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439453; BE2B4974; Hs.6566; thyroid hormone receptor interactor 13; thyroid hormone receptor Interactor 1 3; 6.72 
453348; BE272318; Hs.8595; hypothetical protein FU12438; hypothetical protein FU 12438; 6J1 

446113; AW967553; Hs,323518; Homo sapiens mRNA for FU00083 protein, partial cds; Homo sapiens mRNA for FU0Q083 protein, ; 6.70 

445363; NM.005993; Hs.12570; tubuTm-specific chaperone d; tubuRn-specifjcchaperone d; 6.70 

433701; AW445023; Hs.15155; ESTs; ESTs; 6.69 

41 8054; NMJXJ2318; Hs.83354; lysyt oxidase-Eke Z tysyl oxidase-fike 2; 6.63 

441031; All 10684; Hs.7645; fibrinogen, B beta polypeptide; fibrinogen, 8 beta polypeptide; 6.62 

439237; AW408158; Hs.318893; ESTs, Weakly similar to A47582 B-cell growth factor precursor [H^apiensJ; ESTs, Weakly similar to A47582 B-cefl gn 6.59 

424381; AA285249; Hs.146329; protein kinase Chk2 (CHEK2); protein kinase Chk2 (CHEK2);6.58 

4 1 4821 ; M63835; Hs.77424; Fc fragment of IgG, high affinity la, receptor for (C064); Fc fragment of IgG, high affinity fa, re; 6.58 

418883; BE387036; Hs.1211; acid phosphatase 5, tartrate resistant; add phosphatase 5, tartrate resistant 6.57 

414161; AA136106; Hs.184852; K1AA1553 protein; KIM1553 protein; 6.57 

437437; AA226869; Hs.351623; hypothetical protein DKFZp762L031 1; hypothetical protein DKFZp762L0311; 6.55 

425769; U72513; Hs.159486; Human RPL13-2 pseudogene mRNA. complete cds; Human RPL13-2 pseudogene mRNA, complete ; 6.55 

452363; AI5B2743; Hs.94953; Homo sapiens, Similar to complement component 1, q subcomponent, c polypeptide, clone MGC:17279, mRNA, complete cds; Homo sapiens, Similar 
to complement comp; 6.55 

422765; AW409701; Hs.1578; bacutovfral IAP repeateontaining 5 (survivin}; baculoviral IAP repeat-containing 5 (sur, 6.51 

409893; AW24709O, Hs.57101; minichromosome maintenance deficient (S. cerevbiae) 2 (mitotin); minichromosome maintenance deficient (S.; 6.50 

418918; X07871; Hs.89476; C02 antigen (p50), sheep red blood cell receptor, CD2 antigen (p50), sheep red blood cell ; 6.49 , 

41791 1; AA333387; Hs.82916; chaperonin containing TCP1, subunil 6A (zeta 1); chaperonin containing TCP1, subunit 6A{; 6.44 

427747; AW41 1425; Hs.180655; serine/threonine kinase 12; serine/threonine kinase 12; 6.43 

409142; AL136877; Hs.50758; SMC4 {structural mainlenance of chromosomes 4, yeastj-like 1; SMC4 (structural maintenance of chromoso; 6.42 

430280; AA361258; Hs.237868; interteukin 7 receptor; interteukin 7 receptor; 6.42 

432938; T2701 3; Hs.3 132; steroidogenic acute regulatory protein; steroidogenic acute regulatory protein; 6.42 

422809; AK001379; Hs.121028; hypothetical protein FU10549; hypothetical protein FU10549; 6.40 

427578; A1591305; Hs.169084; ESTs, Highly similar to TUL3_HUMAN TUBBY RELATED PROTEIN 3 I.H.sapiens); ESTs, Highly similar to TUL3J1UMAN TUBBY; 6.40 

417115; AW952792; Hs.334612; small nuclear ribonudeoprotein polypeptide E; small nuclear ribonudeoprotem polypept; 6.39 

428227; AA321649; Hs.2248; small inducible cytokine subfamily B (Cys-X-Cys), member 10; small inducible cytokine subfamily B (Cy; 6.39 

441384; AA447849; Hs.288660; refinoic acid induced 3; retinoic acid induced 3; 6.38 

43891 5; AA280174; Hs.285681 ; Wifliams-Beuren syndrome chromosome region 14; Williams-Beuren syndrome chromosome regi; 6.34 
416773; AK000340; Hs.7982B; hypothetical protein FU20333; hypothetical protein FU20333; 6.34 

451999; AW176401; Hs.27424; DEAD/H (Asp-G!u-Ala-A$p/His) box polypeptide 11 (S.cerevisiae CHLUike heltcase); OEAD/H (Asp-Glu-Ala-Asp/His) box polypep; 6.31 

444159; AF 11 6846; Hs.10431; dead ringer (Drosophila)-like 2 (bright and dead ringer); dead ringer (Drosophi!a)-like 2 (bright ; 6.31 

425274; BE281191; Hs.155462; minichromosome maintenance deficient (mis5, S. pombe) 6; minichromosome maintenance deficient (mi; 6.31 

434649; AA738254; Hs.1 65390; ESTs. Highly similar to A40350 transcription repressor protein YY1 [H.sapiens]; ESTs, Highly similar to A40350 transcrip; 6.30 

420507; AF093408; Hs.98397; A kinase (PRKA) anchor protein 3; A kinase (PRKA) anchor protein 3; 6.30 

415829; AW450198; Hs.163742; ESTs; ESTs; 6.28 

410342; R31350; Hs.743; Fc fragment of kjE, high affinity 1, receptor for; gamma polypepfide; Fc fragment of IgE. high affinity 1, rec; 6.26 
446B39; BE091926; Hs.16244; mitotic spindle coiled-coil related protein; mitotic spindle cdled-coil related prot; 6.25 
457465; AW301344; Hs.122908; ONA replication factor; DNA replication factor; 6.25 
428918; AL036967; Hs.2324; protamine 2; protamine 2; 6.24 

443523; AK001575; Hs.9536; hypothetical protein FU10713; hypothetical protein FU10713; 6.19 
424415; NM_001975; Hs.146580; enolase 2, (gamma neuronal); enolase 2, (gamma, neuronal); 6.19 
407245; X90568; Hs.172004; tjfin; titin; 6.18 

458627; AW088642; Hs.97984; SRY (sex determining region Y)-box 17 (SOX17); SRY (sex determining region Y)-box 17 (S; 6.18 

421379; Y15221; Hs.103982; small inducible cytokine subfamily B (Cys-X-Cys), member 1 1 ; small inducible cytokine subfamily B (Cy; 6.1 6 

435099; AC004770; Hs.4756; flap structure-specific endontrctease 1 ; flap structure-specific endonudease 1; 6.13 

424308; AW975531 ; Hs. 154443; minichromosome maintenance deficient (S. cerevisiae) 4; minichromosome maintenance deficient (S.; 6.12 

430521; NMJM6383; Hs.242t83; HOM-TES-85 tumor antigen; HOM-TES-85 tumor antigen; 6.10 

444823; BE262989; Hs.12045; putative protein; putative protein; 6.10 

428484; AF1 04032; Hs.1 84601; solute carrier family 7 (cationic amino add transporter, y system), member 5; solute carrier family 7 (cationtc amino ; 6.09 
402260; ; ; NM_001436*:Homo sapiens Marin (FBL), mRNA. transcript) (F8A), mRNA.; NM_001436*:Homo sapiens fibrillarin (FBL; 6.09 
441321; H17182; Hs.7771; B-cell assodated protein; B-cell associated protein; 6.05 

422283; AW41 1307; Hs.11431 1; CDC45 (cell division cycle 45, S.cerevisiae, homoJog r like; COC45 (cell dmsfon cyde 45, Sxerevis; 6.04 
431629; AU077025; Hs.265827; interferon, atpha-inducible protein (done 1R-6-16); interferon, alpha-mducibte protein (do; 6.04 
402678; ; ; Target Exon; Target Exon; 6.03 

453884; AA355925; Hs.36232; KIAA0186 gene product; K1AA01 86 gene product; 6.01 

439753; BE262233; Hs.7423; hypothetical protein from EUROIMAGE 2168212; hypothetical protein from EUROIMAGE 2168; 6.01 

420596; NMJXJ2692; Hs.99185; polymerase (DNA directed), epsibn 2; polymerase (ONA directed), epsilon 2; 6.01 

420676; AI434780; Hs,4248; vav 2 oncogene; vav 2 oncogene; 6.00 

418756; AA252254; Hs.226949; ESTs; ESTs; 5.99 

454438; AA224053; Hs.172405; cell division cyde 27; cell division cyde 27; 5.98 

407818; AL021938; Hs.40154; jumonj! (mouse) homdog; iumonji (mouse) homotog; 5.98 

413313; NM_002047; Hs.293885; glycyl-tRNA synthetase; gtycyl-tRNA synthetase; 5.96 

424247; X14008; Hs.234734; fysozyme (renal amyloidosis}; lysozyme (renal amyloidosis); 5.95 

417777; A1823763; Hs.7055; ESTs, Weakly similar to 178885 senne/utewwie-specrfic protein kinase [H^apiens]; ESTs, Weakly similar to 178885 serine/lh; 5.94 
449569; A1656634; Hs.195389; ESTs; ESTs; 5.92 
436576; A145B213; Hs.77542; ESTs; ESTs; 5.90 

438746; A1885815; Hs.1 84727; Human metanoma-assodated antigen p97 (melanotransfenin) mRNA, 3 flank; Human rneJanoma-assodated antigen p97 (m; 5.89 
420005; AW271 105; Hs.133294; ESTs; ESTs; 5.89 

417208; S67773; Hs.B1665; v-kit Hardy-Zuckerman 4 feline sarcoma viral oncogene homotog; v-kit Hardy-Zuckerman 4 feline sarcoma v; 5.88 

403171; ; ; C2001472*:gil5609678|gb|AAB41848.2| (U64675) sperm membrane protein BS-63 [Homo saplenslH; C2001472*;gi|5809678{gb|AAB41848.2| (U64; 5.87 

44B730; AB032983; Hs.21894; KIAA1157 protein; K1AA1157 protein; 5.87 

406137; ; ; NM_000179*:Homo sapiens mutS (E coli) homotog 6 (MSH6). mRNA. VERSION NMJ000178.1 Gl; NMJ)00179*:Homo sapiens mutS (E. cdi) h; 5.85 

423787; AJ295745; Hs.236204; nudear pore complex protein; nuclear pore complex protein; 5.85 

425126; N32759; Hs.172944; chorionic gonadotropin, beta polypeptide; chorionic gonadotropin, beta polypeptide; 5.84 

452796; AB01 1100; Hs.30656; KIAA0528 gene product K1AA0528 gene product; 5.84 

418322; AA284166; Hs.841 13; cydin-dependent kinase Inhibitor 3 (COK2-assodated dual spedfkaty phosphatase); cycfin-dependent kinase inhibitor 3 (CDK; 5.80 
447359; NMJH2093; Hs.18268; adenylate kinase 5; adenylate kinase 5; 5.79 

420344; BE463721; Hs.97101; putative G protein-coupled receptor, putative G protein-coupled receptor, 5.78 

420297; AI628272; Hs.1 28757; ESTs, WeaWy similar to ALU1.HUMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION WARNING ENTRY [Ksaptens}; ESTs, WeaWy similar to 
ALU1JHUMAN ALUS; 5.75 

414761; AU077228; Hs.77256; enhancer of zeste (Drosophila) homdog 2; enhancer of zeste (Drosophfla) homotog 2; 5.75 
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430287; AW182459; Hs.125759; ESTs, Weakly similar to LEU5.HUMAN LEUKEMIA ASSOCIATED PROTEIN 5 IRsapiensl; ESTs, Weakly similar to LEU5_Hl)MAN l£UKE; 5.74 
436251; BE515065; Hs.296585; nucleolar protein (KKE/D repeat); nucleolar protein (KKE/D repeat); 5.73 

421535; AB002359; Hs.105478; plwsphoribosylforrnylglycinamdine synthase (FGAR amldotransferase); phosphoribosytforn^lycinarnidine syntha; 5.71 
414883; AA926960; Hs.348669; COC28 protein kinase 1; COC28 protein kinase 1; 5.69 

425159; NM 004341; Hs. 154868; carbamoyi-phosphate synthetase 2, aspartate transcarbamyiase, and dihydroorotase; carbamoyl-phosphate synthetase 2, aspart; 5.69 
401704; ; ; NH.02l195':Homo sapiens daudin 6 (CLON6), mRNA. VERSION NM_02098Z1 Gl; NM_021 195':Homo sapiens claucfin 6 (aDN6; 5.66 
425358; AL079658; Hs.338207; FK506 binding protein 12-rapamycin associated protein 1; FK506 binding protein 12-rapamycin assoc. 5.65 

402677; ; ; NM_000478:Homo sapiens alkaline phosphatase, Bver/boneAidney (ALPL), mRNA aJpha-hycVoxylase), potypepBde 1 (CYP7B1J, mRNA.; NM.000478:Homo sapiens 

alkaline phosphat; 5.64 
409264; NMJJ14937; Hs.52463; KIAA0966 protein; KIAA0966 protein; 5.63 
4321B5; AA221032; Hs.272B38; hypothetical protein FU10494; hypothetical protein FU10494; 5.63 
409012; AL1 17435; Hs.49725; OKFZP434I21 6 protein; DKFZP434I216 protein; 5.63 

430252; AI638774; Hs.105328; testes development-related NYD-SP20; testes devetopment-related NYD-SP20; 5.61 

419359; AL043202; Hs.90073; chromosome segregation 1 (yeast homo!og)-fike; chromosome segregation 1 (yeast homolog); 5.61 

452816; AA131789; Hs.61509; ESTs; ESTs; 5.60 

402679; ; ; NM_000478:Homo sapiens alkafine phosphatase, Gver/bone/kidney (ALPL), mRNA. alpha-hydroxylase). polypeptide 1 (CYP7B1), mRNA.; NlwL000478:Homo sapiens 
alkaline phosphat; 5.59 

414291; AI289619; Hs.13040; G protein-coupled receptor 86; G protein-coupled receptor 86; 5.58 
453028; AB006532; Hs.31442; RecQ protein-tike 4; RecQ protein-like 4; 5.58 
453905; NMJHJ2314; Hs.36566; UM domain kinase 1; UM domain kinase 1; 5.56 

411263; BE297802; Hs.69360; Wnesin-like 6 (mitotic centromere-associated kinesin); kinesin-like 6 (mitotic centromere-assoc; 5.55 

419660; BE280337; Hs.194693; solute carrier famity 7 (cationic amino acid transporter, y system), member 7; solute carrier family 7 (cationic amino ; 5.55 

446979; AI654443; Hs.197683; ESTs; ESTs; 5.54 

424321; W74048; Hs.1765; rymphocyte-spedfic protein tyrosine kinase; lymphocyte-specific protein tyrosine kin; 5.53 
418962; AA714835; Hs.271863; ESTs; ESTs; 5.53 

447388; AW630534; Hs.76277; Homo sapiens, clone MGC:9381, mRNA, complete cds; Homo sapiens, clone MGC:9381, mRNA, comp; 5.52 r 

427247; AW504221; Hs.174103; integrin, alpha L (antigen C011A (p180), lymphocyte funcUon-associated antigen 1; alpha polypeptide); integrin, alpha L (antigen CD11A (p180),; 
5.52 

449322; AI638616; Hs.196566; ESTs; ESTs; 5.51 

428450; NMJJ14791; Hs.184339; KIAA0175 gene product; KIAA0175gene product 5.S1 

415141; AA189099; Hs.268171; ESTs, Weakly similar to ALU7.HUMAN ALU SUBFAMILY SQ SEQUENCE CONTAMINATION WARNING ENTRY [RsapiensJ; ESTs, Weakly similar 

to ALU7 HUMAN ALUS; 5.48 
45404B; H05626; Hs.6921; ESTs; ESTs; 5.46 

417079; U65590; Hs.81 134; inlerleukin 1 receptor antagonist; interleukin 1 receptor antagonist; 5.45 

434699; AA643687; Hs, 149425; Homo sapiens cDNA FU1 1980 Ms, done HEMBB1001304; Homo sapiens cDNA FU11980 fis, done HE; 5.44 
414334; AA824298; Hs.21331; hypothetical protein FU10036; hypothetical protein FU10036; 5.44 

452291; AF015592; Hs.28853; COC7 (cell division cyde 7, S. cerevisiae, homotog)-like 1; CDC7 (cell division cyde 7, S. cerevisi; 5.44 
438564; AA381553; Hs.198253; major histocompabbilty complex, dass II, DQ alpha 1; major histocompatibility complex, class ; 5.44 
427668; AA298760; Hs.1 80191; hypothetical protein FU 14904; hypothetical protein FU 14904; 5.43 

449437; AI702038; Hs.100057; Homo sapiens cDNA FU22B02 Ms, clone KAT05581; Homo sapiens cONA: FU22902fe, done K; 5.41 
453633; AA357001; Hs.34045; hypothetical protein FU20764; hypothetical protein FU20764; 5.40 
450746; D82673; Hs.278589; general transcription factor II, »; general transcription factor II, i; 5.40 
425966; NMJJ01761; Hs.1973; cydin F; cydin F; 5.39 
418134; AA397769; Hs.86617; ESTs; ESTs; 5.38 

432141; BE410964; Hs.272736; nudear receptor binding protein; nudear receptor binding protein; 5.37 

417141; U22662; Hs.347353; nudear receptor subfamily 1, group H, member 3; nuclear receptor subfamily 1, group H. m; 5.36 

428329; AA426091; Hs.98453; ESTs. Moderately similar to R27328 2 [H.sapiens); ESTs, Moderately similar to R27328 2 (H^ 5.35 

40681 1; U82979; Hs.67846; leukocyte immunoglobuthi-like receptor, subfamily B (with TM and 1TIM domains), member 4; leukocyte immunoglobiifin-like receptor, ; 5.34 
415819; AU077330; Hs.360791 ; transcription elongation factor A (Sll). 1 ; transcription elongation fador A (Sll),; 5.33 
448133; AA723157; Hs.73769; folate receptor 1 (adull); folate receptor 1 (adult); 5.33 

424762; ALU 9442; Hs.183684; eukaryolic translation initiation factor 4 gamma, 2; eukaryotic translation initiation factor; 5.32 
421959; AW751497; Hs.98370; cytochrome P450, subfamily US, polypeptide 1; cytochrome P450, subfamily HS, polypept 5.32 
429271; AF039850; Hs.198515; dead ringer (DrosopNIaJ-Uke 1; dead ringer (Drosophfla>4ike 1; 5.32 

456373; BE247706; Hs.86693; membrane-spanning 4-domains, subfamfly A, member 2 (CD20 antigen); membrane-spanning 4-dcmains, subfamily A; 5.30 
414907; X90725; Hs.77597; polo (DrosophiaHike kinase; pdo (DrosophiaHike kinase; 5.30 
422997; BE01B212; Hs.1 22908; DNA replication fador, DNA replicafion fador; 53 

440014; AW960782; Hs.6B56; ash2 (absent, small, or homeotfc, Orosophila, homolog)-(ike; ash2 (absent small, or homeotic, Drosop; 5.28 
418399; AF131781; Hs.84753; hypothetical protein FU12442; hypothetical protein FU12442; 5.26 

41 6178; AI80B527; Hs.192822; serologically defined breast cancer anligen NY-BR-81 ; serologically defined breast cancer anti; 5.21 

450377; AB033091; Hs.355925; K1AA1265 protein; WAA1265 protein; 5.20 

409670; AI368109; Hs.375604; KIAA1856 protein; KIAA1855 protein; 5.20 

429083; Y09397; Hs.227817; BCU-retated protein A1; BCU-retated protein A1; 5.20 

449523; NM 000579; Hs.54443; chemoWne (OC mouT) receptor 5; chemokine (C-C motif) receptor 5; 5.20 

408908; BE296227; Hs.250822; serineAhreonine kinase 15; serine/threonine kmase 15; 5.19 

429732; U20158; Hs.248B; lymphocyte cytosolic protein 2 (SH2 domain-containing leukocyte protein of 76kD); lymphocyte cytosolic protein 2 (SH2 doma; 5.19 

438089; W05391; Hs.351546; nuclear receptor subfamily 1, group I, member 3; nudear receptor subfamily 1, group I. m; 5.16 

437623; D63880; Hs.5719; chromosome condensation-related SMC-assoriated protein 1; chromosome condensation-related SMC-asso; 5.15 

448181; AF272833; Hs.279763; hypothetical protein FU 10504; hypothetical protein FU10504; 5.15 

436540; BE397032; Hs.14466; hypothetical protein MGC14226; hypothetical protein MGC14226; 5.14 

422241; Y00062; Hs.170121; protein tyrosine phosphatase, receptor type, C; protein tyrosine phosphatase, receptor 1 5.14 

426752; X69490; Hs.172004; tifin; tilin; 5.13 

415007; BE244332; Hs.77770; adaptor-related protein complex 3, mu 2 subunit adaptor-related protein complex 3, mu 2 ; 5.13 
400263; ; Hs.75309; Eos Contrd; Eos Control; 5.13 

437099; N77793; Hs.48659; ESTs. Highly similar to S14458 laminin alpha-1 chain precursor lH.sapienst ESTs, Highly similar to S14458 laminin a; 5.12 
427209; H06509; Hs.92423; K1AA1566 protein; WAA1566 protein; 5.10 

407347; AA829847; ; gb:od40d07.s1 NCI_CGAPJ&81 Homo sapiens cDNA done IMAGE:1370413 3* similar to contains AhJ repetitive element mRNA sequence.; gb:od40d07.s1 

NCLCGAP.GC81 Homo sapiens; 5.10 
458933; A1638429; Hs.24763; RAN binding protein 1; RAN binding protein 1; 5.10 
450431; AW136797; Hs.266041; ESTs; ESTs; 5.09 

434608; AA805443; Hs.179909; hypothetical protein FU22995; hypothetical protein FU22995; 5.08 

410423; AW402432; Hs.63489; protein tyrosine phosphatase, non-receptor type 6; protein tyrosine phosphatase, non-recepfc 5.08 

417929; R27219; Hs.74647; Human T-cell receptor active alpha-chain mRNA from M cell line, complete cds; Human T-ceD receptor active alpha-chain; 5.05 

412723; AA648459; Hs.335951; hypothetical protein AF301222; hypothetical protein AF301222; 5.05 
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447321; AW271217; Hs.281434; Homo sapiens cDNA FU14028 fis, clone HEMBA1003B38; Homo sapiens cDNA FU14028 fis. clone HE; 5.03 
417866; AW067903; Hs.62772; collagen, type XI, alpha 1; collagen, type XI. alpha 1; 5.03 
410082; M081594;Hs.158311;Musashi(Drosop^ 

433592; NM_004642; Hs.3436; deleted in oral cancer (mouse, homolog) 1 ; deleted m oral cancer (mouse, tomotog) ; 5.02 
42581 1; AL039104; Hs.159557; karyopherin alpha 2 (RAG cohort 1, importin alpha 1); karyopherin alpha 2 (RAG cohort 1, impon 5.00 
425237; U07695; Hs.155227; EphB4; EphB4; 5.00 

414809; AI434699; Hs.77356; transferrin receptor (p90. C071); transferrin receptor (p90, CD71); 4.99 
402145; ; ; Target Exon; Target Exon; 4.99 

432126; AA865239; Hs.37196; ESTs;ESTs; 4.99 OM a a<* 

408279; AF216965; Hs.44095; Homo sapiens, clone MG0.12617, mRNA, complete cds; Homo sapiens, clone MGC12617, mRNA, com, 4.98 
419525; T79257; Hs.1259; asialogtycoprotein receptor 2; asialogtycoprotein receptor Z 4.97 
424439; AA579635; Hs.1770; ligase I, DNA, ATP-dependent; ligase I, DNA, ATP-dependent; 4.97 

427667; AK001279; Hs.180171; Homo sapiens cDNA FU 10417 fis, clone NT2RP1000112; Homo sapiens cONA FU10417 fe, clone NT; 4.96 
457313; AF047002; Hs.241 520; transcriptional coactrvator; tjanscripUonal coactivator; 4.96 

448569; BE382657; Hs.21486; signal transducer and activator of transcription 1, 91kD; signal transducer and activator of trans; 4.94 
426427^86699; Hs.169840; TO protein kinase; TTK protein kinase;4.91 . ^c^q^c**™^ aoi 

440129- AA865818- Hs 369523; ESTs, Weakly similar to S7 1886 Ste20like protein kinase IKsaptensJ; ESTs, Weakly similar to S71886 SteM-lik; 4.91 
453922; AF053306; Hs.35708; budding uninhibited by berrimidazotes 1 (yeast homolog), beta; budding uninhibited by benzimidazoles 1 ; 4.90 
435602- AF217515; Hs.283532; uncharacterized bone marrow protein BM039; uncharacterized bone marrow protein BM03; 4.89 
20 452698! NM01295;Hs.301921;chemokine(C-C motif) receptor ^^^etC^^JJ"^ 1 : 4 - 8 * . , . C1 l10 „ ftc!milQf (ft . ,00 
445333; BE537641; Hs.44278; hypothetical protein FU12538 similar to ras-relaled protein RAB17; hypothetical protein FU12538 similar to, 4.88 
437162; AW005505; Hs.5464; thyroid hormone receptor coactivating protein; thyroid hormone receptor coactivating pr; 4.87 ^ 
441600- AA939347; Hs.127223; Homo sapiens cysteine knot protein (2SIG51) mRNA, complete cds; Homo sapiens cysteine knot protein (ZSIG; 4.87 
421350; AW301608; Hs.278188; ESTs, Moderately similar to I54374 gene NF2 protein [H.sapiens]; ESTs, Moderately similar to 154374 gene ; 4.87 
25 409093 : , BE243834; Hs.50441; CGW4 protein; CGI-04 protein; 4.86 

424304; KM 001395; Hs.144879; dual specificily phosphatase 9; dual specificity phosphatase 9; 4.86 
437G96: Z83844: Hs.5790: hypolheiicd protein dJ37E1&5; hypOlhaUcaS protein dJ37E16.5: 4A6 

417381; AF164142; Hs.82042; solute carrier family 23 (nucleobase transporters], member 1; solute earner family 23 (nucleobase tra; 4.84 
416445; AL043004; Hs.79337; K1AA0135 protein; KIAA0135 protein; 4.83 
30 448939; BE267795; Hs.22595; hypothetical protein FU10637; hypothetical protein FU10637; 4.82 

413566; AW604451; Hs.285814; sprouty (Drosophila) homolog 4; sprouty (Drosophila) homolog 4; 4.82 
424081; NM.006413; Hs.139120; ribonuclease P (30kD); riponuclease P (30kD); 4.B1 
425474; Z48054; Hs.1 58084; peroxisome receptor 1; peroxisome receptor 1; 4.81 

412276; BE262621; Hs.73798; macrophage migration inhibitory factor (glycosylation-inhibiting factor); macrophage migration ™ory^tor {; < * 81 

3 5 4226B9; AW856665; Hs.299797; gb:RC3-CT0297-29010W)1 3403 CT0297 Homo sapiens cDNA, mRNA sequence; gb:RC3-CT0297-2901 0041 3403 CT0297 Homo, 4.80 

409101; NMJW4297; Hs.50612; guanine nucleotide binding protein (G protein), alpha 14; guanine nucleotide binding protein (G pr; 4.79 

412760; AW379030;Hs.41324; ESTs; ESTs; 4.79 mW7Q 

447250; AI878909; Hs. 17883; protein phosphatase 1G (formerly 2C), rnagrwsiurn-dependent, gamma isofbrm; protein phosphatase 1G (formerly &,), ma, 4./9 

429345; R11141; Hs.199695; hypothetical protein; hypothetical protein; 4.78 
40 448950; AF288687; Hs.9275; CGI-1 52 protein; CGI-1 52 protein; 4.78 „ . 

412926- AI879076; Hs.75061; macrophage myristoylated afanine-rich C kinase substrate; macrophage mynstoylated alanine-ncn C ; 4.78 

412641^16660; Hs.74335; heat shock90kD protein 1, beta; heat shock 90kD protein 1, beta; 4.76 

420261 • AW205093; Hs.748; fibroblast growth factor receptor 1 (fms-related tyrosine kinase 2, Pfetffer syndrome); fibroblast growth factor receptor 1 (tms. 4.rt> 
421905; A1660247; Hs.32699; ESTs, Weakly similar to UV-1 protein [Ksapiens]; ESTs, WeaWy similar to UV-1 protein (H; 4.75 

45 413880; AI660842; Hs.1 10915; interteuWn 22 receptor; interleukin 22 receptor; 4.75 

. 424905- NWL002497; Hs.153704; NIMA (never in mitosis gene a)-related kinase 2 (NEK2); NIMA (never in mitosis gene ablated k; 4.74 
418355; L42563; Hs.1165; ATPase, H? transporting, nongastric, alpha polypeptide; ATPase, H? transportrng, nongastric, alp; 4.74 
435905; AW997484; Hs.5003; KIAA0456 protein; KIAA0456 protein; 4.74 

428024- Z29067; Hs.2236; NIMA (never in mitosis gene a)-related kinase 3; NIMA (never in mitosis gene a)-related k; 4.74 
50 421846; AA017707; Hs.1432; protein kinase C substrate 80K-H; protein kinase C substrate 80K-H; 4.72 
419138; U48508;Hs.89631; ryanodine receptor! (skeletal); ryanodine receptor 1 (skeletal); 4.72 
437296; AA350994;Hs.20281;KIAA1700; KIAA1700; 4.70 

450142; AW207469; Hs.244B5; chondroiOn sulfate proteoglycan 6 (bamacan); chondratin sulfate proteoglycan 6(bama; 4.70 
421733; AL1 19671; Hs.1420; fibroblast growth factor receptor 3 (achondroplasia, thanatophoric dwarfism); fibroblast growth factor receptor 3 (acn; 4.69 
55 449475; AI348027; Hs.129826; hypothetical protein PP1057; hypothetical protein PP1057; 4.69 

420062;AW411096;Hs.94785;TGF(teto^ 4 . . ^^^(jy-m 

429336; AB005038; Hs.199270; cytochrome P450, subfamily XXV1IB (2Shydroxyvitamin M-atoha-hydroxylase). polypeptide 1; cytochrome P450, subfamily XXVI1B (254ry. 4.69 
436856; Al469355;Hs, 127310; ESTs; ESTs; 4.68 

406937; U14622; ; gb:Human transketolase-like protein gene, partial cds.; gfcHuman transketolase-fike protein gene; 4.67 
60 411296; BE207307; Hs.10114; growth suppressor 1; growth suppressor 1; 4.67 

426726; AA488915; Hs.171955; trophinin associated protein (tastjn); trophinin associated protein (tasfin); 4.67 

409132; AJ224538; Hs.50732; protein kinase. AMP-activated. beta 2 non-catalytic subunit; protein kinase. AMP-acfJvated, beta 2 no; 4.67 

449230; BE61 3348; Hs.356392; melanoma cell adhesion molecule; melanoma cell adhesion molecule; 4.66 

rc 431681; AK000378;Hs.267566;hyr J c1hefJc^ e ,. . 4R5 

65 443623; AA345519; Hs.9641; complement component 1, q subcomponent alpha polypepfide; complement component 1, q subcomponent, , 4.65 

441595; AW206035; Hs.356457; ESTs; ESTs; 4.64 

423419; R55336; Hs.23539; ESTs; ESTs; 4.64 

415724- NM 003580; Hs.78687; neutral sphingomyelinase (N-SMase) activation associated factor, neutral sphingomyelinase (N-SMase) activ; 4.63 
435045; BE297155; Hs. 143698; ESTs; ESTs; 4.62 
70 424441; X14850; Hs.147097; H2A Nstone family, member X; H2A Wstone family, member X; 4.62 

414972; BE263782; Hs.77695; KIAA0008 gene product WAA000B gene product; 4.62 MhtnM u D(JA nMA ..-.^ /fr A , 7 

436685! W28661; Hs.5288; Homo sapiens mRNA; cDNA DKFZp434M245 (from clone DKF2p434M245); Homo sapiens mRNA; cDNA DKF2p434M245 (fr, 4.62 
449515; A1653378; Hs.302012; ESTs; ESTs; 4.61 

425998; AU076629; Hs.165950; fibroblast growth factor receptor 4; fibroblast growth factor receptor 4; 4.61 
75 420027; AF009746; Hs.94395; ATP-binding cassette, sub-family D (ALD), member 4; ATP-binding cassette, sub-farraly D (ALD); 4.61 
436469; AK001455; Hs.5198; Down syndrome critical region gene 2; Down syndrome critical region gene 2; 4.61 
413441; AI929374; Hs.75357; Src-fike-adapter, Sre-Cke-adapter, 4.60 
456847; AI360456; Hs.B60B8; ESTs; ESTs; 4.58 

421506; BE302796; Hs.105097; thymidine kinase 1, soluble; thymidine kinase 1, soluble; 4.57 
80 426935; NM OO00B8; Hs.172928; collagen, type I. alpha 1; collagen, type I. alpha 1; 4.57 
428782; X12830; Hs.193400; interleukin 6 receptor, interleukin 6 receptor, 4.56 
409430; R21945; Hs.346735; splicing factor, arginine/serine-rlch 5; splicing factor, arginine/senne-ncn 5; 456 
41 2773; H1 5785; Hs.74573; similar to vaccinia virus Hindlll K4L ORF; similar to vaccinia virus Hindlll K4L OR; 4.55 
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443068; AI188710; Hs.374480; ESTs; ESTs; 4.55 

441607; NWL005010; Hs.7912; neuronal cell adhesion molecule; neuronal cell adhesion molecule; 4.54 

453227; AW135862; Hs.243991; ESTs; ESTs; 4.52 

438459; T49300; Hs.35304; Homo sapiens cONA FU13655 fis, done PLACE10 11503; Homo sapiens cDNA FU13655 lis, clone PL; 451 
422565; BE259035; Hs.1 18400; singed (DrosophilaHike (sea urchin fasrin homolog like); singed (Drosophila)-like (sea urchin fas; 4.51 
453613; F06838; Hs.374476; ESTs; ESTs; 4.50 

413900; AW409747; HsJ5612; stress-induced-phosphoprolein 1 {Hsp70/Hsp9torganizing protein); stress-mduced-phosrjhoprotem 1 (Hsp7G7H; 4.50 
412507; L36645; Hs.73964; EphA4; EphA4; 4.50 

419034; NM 002110; Hs.89555; hemopoietic cell kinase; hemopoietic cell kinase; 4.49 

413431; AW246428; Hs.75355; ubtquilin-conjugating enzyme E2N (homotogous to yeast UBC13); ubiquitin-conjugating enzyme E2N (homc-lo; 4.49 
406547; ; ; Target Exon; Target Exon; 4.49 

443216; W80487; Hs.324521; hypothetical protein DC50; hypothetical protein DC50; 4.48 

417497; AW402482; Hs.82212; CD53 antigen; CD53 anfigen; 4.47 

448595; AB014544; Hs.21572; KIAA0644 gene product; WAA0644 gene product; 4.47 

445350; AF0521 1 2; Hs.1 2540; lysophospholipase I; lysophospholipase I; 4.46 

446236; NMJM6293; Hs.301; TYR03 protein tyrosine kinase; TYR03 protein tyrosine kinase; 4.46 

429150; AF120103; Hs,197366; smoothened (Drosophila) homolog; smoothened (Drosophila) homolog; 4.46 

420340; NMJJ00734; Hs.97087; CD32 antigen, zeta polypeptide (TiT3 complex); C03Z antigen, zeta potypepfide (TIT3 com; 4.46 

413426; U88837; Hs.75354; GCN1 (general control of amino-add synthesis 1 , yeast)-Rke 1; GCN1 (general control of amino-aad synl; 4.44 

421819; NM_013403; Hs.108665; zinedin; zinedtn; 4.44 , 

409512; AW9791 87; Hs.293591; melanoma differentiation assodated prolein-5;melanorr«dife^ 

428995; AW004975; Hs.19471 6; MAO (mothers against decapentaplegic, Drosophila) homolog interacting protein, receptor activation anchor; MAD (mothers agatnsl decapentaplegic, 
Dr 4 43 

434551 ; BE387162; Hs.280858; ESTs, Highly similar to A35661 DNA excision repair cross-complementing protein ERCC3 fKsapiensl; ESTs, Highly similar to A35661 DNA excis; 
4.43 

418295; AW970043; Hs.238039; hypothetical protein FU11090; hypothetical protein FU11090; 4.42 
409243; AB037761; Hs.51743; KIAA1340 protein; KIAA1340 protein; 4.42 

437103; AW 139408; Hs.1 52940; ESTs; ESTs; 4.42 ,«..«, u r aa>> 

413186; AU077141; Hs.374548; solute carrier family 16 (monocarboxylic acid transporters), member 1; solute earner family 16 (monocarboxylic; 4.42 
) 456362; AW973003; Hs.179909; hypothetical protein FU22995; hypothetical protein FU22995; 4.40 
424078; AB006625; Hs. 1 39033; paternally expressed 3; paternally expressed 3; 4.39 

448153; Y10805; Hs.20521; HMT1 (hnRNP methyltransferase, S. cerevisiae)-like 2; HMT1 (hnRNP methyltiansferase, S. cerevi; 4.37 
410134; U68140;Hs.58927; nuclear VCP-like; nuclear VCP-iike; 4.36 t 

435523; T62849; Hs.1 1090; membrane-spanning 4-domains, subfamily A, member 7; membrane-spanning 4-domains, subfamily A; 4.35 
> 400440ix83957;Hs.83870;neburin;nebuGn;4.35 , s.^^wn ™o;fe™. a -u 

437218; AL1 17497; Hs.58185; ESTs, Weakly similar to T42727 proliferation potential-related protem - mouse [M.musculusJ ESTs, Weakly similar to T42727 proltfera, 4.34 
430478; NM 014349; Hs.241535; apotipoproleln L, 3; apolipoprotein L, 3; 4.34 
432744; AA988835; Hs.38664; ESTs; ESTs; 4.33 t 

423173; AA442655; Hs.124942; protein phosphatase 2A 48 kDa regulatory subunit; protein phosphatase 2A 48 kDa regulatory; 4.33 
) 415995; NM_004573; Hs.355888; phosphofipase C. beta 2; phosphofipase C, beta 2; 4.33 

408728; AL137379; Hs.47125; hypotheucal proteh FU13912; hypothetical protein FU13912; 4.30 

431222; X56777; Hs.273790; zona pellucida gtycoprotein 3A (sperm receptor); zona peBucida glycoprotein 3A (sperm re; 4.30 

427792- M6392B- Hs.180841 ; tumor necrosis factor receptor superfamily, member 7; tumor necrosis factor receptor superfami; 4.29 

426227- U67Q58; Hs.154299; Human proteinase activated receptor-2 mRNA, 3"UTR; Human proteinase activated receptor-2 mR; 4.29 
5 427337; 246223; Hs.176663; Fc fragment of IgG, low affinity lllb, receptor for (CD16); Fc fragment of IgG. tow affinity lllb, r, 4.29 

434826; AF 155661; Hs.22265; pyruvate dehydrogenase phosphatase; pyruvate dehydrogenase phosphatase; 4.29 

412314; AA825247; Hs.356084; downstream of: G proteirwoupled receptor 27 (GPR27) (SREB1); downstream of: G prolein-coupted recepto; 4.28 
447827; U73727; Hs.19718; protein tyrosine phosphatase, receptor type, U; protein tyrosine phosphatase, receptor 1 4.28 
426108; AA622037; Hs.166468; programmed celi death 5; programmed ceB death 5; 4.28 
0 428820; AA436187; Hs.172631; integrin. alpha M (complement component receptor 3, alpha; also known as CD11b (pl70), rnacrophage antigen alpha polypeptide); integral, alpha M 
(complement component ; 4.27 
437908; A1082424; Hs.351043; ESTs; ESTs; 4.27 

444664; N26362; Hs.1 1615; map kinase phosphatase-lfte protein MK-STYX; map kinase phosphatase-like protein MK-S; 4.27 
429002; AW248439; Hs.2340; junction plakogtobin; junction plakogtobin; 4.26 
55 439334; A11 48976; Hs.1 12062; ESTs; ESTs; 4.26 

425308; M97639; Hs.155585; receptor tyrosine kinase-Oke orphan receptor 2; receptor tyrosine kinase-like orphan rec; 4.25 
413869; NM_000878; Hs.75596; interteukin 2 receptor, beta; interleukin 2 receptor, beta; 4.25 
453648; W21493; Hs.380100; hypothefical protem FU14005; hypotheucal protein FU14005; 4.24 

447200- BE543146- Hs 281434; Homo sapiens cDNA FU14028 fis, done HEMBA1003838; Homo sapiens cDMA FU14028 fis, done HE; 4.24 
60 447528 ! AI612027; Hs.76Z77; Homo sapiens, clone MGC:9381, mRNA. complete cds; Homo sapiens, done MGC5381, mRNA. comp; 4.23 

452721; AJ269529; Hs.301871; sokite carrier famfly 37 (gtycerd-3-phosphate transporter), member 1; solute carrier family 37 (gtycerol-3-pho; 4.22 
44981 0; AB008681; Hs.23994; actjvin A receptor, type IIB; adivin A receptor, type IIB; 4.22 

447198; D61 523; Hs.283435; ESTs; ESTs; 4.22 IL _ r .,,.„ 

416714; AF283770; Hs.79630; CD79A antigen fimniunoglobunn-associaied alpha); CD79A antigen CinrnjnoglobuRn-asscctated; 4.22 
65 425356; BE244879; Hs.155939; inositol poryphosphate-5-phosphatase, 145kD; inositol pdyphosphatB-5-phosphatase, 14; 4.22 

422605; H16646; Hs.118666; hypothelical protein PP591; hypothetical protein PP591; 4.21 crvu,^^ A^r^nn 4 91 

444535; AF011466; Hs.122575; ED&4 (endothelial differentiation, lysophosphalidic aad G-proteavcoupIed receptor, 4); EDG4(endothefial differentiation, lys, 4.21 

417088'; M54915; Hs.81170; plm-1 oncogene; pim-1 oncogene; 4.20 

421707; NM_014921; Hs.107054; tectomedto-2 tectomedm-2; 4.20 
70 408717; AF045458; Hs.47061; uno-51 (C. elegansyike kinase 1; unc-51 (C. etegans}-like kinase 1; 4.20 

438485; W57578; Hs.378718; RAB7, member RAS oncogene family, RAB7, member RAS oncogene famfly; 4.19 

419452; U33635; Hs.90572; PTK7 protein tyrosine kinase 7; PTK7 protein tyrosine kinase 7; 4.18 

41B755; Y14443; Hs.88219; zinc finger protein 200; zinc finger protein 200; 4.18 

417212; AW952823; Hs.351547; NS1-binding protein; NS1-binding protem; 4.17 
75 413686; AI469213; Hs.71404; ESTs: ESTs; 4.17 aM<l ™-« 

419344; U94905; Hs.277445; diacylglycerol kinase, zeta (104kD); diacylgtycerol kinase, zeta (104kD); 4.16 

418870; AF1472D4; Hs.89414; chemokine (C-X-C mofiO. receptor 4 (fusin); chemokine (OX-C motif), receptor 4 (fus; 4.16 

408482; NMJJ00676; Hs.45743; adenosine A2b receptor; adenosine A2b receptor, 4.16 

422391; D63479; Hs.115907; diacylglycerol kinase, delta (130kD); diacylglycerol kinase, delta {130xD); 4.15 
80 409421; AA199B83; Hs.67624; ESTs; ESTs; 4.15 

41 5938; BE383507; Hs.78921; A kinase (PRKA) anchor protein 1; A kinase (PRKA) anchor protein 1; 4.14 

415198- AW009480; Hs.943; natural killer ceil transcript 4; natural killer ceD transcript 4; 4.14 

424685; W21223; Hs.1 51734; nuclear transport factor 2 (placental protein 15); nudear transport fador 2 (placental pr; 4.13 
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428479: Y00272; Hs.334562; cell division cycle 2. G1 to S and G2 to M; cell division cycle 2, G1 to S and G2 to; 4.13 

423728; AW891294; Hs. 1321 36; solute carrier family 4, sodium bicarbonate cotransporter, member 8; solute carrier family 4, sodium bfcarbon; 4.13 

433435; BE545277; Hs.340959; Ts translation elongation factor, mitochondriaJ; Ts translation elongation factor, mitoch; 4.13 

420253- At656055; Hs.96200; neighbor of A-kinase anchoring protein 95; neighbor of A-kinase anchoring protein 9; 4.12 

412584; X54870; Hs.74085; ONA segment on chromosome 12(untque) 2489 expressed sequence;. DNA segment on chromosome 12 (unique) 24; 4.12 

400205; ; Hs.81848; NM_006265':Homo sapiens RAD21 (S. pombe) homolog (RAD21). mRNA. (APO-1/C095 (Fas)-associalefl phosphatase) (PTPN13), mRNA.; 

NM 006265':Homo sapiens RAD21 (S. pombe); 4.1 2 
425322; U63630; Hs.155637; protein kinase, DNA-acuVated, catalytic polypeptide: protein kinase, DNA-activated, catalytic; 4. 1 1 < 
419607; R52557; Hs.91579; Homo sapiens clone 23783 mRNA sequence; Homo sapiens clone 23783 mRNA sequence; 4.10 

459035; AW291 109; Hs.332563; ESTs, Weakly similar to T31611 hypothetical protein Y50E8A.g - Caenorhabdrfis elegans [C.elegansl: ESTs, Weakly similar to T3161 1 hypotheti; 
4.10 

432512; NMJKJ3284; Hs.3017; transition protein 1 (during histone to protamine replacement); transition protein 1 (during histone to ; 4.10 

452875; BE275760; Hs.30928; DNA segment on chromosome 19 (unique) 1 177 expressed sequence; DNA segment on chromosome 19 (unique) 11; 4.09 

423804; AW403448; Hs.1706; interferon-stimutaled transcription factor 3, gamma (48kD); interferon-stimulated transcription fact; 4.09 

413745; AW247252; Hs.75514; nucleoside phosphorylase; nucleoside phosphorylase; 4.09 

442091; AW770493; Hs.182874; guanine nucleotide binding protein (G protein) alpha 12; guanine nucleotide binding protein (G pr, 4.09 

425003; AF1 19046; Hs.154149; apuriru^apyrirnidinicendonuclease(APEX ruiclease)-fike 2 protein; apurinic/apyrimidinlc endonuclease(APEX ; 4.08 

421859, AA356620; Hs.108947; K1AA0050 gene product; WAA0050 gene product; 4.08 

432841; M93425; Hs.62; protein tyrosine phosphatase, noiweceptor type 12; protein tyrosine phosphatase, non-recept 4.08 
452069; AB028949; Hs.183994; KIM1025 protein; KIAA1026 protein; 4.08 

425069; AA687465; Hs.298184; potassium voltage-gated channel, shaker-relaled subfamily, beta member 2; potassium voltage-gated channel, shaker-; 4.07 
418526; BE019020; Hs.85838; solute carrier family 16 (monocarboxyRc actd Iransporters), member 3; solute carrier family 16 (monocarboxylic; 4.07 
424517; A1539443; Hs.137447; Homo sapiens cDNA FU12169 fis, clone MAMMA1000643; Homo sapiens cDNA FU12169 fts, clone MA; 4.07 
434224; AA380731; Hs.84; fnterletikin 2 receptor, gamma (severe combined Immunodeficiency); interteukin 2 receptor, gamma (severe co; 4.06 
446791; A1632278; Hs.195922; ESTs; ESTs; 4.06 

432065; AA401039; Hs.2903; protein phosphatase 4 (formerly X), catatytic subunit; protein phosphatase 4 (formerly X), cata; 4.06 

431194; D43704; Hs.250712; calcium channel, voltage-dependent, beta 3 subunit; calcium channel, voltage-dependent, beta; 4.06 

418751; BE3B9014; Hs.372548; phosphoinositide-3-kinase, regulatory subunit, polypeptide 3 (p55, gamma); phosphoinositide-3-kinase, regulatory su; 4.06 

425923; NMJJ05026; Hs.162808; phospholnositide-3-kinase, catalytic, delta polypeptide; phosphoinositide-3-kinase. catalytic, de; 4.05 

408692; AL040127; Hs.34074; dipeptidylpeptidase VI; dipeptidylpeptidase VI; 4.04 

446272; BE268912; Hs.14601; hematopoietic cell-specific Lyn substrate 1; hematopoietic cell-specific Lyn substrat; 4.04 

439176; AI446444; Hs.190394; ESTs. Weakly similar to B28096 line-1 protein ORF2 [H.sapiens]; ESTs, Weakly similar to B28096 line-1 pr; 4.04 

417880; BE241595; Hs.82848; seiectin L (lymphocyte adhesion molecule 1); selectin L (lymphocyte adhesion molecule; 4.04 

410068; AI633888; Hs.5843S; FYN-binding protein (FYB-1 20/130); FYN-binding protein (FYB-120/130); 4.03 

410639; BE269047; Hs.65234; hypothetical protein FU20596; hypothetical protein FU20596; 4.03 

427716; 138951; Hs.180446; karyopherin (importin) beta 1; karyopherin (irnportin) beta 1; 4.03 

451050; AW937420; Hs.351869; ESTs; ESTs; 4.02 

449667; AB023227; Hs.23850; KIAA1010 protein; KIAA1010 protein; 4.02 

448499; BE613280; Hs.77550; pS^eguIated DDA3; p53-regulated DDA3; 4.01 

437527; A1241019; Hs.145644; ESTs; ESTs; 4.01 

4251 18; AU07661 1; Hs.154672; methylene tetrahydrofolate dehydrogenase (NAD dependent), methenyttetrahydrofolate cyctohydrolase; methylene tetrahydrofolate dehydrogenase; 
4.00 

451931; AK000208; Hs.27267; Homo sapiens cDNA FU20201 fis, clone COLF1210; Homo sapiens cDNA FU20201 fe, clone CO; 4.00 
412939; AW411491; Hs.75C69; eukaryouc translation elongation factor 1 gamma; eukaryotic translation elongalion factor; 4.00 
409581; U66243; Hs.55039, mitogen-activated protein kinase 12; mitogen-activated protein kinase 12; 3.99 
433577; AW007080; Hs.284192; ESTs; ESTs; 3.99 

439963; AW247529; Hs.6793; platelet-activating factor acetylhydrolase. isoform lb, gamma subunit (29kD); platelet-activating factor acetylhydrola; 3.99 
418629; BE247550; Hs.86859; growth factor receptor-bound protein 7; growth factor receptor-bound protein 7; 3.99 
448633; AA31 1426; Hs.21635; tubulin, gamma 1; tubulin, gamma 1; 3.98 

402398; ; ; C19000263:gi|3108023|gbJAAC15755.1| (AC004659) BC62940.2 lHorr»sapiensIjl56335; CI 9000263:gtJ31 08023Igb]AAC1 5755. 1 1 (AC0; 3.97 
408414; AI1 14688; Hs.193400; ESTs, Weakly similar to 2109260A B cell growth factor [RsaptensJ; ESTs, Weakly similar to 21 09260A B cell ; 3.97 
41501 2; NM_004383; Hs.77793; c-src tyrosine kinase; c-src tyrosine kinase; 197 

416084; L16991; Hs.79006; deoxythymkJylale kinase (thymidytate kinase); deoxythymidylate kinase (thymidylate kin; 3.95 

422051; AW327546; Hs,1 1 1 024; solute carrier family 25 (mitochondrial carrier, citrate transporter), member 1; solute carrier family 25 (mitochondrial ; 3.95 
447887; AA1 14050; Hs,21 1610; caspase B, apcptosis-related cysteine protease; caspase 8, apoptosis-related cysteine pr; 3.95 
430770; AA765694; Hs.123296; ESTs; ESTs; 3.94 
442994; AI026718; Hs.16954; ESTs; ESTs; 3.94 

420333; AJ001383; Hs.97084; lymphocyte anOgen 94 (mouse) homolog (activating NK-receptor; NK-p46); lymphocyte antigen 94 (mouse) homolog (a; 3.94 
438456; AA913381; Hs.279763; ESTs; ESTs; 3.94 

422599; BE387202; Hs.1 18638; non-metastatic cells 1 , protein (NM23A) expressed in; norwnetastafic cells 1 , protein (NM23A) ; 3.93 
420162; BE378432; Hs.95577; cycBn^ependent kinase 4; cycfin-dependent kinase 4; 3.93 

424829; NM_002507; Hs.1827; nerve growth factor receptor (TNFR superfamily, member 16); nerve growth factor receptor fTNFR super; 3.93 
447574; AF1 62666; Hs.18895; tousled-like kinase 1; tousled-Kke kinase 1; 3.93 

425797; AF002986; Hs.159545; platelet activating receptor homolog; platelet activating receptor homolog; 3.93 

421910; NM_014586; Hs.109437; hormonally upregulated neu tumor-associated kinase; hormonally upregulated neu tumor-assoda; 3.92 

413781; J05272; Hs.850; IMP (inosine monophosphate) dehydrogenase 1; IMP (inosine monophosphate) dehydrogenas; 332 

434334; AA912476; Hs.116750; Homo sapiens cDNA FU13221 fis, clone NT2RP4002075; Homo sapiens cDNA FU13221 fis, done NT; 3.92 

400262; ; Rs.75309; Eos Control; Eos Control; 3.90 

424977; AA349289; Hs.100057; Homo sapiens cDNA: FU22902 fis, clone KAT05581; Homo sapiens cDNA: FU22902 fis. clone K; 3.90 
409799; D11928; Hs.76845; phosphoserine phosphatase-like; phosphoserine phosphatase-fike; 3.90 
435206; AI432364; Hs.160594; ESTs; ESTs; 3.90 

439863; BE547830; Hs.375208; paired Irrmunoglobufin-like receptor beta; paired immunogbbidirvtike receptor beta; 3.90 
413627; BE182082; Hs.246973; intron of Bicaudal D homolog 1; intron of Bfcaudal D homolog 1; 3.90 
426265; AA421069; Hs.97896; ESTs; ESTs; 3.89 

451063; AW 163702; Hs.25911; HlA-B associated transcript-2; HLA-B associated transcript-2; 3.89 
407013; U35637; Hs.83870; gb:Human nebulln mRNA, partial cds; gb:Human nebulin mRNA, partial cds; 3.89 
437239; AW503395; Hs.5541; ATPase, Ca transporting, ubiquitous; ATPase, Ca transporting, ubiquitous; 3.88 
400261; ; Hs.1802; Eos Control; Eos Control; 3.88 

450447; AF212223; Hs.25010; hypothetical protein P15-2; hypothetical protein P15-2; 3.88 

422293; X94453; Hs.1 14366: pyrroline-Scarboxylate synthetase (glutamate gamma-semialdehyde synthetase); pyrroIine-5<arboxylate synthetase (glut; 3.87 
414251; AL042306; Hs.97689; VASA protein; VASA protein; 3.87 

417767; BE242241; Hs.82542; acyloxyacyl hydrolase (neutrophil); acytoxyacyl hydrolase (neutrophil); 3.87 

414443; AU077268; Hs.76144; pialelet-derived growth factor receptor, beta polypeptide; platelet-derived growth factor receptor.; 3.87 
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444099; D87432; Hs.10315; solute carrier family 7 (caiionic amino acid transporter, y system), member 6; solute earner family 7 (cationic amino ; 3.86 
419596; BE379320; Hs.91448; MKP-1 Bke protein tyrosine phosphalase; MKP-1 like protein tyrosine phosphatase; 3.86 
427022; AW245839; Hs.173255; small nuclear ribonudeoprotein polypeptide A; small nudear ribonucleoprotein polypept; 3.86 
443661; AA336609; Hs.10862; Homo sapiens cDNA: FU23313 fis, clone HEP 11 919; Homo sapiens cONA: FU23313 fis, done H; 3.86 
408056; AA312329; Hs.42331; ephrin-A4; ephrin-A4; 3.86 

410552; X66945; Hs.748; fibroblast growth factor receptor 1 (fms-related tyrosine kinase 2, Pfeiffer syndrome); fibroblast growth factor receptor 1 (fms; 3.85 
450778; U81375; Hs.25450; solute carrier family 29 (nucleoside transporters), member 1; solute carrier family 29 (nucleoside tra; 3.85 
418978; T85295; Hs.268606; ESTs; ESTs; 3.B4 

429903; AL134197; Hs.93597; cydln-dependent kinase 5, regulatory subunit 1 (p35); cydin-dependent kinase 5. regulatory su; 3.84 
442980; AAB57025; Hs.8878; kinesin-Bke 1; kinesln-fike 1; 3.84 

447232; AW499834; Hs.327; interteukin 10 receptor, alpha; interieukin 10 receptor, alpha; 3.84 

434689; AF154115; Hs.4076; CTD (carboxy-temiinatdomina, RNA polymerase II, polypeptide A) phosphatase, subunit 1; CTD (carboxy-terminal domina, RNA polyme; 3.83 
432539; AL1 381 69; Hs.278378; karyopherin beta 2b, transportin; karyopherin beta 2b. transportin; 3.83 
415684; 059356; Hs.374480; sorbitol dehydrogenase; sorbitol dehydrogenase; 3.83 
451598; N291Q2; Hs.79658; ESTs; ESTs; 3.82 

449433; AI672096; Hs.9012; ESTs, Weakly similar to S26650 DNA-binding protein 5 [RsapiensJ; ESTs, Weakly similar b S26650 DNA-bindi; 3.82 
426359; AA376409; Hs.10862; Homo sapiens cDNA FU23313 fis, clone HEP11919; Homo sapiens cDNA: FU23313 fis, done H; 3.81 
435160; AB002374; Hs.4791; KIAA0376 protein; KJAA0376 protein; 3.80 

443402; U77846; Hs.9295; elaslin (supravalvutar aortic stenosis, waiiams-Beuren syndrome); elastin (supravalvular aortic stenosis, ; 3.80 
422753; AI928995; Hs.1575; small nuclear ribonudeoprotein D3 polypeptide (1BkD); small nudear ribonudeoprotein 03 polyp; 3.79 
421508; NMJJ04833; Hs.1051 15; absent in melanoma 2; absent in melanoma 2; 3.79 
414806; D14694; Hs.77329; phosphatidylserine synthase 1; phosphatidytserine synthase 1; 3.79 

428023; AL038843; Hs.374530; Homo sapiens cDNA: FU23602 fis, clone LNG15735; Homo sapiens cDNA: FU23602 fis, done L; 3.79 

421654; AW163267; Hs.106469; suppressor of varl (S.cerevisiae) 3-like 1; suppressor of varl (S.cerevisiae) 3-like; 3.79 

439668; AI091277; Hs.302634; frizzled (Drosophila) homolog 8; frizzled (Drosophila) homolog 8; 3.79 

411125; AA151647; Hs.6B877; cytochrome b-245, alpha polypeplide; rjytcwhrome b-245, alpha polypeplide; 3.78 

429170; NM J)01 394; Hs. 2359; dual specificity phosphatase 4; dual specificity phosphatase 4; 3.78 

425354; U62027; Hs. 155935; complement component 3a receptor 1; complement component 3a receptor 1; 3.78 

423909; AJ223183; Hs.135194; immunoglobulin superfamily. member 6; immunoglobulin superfamily, member 6; 3.78 

429687; AI675749; Hs.211608; nudeoporin 153kD; nucleoporin 153kD; 3.77 

414177; AI351355; Hs.356303; uncharacterized hypothalamus protein HARP11; uncharacteri2ed hypothalamus protein HAR; 3.77 
414135; NMJ304419; Hs.2128; dual specificity phosphatase 5; dual specificity phosphatase 5; 3.77 
445817; NM_003642; Hs.13340; hislone acetyltransferase 1; histone acetyltransferase 1; 3.77 

410422; AL042014; Hs.63348; Homo sapiens, clone MGC:15203, rnRNA, complete cds; Homo sapiens, clone MGC:15203, mRNA, com; 3.76 
412146; M92444; Hs.73722; APEX nuclease (multifunctional DNA repair enzyme); APEX nuclease (multifunctional ONA repai; 3.76 
41301 1; AW06B115; Hs.821; bigrycan; biglycan; 3.76 
428157; A1738719; Hs.198427; hexokinase 2; hexokinase Z 3.76 

400288; X06256; Hs.149609; Integrin, alpha 5 (fibronedin receptor, alpha polypeplide); integrin, alpha 5 (flbronedin receptor,; 3.75 
427378; BE515037; Hs. 177556; melanoma antigen, family D, 1; melanoma antigen, family D, 1; 3.75 

4054B4; ; ; C3002124':gil12737280lrefpCP_006682.2| keratin 18 [Homo sapiens]H6633; C3002124':gi)1 2737280 |ref]XP_C0668Z2| k; 3.75 
450998; BE387614; Hs.25797; splicing factor 3b, subunit 4, 49kl>. spRring factor 3b, subunit 4, 49kD; 3.75 
432460; H12912; Hs.274691; adenylate kinase 3; adenylate kinase 3; 3.75 

428816; AA004986; Hs.193852; ATP-binding cassette, sub-family C (CFTR/MRP), member 2; ATP-binding cassette, sur>famfly C (CFTR; 3.74 

431884; AA521246; Hs.210792; ESTs, WeaWy similar to ALUS HUMAN ALU SUBFAMILY SX SEQUENCE CONTAMINATION WARNING ENTRY [H.sapiensl; ESTs. Weakly similar 
to ALU8_HUMAN ALU S; 3.74 

453329; T97205; Hs.193400; ESTs, Weakly similar to 2109260A B cell growth factor [Rsapiensl; ESTs. Weakly similar to 2109260A B cell ; 3.74 
426440; BE382756; Hs.1 69902; solute carrier family 2 (fadDtated glucose transporter), member 1; solute carrier family 2 (facilitated glu; 3.74 
438330; AW450572; Hs.257316; ESTs; ESTs; 3.74 

41991 1 ; L15301 ; Hs.1 276; BN51 (BHK21) temperature sensitivity complemenfing; BN51 (BHK21 ) temperature sensitivity com; 3.74 
452695; AW780199; Hs.30327; mitogen-activated protein kinase-activated protein kinase 5; mitog en-activated protein kinase-adivat; 3.73 
442233; AW967149; Hs.28439; ESTs, Weakly similar to 138022 hypothefical protein ^.sapiens]; ESTs, Weakly similar to 138022 hypotheti; 3.73 
451295; AI557212; Hs.17132; ESTs, Moderately similar to 154374 gene NF2 protein [H.sapiensj; ESTs, Moderately similar to 154374 gene ; 3.73 
410772; BE275297; Hs.194685; Homo sapiens done 24675 mRNA sequence; Homo sapiens done 24675 mRNA sequence; 3.73 
426251; M24283; Hs.1 68383; intercellular adhesion molecule 1 (CD54), human rhinovirus receptor, Intercellular adhesion molecule 1 (CD54); 3.72 
449843; R85337; Hs.24030; sdute carrier family 31 (copper transporters), member 2; solute carrier fem8y 31 (copper trans po; 3.71 
423523; AW299828; Hs.193580; ESTs; ESTs; 3.71 

413407; Al 356293; Hs.75339; inositol polyphosphate phosphatase-like 1; inositol polyphosphate phosphatase-Cke ; 3.71 
448336; R53848; Hs.44976; ESTs; ESTs; 3.70 

422083; NMJX)1141; Hs.111256; arachidonale 15-Spoxygenase, second type; arachldonate 15-lipoxygenase, second typ; 3.70 
416087; AF0451B4; Hs.79008; SKI-INTERACTING PROTEIN; SKI-INTERACTING PROTEIN; 3.70 
442200; AW590572; Hs.235768; ESTs; ESTs; 3.70 
414280; 8E410769; Hs.75873; zyxln; zyxln; 3.69 

409354; N68188; Hs.159472; Homo sapiens cDNA FU22224 fis, clone HRC01703; Homo sapiens cDNA: FU22224 fis, done H; 3.69 
415276; U68666; Hs.78353; SFRS protein kinase 2; SFRS protein kinase 2; 3.69 

439659; AW970780; Hs.59483; leudne-rich repeat-containing G protein-coupled receptor 6 (LGR6) mRNA; leurine-rich repeat-containing G protein; 3.69 

446522; NM_003876; Hs.15196; putative receptor protein; putative receptor protein; 3.69 

422785; A1B24114; Hs.283088; heat shock 90kD protein 1. alpha; heat shock 90kO protein 1, alpha; 3.68 

401083; ; ; NM_016582*:Homo sapiens peptide transporter 3 (LOC51296), mRNA. VERSION NMJM6579.1 Gl; NM_016582*:Homo sapiens peptide transpon 3.68 
413048; M93221; Hs.751 82; mannose receptor, C type 1 ; mannose receptor, C type 1 ; 3.68 
452690; AI536070; Hs.15085; ESTs; ESTs; 3.68 

428981; BE313077; Hs.93135; ESTs, WeaWy similar to ALU2JWMAN ALU SUBFAMILY SB SEQUENCE CONTAMINATION WARNING ENTRY (Rsapiensl; ESTs, Weakly similar 
to ALU2.HUMAN ALUS; 3.68 

415010; NMJJ04203; Hs.77783; membrane- assodated tyrosine- and threonine^pedficcdc2-inhIbitory kinase; membrane-associated tyrosine- and threon; 3.68 

428579; NM_005756; Hs. 1 84942; G protein-coupled receptor 64; G protein-coupled receptor 64; 3.6B 

446430; AA346837; Hs.15075; hypothetical protein DKFZp434E2216; hypothetical protein DKF2p434E2216; 3.66 

442013; AA506476; Hs.375009; Human DNA sequence from done RP1 1-353C18 on enromosome 20 Contains ESTs, STSs, GSSs and CpG Islands. Contains the NIFS gene for 

cysteine desutfurase, two genes for novel proteins and the gene for the; Human DNA sequence from clone RP11-353C1; 3.66 
416602; NM.006159; Hs.367895; Protein kinase C-binding protein NELL2; Protein kinase C-binding protein NELL2; 3.65 

441226; BE563042; Hs.118820; Homo sapiens, Similar to RIKEN cDNA 0610012G03 gene, done MGC14132, mRNA, complete cds; Homo sapiens, Similar to RIKEN cDNA 0610; 
3.65 

413076; U10564; HsJ5188; weel (S. pombe) homdog; weal (S. pombe) homolog; 3.65 

429303; AW1 37635; Hs.44238; ESTs, Weakly similar to S65657 alpha-1C-adrenergte receptor splice form 2 [H^apiensJ; ESTs, Weakly similar to S65657 aipha-1C-; 3.65 
452060; W26980; Hs^49089; ATP-binding cassette, sub-family F (GCN20), member 2; ATP-binding cassette, sub-fam8y F (GCN2; 3.65 
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451558; NMJ01089; Hs.26630; ATP-binding cassette. sub-family A (ABC1), member 3; ATP-bindlng cassette, sub-family A (ABC1; 3.65 
414774; X02419; Hs.77274; plasminogen activator, urokinase; plasminogen activator, urokinase; 3.65 
417426; NM.002291; Hs.82124; laminin. beta 1; laminin, beta 1; 3.65 

450296; AL041949; Hs.24756; hepatocyte growth factor-regulated tyrosine kinase substrate; hepatocyte growth factor-regulated tyros; 3.64 
437669; AI35B105; Hs.123164; ESTs, Weakly similar to match to ESTs AA667999 (H^apiensJ; ESTs. Weakly similar to match to ESTs AA; 3.64 
414029; BE297731 ; Hs.75709; mannose-6-phosphate receptor (cation dependent); mannose-6-phosphate receptor (cation dep; 3.64 
444388; AB033058; Hs.11101; KIAA1232 protein; KIAA1232 protein; 3.64 

425910; AA830797; Hs.184760; CCAAT-box-binding transcription factor CCAAT-box-binding transcription factor; 3.63 

451484; AV648896; Hs.283771; hypothetical protein; hypothetical protein; 3.63 

444613; H29627; Hs.79092; hypothetical protein FU14427; hypothetical protein FU14427; 3.63 

447495; AW401864; Hs.18720; programmed cell death 8 (apoptosis-inducing factor); programmed cell death 8 (apoptosis-induc; 3.62 

408101; AW968504; Hs.278346; CDC2-related protein kinase 7; COC2-retated protein kinase 7; 3.62 

424732; D8OO01 ; Hs.1 52629; KIAA0179 protein; KJAA0179 protein; 3.62 

41 1165; NMJJ00169; Hs.69089; galactosidase, alpha; gatactosidase. alpha; 3.62 

422112; BE540240; Hs.1 11783; Lsm1 protein; Lsm1 protein; 3.62 

453020; AL1 62039; Hs.31422; Homo sapiens mRNA; cONA DKFZp434M229 (from done DKFZp434M229); Homo sapiens mRNA; cONA DKFZp434M229 (fn 3.61 
438795; AA825792; Hs.377119; gb:od84b1U1 NCLCGAP_Ov2 Homo sapiens cDNA clone, mRNA sequence; gb:od84b1 U1 NCI CGAP_Ov2 Homo sapiens ; 3.61 
445515; BE388665; Hs.179999; Homo sapiens, clone 1MAGE:3457003. mRNA; Homo sapiens, clone IMAGE:34570Q3, mRNA; 3.61 
407797; AK000524; Hs.39850; hypothetical protein FU20517; hypotheticat protein FU20517; 3.60 

423217; NM_000094; Hs.1640; collagen, type VII. alpha 1 (epidermolysis bullosa, dystrophic, dominant and recessive); collagen, type VII. alpha 1 (epidermotys; 3.60 

444985; A1677737; Hs.380100; hypothetical protein FLH4005; hypothetical protein FU14005; 3.60 

433764; AW753676; Hs.39982; zinc finger protein RINZF (NM .023929); zinc finger protein R1N2F (NM JJ23929); 3.69 

427857; AL1 33017; Hs.288679; hypothetical protein FU22865; hypothetical protein FU22865; 3.59 

41 5020; BE24991 5; Hs.293533; Human ONA sequence from clone RP1 1-127120 on chromosome 10. Contains ESTs. STSs, GSSs and CpG islands. Contains the gene for a novel 

glulathione-S-transferase and five novel genes; Human DNA sequence from clone RP11-127L2; 3.59 
415149; X12451; Hs.78056; cathepsin U cathepsm L; 3.57 

45B715; AK000973; Hs.1706; hypothetical protein FU10111; hypothetical protein FU10111; 3.57 

423576; NMJ)00383; Hs.1 29829; autoimmune regulator (automimmune polyendocrinopathy candidiasis ectodermal dystrophy); autoimmune regulator (automimmune polyen; 3.57 
440270; NM_015986; Hs.7120; cytokine receptor-like molecule 9; cytokine receptor-like molecule 9; 3.57 
434883; AW381538; Hs.19807; hypothetical protein MGC12959; hypothetical protein MGC12959; 3.57 . 

404976; ; ; NM_014323*:Homo sapiens zinc finger protein 278 (ZNF278), transcript variant 1. mRNA.; NM_01 4323': Homo sapiens zinc finger prol; 3.57 
449656; M002008; Hs.188633; ESTs; ESTs; 3.56 
413795; AL040178; Hs.142003; ESTs; ESTs; 3.56 

406859; AI581 134; Hs.181357; laminin receptor 1 (67kD. ribosomal protein SA); laminin receptor 1 (67kD, ribosomal prol; 3.56 
411030; BE387193; Hs.67896; 7-60 protein; 7-60 protein; 3.56 

447079; AA280057; Hs.105280; ESTs, Weakly similar to dJ963K23.2 [HsapiensJ; ESTs. Weakly similar to dJ963K23.2 (Rsa; 3.56 

424263; M77640/. Hs. 1 757; L1 cell adhesion molecule (hydrocephalus, stenosis of aqueduct of Sylvius 1, MASA (mental retardation, aphasia, shuffling gait and adducted thumbs) 

syndrome, spastic paraplegia 1); L1 cell adhesion molecule (hydrocephalus; 3.55 
413472; BE242870; Hs.75379; solute carrier family 1 (glial high affinity glutamate transporter), member 3; solute carrier family 1 (glial high affi; 3.55 
443466; BE243123; Hs.321045; IKK-related kinase epsBon; inducible IkappaB kinase; IKK-related kinase epsuon; inducible Ik; 3.55 

426746; J03626; Hs.2057; uridine monophosphate synthetase (orotate phosphoribosyl transferase and orotidine-S'-decarboxylase); uridine monophosphate synthetase (orotat; 3.55 

450931; N25156; Hs.25648; tumor necrosis factor receptor superfamily, member 5; tumor necrosis factor receptor superfami; 3.55 

425836; AW955696; Hs.90960; ESTs; ESTs; 3.54 

441054; AA913591; Hs.1 26480; ESTs; ESTs; 3.54 

440592; AL137268; Hs.7285; KIAA0759 protein; WAA0759 protein; 3.54 

458946; AA009716; Hs.42311; ESTs; ESTs; 3.53 

449027; AJ271216; Hs.22880; dipeptidytpeptidase III; dipeptidylpeptidase til; 3.53 
421662; NM_014141; Hs.106552; cell recognition molecule Caspr2; cell recognition molecule Caspr2; 3.53 
422732; AA577455; Hs.24937; transformer-2 alpha (htra-2 alpha); transformer-2 alpha (htra-2 alpha); 3.53 
424870; T15545; Hs.244624; ESTs; ESTs; 3.52 

442794; A1744130; Hs.356753; hypotheucal protein MGC2975; hypothetical protein MGC2975; 3.52 

417640; O30B57; Hs.82353; protein C receptor, endothelial (EPCR); protein C receptor, endothelial (EPCR); 3.51 

419971; AA400027; Hs.296234; ESTs, Weakly similar to T31613 hypothetical protein Y50E8A.I - Caenorhabditis etegans [C.elegans]; ESTs, Weakly similar to T31613 hypotheti; 3.51 
410257; BE244044; Hs.61469; hypothetical protein; hypotheucal protein; 3.51 

424837; BE276113; Hs.333034; N-acetyltransferase, homolog of S. cerevisiae ARD1; Nnacetyltransferase, homotog of S. cerev; 3.51 

421921; H83363; Hs.355993; transtocase of inner mitochondria) membrane 10 (yeast) homolog; translocase of inner mitochondrial membr, 3.50 

454128; AL031259; Hs.367900; programmed cetl death 2; programmed cell death % 3.50 

434049; M501430; Hs.5771; amyotrophic lateral sclerosis 2 (juvenile) chromosome region, candidate % amyotrophic lateral sclerosis 2 (juvenih 3.50 

453641; AA444140; Hs.90960; ESTs; ESTs; 3.50 

429592; AB029041; Hs.209646; KIAA1118 protein; KIAA1118 protein; 3.49 

430647; AC003682; Hs.127988; ESTs, Weakly similar to Z21 INHUMAN ZINC FINGER PROTEIN 21 1 fHsaptensl; ESTs, Weakly simHar to Z21 INHUMAN ZINC ; 3.47 

410855; X97795; Hs.66718; RAD54 (S.cerevisiae)-like; RAD54 (SxerevislaeHike; 3.44 

413372; H55532; Hs.349695; tubufin, alpha Z tubulin, alpha 2; 3.07 

437224; AL1 17628; Hs.97808; ESTs; ESTs; Z77 

430439; AL1 33561; ; DKFZP434B061 protein; DKFZP434B061 protein; 2.76 

435897; AF269223; Hs.128322; t<omplex 11 (a murine top homolog); t-complex 1 1 (a murine top homolog); 253 

417592; AA204664; Hs.182437; ESTs, Weakly similar to 154383 chromosome segregation protein smc1 [Rsapiens]; ESTs, Weakly similar to I54383 chromosom; 2.46 
41 2026; AA383618; Hs.73073; testis-spectfic ankyrin motif containing protein; tesGs-specnlc ankyrin motif containing; 2.35 
422789; AK001 1 13; Hs.120842; hypothefical protein FU10251; hypothetical protein RJ10251; 2.33 
426627; AF012359; Hs.195685; ESTs; ESTs; 2.12 

438983; AF085884; Hs.20029; proacrosin binding protein sp32 precursor, proacrosln binding protein sp32 precurso; 2.07 
425709; AA383076; Hs.159274; outer dense fibre of sperm tails 1; outer dense fibre of sperm taBs 1 ; 1 .99 
433724; A1827749; Hs,144924: serine/threonine protein kinase SSTK; serine/threonine protein kinase SSTK; 1 .68 
420710; NMJJ07009; Hs.99B75; zona peilucida binding protein; zona pelludda binding protein; 1.54 

TABLE 57B 

Pkey. Unique Eos probeset identifier number 
CAT number. Gene cluster number 
Accession: Genbank accession numbers 

Pkey CAT Number Accession 
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417886 1031334 J AA210987 D57294 AA214584 AA207006 D56572 

432407 MH1429J2 BG036675 BF772005 BF771866 BG960385 BG960381 NMJJ05712 AF110315 BE074534 BE182776 BE158000 BE157999 BE714315 AW818104 
AWB47519 AA099426 AW817981 AW85639S BG961122AA224498 AA308542 AW821833 BF302155 AI732411 BG778834 BG283641 BE748279 
BE74B870 BG319540 BE748864 BF739224 BG986155 AK057283 BI86146S AA653341 AA457591 BG949294 AW3928B6 AA071 122 AA227849 
AA584918 BG959570 BF773486 AL041698 BF959013 R87170 C16859 BF770411 BF771298 AI075321 L1 3823 AA21 6700 BF771864 AW861859 
BE537068 C18935 AA155719 BF771 172 BF769107 BF804964 AW818172 AW818143 AW392930 AW817057 AW858044 BF74621 1 AA179928 
AW861687 AWB21826 BI055726 BF242643 AA207189 BF770412 BF771157 BG430030 AA055592 

434414 35978 J AF134164 BF809407 AA218567 BFB42863 AI267168 BF876178 BG999253 AW851851 AW858362 AI817548 BF771300 AA1 13928 AA223422 

AA055556 BF7734O0 BF998859 BE081333 BE073424 BE142245 H59571 H59570 BF871558 BF871064 BE001132 BF826831 AW754298 
AA223267 BG997B95 BG997897 AW991957 AA534354 BG319501 BF736309 AI694265 AA045564 BG950256 AI829309 BG987850 BE093175 
BF854337 

427298 1 15241J AA933717 BF061B97 AW628327 AA641788 AA400495 

427521 513212J AW973352 BF222929 AW016853 BF059130 AI651829 BE551767 AA558414 AI339359 BF059601 AI961162 AI341422 AI206248 AI206165 

AA548736 AA768578 AI539081 AW025957 AA736837 N79575 AW594357 AA480892 
407347 810943J T23514 AK55785 

430439 6750.2 AL133561 AL117481 AL1 22069 AW439292 AI968826 AL041090 



TABLE 57C 



Pkey. Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers in this column are Genbarik Identifier (Gl) numbers. "Dunham 1. et at" refers to the publication entitled The DNA 

sequence of human chromosome 22." Dunham I. et at, Nature (1999) 402489-495. 
Stran± Indicates DNA strand from which exons were predicted. 
NLposition: Indicates nucleotide positions of predicted exons. 



Pkey 


Ref 


Strand 


NLposition 


402199 


8576116 


Minus 


84187-84744 


402680 


6113438 


Plus 


137634-137768,139702-139893,140475-14059 


402260 


3399665 


Minus 


1 1 3765-1 1391 0. 1 1 5653-1 1 5765, 1 1 5808-1 1 694 


402678 


8113438 


Plus 


37395-37514.37866-37981 


403171 


983B164 


Minus 


74502-74703 


406137 


9166422 


Minus 


30487-31058 


401704 


3097841 


Plus 


24712-25374 


402677 


8113438 


Plus 


22135-22309,23063-23238 


402679 


8113438 


Plus 


132079-132216 


402145 


8018280 


Plus 


113086-114800 


406547 


7711513 


Minus 


172780-174358 


402398 


4092817 


Minus 


24019-24973 


405484 


5922025 


Plus 


199214-199579,199672-199920,200262-20049 


401083 


-3242744 


Plus 


33192-33360 


404976 


3419864 


Minus 


139625-140632 



TABLE 58A: 434 genes upregulated In bladder cancer relative to normal body tissues 

Table 58A lists about 434 genes upregulated in bladder cancer relative to norma) body tissues that are likely to encode proteins amenable to modulation by small molecules, peptides, 
or antibodies. These genes were selected from 59680 probesets on the Eos/Affymetrix Hu03 Genechip array. Gene expression data for each probeset obtained from this analysis 
was expressed as average intensity (AT), a normalized value reflecting the relative level of mRNA expression. The protein products of these genes often contain one or more domains 
indicative of have oncogenic function or of transducing intracellular signals, or of being modubtable by small molecules, peptides, or antibodies (e.g. pkinase, death-domain, 7tm, 
phosphatase, or ionjransporter). Certain predicted protein domains are noted. 

Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar accession number, GenBank accession number 

UniGenelD: UniGene number 

Pred.ProLDomains: Certain predicted protein domains. Abbreviations used: TM, transmembrane domain; SS, signal sequence; =Y, very likely to contain; =M, likely to contain; olher 

protein domain abbreviations are from PFAM (Nucleic Acids Research. 2002. 30^276-280). 
UniGene Title: UniGene gene title 

Rl 90th percentile of bladder tumor Als divided by the 50th percentile of normal tissue Ats 

Pkey; ExAccn; UnigenelD; Unigene Title; Pred.ProLDomains; R1 

430630; AW269920; Hs.2621; cystatin A (stefin A); cystatin;TM=M;; 35.25 

422282; AF019225; Hs.1 14309; apolipoprotein L; MolA_ExbB;TM=Y;SS=M; 33.25 

414555; N98569; Hs.76422; phospholipase A2, group IIA (platelets, ; phosSp;TM=M;SS=Y; 31.68 

415192; D17793; Hs.78183; aldo-keto reductase family 1, member C3 ; aidoJceLred;TM=M;; 31.04 

417771; AA804698; Hs.82547; retinae acid receptor responder (tazaro; none.none; 28.50 

439180; A1393742; Hs.1 99067; v-erb-b2 avian erythroblastic leukemia v. Furin-Bke.pkinase.Recep L^dornain.Furin-like.pkinase.Recep Ldomain Peptidase M24; 27.43 

417079; U65590; Hs.81134; interleukin 1 reccrjtwaritogonist; IL1;SS=M; 25.98 

41 3859; AW992356; Hs.8364; Homo sapiens pyruvate dehydrogenase kin a; SAM_PNT,none; 2533 

418818; AA228899; Hs.101307; Homo sapiens HUT 11 protein mRNA, partial; UT,none; 25.28 

425397; J04088; Hs.156346; topoisomerase (DNA) II alpha (170kD); DNA_gyraseB,ONA topoisolV.HATPase c;SS=M; 23.58 

421733; AL1 19671; Hs.1420; fibroblast growth factor receptor 3 (ach; ig,pxinase;TM=Y;SS=M; 21.24 

424008; R02740; Hs.137555; putathrediemokine receptor; GTP-binding; 7tmJ;TM=Y;SS=M; 20.45 

447343; AA256641; Hs.236894; ESTs, Highly similar to S02392 aJpha-2-m; none,none; 19.78 

408243; Y00787; Hs.624; interleukin B; HLH,PAS.!L8;TM=M;; 18.90 

427490; 295152; Hs.178695; mHogen-activated protein kinase 1 3; pkjnase;TM=Mr, 18.75 

410687; U24389; Hs.65436; lysyl oxidase-like 1; LysyLoxidase;SS=M; 18.63 

444381; BE387335; Hs.283713; ESTs. Weakly similar to S64054 hypotheti; Collagen;TM=M;SS=M; 18.60 

452747; BE1 53855; Hs.61460; Ig surjerfamfly receptor LNIR; ig,Rhabd_grycop;TM=Y;SS=M; 18.55 

415444; BE247295; Hs.78452; solute carrier family 20 (phosphate tran; PH04,UM;TM=M;; 18.25 
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413132; NM_006823; Hs.75209; protein kinase (cAMP^tependent cataryti; PKI;SS=M; 17.73 

427337; Z45223; Hs.1 76653; Fc fragment of IgG. tow affinity lllb, n ig;TM=Y;SS=M; 17.68 

450746; 082673; Hs.278589; genera! transcription factor H, i; nonB,SH3,PX; 17.12 

418945; 8E246762; Hs.89499; arachidonate lipoxygenase; Bpoxygenase 1 PLAT;TM=M;; 16.88 

420981; L40904; Hs.100724; peroxisome proliferative activated recep; rwrmone_rec,zf-C4;TM=M;; 16.78 

439941; AI392640; Hs.18272; amino acid transporter system A1; Aa_trans;TM=Y;; 16.75 

431846; BE019924; Hs.271580; uroptakin 1B; lransmembrane4;TM=Y;SS=M; 16.56 

424247; X14008; Hs.234734; lysozyme (renal amyloidosis); lys.ig,FAD_SyntraJh,ldti^Cpkinase;SS=M; 16.43 

414883; AA926960; ; CDC28 protein kinase 1; CKS;; 16.20 

438091; AW373062; ; nuclear receptor subfamily 1, group I, m; hormone_reczf.C4.none; 15.80 

439963; AW247529; Hs.6793; piatelet-acUvaUng factor acetylhydrota; PAF-AHJb,UpaseJ30SL;TM=M;; 15.70 

428450; NM.014791; Hs.184339; KIAA0175 gene product; KA1,pkinase;TM=M;; 15.63 

422278; AF072873; Hs.114218; frizzled (Drasophila) bomolog 6; Fz.Frizzled.7tm 2;TM=Y;SS=M; 15.45 

434293; NM.004445; Hs.3796; EphB6; EPHJbd.fn3,pkinase,SAM;TM=Y;SS=M; 15.43 

417880; BE241595; Hs.82848; setectin L (lymphocyte adhesion molecule; EGF.Ieclin_c.sushi;TM=M;SS=M; 15.28 

443991; NMJ>D225Q; Hs.10082; potassium intermediate/small conductance; CaMBD,SK_channel,ion trans;TM=Y;SS=M; 15.10 

41 6305; AU076628; Hs.791 87; coxsackie virus and adenovirus receptor; ig;TM=Y;SS=M; 14.90 

432306; Y18207; Hs.303090; protein phosphatase 1. regulatory (inhib; CBM_21;TM=M;; 14.80 

413076; U10564; Hs.75188; weel (S. pombe) homolog; pkinase;TM=M;; 14.73 

429345; R11141; Hs.199695; hypothetical protein; K_tetra,SAM; 14.58 

449230; BE613348; Hs.21 1579; melanoma cell adhesion molecule; ig;eodh.Ribosomal_L6.F-box;TM=Y;SS=M; 1 4.55 
421508; NMJJ04833; Hs.105115; absent in melanoma 2; PAAD J)API N,H1N;TM=M;; 14.53 
446005; NNLO04403; Hs.13530; deafness, autosomal dominant 5; none;TM=M;SS=M; 14.35 
429555; AW139399; Hs.98988; ESTs; none;TM=M;; 14.18 

416224; NMJJ02902; Hs.79088; reliculocalbin 2, EF-hand calcium bindin; efhand;SS=M; 14.13 

429573; AA884407; Hs.21 1595; protein tyrosine phosphatase, non-recept; Y_phosphatase,Band 41,PDZ;SS=M; 13.90 

426657; NM.015865; Hs.171731; solute carrier family 14 (urea transport; UT;TM=Y;; 13.83 

428157; AI738719; Hs.198427; hexokinase 2; hexokmase,hexokinase2,none; 13.80 

400843; ;; NM_003105*:Homo sapiens sortilin-related; EGF,fn3 1 ldljecepLa,ldLrecepLb.granulin.BNR;TM=Y;SS=M; 13.78 

439453; BE264974; Hs.6566; thyroid hormone receptor interactor 13; AAA, ABC tran,CoaE;TM=M;; 13.38 

432314; AA533447; Hs.312989; ESTs; Xlink.none; 13.25 

413109; AW389845; Hs. 110855; ESTs; PH04.none; 13.15 

424490; AJ278016; Hs.55565; ankyrin repeat domain 3; ank,pkinase;TM=M;; 13.13 

426490; NMJ)Q1621; Hs.1 70087; aryl hydrocarbon receptor; PAC,PAS;TM=M;; 1193 

426158; NM_001982; Hs.199067; v-erb-b2 avian erythroblastic leukemia v; Furin^ike,pldnase ) Recep_l^doman i Furin-like,pkinase.Recep U_domatn Peptidase M24' 12 43 
440249; AI246590; Hs.249175; ESTs; TatD_DNase,pkinase,death.none; 12.38 
425177; AF127577; Hs.1 55017; nuclear receptor interacting protein 1; none;SS=M; 12.38 
404942; U30825; ; splicing factor. arginine/serine-rich 9; CD36;TM=Y;SS=M; 12.03 

439569; AW602166; Hs.222399; CEGP1 protein; EGF,TNFR_c6.granuliri > CUB.Keratin B2,T1L;TM=M;SS=M; 11,93 
443195; BE148235; Hs.193063; Homo sapiens cONA FU14201 fis. clone NT; Aajrans.none; 1 1.88 

408000; L11690; Hs.198689; bullous pemphigoid antigen 1 (230/240kD); emand 1 spectiin.GAS2,SH3 < Plectin l RA^ylose isom.niO,bZlP,Tropomyosln,Myr> 
L2.MJdh_C.CH^lP3;TM=M;; 11.88 ~ ^ 

412182; AA205588; Hs.155160; Splicing factor, arginine/serine-rich, 4; rrm.hormone rec,zttv4,sugar tn; 11.85 

433470; AW960564; ; transmembrane 4 superfamify member 1; none;TM=Y;SS=M; 1 1.80 

4251 18; AU07661 1; Hs.154672; methylene tetrahydrofotate dehydrogenase; myb_ONA- 

biriding,THF_DHG_Cr'H l THF_DHG_CYH_C l CAP_GlY^AA,LON 1 PepW cotransp;TM=M;; 1 1.69 

418870; AF147204; Hs.B9414; chemokine (OX-C motif), receptor 4 (fus; 7trnJ,7tra_2;TM=Y;SS=M; 1 1.50 

426761; A1015709; Hs.172089; Homo sapiens mRNA; cDNA DKFZp586l2022 (f; none;TM=Y;SS=M; 1 1.48 

439750; AL359053; Hs.57664; Homo sapiens mRNA full length insert cON; IMPDH_C.IMPDH_N 1 C8S,uitegrin B,Rlcin_B lectin; 1 1.38 

446742; AA232119; Hs.16085; putative G^lein coupled receptor, none;TM=Y;SS=M; 11J23 

436729; BE621807; ; transmembrane 4 superfamify member 1; none;TM=Y;SS=M; 11.18 

409960; BE261944; Hs.339673; hexokinase 1; none.none; 11.02 

426539; AB01 11 55; Hs.1 70290; discs, large (Drosophila) homolog 5; SH3,PDZ,Guanylate_kin;TM=M;; 10.78 

417821; BE245149; Hs.82643; protein tyrosine kinase 9; cofilin_ADF;SS=M; 10.63 

427654; AA410183; Hs.137475; ESTs; ionJrans.vwc.lGFBP.tspJ; 1058 

418004; U37519; Hs.87539; aldehyde dehydrogenase 3 family, member ; a!dedh;TM=M;SS=M; 10.53 

447355; BE383676; Hs.334; Rho guanine nucleotide exchange factor (; SH3,PH,RhoGEF;TM=M;; 1 0.53 

449437; AI702038; Hs.100057; Homo sapiens cDNA: FU22902 fis, clone K; none,none; 10.52 

436856; AI469355; Hs.12731 0; ESTs; rjldnase,rrrn;TM=M;; 10.48 

451035; AU076785; Hs.430; plasfin 1 (J isoform); efhand.CH^dapun_N;SS=M; 10.38 

418299; AA279530; Hs.83968; integrin, beta 2 (antigen CD18 (p95), ty; integrin_B,EGF,PSI;TM=Y;SS=M; 10.35 

431890; X17033; Hs.271986; integrin, alpha 2 (CD49B, alpha 2 subuni; vwa.integrirLAFG^AP;TM=Y;SS=M; 10.34 

440006; AK000517; Hs.6844; hypothetical protein FU20510; AAA, NB-ARC, PAAD DAPIN;NA;NA; 10.25 

420344; BE463721; Hs.97101; putative G protein-coupled receptor; MeIhyltransf_5~TM=Y;SS=M; 10.18 

437852; BE001836; Hs.256897; ESTs, WeakJy similar to dJ36501Z1 (Ksa; GPS,7tm_2;TM=Y;; 1013 

400752; ; ; NM_003105*:Homo sapiens sortifin-related; EGF,fo3,!tf_recepLaldLreceptb4 10.08 

427700; AA262294; Hs.180383; dual specificity phosphatase 6; Rhodanese,DSPc;TM=M;; 10.05 

426728; NMJJ07118; Hs.171957; triple functional domain (PTPRF interact; SH3,ig,pWnase,PH,spectrin 1 RhoGEF;TM=M;; 10.05 

400496; ; ; ENSP0Q400224716*:GTP-birKfing protein SAR; none;TM=Y~ 10.01 

413899; AF083892; Hs.7560B; tight junction protein 2 (zona occludens; SH3,PDZ.GuanyIate_kln;TM=M;; 10.00 

404568; ; ; NM_02207r:Homo sapiens hypothetical pro; SH2;TM=M;; 10.00 

444823; BE262989; Hs.12045; putative protein; Mra1,MBOAT;TM=M;SS=Y; 9.93 

421425; AK001564; Hs.104222; hypothetical protein FU10702; efhand,kazal,arf,ras,7tra.1;TM=M;; 9.90 

413441; AI929374; Hs.75367; Src-Kke-adapter; SH2,SH3;TM=M;; 9.90 

424954; NMJJ00546; Hs.1846; tumor protein p53 (tif raumeni syndrome); P53.WD40.IRK;TM=M;; 9.88 
452289; BE566205; Hs.28827; rriilogen-actrvated protein kinase kinase ; pkinase;TM=M;; 9.65 
439223; AW23B299; Hs.250618; UL16 binding protein 2; WLrecepLa,PKD,MHCJ;TM=M;SS=Y; 9.83 
429238; NMJJ02849; Hs.198288; protein tyrosine phosphatase, receptor t; Y_phosphatase;TM=Y;SS=M; 9.80 
414135; NR.004419; Hs.21 28; dual specificity phosphatase 5; Rrx)danese,Kr^ f Yj>rtosphatase;TM=M;; 9.73 
452239; AW379378; Hs.170121; protein tyrosine phosphatase, receptor t none.none; 9.73 
41 8322; AA284166; Hs.841 13; cycfin-dependent kinase Inhibitor 3 (CDK; Yj>hosphatase,DSPc;TM=M,- 9.72 
403912; ; ; C5000394*:gi|1 2737280|refpCP 00668Z2| k; none;TM=Mn 9.70 

427315; AA179949; Hs.175563; Homo sapiens mRNA; cDNA DKFZp564N0763 (f; none,spectrin,SH3.PH,CH; 9.70 
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428428; AL037544; Hs.184298; cyclin-dependenl kinase 7 {homolog of Xe; pkinase;TM=M;; 9.68 

422241; Y00062; Hs.170121; protein tyrosine phosphatase, receptor t Wnesin,fn3,Y_phosphatase;TM=M;; 9.63 

429083; Y09397; Hs.227817; BCL2-reIated protein A1; Bd-2;TM=M," 9.63 

425322; U63630; Hs.155637; protein kinase, DNA-activated, catalytic; PI3_PI4_kin3se,FAT,FATC;TM=M;; 9.55 

430259; BE550182; Hs.127826; RaJGEF4ike protein 3, mouse homolog; fn3,RA,RasGEF;TM=M;SS=M; 9.50 

428520; AA331901; Hs.184736; hypothetical protein FU1Q097; none;TM=M;; 9.50 

418969; W33191; rfe.28907; hypothetical protein FU20258; SH3;TM=M;; 9.50 

448913; AA1 94422; Hs.22564; myosin VI; rnn.zf-ftanBP.pkinase.GSTJXEts.SAMJ'rtf .ABC^ 

C2H2,PHD.BTBjniSAT_hook,SAM;TM=M;; 9.50 
414911; NM 000107; Hs.77602; rJamage-spedfic DNA binding protein 2 (4; WD40 1 homeobox,UM;TM=M;; 9.48 
451295; A1557212; Hs.17132; ESTs. Moderately similar to 154374 gene ; pkinase,DAG_PE-bind,pkinase„C,OPR.none; 9.45 
402328; ; ; Target Exon; pkinase;TM=M,-; 9.44 

443710; AI928136; Hs.9691; Homo sapiens cDNA: FU23249 fis, clone C; G-alpha.none; 9.42 
414987; AA524394; Hs.294022; rrypolhelical protein FU14950; SH2;TM=M;; 9.42 

434375; BEZ77910; Hs.3833; 3 , -phosphoadenosine 5 , -phosphosulfale sy; AFSJtee.ATp.sulfurytase.PRKJhymi^ 9.40 
418827; BE327311; Hs.47166; HT021; none;TM=M;; 9.40 

440675; AW005054; Hs.47883; ESTs, Weakly similar to KCC1_HUMAN CALCI; pkinase.none; 9.35 
433376; AI249361; Hs.74122; caspase 4, apoptosis-related cysteine pn CARD,ICEj)10,ICEj)20;SS=M; 9.28 
410668; BE379794; Hs.65403; hypothetical protein; death.TN FR_c6;TM=Y;SS=M; 9.25 
430024; AI808780; Hs.227730; integrin, alpha 6; integrin_A,FG^AP;TM=Y;SS=M; 9.23 
452696; AI826645; Hs.211534; ESTs; ArfGap,PH ( ank,Guanybte_Wn,PDZ,SH3; 9.13 
434263; N34895; Hs.44648, ESTs; ig,none; 9.13 

407949; W21874; Hs^47057; ESTs, Weakly similar to 2109260A B cell ; Ribosomal„S14,ank,pkfnase,death > none; 9.10 

429332; AF030403; Hs.199263: Ste-20 related kinase; pkinase,metalthio;TM=M;SS=M; 9.08 

417426; NM.002291; Hs.82124; lamin'm, beta 1; lamin'm_EGFJaminin^Nterm,integrin^B;SS=M; 9.08 

443951; F13272; Hs.1 11 334; ferritin, fight polypeptide; PMP22_Claudin,none; 9.07 

414368; W70171; Hs.75939; uridine monophosphate kinase; PRK,CoaE;; 8.98 

451292; AB037716; Hs.26204; K1AA1295 protein; SH3;TM=M;; 8.93 

438000; AI825880; Hs.5985; non-kinase Cdc42 effector protein SPEC2; none;TM=M;; B.90 

446620; AA1 28808; Hs. 179902; transporter-tike protein; none;TM=Y;SS=M; 8.90 

436075; BE090176; Hs. 179902; transporter-like protein; none;TM=Y;SS=M; 8.88 

437056; Al 1 47061 ; ;gb:ok33a11.s1 Soares.NSFJ8_9W_OT_PA^P_S; none,spectrin,SH3,PH,CH; 8.78 

445496; AB00786O; Hs.12802; development and differentiation enhancin; SH3,ank,PH,ArfGap;TM=M;; 8.78 

418203; X54942; Hs.83758; COC28 protein kinase 2; CKS;; 8.75 

434608; AA805443; Hs.179909; hypothetical protein FU22995; none;TM=M;; 8.70 

445033; AV652402; Hs.72901; mucin 13, epithenal transmembrane; ank;; 8.68 

417640; D30857; Hs.82353; protein C receptor, endothelial (EPCR); none;TM=M;SS=M; 8.65 

432841; M93425; Hs.62; protein tyrosine phosphatase, noiwecept; Y_phosphatase;SS=M; 8.65 

430397; AI924533; Hs.105607; bicarbonate transporter related protein ; HC03_cotransp;TM=Y;; 8.64 

448888; AW1 96663; Hs.200242; caspase recruitment domain protein 6; CARD;TM=M;; 8.60 

442994; A102671B; Hs.16954; ESTs; ank,pklnase,death,Ribosomal_S14; 8.60 

429109; AL008637; Hs.196352; neutrophil cytosofic factor 4 (40kD); SH3,OPR,PX;TM=M;; B.60 

455439; AW945484; Hs.1 84252; ESTs, Weakly similar to ALU8.HUMAN ALU S; nonejtmj; 8.55 

437763; AA469369; Hs.5831 ; tissue inhibitor of metalloprotetnase 1 ; TIMP.pkinase.DAG.PE-bind.RBD; 8.43 

417035; AA192455; Hs.22968; Homo sapiens done IMAGE:451939. mRNA se; none^one; 8.40 

418478; U38945; Hs.1 174; cycfin-dependent kinase inhibitor 2A (me; ank;; 8.39 

448209; AW160489; Hs.20709; tetraspan 5; transrr«mhrane4;TM=Y;SS=M; 8.33 

450139; AK001838; Hs.296323; seaim/ghjcocorDcoid regulated kinase; none,none; 8.33 

445350; AF052112; Hs.12540; rysophosphonpase I; abhydrolase_2;TM=M;; 8.31 

427509; M62505; Hs.2161; complement component 5 receptor 1 (C5a I; 7tmJ;TM=Y;SS=M; 8.30 

445633; AI453386; Hs.17287; ESTs, Weakly similar to S26689 hypothetj; lRK,none; 8.28 

446719; W39500; Hs.301872; hypothetical protein MGC4840; AAA,SKI;TM=M;; 8.23 

424762; AL1 19442; Hs. 1 83684; eukaryoSc translation initiation factor; none.none; 8.20 

427274; NM_005211; Hs.174142; colony sfimutating factor 1 receptor, to; ig,pkinase;TM=Y;SS=M; 8.18 

416094; AW995512; Hs.225977; nuclear receptor coactivator 3; none,none; 8.15 

419262; AA834654; Hs.29131; nuclear receptor coactivator 2; PAS.zi-C2H2 t SET; 8.15 

417386; AL037228; Hs.82043; D123 gene product; NUOIX^Y,E1_derrydrog,tramkeLpyr;TM=Y;SS=M; 8.13 

431236; AV656840; Hs.285115; interteukln 13 receptor, alpha 1; fn3;TM=Y;SS=M; 8.10 

439658; AA332057; Hs.6639; hypothetica! protein MGC15440; ncne;TM=M;SS=M; 8.09 

425424; NMJXM954; Hs.1 57 199; ELKL moSf kinase; pkinase,UBA,KA1;TM=Mn 8.08 

422573; AW297985; Hs.295726; integrin, alpha V (vitronectin receptor; FG-GAP,integrin_Anone; 8.05 

408912; AB011084; Hs.48924; WAA0512 gene product ALEX2; ArmadiIlo_seg;TM=M;SS=M; 8.03 

447674; BE270540; Hs.19192; cyclin-dependant kinase 2; pkinase;SS=M; 8.03 

40489 1 ; ; ; Target Exon; none,none; 7.95 

414278; AA330116; Hs.77273; Human glucose transporter pseudogene; none,none; 7.93 

452888; AW955454; Hs.30942; ephrin-B2; Ephrin,fn2;TM=Y;SS=M; 7.93 

433211; H11850; Hs.l2B08; MARK; p«nase,UBA,KA1;SS=M; 7.91 

438485; W57578; Hs^37955; RAB7, member RAS oncogene family; pkinase,ABC1,none; 7.90 

413219; AAB78200; Hs.1 18727; Homo sapiens cDNA FU13592 fis, done PU HLH,death,TNFR„c6^W^hydro; 7.90 

417381; AF164142; Hs.82042; solute carrier family 23 (nudeobase tra; xan_ur_permease 1 RA; 7.88 

427122; AW057736; Hs.323910; HER2 receptor tyrosine kinase (cerb-b2,; Fun^ke,pkir^e,Recep_l w domain,YLP;TM=Y;SS=M; 7.8S 
405036; ; ; NMJ)21628*:Homo sapiens arachidonate Bp; npoxygenase,cornp!exL49Kd,PlAT;TM=M;; 7.83 

418529; AW005695; Hs.250897; TRK-fused gene; Band_41 ,ERM,pldnase,LRRlfWCT.N!AM.rto 7.t 

431912; AI660552; Hs.76549; ESTs, Weakly similar to A56154 AW subst; noneA^W^dhA^-CoA^dh.M^CoA_dh Jl; 7.80 

432981; NM 002733; Hs.3136; protein kinase. AMP-acSvaled. gamma 1 n; CBSAaJrans;TM=M;; 7.78 

422940; BE077458; ; gb:RC1.BT0606490500-015-b04 BT0606 Homo; Sec7,PrUVNFjeceptor,Iig_chan 1 WD40 1 lRK; 7.78 

446636; ACO02563; Hs.1 5767; dtron (rho-interacting, serine/threonin; CNH,DAG_PE-bind,PH,lnvolucrin,M;TM=M;; 7.78 

431183; NMJJ06855; Hs.250696; KDEL (lys-Asp-Glu-Leu) endoplasmic relic; ER_lumen - recept;TM=M;SS=M; 7.78 

400845; ; ; NM_003105':Homo sapiens sorfffirwelated; EGF t fn34dLrecerJta.ldljecerjLb,granufin,BNR;TM=Y;SS=M; 7.73 

409142; AL136877; Hs.50758; SMC4 (structural maintenance of chromoso; ABC_1ran,M,SMC_N,SMC_C,OUF164.none; 7.73 

437192; AW975786; Hs.75355; ubtquitin-conjugafing enzyme E2N (homolo; UQj»n,Y_priosphatase,SH2; 7.70 

40321 2; ; ; NMJH 9595:Homo sapiens IntersecUn 2 (IT; SH3,efhand.C2,PH,RhoGEF;TM=M;; 7.70 

441190; H09073; Hs.25046; ESTs; E1-E2_ATPase.C^liorU\TPase_C,Cati()n>TPase.N,Hydrolas 7.68 
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409745; AA077391; ; gb:7B14E12 Chromosome 7 Fetal Brain cDNA; 7tm_1^f-C3HC4 f fn3,SPRY > KRAB^f-C2H2 l rve,zf-B b0JcTM=Y SS=M- 7 68 
428840; M15990; Hs. 194148; v-yes-1 Yamaguchi sarcoma viral oncogene; SH2,SH3,pkinase;SS=M; 7.65 
447898; AW969S38; Hs.112318; 6.2 kd protein; none.none; 7.65 

425289; AW1 39342; Hs.155530; interferon, gamma-inducible protein 16; PAAD_DAP1N,HIN;SS=M- 7.63 
401927; ; ; C 1 700091 4*:gi|8394367|ref|NPJ)58549.1| s; none- 7.60 

407347; AA829847; ; gb:od40d07.s1 NCLCGAP_GCB1 Homo sapiens; RhoGAP,SH2,pWnase,POLO_box.none- 7 58 
449924; W30681; Hs.146233; Homo sapiens cDNA: FU22130 lis, clone H; SH3,none; 7.57 

429952; AF080158; Hs.226573; inhibitor of kappa light polypeplide gen; pWnase,ubiquith,&iterotoxinJV,PH04,pkinase ubiquitin- 7 55 

445800; AA12641 9; Hs.32944; inositol Mypnosphate-4-phosphatase, ty; none.none; 7.55 

421489; AI922821; Hs.32433; ESTs; none.PI-PLC-X,PI-PLC-Y l C2; 7.53 

430486; BE062109; Hs.241551; chloride channel, calcium activated, fern; none;TM=Y;SS=M; 7.53 

431605; AW972407; Hs.124370; gb:EST384498 MAGE resequences, MAGI Homo; adenytatekinase,SRP54 TM=M • 7 50 

430570; AM17881; Hs.292464; ESTs; 7tm_2.Fz,Frizzted,none; 7.50 

431441; U8196 1 ; Hs.2794; sodium channel, nonvoltage-gated 1 alpha; ASC;TM=Y~ 7.48 

420676; AI434780; Hs.4248; vav 2 oncogene; RhoGEF,PH,CH t SH2 f SH3,DAG PE-bind.none; 7.48 

444252; R21135; Hs.54985; ESTs; none,none; 7.47 

414914; U49844; Hs.77613; ataxia telangiectasia and Rad3 related; FAT.FATC.P13 PI4_kinase;TM=M;; 7.47 
443426; AF098158; Hs.9329; chromosome 20 open reading frame 1; none;TM=H; 7.45 
418546; AA224827; ; gb:nc32g04.s1 NCLCGAP_Pr2 Homo sapiens ; vwajntegrin A,FG-GAP,none; 7.45 
437860; AA333063; Hs.279898; Homo sapiens cDNA: FU23165 fis, clone U none;NA;NA; 7.43 
452007; AA426234; Hs.34906; ESTs, Weakly similar to T17210 hypolheti; none.pkinase; 7.40 

432407; AA221036; ; gb:zrC3f12.r1 Stratagene NT2 neuronal pr; DEAD,he!icase_C,rrm,Ndr,Cys knot,TIUvwa,vwc.vwd,IQ,Rlla.abhydrolase TGF- 
beta,DUF139,TPR l DSPc,tsp J ,Ribosomal_S21 ,rvp;TM=M;; 7.40 

fl^ 2 3 "?; 153924: death-assodaled protein kinase 1; ank,pkinase,death,SPRY,SAP,Rirjo^^ 7.40 
421429; NM_014922; Hs. 104305; death effector filament-forming Ced-4-li; LRR.PAAD DAPJN 1 AAA,CARD,NB-ARC;NA;NA' 7 38 
41 7141; U22662; Hs.347353; nuclear receptor subfamily 1 , group H, m; hormone_recif-C4;SS=M; 7.38 
430016; NM_004736; Hs.227656; xenotropic and potytropic retrovirus rec; SPX,EXS;TM=Y;; 7.38 
42281 3; AV656571 ; Hs. 1 21068; transmembrane 4 superfamBy member 6; transmembrane4;TM=Y;SS=M- 7.32 
433390; AA586950; Hs.260180; Homo sapiens mRNA; cDNA DKF2p761G18121 (; none,spectrin,SH3,PH,CH; 7 25 
409213; U61412; Hs.51133; PTK6 protein tyrosine kinase 6; SH2.SH3,pkinase;TM=M;; 7.24 
444745; AF1 17754; Hs.1 1861 ; thyroid hormone receptor-associated prat; none;TM=M;; 7.23 
452817; AA322859; Hs.284275; Homo sapiens PAK2 mRNA, complete cds; pkinase r PBD;TM=M;; 7.19 
407591; NM_000910; Hs.37125; neuropeptide Y receptor Y2; 7tm_1;TM=Y;; 7.18 
427231; AW851989; Hs.285814; sprouty (Drosophila) homolog 4; SH2,SH3;TM=M;SS=M; 7.18 
450285; AW383256; Hs.24752; spectrin SH3 domain binding protein 1; SH3;TM=M;; 7.18 
403344; ; ; NMJ)00341:Homo sapiens solute carrier fa; aIpha-amylase;TM=Yn 7.15 
427268; X78520; Hs.174139; chloride channel 3; CBS,voltage_CLC;TM=Y;; 7.14 
442213; N36110; Hs.305971; solute carrier family 2 (facilitated glu; sugar_tnTM=Y;SS=M; 7 14 
404875; ; ; NNL022819*:Homo sapiens phospholipase A2; phos!ip;SS=M; 7.11 
433618; AA602539; Hs.345494; ESTs; G-alphaA.deaminase; 7.10 

41 7165; R80137; Hs.302738; Homo sapiens cDNA; FU21425 fis, clone C; Sulfate_transp.STAS,HMG box; 7.08 

413073; AL038165; Hs.75187; translocase of outer mitochondrial membr; MAS20,zf-A20,VPS9;TM=M;SS=M; 7 05 

426655; AL049589; Hs.171723; neuronal cell death-related protein; TFIID-31;TM=M;; 7.05 

423387; AJ012074; ; vasoactive intestinal peptide receptor 1; 7tm_2,HRM,CSD;TM=Y;SS=M; 7.03 

400211; ; ; NM_Q03899*:Homo sapiens PAK-interacting ; SH3,PHRhoGEF,Terpene_synth;TM=M;; 7.03 

438150; AA037534; Hs.342874; transforming growth factor, beta recepto; zona_peDuctda.none; 6.93 

431341; AA30721 1; Hs.251531; proteasome (prosome, macropain) sub-unit,; proteasome;TM=M;; 6.93 

405275; AB028989; ; mHogen-activated protein kinase 8 inter Cys knoaGF-beta,vwa > vwc l vwd.TILDUF139-SS=M; 6 93 

415392; 244057; Hs.10957; ESTs; PIP5K.none; 6.89 * " ' 

429355; AW973253; Hs.292689; ESTs; P kinase,b2lPAmnadiBo_seg,none; 6.88 

429663; M68874; Hs.21 1587; phosphoHpase A2, group IVA (cytosolic, ; C2,PLA2_B;TM=M;; 6.85 

427832; AF038362; Hs.180930; TBP*ssodated factor 172; SNFZJI.heRcaseJXArrnadillo seg,HEAT;TM=M;; 6.83 

447887; AA1 14050; Hs.19949; caspase 8, apoptosis-related cysteine pr, lCEo10,ICE o20,DED;TM=M;; 6.80 

400158; ; ; ENSP00000244302*:CDNA FU11591 hs, clon; Sm;SS=M; 6.78 

401917; AL050149; ; RAN binding protein 3; OrexJn,SH2,STAT,STAT_bind,STATj)rotton_trans,PA(;PAS,none; 6.78 

400844; ; ; NM_003105*:Homo sapiens sortilin-related; EGF,fn3JdLrece|n_a,tdLreceptb,granuHn < BNR;TM=Y;S&=M; 6.73 

457238; U07358; Hs.211601; mitogen-activated protein kinase kinase ; pkinase;; 6.73 

404440; ; ; NMJ)21048:Homo sapiens melanoma antigen,; MAGE;TM=M;; 6.73 

424241; AW995948; Hs.182339; Homo sapiens pyruvate dehydrogenase Wna; Ets,SAM_PNT;TM=M;; 6.70 

426746; J03626; Hs.2057; uridine monophosphate synthetase (orotat; Pnl>osyhran 1 OMPdecase;TM=M^ 6.70 

429429; AA829725; Hs.334437; hypothetical protein MGC4248; none < transmembrane4; 6.65 

450825; AC005954; Hs.25527; tight junction protein 3 (zona ocdudens; PDZ,GuanytateJdn;SS=M; 6.64 

408638; AW451353; Hs.173328; ESTs; B56,none; 6.63 

408056; AA312329; Hs.42331; ephrin-A4; Ephrin;TM=M;SS=M; 6.61 

401057; BE563196; ; eukaryotic translation elongation factor; ion trans,IQ;TM=Y;; 6.60 

446526; H89616; Hs.296290; Homo sapiens cDNA FU13357 fis, clone PL; none,none; 6.60 

400528; ; ; NM_020975*:Homo sapiens ret proto-oncoge; cadherin,pkinase;TM=Y;SS=M; 6.58 

418562; R60869; Hs.124831; CGI-67 protein; none,Skp1,AAA; 6.57 

453826; AL138129; ; gb:DKFZp547F152_r1 547 (synonym: hfbrl) ; PK.PK-C.none; 6.55 

437412; BE069288; Hs.34744; Homo sapiens mRNA; cDNA DKF2p547C136 (ft; ABC tran,GTP EFTU^BCjnembrane none- 6 54 
41 5088; A1077288; Hs.296323; serum/glucocorticoid regulated kinase; none.none; 6.50 

453489; AA300067;Hs.33032;hypo»iefical protein DKFZp434N185; F5_F8 type C,pkinase,Ets,F5 F8 type C.pkinase Els* 6 47 

439866; AA28071 7; Hs.6727; Ras-GTPase activating protein SH3 domain; rrm,NTF2;TM=M;; 6.46 

456376; AA663904; Hs.89862; TNFRSFIA-associated via death domain; dealh;TM=M;; 6.46 

441712; AW391927; Hs.7946; WAA1288 protein; AIP3;TM=M;; 6.44 

414557; AA3401 1 1; Hs.100009; acyl-Coenzyme A oxidase 1, palmitoyl; pkinasejtmj; 6.43 

432211; BE274530; Hs.273333; hypothetical protein FU10986; FGGY C;TM=M;; 6.43 

432945; AL043683; Hs.8173; hypothetical protein FU10803; none;TM5w;SS=M; 6.40 

438941; AF075047; Hs.31864; ESTs; Ca channel B,SH3,arf,none; 6.40 

441466; AW673081; Hs.54828; ESTs; pkmase,zK2H2,KRAB.none; 6.33 

438598; AW297855; Hs.125B15; ESTs, Weakly similar to 138022 hypolheti; lipoxygenase.PLAT.none- 6 33 
433255; AI274270; Hs.96840; K1AA1527 protein; MHCK^EF2_kinase;TM=M;SS=M; 6.33 
427801; AW979155; Hs.298275; amino acid transporter 2; AaJrans;TM=M;SS=M; 6.32 
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407970; AW403814; Hs.41714; BCL2-associated alhanogene; utaqi)itin,BA6,Tropomyosin;; 6.30 
438454; AA659735; Hs.324743; protein phosphatase 4 regulatory subunit none,none; 6.30 
453977; AA886006; Hs.250427; ESTs; pWnase.P2)ereceptor l E1-E2_ATPase,Hydrolase; 6.30 
412491; W31589; Hs.73957; RAB5A, member RAS oncogene family; ras,arf,PP2C;TM=M;; 6.30 

413235; BE243445; Hs.75248; topoisomerase {DNA) II bela (180kD); Dr^jyraseB,DrMJopoiso!V,HATPasej:,DNA_g^ 6.29 
404342; ; ; C7002192-:gi|7299207|gb|AAF54404.1| (AE0; none;TM=M;; 6.27 
409274; NM 003930; Hs.52644; SKAP55 homologue; SH3,PH;SS=M; 6.25 

419693; AA133749; Hs.301350; FXYO domain-containing Ion transport reg; ATP1G1_PIM_MAT8;TM=Y;SS=M; 6.23 
405429; ; ; Target Exon; Y_phosphatase,none; 6.23 

404975; AL042279; ; uncharacterized hypothalamus protein HT0; knngle;TM=Y;SS=M; 6.20 

452929; AW954938; Hs.172816; neuregulin 1; Neuregu«n,EGF,ig,Neureguiin,EGFJg; 6.18 

446883; AW452756; Hs.16364; hypothetical protein FU10955; DEAD,helicasej:,rm*Ndr,CysJcnotJIU^^ 

beta,DUF139;TPR,DSPc.tspJ i rcboso^ 
433233; AB040927; Hs.301804; KIAA1494 protein; SH3,zf-C3HC4;TM=M;; 6.15 
458791; BE615453; Hs.346509; dedicator of cyto-kinesis 1; none;TM=Y;; 6.15 
421836; AF109219; Hs.108787; phosphatidylinositol glycan, class N; none,none; 6.14 
447727; AI421079; ; tumor necrosis factor receptor superfami; none^ynaptobrevm; 6.13 
401536; ; ; NM_002530*:Homo sapiens neurotrophic tyr; ig,pkmase,LRRUWT,LRRCT;TM=M;SS=M; 6.11 

444317; AI140566; Hs.143436; ESTs, Weakly similar to PLHU plasmin [H.; PAN.kringle,trypsin.PI-PLC-X,C2,SH2,PH 1 SH3,Pl-PLC-Y 1 PAN; 6.10 
427557; NM_002659; Hs.179657; plasminogen activator, urokinase receplo; UPAR_LY6.ET,PlA2Jnh;SS=M; 6.08 
450447; AF212223; Hs.25010; hypolhetical protein P15-2; NTF2;TM=M;; 6.08 
450107; A1873287; Hs.257ei2; ESTs; ICE^p20,DED;TM=M;; 6.05 

418175; AW967054; Hs.206312; ESTs, Weakly similar to I38022 hypotheti; zf-aH2,BTB,KJetra,Syntaxin,none; 6.05 

408983; NM_000492; Hs.663; cystic fibrosis transmembrane conductanc; ABC_tran^BC_membrane,PRK,Bac_exporL3;TM=Y;; 6.05 

439738; BE246502; Hs.9598; sema domain, immunoglobulin domain (lg) f ; Sema,PSI,integrin_B;TM= : Yr, 6.03 

427625; AF008216; Hs.285013; putative human HLA class II associated p; none.none; 6.03 

409430; R21945; Hs.346735; splicing factor, argmine/serine-rich 5; DSPc,Rhodanese,none; 6.03 

446650; AB016625; Hs.1581 3; solute carrier family 22 (organic cation; sugar_tr;TM=Y;SS=M; 6.03 

405102; ; ; C15O01220*:giI4469558|gb|AAD21311.1j (AF; DAG PE-bind.PH,RhoGEF,DC1;SS=M; 6.03 

400121; ; ; Eos Control; SH3,PH,RhoGEF,Terpene synth;TM=M;; 6.03 

415327; H22769; ; gb:ym54c02.r1 Soares infant brain 1N1B H; SH3,PDZ,Guanylate_kin;SS=M; 6.03 

404148; ; ; NM 002944*:Homo sapiens v-ros avian UR2 ; fn3.pkinase,DUF139;TM=Y;SS=M; 6.03 

405531; ; ; Target Exon; PDZ.CARD.Guanylate kin;TM=M;; 6.00 

433363; AA584829; Hs.275163; non-metastatic cells 2, protein (NM23B) ; NDK,none; 6.00 

427270; H47921; Hs.174139; chloride channel 3; voltageJXC.CBS.none; 5.99 

423774; 139064; Hs.1702; interteukin 9 receptor; none;TM=M;SS=M; 5.98 

4241 24; AA335609; Hs.7589; ESTs, Weakly similar to A46010 X-lnked ; pkinase.TBC; 5.98 

411040; AF007393; Hs.177574; protein-kinase, interferon-inducible dou; HLH;TM=M;; 5.95 

423422; AC005175; Hs.128425; NY-REN-24 antigen; Sulfotransfer.7tmJ.none; 5.88 

453902; BE502341; Hs.3402; ESTs; none,none; 5.88 

409636; AA305729; Hs.1 8272; amino acid transporter system A1; Aa_trans;TM=Y;; 5.86 

436154; AA764950; Hs.119898; ESTs; efhand,DAG_PE-bind,DAGKa,PHD,OAGKc,PSUone; 5.85 

455358; AW902641; ; gb:QV3-NN1024-100500-181-<l08 NN1024 Homo; Sutfatas^,Somatoniedin_B.Phosphodiest,none; 5.83 

414108; AI267592; Hs.75761; SFRS protein kinase 1; ank,PKOxysteroLBP,pkinase;TM=M;; 5.82 

452547; AA335295; Hs.74120; adipose specific 2, LEA;TM=M;; 5.82 

415204; T27434; ; gb:hbc2294 Human pancreatic islet Homo s; Na_Ca..Ex,Calx-beta,none; 5.80 

435563; AF210317; Hs.95497; sotute carrier family 2 (facilitated gtu; sugarJr.TM=Y;; 5.78 

456097; C15702; Hs.288028; ESTs, Moderately similar to 154374 gene ; dsrm.FKBP; 5.78 

422445; M231 14; Hs.1 526; ATPase. Ca transporting, cardiac muscle,; E1-E2_ATPase,Cation_ATPase C.Calbn _ATPase N 1 HydroIase;TM=Y;; 5.77 

436246; AW450963; Hs.1 19991; ESTs; none,DNA^gyraseB,D^topo^ c; 5.75 

422953; AA488860; Hs.245043; hypothetical protein FU14297; ABCJran,PRK;TM=Y;SS=M; 5.75 " 

425854; AA749190; ; ESTs; RhoGAP,SH2,pkinase,POLOJxjx,none; 5.74 

424160; T74062; ; gb7c81f01.r1 Soares Infant brain 1NIB H; ROK,none; 5.70 

417389; BE260964; Hs.82045; mldkine (neurite growth-promoting factor; PTN MK;TM=M;SS=Y; 5.69 

437613; R19S92; Hs.10267; MIL1 protein; none^one; 5.68 

410820; BE391493; Hs.16475; Human DNA sequence from done RP5452M4 ; TBQSS=M; 5.68 
423393; R37772; Hs.21420; p21-activated protein kinase 6; pkinase,PBD;TM=M;; 5.66 
426500; NM_014638; Hs.170155; KIAA0450gene product C2,PI-PLC-Y;TM=M;; 5.63 
406930; U04691; ; gb:Human olfactory receptor (OR17-219) g; none;TM=Y;SS*M; 5.60 
401044; ; ; Target Exon; none,lCEj>20,ICEj)10,CARD,PepJidase^M1; 5.56 

428479; Y00272; Hs.334562; cell division cycle X G1 to S and G2 to; pldnase,ICEj>10JCE_p20;TM=M;SS=M; 5.55 
421970; AF227156; Hs.110103; RNA polymerase I transcription factor RR; 

aaj)ermeases f pyridoxaIJeC,brorno(tomain,PHD,MBD^T_h 
426248; T18988; Hs.293668; ESTs; pkinase,none; 5.50 

418426; NMJW3804; Hs.296327; receptor (TNFRSF)-lnteracfing serine-thn pkinase^eath;TM=M^ 5.43 

417086; AA194446; ; ESTs, WeaHy similar to S55024 nebuttn, ; ank,ck^,ZU5,EGFJmrs^ 5.43 

447437; U07225; Hs.339; purinergic receptor P2Y, G-protein coupl; 7tmJ,SH2;TM=Y;SS==M; 5.40 

41 2247; AF022375; Hs.73793; vascular endothelial growth factor; PDGF;SS=M; 5.40 

434938; AW500718; Hs.B115; Homo sapiens, clone MGC:16169. mRMA, com; ptee,TBC.Rhodanese;TM=M;; 5.38 
419355; AA428520; Hs.90051; progesterone binding protein; neme_1;TM=Y;SS=M; 5.35 
411188; BE161168; ; gb:PM0-HT0425-170100-002-a1O HT0425 Homo; adenylatekinase.none; 5.35 
422461; NM__003417; Hs.117077; 2lnc finger protein 264; zf-C2H2,KRAB,TFIlS;TM=M;; 5.28 
426348; BE466586; Hs.17433; hypolhetica! protein FU20967; none.none; 5.25 

429170; NMJKJ1394; Hs.2359; dual specificity phosphatase 4; Rhoaanese 1 DSPc,Y_phosphatase,Ribosornal_S3_N;TM=fvl;; 5.23 
429592; AB029041; Hs.209646; WAA1118 protein; Troponin,Exo_endo_phos,lQ;TM=Mn 5.22 

434821; AA159111; Hs.284281; Human putative ribosomal protein S1 mRNA; ERJumen^receptRiT)OSomal_L11,RibosomaLL11_N;TM=Y;SS=M; 5.21 

434368; AW519020; Hs.73893; dopamine receptor D2; pkinase.SH3.none; 5.15 

405586; ; ; NM_000299:Homo sapiens plakophifin 1 (ec; Armadiflo_seg;TM=M;; 5.13 

408176; AK001553; Hs.43436; adenylate kinase 3 alpha fike; adenylatekinase^one; 5.12 

407443; AF227138; ; gb:Homo sapiens candidate taste receptor; none;TM=Y;SS=M; 5.1 1 

416817; AA398045; Hs.104679; ESTs; Furin-rike,pkinase.Recep Ldotna'n.fh3,none; 5.10 

401886; ; ; NM 021783:Homo sapiens XEDAR (XEDAR), mR; TNFR_c6;TM=M;SS=M; 5.08 

410314; AW860708; Hs.18851; hypothetical protein FU10875; rnyb.DrMrbirio1ng,PAaBAH.broniodomain.PHD,SET;TM^ 5.08 
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401579; AL031447; ; Homo sapiens, clone iMAGE:4053044, mRNA,; NeuLchan.LBD.Neur^chan^memb.none; 5.05 

445873; AA250970; Hs.251946; poly(A)-binding protein, cytoplasmic 1-1; PABP.rrm.pkinase, 14-3-3; 5.05 

417529; AA203634; ; gb:zx58b09/1 Soaresjetaljver.spteenj pkinase.UBA.KAl .none; 5.03 

417527; AA203524; ; gb:zx56e10.r1 Soares_fetalJiver_.spleen j SH3;SS=M; 4.9B 

407722; BE252241; Hs.38041; pyridoxal (pyridoxine, vitamin B6) kinas; plkB;TM=M;; 4.96 

431321; AW1 36372; Hs.1 851 acid phosphatase, prostate; aridj>hospnat,none; 4.93 

404298; ; ; C6001238*:gi|121715lsp|P26697|GTA3_CHlCK; none.GST_C,GST_N,pkinase; 4.85 

407603; AW955705; Hs.62604; Homo sapiens, clone IMAGE:4299322, mRNA,; none;TM=M;; 4.82 

424099; AF071202; Hs.139336; ATP-binding cassette, sub-family C (CFTR; ABCJran.ABCjnembrane;TM=Y;: 4.73 

422366; TB38B2; Hs.97927; ESTs; pkinase.none; 4.64 

424905; NMJW2497; Hs.153704; NIMA (never in mitosis gene a)-related k; pkinase;TM=M;; 4.48 

426925; NlvLOOt 196; Hs.31 5689; Homo sapiens cDNA: FU22373 fis, clone H; Esterase 1 enotase.Pepidase_S9;TM=M;; 4.45 

439606; W79123; Hs.58561; G protein-coupled receptor 87; 7tmJ;TM=Y;SS=M; 4.40 

400749; ; ; NM_003105*:Homo sapiens sortifowelated; EGF,fn3JdljecepLaJdUece^ 4.38 

425721; AC002115; Hs.159309; uroplakin 1A; trarsrnembrane4;TM=Y;SS=M; 4.33 

444006; BE395085; Hs.10086; type I transmembrane protein Fn14; Ml jecepLa,PKD.MHC_l;TM=M;SS=Y; 4.31 

400751; ; ; NNLQ03105*:Homo sapiens sortiHn-related; EGF,fn3.ldljec€pLa.ldi_recepLb.granunn,BNR;TM=Y;SS=M; 4.18 

408908; BE296227; Hs.250822; serineAhreonine kinase 15; pkinase;SS=M; 3.90 

422152; AA909249; Hs.1 12282; solute carrier family 30 (zinc transport; none,none; 3.88 

458760; AI498631; Hs.1 11334; ferritin, Tight polypeptide; cystatin,ferritin,hlstor«,HC03jx)te 3.85 
441218; BE327561; Hs.202345; ESTs; none.WD40,E1-E2_ATPase,Cation^ATPase_C l CationJ\TPase_N l Hydrolase; 178 
419073; AW372170; Hs.1B3918; Homo sapiens cDNA FU12797 fis. clone NT; death,ZU5;SS=M; 3.76 

451385; AA017656; ; gb2e39h01.rt Soares retina N2b4HR Homo ; Atrapr^1,enoiase^Uophin-1 i Yj)hospr^e.SH2.fibrinogen_C > TIM; 3.60 

412604; AW978324; Hs.1904; protein kinase C, iota; pkinase,DAG_PE^nd,pkinase„C,OPR;TM=M;; 3.30 

409582; R27430; Hs.271565; ESTs; none,Neur_chan_LBD,Neur_chan_memb; 3.2B 

441155; AW161008; Hs.7719; GABA(A) receptor-associated protein; MAP1_LC3;SS=M; 3.23 

436740; AW975133; ; gb:EST387239 MAGE resequences, MAGN Homo; none,EPHJbd,fn3,pkinase.SAM; 3.20 

418319; AW611703; Hs.190173; ESTs, Weakly 'similar to A46010 X-tinked ; noneJRK; 3.20 

409744; AW675258; Hs.56265; Homo sapiens mRNA; cDNA DKFZp586P2321 (f; none;NA;NA; 3.13 

418764; N30531; Hs.42215; protein phosphatase 1, regulatory subunr, none.none; 3.10 

400846; ; ; sortiltn-related receptor, L(DLR class) ; EGF,fn3,ldl_recepLa,ldtrecepLb.granulin,BNR;TM=Y;SS=M; 3.09 

422005; BE266556; Hs.1 10702; Homo sapiens mRNA; cDNA DKFZp761E212 (fr; none,Na_H_Exchanger; 3.03 

426440; BE382756; Hs.169902; solute carrier family 2 (facilitated glu; sugar_tr,TM=Y;SS=M; 3.02 

424187; AA336561; Hs.17287; ESTs, Weakly similar to S26689 hypolheti; IRK,none; 2.98 

425852; AK001504; Hs.159651; death receptor 6, TNF superfamBy member, death ,TNFR_c5;TM=Y;SS=M; 2.93 

401279; ; ; C1 3000351 ':gi|2494033|sp|Q64398|KDGD_MES; none.none; 2.88 

406671; AA129547; Hs.285754; met proto-oncogene (hepatocyle growth fa; Sema.pkinase.TIG.PSI.none; 2.83 
447081; Y13896; Hs.17287; potassium mwardry-rectifying channel, s; IRK;TM=Y^ 2.80 
453619; H87648; Hs.33922; Homo sapiens, done MGC:9084, mRNA, comp; pkinase;TM=M;; 2.75 
441699; AW511126; Hs.1 27572; ESTs; none,AaJrans; 173 

458781; AI444821; Hs.63085; ESTs, Weakly similar to MPP3_HUMAN MAGUK; SH3,PDZ,GuanylateJdn,L27;TM=M;; 2.73 

446913; AA430650; Hs.16529; transmembrane 4 superfamily member (tetr; transmembrane4;TM=Y;SS=M; 170 

453487; R31770; Hs.56562; ESTs; 7tm_1,none; 2.68 

421279; AW664878; Hs.106645; ESTs; pkinasejone; 168 

419720; AA249131; Hs.337778; hypothetical protein FU11068; none.none; 165 

452345; AA293279; Hs.29173; hypothetical protein FU20515; DSPc;TM=M;; 2.63 

422247; U18244; Hs.113602; solute carrier family 1 (high affinity a; SDF;TM=Y;; 2.62 

425212; AW962253; Hs.171618; ESTs; pkinase,none; 2.60 

427344; NMJJQ0869; Hs.2141; Wydroxytryptamtns (serotonin) receptor. Neur_chan_LBD,Neur_chan_memb;TM=Y;SS=M; 2.58 

423629; AW021173; Hs.1B61l; Homo sapiens cDNA: FU21909 fis, clone H; vo!tage_CLC,CBS,none; 155 

456737; BE247203; Hs.124831; CGI-67 protein; abrrydrdase_.2;TM=M:SS=M; 2.53 

421585; U95626; Hs.302043; chemokine (C-C motif) receptor-like 2; 7tmJ:TM=Y;SS=M; 153 

424028; AF055084; Hs.153692; Homo sapiens cDNA FU14354 fis, done Y7; none,none: 150 

448324; AI571356; Hs.34174; ESTs, Moderately similar to ALU8_HUMAN A; ICE_p20,CARD,ICEj)10,none; 150 

402256; ; ; Target Exon; pkinase,UBA,none; 143 

452256; AK000933; Hs.28661; Homo sapiens cDNA FU10071 fis, done HE; GDI.7tm_1 .none; 140 

419169; AW851980; Hs.262346; ESTs, Weakly simflar to S72482 hypotheti: nor^spectrin.SH3.PH,CH; Z40 

420634; S42457; Hs.1 323; cydic mideotide gated channel alpha 1; cNMPJxrafmg.ion Jrans:TM=Y;; 135 

419630: W57756; ; gb:zd20g10/1 Soares_fetal_hearLNbHH19W; zf.C3HC4.none; 2.35 

426227; U67058; Hs.154299; Human proteinase activated receptor.2 mR; 7tmJ;TM=Y;SS=M; 135 

400704; ; ; Target Exon; fig chan,SBP_bac_3,ANF„receptDrTM=Y;SS=M; 133 

400149; ; ; Eos Control; acid_phosphat:TM=Y;SS=M; 130 

459327; AW149706; Hs.7859; gb:xf41d0lx1 NCLCGAP_Bm50 Homo sapien; PHD f PWWP,SET,pktnase.ig; 130 
452220: BE158006; Hs.212296; ESTs; inlegrinJ^FG-GAP.none; 125 
416690; H84078; Hs.108551; ESTs; pkinase.none; 123 
408354; A1382803; Hs.159235; ESTs; none,none: 123 

452203; X57522; ; transporter 1, ATP-binding cassette, sub; ABC_tranABC - merr^ane,SRP54,Thymidytate_kin;TK^Y;SS=^ 121 

405093: ; ; C12001 101*:gi]7522643lpir|fT32733 AMPA g; none,none: 120 

412723; AA648459; Hs.335951; hypothetical protein AF301222; none;TM=M;; 120 

418738; AW388633; Hs.6682: solute carrier family 7, (calionic amino; none,none; 2.18 

417185; NMJJ02484; Hs.81469; nucleotide binding protein 1 (Ecoli Min; ParA.fer4_NifHArsA_ATPase;TM=M;; 118 

433222; AW514472; Hs.238415; dickkopf (Xenopus laevis) homotog 4; none,PH04; 118 

413627; BE182082; Hs.246973; ESTs; Armadillo_seg,lBB;TM=M;; 2.18 

407415; AF073328; ; gb:Homo sapiens tetracyline transporter-; none.none; 2.15 

450592: AI701555; Hs.202561 ESTs; pkinascnone; 115 

428767; A1421971 Hs.98802; ESTs, Moderately similar to T14342 NSD1 ; none,pkinase,ig: 115 

429012; AW629596; Hs.1 94726; BCL2-assodated athanogene 4; Sm,9AG;SS=M; 115 

419122: AI401360; Hs.44410; ESTs: ABCJran,ABCjTismbrane,none; 110 

446420; AW015693; Hs.135614; ESTs; ion_trans.none; 105 

420076; AA827860; Hs.293717: ESTs; DUF59.pkinase; 105 

409416; AW388359; Hs.1 0667; ESTs; transmembrane4;TM=Y;SS=M; 103 

428766; AA477989; Hs.98800; ESTs; TPR.7tm_1; 2.03 

427001; NM 006482: Hs. 173135; dual-spedfirity tyrosine-(Y)-phosrihoryl; pkinase;TM=M;; 103 
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453709; AL119133; ; protein kinase C substrate 80K-H; none.hislone: 2.03 
423341; AW242394; Hs. 108660; ESTs; none,none; 2.00 

456772; AW604297; Hs.1 2971 1; hepatitis A virus cellular receptor 1; ig;TM=Y;SS=M; 1.88 
427786; BE407863; Hs.256871; ESTs; none.FG-GAP.7tmJ; 1.65 
423508; AW604297; Hs.12971 1; hepatitis A virus cellular receptor 1; fcj;TM=Y;SS=M; 1.00 
447993; AW1 39525; Hs.1 70362; ESTs; none,none; 1 .00 

TABLE 58B 

Pkey: Unique Eos probeset Identifier number 
CAT number: Gene cluster number 
Accession: Genbank accession numbers 

Pkey CAT Number Accession 

414883 8371.2 AF274943 BG494894 AI719075 AA908783 AI935150 A1422691 AA910644 AA583187 BM272167 A1828996 AA527373 AW972459 A1831360 AA772418 
A1033892 AA100926 AU154749 AI459432 AI423513 AI094597 AA740817 A1991988 A1090262 AI312104 BI256707 AA459522 AA416871 A1075239 
AI339996 AA701623 AI139549 A1336880 AA633648 A1989380 A1362835 AA399239 A1146955 BF514270 N92892 AI348243 AI278887 AA459292 
AI494230 BF507531 A1492600 AA962596 AW613002 AA293140 AA235549 BF108854 AA954344 N49682 AI457100 AW569407 AW300758 BE220715 
BE220698 BE569091 BM009647 BF900351 AI537692 AJ203723 A1857576 AA564410 AW371667 BM172363 

438091 22448.1 AK054860 AV652198 AV652192AV652138AV652127AV652194 BE935919 AV652017AV651995AV651548 AV646063AV651985 AV646184 
AV646179 AW880409 AA345002 BF155189 BE068931 X56197 AU03014 AW953629 BM263546 BE550772 AA701084 AI681352 AA35B689 
AW938841 BF438147 W05391 H75313 BF326185 AV646335 AV651589 AV646340 AV651992 AV646384 AV646354 AV687497 BF155183 AV646370 
AW797876 A1906821 X561S6 BE833835 AA628440 BE833808 BF224205 AA709126 BE673807 A1923886 AA947932 AI276125 AI185720 AW510698 
AA987230 BE467708 AW898628 AW898544 A1146984 AW043642 A1288245 AH 86932 AI635262 A! 139455 AI298739 AI81 3854 AI024768 BE699445 
BE699444 A1707B07 D52654 AI214518 AI004723 AI698085 AW087420AI565133 AA845571 AW898622 BF110144 AW5132B0 AI061 126 BF362770 
A1268939 AI435B18 BF475318 A1024767 BE174213 AA757598 AA513019 AA902959 A1860794 AI334784 BF10841 1 BM310532 AW513771 AI951391 
AI337671 BF095606 BF095601 BF095468 AW890091 BF095753 AW243400 AW898607 AW898616 BF362762 AI922204 AW898625 BE699468 
BE174196 AW102923 D52715 BE699456 D52477 D55017 BF955933 BG623563 AV646254 AA463522 B1003244 AI299190 W40186 BE174210 
BF939091 BF434180 AW579001 T55662 H0181 1 T52522 BF945037 BF955938 D54679 D53933 R67100 BG925552 BF999056 R83430 Z29922 T85791 
W03942 H63289 A1091537 BF086583 AA345570 H48870 H80720 T83523 BI039626 BI037700 R00353 BF155184 N98343 N79072 H01812 T55581 

433470 6624.1 X75684 AL573167 AI445461 AI453743A1983655A1564644 AA977180 AI694111 A1591 358 AW07 1 625 AI67871 2 AI720939 AI927769 BE439796 

A1963432 AA292956 AW192593 AI865838 AI696905 AI424384 A1161312 AI91 1921 A1597801 BI494959 AI240988 A1492554 AW262737 BE044033 
AW008570 AW629505 BI494958 AA088439 AA706057 BF222820 BF593608 BE501957 AA524526 BE044134 AW572531 AW015724 BE349186 
, AA043217 BE219784 AI799814 AA129575 AI671727 AM70033 BE646195 AW779725 AA903050 AA147228 AA404570 AI075878 W38161 AI972739 
AW673152 AA723200 C06123 BF057147 AA627686 AA157944 AI990245 AA662517 T32487 AI800106 AI333170 AI859160 W45410 A1990827 
AVW75048M182640M478328A1298935AW085158AW471421AW103470AWM^ 

AW339104 AA724739 AA41 1 100 AA191349 AA757735 AA037696 AI769516 AW772283 AA01 0631 A1692846 AI061065 H80983 R79933 AI9S0693 
A1245632 A1349390 AA148284 AI798502 AA487893 AI621320 AW194272 C06365 AA953883 BE858936 AI918523 AI872628 AI927217 AI453453 
A11B9366 AW338678 AI261 359 AI500576 BF477735 AI032569 AI972899 AI985583 Z28771 AI363829 A1693030 AA603586 BE773488 AW339301 
BE773489 BE773462 BE773495 AI650338 BE773499 AI745717 BE811475 BE811470 BE811464 BE811418 BE811415 BE811400 BE811398 
BE81 1388 BE81 1352 BE773501 BE773494 BE773486 BE773474 BE773473 BE773470 BE773461 BEB11350 BE811337 BF593847 BGO55071 
AW675302 BF003068 AA71 9173 BE811348 AI582462 AI685240 BE773500 A1244845 A1565439 AI918453 AI472527 AI446740 AA035576 AA191414 
AW674145 C05782 A15B9264 057558 A1468237 AI432033 AA989662 R21 752 BF002457 AA98B297 AL574095 AL576200 AL571074 AL574525 
AL578810 BG49B381 AI928364 BE879732 AA479834 AA479712 C17732 BM091258 BF843901 AW820230 C17476 BE327120 AA129574 M136645 
BF843900 AW806193 AA502832 AA649494 AL568520 AL547960 BE705937 BE811360 BE773498 BE81 1401 BE773484 BE811437 BE81 1380 
BE81 1399 BF997171 BF757734 BE926037 AI377596 C06111 AW088968 BEB11404 BE811472 AI865912 AI925607 AI871950 AI093510 BE905927 
BE81 1435 AA191387 AW772000 BEB1 1453 BE614379 BF844522 BI044896 A1744233 AW984527 C17504 BF843883 AI248307 BE773483 A1567995 
W60075 BF941 183 AT738844 BE81 1458 BE773481 AI262930 AA948565 BE706942 BE156360 T65026 AW242958 AW197954 BE905184 AA722206 
AI344943 AI348877 AI334860 BE621857 BE156280 AA454099 AA037722 BF843897 AW806183 AA043216 BG482896 AA182734 AA877242 
AW372926 H27252 R38114 BF851858 BE156214 AA190427 T91762 AA035067 AA837326 T10930 BF906587 BI755027 BG506731 BC008442 
BC010166 AL550134 AL553096 AL548700 AL550751 AL54797B AL545286 AL540643 AU118627 AL601379 BI259821 BG741786 BI868522 AU135866 
B1552770 BI259210B1256520 BI255569 BG485098 BI258228 BG498501 BM044512 AU133984 AL556586 BE745111 BI222633 AU133917 BG288151 
BI260715 BI550550 BG500773 B1551761 BG707601 B1818593 BF691383 BG721129 BG541578 BE906666 BG751098 BI224135 BG400746 BG478065 
BE790436 AW0B0238 AU137549 BG429896 BE392486 AW961686 BG721056 BE908365 BE546656 BG541235AW583735 BG528290 B1260895 
AW651691 BM048974 BM043805 BG142185 AA315188 AI446615 C06300 BG497644 AA088544 AI815987 BG528631 BE619182 AW239185 
AWO62910AWrc2902AA347236F11933 M488O05AA301631AA376800D 

BI870221 BE910282 BG538748 AW960564 AV732879 D16854 AA192519 BF922148 AA216013 BG624091 BE544387 BG507008 AW1 76446 
BF790033 BE088925 BE088854 AA921353 R21800 AA01 1222 T97525 
436729 6624 1 X75684 AL573167 A1445461 AW53743AI983655 AI564644 AA977180AI694111 A1591358 AW071625AI678712AI720939 AI927769 BE439796 
AI963432 AA292956 AW192593 AI865838 AI696905 A1424384 A1161312 AI91 1921 AI597801 BI494959 AI240988 AI492554 AW262737 BE044O33 
AW008570 AW629505 BI494958 AA088439 AA706057 BF222820 BF593608 BE501957 AA524526 BE044134 AW572531 AW01 5724 BE349186 
AA043217 BE219784 AI799814 AA129575 A1671727 AM70033 BE646195 AW779725 AA903050 AA147228 AA404570 A1075878 W38161 AI972739 
AW673152 AA723200 C06123 BF057147 AA627686 AA157944 AI990245 AA662517 T32487 AI800106 AI333170AI859160 W45410 AI990827 
AW275048 AA182640 AA478328 AI298935 AW085158 AW471421 AW103470 AW30O456 AW191997 A1823466 AA962397 AA136658 AI251817 
AW339104 AA724739 AA41 1 1 00 AA191 349 AA757735 AA037696 AI769516 AW772283 AA010631 AI692846 AI061 065 H80983 R79933 A1950693 
A1245632 A1349390 AA148284 A1798502 AA487893 A1621320 AW194272 C06365 AA953883 BE858936 AI918523 AI872628 AI927217 A1453453 
Al 1 B9366 AW338678 AI261 359 AI500576 BF477735 AI032569 AI972899 AI985583 Z28771 A1363829 A1693030 AA603586 BE773488 AW339301 
BE773489 BE773462 BE773495 AI650338 BE773499 AI745717 BEB11475 BE811470 BE811464 BE811418 BE811415 BE811400 BE811398 
BE811388 BE811352 BE773501 BE773494 BE773486 BE773474 BE773473 BE773470 BE773461 BE81 1 350 BE81 1337 BF593847 BG055071 
AW675302 BF003068 AA719173 BE811348 A1582462 AI686240 BE773500 AI244845 AI565439 AI918453 AI472527 A1446740 AA035576 AA191414 
AW674145 C05782 AI589264 D5755B AI468237 AI432033 AA989662 R21 752 BF002457 AA988297 AL574095 AL576200 AL571074 AL574525 
AI578810 BG498381 A1928364 BE879732 AA479834 AA479712 C17732 BM091258 BF843901 AW820230 C17476 BE327120 AA129574 AA136645 
BF843900 AW806193 AA502832 AA649494 AL568520 AL547960 BE706937 BE811360 BE773498 BE811401 BE773484 BE811437 BE811380 
BE81 1399 BF997171 BF757734 BE926037 A1377596 C0611 1 AW088968 BE81 1404 BE81 1472 A1865912 AI925607 AI871950 AI093510 BE905927 
BE81 1435 AA191 387 AW772000 BE81 1453 BE614379 BF844522 BI044896 AI744233 AW984527 C17504 BF843883 AI248307 BE773483 AI567995 
W60075 BF941 183 AI738844 BE81 1458 BE773481 AI262930 AA948565 BE706942 BE156360 T65026 AW242958 AW197954 BE9051 84 AA722206 
AI344943 A1348877 A1334860 BE621857 BE156280 AA454099 AA037722 BF843897 AW805183 AA043216 BG482896 AA182734 AA877242 
AW372926 H27252 R38114 BF85185B BE 15621 4 AA1 90427 T91 762 AA035067 AA837326 T10930 BF906587 B1755027 BG506731 BC008442 
BC010166 AL550134 AL553096 AL548700 AL550751 AL547978 AL545286 AL540643 AU1 18627 AL601379 BI259821 BG7417B6 BI868522 AU135866 
BI552770 BI259210 BI256520 BI255569 BG485098 BI258228 BG498501 BM04451 2 AU 133984 AL556586 BE745111 BI222633AU1 33917 BG288151 
BI260715 BI550550 BG500773 BI551761 BG707601 BI818593 BF691383 BG721129 BG541578 BE906666 BG751098 BI224135 BG400746 BG478065 
BE790436 AW0B0238 AU137549 BG429896 BE392486 AW961686 BG721056 BE908365 BE546656 BG541235 AW583735 BG528290 BI260895 
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437056 
422940 
409745 



407347 
418546 
432407 



428504.3 

58443.1 

MH1944.5 



810943 1 
242836. 



423387 2612.2 



400211 



400158 



453826 
447727 
400121 



3532.1 



2166.1 



415327 



415204 
425854 



424160 
417086 



411188 
417529 
417527 
451385 
436740 
419630 
400149 



AW651691 BM048974 BMD43805BG142185 AA315183 AI446615 C06300 BG497644 AA088544 AI815987 BG528631 BE619182 AW239185 
AW062910 AW062902 AA347236 F11933 AA488005 M301631 AA376800 D56120 AA343532 AA308536 F00242 AA376086 AA316968 AA343799 
BI870221 BE910282 BG538748 AW960564 AV732879 D16854 AA192519 BF922148 AA216013 BG624091 BE544387 BG507008 AW176446 
BF790033 BE088925 BE088854 AA921 353 R2 1 800 AA01 1222 T97525 
AW976398AI147061 AA765223 AA743380 A1803927 
BC012771 BG397153 BF366196 AA337277 AA319285 AW843252 

B1030997 AA921874 AW188822 BI027862 AI347618 AI361453 AI088754 AW207491 AA077391 BG012775 BG997382 AA286833 AA150722 BI007625 
B1027864 BI009100 BI006275 BI006270 B1031000 BI029B64 B1005277 BI007627 B1006266 BI006991 B1006990 BI007763 B1007762 BG997377 
AA150780 BI033518 B1027818 BG015789 BI033807 AA341445 
T23514AI655785 

T59708 AA224827 T59843 BE156903 

MH1429 12 BG036675BF772005BF771866BG960386BG960381 NM_005712AF110315 BE074534 BE182776BE158000BE157999BE714315AW818104 
AW847519 AA099426 AWB17981 AW856396 BG961 122 AA224498 AA308542 AW821833 BF902155 AI732411 BG778834 BG283641 BE748279 
BE748870 BG319540 BE748864 BF739224 BG986155 AK057283 BI861466 AA663341 AA457591 BG949294 AW392B86 AA071122 AA227849 
AA584918 BG959570 BF773486 AL041698 BF959013 R87170 C16859 BF770411 BF771298 AI075321 LI 3823 AA216700 BF771864 AW861859 
8E537068 C18935 M155719 BI771172 BR69107 BF804964 AW818172 AW818143 A 
AWB61687 AW821826 BI055726 BF242643 AA207189 BF770412 BF771 157 BG430030 M055592 

L13288 AA928785 AI60891 2 AW872978 AA565655 AI02291 5 AI304920 AI564366 AI668793 AI094557 T60038 R72302 H45409 AA508805 R46356 
M418798 BM1 29553 BM129126 BM1 29292 BM1 28865 A1808418 A16B9932 A1806573 BF431 808 AW872985 AW1 66269 H73241 T161 82 AI264547 
R73391 R72085 R72840 T83751 X75299 BF754348 R94105 AW449839 R73300 NM.004624 AI797007 BE045543 BF1 10021 BF754250 T83923 
AW884084 AA903896 AA418962 120295 R72351 H45098 AA961010 R73210 R46451 AW884085 BI022902 BI763932 B1910138 
NM 003899 D63476 BM456434 AA778936 AA452871 A1052466 AW014138 AA448725 BE673088 AW028198 BI856378 BM150466 BM150674 
BM?4B451 AW500880 AA1 80228 BE243507 BM144903 AA333656 AW503767 AA305470 AW504819 AA9781 94 AW500776 BE872488 AJ032663 
AA704686 AA652189 AA179463 AL535925 BE275744 BE277708 BE275715 AW504259 AA354483 BE244197 BE246232 D17055 AW013B76 
AW014877 T09464 T08407 AA830246 AW897881 BE501192 BE501195 AL044534 AA258853 BI037915 M448037 BM461769 B1825965 BE763352 
AW167531 Z45588 AV721881 AA527273 AJ573219 AA457036 AW439651 AW26441B AA577618 A1802954 AA902292 AA468752 AI380374 AA722690 
AI867708 AA9169B2 AI291576 AW190427 AI338089 AI653744 AI306665 AW513541 AW440077 AI370014 AA904269 AW1 88378 AI671644 AW193386 
AI261832 AA775336 BF436811 AI582703 A127B635 BE440186 AA617898 AA648948 B1491837 BF590311 AA448633 F27048 F37022 AW770819 
AA258808 A1369564 AW503675 AA7771 94 BE501 048 BF222087 AA042973 AI868087 AA91 1460 Z41 274 AI919082 T16746 AA447634 A1282427 
F22456 T15901 AA825298 AW007436 BE934303 BG981939 AW373814 BM151638 AW956921 8M150080 BM153173 BM147451 BF953992 AA916696 
AW444935 M78398 AW581147AW608258 AA651910 AA132152 AW806295 T30326 D20054 AA310837 T06543 BM194508 BM193225 8M469348 
AW964920 AA325930 B1833627 AW952193 AA738189 M321051 BG987199 BF953967 T08890 BE869543 BG742857 BG988685 AA456880 
BG001842BF809452 AW892083BF944342T49551 W69981 BF764519 T1 5869 AA1 32030 

BC000486 BC001930 U15008 NM_004597 BG326561 AV761269 BF212132 BG113313 B1838038 BG944198 BG255198 BG164334 BM013507 
AV682717 BF028761 BG755489BG030141 BG1 05499 BE4401 03 BF727449 BM454228 BG912181 BM015197BG395995 Al 1 26579 AW248204 
BI857394 BI857319 BG944205 BG323958 BI092930 AV756576 AA280959 BF033078 BE539360 AW247174 BG107836 BE796975 BG1 70920 A! 148142 
AI092009 A1336349 AV713409 BF685591 BE616029 AV713166 BE909458 BE743868 BM018218 A1340153 BE792567 BE2O6806 AI159916 A1184271 
BE208585 BI862286 C18760 BG219364 AI122677 BE207297 BF754798 AA314384 BE873951 BG207294 AI148198 AI336189 AI141094 A1308985 
BF220098 A! 1 43289 BE856397 AA644001 BE675402 AJ346708 AI052809 Al 140082 BF081 542 A1342428 A1479625 AA278456 AI750163 BE707501 
BG944510 AA314616AA315774 BF360721 BE738404AA814194 AA477195AA932107 F3685BAA363159 BE865458 AA354147AW904224 A1055855 
AA380365 F28474 D59140 AA341946 AA993339 AA323350 F24344 AA374436 BG942061 AA636015 AAB52919 T31486 F36334 BG943190 T30319 
BG943153 BG941836 T32569 T31 160 AA31 1749 BE710086 BE709911 W23826 BG941639 BF769034 M026407 AA229555 AV756340 AV7 14380 
AL138129 BG003205 AL138179 B1021773 BE054231 

AL520734 BI666852 BF870571 BF945351 BF945354 AI770086 81861 138 AW21079 AL043200 BG683612 AW946265 
NMJQ3899 D63476 BM456434 AA778935 AA452871 AI052466 AW014138 AA448725 BE673088 AW028198 BI856378 BM1 50466 BM1 50674 
BM148451 AW500880 AA180228 BE243507 BM144903 AA333656 AW503767 AA305470 AW504819 AA978194 AW500776 BE872488 A1032663 
AA704686 AA652189 AA179463 AL535925 BE275744 BE277708 BE275715 AW504259 M354483 BE244197 BE246232 D17055 AW013876 
AW014877 T09464 T08407 AA830246 AW897881 BE501 192 BE501 195 AL044534 AA258853 B!03791 5 AA448037 BM461769 BI825965 BE763352 
AW167531 Z45588 AV721881 AA527273 AI573219 AA457036 AW439651 AW264418 AA577618 AI802954 AA902292 AA468752 AI380374 AA722690 
A1867708 AA916982 AI291 576 AW190427 A1338089 AI653744 AI306665 AW513541 AW440077 AI370014 AA904269 AW1 88378 AI671644 AW193386 
AI261832 AA775336 BF43681 1 A1582703 Al 278635 BE440186 AA617B98 AA648948 BI491837 BF59031 1 AA448633 F27048 F37022 AW770819 
AA258B08 AI369564 AW503675 M7771 94 BE501 048 BF222087 M042973 AI868087 AA91 1 460 Z41274 AI919082 T16746 AA447634 A1282427 
F22456 T15901 AA825298 AW007436 BE934303 BG9B1939 AW373814 BM151638 AW956921 BM150080 BM153173 BM147451 BF953992 AA916695 
AW444935 ^8398AW581147AW608258 M651910M132152AW806295 T30326O20054AA310837 T06M^ 
AW964920 AA325930 B1833627 AW952193 AA738189 AA321051 BG987199 BF953967 T08890 BE869543 BG742857 BG988685 AA456880 
BG001842BF809452 AW892083BF944342 T49551 W69981 BF76451 9 T1 5869 AA1 32030 

BM048470 BG983877 N92089 Z43545 FQ5783 R35182 H22769 H71928 H63921 R94394 AW195830 AA630220 F02030 R60760 R08204 H71929 
BE464731 R62568 H63520 H77840 AL515501 F01971 T71768 N55275 AA883505 AW606824 AA704682 M74193 W57593 AW753776 H60439 
A1076352 R31485BE160931 AW130730 BE160869 AW580256 AW902073 BE702158 AW382079 N76884 BF431013 H38156 AV648937 H80573 
AW902569 AW902557 AW902654 AW902641 AW902650 AW902741 AW902644 
Z25288 R85077 F00323 D82802 T27434 

BF184345 AA364846 BI494468 AA775693 BF1 95801 AA7491 90 AI480223 BE327332 AI470017 AW628345 A1917472 AI972264 AI375571 AI652583 
A1656167 AJ983110 BE550378 AW015085 BE552318 AI631475 AI433447 AA534540 AI865480 AI468119 BF1 10875 AW512059 W58721 AA827482 
R59357 

AW370493 AA923304 AI660828 AA50261 1 BI017370 AA039704 W65393 BE938514 W65295 BF919626 BI759440 BI838879 AA336207 F12441 
BF171187 BE185031 T74062 AW904410 AA280078 BF919529 

AK056605 AK056969 AW380724 AA195950 F21243 Z17422 AA176595 AA1 76955 AA194350 BC005933 BC017866 AA1 96396 Z24810 AA181361 
AA1931 15 AA086465 F25194 F30130 AA1 80013 AJ346143 F20589 F20620 F29371 F32439 F25153 F32146 F261 10 F28506 F25307 F27533 F20375 
F32370 AA653078 BF574897 Z24852 F001 72 T30560 AW449825 AI620346 BG986374 BE706521 C02691 AL596834 F31 902 F2607B AA670099 
BF475555 F30818 F37524 AJ346558 F28050 F17933 F31637 003413 A! 0921 52 AA1 80743 AA085730 F21998 F20854 F18944 F31180 F37937 F37738 
AA193162Z17344AA192546 
AW821260BE162466BE161168 
AA203634 
AA203524W88451 
AA019761 AA017656 AA017374 
AW975133 AA805813 AA729943 

BF223647 AA204668 AA256086 BF574707 W57756 AA249070 BI906494 AA24801 1 

BC003160NWL001610X12548BG386685BI760866BIS59619BG323829 AU135543BI834101 AU1 421 20 AU 124511 AU1 24889 AL5581 71 AU117286 
BI824000 BG386610 BI753285 BI223475 AU134828 BM126369 BE206493 BF751498 AL544274 AY007137 BI828921 BE870130 BF771242 BI835451 
B1765655 BI820955 BI223344 BG015924 BI759894 AL527413 BF310568 N31870 N23974 BE514914 T488S3 AW860257 BF334625 AA883860 
AU144168 AA442562 AU159491 AU148353 AA564123 AU14B667 A1377256 AW664004 AI871712 AI141486 A1332351 AI339094 BE206109 AW519033 
AI817729 AI332490 Al 149455 AI85741 1 AI763154 AI751608 AI377222 AI081956 AW664229 AI275B72 AW168546 AA975270 AI367408 A!687729 
AI269164 AW105344 BM1 93081 BE550930 AI082116 AA854691 AI056249 AI221062 AI2901 13 D51818 AA732409 BG055125 N85878 AU156121 
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366136.1 
10123 14 
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9792^2 



1160035.1 
1865508.1 
2638.3 



5320.3 
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85022.1 

1239008.1 
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BF093671 AA053070 T28548 AL570460 BI006862 BI007207 AA373620 W95069 AA629136 AA708164 AI014886 AW1S8697 D51623 AA577168 
AI094208 AA028946 AA975078 R16815 BG151132 AI276297 D51676 

452203 2630.1 BC014081 NM.O00S93 X57522 L21 208 L21207 L21206 L21205 L21204 AL561404 AL548423 ALS60492 AL555882 AL541576 AL550654 BI823519 
BI770023 AL554969 BI489906 AI304693 AW295947 BM146B42 X57521 BGB20143 BE898390 F06770 F12630 BM423610 AL561518 BM009470 
BG742981 AA279685 AA847441 AA313737 BF172639 BF897216 BF914190 BF903647 S70277 AI569694 AW073296 AI361433 AA554544 AA487429 
BE85B232 AAB38810 AI5391 14 AI719375 AI829129 BG057675 AI423422 AU15B860 BE300555 AW170777 AA586956 AL571889 AL556850 AL576404 
AL582800 BI256544 BF342301 BG875994 AA05445B AA353161 AI940434 BE816522 AL577636 A1479650 AW150377 AU154395 AW951271 AI032220 
AI819778 AI346733 AW771150 AW512525 A1249904 AA279809 AI352549 AW512517 BG056280 AA521222 BE271 141 AL581932 AI541575 BI819184 
AV6S0190 AL556475 Al 620020 AW08988BAW0791 79 Z21 518 AA687601 F04651 A17B3961 T57198AI433367T78652AL554968AA36564BAL5B2619 
BE874601 BF804669 AL574458 BM145502 AI266514 A1538823 AI475626 AA948210 AA884054 AA487S37 AA031844 AA535221 AW794256 
AW361447 BE788505 AI682892 AA830989 AA852356 AA653084 BM009154 AA135727 H05927 H23433 R42244 N79997 AW365665 AW365601 
AA678742 AL556474 AA135770 BE774050 BF914200 H88457 AA627746 BI560216 B1753586 

453709 59915J AK057669 AK054977 AL519747 BE893744 BM313248 BG913430 H80793 BF813504 N35311 N39276 H95973 BF791919 BE739392 BE144239 

AA074615 A1291059 AI681053 AA702355 BF439899 AW055166 AI096957 BF223853 AL1 19659 AI692209 BM312961 AI869297 BE466252 AI292024 
AA402764 AI214620 AA765312 BF380770 AA442682 AL519746 AW295039 AI037878 AW473433 AI499437 AI401618 AI130831 AA427406 AI042138 
BG272488 A1828769 AI828764 AI189390 W84635 AA398496 AA761672 AA699520 AJ200406 N68D93 AI143913 AA993133 AA613306 AI050971 
AA661905 AA7226B7 AI749977 AA829345 BG057324 BF001339 AA910169 AA765133 AI360722 AI701849 AI365083 H95974 AI830377 AI312866 
AI370491 BEB58907 N62185 AA705746 BE379632 W93803 AI440333 AW367670 AW367640 N77131 BF993216 AI858263 W52329 N68106 RB31 1 3 
R85153 BE380058 AA082537 AA729731 W23495 W31190 BF995236 BF968827 BF355168 N24508 AA21571 1 BF170735 AA280395 BE738851 
AW367707 AA630879 AA428420 R76236 BG567847 N25931 AA17356B AI073567 AA004957 AI539585 N95093 H99798 H95072 H96853 AA215712 
AA034214 R85096 H82051 H80794 R44954 AA278972 H68352 H68346 R94750 R89010 R67951 H65817 N24891 AA173731 AI693577 BE1B1027 
AA709461 BE181002 W60239 BF987598 BF995279 H17483 R76237 AL1 19133 BF964815 AW663315 H65903 H17591 R201 67 AA310039 R58734 
R58506 

TABLE 58C 

Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers in Ihis column are Genbank Identifier (Gl) numbers. "Dunham I. et al." refers to the publication entitled The DNA 

sequence of human chromosome 22." Dunham I. et al., Nature (1999) 402:489-495. 
Strand: Indicates DNA strand from which exons were predicted. 
NLposition: Indicates nucleotide positions of predicted exons. 



Pkey 


Ref 


Strand 


NLposition 


400843 


9188605 


Plus 


5863-5970,7653-7784.8892-9023.9673-9807 


404942 


7382153 


Plus 


92095-92252 


400752 


7331445 


Minus 


36215-36461 


400496 


9743564 


Plus 


41515-41695 


404568 




Minus 


92893-93116 


403912 


7710730 


Minus 


72000-72290,72431-72700.72929-73199 


402328 


4464283 


Minus 


1375B-13922.14558-14752 


404891 


7329392 


Plus 


84974-85125 


405036 


7543748 


Minus 


121957-122129 


400845 


91B8605 


Plus 


34428-34612 


403212 


7630897 


Minus 


156037-156210 


401927 


3873185 


Minus 


112000-112137 



Table 59A lists about 1 1 70 genes up-regulated in prostate cancer compared to normal adult tissues. These were selected from 59680 probesets on the Affymetrix/Eos Hu03 
GeneChfp array such that the ratio of •average" prostate cancer to "average" normal adult Gssues was greater than or equal to 8.Z The "average" prostate cancer level was set to the 
75° percentile amongst prostate cancers. The "average" normal adult tissue level was set to the 50* percentile amongst non-malignant tissues. In order to remove gene-specific 
background levels of non-specific hybridization, the 10* percenHe value amongst the non-malignant tissues was subtracted from both the numerator and the denominator before the 
ratio was evaluated. 

TABLE 59A: ABOUT 1 170 GENES UP-REGULATED IN PROSTATE CANCER COMPARED TO NORMAL ADULT TISSUES 

Pkey: Unique Eos probeset identifier number " 
ExAccn: Exemplar Accession number, Genbank accession number 
UnigenelD: Unlgene number 
Unigene Title: Unlgene gene title 

R1: Ratio of prostate tumor to normal adult body tissue 



Pkey ExAccn Unigene ID Unigene Title R1 

428330 L22524 Hs.2256 matrix metalloproteinase 7 (matrilysin, 108.1 

426747 AA535210 Hs.171995 kalGkrein 3, (prostate specific antigen 102.5 

420923 AF097021 Hs.273321 differentially expressed in hematopoietj 90.5 

416854 H40164 Hs.B0296 Purkinje cell protein 4 79.8 

425075 AA506324 Hs.1B52 add phosphatase, prostate 71.6 

431103 M57399 Hs.44 ptefotrophin (heparin binding growth fac 70.6 

434666 AF151103 Hs.112259 T cell receptor gamma locus 69.1 

419551 AW582256 Hs.91011 anterior gradient 2 (Xenepus laevis) horn 65.4 

413859 AW992356 Hs.8364 Homo sapiens pyruvate dehydrogenase kina 66.4 

420154 AI093155 Hs.95420 JM27 protein 63.9 

428336 AA503115 Hs.183752 rnicroserninoproJein, beta- 61.4 

400287 S39329 Hs.181350 kallikretn 2. prostatic 59.7 

416602 NMJJ06159 Hs.79389 ne) (chicken)-Iike 2 54.6 

428398 A1249368 Hs.98558 ESTs 54 6 

432441 AW292425 Hs.163484 ESTs 54 6 

414569 AF109298 Hs.118258 prostate cancer associated protein 1 54.2 

417771 AA804698 Hs.82547 retinoic acid receptor responder (tazaro 54.2 

448999 AF179274 Hs.22791 transmembrane protein with EGF^ike and 53.7 
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437145 AF007216 Hs.5462 solute carrier family 4, sodium bicarbon 49.6 

446057 A1420227 Hs.1 49358 ESTs, Weakly similar to A460 10 X-linked 48.0 

431579 AW97108Z Hs.222886 ESTs, Weakly similar to TRHY_HUMAN TRICH 46.8 

400299 X07730 Hs.171995 kaDikrein 3, (prostate specific antigen 45.0 

5 427674 NM 003528 Hs.2178 H2B histone family, member Q 44.9 

432374 W68815 Hs.301885 Homo sapiens cDNA FU11346 fis. done PL 44.5 

422487 AJ010901 Hs.198267 mucin 4, tracheobroxhial 44.5 

415314 N88802 Hs.5422 glycoprotein M6B 43.2 

431958 X63629 Hs.2877 cadherin 3, type 1,P-cadherin (placenta 41.9 

10 409262 AK000631 Hs.52256 hypothetical protein FU20624 41.1 

423582 8E00Q831 Hs.23837 Homo sapiens cDNAFU 11 81 2 fis. clone HE 40.5 

400298 AA032279 Hs.61635 six transmembrane epithelial antigen of 39.5 

439176 AI446444 Hs.190394 ESTs, Weakly similar to B28096 Bne-1 pr 39.3 

416585 X54162 Hs.793B6 leiomodin 1 (smooth muscle) 39.0 

15 447726 AL137638 Hs,19368 matri1in2 38.6 

426174 AA547959 Hs.115838 ESTs 3B.1 

425071 NI013989 Hs.1 54424 deiodinase, Iodothyronine, type II 38.0 

417067 AJ001417 Hs.81086 sohite carrier family 22 (extraneuronal 37.5 

414217 AI309298 Hs.279898 Homo sapiens cDNA FU 23 165 fis, clone L 37.0 

20 433043 W57554 Hs.1 25019 lymphoid nuclear protein (LAF-4) mRNA 35.5 

400302 N48056 Hs.1915 folate hydrolase (prostate-specific memb 35.1 

453096 AW294631 Hs.1 1325 ESTs • 35.1 

419239 AA4681B3 Hs.1 84598 Homo sapiens cDNA: FU23241 fis, clone C 34.9 

427665 AF134803 Hs.180141 cofilin 2 (muscle) 34.9 

25 415539 A1733881 Hs.72472 BMP-R1B 34.9 

428411 AW291464 Hs.10338 ESTs 34.6 

428819 AL135623 Hs.193914 KIAA0575 gene product 34.3 

401424 NM_00117ZHomo sapiens arginase, type II 34.0 

452114 N22687 Hs.8236 ESTs 34.0 

30 426990 AL044315 Hs.173094 Homo sapiens mRNA for KIAA1 750 protein, 33.8 

448045 AJ297436 Hs.20166 prostate stem cell antigen 33.1 

421552 AF026692 Hs.105700 secreted frizzled-related protein 4 32.9 

432101 AI918950 Hs.1 23642 EphA3 32.7 

427398 AW390020 Hs.20415 chromosome 21 open reading frame 1 1 32.4 

35 426716 NMJJ06379 Hs.171921 sema domain, immunoglobulin domain (Ig), 32.4 

434025 AF1 14264 Hs.216381 Homo sapiens clone HH409 unknown mRNA 32.4 

414591 AI888490 Hs.55902 ESTs, Weakly similar to ALU8_HUMAN ALU S 32.3 

453355 AW295374 Hs.31412 Homo sapiens cDNA FU11422 fis, ctone HE 31.9 

408380 AF123050 Hs.44532 diubtauitin 31.6 

40 429500 X78565 Hs.289114 hexabrachion (tenascin C, cytotactin) 31.4 

453370 A1470523 Hs.1 39336 ATP-binding cassette, sub-family C (CFTR 31.3 

408369 R38438 Hs.182575 solule carrier family 15 (H77? transport 31.2 

408001 AA046458 Hs.95296 ESTs 30.8 

440274 R24595 Hs.7122 scrapie responsive protein 1 30.7 

45 417332 AW972717 Hs.288462 hypothetical protein FU21 511 30.3 

442993 BE018682 Hs.166196 ATPase, O ass I, type 8B, member 1 30.2 

433404 T32982 . Hs.102720 ESTs 29.6 

459247 N46243 Hs.110373 ESTs, Highly similar to T42626 secreted 29.6 

418829 AA516531 Hs.55999 NK homeobox (Drosophila), famfly 3, A 29.3 

50 459290 NM 001546 Hs.34853 inhibitor of DNA binding 4, dominant neg 29.2 

418827 BE327311 Hs.47166 KT021 29.0 

410330 AW023630 Hs.46786 ESTs 28.6 

450377 AB033091 Hs.74313 WAA1265 protein 28,6 

424012 AW368377 Hs.137569 tumor protein 63 kDa with strong homolog 28.5 

55 441633 AW958544 Hs.1 12242 normal mucosa of esophagus specific 1 28.2 

429290 AF203032 Hs.198760 neurofilament lieavy polypeptide (200kD) 27.7 

452340 NM 002202 Hs.505 ISL1 transcription factor, LIM/homeodoma 27.7 

421566 NMJJ00399 Hs.1395 earty growth response 2 (Krox-20 (Drosop 27.6 

409361 NM_005982 Hs.54416 s'me ocuEs homeobox (Drosophila) homoto 27.4 

60 446100 AW967109 Hs.1 3804 hypothetical protein dJ462023.2 27.4 

407202 N5B172 Hs.109370 ESTs 27.4 

434078 AW880709 Hs.283683 chromosome 8 open reading frame 4 27.4 

442870 N45018 Hs.8769 hypothefical protein DKFZp761J1 7121 27.3 

403047 NMJ)05656*:Homo sapiens transmembrane pr 27.2 

65 407709 AA456135 Hs.23023 ESTs 27.0 

433444 AW975324 Hs.129816 ESTs 26.8 

415989 AI267700 Hs.317584 ESTs 26.8 

430539 AK001489 Hs.242894 AOP^bosytalion factor-like 1 26.5 

„ 408221 AA912183 Hs.47447 ESTs 26.5 

70 448519 AW175665 Hs.278695 Homo sapiens prostein mRNA, complete cds 26.4 

437179 AA393508 Hs.300642 serologically defined colon cancer anlig 26.4 

426457 AW894667 Hs.1 69965 chimerin {chimaerin) 1 26.2 

444484 AK002126 Hs.1 1260 hypothetical protein FU 11 264 26.2 

400292 AA250737 Hs.72472 BMP-R1B 26.0 

75 433647 AA603367 Hs.222294 ESTs 26.0 

429220 AW207206 Hs.136319 ESTs 25.8 

427871 AW992405 Hs.59622 Homo sapiens, done IMAGE:3507281, mRNA, 25.6 

448106 A1800470 Hs.171941 ESTs 25.5 

0/x 415992 C05837 Hs.1 45807 hypothetical protein FU 13593 25.5 

80 452792 AB037765 Hs.30652 KIAA1344 protein 25.3 

407910 AA650274 Hs.41296 fibronectin teucine rich transmembrane p 25.2 

428862 NM 000346 Hs.2316 SRY (sex determining region YJ-box 9 (ca 24.9 

418961 AW967646 Hs.23023 ESTs 24.8 
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20 
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45 
50 
55 
60 
65 
70 
75 
80 



451027 


AW519204 


Hs.40808 


ESTs 


24.5 


413517 


N76712 


Hs.44829 


ESTs, Weakly similar to I38022 hypotheti 


24.5 


409731 


AA 125985 


Hs.56145 


thymosin, beta, identified in neuroblast 


24.4 


418396 


AI765805 


Hs.26691 


ESTs 


24.1 


412088 


AI689496 


Hs.108932 


ESTs 


24.1 


429673 


AA884407 


Hs.21 1595 


protein tyrosine phosphatase, non-recept 


24.0 


411644 


H92064 


Hs.278626 


Arg/AbMnteracting protein ArgBP2 


24.0 


409619 


AK001015 


Hs.55220 


RCI 2-assnrJatpri flfhannnpnp 2 


2? Q 


433466 


AA508353 


Hs. 10531 4 


relaxin 1 (H1) 


23.7 


431657 


A1345227 


Hs. 105448 


ESTs, Weakly similar to B34087 hypotheti 


23.6 


440995 


T57773 


Hs.10263 


ESTs 




432276 


AL1 37506 


Hs.274256 


hypothetical protein RJ23563 


23.2 


428695 


A1355647 


Hs.189999 


DUrineroic recsoior ffamitv A arntin 5) 


23.1 


424720 


M89907 


Hs.152292 


SWUSNF related matrix associated acti 


22.9 


440260 


AI972867 


Hs.7130 




22 5 


443622 


AI911527 


Hs HRfK 


CO IS 


£1.1 


426418 


M90464 




rntl r*"wan (una f\/ atnh'i C t&lnnrt nmdr 

cuji&gen, lype iv, aiproa o (Aipon synor 




425383 


D83407 


Hs 156007 


uuwii ajiiuiuiiic uiuudj (cQion gens nn 


91 Q 


450642 


R39773 


Hs.7130 




y\ o 


417421 


AL138201 


Hs 82120 


nuucdi (cccpmr auuianuiy «t, group m, m 




435981 


H74319 


Hs 188620 


ESTs 


91 ft 


450693 


AW450461 


Hs.203965 


ESTs 


21.7 


444784 


01 2485 


Hs. 11951 


BcionucicoiiaB pyTopnospnaiase/pnospnoaj 


21.7 


423242 


AL039402 


Hs 125783 




91 7 


408621 


AI970672 


Hs.46638 


chromosome 1 1 ooen rpadinn framp fi 


21.6 


408000 


L11690 


Hs.620 


hilllntis npfnnhlnntfl pntinpn 1 /29fl/9inifn\ 


91 R 


404210 






nivi_uvv9J0.nuniD sapiens rnyeioiu/iympnoio 


91 c 


417622 


AW298163 


Hs 82318 


WAR nrnfpln fanmlv momhor 9 


91 C 

Z1.0 


419526 


AI821895 


Hs.1 93481 


ESTs 


91 *; 


442799 


AI564739 


Hs 6SSn*i 


ESTs 


^1.4 


424846 


AU077324 


Hs.1832 


npiirnnonltrlo Y 
i icuiupcpiiuc 1 


91 1 


420195 


N44348 


Hs 26943 


Hnmn canipnc rRMA PI I1 1 177 fie r\r\na PI 
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ESTs WeakJv similar to ALU1 HUMAN ALU S 


14.6 


409264 


NMJM4937 


Hs.52463 


WAA0966 protein 


14£ 


432729 


AK000292 


Hs.278732 


hypothetical protein FU20285 


14.5 


422270 


AF1 14494 


Hs.1 14062 


protein tyrosine phosphatase-like (proli 


14.5 


410339 


AI916499 


Hs.298258 


ESTs 


14.5 


431992 


NM.002742 


Hs^891 


protein kinase C, mu 


14.5 


424432 


AB037821 


Hs.146858 


prolocadherin 10 


14.5 


431933 


AI187057 


Hs. 132554 


ESTs 


14.5 


440749 


W22335 


Hs.7392 


hypothetical protein MGC3199 


14.5 


416434 


AW163045 


HS79334 


nuclear factor, interteukin 3 regulated 


14.5 


450382 


AA397658 


Hs.60257 


Homo sapiens cDNA FU13598 fis, done PL 


14.5 


407021 


U52077 




gb.Human mariner 1 trans posase gene, comp 


14.5 


413786 


AW613780 


Hs.13500 


ESTs 


14.5 


437866 


AA156781 


HS74170 


metalbthionein 1E (functional) 


14.5 
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454078 AA601518 Hs.22209 secreted modular calcium-binding protein 14.4 

411990 AW963524 Hs.31707 ESTs. Weakly similar to YEW4_YEAST HYPOT 14.4 

443280 AA299688 Hs.24183 ESTs 14.4 

416836 D54745 Hs.80247 cholecystokinin 14.4 

5 436860 H12751 Hs.5327 PR01 914 protein 14.3 

425174 D87450 Hs.154978 WAA0261 protein 14.3 

442242 AV647908 Hs.90424 Homo sapiens cDNA: FU23285 fis. clone H 14.3 

443790 NM 003500 Hs.9795 acyl-Coenzyme A oxidase 2, branched chai 14.3 

418994 AA296520 Ks.69546 selecfin E {endothelial adhesion molecul 14.2 

10 419563 AA526235 Hs.193162 Homo sapiens cONA FU119B3 fis, done HE 14.2 

437718 AI927288 Hs.196779 ESTs 14.2 

412828 AL133396 Hs.74621 prion protein (p27-30) (CreutzfekWakob 14.2 

432712 AB016247 Hs.288031 sterol-C5-desatuxase (funga! ERG3, delta 14.2 

433332 A1367347 Hs.44898 Homo sapiens clone TCCCTA001 5 1 mRNAsequ 14.2 

15 451533 NM_004657 Hs.26530 serum deprivation response (phosphatidyl 14.2 

424701 NM 005923 Hs.1 51988 mitogen-acuvaled protein kinase kinase 14.1 

446494 AA463276 Hs.288906 WW Domain-Containing Gene 14.1 

456508 AA502764 Hs.123469 ESTs, WeaWy similar to AF208B55 1 BM-01 14.0 

432251 AW972S83 Hs.232165 polycythemia rubra vera 1; cell surface 14.0 

20 426423 NM.012446 Hs.1 69833 single-stranded- DNA-binding protein 14.0 

442281 N34742 Hs.1 70065 Homo sapiens cDNA FU13492 fis, clone PL 14.0 

442145 AHJ22650 Hs.8117 erbb2-inleracting protein ERBIN 13.9 

41884B AI820961 Hs.1 93465 ESTs 13.9 

447072 D61594 Hs.1 7279 tyrosylprotein sulfotransferase 1 13.9 

25 439518 W76326 gb.zd60d04.r1 Scares Jetal_hearLNbHH19W 13.9 

443967 AW294013 Hs.200942 ESTs 13.9 

424775 AB014540 Hs.1 53026 SWAP-70 protein 13.9 

411190 AA306342 Hs.69171 protein kinase C-fike 2 13.9 

447384 AI377221 Hs.40528 ESTs 13.9 

30. 444880 AW118683 Hs,154150 ESTs 13.9 

433409 AI278802 Hs.25661 ESTs 13.9 

423201 NM 000163 Hs.1 25180 growth hormone receptor 13.9 

413724 AA131466 Hs.23767 hypothetical protein FU 12666 13.9 

454219 X75042 Hs.44313 v-rel avian reticuloendothetiosis viral 13.8 

35 429165 AW009885 Hs.11825B prostate cancer associated protein 1 13.8 

430598 AK001764 Hs.247112 hypothetical protein FU10902 13.8 

444172 BE147740 Hs.104558 ESTs, Moderately similar to 138022 hypot 13.8 

416774 AI005169 Hs.28274 Homo sapiens cDNA: FU22049 fis. done H 13.8 

410310 J02931 Hs.62192 coagulation factor 111 (thromboplastin, 13.8 

40 428738 NM_O0038O Hs.192803 xeroderma pigmentosum, complementation g 13.8 

400296 AA305627 Hs.139336 ATP-binding cassette, sub-family C (CFTR 13.8 

407939 W05608 Hs.312679 ESTs, Weakly similar to A49019 dynein he 13.8 

452281 T93500 Hs.28792 Homo sapiens cDNAFU 11 041 fis, done PL 13.7 

Ap 408622 AA056060 Hs.202577 HomosapienscDNAFU12166fis,doneMA 13.7 

45 428054 AI9486B8 Hs.266619 ESTs 13,7 

444636 T96667 Hs.17877 ESTs 13.7 

416292 AA179233 Hs.42390 nasopharyngeal carcinoma susceptibaity 13.7 

442315 AA173992 Hs.7956 ESTs, Moderately similar to ZN91_HUMAN Z 13.7 

452260 AA453208 Hs.28726 RAB9, member RAS oncogene family 13.7 

50 428966 AF059214 Hs.194687 cholesterol 2Shydroxylase " 13.7 

450316 W84446 Hs.226434 hypothetical protein MGC4643 13.7 

419839 U24577 Hs.93304 phospholipase A2, group VII (platelet-ac 13.7 

443634 H73972 Hs.1 34460 ESTs 13.7 

449474 AA019344 Hs.2055 ublquithvacGvating enzyme E1 (A1S9T an 13.7 

55 422278 AF072873 Hs.1 14218 frizzled (DrosopWIa) homolog 6 13.7 

445817 NM.003642 Hs.13340 mstone acetyltransferase 1 13.7 

434629 AA789081 Hs.4029 gGoma-amplified sequence-41 13.6 

416795 AI497778 Hs.20509 HBV pX associated protein-8 13.6 

410001 AB041036 Hs.57771 kallikreinll 13.6 

60 452242 R50956 Hs.159993 gycosyltransferase 13.6 

413231 D87461 Hs.75244 BCL2-like2 13.6 
404641 NM-.021965*:Homo sapiens phosphogkicomuta 13.6 

428648 AF052728 Hs.1 88021 potassium voltage-gated channel, subfami 13.6 

414279 AW021691 Hs.1 01067 GCN5 (general control of amino-acid synt 13.6 

65 411573 AB029000 Hs.70823 WAA1 077 protein 13.5 

417632 R20855 Hs.5422 glycoprotein M6B 13.5 
431467 N71831 Hs.256398 Homo sapiens mRNA; cDNA DKFZp434E0528 (f 13.5 

450313 AI038989 Hs.332633 Bardet-Biedl syndrome 2 13.5 

445472 AB005631 Hs.1 2784 Homo sapiens mRNA for KIAA0293 gene, par 13.5 

70 419264 AA877104 Hs.293672 ESTs, Weakly simBar to ALUB_HUMAN III! 13.5 

418663 AK001100 Hs.41690 desmocotfin3 13.5 

452277 AL049013 Hs.28783 KIAA1 223 protein 13.5 

447896 AJ436124 Hs.294069 Homo sapiens cDNA FU13384 fis. done PL 13.5 

447082 T85314 Hs.42644 thioredoxhvUke 13.5 

75 451468 AW503398 Hs.293663 ESTs, Moderately similar to I38022 hypot 13.4 

415443 T07353 Hs.7948 ESTs 13.4 

427212 AW293849 Hs.58279 ESTs, Weakly similar to ALU7.HUMAN ALU S 13.4 

442113 BE622187 Hs.99670 ESTs, Weakly similar to 138022 hypotheti 13.3 

n/x 433517 AW022133 Hs.189838 ESTs 13.3 

80 430829 AW451999 Hs.194024 ESTs 13.3 

453111 AB014598 Hs.31720 hephaestin 13.3 

435839 AF249744 Hs.25951 Rho guanine nucleotide exchange factor ( 13.3 

429113 D28235 Hs. 196384 prostaglandirvendoperoxide synthase 2 (p 13.3 
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422728 AW937826 Hs.103262 ESTs, Weakly similar to ZN91 .HUMAN ZINC 13.3 

427286 AW732802 Hs.2132 epidcrma! growth factor receptor pathway 13.3 

437323 AA371145 Hs.226627 leptin receptor 13.2 

_ 421878 AA299652 Hs.1 11496 Homo sapiens cDNA FU 11 643 Es, clone HE 13.2 

5 433037 NM 014158 Hs.279938 HSPC067 protein 13.2 

407938 AA905097 Hs.85050 phospholamban 13.2 

400860 Target Exon 13.2 

411031 W37943 Hs.34892 KIAA1 323 protein 13.2 

436797 AA731491 Hs.336454 hypothetical protein MGC1 4879 13.2 

10 409277 T05558 Hs.1 56880 ESTs 13.2 

434036 AI659131 Hs.1 97733 hypothetical protein MGC2849 13.2 

448520 AB002367 Hs.21355 doublecortin and CaM kinase-like 1 13.2 

450755 AA010984 Hs.159464 ESTs 13.1 

450649 NMJJ01429 Hs.25272 E1A binding protein p300 13.1 

15 408495 W68796 Hs.237731 ESTs 13.1 

429732 U20158 Hs.2488 lymphocyte cytosoBc protein 2 (SH2 doma 13.1 

428839 AI767756 Hs.82302 Homo sapiens cDNA FU 14814 fis, done NT 13.1 

416987 D86957 Hs.80712 K1AA0202 protetn 13.0 

„ 453006 AI362575 Hs.167133 ESTs 13.0 

20 421750 AK000768 Hs.107872 hypothetical protein FU20761 13.0 

415752 BE314524 Hs.78776 putative transmembrane protein 13.0 

411529 AA430348 Hs.317596 Homo sapiens cDNA FU 12927 fis, clone NT 13.0 

417318 AW953937 Hs.12891 ESTs 13.0 

410763 AF279145 Hs.8966 hypothetical protein FU21 776 12.9 

25 426034 AI276989 Hs.56123 Homo sapiens cDNAFU 13443 fis, done PL 12.9 

447966 AA340605 Hs.105887 ESTs, Weakly similar to Homoiog of rat Z 12.9 

428218 AA424266 Hs.123642 EphA3 12.9 

409799 D1192B Hs.76B45 phosphoserine phosphatase-like 12.9 

„ 446779 BE042877 Hs.177135 ESTs 12.8 

30 448944 AB014605 Hs.22599 atrophin-1 interacting protein 1; activi 12.8 

417315 AI080042 Hs.336901 ribosomal protein S24 .12.8 

429697 AW296451 Hs.24605 ESTs 12.8 

432674 AA641092 Hs.257339 ESTs, WeaJdy similar to I38022 hypolheti 12.8 

445309 AL1 57474 Hs.12504 likely ortholog of mouse Arkadia 12.8 

35 451099 R52795 Hs.25954 interieukin 13 receptor, alpha 2 12.8 

424806 AA382523 Hs.105689 MSTP031 protein 117 

442343 AA992480 Hs.129874 ESTs 117 

432244 AI669973 Hs.200574 ESTs 12.7 

. _ 459513 AI032946 gb:ox06g09.s1 SoaresJetaIJrveu>pleen_ 12.7 

40 452512 AW363486 Hs.337635 ESTs 12.7 

415079 R43179 Hs.22895 hypothebcal protein FU23548 12.7 

422365 AF035537 Hs.1 15521 REV3fjeasthomotogHke, catalytic sub 12.7 

411067 AI681006 Hs.71721 ESTs 12.7 

442501 AA315267 Hs.23128 ESTs 12.7 

45 434466 AB037829 Hs.3862 regulatof of nonsense transcripts 2; OKF 12.7 

437052 AA861697 Hs.1 20591 ESTs 117 

433234 AB040928 Hs.65366 KIAA1 495 protein 116 

453830 AA534296 Hs.20953 ESTs 12.6 

409995 AW960597 Hs.30164 ESTs 12.6 

50 414290 AI568801 Hs.71721 ESTs 116 

417248 AA329449 Hs.247302 twisted gastrulation 116 

418624 AI734080 Hs.104211 ESTs 116 

412654 A1093480 Hs.29263 hypothetical protetn FU 11896 116 
450253 AL133047 Hs.24715 Homo sapiens mRNA; cDNA DKFZp434D0215 (f 116 

55 446619 AU076643 Hs.313 secreted phosphoprotein 1 (osteopontm, 12.5 

427078 AI676062 Hs.111902 ESTs 115 

420838 AW118210 Hs.5244 ESTs 12.5 

449784 AW161319 Hs.12915 ESTs 115 

_ 431325 AW026751 Hs.5794 ESTs, WeaWy similar to 21 09260A B ceD 115 

60 415276 UB8666 Hs.78353 SFRS protein kinase 2 12.5 

426110 NM.002913 Hs.166563 repfcation factor C (activator 1) 1 (14 115 

400301 X03635 Hs.1657 estrogen receptor 1 12.5 

434926 BE543269 Hs.50252 mitochondrial ribosomal protein L32 115 

426316 NM.002430 Hs.268515 meningioma {disrupted in balanced trans! 115 

65 420345 AW295230 Hsl5231 ESTs 114 

432205 AI806583 Hs.125291 ESTs 114 

451893 AW192083 Hs.290855 ESTs ' 114 

435266 AK001942 Hs.4863 hypothetical protein DKFZp566A1 524 114 

„ 427003 U19487 Hs.2090 prostaglandin E receptor 2 (subtype EP2) 12.4 
70 404642 NM_021965*:Homo sapiens ptosptoghicomuta 114 

414241 AA425085 Hs.4007 Sanxrfemmal-assodated protein 12.4 

429716 R25685 Hs.211933 collagen, type XIII, alpha 1 12.4 

420871 AA702972 Hs.65300 ESTs 12.4 

448072 AI459X6 Hs.24908 ESTs 114 

75 441269 AW015206 Hs.178784 ESTs 113 

427761 AA412205 Hs.140996 ESTs 113 

425322 U63630 Hs.1 55637 protein kinase, DNA-activated, catalytic 113 

417733 AL048678 Hs.82503 Hxapiens mRNA for S'UTR of unknown prot 12.3 

on 436521 AW203986 Hs^13003 ESTs 113 

oO 408920 AL1 20071 Hs.48998 fibroneclin leucine rich transmembrane p 113 

446332 AK001635 Hs.1483B hypothetical protein FU1 0773 12.2 

453994 BE180964 Hs.165590 ribosomal protein S13 12.2 

420397 NM_007018 Hs.97437 centrosomal protein 1 112 
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447471 AF039843 Hs.18676 sprouty (Drosophfla) homolog 2 112 

442562 BE379584 Hs.34789 dolichyj-dtphosphooligosacjcharide-prolei 12.2 

400880 NM.00061 1*:Homo sapiens CD59 antigen p18 12.2 

425920 AL049977 Hs.162209 claudin8 12.2 

5 419743 AW408762 Hs.5957 Homo sapiens clone 24416 mRNA sequence 12.2 

414222 AL135173 Hs.878 sorbitol dehydrogenase 12.1 

427051 BE1781 10 Hs.173374 Homo sapiens cDNA FU10500 fis, clone NT 12.1 

452994 AW962597 Hs.31305 KIAA1 547 protein 12.1 

425154 NM 001851 Hs.154850 collagen, type IX, alpha 1 12.1 

10 437124 AA554458 Hs.197751 KIAA0666 protein 12-1 

411450 H49619 Hs.127301 ESTs 12.1 

421559 NM 014720 Hs.105751 Ste2Ckelated serine/threonine kinase 12.1 

449625 NMJJ14253 Hs.23796 odz (odd OzAen-m, Drosophita) homolog 1 12.1 

419459 AW291128 Hs.27B422 DKFZP586G1 122 protein 12.1 

15 426252 BE176980 Hs.28917 ESTs 12.1 

414171 AA360328 Hs.865 RAP1A, member of RAS oncogene family 12.1 

436169 AA888311 Hs.17602 Homo sapiens cDNA FU12381 Gs, done MA 12.0 

435047 AA454985 Hs.54973 cadherin-likQ protein VR20 12.0 

417625 U59305 Hs.44708 Ser-Thr protein kinase related to the my 12.0 

20 439820 AL360204 Hs.283853 Homo sapiens mRNA full length Insert cDN 12.0 

443646 AI085198 Hs.164226 ESTs 12.0 

434874 N62448 Hs.293970 methylmalonate-semialdehydedehydrogenas 12.0 

443837 AI984625 Hs.9884 spindle pole body protein 12.0 

451640 AA195601 Hs.26771 Human DNA sequence from clone 747H23 on 12.0 

25 424232 AB015982 Hs.143460 protein kinase C, nu 11.9 

414555 N98569 Hs.76422 phospholipase A2, group I1A (platelets, 11.9 

446874 AW968304 Hs.56156 ESTs 11.9 ' 

454119 BE549773 Hs.40510 uncoupling protein 4 11.9 

436746 AA730045 Hs.187866 ESTs 11-9 

30 432359 AA076049 Hs.274415 Homo sapiens cDNA FU1 0229 fis, clone HE 11.9 

416508 R39769 Hs.206088 ESTs, Moderately similar to ALU8.HUMAN A 1 1 .8 

444758 AL044878 Hs.11899 3-hydroxy-3-methylglutaryl-Coenzyrne A re 11.8 

413991 H44725 Hs.426B3 ESTs 11.8 

431645 AF078849 Hs.2664B3 dynein light chain-A 11.8 

35 421406 AF179897 Hs.104105 Meis (mouse) homolog 2 11.8 

408826 AF216077 Hs.48376 Homo sapiens clone HB-2 mRNA sequence 11.8 

424903 T26477 Hs.22883 ESTs, Weakly similar to 138022 hypothec 11.8 

412977 AA125910 Hs.191461 ESTs 11-8 

426981 AL044675 Hs.173081 KIAA0530 protein 11.8 

40 410853 H04588 Hs.30469 ESTs 11.7 

444670 H58373 Hs.332938 hypothetical protein MGC5370 11.7 

444381 BE387335 Hs.283713 ESTs, Weakly similar to S64054 hypothec' 11.7 

434398 AA121098 Hs.3838 serum-inducible kinase 11.7 

438032 BE045624 Hs.152992 ESTs 11.7 

45 433212 BE218049 Hs.121820 ESTs 116 

424330 AW073953 Hs.333396 Homo sapiens cDNA FU 13596 fis, done PL 11.6 

452679 Z42387 Hs.83883 transmembrane, prostate androgen induced 11.6 

430929 AA489166 Hs.156933 ESTs 116 

423782 AW72209 Hs.323117 ESTs 116 

50 425770 NMJH4363 Hs.159492 spastic ataxia of Charlevoix-Saguenay (s 11.6 

437108 AA434054 Hs.80624 hypothetical protein M6C2560 11.6 

430200 BE613337 Hs.234896 geminin 115 

453857 AL080235 Hs.35861 DKFZP586E1621 protein 11.5 

411096 U80034 Hs.68583 mitochondrial intermediate peptidase 11.5 

55 422805 AA436989 Hs.121017 H2A histone family, member A 11.5 

422538 NM_006441 Hs.1 18131 5, 10^ethenyltetrahydrofolate synthetase 11.5 

412677 AW029608 Hs.17384 ESTs 11.5 

421896 N62293 Hs.45107 ESTs 11-5 

453930 AA419466 Hs.36727 hypothetical protein FU10903 11.5 

60 443123 AA094538 Hs.272808 putative transcription regulation nudea 11.5 

415293 R49462 Hs.1 06541 ESTs 11.5 

443161 AI038316 gb:ox48c08j<1 Soares_totaLfetus_Nb2HF8_ 11J5 

420185 AL044056 Hs.158047 ESTs 11.5 

445527 W39694 Hs.83286 ESTs. Weakly similar to S14747 sphingomy 11.5 

65 428829 R14050 Hs.194051 rtomosa^Jle^snnR^^cOrMDKFZp566B213(fr 11.5 

416852 AF283776 Hs.802B5 Homo sapiens mRNA; cDNA DKF2p586C1723 (f 11.5 

449919 AI674685 Hs.200141 ESTs 11.5 

414844 AA296874 Hs.77494 deoxyguanosine kinase 11.5 

. 449543 AF070632 Hs.23729 . Homo sapiens clone 24405 mRNA sequence 11.4 

70 429784 M89795 Hs.30 membrane-spanning 4-domalns, subfamily A 11.4 

407786 AA687538 Hs.38972 tetraspanl 114 

414407 AA147026 Hs.76704 ESTs 11.4 

441247 AW118681 Hs.128051 Homosapierfimymicstrc^lymphorjoieti 11.4 

456804 AI421645 Hs.1 39851 caveolin2 11.4 

75 422546 AB007969 Hs.301478 WAA05QQ protein 11.4 

417350 U50928 Hs.82001 polycystic kidney disease 2 (autosomal d 11.4 

417683 AW566008 Hs.239154 ankyrin repeat, family A (RFXANK-tike), 11.4 

424800 AL035588 Hs.153203 MyoD family inhibitor 11.4 

433577 AW007080 Hs.8817 ESTs 114 

80 453935 AI633770 Hs.42572 ESTs 11.4 

415060 AJ223810 Hs.43213 ESTs, Weakly similar to IEFS_HUMAN TRANS 11.4 

425710 AF030880 Hs.1 59275 solute carrier family, member 4 11.3 

448766 AI473827 Hs.31793 ESTs 113 
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444151 AW972917 Hs.1 28749 alpha-melhytacyl-CoA racemase 11.3 

41010B AA081659 Hs.318775 OSBP-rdated protein 6 113 

418450 R84397 Hs.1 93651 ESTs, Weakly similar to alternatively sp 113 

415457 AW081710 Hs.7369 ESTs, WeaWy similar to ALU 1 .HUMAN ALUS 113 

5 452110 T47667 Hs.28005 Homo sapiens cDNAFLM 1309 lis. clone PL 113 

414212 AA136569 Hs.10848 KIAA01 87 gene product 113 

401519 C15000476':gi|12737279|ref|XP _012163.1| 1 13 

458509 AA654650 Hs,282906 ESTs 11.2 

416913 AW934714 gb:RC1-OT0001-031 299-01 1-al 1 DT0001 Homo 11.2 

10 452784 BE463857 Hs.1 51258 hypothetical protein FU21 062 11.2 

436758 AW977167 Hs.155272 ESTs 11.2 

409401 AI201895 Hs. 181 309 proleasome (prosome, macropain) subunit, 11.2 

434715 BE005346 Hs.116410 ESTs 11.2 

427283 AL1 19796 Hs.174185 ectonucleotidepvroprwsphalase/phosphodi 11.2 

15 428342 A173916B Hs.131798 Homo sapiens cDNA FU 13458 fis, clone PL 11.2 

419465 AW500239 Hs.21187 Homo sapiens cDNA: FU 23068 fis, done L 11.2 

430519 AF129534 Hs.49210 F-box only protein 4 11J2 

418838 AW385224 Hs.35198 ectonucleotide pyrophosphatasefthosphodi 11.1 

429163 AA884766 gb:am20a10.s1 Soares„NFUJ_GBC_S1 Homo s 11.1 

20 434158 TB6534 Hs.14372 ESTs 11.1 

414341 D80004 Hs.75909 KIAA01 82 protein 11.1 

414650 AA150435 Hs.72063 ESTs 11.1 

448475 BE613134 Hs.247474 hypothetical prolein FU21 032 11.1 

437575 AW954355 Hs.36529 hypothetical protein MGC1 1242 11.1 

25 423566 AW976434 Hs.3623 hypothetical protein FU1 1220 11.1 

425289 AW139342 Hs.155530 interferon. gammsHnducibte protein 16 11.1 

452598 AI831594 Hs.68647 ESTs, Weakly similar to ALU7_HUMAN ALU S 11.1 

417433 BE270266 Hs.82128 5T4 oncofetal trophoblast glycoprotein 11.0 

438379 N23018 Hs.1 71391 C-terrrinal binding protein 2 11.0 

30 433230 AW136134 Hs.220277 ESTs 11.0 

412622 AW664708 Hs.171959 ESTs 11.0 

450353 AI244661 Hs.103296 ESTs, Weakly similar to S65657 alpha-1C- 11.0 

434349 NM_015678 Hs.3821 neurobeachin 11.0 

430261 AA305127 Hs.237225 hypothetical protein HT023 11.0 

35 420544 AA677577 Hs.98732 Homo sapiens Chromosome 16 BAC clone CIT 11.0 

424099 AF071202 Hs.1 39336 ATP-btnding cassette, sub-family C (CFTR 11.0 

442082 R41823 Hs.7413 ESTs 11.0 

422890 Z43784 Hs.75893 ankyrin 3, node of Ranvier (ankyrin G) 11.0 

437162 AW005505 Hs.5464 thyroid hormone receptor coactivaling pr 11.0 

40 450244 AA007534 Hs.125062 ESTs 11.0 

417169 R13550 Hs.246773 ESTs 11.0 

421481 AW391972 Hs.104696 K1AA1324 protein 10.9 

431576 M76665 Hs.275215 hydroxysteroid (11-beta) dehydrogenase 1 10.9 

408988 AL1 19844 Hs.49476 Homo sapiens clone TUA8 Cri-du-chat regi 10.9 

45 439706 AW872527 Hs.59761 ESTs, Weakly similar to DAP1 .HUMAN DEATH 10.9 

435133 AJ0104B2 Hs.31412 Homo sapiens cDNA FU1 1 422 fc, done HE 10.9 

433658 L03678 Hs.1 561 10 immunoglobulin kappa constant 10.9 

428364 AA426565 Hs311103 ESTs, Moderately similar to ALULHUMAN A 10.9 

441540 C01367 Hs.127128 ESTs 10.9 

50 431154 AW971228 Hs.290259 ESTs. Weakly similar to 138022 hypotheti 10.9 

• 416777 AF146760 Hs.79844 DKFZP564M1 41 6 protein 10.9 

439556 AI623752 Hs.163603 ESTs 10.9 

428280 H05541 Hs.1 83428 sarcospan (Kras oncogene-assodated gene 10.8 

453942 AW190920 Hs,19928 hypothetical protein SP329 10.8 

55 447982 H22953 Hs.137551 ESTs 10.6 
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a6 


433759 


AA680003 


Hs.109363 


Homo sapiens cDNA: FU23603 fis, clone L 


8.6 


434064 


AL049045 


Hs.1 80758 


hypothetical protein PRO0082 


8.6 


401785 






NM_002275*:Homo sapiens keratin 15 (KRT1 


8.6 


418407 


AL04481B 


Hs.84928 


nuclear transcription factor Y, beta 


8.6 


419647 


AA348947 


Hs.91816 


hypothetical protein 


8.6 


425907 


AA365752 


Hs.1 55965 


ESTs 


8.6 


452017 


AF109302 


Hs.27495 


prostate cancer associated protein 7 


8.6 


424434 


H87028 


Hs.146861 


hypothetical protein FU20580 


8.6 


400303 


AA242758 


Hs.79136 


UV-1 protein, estrogen regulated 


8.6 


414680 


AA743331 


Hs.272572 


hemoglobin, alpha 2 


8.6 


413992 


W26276 


Hs.136075 


RNA, U2 small nuclear 


8.6 


415071 


AK002197 


Hs.284270 


Homo sapiens cDNA FU1 1 335 fis, clone PL 


8.6 


451230 


BE546208 


Hs.26090 


hypothetical protein FU20272 


8.6 


441790 


AW294909 


Hs.132208 


ESTs 


6.6 


441124 


T97717 


Hs.1 19563 


ESTs 


8.6 


438160 


AA779332 


Hs.122671 


ESTs 


8.6 


433264 


D85782 


Hs.3229 


cysteine dtoxygenase, type 1 


6.6 


434851 


M806164 


Hs.1 16502 


ESTs 


B.6 


420608 


BE548277 


Hs.103104 


ESTs 


8.6 


426535 


AU077012 


Hs.288582 


ESTs, Weakly similar to ubiquitous TPR m 


8.6 


411667 


BE160198 




gb:QV1-HTM13410200-059-h03 HT0413 Homo 


8.5 


454058 


AI273419 


Hs.135146 


hypothetical protein FU13984 


8.5 


435614 


R09718 


Hs.20403 


ESTs 


8.5 


446595 


T57448 


Hs.15467 


hypothetical protein FU20725 


8.5 


435677 


AA694142 


Hs.293726 


ESTs, Weakry similar to TSGA RAT TESTIS 


8.5 


447894 


AW204253 


Hs.21912 


ESTs 


8.5 


452682 


AA456193 


Hs.9071 


progesterone membrane binding protein 


8.5 


418372 


AA311833 


Hs.84318 


replication protein A1 (70kO) 


8.5 


447231 


AK001293 


Hs.330208 


crystalfin, zeta (quinone reductase)-lik 


8.5 


416206 


AW206248 


Hs.1 11092 


hypothetical protein FU22332 


8.5 
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434747 AA837085 Hs.220585 ESTs 8-5 

415938 BE383507 Hs.78921 A kinase (PRKA) anchor prolan 1 8.5 

450813 AI739625 Hs.203376 ESTs 8.5 

413956 A1821351 Hs.193133 ESTs, Weakly similar to ALU7_HUMAN ALU S 8.5 

408699 AA056614 Hs.106200 ESTs, Moderately simBar to ZN91 .HUMAN 1 8.5 

440840 AW6296B6 Hs.283713 ESTs, Weakly similar to S 64054 hypotheti 8.5 

407891 AA486620 Hs.41135 endomutiiv2 8.5 

411019 AW993097 Hs.48617 Homo sapiens cONA FU 12540 fis, done NT 8.5 

432620 AA777749 Hs.5978 UM domain only 7 8.5 

424511 BE300512 Hs.193557 ESTs, Moderately similar to ALU7_HUMAN A 8.5 

408170 AW204516 Hs.31835 ESTs 8.5 

452670 AF068227 Hs.30213 ceroid-lipofusdnosis, neuronal 5 8.5 

450516 AA902656 Hs.21943 NIF3 (Ngg1 interacting factor 3, S.pombe 8.5 

417560 U73338 Hs.82283 5-rnelhyiletrarrydrofc^e4iomocysteine me 8.5 

430606 BE266026 Hs.31476 Homo sapiens cDNARJ 13872 fis, clone TH 8.4 

436953 AW959074 Hs.23648 Homo sapiens cDNA FU 13097 fis, clone NT 8.4 

418120 AA213437 Hs.192249 ESTs 8.4 

419436 AA991639 Hs.242413 hypothetical protein DKFZp434K1421 8.4 

432600 AI821085 gb:ns95a12.y5 NCLCGAP_Pr3 Homo sapiens 8.4 

448474 AI7920U Hs.1 3809 hypothetical protein FU 10648 8.4 

434384 AA631910 Hs.162849 ESTs 8.4 

452234 AW084176 Hs.223296 ESTs, Weakly similar to 138022 hypotheti 8.4 

411562 AL050201 Hs.70769 hypothetical protein OKFZp586E 1923 8.4 

421622 AB037748 Hs.106204 KIAA1 327 protein 8.4 

431160 AW971174 Hs.324330 ESTs 8.4 

449567 AI990790 Hs.188614 ESTs B.4 

411088 BE247593 Hs.145053 ESTs 8.4 

452186 AA1 20761 Hs.28307 WW domain binding protein 4 (formin bind B.3 

437872 AK002015 Hs.5887 RNA binding motif protein 7 8.3 

429393 AA383024 Hs.201603 Homo sapiens mRNA; cDNA DKFZp434D0917 (f 8.3 

425692 D90041 Hs.155956 N-acetyltransferase 1 (arylamine N-acety 8.3 

438440 AA807228 Hs.225161 ESTs 8.3 

431962 AL049385 Hs.272251 Homo sapiens mRNA; cDN A OKFZp586M1 418 (f 8.3 

422295 AF051151 Hs.1 14408 toll-like receptor 5 8.3 

440146 AW014231 Hs.90790 Homo sapiens cDNA: FU22930 fis, clone K 8.3 

425242 D13635 Hs.155287 KIAA0010 gene product 8.3 

432769 AA620814 Hs.144959 ESTs 8.3 

433615 AA732982 Hs.269807 ESTs, Weakty similar to ALU1_HUMAN ALU S 8.3 

427229 A1799751 Hs.5635 ESTs 8.3 

446947 AF1 46747 Hs.232165 polycythemia rubra vera 1; cell surface 8.3 

403149 NM_001450:Homo sapiens four and a half L 8.3 

436679 AI127483 Hs.120451 ESTs, Weakly simitar to unnamed protein 8.3 

421991 NMJJ14918 Hs.110488 WAA0990 protein 8.3 

436476 AA326108 Hs.33B29 bHLH protein DEC2 8.3 

434980 AW770553 Hs.14553 sterol O-acyltransferase (acyl-Coenzyme 8.3 

407385 AA610150 Hs.272072 ESTs, Weakly similar to I38022 hypotheti 8.3 

453121 AI968264 Hs.232286 ESTs 8.3 

420630 AL1 33101 Hs.99508 Homo sapiens mRNA; cONA OKF2p434O0921 (f 8.3 

426101 AL049987 Hs.166361 Homo sapiens mRNA; cDNA DKFZp564F112 (fr a2 

419517 AF052107 Hs.90797 Homo sapiens done 23620 mRNA sequence 8.2 

425707 AF1 15402 Hs.11713 E74-like factor 5 (ets domain transcript 8.2 

440080 AW051597 Hs.143707 ESTs 8.2 

422049 W25760 Hs.77631 glydne deavage system protein H {amino 8.2 

430154 AW583058 Hs.234726 serine (or cysteine) proteinase inhibilo &2 

432695 D63480 Hs.278634 K1AA01 46 protein 8.2 

407300 AA102616 gbzri43e07.s1 Stratagene HeLacells393 8.2 

408964 AF201468 Hs.49349 beta-sile APP-deaving enzyme 8.2 

418051 AW192535 Hs.19479 ESTs 8.2 

412125 Y17114 Hs.73393 eyes absent (Drosophfla) hornolog 4 8.2 

441523 AW514263 Hs.301771 ESTs, WeaWy similar to ALUF_HUMAN HI! 8.2 

41795B AA767382 Hs.193417 ESTs 8.2 

452040 AW973242 Hs.293690 ESTs, WeaWy similar to 138022 hypotheti 8,2 

430468 NMJJ04673 Hs.241519 angiopdetin-fike 1 8.2 

407802 084145 Hs.39913 novel RGD^rrtaining protein 8.2 

420556 AA278300 Hs.1 24292 Homo sapiens cDNA: FU 231 23 fis, done L 8.2 

414138 AA135884 Hs.3772 ESTs 8.2 

438209 AL1 20659 Hs.6111 aryl-hydrocarbon receptor nuclear transl 8.2 

408267 AW380525 Hs.32675 tubulirvspedfic chaperone e 8.2 

429692 N48422 Hs.9977 ESTs, Weakty similar to B34087 hypotheti 8.2 

408108 A1580492 Hs.42743 hypotheBcai protein 8.2 

408418 AW963897 Hs.44743 KIAA1435 protein 8.2 

430334 AI824719 Hs.328700 ESTs 8.2 

422083 NMJJ01141 Hs.111256 arachidonate lipoxygenase, second typ 8.2 



TABLE 59B 

Pkey: Unique Eos probeset identifier number 
CAT number. Gene duster number 

Accession: Genbank accession numbers . 

Pkey CAT Number Accessions 

409385 112523J AA071267 T65940T64515 AA071334 
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409752 


115301_1 


/iiiiA^mnn A»n701QC A\M7AQAttt A AfYTIACQ OC1C1C71 AA^7KQ17 

AW963990AA078i9b AW/4940i: AA07/4bo btiblb/l AAJ/Wi/ 


411479 


1247077 J 


AW84B047 AW84o£V£ AVYo4obol AWo4ol4£ AWo4ofU* AWtwuiZi /\vvofloOJiAVVO*»oi*»UMvvo»i03/ 1 «»yohouu3 wvohouo/ mwchoud^ nvvotosua 






AW848214 


41 1667 


1253334J 


BE 1 601 98 AW9358SO 1 1 1 520 AW9 JDyoU AWobbU I & AWob l \io** 


416913 


163001_1 


AiktMA-m arnanm aacncnn A\knAaann a\kt7 aqqca C5C1fi1f">rit; AAlQfLdAQ AWRI^ARR RFlfilnOfi RF1R94QQ 
AW934714 BclblUU/ BfcibcbUu AW74yyuz AW/4y&M Be lb^yt) Bt loiUUo AAiaiwa AVVOto i K)ODCJDiuua dcid4**to 


418666 


179788_1 


T65754 AA229857 AA229658 


416876 


179960.1 


AA740616 AA654854 AA229923 


H 1 3D JO 


IOOOOO 1 


AAfiO^n*; AAP44M5 AA9441M 


419544 


185760.2 


AI909154 AA526337 AA244193 AI9091 53 


423800 


232161J 


AA331156 AA331157 AA331155 


426413 


266650 1 


AA377823 AW954494 AI022688 


429163 


300543.1 


AA884766 AW974271 AA592975 M447312 


430848 


324621.1 


AW021726 AA487752 AA4880B5 


431121 


328275.1 


AW971157 AA492575 AA492520 


432189 


342819.1 


AA527941 AJ810608 A1620190AA635266 


432600 


350959.1 


AIB21085 AW973464 AA554802 AI821831 AA657438 AA640756 AA650339 


434415 


385931.1 


BE177494 AW276909 AA632849 


434579 


38916.1 


T55958T57205AF147346 


439518 


47334 1 


W76326AF086341 W72300 


443161 


561305J 


AI038316AI344631A1261653 


447974 


745643.1 


R76886AI453674 R77049 



TABLE 59C . 

Pkey: Unique number corresponding to an Eos probes et 

Ref: Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. Dunham I. et al.' refers to the publication entitied The 0NA sequence of 

human chromosome 22" Dunham I. et al. t Nature (1999) 402:489-495. 
Strand: Indicates DMA strand from which exons were predicted. 

Nt_position: Indicates nucleotide positions of predicted exons. _ 



Pkey 


Ref 


Strand 


NLposition 


400860 


' 9757499 


Minus 


151830-152104,152649-152744 


400880 


9931121 


Plus 


29235-29336,36363-36580 


401197 


9719705 


Plus 


176341-176452 


401424 


8176894 


Plus 


24223-24428 


401519 


6649315 


Plus 


157315-157950 


401558 


7139678 


Plus 


103510-104090 


401747 


9789672 


Minus 


118596-118816,119119-1192H119609-119761,12M22-120990,130161-130381^ 








131932 132451-132575 133580-134011 


401785 


7249190 


Minus 


165776^165996,166189^166314,166408-166569,167112-167268,167387-167469,168634-168942 


402076 


8117410 


Plus 


128316-128627 


402812 


6010110 


Plus 


25026-25091.25844-25920 


402855 


9662953 


Minus 


59763-59909 


403047 


3540153 


Minus 


59793-59968 


403137 


9211494 


Minus 


92349-92572,92958-93084,93579-93712.93949-94072,94591-94748,95214-95337 


403149 


9799833 


Plus 


25034-25185 


403362 


8571772 


Plus 


64099-64260 


404210 


5006246 


Plus 


169926-170121 


404571 


7249169 


Minus 


112450-112648 


404641 


9796610 


Minus 


32247-32362 


404642 


9796810 


Plus 


102999-103145 


405523 


9454643 


Plus 


114550-114688,117265-117407.119490-119599.123237-123395,131140-131217 



Table 6QA Fists about 1166 genes up-regulated in prostate cancer compared to normal adult tissues. These were selected from 59680 probesets on the Affymetrix/Eos Hu03 
GeneChip array such that the ratio of "average* prostate cancer to "average* normal adult tissues was greater than or equal to 3.0. The "average" prostate cancer level was set to the 
75* percentile amongst prostate cancers. The "average* norma! adult fissue level was set to the 85* percentile amongst non-maTignant tissues. In order to remove gene-specific 
background levels of non-specific hybridization, the 10* percentile value amongst the non-mafignanl tissues was subtracted from both the numerator and the denominator before the 
ratio was evaluated. 

TABLE 60A: ABOUT 1 1 66 GENES UP-REGULATED tN PROSTATE CANCER COMPARED TO NORMAL ADULT TISSUES 

Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number. Genbank accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene title 

Rl : Ratio of prostate tumor to norma! adult body tissue 



Pkey 


ExAccn 


Unigene ID 


Unigene Title 


Rl 


426747 


AA535210 


Hs.171995 


kalBkrein 3, (prostate specific antigen 


56.6 


419526 


A1821895 


Hs.193481 


ESTs 


56.2 


420154 


AI093155 


Hs.95420 


JM27 protein 


44.0 


432441 


AW292425 


Ks.163484 


ESTs 


41.9 


414569 


AF 109293 


Hs.118256 


prostate cancer associated protein 1 


39.9 


431616 


AA508552 


Hs.195839 


ESTs. Weakly similar to 136022 hypolheti 


37.9 


400299 


X07730 


Hs.171995 


kafflkrein 3, (prostate specific antigen 


36.0 


446057 


AI420227 


Hs.149358 


ESTs, Weakly similar to A46010 X-finked 


32.9 


425075 


AA506324 


Hs.1852 


acid phosphatase, prostate 


31.1 


400298 


AA032279 


Hs.61635 


six transmembrane epithelial antigen of 


30.0 


439176 


AI446444 


Hs.190394 


ESTs, Weakly simHar to B28096 Bne-1 pr 


29.8 
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400302 N48056 folate hydrolase (prostate-specific memb 28.9 

431579 AW971082 Hs.222886 ESTs, Weakly similar to TRHY.HUMAN TRICH 27.9 

432101 AI918950 Hs. 123642 EphA3 26.3 

408369 R38438 Hs. 182575 sohile carrier family 15 (H7?7 transport 26.2 

5 453096 AW294631 Hs.11325 ESTs 25.7 

400287 S39329 Hs.181350 kaffikrein 2, prostatic 25.5 

409262 AK000631 Hs.52256 hypothetical protein FU 20624 24.6 

428819 AL135623 Hs.193914 KIM0575 gene product 24.5 

401424 NM_001172:Homo sapiens arginase, type II 24.5 

10 453370 AI470523 Hs.139336 ATP-binding cassette, sub-family C (CRR 22.4 

400292 AA250737 Hs.72472 BMP-R1B 22.3 

407202 N58172 Hs.109370 ESTs 22.3 

415989 A1267700 ESTs 20.2 

407709 AA456135 Hs.23023 ESTs 20.0 

15 40716B R45175 Hs.117183 ESTs 19.6 

433444 AW975324 Hs.1 29816 ESTs 19.4 

409731 AA125985 Hs.56145 thymosin, beta, identified in neuroblast 19.1 

428336 AA503115 Hs.183752 mteroseminoprotein, beta- 18.7 

410929 H47233 Hs.30643 ESTs 18.0 

20 424099 AF071202 Hs.139336 ATP-binding cassette, sub-family C (CRR 17.9 

418961 AW967646 Hs.23023 ESTs 17.7 

452792 AB037765 Hs.30652 K1AA1344 protein 17.5 

403047 NM_005656*:Homo sapiens transmembrane pr 17.3 

433466 AA508353 Hs.105314 relaxin1(H1) 17.3 

25 418396 AI765B05 Hs.26691 ESTs 17.1 

433647 AA603367 Hs.222294 ESTs 16.9 

432240 AI694767 Hs.129179 Homo sapiens cDNA FU 13581 fis, done PL 16.7 

430487 D87742 Hs.241552 KIAA0268 protein 16.5 

„ 440260 AI972867 Hs.7130 copinelV 16.0 

30 429170 NM.001394 Hs.2359 dual specificity phosphatase 4 15.7 

429220 AW207206 ESTs 15.7 

416370 N90470 Hs.203697 ESTs, Weakly similar to I38022 hypotheli 15.3 

418819 AA228776 Hs.191721 ESTs 14.8 

0 _ 421513 X00949 Hs.105314 relaxin 1 (H1) 14.8 

35 429918 AW873988 Hs.1 19383 ESTs 14.5 

420092 AA814043 Hs.88045 ESTs 14.5 

432473 AI202703 Hs.152414 ESTs 14.4 

450693 AW450461 Hs.203965 ESTs 14.4 

Ar . 431548 AIB34273 Hs.9711 novel protein 14.4 

40 432674 AA641092 Hs.257339 ESTs. Weakly similar to 138022 hypotheti 14.3 

430187 AI799909 Hs.158989 ESTs 14.3 

441690 R81733 Hs.33106 ESTs 14.3 

408621 AI970672 Hs.46638 chromosome 1 1 open reading frame 8 14.2 

448999 AF179274 Hs.22791 transmembrane protein with EGF4ike and 13.7 

45 450642 R39773 Hs.7130 copinelV 13.5 

430376 AW292053 Hs.12532 chromosome 1 open reading frame 21 13.4 

432966 AA650114 Hs.325198 ESTs 13.4 

410330 AW023630 Hs.159425 ESTs 13.4 

434666 AF151103 Hs.112259 T cell receptor gamma locus 13.1 

50 400297 AI127076 Hs.306201 hypothetical protein DKFZp56401 278 13.0 

423073 BE252922 Hs.123119 MAD (mothers against decapentaplegk, Dr 12.8 

431474 AL133990 Hs.190642 ESTs 12.6 

418829 AA516531 Hs^5999 NK homeobox (Drosophila), family 3, A 12.5 

CJT 453861 AI02683B Hs^0120 ESTs, Weakly similar to NUCLJHUMAN NUCLE 12.5 

55 434217 AW014795 Hs.23349 ESTs 12.3 

416432 M14156 Ks.65112 insuGn-like growth factor 1 {somatomedi 12.3 

428398 AI249368 Hs.98558 ESTs 12.2 

421059 AI654133 Hs.30212 thyroid receptor interacting protein 15 12.2 

re . 447313 U92981 Hs.1B081 Homo sapiens done DT1P1B6 mRNA, CAG rep 12.2 

60 ' 443960 AI093577 Hs.255416 hypothetical protein FU21 986 12.1 

448290 AK002107 Hs.20843 Homo sapiens cDNA FU 1 1245 fis, done PL 12.1 

436032 AA150797 Hs.109276 latexin protein 12.0 

432527 AW975028 Hs.102754 ESTs 12.0 

434792 AA649253 Hs.132458 ESTs 12.0 

65 420424 AB033036 Hs.97594 K1AA1210 protein 11.9 

440529 AW207640 Hs.1647B Homo sapiens cDNA: FU2171 8 fis, clone C 11.9 

424846 AU077324 Hs.1 832 neuropeptide Y 11.9 

448519 AW175665 Hs.278695 Homo sapiens prostem mRNA, complete cds 115 

418339 AA639902 Hs.104215 ESTs, Moderately similar to SPCN_HUMAN S 11.8 

70 432600 AI821085 gb:ns95a1Zy5 Na_CGAP_Pr3 Homo sapiens 11.8 

413597 AW302885 Hs.117183 ESTs 11.8 

446100 AW967109 Hs.13804 hypothetical protein dJ462023.2 11.7 

431448 AL137517 Hs.306201 hypothetical protein DKF2p5540127B 11.6 

424701 NR.005923 Hs.151988 mitogervadrvated protein kinase kinase 11.6 

75 432435 BE218885 Hs.282070 ESTs 11.5 

429290 AF203032 Hs.198760 neurofilament, heavy polypeptide (200kD) 11.5 

410467 AF102546 Hs.63931 dachshund (Drosophila) homolog 11.5 

439444 A1277652 Hs.54578 ESTs. Weakly similar to 138022 hypotheti 11.5 

0 _ 450382 AA39765B Hs.60257 Homo sapiens cDNA FU13598 fis, done PL 11.5 

80 452340 NM.002202 Hs.505 ISL1 transcripUon factor, UM/homeodoma 11.1 

417332 AW972717 Hs.288462 hypolhelicalprotelnFU21511 11.0 

407021 U52077 gb:Human marinerl transposase gene, comp 11.0 

419563 AA526235 Hs.193162 Homo sapiens cDNAFU 11 983 fis. clone HE 10.8 
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432729 AK000292 Hs.130732 hypothetical protein FU 20285 10-6 

400301 X03S35 Hs.1657 estrogen receptor 1 10.5 

419536 AA603305 gb:np12d11.s1 NQ_CGAP^Pr3 Homo sapiens 10.5 

445472 AB006631 Hs.12784 Homo sapiens mRNA for KIAA0293 gene, par 10.5 

5 415539 AI733881 Hs.72472 BMP-RIB 10.4 

419264 AA877104 Hs.293672 ESTs, Weakly similar to ALUBJWMAN !H! 10.4 

424432 AB037821 Hs.146858 ptotocadherin 10 10.3 

439518 W76326 gb:zd60d04.r1 Soares_tetal_heart.NbHH19W 10.3 

434036 AI659131 Hs.1 97733 hypothetical protein MGC2849 10.3 

10 415263 AA948033 Hs.130853 ESTs 10-2 

431217 NMJJ13427 Hs.250B30 Rho GTPase activating protein 6 10.2 

444805 AB007899 Hs.12017 homotog of yeast ubiquitin-protein ligas 10.1 

437252 A1433833 Hs.164159 ESTs. Weakly simitar to ALU 1.HUMAN ALU S 10.1 

444190 AJ878918 Hs.10526 cysteine and glycine-rich protein 2 10.0 

15 418293 AI224483 Hs.16063 hypothetical protein FU21877 10.0 

432244 AI669973 Hs.200574 ESTs 10.0 

451640 AA195601 Hs.26771 Human DNA sequence from clone 747H23 on 9.9 

450164 AI239923 Hs.63931 ESTs 9-8 

447072 D61594 Hs.17279 tyrosylprotein sutfolransferase 1 9.8 

20 415079 R43179 Hs.22895 hypothetical protein FU 23548 9.7 

409799 D11928 Hs.76845 phosphoserine phosphatase-tike 9.6 

433234 AB04092B Hs.65366 KIAA1495 protein 95 

400296 AA305627 Hs.1 39336 ATP-binding cassette, sub-family C (CFTR 9.5 

422728 AW937826 Hs.103262 ESTs. Weakly similar to ZN91 .HUMAN ZINC 9.5 

25 425154 NM_001B51 Hs.1 54850 collagen, type IX. alpha 1 9.4 

433927 AI557019 Hs.116467 small nuclear protein PRAC 9 4 

444670 H58373 Hs.332938 hypothetical protein MGC5370 9.4 

449625 NM 014253 odz (odd Oz/ten-m, Drosophila) homolog 1 9.4 

450325 AI935962 Hs.26289 ESTs 9.4 

30 427761 AA412205 Hs.140996 ESTs 94 

453930 AA4 19466 Hs.36727 hypothetical protein FU10903 9.4 

'416795 A1497778 Hs.20509 HBV pX associated protein-8 9.3 

422805 AA436989 Hs.121017 H2A histone family, member A 9.3 

418848 AI820961 Hs.193465 ESTs 9.2 

35 408430 S79876 Hs.44926 dipeptidylpeptidase IV (CD26, adenosine 9.2 

415788 AW628686 Hs.78851 KIAA0217 protein 9.2 

404541 NM_021965*:Homo sapiens phosphoglucomuta 9.1 

432437 W07088 Hs.293685 ESTs 9.1 

436396 AI683487 Hs.152213 wingless-type MMTV integration site fami 9.1 

40 436962 AW377314 Hs.5364 DKFZP564I052 protein 9-1 

410765 AI694972 Hs.66180 nudeosome assembly protein 1-Gke 2 9.0 

428342 AI739168 Homo sapiens cDNA FU13458 fis, done PL 9.0 

452281 T93500 Hs.28792 Homo sapiens cDNA FIJ1 1041 fis. done PL 9.0 

433043 W57554 Hs.125019 lymphoid nudear protein (LAF-4) mRNA 8.8 

45 442082 R41823 Hs.7413 ESTs 8.8 

436671 AW137159 Hs.146151 ESTs 8.7 

415786 AW419196 Hs.257924 hypothetical protein FU1 3782 8.7 

440774 A1420611 Hs.153934 ESTs 8.7 

420120 ALD49610 Hs.95243 transcription elongation factor A (S!l|- 8.7 

50 427138 N77624 Hs.1 73717 phosphatidic acid phosphatase type 2B 8.7 

430722 AW968543 Hs.203270 ESTs. Weakfy similar to ALU INHUMAN ALU S 8.6 

422487 AJ010901 Hs.198267 mudn 4, tracheobronchial 8.6 

443646 AI085198 Hs.164226 ESTs 8.6 

414565 AA502972 Hs.183390 hypothetical protein FU13590 8.5 

55 428728 NMJJ16625 Hs.191381 hypothetical protein 8.5 

434926 BE543269 Hs.50252 mitochondrial ribosomal protein L32 8.5 

437162 AW005505 Hs.5464 thyroid hormone receptor coacfivating pr 8.5 

437866 AA1 56781 metallothlonein 1E (functional) 8.5 

453006 AI362575 Hs.303171 ESTs 8.4 

60 418723 AA504428 Hs.10487 Homo sapiens, done IMAGE:3954132, mRNA, 8.3 

421040 AA715026 Hs.135280 ESTs 8.3 

414212 AA136569 Hs.10848 KIAA01 87 gene product 8.3 

417916 NMJ)06416 Hs.82921 solute carrier family 35 (CMP-stalic ad 8.2 

440749 W22335 Hs.7392 hypothetical protein MGC31 99 8.2 

65 412715 NMJJ00947 Hs.74519 primase. polypeptide 2A (58kD) 8.2 

42B829 R14050 Hs.194051 Homo sapiens mRNA; cDNA DKFZp566B21 3 (fr 8.2 

447156 AW274731 Hs.157920 ESTs 8.1 

428330 L22524 Hs.2256 matrix metafloprotehase 7 (matrirysin, 8.1 

„ 417412 X16896 Hs.82112 interleukin 1 receptor, type I 8.0 

70 424692 AA429834 Hs.151791 K1AA0D92 gene product 8.0 

435981 H74319 Hs.188620 ESTs 7.9 

442081 AA401663 Hs.22380 ESTs 7.9 

434988 AI418055 Hs.161160 ESTs 7.9 

407910 AA650274 Hs.41296 fibronectin leudne rich transmembrane p 7.8 

75 420345 AW295230 Hs.25231 ESTs 7.8 

412324 AW978439 Hs.69504 ESTs 7.8 

420757 X78592 Hs.99915 androgen receptor (dihydrotestosterone r 7.8 

408374 AW025430 Hs.155591 forkhead box F1 7.8 

435604 AA625279 Hs.26892 uncharacterized bone marrow protein BM04 7.8 

80 453160 A1263307 Hs^39884 H2B histone family, member L 7.8 

424051 AL1 10203 Hs.138411 Homo sapiens mRNA; cDNA DKFZp586J1922 (f 7.7 

416182 NMJJ04354 Hs.79069 cycfinG2 7.7 

452110 T47667 Hs .28005 Homo sapiens cDNAFL)11 309 fe, done PL 7.6 



693 



WO 03/042661 



PCT/US02/36810 



451099 R52795 Hs.25954 tnterleukin 13 receptor, alpha 2 7.6 

427003 U19487 Hs.2090 prostaglandin E receptor 2 (subtype EP2) 7.5 

443180 R15875 Hs.258576 daudln 12 7.5 

454119 BE549773 Hs.40510 uncoupling protein 4 7.5 

5 418693 AI750878 Hs.87409 thrombospondin 1 7.5 

426581 AB040956 Hs.135890 KIAA1 523 protein 7.5 

417683 AW566008 Hs.239154 ankyrin repeal family A (RFXANK-like), 7.5 

439569 AW602166 Hs.222399 CEGP1 protein 7.4 

416140 AI918035 Hs.301198 roundabout (axon guidance receptor, Dros 7.4 

10 451230 BE546208 Hs.26090 hypothetical protein FU 20272 7.4 

400294 N95796 Hs.278695 Homo sapiens proslein mRNA, complete cds 7.4 

436B73 N23874 Hs.50477 RAB27A, member RAS oncogene family 7.4 

419743 AW408762 Hs.5957 Homo sapiens clone 24416 mRNA sequence 7.3 

428600 AW863261 Hs.242413 hypothetical protein DKFZp434K1 421 7.3 

15 410252 AW821182 Hs.61418 microfibrinar-associated protein 1 7.3 

424649 BE242035 Hs.1 51461 embryonic ectoderm development 7.3 

428218 AA424266 Hs.123642 EphA3 7.3 

434158 T86534 Hs.14372 ESTs 7.3 

_ 413249 AF167160 Hs.75251 DEAD/H (Asp-Glu-Ala-Asp/His) box binding 7.3 

20 430261 AA305127 Hs.237225 hypothetical protein HT023 7.3 

429597 NM_003816 Hs.2442 a disintegrin and metalloproteinase doma 7.3 

423566 AW976434 Hs.3623 hypothetical protein FU1 1220 7.2 

433556 W56321 Hs. 11 1460 caidurn/calmodulin-dependent protein kin 7.2 

420871 AA702972 Hs.65300 ESTs 7.2 

25 418278 AI088489 Hs.83937 hypothetical protein 7.1 

456516 BE172704 Hs.222746 KIAA1610 protein 7.1 

417511 AL049176 Hs.82223 chordiivlike 7.1 

411667 BE160198 gb:QV1-HT0413-01020(W)5^h03HT0413Hcmo 7.1 

414422 AA147224 Hs.249195 HomeoboxA13 7.1 

30 407118 AA156790 Hs.262036 ESTs, Weakly similar to Z223_HUMAN ZINC 7.1 

421566 NM_000399 Hs.1 395 early growth response 2 (Krox-20 (Drosop 7.0 

418564 AA631143 Hs.278695 Homo sapiens proslein mRNA, complete cds 7.0 

437872 AK002015 Hs.5887 RNA binding motif protein 7 7.0 

448148 NM_016578 Hs.20509 HBV pX associated protein-8 7.0 

35 449523 NM 000579 Hs.54443 chemokine (C-C motif) receptor 5 6.9 

431657 AI345227 Hs.1 05448 ESTs, Weakly similar to B34087 hypolheti 6.9 

457728 AW974811 gb:EST386916 MAGE resequences, MAGN Homo 6.9 

415910 U20350 Hs.7B913 chemokine (OX3-CJ receptor 1 6.9 

421977 W94197 Hs.110165 ribosoma! protein L26 homdog 6.9 

40 432161 AK000400 Hs.341181 ESTs, Weakly similar to envelope [Ksapi 6.9 

432887 AI926047 Hs.1 62859 ESTs 6.8 

444931 AV652066 Hs.75113 general transcription factor UIA 6.8 

421823 N40850 Hs.28625 ESTs 6.8 

426981 AL044675 Hs.173081 K1AA0530 protein 6.8 

' 45 425170 AU077315 Hs.154970 transcription factor CP2 6.8 

411479 AW848047 gb:IL3-CT0214-29129&452-A12 CT0214 Homo 6.8 

420954 AA282074 Hs.237323 N-acetytg!jcosarm'ne-phosphate mutase 6.8 

439492 AF086310 Hs.103159 ESTs 6.8 

449919 A1674685 Hs.200141 ESTs 6.8 

50 452221 C21322 Hs.288057 hypothetical protein FU22242 6.8 

431555 A1815470 Hs.260024 Cdc42 effector protein 3 6.7 

441111 AI806867 Hs.126594 ESTs 6.7 

457498 AI732230 Hs.191737 ESTs 6.7 

415293 R49462 Hs.106541 ESTs 6.7 

55 429163 AA884766 gb:am20a10.s1 Soares_NFL_T_GBC_S1 Homos 6.7 

43390B AW298141 Hs.157975 ESTs 6.7 

446896 T15767 Hs.22452 Homo sapiens mRNA for K1AA1737 protein, a7 

431770 BE221B80 Hs.268555 5-3 exoribonuctease 2 6.7 

418575 AA225313 Hs.222886 ESTs, WeaWy simitar to TRHY_HUMAN TRICH 6.6 

60 443745 AB039670 Hs.9728 ALEX1 protein 6.6 

427958 AA418000 Hs.98260 potassium intermediate/small conductance 6.6 

447805 AW627932 Hs.302421 gemin4 6.6 

440995 T57773 Hs.10263 ESTs 6.6 

422173 BE385828 Hs.250619 phorbotln-like protein MDS019 6.6 

65 431992 NMJJ02742 Hs.2891 protein kinase C mu 6.6 

437052 AA861697 Hs.120591 ESTs 6.6 

444030 AW021254 Hs.135055 ESTs 6.6 

416836 D54745 Hs.80247 chotecystokinin 6.6 

447033 AI357412 Hs.157601 ESTs 6.5 

70 428927. AA441837 Hs.90250. ESTs 6.5 

432189 AA527941 gbmh30c04.s1 NCI_CGAP_Pr3 Homo sapiens 6.5 

448779 BE042877 Hs.177135 ESTs 6.4 

430598 AK001764 Hs.247112 hypotheBcal protein FU 10902 6.4 

__ 411630 U42349 Hs.71119 Putative prostate cancer tumor suppresso 6.4 

75 432682 AI376400 Hs.159588 ESTs 6.4 

441499 AW298235 Hs.101689 ESTs 6.4 

441676 BE564206 Hs.49889 ESTs 6.4 

421077 AK000061 Hs.101590 hypothetical protein &4 

452784 BE463857 Hs.1 51 258 hypotheCca! protein FU21 062 6.4 

oO 452055 AI377431 Hs.141693 hypothetical protein MGC10856 6.4 

431576 M76665 Hs.275215 hydroxysteroid (1Ueta) dehydrogenase 1 6.3 

418994 AA296520 Hs.89546 selectin E (endothelial adhesion motecut 6.3 

452277 AL049013 Hs.28783 KIAA1223 protein 6.3 
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412953 Z45794 Hs.238809 ESTs 6.3 

441247 AW118581 Hs.128051 Homo sapiens thymic stromal rymphopoieti 6.3 

453390 AAB62496 Hs.28482 ESTs 6.3 

418450 R84397 Hs.1 93651 ESTs, Weakly similar to alternatively sp 6.3 

5 407829 AA045084 Hs.29725 hypothetical protein FU 131 97 6.3 

425704 U79293 Hs.1 59254 Human done 23948 mRNA sequence 6.3 

433610 AA806822 Hs.112547 ESTs 6.3 

448552 AW973653 Hs.20104 hypothetical protein FU00052 6.3 

412977 AA125910 Hs.191461 ESTs 6.3 

10 441217 AI922183 Hs.213246 ESTs 6.3 

443912 R37257 Hs.1 84780 ESTs 6.3 

429281 AA830856 Hs.29808 Homo sapiens cDNA: FU21 122 fe. clone C 6.2 

435021 AA922192 Hs.54709 ESTs 6.2 

425465 L18964 Hs.1904 protein kinase C, iota 6.2 

15 418821 AA436002 Hs.183161 ESTs 6.2 

424511 BE300512 Hs.193557 ESTs, Moderately similar to ALU7_HUMAN A 6.2 

448106 AI800470 Hs.171941 ESTs 6.2 

408418 AW963897 Hs.44743 KIAA1 435 protein 6.2 

450728 AW162923 Hs.25363 presenllln 2 {Alzheimer disease 4) 6.2 

20 427078 AI676062 Hs.111902 ESTs 6.2 

426931 NM 00341 6 Hs.2076 zinc finger protein 7 (KOX 4, clone HF.1 6.2 

449907 AA004825 Hs.103281 ESTs 6.1 

458509 AA654650 Hs.282906 ESTs 6.1 

437323 AA371145 Hs.194397 leptin receptor 6.1 

25 407902 AL1 17474 Hs.41181 Homo sapiens mRNA; cDNA DKFZp727C191 (fr 6.1 

429302 AU076674 Hs.198B99 eukaryotic translation initiation factor 6.1 

412673 AL042957 Hs.31845 ESTs 6.0 

410150 AW382942 Hs.260024 ESTs 6.0 

421863 AI952677 Hs.108972 Homo sapiens mRNA; cDNA DKFZp434P228 (fr 6.0 

30 446947 AF146747 Hs.232165 polycythemia rubra vera 1; cell surface 6.0 

408968 AI652236 Hs.49376 hypothetical protein FU20644 6.0 

409047 AW961434 Hs.31539 ESTs 6.0 

418601 AA279490 Hs.86368 calmegin 6.0 

425710 AF030880 Hs.159275 solute carrier family, member 4 6.0 

35 439820 AL360204 Hs.283853 Homo sapiens mRNA full length insert cDN 6.0 

414905 H40873 Hs.175971 ESTs 6.0 

450295 AI766732 Hs.210628 ESTs 6.0 

412505 AA974491 Hs.21734 ESTs 6.0 

428730 AA625947 Hs.25750 ESTs 6.0 

40 445413 AA151342 Hs.12677 CGl-1 47 protein 6.0 

451424 AI862026 Hs.302810 Novel human gene mapping to chomosome 20 5.9 

425628 NM_004476 Hs.1915 folate hydrolase (prostate-specific memb 5.9 

435677 AA694142 Hs.293726 ESTs, Weakly similar to TSGA RAT TESTIS 5.9 

431359 AW993522 Hs.292934 ESTs 5.9 

45 404632 NM_022490:Homo sapiens hypothetical prot 5.9 

423748 AI149048 Hs.30211 hypothetical protein FU 2231 3 5.9 

449118 R67477 Hs.23103 Bell (S. cerevisiae) homolog 5.9 

405523 C8001409*:gil7441226lpir||S31212coBage 5.9 

„ 448807 AI571940 Hs.7549 ESTs 5.8 

50 404642 NM_021965*:Homo sapiens phosphoghicomuta 5.8 

452598 AI831594 ESTs, Weakly similar to ALU7_HUMAN ALU S 5.8 

427212 AW293849 Hs.58279 ESTs, Weakly similar to ALU7_HUMAN ALU S 5.8 

435047 AA454985 Hs.54973 cadherin-like protein VR20 5.8 

408622 AA056060 Hs.202577 Homo sapiens cDNAFU12166fc, done MA 5.8 

55 419038 AW134924 Hs.190325 ESTs 5.8 

416913 AW934714 gb:RC1-DT0001.031 29941 1-a11 DT0001 Homo 5.7 

432432 AA541323 Hs.115831 ESTs . 5.7 

435937 AA830893 Hs.119769 ESTs 5.7 

„ 414528 AA148950 Hs.188836 ESTs 5.7 

60 432712 AB016247 Hs.28B031 stsrotf^esaturase (fungal ERG3, delta 5.7 

412783 BE276738 Hs.74578 DEAD/H (Asp-GJu-AIa-Asp/Hts) box polypep 5.7 

408826 AF216077 Hs.48376 Homo sapiens clone HB-2 mRNA sequence 5.7 

419083 AI479560 Hs.98613 Homo sapiens cONARJ12292fe. done MA 5.7 

420184 AA188408 Hs.95665 hypothelica! protein 5.7 

65 428493 AK001745 Hs.184628 hypothetical protein FU 10883 5.7 

443250 AI041530 Hs.132107 ESTs 5.7 

443324 R44013 Hs.1 64225 ESTs 5.7 

448164 R61680 Hs.26904 ESTs, Moderately similar to Z195_HUMANZ 5.7 

448172 N75276 Hs.135904 ESTs 5.7 

70 433332 AI367347 Hs.4489B Homo sapiens done TCCCTA00151 mRNA sequ 5.6 

453455 AA063553 Hs.221931 ESTs, Weakly similar to JC1087 RNA hefic 5.6 

452242 R50955 Hs.159993 gycosyltransferase 5.6 

410141 R07775 Hs.287657 Homo sapiens cDNA: FU21291 fe, clone C 5.6 

418019 R68911 Hs.176275 ESTs 5.6 

75 450813 AI739625 Hs.203376 ESTs 5.6 

419239 AA468183 Hs.184598 Homo sapiens cDNA: FU23241 fis, done C 5.6 

411096 U80034 Hs.68583 rrttochcndxial intermediate peptidase 5.6 

451684 AF216751 Hs.26813 COA14 5.6 

427674 NM.00352B Hs.2178 H2B histone family, member Q 5.6 

oO 407275 AI364186 gb:qw34h07.x1 NCI_CGAP_U14 Homo sapiens 5.6 

417958 AA767382 Hs.193417 ESTs 5.6 

418407 AL044818 Hs.84928 nuclear transcription factor Y, beta 5.6 

435176 AA744875 Hs.189413 ESTs 5.5 
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428455 AW970976 Hs.293653 ESTs 5.5 

431316 AA502663 Hs.145037 ESTs 5.5 

434804 AA649530 Hs.348148 gb:ns44f05.s1 NQ_CGAPJMv1 Homo sapiens 5.5 

411990 AW963624 Hs.31707 ESTs, Weakly similar to YEW4_YEAST HYPOT 5.5 

5 416653 AA768553 Hs.193145 melallolhionein 1E (functional) 5.5 

427359 AW020782 Hs.79881 Homo sapiens cDNA: FL J 23006 fis, clone L 5.5 

432169 Y00971 Hs.2910 phosphoribosyl pyrophosphate synthetase 5.5 

416239 A1038450 Hs.48948 ESTs 5.4 

421470 R27496 Hs.1378 annexinA3 5.4 

10 408177 AI241733 Hs.43871 ESTs 5.4 

424084 AI940675 Hs^0914 hypothetical protein FU23056 5.4 

414171 AA360328 Hs.865 RAP1 A, member of RAS oncogene family 5.4 

448072 AI459306 Hs.24908 ESTs 5.4 

420397 NM.007018 Hs.97437 centrosomal protein 1 5.4 

15 429250 H56585 Hs.1 98308 tryptophan rich basic protein 5.4 

453070 AK001465 Hs.31575 SEC63, endoplasmic reticulum transtocon 5.4 

429165 AW009886 Hs.118258 prostate cancer associated protein 1 5.4 

418866 T65754 gbrycl 1c07.s1 Stratagene lung (937210) H 5.4 

428364 AA426565 Hs.311103 ESTs, Moderately similar to ALU1_HUMAN A 5.4 

20 431467 N71831 Hs.256398 Homo sapiens mRNA; cDNA DKFZp434E0528 (f 5.4 

436679 AI127483 Hs.1 20451 ESTs, Weakly similar to unnamed protein 5.4 

447188 H65423 Hs.17631 hypothetical protein DKFZp434E2135 5.4 

457211 AW972565 Hs.32399 ESTs, Weakly similar to S51797 vasodilat 5.3 

412677 AW029608 Hs.17384 ESTs 5.3 

25 430412 AW341754 Hs.189305 ESTs 5.3 

423250 BE061916 Hs.1 25849 chromosome 8 open reading frame 2 5.3 

421433 AI829192 Hs.22380 ESTs 5.3 

435572 AW975339 Hs.239828 ESTs. Weakly similar to GAG2.HUMAN RETRO 5.3 

458571 AV653731 Hs.282829 ESTs, Moderately similar to PC4259 ferri 5.3 

30 441054 AA913591 Hs.126480 ESTs 5.3 

431725 X65724 Hs.2839 Nome disease (pseudoglioma) 5.3 

433409 AI278802 Hs.25661 ESTs 5.3 

441102 AA973905 intermediate filament protein syncoilin 5.3 

453387 AI990741 Hs.252809 ESTs 5.3 

35 421928 AF013758 Hs.1 09643 polyadenylate binding proleirwnteractin 5.3 

433923 A1823453 Hs.148625 ESTs 5.3 

426503 AA380153 gb:EST93093 Skin tumor I Homo sapiens cD 5.2 

447574 AF162666 Hs.18895 tousled-like kinase 1 5.2 

421B96 N62293 Hs.45107 ESTs 5.2 

40 410870 U81599 Hs.66731 homeoboxB13 5.2 

420729 AW964897 Hs.290825 ESTs 5.2 

425066 M82882 Hs.154365 E74-nke factor 1 (ets domain transcript 5.2 

429467 NMJXW477 Hs.203772 FSHD region gene 1 5.2 

Atr 447816 NNL007233 Hs^74329 TP53 target gene 1 5.2 

45 446553 AB021179 Hs.15299 HMBA-lnducible 5.2 

453308 AW959731 Hs.323099 ESTs 5.2 

452576 AB023177 Hs.29900 KIAA0960 protein 5.2 

421437 AW821252 Hs.104336 hypothetical protein 5.2 

422295 AF051151 Hs.114408 tolMike receptor 5 5.2 

50 453942 AW19092O Hs.19928 hypothetical protein SP329 5.2 

400424 AJ276316 Hs.287374 zinc finger protein 304 5.2 

430387 AW372884 Hs.240770 nuclear cap binding protein subunit 2, 2 5.2 

452588 AA889120 Hs.110637 homeoboxAlO 5.2 

448944 AB014605 Hs.22599 atrophtn-1 interacting protein 1; activi 5.1 

55 413991 H44725 Hs.176090 ESTs 5.1 

444454 BE018316 Hs.11183 sorting nexin 2 5.1 

446795 AI797713 Hs.156471 ESTs 5.1 

407300 AA102616 Hs.120769 gb:zn43e07.s1 Stratagene HeLa cell s3 93 5.1 

„ 432340 AA534222 gb:nj21d0ls1 NQ_CGAP_AA1 Homo sapiens 5.1 

60 446469 BE094848 Hs.15113 homogentisate lipoxygenase (homogenli 5.1 

453293 AA382267 Hs.10653 ESTs 5.1 

408920 AL120071 Hs.48998 fibronectin leucine rich transmembrane p 5.1 

409643 AW450866 Hs.257359 ESTs 5.1 

427723 AI355260 Hs.279789 histone deacetylase 3 5.1 

65 438157 AW137011 Hs.49576 ESTs 5.1 

420344 BE463721 Hs.97101 putative G protein-coupled receptor 5.1 

432251 AW972983 Hs.232165 polycythemia rubra vera 1;cefl surface 5.1 

418051 AW192535 Hs.19479 ESTs 5.1 

„ 419926 AW900992 Hs.93796 DKFZP586D2223 protein 5.1 

70 425843 BE313280 Hs.159627 death associated protein 3 5.1 

440594 AW445167 Hs.126036 ESTs 5.1 

452449 AW068658 Hs.20943 ESTs 5.1 

429769 NWL004917 Hs.218366 kallikrein 4 (prostase, enamel matrix, p 5.1 

__. 428898 AB033070 Hs.194408 WAA1244 protein 5.1 

75 415339 NM.015156 Hs.78398 K1AA0071 protein 5.1 

450671 AI355967 Hs.43086 ESTs, Weakly similar to A46010 X-finked 5.1 

444484 AK002126 Hs.11260 hypothebcal protein FU 11 264 5.0 

433023 AW864793 Hs.87409 thrombospondin 1 5.0 

0J _ 433862 D86960 Hs.3610 WAA0205 gene product 5.0 

oO 451244 AW008798 Hs.343877 hypothetical protein FU20039 5.0 

418624 AT734080 Hs.104211 ESTs 5.0 

430291 AV660345 Hs.238126 CGI-49 protein 5.0 

432261 R42216 Hs.1 2342 Homo sapiens done 24538 mRNA sequence 5.0 
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429922 Z97630 Hs.226117 H1 hislone farnfly, member 0 5.0 

420218 AW958037 Hs.286 rfoosomal prolein L4 5.0 

425242 D13635 Hs.155287 K1AAO0 10 gene product 5.0 

427176 AW381569 Hs.40334 ESTs 5.0 

5 431585 BE2428Q3 Hs.252823 hypolheticd protein FU 10326 5.0 

439699 AF086534 Hs.1 87561 ESTs, Moderately similar to ALU INHUMAN A 5.0 

412350 AI659306 Hs.73826 protein tyrosine phosphatase, non-recept 5.0 

450244 AA007534 Hs.125062 ESTs 5.0 

439731 AI953135 Hs.45140 hypothetical protein FU 14084 5.0 

10 434237 AF1 19908 Hs.235516 hypothetical protein PR02955 5.0 

418700 AI96380B Hs.86970 ESTs, Moderately similar to ALU5_HUMAN A 5.0 

445866 H20B99 Hs, 13399 Homo sapiens done 25032 mRNA sequence 4.9 

408331 NM 007240 Hs.44229 dual specificity phosphatase 12 4.9 

417407 AA923278 Hs.290905 ESTs, Weakly simHar to protease iH.sapi 4.9 

15 431325 AW026751 Hs.5794 ESTs, WeaWy similar to 2109260A B ceil 4.9 

444151 AW972917 Hs.128749 a!pha^nethylacyl-CoA racemase 4.9 

401451 NMJJ04496*:Homo sapiens hepatocyte nucte 4.9 

416434 AW163045 Hs.79334 nuclear factor, inteHeuldn 3 regulated 4.9 

419839 U24577 Hs.93304 phosphofipase A2, group VII (p!alelet-ac 4.9 

20 442281 N34742 Hs.170065 Homo sapiens cDNA FU 13492 fis, done PL 4.9 

420608 BE548277 Hs.103104 ESTs 4.9 

420911 U77413 Hs.100293 O-linked N-aceryiglucosamine (GlcNAc) tr 4.9 

424332 AA338919 Hs.101615 ESTs 4.9 

430523 AW451385 Hs.161954 ESTs 4.9 

25 449300 A1656959 Hs.346514 ESTs 4.9 

426657 NM 015865 Hs.171731 solute carrier family 14 (urea transport 4.9 

422299 AK000181 Hs.1 14556 hypothetical protein FU 20 174 4.9 

438321 AA576635 Hs.6153 CGI-48 protein 4.9 

452744 AI267652 Hs.246107 Homo sapiens mRNA; cONA DKFZp434E0B2 (fr 4.9 

30 418818 AA228899 Hs.101307 Homo sapiens HUT 11 protein mRNA, partial 4.8 

425174 D87450 Hs.154978 KIAA0261 protein 4.8 

430458 AA479300 Hs.225706 ESTs, Weakly similar to 1 38022 hypolheti 4.8 

455497 AA1 12573 Hs.278695 Homo sapiens prostein mRNA, complete cds 4,8 

416208 AW291168 Hs.41295 ESTs. Weakly similar to MUC2JHUMAN MUCIN 4.8 

35 449539 W80363 Hs.58446 ESTs 4.8 

450654 AJ245587 Hs.25275 Kruppel-type zinc finger protein 4.8 

434228 Z42047 Hs.283978 Homo sapiens PR02751 mRNA, complete cds 4.8 

407385 AA610150 Hs.272072 ESTs, WeaWy similar to I38022 hypotheti 4.8 

420210 AI557257 Hs.44811 ESTs 4.8 

40 425905 AB032959 Hs.318584 novel C3HC4 type Zinc finger (ring tinge 4.8 

428647 AA830050 Hs.124344 ESTs 4.8 

434015 AA844518 Hs.300876 hypothetical protein FU13386 4.8 

441879 AI521936 Hs.1 07149 novel protein smnilar to archaeal, yeast 4.8 

408990 AL022395 Hs.49526 tt»x and teucmenich repeat protein 4 4.8 

45 416030 H15261 Hs.21948 ESTs 4.8 

420948 AB016898 Hs.1 00469 myeloid/lymphoid or mixed-lineage leukem 4.8 

423749 U09848 Hs.132390 zinc finger protein 36 (KOX 18) 4.8 

410268 AA316181 Hs.61635 six transmembrane epithelial antigen of 4.7 

452323 W44356 Hs.292812 ESTs, Weakly similar to T33468 hypotheti 4.7 

50 443684 AI681307 Hs.55098 ESTs 4.7 

415621 AI648602 Hs.55468 ESTs 4.7 

420851 AA281062 Hs.29493 hypothetical prolein FU20142 4.7 

424433 H04607 Hs.9218 ESTs 4.7 

436703 AW880514 Hs.146381 RNA binding motif protein, X chromosome 4.7 

55 451806 NM 003729 Hs.27076 RNA 3Merrninal phosphate cyclase 4.7 

433293 AF007835 Hs.32417 hypothetical protein MGC4309 4.7 

434973 AW449285 Hs.313636 EST 4.7 

433560 AI925195 Hs.1 30891 hypothetical protein MGC4400 4.7 

431587 NM.016179 Hs.262360 transient receptor potential channel 4 4.7 

60 452260 AA453208 RAB9, member RAS oncogene family 4.7 

412846 AW961245 Hs.55896 Homo sapiens PAC done RP5-978E1 8 from 7 4 J 

418836 AI655499 Hs.161712 ESTs 4.7 

40155B ENSP00000220478VSECRETOGRANIN 111. 4.7 

441647 AA534210 Hs.285280 Homo sapiens cDNA: FU22096 fis, clone H 4.7 

65 443881 R64512 Hs.237146 hypothetical protein FU 12752 4.7 

417169 R13550 Hs^46773 ESTs 4.7 

423349 AF010258 Hs.127428 homeoboxA9 4.7 

458611 AI268407 Hs.211458 DC^pecific transmembrane protein 4.7 

444951 AI783767 Hs.148635 ESTs, Moderately similar to ALUB.HUMAN I 4.6 

70 416774 AI005169 Hs.28274 Homo sapiens cDNA: FU22049 fis. clone H 4.6 

419465 AW500239 Hs.21187 Homo sapiens cDNA: FU23068 fis, clone L 4.6 

423242 AL039402 Hs.1257B3 DEME-6 protein 4.6 

452627 AI122843 Hs.184319 ESTs, WeaWy similar to KIAA1006 protein 4.6 

419005 T86358 Hs.1 93931 ESTs, Weakly similar to I54374 gene NF2 4.6 

75 403046 NM_005656*:Homo sapiens transmembrane pr 4.6 

416602 NM.006159 Hs.79389 nel (chickenHke 2 4.6 

407426 AF129533 gb:Homo sapiens F-box protein Fbl3b (FBI 4.6 

426011 AW996096 Hs.58924 ESTs, Weakly similar to JC5594 jerky gen 4.6 

on 442138 AA445973 Hs.13303 Homo sapiens cONA: FU21 784 fis. clone H 4.6 

80 446015 T30968 Hs.13531 hypothetical protein FU10971 4.6 

452994 AW962597 Hs.31305 KIAA1 547 prolein 4.6 

446091 AW022192 Hs.200197 ESTs 4.6 

432908 AI861896 Hs.304505 ESTs 4.6 



697 



WO 03/042661 



PCT/US02/36810 



430226 BE245562 Hs.2551 adrenergic, beia-2-, receptor, surface 4.6 

431673 AW971302 Hs.293233 ESTs 4.6 

446183 AA354991 Hs.14222 Homo sapiens mRNA; cDNA DKFZp761P019 (fr 4.6 

452355 N54926 Hs.29202 G protein-coupled receptor 34 4.6 

5 432093 H2B383 gb:y!52c03.r1 Soares breast 3NbHBst Homo 4.6 

448402 BE244226 Hs.21094 RAB 18, member RAS oncogene family 4.6 

409019 AW385412 • Hs.9615 myosin regulatory Tight chain 2, smooth 4.6 

413812 AW186687 Hs.44748 ESTs 4.6 

414343 AL036166 Hs.323378 coated vesicle membrane protein 4.6 

10 434423 NM 006769 Hs.3844 L1M domain only 4 4.5 

407355 AA846203 Hs.193974 ESTs. Weakly similar to ALU 1JHUMAN ALUS 4.5 

421724 AB037832 Hs.107287 KIAA1411 protein 4.5 

412125 Y17114 Hs.73393 eyes absent (Drosophila) homolog 4 4.5 

437330 AL353944 Hs.50115 Homo sapiens mRNA; cDNA DKFZp761 J1 1 1 2 (f 4,5 

15 446720 AI439136 Hs.140546 ESTs 4.5 

448664 AI879317 Hs.334691 splicing factor 3a, subunit 1, 120kO 4.5 

415083 AI632683 Hs.27179 Homo sapiens cONA FU 12933 ffis. done NT 4.5 

423412 AF109300 prostate cancer associated protein 5 4.5 

433507 A1817336 Hs.191791 ESTs 4.5 

20 428966 AF059214 Hs.194687 cholesterol 25-hydroxylase 4.5 

423782 AI472209 Hs.323117 ESTs 4.5 

456362 AW973003 Hs.179909 hypothetical protein FU22995 4.5 

415451 H19415 Hs.268720 ESTs, Moderately similar to ALU 1 .HUMAN A 4.5 

419745 AF042001 Hs.93005 slug (chicken homolog), zinc finger prat 4.5 

25 428715 AW293716 Hs.53126 ESTs 4.5 

429857 AF089897 Hs.294030 topoisomerase-related function protein 4 4.5 

459561 AI547306 ESTs 4.5 

416155 AIB07264 Hs.205442 ESTs, Weakly similar to T34036 hypotheti 4.5 

414272 A1651603 Hs.46988 ESTs 4,5 

30 424690 BE538356 Hs.151777 eukaryotic translation initiation factor , 4.5 

417350 U50928 Hs.82001 polycystic kidney disease 2 (autosomal d 4.5 

450094 AI174947 Hs.295789 Homo sapiens mRNA: cDNA DKFZp564D1 164 (f 4.5 

459324 AW080953 gb:xc28c12.x1 NC!_CGAP_Co 18 Homo sapiens 4.5 

429784 M89796 Hs.30 membrane-spanning 4-domains, subfamily A 4.5 

35 401519 C15000476*:gi|12737279lrefjXP„012163.1| 4.5 

417576 AA339449 Hs.82285 phosphoribosylglycinamideformyltransfer 4.5 

427484 N32859 Hs.37288 nuclear receptor subfamily 1, group D,m 4.5 

417061 AI675944 Hs.18B691 Homo sapiens cONA FU 12033 fis, done HE 4.4 

418858 AW961605 Hs.21145 hypothetical protein RG083M05.2 4.4 

40 433517 AW022133 Hs.189838 ESTs 4.4 

429559 AI985345 Hs.26425 ESTs 4.4 

452338 AW608920 Hs.29159 zinc finger protein 75 (D8C6) 4.4 

409190 AU076536 Hs.50984 sarcoma amplified sequence 4.4 

444922 A1921750 Hs.144871 Homo sapiens cDNA FLJ 13752 fis, done PL 4.4 

45 416547 H62914 Hs.268946 ESTs, WeaWy similar to PC4259 ferritin 4.4 

430320 BE245290 Hs.239218 uncharacterized hypothalamus protein HCO 4.4 

436953 AW959074 Hs.23648 Homo sapiens cDNA FU 13097 fis, clone NT 4.4 

438497 AA808725 Hs.291712 ESTs, WeaWy similar to 138022 hypotheti 4.4 

434384 AA631910 Hs.162849 ESTs 4.4 

50 444564 AI167877 Hs.143716 ESTs 4.4 

447500 AI381900 Hs.159212 ESTs 4.4 

425259 AL049280 Hs.1 55397 Homo sapiens mRNA; cDNADKFZp564K143{fr 4.4 

408380 AF123050 Hs.44532 dhibiquitin 4.4 

448766 AI473827 Hs.31793 ESTs 4.4 

55 417421 AL1 38201 Hs.82120 nuclear receptor subfamily 4, group A, m 4.4 

432810 AA863400 ESTs 4.4 

437812 A1582291 Hs.16846 ESTs, WeaWy similar to 04HUD1 debrisoqu 4.4 

447247 AW369351 Hs.287955 Homo sapiens cONA FU 13090 fis, clone NT 4.4 

434022 R18374 Hs.1 17956 ESTs 4.4 

60 422365 AF035537 Hs.1 15521 REVS (yeast homotogHike, catalytic sub 4,4 

417728 AW138437 Hs.24790 KIAA1 573 protein 4.4 

420407 AA814732 Hs.145010 Hpopolysaccaride-spedfic response 5-D 4.4 

425810 AI923627 Hs.31903 ESTs 4.4 

432882 NM_013257 Hs.279696 serum/glucocorficokJ regulated kinase-0 4.3 

65 448550 AL161983 Hs.21415 Homo sapiens mRNA; cONA DKFZp761K2024 (f 4.3 

410227 AB009284 Hs.61152 exostoses (mullipleHike 2 4.3 

417845 AL1 17461 Hs.82719 Homo sapiens mRNA; cDNA DKFZp586F1B22 (f 4.3 

452335 AW18B944 Hs.61272 ESTs 4.3 

429716 R25685 Hs.211933 colbg en, type XIII, alpha 1 4.3 

70 450316 W84446 Hs.226434 hypothetical protein MGC4643 43 

408527 AL135018 Hs.33074 Homo sapiens, done IMAGE:360651 9, mRNA, 4.3 

419852 AW503756 Hs.286184 hypothetical protein dJ551D2.5 4.3 

424534 D87682 Hs.150275 WAA0241 protein 4.3 

424850 AA151057 Hs.153498 chromosome 18 open reading frame 1 4.3 

75 435023 AI692552 gbwd73f12jc1 NQ_CGAP_Lu24 Homo sapiens 4.3 

442914 AW188551 HsJ9519 hypothetical protein FU 14007 4.3 

452679 Z42387 Hs.83883 transmembrane, prostate androgen induced 4.3 

421163 AA375974 Hs.32450 ESTs, WeaWy similar to T23762 hypotheti 4.3 

or , 420405 AA743396 Hs.189023 ESTs 4.3 

oO 450330 AW500775 Hs.24817 hypothetical protein FU20136 4.3 

423998 H29138 Hs.157113 coenzyme Q, 7 (rat yeast) homolog 4.3 

439764 T26535 Hs.22744 hypothetical protein MGC13105 4.3 

446351 AW444S51 Hs.35380 x 001 protein 4.3 
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431583 AL042613 Hs.262476 S-adenosylmelhionine decarboxylase 1 4.3 

414312 AA1 55694 Hs.1 91060 ESTs 4.3 

438379 N23018 Hs.171391 (terminal binding protein 2 4.3 

411928 AA888624 Hs.1 97289 rab3 GTPase-aclivating protein, non-cata 4.3 

5 414680 AA743331 hemoglobin, alpha 2 4.3 

423104 AJ005273 Hs.1 23647 antigenic determinant of recA protein (m 4.3 

440840 AW629666 ESTs, Weakty similar to S64054 hypothetj 4.3 

449802 AW901804 Hs.23984 hypothetical protein FU201 47 4.3 

434874 N62448 Hs.293970 methylmalonate-semialdehydedehydrogenas 4.2 

10 431429 AF072813 reticulon3 4.2 

413492 D87470 Hs.75400 KIAA0280 protein 4.2 

419929 U9026B Hs.93810 cerebral cavernous malformations 1 4.2 

435846 AA700870 Hs.14304 ESTs 4.2 

441523 AW514263 Hs.301771 ESTs, Weakly similar to ALUF_HUMAN !!!! 4.2 

1 5 450546 AA010200 Hs.175551 ESTs 4.2 

414222 AL135173 Hs.878 sorbitol dehydrogenase 4.2 

416445 AL043004 Hs.79337 KIAA01 35 protein 4.2 

409151 AA305105 Hs.50785 SEC22, vesicle trafficking protein (S. c 4.2 

412707 AW206373 Hs.16443 Homo sapiens cDNA: FU21721 fis, clone C 4.2 

20 442991 BE281238 Hs.8886 hypothetical protein FU20424 4.2 

447439 AA313565 Hs.145020 ESTs, Weakly similar to KIAA1 205 protein 4.2 

449685 AW296669 Hs.66095 ESTs 4.2 

407204 R41933 Hs.140237 ESTs, Weakly similar to ALU INHUMAN ALU S 4.2 

450377 AB033091 Hs.74313 K1AA1 265 protein 4.2 

25 419647 AA348947 Hs.91816 hypotheticat protein 4.2 

442049 AA310393 Hs.190044 ESTs 4.2 

430669 AW969657 Hs.291029 ESTs 4.2 

424181 AL039482 Hs.142517 Homo sapiens mRNA; cDNA DKFZp434P0810 (f 4.1 

443634 H73972 Hs.134460 ESTs 4.1 

30 426216 N77630 Hs.13895 Homo sapiens cDNA FU 11 654 fis, clone HE 4.1 

422634 NM_016010 Hs.1 18821 CGI-62 protein 4.1 

445895 D29954 Hs.13421 KIAA0056 protein 4.1 

447669 AL049985 Hs.19180 Homo sapiens mRNA; cDNA DKFZp564E1 22 (fr 4.1 

407198 H91679 gb:yv04a07.s1 Soares fetal liver spleen 4.1 

35 447752 M73700 Hs.105938 lactolransferrin 4.1 

402855 NM_001839*:Homo sapiens calponin 3, acid 4. 1 

443161 A103B316 gb".ox48c08j(1 Soares_totalJetus_Nb2HF8_ 4.1 

415827 H17462 Hs.23079 ESTs 4.1 

408161 AW952912 Hs.300383 hypothetical protein MGC3032 4.1 

40 418365 AW014345 Hs.161690 ESTs 4.1 

429525 N92540 Hs.205353 ectonucteoskje triphosphate (fiphosphohyd 4.1 

435401 R44477 Hs.10056 hypothetical protein FU 14621 4.1 

446657 AI335191 Hs.260702 ESTs, Weakly similar to 21 09260A B cefl 4.1 

421141 AW1 17261 Hs.125914 ESTs 4.1 

45 430335 D80007 Hs.239499 KIAA01 85 protein 4.1 

435020 AW505076 Hs.301855 DiGeorge syndrome critical region gene 8 4.1 

445309 AL157474 Hs.12504 fikely orfhotog of mouse Arkadia 4.1 

443547 AW271273 Hs.23767 hypothetical protein FU 12666 4.1 

410763 AF279145 Hs.8966 hypoiheOcat protein FU21776 4.1 

50 410592 R94088 Hs.43569 ESTs 4.1 

423698 AA329796 Hs.1098 DKFZp434J 181 3 protein 4.1 

428634 AA811845 Hs.106290 Keteh motif containing protein 4.1 

430253 AK001514 Hs.236844 hypothetical protein FU 10652 4.1 

425211 M18667 Hs.1867 progasuicsm (pepsinogen C) 4.0 

55 443273 AI042063 Hs.132156 ESTs 4.0 

428055 AA420564 Hs.101760 ESTs 4.0 

425707 AF115402 Hs.11713 E74-fike factor 5 {ets domain transcript 4.0 

451294 A1457338 Hs.29894 ESTs 4.0 

430519 AF129534 Hs.49210 F-box only protein 4 4.0 

60 441766 R53790 Hs.23294 hypothetical protein FU 14393 4.0 

431117 AF003522 Hs.250500 delta (DrosophjlaHike 1 4.0 

446354 AW449650 Hs.346335 ESTs 4.0 

45146B AW503398 Hs!293663 ESTs, Moderately similar to 138022 hypot 40 

409705 BE158773 Hs.213207 ESTs 4.0 

65 447082 T85314 . thioredoxin-fike 4.0 

418594 AI732083 Hs.187619 ESTs 4.0 

426501 AW043782 Hs.293616 ESTs 4.0 

436797 AA731491 Hs.334477 hypothetical protein MGC1 4879 4.0 

41628B H51299 gb:yp07c06.s1 Soares breast 3NbHBst Homo 4.0 

70 419865 NMJ0O702O Hs.93502 U1-snRNP binding protein homotog (70kD) 4.0 

425920 AL049977 Hs.162209 daudin8 4.0 

416658 U03272 Hs.79432 fibrillin 2 {congenital contractural ara 4.0 

452367 U71207 Hs.29279 eyes absent (Drosophila) homoiog 2 4.0 

433209 AB040907 Hs.278436 KIAA1 474 protein 4.0 

75 428801 AWZ77121 Hs.254881 ESTs 4.0 

419629 AB020695 Hs.91662 KIAA0888 protein 4.0 

451061 AW291487 Hs.213659 ESTs. Weakly similar to KIAA1357 protein 3.9 

420544 AA677577 Hs.98732 Homo sapiens Chromosome 1 6 BAG clone OT 3.9 

OA 400695 C11002514-.gil11280151|pirl|E82756beta- 3.9 

oO 417173 U61397 Hs.81424 ubiquffin-tike 1 (sentrin) 3.9 

419985 H66373 Hs.5856 ESTs, Highly similar to bA393J16.3 [H,sa 3.9 

429340 N35938 Hs.199429 Homo sapiens mRNA; cDNA DKFZp434M2216 (f a9 

433759 AA680003 Hs.1 09363 Homo sapiens cDMA: FU23603 fis. clone L 3.9 
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428862 NM 000346 Hs.2316 SRY (sex determining region Y>box 9 (ca 3.9 

413450 Z99716 Hs.75372 N-acetylgalactosaminidase. alpha- 3.9 

427615 BE410107 Hs.179817 CG1-82 protein 3.9 

450649 NM.001429 Hs.25272 E1A binding protein p300 3.9 

5 445798 NM.012421 Hs. 13321 rearranged L-myc fusion sequence 3.9 

416198 H27332 Hs.99598 hypothetical protein MGC5338 3.9 

431147 AI767751 Hs.20300 ESTs 3.9 

439192 AW970536 Hs.105413 ESTs 3.9 

, ft 446157 BE270828 Hs.131740 Homo sapiens cDNA: FU22562 fis, clone H 3.9 

10 451900 AB023199 Hs.27207 KIAA0982 protein 3.9 

420802 U22376 Hs.1334 v-myb avian myeloblastosis viral oncogen 3.9 

426083 AW962712 Hs.126712 ESTs. Weakly similar to AF1 9 1020 1 E21G5 3.9 

442320 A1287817 Hs.129636 ESTs 3.9 

447541 AK000288 Hs.18800 hypolheticat protein FU20281 3.9 

15 444636 T96667 Hs.17877 ESTs 3.9 

432103 T15803 Hs.272458 protein phosphatase 3 (formerly 28), cat 3.9 

412576 AA447718 Hs.107057 ESTs 3.9 

423952 AW877787 Hs.136102 K1AA0853 protein 3.9 

437916 BE566249 Hs.20999 hypothetical protein FU23142 3.9 

20 442242 AV647908 Hs.90424 Homo sapiens cDNA: FU23285 fis, done H 3.9 

447280 BE617907 Hs.97635 ESTs 3.9 

421750 AK000768 Hs.107872 hypothetical protein FU20761 3.8 

450580 N40087 ESTs 3.8 

„ 432359 AA076049 Hs.274415 Homo sapiens cDNA FU 10229 fis, clone HE 3.8 

25 429323 NM_001649 Hs.2391 apical protein, Xenopus laevis-Gke 3.8 

407332 AI801565 Hs.200113 Homo sapiens cDNA FU1 1379 fis, clone HE 3.8 

426413 AA377823 gb:EST90805 Synovial sarcoma Homo sapien 3.8 

412652 AI801777 Hs.260024 ESTs 3.8 

426226 AA769045 gb:oa80h07j»1 NCI_CGAP_GCB1 Homo sapiens 3.8 

30 437816 AI823445 Hs.280699 ESTs 3.8 

444534 AW271626 Hs.42294 ESTs 3.8 

459241 AA032276 Hs.99010 ESTs, Moderately similar to T 14342 NSD1 3.8 

408242 AA251594 Hs.43913 P1BF1 gene product 3.8 

443484 AI091458 Hs.134559 ESTs 3.8 

35 422083 NMJ501141 Hs.111256 arachidonate lipoxygenase, second typ 3.8 

428043 T92248 Hs.2240 uteroglobin 3.8 

407192 AA609200 gb:af12e0ls1 Soares_testis_NHT Homo sap 3.8 

421590 AF004715 Hs.105940 jerky (mouse) homo!og-ttke 3.8 

453020 AL162039 Hs.31422 Homo sapiens mRNA; cONA DKFZp434M229 (fr 3.8 

40 437372 AA323968 Hs.283631 hypothetical protein DKFZp547G183 3.8 

432278 AL137506 Hs.274256 hypothetical protein FU23563 3.8 

407917 U63139 Hs.41587 RAO50 (S. cerevislae) homotog 3.8 

408547 AA574291 Hs.57837 ESTs 3.8 

427635 BE397988 Hs.179982 tumor protein p53-binding protein 3.8 

45 430968 AW972830 gb:EST384925 MAGE resequences, MAGL Homo 3.8 

452234 AW084176 Hs.223296 ESTs, Weakly similar to 138022 hypotheti 3.8 

453698 AA037615 Hs.42746 ESTs 3.8 

408875 NM_015434 Hs.48604 DKFZP434B1 68 protein 3.8 

_ A 431197 AL038596 Hs.250745 polymerase (RNA) III (DNA directed) (62k 3.8 

50 434072 H70854 Hs.283059 Homo sapiens PRO10B2 mRNA, complete cds 3.8 

438136 R98299 Hs.177502 ESTs 3.8 

447906 AL050062 Hs.19999 DKFZP566K023 protein 3.B 

415992 C05837 Hs.145807 hypothetical protein FU 13593 3.7 

435655 AW105663 Hs.6947 HSPC069 protein 3.7 

55 430607 AW973521 Hs.247324 mitochondrial ribosomal protein S14 3.7 

439398 AA284267 Hs.221504 ESTs 3J 

448515 H68441 Hs.13528 hypothetical protein FU 14054 3.7 

445242 BE156478 Hs.21108 ESTs, Weakty similar to ALU INHUMAN ALUS 3.7 

418576 AW968159 Hs.289104 Epithelial calcium channel 2. CaT-tike A 3.7 

60 436024 AI800041 Hs.190555 ESTs 3.7 

407783 AW996872 Hs.172028 a dtsintegrin and metaJiocroteinase doma 3.7 

436207 AA334774 Hs.12845 hypothetical protein MGC13159 3.7 

442861 AA243837 Hs.57787 ESTs 3.7 

448207 AI475490 Hs.170577 ESTs 3.7 

65 450628 AW382884 Hs.204715 ESTs 3.7 

432606 NM_002104 Hs.3066 granzyme K (serine protease, granzyme 3; 3.7 

438613 C05569 Hs.243122 hypothetical protein FU 13057 similar to 3.7 

443031 AW134696 Hs.49418 ESTs 3.7 

_ A 447608 AW205042 Hs.18955 Homo sapiens cDNA FU20667 fis, clone KA 3.7 

70 408312 AF263613 Hs.44198 Intracellular rnembrane-associated calciu 3.7 

412777 AJ335773 Hs.270123 ESTs 3.7 

424602 AK002055 Hs.151046 hypothetical protein FU1 1193 3.7 

424882 A1379461 Hs.153636 far upstream element (FUSE) binding pro! 3.7 

nr 425898 AA365649 Hs.269478 ESTs, Weakly similar to PC4259 ferritin 3.7 

75 443323 BE560621 Hs.9222 estrogen receptor binding site associale 3.7 

452129 AW291379 Hs.212827 ESTs 3.7 

453927 AA082465 Hs. 125031 crdhe/emanolaniinephosphotransferase 3.7 

432336 NMJJ02759 Hs.274382 protein kinase, mterferon-tnducible dou 3.7 

D _ 425988 BE045897 Hs.274454 ESTs, Weakly similar to 138022 hypotheti 3.7 

oO 443123 AA094538 Hs.272808 putative transcription regulation nuclea 3.7 

445467 AI239832 Hs.15617 ESTs, Weakly similar to ALU4_HUMAN ALU S 3.7 

423855 AA331761 Hs.254B59 ESTs 3.7 

403790 NKt001334*:Homo sapiens cathepsin O (CTS 3.7 ' 

700 



WO 03/042661 



PCT7US02/36810 



434470 AA634818 Hs.29813B ESTs 3.7 

452032 BE244005 Hs.27610 retinae acid- and interferon-inducible 3.7 

420026 AI831190 Hs.1 66676 ESTs 3.7 

421838 AW881089 Hs.1 08806 Homo sapiens mRNA; cDNA DKFZp566M0947 (f 3.7 

5 449916 T60525 Hs.299221 pyruvate dehydrogenase kinase, isoenzyme 3.7 

452270 AW975014 Hs.26 ferrochelatase (protoporphyria) 3.7 

445941 A1267371 Hs.172636 ESTs 3.6 

416882 AI633044 tryptophan^ IRNA synthetase 2 (mitochon 3.6 

412533 AA679863 Hs.69606 ESTs 3.6 

10 431740 N75450 Hs.183412 ESTs, Moderately similar to AF 116721 67 3.6 

442710 AI015631 Hs.23210 ESTs 3.6 

448212 AW75858 gMc87d07.x1 NCI_CGAP_CLL1 Homo sapiens 3.6 

413431 AW24642B Hs.75355 uwquitin-conjugaling enzyme E2N (homoto 3.6 

453038 AW292415 Hs.20509 HBV pX associated protein-8 3.6 

15 427661 AA410292 Hs.104761 ESTs 3.6 

420923 AF097021 Hs.273321 differentially expressed in hematopoieti 3.6 

414844 AA296874 Hs.77494 deoxyguanosine kinase 3.6 

441866 BE464341 Hs.21201 nedin 3; DKFZP566B0846 protein 3.6 

406815 AA833930 Hs.288036 IRNA isopentenylpyrophosphale transferas 3.6 

20 423482 BE280172 Hs.1 29228 galactokinase 2 3.6 

424677 U09414 Hs.151689 zinc finger protein 137 {done pHZ-30) 3.6 

430160 AW968210 Hs.293957 ESTs, Weakly similar to ALUC.HUMAN llll 3.6 

433672 BE281165 Hs.288038 TLS-associated serine-arginine protein 1 3.6 

433887 AW204232 Hs.279522 ESTs 3.6 

25 452670 AF068227 Hs.30213 ceroid-lipofuscinosis, neuronal 5 3.6 

411890 H92738 Hs.75811 N-acylsphingosine amidohydrolase (acid c 3.6 

432728 NM 006979 Hs.278721 HLA class II region expressed gene KE4 3.6 

431724 AA514535 Hs.283704 ESTs 3.6 

435703 AW630133 Hs.83313 GK003 protein 3.6 

30 445674 BE410347 Hs.1 3063 transcription factor CA150 3.6 

414178 AW957372 Hs.46791 ESTs, Weakly similar to 138022 hypotheli 3.6 

416111 AA033813 Hs.79018 chromatin assembly factor 1 . subunit A ( 3.6 

432840 AK001403 Hs.279521 hypothetical protein FU 20530 3.6 

439726 AW449893 Hs.293707 ESTs, Weakly similar to 138598 zinc fing 3.6 

35 445704 AM93742 Hs.1 67700 ESTs, Moderatery similar to 138022 hypot 3.6 

449115 AW959952 Hs.37528 ESTs, Weakly similar to AF090944 1 PRO06 3.6 

439379 AA835002 Hs.125611 ESTs 3.6 

414646 AA353776 Hs.901 CD4B antigen (B-cell membrane protein) 3.6 

434579 T55958 gb:yb35f05.r1 Stralagene fetal spleen (9 3.6 

40 451367 AA923729 Hs.26322 cell cycle related kinase 3.6 

420285 AA258124 Hs.293878 ESTs, Moderately similar to ZN91_HUMAN 2 3.6 

431263 AW129203 Hs.322915 ESTs 3.6 

431952 Z70695 Hs.272240 Homo sapiens cDNA FU 1 1086 fis, clone PL 3.6 

439092 AA830149 gb:oc44fOB.s1 NQ_CGAP_GC81 Homo sapiens 3.6 

45 411562 AL050201 Hs.70769 hypothetical protein DKFZp586E1923 3.5 

417771 AA804598 Hs.82547 retinoic acid receptor responder (tazaro 3.5 

419440 AB020689 Hs.90419 WAA0882 protein 3.5 

417333 AL157545 Hs.173179 bromodornain and PHD finger containing, 3 3.5 

419241 AA523939 Hs.165258 ESTs 3.5 

50 410762 AF226053 Hs.66170 HSKM-B protein 3.5 

413384 NMJJ00401 Hs.75334 exostoses (multiple) 2 3.5 

424081 NMJX36413 Hs.139120 ribonuclease P (30kD) 3.5 

, 428004 AA449563 Hs.151393 glutamatfrcysteine Cgase, catalytic sub 3.5 

420390 AA330047 Hs.191187 ESTs 3.5 

55 416662 T25853 HsJ538 ESTs 3.5 

433280 AA581404 Hs.289037 Homo sapiens cDNA FU14135 lis, clone MA 3.5 

433285 AW975944 Hs.237396 ESTs 3.5 

421991 NM_014918 Hs.1 10488 K1AA0990 protein 3.5 

413950 AA249096 Hs.32793 ESTs 3.5 

60 431318 AA502700 Hs.293147 ESTs, Moderately similar to A46010 X-Bn 3.5 

400289 X07620 Hs.2258 matrix metaHoproteinase 10 (stramelysin 3.5 

411149 N68715 Hs.269128 ESTs 3.5 

417601 NMJM4735 Hs.82292 K1AA021 5 gene product 3.5 

418334 AA319233 Hs.5521 ESTs 3.5 

65 422583 AA410506 Hs.27973 KIAA0874 protein 3.5 

425717 X07282 Hs.171495 retinoic acid receptor, beta 3.5 

425856 AA364908 Hs.98927 hypothetical protein FU 13993 3.5 

434569 AI311295 Hs.344478 WAA0196 gene product 3.5 

419436 AA991639 Hs.242413 hypothetical protein DKFZp434K1 421 3.5 

70 437296 AA350994 Hs.20281 KIAA1700 3.5 

432697 AW975050 Hs.293892 ESTs, WeaWy similar to ALU4_HUMAN ALU S 3.5 

439605 AF086431 Hs.22380 ESTs 3.5 

442145 A1022650 Hs.8117 erbb24nteracting protein ERBIN 3.5 

__ 432589 AL135725 Hs.131708 ESTs 3.5 

75 425910 AA830797 Hs.1 84760 CCAAT-box-bindlng transcription factor 3.5 

412095 AI624707 Hs.5921 Homo sapiens cDNA:FU21592 fis, done C 3.5 

421129 BE439899 Hs.89271 ESTs 3.5 

429433 AA452899 Hs.213586 ESTs, Weakly similar to KIAA1 353 protein 15 

434839 A1743069 Hs.134736 ESTs 3.5 

80 435166 A1391470 Hs.1 58618 ESTs 3.5 

448966 AW372914 Hs.86149 phcsphoinositol phosphate-binding prot 3.5 

449318 AW236021 Hs.78531 Homo sapiens, Similar to RIKEN cDNA 5730 3.5 

453311 AW104911 Hs.126707 hypothetical protein FU 11 457 3.5 
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455497 AW967956 Hs.1 23648 ESTs. WeaWy sirniTar to AF108460 1 ubinu 3.5 

413786 AW613780 Hs.13500 ESTs 3.5 

430935 AW072916 zinc Finger prolan 131 (done pHZ-tO) 3.5 

. 439904 AW892676 gb:CM3-NN0004-280300-1 31-C12 NNQ004 Homo 3.5 

5 449909 AA004681 Hs.59432 ESTs 3.5 

445817 NNL003642 Hs.13340 histone acelyltransferase 1 3.5 

419594 AA013051 Hs.91417 (opoisomerase (DNA) It binding protein 3.5 

417251 AW015242 Hs.994B8 ESTs, Weakly similar to YK54JCEAST HYPOT 3.5 

423545 AP000692 Hs. 129781 chromosome 21 open reading frame 5 3.5 

10 438527 AI969251 Hs. 115325 RAB7, member RAS oncogene family-like 1 3.5 

453916 AW974874 Hs.75212 ornithine decarboxylase 1 3.5 

430200 BE613337 Hs.234896 geminin 3.4 

4371B7 AL080208 Hs.306325 Horr»sapieramRNA;cONADKF2p586C1523 (f 3.4 

423645 AI215632 Hs.147487 ESTs 3.4 

15 432370 AA308334 Hs.274424 N-acetyl neuraminic acid phosphate syntha 3.4 

434966 AA657494 gb:nt66f04.s1 NCLCGAP_Pr3 Homo sapiens 3.4 

404571 NM 01 5902*:Homo sapiens progestin induce 3.4 

418727 AA227609 Hs.94834 ESTs 3.4 

436374 AA400709 Hs.96716 ESTs, Weakly similar to T17210 hypolheti 3.4 

20 452721 AJ269529 Hs.301871 solute carrier family 37 (glycerol-3-pho 3.4 

427472 AA522539 Hs. 131250 transposon-derived Buster3 transposase-t 3.4 

418245 AA088767 Hs.83883 transmembrane, prostate androgen induced 3.4 

402031 ENSP00000251056*:Plasma membrane calcium 3.4 

^_ 407137 T97307 gb:ye53h05.s1 Soares fetal liver spleen 3.4 

25 449803 AI655662 Hs.197698 ESTs 3.4 

425692 D90041 Hs.155956 N-acetyllransferase 1 (arylamine N-acety 3.4 

436805 AA731533 Hs.270751 ESTs 3.4 

437367 AA749316 Hs.271879 ESTs. Moderately simDar to ALU 1.HUMAN A 3.4 

407908 BE379758 Hs. 110853 uncharacterized hematopoietic stem/proge 3.4 

30 430144 AI732722 Hs.187694 ERGL protein; ERGIC-53-like protein 3.4 

416292 AA1 79233 Hs.42390 nasopharyngeal carcinoma susceptibility 3.4 

423800 AA331156 gb:EST35034 Embryo, 6 week, subtracted ( 3.4 

439375 AA689526 Hs.344249 steroid dehydrogenase homolog 3.4 

429084 AJ001443 Hs.195614 splicing factor 3b. subu nit 3, 130kD 3.4 

35 432886 BE1 59028 Hs.279704 chromatin accessibility complex 1 3.4 

408267 AW380525 Hs.343564 tubulin-specific chaperone e 3.4 

408832 AW085690 Hs.63428 ESTs, Weakly similar to Z195.HUMAN ZINC 14 

422564 AI146006 Hs.222120 ESTs 3.4 

. _ 424842 AA034127 Hs.153487 signal transducing adaptor molecule (SH3 3.4 

40 429984 AL050102 Hs.227209 hypothetical protein RJ21 61 7 3.4 

435256 AF193766 Hs.13872 cytokine-like protein C17 3,4 

435137 A1056769 Hs.133512 ESTs 3.4 

440348 AW015802 Hs.47023 ESTs 3.4 

442910 AI365130 Hs.11307 ESTs, WeaWy sirnflar to T19326 hypotheti 3.4 

45 443242 BE243910 Hs.9082 nudeoporin p54 3.4 

445469 AW298370 Hs.153714 complement-el q tumor necrosis factor-re! 3.4 

449517 AW500106 Hs.23643 serine/threonine protein kinase MASK 3.4 

425354 U62027 Hs.155935 complement component 3a receptor 1 3.4 

433037 NMJN4158 Hs^79938 HSPC067 protein 3.4 

50 423044 AA320829 Hs.97266 protocadherin 18 3.4 

434128 W93170 Hs^84164 protein x 0004 3.4 

449845 AW971183 Hs.9683 DnaJ (Hsp40) homolog, subfamily C, membe 3.4 

423590 AW952412 Hs.65874 ESTs, Weakly simflar to A40348 Elav/Sex- 3.4 

440370 AA884000 Hs.8173 hypothetical protein FU 10803 3.4 

55 438825 BE327427 Hs.79953 ESTs 3.4 

446874 AW968304 Hs.56156 ESTs 3.4 

408101 AW968504 Hs.123073 COC2-related protein kinase 7 3.4 

419131 AA406293 Hs.41167 ESTs 3.4 

„ 423178 AI033140 Hs.124983 Homo sapiens mRNA; cONA OKFZp564C1 42 (fr 3.4 

60 433764 AW753676 Hs.39982 ESTs. 3.4 

440658 H29142 Hs.143032 ESTs, Weakly simflar to neuronal thread 3.4 

411558 AA102670 Hs.70725 gamma-ammobutyric acid (GABA) A recepto 3.3 

452815 AA418841 gbzw01e11.s1 Soares_NhHMPu_S1 Homo sapi 3.3 

421234 AA907153 Hs.190060 ESTs 3.3 

65 424075 AI807320 Hs.227630 RE1 -silencing transcription factor 3.3 

412045 AA099802 Hs.83883 transmembrane, prostate androgen induced 3.3 

420041 AB005142 Hs.94592 Wotho 3.3 

426174 AA547959 Hs.115838 ESTs 3.3 

427083 NM_006363 Hs.173497 Sec23 (S. cerevisiae) homolog B 3.3 

/O 441021 AW578716 Hs.7644 H1 h'tstonefamify, member 2 3.3 

436664 AW197887 Hs.253353 ESTs 3.3 

441124 T97717 Hs.119563 ESTs 3.3 

444169 AV648170 Hs.58756 ESTs 3.3 

„ 421476 AW953805 Hs.21B87 ESTs 3.3 

75 422165 AL041199 Hs.1481 histidine decarboxylase 3.3 

455100 BE160198 gb:QV1-HT()413-010200-0594i03HT0413Hoino 3.3 

450747 AI064821 Hs.318535 ESTs. Highly similar to 181 8357AEWS gen 3.3 
401197 ENSP00000229263*:HSPC213. 3.3 

OA 433404 T32982 Hs.102720 ESTs 3.3 

oU 422546 AB007959 Hs.301478 K1AA0500 protein 3.3 

443884 N20617 Hs.194397 leptin receptor 3.3 

414341 D80004 Hs.75909 KIAA0182 protein 3.3 

425292 NM_005824 Hs.155545 37 kDa leucme-nch repeat (LRR) protein 3.3 
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414407 AA147026 Hs.76704 ESTs 3.3 

420324 AF 163474 Hs.96744 proslate androgen-regulated transcript 1 3.3 

423453 AW450737 Hs.1 28791 CGi-09 protein 3.3 

_ 444170 AW613879 Hs.102408 ESTs 3.3 

5 445474 A1240014 Hs.259558 ESTs 3.3 

450582 A1339732 G-rich RNA sequence binding factor 1 3.3 

451246 AW189232 Hs.39140 cutaneous T-ceil lymphoma tumor antigen 3.3 

434194 AF119847 Hs.283940 Homo sapiens PRO1550mRNA, partial cds 3.3 

444324 AI301330 Hs.143838 ESTs 3.3 

10 427581 NM_014788 Hs.179703 K1AA01 29 gene product 3.3 

429258 AA448765 gbzx10e09.r1 SoaresJotaUetus_Nb2HF8._ 3.3 

400880 NM.00061 1*:Homo sapiens CD59 antigen p18 3.3 

445525 BE149B66 Hs. 14831 Homo sapiens. Similar to zinc finger pro 3.3 

451968 H56196 Hs.26409 Homo sapiens mRNA; cONA DKFZp547K204 (fr 3.3 

15 411190 AA306342 Hs.69171 protein kinase CJike 2 3.3 

410660 AI061118 Hs.65328 Fanconl anemia, comptementafion group F 3.3 

420493 AI635113 Hs.270366 ESTs, Weakly similar to I78885 serine/th 3.3 

423740 Y07701 Hs.293007 aminopeptidase puromycin sensitive 3.3 

428523 AW974540 Hs.98626 ESTs 3.3 

20 428743 AL080060 Hs.301549 Homo sapiens mRNA; cDNA DKFZp564H172 (fr 3.3 

432363 AA534489 gbmf76g11.s1 NCI_CGAP_Co3 Homo sapiens 3.3 

435102 AW899053 Hs.76917 F-box only protein 8 3.3 

438147 AW250553 H-2K binding factor-2 3.3 

445808 AV655234 ESTs, Moderately similar to PC4259 fern 3.3 

25 453058 AW612293 Hs.288584 Homo sapiens cONA FU 11750 fis, clone HE 3.3 

453286 AA034319 Hs.29041 Homo sapiens cDNA FU14177 fis, clone NT 3.3 

453802 AL134757 gb:OKFZp547G1090_r1 547 (synonym: hfbrl) 3.3 

422017 NM_003877 Hs.1 10776 STAT induced STAT inhibitor-2 3.3 

411373 BE326276 Hs.8861 ESTs 3.3 

30 435114 AA775483 Hs.288936 mitochondrial ribosomal protein L9 3.3 

443674 AI081330 Hs.145008 ESTs 3.3 

427528 AU077143 Hs.179565 minichromosome maintenance deficient (S. 3.3 

433800 AI034361 Hs.1 35150 lung type-l cell membrane-associated giy 3.3 

420969 AI636310 Hs.28310 ESTs 3.3 

35 421654 AW163267 Hs.106469 suppressor of varl (S.cerevisiae) 3^3ke 3.3 

409542 AA503020 Hs.36563 hypothetical protein FU2241 8 3.3 

410119 F07841 Hs.13926 ESTs 3.3 

417379 AA1 96390 gb:zp99b10.s1 Stratagene muscle 937209 H 3.3 

423201 NM_000163 Hs.1 251 80 growth hormone receptor 3.3 

40 442787 W93048 Hs.250723 hypothetical protein MGC2747 3.3 

407930 AA045847 Hs.1 88361 Homo sapiens cDNA FU12807 fis, clone NT 3.3 

423427 AL137612 Hs.2B5848 K1AA1 454 protein 3.3 

424903 T26477 Hs.22883 ESTs, Weakly similar to I38022 hypotheti 3.3 

Ac 426775 AA384554 ESTs 3.3 

45 432378 A1493046 Hs.146133 ESTs 3.3 

438875 AA827640 Hs.189059 ESTs 3.3 

452959 AI933416 Hs.189674 ESTs 3.3 

453124 AI139058 Hs.125790 teucine-rich repeat-containing 2 3.3 

417560 U73338 Hs.82283 S^Uiyltetrahydrofolate^wmocysteineme 3.2 

50 433002 AF048730 Hs.279906 cyciinTI 3.2 

430929 AA489166 ' Hs.1 56933 ESTs 3.2 

437444 H46008 Hs.31518 ESTs 3.2 

446297 AI346930 Hs.149728 ESTs 3.2 

c _ 407870 AB032990 Hs.40719 hypothetical protein KIAA1164 3.2 

55 429377 AA813192 Hs.200596 WAA0547 gene product 3.2 

426030 BE243933 Hs.1 08642 zinc finger protein 22 (KOX 15) 3.2 

420583 H77859 Hs.65450 reficuton 4 3.2 

425264 AA353953 Hs.20369 ESTs, Weakly similar to gonadotropin ind 3.2 

£rk 442562 BE379584 dolichvWiphosphooIigosaccharide-protei 3.2 

OO 418871 NrVL001608 Hs.1209 acyt^rizynieAo^ydrogenase,Iongchal 3.2 

421029 AW057782 Hs.293053 ESTs 3.2 

452997 N64777 Hs.44656 ESTs 3.2 

435035 BE568487 Hs.47668 x 006 protein 3.2 

408412 AW193033 Us.124436 ESTs 3.2 

65 421684 BE281591 Hs.106768 hypothetical protein FU 10511 3.2 

421887 AW161450 Hs.109201 CGI-86 protetn 3.2 

432409 AA806538 Hs.130732 WAA1 575 protein 3.2 

434378 AA631739 Hs.335440 EST 3.2 

409401 AI201895 Hs.1 81309 protaasome (proSDme. macropain) subunit 3.2 

/O 420195 N44348 Hs.26243 Homo sapiens cDNA FU1 1 177 fe, clone PL 3.2 

410264 AK001853 Hs.61508 Homo sapiens cDNA FU 10991 fis, dona PL 3.2 

410296 AW630675 Hs.271946 ESTs 3.2 

413838 AV661185 Hs.75574 mitochondrial ribosomal protein L1 9 3.2 

nc 414342 AA742181 Hs.75912 KIAA0257 protein 3.2 

/5 414709 AA704703 Hs.77031 Sp2 transcription factor 3.2 

425475 W56339 Hs.107057 ESTs 3.2 

437814 AI088192 Hs.135474 ESTs, Weakly similar to DDX9JHUMAN ATP-D 3.2 

449897 AW819642 Hs.24135 transmembrane protein vezatln; hypolheti 3.2 

0 452295 BE379936 Hs.28866 programmed cell death 10 3.2 

oO 437517 AI927675 Hs.244482 Homo sapiens, clone 1MAGE:361 1719, mRNA, 3.2 

432960 AW150945 Hs.144758 ESTs 3.2 

452498 AK000101 Hs.29700 hypotheticat protetn FU20094 3.2 

427299 AA830210 Hs.214263 ESTs, Moderately similar to ALU1_HUMAN A 3.2 
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421645 AA974127 Hs.129777 ESTs 3.2 

457489 AI693815 Hs.127179 cryptic gene 3.2 

415691 AW963979 Hs.24723 ESTs 3.2 

433852 AI378329 Hs.126629 ESTs 3.2 

5 439735 AI635386 Hs.142846 hypothetical protein 3.2 

428279 AA425310 Hs.155766 ESTs, Weakly similar to A47582 B-cefl gr 3.2 

427715 BE245274 Hs.180428 KIAA1 181 protein 3.2 

408699 AA056614 Hs.106200 ESTs, Moderately similar to ZN91_HUMANZ 3.2 

415715 F30364 Hs.302204 ESTs 3.2 

10 426172 AA371307 Hs.125056 ESTs 3.2 

434825 N87549 Hs.1252B7 zinc finger protein ZNF14Wike protein 3.2 

436246 AW450963 Hs.119991 ESTs 3.2 

436401 AI087958 Hs.29088 ESTs 3.2 

408784 AW971350 Hs.63386 ESTs 3.2 

15 432125 AW972667 Homo sapiens cDNAFU 12300 lis. clone MA 3.2 

426716 NMJKJ6379 Hs.1 71921 sema domain, irnmunogtobulin domain (Ig), 3.2 

405558 Target Exon 3.2 

434747 AA837085 Hs.220585 ESTs 3.2 

rt 414591 AI888490 Hs.55902 ESTs, Wealdy similar to ALU8..HUMAN ALU S 3.2 

20 426991 AK001536 Homo sapiens cDNA FU1Q674US, clone NT 3.2 

426171 A1128606 Hs.6557 zinc finger protein 161 3.2 

402802 NM 001 397:Homo sapiens endolhelin conver 3.2 

412530 AA766268 Hs.266273 hypothetical protein FU 13346 3.2 

433050 AID93930 Hs.163440 Homo sapiens cDNA: FU 2 1000 fis, clone C 3.2 

25 439221 AA737106 Hs.32250 ESTs. Moderately similar to 178885 serin 3.2 

440351 AF030933 Hs.7179 RAD1 (S. pombe) homolog 3.2 

445218 T79971 Hs.12432 Homo sapiens clone 24407 mRNA sequence 3.2 

448568 AA149121 Hs.71947 ESTs 3.2 

420131 F08286 Hs.95262 nuclear factor related to kappa B bindin 3.1 

30 429299 AI620463 Hs.347408 hypothefcal protein MGC1 31 02 3.1 

407913 BE393767 Hs.41569 phosphatidic acid phosphatase type 2A 3.1 

415009 C75253 Hs.220950 ESTs 3.1 

406627 T64904 Hs.163780 ESTs 3.1 

438666 AW014493 Hs.126727 ESTs 3.1 

35 419875 AA853410 Hs.93557 praenkephalin 3.1 

456660 AA909249 Hs.1 12282 solute carrier family 30 (zinc transport 3.1 

418701 AAB14948 Hs.96343 ESTs, Weakly similar to ALU C_HUMANl!« 3.1 

425071 NMJJ13989 Hs.154424 defodinase, Iodothyronine, type II 3.1 

AfK 440300 N39760 Hs.8859 Homo sapiens. Similar to RIKEN cDNA 5830 3.1 

40 444100 AA383343 Hs.22116 COC14((^loTviston cycle 14, S.cerevi 3.1 

431416 AA532718 Hs.178604 ESTs 3,1 

422662 BE274778 Hs.1 19007 RAB4, member RAS oncogene famBy 3.1 

449543 AF070632 Ks.23729 Homo sapiens clone 24405 mRNA sequence 3.1 

437083 AW082597 Hs.244862 ESTs 3.1 

45 430294 AI538226 Hs.32976 guanine nucleotide binding protein 4 3.1 

417380 T06809 Hs.332086 ESTs 3.1 

419965 H16382 Hs.70258 ESTs 3.1 

430039 BE253012 Hs.153400 ESTs, Weakly similar to ALU1.HUMAN ALU S 3.1 

430188 AL049242 Hs.234794 Homo sapiens mRNA; cDNA DKFZp564B083 (fr 3.1 

50 446146 AI287539 Hs.148078 ESTs 3.1 

426647 AA243464 Hs.294101 pre-B-cefl leukemia transcription factor 3.1 

418555 A1417215 Hs.87159 hypothetical protein FU12577 3.1 

453496 AA442103 Hs.33084 solute carrier family 2 (facilitated glu 3.1 

423947 AW451954 Hs.135941 WAA1048 protein 3.1 

55 451893 AW192083 Hs.290855 ESTs 3.1 

447726 AL137638 Hs.19368 matrilln2 3.1 

415068 Z19448 Hs.131887 ESTs, Wealdy similar to T24396 hypotheti 3.1 

453439 A1572438 Hs.32976 guanine nucleotide binding protein 4 3.1 

443331 AI052026 Hs.149995 ESTs 3.1 

60 410340 AW182833 Hs.112188 hypothetical protein FU 131 49 3.1 

414372 AA143554 gb2o65a02.r1 Stratagene pancreas (93720 3.1 

427421 AA402414 Hs.3059 coatomer protein complex, subunit beta 3.1 

429399 AA452244 Hs.16727 ESTs 3.1 

430092 AI821399 Hs.16514 ESTs 3.1 

65 433577 AW007080 Hs.2B4192 ESTs 3.1 

433730 AK002135 Hs.3542 hypothetical protein FU1 1273 11 

436894 H80696 Hs.233313 ESTs 3.1 

437756 AA767537 Hs.197096 ESTs 3.1 

438979 AW97621B Hs.32565 ESTs 3.1 

70 441791 AW372449 Hs.61271 hypothetical protein FU21 159 11 

457718 F18572 Hs.22978 ESTs, Weakly similar to ALU4.HUMAN ALU S 3.1 

452017 AF109302 Hs.27495 prostate cancer associated protein 7 3.1 

419030 T79957 Hs.188466 ESTs 3.1 

__ 438308 AI343469 Hs.127685 KIAA1 627 protein 3.1 

75 433571 AA765256 Hs.135191 ESTs, WeaWy simflar to unnamed protein 3.1 

421685 AF189723 Hs.1 06778 ATPase, Ca transporting, type 2C, member 3.1 

428878 AA4368B4 Hs.48926 ESTs 3.1 

432205 AI806583 Hs.125291 ESTs 3.1 

orj 440099 AL080058 Hs.6909 DKFZP564G202 protein 3.1 

oO 428259 AA424793 Hs.24144 ESTs 3.1 

434614 AI249502 Hs.29669 ESTs 3.1 

420380 AA640891 Hs.102406 ESTs 3.1 

433323 AA805132 Hs.159142 ESTs 3.1 
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dam r ,* ~y 

KAN Binding protein 7 


3.1 


449474 


a a n 4 rvs j 4 

AAU 19344 


Hs.2055 


ubiquitin-acUvating enzyme E1 (A1S9T an 


3.1 


410595 


AW629223 


Hs.64794 


zinc finger protein 183 {RING finger, C3 


3.1 


415245 


N59650 


Hs.27252 


ESTs 


3.1 


418647 


AA226198 




gb:nc26a07.s1 NCLCGAP_PM Homo sapiens 


3.1 


434808 


A f* 4 Pf 4 AO 

AF155108 


Hs.256150 


Homo sapiens, Similar to RIKEN cDNA 2810 


3.1 


453716 


AA037675 


Hs.1 52675 


ESTs 


3.1 


432952 


AA813887 


Hs.1 88173 


Homo sapiens cDNA FU12187 fis, clone MA 


3.0 


408705 


AA312135 


Hs.46967 


HSPC034 protein 


3.0 


430280 


AA361258 


Hs.237868 


interieukin 7 receptor 


3.0 


445919 


T53519 


Hs.334692 


hypothetical protein MGC14141 


3.0 


441790 


AW2S4909 


Hs.1 32208 


ESTs 


3.0 


414085 


AA1 14016 


Hs.75746 


aldehyde dehydrogenase 1 (antiy, member 


3.0 


452619 


AW298597 


Hs.61884 


Homo sapiens, done IMAGE:4298026, mRNA, 


3.0 


422788 


AL1 17352 


Hs.77196 


Human ONA sequence from done RP5-876B10 


3,0 


439703 


AF086538 


Hs.1 96245 


ESTs 


3.0 


444489 


A1151010' 


Hs.157774 


ESTs 


3.0 


453B78 


AW964440 


Hs.1 9025 


DC32 


3.0 


440193 


AW902312 


Hs.7037 


Homo sapiens clone 24923 mRNA sequence 


10 


433680 


AI805366 


Hs.1 99945 


ESTs 


3.0 


426363 


M5B524 


Hs.2025 


transforming growth factor, beta 3 


3.0 


447595 


AW379130 


Hs.18953 


phosphodiesterase 9A 


10 


426044 


AA502490 


Hs.1 70290 


ESTs 


3.0 


431962 


AL049385 


Hs.272251 


Homo sapiens mRNA; cDNA DKF2p586M1418 (f 


3.0 


445084 


H38914 


Hs.250848 


hypothetical protein FU 14761 


3.0 


452737 


AK001680 


Hs.30488 


DKF2P434F091 protein 


3.0 


458229 


AI929602 


Hs.1 77 


phosphatidyGnositol glycan, dass H 


3.0 


435712 


AA694607 


Hs.1 76956 


ESTs 


3.0 


407378 


AA299264 


Hs.57776 


ESTs. Moderately similar to 138022 hypot 


3.0 


414993 


AW819403 


Hs.77724 


KIAA0586 gene product 


3.0 


419459 


AW291128 


Hs.278422 


DKFZP586G1 122 protein 


3.0 


421988 


AW450481 


Hs.161333 


ESTs 


3.0 


432B33 


N51075 


Hs.1 10028 


ESTs 


3.0 


445210 


H09323 


Hs.27133 


ESTs 


3.0 


447620 


AW290951 


Hs.224965 


ESTs 


3.0 


449375 


R07114 


Hs.271224 


ESTs 


3.0 


428695 


AJ355647 


Hs.1 89999 


purinergic receptor (family A group 5) 


3.0 


443967 


AW294013 


Hs.200942 


ESTs 


3.0 


441224 


AU076964 


Hs.7753 


catumenin 


3.0 


448822 


BE149845 


Hs.289038 


hypothetical protein MGC4126 


3.0 


435688 


H72286 


Hs.1 28387 


ESTs 


3.0 


441689 


AI090455 


Hs.268371 


hypothetical protein FLI20274 


3.0 


411704 


A1499220 


Hs.71573 


hypothetical protein FU 10074 


3.0 


415954 


AA171850 


Hs.42251 


ESTs 


3.0 


419544 


AI909154 




gb:QV-BT200-010499-007 BT200 Homo sapien 


10 


420077 


AW512260 


Hs.87767 


ESTs 


3.0 


443475 


A1066470 


Hs.132809 


ESTs 


3.0 


447231 


AK001293 


Hs.330208 


crystallin, zeta (quinone reductase)-Iik 


10 


438436 


AA807168 


HsJ271552 


ESTs 


10 


427871 


AW992405 


Hs.59622 


Homo sapiens, done IMAGE:3507281, mRNA, 


3.0 


443280 


AA299688 


Hs.24183 


ESTs 


3.0 


448264 


AI47B933 


Hs.1 88260 


ESTs 


10 


428673 


AW601325 


Hs.337757 


Homo sapiens mRNA; cDNA DKFZp566M063 (fr 


3.0 


453843 


D25215 


Hs.35804 


hect domain andRLD3 


3.0 


400303 


AA242758 


Hs.79136 


UV-1 protein, estrogen regulated 


10 


445943 


AW898533 


Hs.181574 


ESTs 


10 


432286 


AW327432 


Hs.255843 


ESTs 


10 


431707 


R21326 


Hs.267905 


hypothetical protein FU10422 


3.0 




fu/y 1 03o 


ns, i vooo** 


CO IS 


3 n 

J.U 


443162 


T49951 


Hs.9029 


DKFZP434G032 protein 


3.0 


448073 


W19789 


Hs.336635 


Homo sapiens, done IMAGE:4179462, mRNA 


3.0 


419713 


AW96B058 


Hs.92381 


nudix (nudeoside diphosphate linked moi 


3.0 


447966 


AA340505 


Hs.105887 


ESTs, Weakly similar to Homotog of rat Z 


10 


423583 


AL122055 


Hs.1 29836 


K1AA1028 protein 


3.0 


418304 


AA215702 




gb:zr97g10.r1 NC1_CGAP_GCB1 Homo sapiens 


3.0 


417206 


AA291183 


Hs.81648 


hypothetical protein FU 1 1 021 similar to 


10 


410011 


AB020641 


Hs.57856 


PFTA1RE protein kinase 1 


10 


411850 


AK002033 


Hs.72782 


hypothetical protein FU11171 


3.0 


438986 


AF085888 


Hs.269307 


ESTs 


10 


445921 


AW015211 


Hs.146181 


ESTs 


3.0 


447124 


AW976438 


Hs.17428 


RBPI-tike protein 


3.0 


452953 


AI932884 


Hs.271741 


ESTs, Wealdy similar to A46010 X-finked 


3.0 



TABLE 60B 



Pkey: Unique Eos probeset identifier number 



WO 03/042661 



PCT/US02/36810 



CAT number Gene cluster number 
Accession: Genbank accession numbers 



Pkey 


CAT Number 


411479 


1247077 J 


411667 


1253334 1 


414372 


143909 1 


414680 


147525J 


415989 


156454,1 


416288 


1585983 1 


416882 


162718.1 


416913 


163001 1 


417379 


167238J 


41B304 


173558 2 


418647 


177521J 


418866 


179788~1 


419536 


185688J 


419544 


185760 2 


423412 


228001J 


423800 


232161 1 


426226 


2629 18J 


426413 


266650J 


425503 


268283.1 


426775 


271683.1 


426991 


2741 5 J 


428342 


290035_2 


429163 


300543J 


429220 


3013841 


429258 


301917,1 


430935 


325772.1 


430968 


326269.1 


431429 


33313 1 


432093 


341283 1 


432125 


341776J 


432189 


342819.1 


432340 


345248 1 


432363 


345469J 


432600 


350059 1 


432810 


354375.1 


434579 


38916 1 


434966 


396504 1 


435023 


398093 1 


437866 


44433.2 


438147 


45074 1 


439092 


468554.1 


439518 


47334 1 


439904 


479942.1 


440B40 


50357.2 


441102 


509604.1 


442562 


54500.2 


443161 


561305 1 


445808 


65133.1 


447082 


707248.1 


448212 


755099.1 


449625 


8113.1 


450580 


83929J 


450582 


83933.1 


452260 


9074.1 


452598 


92338.2 


452815 


93255.1 


453802 


981589.1 


455100 


1253334.1 



AW848047 AW848202 AW848631 AW848142 AW848702AW848121 AW848632AW848140 AW848571 AWB48009 AW848067 AW848069 
AW848905AW848214 

BE160198 AW935898 T11520 AW935930 AW856073 AW861034 
AA143654 AW753140 AA213770 AW970865 AA569075 AA492132 

AA743331 AA837388 AW664540 AA77571 1 AA150955 BE465475 AA968994 AA886905 AI141054 AW194991 AA252147 AA319500 AI184288 
AA708749 AA644620 AA652769 AA242975 AA151074 T19890 

AI267700 A1720344 AA191424 AI023543 AI469633 AA172056 AW958465 AA172235 AW953397 AA355086 
H51299 H44619 H46391 R86024 H51892T72744 

AI633044 AW016212 AW241 143 AA769058 R43272 AW06B958 AA21091 8 AA293774 AI748815 AJ763294 AI3331 14 A1277384 AI088297 AI468477 

AI824624 AW189606 AI631751 Z40749 AI984673 AI671316 AA189024 AW235412 BE178426 R24677 R40535 H05100 R40597 

AW934714 BE161007 BE1625W AW749902 AW749864 BE162498 BE161005 AA190449 AW513465 BE161006 BE162499 

AA196390 AA507837 AA196468 

AA215702 AA368006 AA215703 BE066555 BE006876 

AA2261 98 AA22651 3 AA383773 

T65754 AA229857 AA229658 

AA603305 AA244095 AA244183 

AI9091 54 AA526337 AA244193 AI9091 53 

AF109300AI299378AI202654 

AA331156 AA331157 AA331155 

AA769045 AA372590 AW963633 

AA377823 AW954494 A1022688 

AA380153 AA380233 AW963529 

AA384564 AW966475 H02121 N41297 063213 AA8B6888 AI922414 AW044240 AW196808 AI076736 AA599294 AI954433 AW117617 AI640323 
H98134 

AK001 536 AA191092 AW510354 AI554256 AL353968 AA134266 

AI739168 M426249 A1199636 AW505198 AW977291 AA824583 AA883419 AA724079 AI015524 AI377728 AW293682 A1928140 AA731438 
A1092404 AI085630 AA731340 
. AA884766AW974271 AA592975AA447312 
AW207206 AW341473 AA448195 A1951341 
AA448765 C04967 C03045 AA658293 

AW07291 6 A! 1 8491 3 AA4891 95 AW466994 AW469044 N 59350 AI81 9642 A1280239 A1220572 AA789302 AI47361 1 AW84 1 1 26 D60937 
AW972830 AA527647 AA489820 AA570362 

AF072813 AF119297 AA362885 AF059524 NM 006054 AA157365 AW163623 AA056148 AA227062 AA418057 AA227076 AA078753 AA233594 

D58629 AA232373 AA233577 T35956 BE618035 AA354497 AA359082 T32010 AA134519 BE299901 BE268096 BE396826 AA324268 AL120308 

AA187561 AA311680 

H28383 AW972670 H28359 AA525B08 

AW972667 AA526539 AI057032 AW1 67842 

AA527941 AI810608 A1620190 AA635266 

AA534222AA632632 T81234 

AA534489 AW970240 AW970323 

AI821085 AW973464 AA554802 A1821 831 AA657438 AA640756 AA650339 

AA863400 AIS91439 AW016017 AW014704 A1367512 H17550 AA744752 R461B7 AW471324 AI126670 AA626033 AI276287 A1094253 A1286003 

A1147163 A1911443 AW512612 AA972102 AA999975 AI6B4428 A1335035 D63102 AI524234 AI539156 AA565542 

T5595BT57205AF147346 

AA657494A1582653AI581639 

AI692552 A1393343 A1B00510 AI37771 1 F24263 AA661876 

AA156781 AW293839 U52054 AA024963 AA778446 BE073977 AW444904 AW602574 BE164040 BE164012 BE163972 BE163974 BE163992 

AA837481 AW468444 BE1 85091 AW468002 AA687333 AA81 1 830 AA581806 AI866686 A1572124 AA043777 AA040926 D201 60 A1536733 

AA812489AW874142AI47188 

AW250553 L07876 Z36843 R30693 AI190097 AW965317 

AA830149 AW978407 M85983 AW503637 

W76326AF086341 W72300 

AW892676 AA853877 D44747 

AW629666 AW959831 AW205739 BE620243 AA412367 AW300025 AW051920 AI288591 AW2361 14 AI302852 AI038548 AA534496 AI797207 
AA921877 

AA973905 AI299B88 AA917019 H63235 T90771 

BE379584 R3421 1 BE544768 AW973709 AI653056 A1653173 A1266043 AI656750 H741 80 A1492830 AI376090 AM72184 D59940 AW170056 

AI082443 AW021142 AI167921 A1348677 A1278577 AW130886 AA761517 A1698203 AA1 15535 Al 254790 R34328 D59939 AW205074 AA554902 

DS2102AI0007 

AI038316 AI344631 AI261653 

AV655234 AW966332 AA340239 

T85314 A1360684 T85528T91254 

AI475858AW969013 

NWL014253 AF100772 BE0B8769 AL022718 BE161779 AWB63569 BE161640 AL039060 BE168542 AW296554 AA323193 AA235370 AW779760 
N48674 AI375997 R45432 059344 AI203107 F07491 R35360 R25094 AI913631 AI498402 T61382 AI016320 N45526T61415 AA331485 
N40087 H12925 AA460779 AA095372 BE160847 AW816890 AW816893 AW816577 AW816891 AW816882 AW816868 AW816889 AW816940 
AW816892 AW816941 AW816578 AA029183 T83320 H86850 AA010295 BE160823 Z25353 AA150BB3 AW887764 AW023806 AW022095 
AA164518 AA730973 W00417 W65303 

AI339732 AA010300 AW515041 AA768334 N29860 AA425874 AA425118 AA865829 AW936878 

AA453208 NM.0O4251 U44103 AI671547 N57463 AW007521 A1479232 T82809 R67109 AW795099 AI833027 AI765395 AI763029 AI758228 
BE326331 A1934576 AI922378 AW276431 AI71 8466 N36566 AA904753 BE464245 AI338752 AI659875 AW272338 AI4231 36 A1089270 AI160904 
AA664354 BE1 

AI831 594 AW970667 AW027959 A1129800 A1927949 AI650270 AI625105 AW514661 AI708393 AL13B076 BE180510 AI926721 AI399955 AA749139 
AI862160 AW87401 1 AI242763 AA262795 AA039864 H73499 AI093249 BE245661 AI816834 N25206 AA82B301 AI084565 AI302816 AA026905 
AA77255 

AA418841 A1452657 AI768876 AA028973 BE179873C00215AA418930 
AL134757AW079131 

BE160198 AW935898 T11520 AW935930 AW856073 AW861 034 

706 



WO 03/042661 



PCT/US02/36810 



457728 



393853.1 



AW974811 AA651634 AA650072 
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Pkey: Unique number corresponding to an Eos probeset J-Tt 

Ref: Sequence source. The 7 digil numbers in this column are Genbank Identifier (G!) numbers. Dunham L et a!.' refers to the publication entitled "The DMA sequence of 

human chromosome 22." Dunham I. et al., Nature (1999) 402:489-495. 
Strand: Indicates DNA strand from which exons were predicted. 

Nt position: Indicates nucleotide positions of predicted exons. . 



Pkey 


Ref 


Strand 


NLposition 


400664 


8118496 


Pius 


13558-13721,13942-14090,14554-14679 


400695 


7249150 


Phis 


160456-160567,164757.164873 


400880 


9931121 


Plus 


29235-29336.36363-36580 


401197 


9719705 


Plus 


176341-176452 


401424 


8176894 


Plus 


24223-24428 


401451 


6634068 


Minus 


119926-121272 


401519 


6649315 


Plus 


157315-157950 


401558 


7139678 


Plus 


103510-104090 


402031 


7656761 


Plus 


33080-33263,33939-34094.36103-36507 


402802 


3287156 


Minus 


53242-53432 


402855 


9662953 


Minus 


59763-59909 


403046 


3540153 


Minus 


55707-55859.56369-55511 


403047 


3540153 


Minus 


59793-59958 


403790 


8084957 


Minus 


87826-87947.89835-90002 


404571 


7249169 


Minus 


112450-112648 


404632 


9796668 


Plus 


45096-45229 


404641 


9796810 


Minus 


32247-32362 


404642 


9796810 


Plus 


102999-103145 


405523 


9454643 


Plus 


114550-114688,117265-117407,119490-119599,123237-123395,131140-131217 


405558 


1621110 


Plus 


4502-4644,5983-6083 



Table 61A lists about 440 genes up-regulated in prostate cancer compared to normal prostate and normal adult tissues. These were selected from 59880 probesels on the 
Affymetrix/Eos Hu03 GeneChip array such that the ratio of 'average' prostate cancer to "average' normal prostate tissue was greater than or equal to 10 and the ratio to normal adult 
tissues was greater than or equal to 2.0. The 'average' prostate cancer level was set to the 75* percentile amongst prostate cancers. The 'average' normal prostate tissue level was 
set to the 95 th percentile amongst prostate tissues. The "average' normal adult tissue level was set to the 85* percentile amongst non-malignant tissues. In order to remove gene- 
specific background levels of non-specific hybridization, the 10* percentile value amongst the non-malignant tissues was subtracted from both the numerator and the denominator 
before the ratio was evaluated. 

TABLE 61A: ABOUT 440 GENES UP-REGUIATED IN PROSTATE CANCER COMPARED TO NORMAL PROSTATE AND NORMAL ADULT TISSUES 

Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene title 

R1: Ratio of prostate tumor to normal adult body tissue 

R2: Ratio of prostate tumor to normal prostate tissue 



Pkey 


ExAccn 


UnigenelD i 


444151 


AW972917 


Hs.128749 


421566 


NM_000399 


Hs.1395 


420729 


AW964897 


Hs.290825 


401197 






450096 


A1682088 


Hs.79375 


449156 


AF103907 


Hs.171353 


443271 


BE568568 


Hs.195704 


434078 


AW880709 


Hs.283683 


431117 


AF003522 


Hs.250500 


417315 


AI080042 


Hs.180450 


416182 


NM_004354 


Hs.79069 


421913 


AI934365 


Hs.109439 


434217 


AW014795 


Hs^3349 


425782 


U66468 


Hs.159525 


442501 


AA315267 


Hs.23128 


429984 


AL050102 


Hs.227209 


449048 


Z45051 


Hs.22920 


417407 


AA923278 


Hs.290905 


420120 


AL049610 


Hs.95243 


419078 


M93119 


Hs.89584 


425905 


AB032959 


Hs.318584 


425018 


BE245277 


Hs.154196 


452221 


C21322 


Hs.288057 


421552 


AF026692 


Hs.105700 


409799 


D11928 


Hs.76845 


411373 


BE326276 


Hs.8861 


423445 


NM.014324 


Hs.128749 


422583 


AA410506 


Hs.27973 


438209 


AL120659 


Hs.6111 


437147 


AL049964 


Hs.8358 



atohaKnelhylacyt-CoAn 
early growth response 2 (Krox-20 (Drosop 
ESTs 

ENSP00000229263*:HSPC213. 
hdocarboxytase synthetase (Wc4in-[prop 
prostate cancer antigen 3. non-coding DD 
ESTs 

chromosome 8 open reading frame 4 
delta (DrosophBa)-lrke 1 
ribosomat protein S24 
cycBnG2 

osteoglycin (osteoinductive factor, mime 
ESTs 

ceD growth regulatory with EF-hand doma 
ESTs 

hypothetical protein FU21617 
similar to S68401 (cattle) glucose indue . 
ESTs, Weakly similar to protease {H^api 
transcription elongation factor A <SII>- 
insulinoma-associated 1 
novel C3HC4 type Zinc finger (ring finge 
E4F transcription factor 1 
hypothetical protein FU 22242 
secreted frizzled-related protein 4 
phosphoserine phosphatase-Gke 
ESTs 

alpha-methylacyl-CoA racemase 
K1AA0874 protein 

aryHrydrocarbon receptor nudear trans! 
hypothetical protein FU20356 



R1 


R2 


42.2 


42.2 


3.2 


26.5 


3.7 * 


15.8 


10 


12.6 


12.4 


12.4 


12.3 


113 


11.6 


11.6 


Z1 


112 


9.4 


9.4 


Z3 


9.0 


8.4 


8.4 


2.2 


8.4 


8.3 


8.3 


8.3 


8.3 


2.0 


a3 


8.1 


8.1 


3.1 


8.0 


34.0 


7.9 


7.5 


7.5 


7.4 


7.4 


9.4 


7.3 


7.2 


7.2 


7.0 


7.0 


4.0 


6.6 


6.3 


6.3 


3.2 


6.3 


6.7 


6.2 


Z3 


6.2 


6w1 


6.1 


2.6 


6.0 
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432415 
446896 
449118 
427472 



T16971 
T15767 
R67477 
AA522539 



43154B AI834273 
417153 X57010 



410889 
429597 
422538 
414812 
448552 
416111 
414516 
414343 
451684 
421470 
426350 
436476 
437571 
447072 
444917 
417318 
434170 
432897 
433560 
421823 
451244 
453370 
425242 
439024 
407235 
421853 
437916 
451952 
431676 
452242 
401519 
414342 
426197 
429467 
408380 
425907 
427078 
419465 
429250 
431585 
443250 
423545 
423698 
442281 
428826 
447188 
418701 
429900 
439518 
409151 
420544 
444100 
443622 
410037 
421129 
428966 
448664 
432242 
433887 
436556 
400301 
443646 
430389 
418432 
433409 
447247 
423749 
444755 
431912 
428493 
447482 
450203 
423337 
452588 
422661 
407317 
410330 



X91662 



Hs.289014 

Hs.22452 

Hs.23103 

Hs.131250 

Hs.9711 

Hs.81343 

Hs.66744 



NMJ03816 HsJ2442 
NM 006441 Hs.118131 



X72755 
AW973653 
M033813 
AI307602 
AL036166 
AF216751 
R27496 



Hs.77367 

Hs.20104 

Hs.79018 

Hs.135560 

Hs.323378 

Hs.26B13 

Hs.1378 



NM 003245 Hs^022 
AA32610B Hs.33829 



AA760894 

D61594 

R68651 

AW953937 

AA626509 

AF155099 

AI925195 

N40850 

AW008798 

A1470523 

D13635 

R96696 

D20569 

AJ952677 

BE566249 

AL1 20173 

AI685464 

R50956 

AA742181 
AA004410 



Hs.153023 

Hs.17279 

Hs.144997 

Hs.240845 

Hs.122329 

Hs.279780 

Hs.130891 

Hs.28625 

Hs.343877 

Hs.139336 

Hs.155287 

Hs.35598 

Hs.169407 

Hs.108972 

Hs.20999 

Hs.301663 

Hs.159993 

Hs.75912 
Hs.100009 



NM_004477 Hs.203772 
AF123050 Hs.44532 



AA365752 
A1676062 
AW500239 
H56585 
BE242603 
AI041530 
AP000592 
AA329796 
N34742 
AL048842 
H65423 
AA814948 
AA460421 
W76326 
AA306105 
AA677577 
AA383343 
AI911527 
AB020725 
BE439899 
AF059214 
AI879317 
AW022715 
AW204232 
AI364997 
X03635 
AI085198 
All 17429 
M14156 
AI278802 
AW369351 
U09848 
AA431791 
A1660552 
AK001745 
AB033059 
AF097994 



Hs.155965 
Hs.111902 
Hs.21187 
Ks.198308 
Hs.262823 
Hs.132107 
Hs.129781 
Hs.109B 
Hs.170065 
Hs.194019 
Hs.17631 
Hs^6343 
Hs^0875 

Hs.50785 
Hs.9B732 
Hs.22116 
Hs.11805 
Hs.5B009 
Hs.89271 
Hs.194687 
Hs.334691 
Hs.162160 
Hs.279522 
Hs.7572 
Hs.1657 
Ks.164226 
Hs.240845 
Hs.85112 
Hsi5661 
Hs.287955 
Hs.132390 
Hs.113823 
Hs.76549 
Hs.184628 
Hs.18705 
Hs.301528 
NM 004655 Hs.127337 
AA889120 Hs.110637 
NMJ14700 Hs.119004 
AI204033 Hs.271461 
AW023630 Hs.159425 



ESTs, Weakly similar loA43932 mucin 2 p 6.0 6.0 

Homo sapiens mRN A for KIAA1 737 protein, 6.2 6.0 

Bet1 (S. cerevisiae) homolog 5.9 5.9 

transposon-derived Buster3 transposase-l 3.3 5.9 

novel protein 15.7 5.8 

collagen, type II, alpha 1 (primary osle 4.2 5.8 

twist (Drosophila) homolog (acrocephalos 3.5 5.8 

a disintegrin and metailoproteinase doma 7.9 5.8 

5,10-melhenyltetrahydrofolatesynlhetase 5.7 5.7 

monokine Induced by gamma interferon 2.5 5.6 

hypothetical protein FU00052 5.6 5.6 

chromatin assembly (actor 1 , subunit A { 5.5 5.5 

ESTs,WeaWy similar to T43458 hypolheti Z7 5.5 

coated vesicle membrane protein 5.4 5.4 

COA14 3.9 5.4 

annexin A3 5.3 5.3 

transglutaminase 3 (E polypeptide, prote 5.3 5.3 

bHLH protein DEC2 2.8 5.3 

ESTs 5.2 5.2 

tyrosylprotein sulfotransferase 1 5.2 5.2 

ESTs 5.1 5.1 

ESTs 4.6 5.1 

ESTs 5.1 5.0 

NY-REN-18 antigen 3.3 5.0 

hypothetical protein MGC4400 4.8 5.0 

ESTs 5.0 5.0 

hypothetical protein FU20039 4.9 4.9 

ATP-binding cassette, sub-family C (CFTR 29.8 4.9 

KIAA0010 gene product 2.2 4.B 

ESTs 5.4 4.8 

SAC2 {suppressor of aclin mutations 2, y 4.8 4.8 

Homo sapiens mRNA; cDNA DKFZp434P228 (fr 4.7 4.7 

hypothetical protein FU23142 3.6 4.7 

ESTs 4.7 4.7 

gb:tt88f04.x1 NCl_CGAP_Pr28 Homo sapiens 9.1 4.7 

gycosyltransferase 4.7 4.7 

C15000476 4 :gi|12737279|reflXPJ12163.1| 4.6 4.6 

KIAA0257 protein 3.4 4.6 

acyl-CoertzymeAoxidase1,palmitoyl 3.3 4.6 

FSHD region gene 1 2.1 4.6 

diubiquifin 4.2 4,6 

ESTs 2.2 4.6 

ESTs 4.8 4.5 

Homo sapiens cDNA: FU23068 fis, clone L 2.7 4.5 

tryptophan rich basic protein 4.5 4.5 

hypothetical protein FU10326 2.5 4.5 

ESTs 8.1 4.4 

chromosome 21 open reading frame 5 4.4 4.4 

DKFZp434J1813 protein 4.4 4.4 

HomosapienscDNAFU13492fis,ctanePL 4.4 4.4 

attradin 2.0 4.3 

hypothetical protein DKFZp434E2135 4.3 4.3 

ESTs. Weakly similar to ALUC_HUMAN III! 4.2 4.2 

ESTs 4.2 4.2 

gb^d60d04j1 Soares_fetal_hearLNbHH19W 2.2 4.2 

SEC22, vesicle trafficking protein (S. c 3.5 4.2 

Homo sapiens Chromosome 1 6 BAC clone CIT 10.0 4.2 

COC14 (cell division cycle 14, S. cerevi 2.8 4.2 

ESTs 2.2 4.1 

KIAA0918 protein 4.1 4.1 

ESTs 4.1 4.1 

cholesterol 25-hydroxylase 3.9 4.1 

splicing factor 3a, subunit 1, 120kD Z8 4.1 

ESTs. Weakly similar to ALU4JWMAN ALU S Z0 4.1 

ESTs 4.1 4.1 

ESTs 4.1 4,1 

estrogen receptor 1 4.1 4.1 

ESTs 4.1 4.1 

DKFZP434D146 protein 2.2 4.0 

insulin-like growth factor 1 {somatome^ 4.0 4.0 

ESTs 4.0 4.0 

Homo sapiens cDNA FU 1 3090 fis, clone NT 4.0 4.0 

zinc finger protein 36 (KOX 18) 3.9 3.9 

apX(caseinorytic protease X,E.coli) 3.9 3.9 

ESTs,WeakfysimilartoA56154Ablsubst 2.2 3.9 

hypothetical protein FU 10883 3.3 3.9 

KIAA1233 protein 2.2 19 

L-kynurenine/alpha-aminoadipate aminotra 3.9 3.9 

axin 2 (conductin, axil) 3.4 3.8 

homeoboxAlO 6.5 3.8 

K1AA0665 gene product 3.8 3.8 

ESTs, Weakly similar to 138022 hypolheti 2.3 3.8 

ESTs 6-2 3.8 
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422975 


AA347720 


Hs.122669 


KIAA0264 protein 


425878 


AW964806 


Hs.38085 


Homo sapiens, Similar to RIKEN cDNA 0610 


414664 


AA587775 


Hs.66295 


multi-PDZ-domain-containing protein 


437162 


AW005505 


Hs.5464 


thyroid hormone receptor co activating pr 


42182B 


AW891965 


Hs.279789 


histone deacetytase 3 


418821 


AA436O02 


Hs.183161 


ESTs 


422049 


W25760 


Hs.77631 


glycine cleavage system protein H (amino 


453160 


A1263307 


Hs.239884 


H2B histone family, member L 


424906 


AJ566086 


Hs.153716 


Homo sapiens mRNA for Hmob33 protein. 3T 


446554 


AA151730 


Hs.301789 


nudix (nucleoside diphosphate linked moi 


452259 


AA317439 


Hs.28707 


signal sequence receptor, gamma (translo 


448807 


AI571940 


Hs.7549 


ESTs 


439209 


AF087993 


Hs.91954 


ESTs 


420077 


AW512260 


Hs.87767 


ESTs 


451009 


AA013140 


Hs.1 15707 


ESTs 


452281 


T93500 


Hs.28792 


Homo sapiens cDNA FU11041 fo, clone PL 


452561 


AI692181 


Hs.49169 


KIAA1634 protein 


447033 


AI357412 


Hs.157601 


ESTs 


427138 


N77624 


Hs.173717 


phosphatide acid phosphatase type 2B 


442991 


BE281238 


Hs.8886 


hypothetical protein FU20424 


414680 


AA743331 




hemoglobin, alpha 2 


417380 


T06809 


Hs.332086 


ESTs 


425710 


AF030880 


Hs.159275 


solute carrier family, member 4 


441247 


AW1 18681 


Hs.1 28051 


Homo sapiens thymic stromal lymphopoieti 



426931 
417873 
432101 
429302 
435513 
440643 
418293 
425900 
431689 
440749 
408374 
400277 



NM 003416 Hs.2076 
BE266659 Hs.293659 



AI918950 

AU076674 

AW4O4075 

M78878 

AI224483 

AK000166 

AA305688 

W22335 

AW025430 



Hs.123642 

Hs. 198899 

Hs.42785 

Hs.7314 

Hs.16063 

Hs.288809 

Hs.267695 

Hs.7392 

Hs.155591 



432169 


Y00971 


Hs.2910 


437872 


AK002015 


Hs.5887 


443912 


R37257 


Hs.184780 


414565 


AA502972 


Hs.183390 


448164 


R61680 


Hs.26904 


450094 


AI174947 


Hs.295789 


424800 


AL035588 


Hs.153203 


451367 


AA923729 


Hs.26322 


445472 


AB006631 


Hs.12784 


408101 


AW968504 


Hs.123073 


421246 


AW582962 


Hs.102897 


426227 


U67058 


Hs.154299 


402802 






419168 


AI336132 


Hs.33718 


436207 


AA334774 


Hs.12845 


453864 


AW021407 


Hs.21068 


448944 


AB014605 


Hs.22599 


444489 


AI151010 


Hs.157774 


453124 


AI139058 


Hs.125790 


453078 


AF053551 


Hs.31584 


439444 


A1277652 


Hs.54578 


413492 


D87470 


Hs.75400 


434804 


AA649530 


Hs.348148 


445840 


A1277811 


Hs.146291 


423566 


AW976434 


Hs.3623 


408161 


AW952912 


Hs,300383 


414922 


D00723 


Hs.77631 


430519 


AF 129534 


Hs.49210 


430487 


D87742 


Hs.241552 


423554 


M90516 


Hs.1674 


430226 


BE245562 


Hs.2551 


400296 


AA305627 


Hs.139336 


420154 


AI093155 


Hs.95420 


453293 


AA382267 


Hs.10653 


457718 


F18572 


Hs.22978 


447476 


BE293466 


Hs.208B0 


419440 


AB020689 


Hs.90419 


433285 


AW975944 


Hs.237396 


450671 


A1356967 


Hs.43086 


445921 


AW015211 


Hs.146181 


416882 


AI633044 




414052 


AW57B849 


Hs.283552 


450580 


N40087 




447726 


AL137638 


Hs.19368 


447816 


NM_007233 


Hs.274329 


420407 


AA814732 


Hs.145010 


422299 


AK000181 


Hs.114556 



zinc finger protein 7 (KOX 4, clone HF.1 
Homo sapiens. SimHar to RIKEN cDNA A430 
EphA3 

eukaryotic translation initiation factor 
DC11 protein 
WAA0614 protein 
hypothetical protein FU21877 
hypothetical protein FU20159 
UDP-GaI:betaGIcNAc beta 1,3-galactosyllr 
hypothetical protein MGC3199 
forkhead box F1 
Eos Control 

phosphoribosyl pyrophosphate synthetase 

RNA binding motif protein 7 

ESTs 

hypothetical protein FU13590 

ESTs. Moderately similar to Z195_HUMAN Z 

Homo sapiens mRNA; cDNA DKFZp564D1 164 (f 

MyoD family inhibitor 

eel) cycle related kinase 

Homo sapiens mRNA for K1AA0293 gene, par 

CDC2-related protein kinase 7 

CQ-47 protein 

Human proteinase activated receptor-2 mR 

NMJX)1397:Homo sapiens endothelin conver 

Homo sapiens cDNA FU 12641 lis, clone NT 

hypotheBcat protein MGC13159 

hypothetical protein 

atrophin-1 interacting protein 1; acfivi 

ESTs 

leucine-rich repeat-containing 2 
metaxin 2 

ESTs, Weakly similar to I38022 hypothetl 
K1AA0280 protein 

gbais44f05.s1 NCLCGAPJUvl Homo sapiens 
ESTs 

hypothetical protein FU 1 1 220 
hypothetical protein MGC3032 
glycine cleavage system protein H (amino 
F-box only protein 4 
K1AA0268 protein 

glutamine-mjctose-6-phosphate transamin 
adrenergic beta-2-, receptor, surface 
ATP-binding cassette, sub-famiry C (CFTR 
JM27 protein 
ESTs 

ESTs, Weakly simflar to ALU4_HUMAN ALU S 
ESTs, Weakly similar to I38022 hypotheti 
K1AA0882 protein 
ESTs 

ESTs. Weakly similar to A46010 X-!inked 
ESTs 

tryptophanyl tRNA synthetase 2 (rrolochon 
ESTs, Weakly similar to unnamed protein 
ESTs 
matri!in2 

TP53 target gene 1 

lipoporysaccarid e-specrftc response 5-fi 
hypothetical protein FU20174 



Z2 
3.8 
3.8 
2.7 
4.1 
2.8 
3.1 

9.2 • 

3.7 

3.7 

3.7 

3.6 

Z1 

4.4 

4.1 

3.6 

3.2 

7.7 

3.6 

3.6 

3.5 

3.5 

3.5 

3.5 

2.2 

3.5 

16.5 

3.3 

2.2 

Z1 

6.7 

2.1 

2.1 

2.5 

4.3 

Z4 

13 

3.3 

3.3 

9.9 

3.3 

4.4 

3.6 

3.3 

11.5 

Z0 

3.2 

3.2 

4.3 

Z7 

3.2 

3.2 

3.8 

3.2 

3.2 

Z1 

10.6 

Z8 

Z1 

3.1 

Z2 

3.1 

3.1 

3.1 

16.2 

3.1 

9.0 

20.9 

27.4 

3.7 

3.0 

Z4 

2.9 

7.7 

3.0 

4.3 

3.5 

2.5 

3.0 

3.9 

7.2 

5.1 

Z6 



3.8 

3.8 

3.8 

3.8 

3.8 

3.8 

3.8 

3.8 

3.7 

3.7 

3.7 

3.7 

3.6 

3.6 

3.6 

3.6 

3.6 

3.6 

3.6 

3.6 

3.5 

3.5 

3.5 

3.5 

3.5 

3.5 

3.5 

3.5 

3.5 

3.5 

3.4 

3.4 

3.4 

3.3 

3.3 

3.3 

3.3 

3.3 

3.3 

3.3 

3.3 

3.3 

3.3 

3.2 

3.2 

3.2 

3.2 

3.2 

3.2 

3.2 

3.2 

3.2 

3.2 

3.2 

3.2 

3,1 

3.1 

3.1 

3.1 

3.1 

3.1 

3.1 

3.1 

3.1 

3.1 

3.1 

3.1 

3.1 

3.1 

3.1 

3.0 

3.0 

3,0 

3.0 

3.0 

3.0 

3.0 

3.0 

3.0 

Z9 

Z9 

Z9 

Z9 
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420297 
427701 
437812 
445B08 
426500 
442320 
418407 
433050 
451796 
407783 
459029 
441676 
439352 
453628 
412324 
408063 
421928 
421247 
433865 
423201 
450832 
416913 
417251 
427144 
444668 
434015 
432689 
423482 
448206 
450546 
452270 
458332 
436109 
422867 
412576 
413336 
433577 
439093 
442833 
447401 
409619 
432729 
452721 
447342 
409205 
422805 
446238 
447313 
411630 
410592 
417173 
417601 
433419 
447509 
424580 
407252 
420061 
424432 
417683 
414312 
417135 
437748 
422728 
453070 
406627 
407151 
425815 
433805 
431474 
436204 
439328 
449115 
437531 
445424 
403047 
416198 
417576 
430607 
436401 
450654 
426216 
431325 
410102 



AI628272 
AM11101 
AI5B2291 
AV655234 
AW863261 
AI287817 
AL044818 
AJ093930 
AL133019 
AW996872 
AA131376 
BE564206 
BE614347 
AW243307 
AW97B439 
BE086548 
AF013758 
BE391727 
N29862 
NM_000163 
AW970802 
AW934714 
AW015242 
X95097 
AA654650 
AAB44518 
AB018320 
BE280172 



Hs.88323 

Hs.243886 

Hs.16846 

Hs.242413 

Hs.129636 

Hs.84928 

Hs.163440 

Hs.27038 

Hs.172028 

Hs.343809 

Hs.49889 

Hs.169615 

Hs.83937 

Hs.69504 

Hs.42346 

Hs.109643 

Hs.102910 

Hs.44104 

Hs.125180 

Hs.105421 

Hs.99488 
Hs.2126 
Hs.282906 
Hs.300876 



AA010200 
AW975014 
AI000341 
M922153 
L32137 
AA447718 
A1569936 
AW007080 
AA534163 
AA328153 
BE618582 
AK001015 
AK000292 
AJ269529 
AI199268 
AI952884 
AA436989 
T95143 
1)92981 
U42349 
R94088 
U61397 
NM_014735 
AI83Q342 
AF 107454 
AA446539 
AA659037 
AW024937 
AB037821 
AW566008 
AA155694 
AA422067 
AF234882 
AW937826 
AK001465 
T64904 
H25836 
R94023 
AA706910 
AL133990 
AK002014 
W07411 
AW959952 
AI400752 
AB028945 

H27332 

AA339449 

AW973521 

AI087958 

AJ2455B7 

K77630 

AW026751 

AW248508 



Hs.129228 
Hs.3731 
Hs.175551 
Hs.26 
Hs.220491 
Hs.132760 
Hs.1584 
Hs.107057 
Hs.296178 
Hs.284192 
Hs.5476 
Hs.88201 
Hs.97661 
Hs.55220 
Hs.130732 
Hs.301871 
Hs.19322 
Hs.14832 
Hs.121017 
Hs.14511 
Hs.18081 
Hs.71119 
Hs.43569 
Hs.81424 
Hs.82292 
Hs.211272 
Hs.107537 
Hs.339024 
Hs.163780 
Hs.29410 
Hs.146858 
Hs.239154 
Hs.191060 
Hs.50547 
Hs.5814 
Hs.103262 
Hs.31575 
Hs.163780 
Hs.301527 
Hs.94560 
Hs.1 12742 
Hs.190642 
Hs.47546 
Hs.118212 
Hs.37528 
Hs.112259 
Hs.12696 

Hs.99598 
Hs.82285 
Hs.247324 
Hs.29088 
Hs.25275 
Hs.13895 
Hs.5794 
Hs.279727 



ESTs, Weakly similar to ALULHUMAN ALU S 
nuclear autoantigenic sperm protein (his 
ESTs, Weakly similar to 04HUD1 debrisoqu 
ESTs. Moderately similar to PC4259 ferri 
hypothetical protein DKFZp434K1421 
ESTs 

nuclear transcription factor Y, beta 

Homo sapiens cDNA: FU21000 fis, done C 

Homo sapiens mRNA; cDNA DKFZp434G2127 (f 

a disintegrin and metalloproteinase doma 

fibroblast growth factor 12B 

ESTs 

hypothetical protein FU20989 
hypothetical protein 
ESTs 

caictneurin-binding protein calsarcin-1 
poryadenylate binding proteirMnteractjn 
general transcription factor IIH, polype 
ESTs 

growth hormone receptor 
ESTs 

gb:RC1-DT0OO1-031299-O11-a11 DT0001 Homo 
ESTs. Weakly similar to YK54_YEAST HYPOT 
vasoactive intestinal peptide receptor 2 
ESTs 

hypothetical protein FU13386 
Arg/AbMnteracting protein ArgBP2 
galactokinase 2 

ESTs. Moderately similar to 138022 hypot 
ESTs 

ferrochelatase (protoporphyria) 
ESTs 

hypothetical protein MGC15729 
cartilage oligomeric matrix protein (pse 
ESTs 

hypothetical protein FU22637 
ESTs 

Homo sapiens, clone IMAGE:3530123, mRNA, 
ESTs, Weakly similar to A Chain A, Cryst 
ESTs 

BCl2-associated athanogene 2 
hypothetical protein FU 20285 
solute carrier family 37 (gtycerol-3-pho 
Homo sapiens. Similar to RIKEN cDNA 2010 
ESTs. Moderately slmiiar to unnamed prot 
H2A histone farnify, member A 
SCO {cytochrome oxidase deficient, yeast 
Homo sapiens clone DT1P1B6 mRNA, CAG rep 
Putative prostate cancer tumor suppresso 
ESTs 

ubiquifjn-tike 1 (sentrin) 
K1AA0215 gene product 
ESTs 

chromosome 7 open reading frame 2 
ESTs, Weakly similar to A46010 X-Bnked 
ESTs 
ESTs 

protocadherin 10 

ankyrin repeat, family A (RFXANK-like), 
ESTs 
ESTs 

suppression of tumcrigenicity 7 
ESTs, Weakly similar to ZN91_HUMAN ZINC 
SEC63. endoplasmic reticulum translocon 
ESTs 

ESTs. Moderately similar to unknown (H.S 
ESTs. Moderately similar to 138022 hypot 
ESTs 
ESTs 

Homo sapiens cDNA FU1145B fis. done HE 
ESTs, Moderatery similar to ALU3.HUMAN A 
ESTs. Weakly similar to AF090944 1 PRO06 
T cell receptor gamma locus 
cortactin SH3 domain-binding protein 
NM 005656*:Homo sapiens transmembrane pr 
hypothetical protein MGC5338 
pnosp^on^ylglycinamide formyltransfer 
milochondria! ribosoma) protein S14 
ESTs 

Kruppel-type zinc finger protein 
Homo sapiens cDNA FU11654 fis, clone HE 
ESTs. Weakly similar to 21 09260A B cell 
Homo sapiens cDNA FU14035 fis, done HE 
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19 

3.0 

2,9 

4.4 

11 

19 

2.0 

5.0 

6.4 

18 

18 

6.3 

2.1 

2.8 

2.8 

2.8 
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3.2 

2.1 

14 

2.8 

2.2 

3.6 

4.9 

2.7 

11 

2.7 

17 

17 

17 

2.7 

2.7 

17 

12 

3.2 

9.0 

14 

13.6 

11 

8.1 

6.2 

16 

16 

16 

19 

10 

14 

16 

14 

7.9 

17 

16 

16 

16 

9.0 

12 

8.7 

14 

16 

16 

9.3 

12 

3.3 

13 

16 

3.4 

21.1 

15 

15 

15 

15 

15 

3.0 

11.9 

15 



2.9 
19 
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AI632278 

U37519 

AI579909 
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447210 AF035269 

425294 AF033827 

452741 BE392914 

447702 AI420256 

412043 BE156622 

421413 A1826128 

429128 AA446869 

429258 AA448765 



Hs.195922 

Hs.87539 

Hs.105104 

Hs.17752 

Hs.1 55553 

Hs.30503 

Hs.161271 

Hs.333371 

Hs.57637 

Hs.1 19316 



435102 
456161 
407021 



AW899053 Hs.76917 
BE264645 Hs.282093 
U52077 



437517 


A1927675 


Hs.244482 


408267 


AW380525 


Hs.343564 


447713 


A1420733 


Hs.207083 


456848 


AL121087 


Hs.296406 


437617 


AI026701 


Hs.5716 


432378 


AI493046 


Hs.146133 


443180 


R15875 


Hs.258576 


445929 


AI089660 


Hs.323401 


424701 


NM.005923 


Hs.151988 


445255 


NM~014841 


Hs.12477 


409731 


AA1 25985 


Hs.56145 


421709 


AA1 59394 


Hs.107056 


412707 


AW206373 


Hs.16443 


418882 


NM 004996 


Hs.89433 


451071 


AW1 38807 


Hs.204939 


420344 


8E463721 


Hs.97101 


433037 


NM_014158 


Hs.279938 


418329 


AW247430 


Hs.84152 


419544 


AI909154 




424602 


AK002055 


Hs.1 51046 


450693 


AW450461 


Hs.203965 


452387 


A1680772 


Hs.306094 


412045 


AA099802 


Hs.83883 


428180 


A! 129767 


Hs.1 82874 


419438 


AA40640O 


Hs.12482 


443194 


AI954958 


Hs.279009 


414669 


AA157291 


Hs.21479 


407300 


AA102616 


Hs.1 20769 


436396 


A1683487 


Hs.152213 


439221 


AA737106 


Hs.32250 


453802 


AL134757 




421059 


AI654133 


Hs.30212 


426413 


AA377823 




431147 


AI767751 


Hs.20300 


442677 


AI557914 




446091 


AW022192 


Hs.200197 


431563 


AKJ27643 


Hs.120912 


439478 


AF049460 


Hs.6574 


413049 


NMJJ02151 


Hs.823 


405685 






407917 


U63139 


Hs.41587 


450378 


AW249181 


Hs.19954 


429303 


AW137635 


Hs.44238 


417248 


AA329449 


Hs.247302 


429277 


AW452016 


Hs.127863 


431555 


AI815470 


Hs.260024 


440099 


AL080058 


Hs.6909 


416653 


AA768553 


Hs.193145 


431542 


H63010 


Hs.5740 


401558 






418281 


U09550 


Hs.1 154 


422295 


AF051151 


Hs.114408 


432453 


AI885537 


Hs.27172 



436679 
425263 



Hs.22051 
Hs.120451 



447499 AW262580 Hs.147674 

452994 AW962597 Hs.31305 

450325 AI935962 Hs.26289 

422390 AW450893 Hs.121830 
448769 BE539108 
AI127483 

NM.001197 Hs.155419 

451939 U80456 Hs.27311 

410252 AW821182 Hs.61418 
439699 AF086534 
422576 BE548555 
433293 AF007835 
D84284 
Y15221 



410733 
421379 
442995 AA532511 
408833 AW612232 



Hs.187561 
Hs.1 18554 
Hs.32417 
Hs.66052 
Hs.103982 
Hs.288455 
Hs.254835 



ESTs 

aldehyde dehydrogenase 3 family, member 
ESTs 

phosphatidylserine-specific phosphdipas 

HNK-1 sulfotransferase 

Homo sapiens cDNA FU11344 fis, done PL 

ESTs 

Homo sapiens done TA40 untranslated mRN 
ESTs, Weakly similar to A49354 59 protei 
ESTs 

gb:zx10e09.r1 Soares_total_fetus_Nb2HFB_ 

F-box only protein 8 

hypothetical protein FU21918 

gb:Human marinerl transposase gene, comp 

Homo sapiens, clone IMAGE:3611719. mRNA, 

tubulin-spedfic chaperone e 

ESTs 

KIAA0685 gene product 
KIAA0310 gene product 
ESTs 
ciaudin 12 
dpy-30-like protein 

mitogen-activated protein kinase kinase 
synaptosomal-associated protein, 91 kDa 
thymosin, beta, identified in neuroblast 
CED-6 protein 

Homo sapiens cDNA: FU21721 fis, clone C 
ATP-binding cassette, sub-family C (CFTR 
ESTs, Highly similar to A54677 homeotic 
putative G protein-coupled receptor 
HSPC067 protein 
cystathionine-beta-synthase 
gb:QV-BT20O-O1O499-007 BT200 Homosapien 
hypothetical protein FU11193 
ESTs 

trimideotide repeat containing 12 
transmembrane, prostate androgen Induced 
guanine nucleotide binding protein (G pr 
glyceronephosphate O-acyltransferase 
matrix Gla protein 
ubinuctein 1 

gb:zn43e07,s1 Stratagene HeLa cell s3 93 
wingless-type MMTV integration site fami 
ESTs. Moderately similar to 178885 serin 
gb:DKFZp547G1090j1 547 (synonym: hfbrlj 
thyroid receptor interacting protein 15 
gb:EST90B05 Synovial sarcoma Homo sapien 
ESTs 

peroxisomal ecyl-CoA Ihioesterase 

ESTs 

ESTs 

deformed epidermal autoregutatory factor 
hepsin (transmembrane protease, serine 1 
C2002829:gi[45076B9|reflNPJ)0329B.1| tra 
RAD50 (S. cerevislae) homotog 
ESTs, Weakly similar to T19873 hypotheli 
ESTs. Weakly similar to S65657 atpha-10 
twisted gastrutation 

ESTs, Weakly similar to SFR4.HUMAN SPUC 
Cdc42 effector protein 3 
DKFZP564G202 protein 
metattothionein 1E (functional} 
ESTs 

ENSP0 000022047 8*:SECRETOGRANIN HI. 
oviductal gtycoprotein 1, 120kD (mucin 9 
toll-like receptor 5 

ESTs, Moderately similar to PC4259 fern 
protocadherin beta 16 
KIAA1547 protein 
ESTs 

ESTs. Weakly similar to T42682 hypotheli 
hypothetical protein MGC15548 
ESTs, Weakly similar to unnamed protein 
BCL2-interacfjng killer (apoptosis-induc 
single-minded (Drosophila) homolog 2 
niicrofibrfflar-associated protein 1 
ESTs, Moderately similar to ALU1.HUMAN A 
CGI-83 protein 

hypothetical protein WGC4309 
C038 antigen (p45) 

small inducibte cytokine subfamily B (Cy 
Homo sapiens cDNA: FU23270 fis, done C 
ESTs 



2.5 
2.5 
2.7 
3.0 
11 
25 
2.1 
24 
2.4 
2.0 
24 
2.4 
3.1 
7.3 
2.1 
4,3 
2.4 
2.2 
2.6 
5.2 
10.1 
2.2 
2.3 
2.5 
27.5 
2.3 
4.7 
2.4 
24 
3.1 
24 
. 4.1 
23 
21 
2.3 
2.3 
4.1 
ZQ 
2.7 
2.1 
2.5 
2.3 
2.3 
Z3 
4.1 
7.9 
VI 
5.8 
2.5 
4.0 
Z2 
Z1 
3.5 
2.3 
Z3 
23 
2.3 
2.3 
Z6 
4.5 
Z9 
3.4 
7.0 
2.2 
2.2 
12 
U 
2.2 
4.0 
11.9 
2.1 
2.6 
11.8 
3.2 
13.6 
4.2 
4.8 
2.1 
11.0 
3.1 
2.1 
2.2 
3.0 
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25 

25 

25 

25 

25 

24 

24 

24 

24 

24 

2.4 

24 

2.4 

24 

24 

24 

2.4 

2.4 

2.4 

24 

2.4 

24 

24 

24 

24 

24 

24 

2.4 

23 

23 

2.3 

2.3 

2.3 

23 

23 

23 

23 

23 

2.3 

2.3 

23 

23 

23 

23 

23 

23 

23 

23 

23 

23 

2.3 

23 

23 

23 

23 

23 

2.2 

22 

22 

22 

22 

22 

22 

22 

22 

22 

22 

22 

22 

22 

22 

22 
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22 

22 

2.2 

22 

22 

22 
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425628 
424036 
440146 
404641 
407253 
406068 
415162 
423349 
424534 



AI186431 Hs.296638 

AW975324 Hs.129816 

AW449650 Hs.346335 
AF075009 

NMJJ04476 Hs.1915 
AA770688 

AW014231 Hs.90790 

AA411175 Hs.141939 



AF035718 
AF010258 
D87682 



407756 AA1 16021 
440658 H29142 



416547 

432281 

447533 

448968 

449897 

452959 

427083 

408291 

418727 

440491 

433466 

432695 

427982 

445873 

434569 

441791 

438520 

414269 

410660 

456660 

450861 

419193 

423784 

416208 

407182 

421689 

432833 

425170 

43627B 

438719 

432728 

410310 

429716 

424081 

433800 

413627 

414133 

411096 

450244 

422493 

409219 

419986 

421969 

434128 

434503 

443292 

443555 

448172 

452039 

400290 

449625 

441406 

451369 

434011 

408806 

433388 

432653 

446783 

437323 

410076 

421501 

419733 



H62914 
AK001239 



Hs.78061 

Hs.127428 

Hs.150275 

Hs.38260 

Hs.143032 

Hs.268946 

Hs.274263 



NM_004786 Hs.18792 

AW372914 Hs.86149 

AW819642 Hs.24135 

AI933416 Hs.169674 

NMJJ06363 Hs.173497 

AB023191 Hs.44131 



AA227609 
R35252 
AA508353 
D63480 



Hs.94834 
Hs.130558 
Hs.105314 
Hs.278634 



NM 016156 Hs.181326 
AA25097O Hs.251946 



AI311295 

AW372449 

AA706319 

AA298489 

AI061118 

AA909249 

AI523898 

D29643 

AK000039 

AW291168 

AA312551 

N87820 

N51075 

AU077315 

BE396290 

AA357129 



Hs.344478 

Hs.61271 

Hs.98416 

Hs.303171 

Hs.65328 

Hs.1 12282 

Hs.17617 

Hs.34789 

Hs.1 32826 

Hs.41295 

Hs.230157 

Hs.106826 

Hs.1 10028 

Hs.154970 

Hs.5097 

Hs.239625 



NM_006979 Hs.278721 

J02931 Hs.62192 

R25685 Hs.211933 

NM_006413 Hs.139120 

AKJ34361 Hs.135150 



BE182082 

AW022188 

U80034 

AA007534 

AW474183 

AA393383 . 

A1345455 

AF052217 

W93170 

T96231 

AK000213 

N71710 

N75276 



H18B36 

NM 014253 

Z45957 

AA017321 

AW953437 

AW847814 

AW32672 



AW138343 

AA371145 

T05387 

M29971 

AW362955 



Hs.246973 

Hs41167 

Hs.68583 

Hs.1 25062 

Hs.250173 

Hs.133331 

HsJ8915 

Hs.110099 

Hs.284164 

Hs.17762 

Hs.9196 

Hs.21398 

Hs.135904 

Hs.172510 

Hs.31608 

Hs.7837 

Hs.269691 

Hs.5486 

Hs.289005 

Hs.288539 

Hs.293185 

Hs.141867 

Hs.194397 

Hs.7991 

Hs.1384 



prostate differentiation factor 4.4 2.2 

ESTs H.7 2.2 

ESTs 2.6 12 

gb:Homo sapiens full length Insert cDNA 5.9 2.2 

folate hydrolase (prostate-specific memb 12.8 2.2 

H2A histone family, member L 3.8 2.2 

Homo sapiens cDNA: FL) 22930 fts, clone K 4.8 2.2 

NM 021965*:Homo sapiens prwsphoglucomuta 2.2 2.2 

ESTs. Moderately similar to S65657 alpha 2.2 2.2 

C2002008:giI73D3957|gblAAF59000.1l(AEOO 3.1 2.1 

transcription factor 21 2.1 2.1 

homeoboxA9 5-9 11 

K1AA0241 protein 13 2.1 

ubtqu'iCn specific protease 18 2.5 2. 1 

ESTs, Weakly similar to neuronal thread 2.5 2.1 

ESTs. Weakly similar to PC4259 ferritin 2.1 2.1 

hypothetical protein FU10377 2.1 2.1 

thioredoxin-like,32kD 2.1 2.1 

phosphoinositol 3-phosphate-binding pro! 2.1 2.1 

transmembrane protein vezalin; hypotheti 2.1 2.1 

ESTs 2.1 2.1 

Sec23 {S. cerevtsiae) homolog B 2.8 2.1 

WAA0974 protein 2.1 2.1 

ESTs 2.1 2.1 

ESTs, WeaWy similar to 2109260A B cell 2.1 11 

refaxin 1 (H1) 17.3 H 

KIAA0146 protein 13 11 

KIAA1073 protein . 3.2 11 

poly(A)-binding protein, cytoplasmic 1-1 2.5 11 

K1AA0196 gene product 2.1 11 

hypothelical protein FU21 159 11 2.1 

ESTs 17 11 

olfactory receptor, family 5 1 , subfamily 17.3 2.1 

Fanconi anemia, complementation group F 3.5 11 

solute carrier family 30 (zinc transport 7.4 11 

ESTs 14 11 

dolichyl-drphosphooligosaccharide-protei 11 11 

Homo sapiens cDNA FLI14913 fis, done PL 3.2 11 

ESTs, Weakly similar to MUC2_HUMAN MUCIN &4 11 

ESTs H H 

WAA1696 protein H 2.1 

ESTs H 2.1 

transcription factor CP2 16 2.0 

synaptogyrin 2 16 1° 

integral membrane protein 2B 2.3 10 

HLA class 11 region expressed gene KE4 3.6 10 

coagulation factor III (thromboplastin, 2.0 10 

collagen, type Mil, alpha 1 3.6 10 

ribonucleaseP(30kD) 10 10 

rang type-l cell membrane-associated gly 10 10 

ESTs 3.3 2.0 

ESTs 2.3 10 

mitochondrial intermediate peptidase 14.7 10 

ESTs 3.1 10 

hypothelical protein FU13158 16 10 

ESTs ' H 10 

GA-b'mdSng protein transcription factor, 10 2.0 

core-binding factor, runt domain, alpha 10 10 

protein x 0004 17 10 

ESTs 10 2.0 

hypothetical protein 10 10 

ESTs, Moderately similar to A Chain A, H 2.0 10 

ESTs 6.0 10 

ESTs 10 10 

hypothetical protein FU20041 6.2 10 

odz {odd Oz/terwn, Drosophila) homolog 1 10.1 10 

phosphoprotein regulated by mltogenic pa 13 10 

ESTs H 10 

clone FLB5214 2.1 10 

Homo sapiens cDNA: FU21532 fis, clone C 5.4 10 

hypothetical protein FU22191 17 10 

ESTs, Weakly similar to JC732B amino aci 5.5 2.0 

ESTs 4.4 10 

teplin receptor 15 10 

ESTs 19 10 

O-6-methylguanine-ONA methyl transferase 11 10 

Homo sapiens cDNA FU14415 fis, clone HE 15 10 



80 

TABLE 618 



712 



WO 03/042661 



PCT/US02/36810 



15 



20 



25 



30 



Pkey: Unique Eos probesel identifier number 

CAT number. Gene cluster number 

Accession: Genbank accession numbers 



Pkey 
4146B0 



416913 
419544 
419733 



426413 
429258 
431676 



438859 
439518 
442677 

445B08 
449625 

450580 

453802 



CAT Number 
147525J 



416882 162718.1 



163001J 
165760.2 
187589J 



424036 23460.1 



266650.1 
301917.1 
336411.1 
35275.1 

46651J 
47334J 
548626.1 

65133.1 
8113.1 



!9_1 
981589J 



a5^31 AA837388AW664540M775711 AA150965BE465475 AA968994 AA886905 AI141054 AW194991 AA2521 47 AA31 9500 AJ1 84288 

AJ824624 AW189606 AI631751 Z40749 A1984673 A1671316 AA189024 AW235412 BE1784 Br . 4MjlM 
AWM4714 BE161007 BE162500 AW749902 AW749854 BE162498 BE161005 AA190449 AW513465 BE161006 BE162499 

AIQ51970 AW044233 N20018 AW663548 T90114 AI139947 AA809643 AA846232 AA581966 AA7690Q2 

SS» AA989300 AA904029 BE254211 AA449148 A1268420 AJ300495 AI215637 AI300494 AI268551 

AA928971 AA179427 AA947684 BE393792 H98018 A1885781 AI188567 AI290658 C15404 
AA377823 AW954494 AI022688 
AA448765 C04967 C03045 AA65B293 

AA002027 AI814851 AA902666 A1039729 AW975053 BE302243 AI240793 AA193203 N5581 
AF075009R63109 R63068 

^fummJm^ AIB14081 WB106B AW182B26 AW173296 AI376594 AI220500 BE257195 BE246486 R55637 C20788 AJ014407 
A1248353AW028015 

H^^Sm^m AL022718 BE161779 AW863569 BE161640 AL039060 BE168542 AW296554 AA323193 AA235370 AW779760 

AVW^ 
AL134757AW079131 
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Pkey: Unique number corresponding to an Eos probesel 

Ref: Sequence source. The 7 digit numbers In this column are Genbank Identifier (Gl) numbers. "Dunham I. et aL 

human chromosome 21" Dunham i. el at., Nature (1999) 402:489495. 
Strand: Indicates DNA strand from which exons were predicted. 

Nt position: Indicates nucleotide positions o f predicted exons. 



refers to the publication entitled The DNA sequence of 



Pkey 


Ref 


Strand 


NLposrfion 


401197 


9719705 


Pius 


176341-176452 


401519 


6649315 


Phis 


157315-157950 


401558 


7139678 


Plus 


103510-104090 


402802 


3287156 


Minus 


53242-53432 


403047 


3540153 


Minus 


59793-59968 


404641 


9796810 


Minus 


32247-32362 


405685 


4508129 


Minus 


37956-38097 


406068 


9114084 


Plus 


382-543 
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Table 62A fete about 600 genes up-regutated in prostate cancer compared to nontrfaddt toes. These were selected 
ratio was evaluated. 

TABLE 62A: ABOUT 600 GENES UP-REGULATED IN PROSTATE CANCER COMPARED TO NORMAL PROSTATE TISSUES _ 

Pkey. Unique Eos probesel identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnlgenelD: Unigene number 

Unigene Title: Unigene gene title 

R1: Ratio of prostate tumor to normal prostate tissue . — — 



Pkey 

444151 

421566 

427274 

420729 

412326 

416188 

417696 

424247 

401197 

426793 

450096 

449156 

443271 

434078 

423396 



ExAccn 

AW972917 

NMJJ00399 

NM.005211 

AW964897 

R07566 

BE157260 

BE241624 

X14008 

X89887 
AI682088 
AF103907 
BE568568 



A1382555 



UnigenelD Unigene Title 

Hs, 128749 alpr^methytacyl-CoAracemase 

Hs.1395 early growth response 2 (Krox-20 (Drosop 

Hs.1741 42 colony stimulating factor 1 receptor, fo 

Hs 290825 ESTs 

Hs J3817 small inducible cytokine A3 {homologous 

Hs .79070 v-myc avian rnyetocytornatosis viral oncog 

Hs.82401 CD69 antigen (p60, earty T-cell acOvati 
Hs.234734 lysozyme (renal amyloidosis) 

ENSP00000229263*:HSPC213. 
Hs.172350 HIR (histone cell cycte regulation defec 
Hs.79375 holocarboxylase synthetase (biotin-lprop 
Hs.171353 prostate cancer antigen 3, non-coding DD 
Hs.195704 ESTs 

Hs .283683 chromosome 8 open reading frame 4 
Hs.127950 brorrudomaln-containing 1 



R1 

42.2 
263 
16.0 
15.8 
14.7 
13.6 
13.1 
12.6 
12.6 
12.6 
1Z4 
12.3 
11.6 
11.2 
11.2 
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419553 
418036 
410023 
4216B4 
426490 
437866 
440274 
431117 
405141 
448520 
417315 
433332 
456614 
427019 
432606 
417381 
425174 
445701 
416182 
445230 
421913 
434217 
417363 
425782 
442501 
429503 
429984 
443837 
449048 
444880 
417407 
436962 
450313 
420120 
438613 
419078 
425905 
451900 
447769 
452436 
452359 
425018 
447737 
424897 
433867 
427308 
432359 
452221 
416051 
421181 
448076 
426759 
416292 
417302 
417022 
451957 
452859 
413431 
408405 
421552 
426108 
429490 
454219 
443020 
415752 
412482 
417880 
444381 
448019 
409799 
424308 
411373 
423445 
422583 
416196 
438209 
437147 
432415 



Y14443 
AB001914 
N34145 
Z37976 
AB017169 
BE281591 
NMJW1621 
AA156781 
R24595 
AF003522 
Y14443 
AB002367 
AI080042 
A1367347 
AV653110 
AA001732 
NM 002104 
AF164142 
D87450 
AF055581 
NMJ)04354 
U97018 
AI934365 
AW014795 
AW129357 
U66468 
AA315267 
AA394183 
AL050102 
AI984625 
Z45051 
AW118683 
AA923278 
AW377314 



408242 
447965 



AL049610 
C05569 
M93119 
AB032959 
AB023199 
AW873704 
BE077546 
BE167229 
BE245277 
AK000643 
D63216 
AK000596 
D26067 
AA076049 
C21322 
AA835868 
NWL005574 
AJ133123 
Al 590401 
AA179233 
BE245812 
NMJM4737 
AI796320 
AI300555 
AW246428 
AK001332 
AF026592 
AA622037 
AI971131 
X75042 
A135005B 
BE314524 
A1499930 
BE241595 
BE387335 
AW947164 
D11928 
AW975531 
BE326276 
NM.014324 
AA410506 
AI745649 
AL120659 
AL049954 
T16971 
T15767 
AA251594 
AW292577 



zinc finger protein 200 
Hs.17041 4 paired basic amino acid cleaving system 
Hs.101 77 ESTs, Moderately similar to ZN91.HUMAN Z 
Hs.83337 latent transforming growth factor beta b 
Hs.57929 sfit{Drosophila)homo!og3 
Hs.106768 hypolhetica) protein FU1051 1 
Hs.170087 aryl hydrocarbon receptor 

metaBothionein IE (functional) 
Hs.7 1 22 scrapie responsive protein 1 
Hs.250500 delta (Drosophila)-fike 1 
zinc finger protein 200 
doublecortin and CaM kinase-like 1 
ribosomai protein S24 

Homo sapiens clone TCCCTA00151 mRNA sequ 
hypothetical protein FU20533 
hypothetical protein FU10970 
granzyme K (serine protease, granzyme 3; 
solute carrier family 23 (nudeobase tra 
WAA0261 protein 
lymphocyte adaptor protein 
cyclinG2 

echinoderm microtubule-associated protei 
osteoglycin (osteoinductive factor, mime 
ESTs 
ESTs 

cell growth regulatory with EF-hand doma 
ESTs 
ESTs 

hypothetical protein FU21617 
spindle pole body protein 
similar to S68401 (cattle) glucose indue 
ESTs 

ESTs, Weakly similar to protease [H.sapi 
DKFZP564I052 protein 
Bardel-Biedl syndrome 2 
transcription elongation factor A (Sll)- 
hypothetical protein FU13057 similar to 
insulinoma-associated 1 
novel C3HC4 type Zinc finger (ring finge 
KIAA0982 protein 

Homo sapiens cDNA FL) 14597 fis, clone NT 
ESTs, Moderately simaar to A46010 X-fin 
hypothetical protein MGC14376 
E4F transcription factor 1 
DKFZP564L0862 protein 
mzzled-related protein 
Wppocalrirvlike 1 
KIAA0033 protein 

Homo sapiens cDNA FU 10229 fis, done HE 
hypothetical protein FU22242 
mannosidase, alpha, dass 1A, member 1 
UM domain only 2 (rhombotin-like 1) 
adenylate cyclase 9 
ESTs 
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Hs.290905 
Hs.5364 
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Hs.174905 
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Hs.25253 
Hs.184585 
Hs.20196 
Hs.21213 
Hs.42390 
Hs.8941 
Hs.80905 
Hs.10299 
Hs.288158 
Hs.75355 
Hs.44672 
Hs.105700 
Hs.166468 
Hs.23889 
Hs.44313 
Hs.106129 
Hs.78776 
Hs.334885 
Hs.82846 
Hs.283713 
Hs.195641 
Hs.76845 
Hs.154443 
Hs.8861 
Hs.128749 
Hs.27973 
Hs.26549 
Hs.6111 
Hs.8358 
Hs.289014 
Hs.22452 
Hs.43913 
Hs.94445 



ESTs 

Ras association (RalGOS/AF-6) domain fam 
Homo sapiens cONA FU 13545 fis, done PI 
hypothetical protein FU23591 
ubiquitirKonjugafing enzyme E2N (homolo 
hypothetical protein FU10470 
secreted frizzled-retaled protein 4 
programmed cell death 5 
ESTs. Weakly similar to AUU7.HUMAN ALU J 
v-rel avian reficuloendotheliosls viral 
ESTs 

putative transmembrane protein 
mitochondrial GTP binding protein 
selectin L (lymphocyte adhesion molecule 
ESTs, Weakly similar to S64054 hypothefj 
ESTs, Moderately similar to 138022 hypot 
phosphoserine phosphatase-lfce 
rrirdchromosome maintenance deficient (S. 
ESTs 

arptia-methytacyt-CcA racemase 
KIAA0874 protein 
WAA1708 protein 

aryWiydrccarbon receptor nuclear trans) 

hypothetical protein FU20366 

ESTs, Weakfy similar to A43932 mucin 2 p 

Homo sapiens mRNA for K1AA1 737 protein, 

PIBF1 gene product 

ESTs 
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449118 R67477 Hs.23103 Bell (S. cerevisiae) homolog 5.9 

427472 AA522539 Hs.131250 transposon-derived Buster3 transposase-l 5.9 

431548 AI834273 Hs.9711 novel protein 5.8 

417153 X57010 Hs.81343 collagen, type II, alpha 1 (primary osle 5.8 

5 410B89- X91662 Hs.66744 twisl (Drosophila) homolog (acrocephalos 5.8 

414734 AA151712 Hs.82572 ESTs 5.8 

429597 NM 003816 Hs.2442 a disintegrin and metalloproteinase doma 5.8 

419047 AW952771 Hs.90043 ESTs 5.8 

t _ 435080 AI831760 Hs.155111 hypothetical protein FU 14428 5.8 

10 445715 AB012958 Hs.13137 UV radiation resistance associated gene 5.8 

422538 NM 006441 Hs.118131 5,10-methenyltetrahydrofolatesvnthetase 5.7 

431055 AL157645 Hs.4B793 siatyltransferase 6 (N-acetyllacosaminld 5.7 

434540 NM_016045 Hs.3945 CGM 07 protein 5.7 

, _ 452144 AA032197 Hs.102558 Homo sapiens, done MG&5352, mRNA, comp 5.7 

15 407938 AA905097 Hs.85050 phospholamban 5.7 

445467 A1239832 Hs.15617 ESTs, Weakly simitar to ALU4_HUMAN ALU S 5.6 

414812 X72755 Hs.77367 monokine induced by gamma interferon 5.6 

448552 AW973653 Ks.20104 hypothetical protein FU00052 5.6 

406672 M26041 Hs.198253 major histocompafibitity complex, class 5.5 

20 416111 AA033813 Hs.79018 chromatin assembly factor 1, su buret A ( 5.5 

453942 AW190920 Hs.19928 hypothetical protein SP329 5.5 

416619 AF01316B Hs.79393 tuberous sclerosis 1 5.5 

432485 N90866 Hs.276770 CDW52 antigen (CAMPATH-1 antigen) 5.5 

414516 AI3078O2 Hs.135560 ESTs, Weakly similar to T43458 hypotheti 5.5 

25 414343 AL036166 Hs.323378 coated vesicle membrane protein 5.4 

41 2490 AW803564 Hs.288850 Homo sapiens cONA: FU22528 fis, clone H 5.4 

416384 AU076903 Hs.79283 selectin P ligand 5.4 

453874 AW591783 Hs.36131 collagen, type XIV, alpha 1 (undulin) 5.4 

458079 AI796870 Hs.54277 DNA segment on chromosome X (unique) 992 5.4 

30 451684 AF216751 Hs.26813 CDA14 5.4 

418758 AW959311 Hs.172012 hypothetical protein DKFZp434J037 5.4 

445757 AW449065 Hs.13264 KIAA0856 protein 5.4 

449907 AA004825 Hs.103281 ESTs 5.4 

419159 AW974945 Hs.268049 hypothetical protein 5.3 

35 421470 R27496 Hs.1378 annexinA3 5.3 

411486 N85785 Hs.181165 eukaryou'c translation etongafon factor 5.3 

429568 AI088691 Hs.208414 Homo sapiens mRNA: cDNA DKFZp564O0472 (f 5.3 

403851 C5002154':gi|7299015|gb|AAF54217.1| (AE0 5.3 

42B839 A1767756 Hs.82302 Homo sapiens cDNA FU14814 fis, clone NT 5.3 

40 417426 NMJJ02291 Hs.82124 laminin, beta 1 5.3 

424624 AB032947 Hs.151301 Ca2*dependent activator protein for sea 5.3 

426350 NM_003245 Hs.2022 transglutaminase 3 (E polypeptide, prate 5.3 

436476 AA326108 Hs.33829 bHLH protein DEC2 5.3 

422746 NMJXM484 Hs.1 19651 glypican3 5.2 

45 437571 AA760894 Hs.153023 ESTs 5.2 

407902 AL1 17474 Hs.41181 Homo sapiens mRNA; cDNA DKFZp727C191 (fr 5.2 

446985 AL038704 Hs.156827 ESTs, Weakly similar to ALU1 .HUMAN ALU S 5.2 

447072 D61594 Hs.17279 tyrosylprotein sutfotransferase 1 5.2 

449969 AW295142 Hs.180187 Homo sapiens cDNA FU14337 fis. ctone PL 5.2 

50 453204 R10799 Hs.191990 ESTs 5.2 

452092 BE245374 Hs.27842 hypothetical protein FU1 1210 5.1 

438510 AL080220 Hs.6285 DKFZP586P01 23 protein 5.1 

444917 R68651 Hs.144997 ESTs 5.1 

414212 AA136569 Hs.10848 KIAA0187 gene product 5.1 

55 451593 AF151879 Hs.26706 CGH21 protein 5.1 

417318 AW953937 Hs.240845 ESTs 5.1 

444172 BE147740 Hs.104558 ESTs, Moderately similar to 138022 hypot 5.1 

433600 R42833 Hs.22232 ESTs 5.0 

434170 AA626509 Hs. 122329 ESTs 5.0 

60 424090 X99699 Hs.1 39262 XlAP associated factor-1 5.0 

408331 NM.007240 Hs.44229 dual specificity phosphatase 12 5.0 

435706 W31254 Hs.7045 GL004 protein 5.0 

417421 AL138201 Hs.82120 nuclear receptor subfamily 4, group A, m 5.0 

re 432897 AF155099 Hs.279780 NY-REN- 1 8 anBgen 5.0 

65 433560 AI925195 Hs.130891 hypothetical protein MGC4400 5.0 

410762 AF226053 Hs.66170 HSKM-B protein 5.0 

421823 N40850 Hs.28625 ESTs 5.0 

417094 NMJ)06895 Hs.81182 histamine hknethyttransferase 4.9 

425397 J04088 Hs.156346 topoisomerase (DNA) II alpha (170kD) 4.9 

70 451244 AW008798 Hs.343877 hypothetical protein FU20039 4.9 

453370 AI470523 Hs.139336 ATP-binding cassette, sub-famfly C (CFTR 4.9 

424259 AK001776 Hs.143954 hypothetical protein FU10914 4.9 

434094 AA305599 Hs.238205 hypothetical protein PRO201 3 4.9 

__ 452323 W44356 Hs.292812 ESTs, Weakly similar to T33468 hypotheti 4.9 

75 437446 AA78B946 Hs.101302 ESTs, Moderately similar to CA1 C RAT COL 4.9 

411352 NM.002890 Hs.758 RAS p21 protein activator (GTPase activa 4.8 

443062 N77999 Hs.8963 Homo sapiens mRNA full length insert cDN 4.6 

425242 D13635 Hs.155287 WAA0010 gene product 4.8 

445800 AA126419 Hs.32944 tnositd polyphosphates-phosphatase, ty 4.8 

oO 439024 R96696 Hs.35598 ESTs 4.8 

407235 D20569 Hs. 169407 SAC2 (suppressor of actin mutations 2, y 4.8 

430935 AW072916 zinc finger protein 131 (clone pHZ-10) 4.7 

412953 Z45794 Hs.238809 ESTs 4.7 
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436576 AI458213 Hs.77542 ESTs 4.1 

443958 BE2418B0 Hs.10029 cathepsin C 4.1 

410037 AB020725 Hs.58009 KIM0918 protein 4.1 

411968 AI207410 Hs.69280 Homo sapiens, clone IMAGE:3636299, mRNA. 4.1 

5 421129 BE439899 Hs.89271 ESTs 4.1 

421492 BE176990 Hs.104916 hypothetical prolein FU 2 1940 4.1 

428966 AF059214 Hs.1 94687 cholesterol 25-hydroxyiase 4.1 

430383 AI861854 Hs.210778 hypothetical protein FU 10989 4.1 

431475 AI567669 Hs.40342 putative nudear protein 4.1 

10 446468 AI765890 Hs.16341 MAWD binding protein 4.1 

446664 A1879317 Hs.334691 splicing factor 3a, subunit 1, 120kD 4.1 

404390 Target Exon 4.1 

408815 AW957974 Hs.25485 hypothetical protein FU22341 4.1 

417512 X76534 Hs.82226 glycoprotein (transmembrane) nmb 4.1 

15 414888 AL039185 Hs.77558 thyroid hormone receptor interaclor 7 4.1 

418295 AW970043 Hs.238039 hypothetical protein FU 11 090 4.1 

432242 AW022715 Hs.162160 ESTs. Weakly similar to ALU4_HUMAN ALUS 4.1 

433887 AW204232 Hs.279522 ESTs 4.1 

„ 436556 A1364997 Hs.7572 ESTs 4.1 

20 430157 BE348706 Hs.278543 ESTs 4.1 

451259 NM_006052 Hs.26146 Down syndrome critical region gene 3 4.1 

458776 AV654978 Hs.19904 cystathionase (cystathionine gamma^yase 4.1 

400301 X03S35 Hs.1657 estrogen receptor 1 4.1 

443646 AI0B5198 Hs.164226 ESTs 4.1 

25 451491 AI972094 Hs.286221 Homo sapiens cDN A FU 13741 lis, clone PL 4.1 

452234 AW084176 Hs.223296 ESTs, Weakly similar b I38022 hypotheti 4.1 

430389 AL1 17429 Hs.240845 DKFZP434D146 protein 4.0 

413275 W94748 Hs.48821 ESTs 4.0 

418432 M14156 Hs.85112 insulin-like growth factor 1 (somalomedi 4.0 

30 432261 R42216 Hs.12342 Homo sapiens clone 24538 mRNA sequence 4.0 

433409 A1278802 Hs.25661 ESTs 4.0 

452186 AA120761 Hs.28307 WW domain binding protein 4 (formin bind 4.0 

417187 AB011151 Hs.334659 hypothetical protein MGC1 4 139 4.0 

421921 H83363 Hs.6820 translocase of inner mitochondrial membr 4.0 

35 447247 AW369351 Hs.287955 Homo sapiens cDNA FU 13090 Ms, clone NT 4.0 

415710 J04543 Hs.78637 annexinA7 4.0 

457794 AA689292 Hs.246850 ESTs 4.0 

451124 AI186203 Hs.31432 cardiac ankyrin repeat protein 4.0 

. _ 424401 H67220 Hs.169681 death effector domairvcontaining 4.0 

40 435905 AW997484 Hs.5003 K1AA0456 protein 4.0 

442731 AI868167 Hs.131044 ESTs 4.0 

418650 BE386750 Hs.86978 prolyl endopeptidase 3.9 

423749 U09848 Hs.132390 zinc finger protein 36 (KOX 18) 3.9 

Ar 439394 AA149250 Hs.56105 ESTs 3.9 

45 442287 AW952703 Hs.8182 synaptic nuclei expressed gene 1b 3.9 

436758 AW9771B7 Hs.1 55272 ESTs 3.9 

451030 H40697 Hs.1B1426 ESTs, Moderately similar to YSHUT threon 3.9 

405934 Target Exon 3.9 

439231 AW581935 Hs.1 41480 Homo sapiens mRNA; cDNA DKFZp434N079 (fr 3.9 

50 444755 AA431791 Hs.1 13823 GpX (caseinolytic protease X, E. coS) 3.9 

447276 AL049795 Hs.17987 hypothetical protein MGC1203 19 

431912 AI660552 Hs.76549 ESTs, Weakly similar to A561 54 AUsubst 3.9 

428493 AK001745 Hs.184628 hypothetical protein FU 10883 3.9 

412978 AW31708 Hs.820 homeoboxC6 3.9 

55 414962 AF273304 Hs.235376 XPMC2 protein 3.9 

447482 AB033059 Hs.18705 KIAA1 233 protein 3.9 

449590 AA694070 Hs.268835 ESTs 19 

450203 AF097994 Hs.301528 L-kynurenme/alpha-aminc^r^aminotra 3.9 

450899 T77447 Hs.177664 ESTs 3.9 

60 423337 NM 004655 Hs.127337 axin 2 {conducfin, axil) 3.8 

452588 AA889120 Hs.110637 homeoboxAlO 3.8 

422661 NM.014700 Hs.1 19004 WAAQ665 gene product 3.8 

437231 BE303000 Hs.288929 hypothetical protein FU13258 similar to 3.8 

450506 NM.004460 fibroblast activation protein, alpha 3.8 

65 407317 AI204033 Hs.271461 ESTs, WeaWy similar to 138022 hypotheti 3.8 

410330 AW023630 Hs.159425 ESTs 3.8 

444143 AW747996 Hs.160999 ESTs, Moderately similar to A561 94 throm 3.8 

415156 X84908 Hs.78060 phosphorytase kinase, beta 3.8 

422975 AA347720 Hs.122669 WAA0264 prolein 3.8 
70 403100 C2001027 # :gij7296271|gblAAF51562.1|{AEO 3.8 

422449 AF168418 Hs.1 16784 thyroid hormone receptor interacts 4 3.8 

424163 N^007071 Hs.142245 HERV«H LTR-associating 3 3.8 

424806 AA382523 Hs.105689 MSTP031 protein 3.8 

__. 425878 AW964806 Hs.38085 Homo sapiens. Similar to RIKEN cDNA 0610 3.8 

75 432353 AA534489 gb:nf76g1U1 NCLCX3AP_Co3 Homo sapiens 3.8 

443441 AW291196 Hs.92195 ESTs 3.8 

432529 A1989507 Hs.162245 ESTs 3.8 

432908 AI861896 Hs.304505 ESTs 3.B 

OA 424723 BE409813 Hs.152337 protein arginine N-methyltransferase 3(h 3.8 

oO 424855 AW204725 Hs.25560 ESTs 3.8 

414664 AA587775 Hs.66295 multi-PDZMlorrain^taining protein 3.8 

437162 AW005505 Hs.5464 thyroid hormone receptor coactivating pr 3.8 

421828 AW891965 Hs.279789 histone deacetylase 3 3.8 
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451524 AK001466 Hs.26516 hypothetical protein FU 10604 3.8 

418821 AA436002 Hs.183161 ESTs 3.8 

422049 W25760 Hs.77631 glycine cleavage system protein H (amino 3.8 

453160 AI263307 Hs.239884 H2B histone family, member L 3.8 

5 416072 AL1 10370 Hs.79000 growth associated protein 43 3.7 

424906 A1566086 Hs.153716 Homo sapiens mRNA for Hmob33 protein, 3 3.7 

446554 AA1 51730 Hs.301789 nudix (nucleoside diphosphate finked moi 3.7 

449523 NM_000579 Hs.54443 chemok'me (C-C motif) receptor 5 3.7 

452259 AA317439 Hs.28707 signal sequence receptor, gamma {translo 3.7 
10 417348 A1940507 Hs.318526 hypothetical protein FU12661 3.7 

430289 AK001952 Hs.238039 hypothetical protein FU1 1090 3.7 

435401 R44477 Hs.10056 hypothetical protein FU 14621 3.7 

448807 A1571940 HsJ549 ESTs 3.7 

413403 M129105 Hs.198425 ESTs, Moderately similar to S65657 alpha 3.7 

15 418323 NM.002118 Hs.1162 mapr histocompatibility complex, class 3.7 

449125 AI671439 Hs.196029 Homo sapiens mRNA tor KIAA1 657 protein, 3.7 

408089 H59799 Hs.42644 thtoredoxin-fike 3.7 

428242 H55709 Hs.2250 leukemia inhibitory factor (cholinergic 3.7 

^_ 439209 AF087993 Hs.91954 ESTs 3.6 

20 434927 H46612 Hs.293815 Homo sapiens HSPC285 mRNA, partial cds 3.6 

431735 AW977724 Hs.75968 thymosin, beta 4, X chromosome 3.6 

456619 AV647917 Hs.107153 inhibitor of growth family, member 1-lik 3.6 

45B082 AW978811 Hs.314451 ESTs, Weakly similar to ALU 1 .HUMAN ALU S 3.6 

411213 AA676939 Hs.69285 neuropDrnl 3.6 

25 414709 AA704703 Hs.77031 Sp2 transcription factor 3.6 

415339 NM_015156 Hs.78398 KIAA0071 protein 3.6 

418203 X54942 Hs.83756 CDC28 protein kinase 2 3.6 

427283 AL1 19796 Hs. 174 185 edonudeotidepyrophosphatase/phosphodi 3.6 

452260 AA453208 RAB9, member RAS oncogene family 3.6 
30 439506 'AI361238 Hs.41136 ESTs 3.6 

. 420077 AW512260 Hs.87767 ESTs 3.6 

451009 AA013140 . Hs.1 15707 ESTs 3.6 

435473 AI193122 Hs.132275 ESTs 3.6 

n _ 413198 AW157712 Hs.47534 ESTs, Weakly similar to I38022 hypotheli 3.6 

35 443067 AI077389 Hs.269818 ESTs, Weakly similar to Z195_HUMAN ZINC 3.6 

452281 T93500 Hs.28792 Homo sapiens cDNAFU 11 041 fis, clone PL 3.6 

452561 AI692181 Hs.49169 WAA1634 protein 3.6 

433269 AI343543 Hs.126890 ESTs 3.6 

445817 NM 003642 Hs.13340 histone aceryttransf erase 1 3.6 

40 447033 AI357412 Hs.157601 ESTs 3.6 

414171 AA360328 Hs.865 RAP1A, member of RAS oncogene family 3.6 

427138 1*77624 Hs.173717 phosphatidic acid phosphatase type 2B 16 

442991 BE281238 Hs.8886 hypothetical protein FU20424 3.6 

434974 AA778711 Hs.4310 eukaryotic translation initiation factor 3.5 

45 443399 AI452659 Hs.61153 proteasome (prosome, macropain) 26S subu 3.5 

401807 C700l350:gi|6578126|gblAAF17706.1|AF0496 3.5 

413979 BE218090 Hs.279453 ESTs " 3.5 

414680 AA743331 hemoglobin, alpha 2 3.5 

417380 T06809 Hs.332086 ESTs 3.5 

50 425710 AF030880 Hs.159275 sohrte carrier family, member 4 3.5 

441247 AW118681 Hs.128051 Homo sapiens thymic stromal lymphopoleti 3.5 

445183 AB007877 Hs.12385 KIAA0417 gene product 3.5 

426931 NWL003416 Hs.2076 zinc finger protein 7 (KOX 4, done HF.1 3.5 

417873 BE266659 Hs.293659 Homo sapiens, Similar to R1KEN cONA A430 3.5 

55 419594 AA013051 Hs.91417 topoisomerase (DNA) II binding protein 3.5 

446416 AV658299 Hs.163959 ESTs 3.5 

449008 AW578003 Hs.22826 tropomodunn 3 (ubiquitous) 3.5 

456719 Z43784 ankyrin 3, node of Ranvier (ankyrin G) 3.5 

432101 AI918950 Hs.123642 EphA3 3.5 

60 453565 BE298808 Hs.33363 DKFZP434N093 protein 3.5 

449217 AA278536 Hs.23262 ribonudease, RNase A family, k6 3.5 

412922 M60721 Hs.74870 H2.0 {OrosophHa)4ike homeo box 1 3.5 

450066 H56499 Hs.252692 ESTs, Weakly similar to 138022 hypotheti 3.5 

429302 AU076674 Hs.198899 eukaryotic translation initiation factor 3.5 

65 451246 AW189232 Hs.39140 cutaneous T-ceii lymphoma tumor antigen 3.5 

415528 R17236 gbyg12c04.r1 So ares infant brain 1NIB H 3.5 

435513 AW404075 Hs.42785 DC11 protein 3.5 

440643 M78878 Hs.7314 KIAA0614 protein 3.5 

409068 AW236991 Hs.102495 ESTs, Moderately similar to 138022 hypot 3.5 

70 439132 H72B03 Hs.38363 ESTs 3.5 

451173 AI765082 Hs.48317 ESTs 3.5 

422673 N59027 gfcyv59d11.r1 Soares fetal fiver spleen 3.4 

401597 AA172106 Rag C protein 3.4 

401744 Target Exon 3.4 

75 432723 D29677 Hs.3085 KIAA0054 gene product; Helicase 3.4 

449465 NM.004380 Hs^3598 CREB binding protein (Rubinstein-Taybi s 3.4 

450628 AW382884 Hs^04715 ESTs 3.4 

432125 AW972667 Homo sapiens c DMA FU 12300 fts, done MA a4 

or . 435411 AW444619 Hs.138211 ESTs 14 

oO 434629 AA789081 Hs.4029 gfioma-amplified sequence-41 14 

437158 AW090198 KIAA1 150 protein 3.4 

416375 H95567 Hs.124700 ESTs 3.4 

419441 AW023731 Hs.274368 MSTP032 protein 3.4 
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5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 



432741 
439192 
440474 
4284B6 
447397 
426828 
418293 
424779 
415015 
425900 
408113 
452852 
414671 



444480 
403389 
440749 
408374 
410494 



433505 
400277 
411031 
417295 
438459 
432169 
437872 
443912 
445270 
425838 
414565 
418700 
428186 
436499 
448164 
426848 
439653 
450094 
402474 
424800 
430280 
408968 
453548 
451367 
444445 
445472 
400658 
408101 
410366 
411048 
421246 
422763 
426227 
439047 
444451 
420568 
. 406973 
402802 
431304 
406247 
411653 
412639 
419168 
433561 
436207 
438461 
455473 
441683 
417386 
427286 
442355 
422473 
407816 



AI732358 

AW970536 

AI207936 

AF151063 

BE247676 

NMJJ00020 

AI224483 

AL046851 

NMJXJ2109 

AK000166 

T82427 . 

AK001972 

AA832479 

AA305688 

AI150730 * 

W22335 
AW025430 
M36564 
BE620738 
AW504027 



453864 
448944 
435129 
424894 
416959 
410185 
451149 
410531 
411954 



W37943 
AW993524 
T49300 
Y00971 
AK002015 
R37257 
AI762154 
AW955696 
AA502972 
AI963808 
AW504300 
AJ276678 
R61680 
H72531 
AW021103 
A1174947 

AUJ35588 
AA361258 
AI652236 
ALD79983 
AA923729 
AA342329 
ABX6631 

AW968504 
A1267589 
AK001742 
AW582962 
AA033699 
U67058 
AW979177 
AV650179 
F09247 
M34996 

BE157283 

AF070578 
AW961284 
AI336132 
BE540937 
AA334774 
AW075485 
AW9847B8 
BE564214 
AL037228 
AW732802 
AA456539 
U947B0 
AW500857 
AW972512 
AW0214O7 
AB014605 
AI381659 
H83520 
D28459 
BE294068 
AL047586 
AW752953 
AA652523 



Hs.34892 
Hs.43148 
Hs.35304 
Hs.2910 
Hs.5887 



Hs.1 851 1 8 ESTs, Moderately similar to A374 13 calbi 
Hs.105413 ESTs 

Hs.7195 gamma-aminobutyric acid (GABA) A recepto 

Hs.1 84456 hypothelical protein 

Hs.18442 E-1 enzyme 

Hs.1 72670 activin A receptor type ll-like 1 

Hs.16063 hypolheOcal protein RJ21877 

Hs.153053 C037an6gen 

Hs.77798 hislidyURNA synthetase 

Hs.288809 hypothetical protein FU201 59 

Hs.194101 Homo sapiens cDNA: FU20869 lis, done A 

Hs.30822 hypothetical protein FU 1 1 1 1 0 

Hs.72080 ESTs 

Hs.267695 UOP-Gal:betaGteNAc beta l^galactosyltr 
Hs.265540 ESTs, Moderately similar to A47582 B-cei 

C3001393*:gi|3327090|dbi|BAA31613.1!(AB 
Hs.7392 hypothelical protein MGC3199 
Hs.155591 forkhead box F1 
Hs.64016 protein S (alpha) 
Hs.173125 peptidylprolyt Isomerase F (cyclophilin 
Hs.15301 Homo sapiens cDNA FU1 2596 lis. done NT 
Eos Control 
KIAA1323 protein 
ESTs 

Homo sapiens cDNA RJ13655 lis, done PL 
phosphoribosyl pyrophosphate synthetase 
RNA binding motil protein 7 
Hs.184780 ESTs 

Hs.54982 Homo sapiens cDNA FU 1401 4 lis, done HE 
Hs.90960 ESTs 

Hs.1 83390 hypothetical protein FU 13590 
Hs.86970 ESTs, Moderately similar to ALU5_HUMAN A 
Hs.295605 mannosidase, alpha, class 2A, member 2 
Hs.283102 HEFlfte Protein 
Hs.26904 ESTs.ModeraterysimilartoZ195_HUMANZ 
Hs.36190 ESTs 

Hs.6631 hypothetical protein FU20373 
Hs.295789 Homo sapiens mRNA; cDNA DKFZp554D1 164 (I 

NVL004079:Homo sapiens cathepsin S ICTSS 
Hs.153203 MyoO family inhibitor 
Hs.237868 inlerteukin 7 receptor 
Hs.49376 hypothetical protein FU20644 
Hs.1 16774 integrln, alpha 1 
Hs.26322 cell cycle related kinase 
Hs.1 1 5920 Homo sapiens cDNA FU 22816 Its, done K 
Hs.1 2784 Homo sapiens mRNA for KIAA0293 gene, par 

ENSP00000237081*:K1AA1217 PROTEIN (FRAGM 
Hs.123073 COC2-related protein kinase 7 
Hs.302689 hypothelical protein 
Hs.67991 hypothetical protein DKFZp434G0522 
Hs.102897 CGM7 protein 

Hs.83938 ESTs. Moderately similar to MASJLHUMAN M 
Hs.1 54299 Human proteinase activated receptor-2 mR 

gb:EST391287 MAGE resequences, MAGP Homo 
Hs.282431 ESTs 
Hs.247735 protocadherin alpha 10 
Hs.1 98253 major histocompatibility complex, class 

NM 001397:Homo sapiens endothelin conver 
gb:RC4-HTO373-13020tW11-aO3 HT0373 Homo 
Target Exon 

Hs.71 168 Homo sapiens clone 24674 mRNA sequence 
Hs.296235 ESTs Lpr 
Hs.33718 Homo sapiens cONA FU12641 lis, done NT 
Hs.20104 ' hypothetical protein FU00052 
Hs.1 2845 hypothelical protein MGC13159 
Hs .286049 phosphoserlne aminotransferase 

gb:RC1-HN0015-120400-021-c07 HN0015 Homo 
Hs.102946 ESTs 
Hs.82043 D 1 23 gene prod ud 
Hs2132 epidermal growth factor receptor pathway 
Hs.8262 lysosomal-assodated membrane protein 2 
Hs.1 17242 meningioma expressed antigen 6 (coiled-c 
Hs.401 37 anaphase-promoting complex 1; meiotic ch 
sin3*ssociated polypeptide, 30kO 
hypothetical protein 
atrophin-1 interacting protein 1; activi 
Hs!267086 ESTs 
Hs.1 53678 reproductions 

Hs.80612 ubiquitirvconjugatmg enzyme E2A(RAD6h 
Hs.737 immediate early protein 
Hs.10283 RNA binding motif protein 8B 

gb:(M>-CT0224-261099-03Sfl02 CT0224 Homo 
Hs.269496 ESTs, Weakly similar to 138022 hypotheli 
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423748 AI149048 Hs.30211 hypothetical protein FU2231 3 3.2 

444489 AI151010 Hs.157774 ESTs 3.2 

453124 AI139058 Hs.125790 leucine-rich repeat-containing 2 3.2 

452413 AW082633 Hs.212715 ESTs 3.1 

5 409390 AI927212 Hs.3734 ESTs 3.1 

453078 AF053551 Hs.31584 metaxin2 3.1 

453024 AW846787 gb:QV3-CT0194-181099-004-e01 CT0194 Homo 3.1 

439444 A1277652 Hs.54578 ESTs, Weakly similar to 138022 hypotheli 3.1 

441124 T97717 Hs.119563 ESTs 3.1 

10 416128 AA173632 CDC14 (cell division cycle 14, S. cerevi 3.1 

445101 T75202 Hs.12314 Homo sapiens mRNA; cDNA DKFZp586C101 9 (f 3.1 

407949 W21B74 Hs.247057 ESTs, Weakly similar to 2109260A B cell 3.1 

413492 087470 Hs.75400 WAA0280 protein 3.1 

421044 AF061871 Ks.101302 Human DNA sequence from clone RP1 -2380 15 3.1 

15 422603 BE242587 Hs.1 18651 hematopoietically expressed homeobox 3.1 

427399 NM 014883 Hs.177684 KIAA09 14 gene product 3.1 

434804 AA649530 Hs.348148 gb:ns44K)55l NQ^CGAP.AIvl Homo sapiens 3.1 

442906 AW296888 Hs.170939 ESTs 3.1 

445840 AI277811 Hs.146291 ESTs 3.1 

20 446084 AL13765B Hs.13768 Homo sapiens mRNA; cONA DKFZp434l 1216 (f 3.1 

435293 AI040777 Hs.1 17170 ESTs 3.1 

400750 Target Exon 3.1 

416B52 AF283776 Hs.80285 Homo sapiens mRNA; cONA DKFZp5B6C1 723 (f 3.1 

423566 AW976434 Hs.3623 hypothetical protein FU 11220 3.1 

25 409558 BE253407 Hs.72363 Homo sapiens mRNA for FU001 16 protein, 3.1 

4001 94 NM__003763:Homo sapiens syntaxin 16 (STX1 3.1 

400479 Target Exon 3.1 

402895 ENSP00000252117:Myeiintranscriptionfac 3.1 

403423 Target Exon 3.1 

30 408161 AW952912 Hs.300383 hypothetical protein MGC3032 3.1 

410886 AW809324 gb:MR4-ST01 21-1 41099-01 0-G06J ST01 21 Ho 3.1 

414922 D00723 Hs.77631 glycine cleavage system protein H (amino 3.1 

420121 AW968271 Hs.191534 ESTs, Weakly similar to ALU INHUMAN ALU S 3.1 • 

„ 430519 AF1 29534 Hs.49210 F-box only protein 4 3.1 

35 447147 AA910353 Hs.292815 ESTs, Weakly similar to T23482 hypotheli 3.1 

422481 AL050163 Hs.117339 DNAX-acthration protein 10 3.1 

430487 087742 Hs.241552 WAA0268 protein 3.1 

409549 AB029015 Hs.54886 phosphoBpase C, epsilon 2 3.1 

423554 M90516 Hs.1674 ghjtamire-frtK^ose-6-phosphatetrarisamin 3.1 

40 431286 AW263476 Hs.44268 myeTm gene expression factor 2 3.1 

430226 BE245562 Hs.2551 adrenergic beta-2-, receptor, surface 3.1 

408051 AI623351 Hs.172148 ESTs 3.1 

420090 AA220238 Hs.94986 ribonuctease P (38kD) 3.1 

400296 AA305627 Hs.139336 ATP-binding cassette, sub-family C (CFTR 3.1 

45 414279 AW021691 GCN5 {general control of amino-acid synt 3.1 

440426 AI159B00 Hs.7181 Homo sapiens cDNA FU 13663 fis, clone PL 11 

423872 AB020316 Hs.134015 uronyl 2-sulfotransferase 3.1 

420154 AI093155 Hs.95420 JM27 protein 11 

447560 AF065214 Hs.18858 phospholipase A2, group IVC (cytosolic, 3.1 

50 453293 AA3B2267 Hs.10653 ESTs 11 

434398 AA1 21098 Hs.3838 serum-inducible kinase 3.0 

414135 NMJM4419 Hs.2128 dual specificity phosphatase 5 3.0 

441527 W19504 Hs.7884 solute carrier family 21 (organic anion 3.0 

423957 AW978309 Hs.136235 Homo sapiens cDNA FU 13542 fis, clone PL 10 

55 444001 A1095087 Hs.152299 ESTs, Moderately similar to S55657 alpha 3.0 

411315 AW836547 gb:P^T0032-O3010r>00^e08LT0032Homo 10 

414178 AW957372 Hs.46791 ESTs, WeaMy similar to 138022 hypothec 3.0 

441401 AI824338 Hs.126891 ESTs . 10 

457718 F18572 Hs.22978 ESTs, Weakly simflar to ALU4.HUMAN ALUS 3.0 

60 429500 X78565 Hs.289114 hexabrachlon (tenascln C cytotactin) 10 

424240 AB023185 Hs.143535 calcium/caJmoounfrdeperuJerrtprot^ 10 

447476 BE293466 Hs.20880 ESTs, Weakly similar to 138022 hypotheli 10 

419440 AB020689 Hs.90419 WAA0882 protein 3.0 

rc 421786 AI188653 Hs.21351 ESTs 10 

65 433285 AW975944 Hs.237396 ESTs 10 

409997 AI906055 Hs.57749 synaptic nuclei expressed gene 2; WAA10 3.0 

417874 BE616160 Hs.82829 protein tyrosine phosphatase, norwecept 3.0 

420956 AA351584 Hs.100543 Homo sapiens clone 24505 mRNA sequence 3.0 

445345 AW003850 Hs.1 2532 chromosome 1 open reading frame 21 3.0 

70 450671 AI356967 Hs.43086 ESTs, WeaWy similar to A46010 X-linked 10 

427315 AA179949 Hs.175563 Homo sapiens mRNA; cDNA DKFZp564N0763 (f 10 

434953 BE049102 Hs.179982 ESTs, Weakly simflar to TRHY_HUMAN TR1CH 10 

436469 AK001455 Hs.5198 Down syndrome critical region gene 2 3.0 

428130 AW444985 Hs.77603 ESTs 10 

75 414839 X63692 Hs.77462 DNA (cytosine-5-)-methyltransferase 1 10 

445784 AI253155 Hs.146065 ESTs 3.0 

446677 A1800311 Hs.156291 ESTs 3.0 

426448 R06054 gb^e69g07j1 Soares fetal Bver spleen 10 

445921 AW015211 Hs.146181 ESTs 10 

80 406673 M34996 Hs.198253 major histocompatibility complex, class 3.0 

430387 AW372884 Hs.240770 nuclear cap binding protein subunH 2. 2 10 

427157 U51166 Hs.173824 thymine-DNA grycosytase 10 

435907 AW304025 Hs.258397 ESTs 3.0 
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451102 AA015683 Hs.41185 Homo sapiens mRNA; cONA DKFZp5640 1262 (f 3.0 

416882 AI633044 byptophanyl tRNA synthetase 2 (mitochon 3.0 

414052 AW578849 Hs.283552 ESTs. Weakly similar to unnamed protein 3.0 

450580 ' N40087 ESTs 3.0 



TABLE 62B 



Pkey: Unique Eos probeset fdentifier number 

CAT number. Gene cluster number 

Accession: Genbank accession numbers 



Pkey 


CAT Number 


Accession 


410531 


1207200 1 


AW752953 H88044 BE 156092 


410886 


1225822 J 


AW809324 BE144977 BE144956 


411315 


1238570 1 


AW836547 AW836513 AW836587 


411962 


126744 T 


AA099050 AA099526 T47733 


414279 


143227J 


AW021691 AI537404 R45431 AI333439AI741845 A1674468 R44190 R52535 R52617A)220925A1979148AI744688AW242437 AA618148 






A1983B37 AA399623 AI676204 AI420077 N24944 051042 AA282786 AA137264 AW236107 AW769 


414680 


147525.1 


AA743331 AA837388 AW664540 AA77571 1 AA1 50965 BE465475 AA968994 AA886905 Al 141054 AW1 94991 AA252147 AA319500 Al 184288 






AA70B749 AA644620 AA652769 AA242975 AA1 51074 T19890 


415528 


1539409 1 


R17236 R52580F11642 


416128 


157163 T 


AA1 73632 AI174858 AA581361 At700024M1 73988 BE165417AI366964 


416882 


162718J 


AI633044 AW01 6212 AW241 143 AA769058 R43272 AW068958 AA210918 AA293774 AI748815 AI763294 AI333114 AI277384 AI088297 AI468477 






AI824624 AW1B9606 AI631751 240749 AI984673AI671316AA189024AW235412BE17B4 


422673 


219674.1 


N59027 AA314694 N53937 R08100 


426448 


267323.1 


R06054 AA378789 AW956453 


430935 


325772J 


AW072916 AI1B4913 AA489195 AW466994 AW469044 N59350 A1819642 AI280239 AI220572 AA789302 AI47361 1 AW841 1 26 D60937 


430968 


326269.1 


AW972830 AA527647 AA489820 AA570352 


431304 


331286J 


BE157283 BE157287 AA502438 


431676 


336411 J 


AI6B5464 AW971336 AA5135B7 AA525142 


432125 


341776 1 


AW972667AA526539AI057032AW167842 


432363 


345469J 


AA534489 AW970240 AW970323 


437158 


43392_5 


AW090198 AW173544 AW439860 AW007307 AI762577 W86516 M160485 AA974203 AI589521 AW451857 AW450602 AI702529 AA630766 






AI801808 AW61 1634 A1393606 AW235356 AW000736 AW468599 AI582546 AA962057 AA523012 AW51 


437666 


44433_2 


AA156781 AW293839 U52054 AA024963 AA778446 BE073977 AW444904 AW602574 BE164040 BE164012 BE163972 BE163974 BE163992 






AA837481 AW468444 BE185091 AW468002AA687333 AA811830 AA581806 AI866686 AI572124AA04 


439047 


468139.1 


AW979177 AA846994 AA829672 


439518 


47334 1 


W76326AF086341 W72300 


450506 


836.1 


NM.004460 U09278 U76833 AW630055 AW471 1 33 C02434 W45237 AW79351 8 BE0701 12 AI587479 AI624429 AW190535 AI446661 AI478772 






AW022667 AA528235 AA599775 AW613820 AI435793 AW594230 AI051768 AI2OO109 A1680296 


450580 


83929J 


N40087 H12925 AA460779 M096372 BE160847 AW816890 AW816893 AW816577 AW816891 AW816882 AW816868 AWB16889 AW816940 






AW816B92 AW816941 AW81657B AA029183 T83320 H86850 AA010295 BE160823 Z25353 AA150883 AW8 


452260 


9074J 


AA453208 NM.004251 U44103 AI671547 N57463 AW007521 A1479232 T82809 R67109 AW795099 AI833027 AI765395 AI763029 AI75822B 






BE326331 AI934576 A1922378 AW276431 AI718466 N36566 AA904753 BE464245 AI338752 Al 


453024 


944876.28 


AW846787 AW903370 AW903378 AW752032 AW839052 AW839051 AW839054 AW839058 AW903374 AW839045 AW903373 AW903352 






AW839043 AW839049 AW903372 AWB46755 AW846767 AW90336B AW846766 BE146826 AW839056 AW846802 AW 


455473 


1292917J 


AW984788 AW98481 6 AW98481 1 AW984807 AW98481 9 AW9B4790 AW9847B2 AW9B4784 AW984780 AW984814 AW984795 AW984793 






AW9847B9 AW984823 AW948021 AW984802 AW9B4800 AW984799 AW984825 AW984792 AW984821 AW984820AW 


456719 


222707.1 


243784 R13382 AW572911 AA449369 H17037 R19603 Al 632565 AW004030 BE502530 Z25032 AA805324 AA449241 A1651825 AI264B63 






AW196918 AA948267 AI953735 AI263703 AA319159 AW964436 A1903440 AW594171 AI867447 AW2 



TABLE 62C 



Pkey: Unique number corresponding to an Eos probeset 

Ret Sequence source. The 7 digit numbers In this column are Genbank Identifier (Gl) numbers. "Dunham I. et al." refers to the publication entitled The DNA sequence of 

human chromosome TV Dunham I. el al., Nature (1 999) 402:489-495. 
Strand: Indicates DNA strand from which exons were predicted. 

NLposltion: Indicates nucleotide positions of predicted exons. __ 



Pkey 


Ref 


Strand 


Nt_poshlon 


400479 


8439786 


Minus 


115386-116348 


400658 


8118459 


Minus 


73525-73644 


400750 


8119067 


Plus 


198991-199168,199316-199548 


401197 


9719705 


Plus 


176341-176452 


401519 


6649315 


Rus 


157315-157950 


401597 


3293210 


Plus 


65838-66031 


401744 


2576349 


Plus 


14595-14751 


401807 


7331536 


Plus 


152325-152912 


402474 


7547175 


Minus 


53526-53628,55755-55920.57530-57757 


402B02 


3287156 


Minus 


53242-53432 


402895 


9967547 


Plus 


85537-85671,86379-86469 


403100 


8954402 


Minus 


136B3-13874 


403389 


9438331 


Minus 


163415-163634 


403423 


7105492 


Plus 


6934&69615 


403851 


7708872 


Plus 


22733-23007 


404390 


8887030 


Minus 


39624-40072 


405141 


8980911 


Plus 


99861-100054 


405934 


6758795 


Plus 


159913-160605 


406038 


8389537 


Plus 


37764-37877 


406247 


7417725 


Minus 


46234-46461 
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Table 63A lists about 366 genes significantly down-regulated in prostate cancer compared to normal prostate. These were selected from 59680 prabesets on the Affymelrix/Eos Hu03 
GeneChip array such that the ratio of "average" norma) prostate to "average' prostate cancer tissues was greater than or equal to 2.5. The "average" normal prostate level was set to 
the 75"" percentile of normal prostate tissues. The "average" prostate cancer level was set to the 75* percentile amongst tumor samples. In order to remove gene-specific 
background levels of non-specific hybridization, the 10* percentile value amongst all the tissues was subtracted from both the numerator and the denominator before the ratio was 
evaluated. 

TABLE 63A: ABOUT 366 GENES SIGNIFICANTLY DOWN-REGULATED IN PROSTATE CANCER COMPARED TO NORMAL PROSTATE 

Pkey: Unique Eos probeset identifier number 
ExAccn: Exemplar Accession number, Genbank accession number 
UnigenelD: Unigene number 
Unigene Title: Unigene gene title 



R1: 


Ratio of norma! prostate Qssue to prostate tumor tissue 




Pkey 


ExAccn 


Unigene ID 


Unigene Title 


R1 


425932 


M81650 


Hs.1968 


semenogelin I 


89.4 


407245 


X90568 


Hs.172004 


titin 


35.4 


425545 


N98529 


Hs.1 58295 


Homo sapiens, clone MGC:12401, mRNA, com 


32.2 


426752 


X59490 


Hs.172004 


titin 


29.5 


400440 


X83957 


Hs.83870 


nebulin 


20.7 


412519 


AA196241 


Hs.73980 


troponin T 1 , skeletal, slow 


13.9 


413776 


AA090235 


Hs.75535 


myosin, light polypeptide 2, regulatory, 


12.0 


420813 


X51501 


Hs.99949 


prolactin-induced prolein 


11.8 


415337 


Z44881 


Hs.9012 


ESTs, WeaWy similar to S26650 DNA-bindi 


10.2 


407013 


U35637 




gbiHuman nebulin mRNA, partial cds 


9.1 


433331 


A1738815 


Hs.117323 


ESTs 


8.9 


432117 


AL036195 


Hs.2909 


protamine 1 


8.3 


453863 


X02544 


Hs.572 


orosomucoid 1 


8.2 


431847 


AI791314 




gb:ae46g12.y5 Stratagene lung carcinoma 


7.4 


408855 


T83061 


Hs.319946 


Homo sapiens mRNA for KIAA1727 protein, 


7.1 


448059 


AI459021 


Hs.1 70425 


ESTs 


6.8 


403612 






Target Exon 


6.4 


405001 


U58196 




interteukin enhancer binding factor 1 


6.2 


441490 


N46901 


Hs.266720 


ESTs 


6.2 


435805 


AW470260 


Hs.48496 


ESTs 


6.0 


401917 


AL050149 




RAN binding protein 3 


5.9 


455649 


BE065051 




gb:RC1-BT03t3-110500-017-c04 BT0313 Homo 


5.7 


450218 


R02018 


Hs.168640 


ankylosis, progressive (mouse) homolog 


5.6 


404606 






Target Exon 


5.6 


432326 


A1280308 


Hs.274361 


arruToride-sensitive cation channel 2, ne 


5.5 


459708 


AA776881 


Hs.168612 


Homo sapiens cDNA FU14132 fis. clone MA 


5.5 


440808 


AK001339 


Hs.7432 


hypothetical protein FU10477 


5.5 


458339 


AW976853 


Hs.172843 


ESTs 


5.2 


454278 


AF217525 


Hs.49002 


Down syndrome cefi adhesion molecule 


5.2 


417032 


AA192469 


Hs.271638 


ESTs 


5.2 


434352 


AF129505 


Hs.86492 


small muscle protein, Xfinted 


5.1 


439175 


AF086021 


Hs.271113 


ESTs 


5.0 


439749 


AL389942 


Hs.1 57752 


Homo sapiens mRNA full length insert cDN 


4.9 


417364 


N73749 


Hs.222475 


ESTs 


4.9 


400831 






C1100O936:gi|3746443lgblAAC63969.1| (AF0 


4.8 


416935 


AA190712 




gb:zp87f09.r1 Stratagene HeLacetl s3 93 


4.8 


442082 


R41823 


Hs.7413 


ESTs 


4.8 


452625 


AA724771 


Hs.61425 


ESTs 


4.7 


440965 


AI523646 


Hs.169859 


ESTs 


4.7 


456815 


NM.013348 


Hs.144011 


potassium inwardfy-rectifying channel.su 


4.7 


430611 


AA837120 


Hs.156481 


ESTs 


4.6 


415981 


R35694 




gb:yg67b04.r1 Soares infant brain 1NIBH 


4.6 


441040 


AW449782 


Hs.178803 


ESTs 


4.6 


442764 


AI762254 


Hs.131122 


ESTs 


4.6 


411426 


BE141714 




gb:QVO-HT0101-061099-032^HT0101 Homo 


4.6 


433081 


Z85986 


Hs.188757 


Homo sapiens, done MG0.5564, mRNA, ccmp 


4.6 


401896 






Target Exon 


4.5 


445323 


AW207282 


Hs.213049 


ESTs 


4.5 


458895 


A1535663 


Hs.39379 


ESTs 


4.5 


417729 


Z43798 


Hs.6777 


ESTs 


4.5 


431627 


AW609720 


Hs.265540 


HSPC042 protein 


4.5 


420721 


AA927802 


Hs.159471 


ZAP3 protein 


4.4 


449519 


W04244 


Hs.49829 


ESTs 


4.4 


442089 


AI801500 


Hs.128457 


ESTs 


4.4 


436781 


AI914535 


Hs.221377 


ESTs 


4.4 


402797 






Target Exon 


4.4 


404267 






NM_004348*:Homo sapiens runt-related tra 


4.3 


442931 


A1024376 


Hs.1 50473 


ESTs 


4.3 


418626 


AW299508 


Hs.135230 


ESTs 


4.3 


423772 


AA306637 




EAP30 subunit of ELL complex 


4.3 


457136 


AA428240 


Hs.126083 


ESTs 


4.3 


436404 


AW96B556 


Hs.137240 


Homo sapiens mRNA for partial 3*1/1" R, seq 


4.3 


458840 


AI580296 


Hs.174782 


ESTs, Weakly similar to K1AA1437 protein 


4.3 


403649 






Target Exon 


4.2 


435866 


AA704538 


Hs.119740 


ESTs 


4.2 


423871 


AA331906 




gb:EST35805 Embryo, 8 week I Homo sapien 


4.2 
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436190 AK001059 gb:Homo sapiens cONAFU10197fis, done 4.2 

400164 Eos Control 4.1 

435519 A1218950 Hs.125461 hypolhetical protein FU 11 539 4.1 

425094 AI955956 Hs.21417 ESTs 4.1 

5 415928 R46799 Hs.23966 ESTs 4.1 

415746 AA157670 Hs.21413 solute carrier family 12, (potassium-chl 4.1 

426563 AI028767 Hs.262603 ESTs 4.1 

453752 AL120800 gb:DKFZp762E152_r1 762 (synonym: hmeB) 4.1 

404260 Target Exon 4.0 

10 453412 AJ003290 gb:AJ003 290 Selected chromosome 21 cDNA 4.0 

401411 ENSP00000247172*:HYPOTHETICAL 126.2 kDa 4.0 

415925 H09474 Hs.202341 Homo sapiens cONA: FU 23573 fis, clone L 4.0 

402817 H24185 hypolhetical protein 4.0 

449233 BE048401 Hs.196511 ESTs 3.9 

15 435457 AA682421 Hs.59125 ESTs 3.9 

444105 AW189097 Hs.1 66597 ESTs 3.9 

430610 AI821465 Hs.188810 ESTs, Weakly similar toAW6_HUMAN ALU S 3.9 

411320 AW836646 gb:PM3-LT0032-09010(W)08^05 LT0032 Homo 3.8 

409679 BE250521 ras homolog gene family, member A 3.8 

20 415386 Z43087 gb:HSC 13A1 21 normalized infant brain cON 3.8 

446175 AL036568 Hs.291 glutamyl aminopeplidase (aminopeptidase 3.8 

459707 AA631362 Hs.120866 gb:np86b01.s1 NCLCGAPJThyl Homo sapiens 3.8 

406917 X65964 gb:H.sapiens nestin gene. 3.8 

455887 BE154173 gb:PM1-HT0340-201299-004-f12HT0340Homo 3.8 

25 443313 AI796730 Hs.55513 ESTs 3.8 

452351 AA025647 gbze85d01.r1 SoaresJetal_heart_NbHH19W 3.8 

400499 C10001858:gi|6679124|ref|NP_032759.1|ne 3.7 

411829 AW865749 gb:QV3-SN0021 -100500-1 85c03 SN0021 Homo 3.7 

408229 AW176091 gb:QVO-BT0107-250899-007-b08 BT0107 Homo 3.7 

30 427318 AF186081 Hs.175783 zinc transporter 3.7 

417620 R02530 Hs.191198 ESTs 3.7 

404660 C9000841 *:gi|12654691|gb|AAH01 185.1 |AAH0 3.7 

409144 AW341187 Hs.279714 ESTs 3.7 

43B524 AA922236 Hs.221037 ESTs 3.7 

35 459200 Y09306 Hs.3014B homeodomairHnteracting protein kinase 3 3.7 

418833 AW974899 Hs.292776 ESTs 3.7 

424887 AA348112 Hs.259461 ESTs 3.7 

404984 Target Exon 3.7 

454549 AW806910 gb:QV4-ST0023-1604QO-1 72404 ST0023 Homo 3.7 

40 426736 AA431615 Hs.130722 ESTs 3.7 

434552 AA639618 Hs.325116 rtosapjerB,doneMGC:2962 t mRNA,comp 3.6 

438040 AI222422 Hs.121846 ESTs 3.6 

443618 T82009 Hs.300700 hypothetical protein FU20727 3.6 

Ae 442097 AW015799 Hs.128474 ESTs 3.6 

45 435186 AL119470 Hs.145631 ESTs 3.5 

429853 AB020625 Hs.225949 butyrophilin-tike 3 3.6 

456521 AW373450 Hs.286212 hypothetical protein FU 11 729 3.6 

453282 AK000043 Hs.32922 hypothetical protein FU20036 3.6 

400749 NM_003105*^omo sapiens sortilitwelated 3.6 

50 450295 AI766732 Hs.210628 ESTs 3.5 

421154 AA284333 Hs.287631 Homo sapiens cDNA FU 14269 fis, done PL 3.5 

434378 AA631739 Hs.335440 EST 3.5 

421342 AA504749 gb:aa63f08.r1 NCI_CGAP_GCB1 Homo sapiens 3.5 

406704 M21665 Hs.929 myosin, heavy polypeptide 7. cardiac mus 3.5 

55 429043 A1824977 Hs.145319 ESTs 3.5 

438211 T08401 Hs.290856 ESTs, Moderately similar to ALU4JWMAN A 3.5 

40525B Target Exon 3.4 

440207 AI371978 Hs.128326 ESTs 3.4 

445045 AJ652676 Hs.147256 ESTs 3.4 

60 406177 Target Exon 3.4 

438535 L09078 gb:Homo sapiens mRNA fragment 3.4 

405735 ENSP00000252164*:WAA1578 protein (Fragm 3.4 

445797 AI253414 gb:aq1 4f 04 x\ Stanley Frontal NS pool 2 3.4 

444286 A1625304 Hs.190312 ESTs 3.4 

65 442027 AI652926 Hs.128395 ESTs 3.4 

447889 AW469180 Hs.346398 ESTs 3.4 

405152 Target Exon 3.4 

402460 C100126V:gi|2695979|emb|CAA70854.1| (Y0 3.4 

448516 AW898595 gb:RC1-NN0073-260400-011-g09NN0073Homo 3.4 

70 450588 AA010319 Hs.60389 ESTs 3.4 

447600 AJ420990 ESTs 3.4 

406085 Target Exon 3.3 

417959 AW977642 Hs.291742 ESTs 3.3 

418672 L44284 Hs.12915 ESTs 3.3 

75 446593 W79572 Hs.13277 hypothetical protein FU22054 3.3 

446516 AW996692 Hs.257557 ESTs, Weakly similar to I38022 hypotheti 3.3 

415896 H08311 Hs. 14822 ESTs, Weakly similar to I78885 serine/th 3.3 

410140 AL134435 Hs.247837 neurexin3 3.3 

orx 458539 AI733837 Hs.145661 ESTs 3.3 

oO 422399 AW410380 Hs.116056 mesenchymal stem cell protein DSC43 3.3 

413381 BE090690 gb:RC1-BT0720-280300-011-g02BT0720Homo 3.3 

429422 AK001494 Hs.202596 Homo sapiens cDNA FU 10632 fis, clone NT 3.3 

428561 AW973652 Hs.283105 ESTs 3.2 
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445608 A1830851 Hs.200014 ESTs, Weakly similar to ACIDIC PROUNE-R 3.2 

449017 AW002425 Hs.224142 ESTs 3.2 

415908 H08623 Hs.22833 ESTs 3.2 

457733 AW974812 Hs.291971 ESTs 3.2 

5 449340 AW235786 Hs.195359 hypothetical protein MGC10954 3.2 

428134 AA421773 Hs.161008 ESTs 3.2 

434407 AW815333 gb:GV0-ST021 5-0601 00-083-g01 ST02l5Homo 3.2 

429973 AI423317 Hs.164680 ESTs 3.2 

418092 R45154 Hs.338439 ESTs 3.2 

0 411356 H45377 gb.yn99h03.r1 Soares adult brain N2b5HB5 3.1 

443366 AJ053501 Hs.278869 ESTs, Moderately similar to 21 09260A Be 3.1 

422069 AJ010063 Hs.343603 trtin-cap (telelhonin) 3.1 

420778 AW970512 gb:EST382593 MAGE resequences, MAGK Homo 3.1 

403451 Target Exon 3.1 

5 451686 AA059246 Hs.1 10293 ESTs 3.1 

423837 AW937063 Hs.275150 gb:PM3-DT0037-231299401-g11 DT0037 Homo 3.1 

422731 AL138411 gb:DKFZp434A1229_r1 434 (synonym: htes3} 3.1 

450049 AI681234 Hs.258509 EST 3.1 

428656 AB037798 Hs.188790 KIAA1 377 protein 3.1 

'O 401278 Target Exon 3.1 

418087 AA961613 Hs.127838 ESTs 3.1 

421813 BE048255 gb:tz49b05.y1 NCI_CGAP_Brn52 Homo sapien 3.1 

402490 Target Exon 3.1 

._. 448001 AW293237 Hs.202037 ESTs 3.1 

15 445316 AI219833 Hs.166767 ESTs 3.1 

405150 Target Exon 3.1 

413784 BE165819 Hs.207684 ESTs 3.1 

425198 AA352090 Hs.128003 hypothetical protein FU21 213 3.1 

433224 AB040919 Hs.210958 WAA1 486 protein 3.1 

>0 421894 AI418464 Hs.190836 ESTs 3.1 

419386 AA236867 ESTs, Weakly similar to 138022 hypotheli 3.1 

402422 Target Exon 3.1 

435849 BE305242 Hs.16098 claudin2 3.1 

, - 405422 ENSP00000216658 # :HYPOTHETICAL 1 33.5 kDa 3.0 

>D 420543 AA278221 Hs.173344 ESTs 3.0 

410627 AA181339 Hs.929 myosin, heavy polypeptide 7, cardiac mus 3.0 

409079 W87707 Hs.82065 interteukin 6 signal transducer (gp130, 3.0 

406600 TaigetExon 3.0 

iA 408025 AI692784 Hs.41767 PT0002 protein 3.0 

*0 426349 AI308855 Hs.301497 arglnyltransferase 1 3.0 

444576 A1400974 Hs.182045 ESTs 3,0 

430661 AC005551 Hs.130714 Homo sapiens HSPC323 mRNA, partial cds 10 

441335 BE222470 Hs.150825 ESTs, Weakly simflar to antigen NY-CO-33 3.0 

459647 R34107 Hs.321450 pregnancy specific beta-1-grycoprotein 1 3.0 

+5 435713 AA699313 Hs.114071 ESTs 3.0 

407275 AI364186 gb:qw34h07jc1 NCLCGAPJJ14 Homo sapiens 3-0 

456103 239430 Hs.72350 ESTs 3.0 

435200 AA670310 Hs.145903 ESTs 3.0 

449245 A1636539 Hs.224296 ESTs 3.0 

50 428132 AA421765 gb:zu24gOls1 Soares NhHMPu S1 Homo sapi 3.0 

443454 "A1057494 Hs.133421 ESTs 19 

423732 AF058056 Hs.132183 solute carrier family 16 (monocarboxyRc 19 

411944 AW877139 gb:QV2-PT0010-16040Q-133-gQ1 PTOOIOHomo 2.9 

432718 AA563943 Hs.244371 ESTs 19 

JJ 435534 AA830927 Hs.117306 ESTs 19 

430348 AA476915 Hs.189225 ESTs, Weakly simflar to I3B022 hypotheli 19 

410289 AW901618 Hs.61935 Homo sapiens mRNA; cDNAOKFZp761 1071 (fr 19 

453126 AA032155 Hs.61622 ESTs 2.9 

s*f\ 459421 AA233154 Hs.12532 chromosome 1 open reading frame 21 2.9 

□O 441543 AI733014 Hs.269715 ESTs 2.9 

442252 A1733395 Hs.129124 ESTs 19 

419254 AI469453 Hs.222760 ESTs 19 

443727 225389 Hs.18459 ESTs 19 

re 419896 299362 gb:HSZ99362 DKF2phamy1 Homo sapiens cDNA 19 

65 453898 AW003512 Hs.232770 arachidonate lipoxygenase 3 19 

415098 D59687 gb:HUM056E 1 0B Ctontech human fetal brain 19 

415131 D61119 gb:HUM158C11BCtontechrrumanfetalbrain 19 

406398 Target Exon 19 

„ 433942 AW272166 Hs.123465 ESTs 19 

70 400461 Target Exon 19 

442640 AI205646 Hs.147220 ESTs 19 

438814 AA826278 Hs.249596 ESTs 19 

438361 AA805666 Hs.146217 Homo sapiens cDNA: FU23077 lis, clone L 19 

__ 422482 AI439905 Hs.344476 gb:057g08.x1 NCI_CGAPJ.ym1 2 Homo sapien 18 

75 407142 AA412535 gb:zt99b1 &s1 Soares _testis_NHT Homo sap 18 

454319 AW247736 Hs.101617 ESTs, Weakly simflar to T32527 hypotheti 18 

442907 AI023763 Hs.79707 ESTs 18 

456075 N73442 gb:yz31h09.r1 SoaresjnulQple.scferosis, 18 

c 438690 AA815031 Hs.123598 ESTs 18 

OO 452570 AW861293 Hs.336940 Homo sapiens cDNA: FU20861 fis, clone A 2.8 

408493 BE206854 Hs.46039 phosphoglycerate mutase 2 (muscle) 18 

447556 AI885187 Hs.210710 ESTs, Moderately simflar to ALU6 HUMAN A 18 

451632 BE005934 Hs.310625 EST 18 
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5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 



426481 
401656 
407269 
435754 
433565 
451004 
425223 
405770 
456227 
454445 
419494 
427639 
428079 
406337 
401884 
406881 
428986 
450044 
403630 
445514 
446362 
432492 
430889 
434316 
413155 
433329 
446523 
449923 
453826 
405678 
432789 
455791 
449109 
436255 
415984 
402844 
456666 
414620 
404979 
412318 
424361 
412542 
441975 
457021 
457861 
439204 
415642 
446847 
443359 
447336 
449045 
422185 
436030 
4495B9 
436092 
415054 
412908 
409583 
410483 
435383 
426629 
415831 
412281 
434898 
422229 
447518 
458546 
438648 
450399 
420833 
453903 
443650 
427419 
423741 
451577 
441358 
402706 
436054 
402749 
442472 
445762 
441420 
405564 



AW963941 

AJ245210 
AA700752 
AA599763 
AA044967 
AA352825 

T84239 

AW749432 

W01060 

AW444530 

AA421020 



D16154 

AA442884 

R66444 

AI241280 

AW612481 

AW275110 

U22491 

AW411330 

BE067952 

AF015041 

NM.003063 

BE258051 

AL138129 

D26361 

BE090689 

AW270992 

F01143 

R19046 

AA452512 
AA150120 

AW936911 

AK001551 

AW961516 

AW173248 

AW968934 

AW450205 

AF087987 

U19878 

T51454 

AI792583 

AW139383 

BE072483 

AL1 17530 

R02287 

AW752437 

AI345995 

AI733907 

AA121913 

AW440117 

BE163567 

R61083 

AI203933 

H15145 

AI810054 

AW500458 

AF134414 

T80061 

AI215667 

M813125 

AW511049 

R47948 

AW299606 

A1698330 

NM_000200 

AA330362 

N69101 

AW173212 



H&117341 
Hs.1 12520 



Hs.146065 

Hs.189788 

Hs.34382 
Hs.105362 
Hs.208919 



Hs.201201 
Hs.51891 

Hs.148906 
Hs.104105 
Hs.271106 
Hs.248117 
Hs.1 18796 

Hs.199291 
Hs.334629 



Hs.3104 

Hs.120949 
Hs.284284 
Hs.5010 

Hs.76719 
Hs.244621 

Hs.326729 

Hs.145944 

Hs.95097 

Hs.344285 

Hs.173108 

Hs.305890 

Hs.42696 

Hs.336224 

Hs.82845 

Hs.135354 

Hs.245437 

Hs.278337 

Hs.1 12822 

Hs.121052 

Hs.135258 

Hs.127383 

Hs^93896 
Hs.256879 



Hs.97142 
Hs.30509 
Hs.14119 
Hs.29956 
Hs.1 13271 

Hs.175044 
Hs.146335 
Hs.202007 
Hs.188732 
Hs.232777 
Hs.151444 
Hs.177888 

Hs.40730 
Hs.129041 



AI076262 Hs.1 19813 



AW806859 

AI734002 

AA932872 



Hs.264590 



gb:EST376014 MAGE resequences, MAGH Homo 2.8 

Targel Exon 2.8 

gb:Homo sapiens mRNA for immunoglobulin 2.8 

ESTs 2.8 

ESTs 2.8 

gb2l53d09.r1 Scares retina N2MHR Homo 2.8 

gb:EST60880 Activated T-celb XX Homo sa £8 

NM_002362Homo sapiens melanoma antigen, 2.8 

ESTs 28 

gb:RC3-BT0385-301299-01 1-a09 BT0386 Homo 2.8 

ESTs 18 

Homo sapiens, clone MGC: 1 8257, mRNA, com 2.8 

ESTs 2.7 

C14000021:gi|7242973|dbj|BAA92547.1j (AB 27 

Target Exon 2.7 

gb:Human gene for cytochrome P-450c11,e 27 

ESTs 2.7 

ESTs 2.7 

C3001708*:gi|4758028lref|NPJ)04360.1| co 2.7 

ESTs 2.7 

ESTs 2.7 

ESTs Z7 

G protein-coupled receptor 7 2.7 

annexin A6 2.7 

gb:CMO-BT0365-061299-122-g09 BT0365 Homo Z7 

numb (Drosophila) homolog-b'ke 2.7 

sarcolipin 2.7 

gb:601 1 1 1034F1 NIH_MGCJ6 Homo sapiens c 2.7 

gb:DKFZp547F152Lr1 547 (synonym- hfbrl) 2.7 

CX001454:gi|8393794IreqNP_058681.1| myo 17 

KIAA0042 gene product 2 7 

gb:RC1-BT0720-280300-01 1-108 BT0720 Homo 2.7 

ESTs, Weakly similar to ALU7_HUMAN ALU S 2.7 

zincfinger1111 2.7 

gb:yg21f 1 1x1 Soares infant brain 1 NIB H 27 

C1000118*:gi|9951913|reqNP_06283Z1| pr Z7 

U6 srtRNA-associated Sm-Bke protein Z7 

ribosomal protein S14 2.7 

Targel Exon 2.7 

hypotheGcal protein MGC11082 Z7 

Homo sapiens cDNA FU10689 fis, clone NT 17 

ESTs 2.7 

EST 2.7 

Homo sapiens cDNA: FU21897 fis, clone H 2.7 

BCL2-like1 2.7 

EST 2.7 

transmembrane protein with EGF-Kke and 17 

Homo sapiens cDNA: FL121 930 fis, done H 2.7 

ESTs 2J 

ESTs 2.7 

Homo sapiens cDNA FU1 1537 fis', clone HE 2.7 

DKFZP434B172 protein 2.7 

ESTs 17 

ESTs, Weakly similar to ALU7_HUMAN ALU S 2.6 

ESTs ~ 26 

gbzo86h09.y5 Stratagene ovarian cancer 26 

pregnancy-associated plasma protein-E IB 

ESTs 26 

gb:QV3-HT0460-230200-101-b08 HT0460 Homo 26 

weel (S. pombe) homolog 26 

ESTs 26 

ESTs 26 

ESTs 2.6 

KIAA0460 protein 2.6 

ABO blood group (transferase A, alpha 1- 26 

gb:yd22f08.s1 Soares fetal Twer spleen 26 

ESTs 26 

ESTs 26 

ESTs 26 

ESTs 26 

ESTs 26 

ESTs 26 

histatin 3 26 

gb:EST34074 Embryo. 12 week II Homo sapi 26 

ESTs 2.6 

ESTs 26 

Target Exon 26 

ESTs 26 

Target Exon 26 

gb:MR0-ST0O2O-O8119W)O4-c03 ST0020 Homo 26 

ESTs, Moderately similar to ALUS HUMAN A 26 

gb:oo57d07^1 NCLCGAPJ_u5 Homo sapiens 26 

Target Exon 26 
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406003 

459S84 

441597 

411280 

420509 

445060 

436260 

428717 

401716 

416628 

443B64 

440702 

456310 

451255 

455737 

445326 

408432 

445874 

441063 

455505 

453491 

455048 

456990 

444130 

422210 

424241 

451606 

400427 

410443 

455210 

429909 

441191 

413489 

448215 

419354 

447610 

450724 

414523 

414776 

447730 

434152 

412671 

456401 

404678 

408520 

411332 

448920 

429906 

433712 

438353 

446224 



AI9108S4 

AW135032 

N50617 

M83554 

AA830811 

BE172762 

T78001 

W03955 

N37059 

AA904178 

AA225522 

M020857 

BE072246 

AI220072 

AW195262 

BE326671 

AA913819 

AW970640 

AL040177 

AW852749 

NMJKM504 

AI125263 

BE269319 

AW995948 

AA018791 

AB044934 

BE062906 

AW866599 

AW977090 

AI693930 

BE1 44228 

N34740 

M62839 

AW296286 

R55428 

AU076633 

AA155598 

A1421251 

AI792665 

AW977734 

W28146 

AA225063 
AW837212 
AW408009 
AL080137 
AF090887 
BE539951 
AW450551 



Hs.346429 

Hs.203625 

Hs.80506 

Hs.1314 

Hs.282908 

Hs.292710 

Hs.133546 



Hs.36250 
Hs.148233 

Hs.90744 

Hs.344672 

Hs.1 70058 
Hs.188025 
Hs.309071 



Hs.171545 

Hs.170410 

Hs.171937 

Hs.182339 

Hs.7945 

Hs.287388 

Hs.28338 

Hs.184860 
Hs.148816 

Hs.6658 
Hs.1252 
Hs.255534 

Hs.76353 

Hs.212839 

Hs,114085 

Hs.291190 

Hs.37931 



Hs.161614 

Hs.22580 

Hs.193743 

Hs.306562 

Hs.306996 

Hs.13308 



Target Exon 2.6 

ESTs 2-6 , 

ESTs 2.6 

small nuclear ribonudeoprolein potypept 2.6 

tumor necrosis factor receptor superfami 2.6 

ESTs 2.6 

ESTs. Weakly similar to ALU5.HUMAN ALU S 26 

hypothetical protein FU21120 2.6 

C16000902^i|403440lgblAAA7316B.1| (MB17 2.6 

gbz362d04.r1 Soares fetal fiver spleen 2.6 

ESTs. Weakly similar to 138022 hypolheti 2.6 

ESTs 2.6 

gb:nc25c05.r1 NCLCGAP_Pr1 Homo sapiens 26 

ESTs 2.6 

p>:QV4-BT0536-271299-059-b12 BT0536 Homo 2.6 

ESTs 2.6 

gban67b05.x1 NCLCGAP.CML1 Homo sapiens 26 

ESTs 2.6 

ESTs 2.6 

ESTs 2.6 

gb:DKFZp434F0213_r1 434 (synonym: htes3) 2.6 

gb:PM1-CT0247-08010OO08-h09 CT0247 Homo 2.6 

HrV-1 Rev binding protein 2.6 

ESTs 2.5 

steroid dehydrogenase-tike 2.5 

Homo sapiens pyruvate dehydrogenase kina 2.5 

AIE-75 binding protein protein 2.5 

histamine H4 receptor 2.5 

KIAA1 546 protein 2.5 

gb:QV4-SN0024-210400-181-b10 SN0024 Homo 2.5 

CGI-203 protein 2.5 
ESTs 2.5 
gb:MRO-HT0165-140200-OOW04 HT0165 Homo 25 
ESTs 2.5 
apolipoprotein H {beta-2-grycoprotem I) 2.5 
ESTs 2.5 
gfcyi79b05.r1 Soares breast 2NbHBst Homo 25 
serine (or cysteine) proteinase inhibito 2.5 
hypothetical protein FU14195; WAA1714 25 
Homo sapiens mRNA for KIAA1755 protein, 25 
ESTs 25 
gb:EST389963 MAGE resequences, MAGO Homo 25 
gb:43f1 1 Human retina cONA randomly prim 25 
Target Exon 25 
ESTs 25 
gb:QV2-LT0038-14030(M)81-c01 LT0038 Homo 25 
alkytgtycerone phosphate synthase 2.5 
ESTs 2.5 
Homo sapiens done HQ0035 25 
Homo sapiens, clone IMAGE:3447073, mRNA, 25 
ESTs 25 



TABLE 63B 

Pkey: Unique Eos probeset identifier number 

CAT number Gene cluster number 

Accession: Genbank accession numbers 



Pkey 

408229 

408432 

409B79 

410483 

411320 

411332 

411356 

411426 

411829 

411944 

413155 

413381 

413489 

415054 

415098 

415131 

415386 

415981 

416628 

416935 

419386 

419895 

420778 



.CAT Number 
1048462J 
1058667J 
114787J 
1204995.1 
1238624J 
1239102.1 
1240273.1 
1245515.1 
126O309J 
1266482.1 
1351148.1 
1365950.1 
1373392.1 
151827.1 
1522174.1 
1523680.1 
1535560.1 
1564242.1 
1604848.1 
163179.1 
1B4356.1 
1888662J 
1.1 



Accession 
AW176091 H24234 

AW195262 R27868 AW811262 tMnM 
BE250521 AA076837 BE249870 AA984291 AW502442 AW501551 AI221491 AA194239 D63046 AA193426 AA773243 AA193293 
BE163567 BE073689 BE073747 BE073780 BE073739 BE073748 BE163495 AW750178 BE163491 BE073763 BE073671 
AW836646 AW836580 AW836610 AW836636 AW836603 AW836632 
AW837212 AW837408 AW837265AW837380 AW837213 AW837411 AW837418 
H45377 H21137AW838640 
BE141714 AW845993 AW845989 
AW8S5749 BE179419 BE179492 

AW877139 AW877135 AW877018 AW991835 AW877128 AW877108 AW877017 AW877107 

BE067952 BE067945 BE067942 BE067943 BE067949 BE067954 BE067944 BE067953 BE067956 BE067946 

BE090690 BE090688 BE090681 BE090693 BE090675 

BE144228BE144291 

AI733907AA159708AI732614 

D59687 D59694 D59656 D59589 

D61119D815C8 D31734 

Z43087 F07410 H15506 H54108 R95033 H98000 

R35694H12035 R53312 

W03955 H82332 H69247 H72486 

AA190712 AA190665 AA252564 

AA236867 AA237066 AA354236 AW957759 H08961 

299352 Z99363 

AW970512AA280251 A16522B7 BE466438 AI650725 AA55 1854 AA281 574 AW571481 
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421342 


201427J 


421613 


207654.1 


422731 


220507J 


423741 


231582.1 


423772 


23188.1 


423871 


232749.1 


426481 


267878 1 


428132 


287430 1 


431847 


33840^1 


434407 


385744.1 


435383 


405360.1 


436130 


41555.1 


438535 


45946.1 


441420 


516748.1 


442472 


543371.1 


445797 


650943 1 


447518 


724787.1 


447600 


728288.1 


448516 


766241.1 


449923 


81926 1 


450724 


844585.1 


451004 


85453 1 


452351 


91233.1 


453412 


966264.1 


453491 


969172.-1 


453752 


979899.1 


453826 


982669.1 


454445 


1204468.1 


454549 


1223789.1 


455048 


1250599.1 


455210 


1260650.1 


455649 


1348708.1 


455737 


1353892.1 


455791 


1365954 1 


455887 


1380836 1 


456075 


1476756.1 


456310 


177089.1 


456401 


1844649.2 



AA504749AA287498 
BE048255 AA313083 M298419 
AL13B411 AL138412 AA315860 
AA330362 AW952525 H87796 

AA306637 NMJJ07241 AF156102BE388339 BE614027AW935016AA315261 AW376418 AW304951 At859820 AA367436 AW516957 BE614519 
AI453152 AI453149 AI453139 AW168378 AI139491 AI538368 AW468227 AJ680027 AW09Q513 AA662830 F30995 AI351985 AM24349 AW009599 
C02215AI6525 
AA331906AA332484 

AW963941 AW963944 AA379825 AA379564 
AA421765 AA456076 AI290275 AA455579 
AI791314A1791434AA516511 
AW815333 AW815409 AA632563 
R61083 R13743 AA679174 AA679193 Z42903 
AK001O59 AA633055 

L09078 L03145 L09094 L09098 L03165 L09102 
AA932872 W28068 W28643 T961 1 0 
AW806859 AW806852 AF049582 
AI253414 AI366014 R34822 
T80061 AI382804 
AI420990 A1399725 A1401757 

AW898595 AW89B588 AW898590 AW898663 AW898592 AI525093 

BE258051 R45758AA004732BE255126 

R55428 AI820704 AI732283 R54983 

AA044967 H86327 AA0 13079 AA058776 BE242713 AA019987 

AA025647R45716AW753786 

AJ003290 AJ003288 AW276947 

AL040177 

AL120800BE378580 
AL138129 AL138179 BE064231 
AW749432 AW749434 R47332 

AW806910 AWB66461 AW866396 AWB66373AW866611 AW866616 
AW852749 AWB52755 AW852620 
AW866599 AW866294 AW866468 AW866467 

BE065051 BE155165 BE064764 BE155231 BE064648 BE064671 BE064636 
BE072246 8E072229 BE072225 BE072210 BE072221 BE072256 BE072211 BE072242 
BE090689 BE090685 BE090697 BE090680 BE090691 BE090696 BE090698 BE0906B6 
BE154173 BE154098 BE1 54096 
N73442R98100BE410380 

AA225522 AA225465 AI820979 AW973985 A1791 935 AA558735 
W28146 W28187 



. TABLE 63C 

Pkey: Unique number corresponding to an Eos probeset 

Ret Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. "Dunham I. et at" refers to the pubficalion entitled The DNA sequence of 

human chromosome 22/ Dunham I. et at, Nature (1999) 402:489-495. 
Strand: indicates DNA strand from which exons were predicted. 

NLposition: Indicates nucleotide positions of predicted exons. 



Pkey 


Ref 


Strand 


NLposition 


400451 , 


9929654 


Plus 


32727-32846,32929-33051 


400499 


9796071 


Minus 


148495-148806 


400749 


7331445 


Minus 


9162-9293 


400831 


8576271 


Minus 


56502-57407 


401278 


9799936 


Plus 


98428-98573 


401411 


7799787 


Minus 


144144-144329 


401656 


9100664 


Minus 


1-382 


401716 


6715703 


Plus 


174722-174911 


401884 


8140731 


Minus 


89182-90053 


401896 


8569194 


Plus 


115129-115294 


401917 


9502466 


Plus 


25054-25229 


402422 


9796344 


Minus 


32843-33008 


402460 


9796884 


Minus 


108901-109254,110246-110581,113513-113960 


402490 


9797648 


Plus 


149982-150929 


402706 


8894426 


Minus 


148640-148805 


402749 


9212740 


Minus 


68787-68882.76602-76768 


402797 


3421043 


Minus 


15758-15930 


402817 


6822166 


Minus 


48611-49012 


402844 


9369286 


Plus 


54958-55313 


403451 


9838240 


Plus 


77382-78300 


403612 


8469060 


Minus 


94723-94859 


403630 


8569999 


Minus 


13909-14466.15251-15760,16898-17431.41742-42440 


403649 


8705159 


Minus 


27141-27247 


404260 


9366B79 


Plus 


51396-51513 


404267 


9581792 


Minus 


12209-12313,18241-18397 


404606 


9212936 


Minus 


22310-23269 


404660 


9797068 


Plus 


168215-168916 


404678 


9797204 


Plus 


115196-115448 


404979 


4160139 


Minus 


87762-88217 


404984 


6939882 


Plus 


87221-87505 


405001 


6015406 


Minus 


104646-104819 


405150 


9929758 


Ptus 


126475-126773 
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405152 


9965561 


Minus 


137662-137969 


405258 


7329310 


Plus 


129930-130076 


405422 


7243869 


Minus 


101938-102079.102261.102443,102896-103202 


405564 


2114222 


Minus 


16766-17344 


405678 


4079670 


Plus 


151821-152027 


405735 


9931101 


Minus 


29854-29976 


405770 


2735037 


Plus 


61057-62075 


406003 


8247800 


Plus 


42079-42516 


406085 


9123888 


Plus 


18665-18843 


406177 


7279760 


Minus 


18930-19148 


406337 


9213455 


Plus 


90117-90337 


406398 


9256276 


Minus 


118691-118959 


406600 


8248616 


Minus 


36296-36610 



Table 64A lists about 703 genes up-regulated in BPH compared to normal adult tissues. These were selected from 59680 probesets on the Affymetnx/Eos Hu03 GeneChtp array 
such that the ratio of "average" BPH to "average* normal adult tissues was greater than or equal to 3.0. The "average" BPH level was set to the 75* percentile amongst BPH tissues. 
The "average" normal adult tissue level was set to the 85* percentile amongst non-malignant tissues. In order to remove gene-specific background levels of non-specific hybridization, 
the 10* percentile value amongst the non-malignant tissues was subtracted from both the numerator and the denominator before the ratio was evaluated. 

TABLE 64A: ABOUT 703 GENES UP-REGULATED IN BENIGN PROSTATIC HYPERPLASIA COMPARED TO NORMAL ADULT TISSUES 

Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 



R1: 


Ratio of BPH 


Pkey 


ExAccn 


420154 


AI093155 


426747 


AA535210 


419526 


AI821895 


432441 


AW292425 


431616 


AA508552 


407202 


N58172 


432101 


AI918950 


400299 


X07730 


425075 


AA505324 


414569 


AF109298 


410929 


H47233 


400287 


S39329 


446057 


A1420227 


423073 


BE252922 


415989 


AI267700 


428336 


AA503115 


450693 


AW450461 


40029B 


AA032279 


407168 


R45175 


408369 


R38438 


454119 


BE549773 


428819 


AL135623 


400302 


N48056 


400297 


AI127076 


418432 


M14156 


432674 


AA641092 


433444 


AW975324 


432240 


A1694767 


428342 


AI739168 


401424 




432435 


BE218886 


448999 


AF179274 


446100 


AW967109 


425154 


NM_001851 


407275 


AI364186 


452340 


NMJ02202 


432473 


AI202703 


410330 


AW023630 


431579 


AW971082 


452792 


AB037765 


418848 


AI820961 


400292 


AA250737 


433647 


AA603367 


453160 


AI263307 


409262 


AK000631 


431474 


AL133990 


429220 


AW207206 


428134 


AA421773 


40B622 


AA056060 


456497 


AW967956 


434792 


AA649253 


433466 


AA508353 


439176 


AI446444 


428398 


AI249368 



Unigene ID 

Hs.95420 

Hs.171995 

Hs.193481 
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ns. lJZOOo 


CCTc 
CO IS 


4 R 
4.0 


AC4A04 

434UZ4 


A A OQ1CT7 

AAyyooz/ 


U_ •jn»iQ/v7 

nS^yoyW 


nypoineDcal protein ruz\540o 


4 e 
4.0 


44/0U3 


Awoz/yoz 


Uc 0A0A01 

ns.JUZ*lZl 


gernin4 


4 R 


438875 


AAQ27o4u 


ii_ i onncn 

Hs.189059 


ESTs 


4.6 


41QR11 

41331 l 


AA4Zy/3U 


Ue 7C111 


genera) transcription factor II IA 


4 R 


44C040 

44bz44 


QC1 CC/I 70 

be ibb4/o 


Hsu?1108 


ESTs, Weakly similar to ALU INHUMAN ALU o 


4.6 


42821 B 


AA4242oo 


IL. AltClfi 

Hs. 123642 


EphA3 


4.6 


4C0077 

43zz// 


Al A4QA11 

ALu4yUl j 


Hs^o7o3 


KIAA1223 protein 


4.6 


41AR1R 


Albb34by 


Lie 4C1740 

MS.1D1/ U 


ESTs 


4.5 


417601 


Mti mime 
NM_Ul47o3 


Hs.ozzyz 


K1AA0215 gene product 


4.5 


430697 


a a a a a im 

AA484207 


Hs211B57 


ESTs 


4.5 


430701 


Al/bUooo 


Hs^ydy/l 


ESTs 


4.5 


437252 


aI4oooo\5 


Hs.1b4159 


r(>r A _J_!1„ i— Al 114 Ltl II J A LI Ai It *> 

ESTs, Weakly similar to ALU1_HUMAN ALU S 


4.5 


4A79A4 
4U/ZU4 


OA 1 Q11 

K4lyoo 


Uc 1JIA007 

nS.14UzJ/ 


CCTc tA)«t»»lstii rlmH^r Irv At 1 t4 Ul It 1AM Al 1 1 O 

co I s, weakly similar to alui_human alu b 


4.5 


49A111 


AA49R117 
MA4Z01 I / 


Uc ICCOO 


CCTc 
COlS 


4 C 
4.3 


A1174A 
401 /4U 


M7C4CA 
N/343U 


Uc IfllyHO 

nS.10o41z 


CCTc ktnAnrnlnUi ftmWtt tr\ AC1 1R701 C7 

cols, Moaeraiery similar to Ah 110/21 0/ 


4.5 


A1Q7R9 

4oy/oz 


T7R0RR 


Uc 1JM11 


CCTc 
COlS 


4.5 


447R1K 
44/010 


mm nA7in 
r»iYi_uu/zoo 


Uc 074 10O 


TDC1 Inmal nana 1 

I roo largei gene l 


4.5 


40 IUOU 


ACA1Q1fl7 
ArUJyJU/ 


Uc 0A0171 

ns^y 1/1 


ly^uy. W_ v All 

nomeo oox aii 


4 C 
R.3 


4430/ £ 


NJWl/ 


Uc 14AQ0R 

ns.i44yzo 


CCTc 
COlS 


4 R 


43ZU33 


AIO/ /*M1 


Uc 141 coo 


nypouieticai protein Mooiuooo 


4 C 
4.3 


4100UZ 


NIV1_UU0 1 33 


Uc 70000 


not /<*hv«lran\ lil/a O 

nei (crucKeri/MiKe z 


4.4 




PRPQ11 


Uc 17R07C 
ns. 1 /DZ/3 


FCTc 
COIS 


A A 
4.4 


401OR0 
4Z300Z 


AAOZ40UO 


Uc lame 


CCTc 

co IS 


4 4 
4.4 


AIRftAI 
4JOU4Z 


AIATX3KQ71 


Uc IflACm 
nS.laUOlU 


CCTc 
COIS 


4 4 
4.4 


450070 
43Z3/0 


A AAOOOtLI 


Uc C1COO 


CCTc 
COlS 


4 4 
4.4 


410C41 
41ZD43 


A !AfQ7 1 TX} 

Awy/izoy 


Uc 1 0CX 00 

MS. 1054 Si 


ESTs 


4 4 

4.4 


417419 
4 1 /4l£ 


A1O03O 


He 0*51 19 

n5.ozi iz 


Tnlaptm it/In 1 ro/vintnr hmA f 

mieneuAin i receptor, rype i 


A A 

4.4 


445210 


H09323 


Hs.27133 


ESTs 


4.4 


416706 


AA314676 


Hs.288945 


hypothetical protein FU 13448 


4.4 


420608 


BE548277 


Hs.103104 


ESTs 


4.4 


428249 


AA130914 


Hs.183291 


zinc finger protein 268 


4.4 


442242 


AV647908 


Hs.90424 


Homo sapiens cDNA: RJ23285 fis, done H 


4.4 


444030 


AW021254 


Hs.135055 


ESTs 


4.4 


435655 


AW105663 


Hs.6947 


HSPC069 protein 


4.4 


400533 






ENSP00000209376*:PRED65 protein (Fragmen 


4.4 


435285 


AW272603 


Hs.266134 


ESTs 


4.4 


450216 


AA873345 


Hs.60226 


Homo sapiens, clone 1MAGE:3621638. mRMA, 


4.3 


432960 


AW150945 


Hs.144758 


ESTs 


4.3 


459527 


AW977556 


Hs.291735 


ESTs. Weakly similar to I78885 serinerth 


4.3 


426413 


AA377823 




gb:EST90805 Synovial sarcoma Homo sapien 


4.3 


430535 


AW96B485 




gb:EST380561 MAGE resequences, MAGJ Homo 


4.3 
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416312 


W02640 


Hs.1 6247 


ESTs. Weakly similar lo 2004399Achromos 


454159 


AW9580&5 


Hs.44143 


polybromo 1 


435176 


AA744875 


Hs.189413 


ESTs 


451061 


AW291487 


Hs.213659 


ESTs, Weakly similar to KIAA1357 protein 


407910 


AA650274 


Hs.41296 


fibronectin leucine rich transmembrane p 


420111 


AA255652 




gb:zs21n11.n Nu_CGAP_GCoi Homo sapiens 


420757 


X7B592 


Hs.99915 


androgen receptor (dihydrotestosterone r 


459045 • 


N69101 


Hs.4O730 


ESTs 


405348 






C7001664:giJ1<2698061 |dbj|BAB21849.1| (AB 


422906 


U80773 


Hs.121580 


Human EST done 42944 manner transposon 


449603 


AI655662 


Hs.197698 


ESTs 


409705 


M37762 


Hs.56023 


brain-derived neurotrophic factor 


423782 


AI472209 


Hs.323117 


ESTs 


432887 


A1926047 


Hs.162859 


ESTs 


420905 


AA521307 


Hs.186651 


ESTs 


416814 


AW192307 


Hs.80042 


dolichyl-P-Gic:Man9Glcr4Ac2-PP-dolichylgl 


427908 


AA417272 


Hs.24122 


ESTs 


432765 


AJ003429 




gb AI003429 Selected chromosome 21 cDNA 


444246 


H932B1 


Hs.10710 


hypothetical protein FU20417 


450597 


AI701635 


Hs.207077 


ESTs 


434022 


R18374 


Hs.117956 


ESTs 


427761 


AA412205 


Hs.140996 


ESTs 


433209 


AB040907 


Hs.278436 


KIAA1474 protein 


413525 


BE145899 




gb:MRO-HT0208-221299-204-b10 HT0208 Homo 


436679 


AI127483 


1 4 Art A r A 

Hs. 120451 


ESTs, Weakly similar to unnamea proietn 


449655 


AI021987 


Hs.59970 


ESTs 


440774 


A1420611 


Hs.1 53934 


ESTs 


427521 


AW973352 


Hs.290585 


ESTs 


440594 


AW445167 


Hs.126036 


ESTs 


458912 


AI911066 




ESTs 


410790 


AW803357 




gb:IL2-UM0079-090300-050-A08 UM0079 Homo 


415245 


N59650 


Hs.27252 


ESTs 


435743 


T66861 


Hs.12962 


ESTs 


442786 


H50733 


Hs.256261 


ESTs, Moderately similar to ALU8_HUMAN A 


436714 


AA72B964 


Hs.293399 


ESTs 


423789 


AK002084 


Hs.132851 


hypothetical protein FU11222 


408177 


AI241733 


Hs.43871 


ESTs 


424200 


AA337221 




gb:EST41944 Endometrial tumor Homo sapie 



430523 AW451385 
445206 AI350199 



Hs.161954 ESTs 



452335 
438431 
449907 
430487 
448152 
448515 
423242 
412715 
400746 
413081 



AW188944 

AW207860 

AA004825 

D87742 

AI741053 

H68441 

AL039402 

NML000947 

BE064415 



446416 AV658299 
451640 AA195601 



427203 
440840 



AW629517 
AW629666 



Hs.269990 

Hs.61272 

Hs.293116 

Hs.103281 

Hs.241552 



Hs.125783 
Hs.74519 



Hs.26771 
Hs.244855 



442338 


AI761976 


Hs.156080 


446307 


T50083 


Hs.22247 


455276 


BE176479 




415861 


Z43123 


Hs.144513 


418259 


AA215404 




418818 


AA228899 


Hs.101307 


432229 


AW290976 


Hs.143587 


418310 


AA814100 


Hs.86693 


416760 


H85182 


Hs.191327 


419083 


AI479560 


Hs.98613 


421312 


AA824627 


Hs.291670 


448131 


AI675054 


Hs.200481 


439731 


AI953135 


Hs.45140 


418243 


W51873 


Hs.171857 


434072 


H70854 


Hs.283059 


434844 


AF157116 


Hs.22350 


452221 


C21322 


Hs.288057 


432697 


AW975050 


Hs.293892 


446354 


AW449650 


Hs.346335 


447072 


D61594 


Hs.17279 


417333 


AL157545 


Hs.173179 


441568 


AI733322 


Hs.127176 


441736 


AW292779 


Hs.8182 


457498 


AI732230 


Hs.1 91737 


433234 


AB040928 


Hs.65366 


434222 


AF1198B6 


Hs.283941 


44B558 


AA149121 


Hs.71947 


415068 


Z19448 


Hs.131887 


424940 


AA985308 


Hs.283902 



ESTs 
ESTs 
ESTs 
ESTs 

KIAA0268 protein 
Hs.170770 ESTs 

Hs.13528 hypothetical protein FU14054 
DEME-6 protein 
primase, polypeptide 2A (58kO) 
Target Exon 
gb:RC4-BT0311-241199-O12-b03 BT0311 Homo 
Hs.163959 ESTs 

Human DNA sequence from clone 747H23 on 
ESTs 

ESTs, WeaWy similar to S64054 hypotheti 
ESTs 
ESTs 

gb:R(^HTO585-16030O-O22-b09 HT0585 Homo 
ESTs 
ESTs 

Homo sapiens HUT11 protein mRNA, partial 
ESTs 
ESTs 

ESTs, Highly similar to KIAA1 102 protein 
Homo sapiens cDNA FU 1 2292 fis, done MA 
ESTs 
ESTs 

hypotheGca! protein FU 14084 
Homo sapiens testis protein mRNA, partia 
Homo sapiens PRO1082 mRNA, complete cds 
hypothetical protein LOC55757 
hypothetical protein FU22242 
ESTs, Weakly slmBar to ALU4JWMAN ALU S 
ESTs 

tyrosylprotein sulfotransferase 1 
brornodomain and PHD finger containing, 3 
ESTs 
ESTs 
ESTs 

WAA1495 protein 

Homo sapiens PR02591 mRNA. complete cds 
ESTs 

ESTs, WeaWy similar to T24396 hypotheti 
ESTs 
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41S2BB H51299 gb:yp07c06.s1 Soares breast 3NbHBst Homo 3.9 

420301 AA767526 Hs.22030 paired box gene 5 (B-cell lineage specif 3.9 

429066 AA868555 Hs.1 78222 ESTs 3.9 

435878 R08330 Hs.20152 ESTs 3.9 

5 446862 AV660697 Hs.282700 ESTs 3.9 

447530 AW1 92063 Hs.248865 ESTs, Moderately similar to JC5238 galac 3.9 

452619 AW298597 Hs.61884 Homo sapiens, clone IMAGE:429B026, mRNA. 3.9 

401403 Target Exon 3.9 

448779 BE042877 Hs.177135 ESTs 3.9 

10 420533 AI809510 Hs.1 18971 ESTs 3.9 

411084 T18987 Hs.125472 ESTs, Moderately similar to KIAA0877 pro 3.9 

419629 AB020695 Hs.91662 WAA088B protein 3.9 

423453 AW450737 Hs.128791 CGMJ9 protein 3.9 

434833 AF156548 Hs.192969 ESTs, Weakly simflar to AT1A_HUMAN POTEN 3.9 

15 455646 BE064420 gb:RC4-BT031 1-2411 99-01 2-C08 BT03 11 Homo 3.9 

411479 AW848047 gb:IL3-CT0214-291299452-A12CT0214 Homo 3.9 

419038 AW134924 Hs.190325 ESTs 3.9 

421129 BE439899 Hs.89271 ESTs 3.9 

„ 424332 AA338919 Hs.101615 ESTs 3.9 

20 441766 R53790 Hs.23294 hypothetical protein FU14393 3.9 

447033 AI357412 Hs.157601 ESTs 3.9 

439306 BE220199 WD40 protein Gaol 3.8 

410352 AW969725 Hs.150444 WAA0373 gene product 3.8 

407961 AW672939 Hs.41694 origin recognition complex, subunit 2 (y 3.8 

25 410252 AW821182 Hs.61418 microfibrillar-associated protein 1 3.8 

439560 BE565647 Hs.74899 hypothetical protein FU 12820 3.8 

440450 AI333129 Hs.156147 ESTs 3.8 

458611 AI268407 Hs.21145B DC-specific transmembrane protein 3.8 

419589 AW973708 Hs.201925 Homo sapiens cONA FU 13446 fis, clone PL 3.8 

30 431576 M76665 Hs.275215 hydroxysteroid (11-beta) dehydrogenase 1 3.8 

447280 BE617907 Hs.97635 ESTs 3.8 

438379 N23018 Hs.171391 C-terminal binding protein 2 3.8 

416009 Z43062 gb:HSC12E041 normalized infant brain cDN 3.8 

416534 H69043 Hs.224961 Horno sapiens cDNA FU 14366 fis, clone HE 3.8 

35 423044 AA320829 Hs.97266 protocadherln 18 3.8 

424701 NM_0O5923 Hs.1 51988 mitogen-activated protein kinase kinase 3.8 

433507 AI817336 Hs.191791 ESTs 3.8 

437718 AI927288 Hs.195779 ESTs 3.8 

An 419831 AW448930 Hs.5415 ESTs 3.8 

40 424830 AW2705B0 Hs.189311 ESTs. Weakly similar to putative p 150 [H 3.8 

426981 AL044675 Hs.173081 WAA0530 protein 3.8 

431447 AA505138 Hs.291341 ESTs 3.8 

435932 W03928 Hs.114524 ESTs 3.8 

442447 AA999723 Hs.129607 ESTs 3.8 

45 403242 Target Exon 3.8 

433908 AW298141 Hs.157975 ESTs 3.8 

452323 W44356 Hs.292812 ESTs, Weakly similar to T33468 hypothefi 3.7 

412095 AJ624707 Hs.5921 Homo sapiens cDNA:FU21592 fis, clone C 3.7 

418759 AA227879 Hs.187621 ESTs 3.7 

50 422299 AK000181 Hs.11455€ hypothetical protein FU201 74 3.7 

452462 BE173515 gb:RC2-HT0^6O-21020(M)12-«)3HT0560Homo 3.7 

423096 AA732684 Hs.278428 progestin induced protein . 3.7 

454037 AW998716 gb:PM4-BN0067-25030(W)02-f11 BN0067Homo 3.7 

428055 AA420564 Hs.101760 ESTs 3.7 

55 447785 AL041765 Hs.340375 ESTs 3.7 

451746 M86178 Hs.311258 ESTs 3.7 

453293 AA382267 Hs.10653 ESTs 3.7 

436671 AW137159 Hs.146151 ESTs 3.7 
„ 407437 AF220264 gb:Homo sapiens MOST-1 mRNA. complete cd 3.7 

60 408418 AW963897 Hs.44743 WAA1 435 protein 3.7 

420092 AA814043 Hs.88045 ESTs 3.7 

446947 AF146747 Hs.232165 polycythemia rubra vera 1; cell surface 3.7 

441865 AA384726 Hs.5722 hyjwthetical protein FU2331 6 3.7 

419875 AA853410 Hs.93557 proenkephaTin 3.7 

65 431231 AA653552 Hs.1 16532 ESTs 3.7 

418348 AI537167 Hs.96322 hypothetical protein FU23560 3.7 

419261 X07876 Hs.69791 wingless-type MMTV integration site fami 3.7 

422899 D16471 Hs.121571 Human mRNA, Xq terminal portion 3.7 
429163 AA884766 gb:am2Qa10.s1 Soares_NFLJLGBC_S1 Homos 3.7 

70 439075 AF085933 Hs.292620 ESTs 3.7 
440947 AA910403 ESTs 3.7 

404561 trichorh'tnophalangeal syndrome I gene (T 3.6 

430096 U91935 Hs.233321 Retinaderived POU-domain factor-1 3.6 

430320 BE245290 Hs.239218 uncharacterized hypothalamus protein HCD 3.6 

75 444794 AI419991 Hs.145225 ESTs 3.6 
426991 AK001536 Homo sapiens cDNA FU1 0674 fis, done NT 3.6 

431316 AA502663 Hs.145037 ESTs 3.6 

414178 AW957372 Hs.46791 ESTs, Weakly similar to I38022 hypolheti 3.6 

D 450630 AA010429 Hs-191939 ESTs 3.6 

oO 411067 A1681006 Hs.71721 ESTs 3.6 
436326 BE085236 aldo-keto reductase family 1, member B1 3.6 

410268 AA316181 Hs.61635 six transmembrane epithelial antigen of 3.6 

423590 AW952412 Hs.65874 ESTs, Weakly similar to A40348 FJav/Sex- 3.6 
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430376 AW292053 Hs. 12532 chromosome t open reading frame 21 3.6 

451193 N29850 Hs.44098 ESTs 3.5 

416239 AL.03B450 Hs.48948 ESTs 3.6 

444190 A1878918 Hs.10526 cysteine and glydne-nU protein 2 3.6 

5 407902 AL117474 Hs.41181 HomosapieitsmRNA;cDNADKFZp727C191 {fr 3.6 

423349 AFO10258 Hs.1 27428 homeoboxA9 3.6 

426917 AA913814 Hs.1 72854 DKFZP586B0923 protein 3.6 

429430 AI381837 Hs.155335 ESTs 3.6 

433563 AI732637 Hs.277901 ESTs 3.6 

10 425465 L18964 Hs.1904 protein kinase C, iota 3.5 

452498 AK000101 Hs.29700 hypothetical protein FU20094 3.5 

401132 C120005ir:gi|4758712!rdINP_004659.1|a 3.5 

421105 AA766501 Hs.125113 ESTs 3.5 

435177 AI018174 Hs.42936 ESTs 3.5 

15 449343 AI151418 protein phosphatase 3 (formerly 2B). cat 3.5 

457653 AI820719 Hs.1 54662 DnaJ (Hsp40) homolog, subfamily A, membe 3.5 

419563 AA526235 Hs.193162 Homo sapiens cDNA FU 11983 fis, done HE 3.5 

419994 AA282881 Hs.190057 ESTs 3.5 

427304 AA761526 Hs.163853 ESTs 3.5 

20 434763 AA64861B gb:ns07a11.r1 NCLCGAP_Ew1 Homo sapiens 3.5 

447497 AW167254 Hs.205722 ESTs 3.5 

449988 AW372068 Hs.201420 ESTs, Moderately similar to ALU7_HUMAN A 3.5 

422321 AA906427 Hs.1B1035 hypothetical protein MGC1 1296 3.5 

41B723 AA504428 Hs.10487 Homo sapiens, done IMAGE:39541 32, mRN A, 3.5 

25 438680 AA9D6121 Hs.173421 ESTs 3.5 

452903 A1953425 Hs.345291 ESTs, Weakly similar to 138022 hypotheti 3.5 

443273 AI042063 Hs.132156 ESTs 3.5 

403510 Target Exon 3.5 

435681 AA694192 Hs.1 48979 ESTs 3.5 

30 451722 H86374 Hs.40861 ESTs 3.5 

435981 H74319 Hs.188620 ESTs 3.5 

449845 AW971183 Hs.96B3 DnaJ (Hsp40) homolog, subfamily C, membe 3.5 

436024 AI800041 Hs.190555 ESTs 3.5 

429643 AA455889 Hs.167279 FYVE-finger-containing Rab5 effector pro 3.5 

35 443324 R44013 Hs.164225 ESTs 3.5 

458332 AI000341 Hs.220491 ESTs 3.5 

435688 H72286 Hs.128387 ESTs 3.4 

435047 AA454985 Hs.54973 cadherinJike protein VR20 3.4 

A/ . 432606 NMJJ02104 Hs.3066 granzymeK (serine protease, granzyme 3; 3.4 

40 409047 AW961434 Hs.31539 ESTs 3.4 

404848 ENSP00000240769*:BG153O3.1 (similar to C 3.4 

412494 AL133900 Hs.792 ADP-fibosylation factor domain protein 1 3.4 

417435 NMJW51B1 Hs.82129 carbonic anhydrase III. muscte specific 3.4 

430403 AF039390 Hs.241382 tumor necrosis factor (ligand) superfami 3.4 

45 436340 R42246 Hs.21606 ESTs 3.4 

441596 AA939300 Hs.206768 ESTs 3.4 

442231 W02434 Hs.222413 ESTs 3.4 

447124 AW976438 Hs.17428 RBPI-fike protein 3.4 

449517 AW500106 Hs.23643 serine/threonine protein kinase MASK 3.4 

50 450297 AW901347 Hs.38592 hypothetical protein FU23342 3.4 

453682 T79703 gb.-yd71e08j1 Soares fetal Trver spleen 3.4 

456995 T89832 Hs.170278 ESTs 3.4 

444324 AI301330 Hs.14383B ESTs 3.4 

400379 NMJJ1B432 Homo sapiens ovarian cancer related prot 3.4 

55 419964 AA811657 Hs.220913 ESTs 3.4 

424026 AI798295 Hs.137576 ribosomai protein L34 pseudogene 1 . 3.4 

427033 AI457449 Hs.192817 ESTs 3.4 

453942 AW190920 Hs.19928 hypothetical protein SP329 3.4 

^ 431421 AW969118 Hs.1 081 44 ESTs, WeaWy simflar to unnamed protein 3.4 

60 408784 AW971350 Hs.63386 ESTs 3.4 

420184 AA188408 Hs.95665 hypothetical protein 3.4 

420721 AA927802 Hs.1 59471 2AP3 protein 3.4 

422728 AW937826 Hs.103262 ESTs, WeaWy simflar to ZN 91 JIUMAN ZINC 3.4 

_ 436703 AW880614 Hs.146381 JWA binding mofif protein, Xchromosome 3.4 

65 429165 AW009886 Hs.1 18258 prostate cancer associated protein 1 3.4 

427674 NM 003528 Hs.217B H28 histone family, member Q 3.4 

412505 AA974491 Hs.21734 ESTs 3.4 

431408 AA504757 Hs.105738 ESTs 3.4 

438801 AA825971 Hs.124284 ESTs 3.4 

70 445432 AV653771 gfcAV653771 GLC Homo sapiens cDNA done 3.4 

418575 AA225313 Hs.222886 ESTs, Weakly similar to TRHY_HUMAN TRICH 3.3 

436345 AA873008 Hs.121572 ESTs 3.3 

423023 N50128 Hs.173400 ESTs 3.3 

427615 BE410107 Hs.179817 CGI-82 protein 3.3 

75 429588 A1080271 Hs.134533 ESTs 3.3 

400362 AF068294 Hs.272414 Homo sapiens HDCMB45P mRNA, partial cds 3.3 

425523 AB007948 Hs.158244 KJAA0479 protein 3.3 

426237 AK001104 Hs.1 68241 hypothetical protein FU10242 3.3 

427473 AW274439 Hs.252709 ESTs 3.3 

80 434520 AA205273 Hs.177011 hypothetical protein 3.3 

447282 AI989963 Hs.197505 ESTs 3.3 

433554 AW957666 Hs.8108 disabled (Drosophila) homolog 1 3.3 

414818 BE541217 Hs.23606 ESTs 3.3 
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418985 AI042330 Hs.87128 hypothetical protein FU 23309 3.3 

436854 M749167 Hs.173911 ESTs 3.3 

430865 AI073424 Hs.5232 HSPC125 protein 3.3 

447182 BE241868 Hs.17585 KIAA0801 gene product 3.3 

5 423645 A1215632 Hs.147487 ESTs 3.3 

429227 A1961456 Hs.21275 hypothetical protein FU 11 011 3.3 

411928 AA888624 Hs.1 97289 rab3 GTPase-activating protein, non-cata 3.3 

418051 AW192535 Hs.19479 ESTs 3.3 

418719 AW975590 Hs.161707 ESTs 3.3 

10 437714 AA766346 Hs.293242 ESTs 3.3 

45571 0 BE072049 gb:PM4-BT0532-170100-004-ffl6 BT0532 Homo 3.3 

421590 AF004715 Hs.105940 jerky (mouse) homolog-like 3.3 

445707 AI248720 Hs.1 14390 ESTs 3.3 

411436 AW846433 gb:QVO.CT0179-070300-143-b02 CT0179 Homo 3.3 

15 426657 NM.015865 Hs.171731 solute terrier family 14 (urea transport 3.3 

423412 AF109300 prostate cancer associated protein 5 13 

436962 AW377314 Hs.5364 DKFZP564I052 protein 3.3 

415319 AA659823 Hs.34955 Homo sapiens cDNA FU 1 3485 fis, clone PL 3.3 

429318 AW861930 Hs.102500 hypothetical protein dJ511E1 6.2 3.3 

20 433213 AW665130 Hs.137190 ESTs 3.3 

453973 AI291895 Hs.61993 ESTs, Weakly similar to ALU1_HUMAN ALU S 3.3 

418245 AA088767 Hs.83883 transmembrane, prostate androgen induced 3.2 

428043 T92248 Hs.2240 uteroglobin 3.2 

407385 AA610150 Hs.272072 ESTs, Weakly similar to 138022 hypotheti 3.2 

25 423595 R82826 Hs.220702 ESTs 3.2 

407021 U52077 gb:Human marinerl transposase gene, comp 3.2 

418986 AI123555 Hs.81796 ESTs 3.2 

438118 AW753311 Hs.346690 ESTs 3.2 

447516 W05355 Hs.102971 hypothetical protein FU 14751 3.2 

30 449570 AA001793 gb:zh86c06.r1 Soares_feta!_liver_spleen_ 3.2 

450687 M495800 gb:zw05b07.s1 Soares_NhHMPu_S1 Homo sapi 3.2 

425657 T89839 Hs.1 19471 ESTs 3.2 

459546 AW883968 Hs.321190 gb:QV3-OT0063-290300-1 35-c04 OT0063 Homo 3.2 

408480 AI350337 Hs.1 64568 fibroblast growth factor 7 (keralinocyte 3.2 

35 451025 AW028689 Hs.301985 ESTs 3.2 

401416 C14000338':gll7459502|pirllS74665 outer 3.2 

421928 AF013758 Hs.109643 polyadenylata bintfing protein-interactin 3.2 

454860 AW835767 gb:QVUT0016-240200"110-b08LT0016Homo 3.2 

436995 AI160015 Hs.118112 ESTs 3.2 

40 428736 AK001331 Hs.192662 hypothetical protein FU10469 3.2 

420120 AL049610 Hs.95243 transcription elongation factor A (Sll)- 3.2 

422352 AA766296 Hs.99200 ESTs 3.2 

429037 X81895 Hs.1 94765 Rsapiens GENX-5624 mRNA, J UTR 3.2 

432625 AI243596 Hs.94830 ESTs, Moderately similar to T03094 A-kin 3.2 

45 445218 T79971 Hs.12432 Homo sapiens clone 24407 mRNA sequence 3.2 

447597 AI886036 Hs.213675 ESTs 3.2 

437866 AA1 56781 metaDothionein 1E (functional) 3.2 

416882 AI633044 tryptophan^ tRNA synthetase 2 (mitochon 3.2 

410870 U81599 Hs.66731 homeoboxB13 3.2 

50 438899 AF085833 Hs.135624 ESTs 3.2 

415862 R51034 Hs.144513 ESTs 3.2 

420969 AI636310 Hs.28310 ESTs 3.2 

415467 RS0891 Hs.260274 ESTs 3.2 

412666 AL080116 Hs.74420 origin recognition complex, subunit 3 (y 3.2 

55 417380 T06809 Hs.332086 ESTs 3.2 

418319 AW611703 Hs.190173 ESTs, Weakly similar to A46010 X-linked 3.2 

419088 AI538323 Hs.52620 integrin, beta 8 3.2 

422165 AL041199 Hs.1 481 histidine decarboxylase 3.2 

_ 448044 AI458682 gb:tk13e0U1 NQ^CGAP^U24 Homo sapiens 3.2 

60 452862 AW378065 Hs.8687 ESTs 3.2 

405548 Target Exon 3.1 

439584 AA83B114 Hs.221612 ESTs 3.1 

419839 U24577 Hs.93304 phospho!ipaseA2 f group VII (platetet-ac 3.1 

427359 AW020782 Hs.79881 Homo sapiens cDNA: FU23006 fis, done L 3.1 

65 429569 AA454993 Hs.1 38343 ESTs, Weakly similar to I78885 serineflh 3.1 

434512 AW139932 Hs.188941 ESTs 3.1 

452102 U04343 Hs.27954 C086 antigen (CD28 antigen Dgand 2. B7- 3.1 

425855 AA364908 Hs.98927 hypothetical protein FU 13993 3.1 

416836 D54745 Hs.80247 cholecystokinin 3.1 

70 432589 AL135725 Hs.131708 ESTs 3.1 

420512 AA262886 Hs.143817 ESTs 3.1 

417206 AA291183 Hs.81648 hypothetical protein FU 11 021 simflar to 3.1 

410821 AI114811 Hs.92526 ESTs, Weakly simflar to T00365 hypotheti 3.1 

423855 AA331761 Hs.254859 ESTs 3.1 

75 425905 AB032959 Hs.318584 novel C3HC4 type 2nc finger (ring finge 3.1 

427773 AA412290 Hs.98124 ESTs 3.1 

441817 AW969706 Hs.293332 ESTs 3.1 

456660 AA909249 Hs.1 12282 solute carrier family 30 (zinc transport 3.1 

421823 N40850 Hs.28625 ESTs 3.1 

80 416812 H91010 Hs.44940 ESTs 3.1 

445784 A1253155 Hs.1 46065 ESTs 3.1 

434384 AA631910 Hs.162849 ESTs 3.1 

416140 AI918035 Hs.301198 roundabout (axon guidance receptor, Dros 3.1 
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407426 

416423 

418037 

419197 

420179 

433610 

436295 

444800 

418858 

446469 

431992 

405510 

407349 

410869 

425354 

425480 

441492 

447078 

459324 

433B52 

448658 

452242 

424690 

405264 

407253 

452234 

434497 

420355 

403481 

412988 

452679 

416642 

416948 

426174 

430459 

439182 

446258 

448686 

453455 

414441 

427302 

437048 

450983 

432336 

459535 

448734 

451468 

407829 

420407 

426743 

442326 

449913 

454096 

437323 

407137 

450580 

429084 

418365 

423784 

435677 

414951 

418B19 

428634 

431869 

435008 

448880 

451391 

452959 



AF129533 




gb:Homo sapiens F-box protein FbI3b (FBL 


3.1 


H54375 


Hs.268921 


ESTs 


3.1 


AI990212 


Hs.86447 


ESTs 


3.1 


N48921 


Hs.27441 


KIAA1 61 5 protein 


3.1 


N74530 


Hs.21168 


ESTs 


3.1 


AA806822 


Hs.1 12547 


ESTs 


3.1 


N73895 




gb2a62d06.s1 Soares felal liver spleen 


3.1 


AW119071 


Hs.153287 


ESTs 


3.1 


AW961605 


Hs.21145 


hypothetical protein RG0B3M05.2 


3.1 


BE094848 


Hs.15113 


homogentisate 1,2-dioxygenase {homogenti 


3.1 


NMJJ02742 


Hs.2891 


protein kinase C, mu 


3.1 






ENSP00000233779':Hypothetical 68.0 kDa p 


3.1 


AA825449 


Hs.83332 


Homo sapiens cDNA: RJ22437 fis, clone H 


3.1 


AW808361 




gb:MR1-ST0111-111099-003"f04ST0111 Homo 


3.1 


U62027 


Hs.155935 


complement component 3a receptor 1 


3.1 


AB023198 


Hs.158135 


WAA0981 protein 


3.1 


AI149998 


Hs.146346 


ESTs 


3.1 


AW885727 


Hs.9914 


ESTs 


3.1 


AW080953 




gb:xc28c12jt1 NCI CGAP_Co18 Homo sapiens 


3.1 


AI378329 


Hs.126629 


ESTs 


3.0 


H71739 


Hs.200227 


ESTs, Moderately similar to A53959 throm 


3.0 


R50956 


Hs.1 59993 


gycosyltransferase 


3.0 


BE538356. 


Hs.151777 


eukaryotic translation initiation factor 


3.0 






NM_030B13':Homo sapiens suppressor of po 


3.0 


AA411175 


Hs.141939 


ESTs, Moderately similar to S65657 alpha 


3.0 


AW084176 


Hs.223296 


ESTs, Weakly similar to 138022 hypotheli 


3.0 


AI821603 


Hs.136580 


ESTs 


3.0 


AW968263 


Hs.123126 


ESTs 


3.0 



BE046680 

Z42387 

T96118 

AI217097 

AA547959 

BE178539 

AF086030 

AI283476 

AA158659 

AA063553 

AA234759 

AA400540 

AA743240 

A1864668 

NM_002759 

AV654907 

BE614070 

AW503398 

AA045084 

AA814732 

AA383833 

H92962 

AA004696 

AW062757 

AA371145 

T97307 

N40087 

AJ001443 

AW014345 

AK000039 

AA694142 

AW794931 

AA228776 

AA811845 

AA521136 

AF150262 

AW205507 

AA017410 

AI933416 



Hs.83883 
Hs.226313 

Hs.1 15838 

Hs.278534 

Hs.21621 

Hs.263478 

Hs.334712 

Hs.221931 

Hs.132950 

Hs.135282 

Hs.91582 

Hs.48832 

Hs^74382 

Hs.326416 

Hs^93663 

Hs.29725 

Hs.145010 

Hs.245022 

Hs.124813 

Hs.333016 

Hs.194397 



Hs.195614 
Hs.161690 
Hs.132826 
Hs.293726 
Hs.100861 
Hs.1 91721 
Hs.106290 
Hs.190176 
Hs.162898 
Hs.32360 
Hs.40568 
Hs.189674 



Target Exon 3.0 

gb:hn42h03.x1 NCI_CGAP_RDF2 Homo sapiens 3.0 

transmembrane, prostate androgen induced 3.0 

ESTs 3.0 

gb:qd43h07.x1 Soares JetalJiearLNbHH19W 3.0 

ESTs 3.0 

ESTs 3.0 

hypothetical protein DKFZp762O076 3.0 

ESTs 3.0 

hypothetical protein FU14744 3.0 

ESTs, Weakly similar to JC1 087 RNA hefic 3.0 

ESTs 3-0 

Homo sapiens cONA FU 1 1 554 fis, done HE 10 

ESTs 3.0 

ESTs 3.0 

protein kinase, interferon-inducible dou 3.0 

gbAV654907 GLC Homo sapiens cDNA clone 3.0 

Homo sapiens mRNA; cDNA DKFZp564H19l6 (f 3.0 

ESTs, Moderately similar to 138022 hy pot 3.0 

hypothetical protein FU13197 3.0 

lipoporysaccartde-specific response 5-1i 3.0 

ESTs 3.0 

hypothetical protein MGC14817 3.0 

ESTs 3,0 
gb:CM0-CT01 03-1 2089W)37-g07 CT0103 Homo 3.0 
lectin receptor 3.0 
gb:ye53h05.s1 Soares fetal fiver spleen 3.0 
ESTs 3.0 
spfcing factor 3b, subunit 3, 1 30kD 3.0 
ESTs 3.0 
Homo sapiens cDNA FU14913 fis, clone PL 3.0 
ESTs, Weakly similar to TSGA RAT TESTIS 3.0 
hypothetical protein FU14600 3.0 
ESTs 3.0 
Welch motif containing protein 3.0 
ESTs 3.0 
ESTs 3.0 
ESTs. Highly similar to 138587 retroviru 3.0 
ESTs 3.0 
ESTs 3.0 



TABLE 64B 

Pkey: Unique Eos probeset identifier number 

CAT number Gene duster number 

Accession: Genbank accession numbers 



Pkey CAT Number Accession 

410790 1221131J AW803357AW803423AW812233 R06814 AtMna ~ M 
410869 1225123 1 AW808361 AW808404 AW808386AW808594 AW808654 AW808813 AW808551 AW808676 AW808350AW808406AW808694 AW808934 

AW808829 AW808385 AW808422 AW808401 AWB08409AW808760AW808863AW808521 AWB08539 AWB08609AW808472AW808739 

AW808704 AW808558 AW808714 AW808420 AW8 
411436 1245660 1 AW846433AW846159AW846377 AW846528 
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439092 
439306 
440340 

440947 
442481 
445432 
448044 
449343 



449570 
449625 

450317 
4505B0 



450582 
450687 
452462 
453682 
454037 
454096 
454171 
454457 
454860 
454968 
455276 
455646 
455710 
457374 
458912 



411479 1247077 1 AW848047AW848202AW848631 AW848142 AW848702AW848121 AW848632AW848140 AW848571 AW848009 AW848067 AW848069 ' 

AW848905AW848214 
412988 1342150 1 BE046680 BE046738 BE044958 
413081 1348563 1 BE054415BE064430BE064448 

413525 1374635 1 BE145899 BE145848 BE145849 BE145853 BE145927 BE145925 

415989 156454 T AJ267700 A1720344 AA191 424 AI023543AI469633M172056 AW956465 AA1 72236 AW953397 AA355085 

416009 1566379 1 Z43062 R13213 H14422 

416288 1585983J H51299 H44619 H46391 R86024 H51892 T72744 

416682 162718 1 AI633044 AW016212 AW241 143 AA76905B R43272 AW068958 AA210918 AA293774 AI748815 AI763294 AI3331 14 AI277384 AI088297 AI468477 

A1824624 AW189506 AI631751 Z40749 AI984673 AIG71316 AA189024 AW235412 BE178426 R24677 R40635 H05100 R40597 
418259 173388J AA215404 AI990909 BE464132 AW271459 N74332 AI262061 
418866 179788 1 T65754 AA229857 AA229658 

418948 180808 1 AI217097 AW886090 W38035 W38792 AA232835AW936043 

419536 185688.1 AA603305 AA244095 AA244183 

420111 190755.1 AA255652AA280911 AW967920 AA262684 

420352 192979 1 BE258835 AW968316 AA258918 AW843305 R14744 A1580388 BE071923 R362B0 
423412 228001 1 AF109300 Al 299378 AI202654 
424200 236595 1 AA337221 AA336756 AW9861 96 
426413 266650.1 AA377823 AW954494 AI022688 
426503 268283.1 AA380153AA380233 AW963529 

426991 27415 1 AK001536 AA191092 AW510354 AI554256 AL353968 AA1 34266 

428002 285602 1 AA418703 AA418711 BE071915 BE071920 BE071912 

428342 290035 2 AI739168M426249AI199636AW505198AW977291M824583MB83419AA724079A^ 

AI092404A1085630AA731340 
429163 300543.1 AA884766 AW974271 AA592975 AA447312 
429220 301384 1 AW207206 AW341473 AA448195AI951341 
430535 319643 1 AW968485 AW958670 AA480922 BE350425 

432600 350959 1 AI821085AW973464AA554802 AI821831 AA657438 AA640756 AA650339 
432765 353907.1 AJ003429 AJ003367 AA564825 

433523 368873 1 H29882 AW665533 AW149901 AI572917 AA598500 AI686466 A1336390 AW864390 AW864320 
434763 392847.1 AA648618 AW974389 H51771 
436295 41733.1 N73895 AJ001872 

436326 41 795 2 BE085236 BE085317 X04236 AA577934 AA578392 AA502836 AA595852 AA57825B AW270791 AA5071 51 AA559152 T57Q40 BE503281 

AW593405 AI825755 AI350499 AI655710 AI972281 AI654949 BE073961 BE073962 BE041399 AW750214 AA228488 BE074016 AI908706 
AW270601 AW873282 

437866 44433 2 AA156781 AW293839U52054AA024963 M778446BE073977AW4449MA\r^ 

AA837481 AW468444 BE185091 AW468002 AA687333 AA81 1830 AA581806 AI866686 AI572124 AA043777 AA040926 D20160 AI536733 
AA812489AW874142AI47188 
AA830149 AW978407 M85983 AW503637 
BE220199 W01 81 3 AF0861 1 8 N70760 BE221405 

AW629666 AW959831 AW205739 BE620243 AA41 2367 AW300025 AW051920 AI288591 AW2361 14 A1302852 AI038548 AA534496 AI797207 
AA921877 

AA910403 AI815593W5B361 AW162520A1816550 
N99828 BE079873 All 10738 AF074645 
AV653771 BE089370 

A1458682 H24240 R14537 R18426 AW867082 

AI151418 W60401 AW631238 A1346936 AAB62855 W60310 N72501 H90060 BE150445 AW380B21 AI540906 C04881 W03542 AA641764 H97053 
AW889353 AA521308 AA001203 W92828 AJ207798 AA746655 R78710 W24617 AA024605 C01747 AW173095 W61229 W92685 AA742467 
H00789 R76925AW1828 
AA001793AA001871 

NM.014253 AF100772 BE088769 AL02271 8 BE161779 AW863569 BE161640 AL039060 BE168542 AW296554 AA323193 AA235370 AW779760 
N48874 AI375997 R45432 D59344 AI203107 F07491 R35360 R25094AI9 13631 A1498402 T61382 AI016320 N45526 T61415 AA331486 
A1692689 R14223 R18395 

N40087 H12925 AA460779 AA096372 BE160847 AW816890 AW816893 AW816577 AW816B91 AW816882 AW816868 AW816B89 AW816940 
AW81 6892 AW81 6941 AW81 6578 AA0291 83 T83320 H86850 AA010295 BE1 60823 Z25353 AA150883 AW887764 AW023806 AW022095 
M16451 8 AA730973 W00417 W65303 
83933 1 AI339732 AA010300 AW515041 AA768334 N29860 AA425874 M425118 AA865829 AW936878 
84327 1 AA495800 AA495737 AA010736 AA654716 AA640726 

918580.1 BE173515 BE173560 AI902860 
977454 1 T79703 T96307AL079725 
996287J AW998716 AW022148 N68020 

1007449 1 AW062757 AW176890 AW062758 AW176895 AW176869 AW176900 AW176897 AW176928 AW176868 AW176892 
1049240 1 AW854832 AW854798 AW854857 AW854816 AW854834 AW854817 
1207274.1 AW753456 AW753036 AW854868 AW854862 
1237732.1 AWB35767 AW835537 BE160187 

1 247029 1 AW849046 AW847956 AW849039 AW847957 AW848279 AW848698 AW849034 AW849033 
1272541"1 BE176479 BE176678 BE176357 BE176550 AW886079 BE176676 BE176615 BE176555 BE176489 BE176610 BE176362 
1348557~1 BE064420 BE064435 BE064429 BE064414 BE064400 BE064517 

1352368~1 BE072049 BE069471 BE069489 BE059478 BE069479 BE069492 BE069485 BE072122 BE072124 BE069491 BE069486 BE059464 
328758.1 AM93662 AW897396 BE154814 
8231 04 J AI91 1 066 AI933734 AI680888 AJ003599 



468554.1 
47088.1 
50357 J! 

505904.1 

543588.1 

63943.1 

747196.1 

80517.2 



81018.1 
8113.1 

831956.1 

(J 



TABLE 64C m ; 

Pkey. Unique number corresponding to an Eos probesel 

Ref: Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. "Dunham I. et al" refers to the publication entitled The ONA sequence of 

human chromosome 22." Dunham I. el al., Nature (1999) 402:489495. 
Strand: Indicates OKA strand from which exons were predicted. 

NLposition: Indicates nucleotide positions of predicted exons. «. 



Pkey Ref Strand NLposiBon 
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400533 


6981826 


Minus 


277132-277595 


400746 


7329328 


Minus 


147703-147896 


401132 


8705350 


Minus 


85679-85795 


401403 


7710966 


Plus 


146180-146294 


401416 


7452889 


Minus 


121456-121626 


401424 


8176894 


Plus 


24223-24428 


403242 


7637817 


Minus 


11297-12511 


403481 


9965004 


Plus 


93496-93633 


403510 


7652047 


Plus 


61866-62027 


403667 


6850483 


Minus 


1344-1442,1545-1697 


403696 


3135242 


Minus 


143467-143634 


404003 


8655948 


Plus 


198349-199096 


404561 


9795980 


Minus 


69039-70100 


404571 


7249159 


Minus 


112450-112648 


404592 


9943965 


Minus 


39067-39225 


404848 


8248647 


Minus 


23955-24034,25143-25264 


405264 


7329374 


Plus 


28556-28684 


405348 


2914717 


Minus 


43310,43462 


405510 


7630909 


Minus 


101028-101174 


40554B 


1532158 


Plus 


11552-11686 



Table 65A lists about 347 genes up-regulated in BPH compared to prostate cancer tissues. These were selected from 59680 probesets on the Afrymetrix/Eos Hu03 GeneChip array 
such that the ratio of "average" BPH to ■average" prostate cancer tissues was greater than or equal to 3.0. The "average" BPH level was set to the 75" percentile amongst BPH 
tissues. The "average" prostate cancer tissue level was set to the 85* percentile amongst malignant prostate tissues. In order to remove gene-specific background levels of non- 
specific hybridization, the 10* percentile value amongst non-malignant body tissues was subtracted from both the numerator and the denominator before the ratio was evaluated. 

TABLE 65A: ABOUT 347 GENES UP-REGULATED IN BENIGN PROSTATIC HYPERPLASIA COMPARED TO PROSTATE CANCER TISSUES 



Pkey: Unique Eos probeset identifier number 
ExAccru Exemplar Accession number, Genbank accession number 
UnigenelD: Unigene number 
Unigene Title: Unigene gene title 



R1: 


Ratio of BPH 


Pkey 


ExAccn 


428134 


AA421773 


446336 


AW815036 


456373 


BE247706 


458072 


A1890347 


400533 




418310 


AA814100 


404592 




454457 


AW753456 


400080 




459646 


AW883968 


420352 


BE258835 


438231 


AW594539 


41B387 


R18085 


442481 


N99828 


449249 


T52285 


457653 


AI820719 


404967 




430535 


AW968485 


454860 


AW835767 


423789 


AK002084 


412988 


BE046680 


400440 


X83957 


454171 


AW854832 


400086 




440911 


AA909536 


425312 


AA354940 


426140 


AF131798 


419015 


T79262 


453789 


AA628517 


424940 


AA985308 


403667 




429014 


AI800518 


417758 


U27699 


419999 


AI760942 


405348 




404003 




453200 


AA033832 


428002 


AA418703 


432319 


AW510770 


445444 


AA380876 


443361 


AI792628 


411518 


AW850246 


439079 


AF085937 


422081 


AW136820 


408197 


AA282262 


423529 


T87318 



UnigenelD Unigene TiOe R1 

Hs.161008 ESTs 9.4 

Hs.151251 ESTs 9.3 

Hs.89751 membrane-spannrr^ 4-dornains, subfamily A 8.9 

Hs.271923 Homo sapiens cDNA: FU22785 fis, done K 8.8 

ENSP00000209376":PRED65 protein (Ragmen &7 

Hs.86693 ESTs 8.7 

NM.022739*:Homo sapiens E3 ublqultin tig 8.1 

gb:QV2-CT0261-261099-O1 1-d1 1 CT0261 Homo 7.7 

Eos Control 7.4 

Hs.321190 gb:OV30T00^290300-135-c040T0063Homo 7.4 

gb:601117374F1NIH_MGC_16 Homo sapiens c 7.3 

Hs.155689 ESTs 7.3 

gb7g16b12/l Soares Infant brain 1NIB H 7.2 

gb:za32c04.r1 Soares fetal liver spleen 7.1 

Hs.193115 Homo sapiens mRNA for KIAA1764 protein, 7.0 

Hs.1 54662 DnaJ (Hsp40) homolog, subfamily A, membe 6.9 

Target Exon 6.9 

gb:EST380561 MAGE resequences, MAGJ Homo 6.9 

gb:OV4-LT001 6-240200-1 10-b08 LT0016 Homo 6.9 

Hs.132851 hypothetical protein FU11222 6.8 

gb:hn42h03j(1 NCI_CGAP_RDF2 Homo sapiens 6.8 

Hs 83870 nebulin 6.6 

gbK2V2-CT0261-2010994)1 1-f05 CT0261 Homo 6.6 

Eos Control 6.4 

Hs.143562 ESTs 6.4 

Hs.145958 ESTs 6.4 

Hs.343768 Homo sapiens done 251 19 mRNA sequence 6.4 

Hs.14463 ESTs 6.3 

Hs.1 18502 ESTs 6.2 

HSJ283902 ESTs 6.1 

Target Exon 6.1 

Hs.118158 ESTs 6.0 

Hs.82535 solute carrier family 6 (neurotransmitte 6.0 

Hs.191754 ESTs 6.0 

CT001664^i|12698061|dbjIBAB21849.1| (AB 6.0 

Target Exon 5.9 

Hs 212433 ESTs 5.7 

gbzv98c03.s1 Soares_NhHMPu_S1 Homo sapi 5.7 

Hs.128386 ESTs 5.7 

Hs.270 pleckstrin homology. Sec7 andcoiled/coi 5.6 

Hs 133273 ESTs 5.6 

gb:ll^0219-291099-021-E07CT0219Homo 5.6 

Hs.38348 ESTs 5.5 

Hs.196011 ESTs 5.5 

Hs.107410 ESTs, Weakly similar to A46010 X-linked 5.5 

Hs.120411 ESTs 5.5 
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436578 AI091435 Hs.134859 ESTs 5.5 

450920 AA011626 Hs.133324 ESTs 5.5 

435072 AW592176 Hs.1 16932 ESTs 5.4 

414403 AW969551 Hs.76064 ribosomal protein L27a 5.4 

5 443744 AI084326 Hs.271548 ESTs, Weakly simBar to I78885 serine/th 5.4 

433087 AI720686 Hs.152520 ESTs 5.3 

441916 M993571 Hs.1 29075 ESTs 5.3 

414818 BE541217 Hs.23606 ESTs 5.3 

452531 AA429462 Hs.293946 ESTs, Weakly simBar to 138022 hypolheli 5.3 

10 454958 AW849046 gb:lL3-CT0214-15030(W85-H06CT0214Homo 5.2 

415890 H08225 Hs.2B8712 ESTs 5.2 

408385 AF055634 Hs.44553 unc5 (Celegans homolog} c 5.2 

437752 AA767376 Hs.291631 ESTs, Moderately similar to S65657 alpha 5.2 

446495 D60923 Hs,153450 ESTs 5.2 

15 435375 AI733610 Hs.187832 ESTs 5.2 

426748 AL048409' Hs.97177 ESTs, Weakly similar to ALU 1 .HUMAN ALU S 5.2 

424994 AW954525 gb:EST366595 MAGE resequences, MAGC Homo 5.2 

405321 Target Exon 5.1 

416706 AA314676 Hs.288945 hypothetical protein FU 13448 5.1 

20 438206 AA780385 Hs.187885 ESTs 5.1 

445238 AA883971 Hs.187506 ESTs 5.1 

455747 BE074910 gb:RC5-BT0580-170300-021-F12BT0580Homo 5.1 

420533 AI809510 Hs.1 18971 ESTs 5.1 

457374 AA493662 gb:nh05d12Ls1 NCt_CGAP_Thy1 Homo sapiens 5.0 

25 440354 AA889386 Hs.125468 ESTs 5.0 

440388 AI693520 Hs.223000 ESTs 4.9 

421188 AA284658 Hs.261493 ESTs 4.9 

403481 Target Exon 4.8 

0 . 438132 AA907076 Hs.122060 ESTs 4.8 

30 403333 NMJ)02518*:Homo sapiens neuronal PAS dom 4.8 

450317 AI692589 gb:wd86g05.x1 NCI_CGAPJ.u24 Homo sapiens 4.8 

433597 AA708205 Hs.100343 ESTs 4.8 

430172 AA468591 Hs.161889 ESTs 4.8 

__. 452843 AI796769 Hs.208320 ESTs 4.7 

35 452031 AA741314 Hs.865 RAP1A, member of RAS oncogene family 4.7 

427302 AA400540 Hs.135282 Homo sapie nscDMFU 11 554 fis, clone HE 4.7 

442123 A1697790 Hs.159961 EST 4.7 

438727 AW978756 Hs.205679 ESTs 4.7 

420931 AF044197 Hs.100431 small inducible cytokine B subfamily (Cy 4.6 

40 442160 A1337127 Hs.156325 ESTs 4.5 

442295 A1B27248 Hs.224398 Homo sapiens cONA FU 1 1469 fis, clone HE 4.6 

434589 AF147363 gb:Homo sapiens full length Insert cDNA 4.6 

404995 ENSP00000251890*:Monocytic leukemia zinc 4.6 

422906 UB0773 Hs.121580 Human EST clone 42944 mariner transposon 4.5 

45 405549 C7001976*:gil4758712lreflNP_004659.1|al 4.5 

430697 AA484207 Hs.211867 ESTs 4.5 

443998 AI620661 Hs.296276 ESTs 4.5 

455801 BE140643 gb:RCO-HT0015-310599^)16HT0015 Homo sap 4.5 

444800 AW119071 Hs.153287 ESTs 4.5 

50 403371 Target Exon 4.5 

438431 AW207860 Hs.293116 ESTs 4.5 

421926 AA300591 gb:EST1 3437 Testis tumor Homo sapiens cD 4.5 

439752 T78968 Hs.14411 ESTs 4.5 

414441 AA234759 Hs.132950 ESTs 4.5 

55 454665 AW812866 gb:RC3^T018^300100^17-b03ST0186Homo 4.5 

454585 BE069128 gb:OV3-BT0379-3101004)71-g06BT0379Homo 4.4 

452978 AA029994 Hs.61523 ESTs 4.4 

438042 AW296971 Hs.180610 ESTs 4.4 

418059 AA211586 gfczn56d05\s1 Stratagene muscle 937209 H 4.4 

60 451469 NM.014809 Hs.26441 KIAA0319 gene product 4.4 

433072 AI928037 Hs.158832 ESTs 4.4 

443318 A1051603 Hs.133141 ESTs 4.4 

424686 AA345504 gb:EST51529 Gall bladder It Homo sapiens 4.3 

416312 W02640 Hs.16247 ESTs, Weakly similar to 2004399Achromos 4.3 

65 415467 R60891 Hs.260274 ESTs 4.3 

459045 N69101 Hs.40730 ESTs 4.3 

415417 F12038 Hs.140970 ESTs, Weakly similar to ALU6.HUMAN ALU S 4.3 

409111 AL043362 Hs.7984 pteckslrin homology. Sec7 and cofled/coi 4.3 

441620 R59595 Hs.26675 ESTs 4.2 

70 427908 AA417272 Hs.24122 ESTs 4.2 

432765 AJ003429 gbAJ003429 Selected chromosome 21 cDNA 42 

448108 AW300O21 Hs.170685 ESTs 4.2 

436345 AA873008 Hs.121572 ESTs 4.2 

„ 409500 U0B098 Hs .54576 sulfotransferase, estrogen-preferring 4.2 

75 413525 BE145899 gb:MRf>HT0208-221299-204^)10HT0208Homo 4.2 

403305 NM-006825 transmembrane protein (63kD), endoplasmi 4.2 

436338 W92147 Hs.1 18394 ESTs 4.2 

427521 AW973352 Hs.290585 ESTs 4.2 

430124 AW204994 Hs.253450 ESTs 4.2 

80 410790 AW803357 gb:lU-UM007W>9030(W)5(WU)8 UM0O79 Homo 4.1 

434360 AW015415 Hs.127780 ESTs 4.1 

416385 H54253 Hs.205241 ESTs, Weakly similar to S65657 alpha-1C- 4.1 

442786 H50733 Hs.256261 ESTs, Moderately similar to ALU8.HUMAN A 4.1 
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445206 A1350199 Hs.269990 ESTs 4.1 

424200 AA337221 gb:EST41 944 Endometrial tumof Homo sapie 4.1 

448152 AI741053 Hs.170770 ESTs 4.1 

453713 R20640 Hs.79133 cadherin 8, type 2 4.1 

5 400746 Target Exon 4.1 

413081 BE064415 gb:RC4-BT031 1-241 1 99-01 2-b03BT0311 Homo 4.1 

442338 AI761976 Hs.156080 ESTs 4.0 

455388 AW936234 gb:QVO-DT0020-09020(M 05-g05 DT0020 Komo 4.0 

455276 BE176479 gb:RC3-HT0585-160300-022-b09 HT0585 Homo 4.0 

10 421312 AA824627 Hs.291670 ESTs 4.0 

431421 AW969118 Hs.108144 ESTs, Weakly similar to unnamed protein 4.0 

448882 AJ001531 Hs.22404 protease, serine, 12 (neurotrypsln, moto 4.0 

441568 AI733322 Hs.127176 ESTs 4.0 

455646 BE064420 gb:RC4-BT031 1-241 199-012-c08 BT031 1 Homo 3.9 

15 424765 AA428211 Hs^84256 hypothefcal protein FU 14033 similar to 3.9 

454024 AA993527 Hs.293907 hypothetical protein RJ23403 3.9 

453387 AI990741 Hs.252809 ESTs 3.9 

434222 AF1 19886 Hs.283941 Homo sapiens PR02591 mRNA, complete cds 3.9 

454B06 AW872430 Hs.273743 ESTs 3.9 

20 429066 AAB68555 Hs.178222 ESTs 3.9 

435878 R08330 Hs.20152 ESTs 3.9 

453055 AW291436 Hs.31917 Homo sapiens, clone MGC:9658, mRNA, comp 3.9 

447530 AW1 92063 Hs.248865 ESTs, Moderately similar to JC5238 galac 3.9 

407834 AW084991 Hs.26100 ESTs 3.9 

25 400398 AF137396 Hs.2B3879 ubiquiRn 3 3.9 

421353 AW292857 Hs.255130 ESTs 3.9 

401459 C14000482*:gi|9790241iref|NP_062628.1|S 3.9 

403433 NM_001622:Homo sapiens alpha-2-HS-gtycop 3.9 

444911 U06117 Hs.250 xanthene dehydrogenase 3.9 

30 436350 AA713661 Hs.1 21091 ESTs 3.9 

447930 R44574 Hs.107510 ESTs 3.9 

410559 AW754192 gb:RC2-CT0321-131 299-01 2-a04CT0321 'Homo 3.8 

452320 AA042873 Hs.160412 ESTs 3.8 

402145 Target Exon 3.6 

35 458438 AI141520 Hs.151464 ESTs, Weakly similar to ALUC_HUMAN !!!! 3.8 

440450 A1333129 Hs.156147 ESTs 3.8 

437587 A1591222 Hs.72325 Human DNA sequence from clone RP1-187J11 3.8 

416009 Z43062 gb:HSC1 2E041 normaT>zed Infant brain cDN 3.8 

434381 AA631834 gb:np77n05.s1 NCLCGAPJTC Homo sapiens 3.8 

40 416534 H69043 r&224961 Homo sapiens cDNA FU 14366 fis, done HE 3.8 

433523 H29882 ESTs 3.8 

411436 AW846433 gb:QV0-CT0179-070300-143-b02CT0179Homo 3.8 

424830 AW270580 Hs.189311 ESTs, Weakly similar to putative p150 |H 3.8 

431447 AA505138 HsJ291341 ESTs 3.8 

45 435932 W03928 Hs.114524 ESTs 3.8 

442447 AA999723 Hs.129607 ESTs 3.8 

419831 AW448930 Hs.5415 ESTs 3.8 

403242 Target Exon 3.8 

455490 AW953477 gb:EST365547 MAGE resequences, MAGB Homo 3.8 

50 449264 A1637649 Hs.196105 ESTs 3.8 

443635 AI080230 Hs.134214 ESTs 3.7 

428200 AI039624 Hs.96388 ESTs 3.7 

452462 BE173515 gb:RC2-HT0560-21020(W)12-f03HT0560Homo 3.7 

418759 AA227879 Hs.187621 ESTs 3.7 

55 450497 H64159 Hs.15328 ESTs 3.7 

447785 AL041765 Hs.340375 ESTs 3.7 

451746 M86178 Hsl31125B ESTs 3.7 

416321 H94331 Hs.34024 ESTs 3.7 

448135 AI470874 Hs.343799 ESTs 3.7 

60 405510 ENSP00000233779*:Hypothetical68.0kDap 3.7 

407437 AF220264 gb:Homo sapiens MOST-1 mRNA, complete cd 3.7 

451859 H44491 Hs.252938 ESTs, Weakly similar to ALU1.HUMAN ALUS 3.7 

455135 AW857989 gbPM2-CT0328-2B1299^e04CT0328Homo 3.7 

409189 AA125984 gbzn27h06.r1 Stratagene neuroepWietium 3.7 

65 420300 AA258245 Hs.127573 Homo sapiens FKSG41 (FKSG41) mRNA, compi 3.7 

427726 AI359144 Hs.1 43688 Homo sapiens cDNA: FU23031 fis, done L 3.7 

434306 AW081757 Hs.44241 Homo sapiens cDNA: FU21447 fis, done C 3.7 

456354 X56411 Hs.1219 alcohol dehydrogenase 4 (dass II), pi p 3.7 

419261 X07876 Hs.89791 wingless-type MMTV integration site farrfi 3.7 

70 422899 D16471 Hs.121571 Human mRNA, Xq terminal portion 3.7 

447458 AI741082 Hs.15B961 ESTs 3.7 

411421 BE272110 Hs.21177 ESTs 3.7 

439075 AF085933 Hs.292620 ESTs 3.7 

440947 AA910403 ESTs 3.7 

75 417565 A1203405 Hs.47B31 ESTs 3.6 

430096 U91935 Hs.233321 Retina-derived POlWomain fador-1 3.6 

448658 H71739 Hs.200227 ESTs, Moderately simitar to A53959 throm 3.6 

404561 trichorhinophalangeal syndrome I gene (T 3.6 

n - 429073 AA446167 Hs.47385 ESTs 3.6 

80 419002 T78625 Hs.268594 ESTs 3.6 

450630 AA010429 Hs.191939 ESTs 3.6 

455067 AW854538 gb:RC3-CT0255-20010(W24-b02 CT0255 Homo 3.6 

412768 AW996044 Hs.26239 Human DNA sequence from clone RP1 1-438B2 3.6 
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439481 


AF086294 


Hs.125844 


ESTs 


419622 


AA452054 


Hs. 119338 


ESTs 


449821 


A1671141 


Hs.211122 


ESTs 


451193 


N29850 


Hs.44098 


ESTs 


412701 


AW984757 




gb:RC1-HN0015-040400-01 1-g10 HN0015 Homo 


419611 


AB031479 


Hs.91600 


SEEK1 protein 


433563 


AI732637 


Hs.277901 


ESTs 


427235 


AI1262B8 


Hs.192232 


ESTs 


449045 


BE072483 


Hs.278337 


Homo sapiens cDNA FU1 1537 fis. clone HE 


401132 






CI 200051 7*:gil47587 1 2Jref|NP_004659.1| a 


421105 


AA766501 


Hs.125113 


ESTs 


405264 






NM_030813*:Homo sapiens suppressor of po 


417675 


A1808607 


Hs.3781 


similar to murine leucine-rich repeal pr 


44998B 


AW372068 


Hs.201420 


ESTs, Moderately similar to ALU7.HUMAN A 


434763 


AA648618 




gb:ns07a1 1.r1 NCLCGAP.Ewl Homo sapiens 


416379 


N38857 


Hs.203933 


ESTs 


407557 


Z83803 




gb:H.sapiens mRNA for axonemat dynein he 


437956 


AA773283 


Hs.203559 


hypothetical protein FU 12701 


403510 






Target Exon 


451722 


H86374 


Hs.40861 


ESTs 


435681 


AA694192 


Hs.148979 


ESTs 


427125 


AA6B3362 


Hs.97612 


ESTs 


424287 


AL133105 


Hs.144633 


hypothetical protein DKFZp434F2322 


434497 


A1821803 


Hs.136580 


ESTs 


453682 


T79703 




gb:yd71e08.r1 Soares fetal liver spleen 


404848 






ENSP00000240769 4 :BG153O3.1 {similar to C 


417435 


N^_005181 


Hs.82129 


carbonic anhydrase 111, muscle specific 


442231 


W02434 


Hs.222413 


ESTs 


450297 


AW901347 


Hs.38592 


hypothetical protein FU23342 


436340 • 


R42246 


Hs.21506 


ESTs 


441596 


AA939300 


Hs.206768 


ESTs 


456995 


T89832 


Hs.170278 


ESTs 


456304 


AI820973 




gb:nc21c02.y5 NCI_CGAP_Pr1 Homo sapiens 


427033 


A1457449 


Hs.192817 


ESTs 


400379 


NWL018432 




Homo sapiens ovarian cancer related prot 


410551 


R36730 


Hs.21816 


ESTs 


423357 


A1285124 


Hs.157505 


ESTs 


450582 


AI339732 




G-rich RNA sequence binding factor 1 


437662 


AA765387 


Hs.145095 


ESTs 


442388 


AW663442 


Hs.129485 


ESTs 


445004 


A1204616 


Hs.148701 


ESTs 


450597 


AI701635 


Hs.207077 


ESTs 


438801 


AA825971 


Hs.124284 


ESTs 


445432 


AV553771 




gb AV653771 GLC Homo sapiens cDNA clone 


450003 


M777609 


Hs.191995 


ESTs 


431408 


AA504757 


Hs.10573B 


ESTs 


428923 


BE047698 


Hs.188785 


ESTs 


407344 


AI038025 


Hs.271418 


gb:ox29f07jc1 SoaresJotalJetus_Nb2HF8_ 


425523 


AB007948 


Hs.158244 


KIAA0479 protein 


427473 


AW274439 


Hs.252709 


ESTs 


437115 


AA744703 


Hs.129030 


ESTs 


434520 


AA205273 


Hs.177011 


hypothetical protein 


447282 


AI989963 


Hs.197505 


ESTs 


453328 


AW292635 


Hs.346145 


ESTs 


418985 


AI042330 


Hs.87128 


hypothetical protein FU23309 


449256 


AA059050 


Hs.59847 


ESTs 



432550 AW297206 Hs.164018 ESTs 

423101 MB3941 Hs.123642 EphA3 

424853 BE549737 Hs.1 32967 Human EST done 122887 mariner transposo 

410700 AA352335 Hs.65641 hypothetical protein FU20073 

418719 AW975590 Hs.161707 ESTs 

402703 Target Exon 

455710 BE072049 gb:PM4BT()532-17010(Wu4^6BT0532Homo 

452011 AW628911 Hs.211429 ESTs 

453973 AI291895 Hs.61993 ESTs. Weakly similar to ALU1.HUMAN ALU S 

420355 AW968263 Hs.123126 ESTs 

418898 T66847 Hs.194040 ESTs, Weakly similar to I38022 hypolheO 

418986 AI123555 Hs.81796 ESTs 

449570 AA001793 gb2h86c06j1 Soaresjetal Jver_spteen_ 

447516 W05355 Hs.102971 hypothetical protein FU 14751 

408304 AW810279 gb:MR4-ST0125-151299-029-a09 ST0125 Homo 

402454 C10025O1*:gi|129092|splP23270lOLF7_RAT O 

411552 AW851255 gb:lL3-CT022Q-160200-066-H02 CT0220 Homo 

407896 D76435 Hs.41154 Zfe family member 1 (odd-paired Drosophi 

435786 H09175 Hs.260&5 ESTs 

447597 A1886036 Hs.213675 ESTs 

432625 AI243596 Hs.94830 ESTs, Moderately similar to T03094 A-kin 

459535 AV654907 gbAV654907 GLC Homo sapiens cDNA done 

447183 A1554733 Hs.173182 ESTs 

426629 A1203933 Hs.97142 ESTs 

447892 AJ435848 Hs.172978 ESTs 

457136 AA42B240 Hs.126083 ESTs 

enameHn 



443585 AW466983 Hs.283949 
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451399 


AL0421 10 


Hs.326728 


ESTs 


i 1 
j.i 


450687 


AA495800 




9D2wUodu/^1 bo3res_NnHMru_bi Homo sapi 


3.1 


416812 


H91Q1Q 


HS.4494U 


CCTV 

tois 


J. J 


452102 


U04343 


Hs.27954 


CDob antigen (CD28 antigen figand 2, B7- 


3.1 


434512 


AW1 39932 


Hs.1ao941 


ESTs ^ 


0.1 


458251 


A 1 A A AAA~T 

AL040927 


HsJ21 0422 


ESTs 




439950 


AW937417 


Hs .293561 


ESTs 


1 1 


429559 


AA454993 


Hs.1 38343 


ESTs, Weakly similar to 178885 sennenh 


3.1 


403805 






Target Exon 


J.l 


422666 


AA677981 


Hs. 119023 


SMC2 (structural maintenance of chromoso 


3.1 


408799 


AA059412 


Hs.47986 


hypothetical protein MGC10940 


3.1 


429350 


AI754634 


Hs.1 31 987 


ESTs 


3.1 


430454 


AW469011 


Hs.1 05635 


ESTs 


3.1 


441817 


AW969706 


Hs.293332 


ESTs 


3.1 


427773 


AA412290 


Hs.98124 


ESTs 


3.1 


415901 


H08396 


Hs.76118 


ubiqurtin carboxy14ermina1 esterase L1 


3.1 


44575B 


R60715 


Hs.25804 


ESTs 


3.1 


403291 






Target Exon 


3.1 


436295 


N73895 




gb:za62d06.s1 Soares fetal liver spleen 


3.1 


455772 


W28799 




gb:52g1 1 Human retina cDNA randomly prim 


3.1 


418037 


AI990212 


Hs.86447 


ESTs 


3.1 


438142 


T90309 


Hs.269651 


ESTs 


3.1 


416423 


H54375 


Hs.268921 


ESTs 


3.1 


448765 


R15337 


Hs.21958 


Homo sapiens mRNA; cDNA DKFZp547D086 {tr 


3.1 


457021 


AW96B934 


Hs.173108 


Homo sapiens cDNA: FU21897 tls, done H 


3.1 


420301 


AA767526 


Hs.22030 


paired box gene 5 {B-cefl lineage specif 


3.1 


420721 


AA927802 


Hs.1 59471 


ZAP3 protein 


3.1 


436013 


AA703419 


Hs.287749 


Homo sapiens cDNA: FU23593 fis, clone L 


3.1 


441492 


A!1 49998 


Hs.146346 


ESTs 


3.1 


410B69 


AW808361 




gb:MR1-ST01 11-11 1099-003-f04 ST01 1 1 Homo 


3.1 


419386 


AA236867 




ESTs, Weakly similar to I38022 hypotheti 


3.1 


435628 


W88732 


Hs .36107 


ESTs 


3.0 


444326 


AI939357 


Hs.270710 


ESTs 


3.0 


413774 


AA131782 


Hs.182314 


ESTs 


3.0 


434352 


AF129505 


Hs.86492 


small muscle protein, X-linked 


3.0 


434269 


AK001991 


Hs.3781 


similar to murine leucine-rich repeat pr 


3.0 


416422 


H60457 




ESTs, Moderately similar to ZN91.HUMAN Z 


3.0 


416642 


T96118 


Hs.226313 


ESTs 


3.0 




AruoDUJU 


nS./l atl 


nypoineucai proiein ui\rz-p/oAA»/o 


3.0 


41B948 


AI217097 




gb:qd43h07.x1 Soares_fetal_hearLNbHH19W 


10 


433796 


. AA810867 


Hs.186997 


ESTs 


3.0 


427813 


D29833 


Hs.2207 


saTrvary proline-rich protein 


3.0 


405733 






NM 021 140*:Homo sapiens ubiquitously tra 


3.0 


426743 


AA383833 


Hs.245022 


ESTs 


3.0 


449913 


M004696 


Hs.333016 


ESTs 


3.0 


454096 


AW062757 




gb:CMO.fjrO103-12089W)37-g07 CT0103 Homo 


3.0 


440129 


MB65818 


Hs.174936 


ESTs, Weakly simflar to S71886 Ste20-tik 


10 


410929 


H47233 


Hs.30643 


ESTs 


3.0 


458187 


D56919 


Hs.265848 


myomegaBn 


3.0 


429430 


AI381837 


Hs.155335 


ESTs 


3.0 


451391 


AA017410 


Hs.40568 


ESTs 


3.0 


414951 


AW794931 


Hs.100861 


hypothetical protein RJ14600 


3.0 



TABLE 65B 



Pkey: Unique Eos probeset identifier number 

CAT number. Gene cluster number 

Accession: Genbank accession numbers 



Pkey 
408304 



409189 

410559 
410790 
410869 



411436 
411518 
411552 
412701 
412988 
413081 
413525 
416009 
416422 
418059 
418387 
418948 



CAT Number 
1050848J 



110687.1 

1208283J 
1221 131J 
1225123.1 



1245660.1 
1248692J 
1249255J 
1322288.1 
1342150.1 
1348563.1 
1374635.1 
1566379.1 
159381 1_1 
171879 J 
174731.1 
180808.1 



AW810279BE146684BE146693BE146694 BE146679AWB10472AW810208AW810356 AW810193AW178838AW178837AW178857 AW810515 
AWB1 0330 AW810514 AW81 0441 AW81 0358 AW178852AW81 0359 AW81 0322 AW81 0327 AW81 021 1 AW178835AW81 0635 AW81 0288 
AW810263 AW810325 AW810443 AW8 

AA1 25984 AA1271 89 AA065075 AA070377 AA10001 7 AA079891 AA1 13255 AA075168 AA082764 AA083380 N84B29 AA084752 AA076512 
AA085119AA0852O8AA085045 

AW754192 W00554 AW857797 AW754203 AW754197 AW7541 93 
AW803357 AW803423 AW812233 R06814 

AW808361 AW808404 AW808386 AW808594 AW808654 AW808813 AW808551 AW808676 AW80835O AW808406 AW808694 AW808934 

AW808829 AW808385 AW808422 AW808401 AW808409 AW608760 AW808863 AW808521 AW808539 AW808609 AW808472 AW808739 

AW808704 AW808558 AW808714 AW808420 AW8 

AW846433 AW8461 59 AW846377 AW846528 

AW850246 AWB50251 AW850302 

AW851255AW851432AW850955 

AW984757 AW984797 AW9B4734 AW984745 

BE046680 BE046738 BE044958 

BE064415 BE064430 BE064448 

BE145899 BE145848 BE145849 BE145853 BE145927 BE145925 

Z43062 R13213 H14422 

H60457 H68709 H73528 H54335 R87154 

AA21 1586 F35799 AA211641 F29720 AW937387 AW937408 

R18085AA219028 R17712Z44345 

AI21 7097 AW886090 W38035 W38792 AA232835 AW936043 
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419386 


184356 1 


420352 


192979.1 


421926 


209246.1 


424200 


236595.1 


424686 


242486J 


424994 


245786.1 


428002 


285602.1 


430535 


319643J 


432765 


353907.1 


433523 


368873.1 


434381 


385155.1 


434589 


38929.1 


434763 


392847.1 


436295 


41733.1 


440947 


505904.1 


442481 


543588.1 


445432 


63943.1 


449570 


81018 1 


450317 


831956.1 


450582 


83933 1 


450687 


84327.1 


452462 


918580.1 


453682 


977454.1 


454096 


1007449.1 


454171 


1049240.1 


454457 


1207274J 


454585 


1225852.1 


454665 


1228599.1 


454860 


1237732.1 


454968 


1247029.1 


455067 


1252050 1 


455135 


1254729 1 


455276 


1272541J 


45538B 


1287904.1 


455490 


1297826.1 


455646 


1348557J 


455710 


1352368.1 


455747 


1355877.1 


455772 


1363114.1 


455801 


1370508.1 


456304 


176820.1 


457374 


328758.1 



AA236867 AA237066 AA354236 AW957759 H08961 

BE25883S AW958316AA258918 AW843305 R14744 A1580388 BE071923 R36280 

AA300591 AW963893 AA300493 

AA337221 AA336756AW966196 

AA345504 AA345251 AW963243 

AW954525 A1372685 AA349501 AJ372687 H10564 

AA418703AA418711BE071915BE071920 8E071912 

AW968485 AW968670 AA480922 BE350425 

AJ003429 AJ003367 AA564825 

H29882 AW665533 AW149901 AI572917 AA598500 AI686466 AI336390 AW854390 AW854320 

AA631834 AA633425 AA632455 AI792312 AI792311 

AF147363 T47219T47218 

AA648618 AW974389 H51771 

N73895AJ001872 

AA910403 A1815593 W58361 AW1 62520 AI8 16550 
N99828 BE079873 AI110738 AF074645 
AV653771 BE089370 
AA001793AA001871 
A1692689R14223 R18395 

A1339732 AA01 0300 AW51 5041 AA768334 N29860 AA425874 AA425118AA865829 AW936878 
AA495800 AA495737 AA010736 AA654716 AA640726 
BE173515 BE173560 AI902860 
T79703T96307AL079725 

AW062757 AW176890 AW062758 AW176895 AW176859 AW176900 AW176897 AW176928 AW1 76868 AW176892 
AW854832 AW854798 AW854857 AW854816 AW854834 AW854817 
AW753456 AW753036 AW854868 AW854862 
BE069128 BE069023 AW809375 

AW812866 AW812746 AW812747 AW812884 AW812763 AW812722 
AW835767 AW835537 BE160187 

AW849046 AW847956 AW849039 AW847957 AW848279 AW848698 AW849034 AW849033 
AW854538 AW854418 AW854412 

AW857989 AW858016 AW861677 AW861689 AW861691 AWB58056 

BE176479 BE176678 BE176357 BE176550 AW886079 BE176676 BE176615 BE176555 BE1 76489 BE176610 BE1 76362 

AW936234 AW936074 AW936181 AW936179 AW935217 AW936077 AW936227 AW936191 

AW953477 Z41970 F12435 T73989 T09367 

BE064420 BE064435 BE064429 BE0644U BE064400 BE064517 

BE072049 BE069471 BE069489 BE069478 BE069479 BE069492 BE069485 BE072122 BE072124 BE069491 BE069486 BE069464 

BE074910 BE074913 BE074911 BE074903 BE074892 BE074935 

W28799BE086078 

BE140643 BE140645 BE140644 BE140657 BE140660 BE140659 BE140661 
AI820973 A1734077 AI820984 AA225796 AA225060 AA225101 
AA493662 AW897396 BE154814 



TAR1E65C 


Pkey: 
Ref: 


Unique number corresponding to an Eos probeset 

Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. "Dunham 1. et al." ref 
human chromosome 22" Dunham L et al., Nature (1999) 402489-495. 


ers to the publication entitled The ON 


Strand: 


Indicates DNA strand from which exons were predicted. 




Nl position: 


Indicates nucleotide rjositions of predicted exons. 




Pkey 


Ref 


Strand 


NLposition 




400533 


6981826 


Minus 


277132-277595 




400746 


7329328 


Minus 


147703-147896 




401132 


8705350 


Minus 


85679-85795 




401459 


9212270 


Minus 


182001-183323 




402145 


8018280 


Phis 


113086-114800 




402454 


7534025 


Minus 


14826-15603 




402703 


8705069 


Minus 


15335-15500 




403242 


7637817 


Minus 


11297-12511 




403291 


7230870 


Pius 


95177-95435 




403305 


8099945 


Plus 


114632-114805 




403333 


8568833 


Minus 


124794-124941 




403371 


9087278 


Pius 


105655-106050 




403433 


9719611 


Minus 


72225-72437 




403481 


9965004 


Plus 


93495-93633 




403510 


7652047 


Plus 


61866-62027 




403667 


6850483 


Minus 


1344-14421545-1697 




403805 


8140491 


Minus 


51483-51742.53429-53511 




404003 


B655948 


Plus 


198349-199096 




404561 


9795980 


Minus 


69039-70100 




404592 


9943965 


Minus 


39067-39225 




404648 


8248647 


Minus 


23955-24034,25143-25264 




404967 


7523744 


Minus 


89944-90729 




404995 


6006247 


Minus 


154015-154123 




405264 


7329374 


Plus 


28556-28684 




405321 


3419846 


Minus 


44654-45210 




405348 


2914717 


Minus 


4331043462 




405510 


7630909 


Minus 


101028-101174 




405549 


1552494 


Plus 


10878-11048 




405733 


9884689 


Plus 


124832-125051 
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Table 65A lists about 370 genes up-regulated in BPH compared to prostate cancer and normal adult tissues. These were selected from 59680 probesets on the Affymelrix/Eos Hu03 
GeneChip array such that the ratio of 'average' BPH to "average* prostate cancer and normal adult tissues was greater than or equal to 3.0. The 'average' BPH level was set to the 
75* percentile amongst BPH tissues. The "average" prostate cancer and normal adult tissue level was set to the 85* percentile amongst malignant prostate tissues and normal adult 
tissues. In order to remove gene-specific background levels of non-specific hybridization, the 10* percenule value amongst non-malignant body tissues was subtracted from both the 
numerator and the denominator before the ratio was evaluated. 

TABLE 66* ABOUT 370 GENES UP-REGU LATED IN BENIGN PROSTATIC HYPERPLASIA COMPARED TO PROSTATE CANCER AND NORMAL ADULT TISSUES 



Htey: Unique bos probesel idenlilier numoer ~" — 

ExAccm Exemplar Accession number, Genbank accession number 
UnigenelD: Unigene number 
Unigene Title: Unigene gene title 

R1: Ratio of BPH tissue to prostate tumor and normal body tissue 



Pkey 

410929 

450693 

418432 

432473 

446336 

407275 

428134 

400297 

433466 

415293 

458072 

428927 

420345 

453387 

454457 

441247 

431576 

400080 

431448 

404592 

420352 

438231 

410330 

449300 

449249 

426384 

454171 

408197 

429732 

421154 

431467 

424433 

442481 

425312 

426140 

440911 

400533 

418310 

403667 

436396 

404003 

424853 

43B138 

424940 

434485 

453200 

428002 

424850 

445444 

443361 

421513 

439079 

430535 

436578 

424051 

421863 

435072 

435375 



AL137517 Hs.306201 



433087 
439092 
437267 
441916 
452784 
452531 



ExAccn UnigenelD Unigene Title R1 

H47233 Hs.30643 ESTs 21.1 

AW450461 Hs.203965 ESTs 16.7 

M14156 Hs.85112 insulin-like growth factor 1 (somatomedi 14.2 

AI202703 Hs.1 52414 ESTs 113 

AW815036 Hs.151251 ESTs ■ 10.9 

A1364186 gb:qw34h07j(1 NCI_CGAP_Ut4 Homo sapiens 10.7 

AA421773 Hs.161008 ESTs 10.2 

AI127076 Hs.306201 hypothetical protein DKFZp56401 278 9.8 

AA508353 Hs.105314 relaxin 1 (H1) 9.5 

R49462 Hs.106541 ESTs 9.1 

AI890347 Hs.271923 Homo sapiens cDNA FU22785 fis. clone K 8.8 

AA441837 Hs.90250 ESTs 8-6 

AW295230 Hs.25231 ESTs 8.5 

AI990741 Hs.252809 ESTs 8.2 

AW753456 gb:QV2-CT0261-261099-011-d11 CT0261 Homo 7.7 

AWT 18681 Hs.1 28051 Homo sapiens thymic stromal rymphopoleti 7.5 

M76665 Hs.275215 hydroxysteroid (11-beta) dehydrogenase 1 7.4 

Eos Control 7.4 

hypothetical protein DKFZp5640127B 7.4 

NM__022739':Homo sapiens E3 ubiquitin lig 7.3 

gb:601 1 17374F1 NIH MGCJ6 Homo sapiens c 7.3 

ESTs 7.3 

ESTs 7.2 

ESTs 7.1 

Homo sapiens mRNA for KIAA1764 protein. 7.0 

hypothetical protein FU13189 (FU13189) 6.8 

gb:QV2-CT0261-201099-01 1-105 CT0261 Homo 6.6 

ESTs, Weakly similar to A46010 X-linked 6.6 

lymphocyte cytosolic protein 2 (SH2doma 6.6 

Homo sapiens cDNA FU 14269 fis, clone PL 6.5 

Homo sapiens mRNA: cDNA DKFZp434E0528 (f 6.5 

ESTs 65 

gb:za32c04j1 Soares fetal liver spleen i 6.4 

ESTs 6.4 

Homo sapiens done 251 1 9 mRNA sequence 6.4 

ESTs 6.4 

ENSP00000209376*:PRED65 protein (Fragmen 6J2 

ESTs 6.2 

Target Exon 6.1 

wingless-type MMTV integration site fami 6.1 

Target Exon 5.9 

Hs.132967 Human EST done 1 22887 mariner transpose 5.9 

Hs.177502 ESTs 5.9 

Hs.283902 ESTs 5.8 

Hs.1 18567 ESTs 5.8 

Hs.212433 ESTs 5.7 

gbzv98c03.s1 Soares_NhHMPu_S1 Homos8pi 5.7 

Hs.1 53498 chromosome 1 8 open reading frame 1 5.6 

Hs.270 pleckstrin homology, Sec7 and coiledteoi 5.6 

Hs.133273 ESTs 5.6 

Hs.105314 relaxin 1 (H1) 5.5 

Hs.38348 ESTs 5.5 

gb:EST380561 MAGE resequences, MAGJ Homo 5.5 

Hs.134859 ESTs 5.5 

Hs.138411 Homo sapiens mRNA; cDNA DKFZp58SJ 1922 (f 5.4 

Hs.1 08972 Homo sapiens mRNA; cONA DKF2p434P228 (fr 5.4 

Hs.116932 ESTs 5.4 

Hs.187832 ESTs 5.4 

Hs.278081 ESTs 5.4 

Hs.79389 nel(chickenHike2 5.4 

Hs.152520 ESTs 5.3 

gb:oc44f08.s1 NCI_CGAP_GCB1 Homo sapiens 5.3 

Hs.258110 ESTs 5.3 

Hs.1 29075 ESTs 5.3 

Hs.151258 hypothetical protein FU21062 5.3 

Hs.293946 ESTs, Weakly similar to 138022 hypotheti 5.3 

744 



BE258835 

AW594539 

AW023630 

A1656959 

T52285 

AI472078 

AW854832 

AA282262 

U20158 

AA284333 

N71831 

H04607 

N99828 

AA354940 

AF131798 

AA909536 



HS.1556B9 
Hs.159425 
Hs.346514 
Hs.193115 
Hs.303662 

Hs.107410 

Hs.2488 

Hs.287631 

HsJ256398 

Hs.9218 

Hs.145958 
Hs.343768 
Hs.143562 



AA814100 Hs.86693 
A1683487 Hs.152213 



BE549737 

R98299 

AA985308 

AI623511 

AA033832 

AA418703 

AA151057 

AA380876 

AI792628 

X00949 

AF085937 

AW968485 

AI091435 

AL1 10203 

A1952677 

AW592176 

AI733610 

AW571659 

NM 006159 

A1720686 

AA830149 

AW511443 

AA993571 

BE463857 

AA429462 
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423101 
437587 
415890 



M83941 

AI591222 

H08225 



5 

10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



454968 AW849046 
408385 AF055634 



437752 
446495 
422906 



AA767376 

D60923 

U80773 



427726 A1359144 
426748 AL048409 



436338 
416706 
440129 



420533 
438447 



Hs.1 23642 
Hs.72325 
Hs.268712 

Hs.44553 
Hs.291631 
Hs.1 53460 
Hs.121580 
Hs.1 43686 
Hs.97177 
Hs.1 18394 
Hs.288945 
Hs.174936 
Hs.187506 

Hs.1 18971 
Hs.30732 
Hs.1 25468 
Hs.212875 

AA493662 

NM_006042 Hs.48384 
AW292769 Hs.206228 



W92147 

AA314676 

AAB65B18 



445238 AA883971 
450582 AI339732 



AI809510 
A1082883 



440354 AA889386 
452891 N75582 
457374 
408829 
448041 
457653 



AI820719 
450497 H64159 
438132 AA907076 



414818 
429433 
433523 
450317 
443635 
430172 
452843 
427302 
452031 
423789 
417173 
420931 
454024 
405348 
419511 
447058 
428216 



BE541217 

AA452899 

H29882 

AI692689 

AI080230 

AA468591 

AI796769 

AA400540 

AA741314 

AK002084 

U61397 

AF044197 

AA993527 

AA429750 
A1939456 
AA424266 



430697 AA484207 
433280 AA581404 



449821 
420905 
439752 
452055 
43D18B 
423352 
438042 
452978 
420154 
428249 
442242 



AI671141 

AA521307 

T78968 

AI377431 

AL049242 

AA324808 

AW296971 

AA029994 

AI093155 

AA130914 

AV647908 



45249B AK000101 
459527 AW977556 



433597 
416312 
420111 
459045 
409705 
433433 
427908 
432765 
413525 
420355 
427521 
458912 
410790 
415245 
434360 
442786 
450597 
424200 



Hs.154662 

Hs.15328 

Hs.122060 

Hs.23606 

Hs.213586 



Hs.1 34214 

Hs.161889 

Hs.208320 

Hs.135282 

Hs.865 

Hs.132851 

Hs.81424 

Hs.100431 

Hs.293907 

Hs.75113 
Hs.160870 
Hs.123642 
Hs.211867 
Hs.289037 
Hs.211122 
Hs.186651 
Hs.14411 
Hs.141693 
Hs.234794 
Hs.193576 
Hs.18061O 
Hs.61523 
Hs.95420 
Hs.183291 
Hs.90424 
Hs.29700 
Hs.291735 
Hs.100343 
Hs.16247 



AA7082C5 
W02640 
AA255652 
N69101 
M37762 
AI692623 
AA417272 
AJ003429 
BE145899 
AW968263 Hs.123126 



Hs.40730 
Hs.56023 
Hs.1 21513 
Hs.24122 



AW973352 

AI911066 

AW803357 

N59650 

AW015415 

H50733 

AI701635 

AA337221 



445206 AI350199 
438431 AW207860 



A1741053 
AA827640 



448152 
438875 
400746 
413081 
404967 
427203 AW629517 



BE064415 



Hs.290585 



Hs.27252 
Hs.127780 
Hs.256261 
Hs.207077 

Hs.269990 
Hs.293116 
Hs.1 70770 
Hs.189059 



Hs.244855 



EphA3 5.3 

Human DNA sequence from clone RP1-187J1 1 5.2 

ESTs 5.2 

gb:lL3-CT0214-150300-085-H06 CT0214 Homo 5.2 

unc5 (C.etegans homolog) c 5.2 

ESTs, Moderately sim2ar to S65657 alpha 5.2 

ESTs 5.2 

Human EST done 42944 mariner transposon 5.2 

Homo sapiens cDMA: RJ23031 fis, done L 5.2 

ESTs. Weakly similar to ALU INHUMAN ALU S 5.2 

ESTs 5.2 

hypothetical protein FU 13448 5.1 

ESTs, Weakly similar to S71886 Ste20-0k 5.1 

ESTs 5.1 

G-rich RNA sequence binding factor 1 5. 1 

ESTs 5.1 

hypothetical protein FU13409; KIAA1711 5.1 

ESTs 5.0 

ESTs, Weakly similar to DYH9JHUMAN CILIA 5.0 

gb:nh05d1 2^1 NCI_CGAP_Thy1 Homo sapiens 5.0 

heparan sulfate (glucosamine) 3-O-suifot 5.0 

ESTs 5.0 

DnaJ (Hsp40) homolog, subfamily A, membe 4.9 

ESTs 4.8 

ESTs 4.8 

ESTs 4.8 

ESTs, Weakly simflar to KIAA1353 protein 4.8 

ESTs 4.8 

gbwd86g05.x1 NCI CGAP_Lu24 Homo sapiens 4.8 

ESTs 4.8 

ESTs 4.8 

ESTs 4.7 

Homo sapiens cDNA FLJ 1 1 554 fis, clone HE 4.7 

RAP1A, member of RAS oncogene family 4.7 

hypothetical protein FU11222 4.7 

ubiquitin-like 1 (sentrin) 4.6 

small Inducible cytokine B subfamily (Cy 4.6 

hypothetical protein FU23403 4.6 

CT001664:gil12698061|dbj|BAB21849.1| (AB 4.6 

general transcription factor IIIA 4.6 

ESTs 4.6 

EphA3 4.6 

ESTs 4.5 

Homo sapiens cDMA FU 14135 fis, done MA 4.5 

ESTs 4.5 

ESTs 4.5 

ESTs 4.5 

hypothetical protein MGC10858 4.5 

Homo sapiens mRNA; cDNA DKFZp564B083 (fr 4.4 

ESTs 4.4 

ESTs 4.4 

ESTs 4.4 

JM27 protein 4.4 

zinc finger protein 26B 4.4 

Homo sapiens cDNA: FU23285 fis, clone H 4.4 

hypothetical protein FU20094 4.4 

ESTs, Weakly sirrflar to 178885 serineAh 4.3 

ESTs 4.3 

ESTs, Weakly similar to 200439 9A chromes 4.3 

gbzs21h1 1.r1 NCI_CGAP_GC81 Homo sapiens 4.3 

ESTs 4.3 

brairwierrved neurotrophic factor 4.3 

Homo sapiens clone Z3-1 placenta expres 4.2 

ESTs 4.2 

gbAJ003429 Selected chromosome 21 cDNA 4.2 

gb:MRO-HT0208-221299-204-b10 HT0208 Homo 4.2 

ESTs 4.2 

ESTs 4.2 

ESTs 4.2 

gb:tl^-UM0079^0m050-A08UM0079Horno 4.1 

ESTs 4.1 

ESTs 4.1 

ESTs, Moderately similar to ALU8_HUMAN A 4.1 

ESTs 4.1 

gb:EST41944 Endometrial tumor Homo sapie 4.1 

ESTs 4.1 

ESTs 4.1 

ESTs 4.1 

ESTs 4.1 

Target Exon 4.1 

gb:RC4-B70311-241199-012-b03BT0311 Homo 4.1 

Target Exon 4.0 

ESTs 4.0 
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442338 AI761976 Hs.1S6080 ESTs 4.0 

455276 BE176479 9 b:RC3-HT05B5-16030W)22-b09 HT0585 Homo 4.0 

452903 AI953425 Hs.345291 ESTs, Weakry simitor to I3B022 hypotheli 4.0 

435136 R27299 Hs.10172 ESTs 4.0 

5 416760 H85182 Hs.191327 ESTs, Highly similar to KIAA1 102 protein 4.0 

421312 AA824627 Hs.291670 ESTs 4.0 

441558 AI733322 Hs.127176 ESTs 4.0 

441736 AW292779 Hs.8182 ESTs 4.0 

448882 AJ001531 Hs.22404 protease, serine, 12 (neurotrypsin, moto 4.0 

10 434222 AF 11 9886 Hs.283941 Homo sapiens PR02591 mRNA, complete cds 3.9 

420301 AA767526 Hs.22030 paired box gene 5 (B-cell Gneage specif 3.9 

429066 AA868555 Hs.178222 ESTs 3.9 

435878 R08330 Hs.20152 ESTs 3.9 

447530 AW192063 Hs.248865 ESTs, Moderately similar to JC5238 galac 3.9 

15 412988 BE046680 gb:hn42h03.x1 NCLCGAP_RDF2 Homo sapiens 3.9 

410196 AI936442 Hsl59836 hypothetical protein FU 10808 3.9 

411084 T18987 Hs.125472 ESTs, Moderately similar to KIAA0B77 pro 3.9 

419629 AB020695 Hs.91662 KIAA0888 protein 3.9 

455646 BE064420 gb:RC4-BT031 1-241 199-012-C08 BT031 1 Homo 3.9 

20 414441 AA234759 Hs.132950 ESTs 3.9 

425810 AI923627 Hs.31903 ESTs 3.9 

409111 AL043362 Hs.7984 pleckstrin homology. Sec7 and coiledfcoi 3.9 

411479 " AW848047 gb:lL3^0214-291299-052-A12CT0214 Homo 3.9 

440450 AI333129 Hs.156147 ESTs 38 

25 428342 AI739168 Homo sapiens cDNA FU 13458 fis, clone PL 3.8 

431421 AW969118 Hs.108144 ESTs, Weakly similar to unnamed protein 3.8 

416009 Z43062 gb:HSC12E041 normalized infant brain cDN 3.8 

416534 H69043 Hs.224961 Homo sapiens cDNA FU 14366 fis, clone HE 3.8 

407198 H91679 gb:yvO4a07.s1 Soares fetal liver spleen 3.8 

30 419831 AW448930 Hs.5415 ESTs 3.8 

424830 AW270580 Hs.189311 ESTs, WeaWy similar to putative p1 50 [H 3.8 

431447 AA505138 Hs.291341 ESTs 3.8 

435932 W03928 Hs.114524 ESTs 3.8 

442447 AA999723 Hs.129607 ESTs 3.8 

35 403242 Target Exon 3.8 

433908 AW298141 Hs.157975 ESTs 3.8 

454037 AW998716 gb:PM4-BN0067-250300-002-f11 BN0067Homo 3.7 

432101 AI918950 Hs.123642 EphA3 3.7 

418759 AA227879 Hs.187621 ESTs 3.7 

40 452462 BE173515 gb:RC2-HT0560-21020(H)12-ro3HT0560Homo 3.7 

448568 AA149121 Hs.71947 ESTs 3.7 

447785 AL041765 Hs.340375 ESTs 3.7 

451746 M86178 Hs.311258 ESTs 3.7 

436345 AA873008 Hs.121572 ESTs 3.7 

45 446862 AV660697 Hs.282700 ESTs 3.7 

419875 AA853410 Hs.93557 proenkephalin 3.7 

421040 AA715026 Hs.135280 ESTs 3.7 

454860 AW835767 gb:QV4-LTO0 16-240200-11 04)08 LT001 6 Homo 3.7 

450687 AA495800 gb:zw05b07.s1 Soares_NhHMPu_S1 Homo sapi 3.7 

50 407437 AF220264 gb:Homo sapiens MOST-1 mRNA, complete cd 3.7 

431474 AL133990 Hs.190642 CEGP1 protein 3.7 

448004 AW451477 Hs.257456 ESTs 3.7 

419261 X07876 Hs.89791 wingless-type MMTV integration site fami 3.7 

422899 D16471 Hs.121571 Human mRNA, Xq terminal portion 3.7 

55 439075 AF085933 Hs.292620 ESTs 3.7 

440947 AA910403 ESTs 3.7 

447458 AI741082 Hs.158961 ESTs 3.7 

403481 Target Exon 3.6 

404561 bichomlnophalangeal syndrome I gene (T 3.6 

60 417565 AI203405 Hs.47831 ESTs 3.6 

430096 U91935 Hs.233321 Retina-derived POU-domain factor-1 3.6 

448658 H71739 Hs.200227 ESTs. Moderately similar to A53959 throm 3.6 

449655 AI021987 Hs.59970 . ESTs 3.6 

450630 AA010429 Hs.191939 ESTs 3.6 

65 455067 AW854538 gb:RC3-CTQ255-20010tH)24-b02 CT0255 Homo 3.6 

423566 AW976434 Hs.3623 hypothetical protein FU 11 220 3.6 

451193 N29850 Hs.44098 ESTs 3.6 

420026 A1B31190 Hs.166676 ESTs 3.6 

426917 AA913814 Hs.172854 DKFZP566B0923 protein 3.6 

70 429430 AI381837 Hs.155335 ESTs 3.6 

433563 AI732637 Hs.277901 ESTs 3.6 

443744 A1084326 Hs.271548 ESTs, WeaWy similar to 178885 serfneAh 3.5 

456373 BE247706 Hs.89751 membrane-spanning 4-domains, subfamily A 3.5 

419589 AW973708 Hs.201925 Homo sapiens cDNA FU 13446 fis, done PL 3.5 

75 401132 C12000517*:gi|4758712lreflNP_004659.1i a 3.5 

421105 AA766501 Hs.125113 ESTs 3.5 

435177 AI018174 Hs.42936 ESTs 3.5 

434763 AA648618 gb:ns07a11j"1 NCI_CGAP.Ew1 Homo sapiens 3.5 

449988 AW37206B Hs.201420 ESTs, Moderately similar to ALU7_HUMAN A 3.5 

80 450216 AA873345 Hs.60226 rtomo sapiens, done IMAG&3621 638, mRNA, 3.5 

403510 Target Exon 3.5 

435681 AA694192 Hs,148979 ESTs 3.5 

451722 H86374 Hs.40861 ESTs 3.5 
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10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



430865 
444800 
42B647 
412775 
429643 
443324 
430124 
430701 
436714 
404848 
408480 
412494 
417435 
430403 
436340 
441596 
442231 
447124 
450297 
453682 
456995 
405510 
400379 
419964 
427033 
422321 
430829 
420721 
433628 
436703 
429716 
427235 
459646 
428923 
431408 
438801 
445432 
450003 
448108 
425523 
427473 
434520 
447282 
447182 
432229 
418985 
418719 
455710 
420300 
416662 
411436 
420851 
428715 
429318 
444246 
453973 
434497 
439306 
420608 
418986 
438118 
447516 
449570 
424994 
417675 
402145 
448131 
422352 
432625 
447183 
447597 
459535 
415467 
434408 
456354 
453789 
412666 
418319 
419088 
420397 
422165 
448044 
454665 



AI073424 

AW119071 

AA830050 

AA709046 

M455B89 

R44013 

AW204994 

A1760833 

AA728964 

AI350337 

AL133900 

NM 005181 

AF039390 

R42246 

AA939300 

W02434 

AW976438 

AW901347 

T79703 

T89832 

NM_018432 

AAB11657 

AI457449 

AA906427 

AW451999 

AA927802 

AI821784 

AW880614 

R25685 

AI1262B8 

AW8B3968 

BE04769B 

AA504757 

AA825971 

AV653771 

AA777809 

AW300021 

AB 007948 

AW274439 

AA205273 

AI989963 

BE241868 

AW290976 

AI042330 

AW975590 

BE072049 

AA258245 

T25853 

AW845433 

AA281062 

AW293716 

AW861930 

H93281 

AI291895 

A1821803 

BE220199 

BE548277 

AI123555 

AW753311 

W05355 

AA001793 

AW954525 

AI808607 



Hs.5232 

Hs.1 53287 

Hs.124344 

Hs.27552 

Hs.167279 

Hs.164225 

Hs.253450 

Hs.293971 

Hs.293399 

Hs.1 64568 

Hs.792 

Hs.82129 

Hs.241382 

Hs.21606 

Hs.206768 

Hs.222413 

Hs.17428 

Hs.38592 

Hs.170278 



Hs.220913 
Hs.1 9281 7 
Hs.1 81035 
Hs.194024 
Hs.1 59471 
Hs.1 88578 
Hs.146381 
Hs.211933 
Hs.192232 
Hs.321190 
Hs.188785 
Hs.105738 
Hs.1 24284 

Hs.191995 

Hs.170685 

Hs.158244 

Hs.252709 

Hs.177011 

Hs.197505 

Hs.17585 

Hs.143587 

Hs.B7128 

Hs.161707 

Hs.127573 
HsJ538 

Hs.29493 

Hs.53126 

Hs.102500 

Hs.10710 

Hs.61993 

Hs.136580 

Hs.103104 
Hs.81796 
Hs.346690 
Hs.102971 



Hs.3781 



AI675054 

AA766296 

AI243596 

AI554733 

A1886036 

AV654907 

R60891 

AI031771 

X56411 

AA628517 

AL080116 

AW611703 

AI538323 

NM_007018 

AL041199 

AM58682 

AW812866 



Hs.200481 

Hs.99200 

Hs.94830 

Hs.173182 

Hs.213675 

Hs.260274 

Hs.132586 

Hs.1219 

Hs.1 18502 

Hs.74420 

Hs.190173 

Hs.52620 

Hs.97437 

Hs.1481 



HSPC125 protein 3.5 

ESTs 3.5 

ESTs 3.5 

Homo sapiens mRNA; cONA DKFZp586N2424 (f 3.5 

FYVE-finger-containing Rab5 effector pro 3.5 

ESTs 3.5 

ESTs 3.4 

ESTs 3.4 

ESTs 3.4 

ENSP0000G240769*:BG153O3.1 (similar to C 3.4 

fibroblast growth factor 7 (keratinocyte 3.4 

AOP-ribosylation factor domain protein 1 3.4 

carbonic anhydrase III, muscle specific 3.4 

tumor necrosis factor (iigand) superfami 3.4 

ESTs 3.4 

ESTs 3.4 

ESTs 3.4 

RBP1 -like protein 3.4 

hypothetical protein FU23342 3.4 

gb:yd71e08.r1 Soares fetal liver spleen 3.4 

ESTs 3.4 

ENSP00000233779 4 :Hypothetical 68.0 kDa p 3.4 

Homo sapiens ovarian cancer related prot 3.4 

ESTs 3.4 

ESTs 3.4 

hypothetical protein MGC1 1 296 3.4 

ESTs 1 3.4 

2AP3 protein 3.4 

ESTs 3.4 

RNA binding motif protein, X chromosome 3.4 

collagen, type XIII, alpha 1 3.4 

ESTs 3.4 

gb;QV3-OT0063-290300-1 35^:04 OT0063 Homo 3.4 

ESTs 3.4 

ESTs 3.4 

ESTs 3.4 

gbAV653771 GLC Homo sapiens cONA clone 3.4 

ESTs 3.4 

ESTs 3.3 

WAA0479 protein 3.3 

ESTs 3.3 

hypothetical protein 3.3 

ESTs 3.3 

WAA0801 gene product 3.3 

ESTs 3.3 

hypothetical protein FU23309 3.3 

ESTs 3.3 

gb:PM4-BT0532-170100^04-f06 BT0532 Homo 3.3 

Homo sapiens FKSG41 (FKSG41) mRNA, compl 3.3 

ESTs 3.3 

gb:QVIKrT01794J70300-143-b02 CT0179 Homo 3.3 

hypothetical protein FU20142 3.3 

ESTs 3.3 

hypothetical protein dJ511E16.2 3.3 

hypothetical protein FU20417 3.3 

ESTs. Weakly similar to ALU INHUMAN ALU S 3.3 

ESTs 3.2 

WD40 protein Qao1 3.2 

ESTs 3.2 

ESTs 3.2 

ESTs 3.2 

hypothefoal protein FU14751 3.2 

gbzh86c06.r1 Soares Jeta!Jiver_spleen_ 3.2 

gb:EST366595 MAGE resequences, MAGC Homo 3.2 

similar to murine leurine-rich repeat pr 3.2 

Target Exon 3.2 

ESTs 3.2 

ESTs 3.2 

ESTs, Moderately similar to T03094 A-Wn 3,2 

ESTs 3.2 

ESTs 3.2 

gb AV654907 GLC Homo sapiens cDNA done 3.2 

ESTs 3.2 

ESTs 3.2 

alcohol dehydrogenase 4 (class II), p) p 3.2 

ESTs 3.2 

origin recognition complex, subunit 3 (y 3.2 

ESTs, Weakly similar to A46010 X-linked 3.2 

tntegrin,beta8 3.2 

centrosomal protein 1 3.2 

histidine decarboxylase 3.2 

gb:tk1 3e01 jc1 NCI CGAP_Lu24 Homo sapiens 3.2 

gb:RC3-ST0186-300100-017-b03 ST01B6 Homo 3.1 
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405321 

429569 

434512 

452102 

433444 

410821 

415861 

422299 

432527 

427773 

441817 

416812 

417958 

407426 

416423 

418037 

419197 

420179 

436295 

405548 

423595 

412533 

434072 

405264 

410869 

425354 

441492 

447078 

435021 

408832 

450580 

432319 

453713 

445784 

416642 

418948 

433796 

439182 

404995 

444794 

443634 

420133 

407829 

426743 

442326 

449913 

454096 

419622 

449745 

428412 

428200 

414951 

431869 

451391 

452959 



AA454993 

AW1 39932 

U04343 

AW975324 

AI1 14811 

Z43123 

AK000181 

AW975028 

AA412290 

AW969706 

H91010 

AA767382 

AF129533 

H54375 

AI990212 

N48921 

N74530 

N73895 

R82826 

AA679863 

H70854 

AW808361 

U62027 

AI149998 

AW885727 

AA922192 

AW085690 

N40087 

AW510770 

R20640 

AI253155 

T96118 

AI217097 

AA810867 

AF088030 

AI419991 

H73972 

AA426117 

AA045084 

AA383833 

H92962 

AA004696 

AW062757 

AA452054 

A1658593 

AA428240 

AI039624 

AW794931 

AA521136 

AA017410 

AI933416 



Hs.138343 

Hs.188941 

Hs.27954 

Hs.129816 

Hs.92526 

Hs.144513 

Hs.114556 

Hs.102754 

Hs.98124 

Hs.293332 

Hs.44940 

Hs.193417 

Hs.268921 
Hs.86447 
Hs.27441 
Hs.21168 



Hs.220702 
Hs.69606 
Hs.283059 



Hs.155935 

Hs.146346 

Hs.9914 

Hs.54709 

Hs.63428 

Hs.1 28388 
Hs.79133 
Hs.146065 
Hs.226313 

Hs.186997 
Hs^1621 

Hs.145225 

Hs.134460 

Hs.155543 

Hs.29725 

Hs.245022 

Hs.124813 

Hs.333016 

Hs.1 19338 

Hs.126083 

Hs.98388 

Hs.100861 

Hs.190176 

Hs.40568 

Hs.189674 



Target Exon 3.1 

ESTs, Weakly similar to 178885 serine/th 3.1 

ESTs 31 

C086 antigen (CD28 antigen ligand 2, B7- 3.1 

ESTs 3.1 

ESTs, Weakly similar to T00365 hypotheti 3. 1 

ESTs 3.1 

hypothetical protein FU20174 3.1 

ESTs 3.1 

ESTs 3.1 

ESTs 3.1 

ESTs 3.1 

ESTs 3.1 

gb:Homo sapiens F-box protein Fb!3b (FBL 3. 1 

ESTs 3.1 

ESTs 3.1 

KJAA1615 protein 3.1 

ESTs 3.1 

gbza82d06.s1 Scares fetal liver spleen 3.1 

Target Exon 3.1 

ESTs 3.1 

ESTs 3.1 

Homo sapiens PR01 082 mRNA, complete cds 3. 1 

NM.03081 3*:Homo sapiens suppressor of po 3.1 

gb:MR1-ST011 1-11 1099-003-f04 ST01 11 Homo 3.1 

complement component 3a receptor 1 3.1 

ESTs .3.1 

ESTs 3.1 

ESTs 3.0 

ESTs, Weakly similar to Z195_HUMAN ZINC 3.0 

ESTs 3.0 

ESTs 30 

cadherin 8, type 2 3.0 

ESTs 3.0 

ESTs 3.0 

gb:qd43h07.x1 Soares_fetaLhearLNbHH19W 3.0 

ESTs 3.0 

hypothetical protein DKFZp762O076 3.0 

ENSP0(J000251890*Atonocytic leukemia zinc 3.0 

ESTs 3.0 

ESTs 3.0 

ESTs 3.0 

hypothetical protein FU13197 3.0 

ESTs 3.0 

hypothetical protein MGC14817 3.0 

ESTs 3.0 

gb:CMO-CT0103-120899-037-g07 CT0103 Homo 3.0 

ESTs 3.0 

gb:yt38a05.x5 Scares breast 3NbHBst Homo 3.0 

ESTs 3.0 

ESTs 3.0 

hypothetical protein FU14600 3.0 

ESTs 3.0 

ESTs . 3.0 

ESTs 3.0 



TABLE 66B 



Pkey: Unique Eos probeset Identifier number 

CAT number Gene cluster number 

Accession: Genbank accession numbers 



Pkey, 


CAT Number 


410790 


1221131 1 


410869 


1225123J 


411436 


1245660J 


411479 


1247077J 


412988 


1342150J 


413081 


1348563 1 


413525 


1374635 1 


416009 


1566379J 


418948 


1B0808J 


420111 


190755J 


420352 


192979.1 


424200 


236595.1 


424994 


245786.1 


428002 


285602.1 



AW803357 AW803423 AW81 2233 R0681 4 

AW808361 AW808404 AW808386 AW808594 AW808654 AW808813 AW808551 AWB08676 AW808350 AWB08406 AW808694 AW808934 
AW808829 AW808385 AW808422 AW80B401 AW808409 AW808760 AW80B863 AW808521 AW808539 AW808609 AW808472 AW808739 
AW808704 AW808558 AW808714 AW808420 AW8 
AW846433 AW8461 59 AW846377 AW846528 

AW848047 AW848202 AW848631 AW848142 AW848702 AW848121 AW848632 AW848140 AW848571 AW848009 AW848067 AW848069 
AW848905AW848214 
BE046680 BE046738 BE044958 
BE064415 BE064430 BE064448 

BE145899 BE145848 BE145849 BE145853 BE145927 BE145925 
Z43062R13213H14422 

AI217097 AW886090 W38035 W38792 AA232835 AW936043 
AA255652 AA28091 1 AW967920 AA262684 

BE258835 AW968316 AA258918 AW843305 R14744 AI580388 BE071923 R36280 

AA337221 AA336756 AW966196 

AW954525 A1372685 AA349501 A1372687 H10564 

AA418703M418711 BE071915BE071920 BE071912 
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428342 

430535 
432765 
433523 
434763 
436295 
439092 
439306 
440947 
442481 
445432 
448044 
449570 
449745 
450317 
450580 



450582 
450687 
452462 
453682 
454037 
454096 
454171 
454457 
454665 
454860 
454968 
455067 
455276 
455646 
455710 
457374 
458912 



290035.2 

319643.1 

353907 1 

368873J 

392847.1 

41733J 

468554J 

47088.1 

505904.1 

543588 1 

63943J 

747196 1 

81018J 

814534.1 

831956.1 

83929.1 



83933.1 

84327J 

918580J 

977454.1 

996287J 

1007449.1 

1049240 1 

1207274 1 

1228599.1 

1237732J 

1247029J 

1252050.1 

1272541.1 

1348557.1 

1352368.1 

328758.1 

823104.1 



A1739168 AA426249 AI199636 AW505198 AW977291 AA824583 AAB83419 AA724079 AI015524 A1377728 AW293682 AI928140 AA731438 
A1092404AI085530AA731340 
AW968485 AW968670 AA480922 BE350425 
AJ003429 AJ003367 AA564825 

H29882 AW665533 AW149901 A1572917 AA598500 A1686466 AI336390 AW864390 AW86432D 

AA648618AW974389 H51771 

N73895AJ001872 

AAB30149 AW978407 M85983 AW503637 
BE2201 99 W01 81 3 AF0861 18 N70760 BE221405 
AA910403 AI815593 W58361 AW1 62520 AI816550 
N99828 BE079873 All 10738 AF074645 
AV653771 BE089370 

AI458682 H24240 R14537 R18426 AW867082 
AA001793AA001871 

AI668593 AI820774 R86205 H39971 H22177 H26241 
A1692689 R14223 R18395 

N40087 H12925 AA460779 AA096372 BE160847 AW816890 AW816893 AW816577 AW816B91 AW816882 AW816868 AW816889 AW816940 
AW816892 AW816941 AW81657B AA029183 T83320 HB6850 AA010295 BE160823 Z25353 AA150883 AW887764 AW023806 AW022095 
AA164518 AA730973 W00417 W65303 

AJ339732 AA010300 AW515041 AA768334 N29860 AA425874 AA425118 AA865829 AW936878 
AA495B00 AA495737 AA010736 AA654716 AA640726 
BE173515 BE1 73560 A1902860 
T79703 T96307AL079725 
AW998716 AW022148 N 68020 

AW062757 AW176890 AW062758 AW176895 AW176869 AW176900 AW176897 AW176928 AW176868 AW176892 

AW854832 AW854798 AW854857 AW854816 AW854834 AW854817 

AW753456 AW753036 AW854868 AW854862 

AW812866 AW812746 AW812747 AW812884 AW812763 AW812722 

AW835767 AW835537 BE160187 

AW849046 AW847956 AW849039 AW847957 AW848279 AW848698 AW849034 AW849033 
AW854538AW854418AWB54412 

BE176479 BE176678 BE176357 BE176550 AW886079 BE176676 BE176615 BE176555 BE176489 BE176610 BE176362 
BE054420 BE064435 BE064429 BE064414 BE064400 BE064517 

BE072049 BE069471 BE069489 BE069478 BE069479 BE069492 BE069485 BE072122 BE072124 BE069491 BE069486 BE069464 

AA493662 AW897396 BE154814 

AI91 1066 AI933734 AI680B88 AJ003599 



TABLE 66C . m . . _ 

Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers in this column are Genbank Identifier (G!) numbers. 'Dunham I. el at/ refers to the publication entitled The DNA sequence of 

human chromosome 22." Dunham 1. et aL, Nature (1999) 402:489-495. 
Strand: Indicates DNA strand from which exons were predicted. 

Nt_position: Indicates nucleotide positions of predicted exons. — 



Pkey 


Ref 


Strand 


NLposition 


400533 


6981826 


Minus 


277132-277595 


400746 


7329328 


Minus 


147703-147896 


401132 


8705350 


Minus 


85679-85795 


402145 


8018280 


Plus 


113086-114800 


403242 


7637817 


Minus 


11297-12511 


403481 


9965004 


Plus 


93496-93633 


403510 


7652047 


Plus 


61866-62027 


403667 


6850483 


Minus 


1344-1442,1545-1697 


404003 


8655948 


Plus 


198349-199096 


404561 


9795980 


Minus 


69039-70100 


404592 


9943965 


Minus 


39067-39225 


404848 


8248647 


Minus 


23955-24034,25143-25264 


404967 


7523744 


Minus 


89944-90729 


404995 


6006247 


Minus 


154015-154123 


405264 


7329374 


Plus 


28556-28684 


405321 


3419846 


Minus 


44654-45210 


405348 


2914717 


Minus 


43310-43462 


405510 


7630909 


Minus 


101028-101174 


405548 


1532158 


Plus 


11552-11686 



TABLE 67A: 689 GENES UP-REGULATED IN PROSTATE CANCER COMPARED TO NORMAL ADULT TISSUES 

Table 67A lists about 689 penes up-regulated in prostate cancer compared to normal adult tissues. These were selected from 9009 probesets on the Affyrnetrix/Eos Hu03 GeneChip 
array that showed some expression in human xenograft tumors and cell toes. The 689 genes were selected such that the ratio of "average" prostate cancer to "average normal adult 
tissues was greater than or equal to 2.5. The -average" prostate cancer level was set to the 75lh percentile amongst prostate cancers. The average normal I adult Issue level was set 
to the B5th percentile amongst non-maHgnant tissues. In order to remove gene-specific background levels of run-specific hybridization, the 1 0th percentile value amongst non- 
malignant tissues was subtracted from both the numerator and the denominator before the ratio was evaluated. 

Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene title 

R1 : Ratio of tumor to normal body tissue 
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Pkey 


ExAccn 


UnigenelD 


Unigene Title 


R1 


448999 


AF179274 


Hs.22791 


transmembrane protein with EGF-like and 


73.2 


426747 


AA535210 


Hs.171995 


kallikrein 3, (prostate specific antigen 


65.0 


432441 


AW292425 


Hs.163484 


ESTs 


56.0 


446057 


AI420227 


Hs.149358 


Trp-p8 transient receptor potential cati 


55.0 


400302 


N48056 


Hs.283946 


folate hydrolase (prostate-specific memb 


48.1 


414569 


AF109298 


Hs.1 18258 


prostate cancer associated protein 1 


41.4 


432240 


AI694767 


Hs.129179 


Homo sapiens cDNA FU 13581 fis. done PL 


40.3 


419526 


A1821895 


Hs.1 93481 


ESTs 


35.9 


400298 


AA032279 


Hs.61635 


six transmembrane epithelial antigen of 


35.7 


400299 


X07730 


Hs.171995 


kallikrein 3. (prostate specific antigen 


34.9 


453370 


AI470523 


Hs.139336 


ATP-binding cassette, sub-family C (CFTR 


33.9 


409731 


AA125985 


Hs.56145 


thymosin, beta, identified in neuroblast 


32.8 


424099 


AF071202 


Hs.139336 


ATP-binding cassette, sub-family C (CFTR 


32.6 


400292 


AA250737 


Hs.72472 


BMP-R1B 


28.8 


400287 


S39329 


Hs.181350 


kallikrein 2, prostatic 


28.B 


425075 


AA506324 


Hs.1852 


add phosphatase, prostate 


28.6 


409361 


NM.005982 


Hs.54416 


sine ocufis homeobox (Drosophita) homolo 


27.8 


415539 


AI733881 


Hs.72472 


BMP-R1B 


27.0 


428819 


AL135623 


Hs.193914 


KIAA0575 gene product 


25.7 


409262 


AK000631 


Hs.52256 


hypothetical protein FU20624 


25.5 


432374 


W68815 


Hs.301885 


HomD sapiens cDNA FU1 1346 fis. clone PL 


24.8 


407168 


R45175 


Hs.117183 


ESTs 


24.5 


400296 


AA305627 


Hs.139336 


ATP-binding cassette, sub-family C (CFTR 


23.6 


428336 


AA503115 


Hs.183752 


microserninoprotem. beta- 


23.5 


433043 


W57554 


Hs.125019 


lymphoid nuclear protein (LAF-4) mRNA 


23.4 


403047 






NM_005656*:Homo sapiens transmembrane pr 


23.4 


444484 


AK002126 


Hs.11260 


hypothetical protein FU11264 


22.0 


423073 


BE252922 


Hs.123119 


MAO (mothers against decapentaplegic. Or 


21.6 


407709 


AA456135 


Hs:23023 


ESTs 


21.5 


401424 






NM„001172:Homo sapiens arginase. type II 


20.6 


448290 


AK002107 


Hs.20843 


Homo sapiens cDNA FLJ11245 fis, done PL 


19.7 


434666 • 


AF151103 


Hs.112259 


T cell receptor gamma locus 


18.5 


4159B9 


AI267700 




ESTs 


17.9 


437052 


AA861697 


Hs.1 20591 


ESTs 


17.8 


451939 


U80456 


Hs.27311 


single-minded (DrosophUa) home-log 2 


16.8 


431548 


AI834273 


Hs.9711 


novel protein 


16.6 


425628 


NM_004476 Hs.283946 


folate hydrolase (prostate-specific memb 


16.5 


428862 


NM_000346 Hs.2316 


SRY (sex determining region Y)-box 9 (ca 


16.5 


450096 


AI682088 


Hs.79375 


hdocarboxylase synthetase (biotin-[prop 


16.5 


413597 


AW3028B5 


Hs.117183 


ESTs 


16.1 


411096 


U80034 


Hs.68583 


mitochondria! intermediate peptidase 


15.7 


435677 


AA694142 


Hs.293726 


ESTs, Weakly similar to TSGA RAT TESTIS 


15.3 


419743 


AW408762 


Hs.5957 


Homo sapiens done 24416 mRNA sequence 


15.3 


407122 


H20276 


Hs.31742 


ESTs 


15.1 


42795B 


AA418000 


Hs.98280 


potassium intermediate/small conductance 


14.1 


418829 


AA516531 


Hs.55999 


NK homeobox (Drosophita), fam9y 3, A 


13.9 


426501 


AW043782 


Hs.293616 


ESTs 


13.9 


428898 


AB033070 


Hs.194408 


K1AA1244 protein 


13.7 


418961 


AW967646 


Hs.23023 


ESTs 


13.3 


448519 


AW175665 


Hs.278695 


Homo sapiens prostein mRNA, complete cds 


13.2 


418848 


AI820961 


Hs.193465 


ESTs 


13.1 


428398 


AI249368 


Hs.98558 


ESTs 


13.0 


429220 


AW207206 




ESTs 


12.7 


401451 






NM_004496 # :Homo sapiens hepatocyte nude 


12.4 


424012 


AW368377 


Hs.137569 


tumor protein 63 kDa with strong homdog 


12.4 


419078 


M93119 


Hs.89584 


insuBnoma-assodated 1 


12.3 


450382 


AA397658 


Hs.60257 


Homo sapiens cONA FLI1 3598 fis, clone PL 


11.9 


430376 


AW292053 


Hs.12532 


chromosome 1 open reading frame 21 


11.8 


408826 


AF216077 


Hs.48376 


Homo sapiens done HB-2 mRNA sequence 


11.4 


429918 


AW873986 


Hs.1 19383 


ESTs 


11.3 


427212 


AW293849 


Hs.58279 


ESTs, Weakly similar to ALU7 JWMAN ALU S 


11.1 


412446 


AI768015 




ESTs 


11.1 


433404 


T32982 




ESTs 


10.9 


420757 


X78592 


Hs.99915 


androgen receptor (dihydrotestosterone r 


10.9 


427674 


NMJJ03528 Hs.2176 


H2B histone family, member Q 


10.6 


444151 


AW972917 


Hs.128749 


alpha-methyl acyl-CoA racemase 


10.4 


447342 


A1199268 


Hs.19322 


Homo sapiens, Similar to RIKEN cDMA 2010 


10.3 


418278 


Al 088489 


Hs.83937 


hypothetical protein 


10.3 


430226 


BE245562 


Hs.2551 


adrenergic, beta-2-, receptor, surface 


10.2 


433927 


AI557019 


Hs.1 16467 


small nudear protein PRAC 


10.2 


408000 


L11690 


Hs.198889 


bullous pemphigoid antigen 1 (230/240kD) 


10.1 


449625 


NMJ)14253 


odz (odd Oz/ten-m, DrosopMa) homolog 1 


10.1 


431117 


AF003522 


Hs.250500 


detta (DrosophilaKike 1 


10.0 


439444 


A1277652 


Hs.54578 


ESTs. Weakly similar to I38022 hypothec 


9.8 


43771B 


AI927288 


Hs.196779 


ESTs 


9.8 


440529 


AW20764C 


I Hs.16478 


Homo sapiens cDNA: FU21718 fis, clone C 


9.7 


453160 


A1263307 




H2B histone family, member L 


9.7 


432715 


AA247152 


Hs.200483 


ESTs, Weakly similar to KIAA1074 protein 


9.7 


450325 


AI935962 


Hs.91973 


ESTs 


9.6 


434170 


AA626509 


Hs.122329 


ESTs 


9.6 


428600 


AW863261 


I Hs.138860 


hypothetical protein DKFZp434K1421 


9.6 
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447033 
401747 
434423 
411887 
457211 
415263 
432729 
421898 
432101 
451640 
428046 
420218 
434926 
424692 
417333 
452594 
446100 
400294 
416182 
431542 
447397 
432674 
456660 
443822 
451684 
437124 
416239 
432415 
424800 
428728 
418564 
442049 
429597 



AI357412 Hs.157601 

NMJJ06769 Hs.3844 

AW182924 Hs.128790 

AW972565 Hs.32399 

AA948033 Hs.130853 

AK000292 Hs.1 30732 

N62293 Hs.45107 

AI918950 Hs.123642 

AA195601 Hs.26771 

AW812795 Hs.337534 
AW958037 

BE543269 Hs.50252 

M429834 Hs.151791 

AL157545 Hs.173179 

AU076405 Hs.29981 

AW987109 Hs.1 3804 

N95796 Hs.278695 
NMJJ04354 Hs.79069 

H63010 Hs.5740 

BE247676 Hs.18442 

AA641092 Hs.257339 

AA909249 Hs.1 12282 

AI087412 Hs.143611 

AF216751 Hs.26813 
AA55445B 

AL038450 Hs.48948 

T16971 Hs.289014 

AL035588 Hs.153203 



NM_016625 Hs.191381 
AA631143 Hs.278695 
AA310393 Hs.1 90044 
NM_003816 Hs.2442 
X91662 Hs.66744 

443180 R15875 Hs.258576 

433285 AW975944 Hs.237396 

410870 U81599 Hs.66731 

418836 AI655499 Hs.161712 

444922 AI921750 Hs.144871 

431992 NM_002742 Hs.2891 

425905 AB032959 Hs.3185B4 

432586 AA568548 

431325 AW026751 Hs.5794 

435047 AA454985 Hs.54973 

447476 BE293465 Hs.20880 

422728 AW937B26 Hs.103262 

426108 AA622037 Hs.166468 

433323 AA805132 Hs.159142 

423349 AF010258 Hs.127428 

416815 U41514 Hs.80120 

425154 NM_001851 Hs.1 54850 

438869 AF075009 

418329 AW247430 Hs.84152 

452367 U71207 Hs.29279 

418601 AA279490 Hs.86368 

429769 NM_004917 Hs.218366 

436962 AW377314 Hs.5354 

425465 L18964 Hs.1 904 

450377 AB033091 

451418 BE387790 Hs.26369 

418250 U29926 Hs.83918 

416111 AA033813 Hs.79018 
426350 NMJM3245 Hs.2022 
440774 AI420611 Hs.153934 
413992 W26276 Hs.136075 
421928 AF013758 Hs.109643 
433293 AF007635 Hs.32417 
452340 NM.002202 Hs.505 
451027 AW519204 Hs.40808 
436063 AK000028 
440749 W22335 HsJ392 
432103 T15B03 Hs.272458 
431958 X63629 Hs.2B77 
411019 AW993097 Hs.48617 
421566 NM.000399 Hs.1 395 
452744 AI267652 Hs.246107 
425259 AL049280 Hs.155397 
442772 AW503580 Hs.5957 
446045 AJ297436 Hs.20166 
428342 AJ73916B 
453439 AI572438 Hs.32976 
433517 AW022133 Hs.189838 
424036 M770688 



ESTs 9.5 

Homo sapiens keralin 17 (KRT17) 9-3 

UM domain only 4 9.3 

ESTs 9-2 

ESTs. Weakly similar to S51797 vasodilat 9.2 

ESTs 9.2 

hypothetical protein FU202B5 9.2 

ESTs 9.1 

EphA3 9-1 

Human DNA sequence from done 747H23 on 9.0 

ESTs, Moderately similar to I38022 hypot 8.9 

ribosomal protein L4 B.B 

mitochondrial ribosomal protein L32 8.6 

KIAA0092 gene product 8.6 

bromodomain and PHD finger containing, 3 8.6 

solute carrier family 26 (sulfate transp 8.5 

hypothetical protein dJ462023.2 8.5 

Homo sapiens prostein mRNA, complete cds 8,5 

cycBn G2 8.5 

ESTs 8.4 

E-1 enzyme 8.3 

ESTs, Weakly similar to 138022 hypolheti 8.1 

solute carrier family 30 (zinc transport 8. 1 

ESTs. WeaWy similar to 2004399A chromes 8.1 
CDA14 ' 8.1 

KIAA0666 protein 8.0 

ESTs 8.0 

ESTs, Weakly similar to A43932 mucin 2 p 7.9 

MyoD family inhibitor 7.9 

hypothetical protein 7.9 

Homo sapiens prostein mRNA, complete cds 7.8 

ESTs 7.8 

a disinlegrin and melaHoproleinase doma 7.8 

twist (Drosophila) homolog (acrocephalos 7.7 

claudin12 7.5 

ESTs 7.5 

homeoboxB13 7.4 

ESTs 7,4 

Homo sapiens cDNA FU 13752 fc, done PL 7.3 

protein kinase C, mu 7.3 

novel C3HC4 type Zinc finger (ring tinge 7.3 

ESTs 7.3 

ESTs, Weakly similar to 2109260A B cell 7.3 

cadhsrin-Bke protein VR20 7.3 

ESTs. Weakly similar to I3B022 hypolheti 7.3 

ESTs, Weakly similar to ZN91.HUMAN ZINC 7.2 

programmed cell death 5 7.1 

ESTs 7.1 

homeoboxA9 7.0 

UDP4J-ac»tyMpha-f>galadosaininB:polyp 6-9 

collagen, type IX. alpha 1 8.9 

gb:Homo sapiens full length insert cDNA 6.9 

cystathionine-beta-synthase 6.8 

eyes absent (Drosophila) homolog 2 6.8 

calmegin 8.B 

kaflikrein 4 (prostase, enamel matrix, p 6.8 

DKFZP564I052 protein 6.8 

protein kinase C iota 8.6 

KIAA1265 protein 6-6 

rrypothefical protein FU20287 6.6 

adenosine monophosphate deaminase (isofo 6.6 

chromatin assembry factor l.subunit A ( 6.5 

transglutaminase 3 (E polypeptide, prote 6.5 

ESTs 6.5 

RNA.U2 small nuclear 6.4 

potyaderrytate rinding proteln-inleracfin 6.4 

hypothetical protein MGC4309 6.4 

ISL1 transcription factor, UM/homeodoma 6.4 

ESTs 6.4 

ribosomal protein S24 6.3 

hypothetical protein MGC3199 6.3 

protein phosphatase 3 (formerly 2B), cat 6.3 

cadherin 3, type 1, P-cadherin (placenta 6.2 

Homo sapiens cDNA FU 12540 fs, clone NT 6.2 

early growth response 2 (Krox-20 (Drosop 6.2 

Homo sapiens mRNA; cDNA DKF2p434E082 (fr 6.1 

Homo sapiens mRNA; cDNA DKFZp564K143 (fr 6.1 

Homo sapiens done 24416 mRNA sequence 6.1 

prostate stem cell antigen 6.0 

Homo sapiens cDNA FU 13458 Ms, done PL 6.0 

guanine nucleotide binding protein 4 6.0 

ESTs 6.0 

H2A Nstone family, member L 5.9 
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420092 AA814043 Hs.88045 

410268 AA316181 Hs.61635 

425071 NMJJ13989 Hs.154424 

441866 BE464341 Hs.21201 

5 450244 AA007534 Hs.1 25062 

429165 AW009886 Hs.1 18258 

451952 AL120173 Hs.301663 

429084 AJ001443 Hs.195614 

445873 AA250970 Hs.251946 

10 412935 BE267045 Hs.75064 

415068 Z19448 Hs.131887 

434826 AF155661 Hs.22265 

415621 AI648602 Hs.55468 

416653 AA76B553 Hs.1 93145 

15 433332 A1367347 Hs.44898 

455497 AA1 12573 Hs.278695 

418245 AA088767 Hs.83883 

428593 AW207440 Hs.185973 

425211 M18667 Hs.1867 

20 456508 AA502764 Hs.1 23469 

458809 AW972512 Hs.20985 

410762 AF226053 Hs.6617Q 

440594 AW445167 Hs.126036 

413328 Y15723 Hs.75295 

25 447805 AW627932 Hs.302421 

434808 AF155108 Hs.256150 

453308 AW959731 Hs.323099 

421470 R27496 Hs.1378 

418555 AI417215 Hs.87159 

30 404632 

450861 AI523898 Hs.17617 

431583 AL042613 Hs.262476 

400303 AA242758 Hs.79136 

400290 H18836 Hs.31608 

35 410037 AB020725 Hs.58009 

410240 All 57424 Hs.61289 
403046 

419647 AA348947 Hs.91816 

450203 AF097994 

40 450164 AI239923 Hs.63931 

417318 AW953937 Hs.240845 

417153 X57010 Hs.81343 

412350 AI659306 Hs.73826 

456088 BE177320 Hs.156148 

45 433852 A1378329 Hs.126629 

452679 Z42387 Hs.83883 

430387 AW372884 Hs.240770 

416276 U41060 Hs.79136 

441021 AW578716 Hs.7644 

50 453171 R76472 Hs.65646 

416795 AI497778 Hs.20509 

427615 BE410107 Hs.179817 

419440 AB020689 Hs.90419 

422048 NM_012445 Hs.288126 

55 453082 H18835 Hs.31608 

420948 AB016898 Hs.100469 

442592 BE566623 Hs.29899 

421685 AF189723 Hs.105778 

431724 AA514535 Hs.283704 

60 423242 AL039402 Hs.125783 

434485 AI623511 Hs.118567 

407103 AA424881 Hs.256301 

414085 AA114016 Hs.75746 

422486 BE514492 Hs.1 17487 

65 422083 NM 001141 Hs.111256 

439735 AI635386 Hs.142846 

420039 NM_004605 Hs.94581 

449845 AW971183 Hs.6019 

423445 NMJJ14324 Hs.128749 

70 433730 AK002135 Hs.3542 

426006 R49031 Hs.22627 

451982 F13036 Hs.27373 

415082 AA16O000 Hs.137396 

409799 D11928 Hs.76845 

75 423583 AL122055 Hs.129836 

452017 AF109302 Hs.27495 

437162 AW005505 Hs.5464 

451468 AW503398 Hs.293653 

nrt 429467 NM 004477 Hs.203772 

80 451752 AB032997 

430294 AI538226 Hs.32976 

452259 AA317439 Hs.28707 

456362 AW973003 Hs.179909 



ESTs 5.9 

six transmembrane epithelial antigen of 5.8 

deiodtnase. iodolhyronine. type II " 5.8 

nectin 3; DKFZP566B0846 protein 5.8 

ESTs 5.8 

prostate cancer associated protein 1 5.7 

ESTs 5.7 

splicing factor 3b, subunit 3. 130kD 5.7 

po1y(A}-binding protein, cytoplasmic 1-1 5.7 

tubulin-speciftc chaperone c 5.7 

ESTs, Weakly similar b T24396 hypotheti 5.7 

pyruvate dehydrogenase phosphatase 5.7 

ESTs 5.7 

metaltothionein 1E (functional) 5.6 

Homo sapiens clone TCCCTA00151 mRNA sequ 5.5 

Homo sapiens prostein mRNA, complete cds 5.5 

transmembrane, prostate androgen induced 5.4 

degenerative spermatocyte (homolog Droso 5.4 

progastricsin (pepsinogen C) 5.4 

ESTs, Weakly similar to AF208855 1 BM-01 5.4 

sin3-assoclated polypeptide, 30kD 5.4 

HSKM-B protein 5.4 

ESTs 5.4 

guanyl ate cyclase 1 , soluble, alpha 3 5.3 

gemin4 5.3 

Homo sapiens, Similar to RIKEN cDNA 2810 5.3 

ESTs 5.3 

annexin A3 5.3 

hypothetical protein RJ1 2577 5.2 

N M_022490:Homo sapiens hypothetical prot 5.1 

ESTs 5.1 

S-adenosylmelhionine decarboxylase 1 5. 1 

L1V-1 protei n, estrogen regulated 5.1 

hypothetical protein FLJ20041 . 5.1 

KIAA0918 protein 5.1 

synaptojanin 2 5.0 

NM_005656*:Homo sapiens transmembrane pr 5.0 

hypothetical protein 5.0 

L-kynurenine/alpha-aminoadipate aminotra 5.0 

ESTs 5.0 

ESTs 4.9 

collagen, type I!, alpha 1 (primary oste 4.9 

protein tyrosine phosphatase, non-recept 4.9 

hypothetical protein FU13231 4.9 

ESTs 4.9 

transmembrane, prostate androgen induced 4.9 

nuclear cap binding protein subunit 2, 2 4.9 

UV-1 protein, estrogen regulated 4.8 

H1 histone family, member 2 4.8 

ESTs 4.8 

HBVpX associated pratein-8 4.8 

CGI-82 protein, PSDR1 4.8 

K1AA0882 protein 4.8 

spondin 2. extraceflutar matrix protein 4.8 

hypothetical protein FU20041 4.8 

myeloid/lymphoid or mixed-lineage leukem 4.8 

ESTs, Weakly similar to G02075 transcrip 4.8 

ATPase. Ca transporting, type 2C, member 4.7 

ESTs 4.7 

DEME-6 protein 4.7 

ESTs 4.7 

hypothetical protein MGC13170 4.6 

aldehyde dehydrogenase 1 family, member 4.6 

gene near HD on 4p16.3 with homology to 4.6 

arachidonate 15-fipoxygenase, second typ 4.6 

hypothefical protein 4.6 

sulfotransferase family, cytosolic. 2B, 4.6 

DnaJ (Hsp40) homolog, subfamily C, membe 4.5 

alpha^trrytecyl-CoA racemase 4.5 

hypothetical protein FU11273 4.5 

ESTs 4.5 

Homo sapiens mRNA; cDNA DKF2p56401763 (f 4.5 

ESTs, Weakly similar to JC523B galactosy 4.5 

phosphoserine phosphatase-like 4.5 

K1AA1028 protein 4.4 

prostate cancer associated protein 7 4.4 

thyroid hormon e receptor coactivating pr 4.4 

ESTs, Moderately similar to 138022 hypot 4.4 

FSHD region gene 1 4.4 

KIAA1 171 protein 4.3 

guanine nucleotide binding protein 4 4.3 

signal sequence receptor, gamma (translo 4.3 

hypothetical protein FU22995 4.3 
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449048 


Z45051 


Hs.22920 


440300 


N39760 


Hs.8859 


427308 


D26067 


Hs.1 74905 


435706 


W31254 


Hs.7045 


416854 


H40164 


Hs.80296 


451406 


A1694320 


Hs.6295 


410076 


T053S7 


Hs.7991 


410275 


U85658 


Hs.61796 


419239 


AA468183 


Hs.184598 


442501 


AA315267 


Hs.23128 


436761 


A1817776 


Hs.236557 


4203B0 


AA640891 


Hs.102406 


436556 


AI364997 


Hs.7572 


458440 


A1095468 


Hs.135254 


446416 


AV658299 


Hs.163959 


417873 


BE266659 


Hs.293659 


430598 


AK001764 


Hs.247112 


421305 


BE397354 


Hs.324830 


429299 


A1620463 


Hs.347408 


407894 


AJ278313 


Hs.41143 


414664 


AA587775 


Hs.66295 


408418 


AW963897 


Hs.44743 


430945 


U80669 


Hs.55999 


432363 


AA534469 




447574 


AF162666 


Hs.18895 


447595 


AW379130 


Hs.18953 


453857 


AL080235 


Hs.35861 


444108 


R55784 


Hs.140942 


422890 


Z43784 




417379 


AA196390 




418818 


AA228899 


Hs.101307 


410102 


AW248508 


Hs.279727 


418821 


AA4360O2 


Hs.183161 


438825 


BE327427 


Hs.79953 


433233 


AB040927 


Hs.301804 


415276 


U88666 


Hs.78353 


440146 


AW014231 


Hs.90790 


407819 


R42185 




450402 


BE218027 


Hs.89969 


432527 


AW975028 


Hs.102754 


419733 


AW362955 





422134 AW179019 Hs.112110 



432435 


BE218886 


HsJ2B2070 


450680 


AF131784 


Hs.25318 


436420 


AA443966 


Hs.31595 


430122 


NM_013342 Hs.233765 


411031 


W37943 


Ks.34892 


429259 


AA420450 


Hs.292911 


407B13 


AL120247 


Hs.40109 


417067 


AJ001417 


Hs.81086 


410227 


AB009284 


Hs.61152 


417958 


AA767382 


Hs.193417 


427176 


AW381569 


Hs.40334 


448826 


A15B0252 


Hs.293246 


447956 


AA340605 


Hs.105887 


432675 


AI791855 




419713 


AW968058 


Hs.92381 


427479 


BE410092 


Hs.178471 


443162 


T49951 


Hs.9029 


413950 


AA249096 


Hs.32793 


439963 


AW247529 


Hs.6793 


419083 


A1479560 


Hs.98613 


448148 


NM.016578 Hs.20509 


419465 


AW500239 


HS.211B7 


432274 


AK000382 


Hs.274251 


447620 


AW290951 




430589 


AJ002744 


Hs.246315 


418004 


U37519 


Hs.87539 


416655 


AW968613 


Hs.79428 


459284 


AF155660 


Hs.300496 


421829 


ABO 18330 


Hs.108708 


437252 


A1433833 


Hs.164159 


437967 


BE277414 


Hs.5947 


417061 


AI675944 


Hs.188691 


423337 


NM.O04655 Hs.127337 


418838 


AW385224 


Hs.35198 


453469 


AB014533 


Hs.33010 


408063 


BE0B6548 


Hs.42346 


422072 


AB018255 


Hs.111136 


420297 


A1628272 


Hs.88323 


420522 


AW957137 


Hs.98541 


408833 


AW612232 


Hs.254835 


426647 


AA243464 


Hs.294101 



similar to S6B401 (cattle) glucose indue 

Homo sapiens, Similar lo RIKEN cDNA 5830 

K1AA0033 protein 

GL004 protein 

Purkinje celt protein 4 

ESTs, Weakly similar to T17248 hypotheU 

ESTs 

transcription factor AP-2 gamma (aclivat 

Homo sapiens cDNA: FU23241 fis, clone C 

ESTs 

ESTs 

ESTs 

ESTs 

Homo sapiens clone 1 thrombospondin mRNA 
ESTs 

Homo sapiens, Similar to RIKEN cDNA A430 
hypothetical protein FU1O902 
diptheria toxin resistance protein requl 
hypothetical protein MGC13102 
phosphoinosilide-specific phosphotipase 
multi-PDZ-<Jomain-containing protein 
KIAA1435 protein 

NK homeobox (Drosophila), family 3, A 

gb:nf76g1U1 NCI_CGAP_Co3 Homo sapiens 

tousled-like kinase 1 

phosphodiesterase 9A 

DKFZP586E1621 protein 

ESTs 

ankyrin 3. node of Ranvier (ankyrin G) 
gb:zp99b10.s1 Stratagene muscle 937209 H 
Homo sapiens HUT1 1 protein mRNA, partial 
ESTs; homologue of PEM-3 [Ciona savignyi 
ESTs 
ESTs 

KIAA1494 protein 

SFRS protein kinase 2 

Homo sapiens cDNA: FU22930 (is. clone K 

ESTs 

ESTs 

ESTs 

Homo sapiens cDNA FU14415 fis. clone HE 
mitochondrial ribosomal protein L42 
ESTs 

Homo sapiens clone 25194 mRNA sequence 
ESTs 

TCF3 (E2A) fusion partner (in childhood 
KIAA1323 protein 
Plakophilin 
K1AA0872 protein 

solute carrier famfly 22 (extraneuronal 
exostoses (nwltipte)-like 2 
ESTs 
ESTs 

ESTs, Weakly similar to putative p150 (H 
ESTs, Weakly similar to Homolog of rat Z 
ESTs 

nudix (nucleoside diphosphate linked moi 
KIAA0798 gene product 
DKFZP434G032 protein 
ESTs 

platelet-activating factor acetylhydrola 
Homo sapiens cDNA FU 12292 fis, done MA 
HBV pX associated pratein-8 
Homo sapiens cDNA: FL123068 fis, clone L 
hypothetical protein FU20375; WAA1797 
ESTs 

UDP-N-acetyl^ha-D^alactosamlne:poryp 
aldehyde dehydrogenase 3 family, member 
Ba2/adenovirus E1B 19kWnteractlng pro 
mitochondrial solute carrier 
caidum/calrnodulirnlependent protein kin 
ESTs, Weakly similar to ALU1JHUMAN ALU S 
mel transforming oncogene (derived from 
Homo sapiens cDNA FU12033 lis, clone HE 
axin 2 (conductin, axil) 
ectonucleottde pyrophosphatase/phosphodi 
K1AA0633 protein 

caldriBurin-btndJng protein calsarcin-1 
KlAA0712gene product 
ESTs. Weakly simBar to ALU INHUMAN ALU S 
hypothetical protein 
ESTs 

pre-B-ceD leukemia transcription factor 
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456177 


NMJH2391 


Hs.79414 


425689 


W16480 


Hs.24283 


432370 


AA308334 


Hs.274424 


422424 


AI186431 


Hs.296638 


446157 


BE270828 


Hs.131740 


433339 


AF019226 


Hs.8036 


421887 


AW161450 


Hs.109201 


435114 


AA775483 


Hs.288936 


412520 


AA442324 


Hs.795 


436476 


AA326108 


Hs.33829 


441224 


AU076964 


Hs.7753 


404922 






432278 


AL137506 


Hs.274256 


452747 


BE153855 


Hs.61460 


429686 


AI871613 


Hs.28538 


408001 


AA046458 


Hs.95296 


428330 


L22524 


Hs.2256 


418113 


AI272141 


Hs.83484 


428055 


AA420564 


Hs.101760 


418827 


BE327311 


Hs.47166 


431103 


M57399 


Hs.44 


427701 


AA411101 


Hs.243886 


419168 


A1336132 


Hs.33718 


409151 


AA306105 




439671 


AW162840 


Hs.6641 


452721 


AJ269529 


Hs.301871 


449933 


AW157098 


Hs.324104 


432615 


AA557191 


Hs.55028 


443884 


N20617 


Hs.194397 


403752 






427723 


A1355260 


Hs.279789 


421878 


AA299652 


Hs.1 11496 


440494 


BE618768 


Hs.7232 


426716 


NMJJ06379 Hs.171921 


433647 


AA603367 


Hs.222294 


407137 


T97307 




432886 


BE1 59028 


Hs.279704 


427715 


BE245274 


Hs.180428 


437617 


AI026701 


Hs.5716 


447818 


W79940 


Hs.21906 


414407 


AA147026 


Hs76704 


421537 


BE383488 


Hs.105547 


435655 


AW105663 


Hs.6947 


427871 


AW992405 


Hs^9622 



421662 NWL014141 Hs.106552 
407204 R41933 Hs.140237 



407908 


BE37975B 


Hs.110853 


407846 


AA426202 


Hs.40403 


420568 


F09247 


Hs.247735 


436873 


N23874 


Hs.50477 


411145 


BE439553 


Hs.250528 


435017 


AA336522 


Hs.12854 


435020 


AW505076 


Hs.301855 


443991 


NM.002250 Hs.10082 


409960 


BE261944 




433891 


AA613792 




430389 


AL117429 


Hs.240845 


432242 


AW022715 


Hs.162160 


419972 


AL041465 


Hs.182982 


445707 


A1248720 


Hs.1 14390 


412628 


A1972402 


Hs.306051 


425263 


NWL001197 


Hs.155419 


419879 


Z17805 


Hs.93564 


433345 


A1681545 


Hs.152982 


417515 


L24203 


Hs.82237 


414922 


D00723 




437898 


W81260 


Hs.43410 


426126 


AL118747 


Hs.26691 


422089 


AA523172 


Hs.103135 


450649 


NM_001429 Hs.25272 


404210 






434954 


AF161455 


Hs.284295 


401519 






419517 


AF052107 


Hs.90797 


431585 


BE242803 


Hs.262823 


427461 


AA531527 


Hs.332040 


453288 


AW5S3292 


Hs.274412 


412513 


AA322599 


Hs.5163 


432579 


AF043244 


Hs.278439 


439593 


BE073597 


Hs. 124863 


451945 


BE504055 


Hs.211420 


434614 


AI249502 


Hs.29669 


445525 


BE149866 


Hs.14831 



prostate epmefium-specific Ets transcr 

E$Ts. Moderately similar to reduced expr 

N-acetylneuraminic acid phosphate synlha 

prostate differentiation factor 

Homo sapiens cDNA: FU22562 fis, clone H 

glioblastoma overexpressed 

CO-86 protein 

mitochondrial ribosomal protein L9 
H2A histone (amity, member O 
bHW protein OEC2 
catumenin 

NMJ»3071:Homo sapiens SW1/SNF related. 

hypothetical protein FU23563 

Ig superfamily receptor LNIR 

Homo sapiens cDNA: FU21086 fis, clone C 

ESTs 

matrix metalloproteinase 7 (matrilysin. 
SRY (sex determining region Y)-box 4 
ESTs 
HT021 

pteiotrophin (heparin binding growth fac 
nuclear autoanligentc sperm protein (his 
Homo sapiens cONA FU12641 fis, clone NT 
SEC22, vesicle trafficking protein (S. c 
kinesin family member 5C 
solute carrier family 37 (glycerol-3-pho 
Human ONA sequence from clone RP1-63M2 o 
ESTs, Weakly similar to 154374 gene NF2 
leptin receptor 

NM_002753*:Homo sapiens mitogen-activate 
histone dexetytase 3 

Homo sapiens cONA FLJ1 1643 fis, clone HE 
acetyl-Coenzyme A carboxylase alpha 
sema domain, immunoglobulin domain (Ig), 
ESTs 

gb:ye53h05.s1 Soares fetal liver spleen 

chromatin accessibility complex 1 

K1AA11B1 protein 

K1AA0310 gene product 

Homo sapiens clone 24670 mRMA sequence 

ESTs 

neural proliferation, differentiation an 
HSPC069 protein 

Homo sapiens, clone IMAGE:3507281, mRNA, 
cell recognition molecule Caspr2 
ESTs, Weakly similar to ALU1JWMAN ALU S 
uncharacterized hematopoietic stem/proge 
Cbp/p30Wnteracling transactivator, wit 
protocadherin alpha 10 
RAB27A, member RAS oncogene family 
Homo sapiens, done IMAGE:4098694, mRNA, 
angiotensin II, type 1 receptor-assoclat 
Di George syndrome critical region gene 8 
potassium Intermetfiate/small conductance 
hexokinase 1 

gb:no97h03.s1 NQ_CGAP_Pr2 Homo sapiens 

DKF2P434D146 protein 

ESTs, Weakly similar to ALU4_HUMAN ALU S 

golgin-67 

ESTs 

hypothetical protein MGC2648 
BCL2-lnteracting killer (apoptosisnnduc 
Homer, neuronal Immediate early gene, 2 
hypothetical protein FU131 17 
ataxia-telangiectasia group Dissociated 
glycine cleavage system protein H (amino 
ESTs 
ESTs 

ESTs. Weakly similar to SFR4_HUMAN SPLIC 

E1A binding protein p300 

NM_005936:Homo sapiens myeloid/lymphoid 

Homo sapiens HSPC337 mRNA, partial cds 

C15000476*:gil12737279|ref|XP_012163.1| 

Homo sapiens done 23620 mRNA sequence 

hypothetical protein FU 10326 

hypothetical protein MGC13010 

simflar to yeast Upf3, variant A 

ESTs, Weakly similar to AF151840 1 CGI-8 

nucleolar protein 3 (apoptosis repressor 

ESTs 

ESTs 

ESTs 

Homo sapiens, Similar to zinc finger pro 
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419991 AJ000098 Hs.94210 eyes absent (Drosophte) rwrnotog 1 3.2 

408101 AW968504 Hs.123073 (£C2-rdated protein kinase 7 3.2 

425810 A1923627 Hs.31903 ESTs 3.2 

443123 AA094538 Hs.272808 putative Iranscriplion regulation mrclea 3.2 

5 447439 AA313565 Hs.145020 ESTs, Weakly similar to KIAA1 205 protein 3.2 

452167 N75238 Hs.13075 Homo sapiens cDNA: FU 2301 3 lis, done L 3.1 

440789 AB007857 Hs.7416 KIAA0397 gene product 3.1 

436087 BE300296 Hs.5054 CQ-133 protein 3.1 

414222 AL135173 sorbitol dehydrogenase 3.1 

10 419749 X736C8 Hs.93029 sparc/osteoneclin, cwcv and kazaWike d 3.1 

418559 AA225048 Hs.104207 ESTs 3.1 

408784 AW971350 Hs.63386 ESTs 3.1 

424285 BE207168 Hs.144630 nuclear receptor subfamily 2, group F.m 3.1 

417193 AI922189 Hs.288390 hypothetical protein FU22795 3.1 

15 428738 NMJJ00380 Hs.192803 xeroderma pigmentosum, complementation g 3.1 

435489 AJ272269 Hs.121429 zinc-binding protein Rbcc728 3,1 

433006 BE242758 Hs.190223 ESTs, Moderately similar to T29285 hypot 3.1 

426991 AK001536 Homo sapiens cDNA FU10574 fis, done NT 3.1 

_ 445943 AW898533 Hs.181574 ESTs 3.1 

20 434194 AF1 19847 Homo sapiens PRO1550 mRNA, partial cds 3.1 

432426 AW973152 Hs.31050 ESTs 3.1 

416974 AF010233 Hs.80667 RALBP1 associated Eps domain containing 3.1 

451131 AI267586 Hs.268012 falty-acid-Coenzyme A ligase, long-chain 3.1 

423453 AW450737 Hs.128791 CGI-09 protein 3.1 

25 417215 BE253181 Hs.81687 non-metastatic cells 3, protein expresse 3.1 

436278 BE396290 Hs.5097 synaptogyrin2 3.1 

432908 AI861896 ESTs 3.1 

•432952 AA813887 Hs.188173 Homo sapiens cDNA FU12187 fis, done MA 3.1 

438705 AI049624 Hs.283390 ESTs, Weakly similar to 2109260A B cell 3.1 

30 427982 NM 016156 Hs.1 81326 KIAA1 073 protein 3.1 

431578 AB037759 Hs.261587 GCN2 elF2alpha kinase 3.1 

409757 NM 001898 Hs.123114 cystalinSN 3.1 

407136 T64896 Hs.287420 Homo sapiens cDNAFU 11 533 fis, done HE 3.1 

434293 NM_004445 Hs.3796 EphB6 3.1 

35 410001 AB041036 Hs.57771 kallikreinll 3.1 

434970 AW272262 Hs.225767 ESTs 3.1 

432800 BE391046 Hs.278962 AIM-1 protein 3.1 

423392 AA195037 Hs.169341 HTPAP protein 3.1 

432205 AI806583 Hs.125291 ESTs 3.1 

40 448807 AI571940 Hs.7549 ESTs 3.1 

439979 AW600291 Hs.6823 hypothetical protein FU 10430 3.1 

435561 AA351978 Hs.4943 hepatocellular carcinoma associated prot 3.0 

427315 AA179949 Hs.175563 Homo sapiens mRNA; cDNA OKFZp564N0763 (f 3.0 

410082 AA0B1594 HS.15B311 Musashi (Drosophfla) homolog 1 3.0 

45 432621 AI298501 Hs.21192 ESTs. Weakly similar to T46428 hypotheti 3.0 

429638 A1916662 Hs.211577 kinectin 1 (kinesin receptor) 3.0 

452908 AB001451 Hs.30965 neuronal She adaptor homolog 3.0 

426030 BE243933 Hs.108642 zinc finger protein 22 (KOX 15) 3.0 

407688 W25317 Hs.37616 Human D9sp5ce variant B mRNA, complete 3.0 

50 434958 T99949 Hs.303428 Homo sapiens cDNA FU 14832 fis, done OV 3.0 

418610 AW245993 Hs.223394 hypothetical protein MGC2742 3.0 

451815 AW974911 Hs.1 84793 Homo sapiens cDNA: FU21880 fis, done H 3.0 

433556 W56321 Hs.1 11460 cak^umtealmodulin-dependent protein kin 3.0 

413010 AA393273 Hs.75133 transcription factor 64ike 1 (mitochond 3.0 

5 5 449459 BE546B46 Hs.195048 ESTs 3.0 

438523 H66220 Hs.278177 ESTs 3.0 

434263 N34895 Hs.44648 ESTs 3.0 

446825 BE266822 Hs^44097 fflamm A. alpha {aofin-Wnding protein- 3.0 

408681 AW953853 Hs.292833 ESTs, Weakty similar to 138022 hypotheti 3,0 

60 429966 BE081342 Hs.283037 HSPC039 protein 3.0 

412652 AI801777 ESTs 3.0 

442409 BE208843 Hs.1 29544 hypothetical protem MGC1 5438 3.0 

450832 AW970602 Hs.105421 ESTs 3.0 

440074 AA863045 Hs.10669 ESTs, Weakly similar to T00050 hypotheti 3.0 

65 436032 AA150797 Hs.109276 latexin protein 3.0 

419594 AA013051 HsJ1417 tDpoisomerase (DNA) II binding protein 3-0 

421674 T10707 Hs.296355 hypothetical protein FU23 138 2.9 

432302 AA345857 Hs.274307 KIAA1 442 protein 2.9 

415172 AF079529 Hs.78106 phosphodiesterase 8B 29 

70 412926 A1879076 Hs.75061 macrophage myristoylated alanine-rich C 29 

413142 M81740 Hs.75212 ornithine decarboxylase 1 2.9 

437179 AA393508 serologically defined colon cancer antig 2.9 

418372 AA311833 Hs.84318 repHcatton protein A1 (70kD) 2.9 

439609 AW971945 Hs.293236 ESTs 2.9 

75 427398 AW390020 Hs.20415 chromosome 21 open reading frame 11 29 

447958 AW796524 Hs.68644 Homo sapiens microsomal signal peptidase 29 

409340 BE174629 Hs.321130 hypothetical protein MGC2771 29 

420061 AW024937 Hs.29410 ESTs 29 

431663 NMJJ16569 Hs.267182 TBX3-iso protein 29 

80 417622 AW298163 Hs.82318 WAS protein family, member 3 29 

420344 BE463721 Hs.97101 putative G protein-coupled receptor 29 

446791 A1632278 Hs.195922 ESTs 29 

433313 W20128 Hs.296039 ESTs 29 
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419829 

437396 

413125 

401785 

433495 

452099 

437296 

421254 

445109 

423551 

421154 

410193 

435854 / 

423396 

442202 

441345 

444367 

422522 

443337 

418166 

448734 

413550 

426170 

444101 

425320 

431631 

448B04 

431416 

447881 

445309 

453186 

439778 

425010 

432840 

450546 

431674 

422017 

443181 

448913 

440193 

452941 

451338 

411190 

434629 

407192 

434747 

428171 

426657 

447147 



407748 

445596 

452268 

446494 

438157 

452260 

409648 

401666 

400301 

453390 

426514 

425294 

405387 

441266 

432388 

450937 

447210 

445098 

451404 

409650 

452707 

451134 

452826 

420818 

43B510 

406627 

442315 

421717 

425910 



ESTs 2.9 

cryptic gene 2.9 

hypothetical protein FU11210 2.9 

Homo sapiens cONA FU14934 fis, done PL 2.9 

ESTs. Moderately similar to PC4259 feni 2.9 

hypothetical protein DKF2p762O076 2.9 

glyoxalase I 2.9 

NMJ»2275*:Homo sapiens keratin 15 (KRT1 2.9 

alpha-2-giycoprotein 1 , zinc 2.9 

hypothetical protein FU106Q7 similar to 2.9 

WAA1700 2.9 

coat protein gamma-cop 2.9 

ectonudeoside triphosphate diphosphohyd 2.9 

ESTs 2.9 

Homo sapiens cDNA FU 14269 fis, done PL 2.9 

zinc finger protein 281 2.9 

putative ankyrin-repeat containing prote 2.9 

broniodornain-containing 1 2.8 

hypothetical protein FU22625 2.8 

Homo sapiens mRNA; cDNA DKFZp564A072 (fr 2.8 

hypothetical protein FU22390 2.8 

ESTs. Highly similar to S94541 1 done 4 Z8 

non-metastatic cells 4. protein expresse 2.8 

Cdc42 effector protein 3 2.8 

Homo sapiens mRNA; cDNA DKFZp564H1916 (f 2.8 

MSTP043 protein 2.8 

methylcrotonoyl-Coenzyme A carboxylase 2 2.8 

ribosomal protein 132 2.8 

fatty acid synthase 2.8 

ESTs 2.8 

ADP-ribosylalion factor-tike 5 2.8 

ESTs 2.8 

GCN1 (general control of amino-acid synt 2.8 

likely orlholog of mouse Arkadia 2.8 

hypothetical protein FU 10846 2.8 

putative transmembrane protein 2.8 

ESTs 2.8 

hypothetical protein FU20530 2.8 

ESTs 2.7 

G-protein coupled receptor 2.7 

STAT induced STAT inhibitOf-2 2.7 

ESTs 2.7 

myosin VI 2.7 

Homo sapiens done 24923 mRNA sequence 2.7 

N-sulfoglucosamine sulfohydrotase (sulfa 2.7 

transcription factor Dp-2 (E2F dimerizat 2.7 

protein kinase 04ike 2 2.7 

tfioma-arnpfified sequence-41 2.7 

gb:af1 2e02.s1 Scares JesOs J1HT Homo sap 2.7 

ESTs 2.7 

ribosomal protein L35 2.7 

solute carrier family 1 4 (urea transport 2.7 

ESTs, Weakly similar to T23482 hypolheti 2.7 

transmembrane 7 superfamily member 2 2.7 

K1AA0606 protein; SCN Orcadian Oscfflat 2.7 

veside trafficking protein 2.7 

H2A histone family, member L 2.7 

AA463276 Hs288906 WW Oornain-Containjng Gene 2.7 

AW137011 Hs.49576 ESTs 2.7 

RAB9, member RAS oncogene famiry 2.7 

ESTs 2.7 

Target Exon 2.7 

estrogen receptor 1 2.7 

ESTs 2.7 

bone morphogenetic protein 7 (osteogenic 2.7 

HNK-1 sulfotransferase 2.7 

NM.0221 70*:Homo sapiens WHBams-Beuren 2.7 

Homo sapiens, done 1MAGE:3502329, mRNA, 2.7 

v-skl avian sarcoma viral oncogene homd 2.7 

ATP-dependant interferon response protei 2.7 

phosphaiidylserine-specific phosphdipas 2.7 

DKFZP566B183 protein 2.7 

ESTs, Weakly similar to T1 7248 hypotheti 2.7 

HSCARG protein 2.7 

ESTs 2.7 

hypothetical protein FU22195 2.7 

peroxisomal biogenesis factor 6 2.7 

ESTs, Weakly similar to 1 207289A reverse 2.7 

DKFZP586P0123 protein 2.7 

ESTs 2.7 

ESTs. Moderately similar to 2N91.HUMAN Z 2.7 

divalent cation tolerant protein CUTA 2.6 

2.6 



AW970976 Hs.293653 

A1693B15 Hs.1 27179 

BE245374 Hs.27842 

AI811807 Hs.108646 

A1924228 Hs.1 15185 

8E140396 Hs.21621 

BE244589 Hs.75207 

AW373784 Hs.71 
BE612992 Hs.27931 
AA350994 Hs.20281 
AK001724 Hs.102950 
AF039916 Hs.12330 
AA327598 Hs.89633 
AA284333 Hs.287631 
AJ132592 Hs.59757 
AJ278120 Hs.4996 
A1382555 Hs.127950 
BE272862 Hs.106534 
AW068579 Hs.7780 
H54892 Hs.10974 
AI023428 Hs.34549 
Y07604 Hs.9235 
AI754416 

BE614070 Hs.326416 
W03011 Hs.306881 
BE161065 Hs.167531 
R19175 Hs.169793 
U29344 Hs.83190 
AA548906 Hs.1 22244 
AW512213 Hs.342849 
AA532718 
. BE620886 

AL157474 Hs.12504 
AK001708 Hs.32271 
AL109729 Hs.99364 
T16837 Hs.4241 
AKD01403 Hs.279521 
AA010200 Hs.175551 
AA098901 Hs.301642 
NM.003877 Hs.1 10776 
AI039201 Hs.283316 
AA194422 Hs.22564 
AW902312 Hs.7037 
AL110347 Hs.31074 
AW612322 Hs.19131 
AA306342 Hs.69171 
AA789081 Hs.4029 
AA609200 
AA837085 

AA489323 Hs.182825 
NM.015865 Hs.171731 
AA910353 Hs.75432 
AW390282 Hs.31130 
AL079409 Hs.38176 
R89543 Hs.12942 
NM_003512 Hs.28777 



AA453208 Hs.330994 

AW451449 Hs.57749 

X03635 Hs.1657 

AA862496 Hs.28482 

BE616633 Hs.170195 

AF033827 Hs.155553 

H15968 Hs.293845 

X15218 Hs.2969 

R49131 Hs^6267 

AF035269 Hs.17752 

AL050272 Hs.12305 

AA460775 Hs.6295 

T08490 Hs.288969 

A1093823 Hs.45070 

AA318315 Hs.25999 

8E245286 Hs.301636 

AW969635 Hs.33032 

AL080220 Hs.6285 

T64904 Hs.163780 

AA173992 Hs.7956 

AF230924 Hs.107187 



AAB30797 Hs.184760 CCAAT-box-binding transcription factor 
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440695 
416941 
422717 

426110 

422027 

401197 

442607 

410297 

447963 

433101 

437546 

443725 

436213 

408157 

420805 

454144 

426761 

428695 

431725 

425174 

410467 

413435 

428293 

428180 

439680 

424395 

406670 

413762 

429922 

449518 

445919 

429343 

438552 

431472 

409299 

434672 

443015 

452576 

412843 

430375 

408196 

424339 

416292 

451040 

445636 

419175 

422000 

441128 

446237 

431797 

406789 

423217 

423115 

420460 

436165 

430542 

452827 

431609 

431108 

410310 

430333 

439864 

438821 

421091 

439414 

400263 

451428 

449051 

431615 

433037 

409504 

409330 

436299 

450628 

426793 

431122 

407236 

421109 

408770 

416737 

414869 

431222 

403532 



AW088363 Hs.246240 ESTs 2.6 

BE000150 Hs.48778 niban protein 2.6 

AJ557623 Hs.1 19475 cold inducible RNA-binding protein 2.6 

NM 002913 Hs.165563 replication factor C (activator 1) 1 (14 2.6 

AL043100 Hs.306319 fatty acid amide hydrolase 2.6 

ENSP00000229263':HSPC213. Z6 

AA507576 Hs.288361 Homo sapiens cDNA: FU22696 fis, clone H 2.6 

AA148710 lumfcan 2.6 

A1452973 Hs.165900 ESTs. Weakly similar to ALU1_HUMAN ALU S Z6 

AW572317 Hs.12082 Homo sapiens mRNA; cDNA DKF2p566L203 (fr 2.6 

AW074836 Hs.173984 T-box1 2.6 

AW245680 Hs.9701 growth arrest and DNA-damage-induciblB, 2.6 

AA325512 Hs.71472 hypothetical protein RJ10774; K1AA1709 2.6 

AA047685 Hs.62946 ESTs 2.6 

L10333 Hs.99947 reticubn 1 2.6 

BE280478 Hs.1B2695 hypolheticat protein MGC3243 2.6 

AI015709 Hs.172089 Homo sapiens mRNA; cONA DKFZp586l2022 (f 2.6 

A1355647 Hs.189999 purinergic receptor (family A group 5) 2.6 

X65724 Hs.2B39 Nome disease (pseudoglioma) 2.6 

D87450 Hs.1 54978 KIAA0261 protein 2.6 

AF102546 Hs.63931 dachshund (Drosophila) homolog 2.6 

X51405 Hs.75360 carboxypeptidase E 2.6 

BE250944 Hs.1 83556 solute carrier family 1 (neutral amino a 2.6 

AI129767 Hs.182874 guanine nucleotide binding protein (G pr 2.6 

AW245741 Hs.58461 ESTs, Weakly similar to A35659 krueppel- 2.6 

AA1 65082 Hs.146388 microtubute-assoctated protein 7 2.6 

W79632 Hs.256301 hypothetical protein MGC1 3170 2.6 

AW411479 Hs.846 FK506-binding protein 4 (59kD) 2.6 

Z97630 Hs.226117 H1 hislone family, member 0 2.6 

BE395253 Hs.30861 ESTs 2.6 

T53519 Hs.334692 hypothetical protein MGC1 41 41 2.6 

AK000785 Hs.199480 Homo sapiens, Similar to epsin 3, clone 2.6 

AJ245820 Hs.6314 type I transmembrane receptor (seizure-r 2.6 

AK001023 Hs.256549 nucleotide binding protein 2 (E.coli Min 2.6 

AA045650 Hs.53125 small nuclear ribonudeoprotein 02 polyp 2.6 

AW29402O Hs.1 17721 ESTs 2.6 

R33261 Hs.6614 ESTs, Weakly similar to A43932 mucin 2 p 2.6 

AB023177 Hs.29900 K1AA0960 protein 2.6 

AF007555 Hs.74624 protein tyrosine phosphatase, receptor t 2.6 

AW371048 Hs.93758 H4 histone family, member H 2,6 

AL034548 Hs.43627 SRY (sex determining region Y)-box 22 2.6 

BE257148 endogrycan 2.6 

AA179233 Hs.42390 nasopharyngeal carcinoma susceptibility • 2.6 

AA324743 Hs.40808 ESTs 2.6 

AW105401 ribosomal protein U9 2.6 

AW270037 KIAA0779 protein 2.6 

M30599 Hs.110637 homeoboxA10 2.6 

AA570256 ESTs, Weakly similar to T23273 hypothefi 2.6 

AW270515 Hs.149596 Homo sapiens, Similar to RIKENcDNA 2310 2.6 

BE169641 Hs.270134 hypothetical protein FU20280 2.5 

AI041403 ribosomal protein L29 2.5 

NM_000094 Hs.1 640 collagen, type VII, alpha 1 (epidermolys 2.5 

AA421973 Hs.169119 ESTs, Weakly similar to T25731 hypotheti 2.5 

AA262331 Hs.48376 Homo sapiens clone HB-2 mRNA sequence 2.5 

AI373544 Hs.331328 Intermediate filament protein syncdHn ZS 

AI557486 Hs.1 19122 ribosomal protein L1 3a 2.5 

AI571835 Hs.55468 ESTs 2.5 

AW792792 Hs.264330 N-acytsphrngosine amidohydrolase (acid c 2.5 

AA991508 Hs.105317 ESTs 2.5 

J02931 Hs.62192 coagulation factor HI (Ihromboplastin, 2.5 

$70114 Hs^39489 TIA1 cytotoxic granule-associated RNA-bi Z5 

AT720078 Hs^91997 ESTs, Weakly similar to A47582 B-cetl gr 2.5 

AA826425 Hs.192375 ESTs 2.5 

W22821 ribosomal protein L26 2.5 

NM.001183 Hs.6551 ATPase, H transporting, lysosomal (vacuo 2.5 

Eos Control 2.5 

Hs.11067 ESTs,HghryslmflartoT46395hypotheH 2.5 

Hs.333526 HER2 receptor tyrosine kinase (c-erW>2, ZS 

Hs.235860 ESTs 2.5 

Hs.279938 HSPC067 protein 2.5 

Hs.699 peptidylprofyt isomerase B (cyclophHin 2.5 

Hs.53940 hypothetical protein FU10369 2.5 

Hs.5111 hypothetical protein FU20729 2.5 

Hs.204715 ESTs 2.5 

Hs.172350 HIR (histone cell cycle regulation defec 2.5 

Hs.250535 Homo sapiens mRNA; cDNA DKFZp434N241 2 (f 2.5 

Hs.173980 nuclear matrix protein NMP200 related to 2.5 

Hs.1 01 842 AT-bmding transcription factor 1 2.5 

Hs.170195 bone morphogeneOc protein 7 (osteogenic Z5 

Hs.79691 LtM domain protein 2.5 

Hs.21479 ubinuclein 1 2.5 

Hs.273790 2ona pellucida glycoprotein 3A (sperm re 2.5 

NM_024638:Homo sapiens hypothetical prot 2.5 
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AW0B3384 

AW961400 

AW295859 

NM.014158 

AA304961 

AK001231 

AK000767 

AW3828B4 

X89887 

AI267593 

W79485 

L32832 

AW270608 

AF154335 

M157291 

X56777 
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hypothetical protein M6C13170 
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NG22 protein 
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449458 


AI805078 


Hs.208261 


ESTs 


2.5 


428013 


AF151020 


Hs.181444 


hypothetical proiein 


Z5 


424369 


R87622 


Hs.26714 


KIAA1831 protein 


2.5 


431604 
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Hs.47234 


ESTs 
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2.5 
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TABLE 67B 



Pkey: Unique Eos probeset identifier number 
CAT number. Gene cluster number 
Accession: Genbank accession numbers 



Pkey CAT Number Accession 

415989 10194J BC01 3389 BC01 7398 AI023543 AA191424 AI267700 AI469633 AW958465 AW953397 AA172056 BE940298 BF909208 BF909980 BF095153 

BG285837 A1720344 BF541 715 AA355086 AA172236 
429220 15103 7 AW341473 AA448195 AW207206 AI951341 AA969259 

412446 i 63467~1 BC021735 AI669212 AL120184 AI769949 BE701002 BE184363 BE819031 BG702238 BF090049 BF963318 BF951912 BF943013 AA934514 
AA151245 BF960659 AA987907 Z41449 BF908059 BF908053 BF908049 BE699424 BF908060 BF962832 BF952020 BF963134 BI035538 
BF908052 BF908057 BFO9O026 BF943158 A1632924 BF512340 BF952021 BF960776 BF943437 BF942847 AI768015 F09778 F04816 F02721 
AA102645 AI633838 AA617929 BF947001 B1035448 BE935876 AW890837 AW898604 BF957405 BF963433 BG704815 

433404 7392 1 BC022980 BF224081 BG149908 AW672842 BE670687 AI702161 AW341832 BE222503 N71836 AI026061 AW953116 AW083132 A1979261 

AW25377 AI42329B AI640707AW675518 AI032611 AI818044 A1299508 AI911386 AI270418 BE219257 BM141B26 AA826491 Z25159 AA587421 
N59447 Z39436 T32982 R54110 BF115783 F09044 BF808433 

449625 249224 1 BI918168 AW779760 N4B674 AI375997 AA235370 BG699146 AJ913631 AI498402 AI016320 AA323193 R49021 D59344 BG986750 N45526 
BG986917 T61382 R49391 R45432 A1203107 R35004 F07491 R25094 R35360 

453160 6028_5 BCO09612 NM_003526 B1597616 AV761592 AV760377 AL601008 BI604131 BE645918 BG187760 BG181525 BG210634 BG192999 AI263307 

AA344186 AW952966 AA033609 AA037562 AA722183 R79452 H70775 BF674991 BE769437 BG007856 AA037483 AW572535 AJ143991 
AA084561 AA033610AV742510AV735788R08336 

42021 8 1 91547 1 AW958037 R42557 AI337047 AA948360 AI638005 AA459950 AI624915 AI638047 AI467856 AI521826 AA860305 AI93231 5 AW003092 AW271756 
AW779380 AA609879 AI634791 AI493770 AI56521 1 Z41 145 AI627952 AA303734 BE349457 AW196765 AA256527 BE089727 

437124 59408J ALO50013BG9395O0AW969191 AA769925 AJ377973 A1625545 AA81 1365 AA521 114 N24705AI379579AA424899AI684671 AA829715 AI453010 

N35401 AA677452 AA504340 AI209149 AA883574 AI379062 AI084455 AI280147 AA644327 BF432508 N27873 N47364 N34880 AI147024 T86860 
AI219716 AA960926 H25544 BIB57123 AW960489 AA599099 AA765246 AA328537 BG434703 AA890373 AA424765 AI292318 AA829886 N95742 
AI218758 H25588 N36282 AA024987 N36687 BI919187 N49471 AAB89970 AW166152 AA468546 AI262504 AJ452782 AA554458 AA807080 
AA724542 AW102730 AA909978 AW1 18134 AW827241 W56431 BM127381 BF436987 AI016509 AW663972 BM1 27686 C1 5552 N63435 N51744 
T98800 N56980 BG108636 N49381 R49886 D6127B BI756612 AA508234 R49885 BF850422 

432586 6533J BC022881 AU1 50944 BG750783 AW754175 AW867737 A191 1659 A1050036 AA554053 AI826259 AA568548 

438869 52134 1 AF075009 R63109 R63068 

450377 12109"1 AB033091 AL520743 BE81 1813 N53332 N99716 AL561910 AA280655 BE710392 AV705100 AW293978 AW444556 AA281459 AI679751 A1873695 

BG700891 B1553517R80518BG779771 BG534451 AA479402AW961 580 BF061 430 AI857643 A1768486 AW51 2118 AA479302 AW770384 
AW072470 AI041 596 AI049699 AW592865 AA976261 BE879747 BG1 141 19 AW18381 1 AA909938 BF571621 BF350794 BF351375 BE925699 
AL050294 BC010371 BF982270 AL042656 BF095732 AW812618 BF095731 BG212397 BF678765 B1038602 BG388664 AW675337 BG289398 
BE939598 AW805570 AA527097 AW1 50540 A1693720 AA743364 AI915793 N48185 A1573107 AA043474 A1351615 AJ969490 A1910763 R50866 
AI699181 N73808 H08164 

436063 5483 1 AK000028 AA494483 AI298674 AA720773 AV761529 AJ884670 AI936202 AW294235 D61652 BF881 184 AV71 1384 N27154 AI926970 AV734970 

N40094 N28596 AA884747 AA512890 BG436593 

428342 6712 1 AK056315 AJ015524 AA724079 BI71 3619 AI377728 AW293682 AI928140 AI092404 AIOB5630 AA731340 BM469629 AW968804 AA425658 

AA769094 BF446026 AW1 1 871 9 AI332765 AW50088B AW576556 AI85957 1 AW499564 AW61 4573 AW629495 AW50531 4 W74704 Al 35636 1 
A1923640 AW070509 AI521 500 AL042095 AA609309 AA761319 AI381489 H45700 AA761333 AW265424 AA909524 AA63531 1 AA649040 
AI392620 Z40708 AI9B5564 AW26351 3 AA913892 AI693486 AW263502 AI806164 AW291 137 BI061 872 BI059498 AA1 34476 AW08488B AA036967 
AW370823 T55263 BI002756 AA489664 BF827261 W74741 BF963166 

424036 6226 1 NMj 033445 BC001 193 AI885781 BF794032 AA476620AA81 0906 AA81 0905 AI291 244 AIB85097AI359708AI335629 H97396AI344589AA300377 
AA457566 AW771833 BE465621 AI364068 AI364452 AI648505 AJ918342 AJ928670 AA886580 AL531029 AA886344 Al 18641 9 BG329096 
BM045465 AL531028 BG437151 BE868021 AA179427 _ o _ r AlcenoAa 

450203 19009 1 AK055952 BG182168 BG220105 BG191569 BG188964 BG187388 BG220104 BG183714 BE645998 AI819354 AW974068 A1393635 AI580846 

AI024796AW020098BI491127 AI393644 K74993 AW472959 BM478854 B1597437 H12165BI458612 BE543192 

451752 10408 5 AB032997 AJ14167B AW978722 BE4671 19 AI761408 BF727385 AW237035 AI934521 BF436248AI479668Z40632AA832081 AW295901 

BF057835 BE465977 A1621269 BE465983 BF756369 N74056 AIB17896 AA716567 AA934774 H62600 H09497 BF943762 BE395335 BE883333 

432363 1234917J AW970240 AA534489 AW970323 
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422890 61426.1 



417379 1610005J 
407819 7392_2 

419733 7612.3 



426991 29771J 



432675 1237917J 
447620 687223.1 
15 409151 '4123J 



20 409960 39576.1 



25 433891 647290.1 
414922 1563.2 



30 414222 18695.1 

35 
40 
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80 



434194 

432908 452541.1 
412652 18858.2 



437179 12239.1 



418166 18858.1 



431416 120918.1 
447881 44623.1 



407192 2200202.1 
434747 117643.1 
410297 2990.1 



424339 50559.1 



AK057805 AW162343 AI190479 AI093318 BE048820 A119B397 AA654667 BE219303 Z39851 F02655 Z28734 T16575 F10145 Z45266 AW572911 
AW964436 AW004030 A1632565 BE502530 BI7923B3 BF056928 AA449241 AI651B25 AAB05324 AJ264863 AW196918 AA948267 AJ953735 
AJ263703 BF056387 AW594171 AI867447 AA319159 A1903440 AW9561 10 AI366013 AI867923 BG91 1906 D81142 C15616 AL538697 22S032 
243784 R13382 AV746924 AA449369 AA318815 BF364265 H17038 H10064 F04161 T87230 R40898 AW204071 BI81942B AA683393 AA683376 
Z43192 T74078 T05103 F12527 T77951 H101 18 H17037 BF855407 R19603 
AA1 96390 AA507837 AA1 96468 

AK056626 AIB00896 BF939022 BE644718 AI954754 BE218177 BE348567 AI962406 AW293122 A1968798 AI457321 BE327228 BG913531 
AW939055 T30280 R54166 Z43366 BGB19153 BF003119 BE646274 BF940881 R18246 R42185 

AK027321 W63676 W63789 BE046412 BF114614 BE646183 BM126230 AW044233 A1951970 AW663548 AH 39947 AA5 14302 AA846232 
BM126251 AA789002 AA581966 AA809643 AW1 88870 BE706664 BE706539 BE153177 BF084925 AL1 33779 AW961 788 AA659693 AA347970 
AA295134 AA526037 AA449282 BG190454 H61476 T91396 N20018 T901 14 H75644 BE710736 BF687723 H28581 AA249370 BF726698 
BE841 554 BI045099 T84625 AW1 29678 BG770826 
AW973834 AI791932 AI791855 A1732640 AA558833 AA559897 AI821610 

A1973051 A1400921 A17961 54 AW241 817 AW290951 _ 
NM 004892 AF047442 BE275338 BF724863 BI917206 BE276993 AL602308 AA306105 BM152505 BC001 364 AW993471 AW993481 AW993283 
AW992919 AW992921 AW992980 AW992861 AW993220 BG573124 AA456385 AA234796 AI902726 AA354813 BI092644 BG778400 BI260001 
BG007325 A1267455 AA42&74 Al 160782 AI4721 86 AA255500 AA434006 BG435520 A1356111 H00525 AV749060 BG944497 BG292031 
AA902153 C04925 AA902160 AA383100 AW073533 AA256706 AA1 50809 R65766 AW958448 BE090972 BF693195 AV73B979 R65855 R80136 
AA484677 

BE644758 AI082238 BF940027 A1201079 AI436035 AW275966 AI085394 AI291655 AW070441 AI474134 AI268978 AI769279 AI567682 AA693941 
BF477668 AW664149 AA283782 BF509538 AW296868 AI268977 AI168133 BM352065 A1262769 BF941976 AI056920 AA481861 BF763697 
AL565B88 BM352383 M427768 AA385346 /U1B6988 M931831 AA134972 BF 

BF364291 AL515057 AV702312 AA377395 AL544217 AI34 1000 AW 193583 AI350789 AA888338 BF9453B0 AW879092 AA1 30839 T91066 N92326 
AJ004389 AA078832 AL572370 W04622 BE314003 AW960808 BM360872 AA319160 AA130778 AL514257 
AW1 82329 AA61 3792 T05304 AW858385 

BG107484 AA632009 A1432670 A1656660 AI650884 AI521919 A1264653 AW150793 AW61 1 894 AJ91 7098 BI091245 AI651454 BF434889 AI580286 
A1880735 BE301995 A1392959 AW61 3965 BM023628 AW515374 AI4601 02 BM02331B BE328188 AI952820 AI581 363 AA557165 AI695677 
AL562079 AI700926 A1470561 BF063058 AW196387 AU132984 BI064046 AI970157 R02122 H55924 AI521721 AA808206 AA725223 A1766003 
AW339821 AA805951 AI287969 AW664827 

BC021085 AL527B72 AL526296 AL557087 BI255090 AU143499 AL560356 BG823170 BE736988 AU141388 AL580262 91764173 AU120299 
AU141 755 AU141988 AU142941 AU141330 AU141784 BI770885 B191 1394 BE901426 BI918039 BG760842 BE883026 BI254740 BE732690 
BG430761 BE792868 AW328267 AL135173 AA102674 BF726986 AL564735 BE155962 BE155979 BF741679 H67776 H59234 H89665 AW117774 
AW274435 L29008NM.003104 U07361 BF002824 BI222949AI458045AW951537 BF 1 96474 AI669049 AI042523 BE453928 AU1 601 25 AU 160580 
AW069877 AW015214 AI948718 BE21970B A1953505 BE21 7755 AI991382 AI245005 BF431 179 AI521843 BE04B908 AI420449 AW275385 
AA336950 BE501521 BF740566 AA311404 AA384639 B1772535 BG473076 BE891298 BE246928 BM01 2986 BE242693 BE901342 BG746358 
BF374053 AL564430 AU143835 A1635707 BF195492 AI280559 BF741685 AA385257 BE247655 W94974 BE163702 AI025167 A18271 18 N78641 
AL581093 AU1 58964 AU158917 AI282516 AU146399 AA713947 AI285028 AA1 01228 A1338522 AA832316 AI284986 AA857926 BF372568 
AA570172 AJ753825 AA171556 AU159257 BM194320 W93390 AW132101 AA550898 BI259678 AA522554 N55172 AW01 3929 Al 826274 A1871 237 
C75260 AA934846 AA555036 AA526579 AA526466 N80270 A1538347 AW615805 AU158875 AU158883 AU159396 A1016567 AU159238 AI282517 
AA406317 AI285043 N53050 AA969446 N57718 H93323 W93374 AI873751 H5601 1 AI936174 AA937830 AW438877 AI800550 AW328268 
AJ244886 AA948424 AA700604 AA832317 AA01 1555 H97671 AA98871 1 AW 150529 N70983 H44325 AV657614 AA902238 AA644018 AA034050 

AkSb ^ AW581996 AW812945 BE882302 AA134266 BI043873 AA019433 BI862088 BM468657 

AU128438 BE384458 AL353967 B1857117 BF686525 B1465223 BM460132 AU1 29877 BI222283 BG171592 B1043544 BG496295 BG750710 
BI256542BG108520AU150719AW510354AI554256AL353968AA191092 BF132635 
AF1 19847 AA437261 AA436987 AI132965 

Mmm\ BK77762 AWoS A AW0^1 BE350994 BE207949 AI091475 A1802774 A1827533 AI075363 AI659979 AA687855 AI0781 25 A1090285 
AA670058 AA602411 AA683472 AI436058 AA612826 AI038932 BG057726 AI167355 BF449023 AI289476 AW074381 BF972912 AI991780 
M889119AI537472Z39730AIB68953AI192337BE812978BE812939AA115248H99006A1915784 F0897^ 
AA115336 BE812876 BE812972 BE812982 BE813006 BE813019 R43883 BE812981 

AK055109 BC019085 AA187684 BG656226 BM023227 AI932311 AW264381 AA398371 BM021483 A1432433 AI375777 AI129580 AW262782 
M134107BM023515M977504A!859222A1348454 R69725M975268BM021207AL080074M129218AVV207^ 
AI02841 6 A10741 14 BG656536 BE501677 AW19341 9 AA917040 W90430 A1342984 AI378957 AL038486 AW020066 BI491 093 BF476021 R41 226 
R69631 F04125 C02343 AA1 1 5589 R56480 Al 400988 R54266 R31422 

AK055915 BE867252 A1523348 AA765350 BF446858 Z43675 R19529 AL133837 BG389444 AW382942 BE702956 AA081961 BE835247 BE835308 
BE835295 AA376302 BE645790 AA375690 AA376592 AW954423 NB5732 AA249770 AI754416 AA213816 BF592044 AA811729 AW514842 
AI633486 A1096810 AW183016 AI635738 N27524 BE645916 AI247842 AW991 230 AI754277 H1 6814 AI766892 BF108422 AI800361 T951 29 
F1 1 313 R97946 R93989 AA375242 BF109388 AI860939 Al 680060 AW953B99 BF971486 AI972337 AW953841 AA372437 AI216746 H11384 
R38484 AA249043 AA249732 

AW969151 AA532718 AA504784 AI791 194 AI821930 AI821485 F37127 AA654206 FZ7974 

AK074291 AW293424 BE676135 AI832125 BE019146 BE465019 AI761 124 AA617778 AI279232 AW575897 A1672039 F28618 BF924261 
AA722184 BF934174 BE004328 AV749301 BE880282 BI019798 BI019389 BF928776 AW813409 AV726604 AA077560 BE272975 BF949119 
AW814195 BE879126 AI697926 BF594155 BE205787 BF063513 N3582B A1948557 AJ433839 A1379679 BG056182 A1589094 N23123 AA588805 
AW316581 AI080272 AI421980 A1493318 BF194830 N87590 AA495993 N32996 AA699844 H96845 H95592 N28741 BI035539 BF747723 
BF171 066 W01350 H05495 A1243785 Z39622 AA887432 A1350659 R46102 
AA602964AA609200 

BwS 7 m4M6 5 90 B^2lfA^93AA843540 BG938644 AU185628 BI495842 AW173255 BM052709 AI743999 A1690144 AJ922209 A1740907 
AW340368 AA928759 AW118737 BF513970 AA707807 BF435295 Al 339463 AI373842 A1433809 BE222392 AA602308 AA428261 A1460355 
AW662760 AI888087 AI342098 AA722418 W78151 N64382 BE221B48 AW025901 AA452120 A1150479 AI016166 AA779515 AA66 1791 BMJ74307 
BI911169BG575154AW953303 T33504 D59141 AA385785AA148648BM461961 BG5B1168 B1602483 BE889592 AW954311 BM052986 BI962893 
AI989299 Z42328 BF029504 T35658 BG402602 AU185770 AI023271 AA147719 A1434079 AI569000 Al 276488 AA992453 AA342821 AA648303 
AI349364AI05100BAA926941 AA350B94 AWD71451 N22249 AI784138 AA0B3847 N222SBAW440825 AA661570 AA376687 AA659125 AQ56299 
R70463 AI383586 AA827189 BI494872 AWQ21094 BI494871 AA905500 AA460923 BI492041 AW028965 A162461 1 BG271780 AI497723 H88862 
059858 N89979 AA658425 N81 154 D62341 AI274437 N66697 H96993 AI370663 AA728850 H05232 R59374 H12223 AI935759 > AIS2553 D60006 
N29572A1916833N75273 AA148710 B1597117 BG740471 AA332671 AA333874AA643052AW020175 R70550 BG623469 AA452342 AW965441 
W19723R32966BE883841R61003AI910374A1865262 R55325A1468927R34681 H96211Z39807BF954386 mtt ~ mm 
NM 015720 AF219137 AL534420 AL524055 AU537346 AL538442 BG7658B8 AL530054 AL525377 BG474596 BG473144 BE251553 BG706099 
AL538039 BG703131 BE255806 BF805256 F12128 AL566773 BI82B686 BF761480 AJ204971 BGB18B1 8 Bl 199246 AL534816 BF529941 AA324163 
AL523285 BG914330 H07952 AL534815 BE769903 AI867802 BM310135 AL533702 BE254484 BF528B52 BE867462 BE740130 AI134164 
AL5671 15 AL533701 AL524054 AL515904 AL523284 AL568203 AL534419 BF981 162 BE2571 48 AL561833 
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445636 8561_5 
419175 35058J 



441128 2093ZJ 



406789 0.0 
421091 24941.2 



400263 18977J 



BF339388 AI345516 BG391657 BE708967 BG026034 BE261703 H56716 H65572 H93801 T48830 R96953 R969B9 R39707 BE867593 AA090310 
AAO90672 

AB018322 BC012480 BI524873 AW665554 AI934469 AI479916 BF096179 BF096162 BF096132 AA744972 A1951988 AI858339 BE076331 
AA886998 AI570585 AI916688 AI678811 A1693109 AI308135 AA669046 AA961054 AI018082 H80618 BE221942 R52609 AI915164 AA365626 
244671 B1052776 BF882486 BG286184 A1589558 AA931663 AA534979 A1275392 AI273455 R52553 AA829920 H80652 AA360728 F10618 
AW953656 AW176773 H85527 AA765570 AA081927 BF093262 BG743753 AL037576 AA534314 BE814964 BE973713 N49493 BE00S634 
BE006630 AW270037 AA234765 AI334004 BF057 179 A! 857450 AI341 191 AI434143 A1917449 AW517207 AA255424 AW008334 AA847572 
AA99421 1 AA861901 AA581873 AI580157 AI364363 AW242357 AW235291 N55645 AA319869 R36911 AA256551 AW044188 A1203159 N49403 
F02090 AJ187299 AI609644 Z40516 AW952314 

BC014072 BE328850 AI356567 AI148171 AI022165 BG149661 BF000671 AA233101 AA573721 AA447991 AW016855 A1005068 AA554071 
BF47B215AA906902AW014761 BE905651 BE512923BM047129AA243852AA232991 AA1 27550 AA1 27551 AA570256 AI473237 BF033706 
N90525 AW973623 AI359627 BG674574 BE903322 
AI041403Z49148 

AK057700 BC015899 BE867108 AL526926 AL527435 BG913023 AI884887 BE858461 AI8B5227 A1935218 BE645596 A1922406 AA77B161 
BF345973 AW195853 AI687121 AI336147 A1091364 AI769203 AW627859 BE677432 BF439954 BG820230 AA283686 AA812396 AI358396 
AA932022 BE222881 AW016109 BM352667 BF528099 BG818616 AI935916 AA912295 BG910887 AL568301 AL567278 B1522445 BI754384 
BG81 9375 BF835950 AA323718 AA860955 A1089847 AL520776 AL526045 AL538507 AL567798 AL536876 AL574332 BF834531 BF3401 16 
BF835954 AL573997 AL5B1252 BF957667 AL580170 AW952324 AL5693B2 AW1 48695 AL559234 BG034270 BI913839 AL581327 A1565842 
H43568 BF945188 AL559686 AL539326 T15481 AW895092 AL582684 BI519896 

Z1 1692 X51466 NM_001961 M19997 81224253 BG830478 AU122147 AU123437 8G113591 BG752624 BE886804 BI868669 BG337216 AW629935 
BM016525 AI560409 AL562866 AI909178 BF849556 AA371735 BF038841 BF727115 BC006547 BG757526 AL555664 B1261304 BG770095 
BI033486 B1517580 BG876486 BI01 1828 AI313235 BG831724 BF869862 BG998348 B1011834 BFB88337 BF898627 BF092380 AW803215 F01241 
BF805719 BG876487 AW49B536 BF988866 BG998849 AA248724 BG829202 BG756456 BG032392 BI8592B7 BM01 6990 BG332369 BE933685 
BE166758 BM452445 AI937808 AW026128 N23684 AW006O41 AI337621 F33111 BF344301 BG105450 BG387343 BF569547 BF154671 
BM007368 BF569385 BE772007 BI199487 BF761700 BI261519 BF944452 BF898506 AI038390 BM044934 AW381142 BG743618 BE769206 
BE893973 BI015047 BF886479 BF761350 BE769769 BG766117 BF847365 BE397834 AW371 121 BF089125 BE082996 BF183193 BG1B0964 
BF089940 BIO00274 BG2S5503 BG674499 BG774174 B1015084 



TABLE 67 C 

Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. -Dunham I. et al. refers to the publication entitled The DNA 

sequence ol human chromosome 22." Dunham I. et al., Nature (1 999) 402:489-495. 
Strand; Indicates DNA strand from which exons were predicted. 
NLposilion: Indicates nucleotide positions of predicted exons. 



Pkey 


Ref 


Strand 


NLposition 


403047 


3540153 


Minus 


59793-59968 


401424 


8176894 


Plus 


24223-24428 


401451 


6634068 


Minus 


119926-121272 


401747 


9789672 


Minus 


118596-118816,119119-119244.119609-11976 


404632 


9796668 


Phis 


45096-45229 


403046' 


3540153 


Minus 


55707-55859,56369-56511 


404922 


7341893 


Plus 


13248-13428 


403752 


7678857 


Plus 


33704-33828 


404210 


5006246 


Plus 


169926-170121 


401519 . 


- 6649315 


Plus 


157315-157950 


401785 


7249190 


Minus 


165776-165996,166189-166314,166408-16656 


401866 


8018106 


Plus 


73126-73623 


405387 


6587915 


Minus 


3769-3833,5708-5895 


401197 


9719705 


Plus 


176341-176452 


403532 


8076842 


Minus 


81750-81901 



TABLE 6BA: 995 GENES UP-REGULATED IN PROSTATE CANCER COMPARED TO NORMAL ADULT TISSUES 

Table 68A fists about 995 genes up-regulated in prostate cancer compared to normal adult tissues. These were selected from 9009 probesets on the Afrymetnx/Eos Hu03 GeneChip 
array lhat showed some expression h human xenograft tumors and cell fines. The 995 genes were selected such that the ratio of 'average' prostate cancer to "average" normal adult 
tissues was greater than or equal to 2.5. The "average' prostate cancer level was set to the 85th percentile amongst prostate cancers. The "average" normal adult tissue level was set 
to the 85th percentile amongst norwnalignant tissues. In order to remove gene-specific background levels of non-specific hybridization, the 10th percentile value amongst non- 
malignant tissues was subtracted from both the numerator and the denominator before the ratio was evaluated. 



Pkey: 
ExAccn: 



Unique Eos probeset identifier number 
Exemplar Accession number, Genbank accession number 



UnigenelD. 


Unigene number 




UnlgeneTille:Unigene gene title 




Rl: 


Ratio of tumor to normal body tissue 


Pkey 


ExAccn 


UnigenelD 


Unigene Title 


448999 


AF179274 


Hs.22791 


transmembrane protein with EGF-like and 


426747 


AA535210 


Hs.171995 


kaflikrein 3, (prostate specific antigen 


446057 


AI420227 


Hs.149358 


Trp-p8 transient receptor potential cati 


400302 


N48056 


Hs.283946 


folate hydrolase (prostate-specific memb 


432441 


AW292425 


Hs.163484 


ESTs 


419526 


A1821895 


Hs.1 93481 


ESTs 


414569 


AF109298 


Hs.118258 


prostate cancer associated protein 1 


424099 


AF071202 


Hs.139336 


ATP-binding cassette, sub-family C (CFTR 


432240 


A1694767 


Hs.129179 


Homo sapiens cDNA FU13581 Ms, clone PL 


400298 


AA032279 


Hs.61635 


six transmembrane epithelial antigen of 


453370 


A1470523 


Hs.139336 


ATP-binding cassette, sub-family C (CFTR 



R1 

85.5 
68.6 
65.6 
61.9 
60.4 
45.9 
45.7 
45.2 
44.9 
40.9 
39.6 
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10 

15 

20 

25 

30 

35 

40 

45 



50 

55 

60 

65 

70 

75 

80 



AA1 25985 Hs.56145 thymosin, beta, identified in neuroblast 
X07730 Hs.171995 kallftrein 3, (prostate specific antigen 
AA305627 Hs.139336 ATP-binding cassette, sub-family C (CFTR 
AA250737 Hs.72472 BMP-RIB 
AA506324 Hs.1852 acid phosphatase, prostate 
AK000531 Hs.52256 hypothetical protein FU20624 
AL1 35623 Hs.1 93914 K1AA0575 gene product 
NM 005982 Hs.54416 sine oculis homeobox (Drosophila) homolo 
R45175 Hs.117183 ESTs 
S39329 Hs.181350 kallikreb 2, prostatic 
A1733881 Hs.72472 BMP-R1B 

W68815 Hs.301885 Homo sapiens cONA FU11346 fis. done PL 
W57554 Hs.1 25019 lymphoid nuclear protein (LAF-4) mRNA 
BE252922 Hs.1 231 1 9 MAD (mothers against decapentaptegic, Dr 

NM_005656*:Homo sapiens transmembrane pr 
AK002126 Hs.11260 hypothetical protein FU1 1 264 
AA503115 Hs.183752 miooserrunoprotein, beta- 

NM_001172:Homo sapiens arginase, type II 
Hs.23023 ESTs 
Hs.31742 ESTs 

Hs.283946 totale hydrolase {prostate-specific memb 
ESTs 

Hs.20843 Homo sapiens cDNA FU1 1 245 (is, clone PL 
Hs.120591 ESTs 

Hs.68583 mitochondrial intermediate peptidase 
Hs.2731 1 single-minded (Drosophila) homolog 2 
Hs.79375 holocarboxylase synthetase (biotin-{prop 
Hs.5957 Homo sapiens clone 24416 mRNA sequence 
Hs.98280 potassium intermediate/small conductance 
Hs.9711 novel protein 
Hs.1 12259 T cell receptor gamma locus 
Hs.293616 ESTs 
Hs.117183 ESTs 

Hs.2316 , SRY (sex determining region Y)-box 9 (ca 
Hs.293726 ESTs, Weakly similar to TSGA RAT TESTIS 
ESTs 

Homo sapiens done HB-2 mRNA sequence 
ESTs 

WAA1244 protein 

androgen receptor (dihydrotestosterone r 
ESTs 

insutinoma-associated 1 
Homo sapiens cDNA FU 13598 fis, clone PL 
odz (odd Oz/len-m, Drosophila) homolog 1 
NM 004496*:Homo sapiens hepatocyte nude 
ESTs 

solute carrier family 26 (sulfate transp 
Homo sapiens proste'm mRNA, complete cds 
NK homeobox (Drosophila), family 3, A 
chromosome 1 open reading frame 21 
ESTs 
ESTs 

tumor protein 63 kDa with strong homolog 
hypothetical protein 

bullous pemphigoid antigen 1 (230/240kD) 
ESTs, Weakly similar to 2004399A chromos 
EphA3 

Homo sapiens. Similar to RIKEN cDNA 2010 
ESTs 
ESTs 
ESTs 

ESTs, Weakly similar to unknown protein 
ESTs. Weakly simflar to ALU7.HUMAN ALU S 
a dtsintegrin and rnetaltoproteinase doma 
alpha-memylacyVCoA racemase 
ESTs, Moderately similar to 138022 hypot 
H2B histone family, member L 
small nuclear protein PRAC 
delta (DrosophilaHke 1 
MyoD family inhibitor 
427674 NMJJ0352B Hs.2178 H2B histone family, member Q 
424692 AA429834 Hs.151791 KIAA0092 gene product 
415263 AA948033 Hs.130853 ESTs 
416182 NMJXM354 Hs.79069 cydinG2 
420218 AW958037 ribosomal protein L4 

411887 AW182924 Hs.1 28790 ESTs 
425154 NMJJ01851 Hs.1 54850 collagen, type IX, alpha 1 
430226 BE245562 Hs.2551 adrenergic, beta-2-, receptor, surface 
450325 A1935962 Hs.91973 ESTs 

439444 AI277652 Hs.54578 ESTs, Weakly similar to 138022 hypothec 
440529 AW207640 Hs.1 6478 Homo sapiens cDNA: FU21718 fis, done C 
457211 AW972565 Hs.32399 ESTs, Weakly similar to S51797 vasodDat 
428600 AW863261 Hs.138860 hypothetical protein DKFZp434K1 421 



409731 
400299 
400296 
400292 
425075 
409262 
428819 
409361 
407168 
400287 
415539 
432374 
433043 
423073 
403047 
4444B4 
428336 
401424 
407709 
407122 
425628 
415989 
448290 
437052 
411096 
451939 
450096 
419743 
427958 
431548 
434666 
426501 
413597 
428862 
435677 
429220 
408826 
418961 
428898 
420757 
, 428398 
419078 
450382 
449625 
401451 
447033 
452594 
448519 
418829 
430376 
418848 
429918 
424012 
418278 
408000 
443822 
432101 
447342 
412446 
437718 
433404 
431089 
427212 
429597 
444151 
428046 
453160 
433927 
431117 



AA456135 
H20276 
NMJJ04476 
AI267700 
AK002107 
AA861697 
U80034 
U80456 
AI682088 
AW408762 
AA418000 
AI834273 
AF151103 
AW043782 
AW302885 
NMJJQ0346 
AA694142 
AW207206 
AF216077 
AW967646 
AB033070 
X78592 
A1249368 
M93119 
AA39765B 
NMJM4253 

A1357412 

AU076405 

AW175665 

AA516531 

AW292053 

AI820961 

AW873986 

AW368377 

A1088489 

L11690 

AI087412 

AI918950 

A1199268 

A1768015 

AI927288 

T32982 

BE041395 

AW293849 

NMJ)03816 

AW972917 

AW812795 

AI263307 

AI557019 

AF003522 

ALTJ35588 

NMJJ0352B 

AA429834 

AA948033 

NMJXM354 

AW958037 

AW182924 

NMJJ01851 

BE245562 



Hs.48376 

Hs.23023 

Hs.194408 

Hs.99915 

Hs.98558 

Hs.89584 

Hs.60257 



Hs.157601 

Hs.29981 

Hs.278695 

Hs.55999 

Hs.12532 

Hs.193465 

Hs.119383 

Hs.137569 

Hs.83937 

Hs.19B689 

Hs.143611 

Hs.123642 

Hs.19322 

Hs.196779 

Hs.156110 

Hs.58279 

Hs.2442 

Hs.128749 

Hs.337534 

Hs.1 16467 

Hs.250500 

Hs.1 53203 

Hs.2178 

Hs.151791 

Hs.130853 

Hs.79069 



37.7 

34.9 

31.7 

31.4 

31.4 

31.4 

31.2 

30.3 

29.6 

29.6 

28.8 

28.8 

28.1 

27.5 

27.4 

25.7 

25.0 

24.9 

247 

24.6 

23.2 

23.1 

22.7 

22.6 

22.5 

21.2 

20.8 

20.5 

20.5 

19.8 

19.4 

19.3 

18.9 

18.7 

18.0 

17.9 

17.6 

17.3 

17.3 

16.8 

16.3 

15.8 

15.4 

15.2 

14.9 

14.9 

14.7 

14.6 

14.5 

14.4 

14.3 

14.2 

14.1 

14.1 

13.9 

13.3 

13.3 

13.1 

12.7 

12.6 

12.6 

12.6 

12.5 

12.5 

12.4 

12.4 

12.2 

12.1 

12.1 

11.9 

11.9 

11.7 

11.3 

11.3 

11.3 

11.2 

11.2 

11.2 

11.1 

11.1 

11.0 

11.0 

11.0 
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422728 AW937826 Hs.103262 ESTs, Weakly similar to ZN91 .HUMAN ZINC 11.0 

417333 AH 57545 Hs.1 73179 bromodomain and PHD finger containing, 3 10.7 

434423 NM 008769 Hs.3844 LIM domain only 4 10.6 

434170 AA626509 Hs.122329 ESTs 10.6 

5 432729 AK000292 Hs.130732 hypothetical protein FU20285 10.5 

446100 AW967109 Hs.13804 hypothetical protein dJ462023.2 10.4 

432715 AA247152 Hs.200483 ESTs, WeaHy similar to KIAA1 074 protein 10.4 

421896 N62293 Hs.45107 ESTs 10.3 

401747 Homo sapiens keratin 17 (KRT17) 10.1 

10 451640 AA195601 Hs.26771 Human DNA sequence from done 747H23 on 10.1 

431542 H63010 Hs.5740 ESTs 10.0 

447397 BE247676 Hs.18442 E-1 enzyme 10.0 

433285 AW975944 Hs.237396 ESTs 10.0 

431325 AW026751 Hs.5794 ESTs, Wealdysiniilar to 2109260AB cell 9.9 

15 410889 X91662 Hs.66744 twist (Drosophila) homolog (acrocephalos 9.8 

432674 AA641092 Hs.257339 ESTs. Weakly similar to 138022 hypotheti 9.6 

434926 BE643269 Hs.50252 mitochondria! ribosoma! protein U2 9.6 

442049 AA310393 Hs.190044 ESTs 9.6 

443180 R15875 Hs.258576 ctaudin 12 9.5 

20 406964 M21305 FGENES predicted novel secreted protein 9.5 

426350 NM_003245 Hs.2022 transglutaminase 3 (E polypeptide, prote 9.5 

456660 AA909249 Hs.1 12282 solute carrier family 30 (zinc transport 9.5 

448045 AJ297436 Hs.20166 prostate stem cell antigen 9.5 

451684 AF216751 Hs.26813 CDA14 9.5 

25 440594 AW445167 Hs.126036 ESTs 9.4 

432415 T16971 Hs.289014 ESTs. Weakly similar to A43932 mucin 2 p 9.3 

418601 AA279490 Hs.86368 calmegin 9.3 

433332 AI367347 Hs.44898 Homo sapiens clone TCCCTA00151 mRNA sequ 9.2 

437124 AA55445B KIAA0666 protein 9.2 

30 428728 NM.016625 Hs.191381 hypothetical protein 9.2 

43B869 AF075009 gb: Homo sapiens full length insert cDNA 9.1 

431992 NMJJ02742 Hs.2891 protein kinase C. mu 8.9 

413992 W26276 Hs.136075 RN A. U 2 small nuclear 8.9 

428342 AI739168 Homo sapiens cDNA FU 13458 fis. clone PL 8.9 

35 418836 AI655499 Hs.161712 ESTs 8.8 

409799 D11928 Hs.76845 phosphoserine phosphatase-like 8.8 

400294 N95796 Hs.278695 Homo sapiens prostein mRNA, complete cds 8.7 

434826 AF155661 Hs.22265 pyruvate dehydrogenase phosphatase 8.7 

418250 U29926 Hs.8391B adenosine monophosphate deaminase fisofo 8.7 

40 425905 AB032959 Hs.318584 novel C3HC4 type Zinc finger (ring finge 8.7 

418329 AW247430 Hs.84152 (^tathionine-bet^synthase 8.7 

450203 AF097994 L4tynureiune/alpna-aminoadipateaminotra 8.7 

425465 L18964 Hs.1904 protein kinase C. iota 8.5 

416239 AL038450 Hs.48948 ESTs 8.5 

45 436962 AW377314 Hs.5364 DKFZP564I052 protein 8.4 

450164 AI239923 Hs.63931 ESTs 8.4 

452744 AI267652 Hs.246107 Homo sapiens mRNA; cDNA DKFZp434E082 (fr 8.4 

440774 A1420611 Hs.153934 ESTs 8.3 

444922 AI921750 Hs.144871 Homo sapiens cDNA FLJ13752 (is, done PL 8.3 

50 410870 U81599 Hs.66731 homeoboxB13 8.3 

435047 AA454985 Hs.54973 cadherin-Uke protein VR20 8.2 

418564 AA631143 Hs.27869S Homo sapiens prostein mRNA, complete cds 8.2 

451027 AW519204 Hs.40808 ESTs 8.2 

445873 AA250970 Hs.251946 poly(A)-binding protein, cytoplasmic 14 8.2 

55 423349 AF010258 Hs.127428 homeoboxA9 8.1 

421928 AF01375B Hs.109643 polyadenylate binding protein-interacHn 8.1 

432586 AA568548 ESTs 8.0 

426108 AA622037 Hs.166468 programmed ceD death 5 8.0 

417153 X57010 Hs.81343 collagen, type II, alpha 1 (primary oste 8.0 

60 416653 AA768553 Hs.193145 metaflothionein IE (functional) 7.9 

447476 BE293466 Hs.20880 ESTs, Weakly similar to 138022 hypotheti 7.9 

450377 AB033091 WAA1 265 protein 7.8 

429769 NM.004917 Hs.218366 kaflikrein 4 (prostase, enamel matrix, p 7.7 

416815 U41514 Hs.80120 UDP^acetyl-aipha-D^aladosamine:polyp 7.7 

65 433323 AA805132 Hs.159142 ESTs 7.7 

451952 AL120173 Hs.301663 ESTs 7.7 

452367 U71207 Hs.29279 eyes absent (Drosophila} homolog 2 7.7 

421470 R27496 Hs.1 378 annexinA3 7.7 

420092 AA814043 Hs.B8045 ESTs 7.6 

70 416111 AA033813 Hs.79018 chromafin assembly factor 1 , subunit A ( 7.6 

440749 W22335 Hs.7392 hypothetical protein MGC3 199 7.6 

441866 BE464341 Hs.21201 necCn 3; DKFZP566B0846 protein 7.6 

411019 AW993097 Hs.48617 Homo sapiens cDNA FU1 2540 fis, clone NT 7.5 

421566 NM_000399 Hs.1395 earty growth response 2 (Krox-20 (Drosop 7.4 

75 451418 BE387790 Hs.26369 hypothetical protein FU20287 7.4 

433517 AW022133 Hs.189838 ESTs 7.4 

415621 AI648602 Hs.55468 ESTs 7.4 

442592 BE566623 Hs.29899 ESTs, Weakly similar to G02075 transcrip 7.3 

n _ 452340 NM 002202 Hs.505 ISL1 transcription factor, UM/homeodoma 7.3 

80 450244 AA007534 Hs.125062 ESTs 7.2 

433293 AF007835 Hs.32417 hypothetical protein MGC4309 7.2 

425259 AL049280 Hs.155397 Homo sapiens mRNA; cDNA DKFZp564K143 (fr 7.2 

434485 AI623511 Hs.118567 ESTs 7.2 
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442772 AW503680 Hs.5957 Homo sapiens clone 24416 mRNA sequence 7.1 

425211 M1B667 Hs.1867 progaslricsin (pepsinogen C) 7.1 

431958 X63629 Hs.2877 cadherin 3, type 1 , P-cadherin (placenla 7.0 

408418 AW963897 Hs.44743 KIAA1435 protein 7.0 

5 436063 AK000028 ribosomal protein S24 7.0 

412350 AI659306 Hs.73826 protein tyrosine phosphatase, non-recept 6.9 

414664 AA587775 Hs.66295 multi-PDZ-domain-containing protein 6.9 

428465 AW970976 Hs.293653 ESTs 6.9 

436476 AA326108 Hs.33829 bHLH protein DEC2 6.8 

10 431585 BE242803 Hs.262823 hypothetical protein FU 10326 6.8 

434808 AF155108 Hs.256150 Homo sapiens, Similar to RIKEN cONA 2810 6.7 

432103 T15803 Hs.272458 protein phosphatase 3 (formerly 2B), cat 6.7 

456088 BE177320 Hs.156148 hypothetical protein FU 13231 6.6 

429084 AJ001443 Hs.195614 splicing factor 3b. subunit 3, 130KD 6.6 

15 422486 BE514492 Hs.1 17487 gene near HD on 4p1 6.3 with homology to 6.6 

410227 AB009284 Hs.61152 exostoses (mulfipleHike 2 6.6 

453439 A1572438 Hs.32976 guanine nucleotide binding protein 4 6.5 

410037 AB020725 Hs.58009 WAA091B prote'm 6.5 

420297 AI62B272 Hs.88323 ESTs, Weakly similar to ALU 1 .HUMAN ALU S 6.4 

20 409648 AW451449 Hs.57749 ESTs 6.4 

448148 NNL016578 Hs.20509 HBV pX associated prolein-8 6.4 

410268 AA316181 Hs.61635 six transmembrane epithelial antigen of 6.4 

422134 AW179019 Hs.112110 mitochondrial ribosomal protein L42 6.3 

415068 Z19448 Hs.131887 ESTs, Weakly similar to T24396 hypotheti 6.3 

25 404632 NM_022490:Homo sapiens hypothetical prot 6.3 

412935 BE267045 Hs.75064 tubulin-spedfic chaperone c 6.3 

45330B AW959731 Hs.323099 ESTs 6.3 

44904B Z45051 Hs.22920 similar to S68401 (cattle) glucose indue 6.3 

410762 AF226053 Hs.66170 HSKM-B protein 6.3 

30 436032 AA1 50797 Hs.109276 tatexin protein 6.3 

456508 AA502764 Hs.123469 ESTs, Weakly similar to AF208855 1 BM-01 6.3 

413328 Y15723 Hs.75295 guanylate cyclase 1, soluble, alpha 3 6.2 

425071 NMJJ13989 Hs.154424 deiodinase, iodothyronine, type ll 6.2 

424036 AA770688 H2A histone family, member L 6.2 

35 418245 AA088767 Hs.83883 transmembrane, prostate androgen induced 6.2 

450861 A1523898 Hs.17617 ESTs 6.1 

418821 AA436002 Hs.183161 ESTs 61 

458809 AW972512 Hs.20985 sin^associated polypeptide, 30kD 6.1 

432527 AW975028 Hs.102754 ESTs 6.1 

40 433730 AK002135 Hs.3542 hypothetical protein FU 11 273 6.1 

449845 AW971183 Hs.6019 DnaJ (Hsp40) homolog, subfamily C, membe 6.1 

420948 AB016898 Hs.100469 niyeldd/lymphoid or mixed-fineage leukem 6.1 

425704 U79293 Hs.1 59264 Human clone 23948 mRNA sequence 6.0 

458440 AI095468 Hs.1 35254 Homo sapiens clone 1 thrornbospondin mRNA 6.0 

45 419647 AA348947 Hs.91816 hypothetical protein 6.0 

455497 AA1 12573 Hs.278695 Homo sapiens prostein mRNA, complete cds 6.0 

428593 AW207440 Hs.1 85973 degenerative spermatocyte (homolog Oroso 5.9 

400303 AA242758 Hs.79136 UV-1 protein, estrogen regulated 5.9 

446416 AV658299 Hs.163959 ESTs 5.9 

50 407819 R42165 ESTs 5.9 

430387 AW372884 Hs.240770 nuclear cap binding protein subunit 2. 2 5.9 

410102 AW248508 Hs.279727 ESTs; hornologue of PEM-3[Cwnasavtgnyi 5.9 

429165 AW009886 Hs.1 18258 prostate cancer associated protein 1 5.9 

417873 BE266659 Hs.293659 Homo sapiens, Similar to RIKEN cDNA A430 5.8 

55 410240 AL157424 Hs.61289 synaplojanin 2 5.8 

421305 BE397354 Hs.324830 diptheria toxin resistance protein requi 5.8 

451406 A1694320 Hs.6295 ESTs, Weakly similar to T17248 hypotheti 5.8 

416795 AW97778 Hs.20509 HBV pX associated protein-8 5.7 

427701 AA411101 Hs.243886 nuclear autoantigenic sperm protein (his 5.7 

60 451468 AW503398 Hs.293663 ESTs, Moderately similar to 138022 hypot 5.7 

433852 AI378329 Hs.126629 ESTs 5.7 

432882 NM 013257 Hs.279696 serurrtfojucoewtjeoid regulated kinase-B 5.6 

419440 AB020689 Hs.90419 KIAA0882 protein 5.6 

435706 W31254 Hs.7045 GLJ004 protein 5,6 

65 431583 AL042613 Hs.262476 S-adenosyrmelhionine decarboxylase 1 5.6 

442409 BE208843 Hs.1 29544 hypothetical protein MGC1 5438 5.6 

453171 R76472 Hs.65646 ESTs 5.6 

437252 A1433833 Hs.164159 ESTs. Weakly similar to ALU 1. HUMAN ALU S 5.6 

422083 NM 001141 Hs.1 11256 arachidonate IS-Tipoxygenase, second typ 5.6 

70 429638 AI9T6662 Hs.211577 kinecCn 1 (Idnesin receptor) 5.5 

403046 NMJXK>656*:Homo sapiens transmembrane pr 5.5 

447805 AW627932 Hs.302421 gernin4 5.5 

443162 T49951 Hs.9029 DKFZP434G032 protein 5.4 

418555 AI417215 Hs,87159 hypothetical protein FU 12577 5.4 

75 437162 AW005505 Hs.5464 thyroid hormone receptor coactivaling pr 5.4 

419465 AW500239 Hs.211B7 Homo sapiens cDNA: FU23068 fis, clone L 5.4 

440146 AW014231 Hs.90790 Homo sapiens cDNA: FU22930 fo, clone K 5.4 

427308 026067 Hs.174905 K1AA0033 protein 5.4 

409151 AA306105 SEC22, vesicle trafficking protein (S. c 5.4 

80 432840 AK001403 Hs.279521 hypothetical protein FU20530 5.4 

423242 AL039402 Hs.125783 DEME-6 protein 5.3 

453082 H18835 Hs.31608 hypothetical protein FU20041 5.3 

441021 AW578716 Hs.7644 H1 histone family, member 2 5.3 
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415276 U88665 Hs.78353 SFRS protein kinase 2 5.3 

400290 H18836 Hs.31608 hypothetical protein FU 20041 5.3 

432435 BE21B886 Hs.282070 ESTs 5.3 

417318 AW953937 Hs.240845 ESTs 5.3 

429467 NM_004477 Hs.203772 FSHD region gene 1 5.3 

416276 U41060 Hs.79136 UV-1 protein, estrogen regulated 5.3 

ESTs 5.3 

sulfotransferase family, cytosoTic, 2B, 5.2 

H ,^ HO alpha-methylacyl-CoAracemase 5.2 

10 436420 AA443966 Hs.31595 ESTs 5.2 

aldehyde dehydrogenase 1 family, member 5.1 

spondin 2, extracetlular matrix protein 5.1 

ESTs 5.1 

ESTs 5.1 

15 448807 AI571940 Hs.7549 ESTs 5.1 

protocadherin alpha 10 5.1 

ATPase, Ca transporting, type 2C. member 5.0 

CQ-82 protein, PSDR1 5.0 

^ UWJU hypothetical protein FU10902 5.0 

20 452576 AB023177 Hs.29900 KIAA0960 protein 5.0 

ESTs 5.0 

cystatinSN 5.0 

ESTs, Weakly similar to ALU4_HUMAN ALU S 5.0 

hv / 1 uj « ilJ „ . w , . hypothetical protein MGC13170 5.0 

25 421154 AA284333 Hs.287631 Homo sapiens cDNA FU 14269 fis, done PL 5.0 

estrogen receptor 1 5.0 

ESTs 5.0 

transmembrane, prostate androgen induced 4.9 

h^uuuu n , u i , yv wv.w Homo sapiens cDNA FU14934 fis, clone PL 4.9 

30 444108 R55784 Hs.140942 ESTs 4.9 

signal sequence receptor, gamma {translo 4.9 

Homo sapiens cDNA FU 1 2292 fis, clone MA 4.9 

Purkinje cell protein 4 4.9 

^ tww ankyrin3.nodeofRanvier{ankyrinG) 4.9 

35 428330 L22524 Hs.2256 matrix metalloproteinase 7 (matrilysin, 4.9 

Homo sapiens cDNA FU 12641 fis, clone NT 4.9 

prostate cancer associated protein 7 4.9 

hypothetical protein FU 22995 4.9 

nrw.-«. ESTs 4.8 

40 439735 AI635386 Hs.142846 hypothetical protein 4.8 

KIAA1 171 protein 4.8 

KIAA1 028 protein 4.8 

Homo sapiens cDNA FU22562 fis. clone H 4.8 

gbTif76g11 .si NCl_CGAP_Co3 Homo sapiens 4.8 

45 433313 W20128 Hs.296039 ESTs 4? 

CDC2-related protein kinase 7 4.8 

DKFZP586E1621 protein 4.8 

Homo sapiens mRNA; cDNA DKFZp56401763 (f 4.7 

™ , tousled-like kinase 1 4.7 

50 429299 AI620463 Hs.347408 hypothetical protein MGC1 3102 4.7 

hypothetical protein 4.7 

gb:zp99b10.s1 Slralagene muscle 937209 H 4.7 

KIAA1323 protein 4.7 

^ ,w , Nonie disease (pseudogRoma) 4.7 

55 441224 AU076964 Hs.7753 calumenin 4.7 

WAA0872 protein 4.7 

ESTs, Weakly similar to T00050 hypotheti 4.7 

guanine nucleotide binding protein 4 4.7 

hum m,«w W , ESTs, Weakly similar to JC5238 galactosy 4.7 

60 414922 D00723 glycine cleavage system protein H (amino 4.7 

NK homeobox (Drosophfla), family 3. A 4.7 

ESTs 4.7 

K1AA0261 protein 4.7 

..wu. ESTs 4.7 

65 410193 AJ132592 Hs.59757 zmc finger protein 281 4 7 

Homo sapiens, Similar to RIKENcONA 5830 4.6 

ESTs 4.6 

transcription factor AP-2 gamma (activat 4.6 

NMJX)3071:Homo sapiens SW1/SNF related. 4.6 

70 436556 A1364997 Hs.7572 ESTs 4-6 

eukaryotic translaCon initiation factor 4.6 

phosphodiesterase 9A 4.6 

ESTs, Moderately similar to T29285 hypot 4.5 

rwtiw.a .,o.-...t- phosphotoosUide-specilfc phosphofipase 4.5 

75 418818 AA228899 Hs.1 01307 Homo sapiens HUTU protein mRNA. partial 4.5 

433556 W56321 Hs.1 11460 calcium/calrnodutin-depeTOient protein kin 4.5 

443123 AA094538 Hs.272808 putative transcription regulation nuclea 4.5 

419991 AJ000D98 Hs.94210 eyes absent (Drosophna) homotog 1 4.5 

419594 AA013051 Hs.91417 topoisomerase (DNA) II binding protein 4.5 

80 453390 AA862496 Hs.28482 ESTs 4.5 

417061 AI675944 Hs.188691 Homo sapiens cDNA FU 12033 fis. done HE 4.5 

419239 AA468183 Hs.184598 Homo sapiens cDNA: FU23241 fis. done C 4.5 

436873 N23874 Hs.50477 RAB27A, member RAS oncogene family 4.5 
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408001 AA046458 Hs.95296 ESTs 44 

413950 AA249096 Hs.32793 ESTs 44 

416655 AW968613 Hs.79428 BCl_2/adenovirus E1B 19kD-interacting pro 4.4 

_ 418004 U37519 Hs.87539 aldehyde dehydrogenase 3 family, member 4.4 

5 419733 AW362955 Homo sapiens cDNA FU 1 4415 fis, ctone HE 4.4 

433233 AB040927 Hs.301804 K1M1 494 protein 44 

439979 AW600291 Hs.6823 hypothetical protein FU 10430 4.4 

430589 AJ002744 Hs.246315 UDP^-a<»tyMpr^r>gatactosamine:poryp 4.4 

417958 AA767382 Hs.193417 ESTs 44 

10 450680 AF131784 Hs.25318 Homo sapiens done 25194 mRNA sequence 4.4 

435020 AW505076 Hs.301855 DiGeorge syndrome criCcal region gene 8 4.4 

437396 BE140396 Hs.21621 hypotheBcal protein DKFZp762O076 4.4 

425810 A1923627 Hs.31903 ESTs 4 - 4 

. _ 426028 NM01110 Hs.172028 adisirfegrinOTdmetalioproteinasedorna 4.3 

15 452747 BE153855 Hs.61460 Ig superfamBy receptor LNIR 4.3 

429259 AA420460 Hs.292911 PlatophiBn 4.3 

452092 BE245374 Hs.27842 hypothetical protein FU11210 4.3 

437967 BE277414 Hs.5947 mel transforming oncogene (derived from 4.3 

459284 AF155660 Hs.300496 mitochondrial solute carrier 4.3 

20 417067 AJ001417 Hs.81086 sototecarrier family 22 (extraneuronal 4.3 

423453 AW450737 Hs.128791 CGI-09 protein 4.3 

407137 T97307 gb:ye53h05.s1 Soares fetal Ever spleen 4.3 

448826 A1580252 Hs.293246 ESTs, Weakly similar to putafivepl 50 (H 4.3 

419713 AW968058 Hs.923B1 nudix (nucleoside diphosphate linked moi 4.3 

25 427359 AW020782 Hs.79881 Homo sapiens cDNA: FU23Q06 fis, done L 4.3 

450402 BE218027 Hs.89969 ESTs 4.3 

428055 AA420564 Hs.101760 ESTs 4.3 

429441 AJ224172 Hs.204096 lipophtfin B (uteroglobin family member) 4.3 

432675 AI791855 ESTs 4.2 

30 439963 AW247529 Hs.6793 platelet-activating fador acetylhydrola 4.2 

445707 AI248720 Hs.1 14390 ESTs 4.2 

410297 AA148710 lumfcan 4.2 

416737 AF1 54335 Hs.79691 UM domain protein 4.2 

430255 AK000703 Hs.323822 Homo sapiens mRNA for KIAA1551 protein, 4.2 

35 421629 AB018330 Hs.108708 catdum/calmodulin-dependent protein kin 4.2 

447620 AW290951 ESTs 4.2 

426647 AA243464 Hs.294101 pre-B-ceD leukemia transcription factor 4.2 

432886 BE15902B Hs.279704 chromatin accessibility complex 1 4.2 

446237 AW270515 Hs.149596 Homo sapiens, Similar to RIKEN cDNA 2310 4.2 

40 421878 AA299652 Hs.1 11496 Homo sapiens cDNA FU11643 fc, done HE 4.2 

423337 NM-004655 Hs.127337 axrn2(conducfin, axH) 4.2 

435854 AJ278120 Hs.4996 putafive ankyrin-repeat containing prote 4.2 

427723 A1355260 Hs.279789 histonedeacetylase3 4.2 

430122 NM.013342 Hs.233765 TCF3(E2A) fusion partner fmchrWhood 4.2 

45 416140 AI918Q35 Hs.301198 ioimo^m(axonguic^r«»ieceptor,Dros 4.2 

428231 U17989 Hs.183105 nudearautoanOgen 4.2 

447966 AA3406O5 Hs.105887 ESTs, Weakly similar to Homolog of rat Z 4.1 

432274 AK000382 Hs.274251 hypothetical protem FU20375; WAA1 797 4.1 

432621 A1298501 Hs.21192 ESTs, Weakly similar to T46428 hypothec 4.1 

50 408063 BE086548 Hs.42346 calctneurin-binding protein calsarcm-1 4.1 

407192 AA609200 gb:af12eOZs1 SoaresJesfis.NHT Homo sap 4.1 

426716 NM 006379 Hs.171921 sema domain .immunoglobulin domain (Ig), 4.1 

427479 BE410092 Hs.178471 KIAA0798 gene product 4.1 

434194 AF119847 Homo sapiens PR01 550 mRNA, partial cds 4.1 

55 422072 AB018255 Hs.111138 WAA071 2 gene product 4.1 

411145 BE439553 Hs.250528 Homo sapiens, clone IMAGE:4098694, mRNA, 4.1 

449459 BE546846 Hs.1 95048 ESTs 4.1 

421662 NML014141 Hs.106552 cell recognition molecule Caspr2 4.1 

445309 AL157474 Hs.12504 likely orthclog of moose Arkadia 4.1 

60 412520 AA442324 Hs.795 H2Ahistone family, member O 4.0 

448663 BE614599 hypothetical protein MGC14797 4.0 

425689 W16480 Hs.24283 ESTs, Moderately similar to reduced expr 4.0 

433647 AA603367 Hs.222294 ESTs 4.0 

426030 BE243933 Hs.108642 zinc finger protein 22 (KOX 15) 4.0 

65 418838 AW385224 Hs.35198 ectonudeotidepymphosphatase/phosphodi 4.0 

439820 AL360204 Hs^83853 Homo sapiens mRNA full length insert cDN 4.0 

422634 NMJJ16010 Hs.1 18821 CGI-62 protein 4.0 

453469 AB014533 Hs.33010 K1AA0633 protein 4.0 

418827 BE327311 Hs.47166 HT021 4.0 

70 446791 AI632278 Hs.195922 ESTs 4.0 

443884 N20617 Hs.194397 leptin receptor 4.0 

416857 AA188775 Hs.292453 ESTs 4.0 

401519 C15000476*:gi|12737279lreqXP_012163.1| 4.0 

__ 407846 AA426202 Hs.40403 Cbp/p30Wnteracljng transactivator, wit 4.0 

75 431689 AA305688 Hs.267695 UDP^Bal:betaGlcNAcDeta1.3^aiactDsylr/ 3.9 

408784 AW971350 Hs.63386 ESTs 3.9 

435655 AW105663 Hs.6947 HSPC069 protein 3.9 

432615 AA557191 Hs.55028 ESTs, Weakly similar to 154374 gene NF2 3.9 

448804 AW512213 Hs.342849 ADP-ribosytation factor-like 5 3.9 

80 430389 AL117429 Hs^40845 OKFZP434D1 46 protein 3.9 

432278 AL1 37506 Hs.274256 hypothetical protein FU23563 3.9 

414869 AA157291 Hs.21479 ublnucleinl 3.9 

429163 AA884766 gfcam20a10.s1 Soares_NFkJ_GBC_S1 Homo s 3.9 
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444190 AI878918 Hs.10526 

443991 NMJM2250 Hs.10082 

433339 AF019226 Hs.8035 

435021 AA922192 



412834 R77123 
AA507576 



Hs.73962 
Hs.79881 
Hs.288361 
AW612232 Hs.254835 



408833 

432952 AA813887 Hs.188173 
456177 NMJH2391 HsJ9414 



cysteine and glydne-rtch protein 2 
potassium intermediate/small conductance 
glioblastoma overexpressed 
ESTs 

Homo sapiens cDNA: FU23008 fis. done L 
Homo sapiens cDNA: FU22696 fis, done H 
ESTs 

Homo sapiens cDNA FU 12187 fis, done MA 
prostate epithelium-specific Ets transcr 



433345 
421887 
434614 AI249502 



AI681545 Hs.152982 hypothetical protein FU131 17 
AW161450 Hs.109201 CGI-86 protein 



425910 
432370 
413010 



AA830797 
AA308334 
AA393273 



435114 AA775483 
439024 
426991 
427871 
439671 

418113 AI272141 

422424 A1186431 

447439 AA313565 

414178 AW957372 Hs.46791 

427099 AB032953 

407908 BE379758 

440695 AW088363 Hs.246240 

429686 AI871613 Hs.28538 

447533 NM.004786 Hs,18792 

411190 AA306342 Hs.69171 
AL1 18747 



AK001536 

AW992405 Hs.59622 
AW162840 Hs.6641 
Hs.83484 
Hs.296638 
Hs.145020 



426126 

434263 N34895 

414407 AA147026 

432426 AW973152 

425770 NMJM363 Hs.1 59492 

432579 AF043244 Hs.278439 

444101 R19175 

432908 AI861896 

432800 BE391046 

429966 BE081342 



419972 AL041465 
410001 AB041036 



Hs^9669 ESTs 
Hs.184760 CCAAT-tox-binding transcription factor 
Hs.274424 N-acetylneuraminic add phosphate syntha 
Hs.75133 transcription factor 6-Bke 1 (mitochond 
Hs.288936 rrotochondrial ribosomal protein L9 
Hs.35598 ESTs 

Homo sapiens cDNA FU10674 lis, clone NT 
Homo sapiens, done IMAGE:3507281, mRNA, 
Wnesin family member 5C 
SRY (sex determining region Y)-box 4 
prostate differentiation factor 
ESTs, Weakly simitar to KIAA1205 protein 
ESTs, Weakly similar to 138022 hypotheti 
Hs.173560 odd Oz/terwn homotog 2 (DrosophSa, mous 
Hs.1 1 0853 uncharacterized hematopoiefic stem/proge 
ESTs 

Homo sapiens cDNA: RJ21086 fis, clone C 
thioredoxin-like, 32kO 
protein kinase C-like 2 
ESTs 
ESTs 
ESTs 
ESTs 

spastic ataxia of Charlevoix-Saguenay (s 
mideoter protein 3 (apoptosis repressor 
ribosomal protein L32 
ESTs 

AIM-1 protein 
HSPC039 protein 



Hs,26691 
Hs.44648 
Hs.76704 
Hs.31050 



Hs.169793 

Hs.278962 
Hs.283037 



Ks.182982 gotgin-67 
Hs.57771 kaJBkrelnH 
Hs.125291 



432205 A1806583 Hs.125291 ESTs 
401197 ENSP00000229263*^SPC213. 
420061 AW024937 Hs.29410 ESTs 

431103 M57399 Hs.44 plelotrophin (heparin binding growth fac 

433577 AW007080 Hs.2B4192 ESTs 

435017 AA336522 Hs.12854 angiotensin II, type I receptor-associal 
420805 L10333 Hs.99947 refictiton 1 
Target Exon 

Hs.118569 Dvrt-binding protein I0AX (inhibition of 
NM.016156 Hs.181326 WAA1 073 protein 

NML02217(T:Homo sapiens Williams-Beuren 
Human DNA sequence from done RP1-63M2 o 
432388 X15218 Hs.2969 v-ski avian sarcoma viral oncogene homd 
438157 AW137011 Hs.49576 ESTs 
451945 BE504055 Hs.211420 ESTs 

452721 AJ269529 Hs.301871 solute carrier family 37 (gtyceroW^ho 
454229 AW957744 Hs.278469 lacrimal proline rich protein 



H91882 



401866 
457183 
427982 
405387 

449933 AW157098 Hs.324104 
X15218 



416974 AF010233 
421537 BE383488 



Hs.80667 
Hs.105547 



RALBP1 associated Eps domain containing 
neural proliferation, differentiation an 



428738 NM_000380 Hs.1 92803 xeroderma pigmentosum, complementation g 

440494 BE618768 Hs.7232 acetyl-Coenzyme A carboxylase alpha 

407748 ALD79409 Hs.38176 K1AA0606 protein; SCN Orcadian Osdlbt 

453049 BE537217 Hs.30343 ESTs 

Hs.103135 ESTs, Weakly similar to SFR4_HUMAN SPUC 

Hs.83765 dihydrofolate reductase 

Hs.140237 ESTs, Weakly similar to ALU1_HUMAN ALU S 

NM-001429 Hs.25272 E1A binding protem p300 

Hs.1 21429 zinc-binding protem Rbcc728 

Hs.21906 Homo sapiens clone 24670 mRNA sequence 

Hs.144630 nuclear receptor subfamily 2, group F, m 

Hs.330994 RAB9, member RAS oncogene family 

Hs.97101 putative G protein-coupled receptor 

Hs.180428 KIAA1181 protein 

Hs.172350 H1R (histone cell cycle regulation defec 

Hs.93564 Homer, neuronal immediate early gene, 2 

NM_002753*:Homo sapiens mitogen-acfivate 

Hs.104207 ESTs 

438523 H66220 Hs.278177 ESTs 

438705 AI049624 Hs.283390 ESTs, WeaWy similar to 2109260A B ceD 

Hs.255416 hypothetical protein FU21986 

Hs.5163 ESTs. Weakly similar to AF1 51 840 1 CGI-8 



440494 BE618768 

407748 ALD79409 

453049 BE537217 

422089 AA523172 

408096 BE250162 

407204 R41933 
450649 

436489 AJ272269 

447818 W79940 

424285 BE207168 

452260 AA45320B 

420344 BE463721 



427715 
426793 
419879 
403752 
418559 



BE245274 

X89887 

Z17805 

AA225048 



443960 A1093577 
412513 AA322599 
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42B695 


AI355647 


Hs.1 89999 


419517 


AF052107 


Hs.90797 


446494 


AA463276 


Hs.288906 


417515 


L24203 


Hs.82237 


433891 


AA613792 




427315 


AA179949 


Hs.175563 


445525 


BE149866 


Hs.14831 


434672 


AW294020 


Hs.1 17721 


453288 


AW583292 


Hs.274412 


434970 


AW272262 


Hs.225767 


417193 


AI922189 


Hs^88390 


437617 


AJ026701 


Hs.5716 


445943 


AW898533 


Hs.181574 


412628 


AJ972402 


Hs.306051 


452167 


N75238 


Hs.1 3075 


410467 


AF102546 


Ks.63931 


440193 


AW902312 


Hs.7037 


451615 


AW974911 


Hs.184793 


429170 


NMJJ01394 


Hs.2359 


410082 


AA081594 


Hs.158311 


436087 


BE300296 


Hs.5054 


437898 


W81260 


Hs.43410 


451131 


A1267586 


HS.26B012 


439509 


AW971945 


Hs.293236 


443555 


N71710 


Hs.21398 


418610 


AW245993 


Hs.223394 


409960 


BE261944 




425263 


NMJXJ1197 


Hs.155419 


404210 






442323 


AW016669 


Hs.29190 


452707 


AI093823 


Hs.45070 


421437 


AW821252 


Hs.104336 


426514 


BE616633 


Hg.170195 



450628 AW382884 Hs.204715 

446825 BE266822 Hs.344097 

421867 AA481078 Hs.109Q45 

434954 AF161455 Hs.284295 

45290B AB001451 Hs.30965 

429922 Z97630 Hjl226117 

448734 BE614070 Hs.326416 

434293 NM.004445 Hs.3796 



414222 


AL135173 




421674 


T10707 


Hs.296355 


438279 


AA805166 


Hs.154762 


411078 


A1222020 


Hs.182364 


419749 


X73608 


Hs.93029 


432302 


AA345857 


Hs.274307 


407944 


R34008 


Hs.239727 


431674 


AA098901 


Hs^01642 


408291 


AB023191 


Hs.44131 


431578 


AB037759 


Hsl261587 


440789 


AB007857 


Hs.7416 


434958 


T99949 


Hs.303428 


452295 


BE379936 


Hs.28866 


417229 


AA975096 


Hs.19522 


439593 


BE073597 


Hs.124863 


407136 


T64896 


Hs.287420 


421044 


AF061871 


Hs.101302 


427461 


AA531527 


Hs.332040 


417315 


A1080042 


Hs.180450 


423392 


AA195037 


Hs.169341 


412652 


AI801777 




415172 


AF079529 


Hs.78106 


418372 


AA311833 


Hs.84318 


447958 


AW796524 


Hs.68644 


425010 


T16837 


Hs.4241 


420460 


AA262331 


Hs.48376 


443181 


AI039201 


Hs.283316 


442064 


AI422867 


Hs.88594 


431663 


NM-016569 


Hs.267182 


412926 


AI879076 


Hs.75061 


420818 


AW969635 


Hs.33032 


408829 


NM 006042 


Hs.48384 


408681 


AW953853 


Hs.292833 


436278 


BE396290 


Hs.5097 


424560 


AA158727 


Hs,150555 


417215 


BE253181 


Hs.81687 


418166 


AI754416 




434629 


M7890B1 


Hs.4029 


427398 


AW390020 


Hs.20415 


457489 


A1693815 


Hs.127179 


434747 


AAB37085 




413125 


BE244589 


Hs.75207 



purinergic receptor {family A group 5) 
Homo sapiens clone 23620 mRNA sequence 
WW Domain-Containing Gene 
ataxia-telangiectasia group D-assoctated 
gfcno97h03^1 NO CGAPJtt Homo sapiens 
Homo sapiens mRNA: cONA DKF2p564N0763 (f 
Homo sapiens, Similar to zinc finger pro 
ESTs 

similar to yeast Upf3, variant A 
ESTs 

hypothetical protein FU 22795 
KiAA0310gene product 
ESTs 

hypothetical protein MGC2648 

Homo sapiens cDNA: RJ23013 fis, done L 

dachshund (Drosophila) homotog 

Homo sapiens clone 24923 mRNA sequence 

Homo sapiens cDNA: FU21880 fis, done H 

dual spedficity phosphatase 4 

Musashi (Drosophila) homotog 1 

CG1-133 protein 

ESTs 

fatty-arid-Coenzyme A Ogase, long-chain 
ESTs 

ESTs, Moderately similar to A Chain A, H 
hypothetical protein MGC2742 
hexokinase 1 

BCL2-interacting killer (apoptosis-induc 
NM_005936:Homo sapiens myeloid/lymphoid 
ESTs 
ESTs 

hypothetical protein 

bone morphogenetic protein 7 (osteogenic 
ESTs 

filamh A. alpha (aclin-binding protein- 
hypothetical protein FU10498 
Homo sapiens HSPC337 mRNA, partial cds 
neuronal She adaptor homolog 
H1 hbtone family, member 0 
Homo sapiens mRNA: cDNA DKFZp564H1916 (f 
EphB6 

sorbitol dehydrogenase 
hypothetical protein FU23138 
HTV-1 rev binding protein 2 
CocoaCnsp 

sparc/osteonectin, ewev and kazal-like d 
KIAA1442 protein 
desmocolfin 2 
G-protein coupled receptor 
WAA0974 protein 
GCN2elF2alpha kinase 
KIAA0397 gene product 
Homo sapiens cDNA FU14832 fis, done OV 
programmed ceD death 1 0 
hypothetical protein PR02849 
ESTs 

Homo sapiens cDNA FU11533 fis, done HE 
Human DNA sequence from clone RP1-238D15 
hypothetical protein MGC1 3010 
ribosomal protein S24 
HTPAP protein 
ESTs 

phosphodiesterase 8B 
replication protein A1 (70kO) 
Homo sapiens microsomal signal peptidase 
ESTs 

Homo sapiens done HB-2 mRNA sequence 
ESTs 
ESTs 

TBX3-&0 protein 

macrophage myristoylated alanine-rich C 
ESTs, Weakly similar to 1207289A reverse 
heparan sulfate (ghjeosamine) 3-O-sutfot 
ESTs, Weakly similar to 138022 hypotheB 
synaptogyrin 2 

protein predicted by done 23733 
non-metastatic cefis 3, protein expresse 
Cdc42 effector protein 3 
gforna-amptjfied sequence-41 
chromosome 21 open reading frame 11 
cryptic gene 
ESTs 

gtyoxalasel 
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435561 


AA351978 


Hs.4943 


447210 


AF035269 


Hs.17752 


422522 


AI023428 


Hs.34549 


423396 


A1382555 


Hs.127950 


401785 






441345 


AW06B579 


Hs.7780 


451404 


AA460775 


Hs.6295 


422173 


BE385828 


Hs.250619 


452099 


BE612992 


Hs.27931 


452826 


BE245286 


Hs.301636 


419829 


AI924228 


Hs.1 15185 


407688 


W25317 


Hs.37616 


406214 






416292 


AA179233 


Hs.42390 


438714 


AA814859 


Hs.160074 


433213 


AW665130 


Hs.137190 


412170 


016532 


Hs.73729 


409340 


BE174629 


Hs.321130 


413142 


M81740 


Hs.75212 


451338 


AW612322 


Hs.19131 


437179 


AA393508 




418700 


A1963808 


Hs.86970 


438361 


AA805666 


Hs.146217 


450546 


AA010200 


Hs.175551 


409995 


AW960597 


Hs.129206 


408739 


W01556 


Hs.44685 


438821 


AA826425 


Hs.192375 


403532 






452941 


AL110347 


Hs.31074 


414341 


D80004 


Hs.75909 


423044 


AA320829 


Hs.97266 


429716 


R25685 


Hs.211933 


444367 


H54892 


Hs.10974 


423551 


AA327598 


Hs.89633 


453186 


AK001708 


Hs.32271 


413550 


W03011 


Hs.306881 


431933 


AJ187057 


Hs.132554 


439778 


AL109729 


Hs.99364 



422017 

421254 

424339 

445109 

425320 

426170 

408770 

442202 

417622 

439864 

433036 

447050 

436299 

408380 

407910 

445098 

431797 

437295 

452627 

448913 

415079 

408157 

443337 

410310 

437546 

451134 

423932 

433495 

453293 

428171 

428293 



406627 
418259 
426110 
411089 
431416 
431631 
409251 
434974 
447147 
450937 
432728 
433037 



AJ458682 
NM.003877 
AK001724 
BE257148 
AF039916 
U29344 
BE161065 
AW270608 
BE272862 
AW298163 
AJ720078 
AA574091 
NMJ16314 
AK000767 
AF123050 
AA650274 
AL050272 
BE169641 
AA350994 
AI122843 
AA194422 

R43179 
M047685 

Y07604 

J02931 

AW074836 

AA318315 

T95633 

AW373784 

AA382267 

AA489323 

BE250944 

NMJH5865 

T64904 

AA215404 

NMJXJ2913 Hs.166563 
AA456454 



Hs.110776 
Hs.102950 

Hs.12330 
Hs.83190 
Hs.167531 
Hs.170195 
Hs.106534 
Hs.82318 
Hs.291997 
Hs.105964 
Hs.17200 ■ 
Hs.5111 
Hs.44532 
Hs.41296 
Hs.12305 
Hs.270134 
Hs.20281 
Hs.184319 
Hs.22564 
Hs.22895 
Hs.62946 
Hs.9235 
Hs.62192 
Hs.173984 
Hs.25999 
Hs.189703 
Hs.71 
Hs.10653 
Hs.182825 
Hs.183556 
Hs.171731 
Hs.163780 



AA532718 

AA548906 

R10723 

AA778711 

AA910353 

R49131 

NM_006979 

NM.014158 



Hs.122244 
Hs.20573 



Hs.75432 
Hs.26267 
Hs.278721 
Hs.279938 



hepatocellular carcinoma associated prot 
phosphatidylserine-sperific phosphoBpas 
ESTs, Highly similar to S94541 1 done 4 
bromodomain-containing 1 
NM_002275*:Homo sapiens keratin 15 (KRT1 
Homo sapiens mRNA; cOKA DKFZp564A072 (fr 
ESTs, Weakly similar to T17248 hypotheO 
phorboMke protein MDS019 
hypothetical protein FU10607 similar to 
peroxisomal biogenesis factor 6 
ESTs, Moderately similar to PC4259 ferri 
Human D9 splice variant B mRNA, complete 
NM_016371:Homo sapiens hydroxysteroid {1 
nasopharyngeal carcinoma susceptibility 
ESTs 
ESTs 

very low density lipoprotein receptor 
hypothetical protein MGC2771 
ornithine decarboxylase 1 
transcription factor Dp-2 (E2F dimerizat 
serologically defined colon cancer antig 
ESTs. Moderately similar to AWS.HUMAN A 
Homo sapiens cDNA: FU23077 fis, clone L 
ESTs 
ESTs 

ESTs, Moderately simaar to I38022 hypot 
ESTs 

NMJ>24638:Homo sapiens hypothetical prot 

N-sulfogtucosamine sulfohydrolase (sulfa 

KIAA0182 protein 

protocadherin18 

collagen, type XIII. alpha 1 

hypothetical protein FU22390 

ESTs 

hypothetical protein FU10846 

MSTP043 protein 

ESTs 

putative transmembrane protein 
gb1k13e01 j(1 NCLCGAPJJU24 Homo sapiens 
STAT induced STAT inhibitor-2 
coat protein gamma-cop 
endogrycan 

ectonucleoside triphosphate diphosphohyd 
fatty acid synthase 

meuiylcfctor«yl-Ctoerizyrne A carboxylase 2 
bone morphogenefc protein 7 (osteogenic 
hypothetical protein FU22625 
WAS protein family, member 3 
ESTs, Weakly similar to A47582 B-ceU gr 
ESTs 

- STAKHike protein containing SH3 and ITA 
hypothetical protein FU 20729 
diublquiGn 

ftbronecfin leucine rich transmembrane p 
DKFZP566B183 protein 
hypothetical protein FU20280 
WAA1700 

ESTs, Weakly similar to KIAA1006 protein 
myosin VI 

hypothetical protein FU23548 
ESTs 

non-matastatic ceOs 4, protein expresse 
coagulation (actor III (thromboplastin, 
T-box1 

hypothetical protein FU22195 
ESTs 

atoha-2-gtycoprotein 1 , zinc 
ESTs 

ribosomal protein 135 
solute carrier famBy 1 (neutral amino a 
solute carrier family 14 (urea transport 
ESTs 
ESTs 

replication factor C (activator 1) 1 (14 
cell Division cycle 2-Hke 1 (PITSLRE pr 
ESTs 
ESTs 
ESTs 

eukaryofic translation initiation factor 
ESTs, Weakly similar to T23482 hypotheti 
ATP-dependant interferon response protei 
HLA class II region expressed gene KE4 
HSPC067 protein 
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411598 BE336554 Hs.70937 H3 h'stone family, member A 3.0 

430268 AK000737 Hs.237480 hypothetical protein FU20730 3.0 

443725 AW245580 Hs.9701 growth arrest and ONA-damage-ind noble, 3.0 

431155 AW971213 gb:EST383301 MAGE resequences, MAGL Homo 3.0 

5 407355 AA846203 Hs.193974 ESTs, Weakly similar to ALU 1J HUMAN ALU S 3.0 

438209 AL1 20659 Hs.6111 aryl-hydrocarbon receptor nuclear trans! 3.0 

434392 AW983709 Hs.250824 Homo sapiens cDNA: FU23435 lis, clone H 3.0 

432810 AA863400 Hs.23054 ESTs 3.0 

430333 S70114 Hs.239489 TIA1 cytotoxic granute-assodated RNA-bi 3.0 

10 445596 R89543 Hs.12942 vesicle trafficking protein 3.0 

406670 W79632 Hs.256301 hypothetical protein MGC13170 3.0 

452779 AA418775 Hs.47234 ESTs 3.0 

439680 AW245741 Hs.58461 ESTs, Weakry similar to A35659 krueppel- 3.0 

408298 AI745325 Hs.271923 Homo sapiens cDNA: FU22785 fis, clone K 3.0 

15 424395 AA165082 Hs.146388 micrtfubute-assoclated protsin 7 2.9 

423472 AF041260 Hs.129057 breast carcinoma amplified sequence 1 2.9 

425920 AL049977 Hs.162209 daudin8 2.9 

409927 T69981 gb:yc19d03.r1 Stratagenetung (937210) H 2.9 

447881 BE620886 GCN1 (general control of arrono-acid synt 2.9 

20 439584 AA838114 Hs.221612 ESTs 2.9 

417601 NM 014735 Hs.82292 KIAA02 15 gene product 2.9 

452268 NMJKJ3512 Hs.28777 H2A hlstone family, member L 2.9 

422576 BE548555 Hs.1 18554 CGt-83 protein 2.9 

438510 AL080220 Hs.6285 DKFZP586P0123 protein 2.9 

25 423217 NM.000094 Hs.1640 collagen, type VII, alpha 1 (epkfermolys 2.9 

427581 NMJM788 Hs.179703 KIAA0129 gene product 19 

447963 AI452973 Hs.1659D0 ESTs, Weakly similar to ALU1_HUMAN ALU S 2.9 

443015 R33261 Hs.6614 ESTs, WeaWy similar to A43932 mucin 2 p 19 

429351 AK001490 Hs.200016 hypothetical protein FU 10628 2.9 

30 425294 AF033827 Hs.1 55553 HNK-1 sulfotransferase 2.9 

452955 AW390282 Hs.31130 transmembrane 7 superfamily member 2 2.9 

416439 AA180363 Hs.1 18769 ESTs 2.9 

442315 AA173992 H&7956 ESTs, Moderately similar to ZN91.HUMAN Z 19 

446911 N27605 Hs.16492 DKFZP564G2022 protein 19 

35 428801 AW277121 Hs.254881 ESTs 19 

430462 AI584156 Hs.105640 Homo sapiens, done IMAGE.41 39775, mRNA, 19 

420552 AK000492 Hs.98806 hypothetical protein 19 

428180 AI129767 Hs.182874 guanine nudeotWe binding protein (G pr 19 

418222 AI675881 Hs.86538 ESTs 19 

40 433009 AA76166B gb:nz24c08.s1 NCLCGAP_GCB1 Homo sapiens 2.9 

432140 AK0Q0404 Hs.272688 hypothetical protein FU20397 2.9 

434517 AA635690 Hs.337251 hypothetical protein MGC2467 19 

448718 AA220235 Hs.1 53959 hypothetical protein MGC1543S 19 

408196 AL034548 Hs.43627 SRY (sex determining region Y>box 22 19 

45 419849 BE041436 Hs.93379 eukaryofJc transtafion inifiafion factor 19 

452256 AK000933 Hs.28661 Homo sapiens cDNA FU10071 fe, clone HE 19 

430375 AW371048 Hs.93758 H4 histone family, member H 19 

409650 T08490 Hs.288969 HSCARG protein 19 

413588 AA971014 Hs.75432 IMP (inosine monophosphate) dehydrogenas 19 

50 412719 AW016610 ESTs 19 

438552 AJ245820 Hs.6314 type I transmembrane receptor (seizure* 19 

439621 A1200281 Hs.123910 ESTs, Highly similar to B34087 hypotheti 19 

437050 AA766420 ESTs 19 

414256 AW410035 Hs.75862 MAD (mothers against decapentapiegfc, Dr 19 

55 429343 AK000785 Hs.1 99480 Homo sapiens, Similar to epsln 3, done 19 

416941 BE000150 Hs.48778 niban protein 19 

421594 R45689 Hs^1889 Homo sapiens cONAFU12978fe. clone NT 19 

436213 AA325512 Hs.71472 hypothetical protein FU10774; KIAA1709 2.9 

436823 AW749865 ESTs, Weakly simHar to I38022 hypotheti 18 

60 441266 H15968 Hs^93845 Homo sapiens, done IMAGE3502329, mRNA, 18 

426761 AI015709 Hs.172089 Homo sapiens mRNA; cDNA DKFZp58612022 (f 18 

413746 AA133243 Hs.171553 ESTs, Moderately simitar to CLC3_H U MAN C 2.8 

457733 AW974812 ESTs 18 

400860 Target Exon 18 

65 . 438165 AJ373544 Hs.331328 intermediate filament protein syncoiBn IB 

433312 AI241331 Hs.131765 ESTs, Moderately similar to 138937 DNA/R 18 

431604 AF175265 Hs.264190 vacuolar protein sorting 35 (yeast hornet 18 

421717 AF230924 Hs.107187 divalent cation tolerant protein CUTA 18 

453912 AL1 21031 SW1/SNF related, matrix associated, acti 18 

70 435703 AW630133 Hs.83313 GKD03 protein 18 

431108 AA991508 Hs.105317 ESTs 18 

451040 AA324743 Hs.40808 ESTs 18 

443837 A1984625 Hs.9884 spindle pete body protein 18 

418196 AI745649 Hs^6549 K1AA1 708 protein 18 

75 450447 AF212223 Hs^5010 hypothefical protein PI 5-2 IB 

409330 AK001231 rte^3940 fiypothelical protein FU10369 18 

413762 AW411479 Hs.848 FK506-binding protein 4 (59kD) 18 

433586 T85301 Hs.194397 gb:yd78d06Ls1 Soares fetal Over spleen 18 

424720 M89907 Hs.152292 SW1/SNF related, matrix associated, acti 18 

80 422027 AL043100 Hs.306319 fatty arid amide hydrolase 18 

422717 A1557623 Hs.1 19475 cold inducible RNA-binding protein 18 

412843 AF007555 Hs.74624 protein tyrosine phosphatase, receptor t 18 

434237 AF1 19908 Hs.235516 hypothetical protein PR02955 18 
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Hs.256301 
Hs.256549 
423318 AW467064 Hs.5740 
454144 BE280478 
452827 AI571835 
452040 AW973242 Hs.293690 
418830 BE513731 Hs.88959 
417517 AF001176 
419286 AA236005 



Hs.182695 
Hs.55468 



Hs.82238 
Hs.128764 



AW182459 Hs.125759 
U77413 Hs.100293 
Hs.266619 



430287 
420911 

428054 A1948588 

409504 AA304961 

440080 AW051597 

431609 AW792792 Hs.264330 

447896 AI436124 Hs.294069 

431525 AA506656 

442995 AA532511 



Hs.6185 
Hs.288455 

433101 AW572317 Hs.12082 
449052 AW029507 Hs.161102 ESTs 
418663 AK001100 Hs.41690 desmocdlin3 
Hs.10600 



hypothetical protein MGC13170 
nucleotide binding protein 2 (E.coti Min 
ESTs 

hypothetical protein MGC3243 
ESTs 

ESTs. Weakly similar to 138022 hypolheti 
hypothetical protein MGC4816 
POP4 (processing of precursor , S. cerev 
ESTs 

ESTs, Weakly similar to LEU5_HUMAN LEUKE 
Winked N-acetylglucosamine (GJcNAc) tr 
ESTs 

pepfidytprotyt feomerase B (cydophiGn 
ESTs 

N-acylsprungosine arntdohydrolase (acid c 
Homo sapiens cDNA FU13384 lis, done PL 
KIAA1557 protein 

Homo sapiens cDNA: FU23270 fis. done C 
Homo sapiens mRNA; cDNA DKFZp566l203 (fr 



442013 AA506476 
449051 
453038 

419175 AW270037 
426788 U66615 
421091 W22821 

413435 X51405 Hs.75360 

432651 AW973744 Hs.293100 

452700 AI859390 Hs.2B8940 

454148 AW732837 Ks.42390 

419452 U33635 Hs.90572 

425491 AA883316 Hs.255221 

414556 AW975063 Hs.343443 

435126 A1393666 Hs.42315 

AF217515 Hs.283532 
AA570256 

440340 AW895503 Hs.125276 
H&298258 
Hs.119122 

406789 AI041403 
424130 AL050136 
449518 BE395253 
410099 AA081630 
422000 M30599 
AA018042 



435602 
441128 



410339 AI916499 
430542 A1557486 



Hs.172280 



Hs.140945 
Hs.30861 



407110 

415443 T07353 
432745 AI821926 



Hs.1 10637 
Hs.252085 
Hs.7948 



AA045650 
424369 R87622 



417831 
424723 



Human DNA sequence from done RP1 1-353C1 
AW961400 Hs.333526 HER2 receptor tyrosine kinase (c-erb-b2, 
AW29241 5 Hs 20509 HBV pX associated protein-8 
K1AA0779 protein 

SW1/SNF related, matrix assodated. acti 
ribosomal protein U6 
carboxypeptidaseE 
ESTs 

five-span transmembrane protein M83 
nasopharyngeal carcinoma susceptibility 
PTK7 protein tyrosine kinase 7 
ESTs 

ribosomal protein L36 
plO-binding protein 

uncharacterized bone marrow protein BM03 
ESTs, Weakly similar to T23273 hypolheti 
ESTs 
ESTs 

ribosomal protein L13a 
ribosomal protein L29 

Homo sapiens mRNA; cDNA DKFZp586L141 (fr 
ESTs 

K1AA0036 gene product 
homeoboxAlO 

Prader-Wdli/Angeiman syndrome^ 
ESTs 

gtent78f05.x5 NCI_CGAP_Pr3 Homo sapiens 
Hs.531 25 smaH nuclear ribonudeoprotetn D2 polyp 
Hs^67U KIAA1831 protein 
Hs.82685 CD47 antigen (Rh-relaled antigen, integr 
Hs.152337 protein arginlne N-methyltransferase 3(h 
ESTs, Highly similar to T46395 hypolheti 
Eos Control 
K1AA1 089 protein 

ESTs, Weakly similar to I38022 hypothefJ 
CED-6 protein 
hypothetical protein 

hypothetical protein DKFZp762N2316; KIAA 
phosphodiesterase 4B. cAMP-specific (dun 
ESTs 

WAA0776 protein 

nuclear matrix protein NMP200 related to 
Homo sapiens cDNA FL) 12300 fis. clone MA 
polymerase (DNA directed), beta 
ribosomal protein L29 
Hs.253594 intron of:trichorhinophalangeat syndro 

ribosomal protein L4 
Hs.179516 Integral type I protein 
Hs.182285 ESTs 
Hs.18103 ESTs 
Hs.121820 ESTs 

Hs.336425 Homo sapiens, done MGC:17296, mRNA. com 
' Hs.296141 ESTs 
Hs.1437 ghicosidase, alpha; add (Pompe disease, 
Hs.70769 hypothetical protein DKFZp586E1923 
Hs.116502 ESTs 

Hs-23625 Homo sapiens done TCCCTA00142 mRNA sequ 
Hs.101842 AT-binding transcription factor 1 
Hs.158184 ORF 
HS.115B38 ESTs 

Hs.334692 hypothetical protein MGC14141 
Hs.169119 ESTs. Weakly similar to T25731 hypolheti 



H16423 
BE409813 



451428 AW083364 Hs.11067 



Hs.4990 

Hs.255883 

Hs.107056 

Hs.101590 

Hs.16493 

Hs.188 



435750 AB029012 

443804 AL135352 

421709 AA159394 

421077 AK000061 

416359 AL042210 

442104 L20971 

411605 AW006831 

437133 AB018319 

407236 W79485 

432125 AW972657 Hs,183006 

427657 AV652249 Hs.180107 

445636 AW105401 
AW206942 



Hs^450 
Hs.173980 



443801 

420223 N27807 

427515 T79526 

447673 AI823987 

411960 R77776 

433212 BE218049 

448244 BE613416 

441551 AA318224 

421883 X55079 

411562 AL050201 

434851 AA806164 

442875 BE623003 

421109 L32832 

432450 A1990739 

426174 AA547959 

445919 T53519 

423115 AA421973 
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2.8 
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28 
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27 
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27 

27 
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27 
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431222 
451558 
431615 
418849 
431122 
438682 
424954 
458082 
427087 
412524 
435176 
419390 
428013 
441094 
451356 
408461 
418516 
423476 
444034 
404913 
407378 
441054 
428500 
430512 
433301 
444381 
443666 
427378 
439414 
421866 
421605 
426797 
410935 
407887 
430341 
424956 
452748 
408212 
407B93 
425976 
448424 
420164 
443444 
412774 
445922 
429925 
419222 
417327 
420337 
408743 
434011 
434976 
417454 
448789 
441049 
437271 
425017 
440333 
434551 
413132 
409619 
444550 
446258 
440191 
453187 
412133 
426841 
435937 



456161 
455236 
435025 
416751 
458946 
424762 
449458 
432409 
425184 
442336 
445622 
440409 
444610 
425863 



X56777 Hs.273790 zona pellucida glycoprotein 3A (sperm re 
NMJJ01089 Hs.26630 ATP-binding cassette, sub-family A (ABC1 
AW295859 Hs.235B60 ESTs 

AKJ96485 Hs.169341 ESTs, Moderately similar to S65657 alpha 
A1267593 Hs.250535 Homo sapiens mRNA; cDNA DKFZp434N2412 (f 
AA354489 Hs^22103 EBP50-PDZ tnteractor of 64 kD 
NM_000546 Hs.1846 tumor protein p53 (Li-Fraumeni syndrome) 
AW97881 1 Hs.314451 ESTs, WeaMy similar to ALU INHUMAN ALU S 
BE07391 3 Hs.1 73515 uncharacterized hypothalamus protein HT0 
AA417813 Hs.44208 hypothetical protein RJ 231 53 
AA744875 Hs.189413 ESTs 
AJ701 162 Hs.90207 hypothetical protein MGC1 1 1 38 
AF151020 Hs.181444 hypothetical protein 
U33819 Hs.7647 MYC-associated zinc finger protein (puri 
AA748418 Hs.164577 ESTs 
AB037756 Hs.45207 hypothetical protein WAA1335 
NM.C06218 Hs.85701 phosphoinositide-3-kinase, catalytic, a! 
AL035633 Human ONA sequence firom clone RP5-1046G1 

AL161957 Hs.10177 pleckstrin homology domain interacting p 

NNL024408*:Homo sapiens Notch (Drosophil 
Hs.57776 ESTs, Moderately similar to 138022 hypot 
ESTs 

Hs. 1 84641 fatty acid desaturase 2 
Hs.241578 U6snRNA-associatedSnvlike protein LSm8 
Hs.152016 Homo sapiens cDNA: FU22140 lis, done H 
Hs.283713 ESTs, Weakly simflar to S64054 hypothefi 
Hs.41528 ESTs, Weakly shnHar to neuronal thread 
Hs.177556 melanoma antigen, family D, 1 
Hs.6551 ATPase, H transporfing, lysosomal {vacuo 
Hs.1435 guanosine monophosphate reductase 
Hs.106127 RNA polymerase 1 16 kDa subunit 
Hs.342849 ADP-ribosylation factor-like 5 
Hs.65881 dynein. cytoplasmic, intermediate polype 
Hs.41072 serine (or cysteine) proteinase inhibito 
Hs.239631 golgi transport complex 1 (90 kDa subuni 
Hs.18724 ESTs, Weakly similar to granule cell mar 
Hs.3051 2 Homo sapiens mRNA for KIAA0556 protein, 
Hs.43728 hypothetical protein 
Hs.43621 Homo sapiens, Simaar to hypothefical pr 
Hs.334514 NG22 protein 
Hs.31924 ESTs 
Hs34908 ESTs 

Hs.17235 Homo sapiens done TCCCIA00176 mRNA sequ 
Hs.23136 ESTs 
Hs.147998 ESTs 

cytochrome P450, 51 (lanosterol 14-alpha 
Hs.89718 spermine synthase 
Hs.81964 SEC24 (S. cerevlsiae) related gene famfl 
Hs.14555 Homo sapiens cDNA: FU 21513 fis, done C 
Hs.47367 WAA1785 protein 
Hs.5486 clone FLB5214 
ESTs 

Hs.172458 iduronate 2-sulratase (Hunter syndrome) 
Hs.22051 hypolhelfcal protein MGC15548 
Hs.29341 hypothetical protein FU22376 
Hs.28846 Homo sapiens mRNA; cDNA DKFZp5660134 (fr 
Hs.26409 ESTs 

Hs^88761 hypothetical protein FU21749 
Hs.280a58 ESTs, Highly similar to A35661 DNA exris 
Hs.75209 protein kinase (cAMP-dependerrt, catalyti 
Hs.55220 BCL2-associated athanogene 2 
Hs.87614 ESTs 
Hs.263478 ESTs 

tubulin, beta 5 

ESTs, Highly similar to S94541 1 ctone 4 
solute carrier family 31 (copper trans po 
ESTs 
ESTs 

hypothetical protein OKFZp434H247 
hypothetical protein FU21918 
regulator of G-protein signalling 10 
anchor attachment protein 1 (Gaa1 p. yeas 
Homo sapiens mRNA; cDMA DKFZp586P1622 (f 
ESTs 



AA299264 
AA913591 
AI815395 
AF182294 
AW296280 
BE387335 
AI458179 
BE515037 
NM_001183 
M24470 
BE440108 
AW936258 
BE067395 
AA579668 
NMJM6348 
AW198103 
AB011128 
AA297567 
BE408359 
C75094 
AW009892 
AW339037 
AW952619 
AA120855 
AI337316 
NM_000786 
AD001528 
NM.004922 
AW295840 
AL110246 
AW953437 
AI963821 
NMJ00202 
BE539108 
W88920 
AL137445 
AL119305 
AI378424 
BE387162 
NM_006823 
AK001015 
BE250716 
AI283476 
A1990417 
AI161383 
U83460 
AI052358 
AA830893 
AW410646 
BE264645 
AF045229 
T08990 
T48130 
AA009715 
AL119442 
AI805078 
AA806538 
BE278288 
AW340958 
AF106697 
AW294315 
AI174783 
U43604 



Hs.34549 
Hs.73614 
Hs.193726 
Hs.1 19769 
Hs.164649 
Hs.282093 
Hs.82280 
Hs.4742 
Hs.5897 
Hs.42311 
Hs.183684 
Hs.208261 
Hs.130732 
Hs.155048 
Hs.7572 
Hs.12971 



Hs.159901 



eukaryofic translation iruBation factor 
ESTs 

KIAA1575 protein 

Lutheran blood group (Auberger b antigen 
ESTs 

thloredoxin reductase beta 
ESTs 

gb:HA2501 Human fetal liver cONA library 
Human unidentified mRNA, partial sequenc 



Z7 

2.7 

2.7 

Z7 

Z7 

Z7 

Z7 

Zl 

Z7 

Z7 

Z7 

2.7 

Z7 

2.7 

2.7 

2.7 

Z7 

Z7 

Z7 

Z7 

2.7 

2.6 

2.6 

2.6 

2.6 

2.6 

2.6 

2.6 

2.6 

2.6 

2.6 

2.6 

Z6 

Z6 

Z6 

Z6 

2.6 

Z6 

2.6 

2.6 

Z6 
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2.6 

2.6 

Z6 

2.6 

2.6 

Z6 

Z6 

2.6 

Z6 

2.6 

2.6 

2.6 

2.6 

Z6 
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Z6 

Z6 

Z6 

2.6 

Z6 

Z6 
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Z6 
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Z6 

Z6 
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436972 
426809 
431183 
457638 
442821 
435522 
406764 
422032 
409125 
416801 
446091 
419848 
415015 
431836 
414987 
447087 
456050 
426178 
414528 
426494 
407656 
437330 
449139 
448153 
417386 
409420 
408796 
407627 
421535 
415862 
426418 
436557 
413517 
449500 
457090 
. 445389 
410006 
452906 
450341 
419430 
440099 
452518 
431843 
424795 
448019 
450704 
400262 
435266 
418039 
429491 
456844 
413781 
437450 
424005 
429332 
. 438572 
425692 
431188 
408803 
414482 
437762 
425722 
433681 
451585 
424959 
407151 
446054 
452556 
421026 
407687 
433683 



AA284679 
BE313114 
NMJM6B55 
AI792670 
BE391929 
N64214 
AA429825 
AA476966 
R17268 
X98834 
AW022192 
AA251242 
NMJ02109 
AF178532 
AA524394 
AW403870 
R79445 
AA371409 
M148950 
AL119528 
AW747986 
AL353944 
BE26B315 
Y10805 
AL037228 
215008 



AI419020 
AB002359 
R51034 
M90464 
W15573 
N76712 
AW956345 
AL080243 
NMJ016831 
AW732308 
BE207039 
N90956 
AI878942 
AL080058 
AA280722 
AA516420 
AW102850 
AW947164 
H85157 

AK001942 
R06859 
NMJH2111 
AI264155 
J05272 
AL390154 
AB033041 
AF030403 
BE267017 
D90041 
W05656 
AF248953 
S57496 
T78028 
AI659076 
AI004377 
AK001171 
NNL005781 
H25B36 
AB014537 
H78517 
AL047332 
AK002011 
AI817723 



Hs.25640 ciaudin 3 
Hs.29706 ESTs 

Hs.250696 KDEL (Lys-Asp-Qi>-Leu) endoplasmic relic 
Hs.144405 ESTs 
Hs.8752 transmembrane protein 4 
Hs.9774 synovial sarcoma translocation gene on c 
Ks.343443 ribosomal protein L36 
Hs.1 1 0857 polymerase (RNA) III (DNA directed) poly 
Hs^43567 axona! transport of synaptic vesicles 
Hs.79971 sal(DiosophiIa)-fike2 
ESTs 

Hs.103238 ESTs 

Hs.77798 hisfidyMRNA synthetase 

Hs.271411 beta-site APP-cleaving enzyme 2 

Hs.294022 hypothetical protein FU14950 

Hs.301872 hypothetical protein MGC4840 

Hs.76230 ribosomal protein S10 

Hs.105664 hypothetical protein FU22294 

Hs.188836 ESTs 

Hs.170098 K1AA0372 gene product 

Hs.37443 Homo sapiens mRNA; cDNA DKFZp434B21 19 (f 

Hs.50115 Homo sapiens mRNA; cDNA OKFZp761J11 12 (f 

HsJ23111 phenylalanine-lRNA synthetase* 

Hs.20521 HMT1{hnRNPmethyItransferase,S.cerevi 

Hs.82043 D1 23 gene product 

Hs.54451 larnlnin.gamma2(nicein(100kO),kalinl 

Hs.170345 ESTs 

Hs.62620 chromosome 6 open reading frame 1 
Hs.105478 phosprwnTwsyUormytglvdnamidmesyntha 
Hs.144513 ESTs 

Hs.169825 collagen, type IV, alpha 5 (Alport syndr 
Hs.5027 ESTs, Weakly similar to A47582 B-cell gr 
Hs.44829 ESTs, Weakly similar to 138022 hypolheti 
Hs.12926 ESTs 
Hs.180920 ribosomal protein S9 
Hs.1 2592 period (Drosophila) homolog 3 
Hs.57783 eukaryotic translation inifiation factor 
Hs.306123 MAGEF1 protein 
Hs.17230 hypothetical protein RJ22087 
Hs.90336 ATPase, H transporfing, lysosomal (vacuo 
Hs.6909 DKFZP56MG202 protein 
Hs^4758 ESTs, Weakly similar to 138022 hypotheti 
ESTs, Weakly similar to 138022 hypotheti 
Hs.1 53177 nbosomal protein S28 
Hs.195641 ESTs, Moderately similar to 138022 hypot 
Hs.40696 ESTs 

Eos Control 

Hs.4863 hypothetical protein DKFZp566A1 524 
Hs.193172 ESTs. Weakly similar to 138022 hypotheti 
Hs.204041 chromosome 14 open reading frame 3 
Hs.152981 CDP-diacylgJycerol synthase (phosphaGda 
Hs.850 IMP (inosine monophosphate) dehydrogenas 
Hs.26954 Homo sapiens mRNA; cDHA DKFZp762G123 (fr 
Hs.137507 vang(vangogkDrosophfla)4ike2 
Hs.1 99263 Ste-20 related kinase 
Hs.6315 acetylserotorrin O^trryttransferase-fike 
Hs.1 55956 N-acetyltransferase 1 (arytamine N-acety 
Hs.169755 ESTs 

Hs.24049 golgi autoantigen, golgin subfamily a, 2 
Hs.76252 endothelin receptor type A 
Hs.154679 synaptotagmln I 
Hs.97031 hypothetical protein MGC1 3047 
Hs .200360 Homo sapiens cDNA FU13027fc, clone NT 
Hs.326422 hypothetical protein MGC4549 
Hs.1 53937 activated p21cdc42Hs kinase 
Hs.301 527 ESTs, Moderately simHar to unknown [Rs 
Hs.1 3504 WAA0637 gene product 
Hs.33905 ESTs 

Hs.101067 GCN5 (general control of amino-acid synt 
Hs.37558 hypothetical protein FU 11 149 
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244671 BI052776 BF882486 BG2861 84 A1589558 AA931 663 AA534979 AI275392 AI273455 R52553 AA829920 H80552 AA360728 F1061 8 
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421091 24941.2 



441128 20932.1 



406789 0 0 
410099 16732.1 



432745 112643J 
400253 18977.1 



411605 10026.3 



445636 

420223 
423476 

441054 
429925 



434976 
440191 



8551_5 

191648 1 
32437.1 

2641490.1 
33135.1 



121716.1 
MH790.2 



440409 
444610 
446091 
431843 



588375.1 
2145292.1 
515091J 
445334.1 



400262 16977.1 



AW953666 AW176773 H85527 AA765570 AA081 927 BF093262 BG743753 AL037576 AA534314 BE814964 BE97371 3 N49493 BE006634 
BE006630 AW270037 AA234765 A1334004 BF057179 A1857450 A1341 191 AI434143 AI917449 AW517207 AA255424 AW008334 AA847572 
AA994211 AA861901 AA581873 A1580157 A064363AW242357 AW235291 N55545AA3 19869 R3691 1 AA256551 AW044188 AI203159 N49403 
FO2090 AI187299 A1609644 Z40516 AW952314 

AK057700 BC015899 BE867108 AL526926 AL527436 BG913023 AI884867 BE858461 AI885227 AI935218 BE645596 AI922406 AA778161 
BF345973 AW195853 A1687121 A1336147 AI091364 AI769203 AW627859 BE677432 BF439954 BG820230 AA283686 AA812396 AI358396 
AA932022 BE222881 AW016109 BM352667 BF528099 BG818616 A1935916 AA912295 BG910B87 AL56B301 AL567278 BI522445 BI754384 
BG81 9375 BF835950 AA32371 8 AA860955 AI089847 AL520776 AL526045 AL538507 AL567798 AL536876 AL574332 BF834531 BF3401 16 
BF835954 AL573997 AL581252 BF957667 AL580170 AW952324 AL569382 AW148695 AL559234 BG034270 BI913839 AL581327 AL565842 
H43568BF945188AL559686AL539326T15481 AW895092 AL582684 BI519896 

BC014072 BE328850 AI356567 A1148171 AI022165 BG149661 BF000671 AA233101 AA573721 AA447991 AW016855 AI005068 AA554071 
BF476215AA906902AW014761 BE905651 BE512923BM047129AA243852AA232991 AA127550 AA1 27551 AA570256 AI473237 BF033706 
N90525 AW973623 AI359627 BG674574 BE903322 
AJ041403Z49148 

AK055674 AW965247 AV751598 AA290926 R53043 AA331387 AK056148 B1917678 BG819395 BG91 1971 BG820167 A1174254 AA348720 
AA364503 BG714279 AW893230 AA081774 H24222 AV727176 BF875715 AA081630 BE000834 AA334880 AL563737 BG029709 W52882 
AI43965B BE551 237 AA283724 BF109530 AI457096 AI805992 BE467735 AA693467 AI697593 AI887863 A1167419 AW901980 AW90176B 
BE702179 AA484549 T2381 1 BE327043 AA716027 AA917004 AA167714 BF339675 AA084618 AI418534 T31586 AA436630 AJ366472 AA706191 
A1422304 AI204899 AI041 169 AA21 1 402 AW827081 AA788593 T32736 AI767935 AA747914 T03534 AW959843 AL1 19527 BE327037 AW901982 
AW993370 AW901977 AW902071 W60090 N79906 D52685 T07735 BE702069 BE702172 T08671 BE767121 BE7671 17 BE7671 13 
AA65B826 AI821926AI791191 AA635129AA564492 

Z1 1 692 X51466 NM.001951 M19997 BI224253 BG830478 AU1 22147 AU123437 BG1 1 3591 BG752624 BE886804 BI868669 BG337216 AW629935 
BM016525 AJ560409 AL562866 A1909178 BF849556 AA371735 BF038841 BF7271 15 BC006547 BG757526 AL555664 BI261304 BG770095 
BI033466 B15175B0 BGB76486 BI011828 AI313235 BG831724 BF869862 BG998348 BI011834 BF888337 BF898627 BF092380 AW803215 F01241 
BF805719 BG876487 AW498536 BF988866 BG998849 AA248724 BG829202 BG756456 BG032392 BI859287 BM016990 BG332369 BE933685 
BE166758 BM452445A1937808AW026128 N23684AW006041 AI337621 F33111 BF344301 BG105450 BG387343 BF569547 BF1 54671 
BM007368 BF569385 BE772007 BH99487 BF761700 BI261519 BF944452 BF898506 A1038390 BM044934 AW381 142 BG743618 BE769205 
BE893973 BI015047 BF8B6479 BF761350 BE769769 BG766117 BF847365 BE397834 AW371121 BF089125 BE082996 BF183193 BG180964 
BF089940 BI000274 8G255503 BG674499 BG7741 74 BIO15084 

BG256892 H10532 N46614 R52610 AW97769B BM460488 W56819 BI042183 BG977498 BE767451 BF870009 BG477472 R61 137 R14274 R20259 
R09686 BI838226 BF034269 AA429173 BE741829 AW867495 Al 123683 AW006831 BE831 1 62 AW452753 AV742717 W86152 BF1 15102 AI63381 5 
BF921 562 AA094230 BE092587 W86151 AA526153 AI672156 BF914498 R12579 BF852352 AA699780 T57386 BF903022 R09933 AA678298 
BF339388 AJ345516 BG391657 BE708967 BG026034 BE261703 H56716 H65572 H93801 T48830 R96953 R96989 R39707 BE867593 M090310 
AA090672 

N27807AA256634BE276324 

AF263306 BF951698 T66089 F11783 F11794 H29379 R19493 H18042 AL133995 AW1 34660 AI299437 AA057405 AA917450 A1002692 T09262 
R43839 H29290 T65008 N78357 
A1221207 A1659B56 AA913591 AI220302 

NM.000786 U23942 BI601050 BG771947 BG773455 BI561558 BM60206 BG714348 BM1 26447 AIM 29411 AU129401 AL119303 AV724389 
AU099323 AU127089 BG701614 BM150364 D55653 AV702235 BE090408 AU151526 AW1 49661 BG054754 W42624 W52098 AV709923 W79867 
W74235 AV709052 AI217668 AA399409 BE182318 BM128040 AI693998 AW615411 AW070426 AI124550 AW778736 AA477781 AW263013 
AI459619 AA660513 BF809548 AA643635 AA864975 H42020 AW901189 AA904786 BF432722 BI916393 BM470755 BI333211 AA095636 BI256415 
U47727 BG773392 N58531 AA226450 BG679564 AA292178 W56883 BE813131 C03646 BG287974 AA386261 H47580 R48858 T91611 H42019 
BI869421 BG502073 BG425943 W37290 W31363 BE004451 BF208311 BI048717 N78122 AA226597 A1525334 
AI963821 AA657925 AA935438 AW975068 

BC0D0222 AL136871 NM.032261 BC009497 BM461705 BG470749 BG826905 BG285127 BI253235 BE397026 BG704967 AW961225 BM352817 
BI227161 BG820180 AA454463 AA255685 N31 549 AA326504 BC019924 BG257230 BI0923S8 B1869896 BG251883 BF034444 BG286577 
BE260391 AA599912 BF686779 AI755222 AA452272 AW241170 AU156565 F28259 T16319 AA362506 N64153 BM016416 BM458963 BG739972 
AV729565 BE268285 BE857433 BM011110 BG2B5856 AI922439 BE270975 AU 19339 BF956085 AI565178 AL554305 AL573240 AL572917 
AI129627AL546640BE392285AJ092843AI371057BE302410A1608753AW674261 AI750057AI052649N47822AL516249AI589903BG258439 
A1123662 AI126014 AA778101 AA243218 AW498837 AA748311 AI754395T15728 AA775369 AA858195 W73625 AA216784 AW513778 AW24395B 
A1061112 A1783806 AL569622 AA070466 BF229936 N5B159 H80288 N32598 H80293 H80279 AL58 1253 AW571884 AI361 698 AW073321 
AA876464H95640 T34421 AA331419N99897 H80274B1262120AA205826 AA070716 AA653206AA653483BG389811 W20432AA670295 
AA6531 97 D51888 AA362527 BG056668 AW1 18162 AW073071 BE293868 BF792321 BF792258 BG107176 T56604 BM1 93566 H03238 AA618045 
AA906004 BI085686 AA564566 AI221630 D52045 C1451 0 AA029390 W60153 H98743 AI682641 H28485 AA723093 AJ081730 AA641309 
AA687083 BI22481 8 AW204722 AI309186 AI215122 AI200785 BE467373 BM352502 AI304400 AI193071 A1742483 AW003408 AI400201 AI656740 
AW6651 73 AI215120 AI147599 AI803429 AI0761 10 AJ754349 AW205103 AI262491 AI808243 AC81007 AI051273 AI004801 AW768918 AW103289 
AI474637 AI264446 AI699509 BE704420 AA989278 AA918256 AA830956 AA989425 AA91 1929 AA262598 BE740563 BE727592 BE781 003 
BG030940 BE256750 W23528 AL517059 H95714 W00970 H74304 W32684 K38210 BE617658 AW593584 AA580593 H94863 N69544 AA181762 
D20132 T55734 T60692 R76858AA022948 AW770291 R50934AA205918 R97811 H40328 R86252AA936029 AI81 3809 AA933607AA1 29695 
AA548261 AA714393 AA775006 AA653439 AA983808 AA743251 AA401150 AA581651 AA555005 AA554408 D51494 C13991 051478 D52007 
AA489663 D52138 D51696 D55942 D52740 AI0001 18 AL516304 AL534259 N54940 AL579194 AI669399 A1342925 BE939201 AA633000 BC22963 
A1619576 AW190306 BF03501 0 AW087B97 AI864969 T57243 R4821 1AA1 13880 R26594 C14467 C14444 BJ195459 BE89S346 BE270780 
AL568073BG389833BE891549BI223147AW381001/^8464AW799744AM12194/^81^ BF752571 BE731304 

AI380443 A1240179 AA977516 AA884643 AW079380 AW29431 6 AI913755 AI864320 AJ685770 H25135 AI972654 AJ538592 
AI174783R12271 R83569 
A1274757 AJS59500 AW022192 

AW970134 AA516420 AA543007 BG057526 BI001430 AI498371 060181 D81004 D60382 C15876 N91070 C14815 C15068 D80763 C14818 
C15161 D60184 D60656 

211692 X51466 NM.001961 M19997 BI224253 BG830478 AU122147 AU123437 BG113591 BG752624 BE886804 B1868669 BG337216 AW629935 
BM016525 A1560409 AL562866 A1909178 BF849556 AA371735 BF038841 BF727115 BC006547 BG757526 AL555664 BI261304 BG770095 
6I033486 BI517580 BG876486 BI011828 AI313235 6G831724 BF869862 BG998348 B1011834 BF688337 BF698627 BF092380 AWB03215 (=01241 
BFB05719 BGB76487 AW498536 BF988866 BG998849 AA248724 BG829202 BG756456 BG032392 BI859287 6M016990 BG332369 BE933685 
BE166758BM452445AI937808AW026128N23684AW006041 At 337621 F33111 BF344301 BG1 05450 BG387343 BF569547 BF1 54671 
BM007368 BF569385 BE772007 BI199487 BF761700 B1261519 BF944452 BF898506 A1038390 BMQ44934 AW381 142 BG743618 BE769205 
BE893973 BI015047 BF886479 BF761350 BE769769 BG766117 BF847365 BE397834 AW371 121 BF089125 BE082996 BF183193 8G1609S4 
BF089940 BI000274 BG255503 BG674499 BG774174 B1015084 



TABLE 66C 



Pkey: Unique number corresponding loan Eos probeset 

Ref: Sequence source. The 7 digit numbers in this column ere Genbank Idenfifter (Gl) numbers. 'Dunham I. et al." refers to the pubHcation entitled The DNA 

sequence of human chromosome 22/ Dunham Letal., Nature (1999) 402:489-495. 

775 



WO 03/042661 



PCT/US02/36810 



Strand: Indicates DNA strand from which exons were predicted. 
NLposilion: Indicates nucleotide positions of predicted exons. 



Pkey 


Ref 


Strand 


NLposilion 


403047 


3540153 


Minus 


59793-59968 


401424 


8176894 


Rus 


24223-24428 


401451 


6634068 


Minus 


119926-121272 


401747 


9789672 


Minus 


118596-118816,119119*119244.119609-11976 


404632 


9796668 


Rus 


45096-45229 


403046 


3540153 


Minus 


55707-55859,56369-56511 


404922 


7341B93 


Rus 


13248-13428 


401519 


6649315 


Plus 


157315-157950 


401197 


9719705 


Rus 


176341-176452 


401866 


8018106 


Plus 


73125-73623 


405387 


6587915 


Minus 


3769-3833,5708-5895 


403752 


7678857 


Rus 


33704-33828 


404210 


5006246 


Plus 


169926-170121 


401785 


7249190 


Minus 


165776-165996,166189-166314,166408-16656 


406214 


7342036 


Plus 


86320-86523 


403532 


8076842 


Minus 


81750-81901 


400860 


9757499 


Minus 


151830-152104,152649-152744 


404913 


7341740 


Rus 


97717-97976 



Table 69A lists about 200 genes that are upregulaled In human umbilical vein endothelial cells (HUVEC) compared to normal body fissues. These genes were selected from 59680 
probesets on the Eos/Affymetrix Hu03 Genechip array. Gene expression data for each probeset obtained from this analysis was expressed as average intensity (AJ), a normalized 
value reflecting the relative level of mRNA expression. 

Table 70A lists about 148 genes that are downregiMed using the mean of vessels (veins and arteries} compared to the mean of HUVEC. These genes were selected from 59680 
probesets on the Eos/Affymetrix Hu03 Genechip array. Gene expression data for each probeset obtained from this analysis was expressed as average intensity (Al), a normalized 
value reflecting relative level of mRNA expression. 

TABLE 69A: 

Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unlgene Title: Unigene gene title 

R1: Ratio of the mean of HUVEC ATs to the mean of the normal body tissue A! s 



Pkey ExAccn UnigenelD Unigene Title 

424806 AA382523 Hs.105689 MSTP031 protein 

418036 237976 Hs.83337 latent transforming growth factor beta b 

444381 BE367335 Hs^83713 ESTs, Weakly similar to S64054 hypolhefj 

442802 AL1 33035 Hs,8728 hypothetical protein DKFZp434G171 

417944 AU077196 Hs.82985 collagen, type V, alpha 2 

410480 R97457 Hs.63984 cadherin 13, H-cadherin (heart) 

452516 AA058630 Hs.29759 RNA POLYMERASE I AND TRANSCRIPT RELEASE 

425139 AW630488 Hs^5338 protease, serine, 23 1 

402463 NM_014624:Homo sapiens S100 calcium-btnd 

423798 AF047033 Hs.132904 solute carrier family 4, sodium bfcarbon 

412568 AI878826 Hs.74034 caveofin l.cavedae protein, 22kD 

419948 AB041035 Hs.93847 NM_016931:Homo sapiens NADPH oxidase 4 ( 

447526 AL048753 Hs.303649 sma« inducible cytokine A2 (monocyte ch 

446500 U78093 Hs.15154 susH-repeat-contalning protein, Xchrom 

441457 AYV996651 Hs.43838 ESTs 

408298 AL1 17452 Hs.44155 DKFZP586G1 517 protein 

418994 AA296520 Hs.89546 setecfln E (endothelial adhesion molecul 

413795 AL040178 Hs.142003 ESTs 

441689 AI123705 Hs.289Q6B ESTs 

412567 AI750979 HsJ4034 Homo sapiens clone 24651 mRNA sequence 

424432 AB037821 Hs. 146858 protocadherin 10 

432673 AB028859 Hs^78605 OnaJ (Hsp40) homolog, subfamily B, membe 

418683 U90908 Hs87241 hypothetical protein from clones 23549 a 

453085 AW954243 WAA0251 protein 

438887 R68857 Hs.265499 ESTs 

436729 BE621807 transmembrane 4 superfamfly member 1 

400494 ENSP0000023897(r:C1G30 (Fragment). 

442506 BE566411 ESTs 
425023 AW956889 Hs.154210 endoftelial dffierentiation. sphhgotipi 
414476 AA301867 Hs.76224 EGF-conlaining fibuJin-Tike extracetluta 
452933 AW391423 Hs.288555 Homo sapiens cDNA: FU 22425 fis, done H 
422369 AF240635 Hs.115897 pratocadherfn12 
417124 BE122762 Hs.2533B ESTs 

433681 A1004377 Hs^00360 Homo sapiens cDNA FU 13027 fis, clone NT 

419983 W55956 Hs.94030 rtosapierisnTf^d3NADKFZp586E1624 (f 

414776 AA15559B Hs^12839 hypotheficat protein FU14195; K1AA1 714 

450534 A1570189 Hs35132 KIAA0470gena product 

449618 AI076459 Hs.15978 KIAA1272 protein 

446098 AW072215 Hs.208470 ESTs 

413281 AA861271 Hs.222024 transcription factor BMAL2 

448674 W31178 Hs.154140 ovary-specific acidic protein 

407051 X97748 gb:H.sapiens PTX3 gene Dromotor region. 
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R1 

1.51 

1.94 

1.60 

1.73 

1.48 

1.91 

1.59 

1.63 

1.50 

1.31 

1.53 

2.06 

2.10 

1.67 

1.30 

1.61 

1.87 

1.64 

1.44 

1.75 

1.65 

1.31 

1.42 

1.47 

1.49 

1.91 

1.34 

1.54 

1.55 

2.65 • 

1.62 

1.38 

Z13 

1.31 

1.54 

1.51 

1.43 

1.42 

1.53 

1.40 

1.47 

1.33 
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436772 AW9756B8 

453789 AA628517 

424259 AK001776 

441020 W79283 
401234 

420174 AI824144 

445684 AK001696 

418693 AI750878 

426535 AU077012 



433376 AI249361 

422099 AA156022 

449394 AA004368 

445119 AF035121 

438142 T90309 

437478 AL390172 

422809 AK001379 

453459 BE047032 

442711 AF151073 

428428 AL037544 

400288 X06256 

417640 D30857 

407252 AA659037 

418825 AA228881 



417805 
426828 
415714 



421218 



U3B545 



434845 AW295389 Hs.1 19768 ESTs 

408570 AL.046406 Hs.103483 WAA1798 protein 

melatothbnein 1E (functional) 
Hs.1 18502 ESTs 

Hs.143954 hypothetical protein FU10914 
Hs.35962 ESTs 

mitogen-activaled protein kinase 8 inter 
Hs.199749 ESTs 
Hs.13109 Ran binding protein 11 
Hs.67409 thrombospondin 1 
Hs.288582 ESTs, WeaHy similar to ubiquitous TPR rn 
412330 NM.005100 Hs.788 A kinase (PRKA) anchor protein (gravin) 
412646 NMJX36825 Hs.74368 transmembrane protein (63kD), endcptasmi 
434523 AA703709 Hs.23410 transiocase of inner mitDchondrial membr 
Hs.74122 caspase 4, apoptosis-related cysteine pr 
Hs.1 1 1518 hypothetical protein 
Hs.18160 Homo sapiens cDNA FU 11 550 fis, clone HE 
Hs.12337 kinase insert domain receptor (a type II 
Hs.269651 ESTs 

Hs.31 7432 branched chain aminotransferase 1 , cytos 
Hs.121028 hypothetical protein FU10549 
Hs.257789 ESTs 
Hs.8645 hypothetical protein 
Hs.184298 cycfin-dependent kinase 7 (homotog of Xe 
Hs.149609 integrin, alpha 5 (fibronectin receptor, 
Hs.82353 protein C receptor, endothelial (EPCR) 
Hs.163780 (AB075832) KIAA1 952 protein [Homosaplen 
Hs.22394 hypothetical protein FU10893 
Hs.82587 phosphoOpase D1 , phophatidylchotine-spe 
NM_000020 Hs.172670 activin A receptor type li-iike 1 
NM_002290 Hs.78672 laminin. alpha 4 
446984 AB020722 Hs.16714 Rho guanine exchange factor (GEF) 15 
421778 AA428000 Hs.283072 aclin related protein 2/3 complex, subun 
NM 000499 Hs.7291 2 cytochrome P450, subfamily I {aromatic c 

Hs.278585 NM.024756; Homo sapiens hypothetical pro 

gtoyb65g12s1 Stratagene ovary (937217) 
Hs.73828 T-nceH acute lymphocytic leukemia 1 
Hs.304471 EST 

Hs.196384 prostaglandin-endoperoxide synthase 2 {p 
423099 NM.002837 Hs.123641 protein tyrosine phosphatase, receptor t 
435705 AA782114 Hs.28043 ESTs 

AA319798 Hs298581 eukaryotic translation elongation factor 
Hs.16450 ESTs 

Target Exon 

NMJM2205 Hs.149609 integrin, alpha 5 (fibronectin receptor, 
452924 AW580939 Hs.97199 complement component C1q receptor 
426696 AW363332 Hs.1 71844 NMJM6505 Homo sapiens poltovinis recepl 
Hs.239527 EI&SSkDa-associated protein 5 
Hs.22968 intronofVEGFR 
Hs.149420 ESTs 
Hs.22245 ESTs 
Hs.322710 ESTs 
Hs.61289 synaptojanin 2 
NNL005795 Hs.152175 caltitorun receptor-Tike 
449318 AW236021 Hs.78531 Homo sapiens, Similar to RIKEN cDNA 5730 
407862 BE548267 Hs.337986 Homo sapiens cONARJ10934 fis, clone OV 
Hs-268107 muffimerin 
Hs.1 6032 K1AA0523 protein 

cardiac ankyrin repeat protein 
Hs.243010 Homo sapiens ras homotog gene family, me 
Hs.71832 angtopofetin-2 
Hs.163780 ESTs 

Bone morphogenetjc protein 6 
Hs.1 18893 Melanoma associated gene 
Hs.30822 hypothetical protein FU1 11 10 
Hs.1 81097 tumor necrosis factor (Tig and) superfami 
Hs.2894 placental growth factor, vascular endoth 
Hs.233955 hypothetical protein FU20401 
Hs.198612 G protebvcotipled receptor 51 
Hs.78768 malignant cell expression-enhanced gene/ 
Hs.194477 E3ubtquiSn Bgase SMURF2 
Hs.3838 serurrHnducile kinase 
Hs.85618 ESTs 

Hs.73B53 bone morphogenetic protein 2 
Hs.79067 MAD (mothers against decapentaptegfc. Or 
Hs.243010 Homo sapiens cDNA FU 14445 fis, done HE 
Hs.79432 fibrillin 2 (congenital contracture! ara 
Hs.1 12497 Homo sapiens cDNA: FU22743 fis. done H 
Hs.183006 WAA0836 protein 
Hs.179891 ESTs, Weakly similar to P4HA U HUMAN PROLY 
Hs.169758 hypothetical protein FU20245 
Hs.1 69758 hypothetical protein FU20245 



452298 AI039243 

434596 T59538 

412351 AL135960 

433374 AI821409 

429113 D28235 



456999 
453983 H94997 
406506 



424503 



418941 
450152 



AA452970 
AI138635 
437259 AA334384 
445279 R41900 
409509 AL036923 
410240 AU57424 
424711 



410057 R66634 

446727 AB011095 

412564 X83703 

414786 AI246482 



AI554545 
T64904 



410276 
406627 
405025 
422648 

452852 AK001972 

427912 AL022310 

432004 BE018302 

430129 BE301708 

429276 AF056085 

407704 BE315072 

448694 AA478756 

434398 AA121098 

416851 AW963951 

412420 AL03566B 

416179 R19015 

450689 AI369275 

416658 U03272 

425235 AA353113 

428206 AB020643 
412755 BE144305 
456977 AK000252 



456977 
427897 



AK000252 

NM_017413 Hs.303084 apelin; peptide Bgand for APJ receptor 



1.29 

1.47 

1.81 

1.41 

1.35 

1.62 

1.48 

1.75 

1.49 

1.89 

1.77 

202 

1.64 

1.26 

1.49 

1.56 

1.54 

1.40 

1.45 

1.44 

1.42 

1.51 

1.42 

1.32 

2.67 

2.47 

1.76 

1.40 

1.35 

1.55 

1.90 

1.71 

1.88 

1.75 

1.86 

1.52 

1.61 

1.45 

2.14 

1.50 

1.53 

1.52 

1.94 

1.97 

232 

2.36 

1.61 

1.41 

1.70 

1.60 

1.59 

1.52 

1.61 

1.96 

1.81 

1.91 

1.85 

1.67 

1.51 

1.71 

1.91 

1.76 

1.69 

2.33 

1.78 

1.79 

202 

1.93 

1.65 

1.48 

2.01 

1.83 

1.65 

1.69 

1.71 

1.86 

1.65 

1.83 

1.77 

200 

1.61 

1.61 

1.94 
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425234 


AW152225 


Hs.165909 


ESTs, Weakly similar to 138022 hypolheli 


1.74 


444409 


AI792140 


Hs.49265 


ESTs 


1.88 


421340 


F07783 


Hs.1359 


decay accelerating factor for complement 


1.74 


445701 


AF055581 


Hs.13131 


lymphocyte adaptor protein 


2.18 


444009 


A1380792 


Hs.135104 


ESTs 


1.90 


409220 


BE243323 


Hs£1233 


tumor necrosis factor receptor superfami 


2.16 


416940 


N75620 


Hs.43157 


ESTs 


203 


418922 


AW956580 


Hs.42699 


ESTs 


109 


431548 


AI834273 


Hs*711 


novel protein 


1.81 


434927 


H46612 


Ks.293815 


Homo sapiens HSPC285 mRNA, partial cds 


214 


434558 


AW264102 


Hs.39168 


ESTs 


1.93 


438183 


BE263252 


Hs.6101 


hypothetical protein MGC3178 


205 


428438 


NMJHJ1955 


Hs.2271 


endotheEn 1 


246 


453365 


AA035211 


Hs.17404 


S0X7 SRY {sex determining region Y}-box 


220 


418058 


AW161552 


Hs.83381 


guanine nucleotide binding protein 1 1 


226 


439410 


AA632012 


Hs.188746 


ESTs 


1.93 


453467 


AI535997 


Hs.30089 


ESTs 


239 


420209 


AA256444 


Hs.126485 


hypothetical protein FU12604; KIAA1692 


1.84 


417933 


X02308 


Hs.B2962 


thymidyf ate synthetase 


1.70 


436420 


AA443966 


Hs.31595 


ESTs » , 


1.97 


414430 


AI346201 


Hs.76118 


ubiquitin carboxyl-terminal esterase L1 


1.93 


412140 


AA219691 


Hs.73625 


RAB6 interacting, kinesin-Bke (rabkines 


1.80 


442104 


L20971 


Hs.168 


phosphodiesterase 4B, cAMP-spetific (dun 


203 


425397 


J04088 


Hs.156346 


topoisornerase (DNA) II alpha (170kO) 


1.58 


418506 


AA084248 


Hs.85339 


G protein-coupled receptor 39 


1.81 


413745 


AW247252 


Hs.75514 


nucleoside phosphoryiase 


1.99 


405121 






rratcgen-activated protein kinase 8 inter 


2.99 


425811 


AL039104 


Hs.159557 


karyopherin alpha 2 (RAG cohort 1, impor 


1.76 


415765 


NM_005424 


Hs.78824 


tyrosine kinase with immunoglobulin and 


2.13 


431089 


BE041395 


Hs.156110 


ESTs, Weakly similar to unknown protein 


271 


421937 


AI878857 


Hs.109706 


hematological and neurological expressed 


1.65 


406669 


AJ493591 


Hs.78146 


platelet/endothelial cell adhesion motec 


239 


416815 


U41514 


Hs.80120 


UDP-N-acetyl-alpha-f>galac1osamine:polyp 


1.77 


418203 


X54942 


Hs.83758 


CDC28 protein kinase 2 


1.67 


408243 


Y00787 


Hs.624 


interieukin 8 


209 


418054 


NWL002318 


Hs.83354 


lysyt oxtdase-UKe 2 


242 


407891 


AA486620 


Hs.41135 


endomucin-2 


234 


407891 


AA486620 


Hs.41135 


endomucin-2 


2.34 


436032 


AA150797 


Hs.109276 


latexm protein 


2.24 


439382 


BE247684 


Hs.103070 


ESTs 


207 


419172 


AW338625 


Hs.22120 


ESTs;sirrtilar to TRANSMEMBRANE 4 SUPERF 


2.00 


442006 


AW975183 




ESTs, Weakly similar to S72482 hypotheti 


2.33 


447560 


AF065214 


Hs.18B5B 


phosphottpase A2, group tVC (cytosofic, 


211 


432128 


AA127221 


Hs.296502 


ESTs 


227 


417426 


NM.002291 


Hs.82124 


lamtnin, beta 1 


2.11 


434217 


AW014795 


Hs.23349 


ESTs 


203 


439265 


AL134430 


Hs.6906 


Homo sapiens cONA: FU23197 fis, done R 


208 


442923 


AW248322 


Hs.95835 


ESTs, Weakly similar to unnamed protein 


1.57 


433001 


AF217513 


Hs.279905 


doneHQ0310PRO0310p1 


1.60 


418741 


H83265 


Hs.8881 


ESTs, Weakly similar to S41044 chromosom 
branched chain aminotransferase 1, cytos 


237 


442573 


H93366 


Hs.7567 


255 


422544 


AB018259 


Hs.118140 


KIAA071 6 gene product 


209 


449722 


BE280074 


Hs.23960 


cydin B1 


1.76 


413794 


AF234532 


Hs.61638 


myosin X 


201 


422603 


BE242587 


Hs.1 18651 


hernalopoieucatty expressed homeobox 


215 


406964 


M21305 




gb:Human alpha satellite and satellite 3 


3.08 


448231 


AI701916 


Hs.202509 


ESTs 


2.27 


408989 


AW361666 


Hs.49500 


KIAA0746 protein 


1.43 


429663 


M68874 


Hs.211587 


phosphofipase A2, group IVA (cytosolic, 


267 


418738 


AW388633 


Hs.6682 


solute carrier family 7, (cation ic amino 


218 


418185 


AW958272 


Hs.347326 


intercellular adhesion molecule 2 


260 


417308 


H6O720 


Hs.61892 


KIAA0101 gene product 


1.86 


407975 


X89426 


Hs.41716 


endothelial cell-specific molecule 1 


292 


420256 


U84722 


Hs.76206 


cadherin 5, type 2, VE-cadherin (vascula 


292 


400666 






NM_002425:Homo sapiens matrix metaliopro 


259 


431728 


NMJW7351 


Hs.268107 


mulfirnerin 


272 


440086 


NM.005402 


Hs.288757 


v-rat simian leukemia viral oncogene horn 


2.25 


413132 


NVL006823 


Hs.75209 


protein kinase (cAMP-dependent, catalytj 


1.76 


451979 


F06972 


Hs.27372 


endothelial tyrosine kinase (Etk) (BMX), 


3.37 


415138 


C18356 


Hs.295944 


tissue factor pathway inhibitor 2 
hypothetical protein RJ20992 similar to 


3.11 


414577 


AJ056548 


Hs.72116 


3.11 


444330 


A1597655 


Hs.49265 


ESTs 


3.08 


422424 


AI186431 


Hs.296638 


prostate differentiation factor 


296 


400289 


X07820 


Hs.2258 


matrix metalloproteinase 10 (strornelysin 


4.84 


418007 


M13509 


Hs.83169 


matrix metalloproteinase 1 (interstitial 


11.91 



TABLE 69B 

Pkey: Unique Eos probeset identifier number 
CAT number. Gene duster number 
Accession: Genbank accession numbers 



Pkey 



CAT Number Accession 
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453085 



436729 



100171 



6624 1 



BC017336 B6716430 BG501286 BI45B528 A1582223 N98532 AI338138 AI273442 AW1 02617 AA831177 AA745642 AA412583AA355375 
BG547492AW954243BI766546BG057641 Al 192435 A! 338935 A131 2651 AI708679 Al 1911 25 Al 206832 AA676899 AI0780 10 AI8887 18 AA452830 
BF445542 N69930 AA715017 BF446713 BE046852 AW771909 AA907729 AI143749 AI761290 M890233 BF925759 AI783713 AI767267 AA814538 
W56778 AA91B481 BG743526 BE645242 AI025328 AI298436 AI290445 H27710 AI475034 BG740023 A1090348 AI340003 BI602481 W38495 
AI183314 AI927418 BG397181 AA878310 W19369 W56507 C05751 AW380760 AW380770 AW380790 BF930729 H28425 AA037326 AA375805 
X75684 AL573167 A1445461 AI453743 AI983555 AI564644 AA977180 AI6941 1 1 AI591 358 AW071625 A167871 2 AI720939 AI927769 BE439796 
AI963432 AA292956 AW192593 AI865838 A1696905 A1424384 AM61312 AI911921 AI597801 BI494959 AI2409B8 A1492554 AW262737 BE044033 
AW008570 AW629505 BI494958 AA088439 AA706057 BF222820 BF593608 BE501957 AA524526 BE044134 AW572531 AW015724 BE349186 
AA043217 BE219784 AI799814 AA129575 A1671727 AM70033 BE646195 AW779725 AA903050 M147228 M404570 AI075878 W38161 A1972739 
AW673152 AA723200 C06123 BF057147 AA627686 AA1 57944 AI990245 AA662517 T32487 AI800106 AI333170 AI859160 W45410 AI990827 
AW275048 AA182640 AA478328 AI298935 AW085158 AW471421 AW103470 AW300456 AW191997 AI823466 AA962397 AA136658 AI251817 
AW339104 AA724739 AA411 100 AA191349 AA757735 AA037696 AI769516 AW772283 AA010631 AI692846 AI061065 H80983 R79933 AI950693 
AI245632 A1349390 AA148284 AI798502 AA487893 AI621320 AW194272 C06365 AA953883 BE858936 A1918523 A1872628 AI927217 A1453453 
AI189356 AW338678 AI261 359 AI500576 BF477735 A1032569 AI972899 AI985583 Z28771 AI363829 AI693030 AA603586 BE77348B AW339301 
BE773489 BE773462 BE773495 AI650338 BE773499 AI745717 BE81 1475 BE81 1470 BE81 1464 BE81 1418 BE81 141 5 BEB11400BE81139B 
BE811388 BE81 1352 BE773501 BE773494 BE773486 BE773474 BE773473 BE773470 BE773461 BE811350 BE811337 BF593847 BG055071 
AW675302 BF003068 AA719173 BE81 1 348 A1582462 AI686240 BE773500 AI244845 AI565439 AI918453 AI472527 AI446740 AA035576 AA191 414 
AW674145 C05782 A1589264 D57556 AI468237 AI432033 AA989662 R21752 BF002457 AA988297 AL574095 AL576200 AL571 074 AL574525 
AL578810 BG498381 AI928364 BE879732 AA479834 AA479712 C17732 BM091258 BF843901 AW820230 C17476 BE327120 AA129S74 
AA1 36645 BF843900 AW806193 AA502832 AA649494 AL56B520 AL547960 BE706937 BE81 1350 BE773498 BE81 1401 BE773484 BE81 1437 
BE81 13B0 BE81 1399 BF997171 BF757734 BE926037 AI377596 C061 1 1 AW08B96B BE81 1404 BE81 1472 AI86591 2 A1925607 AI871950 A109351 0 
BE905927 BE811435 AA191387 AW772000 BE811453 BE614379 BF844522 BKM4896 AI744233 AW9B4527 C17504 BF843883 AI248307 
BE773483 A1567995 W60075 BF941 183 AI738844 BE811458 BE773481 AI262930 AA948565 BE706942 BE156360 T65026 AW242958 AW197954 
BE9Q5184 AA722206 AI344943 A1348877 AI334860 BE621857 BE156280 AA454099 AA037722 BF843897 AW806183 AAQ43216 BG482896 
AA182734 AA877242 AW372926 H27252 R381 14 BF851858 BE156214 AA190427 T91762 AA035067 AA837326 T10930 BF906587 BI755027 
BG506731 BC008442 BC010166 AL550134 AL553096 AL548700 AL550751 AL547978 AL545285 AL540643 AU1 18627 AL601 379 BI259821 
BG741766 B1868522AU 135866 BI552770 BI259210 BI256520 BI255569 BG485098 BI258228 BG498501 BM04451 2 AU 133984 AL556586 
BE7451 1 1 BI222633 AU133917 BG288151 BI260715 BI550550 BG500773 BI551761 BG707601 BI81 8593 BF691 383 BG721 129 BG541578 
BE906666 BG751098 BI224135 BG400746 BG478065 BE790436 AW080238 AU137549 BG429896 BE392486 AW961686 BG721056 BE908365 
BE546656 BG541235 AW583735 BG528290 BI260895 AW651691 BM048974 BM043805 BG142185 AA315188 AI446615 C06300 BG497644 
AA088544AI815987 BG528631 BE619182 AW239185 AW062910 AW062902 AA347236 F11933 AA488005 AA301631 AA376800 D56120 
AA343532 AA308636 F00242 AA376086 AA316968 AA343799 BI870221 BE910282 BG538748 AW960564 AV732879 D16654 AA192519 BF922148 
AA216013 BG624091 BE544387 BG507008 AW176446 BF790033 BE088925 BE08B854 AA921353 R21800 AA01 1222T97525 
NM 002640 L40377 BG227962 AI925897 W60100 BG980023 AAB53496 AA360401 AW9561 68 W61361 A1074846 AA373798 BG193973 BG49941 8 
BF816267 AJ675932 BG611734 BG777123 AW449485 BF912016 BE56641 1 BMQ68728 AW976385 AL121 194 AI767324 BM054718 AW366882 
AA156151 Z25109 C05177 AW975688 AA731063 N67084 
AF1 47374 T59538 T59589 T59598 T59542 

X83703 NM 014391 BC018667 AU139209 BE924924 BE924899 BE924977 BE924985 BE924947 BF229055 BE924917 BE924938 BE812621 
BE924937 AA969184 BE924898 BF229066 BF229054 BF081 177 BE924934 BF229040 BE925037 BE92501 1 BG943776 AA488Q72 AA486364 
Z36249 AW455806 AW582753 BF081131 BE924910 BF229058 BF081 172 BE924956 AW378093 BF081132 BF081168 BE812736 BE812738 
BF081 165 C041 60 C04483 BF229048 
442006 1239046.1 AW975183AA973583 A1385103 AI699495 A1301787 



442506 

436772 
434596 
412564 



29197J 

1239464J 

14701J 

18571J 



TABLE 69C 

Pkey: Unique number corresponding lo an Eos probeset 

Ref: Sequence source. The 7 digit numbers in this column are Genbank Identifier (G1) numbers. "Dunham, et at" relets to the publication entitled The DNA sequence of 

human chromosome 22" Dunham, et al. (1999) Nature 402:489-495. 
Strand: Indicates DNA strand from which exons were predicted. 
NLposifloru Indicates nucleotide positions of predicted exons. 



Pkey 


Ref 


Strand 


NLposiGon 


402463 


9796B96 


Minus 


8818-8952 


400494 


9714719 


Rus 


169845-170272 


401234 


9929642 


Hus 


120173-120337 


406506 


7711374 


Minus 


6843-8077 


405025 


7107727 


Plus 


105267-105343,106184-106294,106387-10653 


405121 


8102330 


Minus 


35816-36004,36567-36684 


400666 


8118496 


Pius 


17982-18115,20297-20456 



TABLE 7QA: 



Pkey: Unique Eos probeset identifier number 
ExAccn: Exemplar Accession number, Genbank accession number 
UnigenelD: Unigene number 
Unigene Title: Unigene gene fitie 



R1: 
Pkey 



Ratio of the mean of the vessel ATs to the mean of the HUVECATs 



4391 B0 
412636 
426158 
414320 
417B78 
414572 
415314 
431103 
406973 
456974 
430560 
406828 
422048 



ExAccn 



AJ393742 
NMJ04415 
NMJM1982 
U13616 



AU077174 

N88802 

M57399 

M34996 

M12529 

Z28942 

AA419202 

NMJJ12445 



UnigenelD 
Hs.194657 
Hs.199057 

Hs.199067 

Hs.75893 

Hs.82845 

Hs.288181 

Hs.5422 

Hs.44 

Hs.198253 

Hs.169401 

Hs.243960 

Hs.84298 

Hs.288126 



Unigene Tifle R1 

cadherin 1 , type 1 , E-cadherin (eplthdi 0.99 

v-erW>2 avian erythroblastic leukemia v 1 .06 

desmop!akin(DPI,DPll) 1-25 

v-ed>b2 avian erythroMasfc leukemia v 1.36 

ankyrin 3, node of Ranvier (ankyrin G) 1 .50 

Homo sapiens cDNA: FU21930 fis, done H 1 .55 

cathepsin H 1-64 

glycoprotein M6B 1-70 

pleiotrophin (heparin binding growth fac 1.75 

major histocompatibiHy complex, dass 1.88 

apofipoproteln E 190 

N-myc downstream-regulated gene 2 1.91 

CD74 antigen (invariant polypeptide of m 1 97 

spondin 2, extracellular matrix protein 199 
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405326 


Attn- A Cf\f\C 

AW5 16005 


Hs.84298 


453023 


AW028733 


Hs.31439 


413391 


AI223328 


Hs.75335 


406824 


AW515961 


Ks.84298 


418299 


AA279530 


Hs.83968 


437536 


X91221 


1 l_ 4 A A Af*WT 

Hs.144465 


442402 


NM.000954 


Hs.8272 


446945 


AI193115 


Hs.16611 


422639 


AI929377 


Hs.173724 


427451 


A1690916 


Hs.178137 


454042 


H22570 




416283 


S79895 


Hs.83942 


424247 


X14008 


Hs.234734 


422103 


AA984330 


Hs.1 11676 


415994 


NM.002923 


Hs.78944 


427393 


AB029018 


Hs.177635 


414217 


A1309298 


Hs.279898 


414063 


H26904 


Hs.75736 


426716 


NhU)06379 


Hs.171921 


414803 


X03100 


Hs.914 


410532 


T53088 


Hs.155376 


431369 


BE1 84455 


Hs.251754 


437952 


D63209 


Hs.5944 


453655 


AW960427 


Hs.342874 


453655 


AW960427 


Hs.342874 


453655 


AW950427 


Hs.342874 


427980 


AA418305 


Hs.303205 


442695 


AC002425 


Hs.8603 


425751 


T19239 


Hs.1940 


414194 


BE175494 


Hs.75811 


443907 


AU076484 


Hs.9963 


453464 


AI884911 


Hs.32989 


410677 


NWL003278 


Hs.65424 


452685 


A1634651 


Hs.30250 


425869 


AA524547 


Hs.160318 



Hs.347939 
Hs^5094 
Hs.151242 
Hs.10587 
Hs.24395 
Hs.8364 
Hs.74615 

Hs.196547 
AA430348 Hs,317596 
NM_000426 Hs.323511 
AW360B47 Hs.16578 



406543 N77976 

453767 AB011792 

424614 X54486 

444195 AB002351 

417233 W25005 

442498 U54617 

412810 M21574 

440820 ALD31846 

429123 AB011099 
411529 
413305 
425622 
420195 



N44348 



446584 U53445 

407869 AI827976 

423915 AF039018 

410036 R57171 



412047 
414840 
424651 
443932 
420859 



AA934589 

R27319 

AI493206 

AW888222 

AW468397 



434025 AF114264 
421834 BE543205 



423961 
4473B4 
425383 



D13666 

AI377221 

D83407 



447023 AA356764 
420136 AW801090 



Hs.251385 

Hs.15432 

Hs.24391 

Hs.135281 

Hs.57975 

Hs.49696 

Hs.23823 

Hs.9973 

Hs.100000 

Hs.216381 

Hs.288771 

Hs. 136348 

Hs.40528 

Hs.156007 

Hs.17109 

Hs.195851 



406972 M32053 

410132 NM.003480 Hs.300946 

430998 AF128847 Hs.204038 

456898 NM_001928 Hs.155597 

443623 AA345519 Hs.9641 

410614 A1091195 

451529 A1917901 

430310 U60115 

424897 D83216 

427111 AA351026 

443604 C03577 

421913 AJ934365 

451331 AK002039 



416908 AA333990 

407938 AA905097 

418005 A1186220 

452877 AI250789 



419577 
404277 



L36531 



Hs.65029 
Hs.20B641 
Hs.239069 
Hs.153684 
Hs.173594 
Hs.9615 
Hs.109439 
Hs.251385 
Hs.80424 
Hs.85050 
Hs.83164 
Hs.32478 
Hs.91295 



C074 antigen (invariant polypeptide of m 
serine protease inhibitor, Kunitz type, 
glycine arnidmotransferase (L-arginine:g 
CD74 antigen (invariant polypeptide of m 
integrin, beta 2 (antigen CD18 (p95), ly 
ESTs 

prostaglandin D2 synthase {21 kD. brain) 
tumor protein D52-fike1 
creatine kinase, brain 
transducer of ERBB2.1 
hypothetical protein FU20093 
cathepsin K (pycnodysostosisj 
lysozyme (renal amyloidosis) 
protein kinase H1 1; small stress protein 
regulator of G-protein signalling 2, 24k 
KIAA1 095 protein 

Homo sapiens cDNA: FU23165 fis, clone L 
apolipoprotein D 

sema domain, immunoglobulin domain (Ig), 
Human mRNA for SB dasslt Nstocompatfoi 
hemoglobin, beta 

secretory leukocyte protease inhibitor ( 
solute carrier family 11 (proton-coupled 
transforming growth factor, beta recepto 
transforming growth factor, beta recepto 
transforming growth factor, beta recepto 
EST 

p8 protein (candidate of metastasis 1) 
crystallin, alpha B 

N-acylsphingosine amidohydrolase (acid c 
TYRO protein tyrosine kinase binding pro 
receptor (calcitonin} activity modifying 
tetranectin (plasminogen-binding protein 
v-maf musculoaponeurotic fibrosarcoma (a 
FXYD domain-containing ion transport reg 
hemoglobin, alpha 2 
extracellular matrix protein 2, female o 
serine (or cysteine) proteinase inhibito 
K1AA0353 protein 

small inducible cytokine subfamBy B (Cy 
Homo sapiens pyruvate dehydrogenase kina 
platelet-derived growth factor receptor, 
plakophifin4 
K1AA0527 protein 

Homo sapiens cDMA FLJ12927 fis, clone NT 
Homo sapiens cDNA: FU23176 fis, clone L 
ESTs 

Homo sapiens cDNA FU 11 177 fis, clone PL 
downregulated in ovarian cancer 1 
hypothetical protein FU13612 
alpha-adinm-2-assodated UM protein 
calsequestrin 2 (cardiac muscle) 
ESTs 

hairy/enhancer-of-split related with YRP 

ESTs 

tensln 

S100 calcium-binding protein A8 (calgran 
Homo sapiens clone HH409 unknown mRMA 
DKFZP586A0522 protein 
perioslin(OSF-2os) 
ESTs 

Down syndrome critical region gene 1-fik 
integral membrane protein 2A 
actio, alpha 2, smooth muscle, aorta 
gb:Human H19 RNA gene, complete cds. 
MicrofibrS-assodated glycoprotein-2 
indolethytamine N-methyttransferase 
D component of complement (adlpsln) 
cornplemenl component 1, q subcomponent 
growth arrest-specific 1 
ESTs 

four and a half UM domains 1 
frizzled-retated protein 
serine (or cysteine) proteinase inhibito 
myosin regulatory light chain 2, smooth 
osteogrycin {osteoinductive factor, mime 
Homo sapiens CDNAFU11177 fis, clone PL 
coagulation factor XUl A1 polypeptide 



collagen, type XV, alpha 1 
ESTs 

Integrin, alpha 8 

NM.0191 1 1*:Homo sapiens major rristocompa 



2.02 

103 

112 

2.15 

117 

121 

2.26 

127 

129 

131 

137 

138 

139 

141 

143 

147 

149 

150 

151 

151 

2.56 

15B 

158 

159 

159 

159 
159 
162 
163 
163 
166 
167 
167 
167 
173 
173 
174 
180 
181 
188 
192 
193 
194 
197 
3.01 
3.01 

aoi 

ao4 

3.04 
3.06 
3.07 
3.10 
3.12 
3.14 
3.17 
. 3u16 
3.20 
3.20 
3.20 
3.24 
3.24 
3.25 
3.25 
3.25 
3.26 
3.27 
3.28 
3.29 
3.33 
3.35 
3.35 
3.37 
3.37 
3.38 
3.41 
3.43 
3.44 
3.46 
3.48 
3.51 
3.52 
3.52 
3.54 



780 
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407815 AW373860 Hs.183860 hypothetical protein FU20277 

421853 AL1 17472 Hs.108924 SH3-domain protein 5 (ponsin) 

447111 A1017574 Hs.17409 cysteine-rich protein 1 (intestinal) 

426935 NM_00M88 Hs.172928 collagen, type I, alpha 1 

418371 M13560 Hs.84298 C074 antigen (invariant polypeptide ot m 

440274 R24595 Hs.7122 scrapie responsive protein 1 

427818 AW511222 Hs.193765 ESTs 

433465 AV657778 Hs.3314 selenoprotein P, plasma, 1 

458568 A1769067 Hs.127824 ESTs, WeaWy similar to T28770 hypolheti 

459290 NMJJ01546 Hs.34853 inhibitor of ONA binding 4, dominant neg 

424206 NM 003734 Hs.198241 amine oxidase, copper containing 3 (vase 

453874 AW591783 Hs.36131 coflagen, type XTV. alpha 1 (undirBn) 

428957 NM 003881 Hs.194579 WNT1 inducible signaling pathway protein 

434868 R50032 Hs.159263 conagen,typevl,alpha2 

417043 NM 004369 Hs.8Q988 collagen, type VI, alpha 3 

417512 X76534 Hs.82226 glycoprotein (transmembrane) nmb 

452669 AA216363 Hs.262958 hypothetical protein DKFZp434B044 

421314 BE440002 Hs.180324 Homo sapiens, done IMAGE:41 83312, mRNA, 

449925 AJ342493 Hs.24192 Homo sapiens cONA FU20767 fis. clone CO 

453500 A1478427 Hs.43125 esophageal cancer related gene 4 protein 

430572 U33114 Hs.245188 tissue inhibitor of metattoproleinase 3 

421823 N40B50 Hs.28625 ESTs 

422550 BE297626 Hs.296049 rwcrofibrillar-assodated protein 4 

443060 D78874 Hs.8944 procollagen C-endopeplidase enhancer 2 

450912 AW939251 Hs.25647 v-fos FBJ murine osteosarcoma vira! enco 

432306 Y18207 Hs.303090 protein phosphatase 1, regulatory (inhib 

419938 AU076772 Hs.1279 complement component 1,r subcomponent 

415165 AW887604 Hs.78065 complement component 7 

414176 BE140638 HsJ5794 endothelial differentiation, rysophospha 

452114 M226B7 Hs.8236 ESTs 

406850 AJ624300 Hs.172928 collagen, type I, alpha 1 

428411 AW291464 Hs.10338 ESTs 

412802 U41518 Hs.74602 aquaporin 1 (channel-forming integral pr 

406849 AA454809 Hs.172928 collagen, type I, alpha 1 

453064 R40334 Hs.89463 potassium large conductance calcium-acti 

429500 X78565 Hs.289114 hexabrachion (tenascin C, cytotactin) 

433336 AF017986 Hs.31386 secreted firizried-related protein 2 

421814 L12350 Hs.1Q8623 thrombospondin 2 

427373 AB007972 Hs.130760 myosin phosphatase, target subunit 2 

414290 AI568801 Hs.71721 ESTs 

415274 AF001548 Hs78344 myosin, heavy polypeptide 11, smooth mus 

417130 AW276858 Hs.81256 SlOOcaldunvWnding protein A4 (calcium 

416784 AA334592 Hs79914 lumlcan 

453355 AW295374 Hs.31412 Homo sapiens cDNA FU1 1422 fis, done HE 

408221 AA912183 Hs.47447 ESTs 

430223 NNL002514 Hs.235935 nephroblastoma overexpressed gene 

416585 X54162 Hs.79386 leiomodin 1 (smooth muscle) 

422737 M26939 Hs.119571 coflagen. type 111, alpha 1 piers-Dan! 

426406 A1742501 Hs.1 69756 complement component 1,s subcomponent 

448429 D17408 Hs.21223 calponln 1, bask, smooth muscle 

413624 BE177019 Hs75445 SPAROCke 1 (masl9, hevin) 



3.58 

164 

3.68 

370 

371 

3.73 

3.73 

375 

377 

3.80 

3.83 

3.86 

3.89 

3.90 

3.95 

3.95 

3.97 

4.01 

4.03 

4.07 

4.09 

4.17 

4.25 

4.26 

4.29 

4.34 

4.37 

4.37 

4.41 

4.43 

4.43 

4.48 

4.51 

4.54 

4.57 

4.57 

4.68 

473 

4.76 

4.98 

5.13 

5.14 

5.20 

5.50 

5.65 

5.87 

6.47 

6.65 

7.94 

9.90 

9.91 



TABLE 70B 

Pkey: Unique Eos probeset identifier number 
CAT number. Gene cluster number 
Accession: Genbank accession numbers 



Pkey 



CAT Number Accession 



412636 1438J 



M77830 NM 004415 AF139 065 BG681115BG740377 BI712964 BG000656 AA1 28470 BW38324 H27408 BE931630 BE167165 AVTW0K7 
Sl3jK 

RPIMlS I RF149266 BE940187 BI060445 BW60444 BF350983 BE720095 BE720069 BE715154 BE0825B4 BE082576 BE004047 AA857316 

mV^k^S^^Si BG223489 BE819009 BF381 184 BE715956 R58704 AA85221 2 AW365566 B1090358 BFO^ BEB19046 
^S™27 BE0^7 BES BE819^ 

AvSa E^43M19 BE182166 AW365175 AW847688 BE818280 AW177933 BF673679 AW178000 BE082526 BF476866 BIW^ BRS^ 

bIS BF839562 BE184933 BF842254 BE69B470 BE931048 BF999889 BF368816 BE184924 BE159646 BE7146^BE1 84948 ^85845 
AA 31128 AA099891 W9488 C04715 BF096124 BE865341 AW799304 AL603116 BE149760 BE705967 BE705966 BE705968 Am48723 

AW997139 BE865474 BE185187 BE156621 BE715089 BE713297 BE713298 BE179915 AW799309 BF872345 BFQ88676 BE70593S • AW52599 
BG005 in aranOK BE715155 BF752396 BF093817 BF831 190 BF752409 BE006561 BG959922 BF094833 BF094748 BP09G83 
lv^n723P ^^9 AW377700 BF349467 AI190590 A1554403 AI392926 AU158477 B1467252 AU159919 AT760816 BF082516 

AtS Bro82491 AWK1347 A5HK96 BE939862 AA088866 D12062 AA056527 AA7B2109 W19287 W02156 AW150038 AA0227M TW181 

AA128330 BG68142S BE706078 R20904 BG680059 BG676647 BF764409 AA026654 

^^^S OM3TO N84767 AA131077 H3014S BE7 14290 A158S869 A1568892 A191 5596 AW1 05614 AB87258 AB3CT BE926W 
BG950164 BE713810 AW365151 F 
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454042 30254J 



440820 3091J 



424651 



46029.5 



BG740832 BG881087 BG698430 AA455100 T87267 BE696209 BE696210 B1089483 BE006273 BE872225 AW391912 BE925515 BG677012 
BG741970 AA026480 BE705999 BG677157 BE0O9O90 BG681378 BE712291 BG961498 BG578984 BI040941 AA337270 AW384371 AW847442 
B1058659 BE813665 W95048 W25458 AW177785 AA025851 BE931733 BF154837 BG949393 BE714441 AW996245 BE71 1801 A1284090 
BE064323 BE719390 BE940148 BG991212 BF375714 BF349522 BG996267 T48793 BI013292 BE001925 AW365156 AW365154 AW505553 
BF763109 BE931637 BE167181 BE713879 BF354008 BF678726 H90899 AW3S5145 W38382 AI498487 

AJ420458 AI018523 AA708686 BF949633 AL1 19553 BF945960 AI081305 AA041432 AI921013 A1684910 AI654847 AW874199 A1206120 
AW241428 R43035 T66767 AW103715 W28478 BF953052 H4592B BF807568 AW903943 BE170143 BI040435 BF931989 BI800000 AV722350 
W27787 H45331 BI549761 R53955 B1549855 BG991583 BI491075 AW020049 AW129293 H45263 AA41O309 AA340613 R42410 AA707199 
AI431587 BE858679 AW292267 A1421678 AAQ41 195 BE466753 AI243913 A1358894 AW1 37298 AI36846B N64350 AA779107 AW025969 R49056 
AA347011 R55722 AW771106 F04969 238381 F01659 H17396 BI493714 AI8801 03 AW77 1447 Al 202561 AA768851 AI494436 BF856114 H22570 
AW954381 BG007409 BM314056 AA465642 T30861 T331 1 1 Z42834 C04542 BF948152 BF944325 BF858895 AA935284 AI267360 N64249 
WB7500 F07962 AA322394 BI489987 BE644965 BM3137B2 AA910364 A1809246 AA836750 BF115228 AA829730 N27413 BM141766 AI742325 
AA455261 AA938708 AI420241 A1130039 BF222341 BF941301 AA771B07 Z41034 BF447457 BF447426 AW515347 T03874 F02360 F02302 
N34898 AI678586 AA807824 AA948556 AW204638 BM142045 BF446775 AV648364 A1801368 AA971739 AI017351 AA760722 AI460007 A1458383 
AI694152 AA226536 BE467282 N47808 BE348825 BE830581 BE830583 N53009N59351 N41056W67501 AA016246 H43293F21282R59121 
R56280 

BG435302 BM0836S7 AA904035 AA488889 W76175 AA761874 H28767 AA910081 AA837086 AI521825 BG986378 AI478562 AA743152 AA746092 
H88B63 BG98B375 AA635544 AM93206 AA669979 BE245127 BG986430 BG9B6529 BF665873 BG0301 57 BG622575 AA766495 



TABLE 70C 

Pkey: Unique number corresponding to an Eos probesel ^ , 

Ref: Sequence source.Trie 7 digit numbers in this column are Genbank Identifier (Gl} numbers-Thmham, et al." refers to the publtcation enfetted The DMA sequence of 

human chromosome 22" Dunham, etat (1999) Nature 402:489495. 
Strand: Indicates DNA strand from which exons were predicted. 
NLposition: Indicates nucleotide positions of predicted exons. 



Pkey 
404277 



Ref Strand NLp°s'uori 

1834458 Minus 91665-91946 



TABLE 71 A: 774 GENES UP-REGULATED IN EWING SARCOMA COMPARED TO NORMAL ADULT TISSUES 

Table 1 A lists 774 genes up-regulated in Ewing Sarcoma compared to normal adult tissues. Tliese were selected from 35371 probesels on the Afrymetrix/Eos Hu03 GeneChip array 
such that the ratio ot 'average" Ewing Sarcoma to 'average' normal adult tissues was greater than or equal to 2.1 . The 'average" Ewing Sarcoma level was set to the 75th percentile 
amongst numerous Ewing sarcomas. The 'average" normal adult tissue level was set to the 85th percentile amongst various non-malignant tissues, tn order to remove gene-speafc 
background levels ot non-specific hybridization, the 10lh percentile value amongst the norwnalignant tissues was subtracted from both the numerator and the denominator before the 
ratio was evaluated. 

Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnlgenelD: Unigene number 



R1: 


75th percentile of ewing sarcoma to 85th percenfle of body map 


Pkey 


ExAccn 


UntgeneiD 


Unigene Title 


104659 


AW969769 


Hs.105201 


ESTs 


101447 


M21305 




gb:Human alpha satellite and satellite 3 


105782 


H09748 


Hs.57987 


B-ceU auiymphoma 11B (zinc finger pro 


115881 


NM-005756 


Hs.184942 


G protein-coupled receptor 64 


121792 


AW969726 


Hs.98381 


ESTs, Weakly similar to serine protease 


101104 


AW862258 


Hs.169266 


neuropeptide Y receptor Y1 


110278 


AF061573 


Hs.19492 


protocadherin 8 


126645 


AA316181 


Hs.61635 


six transmembrane epithelial antigen of 


116752 


AU008583 


Ks.91622 


neuronal perrtraxin receptor 


119717 


AA91B317 


Hs.57987 


B-ceU CLL/lymphoma 11B (zinc finger pro 


104691 


U29690 


Hs.37744 


Homo sapiens beta-1 adrenergic receptor 


110728 


AA737106 


Hs.32250 


ESTs, Moderately similar to 178885 serin 


121362 


AF050147 


Hs.97932 


chondromodulin 1 precursor 


131291 


NMJJ04350 


Hs.170019 


runt-related transcription factor 3 


101063 


D54745 


Hs.80247 


chotecystokinin 


121619 


AA528339 


Hs.178062 


ESTs, Weakly similar to phosphatidylserl 


122651 


AW975398 


Hs.293836 


ESTs 


100299 


D49493 


Hs^171 


growth differenfiaSon factor 10 


129977 


NMJW0399 


Hs.1395 


early growth response 2 (Krox-20 (Drosop 


123619 


AA6029&4 




gb:no97c0Zs1 NCl_CGAP_Pr2 Homo sapiens 


124006 


AI147155 


Hs.279727 


ESTs; homotogue of PEM-3 [Ciona savlgnyi 


116301 


AW969706 


Hs.293332 


ESTs 


121231 


AA814948 


Hs.96343 


ESTs, Weakly similar to ALUC_HUMAN till 


106533 


AL134708 


Hs.145998 


ESTs 


109166 


AA219691 


Hs.73625 


RAB6 interacHng. kinesin-like (rabkines 


131313 


R96290 


Hs.75874 


ribosomal protein L44 


116790 


AW161357 




rrtetubula-assodated protein tan 


105316 


AI671245 


Hs.24835 


hypothetical protein FU 14594 


102123 


NMJXJ1809 


Hs.1594 


centromere protein A (17kD) 


126218 


AL049B01 


Hs.13649 


Novel human gene mapping to chomosome 13 


119791 


AA554907 


Hs.58291 


ESTs 


113003 


AW292315 


Hs.7215 


ESTs 


102836 


U94320 


Hs.158330 


neuropeptide Y receptor Y5 


126799 


AW753865 


Hs.74376 


olractomedin related ER localized protei 


105298 


BE387790 


Hs.26369 


hypolheecal protein FU20287 


107160 


AA314490 


Hs.27669 


WAA1563 protein 


115313 


AA808001 


Hs.184411 


albumin 


123308 


C14187 


Hs.103538 


ESTs 



R1 

70.3 

64.7 

50.5 

48.8 

46.7 

46.6 

46.1 

41.4 

40.9 

39.6 

36.9 

35.5 

34.7 

33.0 

31.7 

28.7 

28.0 

26.5 

26.2 

26.1 

23.1 

22.6 

213 

22.3 

21.3 

20.8 

18.7 

18.2 

17.8 

17.8 

16.7 

16.3 

16.3 

16.3 

15.5 

14.8 

14.6 

U2 
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5 

10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



126077 
100698 
106821 
110288 
106498 
112134 
115265 
128639 
125698 
105200 
119456 
135155 
106614 
131941 
132968 
132725 
111353 
123289 
104968 
123532 
104173 
125556 
127240 
123049 
132315 
134321 
128609 
131945 
110730 
119186 
115909 
121916 
131402 
101879 
133181 
131307 
113849 
103446 
105091 
106146 
127003 
115414 
117829 
126098 
121910 
130625 
129755 
100335 
111109 
120217 
112891 
131601 
126262 
113903 
105253 
102581 
105172 
100380 
129229 
102725 
130298 
132294 
118644 
106575 
133142 
119499 
100248 
114837 
107098 
123960 
105260 
115147 
134570 
133063 
105730 
126426 
107254 
134367 
119082 
125400 
129413 
116766 
129075 



M7B772 Hs.210836 ESTs 

M884407 Hs.21 1595 protein tyrosine phosphatase, non-recept 
NM 005594 Hs.28298 adaptor-related protein complex 4, beta 
H40665 Hs.31564 ESTs 
A1221919 hypothetical protein FU10582 

R41823 Hs.7413 ESTs;calsyntenin-2 
AA334274 Hs.18368 DKFZP564B0769 protein 
AW582962 Hs.102897 CGI-47 protein 
AF078847 Hs.191 356 general transcription factor IIH, polype 
AA328102 Hs.24641 cytosketeton associated protein 2 
AA248897 Hs.48784 ESTs 

A1207958 Hs.1 66556 Homo sapiens. Similar to TEA domain fami 
AA648459 Hs.335951 hypothetical protein AF301222 
BE252983 Hs.35086 ubiquitin specific protease 1 
AF234532 Hs.61638 myosin X 
NM 006276 Hs.184167 splicing factor, argmine/serine-rich 7 
W20090 Hs.6616 ESTs 
AA495904 Hs.103316 ESTs 
AI249502 Hs.29669 ESTs 

AA608733 gb:ae56f08.s1 Stralagene lung carcinoma 

AA084273 Hs.76561 ESTs, Weakly similar to S47072 finger pr 
AB033064 Hs.236463 KIAA1236 protein 
AJ005683 Hs.86998 nuclear factor of activated T-cells 5, t 
BE047680 Hs.21 1 869 dickkopf (Xenopus laevis) homolog 2 
AF091086 Hs.44563 hypothetical protein 
BE538082 Hs.B172 ESTs, Moderately simBar to A46010 Win 
NM 003616 Hs.102456 survival of motor neuron protein interac 
NMJJ02916 Hs.35120 replication factor C (activator 1) 4 {37 
N67655 Hs.26411 ESTs 
AI979147 Hs.101265 hypothetical protein FU22593 
AW872527 Hs.59761 ESTs, Weakly similar to DAP1 .HUMAN DEATH 
AW117207 Hs.98523 ESTs 
Y08890 Hs.1 1 3503 karyopherin (importin) beta 3 
AA176374 Hs.243886 nuclear autoanligentc sperm protein (his 
X91662 Hs.66744 twist (Drosophila) homolog (acrocephalos 
NMJKJ0025 Hs.2549 adrenergic, beta-3-, receptor 
AA45721 1 Hs.8858 bromodomain adjxent to zinc finger doma 
X98834 Hs.79971 sal (Drosophila)-like 2 
AA148859 Hs.179909 hypothetical protein FU 22995 
W07288 Hs.1 1 8392 ESTs, Weakly simHar to LNHUER IgE Fc re 
AW816515 Hs.173540 ATPase, a ass V, type 10D 
AA662240 Hs.283099 AF15q14 protein 
BE160952 Hs.2471 17 ESTs, Moderately similar to ALUFJflJMAN 1 
M79088 gb:EST01236 Subtracted Hippocampus, Stra 

AI204500 Hs.96978 hypothetical protein MGC1 0764 
AF17601 2 Hs.260720 J domain containing protein 1 
R42216 Hs.7759 Homo sapiens done 24538 mRNA sequence 
AW247529 Hs.6793 platelet-activating factor acetylhydrola 
AI940675 Hs.20914 hypothetical protein FU23056 
AAB62257 Hs.66035 ESTs 

T03927 Hs293147 ESTs, Moderately simflar to A46010 X-fin 
NM_0071 1 5 Hs^9352 tumor necrosis factor, alphamduced pro 
A1085153 Hs.143764 ESTs, Weakly simitar to unknown {H.sapte 
A1368034 Hs.36679 ESTs, Weakly similar to A46010 X-Onked 
AW9974B4 Hs.5003 WAA0456 protein 
AU077228 Hs.77256 enhancer of zeste (Drosophite) homolog 2 
AA187554 Hs.3Q0496 mitochondrial solute carrier 
D82343 Hs.74376 neuroblastoma (nerve tissue) protein 
AF013758 Hs.1 09643 polyadenytate binding proteirHnteracQn 
AB026187 Hs.159156 protocadherin11 
AI347487 Hs.132781 class 1 cytokine receptor 
AB023191 Hs.44131 WAA0974 protein 
AA443241 nbosornal protein 144 

AW970602 Hs.105421 ESTs 

AW952412 Hs.65874 ESTs, Weakly similar to A40348 Elav/Sex- 
AI918906 Hs.55080 ESTs 
NMJ)15156 Hs.78398 WAA0071 protein 
BE244930 Hs.166895 ESTs 
Hs.27688 ESTs 

Hs.287733 hypothetical protein FU231B9 
Hs.14600 ESTs 

Hs.38399 hypothetical protein MGC2454 
Hs.172280 SWl/SNF related, matrix associated, acfi 
Hs.30212 thyroid receptor interacfing protein 15 
Hs.5364 DKFZPS64I052 protein 

gb2n27h06x1 Stralagene neuroeplthelium 
Homo sapiens mRNA; cDNA DKFZp761l1912 (f 
phosphoribosylglycinarnlde forrnyt transfer 
cytochrome P450 retinoid metabolizing pr 
DKFZP586O0824 protein 
DKF2P564Q092 protein 
ESTs 



AW082862 
AAB94638 
AA745781 
U66615 
AI654133 
AW377314 
AA125984 
Z45998 
AA339449 
AF252297 
AL110151 
AW377610 
AI608657 
BE250162 



Hs.22543 
Hs.82285 
Hs.91546 
Hs.128797 
Hs.11123 
Hs.95097 
Hs.83765 



14.0 

13.9 

13.9 

13.8 
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9.4 
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9.2 

9.1 

9.0 

9.0 

9.0 
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8.8 
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8.6 

8.6 
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8.5 
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&2 

&2 

8.2 

ai 

8.1 

8.1 

8.0 

8.0 

7.9 

7.9 

7.9 

7.9 
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7.8 

7.8 

7.8 

7.8 

7.7 

7.7 

7.7 

7.7 

7.6 
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105143 
132438 
118036 
131170 
104548 
131495 
132543 
124040 
127695 
132559 
101050 
111823 
116401 
105127 
128478 
110456 
118846 
120934 
128132 
127431 
106157 
117B52 
106213 
116013 
120147 
118267 
132581 
120325 
133276 
127742 
127664 
108778 
121553 
107059 
133658 
115197 
112268 
103346 
126967 
118499 
105301 
113617 
127968 
134719 
110748 
115844 
121B85 
102398 
130882 
132256 
106383 
135186 
105510 
109261 
114357 
128501 
109747 
128361 
115773 
104689 
113575 
121830 
119298 
109841 
115622 
122969 
109872 
114208 
113494 
127684 
129300 
127489 
133479 
105909 
101255 
134676 
130895 
126395 
130723 
107230 
102745 



A136B835 
AW363687 
AW71862 
NMJ)14253 
R39398 
AA812434 
BE568452 
U23752 
AA714731 
AF119848 
AU077324 
R35252 
AWB93940 
AA045648 
AA7082Q5 
K52348 
AW299598 
AA226198 
AA225632 
AW77195B 
W37943 
AW877787 
N45018 
AI674126 
A1917116 
N34905 
AK000631 
AA195651 
AW978439 
AW293496 
AA806164 
AF133123 
AA41248B 
BE614410 
AA319146 
R18556 
W39609 
X87613 
AA205976 
N67274 
AW352357 
AI869372 
AA830201 
M852985 
AW581992 
AI373062 
AA934883 
U42359 
AA497044 
A107B645 
AA447453 
U73328 
Z42047 



R41677 
AL133572 
AI223001 
AW172570 
AW445044 
AA420450 
AW13816B 
AW394055 
NM.001241 
H01052 



Hs.24808 
Hs.82916 
Hs.196008 

Hs.91559 

Hs.344037 

HsJ2964 

Hs31457 

Hs.270863 

Hs.1832 

Hs.130558 

Hs.59698 

Hs.301957 

Hs.100343 

Hs.36636 

Hs.50895 



Hs.175437 

Hs.34892 

Hs.136102 

Hs.8769 

Hs.94031 

Hs.44653 
Hs.52256 

Hs.69504 

Hs.180138 

Hs.116502 

Hs.90847 

Hs.48820 

Hs.23044 

Hs.75426 

Hs.22003 
Hs.5464 

Hs.50141 

Hs.7457 

Hs.17207 

Hs.124347 

Hs.89232 

Hs.301434 

Hs.33293B 

Hs.98467 

Hs.20887 

Hs.431 

Hs.27860 

Hs.172648 

Hs.283978 

Hs.61779 

Hs.6107 

Hs.199009 

Hs^2969 

Hs.14600 

Hs38207 

Hs^92911 

Hs.15671 

Hs.98427 

Hs.155478 
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AW821252 

R65841 

AL049466 

T91451 

AA668631 

W94197 

AA650250 

W01556 

AA195191 

BE385B64 

W28051 

AA641767 

AI468004 

BE247676 

A1034467 

AW753865 

AA121514 

F12310 



Hs.208414 
Hs.104336 
Hs.28653 
Hs.7859 
Hs.86538 
Hs.1 59971 
Hs.1 10165 
r*L272076. 
Rs.238797 

Hs.149894 
Hs.87819 
Hs.21015 
Hs.278956 
Hs.18442 
Hs.34650 
Hs.74376 
Hs.70832 



ESTs. Weakly similar lo 138022 hypotheti 7.6 

chaperonin containing TCP1 , subunit 6A ( 7.6 

Homo sapiens cDNA FU11723 fis, clone HE 7.6 

odz (odd Oz/ten-m, Orosophila) homolog 1 7.5 

ESTs 7.5 

SMC2{slruduralmaintenan(»of*romoso 7.5 

protein regulator of cytokinesis 1 7.4 

SRY (sex determining region Y)-box 1 1 7.4 

ESTs, Weakly similar to heterogeneous ri 7.3 

hypothetical protein PR01 580 7.3 

neuropeptide Y 7.2 

ESTs. Weakly similar to 2109260A B cell 7.2 

ESTs 7.1 

nudix (nucleoside diphosphate finked moi 7.1 

ESTs 7.1 

ESTs 7.1 

homGO box 04 7*1 

gb:nc26a07.s1 Na.CGAP._Pr1 Homo sapiens 7.1 

flb_Tc08aO7 j1 NCJ_CGAP_Pr1 Homo sapiens 7.0 

ESTs, Moderately similar to PC4259 ferri 7.0 

KIAA1323 protein 6.9 

KIAA0853 protein 6.9 

hypothetical protein DKFZp761J17121 6.9 

ESTs 6.9 

hemoglobin, beta 6.B 

Homo sapiens cDNA: FU 22669 fis, clone H 6.8 

hypothetical protein FU20624 6.8 

AP-2 beta transcription factor 6.8 

ESTs 6.8 

ESTs 6.7 

ESTs 6.7 

genera! transcription factor IIIC, polyp 6.6 

TATA box binding protein (TBP)-associate 6.6 

RAD51 (S. cerevisiae) homolog (E coti Re 6.6 

secretogranin II (chromogranin C) 6.6 

ESTs 6.5 

solute carrier family 6 (neurotransmitte 6.5 

thyroid hormone receptor coacfjvating pr 6.4 

gb:zq48a10 .ri Stratagene hNT neuron (937 6.4 

EST W 

MAGE1 protein 6.3 

Homo sapiens cDNA FU1 1922 fis. done HE 6.3 

ESTs 6.3 

chromobox homolog 5 (Orosophila HP1 alph 6.3 

K1AA1387 protein 6.3 

hypothetical protein MGC5370 6.3 

ESTs, Highly simflar to AF257737 1 dKa 6.2 

gb:Human N33 protein form 1 (N33) gene, 6.2 

hypotheticai protein FU10392 6.2 

murine leukemia viral (brrt-1) oncogene h 6.2 
Homo sapiens mRNA; cDNA DKFZp586M0723 (f &2 

distal-less homeobox 4 6.2 

Homo sapiens PR02751 mRNA, complete cds 6.2 

Homo sapiens cDNA FU13591 fis, done PL 6.1 

Homo sapiens cDNA FL) 14839 fis, done OV 6.1 

protein containing CXXC domain 2 6.1 

ESTs, Weakly shnflar to Z141JHUMAN ZINC 6.1 

ESTs 6.1 

Human DNA sequence from clone RP4-530I15 6.0 

Rakophilm 6.0 

ESTs, WeakJy similar to KBF3.HUMAN NUCLE 6.0 

ESTs, Weakly simHar to 138022 hypotheti 5.9 

cyc!inT2 5.9 

gb:yj32hQ1.s1 Soares placenta Nb2HP Homo 5.9 

Homo sapiens mRNA; cDNA DKFZp564D0472 (f 5.8 

hypothefical protein 5.8 

ESTs 5.8 

ESTs 5.8 

ESTs 5.8 

KIAA0379 protein 5.8 

ribosomal protein L26 homolog 5.8 

ESTs 5.8 

ESTs, Moderately similar to 138022 hypot 5.8 

hypothefical protein FU20729 5.8 

mitochondrial transnational initiation f 5.8 

Homo sapiens, done MGG2492, mRNA, comp 5.7 

hypothetical protein DKF2p554L0864 simil 5.7 

hypothefical protein FU12929 5.7 

E-1 enzyme 5.7 

ESTs 5.7 

dradomedhi related ER localized protei 5.6 

ESTs 5.6 

gb:HSC38D041 normafized infant brain cDN 5.6 
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1 17357 N24829 gb:yx98h12.s1 Soares melanocyte 2NbHM Ho 

120827 AA382525 Hs.132967 Human EST done 122887 mariner transposo 
123494 AW179019 Hs.1 12110 mitochondrial ribosomal protein L42 
100438 AA013051 Hs.91417 topoisomerase (DNA) II binding prole'm 

126165 AI741816 Hs.125897 ESTs 

125934 AA193325 Hs.32646 hypothetical protein FU21 901 
124370 A1243499 Hs.170652 ESTs 

106135 'AL117474 Hs.41181 Homo sapiens mRNA; cDNA DKFZp727C191 (fr 
106668 R49390 Hs.254129 KIAA1678 . . . 

AA3161B1 Hs.61635 six transmembrane epithelial antigen of 
Z43371 Hs.7012 ESTs 

BE042915 Hs.22572 Homo sapiens cDNA FU 13675 fis, clone PL 
AA610620 Hs.181165 major histocompatibility complex, class 
BE280074 Hs.23960 cycfinBI 
T06701 Hs.12268 ESTs 
AK001269 Hs.30738 hypothelical protein FU10407 
NM_004454 Hs.43697 ets variant gene 5 (els-related molecule 
T79153 Hs.485B9 zinc Onger protein 228 

Hs.25320 Homo sapiens clone 25142 mRNA sequence 
Hs.325422 ESTs 

gb:yz29d09j1 Soares mulliple_sclerosis_ 
Homo sapiens, clone IMAGE:4298026, mRNA, 
ESTs, Moderately similar to PC4259 fern 
splicing factor prafine/glutamine rich ( 
hypothetical protein FU20041 
K1AA1451 protein 
replication protein A3 (14kD) 
ESTs, Weakly similar to 138022 hypotheti 
ESTs 

Homo sapiens mRNA; cDNA DKFZp761G02121 ( 
SMA5 

putative ribonudease III 
ESTs 
ESTs 
EST 

regulator of nonsense transcripts 2; DKF 
ESTs 

ribosomal protein L24 
dpy-30-like protein 
ESTs 

ESTs, Highly similar to unnamed protein 
ESTs, Weakly similar to 138022 hypotheti 
ESTs 
ESTs 

zinc finger protein 184 (Kruppel-tike) 
cydic AMPphosphoproteln, 19kD 
ESTs 

gfcyp86a10.s1 Soares fetal Twer spleen 
ESTs, Weakly similar to 04HUD1 debrisoqu 
Homo sapiens mRNA; cDNA DKFZp586N2424 (f 
ESTs, Moderately similar to MEG1 MOUSE M 
RU2S 
EST 

suppressor of varl (S.cerevisiae) 3-like 
ribosomal protein L44 
ESTs. Weakly similar to 178885 serine/lh 
hypothetical protein FU13340 
ESTs 

Homo sapiens cONA FU22049 fis, done H 
Integral membrane protein 2A 
gtezn09gOB.r1 Stratagene hNT neuron (937 
gb:yr77g01 j1 Soares fetal liver spleen 
D123 gene product 
frizzled-related protein 
WAA0863 protein 

follicular lymphoma variant transtocafc 
Homo sapiens mRNA; cDNA DKFZp547D0B6 (fr 
ESTs 

hypothetical protein FU21634 
ESTs 
ESTs 

ribosomal protein 144 
Hs.11556 Homo sapiens cDNA FU 12566 fis, clone NT 
Hs.107260 hypothetical protein DKFZp586H0623 
Hs.9585 ESTs 

Hs.57652 cadherin, EGF LAG seven-pass G-type rece 
Hs.179833 ESTs, Moderately similar to AUJ5 .HUMAN A 
Hs.247324 milochondrial ribosomal protdn S14 
Hs.102314 ESTs 
Hs.203933 ESTs 
Hs-17268 ESTs 

Hs.274534 caJdtonin-related polypeptide, beta 
Hs.44833 ESTs 



132967 
109606 
115816 
125186 
131185 
111227 
105516 
103409 
116008 
106394 
115121 
125464 
115596 
119040 
128040 
110151 
105539 
130567 
111077 
120830 
106516 
103316 
116129 
131253 
116680 
123949 
105511 
125B47 
108301 
110799 
104899 
125972 
107869 
121309 
125321 
102627 
104446 
126020 
116B14 
130622 
110818 
127765 
108768 
120484 
114518 
129012 
124973 
124648 
126904 
131526 
130637 
126769 
126086 
100169 
130262 
109260 
103120 
111099 
126142 



105848 
127987 
129706 



126629 
111348 
100448 
127542 
132452 
123778 
126521 
110343 
102983 
127207 



Z42993 
AI634549 
N71807 
AW298597 
R02394 
AW500486 
H18835 
AB040884 
AA383092 
N41367 
AI568170 
AL137311 
X83301 
AF189011 
R71802 
AW902848 
AA621665 
AB037829 
AW161B85 
AA069728 
AI089660 
AA054726 
AI927475 
AW975998 
AA293834 
T86652 
AL021918 
AF084555 
H79863 
H50834 
A1582291 
AL157503 
AA971146 
AF181721 
AA253170 
AW163267 
R81936 
A1476066 
AA249086 
AA948033 
A1005169 
AA356764 
AA083456 
H75681 
AL037228 
D63216 
AW978515 
BE410731 
R15337 
H86261 
AW805510 
AW954064 
AI022103 
AA443241 
N30436 
AL095739 
AA034922 
AF234887 
AA703684 
AW973521 
AYV352149 
AI475110 
AW136703 
X02404 
AA377165 



Hs.61884 
Hs.269436 
Hs.1 80610 
Hs.31608 
Hs.109694 
Hs.1608 
Hs.173002 
Hs.96886 
Hs.234074 
Hs.324728 
Hs.49163 
Hs.24853 
Hs.273829 
HS.20B957 
Hs.3852 
Hs.269745 
Hs.184582 
Hs.323401 
Hs.285574 
Hs.35406 
Hs.58595 
HSS7312 
Hs.178294 
Hs.158174 
Hs.7351 
Hs.1 14243 
Hs.77899 
Hs.16846 
Hs.27552 
Hs.129074 
Hs.61345 
Hs.96473 
Hs.106469 

Hs.102243 
Hs.125034 
Hs.130853 
Hs.28274 
Hs.17109 



Hs.82043 

Hs.163684 

Hs.131915 

Hs.74050 

Hs^1958 

Hs.40568 

Hs.97056 

Hs.24951 

Hs.124511 
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113974 AW9B9756 Hs.34145 ESTs. Weakly similar to B49647 GTP-bindi 

133098 AW629223 Hs.64794 zinc finger protein 183 (RING finger, C3 

109920 H05840 Hs.1 11323 ESTs 

103487 AA743B03 Hs.172108 nucleoporin 88kO 

125353 AB033043 Hs.149377 hypolhetical protein OKFZp761L0424 

100893 BE245294 Hs.1 80789 S164 protein 

Hs.93597 cycBrwtependent kinase 5, regulatory su 
paraneoplastic antigen MA2 
ESTs, Moderately similar to ALU7 JHUMAN A 
hypothetical protein DKFZp434l0428 
ESTs, Weakly similar to T33468 hypolheti 
KIAA0547gene product 
ESTs 
ESTs 

matrix metalloproteinase 9 (gelatinase B 
hypothetical protein PP1044 
ESTs 

Homo sapiens cONA FU 14209 fe. done NT 
AA4S2054 Hs.119338 ESTs 
122792 AW188551 Hs.99519 hypothetical protein FU 14007 

ESTs, Weakry similar to T17279 hypotheti 
ESTs 
EST 
ESTs 

Hs.301763 KIAA0554 protein 
Hs.74122 caspase 4, apoptosis-related cysteine pr 
Hs.334842 tubulin alpha 1 
ESTs 

gb:Homo sapiens clone RB4217 mRNA seque 
Homo sapiens mRNA; cDNA DKFZp434M082 {fr 
chromosome 20 open reading frame 1 
ribosomal protein L10 



135047 AL134197 

111148 AB020690 

107681 BE379594 

108480 AL133092 

132742 AA025480 

104912 AA813192 

110223 H19836 

113047 AI571940 

101031 J05070 

113002 BE243513 

105304 AW134924 

112386 AA831785 



112651 
111346 

123596 AA421130 
125447 AI582222 



HS.77B2 
Hs.49135 
Hs.68055 
Hs.292812 
Hs.200596 
Hs.31697 
Hs.7549 
Hs.151738 
Hs.7212 
Hs.190325 
Hs.171914 
Hs.119338 
Hs.99519 
Hs.33410 



AA595802 

AW970976 Hs.293653 
Hs.1 12640 
Hs.128686 



106178 AL049935 

102250 A1249361 

105868 AA378780 

127496 A1031650 

127315 AF116622 

110193 AI004874 

105012 AF098158 

129445 W52452 

114721 061939 

120922 AA481003 

115167 AA749209 
102407 



Hs.310764 
Hs.9329 
Hs.29797 
Hs.103822 ESTs 
Hs.97128 ESTs 
Hs.43728 



AW602154 Hs.82143 
AW875398 Hs.6451 



hypothetical protein 
E74-Hke factor 2 (ets domain transcript 
PRO0659 protein 
XAGE-1 protein 



106111 

123829 AF251237 Hs.1 12208 

103616 NM_002647 Hs,32971 

100269 NHJJ01949 Hs.1 189 

112728 R91913 Hs.272104 

135098 AW274526 Hs.277721 

106034 AW952005 Hs.14928 

133571 BE515037 Hs.177556 

106246 AL036917 Hs.288821 

125724 AL360190 Hs.3 18501 

132206 AA425204 Hs.334721 

130227 BE397151 Hs,153003 

129650 AF109298 Hs.1 18258 

130382 NM.003450 Hs.155204 

106073 AL157441 Hs.17834 

105403 AI473827 Hs.31793 

118138 AA374756 Hs.93560 

134126 NMJXJ3747 Hs.131814 

115041 AA252457 Hs.86543 

106012 AI240665 

116732 AW152225 Hs.165909 

130832 AW838008 Hs^0082 

125960 AI754693 Hs.145968 

133916 AL039185 Hs.77558 

106232 AB037828 Hs.15370 

129228 U40714 Hs.239307 

109929 AA7731B7 Hs.294027 

123729 AL039779 Hs.278672 

101266 136645 Hs.73964 

132389 AA310393 Hs.1 90044 

124320 H95749 Hs.102342 EST 

103023 AW500470 Hs.1 17950 rmdiifijrictkma! polypeptide similar to S 
116133 AW449597 Hs.313652 EST, Weakly similar to 138022 hypothetic 
129001 AA443323 
119271 AI061118 



E2F transcription factor 3 
ESTs, Moderately similar to ALU 1 .HUMAN A 
membrane component, chromosome 17, surfa 
hypothetical protein FU12903 
melanoma antigen, family D, 1 
KIAA1638 protein 

Homo sapiens mRNA fuU length insert cDN 
hypothetical protein FU13391 
serine/mreonine kinase 16 
prostate cancer associated protein 1 
zinc finger protein 174 
downstream neighbor of SON 
ESTs 

Homo sapiens mRNA for KIAA1771 protein, 
tenkyrase, Tftf Interacting ankyrin-rela 
ESTs, Moderately similar to T00256 hypot 
ESTs 

ESTs, WeaWy simBar to 138022 hypothefi 
zinc finger protein 3 (A8-51) 
ESTs 

thyroid hormone receptor interactor 7 
K1AA1407 protein 
tyrosyHRNA synthetase 
ESTs 

membrane component, chromosome 11, surfa 
EphA4 
ESTs 



114767 AI859865 

126107 H75477 

115333 BE241812 

100571 L14561 

134104 L35253 

120150 BE005771 

134470' X54942 

129948 AI537162 



106882 AA149537 Hs.26994 

126199 AW088276 Hs.1 25829 ESTs 

112727 T91029 Hs.15069 ESTs 

118872 AI039201 Hs.2B3316 ESTs 



Hs.107812 BPOZproteln 
Hs.65328 Fanconi anemia, complementation group F 
rrrinlchromosome maintenance deficient (S. 
ESTs 

protein tyrosine phosphatase, non-recept 
ATPase, Ca-w- transporting, plasma membra 
mi tooen- activated protein kinase 14 
hypothetical protein FU22490 
COC28 protein kinase 2 
ESTs 

hypothetical protein FU 20477 



Hs.154443 

Hs.93361 

Hs.87860 

Hs.78546 

Hs.79107 

Hs.1 53746 

Hs.83758 
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ESTs 
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115399 


AF151534 
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118129 


N57493 




gb:yy54c08.s1 Soares_mulljp!e_sderosls_ 


126522 


W31912 


Hs.21168 


gb:zc76d03.s1 Pancreatic Islet Homosapi 


131965 


W79283 


Hs.35962 


ESTs 


102507 


U52154 


Hs.193044 


potassium inwardly-rectifying channel, s 


126029 


AA704253 


Hs.169359 


ESTs 


107168 


W57578 


Hs.237955 


RAB7, member RAS oncogene family 


105808 


AI133161 


Hs.286131 


CGH01 protein 


134087 


U51166 


Hs.173824 


thymine-DNA glycosylase 


133195 


AI434760 


Hs.279949 


WAA1007 protein 


112996 


BE276112 


Hs.7165 


zinc finger protein 259 


129428 


AA256906 


Hs.111364 


ESTs, Weakly similar to ubiquitous TPR m 


118967 


AI668670 


Hs.216756 


ESTs 


127335 


AA830210 


Hs.214263 


ESTs, Moderately similar to ALU INHUMAN A 



4.0 
4.0 
4.0 
4.0 
4.0 
4.0 
4.0 
4.0 
40 

10 126965 AI470^ Hs!l39336 A^iindng 'cassette, sub-family C (CFTR 4.0 

• - ' " r " " 4.0 
4.0 
4.0 

_..._. 0 .-..-..^.,.,.- r -.-,- 4 -0 
15 111387 mum" Hsi85724 ESTs " 4° 

4.0 
4.0 
4.0 

20 102935 BE561850 Hs.80506 smaflnudearnbonucleoproteinpolypept 4.0 

4.0 
4.0 
4.0 

)tuJM s ,_-..„™..-.— 4-0 

25 131965 W79283 Hs.35962 ESTs 3.9 

3.9 
3.9 
3.9 

3 - 9 

30 134087 U51166 Hs.173824 thymine-DNA glycosylase 3.9 

3.9 
3.9 
3.9 
3.9 

35 127335 AA830210 Hsil4263 isTs, Moderately similar to ALU1_HUMAN A 3.9 

106636 AW958037 Hs.286 ribosomal protein L4 3.9 

133529 W45623 Hs.74571 ADP-ribosylation factor 1 3.9 

102266 U29725 Hs.3080 mitogen-activated protein kinase 7 3.9 

125905 A1678638 Hs.6456 chaperonin containing TCP1,subunit 2 (b 3.9 

40 103080 AU077231 Hs.82932 cyclin D1 (PRAD1: parathyTokl adenomatos 3.9 

131971 BE567100 Hs.154938 hypothetical protem MDS025 3.9 

129703 BE388665 Hs. 179999 Homo sapiens, clone IMAGE:3457003, mRNA 3.9 

128895 AW467000 Hs.106985 ESTs 3.9 

110924 AW058463 Hs.12940 zinc-fingers and homeoboxes 1 3.9 

45 112921 R91095 Hs.4276 WAA1701 protein 3.9 

120820 AA347417 Hs.96869 EST 3.8 

105459 AA789081 Hs.4029 glioma-ampl'ified sequence-41 3.8 

114253 BE149866 Hs.14831 Homo sapiens, Similar to zinc finger pro 3.8 

105476 AL117352 Hs.77196 Human DNA sequence from done RP5^876B10 3.B 

50 122682 AA984531 Hs.159293 ESTs 3.8 

130580 M32388 Hs.334370 uncharaderized hypothalamus protein HBE 3.8 

130174 M29551 Hs.151531 protein phosphatase 3 (formerly 2B), cat 3.8 

115577 AA393167 Hs.41294 ESTs JJ 

129785 H19006 Hs.184780 ESTs 3.8 

55 126127 N95428 gb2b80d09.s1 Scares_senescenLfibrobtes 3.8 

109793 F13088 Hs.8040 heparan sulfate (glucosamine) 3-O-sutfol 3.8 

103175 X690B9 Hs.79227 myomeshi (Mfrotein) 2 (165kD) 3.8 

100154 H60720 Hs.81892 WAA0101 gene product 3.8 

106140 AB006624 Hs.14912 WAA0286 protein 3.8 

60 129052 BE275031 Hs.158210 hypothetical protein MGC2555 3.8 

103319 X83492 Hs.82359 tomornecroste factor receptor superfami 3.8 

106319 W22335 Hs,7392 hypothetical protein MGC31 99 3.8 

102391 AA296874 Hs.77494 deoxyguanosme kinase 3.8 

127262 AA828125 gb:od71a09.s1 Na_CGAP_Ov2 Homo sapiens 3.7 

65 126872 AW450979 gb:UI-H-BI3^a-a-12-0-Ul5l NCLCGAP.Su 37 

107315 AA316241 Hs.90691 nudeoprwsniln/niJcteoplasrnin 3 37 

113119 T47910 gb^18b1ls1Stratagenefetalspteen(9 3.7 

126962 R12014 HA20976 ESTs 37 

116203 AW137166 Hs^7306 ESTs 37 

70 123377 AW969183 Hs.271297 ESTs 3.7 

122798 AW366286 Hs.145696 spBcmg factor (CCI^) 37 

107780 AA018927 Hs.269213 ESTs 3.7 

132733 AB020631 Hs.1 23654 PCF11phomolog 3.7 

120849 AA687322 Hs.192843 leucine zipper protein FKSG14 37 

75 131163 AA099524 Hs.23754 ESTs 37 

126708 AW962593 Hs.135260 ESTs 37 

117417 A1241733 Hs.43871 ESTs 3.7 

106272 AW959731 H&323099 ESTs 3.7 

110834 AW273860 Hs^759 ESTs 3.7 

80 123663 AL039482 Hs.142517 Homo sapiens mRNA; cDNA OKFZp434P0810 (f 17 

124250 AA350256 EST, Weakly similar to 2109260A B cell g 37 

127038 AA233014 Hs.105965 ESTs 37 

107711 W96141 Hs.220667 ESTs 37 
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130850 
119126 
125466 
105150 
103163 
104495 
107599 
113577 
102681 ' 
109411 
127563 
131112 
114118 
103937 
125174 
104799 
128952 
135191 
124367 
113560 
119232 
113988 
115173 



127256 
123419 
102025 
100661 
132339 
115691 
134853 
134075 
128468 
127229 
108451 
133421 
135332 
132520 
125562 
126996 
125558 
117265 
130215 
109482 
133726 
132317 
117613 
100944 
105226 
125032 
123720 
128846 
116443 
128770 
106918 
131244 
128765 
111223 
104857 
105395 
133582 
128527 
105367 
103392 
101086 
133423 
117714 



114096 
117147 
133347 
132456 
133765 
109734 
132786 
106685 
105593 
131381 
108687 
113115 
107234 
123110 
131019 



AB040922 
R45175 
R08234 
AA631977 
AU077018 
AW975687 
AW654072 
AI300699 
Y08890 
RS8881 
AB035898 
K15302 
F01598 
AA934063 
W51835 
AA029703 
AL043463 
X16866 
AI683183 
T91015 
AI655226 
W87536 
BE612940 
AA699949 
AI73B610 
T66087 
U04045 
BE623001 
D80030 
AW190215 
BE268326 
NMJH2201 
T23625 
AA316181 
AA079195 
AF134160 
AW393883 
AA257992 
A1494372 
BE161065 
R59305 
AA451966 
BE301883 
AI002238 
A1803188 
BE262438 
AW029507 
IA7518 
R58958 
T74884 
AA609734 
AA730767 
AW962196 
AB015982 
AA076049 
A163B429 
AF073310 
AA852773 
A1920902 
AI580880 
BE391579 
AA504583 
AA504747 
X94563 
AA382524 
T84084 
N45226 
AL133117 
AF060219 
AW901347 
BE257758 
AB011084 
M62194 
AI927212 
BE083422 
AU076617 
AA279341 
M92642 
BE544475 
AI141426 
AA972447 
AA466256 
W28614 



Hs.20237 DKFZP566C134 protein 
Hs.1 17183 ESTs 
Hs.180461 ESTs 
Hs.155995 WAA0643 protein 
Hs.3235 keratin 4 
Hs^92979 ESTs 
Hs.60136 ESTs 
Hs.1 11334 PRO0470 protein 
Hs.1 13503 karyopherin (importin) beta 3 
Hs.1 09655 sex comb on midleg (Drosophila)-fike 1 
Hs.150587 kinesin-dke protein 2 
Hs.1 68950 Homo sapiens mRNA; cDNA DKFZp566A1046 (I 
Hs.175930 ESTs 

Hs.13836 ESTs, Weakly simitar to 138022 hypoiheti 
Hs.231082 EST 

gbze95h0as1 SoaresJeta)_heartNbHH19W 
Hs.6755 RaP2 interacting protein B 
Hs.333497 cytochrome P450, subfamily HD (debrisoq 
Hs.99343 distaWess homeo box 5 
ESTs 

Hs.1 17659 ESTs, Weakly similar to T46481 hypotheti 
Hs.36473 ESTs. Weakly similar to JC5238 galactosy 
Hs.88252 ESTs 
Hs.191385 ESTs 

Hs.267967 ESTs, Moderately similar to ALU8_HUMAN A 
Hs.1 12482 Homo sapiens unknown mRNA sequence 
Hs.78934 mutS (E. coli) homolog 2 (colon cancer, 
Hs.132746 Homo sapiens ribosoma) protein L39 mRNA, 
Hs.45127 chondroitin sulfate proteoglycan 5 (new 
Hs.62348 hypothetical protein FU11753 
Hs.90280 S^rninoimidazole-4-carboxamide ribonucle 
Hs.78979 Golgi apparatus protein 1 
Hs.1 50580 putative translation initiation factor 
Hs.61635 six transmembrane epithelial antigen of 

gb:zm92h1Zs1 Stratagene ovarian cancer 
Hs.7327 daudin 1 
Hs.98968 hypothetical protein FU23Q58 
Hs.50651 Janus kinase 1 (a protein tyrosine krnas 
Hs.98968 hypothetical protein FU 23058 
Hs.1 67531 melhylcrotonoyl-Coenzyme A carboxylase 2 
gb:yh16c10.r1 Soares infant brain 1NIB H 
RAB9-0ka protein 
glioblastoma amplified sequence 
spUcing factor, argmineTserine-rich 11 



Hs.152707 
Hs.1 1482 
Hs.252716 
Hs.44592 
Hs.161102 

Hs.26608 

Hs.112755 

Hs.285753 

Hs.339808 

Hs.143460 

Hs.274415 

Hs.24763 

Hs.143648 

Hs.334838 

Hs.19058 

Hs.268149 

Hs.75087 

Hs.101047 

Hs.136102 

Hs.250959 

Hs.196008 

Hs.46495 

Hs.81376 

Hs.27007 

Hs.38592 

Hs.71475 

Hs.48924 

Hs.75929 

Hs.3734 

Hs.56851 

Hs.16251 

Hs.174151 

Hs.26208 

Hs.54347 

Hs.8705 

Hs.288833 

Hs.193510 



beta-1,4 mannosyttransferase 
ESTs 
mucin 6, gastric 

hypothetical protein MGC15880 
gb.7c5Bd02.s1 Stratagene liver (937224) 
EST 

SCGIO-tike-protein 
IBP protein 32 
protein kinase C, nu 

Homo sapiens cDNA FU 10229 fe, done HE 
RAN binding protein 1 
insulin receptor substrate 2 
WAA1866 protein 

ESTs, Moderately similar to S65657 alpha 
putative methyl transferase 
Fas-activated serine/threonine kinase 
transcription factor 3 (E2A imrnunoglobul 
WAA0853 protein 

gb:H.saptens dbi/acbp gene exon 1 & 1 



Homo sapiens cDNA FU11723 fis, done HE 
EST 

Homo sapiens mRNA; cDMA DKFZp586L1121 (f 
chromosome condensation 14ike 
hypothetical protein FU 23342 
acid duster protein 33 
K1AA0512 gene product ALEX2 
cadherin 1 1 . type 2, OB-cadherin (osteob 
ESTs 

hypothetical protein MGC2668 
cleavage and pdyadenytaGon specific fa 
aldehyde oxidase 1 
collagen, type XVI, alpha 1 
ESTs 
ESTs 

Homo sapiens mRNA; cDNA DKFZp434K087 (fr 
EST 

chorionic somatomammotropin hormone 1 (p 
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ABOJ/755 
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134267 


AJ1 74596 




106591 


A A A A14CA 

AA4431o4 


|_|_ O7OC0 


105169 


BE245294 


ns.lou/oy 


120120 


OCC/ITOCT 

Bt5472o7 


Hs.59791 


12663c 


AA649257 


Hs.bbyyo 


128531 


H03721 


nS.WzJo 


lUtrobU 




Ue TOT/ft 


108859 


AL121500 




112276 


R53442 


Hs.26038 


125693 


H23989 


Hs.169743 


113474 


R50752 


Hs.23856 


132994 


AA112748 


Hs.279905 


124691 


R05835 


Hs.110153 


112511 


AW970420 




132985 


AL045579 


Hs.62113 


125743 


H17151 


Hs.7416 


133363 


AI866286 


Hs.71962 


107908 


AF087999 




105312 


BE613348 


Hs.211579 


110256 


H63947 


Hs.237955 



113649 



Hs.26370 
Hs.16400 



132073 AA295052 Hs.38516 
107090 AW809208 Hs.183297 
105463 AA825974 Hs.32646 
109592 AI198059 
N94768 
AA248884 
116439 AA251594 
129535 AA397972 
113283 T66813 
H04150 
N54968 



128992 
133160 

134076 AF086215 

128301 U90552 

127728 AW404061 

126516 R95672 



127506 T61039 

104769 AA025887 

126666 AA648886 

130453 U80735 



107131 

130422 AW160614 Hs.180034 



Hs.1 08502 hypothetical protein FU20150 
DKFZP564G2022 protein 
DKFZP564C186 prolan 
phosphoinositirJe-3-kinase, regulatory su 
ESTs, Weakly similar to RMS1.HUMAN REGUL 
Homo sapiens done IMAGE:23371 , mRNA seq 
ESTs 

ankylosis, progressive (mouse) homotog 
ESTs, WeaWy similar to T00351 hypotheti 
GTP-binding protein Rho7 
hypothetical protein 
hypothetical protein KIAA1335 
RAE1 (RNA export 1 , S.pomba) homdog 
hypothetical protein FU13433 
S164 protein 

hypothetical protein MGC13183 
ESTs 

ribosomal protein S15a 

Homo sapiens mRNA; cDNA DKFZp564O0122 (f 
ESTs 

ESTs. Weakly similar to I3B022 hypotheti 
Homo sapiens done 25121 neuronal olfact 
hypothetical protein MGC5297 
clone HQ0310PRO0310p1 
ESTs 

dynactin 2 (p50) 
WAA0717 protein 

g£:ym37a05.r1 Soares infant brain 1NIB H 
ESTs, Weakly similar to B36298 proline-r 
ESTs 

melanoma cell adhesion molecule 
RAB7, member RAS oncogene family 
Homo sapiens, done MGQ15887, mRNA, com 
DKFZP566F2124 protein 
hypothetical protein FU21901 
ESTs 

ESTs, Weakly similar to KIAA1435 protein 
gfck3517.seq.F Human fetal heart, Lambda 
PIBF1 gene product 
chime rin (chrmaerin) 1 
EST 
ESTs 

hypothetical protein MGC11061 
gb:Homo sapiens full length insert cDNA 
butyrophifin, subfamily 3, member A1 
protein kinase C beta 1 
chemokine binding protein 2 
ribosomal protein L10a 
hypothetical protein MGC11266 
ESTs 

PAX transcription activation domain inte 
hypothetical protein RG083M05.2 
cleavage stimulation factor, 3* pre-RNA, 



Hs.43913 
Hs.112603 
Hs.12947 
Hs.107708 
Hs.66309 
HsJ8980 
Hs.284283 



Hs.117572 
Hs.76067 
Hs.293943 
Hs.151999 
Hs.173854 
AW961605 Hs.21145 



121292 


AA401807 




gb:zv65f1U1 Soares_total_fetiis_Nb2HF8_ 


123284 


AA488988 


Hs.293796 


ESTs 


130734 


AW137091 


Hs.18624 


WAA1052 protein 


105400 


AF198620 


Hs.10283 


RNA binding motif protein 8A 


105014 


AA121123 


Hs.2692fi7 


ESTs, Weakly similar to AF161361 1 HSPC0 


114988 


AA251089 




gbzs04t05.s1 NO_CGAP_GCB1 Homo sapiens 


132867 


AF226667 


Hs.58553 


CTP synfliase II 


124169 


BE079334 


Hs.271630 


ESTs 


114652 


AI521936 


Hs.107149 


novel protein similar to archaeal, yeast 


113876 


AT799751 


Hs.5635 


ESTs 


111520 


AJ985369 


Hs.301134 


ESTs 


121748 


BE536911 


Hs.234545 


hypothetical protein NUF2R 


106834 


AL044182 


Hs.28070 


KIAA0753 gene product 


128869 


AA768242 


Hs.80618 


hypothetical protein 


129619 


AA209534 


Hs.284243 


tetraspan NET-6 protein 


126770 


Al 292320 


Hs.81361 


heterogeneous nuclear ribonudeoproteln 


116734 


AW900992 


Hs .93796 


DKFZP586D2223 protein 


100253 


D38024 


Hs.157425 


double homeobox, 2 


130999 


AA326683 


Hs.21992 


Dkery orthotag of mouse variant polyade 


114092 


H81213 


Hs.14825 


ESTs, Weakly similar to WAA1503 protein 


129284 


AA318224 


Hs.296141 


ESTs 


119127 


AA708035 


Hs.12248 


ESTs 


123910 


AA621262 


Hs.179923 


ESTs, Weakly similar to S65657 alpha-1 C- 


132664 


AI740461 


Hs.54542 


ESTs 


118397 


BE139479 


Hs.161492 


ESTs 


107003 


AW138437 


Hs.24790 


K1AA1573 protein 


126735 


M69113 


Hs.226795 


glutathione S-transferase pi 


130847 


AI672483 


Hs.20220 


lipase protein 


101186 


AA020956 


Hs.179881 


core-binding factor, beta subunit 


113626 


T94318 


Hs.17359 


ESTs, Moderately similar to RL44.HUMAN 6 
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121782 
119863 
106103 
123BD8 
133761 
110561 
115239 
117457 
128662 
129673 
105154 
126722 
109966 
129315 
109517 
105252 
101754 
1103B4 
134118 
134869 
100780 
125728 
129794 
129056 
107129 
103038 
130553 
125394 
132305 
131136 
102983 
117639 
127076 
126153 
132676 
104946 
122110 
127705 
109442 
100474 
115536 
119651 
127211 
134964 
105551 
119750 
125590 
104590 
110724 
116429 
133915 
128721 
128538 
129179 
108793 
10B807 
133461 
132571 
134937 
125756 
112369 
127002 
128179 
117121 



AW452957 
AA081218 
BE620779 
AA620552 
AF041430 
AA379597 
BE251328 
N29682 
BE250742 
D38552 
AA307279 
N65148 
K09103 



NMJJ14563 Hs.174038 
A1631874 Hs.155140 



Hs.9728 

Hs.239489 

Hs.268798 

Hs.182877 

Hs.90421 

Hs.293441 

Hs.57987 

Hs.108336 
Hs.4756 

Hs.252587 

Hs.173772 

Hs.44344 

Hs.23413 

Hs.118638 

Hs.44833 

Hs.146162 

Hs.40730 

Hs.261038 

Hs.73848 

Hs.301240 

Hs.86999 



AB039670 
S70114 
H45282 
BE336680 
AL157518 
BE561958 
AW954565 
AF161399 
AI769958 
AC004770 
AA926960 
AF062649 

BE178502 

A1806090 

AB033099 

BE387202 

AA377165 

AI422951 

K85692 

N92S89 

AW242407 

AI123000 

AJ003322 

AW296134 

NMJJ00699 Hs.300280 
AK00146B Hs.62180 



70 



75 



80 



128403 
127930 
114250 
108828 
105225 
117997 
104558 
124777 
106035 
127521 
120215 
119403 
105024 
123485 
109416 
132490 
115348 
117297 



Hs 63348 Homo sapiens, clone MGC:1 5203, mRNA, com 
Hs.58608 Homo sapiens cDNA FU14205 fis, clone NT 
Hs.1 2094 mitochondrial rfoosomal protein L30 

gb:ae58g1 1.s1 Stralagene lung carcinoma 
Hs 75922 brain protein 13 
Hs!51 99 HSPC1 50 protein similar to utiquitin-con 
HsJ3291 hypothetical protein FU10881 
Hs.44071 ESTs, WeaWy similar to ALUSJtUMAN ALU S 
Hs.106673 eukaryoBc translation Initiation factor 
Hs.1191 WAA0073 protein 
Hs.35947 methyl-CpG binding domain protein 4 
Hs.1 11 25 HSPC033 protein 
Hs.30897 EST 

spondyloepiphyseal dysplasia, late 
casern kinase 2, alpha 1 polypeptide 
ALEX1 protein 

T1A1 cytotoxic granule-associated RNA-bi 
ESTs 

KIAA0116 protein 
PR02463 protein 

immunoglobulin heavy constant mu 
B-cell auiymphoma 11 B (zinc finger pro 
hypothetical protein FU13433 
ESTs, Weakly similar to ALUELHUMAN B\ 
flap structure-specific endonudease 1 
CDC28 protein Kinase 1 
piMtary tumor-transforming 1 
ESTs, Weakly similar to 178885 serineAh 
hypothetical protein FU20534 
K1AA1273 protein 

non-melastatic celts 1, protein (NM23A) 
ESTs . 
ESTs 
ESTs 

ESTs, Weakly similar to I38022 hypotheti 
carrinoemtoyonic antigen-related cell ad 
melanocorlin 1 receptor (alpha melanocyt 
gbAI003322 Selected chromosome 21 cDNA 
ESTs, Weakly similar to S65657 alpha-1 C- 
amytase, alpha 2A; pancreatic 
anillin (Drosophila Scraps homolog), act 
small inducible cytokine subfamily E, me 
hypothetical protein FU 22004 
hlppocaldn-fike protein 4 
ribonudease HI, large subunit 
ESTs 

Homo sapiens, clone IMAGE:3840937, mRNA, 
nuclear receptor subfamily 1, group I, m 
Homo sapiens cDMA FU 13471 fis, clone PL 
putaSve nucleotide binding protein, est 
Homo sapiens cDNA FU 14967 fis, clone TH 
phosphoprotein associated with GEMs 
ESTs 
ESTs 
ESTs 

hypothetical protein FU20644 
cytochrome P450, subfamily IIA (phenobar 
suppressor of G2 allele of SKP1.S. cere 
Hs.171939 ESTs 

Hs 289721 growth arrest specific transcript 5 
Hs.4243 hypothetical protein FU12650 
Hs.24979 hypothetical protein DKFZp761 P1010 
Hs.127116 ESTs 
Hs.321386 EST 

Hs.1 1 2227 membrane-associated nucleic acid binding 
Hs.295362 Homo sapiens cDNA FU14459 fis, done HE 
Hs.123304 ESTs 
Hs.13297 ' ESTs 
Hs.273344 DKFZP564O0463 protein 

• gb2n57d02^1 Stratagene muscle 937209 H 
Hs.47420 EST 

Hs.88959 hypothetical protein MGC4816 
Hs 140237 ESTs, Weakly similar to ALU1_HUMAN ALU S 
Hs!5245 hypothetical protein FU20643 
AW297206 Hs.164018 ESTs 
AF10921 9 Hs.1 08787 phosphatidyrinosHot grycan, dass N 
ALU 7554 Hs.1 19908 nucleolar protein NOP5/NOP58 
AA126311 Hs.9879 ESTs 

AI308876 Hs.103849 hypothetical protein OKFZpTC D112 
BE268388 Hs.B6945 ESTs, Weakly similar to A46010 X-Knked 
NMJJ01290 Hs.4980 UM domain binding 2 
AA281562 Hs.766 ESTs 

AW779829 gb.hn88a05.x1 NCLCGAPJGdll Homo sapien 

790 



Hs.333513 
Hs.108812 
Hs.272891 
Hs.25292 
Hs.94812 
Hs.143375 

Hs30799 

Hs.279923 

Hs.77554 

Hs.266175 

Hs.101189 

Hs.109154 

Hs.71139 

Hs.49376 



AW663858 
AA305520 
AI803516 
AW005822 
AI538880 
R23858 
AW373062 
AW016783 
AF191018 
AA815092 
AW403911 
R44214 
AW969025 
AA129395 
A1652236 

NM 000762 Hs.334345 
AW674699 Hs.5169 
A1251449 
BE174587 
AW966243 
AL353940 
AW2936B9 
H95044 
AF255303 
AJ908005 
AA809672 
AI914699 
AK001693 
AA211777 
N52090 
R56678 
R41933 



3.2 
3.2 
3.2 
3.2 
3.2 
3.2 
3.2 
3.2 
3.2 
3.2 
3.2 
3.2 
3.2 
3.2 
3.2 
3.2 
3.2 
3.2 
3.2 
3.2 
3.2 
3.2 
3.2 
3.2 
3.2 
3.2 
3.2 
3.2 
3.2 
3.2 
3.2 
3.2 
3.2 
3.2 
3.2 
3.2 
3.2 
3.2 
3.2 



3.2 

3.2 

3.2 

3.2 

3.1 

3.1 

3.1 

3.1 

3.1 

3.1 

3.1 

3.1 

3.1 

3.1 

3.1 
3.1 
3.1 
3.1 
3.1 
3.1 
3.1 
3.1 
3.1 
3.1 
3.1 
3.1 
3.1 
3.1 
3.1 
3.1 

ai 

3.1 
3.1 
3.1 
3,1 
3.1 
3.1 
3.1 
3.1 
3.1 
3.1 
3.1 
3.1 
31 
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112501 AA972447 Hs.288833 Homo salens mRNA; cDNA DKFZp434K087 (fr 3.1 

107532 AA443473 Hs.173684 Homo sapiens mRNA; cDNA DKFZp762G207 (ft 3.1 

130753 AA205223 Hs.189 phosphodiesterase 4C, cAMP-specific (dun 3.1 

132425 N87549 Hs.1 25287 zinc finger protein ZNF140-Ske protein 3.1 

5 115188 AK000219 Hs.8B387 hypothetical protein FU2021 2 3.1 

129707 AW572317 Hs.12082 Homo sapiens mRNA: cDNA DKFZp566L203 (ft 3.1 

106474 BE383668 Hs.424B4 hypothetical protein FLJ10618 3.1 

106472 AI207162 Hs.3815 statrurun-like-protan RB3 3.0 

102687 NMJ07019 Hs.93002 utwqinOn carrier protein E2-C 3.0 

10 117602 N35020 Hs.44685 C3HC4^e zinc finger protein 3.0 

105522 W19789 Hs.336635 Homo sapiens, done I MAGE: 4 179482, mRNA 3.0 

133650 D84294 Hs.1 18174 tetralricopeptide repeal domain 3 3.0 

134624 AF035119 Hs.8700 deleted m liver cancer 1 3.0 

112435 NM.013255 Hs.288791 muskeRn 1, intraceBuJarmeoTatorconta 3.0 

15 134654 AK001741 Hs.8739 hypothetical protein FU1 0879 3.0 

106651 AA460421 Hs.30875 ESTs 3.0 

109597 AA989362 Hs.293780 ESTs 3.0 

132342 AW162758 Hs.341729 ESTs, WeaWy similar to ALU5_HUMAN ALU S 3.0 

106057 BE614474 Hs.289074 F-box only protein 22 3.0 

20 106378 AAB24298 Hs.21331 hypothetical protein FU 10036 3.0 

128660 AA011597 Hs.177398 ESTs 3.0 

115467 A1366784 Hs.48820 TATA box binding protein (TBP)-associate 3.0 

128604 A1879099 Hs.1 02397 GIOT-3 for gonadotropin inducible transc 3.0 

125219 A1804331 Hs.99423 ATP-dependentRNAhelicase 3,0 

25 126698 A1221147 Hs.145088 ESTs, WeaWy similar to T1 5936 hypotheli 3.0 

133966 BE280478 Hs.1 82695 hypothetical protein MGC3243 3.0 

119155 R61715 Hs.310598 ESTs, Moderately similar to ALU 1 .HUMAN A 3.0 

131586 AA460352 Hs.26966 KIAA1 171 protein 3.0 

100237 D30715 Human PAP (pancreatitis-associated prote 3.0 

30 105515 T24968 Hs.23038 HSPC071 protein • 3.0 

123073 AA485061 Hs.105652 ESTs 3.0 

111375 H56499 Hs.252692 ESTs. Weakly similar to 138022 hypotheti 3.0 

130724 AK001507 Hs.44143 Homo sapiens clone FLB6914 PR01821 mRNA, 3.0 

129928 AJ338993 Hs.134535 ESTs 3.0 

35 118922 AW206193 hypothetical protein DKFZp761B2423 3.0 

131902 AA180145 Hs.34348 Homo sapiens mRNA; cDNA DKFZp434P0235 (f 3.0 

125165 W45350 gb:zc81h08.s1 Pancreatic Islet Homo sapi 3.0 

122219 AA436002 Hs.183161 ESTs 3.0 

132195 BE018717 Hs.42124 ESTs 3.0 

40 102298 AA382169 Hs.54483 N-myc (and STAT} interactor 3.0 

103286 D38616 Hs.54941 phosphorytese Wnase. alpha 2 (liver) 3.0 

130168 AK001389 Hs.15144 hypothetical protein DKFZp564O043 3.0 

126997 AI377150 Hs.150914 ESTs 3.0 

128902 AA036637 Hs.107052 ESTs 3.0 

45 117186 H9B988 Hs.42612 ESTs, WeaWy similar to ALU1JUJMAN ALU S 3.0 

126096 F08208 Hs.283844 similar to rat tricarboxytate carrier-G 3.0 

106711 BE390125 Hs.143187 hypothetical protein 3.0 

114046 BEQ18658 Hs.141003 Homo sapiens cDNA: RJ21691 lis, clone C 3.0 

135029 H58818 hydroxysieroid (17-beta) dehydrogenase 7 3.0 

50 112003 AW978731 Hs.301824 hypothetical protein PROI 331 3.0 

106735 R77698 Hs.337778 ESTs 3.0 

126628 N49776 Hs.170994 hypothetical protein MGC1 0946 3.0 

133350 AI499220 Hs.71573 hypothetical protein FU10074 3.0 

110312 BE256986 Hs.1 1896 hypothetical protein FU 12089 3.0 

55 125659 T57693 Hs.87929 Homo sapiens cDNA FU 13707 fis, done PL 3.0 

129601 AB032964 Hs.1 15726 K1AA1 138 protein 3.0 

123423 AA598484 gtoae38f04,s 1 Gessler Wilms tumor Homo s 3.0 

128695 NMJJ03478 Hs.101299 culGnS 3.0 

123470 AW303285 Human DNA sequence torn done RP1 1-1 10H4 3.0 

60 109252 BE440157 Hs.85944 ESTs 3.0 

101445 M21259 gb:Human Alu repeats in the region 5" to 3.0 

102126 AW950870 Hs.78961 protein phosphatase 1. regulatory (mhib 3.0 

116475 AA733050 Hs.334612 small nudear ribonudeoprotein porypept 3.0 

106573 AA223447 Hs.12835 A Wnase (PRKA) anchor protein 7 3.0 

65 103106 W27172 Hs.1857 phosphotflesterase 6G t cGMP-specific, rod 3.0 

130755 BE293520 Hs.18910 prostate cancer overexpressed gene 1 3.0 

112246 R51321 Hs.25780 Homo sapiens cDNA FU 12252 fis, clone MA 3.0 

126887 H20832 gfcym48d03.s1 Soares infant brain 1 NIB H 3.0 

117960 AA310417 Hs.47044 ESTs 3.0 

70 133626 AW836130 Hs.75277 hypothetical protein FU 13910 3.0 

113179 BE622021 Hs.152571 ESTs, Highly similar to IGF-II mRNA-bind . 3.0 

109968 H09232 Hs.26484 HIRA-mteracting protein 3 3.0 

125770 AA143045 Hs.81665 v-kit Hardy-Zuckerman 4 feline sarcoma v 2.1 
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TABLE 71 B 

Pkey: Unique Eos probeset identifier number 
80 CAT number. Gene duster number 

Accession: Genbank accession numbers 

Pkey CAT Number Accession 
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108451 
107908 

123619 
131495 



101445 
100944 
117265 

117297 
108859 
125165 
116790 



13766.27 
46987,1 

371681.1 
142008J 



1650,-5 
25682_2 
10421.1 

647718.1 
137143 1 
1852047.1 
19B64J 



103038 



15024.1 



126086 
126098 
125464 
126127 
125558 
126426 



1606216J 

1629789.1 

168460.1 

1205826.1 

1703083.1 

110687.1 



118644 81501J 



127262 
127315 
103898 
126769 
120147 
128080 
104590 



127496 
126872 

126887 
128132 
126967 
120325 
112511 



231725.1 

37938J 

187213.-3 

119008.1 

386607.2 

1540039.1 

44964.1 



340470.1 
142696.1 

1572189.1 

177108J 

169750.1 

166688.1 

17406.2 



A1586662 AW389079 AA602099 AW134567 AW517843 AI682674 AM74874 AA374167 

AA81^34 a1S2 0 AA766261 AI769894 N63376 AA214392 AA135B33 AW605017 AA135955 AW450072 H04812 AA446459 
AW43^ 

AI566634 AW971760 AA211796 AA430O89 
M21259 

AW1^^^ N80848 AA699329AJ824676 R26624 R49653 

AW779829 N22494AI248120 _ n ^ enni 

AL121500 AW291763 AI732432 AA129708 AA133309 AI733750 AI225224 

15!JM^ 4 Am7^062 A1928938 AW1 61097 AW161 167 BE314465 AA351715 F07096 AA179034 F08510 F00653 AJ936671 

H85S09 AM170M AA339355 AW950556 D51397 AA213981 BE548002 AI056359 AA001560 AW9521 13 AA317769 A1857477 

S H85610 A!749CT4^5B2 A^3145 AA019127 AA687444 T06745 A1699293 H29532 AA214029 AA223656 NM.016834 
X^7 

H 7335 R37659 R42802 R46242 R60936 R59731 H28993 AA479907 R44570 AI890696 AA308884 AA507078 R41274 AI36S507 
T16348 AI560453 F03259 F04722T16312 AA016081 AW073061 BE314824 W28930 R44098 R51045 
AA9M9M nSa926959 W76521 W24270 W21526 AA037172 BE267636 H831 86 AA469909 N86396 AA001348 BE535736 AAQ81745 
^43X525 W757^^^^ 

S f 3 2k9738? a^»753AA17^8 NM 001826 X54941 BE314366AA908783A1719075BE270172BE269819AA889955 
SoK^ H98460 N31428BE255916 H03265AI857576AA776920 

A^94628 ^Aira3892W95096 AI034317 AA398727 AI085031 N95210 AI459432 AI041437 AA932I24 AA627684 AA935829 

aSa~ 

AMM6mmM35 i AA00i979 R97424 AI129015 N24127 M157451 AA235549 AA459292 M037114 AA129785 AI49421 1 

A^Sh77576 R96823 AI457100 N92845 N49682 H42038 BE220698 BE220715 H99552 M701624 N7417 mUBI 

W^(S^BE261919AA769633AA480310 AA507454AA9105B6AI203723 AW104725 W25611 W25071 T88980H03513 

T775MR99156 W95095R97470 M70^ 

H75681 H70975 

M79088N88221 

N71B07AA203399 

N95428 W24040 AW751366 HB1987 

A^SaA1271 89 AA065075 AA070377 AA1 00017 AA079891 AA1 13255 AA0751 68 AA082764 AA083380 N84829 AA084752 

A1^6M \Sl669 AW769278 R82398 AA131925 R21776 R79031 A1129553 N70340 AI276116 AA709381 Aj^ffflJ 

otaqS A^8675 R16780R26523 K40161 W93671 R99823 R77956 H61601 R987B5 R0973B H78317 H53537 H81056 T53780 
KSSaS RM»7Fra88 W4C71 H66993 T48748 AI702300 R30775 H68996 Al0149y AA369082 
S^^A^^^^C\^m^mC\m\ C18029M7(^13A!168432R673MAJ1^3 
m^mSmSSSfSsm» R75872 W04151 AI220859 AI090290 AA366730 R94434 H81153 H70950AA3577TO 
hrim^^cS C17044D63217C18791 D78839 D63290 D78838 N91085 H5407O 

^2^8784^6323 R09737 C17492 D7B811 R67494 D78775 AA368244H12650 D79043AW957062R82759 C19002 H01715 

AA828125 AA834883 AA330555 

AF1 16622 A1114507 AA640834 AA377999 

AA248884 

AA083456 AA127026 AA084934 AA120972 AA08S317 
AI917116AA633698AI580389 Z39483 



AA75759BX56196AA902959 AI334784A1860794AA010207 AW890091 AW513771 AI951391 A1337671 T52499 AA890205 
/y611201AA910812AI220713AW149306Al758412AA045713R79750 M76096 

A^S^S^^^SI AAS84358 AA492073 BE168945 AAB09054 AW238038 BE011212BE011359 

BE01 1367 BE01136B BE011362 BE011215 BE011365 BE011363 

H20832Z40184 

AA225632 AI820S70 A1820952 AA226472 AI732140 A1732059 AA226307 AA225500 
AA205976AA205930 



S752W7^A624^^ 
£!E^^RW273AV^ 



A1767136 A1268928 AA0W761 AA329290 AI70O593 AA460766 BE222710 BE550651 



R11148 AI979278 AW235B19 AW874095 



AW136492 
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106012 95214.1 



127705 966283.2 
127728 3398.5 
129012 22280.4 



10649S 



120934 
115197 



245223.1 



177521.1 
42406.1 



129706 81501.1 



131019 223488.3 

131170 8113.1 
123470 8785^29 



123532 
125032 
123808 
102398 
118129 
109841 
118922 



113119 
104799 
113560 
129794 



genbank_M608733 

genbankJ74884 

genbank_AA620552 

entrez_U42359 

genbanWl57493 

oenbank_H01052 

220560.1 



genbank.T47910 
genbanH>A029703 
genbankJ91015 
"15.1 



124250 


314220.1 


101447 


entrez_M213G5 


117357 


genbankj*24829 


103392 


entrez.X94563 


135029 


H58818 at 


105225 


genban1UVA211777 


121292 


genbank_AA401807 


105909 


82840.1 


100237 


entrezj)30715 


114988 


genbanleAA251089 


123423 


genbank_AA598484 



i™ nK «T<;ifi81 N7746B H51833 AA147247 R75732 C18450 R73999 A1095755 T49904 H03B6a AA411580 R33395 AA410586 

I^l^TSreO W3786OTT488m7»06ro5632 H03612 AA9096M N50695 H02580 H12839 N58781 M742532 A1360919 
raMOTRW™raOTAW27AM1l62l R2S671 R36105H1245t H03869 H51263 AA035486 R2S109 

R2S110M147933 
AJ003322AJ003324 

SSuM H20702 H46S15 H40200 H50046 H23647 H50439 H18383 H21846 H23849 H40182 R21924 H01290 H01283 

KSoA^9 hS3i741 R220M AA043911 AA1S6838 078832 R36616 R66039 AM92481 AA088608 
ttWwSS NM483 A»rei54AI086754 AA595787 H81051 H01187 AI057251 T96992 H595B4 T47016 R31800 

aJ»^S«719967 wSw780 AW964077 AW1 66028 BE540193 N94800 AA452368 N99604 AB41345 AW298800 AA724961 
uSSn ^74m7AJW6MO aS 08 263053937 DS2496 AA974487 AW043854 N50483 Z39997 AI492961 W361S26 
TOoKl4UM551 AI47S19I667 AI341984 N92658T32870R52664 N50428 AW089291 A1934175AI423737 D60665 
AA226198 AA226S13 AA383773 



^784 ^5 MOOT AI457612 R63254 R28645 R27616 H01310 R78329 R76046 R76055 AA369734 N57914 H94864 
^^^^^a^n^\m6-!6mmA^0S3 R6725OHBO24O R32578 W86279 R80248 R23734 H0O977 R21732 



™6MR82345 M653499 ATOOT17R63207 ^3888 H54071 H66993T48748AI702300R30775HS1996A1014957AW69082 
So^TAmoS420^WM245^ M768458 C19062AB93674C18911 C18029M708613AI168432R673MW16Mra 

SS H40199 R33887 R02529 H94918 R00900 T87029 AA131924 R99891 AA004267 D78764 D78757 H61600 C1B615 
S X S3S37 C17492 078811 R67494 D78775 AA368244 H12650 D79043 AW957062 R82759 C19002 H01715 
wS ^^743S^043M3W95 AW572472 AW190386 AI129278 AI913081 AW473549 AA830713 AI982871 AI636647 

AI886032 A18239^^23WS AW19813S A1287510 AI565910 AA76577S A1866019AI263697 AI3SS825 R42668 AA894603 
AW10556S A1824555 AW339175 020479 



A A^37^W79760N48674A1375997R45432 059344 A1203107 F07491 R35360 R25094 A1913631 AI498402 T61382 AI016320 



AA362576 AI289927 AA382592 
AA608733 
T74884 
AA620552 
U42359 
N57493 
H01052 



AVV2M193 AW137594 AI953685 A1919143 AJ424371 AW007698 AW025681 AI954289 R40438 AW070364- U679653- «679081 



A1693374AI802007/S 
T47910 
AA029703 
T91015 

T^^^tHm^^T^M^Z^^ A1745507 A1807602 H41152 AA065244 AI242569 AI091032 AI251849 

AW499865R945B4R21283 
AA350256 H6B126 
M21305 
N24829 
X94563 
H58818 
AA211777 
AA401807 

AA401739 N22905 N70378 
030715 
AA251089 
AA598484 



M195191M190578AI632307M195227 M7 



Table 72A shows the Seq 10 No, Pkey. ExAccn. UnigenelD. and Unigene T.8e for all of the sequences in Table 73. 



Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number. Genbank accession number 

UnigenelD: Unigene number 

Unigene Tilte: Unigene gene title 
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Seq ID No: 


Seq ID number correlate 


Pkey" 


ExAocn 




103080 


AUUf lloi 


He 


410102 


AW/4t»Ua 


He 97Q797 


101104 


AWooZZttJ 


n5. 1 09409 


447761 


ArUolbf J 


Hs 19492 


428183 


AWboy/ £0 


He QR3A1 


439221 


AA737106 


He ¥)1*>(\ 


121619 


AA528339 


nS/wouoi 


104659 


a t ft trie tvy c ft 

AW969769 


Up 1 arTAi 
HS.lUO£Ul 


105782 


H0974B 


14- C7Q07 


129977 


Liu a am fin 

NM_000399 


ns.ioyo 


100299 


D49493 


U* 0171 


116301 


AW969706 




106533 


AL1 34708 


Hs.1 45998 


131313 


R96290 


Hs.75874 


105316 


AI671245 


Hs.24835 


113003 


AW292315 


Hs.7215 


102836 


U94320 


Hs.ibooJU 


102745 


AW753865 


HS.74J/0 


123308 


C14187 


Hs. 157208 


120147 


AI917116 




123049 


BE047680 


|l_ <n 4 gen 


119082 


AF252297 


u. en cap. 


105301 


AW352357 


Hs.7457 


128478 


M708205 


nS.lUUoAJ 


106111 




Hs.6451 


131307 


NM.000025 


Hs!2549 


120830 


A1568170 


Hs.96886 


127664 


AA806164 


Hs.116502 


102725 


AB026187 


Hs.159156 


132520 


AA257992 


Hs.50651 


130637 


AA356764 


Hs.17109 


117602 


N35020 


Hs.44685 


128797 


NM.002975 


Hs.105927 


129703 


BE388665 


Ks.179999 


125770 


AA143045 


Hs.81665 


414761 


AU077228 


Hs.77256 


420462 


AF050147 


Hs.97932 


410268 


AA316181 


Hs.61635 


104691 


U29690 


Hs37744 


416836 


054745 


Hs.80247 



Unigene Title 



eyefin D1 (PRAD1: parathyroid a 
ESTs; homologue of PEM4 ICiona savignyi 
neuropeptide Y receptor Y1 
protocadherin 8 

ESTs, Weakly similar to serine protease 
ESTs, Moderately similar to 178885 serin 
ESTs, Weakly similar to phosphatidyberi 
ESTs 

B-cefl aUymphoma 11B (zinc finger pro 
early growth response 2 (Krox-20 (Drosop 
growth differentiation factor 10 
ESTs 
ESTs 

ribosomal protein L44 
hypothetical protein FU 14594 
ESTs 

neuropeptide Y receptor Y5 
olfactomedin related ER localized protei 
ESTs 

hemoglobin, beta 

dickkopf (Xenopus laevb) homolog 2 
cytochrome P450 retinoid metabolizing pr 
MAGE1 protein 
ESTs 

PRO0659 protein 
adrenergic, beta-3-, receptor 
ESTs 
ESTs 

protocadherin 11 

Janus kinase 1 (a protein tyrosine kinas 

integral membrane protein 2A 

C3HC4-like zinc ringer protBin 

stem cell growth factor; lymphocyte seer 

Homo sapiens, done IMAG&3457003, mRNA 

v-kit Hardy-Zuckerman 4 f eline sarcoma v 

enhancer of zeste (Drosophfla) homolog 2 

chondromoduOn 1 precursor 

six transmembrane epithelial antigen of 

Homo sapiens bela-1 adrenergic receptor 

chotecystokinin 



Seq ID No 

Seq ID NoB1&B2 

Seq tD No B3 & B4 

SeqtDNoB5&B6 

SeqlDNoB7&B8 

Seq IDNOB9&B10 

SeqIDNoB1UB12 

SeqlDNoB13,B14,&B15 

SeqlDNoB16 

SeqlDNoB17&B18 

SeqiDNoB19&B20 

SeqlDNoB21&B22 

SeqlDNoB23&B24 

SeqlDNoB25-B27 

Seq ID No B28 & B29 

Seq IDNOB30&B31 

Seq IDNoB32 

SeqlDNoB33&B34 

Seq ID No B35-B40 

SeqlDNoB41&B42 

SeqlDNoB43 

SeqlDNoB44&B45 

SeqlDNoB46&B47 

SeqlDNoB48&B49 

SeqtDNoB50B53 

SeqlDNoB54&B55 

SeqlDNoB56&B57 

Seq ID No B58&B59 

Seq ID N0B6O 

Seq ID No B61&B62 

Seq ID No B63&BG4 

SeqlDNoB65&B66 

SeqlDNoB67&B68 

SeqlDNoB69&B70 

SeqlDNoB71&B72 

SeqlDNoB73&B74 

SeqlDNoB75&B76 

SeqlDNoB77&B78 

SeqlDNoB79&B80 

SeqlDNoB81&BB2 

Seq1DNoB83&B84 



Table 72B 

Pkey. Unique Eos probeset identifier number 

CAT number Gene cluster number 

Accession: Genbank accession numbers 



Pkey CAT Number 
131307 3138J 
131313 93372J 



116301 
103080 



52669.2 
17092.1 



N^00X]25 X72861 X70811 M29932 X70812 S53291 AW015148AW581776 AI276134 

R9629 S^1 C1^7 M357588 AA367557 H89632 C17954 AA568860 A1752983 AA699451 H04260 A11281 18 AW193364 N94503 
aS1t^AA^^^B^7384AA747591 R22855A1032539 A1151343AA148534 H63941 T^AA^W^6 

R77664 T52101 AW953745 AW953739 D79107 AA029105 AW953738 AA456487 R67000 AA156623 AA368336 H63662 

AU&77B1 AAM2 9 ^^^ 9 X59798 M64349 NM_001758 AA226805 M73554 BE409154 AA1 60096 BE384352 AA160820 BE382880 
BE261^4 AA113821 SmSeOTJ BE390020 AA100B54 AA127152 AW794066 AW357101 AW367093 U47703 A1347077 

mSSoS^^ R81309 AA100801 AA227161 ^J^^^ L ^ 

i AAB5221B AA158286 N20142 AA622148 AA864225 AA576367 AW182124 T89175 A1758455 AA780573 N71757 R81200 AB59596 
A1674613 AA642544 AW503909 AA1 28851 W39350 N4O420 AA113072 BE168116 AI620604 AI298125 BE075272 N40078 BE07510S I BK80779 
S^r^BK^ BE075108BEra0955BE0B9655BEM1115A^ 
W1 46884 BE0751 54 AW992247 A1186525 AI752230 AW263140 W03329 N26056 AA948080AA1 13073 H99284AA227101 AA631077 

K16^M H^Sw VTO7^ AA425549 W31090 AA807411 BE173280 BE000178T09020 W23852 AA062709 BE167894 AA076515 SSSq,. 
BpSS^ H20622 W44436 W67604W46412AVmi113AI700™ 

ISEKh mmmSQmsmm2SS\ AI087288 R76299AW604781 N55320 AI912334 AA403248 AW159156r«4970 
AA599485 A1276505 AI953979 AA563710 H25674 H51747 AA425389 AA516104 AI095335 T77237 AA151695 T92084 A16BS037 j AJ624162 
)S 1 l^OuS^^ A1311390AA622062 A1055890A1660881 AI366117M«^0 A^728 8 

™6B564AA9^ 
A\SlW280632T^^ 

ISoS 



AVOTZ354 A1094358 R37210 AA948510 AA226909 BE172527 A1086652 BE408324 AW29284B A1768962 BE540703 BE409478 AA931692 



794 



WO 03/042661 



PCT/US02/36810 



132520 45737.1 



117602 10565_-4 
102725 1158ZJ 
10 102745 13186 J 



15 



20 



25 



30 



119082 34131J 



102836 
125770 



40 



45 



50 



55 



60 



65 



105316 



128478 
105782 
120830 
128797 



121619 
123049 



25380J 
16581.2 



120147 386607.2 

35 113003 420161.1 

104659 82594.1 

104691 41308J 



105301 27795.5 



181763J 



127664 394361.1 
106111 4671.1 



370722.2 
246361.1 
244476J 
20410.1 



106533 305354J 



284072.1 
30801.1 



129703 64196J 



70 

100299 2801J 
75 129977 2039.1 



123308 322009.1 
101104 26442.2 
80 130637 7040.1 



AA2S7992 AA317029 AA362097 D17042 H48100 AW838107 AW838106 AW838110 AW838231 AW610241 AW838442 AA045061 AW384991 
BE000988 AA131806 BE180577 AW838269 AL039831 AI754380 C06051 AA131737 AW838239 A1767465 AI480134 AA610312 AW838190 
AA379252 AWB38282 AA484027 A1423413 AI685064 BE328307 AI241857 AW838281 AW8382B4 AI671267 A1814928 AA828367 A1476305 
AI270180 AA622362 AI076497 AI635919 A1444994 AI194028 AI669159 AI928131 AA448853 AI350143 AW079289 AW467B07 AA480442 
AA522935 BE180570 AW3800B7 AW3801 09 AI081015 AI690818 AI5B9485 AI698510 AA642019 AA714366 A1580430 AA985527 AI740475 
C21398 AA257993 AI302393 A1689018 AW770194 AW753750 AI079164 BE550338 AA559851 T16108 AI864822 AI932827 AA045095 
AA045O62 AI954225 AA768569 AA709308 AW958363 
N35020 

AB026187 NM 014522 U79247 F13304 AA224524 F10902 R39431 

AW753865 F07644 F11280 F06355 F08136 U79299 F07459 F08750 F12419 F12842 F06488 F0B585 F13403 F05921 F05512 F11683 F05416 
F12841 F12810F05418 F12850 R24551 F06276 H11B03H10237 H11542 H22894 R59563 H19351 H18722F11237 F08507 H23123 R61595 
F07796 F06201 F12289 F07107 T78113 F08734 F13344 F05760 F11784 F08780 F05835 R20588 F07739 F1 1497 R17410 H17414 R20440 
T66090 R25292 T66236 T78766 T79908 R25286 R60071 C14761 AW905192 AA331914 AW965291 H09000 F05212 F08313 F05825 H06399 
H15135 H1 1378 R66424 R61541 AA235405 AI205041 R595S4 H23124 H1B638 R37359 A1571275 R41780 T66174 T66159 R54102 W07657 
H10794 R52337 R42890 H14354 R40952T17391 H1206B R56797 R51000 R60011 R37617 R39434 H29286T16403 H09129 R49114 H15161 
H11729 R39160 H10468 H24454 H11432 R54433 H08768 H24042 H09511 H17281 H15292 R37849 AI991965 R52815 R42850T17390T16438 
T17388 T16285 R48963 R50782 R44247 R46729 R5955B R40696 R43575 R44420 T17403 R37754 R46637 R51039 R40513 T23785 F10433 
T16350 H10191 R43688T23543 H08591 R44351 R37575 R49508 F04379 F04056 F05067 F02010 F09158 F10036 F08900 F02559 F09914 
F09434 F05034 H17415 F02644 H10650 F10451 H14589 F10444 T16440 R44517 R46643 F03993 F10443 F02170 F09343 F10414 F04794 
F11122 F04997 F03894 AW005932 F02767 F10941 F01777 R45260 F03386 R45261 T65009 F10990 F08942 F01673 H23253 F04533 F02082 
F01669 F03689 H09568 R40763 H06400 R50958 N80570 AI082296 AI198226 T23763 AA814486 AW015823 AI886272 H06514 AA515969 
F10049 AI984267 AI886279 AA558516 F02071 AI783843 AI804884 F01468 AA730438 AF035301 R3B389 R41440 R59468 R14437 T89036 
R43560 H10821 R52850 R56722 R14559 T09299 H10467 R25829 F13546 R56796 R51 151 D82657 T32952 T66257 H06569 F12430 T74326 
H08489 R34430 R13083 H23364 R21 134T77098 AW957157 H24150 Hi 1403 R52336 F12830T77173 T74281 T77663 T74736 H10649 H12067 
R20092 T74735 R54530 R17552 R59557 R54529 H17280 T75171 H14694 T74975 H15291 R18574 R54157 R13446 H15221 R35120 R35031 
R21277 R19632 R35820 R50899 R13923 R36771 R26485 R33061 R20100 H24426 H09184 H15958 R19253 H15196 K09510 R36032 R19691 
H14402H14885T17068 

AF252297 N2788B F13415 R34552 R14377 AA323962 AW611514 AW614097 AW1 95487 AI026040 AI0976B3 AA012834 AW969885 AA988198 
R49069 AW873673 R37774 F08840 AA484668 AW873664 AA170805 AA1 70825 AA585275 AA585280 AA012833 
U94320 NM 006174 U66275 

AA143045 AW271802 AI017093 AA450183 W93694 A1885044 AI890068 AW264522 AW073559 AI355325 AI051302 AI567686 A!1 43554 
AI457891 AW468784 A1567693 AW884460 AI424892 AA401 298 AW244098 AI122935 N24824 AA552932 AAB731 64 AI308810 N26098 H81 852 
N20798 AW263947 AI696267 AA449422 AW877967 
AI917116 AA63369B AI580389 Z39483 

AW292315AI090408 AW517041 AI92869S A1092060 AA71 6741 N98235T23629 

AW969769 AA484137 AI291362 N64753 AL133784 AA007153 BE046923 D59990 C15166 D59991 D60194 AA648108 N76160 D61027 D60630 
U29690 AFZ72890 AA708166 AA010748 R23839 H59572 R23383 A1625747 AA01 1176 AW665324 H02674 H59573 AA757931 H12502 R66851 
AA702310AA748431 AA749352 C01127 AI003679 T96352 

AW352357 AA852419 T80568 AA332510 AW149659 AA341609 AA809550 AI183690 AA730875 T31719 AW955361 AA779709 T05461 
AA564806 AI142536 AI553857 D53384 D53747 D54101 D55112 AW071935 AI082447 AI290911 AW168872 AA199743 AI091646 AI246702 
AC71662 AW1 17296 AJ032138 AI741489 AI53B705 AA233541 AI682685 AW136268 AI689641 AA599916 AI174443 AI432175 AI131328 
AW136277 AA852418 AA243035 T03902 C15651 D52660 H08108 AA844525 AJ420648 AI282445 AW068986 AW497637 AA779042 AA298564 
BE262936 

AI671245 D81062 A1085879 AI193252 AI870535 N23132 H97661 H18098 AA552173 AA233924 AA503379 H19618 D60732 F04762 H1 1343 
AW590283 

AA806164 AA678564 AW978234 AW023951 AA651859 A1699785 AA913686 AW974569 A1681287 

AW875398 A1064861 NM.014138 AF090943 BE071887 BE547232 BE622599 AI345700 AA284315 AA147700 AW376340 BE220170 BE348475 
AA025037 AA057284 AA025038 T16193 AW130196 AW594516 AI989991 AI139091 AW675747 AA830435 AI924977 AI670778 A1051887 
AA912574 AW1 30213 AI828335 AI570587 AI332760 AI3131 15 AA804653 AA477920 AA147645 A!91 6912 AW768839 AA715098 AA292723 
AA732421 T331 22 AI631 560 A1698688 AI1 85386 R43B29 AA421638 AA981400 C00503 AA604025 AA554026 AA553B85 BE172244 R24O40 
AA708205 H15963 R20851 L13829 A1934608 Ai950163 H92B93 AI267243 N66614 H28983 R42031 H06198 AM68628 H92779 L13828 T34896 
H09748 R61554 AA350215 R40551 

A1568170 M731212AW294848AW466878AA568908AI281804A1923533AA347885A1718854M 

NM 002975AF020044AB009244BE620464AAS41683AW000991 AW089850 AW664612 A1636576 A1627654 AI885823 A1864548 Al 5541 22 
AW99682 AI57O905 AI376008 AI871364 AI074230 AT383719 AA905O16 AA977446 AA844732 T78922 AW244007 A1924747 AI9B8230 AI824575 
AI699224AI569490AW379856AW292314AI016874AW292037 

AL134708 AW205610 AW665919 AW594227 AI917093 AI498361 AA635141 AA4537B6 D54178 D53224 A1804341 D531 1 1 M788S5 AL1 34707 
AA453903 AI278054 AI334339 AI380496 AW235939 D60373 060993 081 246 

BEM7680 ^1TOMr^014421 AB033208 AB035181 W55979 AI093105 AIB63392 AA715452 AA300480 AW022313 AA648103 AA720544 
D61856 AA481794 AA737112 AW316551 AA714937 AW974378 AI370632 AI184289 AI333962 AA714945 

BE388665 AA740614 AI394542 AA484451 AW070675 AAB77654 AI360981 AI475799 BE349560 AL037351 N29437 AI246453 AW083444 
AA844441 AI150871 AM40477 AI889061 AA682422 AA6B0053 AA740212 AIB58728 AA490589 AW243692 A1249450 AJ304448 AA534421 
AB00317 AI273337 AW027729 A1979132 C06120 AI860916 AW591923 AI803174 AI559264 BE250500 AW675522 AA883284 T71033 AI346838 
AI066754 A1197913 AA931979 AA427599 AI304730 AI285159 AI273325 AI038450 A1033760 N74972 AA136517 A1304446 AI262652 AA401348 
AW514617AI361795AW514389 N30307 N93439 AA1 90775 BE349902 AJ3421 14 AI1 86831 AA039628 H74242 AI914825 AI955761 AI343851 
AA693856 T71096 AI498539 AA778995 AA084278 AI27467B AI022673 AW339621 AA704929 AW089735 AA041 252 AA161337 AI185333 
AW189397 AI160990 AI082392 AI935848 Al 144404 A1141633 AA447987 AA464241 AI342103 AW0261 18 AAB34025 AW664459 AA603777 
AA630752 AA115450 N49944 AW166071 AW769795 AA991877 AAB73609 AVW83441 H99130AA910538 AA485199 AW083358 AA151767 
AJ161193 N54945 AA180193 AW23477 A1188387 A1608747 N58900 A1038449 AA136852 AA565913 AA180502 H61741 AA701 125 Ah 27678 
AA278541 R76523 AA666187 H85083 F24755 R77729 AA135137 BE250395 R76239 AW794625 AW794404 H61957 AA308197 AW996031 
AA345524 N52726 AA354360 AA179785 N57375 R16447 AA 160645 W21481 „„„„ „„ no , 

D49493 NM 004962 D49492 AA948479 AA993843 AA017558 AA992567 AA912897 AI185363 AA018254 N93253 AA297544 T191B7 R53287 
AA069337 AA297489 N62906 AW023929 AA01 8253 AA017615 W251 10 R52085 R52633 W16549 H88916 H891 34 AA069338 
NM 000399 AA018140 AF1 39463 J04076 AW390694 AW361756 AL079830 AI1905B7 AA731059 AA742743 AI635915 AA878209 AI262612 
AI2B4201 AA446300 AA715016 AA446027 AI274101 AA593330 AA593338 AW015388 AI681724 AI347984 AL045275 AA970069 AA0181 88 
AW014396 AW769445 AI588B69 AW338296 A1801548 AI367702 AI888410 
C14187 AA496211 DB0456 AA484051 080455 AA528446 

AW862258 L07615 

AA356764NM 004867 AF038953W16828 R50284AI929589R01594AA010474W37917 N64212N78175R57511 AA329322 W77834 
AA169869 AA285240 AA356670 T93146 AA325779 AA357064 AA328151 AA306939 AA306801 N31 260 AA306202 AA312530 BE24CT 03 
AA354371 N42259 AL021786 H50965 H38038 AI655965 AJ566732 AI953952 AW972835 AI077300 AJ948482 AI760966 AA648335 W721 97 
N32B15AA768073 AW338181 AW001742 AI81 9896 A101 5666 N5B011 AA677584 Al 1 83750 AW1 05514 W37875 AW301 679 AA775257 

795 
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410102 117657.1 
10 410268 11B97.1 



414761 14860.1 



15 
20 

416836 16233.1 

420462 19379J 

25 

428183 287965.1 

439221 46993.1 

30 447761 7355.1 



AA730357 AI128690 AA630149 AW241515 A1147003 AI183499 N90923 AI968582 AA354145 AI147297 BE550502 AI085777 AI709087 
AA773947 AA469192 A1199289 AI127085 H89232 N30719 A1028165 AW029469 AI709314 AA780834 AA778057 HB9125 A1654762 T93061 
N59622 A1360625 AA71 9982 AI248242 AA233925 AW26251 2 AW438666 AA01 0378 AA484010 AW438790 N22875AA483718AI185407 
AA736720 AA664587 AA648672 AA503406 AA5081 23 AA503140 H46638 N7581 2 AA730938 060546 AA483039 AA507705 AA480448 
AI768654 AA658955 AA730803 AA330312AW021127 BE172031 AW961052 AA339070AA648172 BE172892AA174175AA492012AA491590 
AW118733 AA484130 M730783 AA730510 D62577 D62B28 A1469294 AI801343 N53447 AA740226 AI831 983 AA51 6146 D79799 AI758212 
AA484040 H38039 AI038772 A1039410 AA52B387 AA730576 AA527231 M7481 1 1 D62946 U38439 AA721708 H22130 AA559229 H51620 
AA283355 AA730970 R01595 N72450 AA469123 AA579389 AA480477 AA283306 AA347822 AA328013 

AW24850B A1278454 A1369343 AW291235 AI343946 AI347189 AA970152 AI279668 A1971 894 AA664359 A1191039 AA084598 AA081 641 
AA316181 BE439545AC005053 AF186249AW386101 AC004969AA730199AA032221 AI6B5139 AI167942 AA809228 AI184070 A1394674 
AW969977 AA032279 AW079284 AA513174 AA88831 2 AI453179 AA4B3363 AA528432 AA57951 1 

AU077228 U61 145 AL044256 AA206185 AL044975 NM.O04456 X95653 AW890905 AA332806 AW965453 BE299226 U52965 BE275009 
AA428252 AA769212 AA312778 AA810901 AA46131 5 A1969286 AI953668 AW296248 AI291422 AA704747 Z44716 BE541235 AW608806 
AA588583AW403788AW408386 AA053521 T87571 AA581 163 BE1 83241 T97906 R891 49 AI8081 96 BE61 B521 A1970278AI652680AW130143 
AI970272 AW197254 BE550449 A1628403 AA010143 AW005107 AA251815 AA805836 N53459 AA055309 AA643678 AI345616 AI345618 
AI635728 AI871286 AI018437 AI262514 H75287 AI033155 AI203388 AA205802 A1475265 AW5941 86 Al 696059 A1333312 AW264710 AA876764 
AA864892 AA173156 AI292066 T79837 W93995 AA506427 AA815135 AI376342 AAD1Q144 A1310421 AI223350 AA970576 N33926 T89364 
AI090913 N55098 AA1 01673 AA626665 AA251840 N6B857 AI890053 H05930 AI474199 AI436166 AA628810 K64452 A1205538 AA055310 
BE244495A1365179T99536A1203088 Z40549 AW139858 AA599627 R91847 AA483681 AW068982 H51746AA091201 H13030AA459732 

DM^A^77^9 R So07M R71188 H87563 R80701 AW955319 AA376289 R82442 H46452 R22481 R64323 AW043690 L00354 AW005571 
H19813 AI745597 AI769202 AI205982 AW205841 R64324 AI201975 H46993 C18067 R71189 N29729 AI799838 A1091209 H87455 RB2443 
AW970995 AW072136 N40841 H13148 AI219052 N30191 T28O07 AI817428AA503602 

AF050147 NM 007015 AB006000 AB005999 BE21 81 14 AI703147 AI985891 BE222034 BE549774 BE327618 AI651459 BE549987 AI657194 
BE222414 BE046984 AW470357 AW339338 C02526 AW590220 AI298082 AW296998 A1333135 AI867816 AA730514 BE348513 AA405500 
AW1 36179 AA327834 BE327350 BE327366 

AW969726 AA424026 A1266636 AI271571 AA423968 AI266634 AA702780 AA452037 AI424866 AA742931 AA483351 

AA737106 N35765 N35779 AF086045 N30152 AW374028 A147B237 AA923562 AI382236 AA846668 AI627389 AI371709 H98083 H95983 

H26576 H95984 AA745021 AA639180 H26575 

AF061573 NM.002590 AA780138 AW135620 A11978B4 AA483348 R05468 AW969724 H29216 
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Table 73: Sequences 

Seq ID NO: Bl DNA sequence 
Nucleic Acid Accession 8: NM_053056.1 
j Coding sequence: 210.. 1097 

1 11 21 31 41 51 

i i i i r i 

CACACX3GACT ACAGGGGAGT TTTGTTGAAG TTGCAAAGTC CTGGAGCCTC CAGAGGGCTG 60 
TCGGCGCAGT AGCAGCGAGC AGCAGAGTCC GCACGCTCCG GCGAGGGGCA GAAGAGCGCG 120 
10 AGGGAGCGCG GGGCAGCAGA AGCGAGAGCC GAGCGCGGAC CCAGCCAGGA CCCACAGCCC 180 
TCCCCAGCTG CCCAGGAAGA GCCCCAGCCA TGGAACACCA GCTCCTGTGC TGCGAAGTGG 240 
AAACCATCCG CCGCGCGTAC CCCGATGCCA ACCTCCTCAA CGACCGGGTG CTGCGGGCCA 300 
TGCTGAAGGC GGAGGAGACC TGCGCGCCCT CGGTGTCCTA CTTCAAATGT GTGCAGAAGG 360 
AGGTCCTGCC GTCCATGCGG AAGATCGTCG CCACCTGGAT GCTGGAGGTC TGCGAGGAAC 420 
15 AGAAGTGCGA GGAGGAGGTC TTCCCGCTGG CCATGAACTA CCTGGACCGC TTCCTGTCGC 480 
TGGAGCCCGT GAAAAAGAGC CGCCTGCAGC TGCTGGGGGC CACTTGCATG TTCGTGGCCT 540 
CTAAGATGAA GGAGACCATC CCCCTGACGG CCGAGAAGCT GTGCATCTAC ACCGACAACT 600 
CCATCCGGCC CGAGGAGCTG CTGCAAATGG AGCTGCTCCT GGTGAACAAG CTCAAGTGGA 660 
ACCTGGCCGC AATGACCCCG CACGATTTCA TTGAACACTT CCTCTCCAAA ATGCCAGAGG 720 
20 CGGAGGAGAA CAAACAGATC ATCCGCAAAC ACGCGCAGAC CTTCGTTGCC CTCTGTGCCA 780 
CAGATGTGAA GTTCATTTCC AATCCGCCCT CCATGGTGGC AGCGGGGAGC GTGGTGGCCG 840 
CAGTGCAAGG CCTGAACCTG AGGAGCCCCA ACAACTTCCT GTCCTACTAC CGCCTCACAC 900 
GCTTCCTCTC CAGAGTGATC AAGTGTGACC CAGACTGCCT CCGGGCCTGC CAGGAGCAGA 960 
TCGAAGCCCT GCTGGAGTCA AGCCTGCGCC AGGCCCAGCA GAACATGGAC CCCAAGGCCG 1020 
25 CCGAGGAGGA GGAAGAGGAG GAGGAGGAGG TGGACCTGGC TTGCACACCC ACCGACGTGC 1080 
GGGACGTGGA CATCTGAGGG CGCCAGGCAG GCGGGCGCCA CCGCCACCCG CAGCGAGGGC 1140 
GGAGCCGGCC CCAGGTGCTC CCCTGACAGT CCCTCCTCTC CGGAGCATTT TGATACCAGA 1200 
AGGGAAAGCT TCATTCTCCT TGTTGTTGGT TGTTTTTTCC TTTGCTCTTT CCCCCTTCCA 1260 
TCTCTGACTT AAGCAAAAGA AAAAGATTAC CCAAAAACTG TCTTTAAAAG AGAGAGAGAG 1320 
30 AAAAAAAAAA TAGTATTTGC ATAACCCTGA GCGGTGGGGG AGGAGGGTTG TGCTACAGAT 1380 
GATAGAGGAT TTTATACCCC AATAATCAAC TCGTTTTTAT ATTAATGTAC TTGTTTCTCT 1440 
GTTGTAAGAA TAGGCATTAA CACAAAGGAG GCGTCTCGGG AGAGGATTAG GTTCCATCCT 1500 
TTACGTGTTT AAAAAAAAGC ATAAAAACAT TTTAAAAACA TAGAAAAATT CAGCAAACCA 1560 
TTTTTAAAGT AGAAGAGGGT TTTAGGTAGA AAAACATATT CTTGTGCTTT TCCTGATAAA 1620 
35 GCACAGCTGT AGTGGGGTTC TAGGCATCTC TGTACTTTGC TTGCTCATAT GCATGTAGTC 1680 
ACTTTATAAG TCATTGTATG TTATTATATT CCGTAGGTAG ATGTGTAACC TCTTCACCTT 1740 
ATTCATGGCT GAAGTCACCT CTTGGTTACA GTAGCGTAGC GTGGCCGTGT GCATGTCCTT 1800 
TGCGCCTGTG ACCACCACCC CAACAAACCA TCCAGTGACA AACCATCCAG TGGAGGTTTG 1860 
TCGGGCACCA GCCAGCGTAG CAGGGTCGGG AAAGGCCACC TGTCCCACTC CTACGATACG 1920 
40 CTACTATAAA GAGAAGACGA AATAGTGACA TAATATATTC TATTTTTATA CTCTTCCTAT 1980 
TTTTGTAGTG ACCTGTTTAT GAGATGCTGG TTTTCTACCC AACGGCCCTG CAGCCAGCTC 2040 
ACGTCCAGGT TCAACCCACA GCTACTTGGT TTGTGTTCTT CTTCATATTC TAAAACCATT 2100 
CCATTTCCAA GCACTTTCAG TCCAATAGGT GTAGGAAATA GCGCTGTTTT TGTTGTGTGT 2160 
GCAGGGAGGG CAGTTTTCTA ATGGAATGGT TTGGGAATAT CCATGTACTT GTTTGCAAGC 2220 
45 AGGACTTTGA GGCAAGTGTG GGCCACTGTG GTGGCAGTGG AGGTGGGGTG TTTGGGAGGC 2280 
TGCGTGCCAG TCAAGAAGAA AAAGGTTTGC ATTCTCACAT TGCCAGGATG ATAAGT TCCT 2340 
TTCCTTTTCT TTAAAGAAGT TGAAGTTTAG GAATCCTTTG GTGCCAACTG GTGTTTGAAA 2400 
GTAGGGACCT CAGAGGTTTA CCTAGAGAAC AGGTGGTTTT TAAGGGTTAT CTTAGATGTT 2460 
TCACACCGGA AGGTTTTTAA ACACTAAAAT ATATAATTTA TAGTTAAGGC TAAAAAGTAT 2520 
50 ATTTATTGCA GAGGATGTTC ATAAGGCCAG TATGATTTAT AAATGCAATC TCCCCTTGAT 2580 
TTAAACACAC AGATACACAC ACACACACAC ACACACACAC AAACCTTCTG CCTTTGATGT 2640 
TACAGATTTA ATACAGTTTA TTTTTAAAGA TAGATCCTTT TATAGGTGAG AAAAAAACAA 2700 
TCTGGAAGAA AAAAACCACA CAAAGACATT GATTCAGCCT GTTTGGCGTT TCCCAGAGTC 2760 
ATCTGATTGG ACAGGCATGG GTGCAAGGAA AATTAGGGTA CTCAACCTAA GTTCGGTTCC 2820 
55 GATGAATTCT TATCCCCTGC CCCTTCCTTT AAAAAACTTA GTGACAAAAT AGACAATTTG 2880 
CACATCTTGG CTATGTAATT CTTGTAATTT TTATTTAGGA .AGTGTTGAAG GGAGGTGGCA 2940 
AGAGTGTGGA GGCTGACGTG TGAGGGAGGA CAGGCGGGAG GAGGTGTGAG GAGGAGGCTC 3000 
CCGAGGGGAA GGGGCGGTGC CCACACCGGG GACAGGCCGC AGCTCCATTT TCTTATTGCG 3060 
CTGCTACCGT TGACTTCCAG GCACGGTTTG GAAATATTCA CATCGCTTCT GTGTATCTCT 3120 
60 TTCACATTGT TTGCTGCTAT TGGAGGATCA GTTTTTTGTT TTACA ATGTC ATATA CTGCC 3180 
ATGTACTAGT TTTAGTTTTC TCTTAGAACA TTGTATTACA GATGCCTTTT TTGTAGTTTT 3240 
T T TTTTTTTT ATGTGATCAA TTTTGACTTA ATGTGATTAC TGCTCTATTC CAAAAAGGTT 3300 
GCTGTTTCAC AATACCTCAT GCTTCACTTA GCCATGGTGG ACCCAGCGGG CAGGTTCTGC 3360 
CTGCTTTGGC GGGCAGACAC GCGGGOGCGA TCCCACACAG GCTGGCGGGG GCCGGCCCCG 3420 
65 AGGCCGCGTG OGTGAGAACC GCGCCGGTGT CCCCAGAGAC CAGGCTGTGT CCCTCTTCTC 3480 
TTCCCTGCGC CTGTGATGCT GGGCACTTCA TCTGATOGGG GGCGTAGCAT CATAGTAGTT 3540 
TTTACAGCTG TGTTATTCTT TGCGTGTAGC TATGGAAGTT GCATAATTAT TATTATTATT 3600 
ATTATAACAA GTGTGTCTTA CGTGCCACCA CGGCGTTGTA CCT GTAG GAC TCTCATTCGG 3660 
GATGATTGGA ATAGCTTCTG GAATTTGTTC AAGTTTTGGG TATGTTTAAT CTGTTATGTA 3720 
70 CTAGTGTTCT GTTTGTTATT GTTTTGTTAA TTACACCATA ATGCTAATTT AAAGAGACTC 3780 
CAAATCTCAA TGAAGCCAGC TCACAGTGCT GTGTGCCCCG GTCACCTAGC AAGCTGCCGA 3840 
ACCAAAAGAA TTTGCACCCC GCTGCGGGCC CACGTGGTTG GGGCCCTGCC CTGGCAGGGT 3900 
CATCCTGTGC TCGGAGGCCA TCTCGGGCAC AGGCCCACCC CGCCCCACCC CTCCAGAACA 3960 
CGGCTCACGC TTACCTCAAC CATCCTGGCT GCGGCGTCTG TCTGAACCAC GCGGGGGCCT 4020 
75 TGAGGGACGC TTTGTCTGTC GTGATGGGGC AAGGGCACAA GTCCTGGATG TTGTGTGTAT 4080 
CGAGAGGCCA AAGGCTGGTG GCAAGTGCAC GGGGCACAGC GGAGTCTGTC CTGTGACGCG 4140 
CAAGTCTGAG GGTCTGGGCG GCGGGOGGCT GGGTCTGTGC ATTTCTGGTT GCACCGCGGC 4200 
GCTTCCCAGC ACCAACATGT AACOGGCATG TTTCCAGCAG AAGACAAAAA GACAAACATG 4260 
_ _ AAAGTCTAGA AATAAAACTG GTAAAACCCC AAAAAAAAAA AAAAAA 

80 

Seq ID NO: B2 Protein sequence 
Protein Accession #: NP_444284.1 



1 11 21 31 41 51 
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Lhqllccev etirraypda nllndrvlra klkaeetcap svsyfkcvqk evlpsmrkiv 

ATWMLEVCEE QKCEEEVFPL AMNYLDRFLS LEPVKKSRLQ LLGATCMFVA SKMKETIPLT 
AEKLCIYTDN SIRPEELLQM ELLLVNKLKW NIiAAMTPHDF IEHFLSKMPE AEENKQIIRK 
HAQTFVALCA TDVKFISNPP SMVAAGSWA AVQGLNLRSP NNFLSYYRLT RFLSRVIKCD 
PDCLRACQEQ IEALLESSLR QAQQNMDPKA AEEEEEEEEE VDLACTPTDV RDVDI 

Seq ID NO: B3 DNA sequence 

Nucleic Acid Accession 8; XM_044166 

Coding sequence: 1..1576 

1 11 21 31 41 51 

CTTTTGTTTC GCCATGCCTA GTCTAGTGGT ATCTGGAATA ATGGAAAGAA ATGGGGGCTT 
TGGAGAACTA GGATGTTTCG GGGGAAGCGC TAAGGACCGA GGGCTGCTGG AAGACGAGCG 
CGCCCTTCAG CTGGCTCTCG ATCAACTCTG CCTCCTGGGT TTGGGGGAGC CCCCCGCCCC 
CAGGGCGGGC GAGGACGGGG GAGGTGGGGG GGGCGGCGCC CCCGCGCAGC CGACAGCCCC 
CCCGCAGCCG GCGCCGCCGC CGCCGCCCGC GGCGCCCCCG GCCGCCCCGA CGACGGCCCC 
CGCAGCGCAG ACGCCCCAGC CCCCCACCGC CCCCAAAGGG GCGAGCGACG CCAAGCTCTG 
CGCTCTCTAC AAAGAGGCCG AGCTGCGCCT GAAGGGCAGC AGCAACACCA CGGAGTGTGT 
TCCCGTGCCC ACCTCCGAGC ACGTGGCCGA GATCGTGGGC AGGCAAGGCT GCAAGATTAA 
GGCCTTGAGG GCCAAGACCA ACACCTACAT CAAGACACCG GTGAGGGGCG AGGAACCAGT 
GTTCATGGTG ACAGGGCGAC GGGAGGACGT GGCCACAGCC CGGCGGGAAA TCATCTCAGC 
AGCGGAGCAC TTCTCCATGA TCCGTGCCTC CCGCAACAAG TCAGGCGCCG CCTTTGGTGT 
GGCTCCTGCT CTGCCCGGCC AGGTGACCAT CCGTGTGCGG GTGCCCTACC GCGTGGTGGG 
GCTGGTGGTG GGCCCCAAAG GGGCAACCAT CAAGCGCATC CAGCAGCAAA CCAACACATA 
CATTATCACA CCAAGCCGTG ACOGCGACCC CGTGTTCGAG ATCACGGGTG CCCCAGGCAA 
CGTGGAGCGT GCGCGCGAGG AGATCGAGAC GCACATCGCG GTGCGCACTG GCAAGATCCT 
CGAGTACAAC AATGAAAACG ACTTCCTGGC GGGGAGCCCC GACGCAGCAA TCGATAGCCG 
CTACTCCGAC GCCTGGCGGG TGCACCAGCC CGGCTGCAAG CCCCTCTCCA CCTTCCGGCA 
GAACAGCCTG GGCTGCATCG GCGAGTGCGG AGTGGACTCT GGCTTTGAGG CCCCACGCCT 
GGGTGAGCAG GGCGGGGACT TTGGCTACGG CGGGTACCTC TTTCCGGGCT ATGGCGTGGG 
CAAGCAGGAT GTGTACTACG GCGTGGCCGA GACTAGCCCC CCGCTGTGGG CGGGCCAGGA 
GAACGCCACG CCCACCTCCG TGCTCTTCTC CTCTGCCTCC TCCTCCTCCT CCTCTTCCGC 
CAAGGCCCGC GCTGGGCCCC CGGGCGCACA CCGCTCCCCT GCCACTTCCG CGGGACCCGA 
GCTGGCCGGA CTCCCGAGGC GCCCCCCGGG AGAGCCGCTC CAGGGCTTCT CTAAACTTGG 
TGGGGGCGGC CTGCGGAGCC CCGGCGGCGG GOGGGATTGC ATGGTCTGCT TTGAGAGCGA 
AGTGACTGCC GCCCTTGTGC CCTGCGGACA CAACCTGTTC TGCATGGAGT GTGCAGTACG 
CATCTGCGAG AGGACGGACC CAGAGTGTCC CGTCTGCCAC ATCACAGCCA CGCAAGCCAT 
CCGAATATTC TCCTAAGCCC CGTGCCCCAT GCCTCCGGGG CCCACTCCAC TGGGCCCACC 
CTGGACCTGT TTTCCACTAA AGCCTTTTGG AAAGCGGTGA TTTGAGGGGC AAGGTGCTTA 
GAGATACTCG CTCGCTGGGG AAGGGGGGAG GGAGGCAGTG GTGGCTGGAG GGTGCGCCAC 
TTTCAGAGCC TCTGGTCACC CTGTCCTGGA AAGATTGGGA GGGGGCCAGA CTGAAAATTT 
TACTAGAGTT ACAACTCTGA TACCTCAACA CACCCTTAAA TCTGGAAGCA GCTAAGAGAA 
ACTTTTGTTT TGCCAGAGGT GGCCACTAAG GCATTCTGAC GCCCTCTGCC CACCTCCCCC 
GCTGTGTGTC ACTCCACCCC TTCTTCCGAG GAGGGGGTGG GTAAAAGGGA GAGGGAGAAT 
TACCACCTGT ATCTAGAGGT GCTCTTTGCA ATCCCTAAGC CCTCTGGTCC TGACCTCCGA 
CCTCCCAGCT CTGTCTTGTT CCTTGTCTTT GTCTTTCTTC CCTTCCCCCT GCCCCTGCCC 
CTACCAGCCC AGCTTTGGGG ACACCATCCT TCTGGGGAGA AGTAGGGGGA GGAATATTTG 
GATGGTCCCT CCATTCCTCT TCAGGCATCT GGAGGCCCTC TCCCCCACTC CTCCAAAGAA 
ACATCTCAAA TTATTGATGG AATGTATCCC CATTCTCAGT GAAAATGTGA GGAGGGGACT 
AATACTGGGG TAAAGGGTCA AACCCCCACC TTCATCACTA TGGGCATTAT ATTTAGGGAG 
TAGTTCTTGG GCTGGATTTT CTGGTTGTGG AAGTGGGGGC GCCAGAGTAG TGTGTCTGCT 
ATTTAAAGGA GCAGGAAAGG GCGTGAGGCA GGAGGAGAGA CTGGTGGAGG GAAGAGCTGC 
TCCTCCCATG CAGTGCCCGA CTCCCTGCAC CCCTCTCAAC CTGACCTGAA CCTTTATTGA 
ATCCTTATTA GCTTGAATCC TTATTAGCTT GAATCCTCCA TGCAAATCAT GGAGTCTGTG 
TCCCACCTGA TGTGGTTGAG GAGAAGCCAG GTCTTCAAAG AGGGGTCAGC CTGGGGCAAA 
GCAGGACTGG GGGGAGGTGG GCAGCAGGGC CTATTCTGAG AATCACATAT TGTTACAGGC 
CTTGCACCCC CTTTGCTGCT TCCCTCCTGC TCATTTGGGG CTGCCACCAG CTCTCCACCC 
TCCTGGTTCC GCTGGCCGGG CCAAGAGAGG ATGGAGGGAT GGGAGTCCCA GGAGATCCTT 
GTAAATAGTG GGGTGGGACT GTTCTGAGTG ATCACCCGAG CACTTAAAGC TCCAGAGTCC 
CATTCTTCCT GGATGGAGCA GGTGGAGGTG CAGAGGGGAT TTCCTCCTCT CCTTCCTCCT 
GTCGAGAATT AACACCTCTC CACAGCCTTC CCCTCCAGAA CACCAGCCAG GGAGGGGTCG 
GGAAGGAGGT CACAGCCAAG AAAACTGCCC TGTGACGACT TCCCTCCTTC CCGCCTATGT 
GAGCCATCCT GAGATGTCTG TACAATAGAA ACCAAACCAA ATGGGCACCC TCGGTTGCCG 
GGGGGCAGGT GGGGAGGGGG GTGGGAAGAA GGGATGTCTG TCTGTCGTCC CCCTCCCCCT 
CTCCACTCTT TACCCACAAA GGCAGAAGAC TGTTACACTA GGGGGCTCAG CAAATTCAAT 
CCCACCCTTA CCAATTGAGC CAAACCTAGA AACAAACACA AAACACGAAT AGTGAGAGAC 
AAAATAGAGG AGAGAAAGAG AGCATGAGAG GGAGCGAGAC AGGCGACCAA CACAGAGGAG 
AGAAAACAAA AATAGGAAAA AAAAAAAAAA AAAGCAGTTC TTTATAATTT AATATTCTAT 
TTTAATAAAG GCGTTTATTA CCATATAAAT GTAGCAAAGA ACCTGGGCTA ATATGAA 
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Seq ID NO: B4 Protein sequence 
Protein Accession XP_044166.2 



1 
I 

FCFAMPSLW 
RAGEDGGGGG 
ALYKEAELRIi 
FMVTGRREDV 
LWGPKGATI 
EYNNENDFLA 
GEQGGDFGYG 



11 
1 

SGIMERNGGF 
GGAPAQPTAP 
KGSSNTTECV 
ATARREIISA 
KRIQQQTNTY 
GSPDAAIDSR 
GYLFPGYGVG 



21 
I 

GELGCFGGSA 
PQPAPPPPPA 
PVPTSEHVAE 
AEHFSMIRAS 
IITPSRDRDP 
YSDAWRVHQP 
KQDVYYGVAE 



31 
I 

KDRGliLEDER 
APPAAPTTAP 
IVGRQGCKIK 
RNKSGAAFGV 
VFEITGAPGN 
GCKPLSTFRQ 
TSPPLWAGQE 



41 



51 
I 



ALQXjALDQLC LLGLGEPPAP 60 

AAQTPQPPTA PKGASDAKLC 120 

ALRAKTNTYI KTPVRGEEPV 180 

APALPGQVTI RVRVPYRWG 240 

VERAREEIET HIAVRTGKIL 300 

NSLGCIGECG VDSGFEAPRi 360 

NATPTSVLFS SASSSSSSSA 420 

798 
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KARAGPPGAH RSPATSAGPE LAGL.PRRPPG EPLQGFSKLG GGGLRSPGGG RDCMVCFESE 480 
VTAALVPCGH NLFCMECAVR ICERTDPECP VCHITATQAI RIFS 

Seq ID NO: B5 DNA sequence 

Nucleic Acid Accession «: NMJJ00909.1 

Coding sequence: 209.-1363 

1 11 21 31 41 51 

I 1 I i I. I 

CATTCCCACC CTTCCTTCTT TAATAAGCAG GAGCGAAAAA GACAAATTCC AAAGAGGATT 60 

GTTCAGTTCA AGGGAATGAA GAATTCAGAA TAATTTTGGT AAATGGATTC CAATATCGGG 120 

AATAAGAATA AGCTGAACAG TTGACCTGCT TTGAAGAAAC ATACTGTCCA TTTGTCTAAA 180 

ATAATCTATA ACAACCAAAC CAATCAAAAT GAATTCAACA TTATTTTCCC AGGTTGAAAA 240 

TCATTCAGTC CACTCTAATT TCTCAGAGAA GAATGCCCAG CTTCTGGCTT TTGAAAATGA 300 

TGATTGTCAT CTGCCCTTGG CCATGATATT TACCTTAGCT CTTGCTTATG, GAGCTGTGAT 360 

CATTCTTGGT GTCTCTGGAA ACCTGGCCTT GATCATAATC ATCTTGAAAC AAAAGGAGAT 420 

GAGAAATGTT ACCAACATCC TGATTGTGAA CCTTTCCTTC TCAGACTTGC TTGTTGCCAT 480 

CATGTGTCTC CCCTTTACAT TTGTCTACAC ATTAATGGAC CACTGGGTCT TTGGTGAGGC 540 

GATGTGTAAG TTGAATCCTT TTGTGCAATG TGTTTCAATC ACTGTGTCCA TTTTCTCTCT 600 

GGTTCTCATT GCTGTGGAAC GACATCAGCT GATAATCAAC CCTCGAGGGT GGAGACCAAA 660 

TAATAGACAT GCTTATGTAG GTATTGCTGT GATTTGGGTC CTTGCTGTGG CTTCTTCTTT 720 

GCCTTTCCTG ATCTACCAAG TAATGACTGA TGAGCCGTTC CAAAATGTAA CACTTGATGC 780 

GTACAAAGAC AAATACGTGT GCTTTGATCA ATTTCCATCG GACTCTCATA GGTTGTCTTA 840 

TACCACTCTC CTCTTGGTGC TGCAGTATTT TGGTCCACTT TGTTTTATAT TTATTTGCTA 900 

CTTCAAGATA TATATACGCC TAAAAAGGAG AAACAACATG ATGGACAAGA TGAGAGACAA 960 

TAAGTACAGG TCCAGTGAAA CCAAAAGAAT CAATATCATG CTGCTCTCCA TTGTGGTAGC 1020 

ATTTGCAGTC TGCTGGCTCC CTCTTACCAT CTTTAACACT GTGTTTGATT GGAATCATCA 1080 

GATCATTGCT ACCTGCAACC ACAATCTGTT ATTCCTGCTC TGCCACCTCA CAGCAATGAT 1140 

ATCCACTTGT GTCAACCCCA TATTTTATGG GTTCCTGAAC AAAAACTTCC AGAGAGACTT 1200 

GCAGTTCTTC TTCAACTTTT GTGATTTCCG GTCTCGGGAT GATGATTATG AAACAATAGC 1260 

CATGTCCACG ATGCACACAG ATGTTTCCAA AACTTCTTTG AAGCAAGCAA GCCCAGTCGC 1320 

ATTTAAAAAA ATCAACAACA ATGATGATAA TGAAAAAATC TGAAACTACT TATAGCCTAT 1380 

GGTCCCGGAT GACATCTGTT TAAAAACAAG CACAACCTGC AACATACTTT GATTACCTGT 144 0 

TCTCCCAAGG AATGGGGTTG AAATCATTTG AAAATGACTA AGATTTTCTT GTCTTGCTTT 1500 

TTACTGCTTT TGTTGTAGTT GTCATAATTA CATTTGGAAC AAAAGGTGTG GGCTTTGGGG 1560 

TCTTCTGGAA ATAGTTTTGA CCAGACATCT TTGAAGTGCT TTTTGTGAAT TTATGCATAT 1620 

AATATAAAGA CTTTTATACT GTACTTATTG GAATGAAATT TCTTTAAAGT ATTACGATGC 1680 

GCTGACTTCA GAAGTACCTG CCATCCAATA CGGTCATTAG ATTGGGTCAT CTTGATTAGA 1740 

TTAGATTAGA TTAGATTGTC AACAGATTGG GCCATCCTTA CTTTATGATA GGCATCATTT 1800 

TAGTGTGTTA CAATAGTAAC AGTATGCAAA AGCAGCATTC AGGAGCCGAA AGATAGTCTT 1860 

GAAGTCATTC AGAAGTGGTT TGAGGTTTCT GTTTTTTGGT GGTTTTTGTT TGTTTTTTTT 1920 

TTTTTTCACC TTAAGGGAGG CTTTCATTTC CTCCCGACTG ATTGTCACTT AAATCAAAAT 1980 

TTAAAAATGA ATAAAAAGAC ATACTTCTCA GCTGCAAATA TTATGGAGAA TTGGGCACCC 2040 

ACAGGAATGA AGAGAGAAAG CAGCTCCCCA ACTTCAAAAC CATTTTGGTA CCTGACAACA 2100 

AGAGCATTTT AGAGTAATTA ATTTAATAAA GTAAATTAGT ATTGCTGCAA ATAGCTAAAT 2160 

TATATTTATT TGAATTGATG GTCAAGAGAT TTTCCATTTT TTTTACAGAC TGTTCAGTGT 2220 

TTGTCAAGCT TCTGGTCTAA TATGTACTCG AAAGACTTTC CGCTTACAAT TTGTAGAAAC 2280 

ACAAATATCG TTTTCCATAC AGCAGTGCCT ATATAGTGAC TGATTTTAAC TTTCAATGTC 2340 

CATCTTTCAA AGGAAGTAAC ACCAAGGTAC AATGTTAAAG GAATATTCAC TTTACCTAGC 2400 

AGGGAAAAAT ACACAAAAAC TGCAGATACT TCATATAGCC CATTTTAACT TGTATAAACT 2460 

GTGTGACTTG TGGCGTCTTA TAAATAATGC ACTGTAAAGA TTACTGAATA GTTGTGTCAT 2520 

GTTAATGTGC CTAATTTCAT GTATCTTGTA ATCATGATTG AGCCTCAGAA TCATTTGGAG 2580 

AAACTATATT TTAAAGAACA AGACATACTT CAATGTATTA TACAGATAAA GTATTACATG 2640 

TGTTTGATTT TAAAAGGGCG GACATTTTAT TAAAATCAAT ATTGTTTTTG CTTTTTCTGA 2700 
GGAGTCTCTT TCAGTTTCAT TTTTTCTCAT CCCATGACTT CCCTCCGATG GT 

Seq ID NO: B6 Protein sequence 
Protein Accession th NP_000900.1 



1 
I 

MNSTLFSQVE 
LIIIILKQKE 
CVSITVSIFS 
DEPFQNVTUD 



LFLLCHLTAM 
KTSLKQASPV 



11 
1 

NHSVHSNFSE 
MRNVTNILIV 
LVLIAVERHQ 
AYKDKYVCFD 
NKYRSSETKR 
ISTCVNPIFY 
AFKKINNNDD 



21 
I 

KNAQLLAFEN 
NLSFSDLLVA 
LIINPRGHRP 
QFPSDSHRLS 
INIMLLSIW 
GFLNKNFQRD 
NEKI 



31 
I 

DDCHLPLAMI 
IMCLPFTFVY 
NNRHAYVGIA 
YTTLLLVLQY 
AFAVCWLPLT 
LQFFPNFCDF 



41 
I 

FTLALAYGAV 
TLMDHWVFGE 
VIWVLAVASS 
FGPLCFIFIC 
IFNTVFDWNH 
RSRDDDYETI 



51 
I 

IILGVSGNLA 
AMCKLNPFVQ 
LPFLIYOVMT 
YFKIYIRLKR 
QIIATCNHNL 
AMSTMHTDVS 
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180 
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Seq ID NO: B7 DNA sequence 

Nucleic Acid Accession #: NM_002590.2 

Coding sequence: 204.. 3416 



GCTTCTGTAA 
CTGGAGGCTC 
TTCCGAGAGC 
GGATCGCACT 
TCCCCTTGCA 
GATACAGCAC 
TGCATATGAA 
TGCTCCGGGT 
AGCGGCTGTG 



11 
I 

GAGACGGAGA 
GGAGCTGCTA 
CTGAGACTGA 
TGAGGCTGGA 
GCTCTTCAGC 
CTTCGAGGAG 
AGTATCGGGT 
GCGCGAAGGC 
TGGCCAGGCC 



21 
I 

GGCGCAGAGT 
CCCGCAGACT 
CGCCCGACCT 
GGCATGAGTC 
CTCTGCTGGG 
GATGCCCCCG 
GACACAAGCT 
GACGGGCAGC 
CCGCAGTGCG 



31 
I 

GAGGGCGGGT 
TCTCCCGCAC 
GGAAACCAGA 
CTGTGAGGCG 
TGCTCTCAGT 
GCACGGTCAT 
TCCGCCTGAT 
TGACCGTCGG 
TGCTGGCCTT 



41 

I 

CCGCGCGTCC 
AGGGCTCGCA 
GAAGACTTCC 
TTGGGGCAGC 
GGCCCAGAGC 
CGGGACCCTG 
GAAGCAATTC 
GGACGCCGGC 
CGATGTGGTC 



51 
I 

TCAGAGCCCG 
AAGAGCGTGA 
TTAGCCTTTC 
CCCTGCCTTT 
AAAACAGTCC 
GCCGAGGACC 
AACAGCTCTC 
CTGGACCGCG 
AGCTTCTCGC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
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AGGAGCAGTT CCGGCTGGTG CACGTGGAGG TAGAGGTGAG GGACGTCAAC GACCACGCGC 600 
CGCGCTTCCC CAGGGCCCAG ATCCCGGTAG AGGTGTCCGA GGGTGCGGCA GTGGGCACGC 660 
GCATCCCCTT GGAGGTGCCG GTGGACGAGG ACGTGGGCGC CAACGGGCTG CAGACCGTGC 720 
GCCTGGCCGA GCCGCACAGC C O C T TTCGOG TGGAGCTGCA GACGCGAGCG GACGGCGCTC 780 
AGTGCGCAGA CCTGGTGCTG CTGCAGGAGC TGGACCGCGA GAGCCAGGCC GCCTACAGCC 840 
TGGAGCTGGT GGCCCAGGAC GGCGGCCGCC CGCCGOGCTC CGCCACGGCT GCCCTCAGCG 900 
TGCGCGTCCT GGATGCGAAT GACCACAGCC CGGCCTTCCC GCAGGGCGCC GTGGCCGAAG 960 
TGGAGCTGGC GGAAGACGCG CCCGTGGGCT CCCTGCTTCT CGACCTGGAC GCAGCCGACC 1020 
CCGACGAGGG ACCTAACGGC GACGTGGTGT TCGCATTTGG CGCCCGCACC COGCCGGAGG 1080 
CGCGCCGCCT CTTTCGGCTT GACCCGCGAT CAGGCCGCCT CACCCTGGCC GGGCCCGTGG 1140 
ACTACGAGCG TCAGGACACC TACGAGCTGG ACGTGCGGGC GCAGGACCGC GGACCCGGGC 1200 
CCCGCGCTGC CACCTGCAAG GTCATCGTGC GCATCCGAGA CGTCAATGAC AACGCACCCG 1260 
ACATCGCCAT CACCCCGCTG GCCGCCCCAG GCGCX3CCGGC AACCTCACCC TTCGCCGCTG 1320 
COGCCGCCGC CGCTGCACTC GGGGGAGCGG ACGCTAGCTC GCCGGCGGGA GCCGGGACGC 1380 
CGGAGGCTGG TGCCACTTCG CTGGTGCOGG AGGGGGCGGC GCGCGAGAGC CTGGTGGCCC 1440 
TGGTCAGCAC CTCGGACAGG GACTCGGGCG CCAACGGGCA AGTGCGCTGC GCCCTCTATG 1500 
GGCACGAGCA CTTCCGGCTG CAGCCGGCCT ACGCGGGCAG CTACCTGGTG GTGACGGCGG 1560 
CGTCGCTGGA CCGCGAACGC ATCGCCGAGT ACAACTTGAC GCTGGTGGCC GAGGATCGCG 1620 
GCGCGCCCCC GCTGCGCACA GTGCGGCCCT ACACGGTGCG TGTGGGCGAC GAGAACGACA 1680 
ACGCGCCGCT CTTCACGCGG CCGGTCTATG AGGTGTCGGT GCGCGAGAAC AACCCGCCAG 1740 
GCGCCTACCT GGCCACGGTG GCCGCCCGCG ACCGGGACCT GGGCCGCAAC GGCCAGGTCA 1800 
CCTACCGGCT GCTGGAGGCC GAGGTGGGCC GCGCCGGGGG CGCCGTGTCC ACTTATGTCT 1860 
CGGTGGACCC AGCTACCGGA GCCATCTACG CGCTGCGCAG CTTCGACTAT GAGACGCTGC 1920 
GCCAACTCGA CGTTCGCATC CAAGCTAGCG ACGGCGGCTC CCCTCAGCTT TCCAGCAGCG 1980 
CCCTAGTGCA AGTGCGCGTG CTGGACCAGA ACGACCATGC GCCAGTCCTG GTGCACCCGG 2040 
CGCCAGCCAA TGGCTCCCTA GAAGTGGCGG TGCCTGGGCG CACCGCAAAG GACACGGTTG 2100 
TGGCCCGTGT GCAGGCCCGG GATGCAGACG AGGGAGCCAA CGGGGAGCTG GCGTTCGAGC 2160 
TGCAGCAGCA GGAGCCGCGC GAAGCCTTCG CCATCGGCCG CCGCACGGGG GAGATACTGC 2220 
TCACCGGOGA CCTCTCGCAG GAGCCACCCG GTCGCGTGTT CAGGGCGCTC CTGGTCATAT 2280 
CCGACGGCGG CCGTCCCCCG CTCACCACCA CCGCAACTGT CAGCTTCGTG GTAACAGCAG 2340 
GGGGCGGGCG TGGGCCGGCT GCGCCTGCCA GTGCAGGAAG CCCGGAGCGT TCCCGCCCGC 2400 
CTGGCTCTCG GCTCGGGGTG TCCGGGTCGG TGCTGCAATG GGACACGCCG CTGATCGTCA 2460 
TCATCGTGCT GGCCGGGAGC TGCACGCTGC TGCTGGCCGC CATCATCGCC ATCGCCACCA 2520 
CCTGCAACCG CCGCAAGAAG GAGGTGCGCA AAGGGGGGGC CCTCCGGGAA GAGCGGCCCG 2580 
GGGCGGCGGG CGGCGGAGCC TCGGCTCCCG GCTCCCCGGA GGAGGCCGCC CGGGGAGCCG 2640 
GGCCCAGGCC CAACATGTTC GACGTGCTCA CCTTCCCTGG CACCGGCAAA GCGCCCTTTG 2700 
GCAGCCCCGC GGCGGACGCG CCTCCGCCTG CGGTCGCCGC GGCCGAAGTG CCGGGCTCAG 2760 
AGGGCGGCAG CGCCACTGGG GAAAGCGCCT GTCACTTCGA GGGGCAGCAG CGGCTCCGCG 2820 
GCGCGCACGC CGAG CCCTAC GGTGCCTCCC CGGGTTTTGG AAAGGAGCCG GCGCCCCCTG 2880 
TGGCGGTGTG GAAAGGACAC TCCTTCAACA CCATTTCTGG CAGAGAAGCA GAGAAGTTCA 2940 
GCGGCAAAGA CAGCGGTAAA GGGGACAGTG ATTTCAACGA CAGCGATTCC GACATCAGCG 3000 
GGGACGCTCT GAAAAAGGAT CTCATCAACC ACATGCAGAG TGGACTGTGG GCGTGCACCG 3060 
CTGAGTGTAA GATCCTGGGC CACTCTGACC GCTGCTGGAG CCCATCCTGC AGCGGGCCCA 3120 
ACGCACATCC ATCGCCTCAC CCACCAGCCC AGATGTCAAC CTTCTGTAAG AGCACGTCAC 3180 
TGCCTCGGGA TCCTCTGCGC AGGGACAATT ACTACCAGGC CCAGCTGCCC AAGACAGTGG 3240 
GGCTGCAGAG CGTCTATGAG AAAGTACTGC ACAGAGACTA TGACAGGACA GTCACTCTGC 3300 
TCTCCCCTCC CCGTCCAGGG AGGCTCCCAG ACCTGCAGGA GATTGGAGTA CCCCTCTACC 3360 
AGTCCCCTCC TGGCAGGTAC CTGTCCCCGA AGAAGGGAGC CAATGAAAAT GTGTAATCCC 3420 
ATGCTGCATG TCTTCACACA TATACAGGTC ACTCCGAGAA GCCCCTAAGT TATTGACCGG 3480 
TTTCAGTGTT GTATATATAA ATATGCAAGA TGTGCCTTAC AATGAAGTTG TTGGAAGCTA 3540 
TTTCCAATCA CATTGGTACT GTTTGGTATT TGCAAACAAA AATGTAGTTA ATGTAATTTT 3600 
TATGAAATGT GTGCAGTATT TAATTTTTCT TATGCTATTG ACTTTGATTT CAATTGCGGC 3660 
TTGCCATTTT CTTAGTGTTT TTAACCTGTA CATTGTGTAA TGTAATGTTT GTATATAATG 3720 
AAATTTTGTT ATATTTTTAT ATAATAAAAG CTAAAGTGGA AGTTATTGCC AAAGGAACTG 3780 
TCTGTAAGAC AAAAAACAAA ACATGTTGGA ATTACTTAAT TGAAATTTAT CTTTCACCTG 3840 
AAACAACCTA GTGTTTGAGA AATTTTGCCT TGCCAAGTAT AACTTGTATA TCTTGACTCT 3900 
GTGGTAGATT TCAAGTTCAA TGTTATTTAA TTACATTTGG TTTTCCGTAA ACCGTGTCAC 3960 
TTATAAGCAC AGTAATAAAA GATTTGTCAT GTGTTTGAAG AAAAAAAAAA AAAAAAAAAA 4020 
AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAA 



Seq ID NO: B8 Protein sequence , 
Protein Accession #s NP_0025B1.2 

1 11 21 31 41 51 

MSPVRRWGSP CLFPLQLFSL CWVLSVAQSK TVRYSTFEED APGTVIGTLA EDLHMKVSGD 60 

TSFRLMKQFN SSLLRVREGD GQLTVGDAGL DRERLCGQAP QCVLAFDWS FSQEQFRLVH 120 

VEVEVRDVND HAPRPPRAQI PVEVSBGAAV GTRIPLEVPV DEDVGANGLQ TVRLAEPHSP 180 

FRVELQTRAD GAQCADLVLL QELDRESQAA YSLELVAQDG GRPPRSATAA LSVRVLDAND 240 

HSPAFPQGAV AEVELAEDAP VGSLLLDLDA ADPDEGPNGD WFAFGARTP PEARRIiFRLD 300 

PRSGRLTLAG PVDYERQDTY ELDVRAQDRG PGPRAATCKV IVRIRDVNDN APDIAITPLA 360 

APGAPATSPF AAAAAAAALG GADASSPAGA GTPEAGATSIj VPEGAARESL VALVSTSDRD 420 

SGANGQVRCA LYGHEHFRLQ PAYAGSYLW TAASLDRERI AEYNliTLVAE DRGAPPLRTV 480 

RPYTVRVGDE NDNAPLPTRP VYEVSVRENN PPGAYLATVA ARDRDLGRNG QVTYRLLEAE 540 

VGRAGGAVST YVSVDPATGA IYALRSFDYE TLRQLDVRIQ ASDGGSPQLS SSALVQVRVL 600 

DQNDHAPVLV HPAPAKGSLE VAVPGRTAKD TWARVQARD ADEGANGELA PELQQQEPRE 660 

AFAIGRRTGE ILLTGDLSQE PPGRVFRALL VISDGGRPPL TTTATVSFW TAGGGRGPAA 720 

PASAGSPERS RPPGSRLGVS GSVLQWDTPIi IVIIVLAGSC TLLLAAIIAI ATTCNRRKKE 780 

VRKGGALREE RPGAAGGGAS APGSPEEAAR GAGPRPNMFD VLTFPGTGKA PPGSPAADAP 840 

PPAVAAAEVP GSEGGSATGE SACHFEGQQR LRGAHAEPYG ASPGFGKEPA PPVAVWKGHS 900 

FNTISGREAE KFSGKDSGKG DSDPNDSDSD ISGDALKKDL INHMQSGLWA CTAECKILGH 960 

SDRCWSPSCS GPNAHPSPHP PAQMSTFCKS TSLPRDPLRR DNYYQAQLPK TVGLQSVYEK 1020 
VLiHRDYDRTV TLLSPPRPGR LPDLQEIGVP LYQSPPGRYL SPKKGANENV 



800 



WO 03/042661 



Seq ID NO: B9 DNA sequence 

Nucleic Acid Accession #: AL121939.12 

Coding sequence: 18S. .1426 

1 11 21 31 41 51 

111 ill 
AGOGGAGTST CAGTGGCGAG CTCCGGGTGC TGTGGCCCGG CCTTGGCGGG GCGGCCTCCG 
GCTCAGGCTG GCTGAGAGGC TCCCAGCTGC AGCGTCCCCG CCCGCCTCCT CGGGAGCTCT 
GATCTCAGCT GACAGTGCCC TCGGGGACCA AACAAGCCTG GCAGGACAAA ATTAGAAGAT 
CAAAATGGAA AATATGCTGC TTTGGTTGAT ATTTTTCACC CCTGGGTGGA CCCTCATTGA 
TGGATCTGAA ATGGAATGGG ATTTTATGTG GCACTTGAGA AAGGTACCCC GGATTGTCAG 
TGAAAGGACT TTCCATCTCA CCAGCCCCGC ATTTGAGGCA GATGCTAAGA TGATGGTAAA 
TACAGTGTGT GGCATOGAAT GCCAGAAAGA ACTCCCAACT CCCAGCCTTT CTGAATTGGA 
GGATTATCTT TCCTATGAGA CTGTCTTTGA GAATGGCACC CGAACCTTAA CCAGGGTGAA 
AGTTCAAGAT TTGGTTCTTG AGCCGACTCA AAATATCACC ACAAAGGGAG TATCTGTTAG 
GAGAAAGAGA CAGGTGTATG GCACCGACAG CAGGTTCAGC ATCTTGGACA AAAGGTTCTT 
AACCAATTTC CCTTTCAGCA CAGCTGTGAA GCTTTCCACG GGCTGTAGTG GCATTCTCAT 
TTCCCCTCAG CATGTTCTAA CTGCTGCCCA CTGTGTTCAT GATGGAAAGG ACTATGTCAA 
AGGGAGTAAA AAGCTAAGGG TAGGGTTGTT GAAGATGAGG AATAAAAGTG GAGGCAAGAA 
ACGTCGAGGT TCTAAGAGGA GCAGGAGAGA AGCTAGTGGT GGTGACCAAA GAGAGGGTAC 
CAGAGAGCAT CTGCAGGAGA GAGCGAAGGG TGGGAGAAGA AGAAAAAAAT CTGGCCGGGG 
TCAGAGGATT GCCGAAGGGA GGCCTTCCTT TCAGTGGACC OGGGTCAAGA ATACCCACAT 
TCCGAAGGGC TGGGCACGAG GAGGCATGGG GGACGCTACC TTGGACTATG ACTATGCTCT 
TCTGGAGCTG AAGCGTGCTC ACAAAAAGAA ATACATGGAA CTTGGAATCA GCCCAACGAT 
CAAGAAAATG CCTGGTGGAA TGATCCACTT CTCAGGATTT GATAACGATA GGGCTGATCA 
GTTGGTCTAT CGGTTTTGCA GTGTGTCYGA CGAATCCAAT GATCTCCTTT ACCAATAYTG 
CGATGCTGAG TCGGGCTCCA CCGGTTCGGG GGTCTATCTG CGTCTGAAAG AT CCAGACAA 
AAAGAATTGG AAGCGCAAAA TCATTGCGGT CTACTCAGGG CACCAGTGGG TGGATGTCCA 
CGGGGTTCAG AAGGACTACA ACGTTGCTGT TCGCATCACT CCCCTAAAAT ACGCCCAGAT 
TTGCCTCTGG ATTCACGGGA ACGATGCCAA TTGTGCTTAC GGCTAACAGA GACCTGAAAC 
AGGGCGGTGT ATCATCTAAA TCACAGAGAA AACCAGCTCT GCTTACCGTA GTGAGATCAC 
TTCATAGGTT ATGCCTGGAC TTGAACTCTG TCAATAGCAT TTCWACATTT TTCAAAATCA 
GGAGATTTTC GTCCATTTAA AAAATGTATA GGTGCAGATA TTGAAACTAG GTGGGCACTT 
CAATGCCAAG TATATACTCT TCTTTACATG GTGATGAGTT TCATTTGTAG AAAAATTTTG 
TTGCCTTCTT AAAAATTAGA CACACTTTAA ACCTTCAAAC AGGTATTATA AATAACATGT 
GACTCCTTAA TGGACTTATT CTCAGGGTCC TACTCTAAGA AGAATCTAAT AGGATGCTGG 
TTGTGTATTA AATGTGAAAT YGCATAGATA AAGGTAGATG GTAAAGCAAT TAGTATCAGA 
ATAGAGACAG AAAGTTACAA CACAGTTTGT ACTACTCTGA GATGGAYCCA TTCAGCTCAT 
GCCCTCAATG TTTATATTGT GTTATCTGTT GGGTCTGGGA CATTTAGTTT AGTTTTTTTG 
AAGAATTACA AATCAGAAGA AAAAGCAAGC ATTATAAACA AAACTAATAA CTGTTTTACT 
GCTTTAAGAA ATAACAATTA CAATGTGTAT TATTTAAAAA TGGGAGAAAT AGTTTGTTCT 
ATGAAATAAA CCTAGTTTAG AAATAGGGAA GCTGAGACAT TTTAAGATCT CAAGTTTTTA 
TTTAACTAAT ACTCAAAATA TGGACTTTTC ATGTATGCAT AGGGAAGACA CTTCACAAAT 
TATGAATGAT CATGTGTTGA AAGCCACATT ATTTTATGCT ATACATTCTA TGTAT GAGGT 
GCTACATTTT TAGGACAAAG AATTCTGTAA TCTTTTTCAA GAAAGAGTCT TTTTCTCCTT 
GACAAAATCC AGCTTTTGTA TGAGGACTAT AGGGTGAATT CTCTGATTAG TAATTTTAGA 
TATGTCCTTT CCTAAAAATG AATAAAATTT ATGAATATGA CTTAAAAAAA AAARWCGACG 
CGGCCGCGAA TTTAGTAGTA GTAGTCGACC CGGGAATTCC GGACCGGTAC CTGCAGGGGT 
ACCAGC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1S60 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 



Seq ID NO: BIO Protein sequence 
Protein Accession fl: CAC35071.1 



1 
I 

MENMLLWIiIF 
VCGIEOQKEL 
KRQVYGTDSR 
SKKLRVGLLK 
RIABGRPSFQ 
KMPGGMIHFS 
NWKRKIIAVY 



11 
I 

PTPGWTLIDG 
PTPSLSELED 
PSILDKRFLT 
MRNKSGGKKR 
HTRVKNTHIP 
GFDNDRADQL 
SGHQWVDVHG 



21 
i 

SEMEWDFMWH 
YLSYETVFEN 
NFPFSTAVKL 
RGSKRSRREA 
KGWARGGMGD 
VYPvFCSVSDE 
VQKDYNVAVR 



31 
I 

LRKVPRIVSE 
GTRTLTRVKV 
STGCSGILIS 
SGGDQREGTR 
ATLDYDYALL 
SNDLLYQYCD 
ITPLKYAQIC 



41 51 
I I 

RTFHLTSPAF EADAKMMVNT 60 

QDLVLEPTQN ITTKGVSVRR 120 

PQHVLTAAHC VHDGKDYVKG 180 

EHLRERAKGG RRRKKSGRGQ 240 

ELKRAHKKKY MELGISPTIK 300 

AESGSTGSGV YLRLKDPDKK 360 
LWIHGNDANC AYG 



Seq ID NO: 11 DNA sequence 

Nucleic Acid Accession ft: nm_00203S.1 

Coding sequence: 108.. 1106 



1 
I 

AGGCGCCCGC 
CCGGCGGCCC 
CTGCCGCCTT 
CCAAGCCCCT 
GGAAGTGCAT 
ATGAGGATAA 
AGGTGGTGCT 
TAAAACAAGC 
CAGTGTCAGG 
ATTACCTGGG 
TGGGCAGGAT 
CCTACTCTGC 
AGCCATATAA 
CCGAAGAAAA 
GCAAACCAGA 



11 
I 

CCGCCGCGCG 
CCGCCCGGCC 
CCTCGTGGCC 
CGCCCTGCCC 
TGCTATCGAG 
GCTGCTGCAG 
TTGCATATCA 
ACAGGAGAAA 
AAAATTTGAA 
CAGCGTGTAC 
CGTGTTTGTG 
ATCCAAGTTT 
TGTCTACATC 
CAGAACAAAG 
ACAGGTGGCC 



21 
I 

TGATTCTCGC 
GCTCCGGGCC 
TTOGTGCTGC 
GGGGCGCATG 
TGCTATAAAC 
GCAAAGAAAG 
GTTGATGTAT 
CTGGGTCCAG 
GATCTTGAAG 
CCCAGCCGGG 
TCCTCCCAGG 
GCCATAAGGG 
ACAGTTGCTT 
CCTTTGGAGA 
AAACAAATTG 



31 
I 

CTCGCCGCAG 
CCTGGCCCCG 
TGCTGTACAT 
TGGTGGTTAC 
AAGGAGCTTT 
AAATTGAAAT 
CTCAAGACTA 
TGGACATGCT 
TTAGTACCTT 
CCGTGATCAC 
CAGGACAGTT 
GATTGGCAGA 
ACCCACCAGA 
CTCGACTTAT 
TTAAAGATGC 



41 
I 

CCCAGCCCTG 
CGGAGCGATG 
GGTGTCTCCG 
AGGAGGTTCC 
TATAACTCTG 
GCACTCTATT 
TAACCAAGTA 
GGTAAATTGT 
TGAAAGGTTA 
CACCATGAAG 
GGGATTATTC 
AGCTTTGCAG 
CACAGACACA 
TTCAGAGACC 
CATACAAGGA 



51 
I 

CGCGCCTTGC 
CTGCTGCTGG 
CTCATCAGCC 
AGTGGCATCG 
GTTGCACGAA 
AATGACAAAC 
GAGAATGTCA 
GCAGGAATGG 
ATGAGCATCA 
GAGCGCCGGG 
GGTTTCACAG 
ATGGAGGTGA 
CCTGGCTTTG 
ACATCTGTGT 
AATTTCAACA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 



801 



WO 03/042661 



GTTCCCTTGG 
CTTCTATTAC 
TGTTTTACCT 
CTGAAAATGC 
TAATTTGAAC 
TGTTTTCGAA 
CTAGAAGTTC 
TTGATTTTCT 
AAGTAAAGAA 
CCCCTAGAGA 
GTTTAAACCT 
TGTCTCTCTG 
TTTTATTCAA 
GTTGTGCTCA 
CCCATAAAGC 
CCAGTAGGAA 
TCTCATTCAT 
AGCTTAAGAA 
TTAGCATCTA 
TTTTTGTAAA 
TTAACTTAAC 
CGGAGCATAT 
AAAAAAGACA 



CTCAGATGGG 
TGAGGGGCTC 
TGGAAGTTTT 
AGACAAAACT 
AGCTTGCTGC 
TACGTCAGAT 
ACCTCCTGAC 
AGGCTTTGTG 
CAGGATTCCA 
TTAAGTCCAG 
TCCATAAAAA 
TCTCTCTGTC 
GATGAGTTGG 
GGACTCACTC 
TACGAGGGGG 
AAGCAATGCT 
GGGGAAAGGC 
TTTACAGTTC 
CTTGTTTTTC 
AAAAAAAAAA 
TCATGAATTA 
ATGTAGCTTT 
AGGCTGATTT 



TACATGCTCT 
CAGCAGGTGG 
GACAGCATAG 
GCCTAATCTT 
TAAATGGGAC 
TGGACCAGTG 
AGTATTATTA 
GTAGAGAGGT 
GAATGATCAT 
AAATGTACTT 
AACAATTTTC 
TCTCTGTCTC 
ACCCATTGCC 
TGCTGCTGGT 
ACGGGAGAGG 
TTCTTGTCTT 
AGCCTCCTTA 
CTTCAATAAC 
CCATCTCTTC 
AAGGAAAAAA 
ATTAAAGCAA 
TAGGAAAGGC 
TGAAGAGAGT 



CGGCCCTGAC 
TCACCATGGG 
TTCGTCGCTG 
CTTACCCCTT 
CCAATTTTTG 
CTCTTCAGGA 
ATACTATGCA 
GAAGGTATGA 
TAAATTTGTT 
TCTGGCACAT 
GGTTTCTCGG 
TCTCTCTCTC 
AGTGAGTCTG 
GGAAACTCAT 
GCAGTGCAAT 
TAGACTCAAA 
AATGTTTTCT 
CATGATGACC 
CTTTCCAATT 
AAGACCAGCT 
ATGAAAAAAT 
TGATGATGGT 
TGCTTTTGAA 



CTGTGGGATG 
CCTTTTCCGC 
CATGATGCAG 
GGAAGAAGAC 
GCCTATAGAC 
ATGTGGCTGC 
AATATGGAAT 
GAATTAATAG 
TCTATTTATT 
AAAGAAATCT 
GTTCTCTCTC 
TCTCTTTCTT 
AATGTCACTG 
GGCTTCTCTC 
GGGAAGTAAA 
TGCTTAGGGA 
GAAGAGCGGT 
TGAAGTTCAC 
TTGCTTATAC 
AAAATTTTCG 
TAAAAAGTGT 
ATAAAGTTTG 
ATAAAATGAT 



GCTCCAGTAA 
ACTATTGCTT 
AGAGAAAAAT 
TGTTTCCAAA 
ACTTATGTAT 
AAGCAAGGGG 
AGGAGACCAT 
CGTGTGAACA 
CTTTTTTGCC 
TGAGGACTTT 
TCTGTCTCTC 
TCTTTGTGTA 
ACAGCCCTGT 
TCTCTTTGAT 
GAGATATTTT 
ACGTTTCATT 
AAAATCTAGA 
CTATCCCATT 
TGCTGTAATA 
ACTTGACTTT 
GACTTTTTCT 
CTCATTAAGA 
CA 



960 
1020 
10B0 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 



Seq ID NO: B12 Protein sequence 
Protein Accession #: NP_002026.1 



11 



21 31 41 51 

I I I I 

MLLLAAAFLV AFVLLLYMVS PLISPKPLAL PGAHWVTGG SSGIGKCIAI ECYKQGAFIT 
LVARNEDKLL QAKKEIEMHS INDKQWLCI SVDVSQDYNQ VENVIKQAQE KLGPVDMLVN 
CAGMAVSGKF EDLEVSTFER LMSINYLGSV YPSRAVITTM KERRVGRIVF VSSQAGQLGL 
FGFTAYSASK FAIRGLAEAL QMEVKPYNVY ITVAYPPDTD TPGFAEENRT KPLETRLISE 
TTSVCKPEQV AKQIVKDAIQ GNFNSSLGSD GYMLSALTCG MAPVTSITEG LQQWTMGLF 
RTIALFYLGS FDSIVRRCMM QREKSENADK TA 



Seq ID NO: B13 DNA sequence 

Nucleic Acid Accession fl: CAT Cluster 



l 
I 

CTTGGGATGC 
CGGCAAAATA 
TTTGGATTGG 
CACCCATTTA 
CCAGGCATTG 
AAGATCCAGA 
TATAATATTG 
AAGAACACAT 
AAATGAAACA 
CTATTTCTTG 
AAACATTCCA 



11 
I 

ATTATATATT 
TTGGTGCCTT 
TTAAGTCCAA 
AAAACTTCAG 
GTTTGACATA 
GTCTCATGTT 
TACTTAAAGA 
AAGATGCCTT 
ATGCAATCAG 
GAAAAAATTT 
AAAAAAAAAA 



21 
I 

GCATTATATT 
TCCCGAAATT 
TTATTAAATG 
AAGGCAGGAT 
TTTCCAGAGC 
AAAATCACTT 
CAGAGAGGAA 
CTTCCATCAA 
TCTTTTATAA 
TAAGAATCAA 
AA 



31 
I 

TGCCGGTAAA 
TGGTCCCGGG 
CTGCGGTTTC 
CCTGCTCAAT 
TCAAATCTGC 
ACATACCCAG 
GTGTTTCTTA 
ATGCACTTGC 
TGCACTGTTC 
AAATAAAGAA 



41 
I 

ACTCGGTTAA 
CTCCCTTATA 
AAATTCCCAG 
TTATAAGGCT 
AGTGTTCCAC 
AAAGACCACC 
ATCCAAACAC 
TTGTGAATTA 
AATTTGAGAT 
AATAAAAAGT 



60 
120 
180 
240 
300 



SI 
I 

TTTTTAAAAT 
GGATAATTGG 
CTGGAAGGAC 
TTGGAAAATT 
ATGCACATAC 
ACTTTGCAGG 
ATGTACACCA 
ATGGACTTGT 
TCAAGTATTT 
GCATACAGTT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



Seq ID NO: B14 DNA sequence 

Nucleic Acid Accession #: XMJ>86767.1 

Coding sequence: 276.. 611 

1 11 21 31 41 51 

CTTGTTCATG GCATCTTTAG AAACAAACTG CAATTTTATT TCATTTCCTT GTCGTTCATA 60 

CAAAGATTAC AAGACTAGCT TATGTGTGGA CTGTGACTGT TTTAAGGAAA AATCATGTCC 120 

TCGGCTGGGT TATCAAGCCA AGCTATTTAA AGGTGTTTTA AAAGAAAGGA TGGAAGGAAG 180 

ACCTCTTAGG ACCACTGTGT TTTTGGATAC AAGTGGTACA TATCCATTCT GTAG CCTATT 240 

ATTTTGTTCT CAGTATAATT GTTCCAGATA AAACTATGAT GGATGGCTCG TTTTCATTTA 300 

AATTATTAAA TCAGCTTGGA ATGATTGAAG AGCCAAGGCT TTATGAAAAG AACAAACCAT 360 

TTTATAAACT TCAAGAAGTC AAGATTCTTG CTCAATTTTA TAATGACTTT GTAAATATTT 420 

CAAGCATTGG TTTGACATAT TTCCAGAGCT CAAATCTGCA GTGTTCCACA TGCA CATACA 480 

AGATCCAGAG ACTCATGTTA AAATCACTTA CATACCCAGA AAGACCACCA CTTTGCAGGT 540 

ATAATATTGT ACTTAAAGAC AGAGAGGAAG TGTTTCTTAA TCCAAACACA TGTACACCAA 600 

AGAACACATA AGATGCCTTC TTCCATCAAA TGCACTTGCT TGTGAATTAA TGGACTTGTA 660 

AATGAAACAA TGCAATCAGT CTTTTATAAT GCACTGTTCA ATTTGAGATT CAAGTATTTC 720 

TATTTCTTGG AAAAAATTTT AAGAATCAAA AATAAAGAAA ATAAAAAATG CATACAGTTA 780 
AACATTCC 

Seq ID NO: B15 Protein sequence 
Protein Accession # : XP_086767.1 

1 11 21 31 41 51 

MMDGSPSFKL LNQLGMIEEP RLYEKNKPFY KLQEVKILAQ FYNDFVNISS IGLTYFQSSN 60 
LQCSTCTYKI QRLMLKSLTY PERPPLCRYN IVLKDREEVF LNPNTCTPKN T 



Seq ID NO: B16 DNA sequence 



WO 03/042661 



PCT/US02/36810 



Nucleic Acid Accession fl: CAT cluster 



10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



1 
I 

TTTTTTTTTT 
ACACAACAGT 
AAGTGCTATT 
ATGCACAATG 
AATAAAATAT 
GGAGTGGAGA 
TAAAACATAG 
ATTATAAAAA 
CTGCCAAAAC 
AGTTTCTCGT 
CACTGAGATA 
GGTAGCTGGT 
TTCCCTGATA 
CCTGGAAGCA 
AATAAATGTG 
CCTTCAGACC 



11 



T T T TTTTTTT 
AAAATGGCAG 
AGCCCAAGCT 
GTACATATGC 
TTACAGGTTT 
AAGGAGGAAA 
AATTAGTTCT 
TAACATCTAT 
CTGTCCTCAA 
TTGGGTTATC 
CTTTATATAA 
GTCAAGTACA 
TTCCTGATAG 
ATGAAAATGT 
TAGTGGGGAC 
CATGCCG 



21 
I 

TTAAATGCAA 
TGTTGTAATT 
TCTTACATTC 
ACAAAATGGA 
AAAAATATAA 
GACCCTATTG 
AGGAGACAAT 
AAACCTACTA 
AGACATGCCT 
TTTGTTACTA 
ATAACGTGGG 
AATGATAGGA 
ACCTACCACT 
TAAATATTAC 
CAGTGGTTGT 



31 
I 

AGGAGCAACA 
TCATTTTCAG 
ATTAAAAGAG 
ATTATATCAA 
ACATTGATTC 
CTATTTAGAA 
TTTTGATGTT 
ACAATTTTCC 
GACTTTCAGG 
TTTTCAAATA 
CGAAACCTGA 
ATTGACTTTG 
ATCAGATCCT 
TTTGCTAGAG 
AATGTAGATA 



41 
I 

AAATTTATTG 
ATGTTTGAAT 
TGACTATCAA 
CAAATATACA 
CTCTATCCCA 
TCCTTTTTAA 
TTTCAGGGGT 
TCCTGTGCAC 
AAAGCTAATT 
ACCAGCAACT 
AGTTCACAAT 
CCAACAACAT 
CCATATTCAA 
TTTCTCCTCC 
CTTGAGAAGT 



51 
I 

ACTGAATTAA 
GGAACAAGAA 
AAACAGCAAC 
AAATACCCAA 
TTAAACCATT 
AACAAGTTTT 
TTAACATTCT 
AAAAATAATA 
ATGGAAATGG 
CCCTATATTA 
GAGCCTGCTA 
CAAAAGCATT 
TAAGATTTAT 
TTTATTTAGA 
TTCATTGATT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 



Seq ID NO: B17 DNA sequence 

Nucleic Acid Accession #: NMJJ2289B.1 

Coding sequence : 268.-2739 

1 11 21 31 41 51 

I I | I I I 

TTTTCTTGCT TTTCTTCCCT TTTTTTTCTT TTTGCAAACA AAACAAAAAA CAGCATAGAA 
GAAAGAGCAA AATAAAGAAG AAGAAGAGGA GGAAGAGAGG GAAAGAGAGG AAGGGAAAAA 
AAACAGCAAC CCGGGCAGAG GAGGAGGTGC GGCGGCGGCG GCGGCGGCGG CAGCGGCGGC 
AGCGGCGCGG CGGCGGCTCG GACCCCCTCC CCCGGCTCCC CCCATCAGTG CAGCTCTCCG 
GGCGATGCCA GAATAGATGC CGGGGCAATG TCCCGCCGCA AACAGGGCAA CCCGCAGCAC 
TTGTCCCAGA GGGAGCTCAT CACCCCAGAG GCTGACCATG TGGAGGCCGC CATCCTCGAA 
GAAGACGAGG GTCTGGAGAT AGAGGAGCCA AGTGGCCTGG GGCTGATGGT GGGTGGCCCC 
GACCCTGACC TGCTCACCTG TGGCCAGTGT CAAATGAACT TCCCCTTGGG GGACATCCTG 
GTTTTTATAG AGCACAAAAG GAAGCAGTGT GGCGGCAGCT TGGGTGCCTG CTATGACAAG 
GCCCTGGACA AGGACAGCCC GCCACCCTCC TCACGCTCCG AGCTCAGGAA AGTGTCCGAG 
CCGGTGGAGA TCGGGATCCA AGTCACCCCC GACGAAGATG ACCACCTGCT CTCACCCACG 
AAAGGCATCT GTCCCAAGCA GGAGAACATT GCAGGTAAAG ATGAGCCTTC CAGCTACATT 
TGCACAACAT GCAAGCAGCC CTTCAACAGC GCGTGGTTCC TGCTGCAGCA CGCGCAGAAC 
ACGCACGGCT TCCGCATCTA CCTGGAGCCC GGGCCGGCCA GCAGCTCGCT CACGCCGCGG 
CTCACCATCC CGCCGCOGCT CGGGCOGGAG GCCGTGGCGC AGTCCCCGCT CATGAATTTC 
CTGGGCGACA GCAACCCCTT CAACCTGCTG CGCATGACGG GCCCCATCCT GCGGGACCAC 
CCGGGCTTCG GCGAGGGCCG CCTGCCGGGC ACGCCGCCTC TCTTCAGTCC CCCGCCGCGC 
CACCACCTGG ACCCGCACCG CCTCAGTGCC GAGGAGATGG GGCTCGTCGC CCAGCACCCC 
AGTGCCTTCG ACCGAGTCAT GCGCCTGAAC CCCATGGCCA TCGACTCGCC CGCCATGGAC 
TTCTCGCGGC GGCTCCGCGA GCTGGCGGGC AACAGCTCCA CGCCGCCGCC CGTGTCCCCG 
GGCCGCGGCA ACCCTATGCA CCGGCTCCTG AACCCCTTCC AGCCCAGCCC CAAGTCCCCG 
TTCCTGAGCA CGCCGCCGCT GCCGCCCATG CCCCCTGGCG GCACGCCGCC CCCGCAGCCG 
CCAGCCAAGA GCAAGTCGTG CGAGTTCTGC GGCAAGACCT TCAAGTTCCA GAGCAATCTC 
ATCGTGCACC GGCGCAGTCA CACGGGCGAG AAGCCCTACA AGTGCCAGCT GTGCGACCAC 
GCGTGCTCGC AGGCCAGCAA GCTCAAGCGC CACATGAAGA CGCACATGCA CAAGGCCGGC 
TCGCTGGCCG GCCGCTCCGA CGACGGGCTC TCGGCCGCCA GCTCCCCCGA GCCCGGCACC 
AGCGAGCTGG CGGGCGAGGG CCTCAAGGCG GCCGACGGTG ACTTCCGCCA CCACGAGAGC 
GACCCGTCGC TGGGCCACGA GCCGGAGGAG GAGGACGAGG AGGAGGAGGA GGAGGAGGAG 
GAGCTGCTAC TGGAGAACGA GAGCCGGCCC GAGTCGAGCT TCAGCATGGA CTCGGAGCTG 
AGCCGCAACC GCGAGAACGG CGGTGGTGGG GTGCCCGGGG TCCCGGGCGC GGGGGGCGGC 
GCGGCCAAGG CGCTGGCTGA OGAGAAGGCG CTGGTGCTGG GCAAGGTCAT GGAGAACGTG 
GGCCTAGGCG CACTGCCGCA GTACGGCGAG CTCCTGGCCG ACAAGCAGAA GCGCGGCGCC 
TTCCTGAAGC GTGCGGCGGG CGGCGGGGAC GCGGGCGACG AOGACGACGC GGGCGGCTGC 
GGGGACGCGG GCGCGGGCGG CGCGGTCAAC GGGCGCGGGG GCGGCTTCGC GCCAGGCACC 
GAGCCCTTCC CCGGGCTCTT CCCGCGCAAG CCCGCGCCGC TGCCCAGCCC CGGGCTCAAC 
AGCGCCGCCA AGCGCATCAA GGTGGAGAAG GACCTGGAGC TGCCGCCCGC CGCGCTCATC 
CCGTCCGAGA ACGTGTACTC GCAGTGGCTG GTGGGCTAOG CGGCGTCGCG GCACTTCATG 
AAGGACCCCT TOCTGGGCTT CACGGACGCA CGACAGTCGC CCTTCGCCAC GTCGTCCGAG 
CACTCGTCCG AGAACGGCAG CCTGCGCTTC TCCACGCCGC CCGGGGACCT GCTGGACGGC 
GGCCTCTCGG GCCGCAGCGG CACGGCCAGC GGAGGCAGCA CCCCGCACCT GGGCGGCCCG 
GGCCCCGGGC GGCCCAGCTC CAAGGAGGGC CGCCGCAGCG ACACGTGCGA GTACTGCGGC 
AAGGTGTTCA AGAACTGCAG CAACTTGACG GTGCACCGGC GGAGCCACAC CGGCGAGCGG 
CCTTACAAGT GCGAGCTGTG CAACTACGCG TGCGCGCAGA GCAGCAAGCT CACGCGCCAC 
ATGAAGACGC ACGGGCAGAT CGGCAAGGAG GTGTACCGCT GCGACATCTG CCAGATGCCC 
TTCAGCGTCT ACAGCACCCT GGAGAAACAC ATGAAAAAGT GGCACGGCGA GCACTTGCTG 
ACTAACGACG TCAAAATCGA GCAGGCCGAG AGGAGCTAAG CGCGCGGGCC CCGGOGCCCC 
GCACCTGTAC AGTGGAACCG TTGCCAACCG AGAGAATGCT GACCTGACTT GCCTCCGTGT 
CACCGCCACC CCGCACCCCG CGTGTCCCCG GGGCCCAGGG GAGGCGGCAC TCCAACCTAA 
CCTGTGTCTG CGAAGTCCTA TGGAAACCCG AGGGTTGATT AAGGC AGTAC AAATTGTGGA 
GCCTTTTAAC TGTGCAATAA TTTCTGTATT TATTGGGTTT TGTAATTTTT TTGGCATGTG 
CAGGTACTTT TTATTATTAT TTTTTCTGTT TGAATTCCTT TAAGAGATTT TGTTGGGTAT 
CCATCCCTTC TTTGTTTTTT TTTTAACCCG GTAGTAGCCT GAGCAATGAC TCGCAAGCAA 
TGTTAGAGGG GAAGCATATC TTTTAAATTA TAATTTGGGG GGAGGGGTGG TGCTGCTTTT 
TTGAAATTTA AGCTAAGCAT GTGTAATTTC TTGTGAAGAA GCCAACACTC AAATGACTTT 
TAAAGTTGTT TACTTTTTCA TTCCTTCCTT TTTTTTGTCC TGA AATAAA A AGTGGCATGC 
AGTTTTTTTT TTAATTATTT TTTAATTTTT TTTTTGGTTT TTGTTTTTGG GGTGGGGGGT 
GTGGATGTAC AGCGGATAAC AATCTTTCAA GTCGTAGCAC TTTGTTTCAG AACTGGAATG 

803 
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GAGATGTAGC ACTCATGTCG TCCCGAGTCA AGCGGCCTTT TCTGTGTTGA TTTCGGCTTT 
CATATTACAT AAGGGAAACC TTGAGTGGTG GTGCTGGGGG AGGCACCCCA CAGACTCAGC 
GCCGCCAGAG ATAGGGTTTT TGGAGGGCTC CTCTGGGAAA TGGCCCGACA S^TTCTCAG 
GTTGTGCATG ACCAGCAGAT ACTATCCTGT TGGTGTGCCC TGGGGTGCCA TCGCTGCTAT 
TCGCTGTAGA TTAGGCTACA TAAAATGGGC TGAGGGTACC TTTTTGGGGA GATWGGTGG 
CCTGCAGTGA CACAGAAAGG AAGAAACTAG CGGTGTTCTT TTAGGCGTTT TCTGGCTTGA 
CGGCTTCTCT CTTTTTTTAA ATCACCCCCA CCACATAAAT CTCAAATCCT ATGTTGCTAC 
AAGGGGTCAT CCATCATTTC CCAAGCAGAC GAATGCCCTA ATTAATTGAA GTTAGTGTTC 
TCTCATTTAA TGCACACTGA TGATATTGTA GGGATGGGTG GGGTGGGGAT CTTGCAAATT 
TCTATTCTCT TTTACTGAAA AAGCAGGGGA TGAGTTCCAT CAGAAGGTGC CCAGCGCTAC 
TTCCCAGGTT TTTATTTTTT TTTTCCTATC TCATTAGGTT GGAAGGTACT AAATATTGAA 
CTGTTAAGAT TAGACATTTG AATTCTGTTG ACCCGCACTT TAAAGCTTTT GTTTGCATTT 
AAATTAAATG GCTTCTAAAC AAGAAATTGC AGCATATTCT TCTCTTTGGC CCAGAGGTGG 
GTTAAACTGT AAGGGACAGC TGAGATTGAG TGTCAGTATT GCTAAGCGTG GCATTCACAA 
TACTGGCACT ATAAAGAACA AAATAAAATA ATAATTTATA GGACAGTTTT TCTACTGCCA 
TTCAATTTGA TGTGAGTGCC TTGAAAACTG ATCTTCCTAT TTGAGTCTCT TGAGACAAAT 
GCAAAACTTT TTTTTTGAAA TGAAAAGACT TTTTAAAAAA GTAAAACAAG AAAAGTACAT 
TCTTTAGAAA CTAACAAAGC CACATTTACT TTAAGTAAAA AAAAAAAAAA TTCTGGTTGA 
AGATAGAGGA TATGAAATGC CATAAGACCC AATCAAATGA AGAAATAAAC CCAGCACAAC 
CTTGGACATC CATTAGCTGA ATTATCCTCA GCCCCTTTTG TTTTTGGGAC AACGCTGCTT 
AGATATGGAG TGGAGGTGAT TTACTGCTGA ATTAAAACTC AAGTGACACA AG TTACAAGT 
TGATATCGTT GAATCAAAAG CAAAACAAAA ACAATTCAGG AACAACGGCT AATTTTTTCT 
AAAGTTAAAT TTAGTGCACT CTGTCTTAAA AATACGTTTA CAGT ATTGG G TA CATACA AG 
GGTAAAAAAA AAATTGTGTG TATGTGTGTT GGAGCGATCT T TTTTTT TCA AAGTTTGCTT 
AATAGGTTAT ACAAAAATGC CACAGTGGCC GCGTGTATAT TGTTTTCTTT TGGTGACGGG 
GTTTTAGTAT ATATTATATA TATTAAAATT TCTTGATTAC TGTAAAAGTG GACCAGTATT 
TGTAATAATC GAGAATGCCT GGGCATTTTA CAAAACAAGA AAAAAAATAC CCTTTTCTTT 
TCCTTGAAAA TGTTGCAGTA AAATTTAAAT GGTGGGTCTA TAAATTTGTT CTTGTTACAG 
TAACTGTAAA GTCGGAGTTT TAGTAAATTT TTTTCTGCCT TGGGTGTTGA ATTTTTATTT 
CAAAAAAAAT GTATAGAAAC TTGTATTTGG GGATTCAAAG GGGATTGCTA CACCATGTAG 
AAAAAGTATG TAGAAAAAAA GTGCTTAATA TTGTTATTGC TTTGCAGAAA AAAAAAAAAT 
CACATTTCTG ACCTGTACTT ATTTTTCTCT TCCCGCCTCC CTCTGGAATG GATATATTGG 
TTGGTTCATA TGATGTAGGC ACTTGCTGTA TTTTTACTGG AGCTCGTAAT TTTTTAACTG 
TAAGCTTGTC CTTTTAAAGG GATTTAATGT ACCTTTTTGT TAGTGAATTT GGAAATAAAA 
AGAAAAAAAA AACAAAAACA AACAGGCTGC CATAATATAT TTTTTTAATT TGGCAGGATA 
AAATATTGCA AAAAAAACAC ATTTGTATGT TAAGTCCTAT TGTACAGGAG AAAAAGGGTT 
GTTTGACAAC CTTTGAGAAA AAGAAACAAA AGGAAGTAGT TAAATGCTTT GGTTCACAAA 
TCATTTAGTT GTATATATTT TTTGTOGGAA TTGGCCTACA CAGAGAACCG TTCGTGTTGG 
GCTTCTCTCT GAACGCCCCG AACCTTGCAT CAAGGCTCCT TGGTGTGGCC ACAGCAGACC 
AGATGGGAAA TTATTTGTGT TGAGTGGAAA AAAATCAGTT TTTGTAAAGA TGTCAGTAAC 
ATTCCACATC GTCCTCCCTT TCTCTAAGAG GCCATCTCTA AGATGTCAGA TGTAGAGGAG 
AGAGAGCGAG AGAACATCTT CCTTCTCTAC CATCACTCCT GTGGCGGTCA CCACCACCAC 
CTCTCCCGCC CTTACCAGCA GAAAGCAATG CAAACTGAGC TGCTTTAGTC CTTGAGAAAT 
TGTGAAACAA ACACAAATAT CATAAAAGGA GCTGGTGATT CAGCTGGGTC CAGGTGAAGT 
GACCTGCTGT TGAGACCGGT ACAAATTGGA TTTCAGGAAG GAGACTCCAT CACAGCCAGG 
ACCTTTCGTG CCATGGAGAG TGTTGGCCTC TTGTCTTTCT TCCCTGCTTT G CTGC TTTGC 
TCTCTGAAAC CTACATTCCG TCAGTTTCCG AATGCGAGGG CCTGGGATGA ATTTGGTGCC 
TTTCCATATC TCGTTCTCTC TCCTTCCCCT GCGTT TCCTC TCCATCCTTC ATCCTCCATT 
GGTCCTTTTT TTTTCTTTCA TTTTTTATTT AATTTCTTTT CTTCCTGTCT GTTCCTCCCC 
TAATCCTCTA TTTTATTTTT ATTTTTTGTA AAGCCAAGTA GCTTTAAGAT AAAGTGGTGG 
TCTTTTGGAT GAGGGAATAA TGCATTTTTA AATAAAATAC CAATATCAGG AAGCCATTTT 
TTATTTCAGG AAATGTAAGA AACCATTATT TCAGGTTATG AAAGTATAAC CAAGCATCCT 
TTTGGGCAAT TCCTTACCAA ATGCAGAAGC TTTTCTGTTC GATGCACTCT TTCCTCCTTG 
CCACTTACCT TTGCAAAGTT AAAAAAAAGG GGGGAGGGAA TGGGAGAGAA AGCTGAGATT 
TCAGTTTCCT ACTGCAGTTT CCTACCTGCA GATCCAGGGG CTGCTGTTGC CTTTGGATGC 
CCCACTGAGG TCCTAGAGTG CCTCCAGGGT GGTCTTCCTG TAGTCATAAC AGCTAGCCAG 
TGCTCACCAG CTTACCAGAT TGCCAGGACT AAGCCATCCC AAAGCACAAG CATTGTGTGT 
CTCTGTGACT G CAGAG AAGA GAGAATTTTG CTTCTGTTTT GTGTTTAAAA AACCAACACG 
GAAGCAGATG ATCCCGAGAG AGAGGCCTCT AGCATGGGTG ACCCAGCCGA CCTCAGGCOG 
GTTTCCGCAC TGCCACAACT TTGTTCAAAG TTGCCCCCAA TTGGAACCTG CCACTTGGCA 
TTAGAGGGTC TTTCATGGGG AGAGAAGGAG ACTGAATTAC TCTAAGCAAA ATGTGAAAAG 
TAAGGAAATC AGCCTTTCAT CCCGGTCCTA AGTAACCGTC AGCCGAAGGT CTCGTGGAAC 
ACAGGCAAAC CCGTGATTTT GGTGCTCCTT GTAACTCAGC CCTGCAAAGC AAAGTCCCAT 
TGATTTAAGT TGTTTGCATT TGTACTGGCA AGGCAAAATA TTTTTATTAC CTTTTCTATT 
ACTTATTGTA TGAGCTTTTG TTGTTTACTT GGAGGTTTTG TCTTTTACTA CAAGTTTGGA 
ACTATTTATT ATTGCTTGGT ATTTGTGCTC TGTTTAAGAA ACAGGCACTT TTTTTTATTA 
TGGATAAAAT GTTGAGATGA CAGGAGGTCA TTTCAATATG GCTTAGTAAA ATATTTATTG 
TTCCTTTATT CTCTGTACAA GATTTTGGGC CTCTTTTTTT CCTTAATGTC ACAATGTTGA 
GTTCAGCATG TGTCTGCCAT TTCATTTGTA CGCTTGTTCA AAACCAAGTT TGTTCTGGTT 
TCAAGTTATA AAAATAAATT GGACATTTAA CTTGATCTCC AAA 
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Protein Accession ft: NP_075049.l 



1 
I 

MSRRKQGNPQ 
CQMNFPLGDI 
PDEDDHLLSP 
PGPASSSLTP 
GTPPLPSPPP 
GNSSTPPPVS 
CGKTFKFQSN 
LSAASSPEPG 



11 
I 

HLSQRELITP 
LVFIEHKRKQ 
TKGICPKQEN 
RLTIPPPIX3P 
RKHLDPHRLS 
PGRGNPMHRL 
LIVHRRSHTG 
TSELAGEGLK 



21 
I 

EADHVEAAIL 
CGGSLGACYD 
IAGKDEPSSY 
EAVAQSPIWN 
AEEMGLVAQH 
LNPFQPSPKS 
EKPYKCQLCD 
AADGDFRHHS 



KALDKDSPPP 
ICTTCKQPFM 
FLGDSNPFNL 
PSAFDRVMRL 
PFLSTPPLPP 
HACSQASKLK 



41 SI 
I I 

PSGLGLMVGG PDPDLLTCGQ 60 

SSRSELRKVS EPVEIGIQVT 120 

SAWPLbQHAQ NTHGFRIYLE 180 

LRMTGPILRD HPGFGEGRLP 240 

NPMAIDSPAM DPSRKLRELA 300 

MPPGGTPPPQ PPAKSKSCEF 360 

RHMKTHMHKA GSLAGRSDDG 420 

EEDEEEEEEE EELLLENESR 480 
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PESSFSMDSE LSRNRENGGG GVPGVPGAGG GAAKALADEK ALVLGKVMEN VGLGALPQYG S40 

ELLADKQKRG AFLKRAAGGG DAGDDDDAGG CGDAGAGGAV NGRGGGFAPG TEPFPGLFPR 600 

KPAPLPSPGL NSAAKRIKVE KDLELPPAAL IPSENVYSQW LVGYAASRHF MKDPFW3FTD 660 

ARQSPFATSS EHSSENGSLR FSTPPGDLLD GGLSGRSGTA SGGSTPHI^G PGPGRPSSKE 720 

GRRSDTCEYC GKVFKNCSNL TVHRRSHTGE RPYKCELCKY ACAQSSKLTR HMKTHGQIGK 780 
EVYRCDICQM PFSVYSTLEK HMKKWHGEHL LTNDVKIEQA ERS 

Seq ID NO: B19 DNA sequence 

Nucleic Acid Accession ft: NM_000399.2 

Coding sequence: 339.. 1769 

1 11 21 31 41 51 

| | I I I I 

TAACTGAGCG AGGAGCAATT GATTAATAGC TCGGCGAGGG GACTCACTGA CTGTTATAAT 60 

AACACTACAC CAGCAACTCC TGGCTTCCCA GCAGCCGGAA CACAGACAGG A GAGAGT CAG 120 

TGGCAAATAG ACATTTTTCT TATTTCTTAA AAAACAGCAA CTTGTTTGCT ACTTTTATTT 180 

CTGTTGATTT TTTTTTCTTG GTGTGTGTGG TGGTTGTTTT TAAGTGTGGA GGGCAAAAGG 240 

AGATACCATC CCAGGCTCAG TCCAACCCCT CTCCAAAACG GCTTTTCTGA CACTCCAGGT 300 

AGCGAGGGAG TTGGGTCTCC AGGTTGTGCG AGGAGCAAAT GATGACCGCC AAGGCCGTAG 360 

ACAAAATCCC AGTAACTCTC AGTGGTTTTG TGCACCAGCT GTCTGACAAC ATCTACCCGG 420 

TGGAGGACCT CGCCGCCACG TCGGTGACCA TCTTTCCCAA TGCCGAACTG GGAGGCCCCT 480 

TTGACCAGAT GAACGGAGTG GCCGGAGATG GCATGATCAA CATTGACATG ACTGGAGAGA 540 

AGAGGTCGTT GGATCTCCCA TATCCCAGCA GCTTTGCTCC CGTCTCTGCA CCTAGAAACC 600 

AGACCTTCAC TTACATGGGC AAGTTCTCCA TTGACCCTCA GTACCCTGGT GCCAGCTGCT 660 

ACCCAGAAGG CATAATCAAT ATTGTGAGTG CAGGCATCTT GCAAGGGGTC ACTTCCCCAG 720 

CTTCAACCAC AGCCTCATCC AGCGTCACCT CTGCCTCCCC CAACCCACTG GCCACAGGAC 780 

CCCTGGGTGT GTGCACCATG TCCCAGACCC AGCCTGACCT GGACCACCTG TACTCTCCGC 840 

CACCGCCTCC TCCTCCTTAT TCTGGCTGTG CAGGAGACCT CTACCAGGAC CCTTCTGCGT 900 

TCCTGTCAGC AGCCACCACC TCCACCTCTT CCTCTCTGGC CTACCCACCA CCTCCTTCCT 960 

ATCCATCCCC CAAGCCAGCC ACGGACCCAG GTCTCTTCCC AATGATCCCA GACTATCCTG 1020 

GATTCTTTCC ATCTCAGTGC CAGAGAGACC TACATGGTAC AGCTGGCCCA GACCGTAAGC 1080 

CCTTTCCCTG CCCACTGGAC ACCCTGCGGG TGCCCCCTCC ACTCACTCCA CTCTCTACAA 1140 

TCCGTAACTT TACCCTGGGG GGCCCCAGTG CTGGGGTGAC CGGACCAGGG GCCAGTGGAG 1200 

GCAGCGAGGG ACCCCGGCTG CCTGGTAGCA GCTCAGCAGC AGCAGCAGCC GCCGCCGCCG 1260 

CCGCCTATAA CCCACACCAC CTGCCACTGC GGCCCATTCT GAGGCCTCGC AAGTACCCCA 1320 

ACAGACCCAG CAAGACGCCG GTGCACGAGA GGCCCTACCC GTGCCCAGCA GAAGGCTGCG 1380 

ACCGGCGGTT CTCCCGCTCT GACGAGCTGA CACGGCACAT CCGAATCCAC ACTGGGCATA 1440 

AGCCCTTCCA GTGTCGGATC TGCATGCGCA ACTTCAGCCG CAGTGACCAC CTCACCACCC 1500 

ATATCCGCAC CCACACCGGT GAGAAGCCCT TCGCCTGTGA CTACTGTGGC CGAAAGTTTG 1560 

CCCGGAGTGA TGAGAGGAAG CGCCACACCA AGATCCACCT GAGACAGAAA GAGCGGAAAA 1620 

GCAGTGCCCC CTCTGCATCG GTGCCAGCCC CCTCTACAGC CTCCTGCTCT GGGGGGGTGC 1680 

AGCCTGGGGG TACCCTGTGC AGCAGTAACA GCAGCAGTCT TGGCGGAGGG CCGCTCGCCC 1740 

CTTGCTCCTC TCGGACCCGG ACACCTTGAG ATGAGACTCA GGCTGATACA CCAGCTCCCA 1800 

AAGGTCCCGG AGGCCCTTTG TCCACTGGAG CTGCACAACA AACACTACCA CCCTTTCCTG 1860 

TCCCTCTCTC CCTTTGTTGG GCAAAGGGCT TTGGTGGAGC TAGCACTGCC CCCTTTCCAC 1920 

CTAGAAGCAG GTTCTTCCTA AAACTTAGCC CATTCTAGTC TCTCTTAGGT GAGTTGACTA 1980 

TCAACCCAAG GCAAAGGGGA GGCTCAGAAG GAGGTGGTGT GGGGATCCCC TGGCCAAGAG 2040 

GGCTGAGGTC TGACCCTGCT TTAAAGGGTT GTTTGACTAG GTTTTGCTAC CCCACTTCCC 2100 

CTTATTTTGA CCCATCACAG GTTTTTGACC CTGGATGTCA GAGTTGATCT AAGACGTTTT 2160 

CTACAATAGG TTGGGAGATG CTGATCCCTT CAAGTGGGGA CAGCAAAAAG ACAAGCAAAA 2220 

CTGATGTGCA CTTTATGGCT TGGGACTGAT TTGGGGGACA TTGTACAGTG AGTGAAGTAT 2280 

AGCCTTTATG CCACACTCTG TGGCCCTAAA ATGGTGAATC AGAGCATATC TAGTTGTCTC 2340 

AACCCTTGAA GCAATATGTA TTATATACTC AGAGAACAGA AGTGCAATGT GATGGGAGGA 2400 

ACGTAGCAAT ATCTGCTCCT TTTCGAGTTG TTTGAGAAAT GTAGGCTATT TTTTCAGTGT 2460 

ATATCCACTC AGATTTTGTG TATTTTTGAT GTACCCACAC TGTTCTCTAA ATTCTGAATC 2520 

TTTGGGAAAA AATGTAAAGC ATTTATGATC TCAGAGGTTA ACTTATTTAA GGGGGATGTA 2580 

CATATTCTCT GAAACTAGGA TGCATGCAAT TGTGTTGGAA GTGTCCTTGG TCGCCTTGTG 2640 

TGATGTAGAC AAATGTTACA AGGCTGCATG TAAATGGGTT GCCTTATTAT GGAGAAAAAA 2700 

ATCACTCCCT GAGTTTAGTA TGGCTGTATA TTTATGCCTA TTAATATTTG GAATTTTTTT 2760 

TAGAAAGTAT ATTTTTGTAT GCTTTGTTTT GTGACTTAAA AGTGTTACCT TTGTAGTCAA 2820 

ATTTCAGATA AGAATGTACA TAATGTTACC GGAGCTGATT TGTTTGGTCA TTAGCTCTTA 2880 

ATAGTTGTGA AAAAATAAAT CTATTCTAAC GC AAAAC CAC TAACTGAAGT TCAGATATAA 2940 
TGGATGGTTT GTGACTATAG TGTAAATAAA TACTTTTCAA CAAT 

Seq ID NO: B20 Protein sequence 
Protein Accession gs NP_000390.2 

1 11 21 31 41 SI 

1 | | ! I 1 

MMTAKAVDKI PVTLSGFVHQ LSDNIYPVED IAATSVTIFP NAELGGPFDQ MNGVAGDGMI 60 
NIDMTGEKRS LDLPYPSSFA PVSAPRNQTF TYMGKFSIDP QYPGASCYPE GIINIVSAGI 120 
LQGVTSPAST TASSSVTSAS PNPLATGPLG VCTMSQTQPD UDHLYSPPPP PPPYSGCAGD 180 
LYQDPSAFLS AATTSTSSSL AYPPPPSYPS PKPATDPGLF PMIPDYPGPF PSQOQRDLHG 240 
TAGPDRKPFP CPLDTLRVPP PLTPLSTIRN PTIiGGPSAGV TGPGASGGSE GPRLPGSSSA 300 
AAAAAAAAAY NPHHLPLRPI LRPRKYPNRP SKTPVHERPY PCPAEGCDRR FSRSDELTRH 360 
IRIHTGHKPF QCRICMRNFS RSDHLTTHIR THTGEKPFAC DYCGRKFARS DERKRHTKIH 420 
LRQKERKSSA PSASVPAPST ASCSGGVQPG GTLCSSNSSS LGGGPIAPCS SRTRTP 

Seq ID NO: B21 DNA sequence 

Nucleic Acid Accession #: NM__004962.2 

Coding sequence: 457.. 1893 

31 41 51 

1 I 1 
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CACACACGGG CGCACGCACA CGGCAGCCGG 
AGAGGTCCGG GGCGCGCAGC CGGGTCCCCT 
CCTCGAAGCA GCCGGGCCGG GCGCGCAGTG 
AAGGCAGCGC GCCGACTCGG GCTCGGCTCG 
5 CTGGCCGGGG GCTCGGGCCG CCGGTACCCA 
AAGCCCCTCG CCCCGCGCGG ACCTCGGTAT 
GCGCCCTACT GCCGCGAGGT CAGTCCGCAG 
TCCTGGACTT CGGCCCTTTG CCGCCCTCAC 
AGCCCGGGAC CCGGGCCCCA GCTGCTGCTG 
10 OGGGATGTGG CCGGCAGCCA CAGGGCCCCC 
GGCCTGCAGG GGGACAGGGA TCTCCAGCGG 
CCCAGCGCCC AGGACATGGT CGCTGTCCAC 
CAGGGCGCGC GGCCGGGAGG GGGCAACACG 
GTCGACCAGA AGGCCGTGTA TTTCTTCAAC 
15 CTTACGGCCA CTTTCCACTT CTACTCAGAG 
CTATGCAAGC CGCGGGCCAA GAACGCTTCA 
CGCCAGCACC TGCTCTTCCG CAGCCTCTCQ 
GGGGCCATGG CCCTGGCGCC CCCACCGCGC 
ATCGTCAAGG OGGCCCGCCG GGATGGCGAG 
20 GAGAGGGACC CGGGGGTGCC CCGGCCCAGC 
AACGATCTGG CCATCTCGGA GCCCAACAGC 
TTCCCTGCCG GAGACCCCGA GCCCCGCGCA 
CGCCGAGCCG CGCAGGCCAC TGGGCCCCTC 
AGGCCGCCGC GCGCCCACGC ACAGCACTTC 
25 CGGGCGCTGA AACCCCGGCC AGGGCGCAAA 

. ATGGCCGCCT CGCAGGTGCT GGACTTTGAC 
CAGTGGGATG AGCCGAGGGT GTGCTCCCGG 
GGCTGGAATG AATGGATAAT CTCACCGAAA 
TGTGAGTTCC CCATGCCTAA GATCGTTCGT 
30 GTCAGGGCTG TGGGCATCAT CCCTGGCATC 
AACTCCCTTG GGGTCCTCTT CCTGGATGAG 
AACATGTCCG TGGACACCTG TGCCTGCCGG 
GCCCAGCAGA GCTGCCTTCT CGGAGCCTTC 
CACAGAGCAC AGCTCATGGG CAACATCACT 
35 TCATTAGGGG GTCTTTCATT GCTAGTGACT 
AGGAATCTGG GAATTAGCCC TGGCCTGAAA 
AGGCTTGAAA ACAAAACATA TCCACAACAT 
AGCAAGAAGA CTATGCAAAT CTTAGGGCGC 
TAAATGCTCA GTTCAGAACA CTTTGGGCCA 
40 GGTGTAAATG AGTGTTTCCT TTCAAAGTCC 
CATGTAGCCA ATATTGGTTT CTTTTTCTTA 
AGGGAGGGCG TTGACACCAT TCCCCACAGA 
CATGCATATT GAAATAACAC ATATAGTAAC 
ATATTTTTGT AAATTATACT TTCTATACTG 
45 AAGCTAATAA ATTAAAGGTA CAGTGGTATC 



GCCAGGGACG ACCCTGTCAG CTGCAGCCCC 60 

CGAGGGCGCA GCCGGCCGCC CCGCCCCGCC 120 

GGCTACAAAC TTTCGCAGCG CGAGTCCGCC 180 

GCTCTGCGCT GCTCCGGACG GCTGTGACCG 240 

CGGACCGCGC GCCCGGGTGC CTGCTCCGCT 300 

CCAGOGCCCT GCTGCCCGGG CTCTCCCCGC 360 

CCTCCGGTGC GCCAGCGCTC GCCTTCCTCC 420 

CACGCCATGG tTCATGTCCC CGCTCGGACC 480 

CTGCTGCTGC CGTTGTTTCT GCTGTTGCTC 540 

GCCTGGTCCG CACTGCCCGC GGCCGCCGAC 600 

CACCCTGGGG ACGCGGCOGC CACGTTGGGC 660 

ATGCACAGGC TCTATGAGAA GTACAGCCGG 720 

GTCCGCAGCT TCAGGGCCAG GCTGGAAGTG 780 

CTGACTTCCA TGCAAGACTC GGAAATGATC 840 

CCGCCTOGGT GGCCTOGAGC GCTCGAGGTG 900 

GGCCGCCCGC TGCCCCTGGG CCCGCCCACA 960 

CAGAACACGG CCACACAGGG GCTACTCCGC 1020 

GGCCTGTGGC AGGCCAAGGA CATCTCCCCC 1080 

CTGCTCCTCT CGGCCCAGCT GGATTCTGAG 1140 

CCCTATGCGC CCTACATCCT AGTCTATGCC 1200 

GTGGCAGTGA CGCTGCAGAG ATACGACCCC 1260 

GCCCCCAACA ACTCAGCGGA CCCCCGCGTG 1320 

CAGGACAACG AGCTGCCGGG GCTGGATGAG 1380 

CACAAGCACC AGCTGTGGCC CAGCCCCTTC 1440 

GACCGCAGGA AGAAGGGCCA GGAGGTGTTC 1500 

GAGAAGACGA TGCAGAAAGC CCGGAGGAAG 1560 

AGGTACCTGA AGGTGGACTT CGCAGACATC 1620 

TCTTTTGATG CCTACTACTG CGCGGGAGCA 1680 

CCATCCAACC ATGCCACCAT CCAGAGCATT 1740 

CCAGAGCCCT GCTGTGTTCC CGATAAGATG 1800 

AATCGGAATG TGGTTCTGAA GGTGTACCCC 1860 

TGAGACCACT CCAGGGTGGA AAGAAGCCAC 1920 

TGCAACCAGG ACTTGTGGTG CAGCTGCAGA 1980 

GGGGCCCAGA GAGAGCTGTC CGCCAGTGCA 2040 

AGCCCCTTAA ATGCCAGCCT GAGTACCTGA 2100 

GTGGCCCATC ATTCATACCC ACTGTTCTGA 2160 

TGGCTTGATG TGATCATCAT CTCATAACTG 2220 

TCGCTCCCTG CACACGGAAA GAACTCTGTT 2280 

CATAGTGATT TTGGAAAACA GGATAATCGT 2340 

ACTGCAGAGC TTTTATCCAT ATGGTATGCA 2400 

ATATATATAT TTTATTTTAA AACAACAAAA 2460 

GATAGTCATG CTGAGTGTGG GTTGTTTAAA 2520 

GTGGGAATAC TAAAAAATAA CCAAGATTTT 2580 

TAGATTGTGT ATGTTATGTG TTTTTATGGA 2640 
TTGA 



Seq ID NO: B22 Protein sequence - 
Protein Accession #t NP_004953.1 

50 1 11 21 31 41 51 

If III) 

MAHVPARTSP GPGPQLLLLL LPLFLLUiRD VAGSHRAPAW SALPAAADGIi QGDRDliQRHP 60 

GDAAATLGPS AQDMVAVHMH RLYEKYSRQG ARPGGGNTVR SFRARLEWD QKAVYFFNLT 120 

SMQDSEMILT ATFHFYSEPP RWPRALEVLC KPRAKNASGR PLPI/3PPTRQ HLLFRSLSON 180 

55 TATQGLLRGA MALAPPPRGL WQAKDISPIV KAARRDGELL LSAQLDSEER DPGVPRPSPY 240 

APYIliVYAND LAISEPNSVA VTLQRYDPFP AGDPEPRAAP NNSADPRVRR AAQATGPIiQD 300 

NELPGLDERP PRAHAQHFHK HQLWPSPFRA LKPRPGRKDR RKKGQEVFMA ASQVLDPDEK 360 

TMQKARRKQW DEPRVCSRRY LKVDFADIGW NEWIISPKSP DAYYCAGACE FPMPKIVRPS 420 
NHATIQSIVR AVGIIPGIPE PCCVPDKMNS LGVLFLDENR NWLKVYPNM SVDTCACR 



60 



Seq ID NO: B23 DNA sequence 
Nucleic Acid Accession 8: AK026322. 



1 li 21 31 41 51 

65 | i I I I I 

ATTCTTTAAA TACTTAAACA TGGTTATAAC AGCTGTGTTT TAAGCTCCTT GTCTGTTAAT 60 

TCCATCATCT GTCATTTTTG CATCTGTTTT TATTGACTAA CTTTCTTCCT CA TCATGG TC 120 

ACATATCCTT GCTTCTTCCC ATATTTAGTA AAGTTTAGTT GGATGCTGGA TTGTTTTAAT 180 

TTAGCATTTT GGAGTGACTG CTTTTGCACT TGCCTTTAAA AGTATTGGAC TTCGTTTT GA 240 

70 TAGTTGAATT ACTTGCAAAT CAGCTTGATC ATTCTGAGAC TTGTTTTTAA CTTTTGTCGA 300 

AAAGGCTATG CTACTTCAAG TATAATAAAA CCTAGTTTTA GTTTTATCCT ATAACTAAGG 360 

CATGTGTTCA ATTTGAATGT CTCCAACCCT GTGTGAACTC TGAAAATTGT TCAGCTCCCA 420 

ATTTCGCAGT AATTTTTTTT GTTCAGCCTT GCAGTCTCAT CCTACTCAAG TGTGGCTCTG 480 

TATCCAACAA CAGTCTTGGA GATCTCATGA AGATTTCTGA AACTTTTGCT CTGCAGGATA 540 

75 CCTTCCTTTG TGGTTATGTG CCCTGAAAAC TCCAGCCTCC GTGTCTTCAA TTCAGTGAAA 600 

CTACTACTCT GCTTGGATTC CCCTCTTATA CAATGGTCTA CCAAGTGACT GCAAACAGAA 660 

ATCTAAGGGA ATTTCAGGAG TCACCTCATT TATTTTCCTT TTTTTCAGGG ATTATAGTAG 720 

TACTATACCA CCTGTCAGCC AATGTGTGAG AATGGTGGTT TCATACATTT TCTCCAATTC 780 

TTTCTTGTTA CAGGAGCCTC CATTACTTTT CAAACAGTAA TGCAGTTGCA GTTGCCTCTC 840 

80 CTTTTCTTCA TTATGTGTTT ATCTCTGGCA GTTTGAGCCA AGAGAGGGCA CGGAGAAGTA 900 

CAATGACTAG AGAGCACTTC TGTTGAGGCT CATTCATTGA CCCCTACCCC AGTGCTTTAT 960 

GAATGTGTGC TGCAGATGTC ATACAGCATC ACAGCTTCTT CTCTAATTTA TGAGCCATAA 1020 

, iT . i , 1 „ rrrrn , XGTATTTTTA TTGTATTACC TGCTTGGTTG CAAGAGGATG ATGAGGAGGA 1080 

CTACTATAGG ATAAATTTGT TTTTATAGAG CAATTTCTCA TGGGTCACGA GGGATAGTAC 1140 

806 



WO 03/042661 



TCCATTTTCC TCACTGATAT CGGCTTGATC AT CTGAGTCA GTGGGCTCAT TGTGGAAACA 
TATGGAATCC TTTATGCTTT TTCTCAGGCT GCTTCTGTTA CATGAAATAA AGCCAGAGTT 
GATTGTGAAA AAAAAAAAAA AAAA 



1200 
1260 



Seq ID NO: B24 DNA sequence 

Nucleic Acid Accession 8: cat cluster 



1 11 21 31 41 51 

1 | I 1 I 1 

TGTATACATT CCTTTCAAAT AAAGACCTTG AGAAAACAGC AGAGCCAAGT GAAGATCACC 
TAAAGAACTT TGTGGCTAAT TTATACTTCA TGTAGTAGCA GTGGGTACTG GGCAGGGTCT 
CTTTCCCACT CTGATGATTT GTGCTCTTAT TTTTCCTAGA TTTACCTCAT CTAGGGCATA 
TTCTTTTCCC TCTTCCTCTT TACCTTTCCT GGTCCTGATC CCTCT GTACT CAGTTCCCCT 
AAATTATTGG ACTACAAACT AATATACTAG AAAAGCATAC ACTTATTTTA TTTGAATGCA 
GAAATGCTAT CTATCAGTAT ATATACATAA GAATGTATAT TACAGTATAT CTATATATAC 
TTAACACTGT AACTTTCAGT ATTCCCCAGT TAGCGTACCT AACTCTCCTG TGGGTTATGT 
TAATTCTTAT TAGACTACTA GAGAAAAACC AACTGGCAGT TTGCTAAGCA TATCTACTGG 
TGTTGTTTCT GCGCCCTCTT TTGGCTAATT GATGTAATTA TACTGGCTCT AAAGATTTAC 
TGCCCCATAA GTAAATAGTA TAGCCACATT CTGAACATAT CAAAAGTACA AACTTAGGAG 
GAGTGTATGT ACAAAAATGT AAAATTTTAT GAAAATGAAC ATGTTTTTAT GATGTTATTT 
CTAGTTCATA AGAATGTGAT GACTGCTTTG CTTCATTTAT GTACGTTCCC ATT AT ATT CT 
TGCTGTCAAT CAATCACAAA TTTATATCAG ATTAGGATAA ACTAAGCCAT TTTATGTATT 
TTATTTTAAA CCTTATTTTG GCAGAGTAAT TCCTTAGAAT TGGAAAAGCT GTTACTTTGA 
AATTACCAAT TTATTACAAA ACATAGAAAT GTATTGTAGC TACAAAGACA ACCAAGCATT 
TTCTGTGTTT TAATGAATAT CTAAAAAACT ACATTTAGTT TATTTTACTC AGTTTTGAAA 
TGATTTTTTT ACTGGCTCTA TTGCCTTAAA ATAACTAAGA GATTAATGAT T CTTTGT ATA 
ATTTTCCTTT TCTTTGTTCT TTTTTTTACA TTTCGCAGAG TTATATCTAT AGTTTTAGTA 
ACAATTTCTT ATGTATTCTG GATAACTGAA AACAACTAAA GGTGTTGGGC ATTAGAAAAT 
AATTGTGAGC AGTAAGATTA CTGATGTAAT ATGTATGTTG GACTGAAGTA TTTCTTTATA " 
AACATTCTAT TTGATTTTAA GCAAAATGTA TGTTAAAGCA TGTTTTTACA TCAGTAAAGT 
CATTTGTCGA CCTTCTGGAA ATGAAAGGTT TTTACCTAGA TACTGTAAGT TACACCTCCT 
TAACAATCAT ATTTGTCATT GTTGTTTTCT GCAAACAAAA ATGTTTATGG GCTTCATGTA 
GGCTTAAGAT TGTAGGCAAA AATGGACTGA GTTCAGGACC CTTCAAGCAG TAGG CATTCA 
GTTACAGAGC AGTTGGTACT TTGTAACCCA GACTTACAGT TTAAAAATAT CAAGTTAGCT 
GATGTTTCAT TATAATAAAA AT ACT ATT TT GCTTAAGAGT TGTA TTACAA ATATTTGTGC 
TTAACATTAG AAATAGCTGT TTTAAATTGT AGTTAACATA TTAACTTTTT CAGAAAAAAA 
GCATGGTTTA TTTTTAATAA TGAAATAGAG AACATAATAC GTAATGTTCA GTATAACAGC 
TGAGTTAAAA CATCTGCCAG GATTAACATC AGTGCGTTTT TGCCAATGCA TAGAGGCATT 
TTTCTCTAAG TATGATGGCT AATGATAACT ATTCTTTGTT ACACATTCAA GTCACTCCCA 
TACAAGTAAC TAGTGGGTGA TATGTTTCAC TCCAAAGGGT GTATTAATTC TGAATGCTAA 
TCATGAAGAC TTAAGTTAGG ACAACACTTC AAACCAGGAA GTGTGAACTG ATTTA 
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Seq ID NO: B25 DNA sequence 

Nucleic Acid Accession 8: FGENESH predicted 

Coding sequence : 1 . . 1144 




GGAGGAGGCG 
CCCGGGCTGC 
GCCGGGGCCG 
AGCAGCGGCG 
CGGGCCGGAG 
GCCGAGGGCT 
GGCGGCGGCG 
AGCGGCAGCG 
AAGGCGCTGC 
GCGCTGGACG 
TCCAAGATCG 
GAGGAGATGC 
CTGCCCAGCT 
TACGAGCAGG 
GAGAAGTGCG 
CCTT 



GGATGCACCT 
GCGCCTCCAC 
TGTCCCTGGC 
GCTTCGAGCC 
GCGGCGGCAG 
TAGTAGGTTC 
CCCTTTGCCT 
GCGAGCAGAG 
TAGCCGACCC 
GCTCCAATGC 
GCGGCGGGGG 
GCGGCAGCAG 
GGCTTAACAT 
AGCTGCGCGC 
CCACGCTGCT 
GGCGCCTAGT 
CGGCGGCTGC 
CAGCCGGCTA 
CCCTGTTTAA 



21 
I 

CGGTGCAGCG 
CTCCAAGCGC 
GCCGCCGCCT 
GGCTGACCCC 
CGCGGGAAGT 
AGCCGGCGTT 
CAAGTACGGC 
CCCCGACGAC 
GCGGGCCTCC 
CCACCTCCAC 
TAGCAGCAGC 
CAGCAGCAGC 
CAATGCCCGA 
GGTGATCCCC 
GCTCGCCAAG 
CGCCTACCTC 
AGCGGCAGCA 
CCCGTTCAGC 
CAGCGTCTCC 



31 
I 

GCCGCCGGCG 
TTGGAAGCGG 
GGGGAACGCC 
GAGGGGGCAG 
GGCGGCGGCG 
GGGGGCGACC 
GAAAGCGCGA 
GACAGCGACG 
CCGGGAGCGG 
GGCGGCGCCA 
GGTAGCAGTG 
AGCAGCAGCA 
GAGCGCCGGC 
TACGCGCACA 
AACTACATCC 
AACCAGGGCC 
GCTGCTGCCC 
GCCGGACTGC 
TCCAGCCTCT 



41 
I 

AGGACGACCT 
CTTTCCGCTC 
CGGOGTCCTC 
GGCTGCTGTT 
GCGGCGGGGG 
CTAGCCTAAG 
GCCGGGGCTC 
GTCGCTGCGA 
GAGGTGGTGG 
GCGTCCCCCC 
GCGGCGGTGG 
GCAAGAAATC 
GGATGCACGA 
GCCCCTCGGT 
TCATGCAGGC 
AGGCCATCTC 
TGCACCCGGC 
CCCOGGCTGC 
GCAAACAGTG 



51 
I 

CTTCCTGCAC 
CACGCCCCCG 
CTCCTCGTCG 
GCCGCCGCCT 
GGTGGGTGTC 
CAGCCTGCCG 
GGTGGCCGAG 
GCTCGTGCTG 
CGCGAAGGCA 
GGGGGGCCTG 
CGGTAGCGGT 
CAAAGAGCAA 
CCTGAACGAC 
GCGAAAGCTC 
GCAGGCCCTG 
GGCTGCCTCC 
GCTCGGCGCC 
CTCCTGCCCG 
CACGGAGAAG 



Seq ID NO: B26 Protein sequence 
Protein Accession #: FGENESH predicted 
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11 



1 

MERGMHLGAA 
PLGCFEPADP 
AGAALCLKYG 
AEGCSNAHLH 
KALRLNINAR 
EEMRRLVAYL 
EKCAliFNSVS 



21 31 41 51 

| | I 1 

LEAAPRSTPP GMDLSLAPPP RERPASSSSS 60 

GGGGGGGVGV PGLLVGSAGV GGDPSLSSLP 120 

DSDGRCELVL RAGVADPRAS PGAGGGGAKA 180 

GSSGGGGGSG SGSGGSSSSS SSSSKKSKEQ 240 

YAHSPSVRKL SKIATLLLAK NYILMQAQAL 300 

AAALHPALGA YEQAAGYPFS AGLPPAASCP 360 



AAGEDDLFLH KSLSASTSKR 
EGAGLLLPPP GGGGGGSAGS 
ESASRGSVAE SSGGEQSPDD 
GGASVPPGGL GGGGGGGSSS 
ERRRMHDLND ALDELRAVIP 
NQGQAISAAS LPSSAAAAAA 
SSIfCKQCTEK P 



Seq ID NO: B27 DNA sequence 

Nucleic Acid Accession cat cluster 



807 



WO 03/042661 



PCT/US02/36810 



! 11 21 31 41 51 

I | | I I I 

TTTT T Tl ' TTT TTTTTTTTAA ATGTTTGATC AAAATATTTT AATAAAGATT CTTTCTGACA 
5 TAGATACACA TACAAATGGT CGTACATAGC TGTCATAGTC TGATTGACCT ATTTAATATA 
TATATCATTC TTTACACATC CAAAACCCGC CAACAGATCC ATCACAGCTC CCAACTCACC 
ATCCAACCTG ACAAACTGAA TTTGTATTAT CTGCAAGGAG TGGAAAATAG CAGGACTCCA 
TTTTTAAAAA AGATTTTCTT GATTTTCATA GGGCGGAAAG GCAGTCAAAC AGCCATGCAA 
AACTAAAACT GAAAGCTCAC TTTGGGTAAA TAGCTTCTTG TTCTTCCTTA GTTTTCTTTC 
10 TTTTTAAATT TTATTTTTTT AGAAAAATAA CAAAGGATTT CACACCATAG GCAAATCAAA 
CCAGTCTTTT AACTTAAATA ATTCTCCACA GTTAAAATAA CATATATGTA CATATATATT 
AAAAGCAATT AAATTAGACC TTTTAAAAAT GCACAGCACA GCCTGGAAAA ATATTTGCTT 
AGCATGTTCT TACGTATAGC AACTATTGCT GTGATGTTTT TCCCTTTTGG AATGTAAAGG 
AGTCCCCTTT CAAAAAAGAG ATCAATTCAT TCATCAATTA AGAATACACC TTTCCTGTAA 
15 TTTTTGGACT GAAGCAATTT ATTAAAGCTC AATTTAAATA CAGGGATGAT GCAACTGAAA 
ATATCCAGGT GACCTTTCAT AAACCTAAGC AGCTCAGATA CATCAATATT TCTCTTCATA 
CTTGTTGGCA AATAAACCTT TAAACACTTG GCACACAGCA TAAGTAATCT ATAAAACAAT 
TTTAGAGGCA TTAAAAAAAT CTGCACATAA GACCCATGAC TTTAACACAT GATAAATACT 
GTTGATGTGG AAGGGTCATT GAAGAATAAC AAATAAATAC CATGAATTGT TAAT ACATCA 
20 TTGCAGAGTA GAAAGTAACA AGGTGCACAT AAATATTTTT AAATGCAATT CTTTCAGCCA 
CAGTCAGTTT TTTTATATCA CTCTCGCCAA AACTTTGAGC ATTTTCACAG GATTAAAGTT 
CAGAGACAAT AAAAAATACA AGTCTTTCAT AGTAACATGT TCTCTCTCTC TCGCTCTGGG 
TTTTATTTCC CCCTCACTTA AGTCAACATT TCAAGTTTTT CTCCTGGCTC AGAATCAAAA 
TTTATTTTCA AGTGCCCTTT CTGATTTGTC TGAATGAATA TTCCATCTCT CATGCTACCG 
25 ATCCGCT 

Seq ID NO: B28 DNA sequence 
Nucleic Acid Accession ft: NM_002581.2 
Coding sequence: 3 6 8.. 52 51 

1 11 21 31 41 SI 

I I I I I I 

GGGGGAGGGA ATTCAGCGGA TCAGTCTTAA GAGGAGCTTT TTTTTGGAGC GAGAAATCAT 

ATAAAATAAA ATGAAATAAA ACAAGGAGGA AGGCAACCAG CTGTTAGGGG GAAAATAAGG 120 

35 CAGATAAAGG AGCGGGGAGA GAAATTAATT GCCAACCAGG AGGAGTTGGG CTGTATTTTT 180 

CAAAGGTGGG GAGAGTGGAG CACACACCTT GAGGAGGAAA GCGAGAAAGA AAAGAAAAAA 240 

GCAAGTGAAG GGGGGCTCGC CCAAGAAGGG TGAAGAAGCG AAGAAAGTCG AGGCGCCGAG 300 

GCTCCCAAAG CTGGCAGCTC CGGGTGGCGG TGCAGGGGCG AAGGGGGGGG CGGGGGGAAC 360 

GTCGGACATG CGGCTCTGGA GTTGGGTGCT GCACCTGGGG CTGCTGAGCG CCGCGCTGGG 420 

40 CTGCGGGCTG GCCGAGCGTC CCCGCCGGGC CCGGAGAGAC CCGCGGGCCG GCCGACCCCC 480 

GCGCCCCGCC GCCGGCCCGG CCACCTGCGC CACCCGCGGC CCGCGGCCGC CGCGCCTCGC 540 

CGCCGCCGCC GCCGCCGCCG GGCGTGCCTG GGAAGCCGTG CGCGTCCCCC GGCGGCGGCA 600 

GCAGCGGGAG GCGAGGGGCG CCACCGAGGA GCCGAGCCCG CCGAGCCGGG CGCTCTATTT 660 

CAGCGGGCGA GGCGAGCAGC TGCGAGTCCT CCGGGCCGAC CTCGAGCTGC CCCGGGACGC 720 

45 GTTCACGCTG CAAGTGTGGC TGCGAGCGGA GGGGGGCCAG AGGTCTCCGG CAGTGATCAC "780 

AGGGCTGTAT GACAAATGTT CTTATATCTC ACGTGACCGA GGAT GGGTC G TGGGCATTCA 840 

CACCATCAGT GACCAAGACA ACAAAGACCC ACGCTACTTT TTCTCCTTGA AGACAGACCG 900 

AGCCCGGCAA GTGACCACCA TCAATGCCCA CCGCAGCTAC CTCCCAGGCC AGTGGGTATA 960 

CCTAGCTGCC ACCTATGATG GGCAGTTCAT GAAGCTCTAT GTGAATGGTG CCCAGGTGGC 1020 

50 CACCTCTGGG GAACAAGTGG GTGGCATATT CAGCCCACTG ACCCAGAAGT GCAAAGTGCT 1080 

CATGTTAGGG GGCAGTGCCC TGAATCACAA CTACCGGGGC TACATCGAGC ACTTCAGTCT 1140 

GTGGAAGGTG GCCAGGACTC AGCGGGAGAT ACTGTCTGAC ATGGAAACCC ATGGCGCCCA 1200 

CACTGCTCTA CCTCAGCTCC TCCTCCAGGA GAACTGGGAC AATGTGAAGC ATGCCTGGTC 1260 

CCCCATGAAG GATGGCAGCA GCCCCAAAGT GGAATTCAGC AATGCCCACG GCTTTCTGCT 1320 

55 GGACACGAGT CTGGAGCCTC CTCTGTGCGG ACAGACATTG TGTGACAACA CAGAGGTCAT 1380 

TGCCAGCTAC AATCAGCTCT CAAGTTTCCG CCAGCCCAAG GTGGTGCGCT ACCGGGTGGT 1440 

CAACCTCTAT GAAGATGATC ATAAGAACCC GACGGTGACG CGCGAGCAGG TGGACTTCCA 1500 

GCACCATCAG CTGGCTGAGG CCTTCAAGCA ATACAACATC TCCXGGGAGC TGGACGTGCT 1560 

GGAGGTGAGC AACTCCTCCC TTCGCCGCCG CCTCATCCTG GCCAACTGTG ACATCAGCAA 1620 

60 GATTGGGGAT GAGAACTGTG ACCCCGAGTG CAACCACACG CTGACGGGCC ACGACGGCGG 1680 

GGATTGCCGC CACCTGCGCC ACCCTGCCTT CGTGAAGAAG CAGCACAACG GGGTGTGTGA 1740 

CATGGACTGC AACTATGAAC GGTTCAACTT TGATGGTGGA GAGTGCTGTG ACCCTGAAAT 1800 

CACCAATGTC ACTCAGACTT GCTTTGACCC CGACTCTCCA CACAGAGCCT ACTTGGATGT 1860 

TAATGAGCTG AAGAACATTC TTAAATTGGA TGGATCAACA CATCTCAATA TTTTCTTTGC 1920 

65 AAAATCCTCA GAGGAGGAGT TGGCAGGAGT AGCAACTTGG CCATGGGACA AGGAGGCCCT 1980 

GATGCACTTA GGTGGCATTG TCTTGAACCC ATCTTTCTAT GGCATGCCTG GGCACACCCA 2040 

CACCATGATC CATGAGATTG GTCACAGCCT GGGCCTCTAT CACGTCTTCC GAGGCATCTC 2100 

AGAAATCCAG TCCTGCAGTG ACCCCTGCAT GGAGACAGAG CCCTCCTTCG AGACTGGAGA 2160 

CCTCTGCAAT GATACCAACC CAGCCCCTAA ACACAAGTCC TGTGGTGACC CAGGGCCAGG 2220 

70 AAATGACACC TGTGGCTTTC ATAGCTTCTT CAACACTCCT TACAACAACT TCATGAGCTA 2280 

TGCAGATGAC GACTGTACGG ACTCCTTCAC GCCCAATCAA GTCGCCAGAA TGCACTGTTA 2340 

CCTGGACCTG GTCTACCAGG GCTGGCAGCC CTCCAGGAAA CCAGCGCCTG TTGCCCTCGC 2400 

CCCCCAAGTT CTGGGCCACA CAACGGACTC TGTGACACTG GAGTGGTTCC CACCTATAGA 2460 

TGGCCATTTC TTTGAAAGAG AATTGGGATC AGCATGTCAT CTTTGCCTGG AAGGGAGAAT 2520 

75 CCTGGTGCAG TATGCTTCCA ACGCTTCCTC CCCAATGCCC TGCAGCCCAT CAGGACACTG 2580 

GAGCCCTCGT GAAGCAGAAG GTCATCCTGA TGTTGAACAG CCCTGTAAGT CCAGTGTCCG 2640 

CACCTGGAGC CCAAATTCAG CTGTCAACCC ACACACGGTT CCTCCAGCCT GCCCTGAGCC 2700 
TCAAGGCTGC TACCTCGAGC TGGAGTTCCT CTACCCCTTG GTCCCTGAGT CTCTGACCAT 2760 
TTGGGTGACC TTTGTCTCCA CTGACTGGGA CTCTAGTGGA GCTGTCAATG ACATCAAACT 2820 
80 GTTGGCTGTC AGTGGGAAGA ACATCTCCCT GGGTCCTCAG AATGTCTTCT GTGATGTCCC 2880 
ACTGACCATC AGACTCTGGG ACGTGGGCGA GGAGGTGTAT GGCATCCAAA TCTACACGCT 2940 
GGATGAGCAC CTGGAGATCG ATGCTGCCAT GTTGACCTCC ACTGCAGACA CCCCACTCTG 3000 
TCTACAGTGT AAGCCCCTGA AGTATAAGGT GGTCOGGGAC CCTCCTCTCC AGATGGATGT 3060 
GGCCTCCATC CTACATCTCA ATAGGAAATT CGTAGACATG GATCTAAATC TTGGCAGTGT 3120 
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GTACCAGTAT TGGGTCATAA CTATTTCAGG AACTGAAGAG AGTGAGCCAT CACCTGCTGT 3180 
CACATACATC CATGGACGTG GGTACTGTGG CGATGGCATT ATACAAAAAG ACCAAGGTGA 3240 
ACAATGCGAC GACATGAATA AGATCAATGG TGATGGCTGC TCCCTTTTCT GCCGACAAGA 3300 
AGTCTCCTTC AATTGTATTG ATGAACCCAG CCGGTGCTAT TTCCATGATG GTGATGGGGT 3360 
5 ATGTGAGGAG TTTGAACAAA AAACCAGCAT TAAGGACTGT GGTGTCTACA CGCCCCAGGG 3420 
ATTCCTGGAT CAGTGGGCAT CCAATGCTTC AGTATCTCAT CAAGACCAGC AATGCCCAGG 3480 
CTGGGTCATC ATCGGACAGC CAGCAGCATC CCAGGTGTGT CGAACCAAGG TGATAGATCT 3540 
CAGTGAAGGC ATTTCCCAGC ATGCCTGGTA CCCTTGCACC ATCAGCTACC CATATTCCCA 3600 
GCTGGCTCAG ACCACTTTTT GGCTCCGGGC GTATTTTTCT CAACCAATGG TTGCCGCAGC 3660 
10 TGTCATTGTC CACCTGGTGA CGGATGGGAC ATATTATGGG GACCAAAAGC AGGAGACCAT 3720 
CAGCGTGCAG CTGCTTGATA CCAAAGATCA GAGCCACGAT CTAGGCCTCC ATGTCCTGAG 3780 
CTGCAGGAAC AATCCCCTGA TTATCCCTGT GGTCCATGAC CTCAGCCAGC CCTTCTACCA 3840 
CAGCCAGGCG GTACGTGTGA GCTTCAGTTC GCCCCTGGTC GCCATCTCGG GGGTGGCCCT 3900 
CCGTTCCTTC GACAACTTTG ACCCCGTCAC CCTGAGCAGC TGCCAGAGAG GGGAGACCTA 3960 
15 CAGCCCTGCC GAGCAGAGCT GCGTGCACTT CGCATGTGAG AAAACTGACT GTCCAGAGCT 4020 
GGCTGTGGAG AATGCTTCTC TCAATTGCTC CAGCAGCGAC CGCTACCACG GTGCCCAGTG 4080 
TACTGTGAGC TGCCGGACAG GCTACGTGCT CCAGATACGG CGGGATGATG AGCTGATCAA 4140 
GAGCCAGACG GGACCCAGCG TCACAGTGAC CTGTACAGAG GGCAAGTGGA ATAAGCAGGT 4200 
GGCCTGTGAG CCAGTCGACT GCAGCATCCC AGATCACCAT CAAGTCTATG CTGCCTCCTT 4260 
20 CTCCTGCCCT GAGGGCACCA CCTTTGGCAG TCAATGTTCC TTCCAGTGCC GTCACCCTGC 4320 
ACAATTGAAA GGCAACAACA GCCTCCTGAC CTGCATGGAG GATGGGCTGT GGTCCTTCCC 4380 
AGAGGCCCTG TGTGAGCTCA TGTGCCTCGC TCCACCCCCT GTGCCCAATG CAGACCTCCA 4440 
GACCGCCCGG TGCCGAGAGA AT AAG CACAA GGTGGGCTCC TTCTGCAAAT ACAAATGCAA 4500 
GCCTGGATAC CATGTGCCTG GATCCTCTCG GAAGTCAAAG AAACGGGCCT TCAAGACTCA 4560 
25 GTGTACCCAG GATGGCAGCT GGCAGGAGGG AGCTTGTGTT CCTGTGACCT GTGACCCACC 4620 
TCCACCAAAA TTCCATGGGC TCTACCAGTG TACTAATGGC TTCCAGTTCA ACAGTGAGTG 4680 
TAGGATCAAG TGTGAAGACA GTGATGCCTC CCAGGGACTT GGGAGCAATG TCATTCATTG 4740 
CCGGAAAGAT GGCACCTGGA ACGGCTCCTT CCATGTCTGC CAGGAGATGC AAGGCCAGTG 4800 
CTCGGTTCCA AACGAGCTCA ACAGCAACCT CAAACTGCAG TGCCCTGATG GCTATGCCAT 4860 
30 AGGGTCGGAG TGTGCCACCT CGTGCCTGGA CCACAACAGC GAGTCCATCA TCCTGCCAAT 4920 
GAACGTGACC GTGCGTGACA TCCCCCACTG GCTGAACCCC ACACGGGTAG AGAGAGTTGT 4980 
CTGCACTGCT GGTCTCAAGT GGTATCCTCA CCCTGCTCTG ATTCACTGTG TCAAAGGCTG 5040 
TGAGCCCTTC ATGGGAGACA ATTATTGTGA TGCCATCAAC AACCGAGCCT TTTGCAACTA 5100 
TGACGGTGGG GATTGCTGCA CCTCCACAGT GAAGACCAAA AAGGTCACCC CATTCCCTAT 5160 
35 GTCCTGTGAC CTACAAGGTG ACTGTGCTTG TCGGGACCCC CAGGCCCAAG AACACAGCCG 5220 
GAAAGACCTC CGGGGATACA GCCATGGCTA AGGAAGGACA AGAAGTTGTC AAAGAATTCC 5280 
CAACGCCAGG ACCCACATCC CTTTGGTATT GATTTCACAG TCAGCTGCTC AACGGAATGG 5340 
GCTCTCCACA CCAGGGATCC TTAGCACCCA ACCGGTCTGC CTTTAATTTT ACCCAGGAAG 5400 
GACTCACATT GGGGCGAATG AACCAAGTTT CGCCATGCTG GATGATGAAA TGGATTCCCA 5460 
40 TCCCAAAGTC TGAGATGGAT TGCATATACA GTGTGCAGTC CCAGAGCCTC CTAAAATTCT 5520 
AGCCATTTGT CACACAACCA CAGCAAAAAA AAA 

Seq ID NO: B29 Protein sequence 
Protein Accession #: NP 002572.1 

45 

1 11 . 21 31 41 51 

I I I I I I 

MRLWSWVLHL GLLSAALGCG LAERPRRARR DPRAGRPPRP AAGPATCATR GPRPPRLAAA 
AAAAGRAWEA VRVPRRRQQR EARGATEEPS PPSRALYFSG RGEQLRVLRA DLELPRDAFT 
50 LQVWLRAEGG QRSPAVITGL YDKCSYISRD RGWWGIHTI SDQDNKDPRY FFSLKTDRAR 
QVTTINAHRS YLPGQWVYLA ATYDGQFMKL YVNGAQVATS GEQVGGIFSP LTQKCKVLML 
GGSALNHNYR GYIEHFSLWK VARTQREIIjS DMETHGAHTA LPQLLLQENW DNVKHAWSPM 
KDGSSPKVEP SNAHGFLLDT SLEPPLCGQT LCDNTEVIAS YNQLSSFRQP KWRYRWNL 
YEDDRKNPTV TREQVDFQHH QLAEAFKQYN ISWELDVLEV SNSSLRRRLI LANCDISKIG 
55 DENCDPECNH TLTGHDGGDC RHLRHPAFVK KQHNGVCDMD CNYERFNFDG GECCDPEITN 
VTQTCFDPDS PHRAYLDVNE LKMILKLDGS THLNIFFAKS SEEELAGVAT WPWDKEALMH 
LGGIVLNPSF YGMPGHTHTM IHEIGHSLGL YHVFRGISEI QSCSDPCMET EPSFETGDLC 
NDTNPAPKHK SCGDPGPGND TCGFHSFFNT PYNNFMSYAD DDCTDSPTPN QVARMHCYLD 
LVYQGWQPSR KPAPVAIAPQ VLGHTTDSVT LEWFPPIDGH FFERELGSAC HLCLEGRILV 
60 QYASNASSPM PCSPSGHWSP REAEGHPDVE QPCKSSVRTW SPNSAVNPHT VPPACPEPQG 
CYLELBFLYP LVPESLTIWV TFVSTDWDSS GAVNDIKLIA VSGKNISLGP QNVFCDVPLT 
IRLWDVGEEV YGIQIYTLDE HLEIDAAMLT STADTPLCLQ CKPLKYKWR DPPLQMDVAS 
ILHLNRKFVD MDLNLGSVYQ YWVITISGTE ESEPSPAVTY IHGRGYOGDG I IQKDQGEQC 
DDMNKINGDG CSLFCRQEVS FNCIDEPSRC YFHDGDGVCE EFEQKTSIKD CGVYTPQGFL 
65 DQWASNASVS HQDQQCPGWV IIGQPAASQV CRTKV1DLSE GISQHAWYPC TISYPYSQLA 
QTTFWLRAYP SQPMVAAAV1 VHLVTDGTYY GDQKQETISV QLLDTKDQSH DLGLHVLSCR 
NNPLIIPWH DLSQPFYHSQ AVRVSFSSPL VAISGVALRS FDNFDPVTLS SCQRGETYSP 
AEQSCVHFAC EKTDCPELAV ENASLNCSSS DRYHGAQCTV SCRTGYVLQI RRDDELIKSQ 
TGPSVTVTCT EGKWNKQVAC EPVDCSIPDH HQVYAASPSC PEGTTFGSQC SPQCRHPAQL 
70 KGNNSLLTCM EDGLWSFPEA LCELMCLAPP PVPNADLQTA RCRENKHKVG SFCKYKCKPG 
YHVPGSSRKS KKRAFKTQCT QDGSWQEGAC VPVTCDPPPP KFHGLYQCTN GFQFNSECRI 
KCEDSDASQG LGSNVIHCRK DGTWNGSFHV CQEKQGQCSV PNELNSNLKL QCPDGYAIGS 
ECATSCLDHN SESIILPMNV TVRDIPHWLN PTRVERWCT AGLKWYPHPA LIHCVKGCEP 
_ c PMGDNYCDAI NNRAFCNYDG GDCCTSTVKT KKVTPFPMSC DLQGDCACRD PQAQEHSRKD 
75 LRGYSHG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 



80 



Seq ID NO: B30 DMA sequence 

Nucleic Acid Accession #: NM_032808.1 

Coding sequence: 61.. 1008 

1 11 21 31 41 51 

11)111 

CTAGTCTATC TCCGCTTCCT CAACCTCTCC TACAACCCCA TCAGCACCAT TGAGGGCTCC 60 
ATGTTGCATG AGCTGCTCCG GCTGCAGGAG ATCCAGCTGG TGGGCGGGCA GCTGGCCGTG 120 

809 



WO 03/042661 



PCT/US02/36810 



10 

15 

20 

25 

30 

35 



GTGGAGCCCT ATGCCTTCCG CGGCCTCAAC 
CAGCTGACCA CACTGGAGGA ATTAGTCTTC 
CTGGACTCCA ACCCGCTGGC CTGCGACTGT 
CGGCTCAACT TCAACCGGCA GCAGCCCACG 
GAGTTCAAGG ACTTCCCTGA TGTGCT ACTG 
ATCCGGGACC GCAAGGCCCA GCAGGTGTTT 
TGCCGGGCCG ATGGCGACCC GCCGCCCGCC 
GTCTCAGCCA AGAGCAATGG GCGGCTCACA 
TACGCCCAGG TACAGGACAA CGGCACGTAC 
GACTCCATGC CCGCCCACCT GCATGTGCGC 
AACAAGACCT TCGCTTTCAT CTCCAACCAG 
GCCACTGTGC CTTTCCCCTT CGACATCAAG 
ATCTCTTTCC TGGGCGTCGT CCTCTTCTGC 
AAGGGCAACA CAAAGCACAA CATCGAGATC 
ATCAGCTCCG CCGACGCGCC CCGCAAGTTC 
GGGCAGGGAC CCCOGGGCGG CCGGGCAGGG 
CCAGTCCTTC CCACCTCCTC CCTACCCTTC 

AGCCCAGACC TGGGGACCCC ACCTACACAG 
CGAACCGACA CGCGGCAGAG TCAATAATTC 
TTGGGTTTCA ATAATTATGG ATTTTTATGA 
TATTTCCTAT AGCTAGTCGG AATGCAAACT 
TCCAACTCAG TTTCTTGTTT TTCTCTTCCT 
CTTCCCCAGT GGGGAGGGAT CACTCAGGAA 
CTGCCCACCC CGCCCCAGGC ACCATCAGGA 
CCGGGCTGGC TTTTTGCAGG GCGCAGGTGG 
CCTGTCTGCT GGGCTGCCAG GCGGCACCAC 
GCTGAGACTC TGGACAGAGG CTGGGGTCCT 
CCAGGGGCTG GAGGTGGGGC CCACCCCAGC 
GTGTGGCCTC AAGCAGGTCA CTGGCCTCTC 
GGAACATTAC CCCCTGCCCT GCCTACCTCA 
TGTATGTGAA ACACTTTGTA ACCTGTAAAG 



40 

45 

50 

55 

60 

65 

70 

75 

80 



TACCTGCGCG TGCTCAATGT 
CACTCGGTGG GCAACCTGGA 
CGGCTCCTGT GGGTGTTCCG 
TGCGCCACGC CCGAGTTTGT 
CCCAACTACT TCACCTGCCG 
GTGGACGAGG GCCACACGGT 
ATCCTCTGGC TCTCACCCCG 
GTCTTCCCTG ATGGCACGCT 
CTGTGCATCG CGGCCAACGC 
AGCTACTCGC CCGACTGGCC 
CCGGGCGAGG GAGAGGCCAA 
ACCCTCATCA TCGCCACCAC 
CTGGTGCTGC TGTTTCTCTG 
GAGTATGTGC CCCGAAAGTC 
AACATGAAGA TGATATGAGG 
GAAGGGGCCT GGCCGCCACC 
TACACACGTT CTCTTTCTCC 
CACCTGCCCT CCTTCTACCA 
GGGCATTGAC AGACTGGAGT 
AATAAAAAAG TTACGAACTT 
AAACTTGAAA TAATAAAAAG 
TTTGACGTCC TGATTGCTCC 
CCTCCTCCTC tTCTTCCTCC 
AACAGGAAAG GAGGTTCCAG 
GCAGGCTAGG GGGCAGGCCT 
AGGGGACAGG TCTGCCGATG 
TGCAAGGGGT GGGAGCCTGG 
CCTGGGGGAC AGCACAGTCA 
CTCTGGTCCC AGCTCTGCTG 
TGGGCCTCAG TCTCCACATC 
CAGGGCTGTT GTGAGGAATT 
CGCTGTGCAC ACGTG 



CTCTGGCAAC 
GACACTCATC 
GCGCCGCTGG 
CCAGGGCAAG 
CCGCGCCCGC 
GCAGTTTGTG 
AAAGCACCTG 
GGAGGTGCGC 
GGGCGGCAAC 
CCATCAGCCC 
CAGCACCCGC 
CATGGGCTTC 
GAGCCGGGGC 
GGACGCAGGC 
CCGGGGCGGG 
TGCTCACTCT 
CTCCCGCCTC 
GGACCTCAGA 
TGAAAGCCGA 
TCTCTGTAAC 
AGAAAAAAAC 
AGGGCCCTCT 
TTTCTCTTCT 
CCCCACCCAC 
GGGCCCAGCT 
GGGGTGGGAG 
CTTGGGTGTG 
GTGGAGAGAG 
CTCACTTGCT 
TGTACAAATG 
GATGAGATGA 



160 
240 
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Seq ID NO: B31 Protein sequence 
Protein Accession #: NP_116197.1 

1 11 21 31 41 51 

ill)) 1 

MLHELLRLQE IQLVGGQIAV VEPYAFRGLN YLRVLNVSGN QLTTLEELVF HSVGNLETIiI 
LDSNPLACDC RLLWVFRRRW RLNFNRQQPT CATPEFVQGK EFKDFPDVLL PNYFTCRRAR 
IRDRKAQQVF VDEGHTVQFV CRADGDPPPA ILWLSPRKHL VSAKSNGRLT VFPDGTLEVR 
YAQVQDNGTY LCIAANAGGN DSMPAHLHVR SYSPDWPHQP NKTFAFISNQ PGEGEANSTR 
ATVPFPFDIK TLIIATTMGF ISFLGWLPC LVLLFLWSRG KGNTKHNIEI EYVPRKSDAG 
ISSADAPRKF NMKMI 
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Seq ID NO: B32 DNA sequence 

Nucleic Acid Accession ft: CAT cluster 



1 
I 

CAGCACATAC 
GTAATAACCT 
GTATTCGGTA 
AACCAGGACC 
TGGCACAGCA 
GAAAGTGGCC 
AGCCTTTCCG 
GGGTATAGAA 



11 
I 

AAGAAACATA 
AGTGAGTGAG 
CGAAGGAAGC 
CTGCCTCCCA 
CTGACAAAGG 
GGGGTGGTCC 
TCACGGGCAG 
A 



21 
1 

CAGTGTACCT 
TTGTGATGTC 
CAGGCTGGTC 
GCCCAGAGGT 
TAGAGGGAAA 
TTGGAAAACA 
GGACGCCTTG 



31 
I 

CAAAAGGGGC 
ATCTGGAACA 
CTGGCAGGAA 
GGAGGAGGGC 
TGTAATAGCA 
GTTGGGCTGT 
ATTTAAAAAA 



41 

I 

CCTTGAAATG 
TAGGAAATGG 
GTAAATGATA 
GGTCAGGGTG 
CATCTACGCT 
TCTTGGCAGG 
AATAAATAAA 



51 
I 

TCATCAAAGG 
GGCTCTTAGC 
ATCTTTGGGA 
GGGTCTACAG 
GCAGTCTGGT 
AATTAGTGAC 
TAAAACGTCT 



60 
120 
180 
240 
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Seq ID NO: B33 DNA sequence 

Nucleic Acid Accession #: NM_006174.1 

Coding sequence: 71.. 1408 



GAAAGGCTAT 
AGACTATAAT 
TAATACTGCT 
AGATGACTTA 
GGGGAATCTA 
CTTCCTCATA 
CACACTGACG 
GCCTTTTCTT 
TGTCAGGTAT 
CTTTCTGATA 
TCACAGTCTT 
ATGTGTTGAG 
AGTTCAGTAT 
TATAAGCTGT 
AACTCTTCAT 
GAGTTATTCA 
ACCTGCTCCA 
CTCTGTAAGA 
TGAGATAAAA 



11 
I 

CGGTAACAAC 
ATGGATTTAG 
GCCACTCGGA 
CAGTATTTTC 
CTTATTTTAA 
GGCAATCTGG 
TCTGTCTTGC 
CAATGTGTGT 
CATATGATAA 
GCTACTGTCT 
GTGGAACTTC 
TCATGGCCAT 
ATTCTGCCCT 
GGATTGTCCA 
CCATCCAAAA 
TTCATCAAAA 
GAAAGACCTT 
AGTCAGCTCT 
CCTGAAGAAA 



21 
I 

TGACCTGCCA 
AGCTCGACGA 
ATTCTGATTT 
TGATTGGGCT 
TGGCTCTCAT 
CCTTTTCTGA 
TGGATCAGTG 
CAGTTTTGGT 
AACATCCCAT 
GGACACTAGG 
AAGAAACATT 
CTGATTCATA 
TAGTTTGTCT 
ACAAAGAAAA 
AGAGTGGGCC 
AACACAGAAG 
CTCAAGAGAA 
CTTCATCCAG 
ATTCAGATGT 



31 
I 

CAAAGTTAGA 
GTATTATAAC 
CCCAGTCTGG 
CTATACATTT 
GAAAAAGCGT 
TATCTTGGTT 
GATGTTTGGC 
TTCAACTTTA 
ATCTAATAAT 
TTTTGCCATC 
TGGTTCAGCA 
CAGAATTGCC 
TACTGTAAGT 
CAGACTTGAA 
TCAGGTGAAA 
AAGATATAGC 
CCACTCCAGA 
TAAGTTCATA 
TCATGAATTG 



41 
I 

AGAAAGGATT 
AAGACACTTG 
GATGACTATA 
GTAAGTCTTC 
AATCAGAAGA 
GTGCTGTTTT 
AAAGTCATGT 
ATTTTAATAT 
TTAACAGCAA 
TGTTCTCCCC 
TTGCTGAGCA 
TTTACTATCT 
CATACAAGTG 
GAAAATGAGA 
CTCTCTGGCA 
AAGAAGACAG 
ATACTTCCAG 
CCAGGGGTCC 
AGAGTAAAAC 



51 
I 

GATTCAAGAA 
CCACAGAGAA 
AAAGCAGTGT 
TTGGCTTTAT 
CTACGGTAAA 
GCTCACCTTT 
GCCATATTAT 
CAATTGCCAT 
ACCATGGCTA 
TTCCAGTGTT 
GCAGGTATTT 
CTTTATTGCT 
TCTGCAGAAG 
TGATCAACTT 
GCCATAAATG 
CATGTGTGTT 
AAAACTTTGG 
CCACTTGCTT 
GTTCTGTTAC 
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810 



WO 03/042661 



AAGAATAAAA AAGAGATCTC GAAGTGTTTT CTACAGACTG ACCATACTGA TATTAGTATT 1200 

TGCTGTTAGT TGGATGCCAC TACACCTTTT CCATGTGGTA ACTGATTTTA ATGACAATCT 1260 

TATTTCAAAT AGGCATTTCA AGTTGGTGTA TTGCATTTGT CATTTGTTGG GCATGATGTC 1320 

CTGTTGTCTT AATCCAATTC TATATGGGTT TCTTAATAAT GGGATTAAAG CTGATTTAGT 1380 
GTCCCTTATA CACTGTCTTC ATATGTAATA ATTCTCACTG TTT 

Seq ID NO: B34 Protein sequence 
Protein Accession #: NP_006165.1 

1 11 21 31 41 51 

( | | I I I 

MDLELDEYYN KTLATENNTA ATRNSDFPVW DDYKSSVDDL QYFLIGLYTF VSLLGFMGNL 60 
LILMALMKKR NQKTTVNFLI GNLAFSDILV VLFCSPFTLT SVLLDQWMFG KVMCHIMPFL 120 
QCVSVLVSTL ILISIAIVRY HMIKHPISNN LTANHGYFLI ATVWTLGFAI CSPLPVFHSL 180 
VELQETFGSA LLSSRYLCVE SWPSDSYRIA FTISLLLVQY ILPLVCLTVS HTSVCRSISC 240 
GLSNKENRLE ENEMINLTLH PSKKSGPQVK LSGSHKWSYS FIKKHRRRYS KKTACVLPAP 300 
ERPSQENHSR ILPENFGSVR SQLSSSSKFI PGVPTCFEIK PEENSDVHEL RVKRSVTRIK 360 
KRSRSVFYRL TILILVFAVS WMPLHLFHW TDFNDNLISN RHFKLVYCIC HLLGMMSCCL 420 
NPILYGFLNN GIKADLVSLI HCLHM 



Seq ID NO: B35 DNA sequence 
Nucleic Acid Accession ft: NM_014279.1 
Coding sequence: 286. .1689 

1 11 21 31 41 51 

| | | i I I 

GCGCGGGGGA GCCATTAGGA GGCGAGGAGA GAGGAGGGCG CAGCTCCCGC CCAGCCCAGC 
CCTGCCCAGC CCTGCCCGGA GGCAGACGCG CCGGAACCGG GACGCGATAA ATATGCAGAG 
CGGAGGCTTC GCGCAGCAGA GCCCGCGCGC CGCCCGCTCC GGGTGCTGAA TCCAGGCGTG 
GGGACACGAG CCAGGCGCCG CCGCCGGAGC CAGCGGAGCC GGGGCCAGAG CCGGAGCGCG 
TCCGCGTCCA CGCAGCCGCC GGCCGGCCAG CACCCAGGGC CCTGCATGCC AGGTCGTTGG 
AGGTGGCAGC GAGACATGCA CCCGGCCCGG AAG CTCCTCA GCCTCCTCTT CCTCATCCTG 
ATGGGCACTG AACTCACTCA AGTGCTGCCC ACCAACCCTG AGGAGAGCTG GCAGGTGTAC 
AGCTCTGCCC AGGACAGCGA GGGCAGGTGT ATCTGCACAG TGGTCGCCCC ACAGCAGACC 
ATGTGTTCAC GGGATGCCCG CACAAAACAG CTGAGGCAGC TACTGGAGAA GGTGCAGAAC 
ATGTCTCAAT CCATAGAGGT CTTGGACAGG CGGACCCAGA GAGACTTGCA GTACGTGGAG 
AAGATGGAGA ACCAAATGAA AGGACTGGAG TCCAAGTTCA AACAGGTGGA GGAGAGTCAT 
AAGCAACACC TGGCCAGGCA GTTTAAGGCG ATAAAAGCGA AAATGGATGA ACTTAGGCCT 
TTGATACCTG TGTTGGAAGA GTACAAGGCC GATGCCAAAT TGGTATTGCA GTTTAAAGAG 
GAGGTCCAGA ATCTGACGTC AGTGCTTAAC GAGCTGCAAG AGGAAATTGG CGCCTATGAC 
TACGATGAAC TTCAGAGCAG AGTGTCCAAT CTTGAAGAAA GGCTCCGTGC ATGCATGCAA 
AAACTAGCTT GCGGGAAGTT GACGGGCATC AGTGACCCCG TGACTGTCAA GACCTCCGGC 
TCGAGGTTCG GATCCTGGAT GACAGACCCT CTCGCCCCTG AAGGCGATAA CCGGGTGTGG 
TACATGGACG GCTATCACAA CAACCGCTTC GTACGTGAGT ACAAGTCCAT GGTTGACTTC 1080 
ATGAACACGG ACAATTTCAC CTCCCACCGT CTCCCCCACC CCTGGTCGGG CACGGGGCAG 1140 
GTGGTCTACA ACGGTTCTAT CTACTTCAAC AAGTTCCAGA GCCACATCAT CATCAGGTTT 1200 
GACCTGAAGA CAGAGACCAT CCTCAAGACC CGCAGCCTGG ACTATGCCGG TTACAACAAC 1260 
ATGTACCACT ACGCCTGGGG TGGCCACTCG GACATCGACC TCATGGTGGA CGAGAGCGGG 1320 
CTGTGGGCCG TGTACGCCAC CAACCAGAAC GCTGGCAACA TCGTGGTCAG TAGGCTGGAC 1380 
CCCGTGTCCC TGCAGACCCT GCAGACCTGG AACACGAGCT ACCCCAAGCG CAGCGCCGGG 1440 
GAGGCCTTCA TCATCTGCGG CAOGCTGTAC GTCACCAACG GCTACTCAGG GGGTACCAAG 1500 
GTCCACTATG CATACCAGAC CAATGCCTCC ACCTATGAAT ACATCGACAT CCCATTCCAG 1560 
AACAAATACT CCCACATCTC CATGCTGGAC TACAACCCCA AGGACCGGGC CCTGTATGCC 1620 
TGGAACAACG GCCACCAGAT CCTCTACAAC GTGACCCTCT TCCACGTCAT CCGCTCCGAC 1680 
GAGTTGTAGC TCCCTCCTCC TGGAAGCCAA GGGCCCACGT CCTCACCACA AAGGGACTCC 1740 
TGTGAAACTG CTGCCAAAAA GATACCAATA ACACTAACAA TACCGATCTT GAAAAATCAT 1800 
CAGCAGTGCG GATTCTGACA TCGAGGGATG GCATTACCTC CGTGTTTCTC CCTTTCGAGC 1860 
CGGCGGGCCA CAGACGTCGG AAGAAACTCC CGTATTTGCA GCTGGAACTG CAGCCCACGG 1920 
CGCCCCGGTT TTCCTCCCCG CCCTGTCCCT CTCTGGTCAA ACAACATACT AAAGAGGCGA 1980 
GGCAATGACT GTTGGCCAGT TCTCACCGGG GAAAAACCCA CTGTT AGGAT GGCATGAACA 2040 
TTTCCTTAGA TCGTGGTCAG CTCCGAGGAA TGTGGCGTCC AGGCTCTTTG AGAGCCATGG 2100 
GCTGCACCCG GCCGTAGGCT AGTGTAACTC GCATCCCATT GCAGTGCCGT TTCTTGACTG 2160 
TGTTGCTGTC TCTTAGATTA ACCGTGCTGA GGCTCCACAT AGCTCCTGGA CCTGTG TCTA 2220 
GTACATACTG AAGCGATGGT CAGAGTGTGT AGAGTGAAGT TGCTGTGCCC ACATTGTTTG 2280 
AACTCGCGTA CCCCGTAGAT ACATTGTGCA ACGTTCTTCT GTTATTCCCT TGAGGTGGTA 2340 
ACTTCGTATG TTCAGTTTAT GCGATGATTG TTGTAAATGC AATGCCGTAG TTTGGATTAA 2400 
TAAGTGGATG GTTTTTGTTT CTAAAAAGAA AAAAAAAATC AGTGTTCACC CTTATAGAGA 2460 
CATAGTCAAG TTCATGTTGA TAATAATCAA AGGAATTACT CTCTTCTTGT TAAATTAGCT 2520 
AAATCATGTA ACCGCAGATA GGAAGGGCTC ACCTGGGGAA ACTCTGGTTT CCGATGGGAC 2580 
AGGAAAGTCA TACGGGCAAC AGTATGCGGA AAGTAOGTTT TTTTAAGTAA AAAACAAAGG 2640 
CAAACTTTGT ACTATCCAGT TATCTAAGGA ACAATAAAAA CATTAGGAGA AAAAAAAAAA 2700 
AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA A 



Seq ID NO: B36 Protein sequence 
Protein Accession 8: NP_055094.1 

1 11 21 31 41 51 

! j • | I I I 

MPGRWRWQRD MHPARKLLSL LFLILMGTEL TQVLPTNPEE SWQVYSSAQD SEGRCICTW 60 
APQQTMCSRD ARTKQLRQLL EKVQNMSQSI EVLDRRTQRD LQYVEKMENQ MKGLESKFKQ 120 
VEESHKQHLA RQFKAIKAKM DELRPLIPVL EEYKADAKLV LQFKEEVQNL TSVLNELQEE 180 
IGAYDYDELQ SRVSNLEERIi RACMQKLACG KLTGISDPVT VKTSGSRFGS WMTDPLAPEG 240 
DNRVWYMDGY HNNRFVREYK SMVDFMNTDN FTSHRLPHPW SGTGQWYNG SIYFNKFQSH 300 
IIIRFDLKTE TILKTRSLDY AGYNNMYHYA NGGHSDIDLM VDESGLWAVY ATNQNAGNIV 360 



811 



WO 03/042661 



VSRLDPVSLQ TLQTWNTSYP KRSAGEAFII CGTLYVTNGY SGGTKVHYAY QTNASTYEYI 420 
DIPFQNKYSH ISMLDYNPKD RALYAWNNGH QILYNVTLFH VIRSDEL 



Seq ID NO: B37 DNA sequence 

Nucleic Acid Accession 8: NM_006334.1 

Coding sequence: 286.. 693 

1 11 21 31 41 51 

GCGCGGGGGA GCCATTAGGA GGCGAGGAGA GAGGAGGGCG CAGCTCCCGC CCAGCCCAGC 60 

CCTGCCCAGC CCTGCCCGGA GGCAGACGCG CCGGAACCGG GACGCGATAA ATATGCAGAG 120 

CGGAGGCTTC GCGCAGCAGA GCCCGCGCGC CGCCCGCTCC GGGTGCTGAA TCCAGGCGTG 180 

GGGACACGAG CCAGGCGCCG CCGCCGGAGC CAGCGGAGCC GGGGCCAGAG CCGGAGCGCG 240 

TCCGCGTCCA CGCAGCCGCC GGCCGGCCAG CACCCAGGGC CCTGCATGCC AGGTCGTTGG 300 

AGGTGGCAGC GAGACATGCA CCCGGCCCGG AAGCTCCTCA GCCTCCTCTT CCTCATCCTG 360 

ATGGGCACTG AACTCACTCA AGTGCTGCCC ACCAACCCTG AGGAGAGCTG GCAGGTGTAC 420 

AGCTCTGCCC AGGACAGCGA GGGCAGGTGT ATCTGCACAG TGGTCGCCCC ACAGCAGACC 480 

ATGTGTTCAC GGGATGCCCG CACAAAACAG CTGAGGCAGC TACTGGAGAA GGTGCAGAAC 540 

ATGTCTCAAT CCATAGAGGT CTTGGACAGG CGGACCCAGA GAGACTTGCA GTACGTGGAG 600 

AAGATGGAGA ACCAAATGAA AGGACTGGAG TCCAAGTTCA AACAGGTGGA GGAGAGTCAT 660 

AAGCAACACC TGGCCAGGCA GTTTAAGGGC TAACTTAAAA GAGTTTTTTC A ATGC TGCAG 720 

TGACTGAAGA AGCAGTCCAC TCCCATGTAA CCATGAAAGA GAGCCAGAGA GCTTTTTGCA 780 

CCATGCATTT TTACTATTAT TTTCCAATAC TTAGCACCAT TTCACTAAGG AACCTTGAAT 840 

ACAACCAGGA TCCTCCTTTG CATGCGACTG TAGCTGCATT TCATGAATAG TTTGAACCCT 900 

TGTCAATGCA TTTTTTGAAA AAGAAAGAAA AAAAAAACTT CGTGTATGTG ACTCAAAGCA 960 
TGTAACCTTA AGATGTTGCA TTCTAAACTG ACAATAAAGA CCTTTCCCC 



Seq ID NO: B38 Protein sequence 
Protein Accession tf: NP_006325.1 

1 11 21 31 41 51 

I | | I I I 

MPGRWRWQRD MHPARKLLSL LFLILMGTEL TQVLPTNPEE SWQVYSSAQD SEGRCICTW 60 
APQQTMCSRD ARTKQLRQLL EKVQNMSQSI EVLDRRTQRD LQYVEKMENQ MKGLESKFKQ 120 
VEESHKQHLA RQPKG 



Seq ID NO: B39 DNA sequence 

Nucleic Acid Accession #: NM_058199.1 

Coding sequence: 286.. 795 

1 11 21 31 41 51 

I 1 I t I I 

GCGCGGGGGA GCCATTAGGA GGCGAGGAGA GAGGAGGGCG CAGCTCCCGC CCAGCCCAGC 60 

CCTGCCCAGC CCTGCCCGGA GGCAGACGCG CCGGAACCGG GACGCGATAA ATATGCAGAG 120 

CGGAGGCTTC GCGCAGCAGA GCCCGCGCGC CGCCCGCTCC GGGTGCTGAA TCCAGGCGTG 180 

GGGACACGAG CCAGGCGCCG CCGCCGGAGC CAGCGGAGCC GGGGCCAGAG CCGGAGCGCG 240 

TCCGCGTCCA CGCAGCCGCC GGCCGGCCAG CACCCAGGGC CCTGCATGCC AGGTCGTTGG 300 

AGGTGGCAGC GAGACATGCA CCCGGCCCGG AAGCTCCTCA GCCTCCTCTT CCTCATCCTG 360 

ATGGGCACTG AACTCACTCA AAATAAAAGA GAAAACAAAG CAGAGAAGAT GGGAGGOCCA 420 

GAGAGCGAGA GGAAGACCAC AGGAGAGAAG ACACTGAACG AGCTTCCCTT GTTTTGCCTG 480 

GAAGCCCACG CTGGCTCCCT GGCTCTGCCC AGGATGTGCA GTCCAAATCC CAATCCAGCA 540 

GTGGGGTTAT GTCGTCCCGC TTACCCTCAG AGCCCTTCTC CTGGTGCTGC CCAGACGATC 600 

AGCCAGTCCC TCCTGGAGAG GTTCTGCATG GCCTCTAGGA GAGAAGTTTT CTTGGCCCCA 660 

GGAAGGCCTG GTGGAGGGTG GTGGTTGTGC ACTGTTGCTG GACAGATGCA TTCATTCATG 720 

TGCACACACA CACACACACA TGCACACACA GGGGAGCAGA TACCTGCAGA GAAGAGCCAA .780 

CCAGGTCCTG ATTAGTGGCA AGCTGCCCCA CAAAGGGCTA TGCCTGTGTC TTATTGAGAC B40 

ACCTTGGCAA AGAGATGGCT GATTCTGGGT GGTCCTGGAC ATGGCCGCAC CCAAGGGCCC 900 

TCCAAGCCTT AATGGCACCC TGAAGCCTCC ATGCCCAGGC CAAAAGATGC TTTTCCTCCC 960 
TAAAAAAAAA AAAAAAAAAA 



Seq ID NO: B40 Protein sequence 
Protein Accession #: NP_478106.1 

! 11 21 31 41 51 

■ | 1 1 I 1 

MPGRWRWQRD MHPARKLLSL LFLILMGTEL TQNKRENKAE KMGGPESERK TTGEKTLNEL 60 
PLFCLEAHAG SIALPRMCSP NPNPAVGLCR PAYPQSPSPG AAQTISQSLL ERFCMASRRE 120 
VFLAPGRPGG GWWLCTVAGQ MHSFMCTHTH THAHTGEQIP AEKSQPGPD 



Seq ID NO: B41 DNA sequence 

Nucleic Acid Accession #: AY03 8071.1 

Coding sequence: 1..1686 

1 11 21 31 |1 51 

ATGAGCAATC AGTACCAGGA GGAGGGCTGC TCCGAGAGGC CCGAGTGCAA AAGTAAATCT 60 

CCAACTTTGC TCTCCTCCTA CTGCATCGAC AGCATCCTGG GCCGGAGGAG CCCGTGCAAA 120 

ATGCGGTTGC TGGGAGCCGC GCAGAGCTTG CCTGCTCCGC TGACCAGCCG CGCCGACCCG 180 

GAAAAGGCCG TGCAAGGCTC CCCTAAGAGC AGCAGCGCCC CGTTCGAGGC CGAGCTGCAC 240 

CTGCCGCCCA AGCTGCGGCG CCTGTACGGC CCGGGCGGGG GCCGCCTCCT TCAGGGTGCG 300 

GCAGCGGCGG CGGCGGCGGC GGCGGCGGCG GCGGCAGCGG CCGCCACGGC CACGGCGGGT 360 

CCACGCGGGG AGGCCCCTCC GCCGCCACCG CCAACCGCGC GGCCCGGGGA ACGGCCGGAC 420 

GGCGCAGGGG CCGCCGCGGC AGCCGCGGCC GCGGCCGCCG CGGCCTGGGA CACGCTCAAG 480 

ATCAGCCAGG CGCCGCAGGT GAGCATCAGC CGCAGCAAGT CGTACCGCGA GAACGGGGCG 540 



812 



WO 03/042661 



PCT/US02/36810 
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CCCTTCGTGC 
CCGGAGGAGC 
ACCGGCACCG 
GAGGAACTGC 
AAGGAGCCCC 
GCAGTGGCCA 
GACGCTGAGG 
GAGGGGCTGC 
GAGGAACTGG 
CTGGCCATGA 
GCCAAGTGGC 
CCGGGGCCGC 
CCGCACCACC 
TTCCCGAGCC 
GGCCTGAGCA 
TTCGGCAGGC 
CTGAGACAGC 
GCCACGGCGG 
GAGCACGCGG 
GTGTGC 



CGCCGCCGCC 
GCCTCGGCGT 
AGGACGACGA 
TGGAGGACGA 
GGCGCTGTCC 
CAGAGGGCGG 
GCAAGGACGG 
TGAAACGCAA 
AGCGGGCCTT 
GGCTGGACTT 
GCAAGCGGGA 
TCTCCGCCAC 
CGGCGCTCGA 
TACCTCCGCC 
CTTTCCTCGG 
TCTTTTCCAC 
CCACACCCGC 
CCGCAGACAG 
CGCAGCTCAC 



CGCGCTGGAC 
GGCCGGCGGC 
GGAGGAGCTG 
CGAGGAGGAG 
TGTGGCCGCC 
GGAGCTGTCA 
CGAGGACAGC 
ACAGAGGCGC 
CCAGAAGACG 
GACCGAGGCC 
GAAGGCAGGC 
CCACCCGCTC 
CTCCGCTTGG 
TCCGGGCTCG 
AGCGGCAGTG 
AATGGCCCCC 
CGTGGAGGGC 
ACGCGCCTCT 
GCAGCTCAAC 



GAGCTGGGCG 
CCGGGCAGCG 
CTGGAGGACG 
CTGCTGGAGG 
ACTGGCGCCG 
CCCAAGGAGG 
GTGTGCCTCT 
TACCGCACCA 
CACTACCCGG 
CGAGTCCAGG 
GCGCAGACCC 
AGCCCCTACC 
ACTGCOGCTG 
GCCAGCCTGC 
TTCCGACACC 
CTGACCAGCG 
GCAGTGGCAT 
AGCATAGCCG 
ATCCTGCCGG 



GCCCGGGGGG 
CCCCGGCTGC 
AAGAAGATGA 
ACGACGCCCG 
TGGCCGCAGC 
AGCTGCTGCT 
CTGCGGGCAG 
CGTTCACCAG 
ACGTCTTCAC 
TCTGGTTCCA 
ACCCCCCTGG 
TGGACGCCAG 
CCGCCGCCGC 
CGCCCAGCGG 
CAGCTTTCAT 
CGTCGACCGC 
CGGGCGCCCT 
CGCTGAGGCT 
GCACCAGCAC 



CGTCACGCAC 
GGGTGGTGGC 
GGACGAGGAA 
CGCGCTGCTC 
AGCTGCCGCT 
GCACCCGGAA 
CGACTCGGAG 
CTACCAGCTG 
CAGGGAGGAA 
GAACCGTCGG 
GCTGCCCTTC 
CCCCTTCCCT 
CGCCGCCGCC 
GGCGCCGCTG 
CAGCCCGGCA 
GGCCGOGCTC 
GGCCGACCCG 
CAAGGCCAAG 
GGGCAAGGAG 



600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 



Seq ID NOi B42 Protein sequence 
Protein Accession ft: AAK93901.1 



I 

MSNQYQEEGC 
EKAVQGSPKS 
PRGEAPPPPP 
PFVPPPPALD 



11 
I 



DAEGKDGEDS 
LAMRLDLTEA 
PHHP ALD SAW 
FGRLFSTMAP 
EHAAQLTQLN 



SSAPFEAELH 
PTARPGERPD 
ELGGPGGVTH 
LLEDDARALL 
VCLSAGSDSE 
RVQVWFQNRR 
TAAAAAAAAA 
LTSASTAAAL 
ILPGTSTGKE 



21 
I 

PTLLSSYCID 
LPPKLRRLYG 
GAGAAAAAAA 
PEERIX3VAGG 
KEPRRCPVAA 
EGLLKRKQRR 
AKWRKREKAG 
FPSIiPPPPGS 
LRQPTPAVEG 
VC 



31 
I 

SILGRRSPCK 
PGGGRLLQGA 
AAAAAWDTLK 
PGSAPAAGGG 
TGAVAAAAAA 
YRTTFTSYQL 
AQTHPPGLPF 
ASLPPSGAPL 
AVASGALADP 



41 
I 

MRLLGAAQSL 
AAAAAAAAAA 
ISQAPQVSIS 
TGTEDDEEEL 
AVATEGGELS 
EEbERAFQKT 
PGPLSATHPL 
GLSTFLGAAV 
ATAAADRRAS 



51 
I 

PAPLTSRADP 60 

AAAAATATAG 120 

RSKSYRENGA 180 

LEDEEDEDEE 240 

PKEELLLHPE 300 

HYPDVFTREE 360 

SPYLDASPFP 420 

FRHPAFISPA 480 

S I AALRLKAK. 540 



Seq ID NO: B43 DNA sequence 

Nucleic Acid Accession #: CAT cluster 



1 

I 

CGCCTGCCCC 
TTCCGCCGGT 
CTGCTCTCCT 
GCTCGTTTCT 
GTCTGTGTTT 
CCTGTTTTGG 
TTGGTGCTTT 
ATCTTCTGTT 
GCTGAGGTTG 
CAGTCTCCAT 
AATGTGGAGA 
TGAGCATTGA 
CTTTCTTGGA 
TCTGGACATG 
TGGATTTTAA 
AGCATAGCCC 



11 21 

I I 
CCCGTTGCTT CTCTCTCTTC 
TCCGTGCGCC CTCTCTGGGC 
CCGCTCTCAT GCCCCTTTCA 
CTTGTTCCTC TCCTTCCCGG 
GTGGGTCGTC CTCTGCGTCC 
TCCCCCTGGT TCTCGCGCGG 
GCTCCTGCTC TCTGTCTGGT 
TTCTCTTTTT CTTGTTTCTT 
GGGAGAGATA ACGCTGTAAA 
GCCTGCTCAG CCAAGAAGGA 
CCTGTGAGCC TGTGTCCGGC 
AGAGAATATG TGGGAGAACA 
TGTTATTCCA TGATAGATAG 
GAGGCTGGAA GAGTCAGGGT 
GGCTGGAAAT CTGAGGGTCA 
TTGCTGATGG CAA 



31 
I 

CCTTCTGCCT 
TTCGGCTTCA 
GCTCCTACTC 
TTCGTGTTCT 
TGCCTCTTTC 
TTCTGTGTGC 
GTTCTCCGCC 
CTTGTTTTTT 
CTTTTATTTT 
GCTCACTGTG 
CCTGAACTCT 
AAACAGAAAC 
TAGGGGCAGG 
GTGATTCGGA 
GTGGTCCAAG 



41 



GCGTGCTCGC 
CCCTCCGTCT 
TGGTGTCGTC 
GCTTCTTCGT 
TCTTGCGTTT 
GCGGGCCGCC 
GTCTGTGTCT 
TTTGTCTTTG 
TCAGGAAATC 
GGCACCAGAG 
CAAGCACAGG 
TGAAAGAATA 
AGTGAGAGAG 
GAGGCGCATG 
TCACTCAGAG 



51 
I 

CCGTCTTTGG 
GGCTGTCGCT 
CCCCTCCCTC 
GTTCCCTCTC 
CCGCGCGCTT 
CCGTTCTTTC 
TTCTCCTTTT 
TTTTCTCTTA 
TGGAAACCTA 
ACAGGGACCC 
GCAGGCTTCC 
TGCAAGGTGT 
GCTGACTAGG 
AGAAGGAAGG 
ACAGAATCAC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 



Seq ID NO: B44 DNA sequence 

Nucleic Acid Accession #: NM_014421.1 

Coding sequence: 718.. 1497 



11 



21 



CCACGCGTCC 
GCGCTGGCAG 
AGTGCGCGTG 
TCCTGATCCC 
TTGACACATA 
GCTGATGCGG 
CCAAGGAGGT 
TGCACCTTGG 
TTGCAGAACT 
CCTGAGGCGT 
AACACAATCT 
CTTTGCATTC 
GCCGCGTTGA 
ATGGTGGAGA 
CTGGGCGGGG 
GCATTCGGCG 
AAGGAGTGTG 
GTGTGTCGGA 
TGCAATAATG 



GGCGGGAGCC 
CGATTCGCTG 
TGGGGGACCT 
CTACTAAAGC 
AAGGATAGGA 
GCCTCCTGAT 
GCGGGGCAAG 
GTCACGCCTC 
TCCTGTCCTG 
GAGACTGAGT 
GCAAGCCCCC 
CCACCCCTCC 
TGOGGAGCAA 
GCTCACAGAT 
AGACGCCTGG 
GCAGTAAGAA 
AAGTTGGGAG 
GAAAAAAGAA 
GCATCTGTAT 



CGCGGCGAGC 
TCTCTTGTGA 
CGATTCTCTT 
GCCTGCTAAC 
AAAGAGAGGA 
CAATTAAGAG 
AAGGAACGGA 
CTTGGCGAGA 
GTGGAGAAGC 
TCATAGGGTC 
GCGACCCAAG 
GGGCTTTGCG 
GGATTCGTCC 
CGGCAGTTCG 
TCAGGCCGCC 
GGGCAAAAAC 
GTATTGCCAC 
GCGCTGCCAC 
CCCAGTTACT 



31 
I 

GTAGCGCAAG 
GTCAGGGGAC 
CAGATCTCGA 
TTTGAAAAGG 
GAGAAAAGCA 
GAGAGTTAAA 
AGCGGTGCGA 
AAGCGCCTCG 
GGGTCTCGCT 
CTGGGTCCCC 
TGAGGGGCCC 
TCTTCCTGGG 
TGCTGCCTGC 
CGGGCCAAAC 
AATCGATCTG 
CTGGGGCAGG 
AGTCCCCACC 
CGAGATGGCA 
GAAAGCATCT 



41 

I 

TCCGCTCCCT 
AACGCTTCGG 
GGATTCGGTC 
AGCACTGTGT 
ACTGAGTTGA 
CCGCCGAGAT 
TCCACAGGGC 
CATTTGATTG 
TGGGTTCCGC 
GAACCAGGAA 
CGTGTTGGGG 
GACCCCCTCG 
TCCTACTGGC 
TCAACTCCAT 
CGGGCATGTA 
CCTACCCTTG 
AAGGATCATC 
TGTGCTGCCC 
TAACCCCTCA 



51 
I 

AGGCATCGCT 
GGCAACTGTG 
CGGGGACGTC 
CCTGCAAAGT 
AGGAGAAGGA 
CCCGGCGGGA 
TGGGTTTTCT 
CTTCCAGTTA 
TAATTTCTGT 
GGGTTGAGGG 
TCCTCCCTCC 
CCGGGAGATG 
CGCGGTGCTG 
CAAGTCCTCT 
CCAAGGACTG 
TAGCAGTGAT 
GGCCTGCATG 
CAGTACCCGC 
CATCCOGGCT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 



813 



WO 03/042661 



CTGGATGGTA CTCGGCACAG AGATCGAAAC CACGGTCATT ACTCAAACCA TGACTTGGGA 
TGGCAGAATC TAGGAAGACC ACACACTAAG ATGTCACATA TAAAAGGGCA TGAAGGAGAC 
CCCTGCCTAC GATCATCAGA CTGCATTGAA GGGTTTTGCT GTGCTCGTCA TTTCTGGACC 
AAAATCTGCA AACCAGTGCT CCATCAGGGG GAAGTCTGTA CCAAACAACG CAAGAAGGGT 
TCTCATGGGC TGGAAATTTT CCAGCGTTGC GACTGTGOGA AGGGCCTGTC TTGCAAAGTA 
TGGAAAGATG CCACCTACTC CTCCAAAGCC AGACTCCATG TGTGTCAGAA AATTTGATCA 
CCATTGAGGA ACATCATCAA TTGCAGACTG TGAAGTTGTG TATTTAATGC ATTATAGCAT 
GGTGGAAAAT AAGGTTCAGA TGCAGAAGAA TGGCTAAAAT AAGAAACGTG ATAAGAATAT 
AGATGATCAC AAAAAGGGAG AAAGAAAACA TGAACTGAAT AGATTAGAAT GGGTGACAAA 
TGCAGTGCAG CCAGTGTTTC CATTATGCAA CTTGTCTATG TAAATAATGT ACACATTTGT 
GGAAAATGCT ATTATTAAGA GAACAAGCAC ACAGTGGAAA TTACTGATGA GTAGCATGTG 
ACTTTCCAAG AGTTTAGGTT GTGCTGGAGG AGAGGTTTCC TTCAGATTGC TGATTGCTTA 
TACAAATAAC CTACATGCCA GATTTCTATT CAACGTTAGA GTTTAACAAA ATACTCCTAG 
AATAACTTGT TATACAATAG GTTCTAAAAA TAAAATTGCT AAACAAGAAA TGAAAACATG 
GAGCATTGTT AATTTACAAC AGAAAATTAC CTTTTGATTT GTAACACTAC TTCTGCTGTT 
CAATCAAGAG TCTTGGTAGA TAAGAAAAAA ATCAGTCAAT ATTTCCAAAT AATTGCAAAA 
TAATGGCCAG TTGTTTAGGA AGGCCTTTAG GAAGACAAAT AAATAACAAA CAAACAGCCA 
CAAATACTTT TTTTTCAAAA TTTTAGTTTT ACCTGTAATT AATAAGAACT GATACAAGAC 
AAAAACAGTT CCTTCAGATT CTACGGAATG ACAGTATATC TCTCTTTATC CTATGTGATT 
CCTGCTCTGA ATGCATTATA TTTTCCAAAG TATACCCATA AATTGTGACT AGTAAAATAC 
TTACACAGAG CAGAATTTTC ACAGATGGCA AAAAAATTTA AAGATGTCCA ATATATGTGG 
GAAAAGAGCT AACAGAGAGA TCATTATTTC TTAAAGATTG GCCATAACCT GTATTTTGAT 
AGAATTAGAT TGGTAAATAC ATGTATTCAT ACATACTCTG TGGTAATAGA GACTTGAGCT 
GGATCTGTAC TGCACTGGAG TAAGCAAGAA AATTGGGAAA ACTTTTTCGT TTGTTCAGGT 
TTTGGCAACA CATAGATCAT ATGTCTGAGG CACAAGTTGG CTGTTCATCT TTGAAACCAG 
GGGATGCACA GTCTAAATGA ATATCTGCAT GGGATTTGCT ATCATAATNT TTCCTATGCN 
GNTGAATTCN GTGTGAGGTC CTGTGTCCGT CCTATCCTCA AATTATTTAT TTTATAGTGC 
TGAGATCCTC AAATAATCTC AATTTCGGAG GTTTCACAAA ATGGACTCCT GAAGTAGACA 
GAGTAGTGAG GTTTCATTGC CCTCTATAAG CTTCTGACTA GCCAATGGCA TCATCCAATT 
TTCTTCCCAA ACCTCTGCAG CATCTGCTTT ATTGCCAAAG GGCTAGTTTC GGTTTTCTGC 
CAGCCATTGC GGTTAAAAAA TATAAGTAGG ATAACTTGTA AAACCTGCAT ATTGCTAATC 
TATAGACACC ACAGTTTCTA AATTCTTTGA AACCACTTTA CTACTTTTTT TAAACTTAAC 
TCAGTTCTAA ATACTTTGTC TGGAGCACAA AACAATAAAA GGTTATCTTA TAGTTGTGAC 
TTTAAACTTT TGTAGACCAC AATTCACTTT TTAGTTTTCT TTTACTTAAA TCCCATCTGC 
AGTCTCAAAT TTAAGTTCTC CCAGTAGAGA TTGAGTTTGA GCCT GTAT AT CTATTAAAAA 
TTTCAACTTC CCACATATAT TTACTAAGAT GATTAAGACT TACATTTTCT GCACAGGTCT 
GCAAAAACAA AAATTATAAA CTAGTCCATC CAAGAACCAA AGTTTGTATA AACAGGTTGC 
TATAAGCTTG GTGAAATGAA AATGGAACAT TTCAATCAAA CATTT CCTAT ATAACAATTA 
TTATATTTAC AATTTGGTTT CTGCAATATT TTTCTTATGT CCACCCTTTT AAAAATTATT 
ATTTGAAGTA ATTTATTTAC AGGAAATGTT AATGAGATGT ATTTTCTTAT AGAGATATTT 
CTTACAGAAA GCTTTGTAGC AGAATATATT TGCAGCTATT GACTTTGTAA TTTAGGAAAA 
ATGTATAATA AGATAAAATC TATTAAATTT TTCTCCTCTA AAAACTGAAA AAAAAAAAAA 
AAAAAAAAAA A 



1200 

1260 

1320 

1380 

1440 
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1560 

1620 

1680 

1740 

1800 
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2400 

2460 
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Seq ID NO: B45 Protein sequence 
Protein Accession #: NP_05S236.1 

1 11 21 31 41 51 

1 | I I II 

MAALMRSKDS SCCLLLLAAV LMVESSQIGS SRAKLNSIKS SLGGETPGQA ANRSAGMYQG 
LAFGGSKKGK NLGQAYPCSS DKECEVGRYC HSPHQGSSAC MVCRRKKKRC HRDGMCCPST 
RCNNGICIPV TESILTPHIP ALDGTRHRDR NHGHYSNHDL GWQNLGRPHT KMSHIKGHEG 
DPCLRSSDCI EGFCCARHFW TKICKPVLHQ GEVCTKQRKK GSHGLEIFQR CDCAKGLSCK 
VWKDATYSSK ARLHVCQKI 



60 
120 
180 
240 



Seq ID NO: 46 dna sequence 

Nucleic Acid Accession fl: NM_019885.1 

Coding sequence: 1..1539 



ATGCTCTTTG 
TCCGTGACGC 
CGCGACAAGA 
GAGACCGGCC 
GGCAACGTGT 
AACGTGCGCA 
ACCCGCATGT 
AAGCGCAAGG 
ATCCAGCTGG 
GTGTACCAGG 
AGCATCCCTG 
GTCTTCTCCC 
CAGATCCTGC 
AAGGACTACT 
ATGACCATGC 
ACGGCCAGCG 
AAGCTGCGGG 
GGCACACTGC 
GTCATGCGCC 
CTTGATGGTT 
GACACAGCGC 
CGGAGCGAGG 
TGCCTGGGCA 



11 
I 

AGGGCTTGGA 
TGCTGCTGGC 
GCTGCAAGCT 
ACTGGCTGCT 
TCAAGACGCA 
AGATCCTCAT 
TGCTGGGCCC 
TCTTCTCCAA 
TGATCCAGGA 
AGGCGCAGAA 
AGGAGGACCT 
TGCCTGTCGA 
AGAAGGGGCT 
TGGACGCCCT 
AGGAGCTGAA 
CCAGCACCTC 
ATGAGCTGCG 
GCCTGGACAC 
TGTTCACGCC 
TCCAGATCCC 
CCGTGTTCAA 
ACAAGGATGG 
AGCACCTGGC 



21 
I 

TCTGGTGTCG 
.CGTGTCGCAG 
GCCCATCCCC 
GCAGGGTTCT 
TTTGTTGGGG 
GGGCGAGCAC 
CAACACGGTG 
GATCTTCAGC 
CACACTGCGC 
GCTGACCTTC 
TGGGCACCTC 
CCTGCCCTTC 
GGAGAAGGCC 
GGACCTCCTC 
GGACGGGACC 
ACTCATCATG 
GGCTCATGGC 
GCTCAGTGGG 
CATTTCCGGC 
CAAAGGCTGG 
AGACGTGAAC 
CCGCTTCCAT 
CAAGCTGTTC 



31 

I 

GCGCTGGCCA 
CAGCTGTGGC 
AAGGGATCCA 
GGCTTCCAGT 
CGGCCGCTGA 
CACCTCGTGA 
TCCAATTCCA 
CACGAGGCCC 
GCCTGGAGCA 
CGCATGGCCA 
TTTGAGGTCT 
AGTGGCTACC 
ATCCGGGAGA 
ATTGAGAGCA 
CTGGAGCTGA 
CAGCTGCTGA 
ATCCTGCACA 
CTGCGCTACC 
GGCTACCGCA 
AGTGTCATGT 
GTGTTCGACC 
TACCTCCCGT 
CTGAAGGTGC 



41 
I 

CCCTCGCCGC 
AGCTGCGCTG 
TGGGCTTCCC 
CGTCGCGGAG 
TACGCGTGAC 
GCACCGAGTG 
TTGGCGACAT 
TGGAGAGTTA 
GCCACCCOGA 
TCCGGGTGCT 
ACCAGCAGTT 
GGGGGGGCAT 
AGCTGCAGTG 
GCAAGGAGCA 
TCTTTGCGGC 
AGCACCCCAC 
GTGGCGGCTG 
TGGACTGCGT 
CTGTGCTGCA 
ATAGCATCCG 
CCGATCGCTT 
TCGGTGGCGG 
TGGCGGTGGA 



51 
I 

GTGCCTGGTG 
GGCCGCCACT 
GCTCATCGGA 
GGAGAAGTAT 
CGGCGCGGAG 
GCCTCGCAGC 
CCACCGCAAC 
CCTGCCCAAG 
GGCCATCAAC 
GCTGGGCTTC 
TGTGGACAAT 
TCAGGCTCGG 
CACACAGGGC 
CGGGAAGGAG 
CTATGCCACC 
TGTGCTGGAG 
CCCCTGCGAG 
CATCAAGGAG 
GACCTTCGAG 
GGACACCCAT 
CAGCCAGGCG 
TGTCCGGACC 
GCTGGCTAGC 



60 
120 
1B0 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 



814 



WO 03/042661 PCTAJS02/36810 



ACCAGCCGCT TTGAGCTGGC TACACGGACC TTCCCCCGCA TCACCTTGGT CCCCGTCCTG 1440 
CACCCCGTGG ATGGCCTCAG CGTCAAGTTC TTTGGCCTGG ACTCCAACCA GAACGAGATC 1500 
CTGCCGGAGA CGGAGGCCAT GCTGAGCGCC ACAGTCTAAC CCAAGACCCA CCCGCCTCAG 1560 
CCCAGCCCAG GCAGCGGGGT GGTGGTTGTG GGAGGTAGAA ACCTGTGTGT GGGAGGGGGC 1620 
5 CGGAACGGGG AGGGCGAGTG GCCCCCATAC TTGCCCTCCC TTGCTCCCCC TCCCTGGCAA 1680 
ACCCTACCCA AAGCCAGTGG GCCCCATTCC TAGGGCTGGG CTCCCCTTCT GGCTCCAGCT 1740 
TCCCTCCAGC CACTCCCCAT TTACCATCAG CTCAGCCCCT GGGA AGGG OG TGGCAGGGGC 1800 
TCTGCATGCC CGTGACAGTG TTAGGTGTCA GCGCGTGCTA CAGTGTTTTT GTGATGTTCT 1860 
GAACTGCTCC CTTCCCTCCG TTCCTTTCGG ACCCTTTTAG CTGGGGTTGG GGGACGGGAA 1920 
10 GAGCCGTGCC CCCTTGGGCG CACTCTTCAG CGTCTCCTCC TCCTGCGCCC CCACTGCGTC 1980 
TGCCCAGGAA CAGCATCCTG GGTAGCAGAA CAGGAGTCAA CCTTGGCGGG GCGGGGGCTG 2040 
CGTCCAACCT GGAGATTGCC CTTCCCTATG CCAOGGTTCC CACCCTCCCT CACCAGTTTG 2100 
GACAATTTGA AATTACCTAT TGCTGCTACT TGTTCTGTCC TCTGACCTTG GGGCAAAGGA 2160 
GCCCCAGGCC CTGTCTCCCC AGCATCCTCC CTGGTGGCCC TGGGCAGGTG CACTGACACC 2220 
15 CCCACCTTCC CATCCCCTGC TGAACCAGGC CCTGTTACAC ACAGCCGCCT AAGGCCCGCG 2280 
GCTCATGTGC TGCCCGCCCC CATATTTATT CACTGATAGA GAATCTTGGG GATGCTGGGG 2340 
TCTGGAGTGA ACATCTCCTC CCCTTCATGC CCTAGCCTGT GTTCTAGCTG TCCTGGCGAG 2400 
ACTTCTGTGA GTGAAGAGGA AGGGGTCTCT GGTCAAACCC AGCCCCCAGG GCCTAGGGTT 2460 
GAAAGCCTTC CCCGGCTCCG GGCATTATTT GGGTTTAATC TCGGAGCCTC ACTCCTGGAC 2520 
20 TGAAGTCCGG TGCCTCTGCC TTATCCCTGG TGGAGATGGA ATGTGGCCCA TTGCCTCCTC 2580 
CCTCTCCTGT CAAAAACCCT GATCAGGTAG ATTTGGAGGC GGCCACGATT TCCTGTTTGG 2640 
CCCCTGTTCA CCCCAGTGCA CTGGCCCTGA CTCCAGGCGT GAGTATGGGG AAGGATACGG 2700 
GTTCTTCTGA CGGGGAGCAA GGGCCTCCGT CTTCCCTTCC TTAACTCTCC CCCTTTGCCC 2760 
TCCGCCCTGA AAAAGGTGTC CTTGAAGTCC CTTCCACCTC TATGCCACTG TCTGCTTAGC 2820 
25 CCAGCTCAGG GGTGGGGAAG AGGCGAAAGC GTGGGGGAGG TGAGCGCAGC GGCAGTTCTG 2880 
CCTCGGAGCT GATTGCAGGG CCCTGTGTGG TCTCCGGACA GCTGCGGGAA GGCTGCCGCA 2940 
GCTGAAGCTG AAGAGGCGGC TACGTGCGGT TTGTCAGGGG GATTGGGTTG AAAACTGGCC 3000 
AGTCGGGATG ACTGGGTGAA AGAGGAGTAG CTCCTGCCAC TGGCGTTTTG AGTGTTGGCA 3060 
ATTTGGGATG CCTCCTGGGG AAGGTTTCCG GGCGTTTGGT GAGTCTCTAG ATTTTTCCTT 3120 
30 GCTTTCTGTG TTTATTGGTT TTTGATGTTG TAAAAGCAAT GAATCCCCTT TACAAGAAAA 3180 
TCGAAAACAC AGAAGAATGA AGGACATGCC AGTCCCCGAT CGCTGCTGTG AGCACCTCAG 3240 
TGGCTCCCTC AGACCAGATC CCGTAGGCAG CCCCACAGAC CGACCCTGAC CCCACTCACA 3300 
GCCACCCTGA AGATAGACTA TAGGAACGGG CCCATACCAC ACAGACTGCT CTCCAATCCC 3360 
TGAGTCTCAG ATGTTTCATT TATTTCCTAC TTTTCCACTA CTAAAAAACA GTGTGGAATA 3420 
35 GACATTATTG GCAAAATTGC TCATCCCTAA TCCTGAAAAA CAGGCCAGAA TGGGTAAAGA 3480 
CTTGTCAAAG CTTGCAACAT AGCTACATGG TGCACCCGGA CCTGTACCCC CTCCCCCCAA 3540 
CACAAAACCA GTGTCTGGGA GGTTCATTTT CCTTTAAACT GATCCAGCTG GCCCTGAACC 3600 
AATTGTTTTT GACTGAGTAT CTAGGAGAGC AGTAAGTGGA ACTTCAGACA AGCCCACTGG 3660 
GTCTGGTCCA GGTGAGGGGC AGGGGGCATG GGGCTGGGAG GTCTCAGGGG CCTTCCCTGG 3720 
40 GGGTGGCCAG CCTGGTAGGG GGCAGAGAAG GAAAAGCTGA GGGGGGTCCC TGTGAGGGAG 3780 
GAAAGAAGGA TCATTTGCCC CGCTGGGTCT CAAAGGCAGT GAGAAGAGAG CTGAAGAAAG 3840 
CTCTGGCTGG CTGACAGGAT CCCTGTGTTG TAATTGGTCC CTCCTTTCAG CTCTCTAGTG 3900 
AGATGCCCGT GTCTGTGCGT GTGCGTGTGT GTTTCATACA GCTAGCATTA GATGGGTGAT 3960 
GTTTCTTACT TATCATCCCT AACTATTGCA ACTTGACCTT AAAAAGACAA AACCCCACAA 4020 
45 AACTCTTCCT GCCACGGGCT TGCAGATTGA AGCACTTTCG ATGTTGGGOG CTGGCGTTTG 4080 
TGTTCTGGGC ACCACCGTGA CCCTGCCCAG ATGGCTATAA TATTATTTTA TACACAAACC 4140 
TTTTTTTTCA TAAATGTTAT AATTTTGTGT CTGTCTTTAT AAACTATTAT AAGTACTATT 4200 
TTTGTTATAA TTCAAAATAG ATATTTAGTA TAAAGTTTTT GCTGTTAAAT ATTTGTTATT 4260 
TAGTAAAATA TGAATTTTGC TCTATTGTAA ACATGGTTCA AAATATTAAT ATGTTTTTAT 4320 
50 CACAGTCGTT TTAATATTGA AAAAGCACTT GTGTGTTTTG TTTTGATATG AAACTGGTAC 4380 
CGTGTGAGTG TTTTTGCTGT CGTGGTTTTA ATCTGTATAT AATATTCCAT GTTGCATATT 4440 
AAAAA 



Seq ID NO: B47 Protein sequence 
55 Protein Accession ft: NP_063938.1 

1 11 21 31 41 51 

til' 

MLFEGLDLVS ALATLAACLV SVTLLLAVSQ QLWQLRWAAT RDKSCKLPIP KGSMGFPLIG 60 

60 ETGHWLLQGS GFQSSRREKY GNVFKTHLLG RPLIRVTGAE NVRKILMGEH HLVSTEWPRS 120 

TRMLLGPNTV SNSIGDIHRH KRKVFSKIFS HEALESYI*PK IQLVIQDTLR AWSSHPBAIN 180 

VYQEAQKLTF RMAIRVLI/3F SIPEEDLGHL FEVYQQFVDN VFSLPVDLPF SGYRRGIQAR 240 

QILQKGLEKA IREKLQCTQG KDYLDALDLL IESSKEHGKE MTMQELKDGT LELIFAAYAT 300 

TASASTSLIM QLLKHPTVLE KLRDELRAHG I1HSGGCPCE GTLRLDTLSG LRYLDCVIKE 360 

65 VMRLFTPISG GYRTVLQTFE LDGFQIPKGW SVMYSIRDTH DTAPVFKDVN VFDPDRFSQA 420 

RSEDKDGRFH YLPFGGGVRT CLGKHLAKIiF LKVLAVE1AS TSRFELATRT FPRITLVPVL 480 
HPVDGLSVKF FGLDSNQNEI LPETEAMLSA TV 

_ Seq ID NO: B48 DNA sequence 
70 Nucleic Acid Accession ft: AB040527.1; AL136582.1 
Coding sequenc e : 94 .. 2 3 1 9 

1 11 21 31 41 51 

75 GCGGCTGCGG GGTTCCTGGT GCTGAGGACG GACGCCATTG GAGTTCCCGA GAAGCATGGC 60 

TGAGGGAAGC TTCAGCGTGC AATCGGAAAG CTACAGTGTT GAAGACATGG ATGAGGGTAG 120 

CGACGAAGTC GGGGAGGAAG AGATGGTTGA AGGCAACGAC TATGAAGAAT TCGGTGCGTT 180 

TGGTGGCTAT GGCACCCTCA CCAGCTTTGA CATCCATATC CTCAGAGCCT TCGGAAGCTT 240 

GGGTCCAGGC CTTCGCATCT TATCGAATGA GCCCTGGGAA CTGGAAAACC CTGTGCTGGC 300 

80 CCAGACCCTG GTGGAGGCAT TGCAGCTGGA TCCGGAAACA CTTGCCAATG AGACGGCCGC 360 

CCGTGCTGCC AACGTAGCCC GCGCCGCCGC CTCCAACCGT GCGGCTCGGG CCGCTGCCGC 420 

CGCTGCCCGT ACCGCCTTCA GTCAGGTGGT CGCTAGCCAC CGGGTGGCCA CGCCGCAGGT 480 

CTCAGGAGAG GATACCCAGC CCACGACCTA CGCCGCCGAG GCTCAGGGGC CCACCCCTGA 540 

GCCACCCCTT GCTTCTCCGC AGACCTCCCA GATGTTAGTC ACCAGTAAGA TGGCTGCCCC 600 

815 



WO 03/042661 



CGAGGCTCCG GCAACCTCCG CACAGTCCCA GACAGGCTCC CCGGCCCAGG AGGCTGCTAC 660 

TGAGGGCCCT AGTAGCGCCT GTGCTTTCTC TCAGGCTCCG TGTGCCAGGG AGGTGGACGC 720 

CAACCGGCCC AGCACAGCCT TCCTGGGCCA GAATGATGTC TTCGATTTCA CTCAGCCGGC 780 

AGGTGTCAGT GGCATGGCCT TCCCGCGCCC CAAGAGACCT GCCCCAGCCC AAGAGGCTGC 840 
CACAGAGGGC CCCAGTGCTG CCTCTGGTGT GCCCCAGACG GGACCTGGCA GGGAGGTGGC 900 
AGCCACCCGG CCCAAGACCA CCAAGTCGGG GAAGGCGCTG GCCAAGACTC GGTGGGTGGA 960 

GCCTCAGAAT GTTGTGGCAG CAGCTGCTGC CAAGGCCAAG ATGGCCACGA GCATCCCTGA 1020 

GCCGGAGGGT GCAGCTGCTG CCACTGCTCA GCACAGTGCT GAGCCCTGGG CCAGGATGGG 1080 

AGGCAAGAGG ACCAAGAAGT CCAAGCACCT GGATGATGAG TATGAGAGCA GCGAGGAGGA 1140 

GAGAGAGACT CCCGCGGTCC CACCCACCTG GAGAGCATCA CAGCCCTCAT TGACGGTGCG 1200 

GGCTCAGTTG GCCCCTCGGC CCCCGATGGC CCCGAGGTCC CAGATACCCT CAAGGCACGT 1260 

ACTGTGCCTG CCCCCCCGCA ACGTGACCCT TCTGCAGGAG AGGGCAAATA AGTTGGTGAA 1320 

ATACCTGATG ATTAAGGACT ACAAGAAGAT CCCCATCAAG CGCGCAGACA TGCTGAAGGA 1380 

TGTCATCAGA GAATATGATG AACATTTCCC TGAGATCATT GAACGAGCAA CGTACACCCT 1440 

GGAAAAGAAG TTTGGGATCC ACCTGAAGGA GATCGACAAG GAAGAACACC TGTATATTCT 1500 

TGTCTGCACA CGGGACTCCT CAGCTCGCCT CCTTGGAAAA ACCAAGGACA CTCCCAGGCT 1560 

GAGTCTCCTC TTGGTGATTC TGGGCGTCAT CTTCATGAAT GGCAACCGTG CCAGCGAGGC 1620 

TGTCCTCTGG GAGGCACTAC GCAAGATGGG ACTGCGCCCT GGGGTGAGGC ACCCATTCCT 1680 

CGGCGATCTG AGGAAGCTCA TCACAGATGA CTTTGTGAAG CAGAAGTACC TGGAATACAA 1740 

GAAGATCCCC AACAGCAACC CACCTGAGTA TGAATTCCTC TGGGGCCTGC GAGCCCGCCA 1800 

TGAGACCAGC AAGATGAGGG TCCTGAGATT CATCGCCCAG AATCAGAACC GAGACCCCCG 1860 

GGAATGGAAG GCTCATTTCT TGGAGGCTGT GGATGATGCT TTCAAGACAA TGGATGTGGA 1920 

TATGGCCGAG GAACATGCCA GGGCCCAGAT GAGGGCCCAG ATGAATATCG GGGATGAAGC 1980 

GCTGATTGGA CGGTGGAGCT GGGATGACAT ACAAGTCGAG CTCCTGACCT GGGATGAGGA 2040 

CGGAGATTTT GGCGATGCCT GGGCCAGGAT CCCCTTTGCT TTCTGGGCCA GATACCATCA 2100 

GTACATTCTG AATAGCAACC GTGCCAACAG GAGGGCCACG TGGAGAGCTG GCGTCAGCAG 2160 

TGGCACCAAT GGAGGGGCCA GCACCAGCGT CCTAGATGGC CCCAGCACCA GCTCCACCAT 2220 

CCGGACCAGA AATGCTGCCA GAGCTGGCGC CAGCTTCTTC TCCTGGATCC AGCACCGTTG 2280 

ACGAACTGCA GCGATCTTAC TGGCCAAGCC AGAGCGCCTC CTCTCAGATT CCTTCTCGAC 2340 

ACAGCACCCT AGGCGGCTTC TTCCTGTCAG TCGGAGGTGG CATGCAAGAT GAAGCTCTCT 2400 
TTGCTCTTCC TGCTTTCATT TTGTGCTTTT CCTTGTGTTT TCATGTTTTG GGTATCAGTG 2460 
TTACATTAAA GTTGCAAAAT TAAA 

Seq ID NO: B49 Protein sequence 
Protein Accession #: BAB33378.1; CAB66S17.1 

1 11 21 31 41 SI 

| ' | i I I I 

MAEGSPSVQS ESYSVEDMDE GSDEVGEEEM VEGNDYEEFG AFGGYGTLTS FDIHILRAFG 
SLGPGLRILS NEPWELENPV LAQTLVEALQ LDPETLANET AARAANVARA AASNRAARAA 
AAAARTAFSQ WASHRVATP QVSGEDTQPT TYAAEAQGPT PEPPLASPQT SQMLVTSKMA 
APEAPATSAQ SQTGSPAQEA ATEGPSSACA FSQAPCAREV DANRPSTAFL GQNDVPDFTQ 
PAGVSGMAFP RPKRPAPAQE AATEGPSAAS GVPQTGPGRE VAATRPKTTK SGKALAKTRW 
VEPQNWAAA AAKAKMATSI PEPEGAAAAT AQHSAEPWAR MGGKRTKKSK HLDDEYESSE 
EERETPAVPP TWRASQPSLT VRAQIAPRPP MAPRSQIPSR HVLCLPPRNV TLLQERANKL 
VKYLMIKDYK KIPIKRADML KDVIREYDEH FPEIIERATY TLEKKFGIHL KEIDKEEHLY 
ILVCTRDSSA RLLGKTKDTP RLSLLLVILG VIFMNGNRAS EAVLWEALRK MGLRPGVRHP 
PLGDLRKLIT DDFVKQKYLE YKKIPNSNPP EYEFLWGLRA RHETSKMRVL RFIAQNQNRD 
PREWKAHPLE AVDDAFKTMD VDMAEEHARA OMRAQMNIGD EALIGRWSWD DIQVELLTWD 
EDGDFGDAWA RIPFAFWARY HQYILNSNRA NRRATWRAGV SSGTNGGAST SVLDGPSTSS 
TIRTRNAARA GASFFSWIQH R 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 



Seq ID NO: B50 DNA sequence 

Nucleic Acid Accession #: XM_084965.1 

Coding sequences 356.. 2014 

1 ll 21 31 41 51 

| | | I I 1 

GACCTAGCGT GTGCTCAGCT CTGGACAAGA CATGGATGTA GGCAGACATG CTTCCTGCCT 60 

GCTGGGCTTA GTGCACCAAT GCTAAGGACA GACATGAAGG GCTTGGGAAA GGCATGAAGA 120 

TGCAGGGAGT CTAGGAGAGA GGATAAGAAA AACAAGCATT TTGTGGTGGG TTGAATTGCG 180 

TTGTCGTTTT TAGAGGCATT TCGTGATCCA TGGTTGCGCT CCCGTGAAGA GCGAGTCTAG 240 

AGAAGGAATC CGAAGCGGCG GOGGCGAGCG CGGCTCAGGT TGATTTAGAA TACGGGTGAC 300 

AGTGGCCTGG GGCGAGCCCA CTGCTGACGA AAGCGGCTTA TCCCGCGCGG TTTCCATGGA 360 

GACGAGCCGG AGCCGCGGCG GCGGCGGGGC TGTCAGCGAG CGCGGCGGAG CTGGCGCGTC 420 

CGTGGGGGTC TGCAGGAGGA AGGCGGAGGC CGGGGCCGGG ACCGGGACCC TCGCGGCAGA 480 

CATGGACTTG CATTGTGACT GTGCCGCCGA AACGCCGGCC GCCGAGCCGC CGTCGGGGAA 540 

GATTAATAAA GCTGCCTTCA AATTATTCAA GAAGAGGAAA TCGGGTGGCA CCATGCCCAG 600 

CATTTTTGGG GTCAAAAACA AAGGGGACGG GAAAAGCTCG GGTCCGACGG GGCTGGTGAG 660 

GAGCAGGACC CACGACGGAC TTGCCGAGGT GCTGGTGCTG GAGAGCGGCA GGAAGGAGGA 720 

GCCGCGCGGC GGGGGCGACA GOGGCGGGGG CGGCGGGGGG CGGCCGAACC CGGGGCCCCC 780 

CAGAGCCGCA GGGCCCGGCG GGGGCTCCCT CGCCAGCAGC TCGGTGGCCA AGTCGCACAG 840 

CTTCTTCTCG CTGCTGAAGA AGAACGGGCG CTCGGAAAAC GGCAAGGGAG AGCCTGTGGA 900 

CGCGAGCAAG GCCGGCGGCA AACAAAAGCG GGGGCTGCGG GGGCTGTTCA GCGGCATGCG 960 

CTGGCACAGG AAAGACAAGC GGGCCAAGGC GGAGGCCGCG GAGGGGCGCG CGCCCGGGGG 1020 

CGGCTTGATC CTACCCGGCT CGCTCACCGC CAGCCTGGAG TGCGTCAAGG AGGAGACGCC 1080 

CAGAGCCGCG CGCGAGCCGG AGGAGCCCAG CCAGGACGCC CCGCGAGACC CAGCAGGCTG 1140 

TGGAGATATT ATTGCAGACC AAGAGGAAGA GGCAGGTCCC AGCTGTGACA AGCATGTCCC 1200 

CGGGCCAGGC AAGCCGGCTC TGTCTAAAAA GAACCCCGGC GTGGTGGCCT ACCAAGGAGG 1260 

CGGGGAAGAG ATGGCCAGCC CGGACGAGGT GGACGACACC TATCTACAGG AGTTCTGGGA 1320 

CATGCTCTCC CAGACCGAGG AGCAGGGACC CGAGCCCCAG GAGGGCGCGG CTAAGGTGGC 1380 

AGCTGCGCTG GAAACCAAGG TGGTGCCCGA GACCCCCAAA GACACCAGGT GTGTGGAAGC 1440 

GGCCAAGGAC GCGTCCTCGG TCAAGCGCAG GAGGCTCAAC CGGATTCCCA TCGAGCCCCA 1500 

TCCTAAGGAG GAGCCCAAGC ACCCGGAGAA GGAGCAGCAG GAAGGCGTCC CCAACAGCGA 1560 

816 
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CGAGGGCTAC 
GAAGGCGGGC 
CCCGGACGGA 
GTCCCGGTTA 
CTTGCTGAAG 
CCATCCCGTG 
TTCCAAAGTG 
AACGGCCTGC 
CCCATGCCAA 
AAGAAAGTCT 
GTCTTTGTGC 
TCTCCTCTCA 
AAGTTTTTCT 
TGCAGAAAAC 
CAGTGTCTAG 
ACGTTTCCTT 



TGGGACTCCA 
ATCCCCCGGG 
AGTCCAGCAA 
AAGCCCGTAT 
GACTCTAAGA 
GTGCACCAGC 
CTGGTCCGCA 
CACGACAGTG 
GGAATACAAC 
TTTAGGACAC 
CCTGAGCAGG 
AGATAAGTCC 
TCCTTTTGCT 
AGAACTTAGC 
AGGCCTGAAG 
CCTTCCCTTC 



CCACGCCAGG 
ATAGCTACAG 
CCCTTCCTGG 
CTCCAGGCAC 
TCCCTATTAG 
AACCCTCCAG 
GAGTCAGCAA 
CCAAAAAGTT 
TTTTCCCTGG 
CACCCGTCCC 
GACCGGATAA 
CTGAGAATTA 
TTAATATACT 
CAAACCTAAG 
GAACCGCTGA 
TTTCCTCTTT 



CCCAGAGGAA 
CGGGGACGCG 
AGGGAAGGAC 
CATCACCTGT 
CATCAAGCAC 
GAGTGAGATG 
CCGGGGCTTG 
GTGAGGTCTT 
AAACCACTAA 
CCGCCCTGCT 
CACCAGAAAG 
TTTTCAAGCA 
GAACACTTGG 
TAAGAGTCAT 
AGAACCAGAG 
CCTTTCCAGT 



GACAGCTCGA 
CTCTATGATC 
AACGAGGAGA 
CCACTG CGAA 
CTGACCAACC 
CCCAGAACAA 
GCTGGGACCA 
CCAGGCCAAG 
AGTAAGTTTT 
CCAGAGCGTG 
AGGGATGCTA 
CTTTTTTCTT 
AAGTCACCTT 
GCCTGGATAT 
GAGATCTCCT 
T 



GCAGCGGGAA 
TCTATGCTGA 
CGTCCTCCCT 
CACCAGGCAG 
TTCCATCTAG 
AAATCCCGGT 
CCATCAGAGC 
GTGGATGGGC 
GCTTTTCCTA 
GACCGAGGAG 
CACGGGGGTT 
TTTTACCTTT 
TACTTGCCTT 
TGGGATAAGC 
CTTCCAAGAC 



1620 
1680 
1740 
1800 
1660 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 



Seq ID NO: B51 Protein sequence 
Protein Accession #j XP_084965.1 



1 
I 

METSRSRGGG 
GKINKAAFKL 
EEPRGGGDSG 
VDASKAGGKQ 
TPRAAREPEE 
GGGEEMASPD 
EAAKDASSVK 
GKKAGIPRDS 
GSLLKDSKIP 
RATACHDSAK 



11 
I 

GAVSERGGAG 
FKKRKSGGTM 
GGGGGRPNPG 
KRGLRGLFSG 
PSQDAPRDPA 
EVDDTYLQEF 
RRRLNRIPIE 
YSGDALYDLY 
ISIKHLTNLP 
Kb 



21 
I 

ASVGVCRRKA 
PSIFGVKNKG 
PPRAAGPGGG 
MRWHRKDKRA 
GCGDIIADQE 
WDMLSQTEEQ 
PHPKEEPKHP 
ADPDGSPATL 
SSHPWHQQP 



31 
I 

EAGAGTGTLA 
DGKSSGPTGL 
SLASSSVAKS 
KAEAAEGRAP 
EEAGPSCDKH 
GPEPQEGAAK 
EKEQQEGVPN 
PGGKDNEETS 
SRSEMPRTKI 



41 
I 

ADMDLHCDCA 
VRSRTHDGLA 
HSFFSLLKKN 
GGGLILPGSL 
VPGPGKPALS 
VAAALETKW 
SDEGYWDSTT 
SLSRLKPVSP 
PVSKVLVRRV 



51 
I 

AETPAAEPPS 60 

EVLVLESGRK 120 

GRSENGKGEP 180 

TASLECVKEE 240 

KKNPGWAYQ 300 

PETPKDTRCV 360 

PGPEEDSSSS 420 

GTITCPLRTP 480 

SNRGLAGTTI 540 



Seq ID NO: B52 DNA sequence 
Nucleic Acid Accession #: 
Coding sequence: 1..2016 

21 



FGENESH predicted 



1 11 21 31 41 51 

I I I I I I 

ATGGAGACGA GCCGGAGCCG CGGCGGOGGC GGGGCTGTCA GCGAGCGCGG CGGAGCTGGC 
GCGTCCGTGG GGGTCTGCAG GAGGAAGGCG GAGGCCGGGG CCGGGACCGG GACCCTCGCG 
GCAGACATGG ACTTGCATTG TGACTGTGCC GCCGAAACGC CGGCCGCCGA GCCGCCGTCG 
GGGAAGATTA ATAAAGCTGC CTTCAAATTA TTCAAGAAGA GGAAATOGGG TGGCACCATG 
CCCAGCATTT TTGGGGTCAA AAACAAAGGG GACGGGAAAA GCTCGGGTCC GACGGGGCTG 
GTGAGGAGCA GGACCCACGA CGGACTTGCC GAGGTGCTGG TGCTGGAGAG CGGCAGGAAG 
GAGGAG CCGC GCGGCGGGGG CGACAGCGGC GGGGGCGGCG GGGGGCGGCC GAACCCGGGG 
CCCCCCAGAG CCGCAGGGCC OGGCGGGGGC TCCCTCGCCA GCAGCTCGGT GGCCAAGTCG 
CACAGCTTCT TCTCGCTGCT GAAGAAGAAC GGGCGCTCGG AAAACGGCAA GGGAGAGCCT 
GTGGACGCGA GCAAGGCCGG CGGCAAACAA AAGCGGGGGC TGCGGGGGCT GTTCAGCGGC 
ATGCGCTGGC ACAGGAAAGA CAAGCGGGCC AAGGCGGAGG CCGCGGAGGG GCGCGCGCCC 
GGGGGCGGCT TGATCCTACC CGGCTOGCTC ACCGCCAGCC TGGAGTGCGT CAAGGAGGAG 
ACGCCCAGAG CCGCGCGCGA GCCGGAGGAG CCCAGCCAGG ACGCCCCGCG AGACCCAGCA 
GGTGAGCCCG CAGGGGGAGA GGAGGTGCCC GCCCCCGCCG ACCGCGCCCC AGCGCGGAGC 
TGCCGAGAGG CAGAGGGCCT CGCGCACCCC GGCGACACCG GCGCCCGGGG AGAGGACGCC 
GCGGGGCATC GGCGCGCCGA GCCGGGGCCC GGGGAGGTCC GCACGGCAGA GGACGCTTCC 
AGGACGGGGG CCGTTCCCGT AAAGACGGTC CCCCTTGTCG ACTCCGAAGG CGGCAGCGGC 
GGGGCGCCOG CCGCCCCAGA CCCTGCCTCT GTCGATCCAC CCTCAGACCC GTCGGCAGAT 
CGTATTTGTT TGATGTTTTC TGACGTGACT TCACTGAAAA GCTTTGACTC TCTTACAGGC 
TGTGGAGATA TTATTGCAGA CCAAGAGGAA GAGGCAGGTC CCAGCTGTGA CAAGCATGTC 
CCCGGGCCAG GCAAGCCGGC TCTGTCTAAA AAGAACCCCG GCGTGGTGGC CTACCAAGGA 
GGGGGGGAAG AGATGGCCAG CCCGGACGAG GTGGACGACA CCTATCTACA GGAGTTCTGG 
GACATGCTCT CCCAGACCGA GGAGCAGGGA CCCGAGCCCC AGGAGGGCGC GGCTAAGGTG 
GCAGCTGCGC TGGAAACCAA GGTGGTGCCC GAGACCCCCA AAGACACCAG GTGTGTGGAA 
GCGGCCAAGG ACGCGTCCTC GGTCAAGCGC AGGAGGCTCA ACCGGATTCC CATCGAGCCC 
CATCCTAAGG AGGAGCCCAA GCACCCGGAG AAGGAGCAGC AGGAAGGCGT CCCCAACAGC 
GACGAGGGCT ^ ACTGGGACTC CACCAOGCCA GGCCCAGAGG AAGACAGCTC GAGCAGCGGG 
AAGAAGGCGG GCATCCCCCG GGATAGCTAC AGCGGGGACG CGCTCTATGA TCTCTATGCT 
GACCCGGACG GAAGTCCAGC AACCCTTCCT GGAGGGAAGG ACAACGAGGA GACGTCCTCC 
CTGTCCCGGT TAAAGCCCGT ATCTCCAGGC ACCATCACCT GTCCACTGCG AACACCAGGC 
AGCTTGCTGA AGGACTCTAA GATCCCTATT AGCATCAAGC ACCTGACCAA CCTTCCATCT 
AGCCATCCCG TGGTGCACCA GCAACCCTCC AGGAGTGAGA TGCCCAGAAC AAAAATCCCG 
GTTTCCAAAG TGCTGGTCCG CAGAGTCAGC AACCGGGGCT TGGCTGGGAC caccatcaga 
GCAACGGCCT GCCACGACAG TGCCAAAAAG TTGTGA 

Seq ID NO: B53 Protein sequence 
Protein Accession #: FGENESH predicted 

1 li 21 31 41 51 

I I I I I I 

METSRSRGGG GAVSERGGAG ASVGVCRRKA EAGAGTGTLA ADMDLKCDCA AETPAAEPPS 
GKINKAAFKL FKKRKSGGTM PSIFGVKNKG DGKSSGPTGL VRSRTHDGLA EVLVLESGRK 
EEPRGGGDSG GGGGGRPNPG PPRAAGPGGG SLASSSVAKS HSFFSLLKKN GRSENGKGEP 
VDASKAGGKQ KRGLRGLFSG MRWHRKDKRA KAEAAEGRAP GGGLILPGSL TASLECVKEE 
TPRAAREPEE PSQDAPRDPA GEPAGGEEVP APADRAPARS CREAEGLAHP GDTGARGEDA 

817 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
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60 
120 
180 
240 
300 



WO 03/042661 



AGHRRAEPGP 
RICLMFSDVT 
GGEEMASPDE 
AAKDASSVKR 
KKAGIPRDSY 
SLLKDSKIPI 
ATACHDSAKK 



GEVRTAEDAS 
SLKSFDSLTG 
VDDTYLQEFW 
RRLNRIPIEP 
SGDALYDLYA 
SIKHLTNLPS 
L 



RTGAVPVKTV PLVDSEGGSG RAPAAPDPAS VDPPSDPSAD 
CGDIIADQEB EAGPSCDKHV PGPGKPALSK KNPGWAYQG 
DMLSQTEEQG PEPQEGAAKV AAALETKWP ETPKDTRCVE 
HPKEEPKHPE KEQQEGVPNS DEGYWDSTTP GPEEDSSSSG 
DPDGSPATLP GGKDNEETSS LSRLKPVSPG TITCPLRTPG 
SHPWHQQPS RSEMPRTKIP VSKVLVRRVS NRGLAGTTIR 



360 
420 
480 
540 
600 
660 



Seq ID NO: B54 DNA sequence 

Nucleic Acid Accession fl: NM_014138-1 

Coding sequence: 60.. 8 54 



l 
I 

CTGCAGAGAC 
TGATGGGCCT 
AGGGGCTGCT 
GGCTGGAGTT 
ATCACATGGC 
GTCAGAATCG 
GGATCCCTGG 
GCCTCAGTGG 
CACTCCAGCC 
TAATGTTTCA 
GGATGGAGAC 
CCATTTTGGG 
ACACGTGGGA 
TGCCTGTCAT 
GTCCTCACCG 
TCCATCTCTC 
CTTGTCGCTG 
GCCTGTTGAA 
AGTGTGGGAA 
ATGTTTACGT 
TAGTAGTCTC 
TATGTAGGAA 
AAAAAAAAAA 
AAAAAAAAAA 



11 



TTCCCAGGAA 
CAGTAACCTG 
CCGCCAGCGG 
CCTTCAGAGG 
CCCCTATGCT 
ATTCCGATGT 
GGAGAGTAAC 
CTTGACTCTC 
ACAGGAGACA 
GAGGCTGCTC 
CCGAAGGGAC 
GACGAAACAG 
GTCCTCCCAG 
TCTCGGATGG 
CATCTCACAT 
TTCCCTTCCT 
TCCTGTGCTT 
ATGCAATGTG 
AGGAATGTGG 
ATGTGATAAG 
CCTTACAGGA 
TGGAAAAAAA 
AAAAAAAAAA 
AAAAAAAAAA 



21 
I 

GGTCCAGCGC 
AGCCCCGGTC 
TACAGAGAGG 
AAGAAAGCAT 
GTTGGGAGGG 
GAATGTCGAT 
AGGGCCCCAC 
AGCCTAGGCA 
GAGGAGAAGC 
AAGCAGTGGT 
TCAGACGGAG 
CAAGCTGTGG 
CATGATG CTT 
CTGTGAGGCA 
GGCTCCTGTG 
GCTGTCAGTA 
TTGCACCATT 
CTGCCCTGAG 
AATGAGAACT 
CTACATGTAT 
ATTTTTGACG 
AAAAAAAAAA 
AAAAAAAAAA 
AAAAAAAAAA 



31 
1 

CCTCTCAGCC 
CTGGCCCCAG 
AGAAGACCCT 
TCCTGCGGCA 
AAGCCAGAAT 
ACTGCCAGAG 
ATCCCTCCTC 
CCAACCAGCC 
GCCAGCGAGA 
TAGAGGAAAA 
CCGCCGTGTT 
TCGGATGAGT 
GACTGACCCG 
TTCCTTGGCA 
ATGCATGTTG 
AGAGATCACA 
GAGTTGACTG 
ATCCAGTTTC 
TGGTGGTTCA 
GTAAATGTTG 
GGGTTCCTCA 
AAAAAAAAAA 
AAAAAAAAAA 
AAAAAA 



41 
I 

TTGGTACTCA 
CCAGGCCGTG 
GGAAGAGCGG 
TGTGAGGAGG 
CTCCCCATTA 
CCACAGGCCG 
CTGGGAGACG 
CGGGCCTCTG 
GAGGCAGCAG 
CTGAGACGTG 
GGCAGCGCCT 
GCCAGGACCT 
AGGAAGGTCC 
AGGGACGCTG 
TCGCTTTCCC 
TGTCTGTGTA 
CCTCTGAGAA 
AAGAATGGGC 
CCGCTGTACT 
CAATACCCCT 
TCATCAATAC 
AAAAAAAAAA 
AAAAAAAAAA 



SI 
I 

GAACAGCCGA 
CCTCTCCCAG 
CGGTGGGAGA 
AGACACCGCG 
GGTGACAGAA 
AATCTTTCTG 
CTGGTGCAGG 
CCTGAAGCGG 
GAGAGCAAAA 
CACCCCCATG 
GGGTGTGGGC 
GTGTACCGGG 
TCATGTTTCG 
CGTACCAGCG 
ACCCGGGATC 
GTGTGAATGC 
GCAGCACTAG 
AGGTAAACGC 
ATTTGTGTAA 
AACAGTCGAG 
CAAATAAATA 
AAAAAAAAAA 
AAAAAAAAAA 



Seq ID NO: B55 Protein sequence 
Protein Accession #: NP_054857.1 



11 



60 
120 
180 
240 
300 
360 
420 
460 
540 
600 
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720 
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1020 
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1320 
1380 



21 31 41 51 

| I I i 

MMGLSNLSPG PGPSQAVPLP EGLLRQRYRE EKTLEERRWE RLEFLQRKKA PLRHVRRRHR 
DHMAPYAVGR EARISPLGDR SQNRFRCECR YCQSHRPNLS GIPGESURAP HPSSWETLVQ 
GLSGLTLSLG TNQPGPLPEA ALQPQETEEK RQRERQQESK IMFQRLLKQW LEEN 



60 
120 



Seq ID NO: B56 DNA sequence 

Nucleic Acid Accession ft: NM_000025.1 

Coding sequence : 198 . . 1424 

1 11 21 31 41 51 

GCTACTCCTC CCCCAAGAGC GGTGGCACCG AGGGAGTTGG GGTGGGGGGA GGCTGAGCGC 
TCTGGCTGGG ACAGCTAGAG AAGATGGCCC AGGCTGGGGA AGTOGCTCTC ATGCCXTGCT 
GTCCCCTCCC CTGAGCCAGG TGATTTGGGA GACCCCCTCC TTCCTTCTTT CCCTACCGCC 
CCACGCGCGA CCCGGGGATG GCTCCGTGGC CTCACGAGAA CAGCTCTCTT GCCCCATGGC 
CGGACCTCCC CACCCTGGCG CCCAATACCG CCAACACCAG TGGGCTGCCA GGGGTTCCGT 
GGGAGGCGGC CCTAGCCGGG GCCCTGCTGG CGCTGGCGGT GCTGGCCACC GTGGGAGGCA 
ACCTGCTGGT CATCGTGGCC ATCGCCTGGA CTCCGAGACT CCAGACCATG ACCAACGTGT 
TCGTGACTTC GCTGGCCGCA GCCGACCTGG TGATGGGACT CCTGGTGGTG CCGCCGGCGG 
CCACCTTGGC GCTGACTGGC CACTGGCCGT TGGGCGCCAC TGGCTGCGAG CTGTGGACCT 
CGGTGGACGT GCTGTGTGTG ACCGCCAGCA TCGAAACCCT GTGCGCCCTG GCCGTGGACC 
GCTACCTGGC TGTGACCAAC CCGCTGCGTT ACGGCGCACT GGTCACCAAG CGCTGCGCCC 
GGACAGCTGT GGTCCTGGTG TGGGTCGTGT CGGCCGCGGT GTCGTTTGCG CCCATCATGA 
GCCAGTGGTG GCGCGTAGGG GCCGACGCCG AGGCGCAGCG CTGCCACTCC AACCCGCGCT 
GCTGTGCCTT CGCCTCCAAC ATGCCCTACG TGCTGCTGTC CTCCTCCGTC TCCTTCTACC 
TTCCTCTTCT CGTGATGCTC TTOGTCTACG CGCGGGTTTT CGTGGTGGCT ACGCGCCAGC 
TGCGCTTGCT GCGCGGGGAG CTGGGCCGCT TTCCGCCCGA GGAGTCTCCG COGGCGCCGT 
CGCGCTCTCT GGCCCCGGCC CCGGTGGGGA CGTGCGCTCC GCCCGAAGGG GTGCCCGCCT 
GCGGCCGGCG GCCCGCGCGC CTCCTGCCTC TCCGGGAACA CCGGG CCCTG TGCACCTTGG 
GTCTCATCAT GGGCACCTTC ACTCTCTGCT GGTTGCCCTT CTTTCTGGCC AACGTGCTGC 
GCGCCCTGGG GGGCCCCTCT CTAGTCCCGG GCCCGGCTTT CCTTGCCCTG AACTGGCTAG 
GTTATGCCAA TTCTGCCTTC AACCCGCTCA TCTACTGCCG CAGCCCGGAC TTTCGCAGCG 
CCTTCCGCCG TCTTCTGTGC CGCTGCGGCC GTCGCCTGCC TCCGGAGCCC TGCGCCGCCG 
CCCGCCCGGC CCTCTTCCCC TCGGGCGTTC CTGCGGCCCG GAGCAGCCCA GCGCAGCCCA 
GGCTTTGCCA ACGGCTCGAC GGGGCTTCTT GGGGAGTTTC TTAGGCCTGA AGGACAAGAA 
GCAACAACTC TGTTGATCAG AACCTGTGGA AAACCTCTGG CCTCTGTTCA GAATGAGTCC 
CATGGGATTC CCCGGCTGTG ACACTCTACC CTCCAGAACC TGACGACTGG GCCATGTGAC 
CCAAGGAGGG ATCCTTACCA AGTGGGTTTT CACCATCCTC TTGCTCTCTG TCTGAGAGAT 
GTTTTCTAAA CCCCAGCCTT GAACTTCACT CCTCCCTCAG TGGTAGTGTC CAGGTGCCGT 
GGAGCAGCAG GCTGGCTTTG GTAGGGGCAC CCATCACCCG GCTTGCCTGT GCAGTCAGTG 
AGTGCTTAGG GCAAAGAGAG CTCCCCTGGT TCCATTCCTT CTGCCACCCA AACCCTGATG 

818 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
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900 
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1020 
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1620 
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AGACCTTAGT 
AAGATTTGGG 
CTTTAGCCAT 
TTTTGGGAGC 
TTTGCAATCA 
GTTTGTTTTC 
CAGAGGCAGT 
AATGAAAAGT 
ACCCTTCCTG 
GGACTTGGAC 
GCATTTGTCC 
TGCGTGTGTG 
TCAAATGTCT 
TTCCCACTCA 
TTAC 



GTTCTCCAGG 
GTTTTATCTC 
GGAATGGCTC 
TCCAGGGTTC 
GATAAATAAA 
TTTTCATAAT 
AAATTAGGCC 
CTGTCTGGAC 
AAACTCTTGA 
AGAGGGCCCA 
CCTCCATGCT 
TGTGTGTGTG 
TGTGTCATAA 
CCTTCCCCAG 



CTCTGTGGCC 
TGGTTCCCTT 
CGATCTACCT 
ATAAGAAGGT 
TATCACTGAA 
CCACTTACTC 
TAATCCTCAC 
GAAGGGGAGT 
AATCCAGTTG 
CAGGGGGATG 
ATAATCCAGT 
TGTGTCTGGA 
ATATATTCTG 
CTACAAAAAT 



CAGGCTGAGA 
ATTACTGCTC 
CACAGCAGTG 
GAACCATTAG 
TGCAGTTCAT 
CCTTCCCTTC 
TCTTTTCTTC 
GAGCTTGAGC 
CCATTGAGTA 
GGCTGGCTGT 
GGTGCCTTAC 
GGCACAGGCA 
ATGTTTCCCA 
CTGTATTATC 



Seq ID NO: B57 Protein sequence 
Protein Accession #: NPJ>00016.1 



MAPWPHENSS 
AIAWTPRLQT 
VTASIETLCA 
GADAEAQRCH 



FTLCWLPFFL 
CRCGRRLPPE 



11 
I 

LAPWPDLPTL 
MTNVFVTSLA 
LAVDRYLAVT 
SNPRCCAFAS 
PPAPSRSLAP 
ANVLRALGGP 
PCAAARPALF 



GCAGCAGGGT 
TCAAGCAGTG 
TCAGAAGGAC 
AACAGATCCC 
CCTCGGCCCA 
TACTCTGCGC 
CTAATCTTCA 
CTTTGATATC 
GCAAAGCCAC 
GGCCAGGTTT 
ATGGTGTGTG 
CAAAGCATTG 
GCCTTTCCAC 
CTCTTAAAGT 



AGAAAAGACC 
GCCTCTCTCA 
TTCGCCAGGG 
TTCTTTTCCT 
CTTTCCCTCC 
TGGCTTTTGA 
TCAAACAAAA 
TTGCTCCCCC 
GCTCCCCACA 
AGGGCAGGGG 
TGTGTGTGTG 
CTTGGGTTGG 
AACCTCTACC 
AAAACTGGAG 



1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 



21 31 41 51 

I I I I 

APNTANTSGL PGVPWEAALA GALLALAVLA TVGGNLLVIV 
AADLVMGLLV VPPAATLALT GHWPLGATGC ELWTSVDVLC 
NPLRYGALVT KRCARTAWL VWWSAAVSF APIMSQWWRV 
NMPYVLLSSS VSFYLPLLVM LFVYARVFW ATRQLRLLRG 
APVGTCAPPE GVPACGRRPA RLLPLREHRA LCTIX3LIMGT 
SLVPGPAFLA LNWLGYANSA FNPLIYCRSP DFRSAFRRLL 
PSGVPAARSS PAQPRLCQRI* DGASWGVS 



60 
120 
180 
240 
300 
360 



Seq ID NO: B58 DNA sequence 

Nucleic Acid Accession Jh NM_0325S3.1 

Coding sequence 3 7 103 8 



CACCATTAGG 
AGGCCAGATG 
ATTCTTGTGC 
AAAGAAACAA 
GTTCTTTCCT 
GGTCTCTGCA 
TTGGTCTGCA 
TGCAAACAGA 
TGTGTACTCT 
TTTGTGGATC 
GGCGAGTTGA 
GTTTTATCAC 
TTGAAGATGA 
AGTTTTCCTT 
GTGATTCTAA 
CCAGTCATAT 
CATGACAGCA 
ACACCTGAAT 
CAGAACATAT 
AGTTCTGCTC 
GCATTACGAT 



11 
I 

CAAAGATAGT 
GAGACAATAC 
CAGGTCTCAT 
AACGAGCTGT 
TGCCACTGAG 
TGTTCTGTTT 
TCAGTGTGCG 
AATATGACCT 
TTCCACTCCT 
TTCCTACCAG 
TTGGGTTTGT 
TGCAAGATAA 
TTCTAACCTG 
TAGATTTCCT 
TATTTCATTC 
ACTACTTTTC 
TCCAACTCCA 
TATGCTAAAA 
CTGCAATACC 
TATCTTACTG 
CCACGATTAT 



21 31 

1 I 
TTCTCTAGAG AGAATCATGC 
AGATTTTCGA TACTTTATTT 
AGGGAATATA TTAGCCCTGT 
GATATTTATG ATAAACTTAG 
GATCTTCTAC TACTTGAATC 
CTACCTGAAG TATGTCAACA 
ACGATTTTGG TTTCTCATGT 
GTACATCAGC ATTGCTGGCT 
CAGAACCAGT GATGATACCT 
GAATGTCAAC CTGGCCCAGT 
AACTCCGCTT CTGATTGTCC 
ATATCCCATG GCCCAAGATC 
TGCAGGGGTA TTCCTAATTT 
GGTGAAGTCC AATGAAATTA 
TGTGGCATTG TGTCTTGCTA 
CACTAATGAG TTCCGAAGAC 
TGCAAAATCC TTTGTGAGTA 
CAAAAAACCA AACTGAATGT 
CAAGCCACAG GGAAGAACTT 
CTATGGGGAA TTCACTTCTT 
TGATGTTGAC ATGTCCATGT 



41 
I 

CTGCTAATTA 
ATGCAGTGAC 
GGGTATTCTA 
CCATTGCTGA 
ATGACTGGCC 
TGTATGCAAG 
ACCCCTTTCG 
GGCTGATCAT 
CTGGCAATAG 
CCGTTGTTAT 
TATATTGTAC 
TTGGAGAGAA 
GCTTTGCACC 
AAAGCTGCCT 
GTCTGAATTC 
GGCTTTCAAG 
ACCATACAGC 
GACCTGAAAT 
GCAAAACAAC 
CAAAGCAGGA 
AGTAATTTTT 



51 
1 

CACGTGTACC 
ATACACTGTC 
TGGTTATATG 
CTTACTACAA 
ATTTGGGCCT 
CATCTACTTC 
CTTCCATGAC 
CTGCCTTGCC 
GACCAAATGC 
GATGACCATT 
CTGGAAGACG 
ACAGAAAGCC 
TTATCATTTC 
AGCCAGAAGG 
ATGTCTTGAC 
ACAAGATTTG 
TTCCACCATG 
GCAAGTACAT 
ACAGCTTTTC 
CCTATTTGGA 
CTTCAAGT 



Seq ID NO: B59 Protein sequence 
Protein Accession #t NP_115942.1 



1 
I 

MPANYTCTRP 
LAIADLLQVL 
MYPFRFHDCK 
QSWMMTIGE 
ICFAPYHFSF 



11 



Seq ID NO: B60 DNA sequence 

Nucleic Acid Accession #: CAT cluster 



GATTCGGATT 
CCCGAGGGTT 
TCATATATAA 
TACATTTTAA 
AAGAGTGGAC 
CCCTTTTTAT 
TCTAATTATT 
TGCCTCCAGG 
AATTTTCTCA 
TTCCACGATT 



11 
I 

TTAATCCGAC 
AGACATTTAA 
TTAATTTTCA 
GGAATTTATA 
TTGGTATCCG 
TTTCATCTTC 
TAGAACACTG 
TGCGAGTTTA 
ATTCGAAAAC 
ATTTCCTGTC 



21 
I 

TCACTATAGG 
CTTGAGTCTT 
TCTGGGCATA 
CATTTGCATC 
TCTGATTATA 
TCTTTCTCTA 
TCTCCGAGTA 
TGTGTGTTTA 
TATTTTCCTT 
CTTGGTTTTT 



31 

I 

GAATTTTGGC 
TTCATATCAC 
TATTTCCAGA 
TGAAATTGTA 
TTTTCTTTGT 
GTGTTTATAA 
TGTGTTTATT 
TGTTTCCTAT 
CAGCACTCAG 
CGTGTTCTTA 



41 
i 

CCTCGAGGCC 
GGTATGCTTT 
TTAGAAATCA 
TTCATTATCT 
TGGTAAAGAG 
TCTTATTTTT 
TGGGTTTGTG 
TTGAGGTTGG 
AAATTTTTAT 
ACCTAGAATT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 



21 31 41 51 

I I I I 

DGDNTDFRYF IYAVTYTVIL VPGLIGNILA LWVFYGYMKE TKRAVIFMIN 
SLPLRIFYYL NHDWPFGPGL CMFCFYLKYV NMYASIYFLV CISVRRFWFL 
QKYDLYISIA GWLIICLACV LFPLLRTSDD TSGNRTKCFV DUPTRNVNLA 
LIGFVTPLLI VLYCTWKTVL SUJDKYPMAQ DLGEKQKALK MILTCAGVFL 
PLDFLVKSNE IKSCIARRVI LIFHSVALCL ASLNSCLDPV IYYFSTNEFR 
SIQLHAKSFV SNHTASTMTP ELC 



60 
120 
180 
240 
300 



51 
I 

AAGAATTCGG 
TATTTTACCC 
TTTCTTTTCA 
TCTAGAAACC 
CCTCTTTTTT 
ATGAATGTAG 
TTTTGAACAT 
ACACTTAGAA 
CAAAGATTAT 
CAATTGTTTA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



819 



WO 03/042661 



PCT/US02/36810 



ATGTTAGACC 
ATATCTTTTG 
AAAACTHCAT 
GAT AT ATT CA 
TGTGAACTGT 
TTTCTTTAAA 
ATTCATAAGG 
GAATAAATAA 



TTACACATCT 
GACTATTGTG 
TTTCAAATTT 
TTTTCAGGAT 
GGGATTTCTT 
TTATCTCTTC 
CGGGCTTTCC 
CCTAATTTGA 



TCAATGTATC 
TGAAAGCAAT 
AATACACACA 
TTCTTTAAGT 
GAAACTTTTT 
CTTCAGAATT 
TTGAATACCA 
AAAAAAAAAA 



ATATATATTT 
AAATGTATAT 
AATTGTGCAT 
GTTATTTTAA 
GAAAACTCTC 
ACTTTTGTGT 
GATAACTCTG 



CACTCATATT 
TACTTTAAAA 
ACCCACACAC 
AAATAATCAT 
ATTAGTCTGT 
TTTGAATTTG 
AGTTTCCTGT 



ATTTATTGTC 660 

AAAAAAAAAA 720 

ATATACCACA 780 

ATTCTT ATTT 840 

GTTTGTGTGG 900 

AGTCTTTGCC 960 

TCATATTTAA 1020 



10 
15 
20 
25 
30 
35 
40 
45, 
50 
55 
60 
65 
70 
75 
80 



Seq ID NO: B61 DMA sequence 

Nucleic Acid Accession #: NM_014S22.1 

Coding sequence: 846.-3911 

1 11 21 31 41 51 

CTGGTGGTCC AGTACCTCCA AAGATATGGA ATACACTCCT GAAATATCCT GAAAACTTTT 
TTTTTTCAGA ATCCTTTAAT AAGCAGTTAT GTCAATCTGA AAGTTGCTTA CTTGTACTTT 
ATATTAATAG CTATTCTTGT TTTTCTTATC CAAAGAAAAA TCCTCTAATC CCCTTTTCAC 
ATGATAGTTG TTACCATGTT TAGGCATTAG TCACATCAAC CCCTCTCCTC TCCCAAACTT 
CTCTTCTTCA AATCAAACTT TATTAGTCCC TCCTTTATAA TGATTCCTTG CCTCGTTTTA 
TCCAGATCAA TTTTTTTTCA CTTTGATGCC CAGAGCTGAA GAAATGGACT ACTGTATAAA 
TTATTCATTG CCAAGAGAAT AATTGCATTT TAAACCCATA TTATAACAAA GAATAATGAT 
TATATTTTGT GATTTGT AAC AAATACCCTT TATTTTCCCT TAACTATTGA ATTAAATATT 
TTAATTATTT GTATTCTCTT TAACTATCTT GGTATATTAA AGTATTATCT TTTATATATT 
TATCAATGGT GGACACTTTT ATAGGTACTC TGTGTCATTT TTGATACTGT AGGTATCTTA 
TTTCATTTAT CTTTATTCTT AATGTACGAA TTCATAATAT TTGATTCAGA ACAAATTTAT 
CACTAATTAA CAGAGTGTCA ATTATGCTAA CATCTCATTT ACTGATTTTA ATTTAAAACA 
GTTTTTGTTA ACATGCATGT TTAGGGTTGG CTTCTTAATA ATTTCTTCTT CCTCTTCTCT 
CTCTCCTCTT CTTTTGGTCA GTGTTGTGCG GGTTAATACA ACAAACTGTA ACAAGTGTAC 
CTGGTATGGA CTTGTTGTCC GGGACGTACA TTTTCGCGGT CCTGCTAGCA TGCGTGGTGT 
TCCACTCTGG CGCCCAGGAG AAAAACTACA CCATCCGAGA AGAAATGCCA GAAAACGTCC 
TGATAGGCGA CTTGTTGAAA GACCTTAACT TGTCGCTGAT TCCAAACAAG TCCTTGACAA 
CTGCTATGCA GTTCAAGCTA GTGTACAAGA CCGGAGATGT GCCACTGATT CGAATTGAAG 
AGGATACTGG TGAGATCTTC ACTACTGGCG CTCGCATTGA TCGTGAGAAA TTATGTGCTG 
GTATCCCAAG GGATGAGCAT TGCTTTTATG AAGTGGAGGT TGCCATTTTG CCGGATGAAA 
TATTTAGACT GGTTAAGATA CGTTTTCTGA TAGAAGATAT AAATGATAAT GCACCATTGT 
TCCCAGCAAC AGTTATCAAC ATATCAATTC CAGAGAACTC GGCTATAAAC TCTAAATATA 
CTCTCCCAGC GGCTGTTGAT CCTGACGTAG GAATAAACGG AGTTCAAAAC TACGAACTAA 
TTAAGAGTCA AAACATTTTT GGCCTCGATG TCATTGAAAC ACCAGAAGGA GACAAGATGC 
CACAACTGAT TGTTCAAAAG GAGTTAGATA GGGAAGAGAA GGATACCTAC GTGATGAAAG 
TAAAGGTTGA AGATGGTGGC TTTCCTCAAA GATCCAGTAC TGCTATTTTG CAAGTGAGTG 
TTACTGATAC AAATGACAAC CACCCAGTCT TTAAGGAGAC AGAGATTGAA GTCAGTATAC 
CAGAAAATGC TCCTGTAGGC ACTTCAGTGA CACAGCTCCA TGCCACAGAT GCTGACATAG 
GTGAAAATGC CAAGATCCAC TTCTCTTTCA GCAATCTAGT CTCCAACATT GCCAGGAGAT 
TATTTCACCT CAATGCCACC ACTGGACTTA TCACAATCAA AGAACCACTG GATAGGGAAG 
AAACACCAAA CCACAAGTTA CTGGTTTTGG CAAGTGATGG TGGATTGATG CCAGCAAGAG 
CAATGGTGCT GGTAAATGTT ACAGATGTCA ATGATAATGT CCCATCCATT GACATAAGAT 
ACATCGTCAA TCCTGTCAAT GACACAGTTG TTCTTTCAGA AAATATTCCA CTCAACACCA 
aaattgctct CATAACTGTG ACGGATAAGG ATGCGGACCA TAATGGCAGG GTGACATGCT 
TCACAGATCA TGAAATCCCT TTCAGATTAA GGCCAGTATT CAGTAATCAG TTCCTCCTGG 
AGACTGCAGC AT AT CTTGAC TATGAGTCCA CAAAAGAATA TGCCATTAAA TTACTGGCTG 
CAGATGCTGG CAAACCTCCT TTGAATCAGT CAGCAATGCT CTTCATCAAA GTGAAAGATG 
AAAATGACAA TGCTCCAGTT TTCACCCAGT CTTTCGTAAC TGTTTCTATT CCTGAGAATA 
ACTCTCCTGG CATCCAGTTG ACGAAAGTAA GTGCAATGGA TGCAGACAGT GGGCCTAATG 
CTAAGATCAA TTACCTGCTA GGCCCTGATG CTCCACCTGA ATTCAGCCTG GATTGTCGTA 
CAGGCATGCT GACTGTAGTG AAGAAACTAG ATAGAGAAAA AGAGGATAAA TATTTATTCA 
CAATTCTGGC AAAAGATAAC GGGGTACCAC CCTTAACCAG CAATGTCACA GTCTTTGTAA 
GCATTATTGA TCAGAATGAC AATAGCCCAG TTTTCACTCA CAATGAATAC AACTTCTATG 
TCCCAGAAAA CCTTCCAAGG CATGGTACAG TAGGACTAAT CACTGTAACT GATCCTGATT 
ATGGAGACAA TTCTGCAGTT ACGCTCTCCA TTTTAGATGA GAATGATGAC TTCACCATTG 
ATTCACAAAC TGGTGTCATC CGACCAAATA TTTCATTTGA TAGAGAAAAA CAAGAATCTT 
ACACTTTCTA TGTAAAGGCT GAGGATGGTG GTAGAGTATC ACGTTCTTCA AGTGCCAAAG 
TAACCATAAA TGTGGTTGAT GTCAATGACA ACAAACCAGT TTTCATTGTC CCTCCTTCCA 
ACTGTTCTTA TGAATTGGTT CTACCGTCCA CTAATCCAGG CACAGTGGTC TTTCAGGTAA 
TTGCTGTTGA CAATGACACT GGCATGAATG CAGAGGTTCG TTACAGCATT GTAGGAGGAA 
ACACAAGAGA TCTGTTTGCA ATCGACCAAG AAACAGGCAA CATAACATTG ATGGAGAAAT 
GTGATGTTAC AGACCTTGGT TTACACAGAG TGTTGGTCAA AGCTAATGAC TTAGGACAGC 
CTGATTCTCT CTTCAGTGTT GTAATTGTCA ATCTGTTCGT GAATGAGTCG GTGACCAATG 
CTACACTGAT TAATGAACTG GTGCGCAAAA GCACTGAAGC ACCAGTGACC CCAAATACTG 
AGATAGCTGA TGTATCCTCA CCAACTAGTG ACTATGTCAA GATCCTGGTT GCAGCTGTTG 
CTGGCACCAT AACTGTCGTT GTAGTTATTT TCATCACTGC TGTAGTAAGA TGTCGCCAGG 
CACCACACCT TAAGGCTGCT CAGAAAAACA AGCAGAATTC TGAATGGGCT ACCCCAAACC 
CAGAAAACAG GCAGATGATA ATGATGAAGA AAAAGAAAAA GAAGAAGAAG CATTCCCCTA 
AGAACTTGCT GCTTAATTTT GTCACTATTG AAGAAACTAA GGCAGATGAT GTTGACAGTG 
ATGGAAACAG AGTCACACTA GACCTTCCTA TTGATCTAGA AGAGCAAACA ATGGGAAAGT 
ACAATTGGGT AACTACACCT ACTACTTTCA AGCCCGACAG CCCTGATTTG GCCCGACACT 
ACAAATCTGC CTCTCCACAG CCTGCCTTCC ' AAATTCAGCC TGAAACTCCC CTGAATTCGA 
AGCACCACAT CATCCAAGAA CTGCCTCTCG ATAACACCTT TGTGGCCTGT GACTCTATCT 
CCAAGTGTTC CTCAAGCAGT TCAGATCCCT ACAGCGTTTC TGACTGTGGC TATCCAGTGA 
CGACCTTCGA GGTACCTGTG TCCGTACACA CCAGACCGGT AGGTATCCAA GTTTCTAACA 
CAACTTTCTA ACTATTTTTT TATTATTATT TTCAGTTGAT GTAGAACTTT ACAAAATCTA 
TTGACTTCAA AGAGGGATCA AAACAATCAT ATTCTACAGA TGTACCCAAT AGATATATGG 
ATTCAATTAA GTTTGGTAGA AGATGAGAAC AAAATAACTA CTGATTTAGG AAAATTGGAT 
GCAGAATAAT AATTATAGTA GGGGCAATTT TGTCTGTAGA TGGCAGTATG ACAATTCTTG 

820 



GO 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1S00 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 



WO 03/042661 



CTAGAGAATA TATTGAAAAA AACTTCAACA CAAAGGGTTG TAGCACTGTC CTCAGTACCA 4200 

TTGTGTGCAT GAGGATCAGA ATAGTCTGGG CTAGATACAT CACATTAAAG CTTTTCAGAA 4260 

TCTGATAAAT AGCTCTAAAT ACTAATGATA TTGAGAAGCC TAGCTTCACT TGGGAAAATC 4320 

TGTGGCTGTT CACAGAAATT CAGCACCAAG TTATTCCCCC CATACTCTAC CAGGCCTTCA 4380 

GGTCCTCATA AAGAAAAGTG TCGTTTTCAG ATTAGGAACT CAAAATTATT TTGGTGCATC 4440 

AAATCTACAG TCACACAATA TAACAAGAAT GGGATTAGAA AAATGAAAGC CTACT CATTC 4S00 

TCATCTTTAA GCCAGAGAAT GAAATATATA TGAGGTCTCT GGATAGCTAT TTAAATATTT 4560 
GCATATTTAT GCAAGGTATT TTGAGCCCTT CAGAAGACAT TCT 

Seq ID NO: B62 Protein sequence 
Protein Accession #: NP_05S337.1 

1 11 21 31 41 51 

1 | I I 1 1 

MDLLSGTYIF AVLLACWFH SGAQEKNYTI REEMPENVLI GDLLKDLNLS LIPNKSLTTA 60 
MQFKLVYKTG DVPLIRIEED TGEIFTTGAR IDREKLCAGI PRDEHCFYBV EVAILPDEIF 120 
RLVKIRFLIE DINDNAPLFP ATVINISIPE NSAINSKYTL PAAVDPDVGI NGVQNYELIK 180 
SQNIFGLDVI ETPEGDKMPQ LIVQKELDRE EKDTYVMKVK VEDGGFPQRS STAILQVSVT 240 
DTNDNHPVFK ETEIEVSIPE NAPVGTSVTQ LHATDADIGE NAKIHFSFSN LVSNIARRLF 300 
HLNATTGLIT IKEPLDREET PNHKLLVLAS DGGLMPARAM VLVNVTDVND NVPSIDIRYI 360 
VNPVNDTWL SENIPLNTKI ALITVTDKDA DHNGRVTCFT DHEIPFRLRP VFSNQFLLET 420 
AAYLDYESTK EYAIKLLAAD AGKPPLNQSA MLFIKVKDEN DNAPVFTQSF VTVSIPENNS 460 
PGIQLTKVSA MDADSGPNAK INYLLGPDAP PEFSLDCRTG MLTWKKLDR EKEDKYLFTI 540 
LAKDNGVPPL TSNVTVFVSI IDQNDNSPVF THNEYNFYVP ENLPRHGTVG LITVTDPDYG 600 
DNSAVTLSIL DENDDFTIDS QTGV1RPNIS FDREKQESYT FYVKAEDGGR VSRSSSAKVT 660 
INWDVNDNK PVFIVPPSNC SYELVLPSTN PGTWFQVIA VDNDTGMNAE VRYSIVGGNT 720 
RDLFAIDQET GNITLMEKCD VTDLGLHRVL VKANDLGQPD SLFSWIVNL FVNESVTNAT 780 
LINEIiVRKST EAPVTPNTEI ADVSSPTSDY VKILVAAVAG TITWWIFI TAWRCRQAP 840 
HLKAAQKNKQ NSEWATPKPE NRQMIMMKKK KKKKKHSPKN LLLNFVTIEE TKADDVDSDG 900 
NRVTLDLPID LEEQTMGKYN WVTTPTTFKP DSPDLARKYK SASPQPAFQI QPETPLNSKH 960 
HIIQELPLDN TFVACDSISK CSSSSSDPYS VSDCGYPVTT FEVPVSVHTR PVGIQVSNTT 1020 
F 

Seq ID NO: B63 DNA sequence 

Nucleic Acid Accession #: XMJ5591B0.2 

Coding sequence: 276.. 3740 

1 11 21 31 41 51 

GCGGCGGCCG CGGAGTATCC TGGAGCTGCA GACAGTGCGG GCCTGCGCCC AGTCCCGGCT 60 

GTCCTCGCCG CGACCCCTCC TCAGCCCTGG GCGCGCGCAC GCTGGGGCCC CGCGGGGCTG 120 

GCCGCCTAGC GAGCCTGCCG GTCGACCCCA GCCAGCGCAG CGACGGGGCG CTGCCTGGCC 180 

CAGGCGCACA CGGAAGTGCG CTTCTCTGAA GTAGCTTTGG AAAGTAGAGA AGAAAATCCA 240 

GTTTGCTTCT TGGAGAACAC TGGACAGCTG AATAAATGCA GTATCTAAAT ATAAAAGAGG 300 

ACTGCAATGC CATGGCTTTC TGTGCTAAAA TGAGGAGCTC CAAGAAGACT GAGGTGAACC 360 

TGGAGGCCCC TGAGCCAGGG GTGGAAGTGA TCTTCTATCT GTCGGACAGG GAGCCCCTCC 420 

GGCTGGGCAG TGGAGAGTAC ACAGCAGAGG AACTGTGCAT CAGGGCTGCA CAGGCATGCC 4 B0 

GTATCTCTCC TCTTTGTCAC AACCTCTTTG CCCTGTATGA CGAGAACACC AAGCTCTGGT 540 

ATGCTCCAAA TCGCACCATC ACCGTTGATG ACAAGATGTC CCTCCGGCTC CACTACCGGA 600 

TGAGGTTCTA TTTCACCAAT TGGCATGGAA CCAACGACAA TGAGCAGTCA GTGTGGCGTC 660 

ATTCTCCAAA GAAGCAGAAA AATGGCTACG AGAAAAAAAA GATTCCAGAT GCAACCCCTC 720 

TCCTTGATGC CAGCTCACTG GAGTATCTGT TTGCTCAGGG ACAGTATGAT TTGGTGAAAT 780 

GCCTGGCTCC TATTCGAGAC CCCAAGACCG AGCAGGATGG ACATGATATT GAGAACGAGT 840 
GTCTAGGGAT GGCTGTCCTG GCCATCTCAC ACTATGCCAT GATGAAGAAG ATGCAGTTGC 900 
CAGAACTGCC CAAGGACATC AGCTACAAGC GATATATTCC AGAAACATTG AATAAGTCCA 960 

TCAGACAGAG GAACCTTCTC ACCAGGATGC GGATAAATAA TGTTTTCAAG GATTTCCTAA 1020 

AGGAATTTAA CAACAAGACC ATTTGTGACA GCAGCGTGTC CACGCATGAC CTGAAGGTGA 1080 

AATACTTGGC TACCTTGGAA ACTTTGACAA AACATTACGG TGCTGAAATA TTTGAGACTT 1140 

CCATGTTACT GATTTCATCA GAAAATGAGA TGAATTGGTT TCATTCGAAT GACGGTGGAA 1200 

ACGTTCTCTA CTACGAAGTG ATGGTGACTG GGAATCTTGG AATCCAGTGG AGGCATAAAC 1260 

CAAATGTTGT TTCTGTTGAA AAGGAAAAAA ATAAACTGAA GCGGAAAAAA CTGGA AAATA 1320 

AACACAAGAA GGATGAGGAG AAAAACAAGA TCCGGGAAGA GTGGAACAAT TTTTCTTACT 1380 

TCCCTGAAAT CACTCACATT GTAATAAAGG AGTCTGTGGT CAGCATTAAC AAGCAGGACA 1440 

ACAAGAAAAT GGAACTGAAG CTCTCTTCCC ACGAGGAGGC CTTGTCCTTT GTGTCCCTGG 1500 

TAGATGGCTA CTTCCGGCTC ACAGCAGATG CCCATCATTA CCTCTGCACC GACGTGGCCC 1560 

CCCCGTTGAT CGTCCACAAC ATACAGAATG GCTGTCATGG TCCAATCTGT ACAGAATACG 1620 

CCATCAATAA ATTGCGGCAA GAAGGAAGCG AGGAGGGGAT GTACGTGCTG AGGTGGAGCT 1680 

GCACCGACTT TGACAACATC CTCATGACCG TCACCTGCTT TGAGAAGTCT GAGCAGGTGC 1740 

AGGGTGCCCA GAAGCAGTTC AAGAACTTTC AGATOGAGGT GCAGAAGGGC CGCTACAGTC 1800 

TGCACGGTTC GGACCGCAGC TTCCCCAGCT TGGGAGACCT CATGAGCCAC CTCAAGAAGC 1860 

AGATCCTGCG CACGGATAAC ATCAGCTTCA TGCTAAAACG CTGCTGCCAG CCCAAGCCCC 1920 

GAGAAATCTC CAACCTGCTG GTGGCTACTA AGAAAGCCCA GGAGTGGCAG CCCGTCTACC 1980 

CCATGAGCCA GCTGAGTTTC GATCGGATCC TCAAGAAGGA TCTGGTGCAG GGOGAGCACC 2040 

TTGGGAGAGG CACGAGAACA CACATCTATT CTGGGACCCT GATGGATTAC AAGGATGACG 2100 

AAGGAACTTC TGAAGAGAAG AAGATAAAAG TGATCCTCAA AGTCTTAGAC CCCAGCCACA 2160 

GGGATATTTC CCTGGCCTTC TTCGAGGCAG CCAGCATGAT GAGACAGGTC TCCCACAAAC 2220 

ACATCGTGTA CCTCTATGGC GTCTGTGTCC GCGAOGTGGA GAATATCATG GTGGAAGAGT 2280 

TTGTGGAAGG GGGTCCTCTG GATCTCTTCA TGCACCGGAA AAGCGATGTC CTTACCACAC 2340 

CATGGAAATT CAAAGTTGCC AAACAGCTGG CCAGTGCCCT GAGCTACTTG GAGGATAAAG 2400 

ACCTGGTCCA TGGAAATGTG TGTACTAAAA ACCTCCTCCT GGCCCGTGAG GGCATCGACA 2460 

GTGAGTGTGG CCCATTCATC AAGCTCAGTG ACCCCGGCAT CCCCATTACG GTGCTGTCTA 2520 

GGCAAGAATG CATTGAACGA ATCCCATGGA TTGCTCCTGA GTGTGTTGAG GACTCCAAGA 2580 

ACCTGAGTGT GGCTGCTGAC AAGTGGAGCT TTGGAACCAC GCTCTGGGAA ATCTGCTACA 2640 

ATGGOGAGAT CCCCTTGAAA GACAAGACGC TGATTGAGAA AGAGAGATTC TATGAAAGCC 2700 



821 



WO 03/042661 



PCT/US02/36810 



5 

10 

15 

20 

25 

30 

35 

40 



GGTGCAGGCC 
ACTATGACCC 
AAGAGCAGAA 
ATTTTGAAAA 
TTGAGCTCTG 
CTCTGAAGCC 
TAAGGAACCT 
GAAATGGTAT 
CAAAGAATAA 
AGGGGATGGA 
TCCTTGTTGA 
AAACCGATAA 
CTCCAGAATG 
TCACTCTGCA 
TGAAAATGAT 
AAGAAGGAAA 
GGAAATGCTG 
TTGAAGCACT 
CTTCTCCTGC 
TTCTGTCCAA 
TAGTGCTTAA 
ATAATGACAA 
CACCATTTCA 
ACGAGGAGTT 
TGAAATGTTT 
AATTTCAGTA 
ATATATTGTT 
TCTCCAGTGG 
TGGTTCTACA 
ACTAAGCCTG 
ACCTGGATGC 
CCACTTTGGG 
ACCACATACG 
GATCCCTGTT 
AGGCACTATG 
AATCAAAAAT 
CTGAAAATCT 
GAGCCCAAAT 
AGCCTACATC 
CTCTTTATAC 



45 

50 

55 

60 

65 

70 

75 

80 



AGTGACACCA TCATGTAAGG 
CAATCAGAGG CCTTTCTTCC 
TCCAGATATT GTTTCAGAAA 
GCGCTTCCTA AAGAGGATCC 
CAGGTATGAC CCCGAAGGGG 
TGAGAGTGGA GGTAACCACA 
CTATCATGAG AACATTGTGA 
TAAGCTCATC ATGGAATTTC 
GAACAAAATA AACCTCAAAC 
CTATTTGGGT TCTOGGCAAT 
GAGTGAACAC CAAGTGAAAA 
GGAGTATTAC ACCGTCAAGG 
TTTAATGCAA TCTAAATTTT 
TGAGCTGCTG ACTTACTGTG 
AGGCCCAACC CATGGCCAGA 
ACGCCTGCCG TGCCCACCTA 
GGAATTCCAA CCATCCAATC 
TTTAAAATAA GAAGCATGAA 
AACAAATGCC CAAGTCATTT 
AAAGTCACTG AACTCATACT 
TATGTGTAAG GACTTCCTCT 
CCAAAATATT TGAAAGCACT 
TCTGGCTAGT TCACCATCAC 
GACCAAAATA ATATCTGAAG 
TGCTGGCACA TTAATCATAG 
TCTATACAGT ACTAGACCAT 
TCTGTACAAA AATGACTGTA 
CTTAGCTCCT GTTCCTTTGG 
TGGGGGGATA GCTGTGGAAT 
TGCCAGTGCT TTCCTAAGCA 
TATTACACAT GCTTTTAAGA 
GCAAG CTATT CCAGCACTGG 
CTGCACTGTT CTTAGAGGGT 
TTTACCATCA ATCATCACCC 
GACTTCAGGA TCCACTAGAC 
GTTTAGTCAT TGATTCAATG 
TTTTGTTATG CTGTTTAGTA 
TCAGGATTGG TTCAGTGGCA 
ACCAAGGTCT CTGTGTCAAA 
AAATAAATAT ACTAAAGACT 



AGCTGGCTGA CCTCATGACC OGCTGCATGA 2760 

GAGCCATCAT GAGAGACATT AATAAGCTTG 2820 

AAAAACCAGC AACTGAAGTG GACCCCACAC 2880 

GTGACTTGGG AGAGGGCCAC TTTGGGAAGG 2940 

ACAATACAGG GGAGCAGGTG GCTGTTAAAT 3000 

TAGCTGATCT GAAAAAGGAA ATCGAGATCT 3060 

AGTACAAAGG AATCTGCACA GAAGACGGAG 3120 

TGCCTTCGGG AAGCCTTAAG GAATATCTTC 3180 

AGCAGCTAAA ATATGCCGTT CAGATTTGTA 3240 

ACGTTCACCG GGACTTGGCA GCAAGAAATG 3300 

TTGGAGACTT CGGTTTAACC AAAGCAATTG 3360 

ATGACCGGGA CAGCCCTGTG TTTTGGTATG 3420 

ATATTGCCTC TGACGTCTGG TCTTTTGGAG 3480 

ATTCAGATTC TAGTCCCATG GCTTTGTTCC 3540 

TGACAGTCAC AAGACTTGTG AATACGTTAA 3600 

ACTGTCCAGA TGAGGTTTAT CAACTTATGA 3660 

GGACAAGCTT TCAGAACCTT ATTGAAGGAT 3720 

TAACATTTAA ATTCCACAGA TTATCAAGTC 3780 

TTTAAAAATT TCTAATGAAA GAAGTTTGTG 3840 

TCAGTACATA TACATGTATA AGGCACACTG 3900 

TTAAATTTGG TACCAGTAAC TTAGTGACAC 3960 

TAAGCACTCC TCCTTGTGGA AAGAATATAC 4020 

AACTGCATTA CCAAAAGGGG ATTTTTGAAA 4080 

ATGATTGCTT TTCCCTGCTG CCAGCTGATC 4140 

ATAAAGAAAG ATTGATGGAC TTAGCCCTCA 4200 

GCATTCTTAA AATATTAGAT ACCAGGTAGT 4260 

TTCTCTCACC AGTAGGACTT AAACTTTGTT 4320 

GTGATCACTA GCACCCATTT TTGAGAAAGC 4380 

AGATAATTTG CTGCATGTTA ATTCTCAAGA 4440 

GTATACCTTT AATCAGAACT CATTCCCAGA 4500 
AACGTCAATG TATATCCTTT TATAACTCTA 4560 
TTTTGAATGC TGTATGCAAC CAGTCTGAAT 4620 • 
TTCCATACTT ACCACCGATC TACAAGGGTT 4680 
TGTGGTGCAA CACTTGAAAG ACCCGGCTAG 4740 
AGTTTTCAGT TTGCTTGGAG GTAGCTGGGT 4800 
TGAACGATTA CGGTCTTTAT GACCAAGAGT 4860 
TTCGTTTGAT ATTGTTACTT TTCACCTGTT 4920 
GCAATGAAGT TGCCATTTAA ATTTGTTCAT 4980 
CCTGTGGCCA CTCTATATGC ACTTTGTTTA 5040 
TT 



Seq ID NO: B64 Protein sequence 
Protein Accession 8: A39577 



I 



MQYLNIKEDC 
CIRAAQACRI 
DNEQSVWRHS 
DGHDIENECL 
NNVFKDFLKE 
WFHSNDGGNV 
EEWNNFSFFP 
HYLCTDVAPP 
CFEKSEQVQG 
KRCCQPKPRE 
TLMDYKDDEG 
VENIMVEEFV 
LLAREGIDSE 
TTLWEICYNG 
IMRDINKLEE 
TGEQVAVKSL 
SGSLKEYLPK 
DFGLTKAIET 
DSSPMALFLK 
SPQNLIEGFE 



11 

I 

NAMAFCAKMR 
SPLCHNLFAL 
PKKQKNGYEK 
GMAVLAISHY 
FNNKTICDSS 
LYYEVMVTGN 
EITHIVIKES 
LIVHNIQNGC 
AQKQFKKFQI 
ISNLLVATKK 
TSEEKKIKVI 
EGGPLDLFMH 
CGPFIKLSDP 
EIPLKDKTLI 
QNPDIVSRKK 
KPESGGNHIA 
NKNKINLKQQ 
DKEYYTVKDD 
MIGPTHGQMT 
ALLK 



21 
1 

SSKKTEVNLE 
YDENTKLWYA 
KKIPDATPLL 
AMMKKMQLPE 
VSTHDLKVKY 
LGIQWRHKPN 
WSINKQDNK 
HGPICTEYAI 
EVQKGRYSLH 
AQEWQPVYPM 
LKVIiDPSHRD 
RKSDVLTTPW 
GIPITVLSRQ 



NQPTEVDPTH 
DLKKEIEILR 
LKYAVQICKG 
RDSPVFWYAP 
VTRLVNTLKE 



31 
I 

APEPGVEVIF 
PNRTITVDDK 
DASSLEYLFA 
LPKDISYKRY 
LATLETLTKH 
WSVEKEKNK 
KMELKLSSHE 
NKLRQEGSEE 
GSDRSFPSI/3 
SQLSFDRILK 
ISLAFFEAAS 
KFKVAKQLiAS 
ECIERIPWIA 
RPVTPSCKEL 
FEKRFLKRIR 
KLYHENIVKY 
MDYIiGSRQYV 
ECLMQSKFYI 
GKRLPCPPNC 



41 
I 

YLSDREPLRL 
MSLRLHYRMR 
QGQYDLVKCL 
IPETLNKSIR 
YGAEIFETSM 
LKRKKLEKKD 
EALSFVSLVD 
GMYVLRWSCT 
DLMSKLKKQI 
KDLVQGEHLG 
MMRQVSHKHI 
ALSYLEDKDL 
PECVEDSKNL 
ADLMTRCMNY 



KGICTEDGGN 
HRDLAARNVL 
ASDVWSFGVT 
PDEVYQLMRK 



51 
I 

GSGEYTAEEL 
FYFTNWHGTN 
APIRDPKTEQ 
QRNLLTRKRI 
LLISSENEMN 
KKDEEKNKIR 
GYFRLTADAH 
DFDNILMTVT 
LRTDNISFML 
RGTRTHIYSG 
VYLYGVCVRD 
VHGNVCTKNL 
SVAADKWSFG 
DPNQRPFFRA 
ELCRYDPEDN 
GIKLIMEFLP 
VESEHQVKIG 
LHELLTYCDS 
CWEFQPSNRT 



60 
120 
1B0 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 



Seq ID NO: B65 DNA sequence 

Nucleic Acid Accession *h NM_004867.1 

Coding sequence: 140.. 9 31 



GATCCCAGAC 
TAACAGGATC 
CGCAGCCCGA 
GGAGGAGGCG 
GACCGGGAAG 
GCTTACTCTC 
TTACAAGTAC 
TGAGGATCCT 
GGCTGACATT 
TGATAGTGAC 
CTTGTTGCTG 



11 

1 

CTCGGCTTGC 
TCCTCTTGCA 
AGATTCACTA 
CGGCAAGACG 
GAGCTCCGAG 
TTAGGCCTTT 
TTCATGCCCA 
GCAAATTCCC 
CGTGAGGATG 
CCTGCAGCAA 
GGGAACTGCT 



21 
I 

AGTAGTGTTA 
GTCTGCAGCC 
TGGTGAAAAT 
TGGAGGGCCT 
TTGCCACCCA 
CATTCATCTT 
AGAGCACCAT 
TTCGTGGAGG 
ACAACATTGC 
TTATTCATGA 
ATCTGATGCC 



31 
\ 

GACTGAAGAT 
CAGGACGCTG 
CGCCTTCAAT 
CCTGAGCCGC 
GGAAAAAGAG 
GGCAGGACTT 
TTACCGTGGA 
AGAGCCTAAC 
AATCATTGAT 
CTTTGAAAAG 
CCTCAATACT 



41 
I 

AAAGTAAGTG 
ATTCCAGCAG 
ACCCCTACCG 
ACGGTCAGAA 
GGCTCCTCTG 
ATTGTTGGTG 
GAGATGTGCT 
TTCCTGCCTG 
GTGCCTGTCC 
GGAATGACTG 
TCTATTGTTA 



51 

I • 

CTGTTTGGGC 
CGCCTTACCG 
CCGTGCAAAA 
CTCAGATACT 
GGAGATGTAT 
GAGCCTGCAT 
TTTTTGATTC 
TGACTGAGGA 
CCAGTTTCTC 
CTTACCTGGA 
TGCCTCCAAA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
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AAATCTGGTA GAGCTCTTTG GCAAACTGGC GAGTGGCAGA TATCTGCCTC AAACTTATGT 720 

GGTTCGAGAA GACCTAGTTG CTGTGGAGGA AATTCGTGAT GTTAGTAACC TTGGCATCTT 780 

TATTTACCAA CTTTGCAATA ACAGAAAGTC CTTCCGCCTT CGTCGCAGAG ACCTCTTGCT 840 

GGGTTTCAAC AAACGTGCCA TTGATAAATG CTGGAAGATT AGACACTTCC CCAACGAATT 900 

5 TATTGTTGAG ACCAAGATCT GTCAAGAGTA AGAGGCAACA GATAGAGTGT CCTTGGTAAT 960 

AAGAAGTCAG AGATTTACAA TATGACTTTA ACATTAAGGT TTATGGGATA CTCAAGATAT 1020 

TTACTCATGC ATTTACTCTA TTGCTTATGC CGTAAAAAAA AAAAAAAAAA AAAAAAAAAA 1080 
AA 

10 Seq ID NO: B66 Protein sequence 
protein Accession #: NP_004858.1 

1 11 21 31 |1 SI 

15 MVKIAFNTPT AVQKEEARQD VEALLSRTVR TQILTGKELR VATQEKEGSS GRCMLTLU3L 60 

SFILAGLIVG GACIYKYFMP KSTIYRGEMC FFDSEDPANS LRGGEPNFLP VTEEADIRED 120 

DNIAIIDVPV PSFSDSDPAA IIHDFEKGMT AYLDLLLGNC YLMPLNTSIV MPPKNLVELF 180 

GKLASGRYLP QTYWREDLV AVEEIRDVSN LGIFIYQLCN NRKSFRLRRR DLLLGFNKRA 240 
IDKCWKIRHF PNEFIVETKI CQE 



20 



Seq ID NO: B67 DNA sequence 

Nucleic Acid Accession #•- XM_083862.1 

Coding sequence-. 121.. 813 



25 1 11 21 31 41 51 

ACGCGGGAGG TCTGAGCTGT GGGCTGAGGC AGCGCAGCCG CTGCGCCAGG GTGCGCGATG 60 

CCTTGAACCT GGGAAACTAT GTGAAGCAAC ACTCTGGATT TTGAAAGACA TCTTTTCATC 120 

ATGGGACAGC AAATTTCGGA TCAGACACAG TTGGTTATTA ACAAGTTACC AGAAAAAGTA 180 

30 GCAAAACATG TTACGTTGGT TCGAGAGAGT GGCTCCTTAA CTTATGAAGA ATTTCTCGGG 240 

AGAGTAGCTG AGCTTAATGA TGTAACGGCT AAAGTGGCTT CTGGCCAGGA AAAACATCTT 300 

CTCTTTGAGG TACAACCTGG GTCTGATTCC TCTGCTTTTT GGAAAGTGGT TGTACGGGTG 360 

GTCTGTACCA AG ATTAACAA AAGCAGTGGC ATTGTGGAGG CATCACGGAT CATGAATTTA 420 

TACCAGTTTA TTCAACTTTA TAAAGATATC ACAAGTCAAG CAGCAGGAGT ATCGGCACAG 480 

35 AGCTCCACCT CTGAAGAACC TGATGAAAAC TCATCCTCTG TAACATCTTG TCAGGCTAGT 540 

CTTTGGATGG GAAGGGTGAA GCAGCTGACC GATGAGGAGG AGTGTTGTAT CTGTATGGAT 600 

GGGCGGGCTG ACCTCATCCT GCCTTGTGCT CACAGCTTTT GTCAGAAGTG TATTGATAAA 660 

TGGAGTGATC GACACAGGAA TTGCCCTATT TGTCGCCTAC AGATGACTGG AGCAAATGAA 720 

TCTTGGGTGG TATCAGATGC ACCCACTGAA GATGATATGG CTAACTATAT TCTTAACATG 780 

40 GCTGATGAGG CAGGCCAGCC CCACAGGCCA TGACCTTGAA GTGAAAGTCT TCTGTTGCTA 840 

TTGTGGGCTC AAATATTTGG TCATGGGGGA AGAATGTAGG GTTGTGGCAC TGGCACAGAC 900 

ACAGGAAAAT CCATTTTCCC CACTCTTTTA TTTTTGCTAT TCTGATCATT TGTCCCCCTT 960 
TTAAAAATAA ACTTCCCATG TCTTCCAAAA AAAAAAAAAA AAAAA 

45 Seq ID NO: B68 Protein sequence 
Protein Accession #: XP_0 83 862.1 

1 11 21 31 41 51 

50 MGQQISDQTQ I.VINKLPEKV AKHVTLVRES GSLTYEEFLG RVAELNDVTA KVASGQEKHL 60. 

LFEVQPGSDS SAFWKVWRV VCTKINKSSG IVEASRIMNL YQFIQLYKDI TSQAAGVLAQ 120 

SSTSEEPDEN SSSVTSCQAS LWMGRVKQLT DEEECCICMD GRADLILPCA HSFCQKCIDK 180 
WSDRHRNCPI CRLQMTGANE SWWSDAPTE DDMANYILNM ADEAGQPHRP 

55 Seq ID NO: B69 DNA sequence 

Nucleic Acid Accession #: NM_002975.1 
Coding sequence: 180.. 1151 

1 11 21 31 41 51 

60 | | I i I I 

CGACCAACGG ACCGGACAGA GACGAGGAGA GGAACAGGAA GAGAGAAGCT GGGAGAATCG 60 

GGAACCTGGG GGCTAGTGAC CTGCACACAG GGCAGGGGCA CTCGGCAGTT CCCAGAGGCC 120 

ACCCCTCCCA CCCCAGACAT CCAGACATCT GGAACTTTGG GTGCCAAGAG TCCAGCTTAA 180 

TGCAGGCAGC CTGGCTTTTG GGGGCTTTGG TGGTCCCCCA GCTCTTGGGC TTTGGCCATG 240 

65 GGGCTCGGGG AGCAGAGAGG GAGTGGGAGG GAGGCTGGGG AGGTGCCCAG GAGGAGGAGC 300 

GGGAGAGGGA GGCCCTGATG CTGAAGCATC TGCAGGAAGC CCTAGGACTG CCTGCTGGGA 360 

GGGGGGATGA GAATCCTGCC GGAACTGTTG AGGGAAAAGA GGACTGGGAG ATGGAGGAGG 420 

ACCAGGGGGA GGAAGAGGAG GAGGAAGCAA CGCCAACCCC ATCCTCCGGC CCCAGCCCCT 480 

CTCCCACCCC TGAGGACATC GTCACTTACA TCCTGGGCCG CCTGGCCGGC CTGGACGCAG 540 

70 GCCTGCACCA GCTGCACGTC CGTCTGCACG CGTTGGACAC CCGCGTGGTC GAGCTGACCC 600 

AGGGGCTGCG GCAGCTGCGG AACGCGGCAG GCGACACCCG CGATGCCGTG CAAGCCCTGC 660 

AGGAGGCGCA GGGTCGCGCC GAGCGCGAGC ACGGCCGCTT GGAGGGCTGC CTGAAGGGGC 720 

TGCGCCTGGG CCACAAGTGC TTCCTGCTCT CGCGCGACTT CGAAGCTCAG GCGGCGGCGC 780 

AGGCGCGGTG CACGGCGCGG GGCGGGAGCC TGGCGCAGCC GGCAGACOGC CAGCAGATGG 840 

75 AGGCGCTCAC TCGGTACCTG CGCGCGGCGC TCGCTCCCTA CAACTGGCCC GTGTGGCTGG 900 

GCGTGCACGA TCGGCGCGCC GAGGGCCTCT ACCTCTTCGA AAACGGCCAG CGCGTGTCCT 960 

TCTTCGCCTG GCATCGCTCA CCCCGCCCCG AGCTCGGCGC CCAGCCCAGC GCCTCGCCGC 1020 

ATCCGCTCAG CCCGGACCAG CCCAACGGTG GCACGCTCGA GAACTGCGTG GCGCAGGCCT 1080 

CTGACGACGG CTCCTGGTGG GACCACGACT GCCAGCGGCG TCTCTACTAC GTCTGCGAGT 1140 

80 TCCCCTTCTA GCGGGGCCGG TACCCCGCCT CCTTGCCCAT CCCACCACCC GGCCTTTCCC 1200 

TGCGCCGTGC CCACCCTCCT CCGGAATCGC CCTTCCCTTC CTGGCCAGGA ATGGCAGCGT 1260 

CCTCCCCGAC CCCCAGTCTG GGCGCTTCTG GGAGGGCTCT TGCGGTGCCG GCACTCCTCC 1320 

TTGTTAGTGT CTTTCCTTGA AGGGGCGGGC ACCAGGCTAG GTCCGGTGCC AATAAATCCT 1380 
TGTGGAATCT GAAAAAAAAA AAAAAAA 

823 
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Seq ID NO: B70 Protein sequence 
Protein Accession #: NPJJ02966.1 

1 11 21 31 41 51 

MQAAWLLGAL VVPQLLGFGH GARGAEREWE GGWGGAQEEE REREALMLKH LQEALGLPAG €0 

RGDENPAGTV EGKEDWEMEE DQGEEEEEEA TPTPSSGPSP SPTPEDIVTY ILGRLAGLDA 120 

GLHQLHVRLH ALDTRWELT QGLRQLRNAA GDTRDAVQAL QEAQGRAERE HGRLEGCLKG 180 

LRLGHKCFLL SRDFEAQAAA QARCTARGGS LAQPADRQQM EALTRYLRAA LAPYNWPVWL 240 

GVHDRRAEGL YLFENGQRVS FFAWHRSPRP ELGAQPSASP HPLSPDQPNG GTLENCVAQA 300 
SDDGSWWDHD CQRRLYYVCE PPF 



Seq ID NO: B71 DNA sequence 

Nucleic Acid Accession fl: BC000B39.1 

1 11 21 31 41 51 

CTGGCCAGGG CGACTGGCGG ATAAGGTCTT GTGCGTGGCC TCGAGGCTTA AAAGTAGCAG 60 

TGGGGCTTTG TGAAGGACAA AATGGCGATG GCGGGCCGTG TAGGTCCCCC TTCCTATGAT 120 

GAGGACCTTT TCACAGACCT GTACTGAGCT CCGTGAGGAT AAGTAACTCT GAGGAGATGG 180 

GCCCTGCAAG CCTCCTTCTT AGCCGTCTGT TCAGAAAATA GCGTTTTCGA AATGCCCTGA 240 

GTTGACCTAA TGTCTTATTG GGCTCCTGTC TGCAGGATTT ACGCGCACGT TGGAACCGAA 300 

GAGAGCTCTG TTGTTGCAAT GTTCAGCCCA CAAGAGCTTA CTGGTGAAGG AATGGGACAA 360 

GACCCATCTT TATGCAAAGC CAGCGTTACA GTAATGTTCC AGCATCTCAT AATCTATCCT 420 

GGGGAATTCA GCTGCCTCCC AGGGTGAATA CAGGTATTCC TGATGACAGT CTGCCTCTAT 480 

CTTACAGAGC AGCTTGTTGC TATATACCAT TGAAAAGCCT TCAGAGCTGA GAGGTACTAC 540 

TAACCAATAA CCTGCTTGGC TCAAAGGGCC AGCACCTTCT CTCTAAAGCC CAAGAGGAGT 600 

TTGAGGAAAA CTAGGTGTCT GTGTTCACTC CAGGCTGAAG TTACAGGTCT GAGCAAATAA 660 

GGTGTATAAA AAATGGAATC TGTCTTGGAG GACATCAGAA GGTGAATTTT CCAAGTTCTT 720 

GGACAACCTA GCTGTTGAAA AGCTTTCTGG GTTTGGGGGG TATTTCAGAT GTACCTTAAA 780 

GTGTTAGCAG ATACAGATTA AGACACTGGG AGCCAATGAA ACAGCAGTTG AGGGTTTGCT 840 

GTGTATCACA TTTCTGTATT TTATCACCCC CTTCCTGCAA CATTATTTAT CTGGAATCTA 900 

CCTGCCCTTT TGTTTTTTAG ATACAAGGGC TTGGTTTTGT TACCCAGGCT GGTTTCAAGG 960 

CCATAGCTTT AAGAGATCCT CTCACCACAG ATTTCCAAAG TGCTGGGATT GCAGGTGTGA 1020 

TTCATGGCAC CCAGACTTTG CTGCCTTTCT TACATGATCC AGGCCCAGAA CCCAAACTCA 1080 

GGCACTGTAT AGATGACCAC TTTCGTAAAC TACTGACCTA GCTTGTTGCC AATTGTTGAT 1140 

TGAACTTCCC ATAACTCCAC TTCGTGTCTG TTCCTCTGTA TACAGCCACC TTCTGTTCCC 1200 

GTCATGAGCC TTTAGGTCTC CATTTGCATA TTGCAAATAC TATGTTCCAT GTAGGTAGCT 1260 

CATTCAGGGC CTTGCTCTTC ACTTCAAAAA AGGTTCCCTT GAGGACTGGC TGTCAATTTG 1320 

TGTTGCTGTG TTGGTTGTTG ATGAAAATAA TAAAATGATT GATTACATAA AAAAAAAAAA 1380 
AAAAAAA 



Seq ID NO: B72 DNA sequence 

Nucleic Acid Accession ft: CAT cluster 

1 11 21 31 41 51 

TGCTAGTAGG GCCTGGGTTA ATCGGCCGAG GGTGGCTTCG TGGTCTTTAT AGCTGTTACT 60 

CTTTGTACTT GTCTTTTTCT TTTATTTTCT TTTGAGCGAT TGTGOGAACA TAGCATAGCA 120 

CGCACTATGG CAOGAGGCTC GTGCTGCCTG GCCAGGGCGA CTGGCGGATA AGGTCTTGTG 180 

CGTGGCCTCG AGGCTTAAAA GTAGCAGTGG GGCTTTGTGA AGGACAAAAT GGCGATGGCG 240 

GGCCGTGTAG GTCCCCCTTC CTATGATGAG GACCTTTTCA CAGACCTGTA CTGAGCTCCG 300 

TGAGGATAAA TAACTCTGAG GAGATGGGCC CTGCAAGCCT CTTGCTTAGC CGTCTGTTCA 360 

GAAAATAGCG TTTTCGAAAT GCCCTGAGTT GACCTAATGT CTTATTGGGC TCCTGTCTGC 420 

AGGATTTACG CGCACGTTGG AACCGAAGAG AGCTCTGTTG TTGCAATGTT CAGCCCACAA 480 

GAGCTTACTG GTGAAGGAAT GGGACAAGAC CCATCTTTAT GCAAAGCCAG CGTTACAGTA 540 

ATGTTCCAGC ATCTCATAAT CTATCCTGGG GAATTCAGCT GCCTCCCAGG GTGAATACAG 600 

GTATTCCTGA TGACAGTCTG CCTCTATCTT ACAGAGCAGC TTGTTGCTAT ATACCATTGA 660 

AAAGCCTTCA GAGCTGAGAG GTACTACTAA CCAATAACCT GCTTGGCTCA AAGGGCCAGC 720 

ACCTTCTCTC TAAAGCCCAA GAGGAGTTTG AGGAAAACTA GGTGTCTGTG TTCACTCCAG 780 

GCTGAAGTTA CAGGTCTGAG CAAATAAGGT GTATAAAAAA TGGAATCTGT CTTGGAGGAC 840 

ATCAGAAGGT GAATTTTCCA AGTTCTTGGA CAACCTAGCT GTTGAAAAGC TTTCTGGGTT 900 

TGGGGGGTAT TTCAGATGTA CCTTAAAGTG TTAGCAGACA CAGAT TAAGA CACTGGGAGC 960 

CAATGAAACA GCAGTTGAGG GTTTGCTGTG TATCACATTT CTGTATTTTA TCACCCCCTT 1020 

CCTGCAACAT TATTTATCTG GAATCTACCT GCCCTTTTGT TTTTTAGATA CAAGGGCTTG 1080 

GTTTTGTTAC CCAGGCTGGT TTCAAGGCCA TAGCTTTAAG AGATCCTCTC ACCACAGATT 1140 

TCCAAAGTGC TGGGATTGCA GGTGTGATTC ATGGCACCCA GACTTT GCTG CCTTTCTTAC 1200 

ATGATCCAGG CCCAGAACCC AAACTCAGGC ACTGTATAGA TGACCACTTT CGTAAACTAC 1260 

TGACCTAGCT TGTTGCCAAT TGTTGATTGA ACTTCCCATA ACTCCACTTC GTGTCTGTTC 1320 

CTCTGTATAC AGCCACCTTC TGTTCCCGTC ATGAGCCTTT AGGTCTCCAT TTGCATATTG 1380 

CAAATACTAT GTTCCATGTA GGTAGCTCAT TCAGGGCCTT GCTCTTCACT TCAAAAAAGG 1440 

TTCCCTTGAG GACTGGCTGT CAATTTGTGT TGCTGTGTTG GTTGTTGATG AAAATAATAA 1500 

AATGATTGAT TACATAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAACAAA CACAAAAAAC 1560 
CGCCGCGCTT TTTCCCGGGC GCACAAAGTT ATAAAACGCC GTCCATC 

Seq ID NO: B73 sequence 

Nucleic Acid Accession 8: NM_000222.1 

Coding sequence: 22.. 2952 

1 11 21 31 41 51 

GATCCCATCG CAGCTACCGC GATGAGAGGC GCTCGCGGCG CCTGGGATTT TCTCTGCGTT 60 

CTGCTCCTAC TGCTTCGCGT CCAGACAGGC TCTTCTCAAC CATCTGTGAG TCCAGGGGAA 120 

824 



WO 03/042661 



PCTYUS02/36810 



CCGTCTCCAC CATCCATCCA TCCAGGAAAA TCAGACTTAA TAGTCCGCGT GGGCGACGAG 
ATTAGGCTGT TATGCACTGA TCCGGGCTTT GTCAAATGGA CTTTTGAGAT CCTGGATGAA 
ACGAATGAGA ATAAGCAGAA TGAATGGATC ACGGAAAAGG CAGAAGCCAC CAACACCGGC 
AAATACACGT GCACCAACAA ACACGGCTTA AGCAATTCCA TTTATGTGTT TGTTAGAGAT 
5 CCTGCCAAGC TTTTCCTTGT TGACCGCTCC TTGTATGGGA AAGAAGACAA CGACACGCTG 
GTCCGCTGTC CTCTCACAGA CCCAGAAGTG ACCAATTATT CCCTCAAGGG GTGCCAGGGG 
AAGCCTCTTC CCAAGGACTT GAGGTTTATT CCTGACCCCA AGGCGGGCAT CATGATCAAA 
AGTGTGAAAC GCGCCTACCA TCGGCTCTGT CTGCATTGTT CTGTGGACCA GGAGGGCAAG 
TCAGTGCTGT CGGAAAAATT CATCCTGAAA GTGAGGCCAG CCTTCAAAGC TGTGCCTGTT 
10 GTGTCTGTGT CCAAAGCAAG CTATCTTCTT AGGGAAGGGG AAGAATTCAC AGTGACGTGC 
ACAATAAAAG ATGTGTCTAG TTCTGTGTAC TCAACGTGGA AAAGAGAAAA CAGTCAGACT 
AAACTACAGG AGAAATATAA TAGCTGGCAT CAOGGTGACT TCAATTATGA ACGTCAGGCA 
ACGTTGACTA TCAGTTCAGC GAGAGTTAAT GATTCTGGAG TGTTCATGTG TTATGCCAAT 
AATACTTTTG GATCAGCAAA TGTCACAACA ACCTTGGAAG TAGTAGATAA AGGATTCATT 
15 AATATCTTCC CCATGATAAA CACTACAGTA TTTGTAAACG ATGGAGAAAA TGTAGATTTG 
ATTGTTGAAT ATGAAGCATT CCCCAAACCT GAACACCAGC AGTGGATCTA TATGAACAGA 
ACCTTCACTG ATAAATGGGA AGATTATCCC AAGTCTGAGA ATGAAAGTAA TATCAGATAC 
GTAAGTGAAC TTCATCTAAC GAGATTAAAA GGCACCGAAG GAGGCACTTA CACATTCCTA 
GTGTCCAATT CTGACGTCAA TGCTGCCATA GCATTTAATG TTTATGTGAA TACAAAACCA 
20 GAAATCCTGA CTTACGACAG GCTCGTGAAT GGCATGCTCC AATGTGTGGC AGCAGGATTC 
CCAGAGCCCA CAATAGATTG GTATTTTTGT CCAGGAACTG AGCAGAGATG CTCTGCTTCT 
GTACTGCCAG TGGATGTGCA GACACTAAAC TCATCTGGGC CACCGTTTGG AAAGCTAGTG 
GTTCAGAGTT CTATAGATTC TAGTGCATTC AAGCACAATG GCACGGTTGA ATGTAAGGCT 
TACAACGATG TGGGCAAGAC TTCTGCCTAT TTTAACTTTG CATTTAAAGG TAACAACAAA 
25 GAGCAAATCC ATCCCCACAC CCTGTTCACT CCTTXGCTGA TTGGTTTCGT AATCGTAGCT 
GGCATGATGT GCATTATTGT GATGATTCTG ACCTACAAAT ATTTACAGAA ACCCATGTAT 
GAAGTACAGT GGAAGGTTGT TGAGGAGATA AATGGAAACA ATTATGTTTA CATAGACCCA 
ACACAACTTC CTTATGATCA CAAATGGGAG TTTCCCAGAA ACAGGCTGAG TTTTGGGAAA 
ACCCTGGGTG CTGGAGCTTT CGGGAAGGTT GTTGAGGCAA CTGCTTATGG CTTAATTAAG 
30 TCAGATGCGG CCATGACTGT CGCTGTAAAG ATGCTCAAGC OGAGTGCCCA TTTGACAGAA 
CGGGAAGCCC TCATGTCTGA ACTCAAAGTC CTGAGTTACC TTGGTAATCA CATGAATATT 
GTGAATCTAC TTGGAGCCTG CACCATTGGA GGGCCCACCC TGGTCATTAC AGAATATTGT 
TGCTATGGTG ATCTTTTGAA TTTTTTGAGA AGAAAACGTG ATTCATTTAT TTGTTCAAAG 
CAGGAAGATC ATGCAGAAGC TGCACTTTAT AAGAATCTTC TGCATTCAAA GGAGTCTTCC 
35 TGCAGCGATA GTACTAATGA GTACATGGAC ATGAAACCTG GAGTTTCTTA TGTTGTCCCA 
ACCAAGGCCG ACAAAAGGAG ATCTGTGAGA ATAGGCTCAT ACATAGAAAG AGATGTGACT 
CCCGCCATCA TGGAGGATGA CGAGTTGGCC CTAGACTTAG AAGACTTGCT GAGCTTTTCT 
TACCAGGTGG CAAAGGGCAT GGCTTTCCTC GCCTCCAAGA ATTGTATTCA CAGAGACTTG 
GCAGCCAGAA ATATCCTCCT TACTCATGGT CGGATCACAA AGATTTGTGA TTTTGGTCTA 
40 GCCAGAGACA TCAAGAATGA TTCTAATTAT GTGGTTAAAG GAAACGCTCG ACTACCTGTG 
AAGTGGATGG CACCTGAAAG CATTTTCAAC TGTGTATACA CGTTTGAAAG TGACGTCTGG 
TCCTATGGGA TTTTTCTTTG GGAGCTGTTC TCTTTAGGAA GCAGCCCCTA TCCTGGAATG 
CCGGTCGATT CTAAGTTCTA CAAGATGATC AAGGAAGGCT TCCGGATGCT CAGCCCTGAA 
CACGCACCTG CTGAAATGTA TGACATAATG AAGACTTGCT GGGATGCAGA TCCCCTAAAA 
45 AGACCAACAT TCAAGCAAAT TGTTCAGCTA ATTGAGAAGC AGATTTCAGA GAGCACCAAT 
CATATTTACT CCAACTTAGC AAACTGCAGC CCCAACCGAC AGAAGCCCGT GGTAGACCAT 
TCTGTGCGGA TCAATTCTGT CGGCAGCACC GCTTCCTCCT CCCAGCCTCT GCTT GTGCAC 
GACGATGTCT GAGCAGAATC AGTGTTTGGG TCACCCCTCC AGGAATGATC TCTTCTTTTG 
GCTTCCATGA TGGTTATTTT CTTTTCTTTC AACTTGCATC CAACTCCAGG ATAGTGGGCA 
50 CCCCACTGCA ATCCTGTCTT TCTGAGCACA , CTTTAGTGGC CGATGATTTT TGTCATCAGC 
CACCATCCTA TTGCAAAGGT TCCAACTGTA TATATTCCCA ATAGCAACGT AGCTTCTACC 
ATGAACAGAA AACATTCTGA TTTGGAAAAA GAGAGGGAGG TATGGACTGG GGGCCAGAGT 
CCTTTCCAAG GCTTCTCCAA TTCTGCCCAA AAATATGGTT GATAGTTTAC CTGAATAAAT 
GGTAGTAATC ACAGTTGGCC TTCAGAACCA TCCATAGTAG TATGATGATA CAAGATTAGA 
55 AGCTGAAAAC CTAAGTCCTT TATGTGGAAA ACAGAACATC ATTAGAACAA AGGACAGAGT 
ATGAACACCT GGGCTTAAGA AATCTAGTAT TTCATGCTGG GAATGAGACA TAGGCCATGA 
AAAAAATGAT CCCCAAGTGT GAACAAAAGA TGCTCTTCTG TGGACCACTG CATGAGCTTT 
TATACTACCG ACCTGGTTTT TAAATAGAGT TTGCTATTAG AGCATTGAAT TGGAGAGAAG 
GCCTCCCTAG CCAGCACTTG TATATACGCA TCTATAAATT GTCCGTGTTC ATACATTTGA 
60 GGGGAAAACA CCATAAGGTT TCGTTTCTGT ATACAACCCT GGCATTATGT CCACTGTGTA 
TAGAAGTAGA TTAAGAGCCA TATAAGTTTG AAGGAAACAG TTAATACCAT TTTTTAAGGA 
AACAATATAA CCACAAAGCA CAGTTTGAAC AAAATCTCCT CTTTTAGCTG ATGAACTTAT 
TCTGTAGATT CTGTGGAACA AGCCTATCAG CTTCAGAATG GCATTGTACT CAATGGATTT 
GATGCTGTTT GACAAAGTTA CTGATTCACT GCATGGCTCC CACAGGAGTG GGAAAACACT 
65 GCCATCTTAG TTTGGATTCT TATGTAGCAG GAAATAAAGT ATAGGTTTAG CCTCCTTCGC 
AGGCATGTCC TGGACACCGG GCCAGTATCT ATATATGTGT ATGTAC GTTT GTATGTGTGT 
AGACAAATAT TTGGAGGGGT ATTTTTGCCC TGAGTCCAAG AGGGTCCTTT AGTACCTGAA 
AAGTAACTTG GCTTTCATTA TTAGTACTGC TCTTGTTTCT TTTCACATAG CTGTCTAGAG 
TAGCTTACCA GAAGCTTCCA TAGTGGTGCA GAGGAAGTGG AAGGCATCAG TCCCTATGTA 
70 TTTGCAGTTC ACCTGCACTT AAGGCACTCT GTTATTTAGA CTCA TCTTA C TGTACCTGTT 
CCTTAGACCT TCCATAATGC TACTGTCTCA CTGAAACATT TAAATTTTAC CCTTTAGACT 
GTAGCCTGGA TATTATTCTT GTAGTTTACC TCTTTAAAAA CAAAACAAAA CAAAACAAAA 
AACTCCCCTT CCTCACTGCC CAATATAAAA GGCAAATGTG TACATGGCAG AGTTTGTGTG 
TTGTCTTGAA AGATTCAGGT ATGTTGCCTT TATGGTTTCC CCCTTCTACA TTTCTTAGAC 
75 TACATTTAGA GAACTGTGGC CGTTATCTGG AAGTAACCAT TTGCACTGGA GTTCTATGCT 
CTCGCACCTT TCCAAAGTTA ACAGATTTTG GGGTTGTGTT GTCACCCAAG AGATTGTTGT 
TTGCCATACT TTGTCTGAAA AATTCCTTTG TGTTTCTATT GACTTCAATG ATAGTAAGAA 
AAGTGGTTGT TAGTTATAGA TGTCTAGGTA CTTCAGGGGC ACTTCATTGA GAGTTTTGTC 
ft TTGCCATACT TTGTCTGAAA AATTCCTTTG TGTTTCTATT GACTTCAATG ATAGTAAGAA 

80 AAGTGGTTGT TAGTTATAGA TGTCTAGGTA CTTCAGGGGC ACT TCATTGA GAGTTTTGTC 
AATGTCTTTT GAATATTCCC AAGCCCATGA GTCCTTGAAA ATATTTTTTA TATATACAGT 
AACTTTATGT GTAAATACAT AAGCGGCGTA AGTTTAAAGG ATGTTGGTGT TCCACGTGTT 
TTATTCCTGT ATGTTGTCCA ATTGTTGACA GTTCTGAAGA ATTC 
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Seq ID NO: B74 protein sequence 
Protein Accession 8: NP_O0O213.1 



MRGARGAWDF 
PQFVKWTFEI 
DRSLYGKEDN 
RLCLHCSVDQ 
SVYSTWKREN 
VTTTLEWDK 
DYPKSEMESN 
LVNGMLQCVA 
SAFKHNGTVE 
MILTYKYIjQK 
QKWEATAYG 
TIGGPTLVIT 
YMDMKPGVSY 
AFLASKNCIH 
IFNCVYTFES 
DIMKTCWDAD 
GSTASSSQPL 



11 

1 

LCVLLLLLRV 
LDETNENKQN 
DTLVRCPLTD 
BGKSVLSEKP 
SQTKLQEKYN 
GFINIFPMIN 
IRYVSELHLT 
AGFPEPTIDW 
CKAYNDVGKT 
PMYEVQWKW 
LIKSDAAMTV 
EYCCYGDLLN 
WPTKADKRR 
RDLAARNILL 
DVWSYGIFLW 
PLKRPTFKQ1 
LVHDDV 



21 
I 

QTGSSQPSVS 
EWITEKAEAT 
PEVTNYSLKG 
ILKVRPAFKA 
SWHHGDFNYE 
TTVFVNDGEN 
RLKGTEGGTY 
YFCPGTEQRC 
SAYFNFAFKG 
EEINGNNYVY 
AVKMLKPSAH 
FLRRKRDSFI 
SVRIGSYIER 
THGRITKICD 
ELFSLGSSPY 
VQLIEKQISE 



31 
I 

PGEPSPPSIH 
NTGKYTCTNK 
CQGKPLPKDL 
VPWSVSKAS 
RQATLTISSA 
VDLIVEYEAF 
TFLVSNSDVN 
SASVLPVDVQ 
NNKEQIHPHT 
IDPTQLPYDH 
LTEREALMSE 
CSKQEDHAEA 
DVTPAIMEDD 
FGLARDIKND 
PGMPVDSKFY 
STNHIYSNLA 



41 

I 

PGKSDLIVRV 
HGLSNSIYVF 
RFIPDPKAGI 
YLLREGEEFT 
RVKDSGVFMC 
PKPEHQQWIY 
AAIAFNVYVN 
TLNSSGPPFG 
LFTPLLIGFV 
KWEFPRNRLS 
LKVLSYLGNH 
ALYKNLLHSK 
ELALOLEDXiL 
SNYWKGNAR 
KMIKEGFRML 
NCSPNRQKPV 



51 
1 

GDEIRLLCTD 
VRDPAKLFLV 
MIKSVKRAYH 
VTCTIKDVSS 
YANNTFGSAN 
MNRTFTDKWE 
TKPEII»TYDR 
KLWQSSIDS 
IVAGMMCIIV 
FGKTLGAGAF 
MNIVNLLGAC 
ESSCSDSTNE 
SFSYQVAKGM 
LPVKWMAPES 
SPEHAPAEMY 
VDHSVRINSV 
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Seq ID NO: B7S DNA sequence 
Nucleic Acid Accession #: NM_0 04456 
Coding sequence: 58.-2298 



1 ll 21 31 41 51 

| | | I I I 

GAATTCCGGG CGACGCGCGG GAACAACGCG AGTCGGCGCG CGGGACGAAG AATAATCATG 
GGCCAGACTG GGAAGAAATC TGAGAAGGGA CCAGTTTGTT GGCGGAAGCG TGTAAAATCA 
GAGTACATGC GACTGAGACA GCTCAAGAGG TTCAGACGAG CTGATGAAGT AAAGAGTATG 
TTTAGTTCCA ATCGTCAGAA AATTTTGGAA AGAACGGAAA TCTTAAACCA AGAATGGAAA 
CAGCGAAGGA TACAGCCTGT GCACATCCTG ACTTCTGTGA GCTCATTGCG CGGGACTAGG 
GAGTGTTCGG TGACCAGTGA CTTGGATTTT CCAACACAAG TCATCCCATT AAAGACTCTG 
AATGCAGTTG CTTCAGTACC CATAATGTAT TCTTGGTCTC CCCTACAGCA GAATTTTATG 
GTGGAAGATG AAACTGTTTT ACATAACATT CCTTATATGG GAGATGAAGT TTTAGATCAG 
GATGGTACTT TCATTGAAGA ACTAATAAAA AATTATGATG GGAAAGTACA CGGGGATAGA 
GAATGTGGGT TTATAAATGA TGAAATTTTT GTGGAGTTGG TGAATGCCCT TGGTCAATAT 
AATGATGATG ACGATGATGA TGATGGAGAC GATCCTGAAG AAAGAGAAGA AAAGCAGAAA 
GATCTGGAGG ATCACCGAGA TGATAAAGAA AGCCGCCCAC CTCGGAAATT TCCTTCTGAT 
AAAATTTTGG AGGCCATTTC CTCAATGTTT CCAGATAAGG GCACAGCAGA AGAACTAAAG 
GAAAAATATA AAGAACTCAC CGAACAGCAG CTCCCAGGCG CACTTCCTCC TGAATGTACC 
CCCAACATAG ATGGACCAAA TGCTAAATCT GTTCAGAGAG AGCAAAGCTT ACACTCCTTT 
CATACGCTTT TCTGTAGGCG ATGTTTTAAA TATGACTGCT TCCTACATCC TTTTCATGCA 
ACACCCAACA CTTATAAGCG GAAGAACACA GAAACAGCTC TAGACAACAA ACCTTGTGGA 
CCACAGTGTT ACCAGCATTT GGAGGGAGCA AAGGAGTTTG CTGCTGCTCT CACCGCTGAG 
CGGATAAAGA CCCCACCAAA ACGTCCAGGA GGCCGCAGAA GAGGACGGCT TCCCAATAAC 
AGTAGCAGGC CCAGCACCCC CACCATTAAT GTGCTGGAAT CAAAGGATAC AGACAGTGAT 
AGGGAAGCAG GGACTGAAAC GGGGGGAGAG AACAATGATA AAGAAGAAGA AGAGAAGAAA 
GATGAAACTT CGAGCTCCTC TGAAGCAAAT TCTCGGTGTC AAACACCAAT AAAGATGAAG 
CCAAATATTG AACCTCCTGA GAATGTGGAG TGGAGTGGTG CTGAAGCCTC AATGTTTAGA 
GTCCTCATTG GCACTTACTA TGACAATTTC TGTGCCATTG CTAGGTTAAT TGGGACCAAA 
ACATGTAGAC AGGTGTATGA GTTTAGAGTC AAAGAATCTA GCATCATAGC TCCAGCTCCC 
GCTGAGGATG TGGATACTCC TCCAAGGAAA AAGAAGAGGA AACACOGGTT GTGGGCTGCA 
CACTGCAGAA AGATACAGCT GAAAAAGGAC GGCTCCTCTA ACCATGTTTA CAACTATCAA 
CCCTGTGATC ATCCACGGCA GCCTTGTGAC AGTTCGTGCC CTTGTGTGAT AGCACAAAAT 
TTTTGTGAAA AGTTTTGTCA ATGTAGTTCA GAGTGTCAAA ACCGCTTTCC GGGATGCCGC 
TGCAAAGCAC AGTGCAACAC CAAGCAGTGC CCGTGCTACC TGGCTGTCCG AGAGTGTGAC 
CCTGACCTCT GTCTTACTTG TGGAGCCGCT GACCATTGGG ACAGTAAAAA TGTGTCCTGC 
AAGAACTGCA GTATTCAGCG GGGCTCCAAA AAGCATCTAT TGCTGGCACC ATCTGACGTG 
GCAGGCTGGG GGATTTTTAT CAAAGATCCT GTGCAGAAAA ATGAATTCAT CTCAGAATAC 
TGTGGAGAGA TTATTTCTCA AGATGAAGCT GACAGAAGAG GGAAAGTGTA TGATAAATAC 
ATGTGCAGCT TTCTGTTCAA CTTGAACAAT GATTTTGTGG TGGATGCAAC CCGCAAGGGT 
AACAAAATTC GTTTTGCAAA TCATTCGGTA AATCCAAACT GCTATGCAAA AGTTATGATG 
GTTAACGGTG ATCACAGGAT AGGTATTTTT GCCAAGAGAG CCATCCAGAC TGGCGAAGAG 
CTGTTTGTTG ATTACAGATA CAGCCAGGCT GATGCCCTGA AGTATGTCGG CATCGAAAGA 
GAAATGGAAA TCCCTTGACA TCTGCTACCT CCTCCCCCTC CTCTGAAACA GCTGCCTTAG 
CTTCAGGAAC CTCGAGTACT GTGGGCAATT TAGAAAAAGA ACATGCAGTT TGAAATTCTG 
AATTTGCAAA GTACTGTAAG AATAATTTAT AGTAATGAGT TTAAAAATCA ACTTTTTATT 
GCCTTCTCAC CAGCTGCAAA GTGTTTTGTA CCAGTGAATT TTTGCAATAA TGCAGTATGG 
TACATTTTTC AACTTTGAAT AAAGAATACT TGAACTTGAA AAAAAAAAAA AAAAAA 

Seq ID NO: B76 Protein sequence 
Protein Accession §: NP_004447 

1 11 21 31 41 51 

I I I t ! I 

MGQTGKKSEK GPVCWRKRVK SEYMRLRQLK RFRRADEVKS MFSSNRQKIL ERTEILNQEW 
KQRRIQPVHI LTSVSSLRGT RECSVTSDLD FPTQVIPLKT LNAVASVPIM YSWSPLQQNF 
MVEDETVLHN IPYMGDEVLD QDGTFIEELI KNYDGKVHGD RECGFINDEI FVELVNALGQ 
YNDDDDDDDG DDPEEREEKQ KDLEDHRDDK ESRPPRKFPS DKILEAISSM FPDKGTAEEL 
KEKYKELTEQ QLPGALPPEC TPNIDGPNAK SVQREQSLHS FHTLPCRRCF KYDCFLHPFH 
ATPNTYKRKN TETALDNKPC GPQCYQHLEG AKEFAAALTA ERIKTPPKRP GGRRRGRLPN 

826 
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NSSRPSTPTI NVLESKDTDS DREAGTETGG ENNDKEEEEK KDETSSSSEA NSRCQTPIKM 420 

KPNIEPPENV EWSGAEASMF RVLIGTYYDN FCAIARLIGT KTCRQVYEFR VKESSIIAPA 480 

PAEDVDTPPR KKKRKHRLWA AHCRKIQLKK DGSSNHVYNY QPCDHPRQPC DSSCPCVIAQ 540 

NPCEKFCQCS SECQNRFPGC RCKAQCNTKQ CPCYLAVREC DPDLCLTCGA ADHWDSKNVS 600 

CKNCSIQRGS KKHLLLAPSD VAGWGIFIKD PVQKNEFISE YCGEIISQDE ADRRGKVYDK 660 

YMCSFLFNLN NDFWDATRK GNKIRFANHS VNPNCYAKVM MVNGDHRIGI FAKRAIQTGE 720 
ELFVDYRYSQ ADALKYVGIE REMEIP 



Seq ID NO: B77 DNA sequence 
Nucleic Acid Accession «: NM_007015 
Coding sequence: 1..1005 



ATGACAGAGA 
TGCAGCCCCC 
AAGGTGGGAG 
GCCTTCTACT 
ATCAATGGGA 
TTTAAAATGG 
ACAGGAATTC 
ATTCCTGAGG 
ATGCCAGTCA 
GACAACAGCT 
CTTAAACCAA 
GTTCCAACTA 
CTGAATAATG 
CCTTATCATC 
ATCTGTTGTA 
GGGGGCTATT 
ATGCCATGTA 
CACGTGCTGT 
ATGCTGATGG 
TAACAGAATT 
AGTTCAAGTC 
CATAAGTCTT 
AGTTTGCC 



11 
I 

ACTCCGACAA 
CGGCGTACGC 
CCGTGGTCCT 
TCTGGAAGGG 
AACTACAAGA 
GAAGTGGAGC 
GTTTTGCTGG 
TGGGCGCCGT 
AATATGAAGA 
TCTTGAGTTC 
CCTATCCAAA 
CCACAAAAAG 
AAACCAGACC 
AGCAGGAAGG 
TAGAATGTAG 
ACCCATGGCC 
GCTGGTGGGT 
AAAATAAGAA 
GACCATAAAA 
TTTTTAATCG 
TAAAATGCCA 
CCCTTGCTTG 



21 
I 

AGTTCCCATT 
TACGCTGACG 
CATTTCGGGA 
GAGCGACAGT 
TGGGTCAATG 
TGAAGAAGCA 
AGGAGAGAAG 
GACCAAACAG 
AAATTCTCTT 
TAAGGTGTTA 
AGAAATCCAG 
ACCACACAGT 
CAGTGTTCAA 
GGAAAGCATG 
GCGGAGCTAC 
TTATAATTAT 
GGCCCGTATC 
CTAGCTGAAG 
TATTTTTACA 
TTTTCCAGAA 
TAACCCCGTT 
CATCTTCCAA 



31 
I 

GCCCTGGTGG 
GTGAAGCCCT 
GCTGTGCTGC 
CACATTTACA 
GAAATAGACG 
ATTGCAGTTA 
TGCTACATTA 
AGCATCTCCT 
ATCTGGGTGG 
GAACTCTGCG 
AGGGAAAGAA 
GGACCACGGA 
GAGGACTCAC 
ACATTOGACC 
ACCCACTGCC 
CAAGGCTGCC 
TTGGGCATGG 
AGACAACCAA 
CGCAGCCTGA 
CTTTAGTATA 
ATTTGTTATT 
AGCTATTTCG 



41 
I 

GACCTGATGA 
CCAGCCCCGC 
TGCTCTTTGG 
ATGTCCATTA 
CTGGGAACAA 
ATGATTTCCA 
AAGCGCAAGT 
CCAAACTGGA 
CTGTAGATCA 
GTGACCTTCC 
GAGAAGTGGT 
GCAACCCAGG 
AAGCCTTCAA 
CTAGACTGGA 
AGAAGATCTG 
GTTCGGCCTG 
TGTGAAATCA 
AGAAGCATTA 
GCGGTTATTC 
TGCAAATGCA 
TTTTATTTGC 
AAATAAACAC 



51 

I 

CGTGGAATTC 
GCGGCTGCTC 
GGCCATCGGG 
CACCATGAGT 
CTTGGAGACC 
GAATGGCATC 
GAAGGCTCGT 
AGGCAAGATC 
GCCTGTGAAG 
TATTTTCTGG 
AAGAAAAATT 
CGCTGGAAGA 
TCCTGATAAT 
TCACGAAGGA 
TGAACCCCTG 
CAGAGTCATC 
CTTCATATAT 
AGGCAGGTTG 
TTGACACTCT 
CTGAAAGGGT 
ATTGATTTGC 
GAAAATTTAC 
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Seq ID NO: B7B Protein sequence 
Protein Accession #: NP_008946 



1 11 

I 1 
MTENSDKVPI ALVGPDDVEF 
AFYFWKGSDS HIYNVHYTMS 
TGIRFAGGEK CYIKAQVKAR 
DNSFLSSKVL ELCGDLPIFW 
LNNETRPSVQ EDSQAFNPDN 
GGYYPWPYNY QGCRSACRVI 



21 31 41 51 

1 I I I 

CSPPAYATLT VKPSSPARLL KVGAWLISG AVLLLFGAIG 60 
INGKLQDGSM EIDAGNNLET FKMGSGAEEA IAVNDFQNGI 120 
IPEVGAVTKQ SISSKLEGKI MPVKYEENSL IWVAVDQPVK 180 
LKPTYPKEIQ RERREWRKI VPTTTKRPHS GPRSNPGAGR 240 
PYHQQEGESM TFDPRUDHEG ICCIECRRSY THCQKICEPL 300 
MPCSWWVARI LGMV 



Seq ID NO: B79 DNA sequence 

Nucleic Acid Accession ft: NMJU2449.1 

Coding sequence: 66.. 1085 



I 

CCGAGACTCA 
AATTAATGGA 
GGAGAAATTT 
AAAGACCTGT 
CAGAACTTCA 
CTATTATAGC 
CAACTTCCCA 
CAATGGTTTC 
TCCAACrrCA 
TAACAAGAAA 
GTCTGTCTTA 
AGGTCCAACA 
ATGTGTCTCT 
CATCTGTGAG 
TTGTTTCCCT 
ATATAAAACA 
TTGTTGTCCT 
AGATTAGACA 
TGTAGAATTA 
TCAAGTTTGT 



11 
I 

CGGTCAAGCT 
AAGCAGAAAA 
AGAAGAAGAC 
GCTTTTGCAT 
GCACACACAG 
ATCTCTGACT 
TCAACAATAT 
CATCACTCTC 
TAATGGAACC 
GCAGTTTGGG 
CCCAATGAGG 
AAATAAAGAA 
GGGAATTGTG 
TGACTCTTTG 
TCTACTGGGC 
ATTTGTATGG 
GATATTTAAA 
TGGTTGGGAA 
CTGTTTACAC 
ATTTGTTAAT 



21 
I 

AAGGCGAAGA 
GACATCACAA 
GATTATTTGC 
TTGCACCAAA 
GAACTCTTTC 
TTTCTTTACA 
TTTTATAAAA 
TTGGCATTGG 
AAGTATAAGA 
CTTCTCAGTT 
CGATCCTACA 
GATGCCTGGA 
GGATTGGCAA 
ACATGGAGAG 
ACAATACACG 
TATACACCTC 
AGCATACTAT 
GACGTCACCA 
ACATTTTTGT 
AAAATGATTA 



31 
I 

GTGGGTGGCT 
ACCAAGAAGA 
ATAAGGACAC 
CAGCCCATGC 
CACAGTGGCA 
CTCTTCTGAG 
TTCCAATCCT 
TTTACCTGCC 
AGTTTCCACA 
TCTTTTTTGC 
GATACAAGTT 
TTGAGCATGA 
TACTGGCTCT 
AATTTCACTA 
CATTGATTTT 
CAACTTTTAT 
TCCTGCCATG 
AAATTAACAA 
TCAATATTGA 
TTCAAGGAAA 



41 
I 

GAAGCCATAC 
ACTTTGGAAA 
GGGAGAGACC 
TGATGAATTT 
CTTGCCAATT 
GGAAGTAATT 
GGTCATCAAC 
AGGTGTGATA 
TTGGTTGGAT 
TGTACTGCAT 
GCTAAACTGG 
TGTTTGGAGA 
GTTGGCTGTG 
TATTCAGAGC 
TGCCTGGAAT 
GATAGCTGTT 
CTTGAGGAAG 
AACTGAGATA 
TATATTTTAT 
AAAAAAAAAA 



51 
I 

TATTTTATAG 
ATGAAGCCTA 
AGCATGCTAA 
GACTGCCCTT 
AAAATAGCTG 
CACCCTTTAG 
AAAGTCTTGC 
GCAGCAATTG 
AAGTGGATGT 
GCAATTTATA 
GCATATCAAC 
ATGGAGATTT 
ACATCTATTC 
AAGCTAGGAA 
AAGTGGATAG 
TTCCTTCCAA 
AAGATACTGA 
TGTTCCCAGT 
CACCAACATT 
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Seq ID NO: B80 Protein sequence 
Protein Accession ft: NP_03 6581.1 

1 11 21 31 41 51 

| 1 I I I I 

MESRKDITNQ EELWKMKPRR NLEEDDYLHK DTGETSMLKR PVLLHLHQTA HADEFDCPSE 
LQHTQELFPQ WHLPIKIAAI IASLTFLYTL LREVIHPLAT SHQQYFYKIP ILVINKVLPM 

827 
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VSITLLALVY LPGVIAAIVQ LHNGTKYKKP PHWLDKWMLT RKQFGLLSFF FAVLHAIYSL 
SYPMRRSYRY KLUNWAYQQV QQNKEDAWIE HDVWRMEIYV SLGIVGLAIL AUAVTSIPS 
VSDSLTWREF HYIQSKLGIV SLLLGTIHAL IFAWNKWIDI KQFVWYTPPT FMIAVFLPIV 
VLIFKSILFL PCLRKKILKI RHGWEDVTKI NKTEICSQL 

Seq ID NO; B81 DNA sequence 
Nucleic Acid Accession 8: NM_000684 
Coding sequence: 87.. 1520 

1 11 21 31 41 51 

| | I 1 I 

TGCTACCCGC GCCCGGGCTT CTGGGGTGTT CCCCAACCAC GGCCCAGCCC 
CCCGCCCCCG GCCTCCGCAG CTCGGCATGG GCGCGGGGGT GCTCGTCCTG 
AGCCCGGTAA CCTGTCGTCG GCCGCACCGC TCCCCGACGG CGCGGCCACC 
TGCTGGTGCC CGCGTCGCCG CCCGCCTCGT TGCTGCCTCC CGCCAGCGAA 
CGCTGTCTCA GCAGTGGACA GCGGGCATGG GTCTGCTGAT GGCGCTCATC 
TCGTGGCGGG CAATGTGCTG GTGATCGTGG CCATCGCCAA GACGCCGCGG 
TCACCAACCT CTTCATCATG TCCCTGGCCA GCGCCGACCT GGTCATGGGG 
TGCCGTTCGG GGCCACCATC GTGGTGTGGG GCCGCTGGGA GTACGGCTCC 
AGCTGTGGAC CTCAGTGGAC GTGCTGTGCG TGACGGCCAG CATCGAGACC 
TTGCCCTGGA CCGCTACCTC GCCATCACCT CGCCCTTCCG CTACCAGAGC 
GCGCGCGGGC GCGGGGCCTC GTGTGCACCG TGTGGGCCAT CTCGGCCCTG 
TGCCCATCCT CATGCACTGG TGGCGGGCGG AGAGCGACGA GGCGCGCCGC 
ACCCCAAGTG CTGCGACTTC GTCACCAACC GGGCCTACGC CATCGCCTCG 
CCTTCTACGT GCCCCTGTGC ATCATGGCCT TCGTGT ACCT GCGGGTGTTC 
AGAAGCAGGT GAAGAAGATC GACAGCTGCG AGCGCCGTTT CCTCGGCGGC 
CGCCCTCGCC CTCGCCCTCG CCCGTCCCCG CGCCCGGGCC GCCGCCCGGA 
CCGCCGCCGC CGCCGCCACC GCCCCGCTGG CCAACGGGCG TGCGGGTAAG 
CGCGCCTCGT GGCCCTACGC GAGCAGAAGG CGCTCAAGAC GCTGGGCATC 
TCTTCACGCT CTGCTGGCTG CCCTTCTTCC TGGCCAACGT GGTGAAGGCC 
AGCTGGTGCC CGACCGCCTC TTCGTCTTCT TCAACTGGCT GGGCTACGCC 
TCAACCCCAT CAT CTACTG C CGCAGCCCCG ACTTCCGCAA GGCCTTCCAG 
GCTGCGCGCG CAGGGCTGCC CGCCGGCGCC ACGCGACCCA CGGAGACCGG 
CGGGCTGTCT GGCCCGGCCC GGACCCCCGC CATCGCCCGG GGCCGCCTCG 
ACGACGATGT CGTCGGGGCC ACGCCGCCCG CGCGCCTGCT GGAGCCCTGG 
ACGGCGGGGC GGCGGCGGAC AGCGACTCGA GCCTGGACGA GCCGTGCCGC 
CCTCGGAATC CAAGGTGTAG GGCCCGGCGC GGGGCGCGGA CTCCGGGCAC 
GGGAACGAGG AGATCTGTGT TTACTTAAGA CCGATAGCAG GTGAACTCGA 
CCTCGTCTGA ATCATCCGAG GCAAAGAGAA AAGCCACGGA CCGTTGCACA 
TTTGGGAAGG GATGGGAGAG TGGCTTGCTG ATGTTCCTTG TTG 



180 
240 
300 



TGCCACACCC 
GGCGCCTCCG 
GCGGCGCGGC 
AGCCCCGAGC 
GTGCTGCTCA 
CTGCAGACGC 
CTGCTGGTGG 
TTCTTCTGCG 
CTGTGTGTCA 
CTGCTGACGC 
GTGTCCTTCC 
TGCTACAACG 
TCCGTAGTCT 
CGCGAGGCCC 
CCAGCGCGGC 
CCCCCGOGCC 
CGGCGGCCCT 
ATCATGGGCG 
TTCCACCGCG 
AACTCGGCCT 
GGACTGCTCT 
CCGCGCGCCT 
GACGACGACG 
GCCGGCTGCA 
CCCGGCTTCG 
GGCTTCCCAG 
AGCCCACAAT 
AAAAGGAAAG 



Seq ID NO: B82 Protein sequence 
Protein Accession #: NP_000675.1 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 



1 

I 

MGAGVLVLGA 
KGIiLMALIVL 
WGRWEYGSFF 
TVWAISALVS 
AFVYLRVFRE 
LANGRAGKRR 
FFNWLGYANS 
PPSPGAASDD 



11 
I 

SEPGNLSSAA 
LIVAGNVLVI 
CELWTSVDVL 
FLPILMHWWR 
AQKQVKKIDS 
PSRLVALREQ 
AFNPIIYCRS 
DDDDWGATP 



21 
I 

PLPDGAATAA 
VAIAKTPRLQ 
CVTASIETLC 
AESDEARRCY 
CERRFLGGPA 
KALKTLGIIM 
PDFRKAFQGL 
PARLLEPWAG 



31 
I 

RLLVPASPPA 
TLTNLFIMSL 
VIALDRY1AI 
NDPKCCDFVT 
RPPSPSPSPV 
GVFTLCWLPF 
LCCARRAARR 
CNGGAAADSD 



41 

I 

SLLPPASESP 
ASADLVMGLL 
TSPFRYQSLL 
NRAYAIASSV 
PAPAPPPGPP 
FLANWKAFH 
RHATHGDRPR 
SSIaDEPCRPG 



51 
I 

EPLSQQWTAG 60 

WPFGATIW 120 

TRARARGLVC 180 

VSFYVPLCIM 240 

RPAAAAATAP 300 

RELVPDRLFV 360 

ASGCLARPGP 420 
FASESKV 



Seq ID NO: B83 DNA sequence 

Nucleic Acid Accession #: NM_000729.2 

Coding sequence: 2.. 421 



GGCTCAGCTG 
AGCCATGAAC 
GACGCAGCCG 
GCCCCGTAGG 
CGCCCTGCTG 
CGTTAAGAAC 
CTGGATGGAT 
GCCGCCATCA 
TCACACTCAT 
TCAATGTGAA 
TGTGCAAACT 
TATGCTATTA 



11 
I 

CCGGGCTGCT 
AGCGGCGTGT 
GTGCCTCCCG 
CAGCTGAGGG 
GCAAGATACA 
CTGCAGAACC 
TTTGGCCGTC 
GCCCAACGGA 
AACTCATTGT 
AATTGTGTCT 
GAAGACAAAA 
AAGTGATTTC 



21 
I 

CCGGTTGGAA 
GCCTGTGCGT 
CAGATCCCGC 
TATCGCAGAG 
TCCAGCAGGC 
TGGACCCCAG 
GCAGTGCCGA 
AGCAACCTCC 
CTGTGGAGTT 
GTAAGATTGT 
I W IT TT CIT 
ATTCTGCC 



31 
I 

ACGCCAAGCC 
GCTGATGGCG 
GGGCTCCGGG 
AACGGATGGC 
CCGGAAAGCT 
CCACAGGATA 
GGAGTATGAG 
CAACCCAGAG 
TGACATTGAA 
CCAGTGCAAC 
CATCTGTGAC 



41 
I 

AGCTGCCGTC 
GTACTGGCGG 
CTGCAGCGGG 
GAGTCCCGAG 
CCTTCTGGAC 
AGTGACCGGG 
TACCCCTCCT 
GAGGCAGAAT 
TGTATCTATT 
CACACACGCr 
TCCTGTTCTG 



51 
I 

CTAATCCAAA 
CTGGCGCCCT 
CAGAGGAGGC 
CGCACCTGGG 
GAATGTCCAT 
ACTACATGGG 
AGAGGACCCA 
AAGACAACAA 
TATTAAGTTC 
CACCAGAAGT 
AAAATGTTGT 



Seq ID NO: B84 Protein sequence 
Protein Accession #: NP_000720.1 

1 11 21 31 41 51 

I 1 I I I 1 

MNSGVCLCVL MAVLAAGALT QPVPPADPAG SGLQRAEEAP RRQLRVSQRT DGESRAHI/3A 
LLARYIQQAR KAPSGRMSIV KNLQNLDPSH RISDRDYMGW MDFGRRSAEE YEYPS 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 



828 



WO 03/042661 



PCT/US02/36810 



TABLE 74 Gene Sequences Up- or Down- Regulated in Cancer 



endothelial differentiation, 

Hs. 154210 

M31210 

M31210 

251-1396 



Angiogenesis 

Al DMA SEQUENCE 
Gene name: 
Unigene number: 
Probeset Accession fl: 
Nucleic Acid Accession fl: 
Coding sequence: 

TCTAAAGGTC GGGGGCAGCA GCAAGATGCG AAGCGAGCCG TACAGATCCC GGGCTCTCCG 
AACGCAACTT CGCCCTGCTT GAGCGAGGCT GCGGTTTCCG AGGCCCTCTC CAGCCAAGGA 
AAAGCTACAC AAAAAGCCTG GATCACTCAT CGAACCACCC CTGAAGCCAG TGAAGGCTCT 
CTCGCCTCGC CCTCTAGCGT TCGTCTGGAG TAGCGCCACC CCGGCTTCCT GGGGACACAG 
GGTTGGCACC ATGGGGCCCA CCAGCGTCCC GCTGGTCAAG GCCCACCGCA GCTCGGTCTC 
TGACTACGTC AACTATGATA TCATCGTCCG GCATTACAAC TACACGGGAA AGCTGAATAT 
CAGCGCGGAC AAGGAGAACA GCATTAAACT GACCTCGGTG GTGTTCATTC TCATCTGCTG 
CTTTATCATC CTGGAGAACA TCTTTGTCTT GCTGACCATT TGGAAAACCA AGAAATTCCA 
CCGACCCATG TACTATTTTA TTGGCAATCT GGCCCTCTCA GACCTGTTGG CAGGAGTAGC 
CTACACAGCT AACCTGCTCT TGTCTGGGGC CACCACCTAC AAGCTCACTC CCGCCCAGTG 
GTTTCTGCGG GAAGGGAGTA TGTTTGTGGC CCTGTCAGCC TCCGTGTTCA GTCTCCTCGC 
CATCGCCATT GAGCGCTATA TCACAATGCT GAAAATGAAA CTCCACAACG GGAGCAATAA 
CTTCCGCCTC TTCCTGCTAA TCAGCGCCTG CTGGGTCATC TCCCTCATCC TGGGTGGCCT 
GCCTATCATG GGCTGGAACT GCATCAGTGC GCTGTCCAGC TGCTCCACCG TGCTGCCGCT 
CTACCACAAG CACTATATCC TCTTCTGCAC CACGGTCTTC ACTCTGCTTC TGCTCTCCAT 
CGTCATTCTG TACTGCAGAA TCTACTCCTT GGTCAGGACT CGGAGCCGCC GCCTGACGTT 
CCGCAAGAAC ATTTCCAAGG CCAGCCGCAG CTCTGAGAAT GTGGCGCTGC TCAAGACCGT 
AATTATCGTC CTGAGCGTCT TCATCGCCTG CTGGGCACCG CTCTTCATCC TGCTCCTGCT 
GGATGTGGGC TGCAAGGTGA AGACCTGTGA CATCCTCTTC AGAGCGGAGT ACTTCCTGGT 
GTTAGCTGTG CTCAACTCCG GCACCAACCC CATCATTTAC ACTCTGACCA ACAAGGAGAT 
GCGTCGGGCC TTCATCCGGA TCATGTCCTG CTGCAAGTGC CCGAGCGGAG ACTCTGCTGG 
CAAATTCAAG CGACCCATCA TCGCCGGCAT GGAATTCAGC CGCAGCAAAT CGGACAATTC 
CTCCCACCCC CAGAAAGACG AAGGGGACAA CCCAGAGACC ATTATGTCTT CTGGAAACGT 
CAACTCTTCT TCCTAGAACT GGAAGCTGTC CACCCACCGG AAGCGCTCTT TACTTGGTCG 
CTGGCCACCC CAGTGTTTGG AAAAAAATCT CTGGGCTTCG ACTGCTGCCA GGGAGGAGCT 
GCTGCAAGCC AGAGGGAGGA AGGGGGAGAA TACGAACAGC CTGGTGGTGT CGGGTGTTGG 
TGGGTAGAGT TAGTTCCTGT GAACAATGCA CTGGGAAGGG TGGAGATCAG GTCCCGGCCT 
GGAATATATA TTCTACCCCC CTGGAGCTTT GATTTTGCAC TGAGCCAAAG GTCTAGCATT 
GTCAAGCTCC TAAAGGGTTC ATTTGGCCCC TCCTCAAAGA CTAATGTCCC CATGTGAAAG 
CGTCTCTTTG TCTGGAGCTT TGAGGAGATG TTTTCCTTCA CTTTAGTTTC AAACCCAAGT 
GAGTGTGTGC ACTTCTGCTT CTTTAGGGAT GCCCTGTACA TCCCACACCC CACCCTCCCT 
TCCCTTCATA CCCCTCCTCA ACGTTCTTTT ACTTTATACT TTAACTACCT GAGAGTTATC 
AGAGCTGGGG TTGTGGAATG ATCGATCATC TATAGCAAAT AGGCTATGTT GAGTACGTAG 
GCTGTGGGAA GATGAAGATG GTTTGGAGGT GTAAAACAAT GTCCTTCGCT GAGGCCAAAG 
TTTCCATGTA AGCGGGATCC GTTTTTTGGA ATTTGGTTGA AGTCACTTTG ATTTCTTTAA 
AAAACATCTT TTCAATGAAA TGTGTTACCA TTTCATATCC ATTGAAGCCG AAATCTGCAT 
AAGGAAGCCC ACTTTATCTA AATGATATTA GCCAGGATCC TTGGTGTCCT AGGAGAAACA 
GACAAGCAAA ACAAAGTGAA AACCGAATGG ATTAACTTTT GCAAACCAAG GGAGATTTCT 
TAGCAAATGA GTCTAACAAA TATGACATCC GTCTTTCCCA CTTTTGTTGA TGTTTATTTC 
AGAATCTTGT GTGATTCATT TCAAGCAACA ACATGTTGTA TTTTGTTGTG TTAAAAGTAC 
TTTTCTTGAT TTTTGAATGT ATTTGTTTCA GGAAGAAGTC ATTTTATGGA TTTTTCTAAC 
CCGTGTTAAC TTTTCTAGAA TCCACCCTCT TGTGCCCTTA AGCATTACTT TAACTGGTAG 
GGAACGCCAG AACTTTTAAG TCCAGCTATT CATTAGATAG T AATTGA AGA TATGTATAAA 
TATTACAAAG AATAAAAATA TATTACTGTC TCTTTAGTAT GGTTTTCAGT GCAATTAAAC 
CGAGAGATGT CTTGTTTTTT TAAAAAGAAT AGTATTTAAT AGGTTT CTGA CTTTTGTGGA 
TCATTTTGCA CATAGCTTTA TCAACTTTTA AACATTAATA AACTGATTTT TTTAAAG 



sphingolipid G-protein-coupled receptor, 1 



60 
120 
180 
240 
300 
360 
420 
480 
S40 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
12G0 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1B60 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 



GlnHame" endothelial differentiation, sphingolipid G-prote in- coupled receptor, 

Unigene number: Hs. 154210 

Probeset Accession #: M31210 

protein Accession #: AAA52336 

SSiSSS^i-. ESlfSllO. 122-140. 160-177. 202-222. 2S1-269. 283-30! 

Cellular Localization: plasma membrane 



1 11 i 1 r I 1 r 

MGPTSVPLVK AHRSSVSDYV NYDIIVRHYN YTGKLNISAD KENSIKLTSV VFILICCFII 60 

LENIFVLLTI WKTKKFHRPM YYFIGNLALS DLIAGVAYTA NLLLSGATTY KLTPAQWFLR 120 

EGSMFVALSA SVFSLLAIAI ERYITMLKMK LHNGSNNFRL FLLISACWVI SLILGGLPIM 180 

GWNCISALSS CSTVLPLYHK HYILFCTTVF TLLLLSIVIL YCRIYSLVRT RSRRLTFRKN 240 

ISKASRSSEN VALLKTVIIV LSVFIACWAP LFILLLLDVG CKVKTCDILF RABYFLVLAV 300 

LNSGTNPIIY TLTNKEMRRA FIRIMSCCKC PSGDSAGKFK RPIIAGMEFS RSKSDNSSHP 360 
QKDEGDNPET IMSSGNVNSS S 

A3 DMA SEQUENCE 

Gene name* <* protein- coupled receptor 51 

829 



Unigene number: Hs. 198612 

Probeset Accession #: AA4S2928 
Nucleic Acid Accession #: NM_004624.1 



WO 03/042661 



PCT/US02/36810 



Coding sequence: 1-2826 (underlined sequences correspond to start and stop codons) 



1 11 21 ?* t 1 I 1 

5 i i l I I I 

ATGGCTTCCC CGCGGAGGTC CGGGCAGCCA GGGCGGCCGC CGCCGCCGCC ACCGCCGCCC 
GCGCGCCTGC TACTGCTACT GCTGCTGCCG CTGCTGCTGC CTCTGGCGCC CGGGGCCTGG 
GGCTGGGCGC GGGGCGCCCC CCGGCCGCCG CCCAGCAGCC CGCCGCTCTC CATCATGGGC 
CTCATGCCGC TCACCAAGGA GGTGGCCAAG GGCAGCATCG GGCGCGGTGT GCTCCCCGCC 
10 GTGGAACTGG CCATCGAGCA GATCCGCAAC GAGTCACTCC TGCGCCCCTA CTTCCTCGAC 
CTGCGGCTCT ATGACACGGA GTGCGACAAC GCAAAAGGGT TGAAAGCCTT CTACGATGCA 
ATAAAATACG GGCCGAACCA CTTGATGGTG TTTGGAGGCG TCTGTCCATC CGTCACATCC 
ATCATTGCAG AGTCCCTCCA AGGCTGGAAT CTGGTGCAGC TTTCTTTTGC TGCAACCACG 
CCTGTTCTAG CCGATAAGAA AAAATACCCT TATTTCTTTC GGACCGTCCC ATCAGACAAT 
15 GCGGTGAATC CAGCCATTCT GAAGTTGCTC AAGCACTACC AGTGGAAGCG CGTGGGCACG 
CTGACGCAAG ACGTTCAGAG GTTCTCTGAG GTGCGGAATG ACCTGACTGG AGTTCTGTAT 
GGCGAGGACA TTGAGATTTC AGACACCGAG AGCTTCTCCA ACGATCCCTG TACCAGTGTC 
AAAAAGCTGA AGGGGAATGA TGTGCGGATC ATCCTTGGCC AGTTTGACCA GAATATGGCA 
GCAAAAGTGT TCTGTTGTGC ATACGAGGAG AACATGTATG GTAGTAAATA TCAGTGGATC 
20 ATTCCGGGCT GGTACGAGCC TTCTTGGTGG GAGCAGGTGC ACACGGAAGC CAACTCATCC 
CGCTGCCTCC GGAAGAATCT GCTTGCTGCC ATGGAGGGCT ACATTGGCGT GGATTTCGAG 
CCCCTGAGCT CCAAGCAGAT CAAGACCATC TCAGGAAAGA CTCCACAGCA GTATGAGAGA 
GAGTACAACA ACAAGCGGTC AGGCGTGGGG CCCAGCAAGT TCCACGGGTA CGCCTACGAT 
GGCATCTGGG TCATCGCCAA GACACTGCAG AGGGCCATGG AGACACTGCA TGCCAGCAGC 
25 CGGCACCAGC GGATCCAGGA CTTCAACTAC ACGGACCACA CGCTGGGCAG GATCATCCTC 
AATGCCATGA ACGAGACCAA CTTCTTCGGG GTCACGGGTC AAGTTGTATT CCGGAATGGG 
GAGAGAATGG GGACCATTAA ATTTACTCAA TTTCAAGACA GCAGGGAGGT GAAGGTGGGA 
GAGTACAACG CTGTGGCCGA CACACTGGAG ATCATCAATG ACACCATCAG GTTCCAAGGA 
TCCGAACCAC CAAAAGACAA GACCATCATC CTGGAGCAGC TGCGGAAGAT CTCCCTACCT 
30 CTCTACAGCA TCCTCTCTGC CCTCACCATC CTCGGGATGA TCATGGCCAG TGCTTTTCTC 
TTCTTCAACA TCAAGAACCG GAATCAGAAG CTCATAAAGA TGTCGAGTCC ATACATGAAC 
AACCTTATCA TCCTTGGAGG GATGCTCTCC TATGCTTCCA TATTTCTCTT TGGCCTTGAT 
GGATCCTTTG TCTCTGAAAA GACCTTTGAA ACACTTTGCA CCGTCAGGAC CTGGATTCTC 
ACCGTGGGCT ACACGACCGC TTTTGGGGCC ATGTTTGCAA AGACCTGGAG AGTCCACGCC 
35 ATCTTCAAAA ATGTGAAAAT GAAGAAGAAG ATCATCAAGG ACCAGAAACT GCTTGTGATC 
GTGGGGGGCA TGCTGCTGAT CGACCTGTGT ATCCTGATCT GCTGGCAGGC TGTGGACCCC 
CTGCGAAGGA CAGTGGAGAA GTACAGCATG GAGCCGGACC CAGCAGGACG GGATATCTCC 
ATCCGCCCTC TCCTGGAGCA CTGTGAGAAC ACCCATATGA CCATCTGGCT TGGCATCGTC 
TATGCCTACA AGGGACTTCT CATGTTGTTC GGTTGTTTCT TAGCTTGGGA GACCCGCAAC 
40 GTCAGCATCC CCGCACTCAA CGACAGCAAG TACATCGGGA TGAGTGTCTA CAACGTGGGG 
ATCATGTGCA TCATCGGGGC CGCTGTCTCC TTCCTGACCC GGGACCAGCC CAATGTGCAG 
TTCTGCATCG TGGCTCTGGT CATCATCTTC TGCAGCACCA TCACCCTCTG CCTGGTATTC 
GTGCCGAAGC TCATCACCCT GAGAACAAAC CCAGATGCAG CAACGCAGAA CAGGCGATTC 
CAGTTCACTC AGAATCAGAA GAAAGAAGAT TCTAAAAOGT CCACCTCGGT CACCAGTGTG 
45 AACCAAGCCA GCACATCCCG CCTGGAGGGC CTACAGTCAG AAAACCATCG CCTGCGAATG 
AAGATCACAG AGCTGGATAA AGACTTGGAA GAGGTCACCA TGCAGCTGCA GGACACACCA 
GAAAAGACCA CCTACATTAA ACAGAACCAC TACCAAGAGC TCAATGACAT CCTCAACCTG 
GGAAACTTCA CTGAGAGCAC AGATGGAGGA AAGGCCATTT TAAAAAATCA CCTCGATCAA 
AATCCCCAGC TACAGTGGAA CACAACAGAG CCCTCTCGAA CATGCAAAGA TCCTATAGAA 
50 GATATAAACT CTCCAGAACA CATCCAGCGT CGGCTGTCCC TCCAGCTCCC CATCCTCCAC 
CACGCCTACC TCCCATCCAT CGGAGGCGTG GACGCCAGCT GTGTCAGCCC CTGCGTCAGC 
CCCACCGCCA GCCCCCGCCA CAGACATGTG CCACCCTCCT TCCGAGTCAT GGTCTCGGGC 
CTGTAA 

55 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
16B0 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 



60 



65 



70 



75 



80 



A4 Protein sequence i 
Gene name: 
Unigene number: 
Probeset Accession ft: 
Protein Accession ft: 
Signal sequence: 
Pfam domains: 
Transmembrane domains: 
Cellular Localization: 



G protein- coupled receptor 51 
HS. 198 612 
AA452928 
NP 005449.1 
1-42 

7tm_3 [481-754] , ANF receptor [130-204] 
22-44, 477-499, 517-539, 552-574, — 
plasma membrane 



595-617, 653-675, 692-713, 722-744 



MASPRRSGQP 
LMPLTKEVAK 
IKYGPNHLMV 
AVNPAILKLL 
KKLKGNDVRI 
RCLRKNLLAA 
GIWVIAKTLQ 
ERMGTIKFTQ 
LYSILSALTI 
GSFVSEKTFE 
VGGMLLIDLC 
YAYKGLLMLP 
FCIVALVIIF 
NQASTSRLEG 
GNFTESTDGG 
HAYLPSIGGV 



11 
I 

GRPPPPPPPP 
GSIGRGVLPA 
FGGVCPSVTS 
KHYQWKRVGT 
ILGQFDQNMA 
MEGYIGVDFE 
RAMETLHASS 
FQDSREVKVG 
LGMIMASAFL 
TL.CTVRTWIIi 
ILICWQAVDP 
GCFLAWETRN 
CSTITLCLVF 



KAILKNHLDQ 
DASCVSPCVS 



21 
I 

ARLLLLltLLP 
VELAIEQIRN 
IIAESLQGWN 
LTQDVQRFSE 
AKVFCCAYEE 
PLSSKQIKTI 
RHQRIQDFNY 
EYNAVADTLE 
FFNIKNRNQK 
TVGYTTAFGA 
LRRTVEKYSM 
VSIPALKDSK 
VPKLITLRTN 
KITELDKDLE 
NPQLQWNTTE 



31 



41 
I 



LliLPLAPGAW GWARGAPRPP 
ESLLRPYFLD I*RLYDTECDN 
LVQLSFAATT PVLADKKKYP 
VRNDLTGVLY GEDIEISDTE 
NMYGSKYQWI IPGWYEPSWW 
SGKTPQQYER EYKNKRSGVG 
TDHTLGRI IL NAMNETNFFG 
IINDTIRFQG SEPPKDKTII 
LIKMSSPYMN HLIILGGMLS 
MFAKTWRVHA IFKNVKMKKK 
EPDPAGRDZS IRPLLEHCEN 
YIGMSVYNVG IMCIIGAAVS 
PDAATQNRRF QFTQNQKKED 
EVTMQLQDTP EKTTYIKQKH 
PSRTCKDPIE DINSPEHIQR 
PPSFRVMVSG I* 



51 
I 

PSSPPLSIMG. 
AKGLKAFYDA 
YFFRTVPSDN 
SFSNDPCTSV 
EQVHTEANSS 
PSKFHGYAYD 
VTGQWFRNG 
LEQLRKISLP 
YASIFLFGLD 
IIKDQKLLVI 
THMTIWLGIV 
FLTRDQPNVQ 
SKTSTSVTSV 
YQELNDILNL 
RLSLQLPILH 



60 
120 
180 
240 
300 
360 
420 
460 
540 
600 
660 
720 
780 
840 
900 



830 



WO 03/042661 



PCI7US02/36810 



AS DMA SEQUENCE 

Gene name: ESTs 
Unigene number: Hs. 2 93 61 6 

Probeset Accession 8: AW043782 
5 Nucleic Acid Accession fl: none found 

Coding sequence: 38-1075 (underlined sequence corresponds to start and stop codon) 



1 11 21- 31 41 SI 

10 | | | I I I 

AGCAACGACG CCGGGCAGCG 6GAGCGGCGG CCGCGC CATG TGGCTGCTGG GGCCGCTGTG 60 

CCTGCTGCTG AGCAGCGCCG CGGAGAGCCA GCTGCTCCCC GGGAACAACT TCACCAATGA 120 

GTGCAACATA CCAGGCAACT TCATGTGCAG CAATGGACGG TGCATCCCGG GCGCCTGGCA 180 

GTGTGACGGG CTGCCTGACT GCTTCGACAA GAGTGATGAG AAGGAGTGCC CCAAGGCTAA 240 

15 GTCGAAATGT GGCCCAACCT TCTTCCCCTG TGCCAGCGGC ATCCATTGCA TCATTGGTCG 300 

CTTCCGGTGC AATGGGTTTG AGGACTGTCC CGATGGCAGC GATGAAGAGA ACTGCACAGC 360 

AAACCCTCTG CTTTGCTCCA CCGCCCGCTA CCACTGCAAG AACGGCCTCT GTATTGACAA 420 

GAG CTT CATC TGCGATGGAC AGAATAACTG TCAAGACAAC AGTGATGAGG AAAGCTGTGA 480 

AAGTTCTCAA GAACCCGGCA GTGGGCAGGT GTTTGTGACT TCAGAGAACC AACTTGTGTA 540 

20 TTACCCCAGC ATCACCTATG CCATCATCGG CAGCTCCGTC ATTTTTGTGC TGGTGGTGGC 600 

CCTGCTGGCA CTGGTCTTGC ACCACCAGCG GAAGCGGAAC AACCTCATGA CGCTGCCCGT 660 

GCACCGGCTG CAGCACCCTG TGCTGCTGTC CCGCCTGGTG GTCCTGGACC ACCCCCACCA 720 

CTGCAACGTC ACCTACAACG TCAATAATGG CATCCAGTAT GTGGCCAGCC AGGCGGAGCA 780 

GAATGCGTCG GAAGTAGGCT CCCCACCCTC CTACTCCGAG GCCTTGCTGG ACCAGAGGCC 840 

25 TGCGTGGTAT GACCTTCCTC CACCGCCCTA CTCTTCTGAC ACGGAATCTC TGAACCAAGC 900 

CGACCTGCCC CCCTACCGCT CCCGGTCCGG GAGTGCCAAC AGTGCCAGCT CCCAGGCAGC 960 

CAGCAGCCTC CTGAGCGTGG AAGACACCAG CCACAGCCCG GGGCAGCCTG GCCCCCAGGA 1020 

GGGCACTGCT GAGCCCAGGG ACTCTGAGCC CAGCCAGGGC ACTGAAGAAG TATAAGTCCC 1080 

AGTTATTCCA AAGTCCATAT GGGTTAATCT GCTCTGACTT GTTGCCATTC TAACAATTTG 1140 

30 TGCTCATGGG AAGCTCTTTA AGCACCTGTA AGGATGTCTC AAGTTACAGT TTGGGATATT 1200 

AACTATCTCT GCATTCCCCT CCTCCCCCAG ACTTCAGAGA TGTTTTTCTG GCGTCTCAGT 1260 

TGACATGATC TGTTGTGCGT CTTTTCTGTC AGGTCACTCT TCCCTTGGGA CCCGAGATCA 1320 

CACCCTCATT TTTCACATTA TTCTGTTTCT GTTGGAGAGA CAGCATATAA AACAGTATTG 1380 

AAATAGGCTG GGAGAGAGCA ATGTTTCTGT GCTATATTGG ATGCTCAGAA GTGCAGGAGA 1440 

35 CGCTGGACCC AATTCTCTCT GCTGGGTAGT TACCTTATAG CATTTGGGGA TTTGGGTTAG 1500 

ATGATCTAAC CAGGAGGCCA TCACTGGATG GTCACCCCCC CAAAAAAATT CCATTTGAGC 1S60 

ATCAAAACCT GCTTTGCACA ATCCTATTTG ATGCCCCCAG TTCAGCAGAG TCAGTGGCCA 1620 

AAGAAAACTT TGGACGTGAG TAACACCCTT CAGCAGTCGC AACGTTATTT TGGTTTTGTG 1680 

AAGGACTCTG AAACCATCTA CCCTGTATAA ATTCTGGCTT TAGAAATTTG CCCAAGAATG 1740 

40 CTCATTCTGA GAGCTTTCCT CAGCAGCATA TATCATCAGC CTCATCCTAA AATAGGCAGG 1800 

GAGCCCCTCC CATGAGTTTA TCCAAGTTCT CAGCTCCTAA AATGCAGGCT GCCAAGACCC I860 

TACACCTGCC CTGGCTCTAC AGCCACTTAC CTGGTTTCTG GACTGTCACC CTCCCAGCTG 1920 

ACCTGCCCGT AGCCAAGGAA TGAGGACCTA ACTTGAGTTG GCCCAAAGTC TGACCTGGCT 1980 

GTATGTCCCT GTGGCCCACA CCCAGCCTGT CTTGCTCATT CATGCAGCCT CAACACTGGC 2040 

45 CTCCAAAGTT CCCTTAACAC TTGCAAAGTC CTTTTTACCT GTGCATTTGG ACTTGAGGAC 2100 

ACTGGTTTCT ATCACAGGTG AGAGCCATGT TCAATACCTC CAGCAAGCTC TCCTGGCTCC 2160 

CTGCACTGTG CACGCTCCTC TTCCCAAGGT CCCAATACCA GCACCTCTAG TTAGAGTTAG 2220 

GGTCAGGGTC AGGCCTCTCC CAACATCCCA GTAGTTTCTC CTCTGAGACA CATGGGCAAG 2280 

AGACAATTTG GAGTCAAGAT TTTCCATTTG GATCTATTTT AAATCTTTTA GAAATGCATT 2340 

50 TGAAACAGTG TGTTTGTTTT TTCCCTTCTA GTTAAGGGAC TATTTATATG TGTATAGGAA 2400 

AGCTGTCTCT TTTTTTGTTT TTCCTTTAAC AAGGTCCAAA GAAAGATGCA AAAGGAGATC 2460 

ACACCCTTGC CCCGCTGAGC CCCGTGATAA CAAGTCACTC CAGACTAACC TGTGTGCCAG 2520 

ACATTTGTGC ATTGTTGCAC TTTGAGGTTA TTATTTATCA AGTTCTTGAA GGAAGCAGAA 2580 

AGAGGGACTC CTCTCTCCCT CCGTGTATAG TCTCTATGTT TGTGCTAGTT TTTCTTTTTT 2640 

55 TTCTCTGTGT CCAGTCAGCC ACAGGGCCCG CCTCCCTGCA GGAATAAGGG GTAAAACGTT 2700 

AGGTGTTGTT TGGCAAGAAA CCACACTGAC TGATGAGGGG TAAAATGGAA CCAGGTAGAG 2760 

OCACTCCGGG CAGCTGTCAC CCATTCAGAA CTTCTTTCCG CAGCTGAAGA AATGTTCAGT 2820 

AACCTGTTTG ACGCTAATTA AAACAGAGCC TGCAGGAAGT GGGGCTAAAG TGGCATTCAG 2880 

TGATCCTGTT CTGTAGACTT TTCTTTCTTT TTTTAACCAA ATCCAAAGGA TGTTACAGAA 2940 

60 AAGCTAGCCA CTGGTATTTT GTTTTGTTTA AAAAAAAAAA GAAAGAAAGA AAGAAAGAAA 3000 

AACGGAAAGG AACCTAGCTG CCTGTATCTT TCATTTTTAA AATAGCACTT GAGTTATTTT 3060 

CTGAGTAATC CAATAAAGAA CTTTTGATGA CAGCCAGAAT GTGTTAGAAC TCTGGCTGAA 3120 

CATTTCATCT CCTGTGAGTC AGAAGGGCTT TATTTCTCCC TTTGATGGGG CCCCTTCTTC 3180 

TTTCTGGTGC TCTGGAAGTT GTTTAGAGGA AAGAATTCTA ATTTTAATTA ATTGCGCAGT 3240 

65 GAGTTAATCT CACTCGCTTT TCTGCTTCCA GGCATCTTAG GAAAAACAAA TGGTTTTAGT 3300 

AGATAAGGGA TGCCTACTAA TGCTTTTTTA AAACAAACAG GGACATTTTT ATTATAGATT 3360 

TGATTTTTTT AATGAATGTT TTTAAAAATA TATAAATAGG ACACCAAAGC GGCAGGGTTT 3420 

TTTTTGGGGG GAGGGGGTTT GTTTTCCAAC TCAAGATGGC ACATTAGTGG CCAGCAATAT 3480 

TTTTTAACTC ATTCCAACCA GGAAGCTTTT TTATACATTG CCTAAATCTA CGCCAACCAG 3540 

70 AAAATAGTCT CATCTCTTTT TTTCTCAAAT GAGATCCGTG TTTTATTTTA GCATTAAATT 3600 
AGTTACACTG TGATGACTGG CCTATTACCT GACTCAGCTC CCTCTACCTT GAAATTGACA 3660 
TTTTTAAAAA ATGCAACTAA GTGGTTAATA GTGTGTGACG CTCAAAGTTA ATGTAAACTG 3720 
GAAAGGTTGT GTGTCGTTGC TTTTTGTGTT TTGGTTAGGC TTGGTTTTGT TTTTTAATTT 3780 
TTATACTTTC TAATAAATTT GCAGTTTCAT TCTTTCTGTT TGTGCAAAWG GWMCTAMARM 3840 
75 AAMMAAAAAC AWYWTTGGGG GGGCTTGGGC CTCGGAAAAA GTTTTTAACA CCACTTCGGG 3900 
TGGGGCGGCG GGGCCCACGT AGGTACGGCG ACCACGCGGG CCCAAACGGG ACCCCAGAAG 3960 
GAAACCCTGG CCAAGAAAAA GGTGGCGAGA ATTCTCCACA CCAGAAAAAA ACGCGCOGGG 4020 
GGAAACCGCA GAGTGTTGCG TAAACCACAC CCGAAGAGAG AACTCAGAAG CACACAAGCG 4080 
GGACTCAACC AGGAGGACCC AAGGGAACCC GATAGAGTAC G 

80 



A6 Protein sequencer 
Gene name: 



ESTs 



831 



WO 03/042661 



PCT/US02/36810 



Unigene number: 
Probeset Accession 8: 
Protein Accession #: 
Signal sequence: 
Transmembrane domains: 
LDLa domains: 
Cellular Localization: 



Hs. 293616 
AW043782 

none found 

1-17 

169-191 

28-66, 70-108, 112-149 
plasma membrane 



10 1 11 21 

I 1 * 

MWLLGPLCLL LSSAAESQLL PGNNFTNECN 

EKECPKAKSK CGPTFPPCAS GIHCIIGRFR 

KNGLCIDKSF ICDGQNNCQD NSDEESCESS 

15 VIFVLWALL AI.VLHHQRKR NNLMTLPVHR 

YVASQAEQNA SEVGSPPSYS EALLDQRPAW 

NSASSQAASS LLSVEDTSHS PGQPGPQEGT 



31 41 51 

I I ' 

IPGNFMCSNG RCIPGAWQCD GLPDCFDKSD 
CNGFEDCPDG SDEENCTANP LLCSTARYHC 
QEPGSGQVFV TSENQLVYYP SITYAIIGSS 
LQHPVLLSRL WLDHPHHCN VTYNVNNGIQ 
YDLPPPPYSS DTESLNQADL* PPYRSRSGSA 
AEPRDSEPSQ GTEEV 



60 
120 
180 
240 
300 



20 A7 DNA SEQUENCE 
Gene name: 

Unigene number: 
Probeset Accession fl: 
Nucleic Acid Accession tt: 
25 Coding sequence: 



Integrin, alpha 5 {fibronectin receptor, alpha polypeptide) 

Hs. 149609 
X06256 

T^SO^tunderlined sequences correspond to start and stop codons) 



f I' I' I' 

10 LraMrAGCC GGACGCCAGA GTCCCCTCTC CACGCCGTGC AGCTGCGCTG GGGCCCCCGG 
30 KSS cTcTSS^CC GCTGCTGTTG CTGC.SSTGC CGCCGCCACC CAGGGTOGGG 
GGCTTCAACT TAGACGCGGA GGCCCCAGCA GTACTCTCGG GGCCCCCGGG CTCCTTCTTC 

SSScag tggagtttta ccggccggga acagacgggg tcagtgtgct ggtgggagca 
fc^S^t aIacSgSa gccaggagto ctccagggtg gtgctgtcta cctctgtcct 
35 ££££££ Scccacaca gtgcaccccc attgaatttg acagcaaagg ctctcggctc 
SSactot cactgtccag ctcagaggga gaggagccig tggagtacaa gtccttgcag 
S^tSS Sacagttcg agcccatggc tcctccatct tggcatgcgc tccacigtac 

IS^CA CAGAGAAGGA GCCACTGAGC GACCCOGTGG GCACCTGCTA CCTCTCCACA 
iSrS OCWATTCT GGAGTATGCA CCCTGCOGCT CAGATTTCAG «GGGCAGCA 
40 GGACAGGGTT ACTGCCAAGG AGGCITCAGT GCCGAGTTCA CCAAGACTGG CCGTGTGGTT 
™£gtggac caggaagcta TTTCTGGCAA GGCCAGATCC TGTCTGCCAC tcaggagcag 
I^Sgaat cttattaccc cgagtacctg atcaacctgg ttcaggggca gctgcagact 

GTTCCATCTA TGATGACAGC TACCTAGGAT ACTCTGTGGC TGTTGGTGAA 
S^Sg ATGACACAGA agactttgtt GCTGGTGTGC CCAAAGGGAA cctcacttac 
45 SSI TGGCTCAGAC ATTCGATCCC TCTACAACTT £«^GAA 

SStcgcct CCTACTTTCG CTATGCAGTG GCCGCCACAG ACGTCAATGG ggacgggctg 

SSSc Sgtgggggc acccctgctc atggatogga cccctgacgg gcggcct^ 
?AfirTGGGCA gggtctacgt ctacctgcag cacccagccg gcatagagcc »ogcccacc 
S SSgI tgagtttggc ggatttggca gctottgac cooktgggg 
50 gacctggacc aggatggcta caatgatgtg gccatcgggg cicccrrroG TGGGGAGACC 
DU Stttgt atttcctggg ggccc^gag ogctggg™ ™S«TICC 

CAGGTTCTGC AGCCCCTGTG GGCAGCCAGC CACACCCCAG ACTTCTTTGG CTCTGCCCTT 
^GGAGGCC GAGACCTGGA TGGCAATGGA TATCCTCATC TGATTGTGGG ^CCTTTGGT 
£S*GG CTGTGGTATA CAGGGGCCGC CCCATOGTGT CCGCTAGTGC ctccctcacc 

55 ATCTTCTCCG CWTGTTCAA CCCAGAGGAG CGGAGCTGCA GCTTAGAGGG GAACCCTGTC 
SaS» ACCTTAGCTT CTGCCTCAAT GCTTCTGGAA AACACGTTGC TGACTCCATT 
^wScAG TGGAACTTCA GCTGGACTGG CAQAAGCAGA AGGGAGGGGT ACGGOTGGCA 
Sotqg CCTCCAGGCA GGCAACCCTG ACCCAGACCC tgctcatcca gaatggggct 
S^aggat? gcagagagat gaagatctac ctcaggaaog agtcagaatt tcgagacaaa 

60 SStcgSga TrataSSsc tctcaacttc tccttggacc cccaagcccc agtggacagc 

SSS^CA ScCCT ACATTATCAG AGCAAGAGCC GGATAGAGGA CAAGGCTCAG 

SSg actgtggaga agacaacatc tgtgtgccig acctgcagct ggaagtgttt 

GGGGAGCAGA ACCATGTGTA CCTGGGTGAC AAGAATGCCC TGAACCTCAC TTT^TGCC 
GTGAGGGTGG CGCCTATGAG GCTGAGCTTC GGGTCACGGC CCCTCCAGAG 
65 GCTGAGTACT CAGGACTCGT CAGACACCCA GGGAACTTCT CCAGCCTGAG CTGTGACTAC 

W Sg^^S Scagagccg cctgctggtg tgtgacctgg gcaaccccat gaac^ 

GCCAGTCTCT GGGGTGGCCT TCGGTTTACA GTCOCTCATC TCCGGGACAC TOASdUUWlA- 
ACTTCCAGAT CCTCAGCAAG AATCTCAACA ACTCGCAAAG CGAOTTGGTT 

^S^Sgc tctccgtgga ggctcaggcc caggtcaccc tgaacggtgt ctccaagcct 
70 SSSSgc ?attcccact aagcgactgg catccccgag accagcctca gaaggaggag 

«^Sc CTGCTGTCCA CCATGTCTAT GAGCTCATCA ACCAAGGCCC CAGCTCCATT 
IwSSStg TGCTGGAACT CAGCTGTCCC CAGGCTCTCG aaggtcagca gctcctatat 
SSSgact caactgcacc accaatcacc ccattaaccc aaagggcctg 
?T^£SS° ^Scttc cctgcaccac cagcaaaaac gggaagctcc aagccgcagc 
75 £K£ SSSS gS?ctgaaa tgcccggagg ctgagtgttt o^gctgcgc 

nrrrCCTGCA CCAACAAGAG AGCCAAAGTC TGCAGTTGCA TTTCCGAGTC 
££££££ SSS GCGGGAGCAC CAGCCATTTA GCCTGCAGTG TGAGGCTGTG 
£S2£5c SaSS SaSc CTGOCTCGGC AGCTGCCCCA AAAAGAGCGT 
S^GCCA CAGCTGTGCA ATGGACCAAG GCAGAAGGCA GCTATGGCGT CCCACIGTGG 
80 SSSS^S TAGCmrCCT GTTTGGCCTC CTGCTCCTAG GTCTACTCAT CTACATCCTC 
tISSg SSSI S^cctc CCATATGGCA CCGCCATGGA AAAAGCTCAG 
CTCAAGCCTC CAGCCACCTC TGATGCCTGA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 



832 
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A8 Protein sequence: 
Gene name: 
Unigene number: 
Probeset Accession #: 
Protein Accession fl: 
Signal sequence: 
Transmembrane domains t 



Integrin, alpha 5 (fibronectin receptor, alpha polypeptide) 

Hs. 149609 

X06256 

NP_002196 

1-42 

998-1020 



Integrin alpha domains: 56-115, 268-318, 322-384, 
Cellular Localization: plasma membrane 



388-444, 452-503, 1022-1036 



1 11 21 31 41 51 

I I I I I I 

MGSRTPESPL HAVQLRWGPR RRPPLLPLLL LLLPPPPRVG GFNLDAEAPA VLSGPPGSFF 60 

GPSVEFYRPG TDGVSVLVGA PKANTSQPGV LQGGAVYLCP WGASPTQCTP IEFDSKGSRL 120 

LESSLSSSEG EEPVEYKSLQ WFGATVRAHG SSILACAPLY SWRTEKEPLS DPVGTCYLST 180 

DNFTRILEYA PCRSDFSWAA GQGVCQGGFS AEFTKTGRW LGGPGSYFWQ GQILSATQEQ 240 

IAESYYPEYL INLVQGQLQT RQASSIYDDS YLGYSVAVGE FSGDDTEDFV AGVPKGNLTY 300 

GYVTILNGSD IRSLYNFSGE QMASYFGYAV AATDVNGDGli DDLLVGAPLL MDRTPDGRPQ 360 

EVGRVYVYLQ HPAGIEPTPT LTLTGHDEFG RFGSSLTPLG DLDQDGYNDV AIGAPFGGET 420 

QQGWFVFPG GPGGLGSRPS QVLQPLWAAS HTPDFFGSAL RGGRDLDGNG YPDLIVGSFG 480 

VDKAWYRGR PIVSASASLT IFPAMFNPEE RSCSLEGNPV ACINLSFCLN ASGKHVADSI 540 

GFTVELQLDW QKQKGGVRRA LFLASRQATL TQTLLIQNGA REDCREMK1Y LRNESEFRDK 600 

LSPIHIALNF SLDPQAFVDS HGLRPALHYQ SKSRIEDKAQ ILLDCGEDNI CVPDLQLEVF 660 

GEQNHVYLGD KNALNLTFHA QNVGEGGAYE AELRVTAPPE AEYSGLVRHP GNFSSLSCDY 720 

FAVNQSRLLV CDLGNPMKAG ASLWGGLRFT VPHLRDTKKT IQFDFQILSK NLNNSQSDW 780 

SFRLSVEAQA QVTLNGVSKP EAVLFPVSDW HPRDQPQKEE DLGPAVHHVY ELINQGPSSI 840 

SQGVLELSCP QALEGQQLLY VTRVTGLNCT TNHPINPKGL ELDPEGSLHH QQKREAPSRS 900 

SASSGPQILK CPEAECFRLR CELGPLHQQE SQSLQLHFRV WAKTFLQREH QPFSLQCEAV 960 

YKALKMPYRI LPRQliPQKER QVATAVQWTK AEGSYGVPLW IIILAILFGL LLLGLLIYIL 1020 
YKLGFFKRSL PYGTAMEKAQ LKPPATSDA 



A9 DNA SEQUENCE 

Gene name: Integrin, beta 1 (fibronectin receptor, beta, antigen CD29 includes MDF2, MSK12) 

Unigene number: Hs. 287797 

Probeset Accession #: X07979 
Nucleic Acid Accession ft: NM_0022ll.l 

Coding sequence: 1-2397 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

ATGAATTTAC AACCAATTTT CTGGATTGGA CTGATCAGTT CAGTTTGCTG TGTGTTTGCT 60 

CAAACAGATG AAAATAGATG TTTAAAAGCA AATGCCAAAT CATGTGGAGA ATGTATACAA 120 

GCAGGGCCAA ATTGTGGGTG GTGCACAAAT TCAACATTTT TACAGGAAGG AATGCCTACT 180 

TCTGCACGAT GTGATGATTT AGAAGCCTTA AAAAAGAAGG GTTGCCCTCC AGATGACATA 240 

GAAAATCCCA GAGGCTCCAA AGATATAAAG AAAAATAAAA ATGTAACCAA CCGTAGCAAA 300 

GGAACAGCAG AGAAGCTCAA GCCAGAGGAT ATTACTCAGA TCCAACCACA GCAGTTGGTT 360 

TTGCGATTAA GATCAGGGGA GCCACAGACA TTTACATTAA AATTCAAGAG AGCTGAAGAC . 420 

TATCCCATTG ACCTCTACTA CCTTATGGAC CTGTCTTACT CAATGAAAGA CGATTTGGAG 480 

AATGTAAAAA GTCTTGGAAC AGATCTGATG AATGAAATGA GGAGGATTAC TTCGGACTTC 540 

AGAATTGGAT TTGGCTCATT TGTGGAAAAG ACTGTGATGC CTTACATTAG CACAACACCA 600 

GCTAAGCTCA GGAACCCTTG CACAAGTGAA CAGAACTGCA CCAGCCCATT TAGCTACAAA 660 

AATGTGCTCA GTCTTACTAA TAAAGGAGAA GTATTTAATG AACTTGTTGG AAAACAGCGC 720 

ATATCTGGAA ATTTGGATTC TCCAGAAGGT GGTTTCGATG CCATCATGCA AGTTGCAGTT 780 

TGTGGATCAC TGATTGGCTG GAGGAATGTT ACACGGCTGC TGGTGTTTTC CACAGATGCC 840 

GGGTTTCACT TTGCTGGAGA TGGGAAACTT GGTGGCATTG TTTTACCAAA TGATGGACAA 900 

TGTCACCTGG AAAATAATAT GTACACAATG AGCCATTATT ATGATTATCC TTCTATTGCT 960 

CACCTTGTCC AGAAACTGAG TGAAAATAAT ATTCAGACAA TTTTTGCAGT TACTGAAGAA 1020 

TTTCAGCCTG TTTACAAGGA GCTGAAAAAC TTGATCCCTA AGTCAGCAGT AGGAACATTA 1080 

TCTGCAAATT CTAGCAATGT AATTCAGTTG ATCATTGATG CATACAATTC CCTTTCCTCA 1140 

GAAGTCATTT TGGAAAACGG CAAATTGTCA GAAGGAGTAA CAATAAGTTA CAAATCTTAC 1200 

TGCAAGAACG GGGTGAATGG AACAGGGGAA AATGGAAGAA AATGTTCCAA TATTTCCATT 1260 

GGAGATGAGG TTCAATTTGA AATTAGCATA ACTTCAAATA AGTGTCCAAA AAAGGATTCT 1320 

GACAGCTTTA AAATTAGGCC TCTGGGCTTT ACGGAGGAAG TAGAGGTTAT TCTTCAGTAC 1380 

ATCTGTGAAT GTGAATGCCA AAGCGAAGGC ATCCCTGAAA GTCCCAAGTG TCATGAAGGA 1440 

AATGGGACAT TTGAGTGTGG CGCGTGCAGG TGCAATGAAG GGCGTGTTGG TAGACATTGT 1500 

GAATGCAGCA CAGATGAAGT TAACAGTGAA GACATGGATG CTTACTGCAG GAAAGAAAAC 1560 

AGTTCAGAAA TCTGCAGTAA CAATGGAGAG TGCGTCTGCG GACAGTGTGT TTGTAGGAAG 1620 

AGGGATAATA CAAATGAAAT TTATTCTGGC AAATTCTGCG AGTGTGATAA TTTCAACTGT 1680 

GATAGATCCA ATGGCTTAAT TTGTGGAGGA AATGGTGTTT GCAAGTGTCG TGTGTGTGAG 1740 

TGCAACCCCA ACTACACTGG CAGTGCATGT GACTGTTCTT TGGATACTAG TACTTGTGAA 1800 

GCCAGCAACG GACAGATCTG CAATGGCCGG GGCATCTGCG AGTGTGGTGT CTGTAAGTGT 1860 

ACAGATCCGA AGTTTCAAGG GCAAACGTGT GAGATGTGTC AGACCTGCCT TGGTGTCTGT 1920 

GCTGAGCATA AAGAATGTGT TCAGTGCAGA GCCTTCAATA AAGGAGAAAA GAAAGACACA 1980 

TGCACACAGG AATGTTCCTA TTTTAACATT ACCAAGGTAG AAAGTCGGGA CAAATTACCC 2040 

CAGCCGGTCC AACCTGATCC TGTGTCCCAT TGTAAGGAGA AGGATGTTGA CGACTGTTGG 2100 

TTCTATTTTA GGTATTCAGT GAATGGGAAC AACGAGGTCA TGGTTCATGT TGTGGAGAAT 2160 

CCAGAGTGTC CCACTGGTCC AGACATCATT CCAATTGTAG CTGGTGTGGT TGCTGGAATT 2220 



833 



WO 03/042661 



PCT/US02/36810 



GTTCTTATTG GCCTTGCATT ACTGCTGATA TGGAAGCTTT TMTGATAAT TCATGACAGA 22 80 
AGGGAGTTTG CTAAATTTGA AAAGGAGAAA ATGAATGCCA AATGGGACAC GGGTGAAAAT 2340 
CCTATTTATA AGAGTGCCGT AACAACTGTG GTCAATCCGA AGTATGAGGG AAAATGA 



Integrin. beta 1 (fibronectin receptor, beta, antigen CD29 includes MDF2, MSK12) 

HS. 287797 

X07979 

NP 002202.1 

1-21 

732-754 

34-464 

26-76 

plasma membrane 



A10 Protein sequence: 
Gene name: 
Unigene number: 
Probeset Accession #: 
Protein Accession fh 
Signal sequence: 
Transmembrane domains: 
INB domain: 
PSI domain: 

Cellular Localization: 



MNLQPIFWIG 
SARCDDLEAL 
LRLRSGEPQT 
RIGFGSFVEK 
ISGNLDSPEG 
CHLENNMYTM 
SANSSNVIQL 
GDEVQFEISI 
NGTFECGACR 
RDNTNEIYSG 
ASNGQICNGR 
CTQECSYFNI 
PECPTGPDII 
PIYKSAVTTV 



11 
I 

LISSVCCVFA 
KKKGCPPDDI 
FTLKFKRAED 
TVMPYISTTP 
GFDAIMQVAV 
SHYYDYPSIA 
IIDAYKSLSS 
TSNKCPKKDS 
CNEGRVGRHC 
KFCECDNFNC 
GICECGVCKC 
TKVESRDKLP 
PIVAGWAGI 
VNPKYEGK 



21 

I 

QTDENRCLKA 
ENPRGSKDIK 
YPIDLYYLMD 
AKLRNPCTSE 
CGSLIGWRNV 
HLVQKLSENN 
EVILENGKLS 
DSFKIRPLGF 
ECSTDEVNSE 
DRSNGLICGG 
TDPKFQGQTC 
QPVQPDPVSH 
VIiIGLALiLIjI 



31 

1 

NAKSCGECIQ 
KNKNVTNRSK 
LSYSMKDDLE 
QNCTSPFSYK 
TRLLVFSTDA 
IQTIFAVTEE 
EGVTISYKSY 
TEEVEVILQY 
DMDAYCRKEN 
NGVCKCRVCE 
EMCQTCLGVC 
CKEKDVDDCW 
WKLLMIIHDR 



41 
I 

AGPNCGWCTN 
GTAEKLKPED 
NVKSLGTDLM 
NVLSLTNKGE 
GFHFAGDGKL 
FQPVYKELKM 
CKNGVNGTGE 



SSEICSNNGE 
CNPNYTGSAC 
AEHKECVQCR 
FYFTYSVNGN 
REFAKFEKEK 



51 
I 

STFLQEGMPT 
ITQIQPQQLV 
NEMRRITSDF 
VFNELVGKQR 
GGIVLPNDGQ 
LIPKSAVGTL 
NGRKCSNISI 
IPESPKCHEG 
CVCGQCVCRK 
DCSLDTSTCE 
AFNKGEKKDT 
NEVMVHWEN 
MNAKWDTGEN 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



All DNA SEQUENCE 

Gene name: selectin E (endothelial adhesion molecule 1) 

Unigene number: Hs. 89546 

Probeset Accession #: M24736 

Nucleic Acid Accession #: NM_000450 

Coding sequence: 1-1833 (underlined sequences correspond to start and stop codons) 



I 



ATGA TTGCTT 
GCCTGGTCTT 
CAGCAAAGGT 
TCCATATTGA 
TGGGTCTGGG 
GAACCCAACA 
GATGTGGGCA 
GCTGCCTGTA 
TACACTTGCA 
ACAGCCCTGG 
AGCTACAATT 
ACCATGCAGT 
GAGTGTGATG 
AGCTTCCCAT 
GCCCAGAGCC 
GCTGTGACAT 
CCTGCTGGAG 
TTGCAGGGAC 
GTTTGTGAAG 
CTTCCTAGTG 
GGTTTTGTGT 
GAGAAGCCCA 
. GTGAGGTGTG 
TGTGAGGAGG 
TGGACAGAAG 
AAGATCAACA 
CCTGAAGGAT 
TCTGGCCTGC 
CTTTCTGCTG 
TGCTTACGGA 
GGAAGCTACC 



11 
I 

CACAGTTTCT 
ACAACACCTC 
ACACACACCT 
GCTATTCACC 
TAGGAACCCA 
ATAGGCAAAA 
TGTGGAATGA 
CCAATACATC 
AGTGTGACCC 
AATCCCCTGA 



GTATGTCCTC 
CTGTGACAAA 
GGAACACAAC 
TTCAGTGTAC 
GCAGGGCCGT 
AGTTCACCTT 
CAGCCCAGGT 
CTTTCCAGTG 
CTTCTGGCAG 
TGAAGGGATC 
CATGTGAAGC 
CTCATTCCCC 
GATTTGAATT 
AGGTTCCTTC 
TGAGCTGCAG 
GGACGCTCAA 
TACCTACCTG 
CTGGACTCTC 
AAGCAAAGAA 
AAAAGCCTTC 



21 
I 

CTCAGCTCTC 
CACGGAAGCT 
GGTTGCAATT 
AAGTTATTAC 
GAAACCTCTG 
AGATGAGGAC 
TGAGAGGTGC 
CTGCAGTGGC 
TGGCTTCAGT 
GCATGGAAGC 
TATCAGCTGT 
TGGAGAATGG 
TCCAGCCAAT 
CTGTACATTT 
CTCATCTGGG 
CCGCCAGCCT 
CAAATCATCC 
TGAATGCACC 
CACAGCCTTG 
TTTCCGTTAT 
CAAAAGGCTC 
TGTGAGATGC 
TATTGGAGAA 
ATATGGATCA 
CTGCCAAGTG 
TGGGGAGCCC 
TGGCTCTGCA 
TGAAGCTCCC 
CCTCCTGACA 
ATTTGTTCCT 
TTACATCCTT 



31 
I 

ACTTTGGTGC 
ATGACTTATG 
CAAAACAAAG 
TGGATTGGAA 
ACAGAAGAAG 
TGCGTGGAGA 
AGCAAGAAGA 
CACGGTGAAT 
GGACTCAAGT 
CTGGTTTGCA 
GATAGGGGTT 
AGTGCTCCTA 
GGGTTCGTGG 
GACTGTGAAG 
AATTGGGACA 
CAGAATGGCT 
TGCAACTTCA 
ACTCAAGGGC 
TCCAACCCCG 
GGGTCCAGCT 
CAATGTGGCC 
GATGCTGTCC 
TTCACCTACA 
ACTCAACTTG 
GTAAAATGTT 
GTGTTTGGCA 
GCTCGGACAT 
ACTGAGTCCA 
TTAGCACCAT 
GCCAGCAGCT 
TAA 



41 



TTCTCATTAA 
ATGAGGCCAG 
AAGAGATTGA 
TCAGAAAAGT 
CCAAGAACTG 
TCTACATCAA 
AGCTTGCCCT 
GTGTAGAGAC 
GTGAGCAAAT 
GTCACCCACT 
ACCTGCCAAG 
TTCCAGCCTG 
AATGTTTCCA 
AAGGATTTGA 
ACGAGAAGCC 
CTGTGAGGTG 
CCTGTGAGGA 
AGTGGACACA 
AGCGAGGCTA 
GTGAGTTCTC 
CCACAGGGGA 
ACCAGCCCCC 
AGTCCTCTTG 
AGTGCACATC 
CAAGCCTGGC 
CTGTGTGCAA 
GTGGAGCCAC 
ACATTCCCTT 
TTCTCCTCTG 
GCCAAAGCCT 



51 
I 

AGAGAGTGGA 
TGCTTATTGT 
GTACCTAAAC 
CAACAATGTG 
GGCTCCAGGT 
GAGAGAAAAA 
ATGCTACACA 
CATCAATAAT 
TGTGAACTGT 
GGGAAACTTC 
CAGCATGGAG 
CAATGTGGTT 
AAACCCTGGA 
ACTAATGGGA 
AACGTGTAAA 
CAGCCATTCC 
AGGCTTCATG 
GCAAATCCCA 
CATGAATTGT 
CTGTGAGCAG 
GTGGGACAAC 
GAAGGGTTTG 
TGCCTTCAGC 
TCAGGGACAA 
AGTTCCGGGA 




60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 



A12 Protein sequence: 
Gene name: 
Unigene number: 
Probeset Accession #: 
Protein Accession #: 
Signal sequence: 
Transmembrane domains-. 
C- lectin domain: 



Selectin E (endothelial adhesion molecule 1) 

Hs. 89546 

M24736 

NP_000441 

1-22 

555-573 

23-139 



834 



WO 03/042661 



PCT/US02/36810 



Cellular Localization: plasma membrane 



10 

15 

20 

25 

30 

35 

40 

45 

50 



i 
1 

MIASQFLSAL 
SILSYSPSYY 
DVGMWNDERC 
TALESPEHGS 
ECDAVTNPAN 
AVTCRAVRQP 
VCEAFQCTAL 
EKPTCEAVRC 
WTEEVPSCQV 
SGLLPTCEAP 
GSYQKPSYIL 



11 
I 

TLVLLIKESG 
WIGIRKVNNV 
SKKKLALCYT 
LVCSHPLGNF 
GFVECFQNPG 
QNGSVRCSHS 
SNPERGYMNC 
DAVHQPPKGL 
VKCSSLAVPG 
TESNIPLVAG 



21 
I 

AWSYNTSTEA 
WVWVGTQKPL 
AACTNTSCSG 
SYNSSCSISC 
SFPWNTTCTF 
PAGEFTFKSS 
LPSASGSFRY 
VRCAHSPIGE 
KINMSCSGEP 
LSAAGLSLLT 



31 
I 

MTYDEASAYC 
TEEAKNWAPG 
HGECVETINN 
DRGYLPSSME 
DCEEGFELMG 
CNFTCEEGFM 
GSSCEFSCEQ 
FTYKSSCAFS 
VPGTVCKFAC 
LAPFLLWLRK 



41 
I 

QQRYTHLVAI 
EPNNRQKDED 
YTCKCDPGFS 
TMQCMSSGEW 
AQSLQCTSSG 
LQGPAQVECT 
GFVLKGSKRL 
CEEGFELYGS 
PEGWTLNGSA 
CLRKAKKFVP 



51 
I 

QNKEEIEYLN 
CVEIYIKREK 
GLKCEQIVNC 
SAPIPACNW 
NWDNEKPTCK 
TQGQWTQQIP 
QCGPTGEWDN 
TQLECTSQGQ 
ARTCGATGHW 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



A13 DNA SEQUENCE 
Gene name: 
Unigene number: 
Probeset Accession #: 
Nucleic Acid Accession ft: 
Coding sequence: 



G protein -coupled receptor 39 

H3.8S339 

AA349893 

NM_001508 

1-1362 (underlined sequences correspond to start and stop codons) 



ATGG CTTCAC 
CCCGAGTTTG 
TTCGTGATGG 
AAAGGATACT 
TTGGTGTTCC 
ACGTCCAGCT 
GCTACGCTGC 
TTCAGGTACA 
GTCACCTCCG 
GTGAACGTGC 
CAGCCCGAGA 
CAGTCCAGCA 
ATGTGCTGGA 
ACGCGGCCTC 
ACCATCATCT 
ATTCGGAGGA 
GCGTACATGA 
CCGCTCCTGT 
TGCCGCCTGT 
ACCACCGACA 
TCTGCAAGGA 
TCTAAGTCCC 
AATTCTGCTG 



11 

I 

CCAGCCTCCC 
AGGTGGCCAC 
GCCTTCTGGG 
TGCAGAAGGA 
TCATCGGCAT 
ACACCCTGTC 
TGCACGTGCT 
AGGCTGTGTC 
CCCTGGTGGC 
CCAGCCACCG 
CCTCCAATAT 
TCTTCGGCGC 
ACATGATGCA 
CGCAGCTGAG 
TCCTGAGGCT 
TCATGGCTGC 
TCCTCCTCCC 
ACACGGTGTC 
CGCTGCAGCA 
GCGCCCGCTT 
GAACTGAGAA 
AGTCATTGAG 
CAGAGAATGG 



21 



GGGCAGTGAC 
CTGGATCAAA 
GAACAGCGCC 
GGTGACAGAC 
GCCCATGGAG 
CTGCAAGCTG 
GACACTCAGC 
GGGACCTTGC 
ACTGCCCTTG 
GGGTCTCACT 
GTCCATCTGT 
CTTCGTGGTC 
GGTGCTCATG 
GAAGTCCGAG 
GATTGTTGTG 
GGCCAAACCC 
CTTCTCGGAG 
CTCGCAGCAG 
CGCCAACCAC 
TGTGCAGCGC 
GATTTTCTTA 
TCTCGAGTCA 
TTTTCAGGAG 



31 

t 

TGCTCCCAAA 
ATCACCCTTA 
ACCATTCGGG 
CACATGGTGA 
TTCTACAGCA 
CACACTTTCC 
TTTGAGCGCT 
CAGGTGAAGC 
CTGTTTGCCA 
TGCAACCGCT 
ACCAACCTCT 
TACCTCGTGG 
AAAAGCCAGA 



ACATTGGCCG 
AAGCACGACT 
ACGTTTTTCT 
TTTCGGCGGG 
GAGAAGCGCC 
CCGTTGCTCT 
AGCACTTTTC 
CTAGAGCCCA 
CATGAAGTTT 



41 
I 

TCATTGATCA 
TTCTGGTGTA 
TCACCCAGGT 
GTTTGGCTTG 
TCATCTGGAA 
TCTTCGAGGC 
ACATCGCCAT 
TGCTGATTGG 
TGGGTACTGA 
CCAGCACCCG 
CCAGCCGCTG 
TCCTGCTCTC 
AGGGCTCGCT 
GCAGGACCGC 
TATGCTGGAT 
GGACGAGGTC 
ACCTCAGCTC 
TGTTCGTGCA 
TGCGCGTACA 
TCGCGTCCCG 
AGAGCGAGGC 
ACTCAGGCGC 
GA 



SI 
I 

CAGTCATGTC 
CCTGATCATC 
GCTGCAGAAG 
CTCGGACATC 
TCCCCTGACC 
CTGCAGCTAC 
CTGTCACCCC 
CTTCGTCTGG 
GTACCCCCTG 
CCACCACGAG 
GACCGTGTTC 
CGTAGCCTTC 
GGCCGGGGGC 
CAGGAGGCAG 
GCCCAACCAG 
CTACTTCCGG 
GGTCATCAAC 
GGTGCTGTGC 
TGCGCACTCC 
GCGCCAGTCC 
CGAGCCCCAG 
GAAACCAGCC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 



A14 Protein sequence: 

Gene name: 

Unigene number: 
55 Probeset Accession ft: 

Protein Accession ft: 

Signal sequences 

Pfam domains: 

Transmembrane domains: 
60 Cellular Localization: 



G protein- coupled receptor 39 

Hs. 85339 

AA349893 

NM_001508, NP_001409 
none found 

7tm_l [72-172, 224-344] 

32-54, 68-90, 111-133, 151-173, 

plasma membrane 



221-243, 280-301, 320-342 



1 11 

65 MASPSLPGSD CSQIIDHSHV 
KGYLQKEVTD HMVSLACSDI 
ATLLHVLTLS FERYIAICHP 
VNVPSHRGLT CNRSSTRHHE 
MCWNMMQVLM KSQKGSLAGG 

70 IRRIMAAAKP KHDWTRSYFR 

CRLSLQHANH EKRLRVHAHS 
SKSQSLSLBS LEPNSGAKPA 



21 
I 

PEFEVATWIK 
LVFLIGMPME 
FRYKAVSGPC 
QPETSNMSIC 
TRPPQLRKSE 
AYMILLPFSE 
TTDSARFVQR 



31 
I 

ITLILVYLII 
FYS I IWNPLT 
QVKLLIGFVW 
TNLSSRWTVF 
SEESRTARRQ 
TFFYLSSVIN 
PLLFASRRQS 
HEV 



41 
I 

FVMGLLGNSA 
TSSYTLSCKL 
VTSALVALPL 
QSSIFGAFW 
TIIFLRLIW 
PLLYTVSSQQ 
SARRTEKIFL 



51 
I 

TIRVTQVLQK 60 

HTFLFEACSY 120 

LFAMGTEYPL 180 

YLWLLSVAF 240 

TLAVCWMPNQ 300 

FRRVFVQVLC 360 

STFQSEAEPQ 420 



75 



AI Prostate 



A15 DNA sequence 
Gene name: 
Unigene number: 
©0 Probeset Accession ft: 

Nucleic Acid Accession ft: 
Coding sequence: 



CEGP1 
HS. 222399 
AA25648S 
AJ400877 

81-3080 (underlined sequences correspond to start and stop codons) 



11 



21 



31 



41 



51 



835 



WO 03/042661 



I 1 I I I 1 

GGCGTCCGCG CACACCTCCC CGCGCCGCCG CCGCCACCGC CCGCACTCCG COGCCTCTGC 60 

CCGCAACCGC TGAGCCATCC ATGGGGGTCG CGGGCCGCAA CCGTCCCGGG GCGGCCTGGG 120 

CGGTGCTGCT GCTGCTGCTG CTGCTGCCGC CACTGCTGCT GCTGGCGGGG GCCGTCCCGC 180 

CGGGTCGGGG CCGTGCCGCG GGGCCGCAGG AGGATGTAGA TGAGTGTGCC CAAGGGCTAG 240 

ATGACTGCCA TGCCGACGCC CTGTGTCAGA ACACACCCAC CTCCTACAAG TGCTCCTGCA 300 

AGCCTGGCTA CCAAGGGGAA GGCAGGCAGT GTGAGGACAT CGATGAATGT GGAAATGAGC 360 

TCAATGGAGG CTGTGTCCAT GACTGTTTGA ATATTCCAGG CAATTATOGT TGCACTTGTT 420 

TTGATGGCTT CATGTTGGCT CATGACGGTC ATAATTGTCT TGATGTGGAC GAGTGCCTGG 480 

AGAACAATGG CGGCTGCCAG CATACCTGTG TCAACGTCAT GGGGAGCTAT GAGTGCTGCT 540 

GCAAGGAGGG GTTTTTCCTG AGTGACAATC AGCACACCTG CATTCACCGC TCGGAAGAGG 600 

GCCTGAGCTG CATGAATAAG GATCACGGCT GTAGTCACAT CTGCAAGGAG GCCCCAAGGG 660 

GCAGCGTCGC CTGTGAGTGC AGGCCTGGTT TTGAGCTGGC CAAGAACCAG AGAGACTGCA 720 

TCTTGACCTG TAACCATGGG AACGGTGGGT GCCAGCACTC CTGTGACGAT ACAGCOGATG 780 

GCCCAGAGTG CAGCTGCCAT CCACAGTACA AGATGCACAC AGATGGGAGG AGCTGCCTTG 840 

AGCGAGAGGA CACTGTCCTG GAGGTGACAG AGAGCAACAC CACATCAGTG GTGGATGGGG 900 

ATAAACGGGT GAAACGGCGG CTGCTCATGG AAACGTGTGC TGTCAACAAT GGAGGCTGTG 960 

ACCGCACCTG TAAGGATACT TCGACAGGTG TCCACTGCAG TTGTCCTGTT GGATTCACTC 1020 

TCCAGTTGGA TGGGAAGACA TGTAAAGATA TTGATGAGTG CCAGACCCGC AATGGAGGTT 1080 

GTGATCATTT CTGCAAAAAC ATCGTGGGCA GTTTTGACTG CGGCTGCAAG AAAGGATTTA 1140 

AATTATTAAC AGATGAGAAG TCTTGCCAAG ATGTGGATGA GTGCTCTTTG GATAGGACCT 1200 

GTGACCACAG CTGCATCAAC CACCCTGGCA CATTTGCTTG TGCTTGCAAC CGAGGGTACA 1260 

CCCTGTATGG CTTCACCCAC TGTGGAGACA CCAATGAGTG CAGCATCAAC AACGGAGGCT 1320 

GTCAGCAGGT CTGTGTGAAC ACAGTGGGCA GCTATGAATG CCAGTGCCAC CCTGGGTACA 1380 

AGCTCCACTG GAATAAAAAA GACTGTGTGG AAGTGAAGGG GCTCCTGCCC ACAAGTGTGT 1440 

CACCCCGTGT GTCCCTGCAC TGCGGTAAGA GTGGTGGAGG AGACGGGTGC TTCCTCAGAT 1500 

GTCACTCTGG CATTCACCTC TCTTCAGATG TCACCACCAT CAGGACAAGT GTAACCTTTA 1560 

AGCTAAATGA AGGCAAGTGT AGTTTGAAAA ATGCTGAGCT GTTTCCCGAG GGTCTGCGAC 1620 

CAGCACTACC AGAGAAGCAC AGCTCAGTAA AAGAGAGCTT CCGCTACGTA AACCTTACAT 1680 

GCAGCTCTGG CAAGCAAGTC CCAGGAGCCC CTGGCCGACC AAGCACCCCT AAGGAAATGT 1740 

TTATCACTGT TGAGTTTGAG CTTGAAACTA ACCAAAAGGA GGTGACAGCT TCTTGTGACC 1800 

TGAGCTGCAT CGTAAAGCGA ACCGAGAAGC GGCTCCGTAA AGCCATCCGC ACGCTCAGAA 1860 

AGGCCGTCCA CAGGGAGCAG TTTCACCTCC AGCTCTCAGG CATGAACCTC GACGTGGCTA 1920 

AAAAGCCTCC CAGAACATCT GAACGCCAGG CAGAGTCCTG TGGAGTGGGC CAGGGTCATG 1980 

CAGAAAACCA ATGTGTCAGT TGCAGGGCTG GGACCTATTA TGATGGAGCA CGAGAACGCT 2040 

GCATTTTATG TCCAAATGGA ACCTTCCAAA ATGAGGAAGG ACAAATGACT TGTGAACCAT 2100 

GCCCAAGACC AGGAAATTCT GGGGCCCTGA AGACCCCAGA AGCTTGGAAT ATGTCTGAAT 2160 

GTGGAGGTCT GTGTCAACCT GGTGAATATT CTGCAGATGG CTTTGCACCT TGCCAGCTCT 2220 

GTGCCCTGGG CACGTTCCAG CCTGAAGCTG GTCGAACTTC CTGCTTCCCC TGTGGAGGAG 2280 

GCCTTGCCAC CAAACATCAG GGAGCTACTT CCTTTCAGGA CTGTGAAACC AGAGTTCAAT 2340 

GTTCACCTGG ACATTTCTAC AACACCACCA CTCACCGATG TATTCGTTGC CCAGTGGGAA 2400 

CATACCAGCC TGAATTTGGA AAAAATAATT GTGTTTCTTG CCCAGGAAAT ACTACGACTG 2460 

ACTTTGATGG CTCCACAAAC ATAACCCAGT GTAAAAACAG AAGATGTGGA GGGGAGGTGG 25 20 

GAGATTTCAC TGGGTACATT GAATCCCCAA ACTACCCAGG CAATTACCCA GCCAACACCG 2580 

AGTGTACGTG GACCATCAAC CCACCCCCCA AGCGCCGCAT CCTGATOGTG GTCCCTGAGA 2640 

TCTTCCTGCC CATAGAGGAC GACTGTGGGG ACTATCTGGT GATGCGGAAA ACCTCTTCAT 2700 

CCAATTCTGT GACAACATAT GAAACCTGCC AGACCTACGA ACGCCCCATC GCCTTCACCT 2760 

CCAGGTCAAA GAAGCTGTGG ATTCAGTTCA AGTCCAATGA AGGGAACAGC GCTAGAGGGT 2 B20 

TCCAGGTCCC ATACGTGACA TATGATGAGG ACTACCAGGA ACTCATTGAA GACATAGTTC 2880 

GAGATGGCAG GCTCTATGCA TCTGAGAACC ATCAGGAAAT ACTTAAGGAT AAGAAACTTA 2940 

TCAAGGCTCT GTTTGATGTC CTGGCCCATC CCCAGAACTA TTTCAAGTAC ACAGCCCAGG 3000 

AGTCCCGAGA GATGTTTCCA AGATCGTTCA TCCGATTGCT ACGTTCCAAA GTGTCCAGGT 3060 

TTTTGAGACC TTACAAATGA CTCAGCCCAC GTGCCACTCA ATACAAATGT TCTGCTATAG 3120 

GGTTGGTGGG ACAGAGCTGT CTTCCTTCTG CATGTCAGCA CAGTCGGGTA TTGCTGCCTC 3180 

CCGTATCAGT GACTCATTAG AGTTCAATTT TTATAGATAA TACAGATATT TTGGTAAATT 3240 

GAACTTGGTT TTTCTTTCCC AGCATCGTGG ATGTAGACTG AGAATGGCTT TGAGTGGCAT 3300 

CAGCTTCTCA CTGCTGTGGG CGGATGTCTT GGATAGATCA CGGGCTGGCT GAGCTGGACT 3360 

TTGGTCAGCC TAGGTGAGAC TCACCTGTCC TTCTGGGGTC TTACTCCTCC TCAAGGAGTC 3420 

TGTAGTGGAA AGGAGGCCAC AGAATAAGCT GCTTATTCTG AAACTTCAGC TTCCTCTAGC 3480 

CCGGCCCTCT CTAAGGGAGC CCTCTGCACT CGTGTGCAGG CTCTGACCAG GCAGAACAGG 3540 

CARGAGGGGA GGGAAGGAGA CCCCTGCAGG CTCCCTCCAC CCACCTTGAG ACCTGGGAGG 3600 

ACTCAGTTTC TCCACAGCCT TCTCCAGCCT GTGTGATACA AGTTTGATCC CAGGAACTTG 3660 

AGTTCTAAGC AGTGCTCGTG AAAAAAAAAA GCAGAAAGAA TTAGAAATAA ATAAAAACTA 3720 
AGCACTTCTG GAGACAT 

A16 Protein sequence 
Gene name: 
Unigene number: 
Probeset Accession #: 
Protein Accession #: 
Signal sequence: 
Transmembrane domains: 
PFAM domains: 

Cellular Localization: 



CEGP1 
HS. 222399 
AA256485 
CAB92285 
1-31 
• none 

EGF-like_domainS [49-84,132-167,177-213,286-321.407-442] 
CUB_domain [809-918] 
may be secreted 



1 11 21 31 41 51 

! I - I I I I 

MGVAGRNRPG AAWAVLLLLL LLPPLLLLAG AVPPGRGRAA GPQEDVDECA QGLDDCHADA 60 

LCQNTPTSYK CSCKPGYQGE GRQCEDIDEC GNELNGGCVH DCLNIPGNYR CTCPDGPMLA 120 

HDGHNCLDVD ECLENNGGCQ HTCVNVMGSY ECCCKEGFPL SDNQHTCIHR SEEGLSCMNX 180 

DHGCSHICKE APRGSVACEC RPGFELAKNQ RDCILTCNHG NGGCQHSCDD TADGPECSCH 240 

PQYKMHTDGR SCLEREDTVL EVTESNTTSV VDGDKRVKRR LLMETCAVNN GGCDRTCKDT 300 



836 



WO 03/042661 



PCT7US02/36810 



STGVHCSCPV GFTLQLDGKT CKDIDECQTR NGGCDHFCKN IVGSFDCGCK KGFKLLTDEK 360 

SCQDVDECSL DRTCDHSCIN HPGTFACACN RGYTLYGFTH CGDTNECSIN NGGCQQVCVN 420 

TVGSYECQCH PGYKLHWNKK DCVEVKGLLP TSVSPRVSLH CGKSGGGDGC FLRCHSGIHL 460 

SSDVTTIRTS VTFKLNEGKC SLKNAELFPE GLRPALPEKH SSVKESFRYV NLTCSSGKQV 540 

PGAPGRPSTP KEMFITVEFE LETNQKEVTA SCDLSCIVKR TEKRLRKAIR TLRKAVHREQ 600 

FHLQLSGMNL DVAKKPPRTS ERQAESCGVG GGHAENQCVS CRAGTYYDGA RERCILCPNG 660 

TFQNEEGQMT CEPCPRPGNS GALKTPEAWN MSECGGLCQP GEYSADGFAP CQLCALGTFQ 720 

PEAGRTSCFP CGGGLATKHQ GATSFQDCET RVQCSPGHFY NTTTHRCIRC PVGTYQPEFG 780 

KNNCVSCPGN TTTDFDGSTN ITQCKNRRCG GELGDFTGYI ESPNYPGNYP ANTECTWTIN 840 

PPPKRRILIV VPEIFLPIED DCGDYLVMRK TSSSKSVTTY ETCQTYERPI AFTSRSKKLW 900 

IQFKSNEGNS ARGFQVPYVT YDEDYQELIE DIVRDGRLYA SENHQEILKD KKLIKALFDV 960 
LAHPQNYFKY TAQESREMFP RSFIRLLRSK VSRFLRPYK 



A17 DNA sequence 

Gene name: ESTs 

Unigene number: Hs. 29 3 102 

Probeset Accession ft: AI951118, AW170035, AL044891, AI908272 

Nucleic Acid Accession #: none found 

Coding sequence: 1-489 (entire sequence is an open-reading- frame) 



CAAAAAGAAA 
CTGAAGGCTC 
ATGCAAACTT 
ATGCAAAAGT 
GATCAGATGT 
GAGAGTCTCC 
AAAGAAATGG 
GATACAGTTC 
TCAGGAAAA 



11 
I 

TAGATAAAAT 
CCTGCAGAAT 
TCAAAGCAGA 
CTGTTCCAAA 
TCCCTTCAGA 
GTGAGACTGT 
ATAAAATAAG 
ATTCTTGTGA 



21 
I 

AAATGGAAAA 
GAAAGTTTCT 
GCCTCCCGAG 
TAAAGCCTTG 
ATCAAAACAA 
TTCACAGAAG 
TGGAAAATTA 
AAGAACAAGG 



31 
I 

TTAGAAGAGT 
ATTCCAACTA 
AAGCCATCTG 
GAATTGAAGA 
AAGAASGTTG 
GATGTGTGTG 
GAAGATTCAA 
GAACTTCAAA 



41 
I 

CTCCTGATAA 
AAGCCTTAGA 
CCTTCGAGCC 
ATGAACAAAC 
AAGAAAATTC 
TACCCAAGGC 
CTAGCCTATC 
AAGACCCCTG 



51 
I 

TGATGGTTTT 
ATTGATGGAC 
TGCCATTGAA 
ATTGAGAGCA 
TTGGGATTCT 
TACACATCAA 
AAAAATCTTG 
TGACCCACGT 



60 
120 
180 
240 
300 
360 
420 
480 



A18 Protein sequence 
Gene name: 
Unigene number: 
Probeset Accession #: 
Protein Accession 8: 
Signal sequence: 
Transmembrane domains: 
Cellular Localization: 



ESTs 

Hs. 293102 

AI951118, AW17003S, AL044891, AI908272 
none found 



nuclear 



1 11 21 31 41 51 

i III I I 

QKEIDKINGK LEESPDNDGF LKAPCRMKVS IPTKALELMD MOTFKAEPPE KPSAFEPAIE 60 
MQKSVPNKAL ELKNEQTLRA DQMFPSESKQ KXVEENSWDS ESLRETVSQK DVCVPKATHQ 120 
KEMDKISGKL EDSTSLSKII* DTVHSCERTR ELQKDPCDPR SGK 



Breast ' 



A19 DNA SEQUENCE 

Gene name: TMPRSS3a 

Unigene number: Hs. 2 9 82 41 

Probeset Accession #: AI538613 

Nucleic Acid Accession #: AB0381S7 

Coding sequence: 202-1566 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I 1 I 1 

ACCGGGCACC GGACGGCTCG GGTACTTTCG TTCTTAATTA GGTCATGCCC GTGTGAGCCA 60 

GGAAAGGGCT GTGTTTATGG GAAGCCAGTA ACACTGTGGC CTACTATCTC TTCCGTGGTG 120 

CCATCTACAT TTTTGGGACT CGGGAATTAT GAGGTAGAGG TGGAGGCGGA GCCGGATGTC 180 

AGAGGTCCTG AAATAGTCAC CATGGGGGAA AATGATCCGC CTGCTGTTGA AGCCCCCTTC 240 

TCATTCCGAT CGCTTTTTGG CCTTGATGAT TTGAAAATAA GTCCTGTTGC ACCAGATGCA 300 

GATGCTGTTG CTGCACAGAT CCTGTCACTG CTGCCATTGA AGTTTTTTCC AATCATCGTC 360 

ATTGGGATCA TTGCATTGAT ATTAGCACTG GCCATTGGTC TGGGCATCCA CTTCGACTGC 420 

TCAGGGAAGT ACAGATGTCG CTCATCCTTT AAGTGTATCG AGCTGATAGC TCGATGTGAC 480 

GGAGTCTOGG ATTGCAAAGA CGGGGAGGAC GAGTACCGCT GTGTCCGGGT GGGTGGTCAG 540 

AATGCCGTGC TCCAGGTGTT CACAGCTGCT TCGTGGAAGA CCATGTGCTC CGATGACTGG 600 

AAGGGTCACT ACGCAAATGT TGCCTGTGCC CAACTGGGTT TCCCAAGCTA TGTGAGTTCA 660 

GATAACCTCA GAGTGAGCTC GCTGGAGGGG CAGTTCCGGG AGGAGTTTGT GTCCATCGAT 720 

CACCTCTTGC CAGATGACAA GGTGACTGCA TTACACCACT CAGTATATGT GAGGGAGGGA 780 

TGTGCCTCTG GCCACGTGGT TACCTTGCAG TGCACAGCCT GTGGTCATAG AAGGGGCTAC 840 

AGCTCACGCA TCGTGGGTGG AAACATGTCC TTGCTCTCGC AGTGGCCCTG GCAGGCCAGC 900 

CTTCAGTTCC AGGGCTACCA CCTGTGCGGG GGCTCTGTCA TCACGCCCCT GTGGATCATC 960 

ACTGCTGCAC ACTGTGTTTA TGACTTGTAC CTCCCCAAGT CATGGACCAT CCAGGTGGGT 1020 

CTAGTTTCCC TGTTGGACAA TCCAGCCCCA TCCCACTTGG TGGAGAAGAT TGTCTACCAC 1080 

AGCAAGTACA AGCCAAAGAG GCTGGGCAAT GACATCGCCC TTATGAAGCT GGCCGGGCCA 1140 

CTCACGTTCA ATGAAATGAT CCAGCCTGTG TGCCTGCCCA ACTCTGAAGA GAACTTCCCC 1200 

GATGGAAAAG TGTGCTGGAC GTCAGGATGG GGGGCCACAG AGGATGGAGC AGGTGACGCC 1260 

TCCCCTGTCC TGAACCACGC GGCCGTCCCT TTGATTTCCA ACAAGATCTG CAACCACAGG 1320 

GACGTGTACG GTGGCATCAT CTCCCCCTCC ATGCTCTGCG CGGGCTACCT GACGGGTGGC 1380 



837 



WO 03/042661 



PCT/US02/36810 



10 



15 



20 



25 



30 



GTGGACAGCT 
AAGTTAGTGG 
TACACCCGTG 
ACCTGAAGAG 
TCCTCCCCTG 
TTCCGGCACC 
CAAGCTGCTT 
AGTGCAGTGG 
TTGCCTCAGC 
TTGTATTTTT 
CCTCAAATGA 
ACGCCTAGCC 
AGGGCGGCCT 
ACGAGATAAG 
GCACCAGCCC 
CCAAACCCAC 
ATGACTCGTT 
TTGTCCTTTG 
AAAAAAAA 



GCCAGGGGGA 
GAGCGACCAG 
TCACCTCCTT 
GAAGGGGACA 
GACTCCCGTG 
AGTAGCAGGC 

CGAAATCCCT 
TTCCCCAGTA 
AGTAGAGACA 
TGTGCCTGCT 
TCACGCTCCT 
TTCCCACTGG 
CAGTTATGTG 
AGAAGTGCAG 
CCTTTCTACT 
TAAGGCCTAT 
ATTCCAAATA 



CAGCGGGGGG 
CTTTGGCATC 
CCTGGACTGG 
AGTAGCCACC 
TAGGAACCTG 
CCGAAAGAGG 
GTTTTTTTGA 
GCTCACTGCA 
GCTGGGACCA 
GGGTTTCACC 
TCAGCCTCCC 
TTCTGATCTT 
TCCATCTGGT 
ACCTCACGTG 
AACTGCAGTC 
TGCAAGACTT 
TTTCATGATT 
ATATGTTTCC 



CCCCTGGTGT 
GGCTGCGCAG 
ATCCACGAGC 
TGAGTTCCTG 
CACACGAGCA 
CACCCTTCCA 
GGTGGAGTCT 
GCCTCCGCTT 
CAGGTGCCCG 
ATGTTGGCCA 
ACAGTGCTGG 
CACTAAGAAC 
TTTCTCTCCA 
CAAAGCCACC 
ACTGCACGTT 
ATTTTCACAT 
TCTTTGTAGC 
TTCCCTCAAA 



GTCAAGAGAG 
AGGTGAACAA 
AGATGGAGAG 
AGGTGATGAA 
GACACCCTTG 
TCTGATTCCA 
CGCTCTGTTG 
CCCTGGTTCA 
CCACCACACC 
GGCTGCTCTC 
GATTACAGGC 
AAAAGAAGCA 
GGGGTCTTGC 
AACAGCCACT 
TTCATCTCTA 
GTGGGGAGGT 
ATTTGGTGCT 
AAAAAAAAAA 



GAGGCTGTGG 
GCCTGGGGTG 
AGACCTAAAA 
GACAGCCCGA 
GAGCTCTGAG 
GCACAACCTT 
CCCAGGCTGG 
AGCGATTCTC 
CAACTAATTT 
AAACCCCTGA 
ATGGGCCACC 
GCAACTTGCA 
AAAATTCCTG 
CAGAAAAGAC 
GGGACCAGAA 
TAATCTAGGA 
TGACGTATTA 
AAAAAAAAAA 



A20 Protein sequence: 
Gene name: 
Unigene number: 
Probeset Accession #: 
Protein Accession #: 
Signal sequence: 
Transmembrane domains: 
Tryp_SPc domain: 
Ce 1 lular . Local i za t ion : 



TMPRSS3a 

Hs. 298241 

AIS38613 

BAB20077 

none found 

43-65, 239-261 

216-444 

not determined 



1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 



35 
40 
45 

50 
55 
60 
65 
70 
75 
80 



i 
I 

MGENDPPAVE 
LALAIGLGIH 
TAASWKTMCS 
VTALHHSVYV 
LCGGSVITPL 
LGNDIALMKL 
AVPLISNKIC 
FGIGCAEVNK 



11 
I 

APFSFRSLFG 
FDCSGKYRCR 
DDWKGHYANV 
REGCASGHW 
WIITAAHCVY 
AGPliTFNEMI 
NHRDVYGGII 
PGVYTRVTSF 



21 
I 

LDDLKISPVA 
SSFKCZBL1A 
ACAQLGFPSY 
TLQCTACGHR 
DLYLPKSWTI 
QPVCLPNSEE 
SPSMLCAGYL 
LDWIHEQMER 



31 

1 

PDADAVAAQI 
RCDGVSDCKD 
VSSDNLRVSS 
RGYSSRIVGG 
QVGLVSLLDN 
NFPDGKVCWT 
TGGVDSCQGD 
DLKT 



41 



51 



LSLIiPLKFFP IIVIGIIALI 60 

GEDBYRCVRV GGQNAVLQVF 120 

LEGQFREBFV SIDHLLPDDK 180 

NMSLLSQWPW QASLQFQGYH 240 

PAPSHLVEKI VYHSKYKPKR 300 

SGWGATEDGA GDASPVLNHA 360 

SGGPLVOQER RLWKLVGATS 420 



A21 DNA SEQUENCE 

Gene name: ESTs; opposite strand to TRPS1 

Unigene number: none 
Probeset Accession ft: AA428090 
Nucleic Acid Accession #: AA428090 

Coding sequence: 1-5S8 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I I I 

ATGAAGCCCA GTTTGAAACA CTGGGCTGAT ATAAAAATGT TTTCTGAAAT AGACCAAAGG 60 

CAAGTTGTTG GAGAAGAAAT TCATCTACAA GTAGTATCTG TATCTTATCT TGTAGAAAAC 120 

TTTTCAGACA CAGATGATCT TATGTCCATA GTGGAGGAAT CAGACAGCTG CTACAACCGT 180 

GATTCTGTTG GGCTTCCTGG TAGAGCTGGA GACAGACTGC AACCCAAGAC AAAACCCAGA 240 

GGAAAAACAA CTAACCTCAG CCATTTGCCC AATTTCCACC TTGCTAGTTA TCAGAGTTCA 300 

CTATTGTTTA AGCTCCAGGG GTCATACTAT GGCAATCTAC TGGTGGAATT TATTCTGAGC 360 

AAGTGTTTTG TTCAGCTTGC AGTATTAAAA CAAAAAAAAC ATTGCCTCCA GCTGCAAAGC 420 

AAGGGCATTG CCATTATGAA AGCCCCTCAA AGACTCTCTG CTATTTTCAA AACATGGAAA 480 

GAAAAAGGGA AAAAAGAAAA AAAAATAATA ATTAGAAGGA TTTGTTCCTT AATTTGGGCT 540 

CCCAAAAATG AGAAATGAAG ATTGTATAAT GAGGGGAGAT ACTAATTATT TTAAACTCTC 600 

CAAGCAAATC TTCTGAAGCA ATCAATTATT TATATACTTT ATGTTCTGTC TTTTTGTATT 660 

TTTCCTTCTG GTTAAAAACA TGCAGGTGAG TCTTGCCAAC GTCCTTTCCT ATCTGGATCT 720 

GTTCTGCCTC ATTTCTCTTT CAAAGTCATC TTTCAGGGAA CTTGCCCTGA TTAATTTGAT 780 

TTTAACCAAA CAAATAAGAT ATTTGATATA TTAATTTAAA CTTTTTGAGA TGATTGATTA 840 

GGAATTGCAT CATGTTCACA TGAGTATACC GAATTCAAAG TTAAACTTCA TAAGCAGGAG 900 

TTTTTACACA TCGTAACATA ATCATTACCC AATACTCGAC ACTCAATATT TGATACTCAA 960 

CTGAATGTTT TTGAAATAAA CACATTTTTA TGTTATCTCT CTGGAGAAAG TAGTATATAT 1020 

CTTTTTACAC AAAATATATC AGTGAGAGAG TGTTTGTTTA AGAAAAAAAA TCAAAGCACA 1080 

ACAAGTTGAG AGAGTCCAGG CTTTATCAAT ATAAGTAATA ATTTTTTAGA ATGGTGATTT 1140 

GATTTCACCA TTTCAATTCA GCAGAGCCTG TATATATATA TATATATATA TATATATATA 1200 

TATATATATA TATATATATA TATATTACAA TGATCTGTAT TTCCTATTGC TAGAAGGATG 1260 

AAAGTGAATC CATATAAACC ATACCAACGC CGTTATGTGT AACTGGTGGT AAAACTTTAT 1320 

TATTCAAGTT TAGATGTAAC AGACATCTTT GCTGCCTGAA GATTGTTTGC ATAAGAAATA 1380 

CACCAAGAAC ATGTTTGTGA GTAGAAATGA ACATGCACTA TGAAAACAAA ATAAAATAAA 1440 

ACGAAAAAAT TTCATGTGTT GTAAGAACAG AACTATTATA GCCAACATTC TAGTATTCAA 1500 

ATCAGGACTA CAAATTGAAT TCTTTTTCTT AGCAACATGA AATCATTCCA TATGAAAGAC 1S60 

ATTTTCTGCT GGTGAATATT GCTGTAAGTT AAATTTTACA TTGGCATTTT GAGATGTTCC 1620 
CCCCTCATGC CTCCCCCAAA GTTTTCCATG TGGTTGTCAA ATAGTCCGC 



A22 Protein sequence: 



838 



WO 03/042661 



PCT/US02/36810 



Gene name: 
Unigene number: 
Probeset Accession #: 
Protein Accession #: 
Signal sequence: 
Transmembrane domains: 
Cellular Localization: 



ESTs; opposite strand to TRPS1 

none 

AA428090 

none found 

none found 

113-129 

not determined 



10 



15 



1 11 21 31 41 51 

I I I I I I 

MKPSLKHWAD IKMFSEIDQR QWGEEIHLQ WSVSYLVEN PSDTDDLMSI VEBSDSCYNR 
DSVGLPGRAG DRLQPKTKPR GKTTNLSHLP NFHLASYQSS LLFKLQGSYY GNLLVEPILS 
KCFVQLAVLK QKKHCLQLQS KGIAIMKAPQ RLSAIFKTWK EKGKKEKKII IRRICSLIWA 
PKNEK 



60 
120 
1B0 



20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 



A23 DNA SEQUENCE 

Gene name: ESTs 
Unigene number: Hs.29383 
Probeset Accession #: AW207206 
Nucleic Acid Accession #: AL133619 

Coding sequence: 1-2070 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

ATGAGCGGTG CGGGGGTGGC GGCTGGGACG CGGCCCCCCA GCTCGCCGAC CCCGGGCTCT 
CGGCGCCGGC GCCAGCGCCC CTCTGTGGGC GTCCAGTCCT TGAGGCCGCA GAGCCCGCAG 
CTCAGGCAGA GCGACCCGCA GAAACGGAAC CTGGACCTGG AGAAAAGCCT GCAGTTCCTG 
CAGCAGCAGC ACTCGGAGAT GCTGGCCAAG CTCCATGAGG AGATCGAGCA TCTGAAGCGG . 
GAAAACAAGG GTGAGCCGGC GCGGGGCCCT AGGCCGGCCC TGCCTCCCCA GGCACACTCA 
ACACTGCCGC TCCCGCAGCA CAGAAACACA GCCATCAACT CCAGCACACG CCTGGGCTCA 
GGGGGAACAC AGGACGGGGA GCCCCTCCAG ACTGTCCTTG CCCACCTGGC TGCACTGGCC 
CCTGTATGCC AACCCAGTGG GTACAGGTTC TGGGGGACCT GGACAGATGC CGCTACCTCT 
AGCCGTGGCT GGACGATGTT ATGCAGCCAA GCACAGCAOG TGCTGCTCTC GGGAAGCCCA 
GGGCCTGAGG TCATTGCAGG GCGGCAGGTG GCCACAGGGT GCTCCCCAGA CCTCCCTCCT 
CCAAGTAGAG CTGAAATGGG AAGGAACCCC TGGGACAGCC CCTGCCCTGC TAGATCTTTG 
CCTCAGATTG CTGCTGTGGC CAGGCCCAGG ATTTCCAGCC CTATGGCTCT GAGTCCTCAC 
ATGCTGGGGG CCCAGGGGAT ATGGACACAC TCCATCCAGG GATCCCTTCC TGCCATCTGG 
GCAGCAACCA TGGGGACAAA GGGAGGAAGC AGAGTCCTGT TTCCTTGCCA CTTGTCCAAG 
GCACTTCCCC ATCCTGACAG CGGCCCCCAC CCAGCCCAGG ATCCTGGGCT GTGGTCTCAA 
GCTCACTTCC CATTATCTTT GGGGCTGGGG CTGACATCAG GAGGACATCT GACTGGTGGA 
TGGAGCCAGC CTGGGAACAT CGCAGCTGGG GCAGTGCCTA GGGCTCTCCC TTCCCAGGGA 
GACATGGAGA AGGGGGTTGA GGGAGGGCCC TTCCCTAGCC GCTGTGGCAA CTCCAGTGAG 
CTGTTCTGGG CAAAGTGTGG CCCAAGTCGG CAGCCCCAGC CCTGCAGTGC TGGGGACGCT 
GACAGGACAC GGGAAGAGGC CATGCTTTCC CTCGGGACCT GCTGTTCCAT GTGTCCCAAG 
CCCTCCTGCT TTCCAGATGG CCCCTCAGGA AACCACCTTT CCAGGGCCTC TGCTCCCTTG 
GGCGCTCGCT GGGTCTGCAT CAACGGAGTG TGGGTAGAGC CGGGAGGACC CAGCCCTGCC 
AGGCTGAAGG AGGGCTCCTC ACGGACACAC AGGCCAGGAG GCAAGCGTGG GOGTCTTGOG 
GGCGGTAGCG COGACACTGT GCGCTCTCCT GCAGACAGCC TCTCCATGTC AAGCTTCCAG 
TCTGTCAAGT CCATCTCTAA TTCAGCCAAC TCTCAAGGCA AGGCCAGGCC CCAGCCCGGC 
TCCTTCAACA AGCAAGATTC AAAAGCTGAC GTCTCCCAGA AGGCGGACCT GGAAGAGGAG 
CCCCTACTTC ACAACAGCAA GCTGGACAAA GTTCCTGGGG TACAAGGGCA GGCCAGAAAG 
GAGAAAGCAG AGGCCTCTAA TGCAGGAGCT GCCTGTATGG GGAACAGCCA GCACCAGGGC 
AGGCAGATGG GGGCGGGGGC ACACCCCCCA ATGATCCTGC CCCTTCCCCT GCGAAAGCCC 
ACCACACTTA GGCAGTGCGA AGTGCTCATC CGCGAGCTGT GGAATACCAA CCTCCTGCAG 
ACCCAAGAGC TGCGGCACCT CAAGTCCCTC CTGGAAGGGA GCCAGAGGCC CCAGGCAGCC 
CCGGAGGAAG CTAGCTTTCC CAGGGACCAA GAAGCCACGC ATTTCCCCAA GGTCTCCACC 
AAGAGCCTCT CCAAGAAATG CCTGAGCCCA CCTGTGGCGG AGCGTGCCAT CCTGCCCGCA 
CTGAAGCAGA CCCCGAAGAA CAACTTTGCC GAGAGGCAGA AGAGGCTGCA GGCAATGCAG 
AAACGGCGCC TGCATCGCTC AGTGCTTTGA 



A24 Protein sequencer 
Gene name: 
Unigene number: 
Probeset Accession #: 
Protein Accession #r 
Signal sequence: 
Transmembrane domains: 
Cellular Localization: 



ESTs 

HS.293B3 

AW207206 

T434S7 

none found 

303-322 

not determined 



60 

120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 



75 
80 



MSGAGVAAGT 
QQQHSEMLAK 
GGTQDGEPLQ 
GPEVIAGRQV 
MLGAQGIWTH 
AHFPLSLGLG 
LFWAKCGPSR 
GARWVCINGV 
SVKSISNSAN 



11 
I 

RPPSSPTPGS 



21 
I 

RRRRQRPSVG 



TVLAHLAALA 
ATGCSPDLPP 
SIQGSLPAIW 
LTSGGHLTGG 
QPQPCSAGDA 
WVEPGGPSPA 
SQGKARPQPG 



PVCQPSGYRP 
PSRAEMGRNP 
AATMGTKGGS 
WSQPGNIAAG 
DRTRBEAMLS 
RLKEGSSRTH 
SFNKQDSKAD 



31 

I 

VQSLRPQSPQ 
RPALPPQAHS 
WGTWTDAATS 
WDSPCPARSL 
RVLFPCHLSK 
AVPRALPSQG 
LGTCCSMCPK 
RPGGKRGRLA 
VSQKADLEBE 



41 

I 

LRQSDPQKRN 
TLPLPQHRNT 
SRGWTMLCSQ 
PQIAAVARPR 
ALPHPDSGPH 
DMEKGVEGGP 
PSCPPDGPSG 
GGSADTVRSP 
PLLHNSKLDK 



SI 
I 

LDLEKSLQFL 
AINSSTRLGS 
AQUVLLSGSP 
ISSPMALSPH 
PAQDPGLWSQ 
FPSRCGNSSE 
NHLSRASAPL 
ADSLSMSSFQ 
VPGVQGQARK 



60 
120 
180 
240 
300 
360 
420 
480 
540 



839 



WO 03/042661 



PCT/US02/36810 



EKAEASNAGA ACMGNSQHQG RQMGAGAHPP MILPLPLRKP TTLRQCEVLI RELWNTNLLQ 600 
TQELRHLKSL LEGSQRPOAA PEEASFPRDQ EATHFPKVST KSLSKKCLSP PVAERAILPA 660 
LKQTPKNNFA ERQKRLQAMQ KRRLHRSVL 



A2S DNA SEQUENCE 
Gene name: 
Unigene number: 
Probeset Accession fl: 
Nucleic Acid Accession #: 
Coding sequence: 



ESTs; prolactin receptor 

HS. 25252 

AA057193 

NM_000949 

285-2153 (underlined sequences correspond to start and stop codons) 



11 



GGAGGCTGAA 
CTCCCTCTTT 
ATGTCCTCGT 
TAAAGAACTC 
GTGAACTTCT 
CATCTGCAAC 
TACCTCCTGG 
GCTGGTGGAG 
GGGAAGGAGA 
GCCACTTTGG 
CTAACCAGAT 
AGCCAGACCC 
ACCTGTGGAT 
TCCTGTATGA 
GGCAGCAAAC 
TTCGCTGCAA 
TACCTAGTGA 
CTGTCATCTG 
GCATCTTTCC 
AGGGCAAGTC 
ACTATGAGGA 
TGTCAGTCCA 
ACACTGACTC 
AACCCCAGGC 
CTGAAACAAC 
TTCATGCTGG 
CCAGATCCTC 
CACCGGCCAC 
AGTCTAGAGA 
CTGACCAGGA 
CCTTGGATTA 
AACAGAGAGA 
ATGCCAAGGT 
CTAAAAACGT 
ATCAAGCTGA 
TGGGTGGTTT 
ACTAATGGAA 
AATGCTCAAG 
TTTTTAACCA 
AACTGTGATT 
AAAAGCACAC 
GCTTTCATGA 
AAGATGACAA 
AACTGCATAA 
AAAGAAAGAA 
TTGCTGATAT 



ATCCCCAGAC 
CTGGATTTTA 
GCAGGAAGTA 
TCCTATTCAT 
GATACATTTC 
CGTTTTCACT 
AAAACCTGAG 
GCCTGGGACA 
GACACTCATG 
CAAGCAGTAC 
GGGAAGCAGT 
TCCTTTGGAG 
TAAATGGTCT 
AATTCGATTA 
AGAGTTTAAG 
ACCAGACCAT 
CTTCACCATG 
TTTGATTATT 
GCCAGTTCCT 
TGAAGAACTA 
CTTGCTGGTG 
TTCAAAAGAA 
AGGCCGGGGG 
CAATCCCTCC 
CCACACCTGG 
TGGATCCAAA 
TTACCACAAT 
TCTGTTGAAT 
AGAGGGAAAG 
TACGCCCTGG 
TGTGGAGATT 
GAACAGCGGC 
GTCCGGGGTC 
GGCTTGCTTT 
GAAAGCCCTG 
GGATTACCTG 
TGATTGGTTA 
AATGTAGTCA 



21 
I 



31 
I 



TGTAGATTTA 
TGCTTAGTAT 
TTTGGCATGG 
AAGAAAATTT 
CCTTTACACT 
TGCATTCCAG 
GCAAGTAAGA 



GCCGGTTTTC TGGGCTGGGC 
CCGACCGTTC GCGAAACAGC 
CTCATCGACT GATGTGGCAG 
GGAGGOGAAC ACTGAGGATG 
CTGCAGCAAG AGAAGGCAGC 
CTGCTACTTT TTCTCAACAC 
ATCTTTAAAT GTCGTTCTCC 
GATGGAGGAC TTCCTACCAA 
CATGAATGTC CAGACTACAT 
ACCTCCATGT GGAGGACATA 
TTCTCGGATG AACTTTATGT 
CTGGCTGTGG AAGTAAAACA 
CCACCTACCC TGATTGACTT 
AAACCCGAGA AAGCAGCTGA 
ATTCTCAGCC TACATCCAGG 
GGATACTGGA GTGCATGGAG 
AATGATACAA CCGTGTGGAT 
GTCTGGGCAG TGGCTTTGAA 
GGGCCAAAAA TAAAAGGATT 
CTGAGTGCCT TGGGATGCCA 
GAGTATTTAG AAGTAGATGA 
CACCCAAGTC AAGGTATGAA 
AGCTGTGACA GCCCTTCCCT 
ACATTCTATG ATCCTGAGGT 
GACCCCCAGT GCATAAGCAT 
TGTTCAACAT GGCCCTTACC 
ATTACTGATG TGTGTGAGCT 
GAAGCAGGTA AAGATGCTTT 
GCAACCCAGC AGAGGGAGGT 
CTGCTGCCCC AGGAGAAAAC 
CACAAGGTCA ACAAAGATGG 
AAGCCCAAGA AGCCCGGGAC 
ATGGATAACA ACATCCTGGT 
GAAGAATCAG CCAAAGAGGC 
GCCAACTTCA CTGCAACATC 
GATCCCGCAT GTTTTACACA 
AAATGTGATT TTTCTTCAGG 
GACTGACACT ACTAAAGCTC 
TCTCCAGCAG CTGATTCCAG 
CTTTTTGCTG TTAGTTATAA 
TCTTGAGGGA CAATGCCAAT 
GACAGACGGA AATGAAATTG 
TCCACATAGG AAAATGCCAT 
CCTCGTCCAT TTTATTAGGA 
AACAAATTGT TTACATAAGT 



41 

I 



51 



TTTCTGCTTA CTCACTCCTT 
TTTCCACACA ATGGAGCTTC 
ACTTTGCTCC CTGACAAAAC 
CTTTCCACAT GAACCCTGAA 
CAACATGAAG GAAAATGTGG 
CTGCCTTCTG AATGGACAGT 
CAATAAGGAA ACATTCACCT 
TTATTCACTG ACTTACCACA 
AACCGGTGGC CCCAACTCCT 
CATCATGATG GTCAATGCCA 
GGACGTGACT TACATAGTTC 
GCCAGAAGAC AGAAAACCCT 
AAAAACTGGT TGGTTCACGC 
GTGGGAGATC CATTTTGCTG 
ACAGAAATAC CTTGTCCAGG 
TCCAGCGACC TTCATTCAGA 
CTCTGTGGCT GTCCTTTCTG 
GGGCTATAGC ATGGTGACCT 
TGATGCTCAT CTGTTGGAGA 
AGACTTTCCT CCCACTTCTG 
TAGTGAGGAC CAGCATCTAA 
ACCCACATAC CTGGATCCTG 
TTTGTCTGAA AAGTGTGAGG 
CATTGAGAAG CCAGAGAATC 
GGAAGGCAAA ATCCCCTATT 
ACAGCCCAGC CAGCACAACC 
GGCTGTGGGC CCTGCAGGTG 
AAAATCCTCT CAAACCATTA 
AGAAAGCTTC CATTCTGAGA 
CCCCTTTGGC TCCGCTAAAC 
TGCATTATCA TTGCTACCAA 
TCCTGAGAAC AATAAGGAGT 
GTTGGTGCCA GATCCACATG 
CCCACCATCA CTTGAACAGA 
AAGCAAGTGC AGGCTCCAGC 
CTCCTTTCAC TGATAGCTTG 
TAACACTACA GAGTACGTGA 
CCAGCTCCTT TCATGCTCCA 
AACAAATCAT TATGTTTCCT 
AACTATGTGT TCAATGAAAT 
AGGTATATCC TCTGGAAAAG 
TCAAAATTGT TTACCATAGA 
GAAAATTGCT TTTGAAAAAC 
TTACCCAAAT ATAACCATTT 
TCCTATACCT TACTGACACA 



60 
120 
180 
240 
300 
360 
420 
4B0 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
I960 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 



AAT 



A26 Protein sequence: 
Gene name: 
Unigene number: 
Probeset Accession #: 
Protein Accession #: 
Signal sequence: 
Transmembrane domains: 
FN3 domains: 
Cellular Localization: 



ESTs; prolactin receptor 

Hs. 25252 

AA057193 

NP 000940 

1-23 

237-253 

28-112, 127-215 
plasma membrane 



1 11 21 31 41 51 

I I I I I I 

MKENVASATV FTLLLFLNTC LLNGQLPPGK PEIFKCRSPN KETFTCWWRP GTDGGLPTNY 60 

SliTYHREGBT LMHECPDYIT GGPNSCHFGK QYTSMWRTYI MMVNATNQMG SSFSDELYVD 120 

VTYIVQPDPP LELAVEVKQP EDRKPYLWIK WSPPTLIDLK TGWFTLLYEI RIiKPEKAAEW 180 

EIHFAGQQTE FKILSLHPGQ KYLVQVRCKP DHGYWSAWSP ATFIQIPSDF TMNDTTVWIS 240 

VAVLSAVICL IIVWAVALKG YSMVTCIFPP VPGPKIKGFD AHLLEKGKSE ELLSALGCQD 300 

FPPTSDYEDL LVEYLEVDDS EDQHLMSVHS KEHPSQGMKP TYLDPDTDSG RGSCDSPSLL 360 

SBKCEEPQAN PSTFYDPEVI EKPENPBTTH TWDPQCISME GKIPYFHAGG SKCSTWPLPQ 420 

840 



WO 03/042661 



PCT/US02/36810 



PSQHNPRSSY HNITDVCELA VGPAGAPATL LNEAGKDALK SSQTIKSREE GKATQQREVE 480 

SFHSETDQDT PWLLPQEKTP FGSAKPLDYV EIHKVNKDGA LSLLPKQREN SGKPKKPGTP 540 

ENNKEYAKVS GVMDNNILVL VPDPHAKNVA CFEESAKEAP PSLEQNQAEK ALANFTATSS 600 
KCRLQLGGLD YLDPACFTHS FH 



A27 DNA SEQUENCE 

Gene name: Human neuropeptide Y 
Unigene number: 
Probeset Accession ft: 
Nucleic Acid Accession fh 
Coding sequence: 



receptor Yl (NPYY1) 
Hs. 169266 
L07615 
NM_000 909.1 

209-1363 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

1 I I I 1 I 

CATTCCCACC CTTCCTTCTT TAATAAGCAG GAGCGAAAAA GACAAATTCC AAAGAGGATT 60 

GTTCAGTTCA AGGGAATGAA GAATTCAGAA TAATTTTGGT AAATGGATTC CAATATCGGG 120 

AATAAGAATA AGCTGAACAG TTGACCTGCT TTGAAGAAAC ATACTGTCCA TTTGTCTAAA 180 

ATAATCTATA ACAACCAAAC CAATCAA AAT G AATTCAACA TTATTTTCCC AGGTTGAAAA 240 

TCATTCAGTC CACTCTAATT TCTCAGAGAA GAATGCCCAG CTTCTGGCTT TTGAAAATGA 300 

TGATTGTCAT CTGCCCTTGG CCATGATATT TACCTTAGCT CTTGCTTATG GAGCTGTGAT 360 

CATTCTTGGT GTCTCTGGAA ACCTGGCCTT GATCATAATC ATCTTGAAAC AAAAGGAGAT 420 

GAGAAATGTT ACCAACATCC TGATTGTGAA CCTTTCCTTC TCAGACTTGC TTGTTGCCAT 480 

CATGTGTCTC CCCTTTACAT TTGTCTACAC ATTAATGGAC CACTGGGTCT TTGGTGAGGC 540 

GATGTGTAAG TTGAATCCTT TTGTGCAATG TGTTTCAATC ACTGTGTCCA TTTTCTCTCT 600 

GGTTCTCATT GCTGTGGAAC GACATCAGCT GATAATCAAC CCTCGAGGGT GGAGACCAAA 660 

TAATAGACAT GCTTATGTAG GTATTGCTGT GATTTGGGTC CTTGCTGTGG CTTCTTCTTT 720 

GCCTTTCCTG ATCTACCAAG TAATGACTGA TGAGCCGTTC CAAAATGTAA CACTTGATGC 780 

GTACAAAGAC X AAATACGTGT GCTTTGATCA ATTTCCATCG GACTCTCATA GGTTGTCTTA 840 

TACCACTCTC CTCTTGGTGC TGCAGTATTT TGGTCCACTT TGTTTTATAT TTATTTGCTA 900 

CTTCAAGATA TATATAOGCC TAAAAAGGAG AAACAACATG ATGGACAAGA TGAGAGACAA 960 

TAAGTACAGG TCCAGTGAAA CCAAAAGAAT CAATATCATG CTGCTCTCCA TTGTGGTAGC 1020 

ATTTGCAGTC TGCTGGCTCC CTCTTACCAT CTTTAACACT GTGTTTGATT GG AATCATCA 1080 

GATCATTGCT ACCTGCAACC ACAATCTGTT ATTCCTGCTC TGCCACCTCA CAGCAATGAT 1140 

ATCCACTTGT GTCAACCCCA TATTTTATGG GTTCCTGAAC AAAAACTTCC AGAGAGACTT 1200 

GCAGTTCTTC TTCAACTTTT GTGATTTCCG GTCTCGGGAT GATGATTATG AAACAATAGC 1260 

CATGTCCACG ATGCACACAG ATGTTTCCAA AACTTCTTTG AAGCAAGCAA GCCCAGTCGC 1320 

ATTTAAAAAA ATCAACAACA ATGATGATAA TGAAAAAATC TGAAACTACT TATAGCCTAT 1380 

GGTCCCGGAT GACATCTGTT TAAAAACAAG CACAACCTGC AACATACTTT GATTACCTGT 1440 

TCTCCCAAGG AATGGGGTTG AAATCATTTG AAAATGACTA AGATTTTCTT GTCTTGCTTT 1500 

TTACTGCTTT TGTTGTAGTT GTCATAATTA CATTTGGAAC AAAAGGTGTG GGCTTTGGGG 1560 

TCTTCTGGAA ATAGTTTTGA CCAGACATCT TTGAAGTGCT TTTTGTGAAT TTATGCATAT 1620 

AATATAAAGA CTTTTATACT GTACTTATTG GAATGAAATT TCTTTAAAGT ATTACGATGC 1680 

GCTGACTTCA GAAGTACCTG CCATCCAATA CGGTCATTAG ATTGGGTCAT CTTGATTAGA 1740 

TTAGATTAGA TTAGATTGTC AACAGATTGG GCCATCCTTA CTTTATGATA GGCATCATTT 1800 

TAGTGTGTTA CAATAGTAAC AGTATGCAAA AGCAGCATTC AGGAGCCGAA AGATAGTCTT I860 

GAAGTCATTC AGAAGTGGTT TGAGGTTTCT GTTTTTTGGT GGTTTTTGTT TGTTTTTTTT 1920 

TTTTTTCACC TTAAGGGAGG CTTTCATTTC CTCCCGACTG ATTGTCACTT AAATCAAAAT 1980 

TTAAAAATGA ATAAAAAGAC ATACTTCTCA GCTGCAAATA TTATGGAGAA TTGGGCACCC 2040 

ACAGGAATGA AGAGAGAAAG CAGCTCCCCA ACTTCAAAAC CATTTTGGTA CCTGACAACA 2100 

AGAGCATTTT AGAGTAATTA ATTTAATAAA GTAAATTAGT ATTGCTGCAA ATAGCTAAAT 2160 

TATATTTATT TGAATTGATG GTCAAGAGAT TTTCCATTTT TTTTACAGAC TGTTCAGTGT 2220 

TTGTCAAGCT TCTGGTCTAA TATGTACTCG AAAGACTTTC CGCTTACAAT TTGTAGAAAC 2280 

ACAAATATCG TTTTCCATAC AGCAGTGCCT ATATAGTGAC TGATTTTAAC TTTCAATGTC 2340 

CATCTTTCAA AGGAAGTAAC ACCAAGGTAC AATGTTAAAG GAATATTCAC TTTACCTAGC 2400 

AGGGAAAAAT ACACAAAAAC TGCAGATACT TCATATAGCC CATTTTAACT TGTATAAACT 2460 

GTGTGACTTG TGGCGTCTTA TAAATAATGC ACTGTAAAGA TTACTGAATA GTTGTGTCAT 2520 

GTTAATGTGC CTAATTTCAT GTATCTTGTA ATCATGATTG AGCCTCAGAA TCATTTGGAG 2580 

AAACTATATT TTAAAGAACA AGACATACTT CAATGTATTA TACAGATAAA GTATTACATG 2640 

TGTTTGATTT TAAAAGGGCG GACATTTTAT TAAAATCAAT ATTGTTTTTG CTTTTTCTGA 2700 
GGAGTCTCTT TCAGTTTCAT TTTTTCTCAT CCCATGACTT CCCTCCGATG GT 



A28 Protein sequence: 
Gene name: 
Unigene number: 
Probeset Accession #: 
Protein Accession #: 
Signal sequence: 
Pfam domains: 
Transmembrane domains: 

Cellular Localization: 



Human neuropeptide Y receptor Yl (NPYY1) mRNA, exon 2-3 and complete cds 

Hs. 169266 

L07615 

NPJ)00900.1 

none found 

7tm 1 [57-91] 

39-61, 77-99, 118-139, 157-179, 212-234, 264-286, 
300-322 

plasma membrane 



1 11 21 31 41 51 

I I I I I I 

MNSTLFSQVE NHSVHSNFSE KNAQLLAFEN DDCHLPLAMI FTLALAYGAV IILGVSGNLA 60 

LIIIILKQKE MRNVTNILIV NLSFSDLLVA IMCLPFTFVY TLMDHWVFGE AMCKLNPFVQ 120 

CVSITVSIFS LVLIAVERHQ LIINPRGWRP NNRHAYVGIA VIWVLAVASS LPFLIYQVMT 180 

DEPFQNVTLD AYKDKYVCFD QFPSDSHRLS YTTLLLVLQY FGPLCFIFIC YFKIYIRLKR 240 

RNNMMDKMRD NKYRSSETKR INIMLLSIW AFAVCWLPLT IFNTVFDWNH QIIATCNHNL 300 

LFLLCHLTAM ISTCVNPIFY GFLNKNFQRD LQFFFNFCDF RSRDDDYETI AMSTMHTDVS 360 
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A29 DNA SEQUENCE 

Gene name: Homo sapiens G protein -coup led receptor (HOT7T175) 
Unigene number: Hs. 20822 9 

Probeset Accession ft: AI819198 
Nucleic Acid Accession ft: NM0325S1.1 

Coding sequence: 1-1197 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I ! I I 

ATGCACACCG TGGCTACGTC CGGACCCAAC GCGTCCTGGG GGGCACCGGC CAACGCCTCC 60 

GGCTGCCCGG GCTGTGGCGC CAACGCCTCG GACGGCCCAG TCCCTTCGCC GCGGGCCGTG 120 

GACGCCTGGC TCGTGCCGCT CTTCTTCGCG GCGCTGATGC TGCTGGGCCT GGTGGGGAAC 180 

TCGCTGGTCA TCTACGTCAT CTGCCGCCAC AAGCCGATGC GGACCGTGAC CAACTTCTAC 240 

ATCGCCAACC TGGCGGCCAC GGACGTGACC TTCCTCCTGT GCTGCGTCCC CTTCACGGCC 300 

CTGCTGTACC CGCTGCCCGG CTGGGTGCTG GGCGACTTCA TGTGCAAGTT CGTCAACTAC 360 

ATCCAGCAGG TCTCGGTGCA GGCCACGTGT GCCACTCTGA CCGCCATGAG TGTGGACCGC 420 

TGGTACGTGA CGGTGTTCCC GTTGCGCGCC CTGCACCGCC GCACGCCCCG CCTGGCGCTG 480 

GCTGTCAGCC TCAGCATCTG GGTAGGCTCT GCGGCGGTGT CTGCGCCGGT GCTCGCCCTG 540 

CACCGCCTGT CACCCGGGCC GCGCGCCTAC TGCAGTGAGG CCTTCCCCAG CCGCGCCCTG 600 

GAGCGCGCCT TCGCACTGTA CAACCTGCTG GCGCTGTACC TGCTGCCGCT GCTCGCCACC 660 

TGCGCCTGCT ATGCGGCCAT GCTGCGCCAC CTGGGCCGGG TCGCCGTGCG CCCCGCGCCC 720 

GCCGATAGCG CCCTGCAGGG GCAGGTGCTG GCAGAGCGCG CAGGCGCCGT GCGGGCCAAG 780 

GTCTCGCGGC TGGTGGCGGC CGTGGTCCTG CTCTTCGCCG CCTGCTGGGG CCCCATCCAG 840 

CTGTTCCTGG TGCTGCAGGC GCTGGGCCCC GCGGGCTCCT GGCACCCACG CAGCTACGCC 900 

GCCTACGCGC TTAAGACCTG GGCTCACTGC ATGTCCTACA GCAACTCCGC GCTGAACCCG 960 

CTGCTCTACG CCTTCCTGGG CTCGCACTTC CGACAGGCCT TCCGCCGCGT CTGCCCCTGC 1020 

GCGCCGCGCC GCCCCCGCCG CCCCCGCCGG CCCGGACCCT CGGACCCCGC AGCCCCACAC 1060 

GCGGAGCTGC ACCGCCTGGG GTCCCACCCG GCCCCCGCCA GGGCGCAGAA GCCAGGGAGC 1140 
AGTGGGCTGG CCGCGCGCGG GCTGTGCGTC CTGGGGGAGG ACAACGCCCC TCTCTGA 



A30 Protein sequence 

Gene name: Homo sapiens G protein- coupled receptor (HOT7T175) , mRNA 

Unigene number: Hs. 208229 

Protein Accession ft : A1819198 

Signal sequence: none found 

Pfam domains: 7tm_l [59-323] 

Transmembrane domains: 43-65, 86-108, 122-144, 159-181, 203-225, 260-282 

Cellular Localization: plasma membrane 



1 11 21 31 41 51 

I I I I I I 

MHTVATSGPN ASM GAP ANAS GCPGCGANAS DGPVPSPRAV DAWLVPLFFA ALMLLGLVGN 60 

SL.VIYVICRH KPMRTVTNFY IANLAATDVT FLLCCVPFTA LLYPLPGWVL GDFMCKFVNY 120 

IQQVSVQATC ATLTAMSVDR WYVTVFPLRA LHRRTPRLAL AVSLSIWVGS AAVSAPVLAL 180 

HRLSPGPRAY CSEAFPSRAL ERAFALYNLL ALYLLPLLAT CACYAAMLRH LGRVAVRPAP 240 

ADSALQGQVL AERAGAVRAK VSRLVAAWL LFAACWGPIQ LFLVLQALGF AGSWHPRSYA 300 

AYALKTWAHC MSYSNSALNP LLYAFLGSHF RQAFRRVCPC APRRPRRPRR PGPSDPAAPH 360 
AELHRLGSHP APARAQKPGS SGLAARGLCV LGEDNAPL 

A31 DNA SEQUENCE 

Gene name: CH22_FGENES.706; cadherin, EGP LAG seven-pass G-type receptor 1, flamingo (Drosopbila) homolog 
(CELSR1) 

Unigene number: Hs. 2523 87 

Probeset Accession ft: NMJ>14246 

Nucleic Acid Accession ft: NMJJ14246 

Coding sequence: 1-9045 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

ATGGCGCCGC CGCCGCCGCC CGTGCTGCCC GTGCTGCTGC TCCTGGCCGC CGCCGCCGCC 60 

CTGCCGGCGA TGGGGCTGCG AGCGGCCGCC TGGGAGCCGC GCGTACCCGG CGGGACCCGC 120 

GCCTTCGCCC TCCGGCCCGG CTGTACCTAC GCGGTGGGCG CCGCTTGCAC GCCCCGGGCG 180 

CCGCGGGAGC TGCTGGACGT GGGCCGCGAT GGGCGGCTGG CAGGACGTCG GCGCGTCTCG 240 

GGCGCGGGGC GCCCGCTGCC GCTGCAAGTC CGCTTGGTGG CCCGCAGTGC CCCGACGGCG 300 

CTGAGCCGCC GCCTGCGGGC GCGCACGCAC CTTCCCGGCT GCGGAGCCCG TGCCCGGCTC 360 

TGCGGAACCG GTGCCCGGCT CTGCGGGGCG CTCTGCTTCC CCGTCCCCGG CGGCTGCGCG 420 

GCCGCGCAGC ATTCGGCGCT CGCAGCTCCG ACCACCTTAC CCGCCTGCCG CTGCCCGCCG 480 

CGCCCCAGGC CCCGCTGTCC CGGCCGTCCC ATCTGCCTGC CGCCGGGCGG CTOGGTCCGC 540 

CTGCGTCTGC TGTGCGCCCT GCGGCGCGCG GCTGGCGCCG TCCGGGTGGG ACTGGCGCTG 600 

GAGGCCGCCA CCGCGGGGAC GCCCTCCGCG TCGCCATCCC CATCGCCGCC CCTGCCGCCG 660 

AACTTGCCCG AAGCCCGGGC GGGGCCGGCG CGACGGGCCC GGCGGGGCAC GAGCGGCAGA 720 

GGGAGCCTGA AGTTTCCGAT GCCCAACTAC CAGGTGGCGT TGTTTGAGAA CGAACCGGCG 780 

GGCACCCTCA TCCTCCAGCT GCACGCGCAC TACACCATCG AGGGCGAGGA GGAGCGCGTG 840 

AGCTATTACA TGGAGGGGCT GTTCGACGAG CGCTCCCGGG GCTACTTCCG AATCGACTCT 900 

GCCACGGGCG CCGTGAGCAC GGACAGCGTA CTGGACCGCG AGACCAAGGA GACGCACGTC 960 

CTCAGGGTGA AAGCCGTGGA CTACAGTACG CCGCCGCGCT CGGCCACCAC CTACATCACT 1020 

GTCTTGGTCA AAGACACCAA CGACCACAGC CCGGTCTTCG AGCAGTCGGA GTACCGCGAG 1080 

CGCGTGCGGG AGAACCTGGA GGTGGGCTAC GAGGTGCTGA CCATCCGCGC CAGCGACCGC 1140 
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GACTCGCCCA TCAACGCCAA CTTGCGTTAC CGCGTGTTGG GGGGCGCGTG GGACGTCTTC 1200 

CAGCTCAACG AGAGCTCTGG CGTGGTGAGC ACACGGGCGG TGCTGGACGG GGAGGAGGCG 1260 

GCCGAGTACC AGCTCCTGGT GGAGGCCAAC GACCAGGGGC GCAATCCGGG CCCGCTCAGT 1320 

GCCACGGCCA CCGTGTACAT OGAGGTGGAG GACGAGAACG ACAACTACCC CCAGTTCAGC 1380 

GAGCAGAACT ACGTGGTCCA GGTGCCCGAG GACGTGGGGC TCAACACGGC TGTGCTGCGA 1440 

GTGCAGGCCA CGGACCGGGA CCAGGGCCAG AACGCGGCCA TTCACTACAG CATCCTCAGC 1500. 

GGGAACGTGG CCGGCCAGTT CTACCTGCAC TCGCTGAGCG GGATCCTGGA TGTGATCAAC 1560 

CCCTTGGATT TCGAGGATGT CCAGAAATAC TCGCTGAGCA TTAAGGCCCA GGATGGGGGC 1620 

CGGCCCCCGC TCATCAATTC TTCAGGGGTG GTGTCTGTGC AGGTGCTGGA TGTCAACGAC 1680 

AACGAGCCTA TCTTTGTGAG CAGCCCCTTC CAGGCCACGG TGCTGGAGAA TGTGCCCCTG 1740 

GGCTACCCCG TGGTGCACAT TCAGGCGGTG GACGCGGACT CTGGAGAGAA CGCCCGGCTG 1800 

CACTATCGCC TGGTGGACAC GGCCTCCACC TTTCTGGGGG GCGGCAGCGC TGGGCCTAAG 1860 

AATCCTGCCC CCACCCCTGA CTTCCCCTTC CAGATCCACA ACAGCTCCGG TTGGATCACA 1920 

GTGTGTGCCG AGCTGGACOG CGAGGAGGTG GAGCACTACA GCTTCGGGGT GGAGGCGGTG I960 

GACCACGGCT CGCCCCCCAT GAGCTCCTCC ACCAGCGTGT CCATCACGGT GCTGGACGTG 2040 

AATGACAACG ACCCGGTGTT CACGCAGCCC ACCTACGAGC TTCGTCTGAA TGAGGATGCG 2100 

GCCGTGGGGA GCAGCGTGCT GACCCTGCAG GCCCGCGACC GTGACGCCAA CAGTGTGATT 2160 

ACCTACCAGC TCACAGGCGG CAACACCCGG AACCGCTTTG CACTCAGCAG CCAGAGAGGG 2220 

GGCGGCCTCA TCACCCTGGC GCTACCTCTG GACTACAAGC AGGAGCAGCA GTACGTGCTG 2280 

GCGGTGACAG CATCCGACGG CACACGGTCG CACACTGCGC ATGTCCTAAT CAACGTCACT 2340 

GATGCCAACA CCCACAGGCC TGTCTTTCAG AGCTCCCATT ACACAGTGAG TGTCAGTGAG 2400 

GACAGGCCTG TGGGCACCTC CATTGCTACC CTCAGTGCCA ACGATGAGGA CACAGGAGAG 2460 

AATGCCCGCA TCACCTACGT GATTCAGGAC CCCGTGCCGC AGTTCCGCAT TGACCCCGAC 2520 

AGTGGCACCA TGTACACCAT GATGGAGCTG GACTATGAGA ACCAGGTOGC CTACACGCTG 2580 

ACCATCATGG CCCAGGACAA CGGCATCCCG CAGAAATCAG ACACCACCAC CCTAGAGATC 2640 

CTCATCCTCG ATGCCAATGA CAATGCACCC CAGTTCCTGT GGGATTTCTA CCAGGGTTCC 2700 

ATCTTTGAGG ATGCTCCACC CTCGACCAGC ATCCTCCAGG TCTCTGCCAC GGACCGGGAC 2760 

TCAGGTCCCA ATGGGCGTCT GCTGTACACC TTCCAGGGTG GGGACGAGGG CGATGGGGAC 2820 

TTCTACATCG AGCCCACGTC CGGTGTGATT CGCACCCAGC GCCGGCTGGA CCGGGAGAAT 2880 

GTGGCCGTGT ACAACCTTTG GGCTCTGGCT GTGGATCGGG GCAGTCCCAC TCCCCTTAGC 2940 

GCCTCGGTAG AAATCCAGGT GACCATCTTG GACATTAATG ACAATGCCCC CATGTTTGAG 3000 

AAGGACGAAC TGGAGCTGTT TGTTGAGGAG AACAACCCAG TGGGGTCGGT GGTGGCAAAG 3060 

ATTCGTGCTA ACGACCCTGA TGAAGGCCCT AATGCCCAGA TCATGTATCA GATTGTGGAA 3120 

GGGGACATGC GGCATTTCTT CCAGCTGGAC CTGCTCAACG GGGACCTGCG TGCCATGGTG 3180 

GAGCTGGACT TTGAGGTCCG GCGGGAGTAT GTGCTGGTGG TGGAGGCCAC GTCGGCTCCG 3240 

CTGGTGAGCC GAGCCACGGT GCACATCCTT CTCGTGGACC AGAATGACAA CCCGCCTGTG 3300 

CTGCCCGACT TCCAGATCCT CTTCAACAAC TATGTCACCA ACAAGTCCAA CAGTTTCCCC 3360 

ACCGGCGTGA TCGGCTGCAT CCCGGCCCAT GACCCCGACG TGTCAGACAG CCTCAACTAC 3420 

ACCTTCGTGC AGGGCAACGA GCTGCGCCTG TTGCTGCTGG ACCCCGCCAC GGGCGAACTG 3480 

CAGCTCAGCC GCGACCTGGA CAACAACCGG CCGCTGGAGG CGCTCATGGA GGTGTCTGTG 3540 

TCTGATGGCA TCCACAGCGT CACGGCCTTC TGCACCCTGC GTGTCACCAT CATCACGGAC 3600 

GACATGCTGA CCAACAGCAT CACTGTCCGC CTGGAGAACA TGTCCCAGGA GAAGTTCCTG 3660 

TCCCCGCTGC TGGCCCTCTT CGTGGAGGGG GTGGCCGCCG TGCTGTCCAC CACCAAGGAC 3720 

GACGTCTTCG TCTTCAAOGT CCAGAACGAC ACCGACGTCA GCTCCAACAT CCTGAACGTG 3780 

ACCTTCTCGG CGCTGCTGCC TGGCGGGGTC CGCGGCCAGT TCTTCCCGTC GGAGGACCTG 3840 

CAGGAGCAGA TCTACCTGAA TCGGACGCTG CTGACCACCA TCTCCACGCA GCGCGTGCTG 3900 

COCTTCGAOG ACAACATCTG CCTGCGCGAG CCCTGCGAGA ACTACATGAA GTGCGTGTCC 3960 

GTTCTGCGAT TCGACAGCTC CGCGCCCTTC CTCAGCTCCA CCACCGTGCT CTTCCGGCCC 4020 

ATCCACCCCA TCAACGGCCT GCGCTGCCGC TGCCOGCCCG GCTTCACCGG CGACTACTGC 4080 

GAGACGGAGA TCGACCTCTG CTACTCCGAC CCGTGCGGCG CCAAOGGCCG CTGCCGCAGC 4140 

CGCGAGGGCG GCTACACCTG CGAGTGCTTC GAGGACTTCA CTGGAGAGCA CTGTGAGGTG 4200 

GATGCCCGCT CAGGCCGCTG TGCCAACGGG GTGTGCAAGA ACGGGGGCAC CTGCGTGAAC 4260 

CTGCTCATCG GCGGCTTCCA CTGCGTGTGT CCTCCTGGCG AGTATGAGAG GCCCTACTGT 4320 

GAGGTGACCA CCAGGAGCTT CCCGCCCCAG TCCTTCGTCA CCTTCCGGGG CCTGAGACAG 4380 

CGCTTCCACT TCACCATCTC CCTCACGTTT GCCACTCAGG AAAGGAACGG CTTGCTTCTC 4440 

TACAACGGCC GCTTCAATGA GAAGCACGAC TTCATCGCCC TGGAGATCGT GGACGAGCAG 4500 

GTGCAGCTCA CCTTCTCTGC AGGCGAGACA ACAACGACCG TGGCACCGAA GGTTCCCAGT 4560 

GGTGTGAGTG ACGGGCGGTG GCACTCTGTG CAGGTGCAGT ACTACAACAA GCCCAATATT 4620 

GGCCACCTGG GCCTGCCCCA TGGGCCGTCC GGGGAAAAGA TGGCCGTGGT GACAGTGGAT 4680 

GATTGTGACA CAACCATGGC TGTGOGCTTT GGAAAGGACA TGGGGAACTA CAGCTGCGCT 4740 

GCCCAGGGCA CTCAGACCGG CTCCAAGAAG TCCCTGGATC TGACCGGCCC TCTACTCCTG 4800 

GGGGGTGTCC CCAACCTGCC AGAAGACTTC CCAGTGCACA ACCGGCAGTT CGTGGGCTGC 4860 

ATGCGGAACC TGTCAGTCGA CGGCAAAAAT GTGGACATGG CCGGATTCAT CGCCAACAAT 4920 

GGCACCOGGG AAGGCTGCGC TGCTCGGAGG AACTTCTGCG ATGGGAGGCG GTGTCAGAAT 4980 

GGAGGCACCT GTGTCAACAG GTGGAATATG TATCTGTGTG AGTGTCCACT CCGATTCGGC 5040 

GGGAAGAACT GTGAGCAAGC CATGCCTCAC CCCCAGCTCT TCAGCGGTGA GAGCGTCGTG 5100 

TCCTGGAGTG ACCTGAACAT CATCATCTCT GTGCCCTGGT ACCTGGGGCT CATGTTCCGG 5160 

ACCCGGAAGG AGGACAGOGT TCTGATGGAG GCCACCAGTG GTGGGCCCAC CAGCTTTCGC 5220 

CTCCAGATCC TGAACAACTA CCTCCAGTTT GAGGTGTCCC AOGGCCCCTC CGATGTGGAG 5280 

TCCGTGATGC TGTCCGGGTT GCGGGTGACC GACGGGGAGT GGCACCACCT GCTGATCGAG 5340 

CTGAAGAATG TTAAGGAGGA CAGTGAGATG AAGCACCTGG TCACCATGAC CTTGGACTAT 5400 

GGGATGGACC AGAACAAGGC AGATATCGGG GGCATGCTTC CCGGGCTGAC GGTAAGGAGC 5460 

GTGGTGGTCG GAGGCGCCTC TGAAGACAAG GTCTCOGTGC GCCGTGGATT CGGAGGCTGC 5520 

ATGCAGGGAG TGAGGATGGG GGGGACGCCC ACCAACGTCG CCACCCTGAA CATGAACAAC 5580 

GCACTCAAGG TCAGGGTGAA GGACGGCTGT GATGTGGACG ACCCCTGTAC CTCGAGCCCC 5640 

TGTCCCCCCA ATAGCCGCTG CCACGACGCC TGGGAGGACT ACAGCTGCGT CTGTGACAAA 5700 

GGGTACCTTG GAATAAACTG TGTGGATGCC TGTCACCTGA ACCCCTGOGA GAACATGGGG 5760 

GCCTGCGTGC GCTCCCCCGG CTCCCCGCAG GGCTACGTGT GCGAGTGTGG GCCCAGTCAC 5820 

TACGGGCCGT ACTGTGAGAA CAAACTCGAC CTTCOGTGCC CCAGAGGCTG GTGGGGGAAC 5880 

CCCGTCTGTG GACCCTGCCA CTGTGCCGTC AGCAAAGGCT TTGATCCCGA CTGTAATAAG 5940 

ACCAACGGCC AGTGCCAATG CAAGGAGAAT TACTACAAGC TCCTAGCCCA GGACACCTGT 6000 

CTGCCCTGCG ACTGCTTCCC CCATGGCTCC CACAGCCGCA CTTGCGACAT GGCCACCGGG 6060 

CAGTGTGCCT GCAAGCCCGG CGTCATCGGC CGCCAGTGCA ACCGCTGCGA CAACCCGTTT 6120 

GCCGAGGTCA CCACGCTCGG CTGTGAAGTG ATCTACAATG GCTGTCCCAA AGCATTTGAG 6180 
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GCCGGCATCT GGTGGCCACA GACCAAGTTC GGGCAGCCGG CTGCGGTGCC ATGCCCTAAG 6240. 
GGATCCGTTG GAAATGCGGT CCGACACTGC AGCGGGGAGA AGGGCTGGCT GCCCCCAGAG 6300 
CTCTTTAACT GTACCACCA* CTCCTTCGTG GACCTCAGGG CCATGAATGA GAAGCTGAGC 6360 
CGCAATGAGA CGCAGGTGGA CGGCGCCAGG GCCCTGCAGC TGGTGAGGGC GCTGCGCAGT 6420 
5 GCTACACAGC ACACGGGCAC GCTCTTTGGC AATGACGTGC GCACGGCCTA CCAGCTGCTG 6460 
GGCCACGTCC TTCAGCACGA GAGCTGGCAG CAGGGCTTCG ACCTGGCAGC CACGCAGGAC 6540 
GCCGACTTTC ACGAGGACGT CATCCACTCG GGCAGOGCCC TCCTGGCCCC AGCCACCAGG 6600 
GCGGCGTGGG AGCAGATCCA GCGGAGCGAG GGCGGCACGG CACAGCTGCT CCGGCGCCTC 6660 
GAGGGCTACT TCAGCAACGT GGCACGCAAC GTGCGGCGGA CGTACCTGCG GCCCTTCGTC 6720 
10 ATOGTCACCG CCAACATGAT TCTTGCTGTC GACATCTTTG ACAAGTTCAA CTTTACGGGA 6780 
GCCAGGGTCC CGCGATTCGA CACCATCCAT GAAGAGTTCC CCAGGGAGCT GGAGTCCTCC 6840 
GTCTCCTTCC CAGCCGACTT CTTCAGACCA CCTGAAGAAA AAGAAGGCCC CCTGCTGAGG 6900 
CCGGCTGGCC GGAGGACCAC CCCGCAGACC ACGCGCCCGG GGCCTGGCAC CGAGAGGGAG 6960 
GCCCCGATCA GCAGGCGGAG GCGACACCCT GATGACGCTG GCCAGTTCGC CGTCGCTCTG 7020 
15 GTCATCATTT ACCGCACCCT GGGGCAGCTC CTGCCCGAGC GCTACGACCC CGACCGTCGC 7080 
AGCCTCCGGT TGCCTCACCG GCCCATCATT AATACCCCGA TGGTGAGCAC GCTGGTGTAC 7140 
AGCGAGGGGG CTCCGCTCCC GAGACCCCTG GAGAGGCCCG TCCTGGTGGA GTTCGCCCTG 7200 
CTGGAGGTGG AGGAGCGAAC CAAGCCTGTC TGCGTGTTCT GGAACCACTC CCTGGCCGTT 7260 
GGTGGGACGG GAGGGTGGTC TGCCCGGGGC TGCGAGCTCC TGTCCAGGAA CCGGACACAT 7320 
20 GTCGCCTGCC AGTGCAGCCA CACAGCCAGC TTTGOGGTGC TCATGGATAT CTCCAGGCGT 7380 
GAGAACGGGG AGGTCCTGCC TCTGAAGATT GTCACCTATG CCGCTGTGTC CTTGTCACTG 7440 
GCAGCCCTGC TGGTGGCCTT CGTCCTCCTG AGCCTGGTCC GCATGCTGCG CTCCAACCTG 7500 
CACAGCATTC ACAAGCACCT CGCCGTGGCG CTCTTCCTCT CTCAGCTGGT GTTCGTGATT 7560 
GGGATCAACC AGACGGAAAA CCCGTTTCTG TGCACAGTGG TTGCCATCCT CCTCCACTAC 7620 
25 ATCTACATGA GCACCTTTGC CTGGACCCTC GTGGAGAGCC TGCATGTCTA CCGCATGCTG 7680 
ACCGAGGTGC GCAACATCGA CACGGGGCCC ATGCGGTTCT ACTACGTCGT GGGCTGGGGC 7740 
ATCCCGGCCA TTGTCACAGG ACTGGCGGTC GGCCTGGACC CCCAGGGCTA CGGGAACCCC 7800 
GACTTCTGCT GGCTGTCGCT TCAAGACACC CTGATTTGGA GCTTTGCGGG GCCCATCGGA 7860 
GCTGTTATAA TCATCAACAC AGTCACTTCT GTCCTATCTG CAAAGGTTTC CTGCCAAAGA 7920 
30 AAGCACCATT ATTATGGGAA AAAAGGGATC GTCTCCCTGC TGAGGACCGC ATTCCTCCTG 7980 
CTGCTGCTCA TCAGCGCCAC CTGGCTGCTG GGGCTGCTGG CTGTGAACCG CGATGCACTG 8040 
AGCTTTCACT ACCTCTTCGC CATCTTCAGC GGCTTACAGG GCCCCTTCGT CCTCCTTTTC 8100 
CACTGCGTGC TCAACCAGGA GGTCCGGAAG CACCTGAAGG GCGTGCTCGG CGGGAGGAAG 8160 
CTGCACCTGG AGGACTCCGC CACCACCAGG GCCACCCTGC TGACGCGCTC CCTCAACTGC 8220 
35 AACACCACCT TCGGTGACGG GCCTGACATG CTGCGCACAG ACTTGGGCGA GTCCACCGCC 8280 
TCGCTGGACA GCATCGTCAG GGATGAAGGG ATCCAGAAGC TCGGCGTGTC CTCTGGGCTG 8340 
GTGAGGGGCA GCCACGGAGA GCCAGACGCG TCCCTCATGC CCAGGAGCTG CAAGGATCCC 8400 
CCTGGCCACG ATTCCGACTC AGATAGCGAG CTGTCCCTGG ATGAGCAGAG CAGCTCTTAC 8460 
GCCTCCTCAC ACTCGTCAGA CAGCGAGGAC GATGGGGTGG GAGCTGAGGA AAAATGGGAC 8520 
40 CCGGCCAGGG GCGCCGTCCA CAGCACCCCC AAAGGGGACG CTGTGGCCAA CCACGTTCCG 8580 
GCCGGCTGGC CCGACCAGAG CCTGGCTGAG AGTGACAGTG AGGACCCCAG CGGCAAGCCC 8640 
CGCCTGAAGG TGGAGACCAA GGTCAGCGTG GAGCTGCACC GOGAGGAGCA GGGCAGTCAC 8700 
CGTGGAGAGT ACCCCCCGGA CCAGGAGAGC GGGGGCGCAG CCAGGCTTGC TAGCAGCCAG 8760 
CCCCCAGAGC AGAGGAAAGG CATCTTGAAA AATAAAGTCA CCTACCCGCC GCCGCTGACG 8820 
45 CTGACGGAGC AGACGCTGAA GGGCOGGCTC CGGGAGAAGC TGGCCGACTG TGAGCAGAGC 8880 
CCCACATCCT CGCGCACGTC TTCCCTGGGC TCTGGCGGCC CCGACTGCGC CATCACAGTC 8940 
AAGAGCCCTG GGAGGGAGCC GGGGCGTGAC CACCTCAACG GGGTGGCCAT GAATGTGCGC 9000 
ACTGGGAGCG CCCAGGCCGA TGGCTCCGAC TCTGAGAAAC CGTGAGGCAA GCCCGTCACC 9060 
CCACACAGGC TGCGGCATCA CCCTCAGACC TTGGAGCOCA AGGGGCCACT GCCCTTGAAG 9120 
50 TGGAGTGGGC CCAGAGTGTG GCGGTCCCCA TGGTGGCAGC CCCCCGACTG ATCATCCAGA 9180 
CACAAAGGTC TTGGTTCTCC CAGGAGCTCA GGGCCTGTCA GACCTGGTGA CAAGTGCCAA 9240 
AGGCCACAGG CATGAGGGAG GGGTGGACCA CTGGGCCAGC ACCGCTGAGT CCTAAGACTG 9300 
CAGTCAAAGC CAGAACTGAG AGGGGACCCC AGACTGGGCC CAGAGGCTGG CCAGAGTTCA 9360 
GGAACGCCGG GCACAGACCA AAGACCGCGG TCCAGCCCCG CCCAGGCGGG CATCTCATGG 9420 
55 CAGTGCGGAC CCGTGGCTGG CAGCCCGGGC AGTCCTTTGC AAAGGCACCC CTTGTCTTAA 9480 
AATCACTTCG CTATGTGGGA AAGGTGGAGA TACTTTTATA TATTTGTATG GGACTCTGAG 9540 
GAGGTGCAAC CTGTATATAT ATTGCATTCG TGCTGACTTT GTTATCCCGA GAGATCCATG 9600 
CAATGATCTC TTGCTGTCTT CTCTGTCAAG ATTGCACAGT TGTACTTGAA TCTGGCATGT 9660 
GTTGACGAAA CTGGTGCCCC AGCAGATCAA AGGTGGGAAA TACGTCAGCA GTGGGGCTAA 9720 
60 AACCAAGCGG CTAGAAGCCC TACAGCTGCC TTCGGCCAGG AAGTGAGGAT GGTGTGGGCC 9780 
CTCCCCGCOG GCCCCCTGGG TCCCCAGTGT TCGCTGTGTG TGCGTTTGTC CTCTGCTGCC 9840 
ATCTGCCCCG GCTGTGTGAA TTCAAGACAG GGCAGTGCAG CACTAGGCAG GTGTGAGGAG 9900 
CCCTGCTGAG GTCACTGTGG GGCACGGTTG CCACACGGCT GTCATTTTTC ACCTGGTCAT 9960 
TCTGTGACCA CCACCCCCTC CCCTCACCGC CTCCCAGGTG GCCCGGGAGC TGCAGGTGGG 10020 
65 GATGGCTTTG TC C TTTGCTC CTGCTCCCCG TGGGACCTGG GACCTTAAAG CGTTGCAGGT 10080 
TCCTGATTTG GACAGAGGTG TGGGGCCTTC CAGGCCGTTA CATACCTCCT GCCAATTCTC 10140 
TAACTCTCTG AGACTGCGAG GATCTCCAGG CAGGGTTCTC CCCTCTGGAG TCTGACCAAT 10200 
TACTTCATTT TGCTTCAAAT GGCCAATTGT GCAGAGGGAC AAAGCCACAG CCACACTCTT 10260 
CAACGGTTAC CAAACTGTTT TTGGAAATTC ACACCAAGGT OGGGCCCACT GCAGGCAGCT 10320 
70 GGCACAGCGT GGCCCGAGGG GCTGTGGAAC GGGTCCCGGA ACTGTCAGAC ATGTTTGATT 10380 
TTAGCGTTTC CTTTGTTCTT CAAATCAGGT GCCCAAATAA GTGATCAGCA CAGCTGCTTC 10440 
CAAATAGGAG AAACCATAAA ATAGGATGAA AATCAAGTAA AATGCAAAGA TGTCCACACT 10500 
GTTTTAAACT TGACCCTGAT GAAAATGTGA GCACTGTTAG CAGATGCCTA TGGGAGAGGA 10560 
AAAGCGTATC TGAAAATGGT CCAGGACAGG AGGATGAAAT GAGATCCCAG AGTCCTCACA 10620 
75 CCTGAATGAA TTATACATGT GCCTTACCAG GTGAGTGGTC TTTCGA AGAT AAAAAACTCT 10680 
AGTCCCTTTA AACGTTTGCC CCTGGCGTTT CCTAAGTACG AAAAGGTTTT TAAGT CTTCG 10740 
AACAGTCTCC TTTCATGACT TTAACAGGAT TCTGCCCCCT GAGGTGTAAT TTTTTTGTTC 10800 
TATTTTTTTC CACGTACTCC ACAGCCAACA TCACGAGGTG TAATTTTTAA TTTGATCAGA 10860 
OA ACTGTTACCA AAAAACAACT GTCAGTTTTA TTGAGATGGG AAAAATGTAA ACCTATTTTT 10920 
80 ATTACTTAAG ACTTTATGGG AGAGATTAGA CACTGGAGGT TTTTAACAGA ACGTGTATTT 10980 
ATTAATGTTC AAAACACTGG AATTACAAAT GAGAAGAGTC TACAATAAAT TAAGATTTTT 11040 
GAATTTGTAC TTCTGCGGTG CTGGTOTTC TCCACAAACA CCCCCGCCCC TCCCCATGCC 11100 
CAGGGTGGCC GTGGAAGGGA CGGTTTACGG ACGTGCAGCT GAGCTGTCCG TGTCCCATGC 11160 
TCCCTCAGCC AGTGGAAOGT GCCGGAACTT TTTGTCCATT CCCTAGTAGG CCTGCCACAG 11220 
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CCTAGATGGG CAGTTTTTGT CTTTCACCAA ATTTGAGGAC TTTTTTTTTT TGCCATTATT 11280 
TCTTCAGTTT TCTTTTCTTG CACTGATCTT TCTCCTCTCC TTCTGTGACT CCAGTGACTC 11340 
AGACGTTAGA CCTCTTGATG TTTTCCCACT GGTCCCTGAG GCTCTGTTC 



A3 2 Protein Secfuence 
Gene name: 

Unigene number: 
Protein Accession #: 
Signal sequence: 
Cadherin domains: 

Pfam domains: 
Latrophil in/GPS domains: 
Transmembrane domains: 
Cellular Localization: 



CH22_FG£NES.706; cadherin, EGF LAG seven-pass G-type receptor 1, flamingo 
(Drosophila) homolog (CELSR1) , 
Hs. 252387 
NP_055061 

1-20 

250-344, 358-449, 464-556, 570-678, 692-780, 794-883, 897-990, 1004-1092, 
1110-1199 

Laminin EGP 12003-2048] , 7tra_2 [2465-2708] 
2407-2460 

1219-1238, 2473-2492, 2507-2522, 2529-2547, 2616-2634, 2659-267S, 2687-2704 
plasma membrane 



1 11 21 31 

llll 
MAPPPPPVLP VLLLLAAAAA LPAMGLRAAA 
PRELLDVGRD GRLAGRRRVS GAGRPLPLQV 
CGTGARLCGA LCPPVPGGCA AAQHSALAAP 
LRLLGALRRA AGAVRVGLAL EAATAGTPSA 
GSLKFPMPNY QVALFENEPA GTLILQLHAH 
ATGAVSTDSV LDRETKETHV LRVKAVDYST 
RVRENLEVGY EVLTIRASDR DSPINANLRY 
AEYQLLVEAN DQGRNPGPLS ATATVYIEVE 
VQATDRDQGQ NAAIHYSILS GNVAGQFYLH 
RPPLINSSGV VSVQVLDVND NEPIPVSSPP 
HYRLVDTAST FLGGGSAGPK NPAPTPDFPP 
DHGSPPMSSS TSVS1TVLDV NDNDPVFTQP 
TYQLTGGNTR NRFALSSQRG GGLITLALPL 
DANTHRPVFQ SSHYTVSVSE DRPVGTSIAT 
SGTMYTMMEL DYENQVAYTL TIMAQDNGIP 
IFEDAPPSTS ILQVSATDRD SGPNGRLLYT 
VAVYNLWALA VDRGSPTPLS ASVEIQVTIL 
IRANDPDEGP NAQIMYQIVE GDMRHFFQLD 
LVSRATVHIL LVDQNDMPPV LPDFQILFNN 
TFVQGNELRL LLLDPATGEL QLSRDLDNNR 
DMLTNSITVR LENMSQEKFL SPLLALFVEG 
TFSALLPGGV RGQFFPSEDL QEQIYLNRTL 
VLRFDSSAPF LSSTTVLFRP IHPINGLRCR 
REGGYTCECF EDFTGEHCEV DARSGRCANG 
EVTTRSFPPQ SFVTFRGLRQ RFHFTISLTF 
VQLTFSAGET TTTVAPKVPS GVSDGRWHSV 
DCDTTMAVRF GKDIGNYSCA AQGTQTGSKK 
MRNLSVDGKN VDMAGFIAKN GTREGCAARR 
GKNCEQAMPH PQLFSGESW SWSDLNIIIS 
LQILNNYLQF EVSHGPSDVE SVMLSGLRVT 
GMDQNKADIG GMLPGLTVRS VWGGASEDK 
ALKVRVKDGC DVDDPCTSSP CPPNSRCHDA 
ACVRSPGSPQ GYVCECGPSH YGPYCENKLD 
TNGQCQCKEN YYKLLAQDTC LPCDCPPHGS 
AEVTTLGCEV IYNGCPKAFE AGIWWPQTKF 
LFNCTTISFV DLRAMNEKLS RNETQVDGAR 
GHVLQHESWQ QGFDLAATQD ADFHEDVIHS 
EGYFSHVARN VRRTYLRPFV IVTANMILAV 
VSFPADFFRP PEEKEGPLLR PAGRRTTPQT 
VIIYRTLGQL LPERYDPDRR SLRLPHRPII 
LEVEERTKPV CVFWNHSLAV GGTGGWSARG 
ENGEVXPLKI VTYAAVSLSL AALLVAFVLL 
GINQTENPFL CTWAILLHY IYMSTFAWTL 
IPAIVTGLAV GLDPQGYGNP DFCWLSLQDT 
KHHYYGKKGI VSLLRTAFLL LLLISATWLL 
HCVLNQEVRK HLKGVLGGRK LHLEDSATTR 
SLDSIVRDEG IQKLGVSSGL VRGSHGEPDA 
ASSHSSDSED DGVGAEEKWD PARGAVHSTP 
RLKVETKVSV ELHREEQGSH RGEYPPDQES 
LTEQTLKGRL REKLADCEQS PTSSRTSSLG 
TGSAQADGSD SEKP 



41 
I 



WEPRVPGGTR AFALRPGCTY 
RLVARSAPTA LSRRLRARTH 
TTLPACRCPP RPRPRCPGRP 
SPSPSPPLPP NLPEARAGPA 
YTIEGEEERV SYYMEGLFDE 
PPRSATTYIT VLVKDTNDHS 
RVLGGAWDVF QLNESSGWS 
DENDNYPQFS EQNYWQVPE 
SLSGILDVIN PLDFEDVQKY 
QATVLENVPL GYPWHIQAV 
QIHNSSGWIT VCAELDREEV 
TYELRLNEDA AVGSSVLTLQ 
DYKQEQQYVL AVTASDGTRS 
LSANDEDTGE NARITYVIQD 
QKSDTTTLEI LILDANDNAP 
FQGGDDGDGD FYIEPTSGVI 
DINDNAPMFE KDELELFVEE 
LLNGDLRAMV ELDFEVRREY 
YVTNKSNSFP TGVIGCIPAH 
PLEALMEVSV SDGIHSVTAF 
VAAVLSTTKD DVFVFNVQND 
LTTISTQRVL PFDDNICLRE 
CPPGFTGDYC ETEIDLCYSD 
VCKNGGTCVN LLIGGFHCVC 
ATQERNGLLL YNGRFNEKHD 
QVQYYNKPNI GHLGLPHGPS 
SLDLTGPLLL GGVPNLPEDF 
NFGDGRRCQN GGTCVNRWNM 
VPWYLGLMFR TRKEDSVLME 
DGEWHHLL1E LKNVKEDSEM 
VSVRRGFRGC MQGVRMGGTP 
WEDYSCVCDK GYLGINCVDA 
LPCPRGWWGN PVCGPCHCAV 
HSRTCDMATG QCACKPGVIG 
GQPAAVPCPK GSVGMAVRHC 
ALQLVRALRS ATQHTGTLFG 
GSALLAPATR AAWEQIQRSE 
DIFDKFNFTG ARVPRFDTIH 
TRPGPGTERE APISRRRRHP 
KTPMVSTLVY SEGAPLPRPL 
CELLSRNRT3 VACQCSHTAS 
SLVRMLRSNL HSIHKHLAVA 
VESLHVYRML TEVRKIDTGP 
LIWSFAGPIG AVIIIHTVTS 
GLLAVNRDAL SFHYLFAIFS 
ATLLTRSLNC NTTFGDGPDM 
SLMPRSCXDP PGHDSDSDSE 
KGDAVANHVP AGHPDQSLAE 
GGAARLASSQ PPEQRKGILK 
SGGPDCAITV KSPGREPGRD 



51 

I 

AVGAACTPRA 
LPGCGARARL 
ICLPPGGSVR 
RRARRGTSGR 
RSRGYFRIDS 
PVFEQSEYRE 
TRAVLDREEA 
DVGLNTAVLR 
SLSIKAQDGG 
DADSGENARL 
EHYSFGVEAV 
ARDRDANSVI 
HTAHVLINVT 
PVPQFRIDPD 
QFLWDFYQGS 
RTQRRLDREN 
NNPVGSWAK 
VLWQATSAP 
DPDVSDSLNY 
CTLRVTIITD 
TDVSSNILNV 
PCENYMKCVS 
PCGANGRCRS 
PPGEYERPYC 
FIALEIVDEQ 
GEKMAWTVD 
PVHNRQFVGC 
YLCECPLRFG 
ATSGGPTSFR 
KHLVTMTLDY 
TNVATLNMNN 
CELKPCENMG 
SKGPDPDCNK 
RQCNRCDNPF 



NDVRTAYQLL 
GGTAQLLRRL 



DDAGQFAVAL 
ERPVLVEFAL 
FAVLMDISRR 
LFLSQLVFVI 
MRFYYWGWG 
VLSAKVSCQR 
GLQGPFVLLF 
LRTDLGESTA 
LSLDEQSSSY 
SDSEDPSGTCP 
NKVTYPPPLT 
HLNGVAMNVR 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 



A3 3 DNA SEQUENCE 

Gene name: CXCR3 (G protein-coupled receptor 9); chemokine (C-X-C) receptor 3 
Unigene number: Hs. 198252 

Probeset Accession #: X95876 
Nucleic Acid Accession fh X95876 

Coding sequence: 69-1175 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I 1 I I I 

CCAACCACAA GCACCAAAGC AGAGGGGCAG GCAGCACACC ACCCAGCAGC CAGAGCACCA 60 
GCCCAGC CAT GG TCCTTGAG GTGAGTGACC ACCAAGTGCT AAATGACGCC GAGGTTGCCG 120 



845- 



WO 03/042661 



PCT7US02/36810 



CCCTCCTGGA GAACTTCAGC TCTTCCTATG ACTATGGAGA AAACGAGAGT GACTCGTGCT ISO 

GTACCTCCCC GCCCTGCCCA CAGGACTTCA GCCTGAACTT OGACCGGGCC TTCCTGCCAG 240 

CCCTCTACAG CCTCCTCTTT CTGCTGGGGC TGCTGGGCAA CGGCGCGGTG GCAGCCGTGC 300 

TGCTGAGCCG GCGGACAGCC CTGAGCAGCA CCGACACCTT CCTGCTCCAC CTAGCTGTAG 360 

CAGACACGCT GCTGGTGCTG ACACTGCCGC TCTGGGCAGT GGACGCTGCC GTCCAGTGGG 420 

TCTTTGGCTC TGGCCTCTGC AAAGTGGCAG GTGCCCTCTT CAACATCAAC TTCTACGCAG 480 

GAGCCCTCCT GCTGGCCTGC ATCAGCTTTG ACCGCTACCT GAACATAGTT CATGCCACCC 540 

AGCTCTACCG COGGGGGCCC CCGGCCCGCG TGACCCTCAC CTGCCTGGCT GTCTGGGGGC 600 

TCTGCCTGCT TTTCGCCCTC CCAGACTTCA TCTTCCTGTC GGCCCACCAC GACGAGCGCC 660 

TCAACGCCAC CCACTGCCAA TACAACTTCC CACAGGTGGG CCGCACGGCT CTGOGGGTGC 720 

TGCAGCTGGT GGCTGGCTTT CTGCTGCCCC TGCTGGTCAT GGCCTACTGC TATGCCCACA 780 

TCCTGGCCGT GCTGCTGGTT TCCAGGGGCC AGCGGCGCCT GCGGGCCATG CGGCTGGTGG 840 

TGGTGGTCGT GGTGGCCTTT GCCCTCTGCT GGACCCCCTA TCACCTGGTG GTGCTGGTGG 900 

ACATCCTCAT GGACCTGGGC GCTTTGGCCC GCAACTGTGG CCGAGAAAGC AGGGTAGACG 960 

TGGCCAAGTC GGTCACCTCA GGCCTGGGCT ACATGCACTG CTGCCTCAAC CCGCTGCTCT 1020 

ATGCCTTTGT AGGGGTCAAG TTCCGGGAGC GGATGTGGAT GCTGCTCTTG CGCCTGGGCT 1080 

GCCCCAACCA GAGAGGGCTC CAGAGGCAGC CATCGTCTTC CCGCCGGGAT TCATCCTGGT 1140 

CTGAGACCTC AGAGGCCTCC TACTCGGGCT TGTGAGGCCG GAATCCGGGC TCCCCTTTCG 1200 

CCCACAGTCT GACTTCCCCG CATTCCAGGC TCCTCCCTCC CTCTGCCGGC TCTGGCTCTC 1260 

CCCAATATCC TCGCTCCCGG GACTCACTGG CAGCCCCAGC ACCACCAGGT CTCCCGGGAA 1320 

GCCACCCTCC CAGCTCTGAG GACTGCACCA TTGCTGCTCC TTAGCTGCCA AGCCCCATCC 1380 

TGCCGCCCGA GGTGGCTGCC TGGAGCCCCA CTGCCCTTCT CATTTGGAAA CTAAAACTTC 1440 

ATCTTCCCCA AGTGCGGGGA GTACAAGGCA TGGCGTAGAG GGTGCTGCCC CATGAAGCCA 1500 

CAGCCCAGGC CTCCAGCTCA GCAGTGACTG TGGCCATGGT CCCCAAGACC TCTATATTTG 1560 

CTCTTTTATT TTTATGTCTA AAATCCTGCT TAAAACTTTT CAATAAACAA GATCGTCAGG 1620 

ACCAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA BHCLITERAT 1680 

UREBHCBLAS TPBHCORTHO LOGUESMOUS ECXCRMYLEV SERQVLDASD FAFLLENSTS 1740 

PYDYGENESD FSDSPPCPQD FSLNFDRTFL PALYSLLFLL GLLGNGAVAA VLLSQRTALS 1800 

STDTFLLHLA VADVLLVLTL PLWAVDAAVQ WVFGPGLCKV AGALFNINFY AGAFLLACIS 1860 

FDRYLSIVHA TQIYRRDPRV RVALTCIWW GLCLLFALPD FIYLSANYDQ RLNATHCQYN 1920 

FPQVGRTALR VLQLVAGFLL PLLVMAYCYA HILAVLLVSR GQRRFRAMRL VWWAAFAV 1980 

CWTPYHLWIj VDILMDVGVL ARNCGRESHV DVAKSVTSGM GYMHCCLNPli LYAFVGVKFR 2040 
EQMWMLFTRL GRSDQRGPQR QPSSSRRESS WSETTEASYL GL 



A34 Protein sequence 
Gene name: 
Unigene nutnber: 
Protein Accession ft: 
Signal sequence: 
Pfam domains: 
Transmembrane domains: 

DRY bOX: 

Cellular Localization: 



CXCR3(G protein-coupled receptor 9) ;chemokine(C-X-C> receptor 3 
Hs. 198252 

P49682 
none found 
7tm_l [70-318] 

57-78, 92-113, 129-147, 169-190, 222-243, 257-275, 307-323 
148-149 

plasma membrane 



1 11 21 31 41 51 

I I I I I I 

MVLEVSDHQV LNDAEVAALL ENFSSSYDYG ENESDSCCTS PPCPQDFSLN FDRAFLPALY 60 

SLLFLLGLLG NGAVAAVLLS RRTALSSTDT FLLHLAVADT LLVLTLPLWA VDAAVQWVFG 120 

SGLCKVAGAL FNINFYAGAL LLACISFDRY LNIVHATQLY RRGPPARVTL TCLAVWGLCL 180 

LFALPDFIFL SAHHDERLNA THCQYNFPQV GRTALRVLQL VAGFLLPLLV MAYCYAHILA 240 

VLLVSRGQRR LRAMRLVWV WAFALCWTP YHLWLVDIL MDLGALARNC GRESRVDVAK 300 

SVTSGLGYMH CCLNPLLYAF VGVKFRERMW MLLLRUGCPN QRGLQRQPSS SRRDSSWSET 360 
SEASYSGL 



A3 5 DMA SEQUENCE 

Gene name: 
Unigene number: 
Probeset Accession ft: 
Nucleic Acid Accession ft: 
Coding sequence: 



Differentially expressed C016 gene (clone MGC:5257) 

Hs. 69517 

AA447522 

BC001291 

44-541 (start and stop codons are underlined) 



1 11 21 31 41 51 

I I I I I I 

GGGGGCGCCG CGOGCTGACC CTCCCTGGGC ACCGCTGGGG ACGATGGCGC TGCTCGCCTT 60 

GCTGCTGGTC GTGGCCCTAC CGCGGGTGTG GACAGACGCC AACCTGACTG CGAGACAACG 120 

AGATCCAGAG GACTCCCAGC GAACGGACGA GGGTGACAAT AGAGTGTGGT GTCATGTTTG 180 

TGAGAGAGAA AACACTTTCG AGTGCCAGAA CCCAAGGAGG TGCAAATGGA CAGAGCCATA 240 

CTGCGTTATA GCGGCCGTGA AAATATTTCC ACGTTTTTTC ATGGTTGCGA AGCAGTGCTC 300 

CGCTGGTTGT GCAGCGATGG AGAGACCCAA GCCAGAGGAG AAGCGGTTTC TCCTGGAAGA 360 

GCCCATGCCC TTCTTTTACC TCAAGTGTTG TAAAATTCGC TACTGCAATT TAGAGGGGCC 420 

ACCTATCAAC TCATCAGTGT TCAAAGAATA TGCTGGGAGC ATGGGTGAGA GCTGTGGTGG 480 

GCTGTGGCTG GCCATCCTCC TGCTGCTGGC CTCCATTGCA GCCGGCCTCA GCCTGTCTTG 540 

AGCCACGGGA CTGCCACAGA CTGAGCCTTC CGGAGCATGG ACTCGCTCCA GACOGTTGTC 600 

ACCTGTTGCA TTAAACTTGT TTTCTGTTGA TTACCTCTTG GTTTGACTTC CCAGGGTCTT 660 

GGGATGGGAG AGTGGGGATC AGGTGCAGTT GGCTCTTAAC CCTCAAGGGT TCTTTAACTC 720 

ACATTCAGAG GAAGTCCAGA TCTCCTGAGT AGTGATTTTG GTGACAAGTT TTTCTCTTTG 780 

AAATCAAACC TTGTAACTCA TTTATTGCTG ATGGCCACTC TTTTCCTTGA CTCCCCTCTG 840 

CCTCTGAGGG CTTCAGTATT GATGGGGAGG GAGGCCTAAG TACCACTCAT GGAGAGTATG 900 

TGCTGAGATG CTTCCGACCT TTCAGGTGAC GCAGGAACAC TGGGGGAGTC TGAATGATTG 960 

GGGTGAAGAC ATCCCTGGAG TGAAGGACTC CTCAGCATGG GGGGCAGTGG GGCACACGTT 1020 

AGGGCTGCCC CCATTCCAGT GGTGGAGGCG CTGTGGATGG CTG CTTTT CC TCAACCTTTC 1080 



846 



WO 03/042661 



PCT/US02/36810 



CTACCAGATT CCAGGAGGCA GAAGATAACT AATTGTGTTG AAGAAACTTA GACTTCACCC 1140 

ACCAGCTGGC ACAGGTGCAC AGATTCATAA ATTCCCACAC GTGTGTGTTC AACATCTGAA 1200 

ACTTAGGCCA AGTAGAGAGC ATCAGGGTAA ATGGCGTTCA TTTCTCTGTT AAGATGCAGC 1260 

CATCCATGGG GAGCTGAGAA ATCAGACTCA AAGTTCCACC AAAAACAAAT ACAAGGGGAC 1320 
TTCAAAAGTT CACGAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAA 



Differentially expressed C016 gene (clone MGC:S257) 

Hs. 69517 

AA447522 

AAH01291 
1-17 (first underlined sequence) 
146 - 162 

plasma membrane 



1 11 21 31 41 51 

I ) i I I I 

MALLALLLW ALPRVWTDAN LTARQRDPED SQRTDEGDNR VWCKVCEREN TFECQNPRRC 60 
KHTEPYCVIA AVKIPPRPFM VAKQCSAGCA AMERPKPEEK RFLLEEPMPF FYLKCCKIRY 120 
CNLEGPPINS SVFKEYAGSM GESCGGLWLA ILLLLASIAA GLSLS 

A3 7 DNA SEQUENCE 

Gene name: ESTs 
Unigene number: Hs. 2 93 61 6 

Probeset Accession #: AW0437B2 
Nucleic Acid Accession #: none found 

Coding sequence: 38-1075 (underlined sequence corresponds to start and stop codon) 



A3 6 Protein sequence: 
Gene name: 
Unigene number: 
Probeset Accession #: 
Protein Accession #: 
Signal sequence: 
Transmembrane domain: 
Cellular localization: 



1 11 21 31 41 51 

I I I I I I 

AGCAACGACG CCGGGCAGCG GGAGCGGCGG CCGCGCCATG TGGCTGCTGG GGCCGCTGTG 60 

CCTGCTGCTG AGCAGCGCCG CGGAGAGCCA GCTGCTCCCC GGGAACAACT TCACCAATGA 120 

GTGCAACATA CCAGGCAACT TCATGTGCAG CAATGGACGG TGCATCCCGG GCGCCTGGCA 180 

GTGTGACGGG CTGCCTGACT GCTTCGACAA GAGTGATGAG AAGGAGTGCC CCAAGGCTAA 240 

GTCGAAATGT GGCCCAACCT TCTTCCCCTG TGCCAGCGGC ATCCATTGCA TCATTGGTCG 300 

CTTCCGGTGC AATGGGTTTG AGGACTGTCC CGATGGCAGC GATGAAGAGA ACTGCACAGC 360 

AAACCCTCTG CTTTGCTCCA CCGCCGGCTA CCACTGCAAG AACGGCCTCT GTATTGACAA 420 

GAGCTTCATC TGCGATGGAC AGAATAACTG TCAAGACAAC AGTGATGAGG AAAGCTGTGA 480 

AAGTTCTCAA GAACCCGGCA GTGGGCAGGT GTTTGTGACT TCAGAGAACC AACTTGTGTA 540 

TTACCCCAGC ATCACCTATG CCATCATCGG CAGCTCCGTC ATTTTTGTGC TGGTGGTGGC 600 

CCTGCTGGCA CTGGTCTTGC ACCACCAGCG GAAGCGGAAC AACCTCATGA CGCTGCCCGT 660 

GCACCGGCTG CAGCACCCTG TGCTGCTGTC CCGCCTGGTG GTCCTGGACC' ACCCCCACCA 720 

CTGCAACGTC ACCTACAACG TCAATAATGG CATCCAGTAT GTGGCCAGCC AGGCGGAGCA 780 

GAATGCGTCG GAAGTAGGCT CCCCACCCTC CTACTCCGAG GCCTTGCTGG ACCAGAGGCC 840 

TGCGTGGTAT GACCTTCCTC CACCGCCCTA CTCTTC T GAC ACGGAATCTC TGAACCAAGC 900 

CGACCTGCCC CCCTACCGCT CCCGGTCCGG GAGTGCCAAC AGTGCCAGCT CCCAGGCAGC 960 

CAGCAGCCTC CTGAGCGTGG AAGACACCAG CCACAGCCCG GGGCAGCCTG GCCCCCAGGA 1020 

GGGCACTGCT GAGCCCAGGG ACTCTGAGCC CAGCCAGGGC ACTGAAGAAG TATAAGTCCC 1080 

AGTTATTCCA AAGTCCATAT GGGTTAATCT GCTCTGACTT GTTGCCATTC TAACAATTTG 1140 

TGCTCATGGG AAGCTCTTTA AGCACCTGTA AGGATGTCTC AAGTTACAGT TTGGGATATT 1200 

AACTATCTCT GCATTCCCCT CCTCCCCCAG ACTTCAGAGA TGTTTTTCTG GCGTCTCAGT 1260 

TGACATGATC TGTTGTGCGT CTTTTCTGTC AGGTCACTCT TCCCTTGGGA CCCGAGATCA 1320 

CACCCTCATT TTTCACATTA TTCTGTTTCT GTTGGAGAGA CAGCATATAA AACAGTATTG 1380 

AAATAGGCTG GGAGAGAGCA ATGTTTCTGT GCTATATTGG ATGCTCAGAA GTGCAGGAGA 1440 

CGCTGGACCC AATTCTCTCT GCTGGGTAGT TACCTTATAG CATTTGGGGA TTTGGGTTAG 1500 

ATGATCTAAC CAGGAGGCCA TCACTGGATG GTCACCCCCC CAAAAAAATT CCATTTGAGC 1560 

ATCAAAACCT GCTTTGCACA ATCCTATTTG ATGCCCCCAG TTCAGCAGAG TCAGTGGCCA 1620 

AAGAAAACTT TGGACGTGAG TAACACCCTT CAGCAGTCGC AACGTTATTT TGGTTTTGTG 1680 

AAGGACTCTG AAACCATCTA CCCTGTATAA ATTCTGGCTT TAGAAATTTG CCCAAGAATG 1740 

CTCATTCTGA GAGCTTTCCT CAGCAGCATA TATCATCAGC CTCATCCTAA AATAGGCAGG 1800 

GAGCCCCTCC CATGAGTTTA TCCAAGTTCT CAGCTCCTAA AATGCAGGCT GCCAAGACCC 1860 

TACACCTGCC CTGGCTCTAC AGCCACTTAC CTGGTTTCTG GACTGTCACC CTCCCAGCTG 1920 

ACCTGCCCGT AGCCAAGGAA TGAGGACCTA ACTTGAGTTG GCCCAAAGTC TGACCTGGCT 1980 

GTATGTCCCT GTGGCCCACA CCCAGCCTGT CTTGCTCATT CATGCAGCCT CAACACTGGC 2040 

CTCCAAAGTT CCCTTAACAC TTGCAAAGTC CTTTTTACCT GTGCATTTGG ACTTGAGGAC 2100 

ACTGGTTTCT ATCACAGGTG AGAGCCATGT TCAATACCTC CAGCAAGCTC TCCTGGCTCC 2160 

CTGCACTGTG CACGCTCCTC TTCCCAAGGT CCCAATACCA GCACCTCTAG TTAGAGTTAG 2220 

GGTCAGGGTC AGGCCTCTCC CAACATCCCA GTAGTTTCTC CTCTGAGACA CATGGGCAAG 2280 

AGACAATTTG GAGTCAAGAT TTTCCATTTG GATCTATTTT AAATCTTTTA GAAATGCATT 2340 

TGAAACAGTG TGTTTGTTTT TTCCCTTCTA GTTAAGGGAC TATTTATATG TGTATAGGAA 2400 

AGCTGTCTCT TJT n' mm TTCCTTTAAC AAGGTCCAAA GAAAGATGCA AAAGGAGATC 2460 

ACACCCTTGC CCCGCTGAGC CCCGTGATAA CAAGTCACTC CAGACTAACC TGTGTGCCAG 2520 

ACATTTGTGC ATTGTTGCAC TTTGAGGTTA TTATTTATCA AGTTCTTGAA GGAAGCAGAA 2580 

AGAGGGACTC CTCTCTCCCT CCGTGTATAG TCTCTATGTT TGTGCTAGTT T Tr C TnTTT 2640 

TTCTCTGTGT CCAGTCAGCC ACAGGGCCOG CCTCCCTGCA GGAATAAGGG GTAAAACGTT 2700 

AGGTGTTGTT TGGCAAGAAA CCACACTGAC TGATGAGGGG TAAAATGGAA CCAGGTAGAG 2760 

CCACTCCGGG CAGCTGTCAC CCATTCAGAA CTTCTTTCCG CAGCTGAAGA AATGTTCAGT 2820 

AACCTGTTTG ACGCTAATTA AAACAGAGCC TGCAGGAAGT GGGGCTAAAG TGGCATTCAG 2880 

TGATCCTGTT CTGTAGACTT TTCTTTCTTT TTTTAACCAA ATCCAAAGGA TGTTACAGAA 2940 

AAGCTAGCCA CTGGTATTTT GTTTTGTTTA AAAAAAAAAA GAAAGAAAGA AAGAAAGAAA 3000 



847 



WO 03/042661 



PCT/US02/36810 



5 

10 
15 

20 
25 
30 



AACGGAAAGG 
CTGAGTAATC 
CATTTCATCT 
TTTCTGGTGC 
GAGTTAATCT 
AGATAAGGGA 
TGATTTTTTT 
TTTTTGGGGG 
TTTTTAACTC 
AAAATAGTCT 
AGTTACACTG 
TTTTTAAAAA 
GAAAGGTTGT 
TTATACTTTC 
AAMMAAAAAC 
TGGGGCGGOG 
GAAACCCTGG 
GGAAACCGCA 
GGACTCAACC 



AACCTAGCTG 
CAATAAAGAA 
CCTGTGAGTC 
TCTGGAAGTT 
CACTCGCTTT 
TGCCTACTAA 
AATGAATGTT 
GAGGGGGTTT 
ATTCCAACCA 
CATCTCTTTT 
TGATGACTGG 
ATGCAACTAA 
GTGTCGTTGC 
TAATAAATTT 
AWYWTTGGGG 
GGGCCCACGT 
CCAAGAAAAA 
GAGTGTTGOG 
AGGAGGACCC 



CCTGTATCTT 
CTTTTGATGA 
AGAAGGGCTT 
GTTTAGAGGA 
TCTGCTTCCA 
TGCTTTTTTA 
TTTAAAAATA 
GTTTTCCAAC 
GGAAGCTTTT 
TTTCTCAAAT 
CCTATTACCT 
GTGGTTAATA 
TTTTTGTGTT 
GCAGTTTCAT 
GGGCTTGGGC 
AGGTACGGCG 
GGTGGCGAGA 
TAAACCACAC 
AAGGGAACCC 



TCATTTTTAA 
CAGCCAGAAT 
TATTTCTCCC 
AAGAATTCTA 
GGCATCTTAG 
AAACAAACAG 
TATAAATAGG 
TCAAGATGGC 
TTATACATTG 
GAGATCCGTG 
GACTCAGCTC 
GTGTGTGACG 
TTGGTTAGGC 
TCTTTCTGTT 
CTCGGAAAAA 
ACCACGCGGG 
ATTCTCCACA 
CCGAAGAGAG 
GATAGAGTAC 



AATAGCACTT 
GTGTTAGAAC 
TJTGATGGGG 
ATTTTAATTA 
GAAAAACAAA 
GGACATTTTT 
ACACCAAAGC 
ACATTAGTGG 
CCTAAATCTA 
TTTTATTTTA 
CCTCTACCTT 
CTCAAAGTTA 
TTGGTTTTGT 
TGTGCAAAWG 
GTTTTTAACA 
CCCAAACGGG 
CCAGAAAAAA 
AACTCAGAAG 
G 



GAGTTATTTT 
TCTGGCTGAA 
CCCCTTCTTC 
ATTGCGCAGT 
TGGTTTTAGT 
ATTATAGATT 
GGCAGGGTTT 
CCAGCAATAT 
CGCCAACCAG 
GCATTAAATT 
GAAATTGACA 
ATGTAAACTG 
TTTTTAATTT 
GWMCTAMARM 
CCACTTCGGG 
ACCCCAGAAG 
ACGCGCCGGG 
CACACAAGCG 



A3 8 Protein sequence: 
Gene name: 
Unigene number: 
Probeset Accession tf: 
Protein Accession #: 
Signal sequence: 
Transmembrane domains: 
LDLa domains: 
Cellular Localization: 



ESTs 

Hs. 293616 
AW043782 
none found 
1-17 
169-191 

28-66, 70-108, 112-149 
plasma membrane 



3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 



35 
40 
45 
50 
55 
60 
65 
70 
75 
80 



1 11 21 31 

I I I I 

MWLLGPLCLL LSSAAESQLL PGNNFTNECN IPGNFMCSNG 
EKECPKAKSK CGPTPFPCAS GIHCIIGRFR CNGFEDCPDG 
KNGLCIDKSF ICDGQNNCQD NSDEESCESS QEPGSGQVFV 
VIFVLWALL ALVLHHQRKR NNLMTLPVHR LQHPVLLSRL 
YVASQAEQNA SEVGSPPSYS EALLDQRPAW YDLPPPPYSS 
NSASSQAASS LLSVEDTSHS PGQPGPQEGT AEPRDSEPSQ 



41 51 ' 

I I 

RCIPGAWQCD GLPDCFDKSD 60 
SDEENCTANP LLCSTARYHC 120 
TSENQLVYYP SITYAIIGSS 180 
WLDHPHHCN VTYNVNNGIQ 240 
DTESLNQADL PPYRSRSGSA 300 
GTEEV 



A3 9 DNA sequence 

Gene name: ESTs 

Unigene number: Hs. 128899 

Probeset Accession #: AA983251 

Nucleic Acid Accession #: AA983251 

Coding sequence: 1-1749 (underlined 

1 11 21 31 

illl 

ATG CTGTCTG GCTTCTTGAT GAGTCCCAGT ACCCAGCACA 
GGAAAGAAAC TTCCGTGGGA GGCTTCCATC GGTGCGCACA 
GACCGGGAGA GGGAGAGCCG GCCGGAGGCT GCCGGGCTCC 
GGGGAGGCGG AGAAGGGGAA CCGGGGCGAG CCGCCCGCCT 
CCGCGGCCGC CGCCAGCTGG GCAGGCTCCC GGGACTGCGG 
CGCCTGCGTC CTGGACGTTC CCGGGGGAGG GTCCGGTTGC 
TCCGGACGAC AGCCCCGGGG GCCTTCTGAC TGCATCCCGA 
ACTCATAAGG CAGTCCCTAA GGGGACCGGG CCACCGGCTG 
GCTCCTGGAC CTAGGGCCCG GCGTCGTOGC CTCCTGGGCG 
CCGCGCGGAA AGCGCCGCGG GACAGTCAGT GACGAGGCCC 
CTTCTCGGAG ACCGTCCTGC GCTCTCTGGA GACGCGCTGT 
TGTGGGGCGC TCGCCGCTCG TCCGTCTCCT CATCCTGGAA 
TGCTGCTGGC TGCGCTGCTG GCGGCGGGGG CGAGGGCCCA 
TGGCTGGACG CGCAGGGCGT CTGGCGCATC GGCTTCCAGT 
GGCGACGCCA CCATCTGCTG CGGCAGCTGC GCGTTGCGCT 
GCGCGCCTGG ACCAGGGCGG CTGCGACAAT GACCGCCAGC 
CGGGCGGACA AAGACGGGCC CCGACGGCTC GGCAGGGCTT 
GGAGACGGCG AGGGTGCGCC CCCACCCGTG AGGGCCTGGC 
TCCCCGAAAG GAAGGCAGCT CCTCAGGGCT TTCCCGGGGC 
CGCGGATTCC CATCTTCTCC ACGCGGCGGC CCCTCTCCCC 
ATCTACGTGC CGTTCCTCAT TGTTGGCTCC GTGTTTGTCG 
CTGGTGGCAG CCTGTTGCTG CAGATGTCTC CGGCCTAAGC 
GCCCCAGGGG GTAACCGCTT GATGGAGACC ATCCCCATGA 
CGGGGGTCGT CCTCACGCCA GTCCAGCACA GCTGCCAGTT 
GGGGCCCGGG CGCCCCCAAC AAGGTCACAG ACCAACTGTT 
AACAACGTGT ATGTCAACAT GCCCACGAAT TTCTCTGTGC 
CAGATTGTGC CACATCAAGG GCAGTATCTG CATCCCCCAT 
CACGACTCTG TGCCCATGAC AGCTGTGCCA CCTTTCATGG 
AGGCAGATTC AGTCCCCCTT CCCTCACACC AACAGTGAAC 
ACTGTATAAC CGAGAGTCAC TGGTGGGTTC CTTTACTGAA 
GATTCTCGAG GTGGAAGTCC GCACATGTCG GTGGTATTTA 
GCTTCATTTG CCCCCAGACT GTATGAAAAC ATCTCCGAAT 
CATCCAGGGT ATCATTGATT TATGATGGAA AAOCGGCCTC 
TTGCTGATGG GTGTATAACA AATGCTTGAG TCCGAAGTGC 
AAGAATTTTA TAAACTGATA AATTAAGGAT TTTTATTATG 



sequences 
41 
I 

GAGCACAGTA 
CCTCCCGAGG 
TGTGGGACCG 
GGATCCGCGC 
CTGGGGGCGC 
CAGTGAAACC 
GATTTCCATC 
AGGACGGGGA 
TCGCGGCAGA 
GGGGGTCGCC 
CCGCGCCCAG 
CGCCGCTTCG 
GCGGCGAGTA 
GTCCCGAGCG 
ACTGCTGCTC 
AGGGCGCTGG 
CATGTCTTAG 
AGCGGTGCTC 
TGCTGCCCCG 
TGCAGCGGCC 
CCTTTATCAT 
AGGATCCCCA 
TCCCCAGTGC 
CCAGCTCCAG 
GCTTGCCGGA 
TGAACTGTCA 
ACGTGGGGTA 
ACGGCCTGCA 
AGAAGATGTA 
GGGAGACGAA 
TGGCACGATT 
TAGCATTTCT 
AGCTGGAGAT 
CCTTGAGATA 
TTGTTATTAT 



correspond 

51 

I 

CACTCCCGGA 
GCGAGGCAGC 
CGCTGCAGCC 
CCAGCAGCAG 
GCAGGACCCT 
TCCAGAGGCT 
AGCGAGTGCA 
TGGCTTAGGA 
GGGGAGTGGC 
GGGGCCACGA 
GGTGGTGCCA 
CTCCTGCAGC 
CTGCCACGGC 
CTTCGACGGC 
CAGOGCCGAG 
CGAGCCTGGC 
GGGTACCCAA 
CCCTGAAGGC 
TGCCAGACGC 
CGCCTTGCCC 
CTTGGGGTCC 
GCAGAGCCGA 
CAGCACCTCC 
CGCCAACTCC 
AGGGACCATG 
GCAGGCCACC 
CACGGTGCAG 
GCCTGGCTAC 
CCCAGCGGTG 
GGCAGGGGTG 
CCTTTGGATG 
GGATATGTTT 
GACTGTGATG 
TGGTTGACGA 
TATTTCTTTT 



to start and stop codons) 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 



848 



WO 03/042661 



PCT/US02/36810 



TTGTTGTTGA CTGCACAGGA TCAAAATGCC TGTTATCTCC CTTTTACTGG GACTTTTTTT 2160 

TTTTTTTTTT TTTTTTTTAA TCAGACAGGG TCTTGCTCTG TTGCCCAGGC TGGAGTGCAG 2220 

TGGTGCGATC TCGGCTCACT GCAACTTCAG CCTCCTGGAT TCAGGCAACA CTCCTGCCTC 2280 

AGCCTCCCAC GTGGCTGGGA TTACAGGTGC CTGCCCCCAT GGCTAATTTT TTGTATTTTT 2340 

TGTAGAGATG GGGTTTCACC ATGTTGGCTG GGCTGGTCTC ACTCTCCTGA CCTCAAGCAA 2400 

TCTGCCTGTC TCAGCCTCCC AAAGTGCTGG GATT ACAGG C GTGAGCCACC GCCCCCAGCC 2460 

TGAGCCTTTT TTTTTTTCTA ATGCATCCAA GGTTAAGGGG AAGACGCAAA TAACAGGACT 2520 

ATTCTAAAAG GAAACCTGTT TGAACTCTGT GAGATCAGTC ATCAGTCTCA GTATTCCACA 2SB0 

GGCACACCTT AATTTCATTG TAAAAAGATA TATATATTTT GTCTATTTTT GTGCTTTTGG 2640 

GGGCCTATTT TGTGCTTTTT TACCTTATGT AGAGATCTTA TTACAAAGTG ATTTTCTACA 2700 

TTAAAAAGAG ACTGAAATAA ATTGTATAGT TACTTAACTA ATGAAGACAT TTCAGAACTC 2760 

TGGGATGATT TTAATCTTGA AGTAGTAGGT GGTATAGTCA TAAAACCATT CATCCCCTTC 2820 

TTGATTGTAT CTTAATTTTC TGGCTTTAAG GTGACATCTG AGAGGTAATG CATTCTTTTT 2880 

TATATTGAAA TCATAAACTA TCACCCGCTG CTTCTCTGAG TTACTTTTAA TTTTGCCTTG 2940 

TGGTTATGGT TTGGCGTTTC CTTCTGTTTG GTTTTCAGAG CCCCATGTCT ATATAGTCCT 3000 

GAGTGCAAGT AATTACTATA CTTGTAAATG AAGATCAGTA TTTCTGCCTA GATCTGATAA 3060 

AAAAATTTTC TTGTCTTAGT TATAAAAATT CAAAGAAATG TGTTACAAAG ATACTTAGTA 3120 

TAGCTCCTCA GCCATAACCT GAGACTTGGG ATGAAATTTA AACCAGATAC GATTTACTTT 31 BO 

GCAGATCATA AGGCTTTTTA TACTCTTGTT ATCAAAATGG CTTATTTTTC AGGCACTAAG 3240 

GATTGTTAAG AGAAAAGCTT TTCAACGAAG GATTGCCTTT CTTCTCCCAC ACTGTTCTTG 3300 

ATTTCCTCTC TCTTTCAGGC CTCAACAGGC ACTGTATTCA TTGCCAATGT TCCAAATTAT 3360 

CAAATTCAAG TGAATTTATT TGTGTGTTCT TTACTTATAT AAAAAAAGAT AACTTTAAGG 3420 

ATGTGCAAGT ACATTTCCAA CTGCTAGCAC AACCAGTATT TTGTAATTAA ACAAATCGCT 3480 

GTATGGTATG GTCTTCTACA CATTTATGTC TATAGATATC TATCGATCAT CTTTCTATTC 3540 

TGTTTCATGA CTGAATAATG TAAAACCAGT GTTGGCAATT GGTATCATCA ATGATACTCA 3600 

TTTTTTAATA ACCAAAGGCA GGGGAAAATC ATTTTACTTA TTAATAAATA TTTTATGATG 3660 
TGAAAAAAAA AAAAAAAAAA AAAAAAAAAA 



A40 Protein sequence 
Gene name: 
Unigene number: 
Protein Accession #: 
Signal sequence: 
Transmembrane domains: 
Cellular localization: 



ESTs 

Hs. 128899 

none found 

1-11 
402-424 

not determined 



MLSGFLMSPS 
GEAEKGNRGE 
SGRQPRGPSD 
PRGKRRGTVS 
CCWLRCWRRG 
ARLDQGGCDN 
SPKGRQLLRA 
LVAACCCRCL 
GARAPPTRSQ 
HDSVPMTAVP 



11 
I 

TQHRAQYTPG 
PPAWIRAQQQ 
CIPRFPSASA 
DEARGSPGPR 



DRQQGAGEPG 
FPGLLPRARR 
RPKQDPQQSR 
TNCCLPEGTM 
PFMDGXiQPGY 



21 

1 

GKKLPWEASI 
PRPPPAGQAP 
THKAVPKGTG 
LLGDRPALSG 
WLDAQGVWRI 
RAOKDGPRRIi 
RGFPSSPRGG 
APGGNRLMET 
NNVYVNMPTN 
RQIQSPFPHT 



31 
I 

GAHTSRGRGS 
GTAAGGAQDP 
PPAEDGDGLG 
DALSAPRWP 
GFQCPERFDG 
GRASCLRGTQ 
PSPLQRPAliP 
IPMIPSASTS 
FSVLNCQQAT 
NSEQKMYPAV 



41 

I 

DRERESRPEA 
RLRPGRSRGR 
APGPRARRRR 
CGAliAARPSP 
GDATICCGSC 
GDGEGAPPPV 
IYVPFLIVGS 
RGSSSRQSST 
QIVPHQGQYL 
TV 



51 
I 

AGLLWDRAAA 
VRLPVKPPEA 
LLGVAAEGSG 
HPGTPLRSCS 
ALRYCCSSAE 
RAWQRCSPEG 
VFVAFIILGS 
AASSSSSANS 
HPPYVGYTVQ 



60 
120 
180 
240 
300 
360 
420 
480 
540 



A41 DNA SEQUENCE: 
Gene name: 
Unigene number: 
Probeset Accession #: 
Nucleic Acid Accession #: 
Coding sequence: 



ESTs, Weakly similar to JE0350 Anterior gradient-2 [H. sapiens] 

Hs. 100686 

AA48746B 

AA487468 

55-555 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I 1 I 

CGGCACCAAG AGCACTGGCC AAGTCAGCTT CTTCTGAGAG AGTCTCTAGA AGACATGATG 60 

CTACACTCAG CTTTGGGTCT CTGCCTCTTA CTCGTCACAG TTTCTTCCAA CCTTGCCATT 120 

GCAATAAAAA AGGAAAAGAG GCCTCCTCAG ACACTCTCAA GAGGATGGGG AGATGACATC 180 

ACTTGGGTAC AAACTTATGA AGAAGGTCTC TTTTATGCTC AAAAAAGTAA GAAGCCATTA 240 

ATGGTTATTC ATCACCTGGA GGATTGTCAA TACTCTCAAG CACTAAAGAA AGTATTTGCC 300 

CAAAATGAAG AAATACAAGA AATGGCTCAG AATAAGTTCA TCATGCTAAA CCTTATGCAT 360 

GAAACCACTG ATAAGAATTT ATCACCTGAT GGGCAATATG TGCCTAGAAT CATGTTTGTA 420 

GACCCTTCTT TAACAGTTAG AGCTGACATA GCTGGAAGAT ACTCTAACAG ATTGTACACA 480 

TATGAGCCTC GGGATTTACC CCTATTGATA GAAAACATGA AGAAAGCATT AAGACTTATT 540 

CAGTCAGAGC T ATAAG AQAT GATAGAAAAA AGCCTTCACT TCAAAGAAGT CAAATTTCAT 600 

GAAGAAAACC TCTGGCACAT TGACAAATAC TAAATGTGCA AGTATATAGA TTTTGTAATA 660 

TTACTATTTA GTTTTTTTAA TGTGTTTGCA ATAGTCTTAT TAAAATAAAT GTTTTTTAAA 720 
TCTGAAAAAA AAAAAAAAAA AAAAAAAAA 



A42 Protein sequence: 

Gene name: ESTs, Weakly 

Unigene number: Hs. 100686 

Probeset Accession #: AA487468 

Protein Accession #: none found 

Signal sequence: 1-23 

Transmembrane domains-. none found 

Cellular Localization: secreted 



to JE0350 Anterior gradient-2 [H. sapiens) 



849 



WO 03/042661 



1 11 21 31 41 51 

I I I I I I 

MMLHSALGLC LLLVTVSSNL AIAIKKEKRP PQTLSR6WG0 DITWVQTYEE GLFYAQKSKK 60 
PLMVIHHLED CQYSQALKKV FAQNEEIQEM AQNKFTMLNL MHBTTDKNLS PDGQYVPRIM 120 
PVDPSLTVRA DIAGRYSNRL YTYEPRDLPL LIENMKKALR LIQSEL 

A43 DNA SEQUENCE 

Gene name: ATPase, Ca++ transporting, type 2C, member 1 

Unigene number: Hs. 106778 

Probeset Accession #: N51919 
Nucleic Acid Accession #: AF189723 

Coding sequence: 1-2712 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 SI 

1 1 I I I I 

ATGATTCCTG TATTGACATC AAAAAAAGCA AGTGAATTAC CAGTCAGTGA AGTTGCAAGC 60 

ATTCTCCAAG CTGATCTTCA GAATGGTCTA AACAAATGTG AAGTTAGTCA TAGGCGAGCC 120 

TTTCATGGCT GGAATGAGTT TGATATTAGT GAAGATGAGC CACTGTGGAA GAAGTATATT 180 

TCTCAGTTTA AAAATCCCCT TATTATGCTG CTTCTGGCTT CTGCAGTCAT CAGTGTTTTA 240 

ATGCATCAGT TTGATGATGC CGTCAGTATC ACTGTGGCAA TACTTATCGT TGTTACAGTT 300 

GCCTTTGTTC AGGAATATCG TTCAGAAAAA TCTCTTGAAG AATTGAGTAA ACTTGTGCCA 360 

CCAGAATGCC ATTGTGTGCG TGAAGGAAAA TTGGAGCATA CACTTGCCCG AGACTTGGTT 420 

CCAGGTGATA CAGTTTGCCT TTCTGTTGGG GATAGAGTTC CTGCTGACTT ACGCTTGTTT 480 

GAGGCTGTGG ATCTTTCCAT TGATGAGTCC AGCTTGACAG GTGAGACAAC GCCTTGTTCT 540 

AAGGTGACAG CTCCTCAGCC AGCTGCAACT AATGGAGATC TTGCATCGAG AAGTAACATT 600 

GCCTTTATGG GAACACTGGT CAGATGTGGC AAAGCAAAGG GTGTTGTCAT TGGAACAGGA 660 

GAAAATTCTG AATTTGGGGA GGTTTTTAAA ATGATGCAAG CAGAAGAGGC ACCAAAAACC 720 

CCTCTGCAGA AGAGCATGGA CCTCTTAGGA AAACAACTTT CCTTTTACTC CTTTGGTATA 780 

ATAGGAATCA TCATGTTGGT TGGCTGGTTA CTGGGAAAAG ATATCCTGGA AATGTTTACT 840 

ATTAGTGTAA GTTTGGCTGT AGCAGCAATT CCTGAAGGTC TCCCCATTGT GGTCACAGTG 900 

ACGCTAGCTC TTGGTGTTAT GAGAATGGTG AAGAAAAGGG CCATTGTGAA AAAGCTGCCT 960 

ATTGTTGAAA CTCTGGGCTG CTGTAATGTG ATTTGTTCAG ATAAAACTGG AACACTGACG 1020 

AAGAATGAAA TGACTGTTAC TCACATATTT ACTTCAGATG GTCTGCATGC TGAGGTTACT 1080 

GGAGTTGGCT ATAATCAATT TGGGGAAGTG ATTGTTGATG GTGATGTTGT TCATGGATTC 1140 

TATAACCCAG CTGTTAGCAG AATTGTTGAG GCGGGCTGTG TGTGCAATGA TGCTGTAATT 1200 

AGAAACAATA CTCTAATGGG GAAGCCAACA GAAGGGGCCT TAATTGCTCT TGCAATGAAG 1260 

ATGGGTCTTG ATGGACTTCA ACAAGACTAC ATCAGAAAAG CTGAATACCC TTTTAGCTCT 1320 

GAGCAAAAGT GGATGGCTGT TAAGTGTGTA CACCGAACAC AGCAGGACAG ACCAGAGATT 1380 

TGTTTTATGA AAGGTGCTTA CGAACAAGTA ATTAAGTACT GTACTACATA CCAGAGCAAA 1440 

GGGCAGACCT TGACACTTAC TCAGCAGCAG AGAGATGTGT ACCAACAAGA GAAGGCACGC 1500 

ATGGGCTCAG CGGGACTCAG AGTTCTTGCT TTGGCTTCTG GTCCTGAACT GGGACAGCTG 1560 
ACATTTCTTG GCTTGGTGGG AATCATTGAT CCACCTAGAA CTGGTGTGAA AGAAGCTGTT . 1620 

ACAACACTCA TTGCCTCAGG AGTATCAATA AAAATGATTA CTGGAGATTC ACAGGAGACT 1680 

GCAGTTGCAA TCGCCAGTCG TCTGGGATTG TATTCCAAAA CTTCCCAGTC AGTCTCAGGA 1740 

GAAGAAATAG ATGCAATGGA TGTTCAGCAG CTTTCACAAA TAGTACCAAA GGTTGCAGTA 1800 

TTTTACAGAG CTAGCCCAAG GCACAAGATG AAAATTATTA AGTCGCTACA GAAGAACGGT 1860 

TCAGTTGTAG CCATGACAGG AGATGGAGTA AATGATGCAG TTGCTCTGAA GGCTGCAGAC 1920 

ATTGGAGTTG CGATGGGCCA GACTGGTACA GATGTTTGCA AAGAGGCAGC AGACATGATC 1980 

CTAGTGGATG ATGATTTTCA AACCATAATG TCTGCAATCG AAGAGGGTAA AGGGATTTAT 2040 

AATAACATTA AAAATTTCGT TAGATTCCAG CTGAGCACGA GTATAGCAGC ATTAACTTTA 2100 

ATCTCATTGG CTACATTAAT GAACTTTCCT AATCCTCTCA ATGCCATGCA GATTTTGTGG 2160 

ATCAATATTA TTATGGATGG ACCCCCAGCT CAGAGCCTTG GAGTAGAACC AGTGGATAAA 2220 

GATGTCATTC GTAAACCTCC TCGCAACTGG AAAGACAGCA TTTTGACTAA AAACTTGATA 2280 

CTTAAAATAC TTGTTTCATC AATAATCATT GTTTGTGGGA CTTTGTTTGT CTTCTGGCGT 2340 

GAGCTACGAG ACAATGTGAT TACACCTCGA GACACAACAA TGACCTTCAC ATGCTTTGTG 2400 

TTTTTTGACA TGTTCAATGC ACTAAGTTCC AGATCCCAGA CCAAGTCTGT GTTTGAGATT 2460 

GGACTCTGCA GTAATAGAAT GTTTTGCTAT GCAGTTCTTG GATCCATCAT GGGACAATTA 2520 

CTAGTTATTT ACTTTCCTCC GCTTCAGAAG GTTTTTCAGA CTGAGAGCCT AAGCATACTG 2580 

GATCTGTTGT TTCTTTTGGG TCTCACCTCA TCAGTGTGCA TAGTGGCAGA AATTATAAAG 2640 

AAGGTTGAAA GGAGCAGGGA AAAGATCCAG AAGCATGTTA GTTCGACATC ATCATCTTTT 2700 
CTTGAAGTAT GA 



A44 Protein sequence; 
Gene name: 
Unigene number: 
Probeset Accession #: 
Protein Accession #: 
Signal sequence: 
Transmembrane domains: 
Pfam domains: 
[744-889] 

Cellular Localization: 



ATPase, Ca++ transporting, type 2C, member 1 

HS.10677B 

N51919 

AAF27813 

none found 

60-82, 86-108, 252-274, 282-304, 757-779, 827-849, 856-878 
Cation ATPase N [6-83] , E1-E2 ATPase [89-324] , Hydrolase [328-649] 

not determined 



1 11 21 31 41 51 

I I I I 1 I 

MIPVLTSKKA SELPVSEVAS ILQADLQNGL NKCEVSHRRA FHGWNEFDIS EDEPLWKKYI 60 

SQFKNPLIML LLASAVISVL MHQPDDAVSI TVAILIWTV AFVQEYRSEK SLEELSKLVP 120 

PECHCVREGK LEHTLARDLV PGDTVCLSVG DRVPADLRLF EAVDLSIDES SLTGETTPCS 180 

KVTAPQPAAT NGDLASRSNI AFMGTLVRCG KAKGWIGTG ENSEPGEVPK MKQAEEAPKT 240 

PLQKSMDLLG KQLSFYSFGI IGIIMLVGWL LGKDILEMFT ISVSLAVAAI PEGLPIWTV 300 

850 



WO 03/042661 PCTAJS02/36810 



5 
10 



TLALGVMRMV 
GVGYNQFGEV 
MGLDGLQQDY 
GQTLTLTQQQ 
TTLIASGVSI 
FYRASPRHKM 
LVDDDFQTIM 
INIIMD6PPA 
ELRDNVITPR 
LVIYFPPLQK 
LEV 



KKRAIVKKLP 
IVDGDWHGF 
IRKAEYPPSS 
RDVYQQEKAR 
KMITGDSQET 
KIIKSLQKNG 
SAIEEGKGIY 
QSLGVEPVDK 
DTTMTFTCFV 
VFQTESLSIL 



IVETLGCCNV 
YNPAVSRTVE 
EQKWMAVKCV 
MGSAGLRVLA 
AVAIASRLGL 
SWAMTGDGV 
NNIKNFVRFQ 
DVIRKPPRNW 
PFDMFNALSS 
DLLFLLGLTS 



ICSDKTGTLT 
AGCVCNDAVI 
HRTQQDRPEI 
LASGPELGQL 
YSKTSQSVSG 
NDAVALKAAD 
LSTSIAALTli 
KDSILTKNLI 
RSQTKSVFEI 
SVCIVAEIIK 



KNEMTVTHIF 
RNNTLMGKPT 
CFMKGAYEQV 
TFLGLVGIID 
EEIDAMDVQQ 
IGVAMGOTGT 
ISLATLMNFP 
LKILVSEIII 
GLCSNRMFCY 
KVERSREKIQ 



TSDGLHAEVT 
EGALIALAMK 
IKYCTTYQSK 
PPRTGVKEAV 
LSQIVPKVAV 
DVCKEAADMI 
NPLNAMQILW 
VCGTLFVFWR 
AVLGSIMGQL 
KHVSSTSSSF 



360 
420 
480 
540 
600 
660 
720 
780 
840 
900 



15 

20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 



A4S DNA sequence 
Gene name: 
Unigene number: 
Probeset Accession #: 
Nucleic Acid Accession #: 
Coding Sequence: - 



ESTs 

Hs. 157601 

W07459 

AC005383 

328-2751 (underlined sequences correspond to start and stop codons) 



GACAGTGTTC 
TTTTATTTGC 
CCTGGCGGTA 
ACAAACAGGT 
CCCCCTGGCC 
TCGCCGCTCT 
GTTTTCCTGT 
GAAACCATCG 
ATCATGTTTC 
CACTTTGCCA 
GCATTCCAGT 
CAGGAAGTGA 
CTTGCTCTGA 
CAGATCCTCA 
CAGCTGAAGG 
GAGCTGCATG 
GAGGATGCCA 
ACGCCAGACT 
GAGTTCGCTG 
GCACACTGTC 
AGGACCACCT 
CCAGAAGGAC 
TGTGCCCTGA 
GCGGGCACCA 
GCOGTGCTGA 
CTGGTGGOGG 
GGCATTOCCT 
CGTGGCTTCG 
CTCACTGAGT 
GAGCTGCTCC 
GGCAGCCCAA 
GAGCTGCAGG 
CTOGTCTTCA 
AGCTTTGTGA 
CTGGTGGTGT 
GCTGCGATGC 
ACCGCCCTGC 
GTCCCCAAAG 
GCCCAGAAGC 
AGTGAGGGTC 
GCCGACCTGC 
CCAGTCAACC 
GGGAGCTACC 
TGGAGCTCTT 
ATGGCTCCCG 
GGCACTGAAA 
TTCCCGCCGT 
ATGCTGCTTA 
TTGATGTGTA 
CTGCCACCTT 
CGTTCCTTTG 
AGGCCTTTAC 
GCAGCTTTTC 
CTTGAGGGAC 
GGTCTCAGAC 
TGTGCATGGG 
ACCTTGAAGG 



11 
I 

GCGGCTGCAC 
AGACCTGGGC 
GTTCCTCCGA 
GTCCCACGTG 
CGAGCCGCGC 
CCTTCCGTTA 
TTTCCAGAGT 
GGAAGATTTC 
TGTTAGATGG 
TCACAGTCTG 
TCAGTTCCAC 
AGGCAAGAAT 
AATACCTTCT 
TCATCGTCAC 
AAAGGGGTGT 
CACTGGCCAG 
CCAACGGCCT 
GCAGGGTCGA 
GCAATGCCCC 
CCTTCTACAG 
GCCCAGGCCC 
TGGACGGCTA 
AGCTGAGCCT 
CTCTGGACGG 
GCGAGGACTC 
TGCCTGTGGG 
TCCGTGGTGG 
GGAGCGCCAC 
CACACTCCGA 
TGCTGGGTGT 
AGCATGTGAT 
GGAAGCTGTG 
TGTTGGACAC 
GAAGCTGTGC 
ATGGCAGCCA 
TGCGGGGCAT 
TGCACATCTA 
CTGTGGTGGT 
TGAGGAACAA 
TGCGGAGGCT 
GGTACCACCA 
TCTGCAAACC 
GCTGCAAGTG 
GCTCTGTATG 
TGCAGGAGGG 
TGGTGCCTAC 
GGCCAGGACC 
GAGACAAGAA 
AGTAAATACC 
TCCCTTGAGG 
CACACAATCA 
TAGAGCATCC 
CACTTCCCCA 
GTTTGTGACT 
TGAATGTGAC 
CCCAGGTCTG 
TCTTC 



21 
I 

CGCTCGGAGG 
CGATGCCGCT 
CCTCAGCCGG 
GCAGCCGCGC 
CCGGGTCTGT 
TATCAACATG 
GCCCCCATCT 
AGCTGCCAGC 
GTCTAACAGC 
TGACGGTCTG 
TCCTCATCTG 
CAAGAGGATG 
GCACAGAGGG 
TGATGGGAAG 
CACTGTGTTT 
CGAGCCTAGA 
CTTCAGCACC 
GGCTCACCCC 
ATGCTGGAGA 
CTGGAAGAGA 
CTGTGACTCG 
CCAGTGCCTC 
GGAATGCAGG 
CTTCCTGCGG 
TCGGGCCCGA 
GGAGTACCAG 
CCCCACCCTG 
CAGGACAGGC 
GGATGAGGTT 
AGGCAGTGAG 
GGTCTACTCG 
CAGCCGGCAG 
CTCTGCCTCA 
CCTCCAGTTT 
GGTGCAGACT 
TAGCCAGGCC 
TGACAAAGTG 
GCTCACAGGC 
TGGCATCTCT 
TGCAGGTCCC 
GGACGTGCTC 
CAGCCCGTGC 
TCGGGATGGC 
TGTGAGCCAG 
CAGCAGCCGT 
CTTCTGGAAT 
ACTATTCTCA 
AGCAGCTGAT 
CACTTTCTOT 
ATAAACAAGG 
ATGCTCGCCA 
TTTGGACGGC 
GAGACATTCT 
TCTTGGCGAC 
CAATTAACCA 
GAGGGCCACG 



31 
I 

CTGGGTGACC 
TTAAAAAACG 
GTCGGGTCGT 
CCCGGGCGCC 
GAGTAGAGCC 
CCCCCTTTCC 
CTCCCTCTCC 
AAAATGATGT 
GTCGGGAAAG 
GACATCAGCC 
GAATTCCCCT 
GTTTTCAAAG 
TTGCCTGGAG 
TCCCAGGGGG 
GCTGTGGGGG 
GGGCAGCACG 
CTCAGCAGCT 
TGTGAGCACA 
GGATCGCGGC 
GTGTTCCTAA 
CAGCCCTGCC 
TGCCCGCTGG 
GTCGACCTCC 
GCCAAAGTCT 
GTGGGTGTGG 
GATGTGCCTG 
ACGGGCAGTG 
CAGGACCGGC 
GCGGGCCCAG 
GCGGTGCGGG 
GATCCTCAGG 
CGGCCAGGGT 
GTAGGGCCCG 
GAGGTGAACC 
GCCTTCGGGC 
CCCTACCTAG 
ATGACCGTCC 
GGGAGAGGCG 
GTCTTGGTCG 
CGGGATTCCC 
ATTGAGTGGC 
ATGAATGAGG 
TGGGAGGGCC 
GGATGGATTC 
ACCCCTCCCA 
GTCTGTGCCC 
CTGAGGGAGG 
GTCACCCACA 
ACCTGCTGTG 
GGTCCTGAAG 
GAATGTTGTT 
GAAGGCCACG 
GGATGCATTT 
TGCCTTTTGT 
GCTTGGTTGA 
TAAAATCGTT 



41 
I 

CGCGTAGAAG 
CGAGGGGCTC 
GCCGCCCTCT 
CCTCCTGTGA 
GCCCGGGCAC 
TGTTGCTGGA 
AGGAAGTCCA 
GGTGCTCGGC 
GGAGCTTTGA 
CCGAGAGGGT 
TGGATTCATT 
GAGGGCGCAC 
GCAGAAATGC 
ATGTGGCACT 
TCAGGTTTCC 
TGCTGTTGGC 
OGGCCATCTG 
GGACGCTGGA 
GGACCCTTGC 
CCCACCCTGC 
AGAATGGAGG 
CCTTTGGAGG 
TCTTCCTGCT 
TCGTGAAGCG 
CCACATACAG 
ACCTGGTCTG 
OCTTGCGGCA 
CACGTAGAGT 
CGCGTCACGC 
CAGAGCTGGA 
ATCTGTTCAA 
GCCGGACACA 
AGAATTTTGC 
CTGAOGTGAC 
TGGACACCAA 
GTGGGGTGGG 
AGAGGGGTGC 
CAGAGGATGC 
TGGGCGTGGG 
TGATCCACGT 
TGTGTGGAGA 
GCAGCTGCGT 
CCCACTGCGA 
TTGAGACGCC 
GCAACTACAG 
CAGGTCCTTA 
AGGATGTCCC 
AACGATGTTG 
CCTTGTTGAG 
ACTTAAATTT 
GACACAGTAA 
GCCTTTCAAG 
GCATTGAGTC 
GTGTGGAAGA 
TGATGGGGGA 
CTGAGTCGTG 



51 
I 

TGAAGTACTT 
TATGCACCTC 
CCCAGGAGAG 
TCCCGTAGCG 
CGAGCGCTGG 
GGCCGTCTGT 
TGTAAGCAAA 
TGCAGTGGAC 
AAGGTCCAAG 
CAGAGTGGGA 
TTCAACCCAA 
GGAGACGGAA 
TTCTGTGCCC 
GCCATCCAAG 
CAGGTGGGAG 
TGAGCAGGTG 
CTCCAGCGCC 
GATGGTCCGG 
GGTGCTGGCT 
CACCTGCTAC 
CACATGTGTT 
GGAGGCTAAC 
GGACAGCTCT 
GTTTGTGOGG 
CAGGGAGCTG 
GAGCCTCGAT 
GGCGGCAGAG 
GGTGGTTTTG 
AAGGGCGCGA 
GGAGATCACA 
CCAAATCCCT 
AGCCCTGGAC 
TCAGATGCAG 
ACAGGTCGGC 
ACCCACCCGG 
CTCAGCCGGC 
CCGGCCTGGT 
AGCCGTTCCT 
GCCTGTCCTA 
GGCAGCTTAC 
AGCCAAGCAG 
CCTGCAGAAT 
GAACCGTGAG 
CCTGAGGCAC 
AGAAGGCCTG 
GAATGTCTGC 
AACTGCAGCC 
TTGAAAAGTT 
GCTATGTCAT 
AGOGGCCTGA 
TGCCCAGCAG 
ATGGAAAGCA 
TGAAAGGGGG 
GACTTGGAAA 
GGGGCTGAGT 
AGCAGTGTCC 



60 
120 
180 
240 
300 
360 
420 
460 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1660 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 



A46 Protein sequence 
Gene name : 



851 



WO 03/042661 



PCT/US02/36810 



Unigene number: 
Protein Accession 
Signal sequence: 
Transmembrane domains: 
VGW domains: 49-223; 341 
EGF domains: 298-333; 71! 
Cellular Localization: 



Hs. 157601 
none found 
1-17 
none found 

518; 529-706 

>-748 

secreted 



1 11 21 31 41 51 

I I I I I I 

MPPFLLLEAV CVFLPSRVPP SLPLQEVHVS KETIGKISAA SKMMWCSAAV DIMFLLDGSN 60 

SVGKGSFERS KHFAITVCDG LDISPERVRV QAFQFSSTPH LEFPLDSFST QQEVKARIKR 120 

MVFKGGRTET ELALKYLLHR GLPGGRNASV PQILIIVTDG KSQGDVALPS KQLKERGVTV 180 

FAVGVRFPRW EELHALASEP RGQHVLLAEQ VEDATNGLFS TLSSSAICSS ATPDCRVEAH 240 

PCEHRTLEMV REFAGNAPCW RGSRRTLAVL AAHCPFYSWK RVFLTHPATC YRTTCPGPCD 300 

SQPCQNGGTC VPEGLDGYQC LCPLAFGGEA NCALKLSLEC RVDLLFLLDS SAGTTLDGFL 360 

RAKVFVKRFV RAVLSEDSRA RVGVATYSRE LLVAVPVGEY QDVPDLWISL DGIPFRGGPT 420 

LTGSALRQAA ERGFGSATRT GQDRPRRWV LLTESHSEDE VAGPARHARA RELLLLGVGS 480 

EAVRAELEEI TGSPKHVMVY SDPQDLFNQI PELQGKLCSR QRPGCRTQAL DLVFMLDTSA 540 

SVGPENFAQM QSFVRSCALQ FEVNPDVTQV GLWYGSQVQ TAFGLDTKPT RAAMLRAISQ 600 

APYLGGVGSA GTALLHIYDK VMTVQRGARP GVPKAWVLT GGRGAEDAAV PAQKLRNKGI 660 

SVLWGVGPV LSEGLRRLAG PRDSLIRVAA YADLRYHQDV LIEWLCGEAK QFVNLCKPSP 720 

CMNEGSCVLQ NGSYRCKCRD GWEGPHCENR EWSSCSVCVS QGWILETPLR HMAPVQEGSS 780 
RTPPSNYREG LGTEMVPTFW NVCAPGP 



COLON 



A47 DNA SEQUENCE: 
Gene name: 
Unigene number: 
Probeset Accession #: 
Nucleic Acid Accession #: 
Coding sequence: 



ESTs, Weakly similar to JE0350 Anterior gradient-2 [H. sapiens] 

Hs. 100686 

AA487468 

AA487468 

55-555 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

CGGCACCAAG AGCACTGGCC AAGTCAGCTT CTTCTGAGAG AGTCTCTAGA AGACATGATG 60 

CTACACTCAG CTTTGGGTCT CTGCCTCTTA CTCGTCACAG TTTCTTCCAA CCTTGCCATT 120 

GCAATAAAAA AGGAAAAGAG GCCTCCTCAG ACACTCTCAA GAGGATGGGG AGATGACATC 180 

ACTTGGGTAC AAACTTATGA AGAAGGTCTC TTTTATGCTC AAAAAAGTAA GAAGCCATTA 240 

ATGGTTATTC ATCACCTGGA GGATTGTCAA TACTCTCAAG CACTAAAGAA AGTATTTGCC 300 

CAAAATGAAG AAATACAAGA AATGGCTCAG AATAAGTTCA TCATGCTAAA CCTTATGCAT 360 

GAAACCACTG ATAAGAATTT ATCACCTGAT GGGCAATATG TGCCTAGAAT CATGTTTGTA 420 

GACCCTTCTT TAACAGTTAG AGCTGACATA GCTGGAAGAT ACTCTAACAG ATTGTACACA 480 

TATGAGCCTC GGGATTTACC CCTATTGATA GAAAACATGA AGAAAGCATT AAGACTTATT 540 

CAGTCAGAGC T ATAAG AGAT GATAGAAAAA AGCCTTCACT TCAAAGAAGT CAAATTTCAT 600 

GAAGAAAACC TCTGGCACAT TGACAAATAC TAAATGTGCA AGTATATAGA TTTTGTAATA 660 

TTACTATTTA GTTTTTTTAA TGTGTTTGCA ATAGTCTTAT TAAAATAAAT GTTTTTTAAA 720 
TCTGAAAAAA AAAAAAAAAA AAAAAAAAA 



A48 Protein sequence: 
Gene name: 
Unigene number: 
Probeset Accession #: 
Protein Accession #: 
Signal sequence: 
Transmembrane domains: 
Cellular Localization: 



ESTs, Weakly similar to JE0350 Anterior gradient-2 [H. sapiens] 

Hs. 100686 

AA487468 

none found 

1-23 

none found 
secreted 



1 11 21 31 41 51 

I I I I I I 

MMLHSALGLC LLLVTVSSNL AIAIKKEKRP PQTLSRGWGD DITWVQTYEE GLPYAQKSKK 60 
PLMVIHHLED CQYSQALKKV FAQNEEIQEM AQNKFIMLML MHETTDKNLS PDGQYVPRIM . 120 
PVDPSLTVRA DIAGRYSNRL YTYEPRDLPL LIENMKKALR LIQSEL 



A49 DNA SEQUENCE 

Gene name: G protein- coupled receptor 56 
Unigene number: Hs.6527 
Probeset Accession fh AA478S99 
Nucleic Acid Accession #: NM_005682 

Coding sequence: 163-2244 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I 1 I 

CGGCAGCAGG GTCTCGCTCT GTCACACAGG CTGGAGTGCA GTGGTGTGAT CTTGGCTCAT 60 
CGTAACCTCC ACCTCCCGGG TTCAAGTGAT TCTCATGCCT CAGCCTCCCG AGTAGCTGGG 120 
ATTACAGGTG GTGACTTCCA AGAGTGACTC OGTCGGAGGA AAATGACTCC CCAGTCGCTG 180 



852 



WO 03/042661 



PCT/US02/36810 



CTGCAGACGA CACTGTTCCT GCTGAGTCTG CTCTTCCTGG TCCAAGGTGC CCACGGCAGG 240 

GGCCACAGGG AAGACTTTCG CTTCTGCAGC CAGCGGAACC AGACACACAG GAGCAGCCTC 300 

CACTACAAAC CCACACCAGA CCTGCGCATC TCCATCGAGA ACTCCGAAGA GGCCCTCACA 360 

GTCCATGCCC CTTTCCCTGC AGCCCACCCT GCTTCCCGAT CCTTCCCTGA CCCCAGGGGC 420 

CTCTACCACT TCTGCCTCTA CTGGAACCGA CATGCTGGGA GATTACATCT TCTCTATGGC 480 

AAGCGTGACT TCTTGCTGAG TGACAAAGCC TCTAGCCTCC TCTGCTTCCA GCACCAGGAG 540 

GAGAGCCTGG CTCAGGGCCC CCCGCTGTTA GCCACTTCTG TCACCTCCTG GTGGAGCCCT 600 

CAGAACATCA GCCTGCCCAG TGCCGCCAGC TTCACCTTCT CCTTCCACAG TCCTCCCCAC 660 

ACGGCCGCTC ACAATGCCTC GGTGGACATG TGCGAGCTCA AAAGGGACCT CCAGCTGCTC 720 

AGCCAGTTCC TGAAGCATCC CCAGAAGGCC TCAAGGAGGC CCTCGGCTGC CCCCGCCAGC 780 

CAGCAGTTGC AGAGCCTGGA GTCGAAACTG ACCTCTGTGA GATTCATGGG GGACATGGTG 840 

TCCTTCGAGG AGGACCGGAT CAACGCCACG GTATGGAAGC TCCAGCCCAC AGCCGGCCTC 900 

CAGGACCTGC ACATCCACTC CCGGCAGGAG GAGGAGCAGA GCGAGATCAT GGAGTACTCG 960 

GTGCTGCTGC CTCGAACACT CTTCCAGAGG ACGAAAGGCC GGAGCGGGGA GGCTGAGAAG 1020 

AGACTCCTCC TGGTGGACTT CAGCAGCCAA GCCCTGTTCC AGGACAAGAA TTCCAGCCAA 1080 

GTCCTGGGTG AGAAGGTCTT GGGGATTGTG GTACAGAACA CCAAAGTAGC CAACCTCACG 1140 

GAGCCCGTGG TGCTCACTTT CCAGCACCAG CTACAGCCGA AGAATGTGAC TCTGCAATGT 1200 

GTGTTCTGGG TTGAAGACCC CACATTGAGC AGCCCGGGGC ATTGGAGCAG TGCTGGGTGT 1260 

GAGACCGTCA GGAGAGAAAC CCAAACATCC TGCTTCTGCA ACCACTTGAC CTACTTTGCA 1320 

GTGCTGATGG TCTCCTCGGT GGAGGTGGAC GCCGTGCACA AGCACTACCT GAGCCTCCTC 1380 

TCCTACGTGG GCTGTGTCX3T CTCTGCCCTG GCCTGCCTTG TCACCATTGC CGCCTACCTC 1440 

TGCTCCAGGG TGCCCCTGCC GTGCAGGAGG AAACCTCGGG ACTACACCAT CAAGGTGCAC 1500 

ATGAACCTGC TGCTGGCCGT CTTCCTGCTG GACACGAGCT TCCTGCTCAG CGAGCCGGTG 1560 

GCCCTGACAG GCTCTGAGGC TGGCTGCCGA GCCAGTGCCA TCTTCCTGCA CTTCTCCCTG 1620 

CTCACCTGCC TTTCCTGGAT GGGCCTCGAG GGGTACAACC TCTACCGACT CGTGGTGGAG 1680 

GTCTTTGGCA CCTATGTCCC TGGCTACCTA CTCAAGCTGA GCGCCATGGG CTGGGGCTTC 1740 

CCCATCTTTC TGGTGACGCT GGTGGCCCTG GTGGATGTGG ACAACTATGG CCCCATCATC 1800 

TTGGCTGTGC ATAGGACTCC AGAGGGCGTC ATCTACCCTT CCATGTGCTG GATCCGGGAC 1860 

TCCCTGGTCA GCTACATCAC CAACCTGGGC CTCTTCAGCC TGGTGTTTCT GTTCAACATG 1920 

GCCATGCTAG CCACCATGGT GGTGCAGATC CTGCGGCTGC GCCCCCACAC CCAAAAGTGG 1980 

TCACATGTGC TGACACTGCT GGGCCTCAGC CTGGTCCTTG GCCTGCCCTG GGCCTTGATC 2040 

TTCTTCTCCT TTGCTTCTGG CACCTTCCAG CTTGTCGTCC TCTACCTTTT CAG CAT CATC 2100 

ACCTCCTTCC AAGGCTTCCT CATCTTCATC TGGTACTGGT CCATGCGGCT GCAGGCCCGG 2160 

GGTGGCCCCT CCCCTCTGAA GAGCAACTCA GACTGCGCCA GGCTCCCCAT CAGCTCGGGC 2220 

AGCACCTCGT CCAGCCGCAT CTAGGCCTCC AGCCCACCTG CCCATGTGAT GAAGCAGAGA 2280 

TGCGGCCTCG TCGCACACTG CCTGTGGCCC CCGAGCCAGG CCCAGCCCCA GGCCAGTCAG 2340 

CCGCAGACTT TGGAAAGCCC AACGACCATG GAGAGATGGG CCGTTGCCAT GGTGGACGGA 2400 

CTCCCGGGGC TGGGGCTTTT GAATTGGCCT TGGGGACTAC TCGGCTCTCA CTCAGCTCCC 2460 

ACGGGACTCA GAAGTGCGCC GCCATGCTGC CTAGGGTACT GTCCCCACAT CTGTCCCAAC 2520 

CCAGCTGGAG GCCTGGTCTC TCCTTACAAC CCCTGGGCCC AGCCTCATTG CTGGGGGCCA 2580 

GGCCTTGGAT CTTGAGGGTC TGGCACATCC TTAATCCTGT GCCCCTGCCT GGGACAGAAA 2640 

TGTGGCTCCA GTTGCTCTGT CTCTCGTGGT CACCCTGAGG GCACTCTGCA TCCTCTGTCA 2700 

TTTTAACCTC AGGTGGCACC CAGGGCGAAT GGGGCCCAGG GCAGACCTTC AGGGCCAGAG 2760 

CCCTGGCGGA GGAGAGGCCC TTTGCCAGGA GCACAGCAGC AGCTCGCCTA CCTCTGAGCC 2820 
G 



A50 Protein sequence 
Gene name* 
Unigene number: 
Protein Accession # : 
Signal sequence: 
GPS domain: 
Pfam domain: 
Transmembrane domains : 
Cellular Localization: 



G protein -coupled receptor 56 

Hfl.6527 

NM_005682.1 

1-26 

342-394 

7tm_2 1400-665] 

410-432, 446-468, 482-504, 517-539, 575-597, 608-630, 638-659 
plasma membrane 



1 11 21 31 41 51 

I I I I I I 

MTPQSLLQTT LFLLSLLFLV QGAHGRGHRE DFRFCSQRNQ THRSSLHYKP TPDLRISIEN 60 

SEEALTVHAP FPAAHPASRS PPDPRGLYHF CLYWNRHAGR LHT.LYGKRDF LLSDKASSLL 120 

CFQHQEESLA QGPPLLATSV TSWWSPQNIS LPSAASPTPS FHSPPHTAAH NASVDMCEUC 1B0 

RDLQLLSQFL KHPQKASRRP SAAPASQQLQ SLESKLTSVR FMGOMVSFEE DRINATVWKL 240 

QPTAGLQDLH IHSRQEEEQS E1MEYSVLLP RTLFQRTKGR SGEAEKRLLL VDFSSQALFQ 300 

DKNSSQVLGE KVLGIWQKT KVANLTEPW LTFQHQLQPK NVTLQCVFWV EDPTLSSPGH 360 

WSSAGCETVR RETQTSCFCN HLTYFAVLMV SSVEVDAVHK HYLSLLSYVG CWSALACLV 420 

TIAAYLCSRV PLPCRRKPRD YTIKVHMNLL LAVFLLDTSF LLSEPVALTG SEAGCRASAI 480 

FXiHFSLLTCL SWMGLEGYNL YRLWEVFGT YVPGYLLKLS AMGWGFPIFL VTLVALVDVD 540 

NYGPIILAVH RTPEGVIYPS MCWIRDSLVS YITNLGLFSL VFLFNMAMLA TMWQILRLR 600 

PHTQKWSHVL TLLGLSLVLG LPWALIFFSF ASGTFQLWL YLFSIITSFQ GFLIFIWYWS 660 
MRLQARGGPS PLKSNSDCAR LPISSGSTSS SRI 



A51 DNA SEQUENCE 

Gene name: 
Unigene number: 
Probeset Accession #: 
Nucleic Acid Accession fl: 
Coding sequence: 



Hypothetical protein FLJ20063 

Ks.5940 

AA0S3660 

AA053660 

218-1360 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I 1 I I I 

CCCCCATGAC TTTGTACAGC TACTTCACTG CTTCCCCCCA ATTAGTACAC ATAGTTCCTC 

853 



WO 03/042661 



PCT7US02/36810 



CACAATTCCT ACACCTGCTC CCCCCATAAT CAGTACACAT AGTTCCTCCA CAATTCCTAT 120 

ACCTACTGCT GCAGACAGTG AGTCAACCAC AAATGTAAAT TCATTAGCTA CCTCTGACAT 180 

AATCACCGCT TCATCTCCAA ATGATGGATT AATCACAATG GTTCCTTCTG AAACACAAAG 240 

TAACAATGAA ATGTCCCCCA CCACAGAAGA CAATCAATCA TCAGGGCCTC CCACTGGCAC 300 

CGCTTTATTG GAGACCAGCA CCCTAAACAG CACAGGTCCC AGCAATCCTT GCCAAGATGA 360 

TCCCTGTGCA GATAATTCGT TATGTGTTAA GCTGCATAAT ACAAGTTTTT GCCTGTGTTT 420 

AGAAGGGTAT TACTACAACT CTTCTACATG TAAGAAAGGA AAGGTATTCC CTGGGAAGAT 480 

TTCAGTGACA GTATCAGAAA CATTTGACCC AGAAGAGAAA CATTCCATGG CCTATCAAGA 54 0 

CTTGCATAGT GAAATTACTA GCTTGTTTAA AGATGTATTT GGCACATCTG TTTATGGACA 600 

GACTGTAATT CTTACTGTAA GCACATCTCT GTCACCAAGA TCTGAAATGC GTGCTGATGA 660 

CAAGTTTGTT GATGTAACAA TAGTAACAAT TTTGGCAGAA ACCACAAGTG ACAATGAGAA 720 

GACTGTGACT GAGAAAATTA ATAAAGCAAT TAGAAGTAGC TCAAGCAACT TTCTAAACTA 780 

TGATTTGACC CTTCGGTGTG ATTATTATGG CTGTAACCAG ACTGGGGATG ACTGCCTCAA 840 

TGGTTTAGCA TGCGATTGCA AATCTGACCT GCAAAGGCCT AACCCACAGA GCCCTTTCTG 900 

CGTTGCTTCC AGTCTCAAGT GTCCTGATGC CTGCAACGCA CAGCACAAGC AATGCTTAAT 960 

AAAGAAGAGT GGTGGGGCCC CTGAGTGTGC GTGCGTGCCC GGCTACCAGG AAGATGCTAA 1020 

TGGGAACTGC CAAAAGTGTG CATTTGGCTA CAGTGGACTC GACTGTAAGG ACAAATTTCA 1080 

GCTGATCCTC ACTATTGTGG GCACCATCGC TGGCATTGTC ATTCTCAGCA TGATAATTGC 1140 

ATTGATTGTC ACAGCAAGAT CAAATAACAA AACGAAGCAT ATTGAAGAAG AGAACTTGAT 1200 

TGACGAAGAC TTTCAAAATC TAAAACTGCG GTCGACAGGC TTCACCAATC TTGGAGCAGA 1260 

AGGGAGCGTC TTTCCTAAGG TCAGGATAAC GGCCTCCAGA GACAGCCAGA TGCAAAATCC 1320 

CTATTCAAGA CACAGCAGCA TGCCCCGCCC TGACTATTAG AATCATAAGA ATGTGGAACC 1380 

CGCCATGGCC CCCAACCAAT GTACAAGCTA TTATTTAGAG TGTTTAGAAA GACTGATGGA 1440 

GAAGTGAGCA CCAGTAAAGA TCTGGCCTCC GGGGTTTTTC TTCCATCTGA CATCTG CCAG 1500 

CCTCTCTGAA TGGAAGTTGT GAATGTTTGC AACGAATCCA GCTCACTTGC TAAATAAGAA 1560 

TCTATGACAT TAAATGTAGT AGATGCTATT AGCGCTTGTC AGAGAGGTGG TTTTCTTCAA 1620 

TCAGTACAAA GTACTGAGAC AATGGTTAGG GTTGTTTTCT TAATTCTTTT CCTGGTAGGG 1680 

CAACAAGAAC CATTTCCAAT CTAGAGGAAA GCTCCCCAGC ATTGCTTGCT CCTGGGCAAA 1740 

CATTGCTCTT GAGTTAAGTG ACCTAATTCC CCTGGGAGAC ATACGCATCA ACTGTGGAGG 1800 

TCCGAGGGGA TGAGAAGGGA TACCCACCAT CTTTCAAGGG TCACAAGCTC ACTCTCTGAC IB 60 

AAGTCAGAAT AGGGACACTG CTTCTATCCC TCCAATGGAG AGATTCTGGC AACCTTTGAA 1920 

CAGCCCAGAG CTTGCAACCT AGCCTCACCC AAGAAGACTG GAAAGAGACA TATCTCTCAG 1980 

CTTTTTCAGG AGGCGTGCCT GGGAATCCAG GAACTTTTTG ATGCTAATTA GAAGGCCTGG 2040 

ACTAAAAATG TCCACTATGG GGTGCACTCT ACAGTTTTTG AAATGCTAGG AGGCAGAAGG 2100 

GGCAGAGAGT AAAAAACATG ACCTGGTAGA AGGAAGAGAG GCAAAGGAAA CTGGGTGGGG 2160 

AGGATCAATT AGAGAGGAGG CACCTGGGAT CCACCTTCTT CCTTAGGTCC CCTCCTCCAT 2220 

CAGCAAAGGA GCACTTCTCT AATCATGCCC TCCCGAAGAC TGGCTGGGAG AAGGTTTAAA 2280 

AACAAAAAAT CCAGGAGTAA GAGCCTTAGG TCAGTTTGAA ATTGGAGACA AACTGTCTGG 2340 

CAAAGGGTGC GAGAGGGAGC TTGTGCTCAG GAGTCCAGCC GTCCAGCCTC GGGGTGTAGG 2400 

TTTCTGAGGT GTGCCATTGG GGCCTCAGCC TTCTCTGGTG ACAGAGGCTC AGCTGTGGCC 2460 

ACCAACACAC AACCACACAC ACACAACCAC ACACACAAAT GGGGG CAACC ACATCCAGTA 2520 

CAAGCTTTTA CAAATGTTAT TAGTGTCCTT TTTTATTTCT AATGCCTTGT CCTCTTAAAA 2580 

GTTATTTTAT TTGTTATTAT TATTTGTTCT TGACTGTTAA TTGTGAATGG TAATGCAATA 2640 
AAGTGCCTTT GTTAGATGGT GAAAAAAAAA AAAAAAAAAA AAAAAAAAAA A 



A52 Protein sequence: 
Gene name: 
Unigene number: 
Probeset Accession #t 
Protein Accession 8: 
Signal sequence: 
Transmembrane domains: 
EGF domain: 
SEA domain: 

Cellular Localization: 



Hypothetical protein FLJ20063 

Hs.5940 

AA05366O 

none found 

none found 

289-311 

45-74 

80-196 

plasma membrane 



I 



11 
I 



21 
I 



31 



41 
I 



51 
I 



MVPSETQSKN EMSPTTEDNQ SSGPPTGTAL LETSTLNSTG PSNPCQDDPC ADNSLCVKLH 60 

NTSFCLCLEG YYYNSSTCKK GKVFPGKISV TVSETFDPEE KHSMAYQDLH SEITSLPKDV 120 

FGTSVYGQTV ILTVSTSLSP RSEMRADDKF VDVTIVTIIA ETTSDMEKTV TEKINKAIRS 180 

SSSNFLNYDL TLRCDYYGCN QTADDCLNGL ACDCKSDLQR PNPQSPFCVA SSLKCPDACN 240 

AQHKQCLIKK SGGAPECACV PGYQEDANGN CQKCAFGYSG LDCKDKFQLI LTIVGTIAGI 300 

VILSMIIALI VTARSNNKTK HIEEENLIDE DFQNLKLRST GFTNLGAEGS VFPKVRITAS 360 
RDSOMQNPYS RHSSMPRPDY 



A53 DMA SEQUENCE 
Gene name: 
Unigene number: 
Probeset Accession #: 
Nucleic Acid Accession #: 
Coding sequence: 



TMPRSS3a 
Hs. 298241 
AI538613 
AB0381S7 

202-1566 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

ACCGGGCACC GGACGGCTCG GGTACTTTCG TTCTTAATTA GGTCATGCCC GTGTGAGCCA 60 

GGAAAGGGCT GTGTTTATGG GAAGCCAGTA ACACTGTGGC CTACTATCTC TTCCGTGGTG 120 

CCATCTACAT TTTTGGGACT CGGGAATTAT GAGGTAGAGG TGGAGGCGGA GCCGGATGTC 180 

AGAGGTCCTG AAATAGTCAC CATGGGGGAA AATGATCCGC CTGCTGTTGA AGCCCCCTTC 240 



854 



WO 03/042661 



PCT/US02/36810 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 



TCATTCCGAT 
GATGCTGTTG 
ATTGGGATCA 
TCAGGGAAGT 
GGAGTCTCGG 
AATGCCGTGC 
AAGGGTCACT 
GATAACCTCA 
CACCTCTTGC 
TGTGCCTCTG 
AGCTCACGCA 
CTTCAGTTCC 
ACTGCTGCAC 
CTAGTTTCCC 
AGCAAGTACA 
CTCACGTTCA 
GATGGAAAAG 
TCCCCTGTCC 
GACGTGTAOG 
GTGGACAGCT 
AAGTTAGTGG 
TACACCCGTG 
ACCTGAAGAG 
TCCTCCCCTG 
TTCCGGCACC 
CAAGCTGCTT 
AGTGCAGTGG 
TTGCCTCAGC 
TTGTATTTTT 
CCTCAAATGA 
ACGCCTAGCC 
AGGGCGGCCT 
ACGAGATAAG 
GCACCAGCCC 
CCAAACCCAC 
ATGACTCGTT 
TTGTCCTTTG 
AAAAAAAA 



CGCTTTTTGG 
CTGCACAGAT 
TTGCATTGAT 
ACAGATGTCG 
ATTGCAAAGA 
TCCAGGTGTT 
ACGCAAATGT 
GAGTGAGCTC 
CAGATGACAA 
GCCACGTGGT 
TCGTGGGTGG 
AGGGCTACCA 
ACTGTGTTTA 
TGTTGGACAA 
AGCCAAAGAG 
ATGAAATGAT 
TGTGCTGGAC 
TGAACCACGC 
GTGGCATCAT 
GCCAGGGGGA 
GAGCGACCAG 
TCACCTCCTT 
GAAGGGGACA 
GACTCCCGTG 
AGTAGCAGGC 
TTTGTTTTTT 
CGAAATCCCT 
TTCCCCAGTA 
AGTAGAGACA 
TGTGCCTGCT 
TCACGCTCCT 
TTCCCACTGG 
CAGTTATGTG 
AGAAGTGCAG 
CCTTTCTACT 
TAAGGCCTAT 
ATTCCAAATA 



CCTTGATGAT 
CCTGTCACTG 
ATTAGCACTG 
CTCATCCTTT 
CGGGGAGGAC 
CACAGCTGCT 
TGCCTGTGCC 
GCTGGAGGGG 
GGTGACTGCA 
TACCTTGCAG 
AAACATGTCC 
CCTGTGCGGG 
TGACTTGTAC 
TCCAGCCCCA 
GCTGGGCAAT 
CCAGCCTGTG 
GTCAGGATGG 
GGCCGTCCCT 
CTCCCCCTCC 
CAGCGGGGGG 
CTTTGGCATC 
CCTGGACTGG 
AGTAGCCACC 
TAGGAACCTG 
CCGAAAGAGG 
GTTTTTTTGA 
GCTCACTGCA 
GCTGGGACCA 
GGGTTTCACC 
TCAGCCTCCC 
TTCTGATCTT 
TCCATCTGGT 
ACCTCACGTG 
AACTGCAGTC 
TCCAAGACTT 
TTTCATGATT 
ATATGTTTCC 



TTGAAAATAA 
CTGCCATTGA 
GCCATTGGTC 
AAGTGTATCG 
GAGTACCGCT 
TCGTGGAAGA 
CAACTGGGTT 
CAGTTCCGGG 
TTACACCACT 
TGCACAGCCT 
TTGCTCTCGC 
GGCTCTGTCA 
CTCCCCAAGT 
TCCCACTTGG 
GACATCGCCC 
TGCCTGCCCA 
GGGGCCACAG 
TTGATTTCCA 
ATGCTCTGCG 
CCCCTGGTGT 
GGCTGCGCAG 
ATCCACGAGC 
TGAGTTCCTG 
CACACGAGCA 
CACCCTTCCA 
GGTGGAGTCT 
GCCTCCGCTT 
CAGGTGCCCG 
ATGTTGGCCA 
ACAGTGCTGG 
CACTAAGAAC 
TTTCTCTCCA 
CAAAGCCACC 
ACTGCACGTT 
ATTTTCACAT 
TCTTTGTAGC 
TTCCCTCAAA 



GTCCTGTTGC 
AGTTTTTTCC 
TGGGCATCCA 
AGCTGATAGC 
GTGTCCGGGT 
CCATGTGCTC 
TCCCAAGCTA 
AGGAGTTTGT 
CAGTATATGT 
GTGGTCATAG 
AGTGGCCCTG 
TCACGCCCCT 
CATGGACCAT 
TGGAGAAGAT 
TTATGAAGCT 
ACTCTGAAGA 
AGGATGGAGC 
ACAAGATCTG 
CGGGCTACCT 
GTCAAGAGAG 
AGGTGAACAA 
AGATGGAGAG 
AGGTGATGAA 
GACACCCTTG 
TCTGATTCCA 
CGCTCTGTTG 
CCCTGGTTCA 
CCACCACACC 
GGCTGCTCTC 
GATTACAGGC 
AAAAGAAGCA 
GGGGTCTTGC 
AACAGCCACT 
TTCATCTCTA 
GTGGGGAGGT 
ATTTGGTGCT 
AAAAAAAAAA 



ACCAGATGCA 
AATCATCGTC 
CTTCGACTGC 
TCGATGTGAC 
GGGTGGTCAG 
CGATGACTGG 
TGTGAGTTCA 
GTCCATOGAT 
GAGGGAGGGA 
AAGGGGCTAC 
GCAGGCCAGC 
GTGGATCATC 
CCAGGTGGGT 
TGTCTACCAC 
GGCCGGGCCA 
GAACTTCCCC 
AGGTGACGCC 
CAACCACAGG 
GACGGGTGGC 
GAGGCTGTGG 
GCCTGGGGTG 
AGACCTAAAA 
GACAGCCCGA 
GAGCTCTGAG 
GCACAACCTT 
CCCAGGCTGG 
AGCGATTCTC 
CAACTAATTT 
AAACCCCTGA 
ATGGGCCACC 
GCAACTTGCA 
AAAATTCCTG 
CAGAAAAGAC 
GGGACCAGAA 
TAATCTAGGA 
TGACGTATTA 
AAAAAAAAAA 



A54 Protein sequence; 
Gene name* 
Unigene number: 
Probeset Accession #: 
Protein Accession #: 
Signal sequence: 
Transmembrane domains: 
Tr YP_SPc domain: 
Cellular Localization: 



TMPRSSBa 

Hs. 298241 

AI538613 

BAB20077 

none found 

43-65, 239-261 

216-444 

plasma membrane 



1 
I 

MGENDPPAVE 
LALAXGLGIH 
TAASWKTMCS 
VTALHHSVYV 
LCGGSVITPL 
LGNDIALMKL 
AVPLISNKIC 
PGIGCAEVNK 



11 
I 

APFSFRSIiFG 
FDCSGKYRCR 
DDWKGHYANV 
REGCASGHW 
WIITAAHCVY 
AGPLTFNEMI 
NHRDVYGGII 
PGVYTRVTSF 



21 
I 

LDDLKISPVA 
SSFKC1ELIA 
ACAQLGPPSY 
TLQCTACGHR 
DLYLPKSWTI 
QPVCLPNSEE 
SPSMLCAGYL 



31 
I 

PDADAVAAQI 
RCDGVSDCKD 



RGYSSRIVGG 
QVGLVSLLDN 
NPPDGKVCHT 
TGGVDSCQGO 
DLKT 



41 
I 

LSLLPLKFFP 
GEDEYRCVRV 
LEGQFREEFV 
NMSLLSQWPW 
PAPSHLVEKI 
SGWGATEDGA 
SGGPLVCQER 



51 
I 

IIVIGIIALI 
GGQNAVLQVF 
SIDHLLPDDK 
QASLQFQGYH 
VYHSKYKPKR 
GDASPVLNHA 
RLWKLVGATS 



300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1B60 
1920' 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 



60 
120 
100 
240 
300 
360 
420 



AS 5 DNA SEQUENCE 

Gene name: Putative G protein -coupled receptor GPCR150 
Unigene number: Hs. 97101 

Probeset Accession #: AA215333 
Nucleic Acid Accession #: NM_014373 

Coding sequence: 322-1338 {underlined sequences correspond to start and stop codons) 



GTGGCCTCGA 
GCCTCCGGAG 
ATGTCTCCGA 
AAATAACATA 
GACCCACTGG 
TCTTGAAATT 
CAGTTACGTC 
GGGAAAATAT 
TTTATGGAAT 
TCCATTATAT 
ATCTGCCTAT 
CTGACAGCTT 
TGTCAAAAAT 



11 
I 

GGTGGTGGCA 
GCAGGTTCGG 
GCTTACTCAC 
ATTGAAGGCA 
AGAGGACAGA 
TAACTAAAAA 
AAACAAACCA 
TATTAAATAT 
ATTTTTGCAT 
TGTATTTCAG 
TTACTCAAAT 
GTATAGATTA 
TATTTTATTT 



21 
I 

GGGCCGCCCC 
CTGGAAGGAA 
ATAGCATATT 
GTAAAAGTGA 
AAATGAAGCA 
TATGACTGCT 
GCCCCTAGAC 
CCTTACACTA 
TTCACTAGCA 
GGATTTTGTA 
TATTTCCTTT 
TTGCCTGAAT 
CTTTACAGTA 



31 
I 

CTGCAGTCCG 
CCGCTCTCGC 
GGTATATCAA 
AATTAAATAG 
GTGTTTTATC 
CTCTCTTCAG 
GTTAACTATC 
GGAATGAGAA 
TTCGTTGATC 
CTTTTAAGCA 
ACTTATGGCT 
TTCTCTAAAA 
ATTTTAATTT 



41 
I 

GAGACGAACG 
TTCGTCCTAC 
AATGAAATGC 
GAAGATCATC 
ATGTGTATTT 
AGAACTGCTC 
TGCTATTCTT 
GAAAAAACAC 
TTTTACTTTT 
TTAGGTTCAC 
TTTTGCATTA 
CAACCAAGCT 
GGATTTCAGT 



51 

I 

CACGGACCGG 
ACTTGCGCAA 
AAGGAACCAA 
AGTCAAGGAA 
CAGCAGGTCT 
TTTTCAGTAC 
GATCATACTT 
CTGTCAAAAT 
GGTAAACATT 
TAAATACCAC 
TCCAGTTTTC 
TTCATTTAAG 
CCTTGCTTAT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



855 



WO 03/042661 



PCT/US02/36810 



GTTTTGGGAG ACCCAGCCAT CTACCAAAGC CTGAAGGCAC AGAATGCTTA TTCTCGTCAC 840 

TGTCCTTTCT ATGTCAGCAT TCAGAGTTAC TGGCTGTCAT TTTTCATGGT GATGATTTTA 900 

TTTGTAGCTT TCATAACCTG TTGGGAAGAA GTTACTACTT TGGTACAGGC TATCAGGATA 960 

ACTTCCTATA TGAATGAAAC TATCTTATAT TTTCCTTTTT CATCCCACTC CAGTTATACT 1020 

GTGAGATCTA AAAAAATATT CTTATCCAAG CTCATTGTCT GTTTTCTCAG TACCTGGTTA 1080 

CCATTTGTAC TACTTCAGGT AATCATTGTT TTACTTAAAG TTCAGATTCC AGCATATATT 1140 

GAGATGAATA TTCCCTGGTT ATACTTTGTC AATAGTTTTC TCATTGCTAC AGTGTATTGG 1200 

TTTAATTGTC ACAAGCTTAA TTTAAAAGAC ATTGGATTAC CTTTGGATCC ATTTGTCAAC 1260 

TGGAAGTGCT GCTTCATTCC ACTTACAATT CCTAATCTTG AGCAAATTGA AAAGCCTATA 1320 

TCAATAATGA TTTGTTAATA TTATTAATTA AAAGTTACAG CTGTCATAAG ATCATAATTT 1380 

TATGAACAGA AAGAACTCAG GACATATTAA AAAATAAACT GAACTAAAAC AACTTTTGCC 1440 

CCCTGACTGA TAGCATTTCA GAATGTGTCT TTTGAAGGGC TATACCAGTT ATTAAATAGT 1500 

GTTTTATTTT AAAAACAAAA TAATTCCAAG AAGTTTTTAT AGTTATTCAG GGACACTATA 1560 

TTACAAATAT TACTTTGTTA TTAACACAAA AAGTGATAAG AGTTAACATT TGGCTATACT 1620 

GATGTTTGTG TTACTCAAAA AAACTACTGG ATGCAAACTG TTATGTAAAT CTGAGATTTC 1680 

ACTGACAACT TTAAGATATC AACCTAAACA TTTTTATTAA ATGTTCAAAT GTAAGCAAGA 1740 
AAAAAAAAA 



A56 Protein sequence 
Gene name: 
Unigene number: 
Protein Accession #t 
Signal sequence: 
Transmembrane domains: 
Cellular Localization: 



Putative G protein- coupled receptor GPCR150 
Hs. 97101 
NP_0551BB 
none found 

23-45, 59-81, 97-119, 138-160, 184-206, 241-263, 276-297 
plasma membrane 



1 11 21 31 41 51 

I I I I I 1 

MTALSSENCS FQYQLRQTNQ PLDVNYLLFL IILGKILLNI LTLGMRRKNT CQNFMEYPCI 60 
SLAFVDLLLL VNISIILYFR DFVLLSIRFT KYHICLFTQI ISFTYGFLHY PVFLTACIDY 120 
CLNFSKTTKL SFKCQKLFYF FTVILIWISV LAYVLGDPAI YQSLKAQNAY SRHCPFYVSI " 180 
QSYWLSFFMV MItiFVAFITC WEEVTTLVQA IRITSYMNET ILYFPFSSHS SYTVRSKKIF 240 
LSKLIVCFLS TWLPFVLLQV IIVLLKVQIP AYIEMNIPWL YFVNSFLIAT VYWFNCHKLN 300 
LKDIGLPLDP FVNWKCCFIP LTIPNLEQIE KPISIMIC 



A57 DMA SEQUENCE 

Gene name: BSTs, Weakly similar to CGHU7L collagen alpha 1(111) chain precursor [H. sapiens] 

Unigene number: Hs.19322 
Probeset Accession #: AA088458 
Nucleic Acid Accession #: AA088458 

Coding sequence: 862-1995 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I i I i i 

GCCCTTGGAC ACTGACATGG ACTGAAGGAG TAGAATGGAG CACGAGGACA CTGACATGGA 60 

CTGAAGAAAA AGGAGCTGGA GCAGGAGAAG GAGGTGCTGC TGCAGGGTTT GGAGATGATG 120 

GCGCGGGGCC GCGACTGGTA CCAGCAGCAG CTGCAACGAG TGCAGGAGCG CCAGCGCCGC 180 

CTGGGCCAGA GCAGAGCCAG CGCCGACTTT GGGGCTGCAG GGAGCCCCCG CCCACTGGGG 240 

CGGCTACTGC CCAAGGTACA AGAGGTGGCC CGGTGCCTGG GGGAGCTGCT GGCTGCAGCC 300 

TGTGCCAGCC GGGCCCTGCC CCCGTCCTCC TCCGGGCCCC CCTGCCCTGC CCTGACGTCC 360 

ACCTCACCCC CGGTCTGGCA GCAGCAGACC ATCCTCATGC TGAAGGAGCA GAACCGACTC 420 

CTCACCCAGG AGGTGACCGA GAAGAGTGAG CGCATCACGC AGCTGGAGCA GGAGAAGTCG 480 

GCGCTCATTA AGCAGCTGTT TGAGGCCCGC GCCCTGAGCC AGCAGGACGG GGGACCTCTG 540 

GATTCCACCT TCATCTAGTC CTTGTGGGCC GCGTGGGCCC CCAGGGCCAG CCTGGCACTC 600 

AGCCCTTCGA GGGTGGGCGC CCCATCGCAC CCACCCTCTC TGGCTGGAGA CCCCCGGCAG 660 

GCCCAGGCAC AGTCCCGGAG TGGGCGCCTT CCTGCCGCCC TTGCCAGATG GGCTCCCCAG 720 

GCCTGCCCCC GGCTGGTCCC CGCACCGAGC GCTTGACTCC GTTTKGGCTC CTGGTTGYTG 780 

ACATGGGCTG GGGGCTCTCT TGAGTCCGCA TAGTCCGCAG CTACTACTGG CCGCTGTCAG 840 

TGGACAGTGG GGTACCCCTC CATGAGTTAG CGTCCCCCCG TTTCCAGCGG TGCCGCCCTG 900 

GGTCCCATCT TCAGGGAAAG GCACTGCCCA CGCCAGGCTG CACTTCCAAC AACGGGCAGC 960 

AGAGGGCGCG GGGCGGCTCC GACGCGGGTC CAAGGGCAGC TTCCCGCTCA ACCAGGGCAC 1020 

CAGGACGAGG TGGCTGTAGC TCGGACGGAC GGAAGTAGAT GGAGGGGGTG GGGACGGCCT 1080 

GTAAGCGGGG GGTGCCTGCC TGGCTGGGGA GCCCCAGGGA TAGCGGTCGG ACTTCAGGTT 1140 

CTGGCCAAGG CTGAGGGACC CTGGCTGCAG CGGATCGGCA CGCCGGGTGG GCGAGAGCTT 1200 

GGCCTGCATG TGCCTCCCAC AGACCCTGGG GTGATGGCCT TCCCCCTCTT GGCCGGGACG 1260 

TTGCCCCACG TTGAGTCCCA CACAACATCC TGTGAGCCTG GCTCCCCAGG AGGGCCCCCA 1320 

GACAGCTCCC AGGCACGTCA TAGGCAAAGC CTGTTTCCCC OGACTCAGGA TTTCCAAGGC 1380 

CTGGGGTCCT GCTCACCCCC CTTTGCTCTC ACGCCCAGCC TGTCCCCAGG TTTCAGCTGG 1440 

GAGAGGCCAC CTCCCTCAGC CAAGGAAAAC GAGAACCCCC AGGGTACAGG AGGAGGCTGG 1500 

GGCAGGTCCC CTTGGGTGTC ACTCCCTCAG CCCCTGCCCA GGCCCACTCC CGCTGGTGCT 1560 

GGAGTACGCA CTGGTGGGGG GGCCCTGCTC AGCCCAACCT GGAGGGTCCC AGTGTCACCA 1620 

GAACCAGGGG CACGGCAACA GCATCGATGG GTTCTGCAGC CCAGGGCCCC CGATGCGGGG 1680 

TCAGTGTGTG TGGGGCGCAG GGCCTCCGAT GCGGGGTCAG TGCGTGGGGG GCGCAGGGCC 1740 

CCCGATGCGG GGTCAGTGCG TGGGGGGCGC AGGGCCCCCT CGTGTCCAGG GCACTTTGGT 1800 

ACACTGTCCC ACAAGGCACC TGTCTCAGAG GAGGGGCCCT GGCAGGCAGC GTGGCAACTC 1860 

CCTTCCGGAG CCCAGCTCCA TGCTAACCTG CCCACAGCAA CCCCACAGAG CCACATTCCC 1920 

TGCTGCACCT GGTCTGCAGG GGTGTCCCAG GACAGGCCCA AGTCAGCCCA GCATGCAGCT 1980 

GCCCTCCTAC C CTGAA GATQ GGAGTGGGCT TTCCAGGGGA CATAAGGATG TCAGGCCTGG 2040 

ACCTCCTGGG CAGGAAAGGG TGCAGGTCCT GAGGGCCTGT GCCCCACAGC CCCAGCACCC 2100 

AGGTGGACTG CAGCGCAGTG GGTGGGCCAG TGGCAGCCAG GGAGAAGCCC CCCGTCAGCA 2160 

GGCTGGGGTC TGCCCACCAG GGCCTCCCCA CGTCTGCCTT TGAGGGTGCC TGCCATGCCC 2220 



856 



WO 03/042661 



PCT/US02/36810 



5 

10 
15 



TGGGGGATCC 
GGTGACTTCA 
GAGACAGGCT 
AAAGAAATAG 
CACGAGGGGA 
GCAGACCCTG 
GAGCAGCGTC 
GCGTGCACAC 
CAGAAGTGTC 
TTTTGTGTTG 
CTGGAATCCC 
CCCCATCTCT 
TCATAAACAC 
TAGACCCAGA 
AGAAATAAAA 



TGGCATCTTT 
TCAGGAGACC 
GGCACCTCCG 
GTCCTCCCAG 
GAATTTAAAG 
CCTGGAGCCT 
CCTGGGCTCT 
TGTGATGACA 
CCCAGTTGAG 
ATCAAGTTCC 
AGCACTTGAG 
ACAARAAAAA 
CACAAGGAAA 
TACTAGAATT 
GAGATTTCTG 



ACTGGACTGG 
GCCCACATAG 
GAAAAACTGC 
TTTACAGCTT 
GCCCCGGCTG 
GCCCTAGGAC 
ATCCGCGAGG 
CCOGGAAATG 
AATCTGCCCC 
AAGGAAAAGG 
GCCAGGAGTT 
AAAAAGAAAG 
CAATACACTA 
ATCAGAGAGA 
GAAACATGAA 



AAGCAGGAGA 
AGCTGGACCC 
CTTTCAGCCT 
GAAATCAGGC 
GCAGGGTCTA 
GCTGGGCGGG 
TGCCAGTAGC 
TCTCAGGATG 
AGAGGAACAC 
AACATCTCAG 
CCAGAGCAGC 
AAAGAAAATG 
TGAGACCCAG 
ATATAAAGTA 
AAAAAA 



CAGAACAGTG 
CGCAGCTGAA 
TGGTGTTCCG 
TAGTGAGTGG 
GGTGGCTGGC 
TCAGTCTCCG 
GTGTGCAGGT 
TTGAAATGTG 
ACCCACACCA 
COGGGCGTGG 
CTGGGCAACG 
AGAGATCCAG 
CAGAAGCAAC 
ACAGTGTTTT 



TCTGTCCCGG 
GCGGAAATGT 
TGCAAGGTGA 
CCCTGGAGAC 
AGAGGCACAT 
TGCAGGATGT 
ACATACACGT 
TCCTTGGGGG 
GGCCTCAGGA 
TGGTTCAOGC 
CAGTGAGAGA 
GTTTAAAAAT 
AGATTGACTC 
ATATATCTAA 



2280 
2340 
2400 
2460 
2520 
2SB0 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 



20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 



A5Q DNA sequence 
Gene name: 
Unigene number: 
Probeset Accession #: 
Nucleic Acid Accession #: 
Coding Sequence: 



ESTs 

Hs. 157601 

W074S9 

AC005383 

328-2751 (underlined sequences correspond to start and stop codons) 



1 11 

I I 
GACAGTGTTC GCGGCTGCAC 
TTTTATTTGC AGACCTGGGC 
CCTGGCGGTA GTTCCTCCGA 
ACAAACAGGT GTCCCACGTG 
CCCCCTGGCC CGAGCCGCGC 
TCGCCGCTCT CCTTCCGTTA 
GTTTTCCTGT TTTCCAGAGT 
GAAACCATCG GGAAGATTTC 
ATCATGTTTC TGTTAGATGG 
CACTTTGCCA TCACAGTCTG 
GCATTCCAGT TCAGTTCCAC 
CAGGAAGTGA AGGCAAGAAT 
CTTGCTCTGA AATACCTTCT 
CAGATCCTCA TCATCGTCAC 
CAGCTGAAGG AAAGGGGTGT 
GAGCTGCATG CACTGGCCAG 
GAGGATGCCA CCAAGGGCCT 
ACGCCAGACT GCAGGGTCGA 
GAGTTCGCTG GCAATGCCCC 
GCACACTGTC CCTTCTACAG 
AGGACCACCT GCCCAGGCCC 
CCAGAAGGAC TGGACGGCTA 
TGTGCCCTGA AGCTGAGCCT 
GCGGGCACCA CTCTGGACGG 
GCCGTGCTGA GCGAGGACTC 
CTGGTGGOGG TGCCTGTGGG 
GGCATTCCCT TCCGTGGTGG 
CGTGGCTTCG GGAGCGCCAC 
CTCACTGAGT CACACTCCGA 
GAGCTGCTCC TGCTGGGTGT 
GGCAGCCCAA AGCATGTGAT 
GAGCTGCAGG GGAAGCTGTG 
CTCGTCTTCA TGTTGGACAC 
AGCTTTGTGA GAAGCTGTGC 
CTGGTGGTGT ATGGCAGCCA 
GCTGCGATGC TGCGGGCCAT 
ACCGCCCTGC TGCACATCTA 
GTCCCCAAAG CTGTGGTGGT 
GCCCAGAAGC TGAGGAACAA 
AGTGAGGGTC TGCGGAGGCT 
GCCGACCTGC GGTACCACCA 
CCAGTCAACC TCTGCAAACC 
GGGAGCTACC GCTGCAAGTG 
TGGAGCTCTT GCTCTGTATG 
ATGGCTCCCG TGCAGGAGGG 
GGCACTGAAA TGGTGCCTAC 
TTCCCGCCGT GGCCAGGACC 
ATGCTGCTTA GAGACAAGAA 
TTGATGTGTA AGTAAATACC 
CTGCCACCTT TCCCTTGAGG 
CGTTCCTTTG CACACAATCA 
AGGCCTTTAC TAGAGCATCC 
GCAGCTTTTC CACTTCCCCA 
CTTGAGGGAC GTTTGTGACT 
GGTCTCAGAC TGAATGTGAC 
TGTGCATGGG CCCAGGTCTG 
ACCTTGAAGG TCTTC 



21 
I 

CGCTCGGAGG 
CGATGCCGCT 
CCTCAGCCGG 
GCAGCCGCGC 
CCGGGTCTGT 
TATCAACATG 
GCCCCCATCT 
AGCTGCCAGC 
GTCTAACAGC 
TGAOGGTCTG 
TCCTCATCTG 
CAAGAGGATG 
GCACAGAGGG 
TGATGGGAAG 
CACTGTGTTT 
CGAGCCTAGA 
CTTCAGCACC 
GGCTCACCCC 
ATGCTGGAGA 
CTGGAAGAGA 
CTGTGACTCG 
CCAGTGCCTC 
GGAATGCAGG 
CTTCCTGCGG 
TCGGGCCCGA 
GGAGTACCAG 
CCCCACCCTG 
CAGGACAGGC 
GGATGAGGTT 
AGGCAGTGAG 
GGTCTACTCG 
CAGCCGGCAG 
CTCTGCCTCA 
CCTCCAGTTT 
GGTGCAGACT 
TAGCCAGGCC 
TGACAAAGTG 
GCTCACAGGC 
TGGCATCTCT 
TGCAGGTCCC 
GGACGTGCTC 
CAGCCCGTGC 
TCGGGATGGC 
TGTGAGCCAG 
CAGCAGCCGT 
CTTCTGGAAT 
ACTATTCTCA 
AGCAGCTGAT 
CACTTTCTGT 
ATAAACAAGG 
ATGCTOGCCA 
TTTGGACGGC 
GAGACATTCT 
TCTTGGCGAC 
CAATTAACCA 
GAGGGCCACG 



31 
I 

CTGGGTGACC 
TT AAAAAA CG 

CCCGGGCGCC 
GAGTAGAGCC 
CCCCCTTTCC 
CTCCCTCTCC 
AAAATGATGT 
GTCGGGAAAG 
GACATCAGCC 
GAATTCCCCT 
GTTTTCAAAG 
TTGCCTGGAG 
TCCCAGGGGG 
GCTGTGGGGG 
GGGCAGCACG 
CTCAGCAGCT 
TGTGAGCACA 
GGATCGCGGC 
GTGTTCCTAA 
CAGCCCTGCC 
TGCCCGCTGG 
GTCGACCTCC 
GCCAAAGTCT 
GTGGGTGTGG 
GATGTGCCTG 
ACGGGCAGTG 
CAGGACCGGC 
GOGGGCCCAG 
GCCGTGCGGG 
GATCCTCAGG 
CGGCCAGGGT 
GTAGGGCCCG 
GAGGTGAACC 
GCCTTCGGGC 
CCCTACCTAG 
ATGACCGTCC 
GGGAGAGGCG 

CGGGATTCCC 
ATTGAGTGGC 
ATGAATGAGG 
TGGGAGGGCC 
GGATGGATTC 
ACCCCTCCCA 
GTCTGTGCCC 
CTGAGGGAGG 
GTCACCCACA 
ACCTGCTGTG 
GGTCCTGAAG 
GAATGTTGTT 
GAAGGCCACG 
GGATGCATTT 
TGCCTTTTGT 
GCTTGGTTGA 
TAAAATCGTT 



41 
I 

CGCGTAGAAG 
CGAGGGGCTC 
GCCGCCCTCT 
CCTCCTGTGA 
GCCCGGGCAC 
TGTTGCTGGA 
AGGAAGTCCA 



GGAGCTTTGA 
CCGAGAGGGT 
TGGATTCATT 
GAGGGCGCAC 
GCAGAAATGC 
ATGTGGCACT 
TCAGGTTTCC 
TGCTGTTGGC 
CGGCCATCTG 
GGACGCTGGA 
GGACCCTTGC 
CCCACCCTGC 
AGAATGGAGG 
CCTTTGGAGG 
TCTTCCTGCT 
TCGTGAAGCG 
CCACATACAG 
ACCTGGTCTG 
CCTTGCGGCA 
CACGTAGAGT 
CGCGTCACGC 
CAGAGCTGGA 
ATCTGTTCAA 
GCCGGACACA 
AGAATTTTGC 
CTGACGTGAC 
TGGACACCAA 
GTGGGGTGGG 
AGAGGGGTGC 
CAGAGGATGC 
TGGGCGTGGG 
TGATCCACGT 
TGTGTGGAGA 
GCAGCTGCGT 
CCCACTGCGA 
TTGAGACGCC 
GCAACTACAG 
CAGGTCCTTA 
AGGATGTCCC 
AACGATGTTG 
CCTTGTTGAG 
ACTTAAATTT 
GACACAGTAA 
GCCTTTCAAG 
GCATTGAGTC 
GTGTGGAAGA 
TGATGGGGGA 
CTGAGTCGTG 



51 
I 

TGAAGTACTT 
TATGCACCTC 
CCCAGGAGAG 
TCCCGTAGCG 
CGAGCGCTGG 
GGCCGTCTGT 
TGTAAGCAAA 
TGCAGTGGAC 
AAGGTCCAAG 
CAGAGTGGGA 
TTCAACCCAA 
GGAGACGGAA 
TTCTGTGCCC 
GCCATCCAAG 
CAGGTGGGAG 
TGAGCAGGTG 
CTCCAGCGCC 
GATGGTCCGG 
GGTGCTGGCT 
CACCTGCTAC 
CACATGTGTT 
GGAGGCTAAC 
GGACAGCTCT 
GTTTGTGCGG 
CAGGGAGCTG 
GAGCCTCGAT 
GGCGGCAGAG 
GGTGGTTTTG 
AAGGGOGCGA 
GGAGATCACA 
CCAAATCCCT 
AGCCCTGGAC 
TCAGATGCAG 
ACAGGTCGGC 
ACCCACCCGG 
CTCAGCCGGC 

AGCCGTTCCT 
GCCTGTCCTA 
GGCAGCTTAC 
AGCCAAGCAG 
CCTGCAGAAT 
GAACCGTGAG 
CCTGAGGCAC 
AGAAGGCCTG 
GAATGTCTGC 
AACTGCAGCC 
TTGAAAAGTT 
GCTATGTCAT 
AGCGGCCTGA 
TGCCCAGCAG 
ATGGAAAGCA 
TGAAAGGGGG 
GACTTGGAAA 
GGGGCTGAGT 
AGCAGTGTCC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 



857 



WO 03/042661 



PCT/US02/36810 



A59 Protein sequence 
Gene name: 
Unigene number: 
Protein Accession #: 
Signal sequence: 
Transmembrane domains: 
VGW domains: 49-223; 341 
EGF domains: 298-333; 71! 
Cellular Localizations 



ESTs 

Hs. 157601 
none found 
1-17 
none found 

518; 529-706 

!-748 

secreted 



1 11 21 31 41 51 

I I I I I 1 

MPPFLLLEAV CVFLFSRVPP SLPLQEVHVS KETIGKISAA SKMMWCSAAV DIMFLLDGSN 60 

SVGKGSFERS KHFAITVCDG LDISPERVRV GAFQFSSTPH LEFPLDSFST QQEVKARIKR 120 

MVFKGGRTET ELALKYLLHR GLPGGRNASV PQILIIVTDG KSQGDVALPS KQLKERGVTV 180 

FAVGVRFPRW EELHALASEP RGQHVLLAEQ VEDATNGLFS TLSSSAICSS ATPDCRVEAH 240 

PCBHRTLEMV REFAGNAPCW RGSRRTLAVL AAHCPFYSWK RVFLTHPATC YRTTCPGPCD 300 

SQPCQNGGTC VPEGLDGYQC LCPLAFGGEA NCALKLSLEC RVDLLFLLDS SAGTTLDGFL 360 

RAKVFVKRFV RAVLSEDSRA RVGVATYSRE LLVAVPVGEY QDVPDLVWSL DGIPFRGGPT 420 

LTGSALRQAA ERGFGSATRT GQDRPRRWV LLTESHSEDE VAGPARHARA RELLLLGVGS 480 

EAVRAELEEI TGSPKHVMVY SDPQDLFNQI PELQGKLCSR QRPGCRTQAL DLVFMLDTSA 540 

SVGPENFAQM QSPVRSCALQ FEVNPDVTQV GLWYGSQVQ TAFGLDTKPT RAAMLRAISQ 600 

APYLGGVGSA GTALLHIYDK VMTVQRGARP GVPKAWVLT GGRGAEDAAV PAQKLRNNGI 660 

SVLWGVGFV LSEGLRRLAG PRDSLIHVAA YADLRYHQDV LIEWLCGEAK QPVNLCKPSP 720 

CMNEGSCVLQ NGSYRCKCRD GWEGPHCENR EWSSCSVCVS QGWILETPLR HMAPVQEGSS 780 
RTPPSNYREG LGTEMVPTFW NVCAPGP 

A60 DMA SEQUENCE 

Gene name.- Homo sapiens type II membrane serine protease mRNA 

Unigene number: Hs. 63325 

Probeset Accession #: AA411502 
Nucleic Acid Accession #: NM_016425 

Coding sequence: 1-1314 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

ATGTTACAGG ATCCTGACAG TGATCAACCT CTGAACAGCC TCGATGTCAA ACCCCTGCGC 60 

AAACCCCGTA TCCCCATGGA GACCTTCAGA AAGGTGGGGA TCCCCATCAT CATAGCACTA 120 

CTGAGCCTGG CGAGTATCAT CATTGTGGTT GTCCTCATCA AGGTGATTCT GGATAAATAC 180 

TACTTCCTCT GCGGGCAGCC TCTCCACTTC ATCCCGAGGA AGCAGCTGTG TGACGGAGAG 240 

CTGGACTGTC CCTTGGGGGA GGACGAGGAG CACTGTGTCA AGAGCTTCCC CGAAGGGCCT 300 

GCAGTGGCAG TCCGCCTCTC CAAGGACCGA TCCACACTGC AGGTGCTGGA CTCGGCCACA 360 

GGGAACTGGT TCTCTGCCTG TTTCGACAAC TTCACAGAAG CTCTCGCTGA GACAGCCTGT 420 

AGGCAGATGG GCTACAGCAG CAAACCCACT TTCAGAGCTG TGGAGATTGG CCCAGACCAG 480 

GATCTGGATG TTGTTGAAAT CACAGAAAAC AGCCAGGAGC TTCGCATGCG GAACTCAAGT 540 

GGGCCCTGTC TCTCAGGCTC CCTGGTCTCC CTGCACTGTC TTGCCTGTGG GAAGAGCCTG 600 

AAGACCCCCC GTGTGGTGGG TGGGGAGGAG GCCTCTGTGG ATTCTTGGCC TTGGCAGGTC 660 

AGCATCCAGT ACGACAAACA GCACGTCTGT GGAGGGAGCA TCCTGGACCC CCACTGGGTC 720 

CTCACGGCAG CCCACTGCTT CAGGAAACAT ACCGATGTGT TCAACTGGAA GGTGCGGGCA 780 

GGCTCAGACA AACTGGGCAG CTTCCCATCC CTGGCTGTGG CCAAGATCAT CATCATTGAA 840 

TTCAACCCCA TGTACCCCAA AGACAATGAC ATCGCCCTCA TGAAGCTGCA GTTCCCACTC 900 

ACTTTCTCAG GCACAGTCAG GCCCATCTGT CTGCCCTTCT TTGATGAGGA GCTCACTCCA 960 

GCCACCCCAC TCTGGATCAT TGGATGGGGC TTTACGAAGC AGAATGGAGG GAAGATGTCT 1020 

GACATACTGC TGCAGGCGTC AGTCCAGGTC ATTGACAGCA CACGGTGCAA TGCAGACGAT 1080 

GCGTACCAGG GGGAAGTCAC CGAGAAGATG ATGTGTGCAG GCATCCCGGA AGGGGGTGTG 1140 

GACACCTGCC AGGGTGACAG TGGTGGGCCC CTGATGTACC AATCTGACCA GTGGCATGTG 1200 

GTGGGCATCG TTAGCTGGGG CTATGGCTGC GGGGGCCCGA GCACCCCAGG AGTATACACC 1260 
AAGGTCTCAG CCTATCTCAA CTGGATCTAC AATGTCTGGA AGGCTGAGCT GTAA 



A61 Protein sequence: 
Gene name: 
Unigene number: 
Probeset Accession #: 
Protein Accession # : 
Signal sequence: 
Transmembrane domains: 
LDLa domain: 
Tryp_SPc domain: 
Cellular Localization: 



Homo sapiens type II 

Hs. 63325 

AA411502 

NP_057509 

none found 

31-53 

54-94 

204-429 

plasma membrane /ER 



serine protease mRNA 



1 11 21 31 41 51 

I I I I 1 I 

MLQDPDSDQP LNSLDVKPLR KPRIPMETFR KVGIPIIIAL LSLASIIIW VLIKVILDKY 60 

YFLCGQPLHF IPRKQLCDGE LDCPLGEDEE HCVKSFPEGP AVAVRLSKDR STLQVLDSAT 120 

GNWFSACFDN FTEALAETAC RQMGYSSKPT FRAVEIGPDQ DLDWEITEN SQELRMRNSS 180 

GPCLSGSLVS LHCLACGKSL KTPRWGGEE ASVDSWPWQV SIQYDKQHVC GGSILDPHWV 240 

LTAAHCFRKH TDVFNWKVRA GSDKLGSFPS LAVAKIIIIE FNPMYPKDND IALMKLQFPL 300 

TFSGTVRPIC LPFFDEELTP ATPLWIIGWG FTKQNGGKMS DILLQASVQV IDSTRCNADD 360 

AYQGEVTEKM MCAGIPEGGV DTCQGDSGGP LMYQSDQWHV VGIVSWGYGC GGPSTPGVYT 420 
KVSAYLNWIY NVWKAEL 



858 



WO 03/042661 



A62 DNA SEQUENCE 

Gene name: ESTs; Weakly similar to LITHOSTATHINE 1 BETA PRECURSOR [H. sapiens] 

Unigene number: Hs. 105484 

Probeset Accession ft: AA314779 

Nucleic Acid Accession ft: none found 

Coding sequence: 103-579 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

i I I I I 1 

CCAAACAGAT TTGCAGATCA AGGAGAACCC AGGAGTTTCA AAGAAGCGCT AGTAAGGTCT 60 

CTGAGATCCT TGCACTAGCT ACATCCTCAG GGTAGGAGGA AGATGGCTTC CAGAAGCATG 120 

CGGCTGCTCC TATTGCTGAG CTGCCTGGCC AAAACAGGAG TCCTGGGTGA TATCATCATG 180 

AGACCCAGCT GTGCTCCTGG ATGGTTTTAC CACAAGTCCA ATTGCTATGG TTACTTCAGG 240 

AAGCTGAGGA ACTGGTCTGA TGCCGAGCTC GAGTGTCAGT CTTACGGAAA CGGAGCCCAC 300 

CTGGCATCTA TCCTGAGTTT AAAGGAAGCC AGCACCATAG CAGAGTACAT AAGTGGCTAT 360 

CAGAGAAGCC AGCCGATATG GATTGGCCTG CACGACCCAC AGAAGAGGCA GCAGTGGCAG 420 

TGGATTGATG GGGCCATGTA TCTGTACAGA TCCTGGTCTG GCAAGTCCAT GGGTGGGAAC 480 

AAGCACTGTG CTGAGATGAG CTCCAATAAC AACTTTTTAA CTTGGAGCAG CAACGAATGC 540 

AACAAGCGCC AACACTTCCT GTGCAAGTAC CGACCATAGA GCAAGAATCA AGATTCTGCT 600 

AACTCCTGCA CCAGCCCCGT CCTCTTCCTT TCTGCTAGCC TGGCTAAATC TGCTCATTAT 660 

TTCAGAGGGG AAACCTAGCA AACTAAGAGT GATAAGGGCC CTACTACACT GGCTTTTTTA 720 

GGCTTAGAGA CAGAAACTTT AGCATTGGGC CCAGTAGTGG CTTCTAGCTC TAAATGTTTG 780 

CCCCGCCATC CCTTTCCACA GTATCCTTCT TCCCTCCTCC CCTGTCTCTG GCTGTCTCGA 840 

GCAGTCTAGA AGAGTGCATC TCCAGCCTAT GAAACAGCTG GGTCTTTGGC CATAAGAAGT 900 

AAAGATTTGA AGACAGAAGG AAGAAACTCA GGAGTAAGCT TCTAGACCCC TTCAGCTTCT 960 

ACACCCTTCT GCCCTCTCTC CATTGCCTGC ACCCCACCCC AGCCACTCAA CTCCTGCTTG 1020 

TTTTTCCTTT GGCCATAGGA AGGTTTACCA GTAGAATCCT TGCTAGGTTG ATGTGGGCCA 1080 
TACATTCCTT TAATAAACCA TTGTGTACAT AAGAGAAAAA AAAAAAAAAA AAAAAAAAAA 



A63 Protein sequence: 
Gene name: 
Unigene number: 
Probeset Accession ft: 
Protein Accession ft: 
Signal sequence: 
Transmembrane domains: 
C-type lectin domain: 
Cellular Localization: 



ESTs; Weakly similar to LITHOSTATHINE 1 BETA PRECURSOR [H. sapiens] 

Hs. 105484 

AA314779 

none found 

1-22 

none found 

47-156 

secreted 



1 11 21 31 41 SI 

I 1 I I I I 

MASRSMRLLL LLSCLAKTGV LGDIIMRPSC AFGWFYHKSN CYGYFRKLRN WSDAELECQS 60 
YGNGAHLASI LSLKEASTIA EYISGYQRSQ PIWIGLHDPQ KRQQWQWIDG AMYLYRSWSG 120 
KSMGGNKHCA EMSSNNNFLT WSSNECNKRQ HFLCKYRP 

A64 DNA SEQUENCE 

Gene name: Cadherin 3, P-cadherin (placental) 

Unigene number: Hs-2877 

Probeset Accession ft: X63629 

Nucleic Acid Accession ft: X63629 

Coding sequence: 54-2543 (start and stop codons are underlined) 



1 11 21 31 41 51 

i i I I I I 

GCGGAACACC GGCCCGCCGT CGCGGCAGCT GCTTCACCCC TCTCTCTGCA GCCATGGGGC 60 

TCCCTCGTGG ACCTCTCGCG TCTCTCCTCC TTCTCCAGGT TTGCTGGCTG CAGTGCGCGG 120 

CCTCCGAGCC GTGCCGGGCG GTCTTCAGGG AGGCTGAAGT GACCTTGGAG GCGGGAGGCG 180 

CGGAGCAGGA GCCCGGCCAG GCGCTGGGGA AAGTATTCAT GGGCTGCCCT GGGCAAGAGC 240 

CAGCTCTGTT TAGCACTGAT AATGATGACT TCACTGTGCG GAATGGOGAG ACAGTCCAGG 300 

AAAGAAGGTC ACTGAAGGAA AGGAATCCAT TGAAGATCTT CCCATCCAAA CGTATCTTAC 360 

GAAGACACAA GAGAGATTGG GTGGTTGCTC CAATATCTGT CCCTGAAAAT GGCAAGGGTC 420 

CCTTCCCCCA GAGACTGAAT CAGCTCAAGT CTAATAAAGA TAGAGACACC AAGATTTTCT 480 

ACAGCATCAC GGGGCCGGGG GCAGACAGCC CCCCTGAGGG TGTCTTCGCT GTAGAGAAGG 540 

AGACAGGCTG GTTGTTGTTG AATAAGCCAC TGGACCGGGA GGAGATTGCC AAGTATGAGC 600 

TCTTTGGCCA CGCTGTGTCA GAGAATGGTG CCTCAGTGGA GGACCCCATG AACATCTCCA 660 

TCATCGTGAC CGACCAGAAT GACCACAAGC CCAAGTTTAC CCAGGACACC TTCCGAGGGA 720 

GTGTCTTAGA GGGAGTCCTA CCAGGTACTT CTGTGATGCA GGTGACAGCC ACAGATGAGG 780 

ATGATGCCAT CTACACCTAC AATGGGGTGG TTGCTTACTC CATCCATAGC CAAGAACCAA 840 

AGGACCCACA CGACCTCATG TTCACAATTC ACCGGAGCAC AGGCACCATC AGCGTCATCT 900 

CCAGTGGCCT GGACCGGGAA AAAGTCCCTG AGTACACACT GACCATCCAG GCCACAGACA 960 

TGGATGGGGA CGGCTCCACC ACCACGGCAG TGGCAGTAGT GGAGATCCTT GATGCCAATG 1020 

ACAATGCTCC CATGTTTGAC CCCCAGAAGT ACGAGGCCCA TGTGCCTGAG AATGCAGTGG 1080 

GCCATGAGGT GCAGAGGCTG ACGGTCACTG ATCTGGACGC CCCCAACTCA CCAGCGTGGC 1140 

GTGCCACCTA CCTTATCATG GGCGGTGACG ACGGGGACCA TTTTACCATC ACCACCCACC 1200 

CTGAGAGCAA CCAGGGCATC CTGACAACCA GGAAGGGTTT GGATTTTGAG GCCAAAAACC 1260 

AGCACACCCT GTACGTTGAA GTGACCAACG AGGCCCCTTT TGTGCTGAAG CTCCCAACCT 1320 

CCACAGCCAC CATAGTGGTC CACGTGGAGG ATGTGAATGA GGCACCTGTG TTTGTCCCAC 1380 

CCTCCAAAGT CGTTGAGGTC CAGGAGGGCA TCCCCACTGG GGAGCCTGTG TGTGTCTACA 1440 

CTGCAGAAGA CCCTGACAAG GAGAATCAAA AGATCAGCTA CCGCATCCTG AGAGACCCAG 1500 



859 



WO 03/042661 



PCT7US02/36810 



CAGGGTGGCT 
GTGAGGATGA 
ATGGAAGCCC 
ACCATGGCCC 
ACGTGCTGAA 
TCACAGATGA 
TCTTGTCCCT 
ACCATGGCAA 
ATGTCGAAAC 
TCCTGGCTCT 
TCAAGGAGCC 
AAGAGGGGGG 
AGGCCAGGCC 
TGTACOGTCC 
AGGCGGCTAA 
AGGGCAGCGG 
ACCAAGATTA 
ACGGTGGCGG 
CACAGAGCAT 
GGAAGTGGCC 
TCCTTAGCCT 
CACCTGGGCC 
TGCTCAACCC 
CTGGAATGGA 
TCAAAACGTT 
TCCTGCATTT 
ATACTGAGTG 
GAAGGGTGAG 



AGCCATGGAC 
GCAGTTTGTG 
TCCCACCACT 
AGTCCCTGAG 
CATCACGGAC 
CTCAGACATC 
GAAGAAGTTC 
CAAAGAGCAG 
CTGCCCTGGA 
GCTGTTCCTC 
CCTCCTACTC 
TGGCGAAGAG 
GGAGGTGGTT 
TAGGCCAGCC 
CACAGACCCC 
CTCCGACGCC 
CGATTATCTG 
GGAGGACGAC 
CTCCAAGGGG 
GTAGCAACTT 
TTCAGGATGG 
AGGGTTGCCT 
TGTGTCCTGQ 
ACCTTCTTAG 
AGAGAAAGTT 
CTGGTTTCCA 
TGCCTAGGTT 
GACAATCGTG 



CCAGACAGTG 
AGGAACAACA 
GGCACGGGAA 
CCCCGTCAGA 
AAGGACCTGT 
TACTGGACGG 
CTGAAGCAGG 
CTGACGGTGA 
CCCTGGAAAG 
CTGCTGGTGC 
CCAGAAGATG 
GACCAGGACT 
CTCCGCAATG 
AACCCAGATG 
ACAGCCCCGC 
GCGTCCCTGA 
AACGAGTGGG 
TAGGCGGCCT 
TCTCAGTTCC 
GGCGGAGACA 
AGGAATGTGG 
CAGAGGCCAA 
GCCTGGGCCT 
GCCTCCTGGT 
CTTCAAAAGT 
GACCCCAATG 
GCCCCTTATT 
TATATGTACT 



GGCAGGTCAC 
TCTATGAAGT 
CCCTTCTGCT 
TCACCATCTG 
CTCCCCACAC 
CAGAGGTCAA 
ATACATATGA 
TCAGGGCCAC 
GAGGTTTCAT 
TGCTTTTGTT 
ACACCCGTGA 
ATGACATCAC 
ACGTGGCACC 
AAATCGGCAA 
CCTACGACAC 
GCTCCCTCAC 
GCAGCCGCTT 
GCCTGCAGGG 
CCCTTCAGCT 
GGCTATGAGT 
GCAGTTTGAC 
GTTTCCAGAA 
GCTGTGACTG 
GCAACTTAAT 
GCAGCCCAGA 
CCTCCCATTC 
TTTTATTTTC 
AGAACTTTTT 



AGCTGTGGGC 
CATGGTCTTG 
AACACTGATT 
CAACCAAAGC 
CTCCCCTTTC 
CGAGGAAGGT 
CGTGCACCTT 
TGTGTGCGAC 
CCTCCCTGTG 
GGTGAGAAAG 
CAACGTCTTC 
CCAGCTCCAC 
AACCATCATC 
CTTTATAATT 

CTCCTCCGCC 
CAAGAAGCTG 
CTGGGGACCA 
GAGGACTTCG 
CTGACGTTAG 
TTCAGCACTG 
GCCTCTTACC 
ACCTACAGTG 
TTTTTTTTTT 
GCTGCTGGGC 
GGATGGATCT 
CCTGTTGCGT 
TATTAAAGAA 



ACCCTCGACC 
GCCATGGACA 
GATGTCAACG 
CCTGTGCGCC 
CAGGCCCAGC 
GACACAGTGG 
TCTCTGTCTG 
TGCCATGGCC 
CTGGGGGCTG 
AAGCGGAAGA 
TACTATGGCG 
CGAGGTCTGG 
CCGACACCCA 
GAGAACCTGA 
TTCGACTATG 
TCCGACCAAG 
GCAGACATGT 
AACGTCAGGC 
GAGCTTGTCA 
AGTGGTTGCT 
AAAACCTCTC 
TGCCGTAAAA 
GACTTTCTCT 
AATGCTATCT 
CCACTGGCCG 
CTGCGTTTTT 
TGCTATAGAT 
A 



1560 
1620 
1680 
1740 
1800 
I860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 



A6S Protein sequence: 
Gene name: 
Unigene number: 
Probeset Accession #: 
Protein Accession #: 
Signal sequence: 
Transmembrane domain: 

Cellular localization: 



Cadherin 3, P-cadherin (placental) 

Hs.2877 

X63629 

CAA45177 

1-24 

659-675 {second underlined sequence) 
plasma membrane 



1 11 21 31 41 51 

I I I I I I 

MGLPRGPIAS LLLLQVCWLQ CAASEPCRAV FREAEVTLEA GGAEQEPGGA LGKVFMGCPG 60 

QEPALFSTDN DDFTVRNGET VQERRSLKER NPLKIFPSKR ILRRHKRDWV VAPISVPENG 120 

KGPFPQRLNQ LKSNKDRDTK . IFYSITGPGA DSPPEGVFAV EKETGWLLLN KPLDREEIAK 180 

YELFGHAVSE NGASVEDPMN ISIIVTDQND HKPKFTQDTF RGSVLEGVLP GTSVMQVTAT 240 

DEDDAIYTYN GWAYSIHSQ EPKDPHDLMF TIHRSTGTIS VISSGLDREX VPEYTLTIQA 300 

TDMDGDGSTT TAVAWEILD AKDNAPMFDP QKYEAHVPEN AVGHEVQRLT VTDLDAPNSP 360 

AWRATYLIMG GDDGDHFTIT THPESNQGIL TTRKGLDFEA KNQHTLYVEV TNEAPFVLKL 420 

PTSTATIWH VEDVNEAPVF VPPSKWEVQ EGIPTGEPVC VYTAEDPDKE NQKISYRILR 480 

DPAGWLAMDP DSGQVTAVGT LDREDEQPVR NNIYEVMVLA MDNGSPPTTG TGTLLLTLID 540 

VNDHGPVPEP RQITICNQSP VRHVLNITDK DLSPHTSPFQ AQLTDDSDIY WTAEVNEEGD 600 

TWLSLKKFL KQDTYDVHLS LSDHGNKEQL TVIRATVCDC HGHVETCPGP WKGGFILPVL 660 

GAVLALLFLL LVLLLLVRKK RKIKEPLLLP EDDTRDNVFY YGEEGGGEED QDYDITQLHR 720 

GLEARPEWL RNDVAPTIIP TPMYRPRPAN PDEIGNFIIE NLKAANTDPT APPYDTLLVF 780 
DYEGSGSDAA SLSSLTSSAS DQDQDYDYLN EWGSRFKKLA DMYGGGEDD 



A66 DNA SEQUENCE 

Gene name: ATPase, Ca++ transporting, type 2C, member 1 

Unigene number: Hs. 106 778 

Probeset Accession #: N51919 

Nucleic Acid Accession ft : AF189723 

Coding sequence: 1-2712 (underlined sequences correspond to start and stop codona) 



1 11 21 31 41 51 

I I I I I I 

ATGA TTCCTG TATTGACATC AAAAAAAGCA AGTGAATTAC CAGTCAGTGA AGTTGCAAGC 60 

ATTCTCCAAG CTGATCTTCA GAATGGTCTA AACAAATGTG AAGTTAGTCA TAGGCGAGCC 120 

TTTCATGGCT GGAATGAGTT TGATATTAGT GAAGATGAGC CACTGTGGAA GAAGTATATT 180 

TCTCAGTTTA AAAATCCCCT TATTATGCTG CTTCTGGCTT CTGCAGTCAT CAGTGTTTTA 240 

ATGCATCAGT TTGATGATGC CGTCAGTATC ACTGTGGCAA TACTTATCGT TGTTACAGTT 300 

GCCTTTGTTC AGGAATATCG TTCAGAAAAA TCTCTTGAAG AATTGAGTAA ACTTGTGCCA 360 

CCAGAATGCC ATTGTGTGCG TGAAGGAAAA TTGGAGCATA CACTTGCCCG AGACTTGGTT 420 

CCAGGTGATA CAGTTTGCCT TTCTGTTGGG GATAGAGTTC CTGCTGACTT ACGCTTGTTT 480 

GAGGCTGTGG ATCTTTCCAT TGATGAGTCC AGCTTGACAG GTGAGACAAC GCCTTGTTCT 540 

AAGGTGACAG CTCCTCAGCC AGCTGCAACT AATGGAGATC TTGCATCGAG AAGTAACATT 600 

GCCTTTATGG GAACACTGGT CAGATGTGGC AAAGCAAAGG GTGTTGTCAT TGGAACAGGA 660 

GAAAATTCTG AATTTGGGGA GGTTTTTAAA ATGATGCAAG CAGAAGAGGC ACCAAAAACC 720 

CCTCTGCAGA AGAGCATGGA CCTCTTAGGA AAACAACTTT CCTTTTACTC CTTTGGTATA 780 

ATAGGAATCA TCATGTTGGT TGGCTGGTTA CTGGGAAAAG ATATCCTGGA AATGTTTACT 840 



860 



WO 03/042661 



PCT/US02/36810 



ATTAGTGTAA GTTTGGCTGT AGCAGCAATT 
ACGCTAGCTC TTGGTGTTAT GAGAATGGTG 
ATTGTTGAAA CTCTGGGCTG CTGTAATGTG 
AAGAATGAAA TGACTGTTAC TCACATATTT 
GGAGTTGGCT ATAATCAATT TGGGGAAGTG 
TATAACCCAG CTGTTAGCAG AATTGTTGAG 
AGAAACAATA CTCTAATGGG GAAGCCAACA 
ATGGGTCTTG ATGGACTTCA ACAAGACTAC 
GAGCAAAAGT GGATGGCTGT TAAGTGTGTA 
TGTTTTATGA AAGGTGCTTA CGAACAAGTA 
GGGCAGACCT TGACACTTAC TCAGCAGCAG 
ATGGGCTCAG CGGGACTCAG AGTTCTTGCT 
ACATTTCTTG GCTTGGTGGG AATCATTGAT 
ACAACACTCA TTGCCTCAGG AGTATCAATA 
GCAGTTGCAA TCGCCAGTCG TCTGGGATTG 
GAAGAAATAG ATGCAATGGA TGTTCAGCAG 
TTTTACAGAG CTAGCCCAAG GCACAAGATG 
TCAGTTGTAG CCATGACAGG AGATGGAGTA 
ATTGGAGTTG CGATGGGCCA GACTGGTACA 
CTAGTGGATG ATGATTTTCA AACCATAATG 
AATAACATTA AAAATTTCGT TAGATTCCAG 
ATCTCATTGG CTACATTAAT GAACTTTCCT 
ATCAATATTA TTATGGATGG ACCCCCAGCT 
GATGTCATTC GTAAACCTCC TCGCAACTGG 
CTTAAAATAC TTGTTTCATC AATAATCATT 
GAGCTACGAG ACAATGTGAT TACACCTCGA 
TTTTTTGACA TGTTCAATGC ACTAAGTTCC 
GGACTCTGCA GTAATAGAAT GTTTTGCTAT 
CTAGTTATTT ACTTTCCTCC GCTTCAGAAG 
GATCTGTTGT TTCTTTTGGG TCTCACCTCA 
AAGGTTGAAA GGAGCAGGGA AAAGATCCAG 
CTTGAAGTAT GA 



CCTGAAGGTC TCCCCATTGT 
AAGAAAAGGG CCATTGTGAA 
ATTTGTTCAG ATAAAACTGG 
ACTTCAGATG GTCTGCATGC 
ATTGTTGATG GTGATGTTGT 
GCGGGCTGTG TGTGCAATGA 
GAAGGGGCCT TAATTGCTCT 
ATCAGAAAAG CTGAATACCC 
CACCGAACAC AGCAGGACAG 
ATTAAGTACT GTACTACATA 
AGAGATGTGT ACCAACAAGA 
TTGGCTTCTG GTCCTGAACT 
CCACCTAGAA CTGGTGTGAA 
AAAATGATTA CTGGAGATTC 
TATTCCAAAA CTTCCCAGTC 
CTTTCACAAA TAGTACCAAA 
AAAATTATTA AGTCGCTACA 
AATGATGCAG TTGCTCTGAA 
GATGTTTGCA AAGAGGCAGC 
TCTGCAATCG AAGAGGGTAA 
CTGAGCACGA GTATAGCAGC 
AATCCTCTCA ATGCCATGCA 
CAGAGCCTTG GAGTAGAACC 
AAAGACAGCA TTTTGACTAA 
GTTTGTGGGA CTTTGTTTGT 
GACACAACAA TGACCTTCAC 
AGATCCCAGA CCAAGTCTGT 
GCAGTTCTTG GATCCATCAT 
GTTTTTCAGA CTGAGAGCCT 
TCAGTGTGCA TAGTGGCAGA 
AAGCATGTTA GTTCGACATC 



GGTCACAGTG 
AAAGCTGCCT 
AACACTGACG 
TGAGGTTACT 
TCATGGATTC 
TGCTGTAATT 
TGCAATGAAG 
TTTTAGCTCT 
ACCAGAGATT 
CCAGAGCAAA 
GAAGGCACGC 
GGGACAGCTG 
AGAAGCTGTT 
ACAGGAGACT 
AGTCTCAGGA 
GGTTGCAGTA 
GAAGAACGGT 
GGCTGCAGAC 
AGACATGATC 
AGGGATTTAT 
ATTAACTTTA 
GATTTTGTGG 
AGTGGATAAA 
AAACTTGATA 
CTTCTGGCGT 
ATGCTTTGTG 
GTTTGAGATT 
GGGACAATTA 
AAGCATACTG 
AATTATAAAG 
ATCATCTTTT 



900 
960 
1020 
1060 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 



A67 Protein sequence: 
Gene name: 
Unigene number: 
Probes et Accession #: 
Protein Accession #: 
Signal sequence: 
Transmembrane domains: 
Pfam domains: 
[744-889) 

Cellular Localization: 



ATPase, Ca++ transporting, type 2C, member 1 

Hs. 106778 

N51919 

AAF27813 

none found 

60-82, 86-108, 252-274, 282-304, 757-779, 827-849, 856-878 

Cation ATPase N [6-83] , E1-E2 ATPase [89-324] , Hydrolase [328-649] . 

not determined 



Cation ATPase C 



MIPVLTSKKA 
SQFKNPLIML 
PECHCVREGK 
KVTAPQPAAT 
PLQKSMDLLG 
TLALGVMRMV 
GVGYNQFGEV 
MGLDGLQQDY 
GQTLTLTQQQ 
TTLIASGVSI 
FYRASPRHKM 
LVDDDFQTIM 
ZNIIMDGPPA 
ELRDNVITPR 
LVIYFPPLQK 
LEV 



11 
I 

SELPVSEVAS 
LLASAVISVL 
LEHTLARDLV 
NGDLASRSNI 
KQLSFYSFGI 
KKRAIVKKLP 
IVDGDWHGF 
IRKAEYPPSS 
RDVYQQEKAR 
KMITGDSQET 
KIIKSLQKNG 
SAIEEGKGIY 
QSLGVEPVDK 
DTTMTFTCFV 
VFQTESLSIL 



21 
I 

ILQADLQNGL 
MHQFDDAVSI 
PGDTVCLSVG 
AFMGTLVRCG 
IGIIMLVGWL 
IVETLGCCNV 
YNPAVSRIVE 
EQKWMAVKCV 
MGSAGLRVLA 
AVAIASRLGL 
SWAMTGDGV 
NNIKNFVRFQ 
DVIRKPPRNW 
FFDMFNALSS 
DLLFLLGLTS 



31 
I 



TVAILIWTV 
DRVPADLRLF 
KAKGWIGTG 
LGKDILEMFT 
ICSDKTGTLT 
AGCVCNDAVI 
HRTQQDRPEI 
LASGPELGQL 
YSKTSQSVSG 
NDAVALKAAD 
LSTSIAALTL 
KDSILTKNLI 
RSQTKSVFEI 
SVCIVAEIIK 



AFVQEYRSEK 
EAVDLS1DE5 
ENSEFGEVFK 
ISVSLAVAAI 
KNEMTVTHIF 
RNNTLMGKPT 
CFMKGAYEQV 
TFLGLVGIID 
EEIDAMDVQQ 
IGVAMGQTGT 
ISLATLMMFP 
LKILVSSIII 
GLCSNRMFCY 



51 




I 

EDEPLWKKYI 


60 


SLEELSKLVP 


120 


SLTGETTPCS 


180 


MMQAEEAPKT 


240 


PEGLPIWTV 


300 


TSDGLHAEVT 


360 


EGALIALAMK 


420 


IKYCTTYQSK 


480 


PPRTGVKEAV 


540 


LSQIVPKVAV 


600 


DVCKEAADMI 


660 


NPLNAMQILW 


720 


VCGTLFVFWR 


780 


AVLGSIMGQL 


840 


KHVSSTSSSF 


900 



A68 DNA SEQUENCE 
Gene name: 
Unigene numbers 
Probeset Accession #i 
Nucleic Acid Accession #: 
Coding sequence: 



bone morphogenetic protein 7 (osteogenic protein 1) (BMP7) 

Hs. 170195 

BE616633 

NM_001719 

123-1418 [underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

1 I I I I I 

GGGCGCAGCG GGGCCCGTCT GCAGCAAGTG ACCGACGGCC GGGACGGCCG CCTGCCCCCT 60 

CTGCCACCTG GGGCGGTGCG GGCCCGGAGC CCGGAGCCCG GGTAGCGCGT AGAGCCGGOG 120 

CGATGCACGT GCGCTCACTG CGAGCTGCGG CGCCGCACAG CTTCGTGGCG CTCTGGGCAC 180 

CCCTGTTCCT GCTGCGCTCC GCCCTGGCCG ACTTCAGCCT GGACAACGAG GTGCACTCGA 240 

GCTTCATCCA CCGGCGCCTC CGCAGCCAGG AGCGGCGGGA GATGCAGCGC GAGATCCTCT 300 

CCATTTTGGG CTTGCCCCAC CGCCCGCGCC CGCACCTCCA GGGCAAGCAC AACTCGGCAC 360 

CCATGTTCAT GCTGGACCTG TACAACGCCA TGGCGGTGGA GGAGGGCGGC GGGCCCGGCG 420 

GCCAGGGCTT CTCCTACCCC TACAAGGCCG TCTTCAGTAC CCAGGGCCCC CCTCTGGCCA 480 

GCCTGCAAGA TAGCCATTTC CTCACCGACG CCGACATGGT CATGAGCTTC GTCAACCTCG 540 

TGGAACATGA CAAGGAATTC TTCCACCCAC GCTACCACCA TCGAGAGTTC CGGTTTGATC 600 

TTTCCAAGAT CCCAGAAGGG GAAGCTGTCA CGGCAGCOGA ATTCCGGATC TACAAGGACT 660 
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ACATCCGGGA ACGCTTCGAC AATGAGACGT TCCGGATCAG CGTTTATCAG GTGCTCCAGG 720 

AGCACTTGGG CAGGGAATCG GATCTCTTCC TGCTCGACAG CCGTACCCTC TGGGCCTCGG 780 

AGGAGGGCTG GCTGGTGTTT GACATCACAG CCACCAGCAA CCACTGGGTG GTCAATCCGC 840 

GGCACAACCT GGGCCTGCAG CTCTCGGTGG AGACGCTGGA TGGGCAGAGC ATCAACCCCA 900 

AGTTGGCGGG CCTGATTGGG CGGCACGGGC CCCAGAACAA GCAGCCCTTC ATGGTGGCTT 960 

TCTTCAAGGC CACGGAGGTC CACTTCCGCA GCATCCGGTC CACGGGGAGC AAACAGCGCA 1020 

GCCAGAACCG CTCCAAGACG CCCAAGAACC AGGAAGCCCT GCGGATGGCC AACGTGGCAG 1080 

AGAACAGCAG CAGCGACCAG AGGCAGGCCT GTAAGAAGCA CGAGCTGTAT GTCAGCTTCC 1140 

GAGACCTGGG CTGGCAGGAC TGGATCATCG OGCCTGAAGG CTACGCCGCC TACTACTGTG 1200 

AGGGGGAGTG TGCCTTCCCT CTGAACTCCT ACATGAACGC CACCAACCAC GCCATCGTGC 1260 

AGACGCTGGT CCACTTCATC AACCCGGAAA CGGTGCCCAA GCCCTGCTGT GCGCCCACGC 1320 

AGCTCAATGC CATCTCCGTC CTCTACTTOG ATGACAGCTC CAACGTCATC CTGAAGAAAT 1380 

ACAGAAACAT GGTGGTCCGG GCCTGTGGCT GCCACTAGCT CCTCOGAGAA TTCAGACCCT 1440 

TTGGGGCCAA GTTTTTCTGG ATCCTCCATT GCTCGCCTTG GCCAGGAACC AGCAGACCAA 1500 

CTGCCTTTTG TGAGACCTTC CCCTCCCTAT CCCCAACTTT AAAGGTGTGA GAGTATTAGG 1560 

AAACATGAGC AGCATATGGC TTTTGATCAG TTTTTCAGTG GCAGCATCCA ATGAACAAGA 1620 

TCCTACAAGC TGTGCAGGCA AAACCTAGCA GGAAAAAAAA ACAAOGCATA AAGAAAAATG 1680 

GCCGGGCCAG GTCATTGGCT GGGAAGTCTC AGCCATGCAC GGACTCGTTT CCAGAGGTAA 1740 

TTATGAGCGC CTACCAGCCA GGCCACCCAG CCGTGGGAGG AAGGGGGCGT GGCAAGGGGT 1800 

GGGCACATTG GTGTCTGTGC GAAAGGAAAA TTGACCCGGA AGTTCCTGTA ATAAATGTCA 1860 
CAATAAAACG AATGAATG 



A69 Protein sequence: 

Gene name: bone morphogenetic protein 7 (osteogenic protein l) (BMP7) 

Unigene number : Hs. 170195 

Probeset Accession #: BE616633 

Protein Accession #: NP_001710.1 

Signal sequence: 1-30 

Pfam domains: TGFb_propeptide (37-2813 

Transmembrane domains: none found 

Cellular Localization: secreted 



1 11 21 31 41 51 

1 I I I I I 

MHVRSLtRAAA PHSFVALWAP LFLLRSALAD FSLDNEVHSS FIHRRLRSQE RREMQREILS 60 

ILGLPHRPRP HliQGKHNSAP MFMLDLYNAM AVEBGGGPGG QGFSYPYKAV FSTQGPPLAS 120 

LQDSHFLTDA DMVMSFVNltV EHDKEFFHPR YHHREFRFDL SKIPEGEAVT AAEFRIYKDY 180 

IRERFDNETF RISVYQVLQE HLGRESDLFL LDSRTLWASE EGWLVFDITA TSNHWWNPR 240 

HNLGLQLSVE TLDGQSINPK LAGLIGRHGP QNKQPFMVAF FKATEVHFRS IRSTGSKQRS 300 

QNRSKTPKNQ EALRMANVAE NSSSDQRQAC KKHELYVSFR DLGWQDWIIA PEGYAAYYCE 360 

GECAFPLNSY MNATNHAIVQ TLVHFINPET VPKPCCAPTQ LNA1SVLYFD DSSNVILKKY 420 
RNMWRACGC H 

Cervical 

A70 DNA sequence 

Gene name: bone morphogenetic protein receptor IB (ALK-6) 

Unigene number: Hs. 87223 

Probeset Accession 8: AA250737 
Nucleic Acid Accession 8: NM_001203 

Coding sequence: 274-1782 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I i I t I I 

CGCGGGGCGC GGAGTCGGCG GGGCCTCGCG GGACGCGGGC AGTGCGGAGA CCGCGGCGCT 60 

GAGGACGCGG GAGCCGGGAG CGCACGCGCG GGGTGGAGTT CAGCCTACTC TTTCTTAGAT 120 

GTGAAAGGAA AGGAAGATCA TTTCATGCCT TGTTGATAAA GGTTCAGACT TCTGCTGATT 180 

CATAACCATT TGGCTCTGAG CTATGACAAG AGAGGAAACA AAAAGTTAAA CTTACAAGCC 240 

TGCCATAAGT GAGAAGCAAA CTTCCTTGAT AACATGCTTT TGCGAAGTGC AGGAAAATTA 300 

AATGTGGGCA CCAAGAAAGA GGATGGTGAG AGTACAGCCC CCACCCCCCG TCCAAAGGTC 360 

TTGCGTTGTA AATGCCACCA CCATTGTCCA GAAGACTCAG TCAACAATAT TTGCAGCACA 420 

GACGGATATT GTTTCACGAT GATAGAAGAG GATGACTCTG GGTTGCCTGT GGTCACTTCT 480 

GGTTGCCTAG GACTAGAAGG CTCAGATTTT CAGTGTCGGG ACACTCCCAT TCCTCATCAA 540 

AGAAGATCAA TTGAATGCTG CACAGAAAGG AACGAATGTA ATAAAGACCT AC ACCC TACA 600 

CTGCCTCCAT TGAAAAACAG AGATTTTGTT GATGGACCTA TACACCACAG GGCTTTACTT 660 

ATATCTGTGA CTGTCTGTAG TTTGCTCTTG GTCCTTATCA TATTATTTTG TTACTTCCGG 720 

TATAAAAGAC AAGAAACCAG ACCTCGATAC AGCATTGGGT TAGAACAGGA TGAAACTTAC 780 

ATTCCTCCTG GAGAATCCCT GAGAGACTTA ATTGAGCAGT CTCAGAGCTC AGGAAGTGGA 840 

TCAGGCCTCC CTCTGCTGGT CCAAAGGACT ATAGCTAAGC AGATTCAGAT GGTGAAACAG 900 

ATTGGAAAAG GTCGCTATGG GGAAGTTTGG ATGGGAAAGT GGCGTGGCGA AAAGGTAGCT 960 

GTGAAAGTGT TCTTCACCAC AGAGGAAGCC AGCTGGTTCA GAGAGACAGA AATATATCAG 1020 

ACAGTGTTGA TGAGGCATGA AAACATTTTG GGTTTCATTG CTGCAGATAT CAAAGGGACA 1080 

GGGTCCTGGA CCCAGTTGTA CCTAATCACA GACTATCATG AAAATGGTTC CCTTTATGAT 1140 

TATCTGAAGT CCACCACCCT AGACGCTAAA TCAATGCTGA AGTTAGCCTA CTCTTCTGTC 1200 

AGTGGCTTAT GTCATTTACA CACAGAAATC TTTAGTACTC AAGGCAAACC AGCAATTGCC 1260 

CATCGAGATC TGAAAAGTAA AAACATTCTG GTGAAGAAAA ATGGAACTTG CTGTATTGCT 1320 

GACCTGGGCC TGGCTGTTAA ATTTATTAGT GATACAAATG AAGTTGACAT ACCACCTAAC 1380 

ACTCGAGTTG GCACCAAACG CTATATGCCT CCAGAAGTGT TGGACGAGAG CTTGAACAGA 1440 

AATCACTTCC AGTCTTACAT CATGGCTGAC ATGTATAGTT TTGGCCTCAT CCTTTGGGAG 1500 

GTTGCTAGGA GATGTGTATC AGGAGGTATA GTGGAAGAAT ACCAGCTTCC TTATCATGAC 1S60 

CTAGTGCCCA GTGACCCCTC TTATGAGGAC ATGAGGGAGA TTGTGTGCAT CAAGAAGTTA 1620 ( 
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CGCCCCTCAT TCCCAAACCG GTGGAGCAGT GATGAGTGTC TAAGGCAGAT GGGAAAACTC 1680 

ATGACAGAAT GCTGGGCTCA CAATCCTGCA TCAAGGCTGA CAGCCCTGCG GGTTAAGAAA 1740 

ACACTTGCCA AAATGTCAGA GTCCCAGGAC ATTAAACT CT GAT AGGAGAG GAAAAGTAAG 1800 

CATCTCTGCA GAAAGCCAAC AGGTACTCTT CTGTTTGTGG GCAGAGCAAA AGACATCAAA 1860 

TAAGCATCCA CAGTACAAGC CTTGAACATC GTCCTGCTTC CCAGTGGGTT CAGACCTCAC 1920 

CTTTCAGGGA GCGACCTGGG CAAAGACAGA GAAGCTCCCA GAAGGAGAGA TTGATCCGTG 1980 
TCTGTTTGTA GGCGGAGAAA CCGTTGGGTA ACTTGTTCAA GATATGATGC AT 



A71 Protein sequence 

Gene name: " bone roorphogenetic protein receptor IB [ALK-6) 

Unigene number: Hs. 72472 / Hs. 87223 

Probeset Accession #: AA250737 / U89326 

Protein Accession #: NP_001194 

Signal sequence: 1*13 

Transmembrane domains: 128-144 

PFAM domains: activin_receptor [30-111} , protein kinase [204-491] 
Cellular Localization: plasma membrane 



1 11 21 31 41 51 

I I I I I I 

MLLRSAGKLN VGTKKEDGES TAPTPRPKVL RCKCHHHCPE DSVNNICSTD GYCFTMIEED 60 

DSGLPWTSG CLGLEGSDFQ CR0TPIPHQR RSIECCTERN ECNKDLHPTL PPLKNRDFVD 120 

GPIHHRALLI SVTVCSLLLV LIILFCYFRY KRQETRPRYS IGLEQDETYI PPGESLRDLI 180 

EQSQSSGSGS GLPLLVQRTI AKQIQMVKQI GKGRYGEVWM GKWRGEKVAV KVFFTTEEAS 240 

WFRETEIYQT VLMRHENILG FIAADIKGTG SWTQLYLITD YHENGSLYDY LKSTTLDAKS 300 

MLKLAYSSVS GLCHLHTEIF STQGKPAIAH RDLKSKNILV KKNGTCCIAD LGLAVKFISD 360 

TNEVDIPPNT RVGTKRYMPP EVLDESLNRN HFQSYIMADM YSFGLILWEV ARRCVSGGIV 420 

EEYQLPYHDL VPSDPSYEDM REIVCIKKLR PSFPNRWSSD ECLRQMGKLM TECWAHNPAS 480 
RLTALRVKKT LAKMSESQDI KL 



Bladder 

A72 DNA SEQUENCE 

Gene name: Homo sapiens type II membrane serine protease mRNA 

Unigene number: Hs.63325 

Probeset Accession #: AA411502 

Nucleic Acid Accession #: NM_016425 

Coding sequence: 1-1314 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

ATGTTACAGG ATCCTGACAG TGATCAACCT CTGAACAGCC TCGATGTCAA ACCCCTGCGC 60 

AAACCCCGTA TCCCCATGGA GACCTTCAGA AAGGTGGGGA TCCCCATCAT CATAGCACTA 120 

CTGAGCCTGG CGAGTATCAT CATTGTGGTT GTCCTCATCA AGGTGATTCT GGATAAATAC 180 

TACTTCCTCT GCGGGCAGCC TCTCCACTTC ATCCCGAGGA AGCAGCTGTG TGACGGAGAG 240 

CTGGACTGTC CCTTGGGGGA GGACGAGGAG CACTGTGTCA AGAGCTTCCC CGAAGGGCCT 300 

GCAGTGGCAG TCCGCCTCTC CAAGGACCGA TCCACACTGC AGGTGCTGGA CTCGGCCACA 360 

GGGAACTGGT TCTCTGCCTG TTTCGACAAC TTCACAGAAG CTCTCGCTGA GACAGCCTGT 420 

AGGCAGATGG GCTACAGCAG CAAACCCACT TTCAGAGCTG TGGAGATTGG CCCAGACCAG 480 

GATCTGGATG TTGTTGAAAT CACAGAAAAC AGCCAGGAGC TTCGCATGCG GAACTCAAGT 540 

GGGCCCTGTC TCTCAGGCTC CCTGGTCTCC CTGCACTGTC TTGCCTGTGG GAAGAGCCTG 600 

AAGACCCCCC GTGTGGTGGG TGGGGAGGAG GCCTCTGTGG ATTCTTGGCC TTGGCAGGTC 660 

AGCATCCAGT ACGACAAACA GCACGTCTGT GGAGGGAGCA TCCTGGACCC CCACTGGGTC 720 

CTCACGGCAG CCCACTGCTT CAGGAAACAT ACCGATGTGT TCAACTGGAA GGTGCGGGCA 780 

GGCTCAGACA AACTGGGCAG CTTCCCATCC CTGGCTGTGG CCAAGATCAT CATCATTGAA 840 

TTCAACCCCA TGTACCCCAA AGACAATGAC ATCGCCCTCA TGAAGCTGCA GTTCCCACTC 900 

ACTTTCTCAG GCACAGTCAG GCCCATCTGT CTGCCCTTCT TTGATGAGGA GCTCACTCCA 960 

GCCACCCCAC TCTGGATCAT TGGATGGGGC TTTACGAAGC AGAATGGAGG GAAGATGTCT 1020 

GACATACTGC TGCAGGCGTC AGTCCAGGTC ATTGACAGCA CACGGTGCAA TGCAGACGAT 1080 

GCGTACCAGG GGGAAGTCAC CGAGAAGATG ATGTGTGCAG GCATCCCGGA AGGGGGTGTG 1140 

GACACCTGCC AGGGTGACAG TGGTGGGCCC CTGATGTACC AATCTGACCA GTGGCATGTG 1200 

GTGGGCATCG TTAGCTGGGG CTATGGCTGC GGGGGCCCGA GCACCCCAGG AGTATACACC 1260 
AAGGTCTCAG CCTATCTCAA CTGGATCTAC AATGTCTGGA AGGCTGAGCT GTAA 



A73 Protein sequence-. 
Gene name: 
Unigene number: 
Probeset Accession #: 
Protein Accession #: 
Signal sequence: 
Transmembrane domains: 
LDLa domain: 
Tryp_SPc domain: 
Cellular Localization: 



Homo sapiens type II 

HS.63325 

AA411502 

NP_057509 

none found 

31-53 

54-94 

204-429 

plasma membrane / ER 



serine protease mRNA 



1 11 21 31 41 51 

I I I I I I 

MLQDPDSDQP LNSLDVKPLR KPRIPMETFR KVGIPIIIAL LSLASIIIW VLIKVILDKY 60 
YFLCGQPLHF IPRKQLCDGE LDCPLGEDEE HCVKSFPEGP AVAVRLSKDR STLQVLDSAT 120 
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GNWPSACFDN FTEAIiAETAC RQMGYSSKPT FRAVEIGPDQ DLDWEITEN SQELRMRNSS X80 

GPCLSGSLVS LHCLACGKSL KTPRWGGEE ASVDSWPWQV SIQYDKQHVC GGSILDPHWV 240 

LTAAHCFRKH TDVFNWKVRA GSDKLGSFPS LAVAKIIIIE FNPMYPKDND IALMKLQFPL 300 

TPSGTVRPIC LPFFDEELTP ATPLWIIGWG FTKQNGGKMS DILLQASVQV IDSTRCNADD 360 

AYQGEVTEKM MCAGIPEGGV DTCQGDSGGP LMYQSDQWHV VGIVSWGYGC GGPSTPGVYT 420 
KVSAYLNWIY NVWKAEL 

A74 DNA SEQUENCE 

Gene name: ESTs, Weakly similar to CGHU7L collagen alpha l(III) chain precursor [H. sapiens] 
Unigene number: Hs- 19322 

Probeset Accession #: AA08845B 
Nucleic Acid Accession #: AA0884S8 

Coding sequence: 862-1995 (underlined sequences correspond to start and .stop codons) 



1 11 21 31 41 51 

I I I I I I 

GCCCTTGGAC ACTGACATGG ACTGAAGGAG TAGAATGGAG CACGAGGACA CTGACATGGA 60 

CTGAAGAAAA AGGAGCTGGA GCAGGAGAAG GAGGTGCTGC TGCAGGGTTT GGAGATGATG 120 

GCGCGGGGCC GCGACTGGTA CCAGCAGCAG CTGCAACGAG TGCAGGAGCG CCAGCGCCGC 180 

CTGGGCCAGA GCAGAGCCAG CGCCGACTTT GGGGCTGCAG GGAGCCCCCG CCCACTGGGG 240 

CGGCTACTGC CCAAGGTACA AGAGGTGGCC CGGTGCCTGG GGGAGCTGCT GGCTGCAGCC 300 

TGTGCCAGCC GGGCCCTGCC CCCGTCCTCC TCCGGGCCCC CCTGCCCTGC CCTGACGTCC 360 

ACCTCACCCC CGGTCTGGCA GCAGCAGACC ATCCTCATGC TGAAGGAGCA GAACCGACTC 420 

CTCACCCAGG AGGTGACCGA GAAGAGTGAG CGCATCACGC AGCTGGAGCA GGAGAAGTCG 480 

GCGCTCATTA AGCAGCTGTT TGAGGCCCGC GCCCTGAGCC AGCAGGACGG GGGACCTCTG 540 

GATTCCACCT TCATCTAGTC CTTGTGGGCC GCGTGGGCCC CCAGGGCCAG CCTGGCACTC 600 

AGCCCTTCGA GGGTGGGCGC CCCATCGCAC CCACCCTCTC TGGCTGGAGA CCCCCGGCAG 660 

GCCCAGGCAC AGTCCCGGAG TGGGCGCCTT CCTGCCGCCC TTGCCAGATG GGCTCCCCAG 720 

GCCTGCCCCC GGCTGGTCCC CGCACCGAGC GCTTGACTCC GTTTKGGCTC CTGGTTGYTG 780 

ACATGGGCTG GGGGCTCTCT TGAGTCCGCA TAGTCCGCAG CTACTACTGG CCGCTGTCAG 840 

TGGACAGTGG GGTACCCCTC CATGAGTTAG CGTCCCCCCG TTTCCAGCGG TGCCGCCCTG 900 

GGTCCCATCT TCAGGGAAAG GCACTGCCCA CGCCAGGCTG CACTTCCAAC AACGGGCAGC 960 

AGAGGGCGCG GGGCGGCTCC GACGCGGGTC CAAGGGCAGC TTCCCGCTCA ACCAGGGCAC 1020 

CAGGACGAGG TGGCTGTAGC TCGGAOGGAC GGAAGTAGAT GGAGGGGGTG GGGACGGCCT 1080 

GTAAGCGGGG GGTGCCTGCC TGGCTGGGGA GCCCCAGGGA TAGCGGTCGG ACTTCAGGTT 1140 

CTGGCCAAGG CTGAGGGACC CTGGCTGCAG CGGATCGGCA CGCCGGGTGG GCGAGAGCTT 1200 

GGCCTGCATG TGCCTCCCAC AGACCCTGGG GTGATGGCCT TCCCCCTCTT GGCCGGGACG 1260 

TTGCCCCACG TTGAGTCCCA CACAACATCC TGTGAGCCTG GCTCCCCAGG AGGGCCCCCA 1320 

GACAGCTCCC AGGCACGTCA TAGGCAAAGC CTGTTTCCCC CGACTCAGGA TTTCCAAGGC 1380 

CTGGGGTCCT GCTCACCCCC CTTTGCTCTC ACGCCCAGCC TGTCCCCAGG TTTCAGCTGG 1440 

GAGAGGCCAC CTCCCTCAGC CAAGGAAAAC GAGAACCCCC AGGGTACAGG AGGAGGCTGG 1500 

GGCAGGTCCC CTTGGGTGTC ACTCCCTCAG CCCCTGCCCA GGCCCACTCC CGCTGGTGCT 1560 

GGAGTACGCA CTGGTGGGGG GGCCCTGCTC AGCCCAACCT GGAGGGTCCC AGTGTCACCA 1620 

GAACCAGGGG CACGGCAACA GCATCGATGG GTTCTGCAGC CCAGGGCCCC CGATGCGGGG 1680 

TCAGTGTGTG TGGGGCGCAG GGCCTCCGAT GCGGGGTCAG TGCGTGGGGG GCGCAGGGCC 1740 

CCCGATGGGG GGTCAGTGCG TGGGGGGCGC AGGGCCCCCT CGTGTCCAGG GCACTTTGGT 1800 

ACACTGTCCC ACAAGGCACC TGTCTCAGAG GAGGGGCCCT GGCAGGCAGC GTGGCAACTC 1860 

CCTTCCGGAG CCCAGCTCCA TGCTAACCTG CCCACAGCAA CCCCACAGAG CCACATTCCC 1920 

TGCTGCACCT GGTCTGCAGG GGTGTCCCAG GACAGGCCCA AGTCAGCCCA GCATGCAGCT 1980 

GCCCTCCTAC CCTGAAGATG GGAGTGGGCT TTCCAGGGGA CATAAGGATG TCAGGCCTGG 2040 

ACCTCCTGGG CAGGAAAGGG TGCAGGTCCT GAGGGCCTGT GCCCCACAGC CCCAGCACCC 2100 

AGGTGGACTG CAGCGCAGTG GGTGGGCCAG TGGCAGCCAG GGAGAAGCCC CCCGTCAGCA 2160 

GGCTGGGGTC TGCCCACCAG GGCCTCCCCA CGTCTGCCTT TGAGGGTGCC TGCCATGCCC 2220 

TGGGGGATCC TGGCATCTTT ACTGGACTGG AAGCAGGAGA CAGAACAGTG TCTGTCCCGG 2280 

GGTGACTTCA TCAGGAGACC GCCCACATAG AGCTGGACCC CGCAGCTGAA GCGGAAATGT 2340 

GAGACAGGCT GGCACCTOCG GAAAAACTGC CTTTCAGCCT TGGTGTTCCG TGCAAGGTGA 2400 

AAAGAAATAG GTCCTCCCAG TTTACAGCTT GAAATCAGGC TAGTGAGTGG CCCTGGAGAC 2460 

CACGAGGGGA GAATTTAAAG GCCCCGGCTG GCAGGGTCTA GGTGGCTGGC AGAGGCACAT 2520 

GCAGACCCTG CCTGGAGCCT GCCCTAGGAC GCTGGGCGGG TCAGTCTCCG TGCAGGATGT 2580 

GAGCAGCGTC CCTGGGCTCT ATCCGCGAGG TGCCAGTAGC GTGTGCAGGT ACATACACGT 2640 

GCGTGCACAC TGTGATGACA CCCGGAAATG TCTCAGGATG TTGAAATGTG TCCTTGGGGG 2700 

CAGAAGTGTC CCCAGTTGAG AATCTGCCCC AGAGGAACAC ACCCACACCA GGCCTCAGGA 2760 

TTTTGTGTTG ATCAAGTTCC AAGGAAAAGG AACATCTCAG CCGGGCGTGG TGGTTCACGC 2820 

CTGGAATCCC AGCACTTGAG GCCAGGAGTT CCAGAGCAGC CTGGGCAACG CAGTGAGAGA 2880 

CCCCATCTCT ACAARAAAAA AAAAAGAAAG AAAGAAAATG AGAGATCCAG GTTTAAAAAT 2940 

TCATAAACAC CACAAGGAAA CAATACACTA TGAGACCCAG CAGAAGCAAC AGATTGACTC 3000 

TAGACCCAGA TACTAGAATT ATCAGAGAGA ATATAAAGTA ACAGTGTTTT ATATATCTAA 3060 
AGAAATAAAA GAGATTTCTG GAAACATGAA AAAAAA 



A75 DNA SEQUENCE 

Gene name: SCCA2 or 2b, serine (or cysteine) proteinase inhibitor 

Unigene number: Hs. 227948 

Probeset Accession #: AB035089 
Nucleic Acid Accession #x AB035089 

Coding sequence: 9645-10219 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

GGGCATGCAG CCATCGGGGA AAATCCATAG TGCAGATAAA GCAAGGAGGA AGAAGAAGGA 60 

CAGTT CTAGT AAAAGGGAGA ACATCAATAT AGGATGTTTC TTAGCAATAG AAAAAGAAGG 120 

CCAAGAGGAA TTAGGGAGAG AGTTATAAGA GATCAGCAAG GGGACAGGGT TAGATTTGGT 180 

TTGGTTTGAA AGCATACAGT AAATATGATG TCTGTCCCTG GCAGTGTTGG CAGAGTAGGA 240 
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AGGAGGAAGG GAGGCAAGAG ATAATATCAT TTTCTCTGTG CTCCAACTGT ACTTACATAT 300 
GAGACTATTT CCCTCTCTGC TTTTCAAACC TTACTGGAGT TGTTTTCCCT CATGAAAACC 360 
AAGAAAGGAA AGCTAGTTAG TCTTGTTCTG AGGTTGTTCA ATGTATACAT ATCTATATCT 420 
GTAGACAGAA TCCTTGGGAA TACAGTAATT GACATATATT CTGTTATTTG ATGCTTGAAA 480 
AATCTCCTCC ACTAACCAGT TTCCCTATAG ATTGCCACAA GCACATAATA AGAAACAATA 540 
AATAAAATGT TCTCTTGACT TTGTTACTTA ACAATGCTGA GAAAACTTTA CAGCCTTCAT 600 
AAGGAAGTGA GGTCCAGGAA AATCTAGGAG ATATTTCTTA ACCAATCTAT AAAGGCATTA 660 
GTAATGACAG GATATTTCCT GAAAGTGTAA TTTCCCATTG AGGATTTGTT TTTAATTTCT 720 
GGATTCCTGG AGCCAATGAA GTTGGTGTAT GTTTATGAAA TATCAAGAGA CATAAGTTGG 780 
CAAGTGTTCA TATGCAAAAA CTTCTTGGAA TTTCTGAGTT CTCTGTGGCA ATATATGACA 840 
TCAGGATATG TCCAGTCTCA CACACCAGGA TATGTCCTTT CTAGCCTGTC TATCACATGC 900 
TAGGAGAACT ATTTAGGAAC AGAAAAAAAT GCCTGAAATG ATTTCTCATT TGAACTCATC 960 
CAAGCTTTCT CTAAATTTAA GCAAACTCCT GGTCATTTTC AGTTAGTACC TTTCCTTAAG 1020 
TTCAACCTTC AGGGCAAACC TCCGTGCCTC AGACGTTTAG CCATAGTCTG AAATTCTCTT 1080 
CCATAGATTG GTCCCCTGTA ACCCCGGTTT GTCTCAGCTT GTTATCCTGT TTTTTTCTTC 1140 
CCTCCATTCC CAGGATGAGC TTGTTGCTTC TGTCCTATGA GACATTAGAT TCCTTTTCTT 1200 
TGGTACCCGA GTAAATCCAT CCTACTCCAA TAGAGGAAGG TCCATTTTTG TCTTATAGCG 1260 
CTGGATGCAG ACTCAGCTGA GAAGACCATT ATTCATTTTT GGAATTCTTT ATCTCAGATA 1320 
TTTCCTCTTC TTTCTTTTTC TTCTATCTTT GGATTTTTAG TCCATCAACG CCCCATTAGT 1380 
CTATTCCCCG ACTTCAATCA GGGAACTTAT ACCTCTTAAA CTCATTCAGA GACTCAAAAC 1440 
ATATATATTG ATACAGGAGA CCTAAGAAGA GCATGTCTTG GGGGTTGAGG AAACAGGCAG 1500 
GTGAGAAATT TCCAGATTGG AAACACAGCT TCCTTTCTCC CATCCAGCCC CTACTTTCAG 1560 
CCTATGTGTT TCTGGCACCT TGTTGTAGAT AAATCTCCCT TGACTTTGTG ATGTGCTGAG 1620 
AAAACAAACT CACGGCTGGT GTTAAAAAGG GCCCATGACA ATACCAAGTG TTGGGGAGAA 1680 
TGTGGAGAAA TCAGAACTCT ATTCACGGTC GGTTGGAATG CACACTTGTG CAGAATTCTA 1740 
TGGAGAAGAG TCTGGCATTT CCTCAAAATG TTAACCTGGA TTTACCATAT GACCCAGCGA 1800 
TTTCATTCAT AGGTTTATAC TCAAAAGAAA TGAAGAAATA TGCCATGCAA AAAAATGTAC 1860 
ATGAAAGGTC ACAACATCAT TATTCATAAT AGTAAAAGGA TGGAAACAAC ACAAATGTCC 1920 
i ATCAACTTAT GATTAAAGAA AATCTGGTCT ATTCATAGAA TGGAATATTA TTCGACCACA 1980 
AAAAGGAATG ATGTACTGAT CCATGCAATG ATGTGGACAA ACCATGAAAA TAACACTAGA 2040 
TTAAAGAAGC CAGTCACAAA AGGACTTACT GTATGATTCC ATTTACCTGA AATGTTTGGA 2100 
ATAGGCAAAT CCATAGAAAC AGGAGGTAGA TTCCTGGTTT CCAGGGTCTC CAGGAAGGGA 2160 
AGAATGAAGT ACAAGATTTC TTTTGGAGGT AGTGAAATTG TTGTGGAATG AGATCATGAT 2220 
GATGATAGCA CAACTTTGTG AATATAATAA AATCATTGAA TTGTACAGTT GAATTTATGG 2280 
TATATAAATT ATATGTTAAT AAAAAGGGGG TCCACAAAAC AAACAGCCCC CCACTCTGGT 2340 
TGTCAGGGAG ATATTGGATT AAATGGCCTT GGACAACAAC CCCTCTCCCT GGCCACAGAC 2400 
ATTCTTCAGA TTACAAGATA TTCCAGGGGA AACACTGGAA TGAGTCTGAA GCCAGGTGCT 2460 
AAACAGAAGG ACCATTGAGA AATGTTGTGA TCCTGACAGG TCAAGCAATT TATTTTTCGG 2520 
CTTCATTTTT AAATGTAAAA TTAGAAAGCT GCCATTTAAA ATGGCCCGTC TGTTTCAATT 2580 
GCTCTTCTCA GTGTCAGCCT GTTAACTCAA TGTGTTAGTC TGTTTTCATG CTGCTGATAA 2640 
AAACATACCT GAGACTGGCA AGAAAAAGAG GTTTAATTGG GCTTAGAGTT CCACGTGATT 2700 
GGGGAGGCCT CAGAATCACA GTAGGAGGCA AAAGTTATTC TTACATGGTG GCTGCAAGAG 2760 
AAGATGAGGA AGAAGCAAAA GAAGAAACCC CTGATAAACC CATCGGATCT CCTGAGGCTT 2820 
ATTAACTATC ATGAGAATAG CACAAGAAAG ACCGGCCCCC ATGATTCAAT TACCTCTACC 2880 
TGGGTCCCTC CAATAACATG TGGAAATTCT GGTAGATACA ATTCAAGTTG AGATTTGGGT 2940 
GGGAACACAG CCAAACCATA TCACTCAGCA AGGCAGATAA CTTTCTCACT GAGCCTATGC 3000 
AACAGAAAAC CATCTGGGAT GGTTGTAAGG GGCACAGGAA GTGACTGGTA GGATCACTGC 3060 
CAAAGCTGAG CACTCAGGAG AAGG CAAT AG AATCCTATTC TCCATAGTAT GCTATAAGAT 3120 
ACTGAAGTAC ACTTCTTCAC TATCTCTTTG GACTTAGAAT TAGCACTACA TTCCTTGTTA 3180 
TACAGAAAAA TTACTAAGGA AATTCATAGG ATGACAAAAA CTTTCAGAAC TGAAAAACAG 3240 
GAAATGTAAG CTTTTTAGTT CTTTGGTATT CGAAGTATGC CTAAAAGACA ATGCAAAATC 3300 
CAAGAAAAGA ATGGTGGGGT TTTTGTTTGT TTGGTTTTGT TTTTGTTTTA CAGCTGGAGT 3360 
AGAATACAAA GGGATGGAGT TGAAACAAAT GAGAGGAAAT TGGAATTCTA AACTTATTCT 3420 
CATTGGCATT AGAAAGGCAC CTACATGTAT TTCACATGAG CCGGTGACTG CTGACTTGCA 3480 
TTCTTATTTT TTCCCTATAG ATTAAAAAGG AGGTACAATG GTAGAACTGT AATCCTGTCC 3540 
TTTGTCATAA ATTTTCATAT TCATAAAGGT GAGTGTTAGC COGCTTGTGA AATCTGAAGT 3600 
TGAGTAACTT CAAATACTAA CCACAGAGGG AAAGGCAGCA AGAGGAGAGG CATAAATTTA 3660 
GGATCTCACC CTTCATTCCA CAGACACACA CAGCCTCTCT GCCCACCTCT GCTTCCTCTA 3720 
GGAACACAGG TAAGAGCTTC AAGCCTCTCC AGCTTAATAA CATGAATTAT TTTTGAGAAT 3780 
AATAATGATA CTGTGTTCTA TATCATGCAT CTCCTGCATT CTGTCTGATT ATATTTTACT 3840 
TATTCTGCCA GAGCAAAATT AAAATACCTA TTTCATCTGA TTTGTCCTTT ATCTAAATTG 3900 
CTTAGTTCCA AGTAAACCAA GGCACTTTTA GGAACACAGA GGGAGAGTGC CTTGCAGCCA 3960 
GAGAGTCTTG AAGGAGATGT CAGGGACGCA TCTTAACAGC TGGTTGGATG TGATCCACAG 4020 
AGGTCTCCTG TTAGCATTCA TTGTAAAGCC ATCCTACCTA GCTCTAGTGT AACC AGCAA T 4080 
GAAAGAAAGA TAAAGAGGGT CGATTACTTA TTTACAATAG TCTTTAAAAA CGTAGTTTTG 4140 
TAAGCCTTCT AATTAGGACA TTAATATATT TAATATATGC ACATTGTAGA AAGATTGAAG 4200 
CGTTAAAAAT AAGAGAAAAA CTTTAAATGT CAAAATCTCA CAACCCAGAT ATATCATTTC 4260 
TTTAAGAAAA TTGTACTACA AAATACCATT CCATTTATTA AAGTCATTCT GACAGGAATC 4320 
TGATGCTTTT CCAGGAGTTC CAGATCACAT CGAGTTCACC ATGAATTCAC TCAGTGAAGC 4380 
CAACACCAAG TTCATGTTCG ATCTGTTCCA ACAGTTCAGA AAATCAAAAG AGAACAACAT 4440 
CTTCTATTCC CCTATCAGCA TCACATCAGC ATTAGGGATG GTCCTCTTAG GAGCCAAAGA 4500 
CAACACTGCA CAACAAATTA GCAAGGTAGC TATCAGCATC ATTACGTTGT CCTGTTGCAG 4560 
TTTTTCTCTG GTTCCGTCGG CTAGCACGCA GATGGTAATA GATGTGGTGG TCTGATGGGT 4620 
AGCACAGGGG GCTGTGCAGG AATTCCCATA ACTGTGAGAC CACTGACTTA AACAGATCTT 4680 
TTGAGTAAAG TTTTCTTGTC CCGCTTCATG TCTCTTCCAG GTTCTTCACT TTGATCAAGT 4740 
CACAGAGAAC ACCACAGAAA AAGCTGCAAC ATATCATGTG AGTCACAGAG CACTCTGATT 4800 
CAGCTTTAGA TCCCTGAACA GGTCATAGTT TAAACCTGGA ACTTCACAAA AACTAAGAAA 4860 
AGGCCAGTTT TAGGGAAAAT CTTGGACACA AAGATTGAGA CATACAGAGT GGGTTGGCAT 4920 
TTCATGGCAC ATAATTATTA TTCCTCATTT CTGCGTTACT AAAAGACAGT CAGCACTGTA 4980 
CCTCAGAGCA TAGGTCTGGA TCAGGATAGG CTGGGTTCAG ACTCCAGCTT TGCTCTTCAC 5040 
AAATGATGAA TAAGAGCAGG ACACAACTGC TCGGAGTCCC AGTGACCTCA TCCCAGAAAA 5100 
CTAAGGGTAA GAAAAAATCT GACTCAATAC ATGCAAATAC ATGCAAATGT TTACAACAGT 5160 
GCCTTGCCCA TAAAAGTCAT AATAAATGTT ATTATTATTA TAAAGTAGCT ATAATTATAC 5220 
TAATCATAAT AATGTGAAAA TAATTTAATT TTCATTGAGT CATTAATGAG ATTCAGAGGA 5280 
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ATAAGCACAA GTCCAAGTAT ATTTTGGAAA ATGATTGCTA TGGAATATAT TGGTTTAGAG 5340 
CCTTAATAGT GCAAAATGCT TTGCTGGAAG GTAGAAAGTT CTAGATTTAA ACAGGCTTAG 5400 
GTTCAAAACT TGGCACTTCT AATTTATGTC TCTATAAACA GGGTTTTTTT CCCCATTCTC 5460 
TGAGCTTTCT TGTGTTCATC TGAATTGAAC TAAAGACTTA GAGTTACCCA TGTAAAGTCC 5520 
5 TTAGCCATGG ACCTGGCATA CACTCTTCTT ACGTGCAGAG AATGACCATC ATGAGGAAAG 55B0 
AGCCACAGAT CAGTCAATGT GTCCTACAAG ATAATAGCAC CAACAGGTAT AACAGGGCTT 5640 
CCTGGCATAA TCTATTTAAA ATATCCAACC TTCAACATAC TCGTATCCTT GATGACTGTT 5700 
AGAAGTGAAA TATGGTCCTT GCCCATAAGG AGCTGAGAGT TTAACTGGGA AGCTAAACCT 5760 
AACCCTTTAA ACCAACAAGG AGAAAATCTA CTGGTAGACA GCGCTGCATC TTTAGTTCAG 5820 
10 AAGAGAAAAG ATTGCAGTAC GTTAGAGCAA GAAGAATTTT CTGGAAGAAG TCAAATATAA 5880 
GGTGGATTTT GAAGGGTATT TGAGGTGAAA TACACCAATT ATCAGGGAAT AACATCAAAG 5940 
GTCCTCAATG AGACTACCAG CATTTAGGGA CTGATCTAAC AGACTTAGCA TGGGTTTAGT 6000 
ATTTACATTG ATACAGCAAT TGAATGATCT CCTTTTTTGA TGTTTGAAGG TTGATAGGTC 6060 
AGGAAATGTT CATCACCAGT TTCAAAAGCT TCTGACTGAA TTCAACAAAT CCACTGATGC 6120 
15 ATATGAGCTG AAGATCGCCA ACAAGCTCTT CGGAGAAAAG ACGTATCAAT TTTTACAGGT 6180 
AATTTCACCT GGCCTACCCA CATTTCATTT GCATCCTGAT GTCTGTGTCT CTGAGTGGCC 6240 
AAATGGAAGA AAGCAAGGCA GATGAGCCTG GCCGACCCAG GTGGAGAGCA TTTACTCAGA 6300 
GTGCATTAGC TCCATTTCCA CAACTCTCCC CCACTGGAGT GTCCCAGACC CCAAOGATAC 6360 
ATCACTGAAG TGTGGATTTA GGGATAATCT TGTGATAAAA GAGGAGGTTG TGTAATAGAG 6420 
20 TGAGTAAGAG TAATAAGTAA TAAGATACCA TOGATAAACT GGCACTGACT CAGTCACATA 6480 
CGATACATCT TGGTGGGAAA TGTATGACTA ATGGGATATT ATTGGAATGG GCAGGCTTGG 6540 
GTGAGTTCCT GAGAATAGTT GAGGAAGTAC CAGGAAATAT TGAATGCACA GGATGAAAGA 6600 
CAAAAACAAA GATCAGAAAC ATCATGGTTA AAATTACTGG AGAGAAGTCT GAGAAGCAAT 6660 
GAATCTCCTT CAGGGAAGCC TGCTCTGCAG TTTGCAAACC ACAGCCTCTT CTGCTTCTGC 6720 
25 CTTTTGCCAA GATGATATTG ACCTTCAGTG ACCTCTTTCT TGTGCCAGCC CACATTCCCC 6780 
TTTTGCATTG CCTACATGAC ACCTGTATAA AAATATCCAT GGACAGGAGA TACTGCATCT 6840 
ATTCAGGGTC TGGATTCAGC TTACTGTTGT TACAAATAAG TAAGTTTGGT AATATATAGT 6900 
TACATAAATT ACTCCTAATT CCTACTTCTT CCTTCATATC TCAAAGGAAT ATTTAGATGC 6960 
CATCAAGAAA TTTTACCAGA CCAGTGTGGA ATCTACTGAT TTTGCAAATG CTCCAGAAGA 7020 
30 AAGTCGAAAG AAGATTAACT CCTGGGTGGA AAGTCAAACG AATGGTAGGA GAGCCACCCA 7080 
TTATAGAAAC ACCTTTGAGA AACCTATGCC AGTGAGCCTT GTGCTTGACA CTGCATGGGG 7140 
GAACAGGTGT GGGGATTGAG ATGGGTTTGC AGGGAGGGCT GAAGAGGGCA CTCCAGATGA 7200 
AGGATTTGTC CAAATGAATA TGAAGAGAGC CTAGGGGAGC CAAGGAGGAA ATCACAGGAA 7260 
GCCAATTAGA TGGAAACACA TCTGGAGAAT TATTTGCTTA TGGCCCTGCA TGACAATAGC 7320 
35 TTTGTGGATC CCCTGTCTCC GCTCAGACCT ATTTTGAGAT CATATCCTTT ACTTTAAATC 7380 
AGACTCAAAT TTTTATGATG AATATTTAAT AGAAAACATT AGAAAGCGTC TCTCGTCTCC 7440 
TTTACTAATT GGGAAACAAG CAGCTCTCTG GTAAATCACC CTTTTGTCTC TGAGCTGGAG 7500 
CTGCCTGGAT CACATCTGTA GCCAATGTGT TCTGCAGGGA TTATCACAGC TCTCTTCCCC 7560 
ATCAAGGGCA AAGAGCTTGA CAAAGTCTCC ATTCTACAGA CATCTTTCTT ACCTCCCACC 7620 
40 TCTCATTACA GGCCAAACTT ACAGCAACTC AACATGAGAG TGAATAGGAA GATACCCCCG 7680 
GAAGTAGTGT CTGACAGCAC AGGACATGCG TTTCATATTA CAGAGCTCAA GTCACTCATC 7740 
CTAAAATGCA ATCAGGGCCT CCTTCCTCTG AATGGGGACC CCGTAGTTAA AAAAAAATAA 7800 
AAGTAGGAAG AGGAGGGAGG GAGAAAGGAA AGACACATGT TGGAAGAGTA GACAAAATCA 7860 
GTTTATCAGT ATTCCAAATC AGATGATTGG AGACATTCAT ACACAGAGAA CGTGAACTCC 7920 
45 TTCTCTATCA CAAGAAGTGA TGTCTCCATC AAGGGTAACT TTATACGACT GGAGCCTTGA 7980 
AGAAAGCTGC ATCTGGTGAA CCACTGGTCA GTGAGTCTAA CAATTCAAAG ATCAAAGTCA 8040 
GTGAGTCTCA AGCAGGGATT TGGGTCAATA ATTAACGATC AGTCACGAAC ATTTGCAAAG 8100 
CATCTTCCAG ACAAGCCATT TGTAGCTTGT GTAAAAGACT CTTTTATTCT TTCCCTTGCA 8160 
GAAAAAATTA AAAACCTATT TCCTGATGGG ACTATTGGCA ATGATACGAC ACTGGTTCTT 8220 
50 GTGAACGCAA TCTATTTCAA AGGGCAGTGG GAGAATAAAT TTAAAAAAGA AAACACTAAA 8280 
GAGGAAAAAT TTTGGCCAAA CAAGGTATTG TCTATATTTT ATTTATATAG TGTAATATGT 8340 
TAATACATGG AATGTTAAAC ATTTCTGATG GAATGTAACA TGATAAGTAA AAAATAAAAA 8400 
TTGTTCATGT CTGTTATTTT GTTGTTTTAC TCTTATAACT TTATTTAGTT AGGAATACCT 8460 
GAAAAACTAT TGTTTCTAAC TCATGGAATT CCTGGGTTAT TTCTTAGAAG AAGAAGGATG 8520 
55 TGTTGCTATC TCAATAATAT TATCTTTTTT GTCTTGTGTT TCAOGTGTTA TTTGTTGGAC 8580 
ACATTGATTT ATTGCAGAAT ACATACAAAT CTGTACAGAT GATGAGGCAA TACAATTCCT 8640 
TTAATTTTGC CTTGCTGGAG GATGTACAGG CCAAGGTCCT GGAAATACCA TACAAAGGCA 8700 
AAGATCTAAG CATGATTGTG CTGCTGCCAA ATGAAATCGA TGGTCTGCAG AAGGTAAGAA 8760 
rA CTTGCATCTA CAACTCTTCC TTCTACTGCC GGACATTTTT CCAAAGATAC CAAGTTTAAA 8820 
60 CAAGGTAAAA GCTTATGACC GAGTTGCCTC AAAATGATGA AAAATTCTAA ATGAGGAATG 8880 
ATGACTCACC TTCATATTAC AAATATTTGA GCATAGGGCC TGACACAAAC TGAAAGCTTA 8940 
GTTTTTGTTT GTTTGTTTGT TTTTATTATT ATTATTATAA TACTTTAAGC TTTAGGGTAC 9000 
ATGTGCACAA TGTGCAGGTT AGTTACATAT GTATACATGT GCCATGCTGG TGTGCTGCAC 9060 
- CCATTAACTC ATCATTTAGC GTTAGGTATA TCTCCTAATG CTATCCCTCC CCCCTCCCCC 9120 

65 CACCCCACAA CAGTCCTCAG AGTGTGATGT TACCTTCCTG TGTCCAAGTG TTCTCATTGT 9180 
TCAATTCCCA TCTATGATTT AATTCCATCT ATGGCTTAGT TAATGATTAA TTTATTAGAG 9240 
TTACATGCAT TGGATATCAA TTTGATGATA TTATTATGCA GCAATTTAAA CTTGACTGGG 9300 
AGAAATATAT ACCAATGTGA GGAAAGTTTA CAAATAGGCC GAGTAGAAAA GGGAATACAA 9360 
ATTTAGGAAT TTAGGGAATT ACAATTTAAT AATTGCAATG TGTACTAAAT AATGTATACA 9420 
70 GAAAAATATG ATGAGCCTAT TAAAAATTGA CACATGTAGT AGGCTGTTGG CACAAGAAAT 9480 
AGTGATACAT ACAGTTCATT GTGTACAAAA TAATGTAATC ATATTTTACA TGTGTATCAT 9540 
ACAGTTGTAT ACATACATAT GTACACATAT ACATATACGT AAAAACATGA TTCTGTTTTT 9600 
ACATACATGT ATATACATAT ACACATATAA CCCAATGTAT TTATATATTC AGGACTCATA 9660 
TTTTACCTAT TAGAATAATA ATGTCTATTA AAGTGAACCT TCTGTATTTC ACATTTATTG 9720 
75 CCAAAATAAC GAATCTCCAC ATAGTCAATT CATTGTTAAG GTGTATTAGA GATCGACAGT 9780 

TAGTCATATC AGTTTCTTTT TTCCATTTGT ATAGCTTGAA GAGAAACTCA CTGCTGAGAA 9840 
ATTGATGGAA TGGACAAGTT TGCAGAATAT GAGAGAGACA TGTGTCGATT TACACTTACC 9900 
TCGGTTCAAA ATGGAAGAGA GCTATGACCT CAAGGACACG TTGAGAACCA TGGGAATGGT 9960 
0 _ GAATATCTTC AATGGGGATG CAGACCTCTC AGGCATGACC TGGAGCCACG GTCTCTCAGT 10020 

80 ATCTAAAGTC CTACACAAGG CCTTTGTGGA GGTCACTGAG GAGGGAGTGG AAGCTGCAGC 10080 
TGCCACCGCT GTAGTAGTAG TCGAATTATC ATCTCCTTCA ACTAATGAAG AGTTCTGTTG 10140 
TAATCACCCT TTCCTATTCT TCATAAGGCA AAATAAGACC AACAGCATCC TCTTCTATGG 10200 
CAGATTCTCA TCCCCATAGA TGCAATTAGT CTGTCACTCC ATTTAGAAAA TGTTCACCTA 10260 
GAGGTGTTCT GGTAAACTGA TTGCTGGCAA CAACAGATTC TCTTGGCTCA TATTTCTTTT 10320 

866 



WO 03/042661 



PCTAJS02/36810 



CTATCTCATC TTGATGATGA TAGTCATCAT CAAGAATTTA ATGATTAAAA TAGCATGCCT 10380 
TTCTCTCTTT CTCTTAATAA GCCCACATAT AAATGTACTT TTCCTTCCAG AAAAATTTCC 10440 
CTTGAGGAAA AATGTCCAAG ATAAGATGAA TCATTTAATA CCGTGTCTTC TAAATTTGAA 10500 
ATATAATTCT GTTTCTGACC TGTTTTAAAT GAACCAAACC AAATCATACT TTCTCTTCAA 10560 
ATTTAGCAAC CTAGAAACAC ACATTTCTTT GAATTTAGGT GATACCTAAA TCCTTCTTAT 10620 
GTTTCTAAAT TTTGTGATTC TATAAAACAC ATCATCAATA AAATAATGAC ATAAAATCAT 10680 
TTTTGCTTTA CCTGTTTTCT CTCTGGAAAG GGCAAGTGTC CAGTTACACA TAGGAAAGAT 10740 
AATTTAGAGA TATATTAATC ATATATAAAG GAAAATTAAA AACAGAGTAG TTCATGATGA 10800 
GCCTGGAGTA GAAGGCATAT CCCAGAACAG GAGGAGCCTT GTAAACCACA TAGGAACTTC 10860 
CTATTTTATG CTAAAGGGAT AAGAAACTCA TTACAGGCTT TGATGGTTGT TTGTCAAAGA 10920 
GGGGCATAAA ATTATCATAT CCACATCTAG AAAATACATC TCTGGCTACG CTGATATCAA 10980 
TGGATGCGAG GAAAGAACAG TGTGGTTACC ATATATAAAT TAGGAAATCA TTAGAGTATT 11040 
GGGAGTGGAA ATGGAGAGAA AGAAAGAGCC TGGGGGAATT ATTTAGGAAA TAATAGTTAC 11100 
AGAAAGACAT CTAAGTTGCT GACCTATCTG ACTGGATGGA TGGAAGAATA TCTTGTTTCT 11160 
GAGAGAAAAA AAGACTTTGG GTTTAAATTT GTACTTGATG AATTAAGGTA CTTTTAATAT 11220 
TCAAATGGAT TTGCCTGGCA GGCACTTGAA GATATTAGTC TAAATCTCAG AAACAGAATA 11280 
TGATCTGAAG CTCTAAATTT GTGATATTCA ATATAAATAC TTTAGAGTCA TTGGGATAAA 11340 
TATGGTAGTT GTAGCTAAAA GCAAAAATAA GATACTAGGG AGAAAGGATA AAGTTAGAAG 11400 
AAAGAAGAAT CTAGAATTGA CCTTGAAGTA TATCAGCATG TGTAAAGATC AGGAATTGAT 11460 
CATTTTTATT TTCCAGAAAG TAGCTTTTCT TAGGGTTCCA TATTTACTCC CATAGATTCT 11520 
TCCC 



A76 Protein sequence t 

Gene name: SCCA2 or 2b, serine (or cysteine) proteinase inhibitor 

Unigene number: Hs. 22 7 94 8 

Probeset Accession #: AB035089 

Protein Accession #: BAB21525 

Signal sequence: none found 

Transmembrane domains: none found 

Serine. Proteinase Inhibitor domain: 13-390 

Cellular Localization: secreted 



1 11 21 31 41 51 

I I I I I I 

MNSLSEANTK FMFDLFQQFR KSKENNIFYS PISITSALGM VLLGAKDNTA QQISKVLHFD 60 

QVTENTTEKA ATYHVDRSGN VHHQFQKLLT EFNKSTDAYE LKIANKLFGE KTYQFLQEYL 120 

DAIKKFYQTS VESTDFANAP EESRKKINSW VESQTNEKIK NLFPDGTIGN DTTLVLVNAI 180 

YFKGQWENKF KKENTKEEKF WPNKNTYKSV QMMRQYNSFN FALLEDVQAK VLEIPYKGKD 240 

LSMIVLLPNE IDGLQKLEEK LTAEKLMEWT SLQNMRETCV DLHLPRFKME ESYDLKDTLR 300 

TMGMVNIFNG DADLSGMTWS HGLSVSKVLH KAFVEVTEEG VEAAAATAW WELSSPSTN 360 
EEFCCNHPFL FFIRQNKTNS ILFYGRFSSP 



A77 DNA 



Gene name: hypothetical protein FLJ134S9 

Unigene number: none found 

Probeset Accession XM_047266 
Nucleic Acid Accession #: XM_047266 

Coding sequence: 485-1471 (underlined sequences correspond to start and stop codons) 



1 

I 

CTGACCTCAA 
CACCATGCCA 
ATAAGATCAG 
CATAAATGAG 
TGACGTCATC 
TCTTTTTCTG 
ATCCTGGCGG 
GTCAAGGTGG 
TCA CATGT GG 
GCCTCGGCAG 
TCTGGATGAC 
GGTGAAGGGG 
GGAGGGGAAC 
GGTGAGCGTC 
TGCAGTGCTG 
ATCCCTGGAG 
TGTGCAGCTG 
GAGGGGCTGG 
GGAGGCTCAG 
CCTGGAGTTC 
GTTTGAGCTG 
GCCAGAGCCT 
GCTGAAGACC 
ACTGGTGCTG 
CCAGGTCATC 
GTGCCAACCC 
GTCTAGTGCT 



11 

I 

GTGATCCACC 
GGCCCTCTTA 
TATCTTCTTA 
GACTAACCTG 
ACTGCCCCAG 
CCTGTAGAGA 
GGCTGTGCCA 
GCAGCAAAGT 
ATTGACATCT 
CCAATCAGCT 
GAGAATCCAC 
TTGGAGCATG 
TTCAATTGGC 
TGGCGCAGGT 
GTCCTGCAGG 
TTGCAGCTAC 
GCCCGCAATG 
TGGCCGGTAG 
GCTGGCAAGA 
ACAGACATGG 
CTGACTGTGG 
CTGGAGAAAC 
TTTGTCTTCT 
CTCACCGTCT 
TTCCGTCCCC 
TTCAATGCCT 
TCCTCTGAAT 



21 
1 

CACCTCTGCC 
ACCTCTTCAA 
ATGGAAGCAC 
TCTCCCGTTG 
GCCCAGCTGT 
TGGTTACAAT 
ACGCTGTGGC 
CTTCCTGACA 
TTCCTCAAGA 
ATGAGCTCAG 
TCACCGGAGA 
ACAAGCAGGA 
GCTTTGTGTT 
CTGGACCCTT 
TCTGGGACTA 
CAGACATGGT 
GGGCCGGGCC 
TGAAGCTGAA 
AGAAGCGAAA 
GTGGCAATGT 
AGGAGGCCGA 
CCAGCCGCCC 
TCATCTGGCG 
TCCTCCTCCT 
TCCACAAGTG 
GCTCCTGGAA 
AAACCTATCA 



31 
I 

TCCCAAAGTG 
GTCTGTTTTC 
CTGGACTACA 
GGAGTTTTGA 
GTCCCTACAC 
GCCTGGCGTG 
CTCCCTGCCC 
CCACCGGAGA 
TGTGCCTGCT 
AGTTGTCATC 
GATGTCGAGT 
GACAGACGTT 
CCGCTTTGAC 
TGCCCTGGAG 
TGACCGCATC 
GCGTGGGGCC 
GAGGTGCAAT 
GGAGGCAGAG 
GCAGAGGAGG 
GTACATCCTC 
GAAACGGCCA 
CAAAACTTCC 
CCGGTACTGG 
GGTCTTCTAC 
_ACTCTCGCTG 
GTCTTTCTTA 
CAGCCACTG 




CCACCCCCAG 
TGGAACACGG 
GACATCTATG 
CACTTCAACT 
TACCTGCCCA 
GAGGCGGAGT 
TCTGCCAATG 
CGGGGCCCCG 
CTGTTTCGCT 
GACGTGGAGC 
AGGAAGGGCC 
ACGGGCAAGG 
GTGGGGAAGG 
TTCAACTGGT 
CGCACCCTGG 
ACCATCCCTG 
ACCTTGGACA 
CCCATGTGAG 



51 

1 

AGGTGTGAGC 
AACAGAGGTA 
TCATTGTTAT 
CATGTCTTCA 
TGACGCATCT 
GCCTTCGCAG 
AGCCGGTGCT 
AGGGATCTCT 
TTGACATCAA 
AGGATGTGGT 
TGAAGAGCTG 
CCCTGACTGG . 
CGGAGCGGGA 
TCCGGCAGCC 
ACTTCCTTGG 
AGCTCTGCTC 
GCCGCCGCCT 
GGGAGGCGCA 
GGCCAGAAGA 
TGGAGGCAGA 
GGCGGAAGCA 
TTGTGAACCC 
TGCTGCTGCT 
GCCAGATCAG 
CTCACCCAGG 
CTACCCCAGA 



60 
120 
180 
240 
300 
360 
420 
480 
.540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 



A78 Protein sequence: 

Gene name: hypothetical protein FLJ13459 
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Unigene number: 
Probeset Accession : 
Protein Accession 8: 
Signal sequence: 
Transmembrane domains : 
C2 domain: 

Cellular Localization: 



none found 

XM_047266 

XP_047266 

none found 

291-313 

27-86 

plasma membrane / 



1 11 21 31 41 51 

I 1 I I I I 

MWIDIFPQDV PAPPPVDIKP RQPISYELRV VIWNTEDWL DDENPLTGEM SSDIYVKSWV 60 
KGLEHDKQET DVHFNSLTGE GNFNWRFVFR FDYLPTEREV SVWRRSGPFA LEEAEFRQPA 120 
VLVLQVWDYD RISANDFLGS LELQLPDMVR GARGPELCSV QLARNGAGPR CNLPRCRRLR 180 
GWWPWKLKE AEDVEREAQE AQAGKKKRKQ RRRKGRPEDL EFTDMGGNVY ILTGKVEAEF 240 
ELLTVEEAEK RPVGKGRKQP EPLEKPSRPK TSFNWPVNPL KTFVFFIWRR YWRTLVLLUj 300 
VLLTVFLLLV FYTIPGQISQ VIFRPLHK 



A79 DNA SEQUENCE 

Gene name: Homo sapiens mRNA; cDNA DKFZp434K0322 (from clone DKFZp434K0322) 

Unigene number: Hs. 16 10 31 

Probeset Accession #: AL137708 
Nucleic Acid Accession #: AL137708 

Coding sequence: 1315-1791 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I 1 I I I I 

GGCATTGATG CTGTGTGCGC GTGCGTGTGT GTGTGTGTGT GTGTGTGTGT GTGTGTGTGT 60 
GTGTGTGTGT GTCTGGAGTC ATGGCAGGGT CCCTTTCTGT CTGTCTCCTT GCTCTGCCCC 120 
AGACTGGGGG GCTGCAGAGG TGAGGGTATC TGGCCTCAAC AGCTGCTTAT TCCCGATGGG 180 
ATGGCCTGGG CTGGGCCCCT GAGGCCAGGC TGACTTGGAC ATGGCAAGAG GGGTCCCAGG 240 
CTCTTGTGGG CAAAGCAGGG GAGGCGCCAA TGTGGAGGAA CAGAGTCTCC TGGCTGGCTG 300 
CTGCTCTCTG GAGCGGGTGG AGTCAGGGAA GAGCTGAGCT GGGGAGTCAC CCTGGGCCTG 360 
GGGTCACCGT AGGCCCCATG TAGCACCCTG GTTCCCCTGC CTGTAGGTGA CAGGAGCCAG 420 
CCCAGCCAGG TGTGCTCCCT CCCCAGGCCC TAGGCAGGCG GGTACAGGGG CCAGCAGCTG 480 
CGCCCGCCCC ACCTTCCTTC CCACCCACAT GCCGAAGGGT GGCCAGGCAG GCAGGTGGAC 540 
GAGTCCAGGC AGCGGCTGAG TCAGTGTGTG TGGAATGTTC TGGCCGCTCC CAGCTGCACC 600 
CTGCCCCTAC CTGCCACCAC CTCACCTTCA TCCTCAGGCG CTGCGGCCCT GAGCCCCTGC 660 
CAGGAATGCA CCTTTAGCCC AGGCCTGCTC AGTGAGCTCC GCCGACAGCC AGCCCTGCTC 720 
CTCCCGCCAT GACCCTGCAG ACCCCTCTGG GCTTCCAAGT TCCTGGGGGC TGCAGTGAAC 780 
ATGCTCCACC TGCATGGCTG GCAAACCATG GTGGGCCCCA GCTGTGGTGC GTGCTGGGGT 840 
AGAGGCAAGG AAGTGATGGG ACCGCAGAGA TGAGACCCCC AGGGATGAGA TGGGACCCCC 900 
AGGCAGGGCC CAGGGTCCAG GGCCCAGGAG AGAGAAGCAG GGAGGGAGAG AGCTTCCTGG 960 
TGGAGGACGC ATCCTACAGT GGGGGCAAGG GTGCTCTGAG GTCCGGTGAA GGCAGGGACT 1020 
AGGCTGCCCA GGCCCTGCCT GCTTGGCTGG GGCTGGGGGC TGCTGGGAGG TGGCTGGGAG 1080 
GCTGGGCCTG GGCAGCTAAG CTGGAGCTTT GGCCAGGGTC CAGAGCCTCC CTCCCTTCAG 1140 
CTTCCTGCTG CACAGAACCC TCGCCCCTGG CCACCCCGTG CTGCCTCCTT GCCCTGGCAG 1200 
ACCCAGCACT GGCTGCTGCT AGTCAGATGG' GGTAGCGGGC AGGGGCCGGA GGGGCCACCC 1260 
TCCCAGCTGA CCCAGCCTCC TGGGCCGCTT CTTCCAAACC AGCAGGGTAG AAAGATGGGG 1320 
CACCCACCAG TCTCTCCCAG TGCCCCGGCC CCAGCTGGCA CCACAGCTAT ACCTGGGCTT 1380 
ATTCCAGACC TTGTCGCCGG GACCCCCTGT GAGTTGTGGG ATTCCCAAGA GGGGTGTGGG 1440 
GATAACCCAG CCAAGTGGGG GCTGCAGCTG TCCACAGATG CACTCAGCCT GGCCTCTACC 1500 
CCAGGGCCCC GCTGGGCTCT CATTGCCGGC GCCCTTGCCG CGGGCGTCCT CCTCGTCTCC 1560 
TGCCTCCTCT GTGCTGCCTG CTGCTGCTGC CGCCGCCACA GGAAGAAGCC CAGGGACAAG 1620 
GAGTCCGTGG GTCTGGGCAG TGCCCGCGGC ACCACCACCA CCCACCTGGT GAGGAGCGGC 1680 
TCCTTGCTCA CTCAGTCCAG AGAGGGCTTG AAATCCAGGC TCCAGAGCCC AGGGCAGCGA 1740 
GGCGAGTTCA GCCCCAGGGA TGGTTTAACC CCCACAGAGG CAGGGCG TTG AG GACCTTCC 1800 
TGGCAGGGAA AGTGGGTGAA CAGAGGTGAG AAGGAGGCCA TGCAACAGGG GCTGCCCCAT I860 
GGGCCCGAGG GAGCCACAGC GGGTTCTTGA GGAAGGCAGG GGGTACCCCA GATGCCACGT 1920 
TTTGGGTGGG TTTGGCCGGT CTCACAGAGC GAAGCCGACG ATTTGTGCCT GTTGGGTGGC 1980 
CTGGCCTGGA GGCGGGGGGT CTTGACCCAT GTCATGCAAG GGCTGCCCGG GAGCCCAGGG 2040 
CTCTGATGAG GCATGATGTC AGCACCACCT GCCCCTTGTC CCAACTCACT CCAGGTGCAA 2100 
CCTGATGTGG ATGGCCTGGA GTCCAGCCCG GGGGATGCTC AGCAATGGGG GTGCCTGCAG 2160 
CTCTCCCTGG AGTTCGACTT TGGAAGCCAG GAGGTGAAGG GCCCCGCTGC GCAGGACCAG 2220 
CGGTTCTGCG AGTTTCCGGA AAGGGTGACG GGGGAAGGGC AGACCCCATG CCCTGGGTGG 2280 
TGGGGAGCTG ACAGGGCAGG GGCCCTTGGC TGAGCCCACC CCGCTGGCTC CCAGATCAGG 2340 
GTGGGCCTGA GGCAGGCAGC CGACCTGAGG CCTGGGGGCA CCGTGGACCC CTATGCCCGG 2400 
GTCAGCGTCT CCACCCAGGC CGGACACAGA CATGAGACAA AAGTGCACCG AGGCACGCTC 2460 
TGCCCCGTGT TTGACGAGAC CTGCTGCTTC CACGTGAGTC AGGGATGGTC GGCTGGGTGG 2520 
GCCTGGACGG CTGGATGGGC CTGGGCTGGG TGGGCCTGGG CAGCTGGGTG GGCCTGGGCA 2580 
GCTGGGTGGG CCTGAGCTAG GGCAGCAGGG CCTGGCTCAC GCCGCTGCCT CAGATCCCGC 2640 
AGGCGGAGCT GCCAGGGGCC ACCCTGCAGG TGCAGCTTTT CAACTTCAAG CGCTTCTCGG 2700 
GGCATGAGCC CCTGGGTGAG CTCCGTCTGC CACTGGGCAC CGTGGATCTG CAGCATGTTC 2760 
TGGAGCACTG GTACCTGCTG GGCCCGCCGG CTGCCACTCA GGTGAGGTGC TGGTCACCAG 2820 
GCCACAGCCC AAGGCAGAGC TGGCAGGGAC CCTGCCCTAT GGGCCATCGG AAAGACAGGC 2880 
CTGATGGGCA GCATTTTCGG GGGTCTGAGC CCCAACTCGG CCAGAATCAC CCTCCCGGGC 2940 
TGAAGCCCCT CTTGCTGCCC ACAGCCCGAG CAGGTCGGGG AGCTGTGCTT CTCTCTCCGG 3000 
TACGTGCCCA GCTCAGGCCG GCTGACOGTG GTGGTGCTGG AGGCTCGAGG CCTGCGTCCA 3060 
GGACTTGCAG AGCCCTACGT GAAGGTCCAG CTCATGCTGA ACCAGAGGAA GTGGAAGAAG 3120 
AGAAAGACAG CCACCAAAAA GGGCACGGCG GCCCCCTACT TCAATGAGGC CTTCACCTTC 3180 
TCAGCCAGGT CCAGAATGTG GACCTGGTGC TGGCTGTCTG GGACCGCAGC 3240 
GAACTGAGCC CGTAGGCAAG GTGCACCTGG GTGCCCGGGC CTOGGGGCAG 3300 

868 
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CCCCTGCAGC ACTGGGCAGA CATGCTGGCC CACGCCCGGC GGCCCATTGC CCAGCGGCAC 3360 

CCCCTGCGGC CAGCCAGGGA GGTGGACCGC ATGCTGGCCC TGCAGCCCCG CCTTCGCCTG 3420 

CGCCTGCCCT TGCCCCACTC CTGAATGCAC CACATGCCTC TGTCTCCCCG CTGAGCCCAG 3480 
GCACTTGCCC AGGCCGCCCT GCAGGACCAC TGCAATAAAC GCCTTCTCCT GCC 



ABO Protein sequence: 



Unigene number: 
Probeset Accession # : 
Protein Accession ft: 
Signal sequence: 
Transmembrane domains: 
Cellular Localization: 

11 



Homo sapiens mRNA; 

Hs. 161031 

AL137708 

CAB70885 

none found 

69-85 

plasma membrane 



CDNA DKFZp434K0322 (from clone DKFZp434K0322) 



51 



1 11 21 31 41 

I I I I I 1 

MGHPPVSPSA PAPAGTTA1P GLIPDLVAGT PCELWDSQEG CGDNPAKWGL QLSTDALSLA 
STPGPRWALI AGAltAAGVLL VSCLLCAACC CCRRHRKKPR DKESVGLGSA RGTTTTHLVR 
SGSLLTQSRE GLKSRLQSPG QRGEFSPRDG LTPTEAGR 



60 
120 



A81 DNA SEQUENCE 

Gene name: 
Unigene number: 
Probeset Accession # : 
Nucleic Acid Accession # : 
Coding sequence: 



Cadherin 3, P-cadherin (placental) 

Hs.2877 

X63629 

X63629 

54-2543 (start and stop codons are underlined) 



11 



21 



31 41 51 

I I I 

GCGGAACACC GGCCCGCCGT CGCGGCAGCT GCTTCACCCC TCTCTCTGCA' GC CATGG GGC 
TCCCTCGTGG ACCTCTCGCG TCTCTCCTCC TTCTCCAGGT TTGCTGGCTG CAGTGCGCGG 
CCTCCGAGCC GTGCCGGGCG GTCTTCAGGG AGGCTGAAGT GACCTTGGAG GCGGGAGGCG 
CGGAGCAGGA GCCCGGCCAG GCGCTGGGGA AAGTATTCAT GGGCTGCCCT GGGCAAGAGC 
CAGCTCTGTT TAGCACTGAT AATGATGACT TCACTGTGCG GAATGGCGAG ACAGTCCAGG 
AAAGAAGGTC ACTGAAGGAA AGGAATCCAT TGAAGATCTT CCCATCCAAA CGTATCTTAC 
GAAGACACAA GAGAGATTGG GTGGTTGCTC CAATATCTGT CCCTGAAAAT GGCAAGGGTC 
CCTTCCCCCA GAGACTGAAT CAGCTCAAGT CTAATAAAGA TAGAGACACC AAGATTTTCT 
ACAGCATCAC GGGGCCGGGG GCAGACAGCC CCCCTGAGGG TGTCTTCGCT GTAGAGAAGG 
AGACAGGCTG GTTGTTGTTG AATAAGCCAC TGGACCGGGA GGAGATTGCC AAGTATGAGC 
TCTTTGGCCA CGCTGTGTCA GAGAATGGTG CCTCAGTGGA GGACCCCATG AACATCTCCA 
TCATCGTGAC CGACCAGAAT GACCACAAGC CCAAGTTTAC CCAGGACACC TTCCGAGGGA 
GTGTCTTAGA GGGAGTCCTA CCAGGTACTT CTGTGATQCA GGTGACAGCC ACAGATGAGG 
ATGATGCCAT CTACACCTAC AATGGGGTGG TTGCTTACTC CATCCATAGC CAAGAACCAA 
AGGACCCACA CGACCTCATG TTCACAATTC ACCGGAGCAC AGGCACCATC AGCGTCATCT 
CCAGTGGCCT GGACCGGGAA AAAGTCCCTG AGTACACACT GACCATCCAG GCCACAGACA 
TGGATGGGGA CGGCTCCACC ACCACGGCAG TGGCAGTAGT GGAGATCCTT GATGCCAATG 
ACAATGCTCC CATGTTTGAC CCCCAGAAGT ACGAGGCCCA TGTGCCTGAG AATGCAGTGG 
GCCATGAGGT GCAGAGGCTG AOGGTCACTG ATCTGGACGC CCCCAACTCA CCAGCGTGGC 
GTGCCACCTA CCTTATCATG GGCGGTGACG ACGGGGACCA TTTTACCATC ACCACCCACC 
CTGAGAGCAA CCAGGGCATC CTGACAACCA GGAAGGGTTT GGATTTTGAG GCCAAAAACC 
AGCACACCCT GTACGTTGAA GTGACCAACG AGGCCCCTTT TGTGCTGAAG CTCCCAACCT 
CCACAGCCAC CATAGTGGTC CACGTGGAGG ATGTGAATGA GGCACCTGTG TTTGTCCCAC 
CCTCCAAAGT CGTTGAGGTC CAGGAGGGCA TCCCCACTGG GGAGCCTGTG TGTGTCTACA 
CTGCAGAAGA CCCTGACAAG GAGAATCAAA AGATCAGCTA CCGCATCCTG AGAGACCCAG 
CAGGGTGGCT AGCCATGGAC CCAGACAGTG GGCAGGTCAC AGCTGTGGGC ACCCTCGACC 
GTGAGGATGA GCAGTTTGTG AGGAACAACA TCTATGAAGT CATGGTCTTG GCCATGGACA 
ATGGAAGCCC TCCCACCACT GGCACGGGAA CCCTTCTGCT AACACTGATT GATGTCAACG 
ACCATGGCCC AGTCCCTGAG CCCCGTCAGA TCACCATCTG CAACCAAAGC CCTGTGCGCC 
ACGTGCTGAA CATCACGGAC AAGGACCTGT CTCCCCACAC CTCCCCTTTC CAGGCCCAGC 
TCACAGATGA CTCAGACATC TACTGGACGG CAGAGGTCAA CGAGGAAGGT GACACAGTGG 
TCTTGTCCCT GAAGAAGTTC CTGAAGCAGG ATACATATGA CGTGCACCTT TCTCTGTCTG 
ACCATGGCAA CAAAGAGCAG CTGACGGTGA TCAGGGCCAC TGTGTGCGAC TGCCATGGCC 
ATGTCGAAAC CTGCCCTGGA CCCTGGAAAG GAGGTTTCAT CCTCCCTGTG CTGGGGGCTG 
TCCTGGCTCT GCTGTTCCTC CTGCTGGTGC TGCTTTTGTT GGTGAGAAAG AAGCGGAAGA 
TCAAGGAGCC CCTCCTACTC CCAGAAGATG ACACCCGTGA CAACGTCTTC TACTATGGCG 
AAGAGGGGGG TGGCGAAGAG GACCAGGACT ATGACATCAC CCAGCTCCAC CGAGGTCTGG 
AGGCCAGGCC GGAGGTGGTT CTCCGCAATG ACGTGGCACC AACCATCATC CCGACACCCA 
TGTACCGTCC TAGGCCAGCC AACCCAGATG AAATCGGCAA CTTTATAATT GAGAACCTGA 
AGGCGGCTAA CACAGACCCC ACAGCCCCGC CCTACGACAC CCTCTTGGTG TTCGACTATG 
AGGGCAGCGG CTCCGACGCC GCGTCCCTGA GCTCCCTCAC CTCCTCCGCC TCCGACCAAG 
ACCAAGATTA CGATTATCTG AACGAGTGGG GCAGCCGCTT CAAGAAGCTG GCAGACATGT 
ACGGTGGCGG GGAGGACGAC TAGGOGGCCT GCCTGCAGGG CTGGGGACCA AACGTCAGGC 
CACAGAGCAT CTCCAAGGGG TCTCAGTTCC CCCTTCAGCT GAGGACTTCG GAGCTTGTCA 
GGAAGTGGCC GTAGCAACTT GGCGGAGACA GGCTATGAGT CTGACGTTAG AGTGGTTGCT 
TCCTTAGCCT TTCAGGATGG AGGAATGTGG GCAGTTTGAC TTCAGCACTG AAAACCTCTC 
CACCTGGGCC AGGGTTGCCT CAGAGGCCAA GTTTCCAGAA GCCTCTTACC TGCCGTAAAA 
TGCTCAACCC TGTGTCCTGG GCCTGGGCCT GCTGTGACTG ACCTACAGTG GACTTTCTCT 
CTGGAATGGA ACCTTCTTAG GCCTCCTGGT GCAACTTAAT TTTTTTTTTT AATGCTATCT 
TCAAAACGTT AGAGAAAGTT CTTCAAAAGT GCAGCCCAGA GCTGCTGGGC CX^ CTG GCCG 
TCCTGCATTT CTGGTTTCCA GACCCCAATG CCTCCCATTC GGATGGATCT CTGCGTTTTT 
ATACTGAGTG TGCCTAGGTT GCCCCTTATT TTTTATTTTC CCTGTTGCGT TGCTATAGAT 
GAAGGGTGAG GACAATCGTG TATATGTACT AGAACTTTTT TATTAAAGAA A 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
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A82 Protein sequence: 
Gene name: 
Unigene number: 
Probeset Accession #: 
Protein Accession #: 
Signal sequence: 
Transmembrane domain: 

Cellular localization: 
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Cadherin 3, p-cadherin (placental) 

Hs.2877 

X63629 

CAA45177 

1-24 

659-675 (second underlined sequence) 
plasma membrane 



MGLPRGPLAS 
QEPALFSTDN 
KGPFPQRIiNQ 
YELFGHAVSE 
DEDDAIYTYN 
TDMDGDGSTT 
AWRATYLIMG 
PTSTATIWH 
DPAGWLAMDP 
VNDHGPVPEP 
TWLSLKKFL 
GAVLALLFLL 
GLEARPEWL 
DYEGSGSDAA 



11 
I 

LLLLQVCWLQ 
DDFTVRNGET 
LKSNKDRDTK 
NGASVEDPMN 
GWAYSIHSQ 
TAVAWEILD 
GDDGDHFTIT 
VEDVNEAPVP 
DSGQVTAVGT 
RQITICNQSP 
KQDTYDVHLS 
LVLUjLVRKK 
RNDVAPTIIP 
SLSSLTSSAS 



21 
I 

CAASEPCRAV 
VQERRSLKER 
IPYSITGPGA 
ISIIVTDQND 
EPKDPHDLMF 
ANDNAPMFDP 
THPESNQGIL 
VPPSKWEVQ 
LDREDEQFVR 
VRHVLNITDK 
LSDHGNKEQL 
RKIKEPLLLP 
TPMYRPRPAN 
DQDQDYDYliN 



31 

i 

PREAEVTLEA 
NPLKIFPSKR 
DSPPEGVFAV 
HKPKFTQDTF 
TIHRSTGTIS 
QKYEAHVPEN 
TTRKGLDFEA 
EGIPTGEPVC 
NNIYEVMVLA 
DLSPHTSPFQ 
TVIRATVCDC 
EDDTRDNVFY 
PDEIGNFIIE 
EWGSRFKKLA 



41 
I 

GGAEQEPGQA 
ILRRHKRDWV 
EKETGWLLLN 
RGSVLEGVLP 
VISSGLDREK 
AVGHEVQRLT 
KNQHTLYVEV 
VYTAEDPDKE 
MDNGSPPTTG 
AQIiTDDSDIY 
HGHVETCPGP 



NLKAANTDPT 
DMYGGGEDD 



51 
1 

LGKVFMGCPG 
VAPISVPENG 
KPLDREEIAK 
GTSVMQVTAT 
VPEYTLTIQA 
VTDLDAPNSP 
TNEAPFVLKL 
NQKISYRILR 
TGTLLLTLID 
WTAEVNEEGD 
WKGGFILPVL 
QDYDITQLHR 
APPYDTLLVF 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



A83 PNA SEQUENCE 

Gene name: putative G -protein coupled receptor 
Unigene number: Hs. 16085 

Probeset Accession #: F07953 
Nucleic Acid Accession #: NMJU6334 

Coding sequence: 104-1471 (underlined sequences correspond to start and stop codons) 



11 



21 



31 



AGCACCTGGG 
GGGGAGTGGG 
CTCCAGCATC 
GCGCCAATTG 
GACGTTTGCA 
GAATAGCAGC 
GGTTTTCATG 
TAAACAACGA 
ACTAGGAGAT 
CATCAGCCGG 
TGTCAACTGC 
TCTAGCCCTG 
GATGGCAATG 
TTTCTGGGGA 
TATTCAACAG 
TGATCTATAT 
TAATTTTCTT 
CAATATTGTT 
TGTGAATTAT 
TCTTGTTGGA 
CTTTTATGCC 
AATGGGCATG 
CCGCACCATA 
TGATGTGATC 
ACAGGCACCA 
CAGTGGTTTC 
ATAAACAAAC 
AGGTGATACT 
ACAATACTCA 
CCAAGAAACT 
TGAGCCAAAC 
TTGAGATTGA 



AAAAGGCAGA 
AAGTGGAGGC 
ATGATTACCT 
TTTAAAGACT 
TTTTCTTGCA 
TCCCGTTATT 
GTGCCTTTTT 
CTGCTTTTTT 
CCCTTTCCCA 
GTTGGTGTGA 
CCATACACTT 
GAACGGCGAC 
GCACGGAGAA 
ATGATAAAAA 
GAAGTGGATG 
GCTACCAAGG 
GGTTACTTTT 
TTTGATCGAG 
CTGGGAATCC 
ATAATCATCG 
ATCTCTAGCA 
TACTTTGTCT 
ATCACTGAAG 
TTCCTGGTCA 
GAGAAGCAAA 
AAAATTTAGA 
AAAATGCTAT 
ATGACCATGA 
GCAGAGAGCA 
AAAGGTGAAA 
ACGTAGGATT 
CTCATTAAAA 



CCGTGTGAGG 
AGGAGCCTTC 
CCCAAATACT 
ATGAGATACG 
CCATGTTTGA 
TTCACTGGAA 
ACATTGGCTA 
CCTGTCTCTT 
TTCTCAGCCC 
TTGGAGTGAC 
ACATGTCTTA 
TGCTGCAAAC 
CAATGTTCCA 
GTGTTACCAC 
CTTTGGAAGA 
AGAGAATAGA 
TCTCTATTTA 
TTGGGAAAAC 
AATTTGATGT 
TCACATCCAT 
GTAAGTCCTC 
CCTCTGTGCT 
TCCTTGGAGA 
GCGCTCTCTC 
TGGCACCTTG 
TATAAGAGGG 
GGTAGCATTT 
GTAGCATCAG 
TCCCGTGTGG 
AATACACTGG 
TCCGTTTTAA 
TCAGAGACTG 



GGGCCTGTGG 
CTTACACTTC 
ATTTTTTGGA 
TCAGTATGTT 
GCTCATCATC 
AATGAACCTG 
TTTTATTGTG 
ATGGCTGACC 
AAAACATGGG 
TCTCATGGCT 
CTTCCTCAGG 
CATGGATATG 
GAAGGGGGAA 
TTCAGCATCA 
ATTAAGCAGG 
ATACTCCAAA 
CTGTGTTTGG 
GGATCCTGTC 
GAAGTTTTGG 
CAGAGGATTG 
CAATGTCATT 
GCTGATCCGA 
ACTGCAGTTC 
TAGCATACTC 
AACTTAAGCC 
GGGAAAAATG 
TTCACCTTCA 
CCAGAACATG 
ATATGAGGCT 
AACTCTGGGG 
GGTTCACATG 
T 



41 
i 

CCCCAGCGTG 
GCCATGAGTT 
TTTGGGTGGC 
GTACAGGTGA 
TTTGAAATCT 
TGCGTAATTC 
AGCAATATCC 
TTTATGTATT 
ATCTTATCCA 
CTTCTTTCTG 
AATGTGACTG 
ATCATAAGCA 
GTGCATAACA 
GGAAGTGAAA 
CAGCTTTTTC 
ACCTTCAAGG 
AAAATTTTCA 
ACAAGAGGCA 
TCCCAACACA 
CTGATCACTC 
GTCCTGCTAT 
ATGAGTATGC 
AACTTCTATC 
TTCCTCTATT 
TACTACAGAC 
GAACCAGGGC 
TAGCATACTC 
AGAGGGAGAA 
GGTGTAGAGG 
CAAGACATGT 
GAAAAGGTTA 



51 
I 

CTGTGGCCTC 
TCCTGATCGA 
TTTTCTTCAT 
TCTTCTCCGT 
TAGGAGTATT 
TGCTGATCCT 
GACTACTGCA 
TCTTCTGGAA 
TAGAACAGCT 
GATTTGGTGC 
ACACAGATAT 
AAAAGAAAAG 
AACCATCAGG 
ATCTTACTCT 
TGGAAACAGC 
GGAAATATTT 
TGGCTACCAT 
TTGAGATCAC 
TTTCCTTCAT 
TTACCAAGTT 
TAGCACAGAT 
CTTTAGAATA 
ACCGTTGGTT 
TGGCTCACAA 
TGTTAGAGGC 
CTGACATTTT 
CTTCCCCCTC 
CTAACTCAAG 
CGGAGAGGAG 
CTATGGTAGC 
TAGCTTTGCC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 

960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
I860 



A84 Protein sequence 
Gene name: 
75 Unigene number: 

Protein Accession #: 
Signal sequence: 
Transmembrane domains: 
424-446 

80 Cellular Localization: 

1 11 21 



putative G-protein coupled receptor 
Hs. 16085 

NP_057418.1 
none found 

5-27, 42-64, 76-98, 110-132, 144-166, 289-311, 342-364. 380-402, 
plasma membrane 
41 51 

870 
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I I I I I I 

MSFLIDSSIM ITSQILFFGF GWLFFMRQLF KDYEIRQYW QVIFSVTFAF SCTMFELI I F 60 

EILGVLNSSS RYFHWKMNLC VILLILVFMV PFYIGYFIVS NIRUjHKQRL LFSCLLWLTF 120 

MYFFWKLGDP FPILSPKHGI LSIEQLISRV GVIGVTLMAL LSGFGAVNCP YTYMSYFLRN 180 

VTDTDILALE RRLLQTMDMI ISKKKRMAMA RRTMFQKGEV HNKPSGFWGM IKSVTTSASG 240 

SENLTLIQQE VDALEELSRQ LFLETADLYA TKERIEYSKT FKGKYFNFLG YFFSIYCVWK 300 

IFMATINIVF DRVGKTDPVT RGIEITVNYL GIQFDVKFWS QHISFILVGI IIVTSIRGIJJ 360 

ITLTKFFYAI SSSKSSNVIV LLLAQIMGMY FVSSVLLIRM SMPLEYRTII TEVLGELQFN 420 
FYHRWFDV1F LVSALSSILF LYLAHKQAPE KQMAP 



A85 DMA SEQUENCE: 
Gene name: 
Unigene number: 
Probeset Accession ft: 
Nucleic Acid Accession #: 
Coding sequence: 



TTK protein kinase 
Hs. 169840 
M86699 
NM_003318 

1026-3551 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I 1 I I I I 

GGAATTCCTT TTTTTTTTTT TTTGAGATGG AGTTTCACTC TTGTTGGCCA GGCTGGAGTG 60 
CAATGGCACA ATCTCAGCTT ACTGCAACCT CCGCCTCCCG GGTTCAAGCG ATTCTCCTGC 120 
CTCAGCCTCT CAAGTAGCTG GGATTACAGG CATGTGCCAC CACCCCTGGC TAACTAATTT 180 
CTTTTCTATT TAGTAGAGAT GGGGTTTCAC CATGTTGGTC AGGCTGGTCT TGAACTCCTG 240 
ACCTCAGGTG ATCCACTTGC CTTGGCCTCC CAAAGTGCTA GGATTACAGC CGTGAAACTG 300 
TGCCTGGCTG ATTCTTTTTT TGTTGTTGGA TTTTTGAAAC AGGGTCTCCC TTGGTCGCCC 360 
AGGCTGGAGT GCAGTGGTGC GATCTTGGCT CACTATAACC TCCACCTCCT GGTTTCAAGT 420 
GATCCTCCCA CTTTAGCCTC CTGAGTAGCT GTGATTACAG GCGTGCACCA CCACACCCGG 480 
CTAATTTTTG TATTTTTATT AGAGACAGGG TTTCACCATG TTGGCCAGGC TGTTCTCAAA 540 
CTCCTGGACT CAAGGGATCC GCCTGCCTCC ACTTCCCAAA GTCCCGAGAT TACAGGTGTG 600 
AGTCACCATG CCTGACCTTA TAATTCTTAA GTCATTTTTT CTGGTCCATT TCTTCCTTAG 660 
GGTCCTCACA ACAAATCTGC ATTAGGCGGT ACAATAATCC TTAACTTCAT GATTCACAAA 720 
AGGAAGATGA AGTGATTCAT GATTTAGAAA GGGGAAGTAG TAAGCCCACT GCACACTCCT 780 
GGATGATGAT CCTAAATCCA GATACAGTAA AAATGGGGTA TGGGAAGGTA GAATACAAAA 840 
TTTGGTTTAA ATTAATTATC TAAATATCTA AAAACATTTT TGGATACATT GTTGATGTGA 900 
ATGTAAGACT GTACAGACTT CCTAGAAAAC AGTTTGGGTT CCATCTTTTC ATTTCCCCAG 960 
TGCAGTTTTC TGTAGAAATG GAATCCGAGG ATTTAAGTGG CAGAGAATTG ACAATTGATT 1020 
CCATAATGAA CAAAGTGAGA GACATTAAAA ATAAGTTTAA AAATGAAGAC CTTACTGATG 1080 
AACTAAGCTT GAATAAAATT TCTGCTGATA CTACAGATAA CTCGGGAACT GTTAACCAAA 1140 
TTATGATGAT GGCAAACAAC CCAGAGGACT GGTTGAGTTT GTTGCTCAAA CTAGAGAAAA 1200 
ACAGTGTTCC GCTAAGTGAT GCTCTTTTAA ATAAATTGAT TGGTCGTTAC AGTCAAGCAA 1260 
TTGAAGCGCT TCCCCCAGAT AAATATGGCC AAAATGAGAG TTTTGCTAGA ATTCAAGTGA 1320 
GATTTGCTGA ATTAAAAGCT ATTCAAGAGC CAGATGATGC ACGTGACTAC TTTCAAATGG 1380 
CCAGAGCAAA CTGCAAGAAA TTTGCTTTTG TTCATATATC TTTTGCACAA TTTGAACTGT 1440 
CACAAGGTAA TGTCAAAAAA AGTAAACAAC TTCTTCAAAA AGCTGTAGAA CGTGGAGCAG 1500 
TACCACTAGA AATGCTGGAA ATTGCCCTGC GGAATTTAAA CCTCCAAAAA AAGCAGCTGC 1560 
TTTCAGAGGA GGAAAAGAAG AATTTATCAG CATCTACGGT ATTAACTGCC CAAGAATCAT 1620 
TTTCCGGTTC ACTTGGGCAT TTACAGAATA GGAACAACAG TTGTGATTCC AGAGGACAGA 16 80 
CTACTAAAGC CAGGTTTTTA TATGGAGAGA ACATGCCACC ACAAGATGCA GAAATAGGTT 1740 
ACCGGAATTC ATTGAGACAA ACTAACAAAA CTAAACAGTC ATGCCCATTT GGAAGAGTCC 1800 
CAGTTAACCT TCTAAATAGC CCAGATTGTG ATGTGAAGAC AGATGATTCA GTTGTACCTT 1860 
GTTTTATGAA AAGACAAACC TCTAGATCAG AATGCCGAGA TTTGGTTGTG CCTGGATCTA 1920 
AACCAAGTGG AAATGATTCC TGTGAATTAA GAAATTTAAA GTCTGTTCAA AATAGTCATT 1980 
TCAAGGAACC TCTGGTGTCA GATGAAAAGA GTTCTGAACT TATTATTACT GATTCAATAA 2040 
CCCTGAAGAA TAAAACGGAA TCAAGTCTTC TAGCTAAATT AGAAGAAACT AAAGAGTATC 2100 
AAGAACCAGA GGTTCCAGAG AGTAACCAGA AACAGTGGCA AGCTAAGAGA AAGTCAGAGT 2160 
GTATTAACCA GAATCCTGCT GCATCTTCAA ATCACTGGCA GATTCCGGAG TTAGCCCGAA 2220 
AAGTTAATAC AGAGCAGAAA CATACCACTT TTGAGCAACC TGTCTTTTCA GTTTCAAAAC 2280 
AGTCACCACC AATATCAACA TCTAAATGGT TTGACCCAAA ATCTATTTGT AAGACACCAA 2340 
GCAGCAATAC CTTGGATGAT TACATGAGCT GTTTTAGAAC TCCAGTTGTA AAGAATGACT 2400 
TTCCACCTGC TTGTCAGTTG TCAACACCTT ATGGCCAACC TGCCTGTTTC CAGCAGCAAC 2460 
AGCATCAAAT ACTTGCCACT CCACTTCAAA ATTTACAGGT TTTAGCATCT TCTTCAGCAA 2520 
ATGAATGCAT TTCGGTTAAA GGAAGAATTT ATTCCATATT AAAGCAGATA GGAAGTGGAG 2S80 
GTTCAAGCAA GGTATTTCAG GTGTTAAATG AAAAGAAACA GATATATGCT ATAAAATATG 2640 
TGAACTTAGA AGAAGCAGAT AACCAAACTC TTGATAGTTA CCGGAACGAA ATAGCTTATT 2700 
TGAATAAACT ACAACAACAC AGTGATAAGA TCATCCGACT TTATGATTAT GAAATCACGG 2760 
ACCAGTACAT CTACATGGTA ATGGAGTGTG GAAATATTGA TCTTAATAGT TGGCTTAAAA 2820 
AGAAAAAATC CATTGATCCA TGGGAACGCA AGAGTTACTG GAAAAATATG TTAGAGGCAG 28 B0 
TTCACACAAT CCATCAACAT GGCATTGTTC ACAGTGATCT TAAACCAGCT AACTTTCTGA 2940 
TAGTTGATGG AATGCTAAAG CTAATTGATT TTGGGATTGC AAACCAAATG CAACCAGATA 3000 
CAACAAGTGT TGTTAAAGAT TCTCAGGTTG GCACAGTTAA TTATATGCCA CCAGAAGCAA 3060 
TCAAAGATAT GTCTTCCTCC AGAGAGAATG GGAAATCTAA GTCAAAGATA AGCCCCAAAA 3120 
GTGATGTTTG GTCCTTAGGA TGTATTTTGT ACTATATGAC TTACGGGAAA ACACCATTTC 3180 
AGCAGATAAT TAATCAGATT TCTAAATTAC ATGCCATAAT TGATCCTAAT CATGAAATTG 3240 
AATTTCCCGA TATTCCAGAG AAAGATCTTC AAGATGTGTT AAAGTGTTGT TTAAAAAGGG 3300 
ACCCAAAACA GAGGATATCC ATTCCTGAGC TCCTGGCTCA TCCATATGTT CAAATTCAAA 3360 
CTCATCCAGT TAACCAAATG GCCAAGGGAA CCACTGAAGA AATGAAATAT GTTCTGGGCC 3420 
AACTTGTTGG TCTGAATTCT CCTAACTCCA TTTTGAAAGC TGCTAAAACT TTATATGAAC 3480 
ACTATAGTGG TGGTGAAAGT CATAATTCTT CATCCTCCAA GACTTTTGAA AAAAAAAGGG 3540 
GAAAAAA ATG A TTTG CAGTT ATTCGTAATG TCAGATAGGA GGTATAAAAT ATATTGGACT 3600 
GTTATACTCT TGAATCCCTG TGGAAATCTA CATTTGAAGA CAACATCACT CTGAAGTGTT 3660 
ATCAGCAAAA AAAATTCAGT GAGATTATCT TTAAAAGAAA ACTGTAAAAA TAGCAACCAC 3720 
TTATGGCACT GTATATATTG TAGACTTGTT TTCTCTGTTT TATGCTCTTG TGTAA TCTA C 3780 
TTGACATCAT TTTACTCTTG GAATAGTGGG TGGATAGCAA GTATATTCTA AAAAACTTTG 3840 



871 



WO 03/042661 



PCT/US02/36810 



TAAATAAAGT TTTGTGGCTA AAATGA 



A66 Protein sequence: 
Gene name: 
Unigene number: 
Probe set Accession # : 
Protein Accession # : 
Signal sequence: 
Transmembrane domains : 
Protein Kinase Domain: 
Cellular Localization: 



TTK protein kinase 

Hs. 169840 

M86699 

NP_0033 09 

none found 

none found 

510-775 

cytoplasmic and nuclear 



I 11 21 31 41 51 

II I I I I 

MNKVRDIKNK FKNEDLTDEL SLNKISADTT DNSGTVNQIM MMANNPEDWL SLLLKLEKNS 60 

VPLSDALLNK LIGRYSQAIE ALPPDKYGQN ESFARIQVRP AELKAIQEPD DARDYFQMAR 120 

ANCKKPAFVH ISFAQFELSQ GNVKKSKQLL QKAVERGAVP LEMLEIALKN LNLQKKQLLS 180 

EEEKKNLSAS TVLTAQESFS GSLGHLQNRN NSCDSRGQTT KARFLYGENM PPQDAEIGYR 240 

NSLRQTNKTK QSCPFGRVPV NLLNSPDCDV KTDDSWPCF MKRQTSRSEC RDLWPGSKP 300 

SGNDSCELRN LKSVQNSHFK EPLVSDEKSS ELIITDSITL KNKTESSLLA KLEETKEYQE 360 

PEVPESNQKQ WQAKRKSECI NQNPAASSNH WQIPEIARKV NTEQKHTTFE QPVFSVSKQS 420 

PPISTSKWFD PKSICKTPSS NTLDDYMSCF RTPWKNDFP PACQLSTPYG QPACFQQQQH 480 

QILATPLQNL QVLASSSANE CISVKGRIYS ILKQIGSGGS SKVFQVLNEK KQIYAIKYVN 540 

LEEADNQTLD SYRNEIAYLN KLQQHSDKII RLYDYEITDQ YIYMVMECGN IDLNSWLKKK 600 

KSIDPWERKS YWKNMLEAVH TIHQHGIVHS DLKPANFLIV DGMLKLIDFG IANQMQPDTT 660 

SWKDSQVGT VNYMPPEAIK DMSSSRENGK SKSKISPKSD VWSLGCILYY MTYGKTPFQQ 720 

IINQISKLHA IIDPHHEIEF PDIPEKDLQD VLKCCLKRDP KQRISIPELL AHPYVQIQTH 780 

PVNQMAKGTT EEMKYVLGQL VGLNSPNSIL KAAKTLYEHY SGGESHNSSS SKTFEKKRGK. 840 
K 



A87 DNA SEQUENCE 

Gene name: bone morphogenetic protein 7 (osteogenic protein 1) (BMP7) 

Unigene number: Hs. 170195 

Probeset Accession #: BE616633 

Nucleic Acid Accession #: NMJJ01719 

Coding sequence: 123-1418 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I 1 I 

GGGCGCAGCG GGGCCCGTCT GCAGCAAGTG ACCGACGGCC GGGACGGCCG CCTGCCCCCT 60 

CTGCCACCTG GGGCGGTGCG GGCCCGGAGC CCGGAGCCCG GGTAGCGCGT AGAGCCGGCG 120 

CGATGCACGT GCGCTCACTG CGAGCTGCGG CGCCGCACAG CTTCGTGGCG CTCTGGGCAC 180 

CCCTGTTCCT GCTGCGCTCC GCCCTGGCCG ACTTCAGCCT GGACAACGAG GTGCACTCGA 240 

GCTTCATCCA CCGGCGCCTC CGCAGCCAGG AGCGGCGGGA GATGCAGCGC GAGATCCTCT 300 

CCATTTTGGG CTTGCCCCAC CGCCCGCGCC CGCACCTCCA GGGCAAGCAC AACTCGGCAC 360 

CCATGTTCAT GCTGGACCTG TACAACGCCA TGGCGGTGGA GGAGGGCGGC GGGCCCGGCG 420 

GCCAGGGCTT CTCCTACCCC TACAAGGCCG TCTTCAGTAC CCAGGGCCCC CCTCTGGCCA 480 

GCCTGCAAGA TAGCCATTTC CTCACCGACG CCGACATGGT CATGAGCTTC GTCAACCTCG 540 

TGGAACATGA CAAGGAATTC TTCCACCCAC GCTACCACCA TCGAGAGTTC CGGTTTGATC 600 

TTTCCAAGAT CCCAGAAGGG GAAGCTGTCA CGGCAGCCGA ATTCCGGATC TACAAGGACT 660 

ACATCCGGGA ACGCTTCGAC AATGAGACGT TCCGGATCAG CGTTTATCAG GTGCTCCAGG 720 

AGCACTTGGG CAGGGAATCG GATCTCTTCC TGCTCGACAG CCGTACCCTC TGGGCCTCGG 780 

AGGAGGGCTG GCTGGTGTTT GACATCACAG CCACCAGCAA CCACTGGGTG GTCAATCCGC 840 

GGCACAACCT GGGCCTGCAG CTCTCGGTGG AGACGCTGGA TGGGCAGAGC ATCAACOCCA 900 

AGTTGGCGGG CCTGATTGGG CGGCAQGGGC CCCAGAACAA GCAGCCCTTC ATGGTGGCTT 960 

TCTTCAAGGC CAOGGAGGTC CACTTOCGCA GCATCCGGTC CACGGGGAGC AAACAGCGCA 1020 

GCCAGAACCG CTCCAAGACG CCCAAGAACC AGGAAGCCCT GCGGATGGCC AACGTGGCAG 1080 

AGAACAGCAG CAGCGACCAG AGGCAGGCCT GTAAGAAGCA CGAGCTGTAT GTCAGCTTCC 1140 

GAGACCTGGG CTGGCAGGAC TGGATCATCG CGCCTGAAGG CTACGCCGCC TACTACTGTG 1200 

AGGGGGAGTG TGCCTTCCCT CTGAACTCCT ACATGAACGC CACCAACCAC GCCATCGTGC 1260 

AGACGCTGGT CCACTTCATC AACCOGGAAA CGGTGCCCAA GCCCTGCTGT GCGCCCACGC 1320 

AGCTCAATGC CATCTCCGTC CTCTACTTCG ATGACAGCTC CAACGTCATC CTGAAGAAAT 1380 

ACAGAAACAT GGTGGTCCGG GCCTGTGGCT GCCACTAGCT CCTCCGAGAA TTCAGACCCT 1440 

TTGGGGCCAA GTTTTTCTGG ATCCTCCATT GCTCGCCTTG GCCAGGAACC AGCAGACCAA 1500 

CTGCCTTTTG TGAGACCTTC CCCTCCCTAT CCCCAACTTT AAAGGTGTGA GAGTATTAGG 1560 

AAACATGAGC AGCATATGGC TTTTGATCAG TTTTTCAGTG GCAGCATCCA ATGAACAAGA 1620 

TCCTACAAGC TGTGCAGGCA AAACCTAGCA GGAAAAAAAA ACAACGCATA AAGAAAAATG 1680 

GCCGGGCCAG GTCATTGGCT GGGAAGTCTC AGCCATGCAC GGACTCGTTT CCAGAGGTAA 1740 

TTATGAGCGC CTACCAGCCA GGCCACCCAG CCGTGGGAGG AAGGGGGCGT GGCAAGGGGT 1800 

GGGCACATTG GTGTCTGTGC GAAAGGAAAA TTGACCCGGA AGTTCCTGTA ATAAATGTCA 1860 
CAATAAAACG AATGAATG 



ASS Protein sequence: 

Gene name: bone morphogenetic protein 7 (osteogenic protein 1) (BMP7) 



Unigene number: 
Probeset Accession #: 
Protein Accession ft: 
Signal sequence: 
Pfam domains: 
Transmembrane domains: 
Cellular Localization: 



Hs. 170195 
BE616633 
NP__001710.1 
1-30 

TGFb_j)ropeptide [37-281] 

none found 

secreted 



872 



WO 03/042661 



PCT/US02/36810 



1 
I 

MHVRSLRAAA 
I LGLPHRPRP 
LQDSHFLTDA 
IRERFDNETF 
HNLGLQLSVE 
QNRSKTPKNQ 
GECAFPLNSY 
RNMWRACGC 



11 
1 

PHSFVALWAP 
HLQGKHNSAP 
DMVMSFVNLV 
RISVYQVLQE 
TLDGQSINPK 
EALRMANVAE 
MNATNHAIVQ 
H 



21 
I 

LFLLRSALAD 
MFMLDLYNAM 
EHDKEFFHPR 
HLGRESDLFL 
LAGLIGRHGP 
NSSSDQRQAC 
TLVHFINPET 



31 
I 

FSLDNEVHSS 
AVEEGGGPGG . 
YHHREFRFDli 
LDSRTLWASE 
QNKQPFMVAF 
KKHELYVSFR 
VPKPCCAPTQ 



41 
I 

.FIHRRLRSQE 
QGFSYPYKAV 
SKIPEGEAVT 
EGWLVFDITA 
FKATEVHFRS 
DLGWQDWIIA 
LNAISVLYFD 



51 
1 

RREMQREILS £0 

FSTQGPPLAS 120 

AAEFRIYKDY 180 

TSNHWWNPR 240 

IRSTGSKQRS 300 

PEGYAAYYCE 360 

DSSNVILKKY 420 



A89 DNA SEQUENCE; 
Gene name: 
Unigene number: 
Probeset Accession 
Nucleic Acid Accession 8: 
Coding sequence: 



Homo sapiens mRNA; cDNA DKFZp56401763 (from clone DKFZp56401763) 

HS. 27373 

F13036 

AC012478 

1-1896 (underlined sequences correspond to start and stop codons) 



ATGOGCGCCG 
GCTCCCGCCG 
CGCGAGTCGC 
GCGGCGGGCG 
ATTTCCATCC 
TTTACCACCC 
AAGAAGACAC 
CTAAATGAAG 
TCCTTGCCGG 
GTGCCCCCAC 
GGTGGAATCA 
TCAGCTGCAA 
GAAACCAAAA 
TCAGACTGTC 
TTTGGGCATC 
CTGAATCTCA 
GCCAGGTGCT 
CCTTGGTGGC 
AGTACCTTGG 
AGAGCAGTGA 
GCCCTAGAGA 
ACCCATCCTG 
AGGGTGTTTC 
TGCCTTTTGG 
ATCTGTCTCC 
GTCCTTGCGT 
AAACGAGGGC 
GCGAACCTGC 
CACCCTGGAG 
GATGGCAGAT 
TGTGAGAGGC 
TCCCCCCGAC 



11 
I 

TGCCGCTGCC 
CCCGCGCCAG 
GGCCACCGCC 
GCAGCGGCAG 
TCCTCCGCGA 
TCCTCATCGC 
GCAAGTATGA 
AGGATGATGA 
CTGCACTGAG 
CCTTCATCCT 
GACCTGGTAA 
CCTGGGGTGT 
CGAACGTTAT 
ACTGGCAAGC 
CCTTTAAAGT 
TGGAAAAGCT 
TGCCACTGGT 
ACTTCAGCGC 
GCTTGGATGT 
CTAAGGTTCT 
GTGGGTTTCC 
TCAGGTTGGC 
GGCGTCCGCG 
TTTTGAAGAT 



CATTTGAGCA 
CCAGTGGGCA 
AGACAATTCC 
GAGACTTGGA 
GCCAGAAGAT 
TCACAGGTTC 
AGCCCCTGTT 



21 
I 

CGCCCCGCTC 
CAGAGCCGAG 
CGGCCCGGGG 
CTCCAGCTCC 
CCTACCCACC 
CTGCCTGCTG 
TATCATCACC 
AGATGAGGAC 
ACGTCAGCTG 
CGACATTGAC 
AACCTGTTTC 
GAAGGACTGG 
GTATAAAACC 
TCGTTTCCAC 
GCCCCCTACT 
GGATTCCTCT 
CCTGGCAGAA 
CACAGGCTCT 
TTTCTGTGGT 
CCAGGGTAGC 
CCATCATCTC 
TCGTTCAGAT 
GCAGTCTCTG 
TCTGTTGAGG 
CGTGGAACAC 
GAGCCCACAA 
ATTGACCAGA 
AGATACCCAA 
TGGGGTGGCA 
GGTCCTGATG 
CCATCACTTC 
TCTGTCCAGG 



31 



CTGCCGCTGC 
TCCGTCTCCG 
CCCGGGAACA 
AACAGCAGTG 
CTCAAGGCAG 
CTGCGCGTCT 
ACTCCAGCAG 
TCCACAGTAT 
CCAGGGTGCC 
CTTCCAGCAA 
CCAGCCTGGT 
ACCTGGAAGC 
CCAGCTCCAT 
GTCACCACAA 
TCTACTCCCC 
GCCTTACGCA 
ATGGCGGCTG 
CCAATAAAAA 
GCCGGCCAGC 
TCTTTCTCCA 
AGGCTTCTCA 
GCCCGGGGAC 
CATGGCGGAG 
CGCCATCCTC 
CTACGGGAAG 
AAGGCAGCTG 
CACACATGCC 
GGCCAGGAAG 
AATTTCTATT 
TCTGAGGAAG 
TCCAGCCATT 
CCCTGA 



41 
I 

TGCTGCTCGC 
CGCCGTGGCC 
CCACCCGGTT 
GCGACGCCTT 
CCGTGATCGT 
TCAGGTCGGG 
AGCGAGTGGA 
TCGACATCAA 
AGACGCTACT 
GATGCAGTGG 
GGCATCCTGT 
CCTCTTGCGT 
CGTGCGTGTC 
TGGAGTTGCT 
ATGGTTTTCG 
GAAACACCGG 
CTGAAAGTGA 
CCCTTTACAC 
GGGGCACCTT 
AACAGCTGCG 
GAGAGTGTCC 
AAGCCAGCCT 
GGTCAGCGGG 
ACCTTGACCT 
CCAAGAGAAG 
CTGCCCACGG 
CTGGCTGGGG 
GCCCACGTGA 
TGGAGGAAGA 
GGCCACCTAG 
CCAAGTCTTG 



SI 



GCTCCTGGCC 
CGAACCCGAG 
TGGGTCTGGG 
GGTGACCCGC 
GG CGTTCGCC 
AAAGAGGTTA 
AATGGCGCCA 
ATACAGAGTG 
GACAGTTCCT 
AAGGCCTGAT 
GGAAAGTTGG 
CGGAGGTGTT 
AGGCATCTGC 
TCTGCCACCC 
ACAACTGCAG 
GGCTCCATCT 
CCTTCCAAAT 
ACAAACCATG 
TTGTGAAGAC 
CTGGAAGCCA 
TCCGCTGAGC 
GACGGGGAGG 
TACCGCAACT 
CTTCTACAAA 
CTCAGTGACT 
GGAGCCTGTC 
GATCACACAT 
GGATGTCACT 
GGGTTTCCAG 
TTTGACAGGA 
GTCCTTCCTT 



60 
120 
160 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 

960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 



A90 Protein sequence: 
Gene name: 
Unigene number: 
Probeset Accession #: 
Protein Accession # : 
Signal sequence: 
Transmembrane domains: 
Cellular Localization: 



Homo sapiens mRNA; cDNA DKFZp56401763 

HS. 27373 

F13036 

FGENESH predicted 
1-27 

94-115, 448-469 
not determined 



(from clone DKFZp5 6401763) 



1 
I 

MRAVPLPAPL 
AAGGSGSSSS 
KKTRKTDIIT 
VPPPFILDID 
ETKTNVMYKT 
LNLMEKLDSS 
STLGLDVFCG 
, THP VRIiARSD 
ICLPCCAVEH 
ANLQTIPDTQ 
CERLTGSHHF 



11 

I 

LPLLLLALLA 
NSSGDALVTR 
TPAERVEMAP 
LPARCSGRPD 
PAPSCVSGIC 
ALRRNTRAPS 
AGQRGTFCED 
ARGQASLTGR 
LREAKRSSVT 
GQEGPREDVT 



21 
I 

APAARASRAE 
ISZLIiRDLPT 
LNEEDDEDED 
GGIRPGKTCF 
SDCHWQARFH 
ARCLPLVLAE 
RAVTKVLQGS 
RVFRRPRQSli 
VLASFEQSPQ 
HPGGDLDGVA 
SPRQPLFLSR 



31 
I 

SVSAPWPEPE 
LKAAVIVAFA 
STVFDIKYRV 
PAWWHPVESW 
VTTMELLLPP 
MAAAESDLPN 
SFSKQLRWKP 
HGGGSAGTAT 
KAAAAHGEPV 
NFYIiEEEGFQ 
P 



41 

1 

RESRPPPGPG 
FTTLLIACLL 
SLPAALRRQL 
SAATWGVKDW 
FGHPFKVPPT 
PWWHFSATGS 
ALESGFPHHL 
CLLVLKILLR 
KRGPSGQLTR 
DGRCQKMVLM 



51 
I 

PGNTTRFGSG 
LRVFRSGKRL 
PGCQTLLTVP 
TWKPSCVGGV 
STPHGFRQLQ 
PIKTLYTQTM 
RLLRECPPLS 
RHPHLDLFYK 
HTCPGWGITH 
SEEGPPSLTG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



EWING 



A91 DNA SEQUENCE 

Gene name: G protein- coupled receptor 64 
Unigene number: Hs. 184 942 



873 



WO 03/042661 



PCT/US02/36810 



Probeset Accession ft: 
Nucleic Acid Accession ft: 
Coding sequence: 



1 11 21 31 41 51 

| j | 1 | 1 

AGCCAGCCCG AGGACGCGAG CGGCAGGTGT GCACAGAGGT TCTCCACTTT GTTTTCTGAA 60 
CTCGCGGTCA GGATGGTTTT CTCTGTCAGG CAGTGTGGCC ATGTTGGCAG AACTGAAGAA 120 
10 GTTTTACTGA CGTTCAAGAT ATTCCTTGTC ATCATTTGTC TTCATGTCGT TCTGGTAACA 180 
TCCCTGGAAG AAGATACTGA TAATTCCAGT TTGTCACCAC CACCTGCTAA ATTAT C TGTT 240 
GTCAGTTTTG CCCCCTCCTC CAATGAGGTT GAAACAACAA GCCTCAATGA TGTTACTTTA 300 
AGCTTACTCC CTTCAAACGA AACAGAAAAA ACTAAAATCA CTATAGTAAA AACCTTCAAT 360 
GCTTCAGGCG TCAAACCCCA GAGAAATATC TGCAATTTGT CATCTATTTG CAATGACTCA 420 
15 GCATTTTTTA GAGGTGAGAT CATGTTTCAA TATGATAAAG AAAGCACTGT TCCCCAGAAT 480 
CAACATATAA CGAATGGCAC CTTAACTGGA GTCCTGTCTC TAAGTGAATT AAAACGCTCA 540 
GAGCTCAACA AAACCCTGCA AACCCTAAGT GAGACTTACT TTATAATGTG TGCTACAGCA 600 
GAGGCCCAAA GCACATTAAA TTGTACATTC ACAATAAAAC TGAATAATAC AATGAATGCA 660 
TGTGCTGCAA TAGCCGCTTT GGAAAGAGTA AAGATTCGAC CAATGGAACA CTGCTGCTGT 720 
20 TCTGTCAGGA TACCCTGCCC TTCCTCCCCA GAAGAGTTGG GAAAGCTTCA GTGTGACCTG 780 
CAGGATCCCA TTGTCTGTCT TGCTGACCAT CCACGTGGCC CACCATTTTC TTCCAGCCAA 840 
TCCATCCCAG TGGTGCCTCG GGCCACTGTG CTTTCCCAGG TCCCCAAAGC TACCTCTTTT 900 
GCTGAGCCTC CAGATTATTC ACCTGTGACC CACAATGTTC CCTCTCCAAT AGGGGAGATT 960 
CAACCCCTTT CACCCCAGCC TTCAGCTCCC ATAGCTTCCA GCCCTGCCAT TGACATGCCC 1020 
25 CCACAGTCTG AAACGATCTC TTCCCCTATG CCCCAAACCC ATGTCTCCGG CACCCCACCT 1080 
CCTGTGAAAG CCTCATTTTC CTCTCCCACC GTGTCTGCCC CTGCGAATGT CAACACTACC 1140 
AGCGCACCTC CTGTCCAGAC AGACATCGTC AACACCAGCA GTATTTCTGA TCTTGAGAAC 1200 
CAAGTGTTGC AGATGGAGAA GGCTCTGTCC TTGGGCAGCC TGGAGCCTAA CCTCGCAGGA 1260 
GAAATGATCA ACCAAGTCAG CAGACTCCTT CATTCOCCGC CTGACATGCT GGCCCCTCTG 1320 
30 GCTCAAAGAT TGCTGAAAGT AGTGGATGAC ATTGGCCTAC AGCTGAACTT TTCAAACACG 1380 
ACTATAAGTC TAACCTCCCC TTCTTTGGCT CTGGCTGTGA TCAGAGTGAA TGCCAGTAGT 1440 
TTCAACACAA CTACCTTTGT GGCCCAAGAC CCTGCAAATC TTCAGGTTTC TCTGGAAACC 1500 
CAAGCTCCTG AGAACAGTAT TGGCACAATT ACTCTTCCTT CATCGCTGAT GAATAATTTA 1560 
CCAGCTCATG ACATGGAGCT AG CTTCCAGG GTTCAGTTCA ATTTTTTTGA AACACCTGCT 1620 
35 TTGTTTCAGG ATCCTTCCCT GGAGAACCTC TCTCTGATCA GCTACGTCAT ATCATCGAGT 1680 
GTTGCAAACC TGACCGTCAG GAACTTGACA AGAAACGTGA CAGTCACATT AAAGCACATC 1740 
AACCCGAGCC AGGATGAGTT AACAGTGAGA TGTGTATTTT GGGACTTGGG CAGAAATGGT 1800 
GGCAGAGGAG GCTGGTCAGA CAATGGCTGC TCTGTCAAAG ACAGGAGATT GAATGAAACC 1860 
ATCTGTACCT GTAGCCATCT AACAAGCTTC GGCGTTCTGC TGGACCTATC TAGGACATCT 1920 
40 GTGCTGCCTG CTCAAATGAT GGCTCTGACG TTCATTACAT ATATTGGTTG TGGGCTTTCA 1980 
TCAATTTTTC TGTCAGTGAC TCTTGTAACC TACATAGCTT TTGAAAAGAT CCGGAGGGAT 2040 
TACCCTTCCA AAATCCTCAT CCAGCTGTGT GCTGCTCTGC TTCTGCTGAA CCTGGTCTTC 2100 
CTCCTGGACT CGTGGATTGC TCTGTATAAG ATGCAAGGCC TCTGCATCTC AGTGGCTGTA 2160 
TTTCTTCATT ATTTTCTCTT GGTCTCATTC ACATGGATGG GCCTAGAAGC ATTCCATATG 2220 
45 TACCTGGCCC TTGTCAAAGT ATTTAATACT TACATCCGAA AATACATCCT TAAATTCTGC 2280 
ATTGTCGGTT GGGGGGTACC AGCTGTGGTT GTGACCATCA TCCTGACTAT ATCCCCAGAT 2340 
AACTATGGGC TTGGATCCTA TGGGAAATTC CCCAATGGTT CACCGGATGA CTTCTGCTGG 2400 
ATCAACAACA ATGCAGTATT CTACATTACG GTGGTGGGAT ATTTCTGTGT GATATTTTTG 2460 
CTGAACGTCA GCATGTTCAT TGTGGTCCTG GTTCAGCTCT GTCGAATTAA AAAGAAGAAG 2520 
50 CAACTGGGAG CCCAGCGAAA AACCAGTATT CAAGACCTCA GGAGTATCGC TGGCCTTACA 2580 
TTTTTACTGG GAATAACTTG GGGCTTTGCC TTCTTTGCCT GGGGA CCAGT TAACGTGACC 2640 
TTCATGTATC TGTTTGCCAT CTTTAATACC TTACAAGGAT TTTTCATATT CATCTTTTAC 2700 
TGTGTGGCCA AAGAAAATGT CAGGAAGCAA TGGAGGCGGT ATCTTTGTTG TGGAAAGTTA 2760 
CGGCTGGCTG AAAATTCTGA CTGGAGTAAA ACTGCTACTA ATGGTTTAAA GAAGCAGACT 2820 
55 GTAAACCAAG GAGTGTCCAG CTCTTCAAAT TCCTTACAGT CAAGCAGTAA CTCCACTAAC 2880 
TCCACCACAC TGCTAGTGAA TAATGATTGC TCAGTACACG CAAGCGGGAA TGGAAATGCT 2940 
TCTACAGAGA GGAATGGGGT CTCTTTTAGT GTTCAGAATG GAGATGTGTG CCTTCACGAT 3000 
TTCACTGGAA AACAGCACAT GTTTAACGAG AAGGAAGATT CCTGCAATGG GAAAGGCCGT 3060 
ATGGCTCTCA GAAGGACTTC AAAGCGGGGA AGCTTACACT TTATTGAGCA AATGT^TTC 3120 
60 CTTTCTTCTA AAATCAAAGC ATGATGCTTG ACAGTGTGAA ATGTCCAATT TTACCTTTTA 3180 
CACAATGTGA GATGTATGAA AATCAACTCA TTTTATTCTC GGCAACATCT GGAGAAGCAT 3240 
AAGCTAATTA AGGGCGATGA TTATTATTAC AAGAAGAAAC CAAGACATTA C ACCAT GGTT 3300 
TTTAGACATT TCTGATTTGG TTTCTTATCT TTCATTTTAT AAGAAGGTTG GTTTTAAACA 3360 
ATACACTAAG AATGACTCCT ATAAAGAAAA CAAAAAAAGG TAGTGAACTT TCAGCTACCT 3420 
65 TTTAAAGAGG CTAAGTTATC TTTGATAACA TCATATAAAG CAACTGTTGA CTTCAGCCTG 3480 
TTGGTGAGTT TAGTTGTGCA TGCCTTTGTT GTATATAAGC TAAATTCTAG TGACCCATGT 3540 
GTCAAAAATC TTACTTCTAC ATTTTTTTGT ATTTATTTTC TACTGTGTAA ATGTATTCCT 3600 
TTGTAGAATC ATGGTTGTTT TGTCTCACGT GATAATTCAG AAAATCCTTG CTCGTTCCGC 3660 
AAATCCTAAA GCTCCTTTTG GAGATGATAT AGGATGTGAA ATACAGAAAC CTCAGTGAAA 3720 
70 ' TCAAGAAATA ATGATCCCAG CCAGACTGAG AAAATGTAAG CAGACAGTGC CACAGTTAGC 3780 
TCATACAGTG CCTTTGAGCA AGTTAGGAAA AGATGCCCCC ACTGGGCAGA CACAGCCCTA 3840 
TGGGTCATGG TTTGACAAAC AGAGTGAGAG ACCATATTTT AGCCCCACTC ACCCTCTTGG 3900 
GTGCACGACC TGTACAGCCA AACACAGCAT CCAATATGAA TACCCATCCC CTGACCGCAT 3960 
CCCCAGTAGT CAGATTATAG AATCTGCACC AAGATGTTTA GCTTTATACC TTGGCCACAG 4020 
75 AGAGGGATGA ACTGTCATCC AGACCATGTG TCAGGAAAAT TGTGAACGTA GATGAGGTAC 4080 
ATACACTGCC GCTTCTCAAA TCCCCAGAGC CTTTAGGAAC AGGAGAGTAG ACTAGGATTC 4140 
CTTCTCTTAA AAAGGTACAT ATATATGGAA AAAAATCATA TTGCCGTTCT TTAAAAGGCA 4200 
ACTGCATGGT ACATTGTTGA TTGTTATGAC TGGTACACTC TGGCCCAGCC AGAGCTATAA 4260 
TTGTTTTTTA AATGTGTCTT GAAGAATGCA CAGTGACAAG GGGAGTAGCT ATTGGGAACA 4320 
80 GGGAACTGTC CTACACTGCT ATTGTTGCTA CATGTATCGA GCCTTGATTG CTCCTAGTTA 4380 
TATACAGGGT CTATCTTGCT TCCTACCTAC ATCTGCTTGA GCAGTGCCTC AAGTACATCC 4440 
TTATTAGGAA CATTTCAAAC CCCTTTTAGT TAAGTCTTTC ACTAAGGTTC TCTTGCATAT 4500 
ATTTCAAGTG AATGTTGGAT CTCAGACTAA CCATAGTAAT AATACACATT TCTGTGAGTG 4560 
CTGACTTGTC TTTGCAATAT TTCTTTTCTG ATTTATTTAA TTTTCTTGTA TTTATATGTT 4620 



AA435577 
NM 005756 

73-3117 (underlined sequences correspond to start and stop coaons) 



874 



WO 03/042661 



PCT/US02/36810 



AAAATCAAAA ATGTTAAAAT CAATGAAATA AATTTGCAGT TAAGA 



A92 Protein sequence 
Gene name: 
Unigene number: 
Protein Accession #: 
Signal sequence: 
GPS domain: 

Transmembrane domains: 
Cellular Localization: 



G protein- coupled receptor 64 

Ks. 184942 

NP_005747.1 

1-38 

564-615 

624-646, 660-682, 688-710, 733-755, 
plasma membrane 



783-805, 828-850, 858-880 



1 11 21 31 41 51 

I I I I I I 

MVFSVRQCGH VGRTEEVLLT FKIFLVIICL HWLVTSLEE DTDNSSLSPP PAKLSWSFA 60 

PSSNEVETTS LNDVTLSLLP SNETEKTKIT IVKTFNASGV KPQRNICNLS SICNDSAFPR 120 

GEIMFQYDKE STVPQNQHIT NGTLTGVLSL SELKRSELNK TLQTLSETYF IMCATAEAQS 180 

TLNCTFTIKL NNTMKACAAI AALERVKIRP MEHCCCSVRI PCPSSPEELG KLQCDLQDPI 240 

VCLADHPRGP PPSSSQSIPV VPRATVLSOV PKATSFAEPP DYSPVTHNVP SPIGEIQPLS 300 

PQPSAPIASS PAIDMPPQSE TISSPMPQTH VSGTPPPVKA SFSSPTVSAP ANVNTTSAPP 360 

VQTDIVNTSS ISDLENQVLQ MEKALSLGSL EPNLAGEMIN QVSRLLHSPP DMLAPLAQRL 420 

LKWDDIGLQ LNFSNTTISL TSPSLALAVI RVNASSFNTT TFVAQDPANL QVSLETQAPE 480 

NSIGTITLPS SLMNNLPAHD MELASRVQFN FFETPALFQD PSLENLSLIS YVISSSVANL 540 

TVRNLTRNVT VTLKHINPSQ DELTVRCVFW DLGRNGGRGG WSDNGCSVKD RRLNETICTC 600 

SHLTSFGVLL DLSRTSVLPA QMMALTFITY IGCGLSSIFIi SVTLVTYIAF EKIRRDYPSK 660 

ILIQLCAALL LLNLVFLLDS WIALYKMQGL CISVAVFLHY FLLVSFTWMG LEAFHMYLAL 720 

VKVFNTYIRK YILKFCIVGW GVPAVWTII LTISPDNYGL GSYGKFPNGS PDDFCWINNN 780 

AVFYITWGY FCVIFLLNVS MFIWLVQLC RIKKKKQLGA QRKTSIQDLR SIAGLTFLIX3 840 

ITWGFAFFAW GPVNVTFMYL FAIFNTLQGF FIFIFYCVAK ENVRKQWRRY LCCGKLRLAE 900 

NSDWSKTATN GLKKQTVNQG VSSSSNSLQS SSNSTNSTTL LVNNDCSVHA SGNGNASTER 960 
NGVSFSVQNG DVCLHDFTGK QHMFNEKEDS CNGKGRMALR RTSKRGSLHF IEQM 



Fibrosis 



A93 DNA SEQUENCE 
Gene name: 
Unigene number: 
Probeset Accession #: 
Nucleic Acid Accession ft: 
Coding sequence: 



EGF-like-domain? multiple 6 

Hs. 12844 

N675S1 

NM_015507 

241-1902 (underlined sequences correspond to start and stop codons) 



CCGCAGAGGA 
CGCCCCTGCC 
CQAGTGGAGC 
GGGTCCGGCC 
ATGCCTCTGC 
GGGAACGCGG 
TGTCACTATG 
TGTGAAGCTA 
AGATGCTTTC 
AAACCCCGGC 
CTCAGTGGCC 
ATAAACTGTC 
TCAGGACTCC 
GGTAAAGTCA 
AAATGTCACA 
AATGAATGTA 
GGGTCCTTCA 
ATCCCTGAAA 
AAGAAGTTGC 
CCAGAACCCA 
ATAGTTTCCA 
GAGGGGCTTG 
AGCCTGCGAG 
CTGGTCCAAA 
GACTGCAGCT 
TGGAATCCTG 
GGTCACAAGA 
AACTTCTGTT 
TTTGTGAAAA 
TGGAAGACAG 
GAAGCAGAAC 
TCAGGCTTAT 
TTGACTTTGT 
TTAGAATTAC 
TCTTGTATAA 
TTTCTGAATC 
CAGTATATCT 
TAGAAAAAAA 
TATGACATCA 
TGTATATTTA 



11 
I 

GCCTCGGCCA 
GCGGTGCCTG 
GGAGGACCCG 
GGCGCCCTCC 
CCTGGAGCCT 
CCAGTGCAAG 
GAACTAAACT 
CATGCGAACC 
CAGGATACAC 
CATGCCAACA 
ACATGCTCAT 
AGTACAGCTG 
GCCTGGCCCC 
TCTGTCCCTA 
TTGGTTTCGA 
CTATGGATAG 
AGTGTAAATG 
ATTCTGTGAA 
TTGCTCACAA 
CCAGGACTCC 
GAGGCGGGAA 
AGGATGAGAA 
GAGATGTGTT 
GGAAAGCGCT 
TCAATCATGG 
CTGATCGAGA 
AAGACATTGG 
TGCTCTTTGA 
ACAGTAACAA 
GGAAAATTCA 
GTGGCAAGGG 
GTCCAGATAG 
ATGTCAGTTC 
TAGCTGAAAA 
GATATGCCAA 
TTTCCACATT 
GATTTGTATA 
AGCACAGAGA 
AAGATAGACT 
AATTCTTTGT 



21 



31 



GGCTAGCCAG 
GCCTCCCCTC 
AGCGGCTGAG 
CGAGGGGGGC 
TGCGCTCCCG 
GCATCACGGG 
GGCCTGCTGC 
TGGATGTAAG 
CGGGAAAACC 
CAGATGTGTG 
GCCAGATGCT 
TGAAGACACA 
AAATGGAAGA 
CAATCGAAGA 
ACTGCAATAT 
CCATACGTGC 
CAAGCAGGGA 
GGAAGTCCTC 
AAACAGCATG 
TACCCCTAAG 
CTCTCATGGA 
AAGAGAAGAG 
TTTCCCTAAG 
AACTTCCAAA 
GATCTGTGAC 
TAATGCTATT 
CCGATTGAAA 
TTACCGGCTG 
TGCCCTGGCA 
GTTGTATCAA 
CAAAACCGGC 
CCTTTTATCT 
CCTGGTTTTT 
ATTGTAATGT 
TATTTGCTTT 
ATATTATAAA 
AGTAAGTTGA 
AATGTTTAAC 
TTTGCCTAAG 
AATAATAATA 



GGCGCCCCCA 
CCAGACTGCA 
GAGAGAGGAG 
TCAGGAGGAG 
CTGCTGCTCT 
TTGTTAGCAT 
TACGGCTGGA 
TTTGGTGAGT 
TGCAGTCAAG 
AATACACACG 
ACGTGTGTGA 
GAAGAAGGGC 
GACTGTCTAG 
TGTGTGAACA 
ATCAGTGGAC 
AGCCACCATG 
TATAAAGGCA 
AGAGCACCTG 
AAAAAGAAGG 
GTGAACTTGC 
GGTAAAAAAG 
AAAGCCCTGA 
GTGAATGAAG 
CTGGAACATA 
TGGAAACAGG 
GGCTTCTATA 
CTTCTCCTAC 
GCCGGAGACA 
TGGGAGAAGA 
GGAACTGATG 
GAAATCGCAG 
GTGGATGACT_ 
TTGATATTGC 
ACCAACAGAA 
AAATATCATA 
ATATGGAAAT 
TGAGCTTCTC 
TGTTTGACTC 
TGGCTTAGCT 
TCCAAATCAT 



41 
I 

GCCCCTCCCC 
GGGACAGCAC 
GCGGCGGCTT 
GAAGGAGGAC 
CCTGGGTGGC 
CGGCACGTCA 
GAAGAAACAG 
GCGTGGGACC 
ATGTGAATGA 
GAAGCTACAA 
ACTCTAGGAC 
CACAGTGCCT 
ATATTGATGA 
CATTTGGAAG 
GATATGACTG 
CCAATTGCTT 
ATGGACTTCG 
GTACCATCAA 
CAAAAATTAA 
AGCCCTTCAA 
GGAATGAAGA 
AGAATGACAT 
CAGGTGAATT 
AAGATTTAAA 
ATAGAGAAGA 
TGGCAGTTCC 
CTGACCTGCA 
AAGTCGGGAA 
CCACGAGTGA 
CTACCAAAAG 
TGGATGGCGT 
GAATGTTACT 
ATCATAGGAC 
ATATTATTGT 
TCACTGTATC 
GTCAGTTTAT 
TCTACAACAT 
TTATGATACT 
GGGTCTTTCA 
CAAAAAAAAA 



51 
I 

AGGCCGCGAG 
CCGGTAACTG 
AGCTGCTACG 
CCGTGCGAGA 
AGGTGGTTTC 
GCCTGGGGTC 
CAAGGGAGTC 
AAACAAATGC 
GTGTGGAATG 
GTGCTTTTGC 
ATGTGCCATG 
GTGTCCATCC 
ATGTGCCTCT 
CTACTACTGC 
TATAGATATA 
CAATACCCAA 
GTGTTCTGCT 
AGACAGAATC 
AAATGTTACC 
CTATGAAGAG 
GAAAATGAAA 
AGAGGAGCGA 
CGGCCTGATT 
TATCTCGGTT 
TGATTTTGAC 
GGCCTTGGCA 
ACCCCAAAGC 
ACTTCGAGTG 
GGATGAAAAG 
CATCATTTTT 
CTTGCTTGTT 
ATCTTTATAT 
CTCTGGCATT 
AAGATGCCTT 
TTCTCAGTCA 
CTCCCCTCCT 
TTCTAGAAAA 
TCTTGGAAAC 
TAGCCAAACT 
AAAAAAAA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
I960 
2040 
2100 
2160 
2220 
2280 
2340 



875 



WO 03/042661 



PCT/US02/36810 



10 
15 
20 
25 
30 

35 
40 
45 
50 
55 
60 
65 
70 
75 
80 



A94 Protein sequence: 
Gene name: 
Unigene number: 
Probeset Accession #: 
Protein Accession #: 
Signal sequence: 
Transmembrane domains: 
MAM domain: 
EGP domain: 

Cellular Localization: 



EGF- like-domain; multiple 6 

Hs. 12844 

N67551 

NP_056322.2 

1-21 

none found 
402-546 
80-259 
secreted 



MPLPWSLALP 
CEATCEPGCK 
LSGHMLMPDA 
GKVICPYNRR 
GSPKCKCKQG 
PEPTRTPTPK 
SLRGDVPFPK 
WNPADRDNAI 
FVKNSNNALA 
SGLCPDSLLS 



11 
I 

LLLSWVAGGF 
FGECVGPNKC 
TCVNSRTCAM 
CVNTFGSYYC 
YKGNGLRCSA 
VNLQPFNYEE 
VNEAGEFGLI 
GFYMAVPALA 
WEKTTSEDEK 
VDD 



21 
I 

GNAASARHHG 
RCFPGYTGKT 
INCQYSCEDT 
KCHIGFELQY 
IPEHSVKEVL 
IVSRGGNSHG 
LVQRKALTSK 
GHKKDIGRLK 
WKTGKIQLYQ 



31 
I 

LLASARQPGV 
CSQDVNECGM 
EEGPQCLCPS 
ISGRYDCIDI 
RAPGTIKDRI 
GKKGNEEKMK 
LEHKDLNISV 
LLLPDLQPQS 
GTDATKSIIP 



41 

I 

CHYGTKLACC 
KPRPCQHRCV 
SGLRLAPNGR 
NECTMDSHTC 
KKLLAHKNSM 



51 
I 

YGWRRNSKGV 
NTHGSYKCFC 
DCLDIDECAS 
SHHANCFNTQ 
KKKAKIKNVT 
KALKNDIEER 



DCSFNHGICD 
NFCLLFDYRL 



AGDKVGKLRV 
E I AVDGVLLV 



60 
120 
180 
240 
300 
360 
420 
480 
540 



GLIOBLASTOMA 
A95 DNA SEQUENCE 

Gene name: ESTs 

Unigene number: Hs. 293616 

Probeset Accession #: AW043782 

Nucleic Acid Accession 8: none found 

Coding sequence: 38-1075 (underlined sequence corresponds to start and atop codon) 



51 



I 11 21 31 41 

II I I I 1 
AGCAACGACG CCGGGCAGCG GGAGCGGCGG CCGCGC CATG TGGCTGCTGG GGCCGCTGTG 
CCTGCTGCTG AGCAGCGCCG CGGAGAGCCA GCTGCTCCCC GGGAACAACT TCACCAATGA 
GTGCAACATA CCAGGCAACT TCATGTGCAG CAATGGACGG TGCATCCCGG GCGCCTGGCA 
GTGTGAOGGG CTGCCTGACT GCTTCGACAA GAGTGATGAG AAGGAGTGCC CCAAGGCTAA 
GTCGAAATGT GGCCCAACCT TCTTCCCCTG TGCCAGCGGC ATCCATTGCA TCATTGGTCG 
CTTCCGGTGC AATGGGTTTG AGGACTGTCC CGATGGCAGC GATGAAGAGA ACTGCACAGC 
AAACCCTCTG CTTTGCTCCA CCGCCCGCTA CCACTGCAAG AACGGCCTCT GTATTGACAA 
GAGCTTCATC TGCGATGGAC AGAATAACTG TCAAGACAAC AGTGATGAGG AAAGCTGTGA 
AAGTTCTCAA GAACCCGGCA GTGGGCAGGT GTTTGTGACT TCAGAGAACC AACTTGTGTA 
TTACCCCAGC ATCACCTATG CCATCATCGG CAGCTCCGTC ATTTTTGTGC TGGTGGTGGC 
CCTGCTGGCA CTGGTCTTGC ACCACCAGCG GAAGCGGAAC AACCTCATGA CGCTGCCCGT 
GCACCGGCTG CAGCACCCTG TGCTGCTGTC CCGCCTGGTG GTCCTGGACC ACCCCCACCA 
CTGCAACGTC ACCTACAACG TCAATAATGG CATCCAGTAT GTGGCCAGCC AGGCGGAGCA 
GAATGCGTCG GAAGTAGGCT CCCCACCCTC CTACTCCGAG GCCTTGCTGG ACCAGAGGCC 
TGCGTGGTAT GACCTTCCTC CACCGCCCTA CTCTTCTGAC ACGGAATCTC TGAACCAAGC 
CGACCTGCCC CCCTACCGCT CCCGGTCCGG GAGTGCCAAC AGTGCCAGCT CCCAGGCAGC 
CAGCAGCCTC CTGAGCGTGG AAGACACCAG CCACAGCCCG GGGCAGCCTG GCCCCCAGGA 
GGGCACTGCT GAGCCCAGGG ACTCTGAGCC CAGCCAGGGC ACTGAAGAAG TATAAGTCCC 
AGTTATTCCA AAGTCCATAT GGGTTAATCT GCTCTGACTT GTTGCCATTC TAACAATTTG 
TGCTCATGGG AAGCTCTTTA AGCACCTGTA AGGATGTCTC AAGTTACAGT TTGGGATATT 
AACTATCTCT GCATTCCCCT CCTCCCCCAG ACTTCAGAGA TGTTTTTCTG GCGTCTCAGT 
TGACATGATC TGTTGTGCGT CTTTTCTGTC AGGTCACTCT TCCCTTGGGA CCCGAGATCA 
CACCCTCATT TTTCACATTA TTCTGTTTCT GTTGGAGAGA CAGCATATAA AACAGTATTG 
AAATAGGCTG GGAGAGAGCA ATGTTTCTGT GCTATATTGG ATGCTCAGAA GTGCAGGAGA 
CGCTGGACCC AATTCTCTCT GCTGGGTAGT TACCTTATAG CATTTGGGGA TTTGGGTTAG 
ATGATCTAAC CAGGAGGCCA TCACTGGATG GTCACCCCCC CAAAAAAATT CCATTTGAGC 
ATCAAAACCT GCTTTGCACA ATCCTATTTG ATGCCCCCAG TTCAGCAGAG TCAGTGGCCA 
AAGAAAACTT TGGACGTGAG TAACACCCTT CAGCAGTCGC AAOGTTATTT TGGTTTTGTG 
AAGGACTCTG AAACCATCTA CCCTGTATAA ATTCTGGCTT TAGAAATTTG CCCAAGAATG 
CTCATTCTGA GAGCTTTCCT CAGCAGCATA TATCATCAGC CTCATCCTAA AATAGGCAGG 
GAGCCCCTCC CATGAGTTTA TCCAAGTTCT CAGCTCCTAA AATGCAGGCT GCCAAGACCC 
TACACCTGCC CTGGCTCTAC AGCCACTTAC CTGGTTTCTG GACTGTCACC CTCCCAGCTG 
ACCTGCCCGT AGCCAAGGAA TGAGGACCTA ACTTGAGTTG GCCCAAAGTC TGACCTGGCT 
GTATGTCCCT GTGGCCCACA CCCAGCCTGT CTTGCTCATT CATGCAGCCT CAACACTGGC 
CTCCAAAGTT CCCTTAACAC TTGCAAAGTC CTTTTTACCT GTGCATTTGG ACTTGAGGAC 
ACTGGTTTCT ATCACAGGTG AGAGCCATGT TCAATACCTC CAGCAAGCTC TCCTGGCTCC 
CTGCACTGTG CACGCTCCTC TTCCCAAGGT CCCAATACCA GCACCTCTAG TTAGAGTTAG 
GGTCAGGGTC AGGCCTCTCC CAACATCCCA GTAGTTTCTC CTCTGAGACA CATGGGCAAG 
AGACAATTTG GAGTCAAGAT TTTCCATTTG GATCTATTTT AAATCTTTTA GAAATGCATT 
TGAAACAGTG TGTTTGTTTT TTCCCTTCTA GTTAAGGGAC TATTTATATG TGTATAGGAA 
AGCTGTCTCT TTTTTTGTTT TTCCTTTAAC AAGGTCCAAA GAAAGATGCA AAAGGAGATC 
ACACCCTTGC CCCGCTGAGC CCCGTGATAA CAAGTCACTC CAGACTAACC TGTGTGCCAG 
ACATTTGTGC ATTGTTGCAC TTTGAGGTTA TTATTTATCA AGTTCTTGAA GGAAGCAGAA 
AGAGGGACTC CTCTCTCCCT CCGTGTATAG TCTCTATGTT TGTGCTAGTT TTTCTTTTTT 
TTCTCTGTGT CCAGTCAGCC ACAGGGCCCG CCTCCCTGCA GGAATAAGGG GTAAAACGTT 
AGGTGTTGTT TGGCAAGAAA CCACACTGAC TGATGAGGGG TAAAATGGAA CCAGGTAGAG 
CCACTCCGGG CAGCTGTCAC CCATTCAGAA CTTCTTTCCG CAGCTGAAGA AATGTTCAGT 
AACCTGTTTG ACGCTAATTA AAACAGAGCC TGCAGGAAGT GGGGCTAAAG TGGCATTCAG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
I860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 



876 



WO 03/042661 



PCT/US02/36810 



TGATCCTGTT CTGTAGACTT TTCTTTCTTT TTTTAACCAA ATCCAAAGGA TGTTACAGAA 2940 

AAGCTAGCCA CTGGTATTTT GTTTTGTTTA AAAAAAAAAA GAAAGAAAGA AAGAAAGAAA 3000 

AACGGAAAGG AACCTAGCTG CCTGTATCTT TC A TTTTT A A AATAGCACTT GAGTTATTTT 3060 

CTGAGTAATC CAA7AAAGAA CTTTTGATGA CAGCCAGAAT GTGTTAGAAC TCTGGCTGAA 3120 

CATTTCATCT CCTGTGAGTC AGAAGGGCTT TATTTCTCCC TTTGATGGGG CCCCTTCTTC 3180 

TTTCTGGTGC TCTGGAAGTT GTTTAGAGGA AAGAATTCTA ATTTTAATTA ATTGCGCAGT 3240 

GAGTTAATCT CACTCGCTTT TCTGCTTCCA GGCATCTTAG GAAAAACAAA TGGTTTTAGT 3300 

AGATAAGGGA TGCCTACTAA TGCTTTTTTA AAACAAACAG GGACATTTTT ATTATAGATT 3360 

TGATTTTTTT AATGAATGTT TTTAAAAATA TATAAATAGG ACACCAAAGC GGCAGGGTTT 3420 

TTTTTGGGGG GAGGGGGTTT GTTTTCCAAC TCAAGATGGC ACATTAGTGG CCAGCAATAT 3480 

TTTTTAACTC ATTCCAACCA GGAAGCTTTT TTATACATTG CCTAAATCTA CGCCAACCAG 3540 

AAAATAGTCT CATCTCTTTT TTTCTCAAAT GAGATCCGTG TTTTATTTTA GCATTAAATT 3600 

AGTTACACTG TGATGACTGG CCTATTACCT GACTCAGCTC CCTCTACCTT GAAATTGACA 3660 

TTTTTAAAAA ATGCAACTAA GTGGTTAATA GTGTGTGACG CTCAAAGTTA ATGTAAACTG 3720 

GAAAGGTTGT GTGTCGTTGC TTTTTGTGTT TTGGTTAGGC TTGGTTTTGT TTTTTAATTT 3780 

TTATACTTTC TAATAAATTT GCAGTTTCAT TCTTTCTGTT TGTGCAAAWG GWMCTAMARM 3840 

AAMMAAAAAC AWYWTTGGGG GGGCTTGGGC CTOGGAAAAA GTTTTTAACA CCACTTCGGG 3900 

TGGGGOGGOG GGGCCCAGGT AGGTAOGGCG ACCACGCGGG CCCAAAOGGG ACCCCAGAAG 3960 

GAAACCCTGG CCAAGAAAAA GGTGGCGAGA ATTCTCCACA CCAGAAAAAA ACGOGCCGGG 4020 

GGAAACCGCA GAGTGTTGCG TAAACCACAC CCGAAGAGAG AACTCAGAAG CACACAAGCG 4080 
GGACTCAACC AGGAGGACCC AAGGGAACCC GATAGAGTAC G 



A96 Protein sequence; 
Gene name: 
Unigene number; 
Probeset Accession #: 
Protein Accession 
Signal sequence: 
Transmembrane domains: 
LDLa domains: 
Cellular Localization: 



ESTs 

Hs. 293616 
AW043782 
none found 
1-17 
169-191 

28-66, 70-108, 112-149 
plasma membrane 



1 11 21 31 41 51 

I I I I I I 

MWLLGPLCLL LSSAAESQLL PGNNFTNECN IPGNFMCSNG RCIPGAWQCD GLPDCFDKSD 60 
EKECPKAKSK CGPTPFPCAS GIHCIIGRFR CNGFEDCPDG SDEENCTANP LLCSTARYHC 120 
KNGLCIDKSF ICDGQNNCQD NSDEESCESS QEPGSGQVFV TSENQLVYYP SITYAIIGSS 180 
VIFVLWALL ALVLHHQRKR NNLMTLPVHR LQHPVLLSRL WLDHPHHOJ VTYNVNNGIQ 240 
YVASQAEQNA SEVGSPPSYS EALLDQRPAW YDLPPPPYSS DTESLNQADL PPYRSRSGSA 300 
NSASSQAASS LLSVEDTSHS PGQPGPQEGT AEPRDSEPSQ GTEEV 



LUNG 



A97 DMA SEQUENCE 

Gene name: putative GPCR, Weakly similar to dJ365012.1 
Unigene number: Hs. 2568 97 

Probeset Accession #; BE001836 
Nucleic Acid Accession #: BE001836 

Coding sequence: 1-2955 (underlined sequences correspond to start and stop codons) 



1 
I 

ATGGCCGTCA 
ATGTTAAGTA 
GGTTTAGGCC 
GGGAATATTG 
AACATAGATG 
GAGAAGTCAA 
GAATATCAGC 
AATTTTCTGA 
AGAGCAAAGG 
GACAGCTACA 
GCTGGAGCAC 
TGTGAGAGAA 
TTGAATTCAT 
AAAGCATATG 
AGCATCGTTG 
ATTGAACATG 
GGCTCTTTCA 
AAGGATGATG 
TGTGAGTCCT 
CTGAACAAGA 
GTGCAAAATC 
TCGGTGGTGT 
TCCAATTCAA 
GTAACCAACT 
ACATTAGAAA 
AAATTCATTG 
TATCAGATTA 
GGGTCAGACC 
CTGGGGAACA 



11 

I 

AATCTTTTTC 
TGATAGGGAT 
CACATAGCAC 
TTCACTCTAA 
CTTCCTCTTT 
ATTCTGACAC 
TGCTGCTTCA 
AGCTCTTGAA 
CTACCACAGA 
CCTGGTTTCC 
TCCCAAGCTG 
CAAAGATTTG 
CTTCTGCTAT 
AAAGAATTCA 
CTGGGTATGA 
TTGCCGAGAA 
GAGTGTTCGG 
AATATACCCT 
CTGGGTGGCA 
ATTTCAGTAT 
TTTCTGT CAT 
CGATTCTGAG 
CAATGGAGGA 
GGACAGTCTT 
ACATCAGCAC 
ACTGGAAAGG 
AAATGTGTCC 
AATTCCAGAG 
TTCTACCCGT 



21 
I 

CTTCAACACT 
CAGTGGCCCA 
TAAAGGTCTT 
AAAAGATGTT 
CCTGATTAGT 
AATCCAGCAA 
GGTGACCTAT 
GCCTCCATTA 
CTGCAACAGC 
TCCCTCATGC 
TGAATGTCAT 
GGGCACTTTC 
ATACTCCAAA 
AGGTTTTGAG 
AGTTGTTGGC 
GGCTAAGACA 
AAAAGCCCAG 
GCCCTGCAGC 
GGTCATCAGG 
GATTGTAGGC 
CATTCGGCAA 
CAATATTTCA 
TGTCATCAGT 
ACTGCGGGAA 
TCTGGTGCCT 
GATTCCAGTG 
CCAAAATACA 
ATCCCTTCCA 
TTCCAAAAAT 



31 
I 

GGGAATGATG 
GCTGGTGTGG 
CAACATGCAA 
TCTATTAGAA 
GATGGCCATC 
GTAACTATAA 
AGAGATTCCA 
TTATGGTCAC 
CTGAATGGAG 
CTTGATCCCC 
CTCAACAACC 
AAAATTAATG 
TATGCAAATG 
TCGGTTCAGG 
TCCAGCAGTG 
GCCCTTCACA 
TGTAATGACA 
AGTGGCTACA 
GAGACTTGTG 
AATGCCACTG 
AACCCATCAA 
TCTCTGTCAC 
ATAGCTGACA 
GAAAAGTATG 
CCGACAGCTC 
AACAAAAGCC 
TCTATTCCCA 
GAAACTATTA 
GGAAATGCTC 



41 
I 

GGCTAGACCC 
AAGACAAGTG 
TTAGAAGCTC 
TTTACTTACT 
AGTTAACCCA 
AAACTGATGG 
AGGAGAAAAG 
ATGGGCTAAT 
TCCTGCAGTG 
AGAACTGCTA 
TCAGCCAGAG 
AAAGGTTTAC 
GAATTGAAAT 
TCACCCAATT 
CATCTGAACT 
AGCTGTTTCC 
TTGTCTTTGG 
GGGGAAACAT 
TGCTCTCTCT 
AGGCAGCTGT 
CCACAGTGGG 
TGGCCAGCCA 
ATATCCTTAA 
CCAGCTCACG 
TTCCTCTGAA 
AACTCAAAAG 
TCAGAGGCCG 
TCAGCATGGC 
AGGTCAATGG 



51 
I 

AGACCCAATC 
GGTCATACTG 
AAATGTTGAC 
CTTTCATGAA 
AGTGCACTCA 
CCCAGTCGAA 
AGATTTGAGA 
TAGAATTATC 
TACCTGTGAA 
CCTTCACACG 
TGTCAATTTC 
AAATGACCTT 
TCAACTTAAA 
TCGAAATGGA 
GCTGTCAGCC 
ATTAGAAGAC 
ATTTGGGTCC 
CACAGCCAAG 
GCTTGAAGAA 
GTCATCCTTC 
GAATCTGGCT 
TTTCAGGGTG 
TTCAGCCTCA 
GTTACTAGAG 
TTTTTCTCGG 
GGGTTACAGC 
TGTGTTAATT 
CTCGTTGACT 
ACCTGTGATA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 



877 



WO 03/042661 



PCT/US02/36810 



TCCACGGTTA TTCAAAACTA TTCCATAAAT GAAGTTTTCC TATTTTTTTC CAAGATAGAG 1800 

TCAAACCTGA GCCAGCCTCA TTGTGTGTTT TGGGATTTCA GTCATTTGCA GTGGAACGAT 1860 

GCAGGCTGCC ACCTAGTGAA TGAAACTCAA GACATCGTGA CGTGCCAATG TACTCACTTG 1920 

ACCTCCTTCT CCATATTGAT GTCACCTTTT GTCCCCTCTA CAATCTTCCC CGTTGTAAAA 1980 

TGGATCACCT ATGTGGGACT GGGTATCTCC ATTGGAAGTC TCATTTTATG CCTGATCATC 2040 

GAGGCTTTGT TTTGGAAGCA GATTAAAAAA AGCCAAACCT CTCACACACG TCGTATTTGC 2100 

ATGGTGAACA TAGCCCTGTC CCTCTTGATT GCTGATGTCT GGTTTATTGT TGGTGCCACA 2160 

GTGGACACCA CGGTGAACCC TTCTGGAGTC TGCACAGYTG CTGTGTTCTT TACACACTTC 2220 

TTCTACCTCT CTT'l'GT TCT T CTGGATGCTC ATGCTTGGCA TCCTGCTGGC TTACCGGATC 2280 

ATCCTCGTGT TCCATCACAT GGCCCAGCAT TTGATGATGG CTGTTGGATT TTGCCTGGGT 2340 

TATGGGTGCC CTCTCATTAT ATCTGTCATT ACCATTGCTG TCACGCAACC TAGCAATACC 2400 

TACAAAAGGA AAGATGTGTG TTGGCTTAAC TGGTCCAATG GAAGCAAACC ACTCCTGGCT 2460 

TTTGTTGTCC CTGCACTGGC TATTGTGGCT GTGAACTTCG TTGTGGTGCT GCTAGTTCTC 2520 

ACAAAGCTCT GGAGGCCGAC TGTTGGGGAA AGACTGAGTC GGGATGACAA GGCCACCATC 2580 

ATCCGCGTGG GGAAGAGCCT CCTCATTCTG ACCCCTCTGC TAGGGCTCAC CTGGGGCTTT 2640 

GGAATAGGAA CAATAGTGGA CAGCCAGAAT CTGGCTTGGC ATGTTATTTT TGCTTTACTC 2700 

AATGCATTCC AGGGATTTTT TATCTTATGC TTTGGAATAC TCTTGGACAG TAAGCTGCGA 2760 

CAACTTCTGT TCAACAAGTT GTCTGCCTTA AGTTCTTGGA AGCAAACAGA AAAGCAAAAC 2820 

TCATCAGATT TATCTGCCAA ACCCAAATTC TCAAAGCCTT TCAACCCACT GCAAAACAAA 2880 

GGCCATTATG CATTTTCTCA TACTGGAGAT TCCTCCGACA ACATCATGCT AACTCAGTTT 2940 
GTCTCAAATG AATAA 



A98 PROTEIN SEQUENCE 
Gene name: 
Unigene number: 
Protein Accession #: 
Signal sequence: 
Pfam domain: 
Transmembrane domains: 
Cellular Localization: 



putative GPCR, Weakly similar to dJ365012.1 
Hs. 256897 

none found 

1-17 

7tm_2 [561-820] 

545-567. 571-593, 605-627, 641-663, 677-699, 721-743, 767-789 
plasma membrane 



1 11 21 31 41 51 

I I I I I I 

MHALLLCFSV LNGASGLSLL QSPVEEYQLL LQVTYRDSKE KRDLRNFLKL LKPPLLWSHG 60 

LIRIIRAKAT TDCNSLNGVL QCTCEDSYTW FPPSCLDPQN CYLHTAGALP SCECHLNNLS 120 

QSVNFCERTK IWGTFKINER FTNDLLNSSS AIYSKYANGI EIQLKKAYER IQGFESVQVT 180 

QFRNGSIVAG YEWGSSSAS ELLSAIEHVA EKAKTALHKL FPLEDGSFRV FGKAQCNDIV 240 

FGFGSKDDEY TLPCSSGYRG NITAKCESSG WQVIRETCVL SLLEELNKNF SMIVGNATEA 300 

AVSSFVQNLS VIIRQNPSTT VGNLASWSI LSNISSLSLA SHFRVSNSTM EDVISIADNI 360 

LNSASVTNWT VLLREEKYAS SRLLETLENI STLVPPTALP LNFSRKFIDW KGIPVNKSQL 420 

KRGYSYQIKM CPQNTSIPIR GRVLIGSDQF QRSLPETIIS MASLTLGNIL PVSKNGNAQV 480 

NGPVISTVIQ NYSINEVFLF F8KIESNLSQ PHCVFWDFSH LQWNDAGCHL VNETQDIVTC 540 

QCTHLTSFSI LMSPFVPSTI FPWKWITYV GLGISIGSLI LCLIIEALFW KQIKKSQTSH 600 

TRRICMVNIA LSLLIADVWF IVGATVDTTV NPSGVCTAAV FPTKFFYLSL FFHMLMLGIL 660 

LAYRI ILVFH HMAQHLMMAV GFCLGYGCPL IISVITIAVT QPSNTYKRKD VCWLNWSNGS 720 

KPLLAFWPA LAIVAVNFW VLLVLTKLWR PTVGERLSRD DKATIIRVGK SLLILTPLLG 780 
LTWGFGIGTI VDSQNIAWHV IFALLNAFQV RTVTITYCIV K 



A99 DNA SEQUENCE 

Gene name: putative G-protein coupled receptor 
Unigene number: He. 16085 

Probeset Accession #: F07953 
Nucleic Acid Accession 8: NM_016334 

Coding sequence: 104-1471 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 SI 

I I I I I I 

AGCACCTGGG AAAAGGCAGA CCGTGTGAGG GGGCCTGTGG CCCCAGCGTG CTGTGGCCTC 60 

GGGGAGTGGG AAGTGGAGGC AGGAGCCTTC CTTACACTTC GCCATGAGTT TCCTGATCGA 120 

CTCCAGCATC ATGATTACCT CCCAAATACT ATTTTTTGGA TTTGGGTGGC TTTTCTTCAT 180 

GCGCCAATTG TTTAAAGACT ATGAGATACG TCAGTATGTT GTACAGGTGA TCTTCTCOGT 240 

GACGTTTGCA TTTTCTTGCA CCATGTTTGA GCTCATCATC TTTGAAATCT TAGGAGTATT 300 

G AATAG CAGC TCCCGTTATT TTCACTGGAA AATGAACCTG TGCGTAATTC TGCTGATCCT 360 

GGTTTTCATG GTGCCTTTTT ACATTGGCTA TTTTATTGTG AGCAATATCC GACTACTGCA 420 

TAAACAACGA CTGCTTTTTT CCTGTCTCTT ATGGCTGACC TTTATGTATT TCTTCTGGAA 480 

ACTAGGAGAT CCCTTTCCCA TTCTCAGCCC AAAACATGGG ATCTTATCCA TAGAACAGCT 540 

CATCAGCCGG GTTGGTGTGA TTGGAGTGAC TCTCATGGCT CTTCTTTCTG GATTTGGTGC 600 

TGTCAACTGC CCATACACTT ACATGTCTTA CTTCCTCAGG AATGTGACTG ACACAGATAT 660 

TCTAGCCCTG GAACGGCGAC TGCTGCAAAC CATGGATATG ATCATAAGCA AAAAGAAAAG 720 

GATGGCAATG GCACGGAGAA CAATGTTCCA GAAGGGGGAA GTGCATAACA AACCATCAGG 780 

TTTCTGGGGA ATGATAAAAA GTGTTACCAC TTCAGCATCA GGAAGTGAAA ATCTTACTCT 840 

TATTCAACAG GAAGTGGATG CTTTGGAAGA ATTAAGCAGG CAGCTTTTTC TGGAAACAGC 900 

TGATCTATAT GCTACCAAGG AGAGAATAGA ATACTCCAAA ACCTTCAAGG GGAAATATTT 960 

TAATTTTCTT GGTTACTTTT TCTCTATTTA CTGTGTTTGG AAAATTTTCA TGGCTACCAT 1020 

CAATATTGTT TTTGATCGAG TTGGGAAAAC GGATCCTGTC ACAAGAGGCA TTGAGATCAC 1080 

TGTGAATTAT CTGGGAATCC AATTTGATGT GAAGTTTTGG TCCCAACACA TTTCCTTCAT 1140 

TCTTGT TGGA ATAATCATCG TCACATCCAT CAGAGGATTG CTGATCACTC TTACCAAGTT 1200 

CTTTTATGCC ATCTCTAGCA GTAAGTCCTC CAATGTCATT GTCCTGCTAT TAGCACAGAT 1260 

AATGGGCATG TACTTTGTCT CCTCTGTGCT GCTGATCCGA ATGAGTATGC CTTTAGAATA 1320 

CCGCACCATA ATCACTGAAG TCCTTGGAGA ACTGCAGTTC AACTTCTATC ACOGTTGGTT 1380 

TGATGTGATC TTCCTGGTCA GCGCTCTCTC TAGCATACTC TTCCTCTATT TGGCTCACAA 1440 



878 



WO 03/042661 



PCT/US02/36810 



ACAGGCACCA 
CAGTGGTTTC 
ATAAACAAAC 
AGGTGATACT 
ACAATACTCA 
CCAAGAAACT 
TGAGCCAAAC 
TTGAGATTGA 



GAGAAGCAAA 
AAAATTTAGA 
AAAATGCTAT 
ATGACCATGA 
GCAGAGAGCA 
AAAGGTGAAA 
ACGTAGGATT 
CTCATTAAAA 



TGGCACCTTG 
TATAAGAGGG 
GGTAGCATTT 
GTAGCATCAG 
TCCCGTGTGG 
AATACACTGG 
TCCGTTTTAA 
TCAGAGACTG 



AACTTAAGCC 
GGGAAAAATG 
TTCACCTTCA 
CCAGAACATG 
ATATGAGGCT 
AACTCTGGGG 
GGTTCACATG 
T 



TACTACAGAC 
GAACCAGGGC 
TAGCATACTC 
AGAGGGAGAA 
GGTGTAGAGG 
CAAGACATGT 
GAAAAGGTTA 



TGTTAGAGGC 
CTGACATTTT 
CTTCCCCCTC 
CTAACTCAAG 
CGGAGAGGAG 
CTATGGTAGC 
TAGCTTTGCC 



1500 
1560 
1620 
1680 
1740 
1800 
1860 



A100 Protein sequence 
Gene name-. 
Unigene number: 
Protein Accession ft : 
Signal sequence: 
Transmembrane domains: 
Cellular Localization: 



putative G-protein coupled receptor 
Hs. 16085 
NP_05741B.l 
none found 

5-27, 42-64, 76-98, 110-132, 144-166, 289-311, 342-364, 380-402, 424-446 
plasma membrane 



MSFLIDSSIM 
EILGVLNSSS 
MYFFWKLGDP 
VTDTDILALE 
SENLTLIQQE 
IFMATINIVF 
ITLTKPPYAI 
FYHRWFDVIF 



11 

I 

ITSQILFFGF 
RYFHWKMNLC 
FPILSPKHGI 
RRLLQTMDMI 
VDALEELSRQ 
DRVGKTDPVT 



LVSALSSILF 



21 
I 

GWLFFMRQLF 
VILLILVFMV 
LSIEQI>ISRV 
ISKKKRMAMA 
LFLETADLYA 
RGIEITVNYL 
LLLAQIMGMY 
LYLAHKQAPE 



31 
I 

KDYEIRQYW 
PFYIGYF1VS 
GVIGVTLMAL 
RRTMFQKGEV 
TKERIEYSKT 
GIQFDVKFWS 
FVSSVLLIRM 
KQMAP 



41 
I 

QVIFSVTFAF 
NIRLLHKQRL 
IiSGFGAVNCP 
HNKPSGFWGM 
FKGKYFNFLG 
QHISFILVGI 
SMPLBYRTII 



51 
I 

SCTMFELIIF 60 

LFSCLLWLTF 120 

YTYMSYFLRN 180 

IKSVTTSASG 240 

YFPSIYCVWK 300 

IIVTSIRGLL 360 

TEVLGELQFN 420 



A101 DNA SEQUENCE 

Gene name: ESTs, Weakly similar to CGHU7L collagen alpha l(III) chain precursor [H. sapiens] 
Unigene number: Hs.19322 
Probeset Accession #: AA088458 
Nucleic Acid Accession #: AA088458 

Coding sequence: - 862-1995 (underlined sequences correspond to start and stop codons) 



1 
I 

GCCCTTGGAC 
CTGAAGAAAA 
GCGCGGGGCC 
CTGGGCCAGA 
CGGCTACTGC 
TGTGCCAGCC 
ACCTCACCCC 
CTCACCCAGG 
GCGCTCATTA 
GATTCCACCT 
AGCCCTTCGA 
GCCCAGGCAC 
GCCTGCCCCC 
ACATGGGCTG 
TGGACAGTGG 
GGTCCCATCT 
AGAGGGCGCG 
CAGGACGAGG 
GTAAGCGGGG 
CTGGCCAAGG 
GGCCTGCATG 
TTGCCCCACG 
GACAGCTCCC 
CTGGGGTCCT 
GAGAGGCCAC 
GGCAGGTCCC 
GGAGTACGCA 
GAACCAGGGG 
TCAGTGTGTG 
CCCGATGCGG 
ACACTGTCCC 
CCTTCCGGAG 
TGCTGCACCT 
GCCCTCCTAC 
ACCTCCTGGG 
AGGTGGACTG 

TGGGGGATCC 
GGTGACTTCA 
GAGACAGGCT 
AAAGAAATAG 
CACGAGGGGA 
GCAGACCCTG 
GAGCAGCGTC 
GCGTGCACAC 
CAGAAGTGTC 
TTTTGTGTTG 



11 21 

I I 
ACTGACATGG ACTGAAGGAG 
AGGAGCTGGA GCAGGAGAAG 
GCGACTGGTA CCAGCAGCAG 
GCAGAGCCAG CGCCGACTTT 
CCAAGGTACA AGAGGTGGCC 
GGGCCCTGCC CCCGTCCTCC 
CGGTCTGGCA GCAGCAGACC 
AGGTGACCGA GAAGAGTGAG 
AGCAGCTGTT TGAGGCCCGC 
TCATCTAGTC CTTGTGGGCC 
GGGTGGGCGC CCCATCGCAC 
AGTCCCGGAG TGGGCGCCTT 
GGCTGGTCCC CGCACCGAGC 
GGGGCTCTCT TGAGTCCGCA 
GGTACCCCTC CATGAGTTAG 
TCAGGGAAAG GCACTGCCCA 
GGGCGGCTCC GACGCGGGTC 
TGGCTGTAGC TCGGACGGAC 
GGTGCCTGCC TGGCTGGGGA 
CTGAGGGACC CTGGCTGCAG 
TGCCTCCCAC AGACCCTGGG 
TTGAGTCCCA CACAACATCC 
AGGCACGTCA TAGGCAAAGC 
GCTCACCCCC CTTTGCTCTC 
CTCCCTCAGC CAAGGAAAAC 
CTTGGGTGTC ACTCCCTCAG 
CTGGTGGGGG GGCCCTGCTC 
CACGGCAACA GCATCGATGG 
TGGGGCGCAG GGCCTCCGAT 
GGTCAGTGCG TGGGGGGCGC 
ACAAGGCACC TGTCTCAGAG 
CCCAGCTCCA TGCTAACCTG 
GGTCTGCAGG GGTGTCCCAG 
CCTGAAGATG GGAGTGGGCT 
CAGGAAAGGG TGCAGGTCCT 
CAGCGCAGTG GGTGGGCCAG 
TGCCCACCAG GGCCTCCCCA 
TGGCATCTTT ACTGGACTGG 
TCAGGAGACC GCCCACATAG 
GGCACCTCCG GAAAAACTGC 
GTCCTCCCAG TTTACAGCTT 
GAATTTAAAG GCCCCGGCTG 
CCTGGAGCCT GCCCTAGGAC 
CCTGGGCTCT ATCCGCGAGG 
TGTGATGACA CCCGGAAATG 
CCCAGTTGAG AATCTGCCCC 
ATCAAGTTCC AAGGAAAAGG 



31 

I 

TAGAATGGAG 
GAGGTGCTGC 
CTGCAACGAG 
GGGGCTGCAG 
CGGTGCCTGG 
TCCGGGCCCC 
ATCCTCATGC 
CGCATCACGC 
GCCCTGAGCC 
GCGTGGGCCC 
CCACCCTCTC 
CCTGCCGCCC 
GCTTGACTCC 
TAGTCCGCAG 
CGTCCCCCCG 
CGCCAGGCTG 
CAAGGGCAGC 
GGAAGTAGAT 
GCCCCAGGGA 
CGGATCGGCA 
GTGATGGCCT 
TGTGAGCCTG 
CTGTTTCCCC 
ACGCCCAGCC 
GAGAACCCCC 
CCCCTGCCCA 
AGCCCAACCT 
GTTCTGCAGC 
GCGGGGTCAG 
AGGGCCCCCT 
GAGGGGCCCT 
CCCACAGCAA 
GACAGGCCCA 
TTCCAGGGGA 
GAGGGCCTGT 
TGGCAGCCAG 
CGTCTGCCTT 
AAGCAGGAGA 
AGCTGGACCC 
CTTTCAGCCT 
GAAATCAGGC 
GCAGGGTCTA 
GCTGGGCGGG 
TGCCAGTAGC 
TCTCAGGATG 
AGAGGAACAC 
AACATCTCAG 



41 
I 

CACGAGGACA 
TGCAGGGTTT 
TGCAGGAGCG 
GGAGCCCCCG 
GGGAGCTGCT 
CCTGCCCTGC 
TGAAGGAGCA 
AGCTGGAGCA 
AGCAGGACGG 
CCAGGGCCAG 
TGGCTGGAGA 
TTGCCAGATG 
GTTTKGGCTC 
CTACTACTGG 
TTTCCAGCGG 
CACTTCCAAC 
TTCCCGCTCA 
GGAGGGGGTG 
TAGCGGTCGG 
CGCCGGGTGG 
TCCCCCTCTT 
GCTCCCCAGG 
CGACTCAGGA 
TGTCCCCAGG 
AGGGTACAGG 
GGCCCACTCC 
GGAGGGTCCC 
CCAGGGCCCC 
TGCGTGGGGG 
CGTGTCCAGG 
GGCAGGCAGC 
CCCCACAGAG 
AGTCAGCCCA 
CATAAGGATG 
GCCCCACAGC 
GGAGAAGCCC 
TGAGGGTGCC 
CAGAACAGTG 
CGCAGCTGAA 
TGGTGTTCCG 
TAGTGAGTGG 
GGTGGCTGGC 
TCAGTCrCCG 
GTGTGCAGGT 
TTGAAATGTG 
ACCCACACCA 
CCGGGCGTGG 



51 
I 

CTGACATGGA 
GGAGATGATG 
CCAGCGCCGC 
CCCACTGGGG 
GGCTGCAGCC 
CCTGACGTCC 
GAACCGACTC 
GGAGAAGTCG 
GGGACCTCTG 
CCTGGCACTC 
CCCCCGGCAG 
GGCTCCCCAG 
CTGGTTGYTG 
CCGCTGTCAG 
TGCCGCCCTG 
AACGGGCAGC 
ACCAGGGCAC 
GGGACGGCCT 
ACTTCAGGTT 
GCGAGAGCTT 
GGCCGGGACG 
AGGGCCCCCA 
TTTCCAAGGC 
TTTCAGCTGG 
AGGAGGCTGG 
CGCTGGTGCT 
AGTGTCACCA 
CGATGCGGGG 
GCGCAGGGCC 
GCACTTTGGT 
GTGGCAACTC 
CCACATTCCC 
GCATGCAGCT 
TCAGGCCTGG 
CCCAGCACCC 
CCCGTCAGCA 
TGCCATGCCC 
TCTGTCCCGG 
GCGGAAATGT 
TGCAAGGTGA 
CCCTGGAGAC 
AGAGGCACAT 
TGCAGGATGT 
ACATACACGT 
TCCTTGGGGG 
GGCCTCAGGA 
TGGTTCACGC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 



879 



WO 03/042661 



PCT/US02/36810 



CTGGAATCCC AGCACTTGAG GCCAGGAGTT CCAGAGCAGC CTGGGCAACG CAGTGAGAGA 2880 

CCCCATCTCT ACAARAAAAA AAAAAGAAAG AAAGAAAATG AGAGATCCAG GTTTAAAAAT 2940 

TCATAAACAC CACAAGGAAA CAATACACTA TGAGACCCAG CAGAAGCAAC AGATTGACTC 3000 

TAGACCCAGA TACTAGAATT ATCAGAGAGA ATATAAAGTA ACAGTGTTTT ATATATCTAA 3060 
AGAAATAAAA GAGATTTCTG GAAACATGAA AAAAAA 



A102 DNA SEQUENCE 
Gene name: 
Unigene number: 
Probeset Accession #: 
Nucleic Acid Accession fh 
Coding sequence: 



TMPRSS3a 
H6 -298241 
AIS38613 
AB0381S7 

202-1566 (underlined sequences correspond to start and .stop codons) 



1 11 21 31 41 51 

I I I I I I 

ACCGGGCACC GGACGGCTCG GGTACTTTCG TTCTTAATTA GGTCATGCCC GTGTGAGCCA 60 

GGAAAGGGCT GTGTTTATGG GAAGCCAGTA ACACTGTGGC CTACTATCTC TTCCGTGGTG 120 

CCATCTACAT TTTTGGGACT CGGGAATTAT GAGGTAGAGG TGGAGGCGGA GCCGGATGTC 180 

AGAGGTCCTG AAATAGTCAC CATGGGGGAA AATGATCCGC CTGCTGTTGA AGCCCCCTTC 240 

TCATTCCGAT CGCTTTTTGG CCTTGATGAT TTGAAAATAA GTCCTGTTGC ACCAGATGCA 300 

GATGCTGTTG CTGCACAGAT CCTGTCACTG CTGCCATTGA AGTTTTTTCC AATCATCGTC 360 

ATTGGGATCA TTGCATTGAT ATTAGCACTG GCCATTGGTC TGGGCATCCA CTTCGACTGC 420 

TCAGGGAAGT ACAGATGTCG CTCATCCTTT AAGTGTATCG AGCTGATAGC TCGATGTGAC 480 

GGAGTCTCGG ATTGCAAAGA CGGGGAGGAC GAGTACCGCT GTGTCCGGGT GGGTGGTCAG 540 

AATGCCGTGC TCCAGGTGTT CACAGCTGCT TCGTGGAAGA CCATGTGCTC CGATGACTGG 600 

AAGGGTCACT ACGCAAATGT TGCCTGTGCC CAACTGGGTT TCCCAAGCTA TGTGAGTTCA 660 

GATAACCTCA GAGTGAGCTC GCTGGAGGGG CAGTTCCGGG AGGAGTTTGT GTCCATCGAT 720 

CACCTCTTGC CAGATGACAA GGTGACTGCA TTACACCACT CAGTATATGT GAGGGAGGGA 780 

TGTGCCTCTG GCCACGTGGT TACCTTGCAG TGCACAGCCT GTGGTCATAG AAGGGGCTAC 840 

AGCTCACGCA TCGTGGGTGG AAACATGTCC TTGCTCTCGC AGTGGCCCTG GCAGGCCAGC 900 

CTTCAGTTCC AGGGCTACCA CCTGTGCGGG GGCTCTGTCA TCACGCCCCT GTGGATCATC 960 

ACTGCTGCAC ACTGTGTTTA TGACTTGTAC CTCCCCAAGT CATGGACCAT CCAGGTGGGT 1020 

CTAGTTTCCC TGTTGGACAA TCCAGCCCCA TCCCACTTGG TGGAGAAGAT TGTCTACCAC 1080 

AGCAAGTACA AGCCAAAGAG GCTGGGCAAT GACATCGCCC TTATGAAGCT GGCCGGGCCA 1140 

CTCACGTTCA ATGAAATGAT CCAGCCTGTG TGCCTGCCCA ACTCTGAAGA GAACTTCCCC 1200 

GATGGAAAAG TGTGCTGGAC GTCAGGATGG GGGGCCACAG AGGATGGAGC AGGTGACGCC 1260 

TCCCCTGTCC TGAACCACGC GGCCGTCCCT TTGATTTCCA ACAAGATCTG CAACCACAGG 1320 

GACGTGTACG GTGGCATCAT CTCCCCCTCC ATGCTCTGCG CGGGCTACCT GACGGGTGGC 1380 

GTGGACAGCT GCCAGGGGGA CAGCGGGGGG CCCCTGGTGT GTCAAGAGAG GAGGCTGTGG 1440 

AAGTTAGTGG GAGCGACCAG CTTTGGCATC GGCTGCGCAG AGGTGAACAA GCCTGGGGTG 1500 

TACACCCGTG TCACCTCCTT CCTGGACTGG ATCCACGAGC AGATGGAGAG AGACCTAAAA 1560 

ACCTGAAGAG GAAGGGGACA AGTAGCCACC TGAGTTCCTG AGGTGATGAA GACAGCCCGA 1620 

TCCTCCCCTG GACTCCCGTG TAGGAACCTG CACACGAGCA GACACCCTTG GAGCTCTGAG 1680 

TTCCGGCACC AGTAGCAGGC CCGAAAGAGG CACCCTTCCA TCTGATTCCA GCACAACCTT 1740 

CAAGCTGCTT TTTGTTTTTT G TTTTTTTGA GGTGGAGTCT CGCTCTGTTG CCCAGGCTGG 1800 

AGTGCAGTGG CGAAATCCCT GCTCACTGCA GCCTCCGCTT CCCTGGTTCA AGCGATTCTC 1860 

TTGCCTCAGC TTCCCCAGTA GCTGGGACCA CAGGTGCCCG CCACCACACC CAACTAATTT 1920 

TTGTATTTTT AGTAGAGACA GGGTTTCACC ATGTTGGCCA GGCTGCTCTC AAACCCCTGA 1980 

CCTCAAATGA TGTGCCTGCT TCAGCCTCCC ACAGTGCTGG GATTACAGGC ATGGGCCACC 2040 

ACGCCTAGCC TCACGCTCCT TTCTGATCTT CACTAAGAAC AAAAGAAGCA GCAACTTGCA 2100 

AGGGCGGCCT TTCCCACTGG TCCATCTGGT TTTCTCTCCA GGGGTCTTGC AAAATTCCTG 2160 

ACGAGATAAG CAGTTATGTG ACCTCACGTG CAAAGCCACC AACAGCCACT CAGAAAAGAC 2220 

GCACCAGCCC AGAAGTGCAG AACTGCAGTC ACTGCACGTT TTCATCTCTA GGGACCAGAA 2280 

CCAAACCCAC CCTTTCTACT TCCAAGACTT ATTTTCACAT GTGGGGAGGT TAATCTAGGA 2340 

ATGACTCGTT TAAGGCCTAT TTTCATGATT TCTTTGTAGC ATTTGGTGCT TGACGTATTA 2400 

TTGTCCTTTG ATTCCAAATA ATATGTTTCC TTCCCTCAAA AAAAAAAAAA AAAAAAAAAA 2460 
AAAAAAAA 



A103 Protein sequence: 
Gene name: 
Unigene number: 
Probeset Accession #: 
Protein Accession #: 
Signal sequence: 
Transmembrane domains: 
Tryp_SPc domain: 
Cellular Localization: 



TMPRSSSa 

Hs. 298241 

AI538613 

BAB20077 

none found 

43-65, 239-261 

216-444 

plasma membrane 



1 11 21 31 41 51 

I I I I 1 I 

MGENDPPAVE APPSFRSLFG LDDLKISPVA PDADAVAAQI LSLLPLKFPP IIVIGIIALI 60 

LALAIGLGIH FDCSGKYRCR SSFKCIELIA RCDGVSDCKD GEDEYRCVRV GGQNAVLQVP 120 

TAASWKTMCS DDWKGHYANV ACAQLGFPSY VSSDNLRVSS LEGQFREEFV SIDHLLPDDK 180 

VTALHHSVYV REGCASGHW TLQCTACGHR RGYSSRIVGG NMSLLSQWPW QASLQFQGYH 240 

LCGGSVITPL WIITAAHCVY DLYLPKSWTI QVGLVSLLDN PAPSHLVEKI VYHSKYKPKR 300 

LGNDIALMKL AGPLTFNEMI QPVCLPNSEE NFPDGKVCWT SGWGATEDGA GDASPVLNHA 360 

AVPLISNKIC NHRDVYGGII SPSMLCAGYL TGGVDSCQGD SGGPLVCQER RLWKLVGATS 420 
FGIGCAEVNK PGVYTRVTSF LDWIHEQMER DLKT 



880 



WO 03/042661 



PCT/US02/36810 



A104 DMA SEQUENCE 

Gene name : Homo sapiens G protein- coupled receptor (HOT7T17S) 
Unigene number: Hs. 2 0822 9 

Probeset Accession #: AI819198 
Nucleic Acid Accession #:' nm_032S51.1 

Coding sequence: 1-1197 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 si 

I I I I I I 

ATGCACACCG TGGCTACGTC CGGACCCAAC GCGTCCTGGG GGGCACCGGC CAACGCCTCC 60 

GGCTGCCCGG GCTGTGGCGC CAACGCCTCG GACGGCCCAG TCCCTTCGCC GCGGGCCGTG 120 

GAOGCCTGGC TCGTGCCGCT CTTCTTCGCG GCGCTGATGC TGCTGGGCCT GGTGGGGAAC 180 

TCGCTGGTCA TCTACGTCAT CTGCCGCCAC AAGCCGATGC GGACCGTGAC CAACTTCTAC 240 

ATCGCCAACC TGGCGGCCAC GGACGTGACC TTCCTCCTGT GCTGCGTCCC CTTCACGGCC 300 

CTGCTGTACC CGCTGCCCGG CTGGGTGCTG GGCGACTTCA TGTGCAAGTT CGTCAACTAC 360 

ATCCAGCAGG TCTCGGTGCA GGCCACGTGT GCCACTCTGA CCGCCATGAG TGTGGACCGC 420 

TGGTACGTGA CGGTGTTCCC GTTGOGCGCC CTGCACCGCC GCACGCCCCG CCTGGCGCTG 480 

GCTGTCAGCC TCAGCATCTG GGTAGGCTCT GCGGCGGTGT CTGCGCOGGT GCTCGCCCTG 540 

CACCGCCTGT CACCCGGGCC GCGCGCCTAC TGCAGTGAGG CCTTCCCCAG CCGCGCCCTG 600 

GAGCGCGCCT TCGCACTGTA CAACCTGCTG GCGCTGTACC TGCTGCCGCT GCTCGCCACC 660 

TGCGCCTGCT ATGCGGCCAT GCTGCGCCAC CTGGGCCGGG TCGCCGTGCG CCCCGCGCCC 720 

GCCGATAGCG CCCTGCAGGG GCAGGTGCTG GCAGAGCGCG CAGGCGCCGT GCGGGCCAAG 780 

GTCTCGCGGC TGGTGGCGGC CGTGGTCCTG CTCTTCGCCG CCTGCTGGGG CCCCATCCAG 840 

CTGTTCCTGG TGCTGCAGGC GCTGGGCCCC GCGGGCTCCT GGCACCCACG CAGCTACGCC 900 

GCCTACGCGC TTAAGACCTG GGCTCACTGC ATGTCCTACA GCAACTCCGC GCTGAACCCG 960 

CTGCTCTACG CCTTCCTGGG CTCGCACTTC CGACAGGCCT TCCGCCGCGT CTGCCCCTGC 1020 

GCGCCGCGCC GCCCCCGCCG CCCCCGCCGG CCCGGACCCT CGGACCCCGC AGCCCCACAC 1080 

GCGGAGCTGC ACCGCCTGGG GTCCCACCCG GCCCCCGCCA GGGCGCAGAA GCCAGGGAGC 1140 
AGTGGGCTGG CCGCGCGCGG GCTGTGCGTC CTGGGGGAGG ACAACGCCCC TCT CTGA 

A10S Protein sequence 
Gene name: 
Unigene number: 
Protein Accession #: 
Signal sequence: 
Pfam domains: 
Transmembrane domains s 
Cellular Localization: 



Homo sapiens G protein-coupled receptor (HOT7T175) , 
Hs. 208229 

AI819198 
none found 
7tm_l [59-323] 

43-65, 86-108, 122-144, 159-181, 203-225, 260-282 
plasma membrane 



MHTVATSGPN 
SLVIYVICRH 
IQQVSVQATC 
HRLSPGPRAY 
ADSALQGQVL 
AYALKTWAHC 
AELHRLGSHP 



11 
I 

ASWGAPANAS 
KPMRTVTNFY 
ATLTAMSVDR 
CSEAFPSRAL 
AERAGAVRAK 
MSYSNSALNP 
APARAQKPGS 



21 
I 

GCPGCGANAS 
IANLAATDVT 
WYVTVFPLRA 
ERAFALYNLL 
VSRLVAAWL 
LLYAFLGSHF 
SGLAARGLCV 



A106 DNA SEQUENCE 
Gene name: 
Unigene number: 
Probeset Accession fc: 
Nucleic Acid Accession tf: 
Coding sequence? 



31 
1 


41 


51 




DGPVPSPRAV 


1 

DAWLVPLFFA 


1 

ALMLLGLVGN 


60 


FLLCCVPFTA 


LLYPLPGWVL 


GDFMCKFVNY 


120 


LHRRTPRLAL 


AVSI*SIWVGS 


AAVSAPVLAL 


180 


ALYLLPLLAT 


CACYAAMLRH 


LGRVAVRPAP 


240 


LFAACWGPIQ 


LFLVLQALGP 


AGSWHPRSYA 


300 


RQAFRRVCPC 


APRRPRRPRR 


PGPSDPAAPH 


360 


LGEDNAPL 








integrin. 


beta 8 







Hs. 52620 
AA479726 
NM_002214 

680-2990 (underlined sequences correspond to start and stop codons) 



11 



CCCAGAGCCG 
CTGCOGACTT 
GTTGGCCTCC 
TCCCCTCGAC 
TAGGGTGGTT 
CTAAGCTGAT 
TGTCCCGGAG 
TGGCCGTCGA 
GGCCGTAGGG 
CCGAGCCGCG 
GGCCCCGAGG 
GGGGCGGGCT 
TCTGCCTGCA 
CACTTGTTCT 
CCTGTGCCAG 
TTTCAGGTGG 
GCTCAGTTGA 
TTAATACCCA 
ATTTTATGCT 
ATGTCTCAGC 
CTAGAAAAAT 
AAACAGTTTC 
ACAATTTAGA 
TCACTGAGTT 



GTCTTTGCCC 
CTGCCCACCT 

TCCCCCCCAG 
TTATGCAGCA 
CAGGCTGCGG 
AGGAGGTGCT 
GCCCTGAGAT 
GGGTCCGCCT 
TCGCCCGGGA 
GTTTTGCATT 
AAACGACCGG 
TGGACTGGGC 
GTGCCTTGCG 
ATCAAGAAGT 
TTCAATAGAA 
GGTGACACCA 
GAAAGTTCAT 
ATCAATGCAC 
GGCATTTTTC 
ACCATACATT 
CTGCATGCCT 
TGAGAAAGCA 



21 
I 

TTGCTGGCAT 
GCTGCTCCGC 
GTGGAAGCAA 
TACCCTCCCA 
CTTCGGGCTT 
GAAGCCCCAC 
AGCCCTTGCA 
TCTCGCGGAG 
GCCGAGCGGT 
GCTAGGCCTG 
GGCCGAGCCC 
ATGT GCGGCT 
CGAGGTCCCG 
CAAGGTGAAG 
CTGGGTCCAG 
GAACGTTGTG 
TACCCATCTG 
GGAGAAGTGT 
CCTCTGAAGA 
AATAATATAG 
TCCCGTGACT 
AGCATCCACC 
CCCCATGGAT 
GTTCATAGAC 



31 
I 

CCCGAGCTTC 
AGACGGGGCT 
CTGOGCTGAT 
CAGATCCAGC 
TGTTTGGGTT 
CGGCTGGAGA 
GAGCCCTCTC 
ACCGCGGGAC 
GCCCGGGCCC 
CGGAAAACGT 
GCGTCCGGAA 
CGGCCCTGGC 
CCTCGTTCCT 
ACAATAGATG 
AATGTGGATG 
ATATTGTTTC 
TGCATGTTAT 
CTATCCAGCT 
AATATCCTGT 
AAAAATTAAA 
TTCGTCTTGG 
CCGAAAGGAT 
ACATCCATGT 
AGAAGATCTC 



41 
I 

CTCCCTTGCC 
GCAAAGCTGC 
TGATGCGCCA 
ATCACCCAGT 
TGATTGTGTT 
GAAACAAAAG 
TCCAGTCGCC 
COGCCGTGCC 
GCTTACCTGC 
CCTAGCGACA 
GGCAGCCAGG 
TTTTTTTACC 
CTGGGCAGCC 
TGCATCTTCA 
GTGTGTTCAA 
CAATTTAATA 
AATACCCACT 
GCGTCCAGGA 
GGATCTTTAT 
TTCCGTTGGA 
ATTTGGCTCA 
TCATAATCAA 
GCTGTCTTTG 
TGGAAACATA 



51 
I 

AGCCAGGACG 
AACTAATGGT 
CAGACTTTTT 
GAATGTACAT 
TGGCTCTTCG 
CTCTTTTCTT 
GCCGGGCCCT 
GAGCCGGGAG 
ACCGCTTGCT 
CTCGCCCGCG 
CGGCGGGCGC 
GCTGCATTTG 
TGGGTGTTTT 
AATGCAGCAT 
GAGGATTTCA 
AGCAAAGGCT 
GAAAATGAAA 
GCCGAAGCTA 
TATCTTGTTG 
AACGATTTAT 
TACGTTGATA 
TGCAGTGACT 
ACAGAGAACA 
GATACACCAG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 



881 



WO 03/042661 



PCT/US02/36810 



AAGGAGGTTT TGACGCCATG CTTCAGGCAG CTGTCTGTGA AAGTCATATC GGATGGCGAA 1500 

AAGAGGCTAA AAGATTGCTG CTGGTGATGA CAGATCAGAC GTCTCATCTC GCTCTTGATA 1560 

GCAAATTGGC AGGCATAGTG GTGCCCAATG ACGGAAACTG TCATCTGAAA AACAACGTCT 1620 

ACGTCAAATC GACAACCATG GAACACCCCT CACTAGGCCA ACTTTCAGAG AAATTAATAG 1680 

ACAACAACAT TAATGTCATC TTTGCAGTTC AAGGAAAACA ATTTCATTGG TATAAGGATC 1740 

TTCTACCCCT CTTGCCAGGC ACCATTGCTG GTGAAATAGA ATCAAAGGCT GCAAACCTCA 1800 

ATAATTTGGT AGTGGAAGCC TATCAGAAGC TCATTTCAGA AGTGAAAGTT CAGGTGGAAA 1860 

ACCAGGTACA AGGCATCTAT TTTAACATTA COGCCATCTG TCCAGATGGG TCCAGAAAGC 1920 

CAGGCATGGA AGGATGCAGA AACGTGACGA GCAATGATGA AGTTCTTTTC AATGTAACAG 1980 

TTACAATGAA AAAATGTGAT GTCACAGGAG GAAAAAACTA TGCAATAATC AAACCTATTG 2040 

GTTTTAATGA AACCGCTAAA ATTCATATAC ACAGAAACTG CAGCTGTCAG TGTGAGGACA 2100 
ACAGAGGACC TAAAGGAAAG TGTGTAGATG AAACTTTTCT AGATTCCAAG TGTTTCCAGT .2160 

GTGATGAGAA TAAATGTCAT TTTGATGAAG ATCAGTTTTC TTCTGAGAGT TGCAAGTCAC 2220 

ACAAGGATCA GCCTGTTTGC AGTGGTCGAG GAGTTTGTGT TTGTGGGAAA TGTTCATGTC 2280 

ACAAAATTAA GCTTGGAAAA GTGTATGGAA AATACTGTGA AAAGGATGAC TTTTCTT6TC 2340 

CAT AT CA CCA TGGAAATCTG TGTGCTGGGC ATGGAGAGTG TGAAGCAGGC AGATGCCAAT 2400 

GCTTCAGTGG CTGGGAAGGT GATCGATGCC AGTGCCCTTC AGCAGCAGCC CAGCACTGTG 2460 

TCAATTCAAA GGGCCAAGTG TGCAGTGGAA GAGGCACGTG TGTGTGTGGA AGGTGTGAGT 2520 

GCACCGATCC CAGGAGCATC GGCCGCTTCT GTGAACACTG CCCCACCTGT TATACAGCCT 2580 

GCAAGGAAAA CTGGAATTGT ATGCAATGCC TTCACCCTCA CAATTTGTCT CAGGCTATAC 2640 

TTGATCAGTG CAAAACCTCA TGTGCTCTCA TGGAACAACA GCATTATGTC GACCAAACTT 2700 

CAGAATGTTT CTCCAGCCCA AGCTACTTGA GAATATTTTT CATCATTTTC ATAGTTACAT 2760 

TCTTGATTGG GTTGCTTAAA GTCCTGATCA TTAGACAGGT GATACTACAA TGGAATAGTA 2820 

ATAAAATTAA GTCCTCATCA GATTACAGAG TGTCAGCCTC AAAAAAGGAT AAGTTGATTC 2880 

TGCAAAGTGT TTGCACAAGA GCAGTCACCT ACCGACGTGA GAAGCCTGAA GAAATAAAAA 2940 

TGGAIATCAG CAAATTAAAT GCTCATGAAA CTTTCAGGTG CAACTTCTAA AAAAAGATTT 3000 

TTAAACACTT AATGGGAAAC TGGAATTGTT AATAATTGCT CCTAAAGATT ATAATTTTAA 3060 

AAGTCACAGG AGGAGACAAA TTGCTCACGG TCATGCCAGT TGCTGGTTGT ACACTCGAAC 3120 

GAAGACTGAC AAGTATCCTC ATCATGATGT GACTCACATA GCTGCTGACT TTTTCAGAGA 3180 

AAAATGTGTC TTACTACTGT TTGAGACTAG TGTCGTTGTA GCACTTTACT GTAATATATA 3240 

ACTTATTTAG ATCAGCATAG AATGTAGATC CTCTGAAGAG CACTGATTAC ACTTTACAGG 3300 

TACCTGTTAT CCCTACGCTT CCCAGAGAGA ACAATGCTGT GAGAGAGTTT AGCATTGTGT 3360 

CACTACAAGG GTACAGTAAT CCCTGCACTG GACATGTGAG GAAAAAAATA ATCTGGCAAG 3420 

TATATTCTAA GGTTGCCAAA CACTTCAACA GTTGGTGGTT GAATAGACAA GAACAGCTAG 3480 

ATGAATAAAT GATTCGTGTT TCACTCTTTC AAGAGGTGAA CAGATACAAC CTTAATCTTA 3540 

AAAGATTATT GCTTTTTAAA GTGTGTAGTT TTATGCATGT GTGTTTATGG TTTGCTTATT 3600 

TTTGCAAGAT GGATACTAAT TCCAGCATTC TCTCCTCTTT GCCTTTATGT TTTGTTTTCT 3660 

TTTTTACAGG ATAAGTTTAT GTATGTCACA GATGACXGGA TTAATTAAGT GCTAAGTTAC 3720 

TACTGCCATA AAAAACTAAT AATACAATGT CACTTTATCA GAATACTAGT TTTAAAAGCT 3780 
GAATGTTAA 



A107 Protein sequence.- 
Gene name: 
Unigene number: 
Probeset Accession #: 
Protein Accession #: 
Signal sequence: 
Transmembrane domains: 
EGF domain: 
1KB domain: 

Cellular Localization: 



Integrin, beta 8 

Hs.52620 

AA479726 

NP_002205 . 1 

1-39 

682-704 

552-584 

54-469 

plasma membrane 



1 11 21 31 41 51 

i I I I I I 

MCGSALAFFT AAFVCLQNDR RGPASFLWAA WVFSLVLGLG QGEDNRCASS NAASCARCLA 60 

LGPECGWCVQ EDFISGGSRS ERCDIVSNLI SKGCSVDSIE YPSVHVI IPT ENEINTQVTP 120 

GEVSIQLRPG AEANFMLKVH PLKKYPVDLY YLVDVSASMH NNIEKLNSVG NDLSRKMAFF 180 

SRDFRLGFGS YVDKTVSPYI SIHPERIHNQ CSDYNLDCMP PHGYIHVLSL TEN1TEPEKA 240 

VHRQKISGNI DTPEGGFDAM LQAAVCESHI GWRKEAKRLL LVMTDQTSHL ALDSKLAGIV 300 

VPNDGNCHLK NNVYVKSTTM EHPSLGQLSE KLIDNNINVI FAVQGKQFHW YKDLLPLLPG 360 

TIAGEIESKA ANLNNLWEA YQKLISEVKV QVEKQVQGIY FNITAICPDG SRKPGMEGCR 420 

NVTSNDEVLF NVTVTMKKCD VTGGKNYAII KPIGFNETAK IHIHRNCSCQ CEDNRGPKGK 480 

CVDETFLDSK CFQCDEKKCH FDEDQFSSES CKSHKDQPVC SGRGVCVCGK CSCHKZKLGK 540 

VYGKYCEKDD FSCPYHHGNL CAGHGECEAG RCQCFSGWEG DRCQCPSAAA QHCVNSKGQV 600 

CSGRGTCVCG RCECTDPRSI GRFCEECP7C YTACKENWNC MQCLHPHNLS QAILDQCKTS 660 

CALMEQQHYV DQTSECFSSP SYLRIFFIIF IVTFLIGLLK VLIIRQVILQ WNSNKIKSSS 720 
DYRVSASKKD KLILQSVCTR AVTYRREKPE EIKMDISKLN AHETFRCNF 

A10B DNA sequence 

Gene name: ESTs 

Unigene number: Hs. 12 8 899 

Probeset Accession #: AA983251 

Nucleic Acid Accession ft: AA983251 

Coding sequence: 1-1749 {underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I 1 I I I 

ATGCTGTCTG GCTTCTTGAT GAGTCCCAGT ACCCAGCACA GAGCACAGTA CACTCCCGGA 60 

GGAAAGAAAC TTCCGTGGGA GGCTTCCATC GGTGCGCACA CCTCCCGAGG GCGAGGCAGC 120 

GACCGGGAGA GGGAGAGCCG GCCGGAGGCT GCCGGGCTCC TGTGGGACCG CGCTGCAGCC 180 

GGGGAGGCGG AGAAGGGGAA CCGGGGCGAG CCGCCCGCCT GGATCCGCGC CCAGCAGCAG 240 



882 



WO 03/042661 



PCI7US02/36810 



CCGCGGCCGC CGCCAGCTGG GCAGGCTCCC GGGACTGCGG CTGGGGGCGC GCAGGACCCT 300 

CGCCTGCGTC CTGGACGTTC CCGGGGGAGG GTCCGGTTGC CAGTGAAACC TCCAGAGGCT 360 

TCCGGACGAC AGCCCCGGGG GCCTTCTGAC TGCATCCCGA GATTTCCATC AGCGAGTGCA 420 

ACTCATAAGG CAGTCCCTAA GGGGACCGGG CCACCGGCTG AGGACGGGGA TGGCTTAGGA 480 

GCTCCTGGAC CTAGGGCCCG GCGTCGTCGC CTCCTGGGCG TCGCGGCAGA GGGGAGTGGC 540 

CCGCGCGGAA AGCGCCGCGG GACAGTCAGT GACGAGGCCC GGGGGTCGCC GGGGCCACGA 600 

CTTCTCGGAG ACCGTCCTGC GCTCTCTGGA GACGCGCTGT CCGCGCCCAG GGTGGTGCCA 660 

TGTGGGGCGC TCGCCGCTCG TCCGTCTCCT CATCCTGGAA CGCCGCTTCG CTCCTGCAGC 720 

TGCTGCTGGC TGCGCTGCTG GOGGCGGGGG CGAGGGCCCA GCGGCGAGTA CTGCCACGGC 780 

TGGCTGGACG CGCAGGGCGT CTGGCGCATC GGCTTCCAGT GTCCCGAGCG CTTOGACGGC 840 

GGOGACGCCA CCATCTGCTG CGGCAGCTGC GCGTTGCGCT ACTGCTGCTC CAGCGCCGAG 900 

GCGCGCCTGG ACCAGGGCGG CTGCGACAAT GACCGCCAGC AGGGCGCTGG CGAGCCTGGC 960 

CGGGCGGACA AAGACGGGCC CCGACGGCTC GGCAGGGCTT CATGTCTTAG GGGTACCCAA 1020 

GGAGACGGCG AGGGTGCGCC CCCACCOGTG AGGGCCTGGC AGCGGTGCTC CCCTGAAGGC 1080 

TCCCCGAAAG GAAGGCAGCT CCTCAGGGCT TTCCCGGGGC TGCTGCCCOG TGCCAGACGC 1140 

CGCGGATTCC CATCTTCTCC ACGCGGCGGC CCCTCTCCCC TGCAGCGGCC CGCCTTGCCC 1200 

ATCTACGTGC CGTTCCTCAT TGTTGGCTCC GTGTTTGTCG CCTTTATCAT CTTGGGGTCC 1260 

CTGGTGGCAG CCTGTTGCTG CAGATGTCTC CGGCCTAAGC AGGATCCCCA GCAGAGCCGA 1320 

GCCCCAGGGG GTAACCGCTT GATGGAGACC ATCCCCATGA TCCCCAGTGC CAGCACCTCC 1380 

CGGGGGTCGT CCTCACGCCA GTCCAGCACA GCTGCCAGTT CCAGCTCCAG CGCCAACTCC 1440 

GGGGCCCGGG CGCCCCCAAC AAGGTCACAG ACCAACTGTT GCTTGCCGGA AGGGACCATG 1500 

AACAACGTGT ATGTCAACAT GCCCACGAAT TTCTCTGTGC TGAACTGTCA GCAGGCCACC 1560 

CAGATTGTGC CACATCAAGG GCAGTATCTG CATCCCCCAT AOGTGGGGTA CACGGTGCAG 1620 

CAOGACTCTG TGCCCATGAC AGCTGTGCCA CCTTTCATGG ACGGCCTGCA GCCTGGCTAC 1680 

AGGCAGATTC AGTCCCCCTT CCCTCACACC AACAGTGAAC AGAAGATGTA CCCAGCGGTG 1740 

ACTGTATAAC CGAGAGTCAC TGGTGGGTTC CTTTACTGAA GGGAGACGAA GGCAGGGGTG 1800 

GATTCTCGAG GTGGAAGTCC GCACATGTCG GTGGTATTTA TGGCACGATT CCTTTGGATG 1860 

GCTTCATTTG CCCCCAGACT GTATGAAAAC ATCTCCGAAT TAGCATTTCT GGATATGTTT 1920 

CATCCAGGGT ATCATTGATT TATGATGGAA AACCGGCCTC AGCTGGAGAT GACTGTGATG 1980 

TTGCTGATGG GTGTATAACA AATGCTTGAG TCCGAAGTGC CCTTGAGATA TGGTTGACGA 2040 

AAGAATTTTA TAAACTGATA AATTAAGGAT TTTTATTATG TTGTTATTAT TATTTCTTTT 2100 

TTGTTGTTGA CTGCACAGGA TCAAAATGCC TGTTATCTCC CTTTTACTGG GACTTTTTTT 2160 

TTTTTTTTTT TTTTTTTTAA TCAGACAGGG TCTTGCTCTG TTGCCCAGGC TGGAGTGCAG 2220 

TGGTGCGATC TCGGCTCACT GCAACTTCAG CCTCCTGGAT TCAGGCAACA CTCCTGCCTC 2280 

AGCCTCCCAC GTGGCTGGGA TTACAGGTGC CTGCCCCCAT GGCTAATTTT TTGTATTTTT 2340 

TGTAGAGATG GGGTTTCACC ATGTTGGCTG GGCTGGTCTC ACTCTCCTGA CCTCAAGCAA 2400 

TCTGCCTGTC TCAGCCTCCC AAAGTGCTGG GATTACAGGC GTGAGCCACC GCCCCCAGCC 2460 

TGAGCCTTTT TTTTTTTCTA ATGCATCCAA GGTTAAGGGG AAGACGCAAA TAACAGGACT 2520 

ATTCTAAAAG GAAACCTGTT TGAACTCTGT GAGATCAGTC ATCAGTCTCA GTATTCCACA 2580 

GGCACACCTT AATTTCATTG TAAAAAGATA TATATATTTT GTCTATTTTT GTGCTTTTGG 2640 

GGGCCTATTT TGTGCTTTTT TACCTTATGT AGAGATCTTA TTACAAAGTG ATTTTCTACA 2700 

TTAAAAAGAG ACTGAAATAA ATTGTATAGT TACTTAACTA ATGAAGACAT TTCAGAACTC 2760 

TGGGATGATT TTAATCTTGA AGTAGTAGGT GGTATAGTCA TAAAACCATT CATCCCCTTC 2820 

TTGATTGTAT CTTAATTTTC TGGCTTTAAG GTGACATCTG AGAGGTAATG CATTCTTTTT 2880 

TATATTGAAA TCATAAACTA TCACCCGCTG CTTCTCTGAG TTACTTTTAA TTTTGCCTTG 2940 

TGGTTATGGT TTGGCGTTTC CTTCTGTTTG GTTTTCAGAG CCCCATGTCT ATATAGTCCT 3000 

GAGTGCAAGT AATTACTATA CTTGTAAATG AAGATCAGTA TTTCTGCCTA GATCTGATAA 3060 

AAAAATTTTC TTGTCTTAGT TATAAAAATT CAAAGAAATG TGTTACAAAG ATACTTAGTA 3120 

TAGCTCCTCA GCCATAACCT GAGACTTGGG ATGAAATTTA AACCAGATAC GATTTACTTT 3180 

GCAGATCATA AGGCTTTTTA TACTCTTGTT ATCAAAATGG CTTATTTTTC AGGCACTAAG 3240 

GATTGTTAAG AGAAAAGCTT TTCAACGAAG GATTGCCTTT CTTCTCCCAC ACTGTTCTTG 3300 

ATTTCCTCTC TCTTTCAGGC CTCAACAGGC ACTGTATTCA TTGCCAATGT TCCAAATTAT 3360 

CAAATTCAAG TGAATTTATT TGTGTGTTCT TTACTTATAT AAAAAAAGAT AACTTTAAGG 3420 

ATGTGCAAGT ACATTTCCAA CTGCTAGCAC AACCAGTATT TTGTAATTAA ACAAATOGCT 3480 

GTATGGTATG GTCTTCTACA CATTTATGTC TATAGATATC TATCGATCAT CTTTCTATTC 3540 

TGTTTCATGA CTGAATAATG TAAAACCAGT GTTGGCAATT GGTATCATCA ATGATACTCA 3600 

TTTTTTAATA ACCAAAGGCA GGGGAAAATC ATTTTACTTA TTAATAAATA TTTTATGATG 3660 
TGAAAAAAAA AAAAAAAAAA AAAAAAAAAA 



A109 Protein sequence 
Gene name: . 
Unigene number: 
Protein Accession #: 
Signal sequence: 
Transmembrane domains: 
Cellular Localization: 



ESTs 
Hs .128899 

none found 

1-11 

402-424 

not determined 



1 11 21 31 41 51 

I I I I I I 

MLSGPLMSPS TQHRAQYTPG GKKLPWEASI GAHTSRGRGS DHERESRPEA AGLLWDRAAA 60 

GEAEKGNRGE PPAWIRAQQQ PRPPPAGQAP GTAAGGAQDP RLRPGRSRGR VRLPVKPPEA 120 

SGRQPRGPSD CIPRFPSASA THKAVPKGTG PPAEDGDGLG APGPRARRRR LLGVAAEGSG 180 

PRGKRRGTVS DEARGSPGPR LLGDRPALSG DALSAPRWP CGALAARPSP HPGTPLRSCS 240 

CCWLRCWRRG RGPSGEYCHG WLDAQGVWRI GFQCPERFDG GDATICCGSC ALRYCCSSAE 300 

ARLDQGGCDN DRQQGAGBPG RADKDGPRRL GRASCLRGTQ GDGEGAPPPV RAWQRCSPEG 360 

SPKGRQLLRA FPGLLPRARR RGFPSSPRGG PSPLQRPALP IYVPFLIVGS VPVAP1ILGS 420 

LVAACCCRCL RPKQDPQQSR APGGNRLMET IPMIPSASTS RGSSSRQSST AASSSSSANS 480 

GARAPPTRSQ TNCCLPEGTM NNVYVNMPTN FSVLNCQQAT QIVPHQGQYL HPPYVGYTVQ 540 
HDSVPMTAVP PFMDGLQPGY RQIQSPFPHT NSEQKMYPAV TV 



A110 DNA SEQUENCE: 
Gene name: 



ESTs, Weakly similar to JE03S0 Anterior gradient -2 [H. sapiens) 

883 



WO 03/042661 



PCT7US02/36810 



Unigene number: Hs. 100686 

Probeset Accession #: AA487468 

Nucleic Acid Accession th AA48746B 

Coding sequence: 55-555 {underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

| | I I 1 I 

CGGCACCAAG AGCACTGGCC AAGTCAGCTT CTTCTGAGAG AGTCTCTAGA AGACATGATG 60 

CTACACTCAG CTTTGGGTCT CTGCCTCTTA CTCGTCACAG TTTCTTCCAA CCTTGCCATT 120 

GCAATAAAAA AGGAAAAGAG GCCTCCTCAG ACACTCTCAA GAGGATGGGG AGATGACATC 180 

ACTTGGGTAC AAACTTATGA AGAAGGTCTC TTTTATGCTC AAAAAAGTAA GAAGCCATTA 240 

ATGGTTATTC ATCACCTGGA GGATTGTCAA TACTCTCAAG CACTAAAGAA AGTATTTGCC 300 

CAAAATGAAG AAATACAAGA AATGGCTCAG AATAAGTTCA TCATGCTAAA CCTTATGCAT 360 

GAAACCACTG ATAAGAATTT ATCACCTGAT GGGCAATATG TGCCTAGAAT CATGTTTGTA 420 

GACCCTTCTT TAACAGTTAG AGCTGACATA GCTGGAAGAT ACTCTAACAG ATTGTACACA 480 

TATGAGCCTC GGGATTTACC CCTATTGATA GAAAACATGA AGAAAGCATT AAGACTTATT 540 

CAGTCAGAGC TATAAGAGAT GATAGAAAAA AGCCTTCACT TCAAAGAAGT CAAATTT CAT 600 

GAAGAAAACC TCTGGCACAT TGACAAATAC TAAATGTGCA AGTATATAGA TTTTGTAATA 660 

TTACTATTTA GTTTTTTTAA TGTGTTTGCA ATAGTCTTAT TAAAATAAAT GTTTTTTAAA 720 
TCTGAAAAAA AAAAAAAAAA AAAAAAAAA 



Alll Protein sequence: 
Gene name: 
Unigene number: 
Probeset Accession ft: 
Protein Accession #: 
Signal sequence: 
Transmembrane domains: 
Cellular Localization: 



ESTs, Weakly similar to JE0350 Anterior gradient-2 1H. sapiens) 

Hs. 100686 

AA4B7468 

none found 

1-23 

none found 
secreted 



1 ll 21 31 41 51 

i I I i I I 

MMLHSALGLC LLLVTVSSNL AIAIKKEKRP PQTLSRGWGD DITWVQTYEE GLFYAQKSKK 
PLMVIHHLED CQYSQALKKV FAQNEEIQEM AQNKFIMLNL MHETTDKNLS PTX3QYVPRIM 
FVDPSLTVRA DIAGRYSNRL YTYEPRDLPL LIENMKKALR LIQSEL 



60 
120 



All 2 DNA SEQUENCE 

Gene name: Homo sapiens type II membrane serine protease mRNA 

Unigene number: Hs.63325 
Probeset Accession #: AA411502 
Nucleic Acid Accession 8: NM_016425 

Coding sequence: 1-1314 (underlined sequences correspond to start and stop codons) 



1 ll 21 31 41 51 

ATGTTACAGG ATCCTGACAG TGATCAACCT CTGAACAGCC TCGATGTCAA ACCCCTGCGC 60 

AAACCCCGTA TCCCCATGGA GACCTTCAGA AAGGTGGGGA TCCCCATCAT CATAGCACTA 120 

CTGAGCCTGG CGAGTATCAT CATTGTGGTT GTCCTCATCA AGGTGATTCT GGATAAATAC 180 

TACTTCCTCT GCGGGCAGCC TCTCCACTTC ATCCCGAGGA AGCAGCTGTG TGACGGAGAG 240 

CTGGACTGTC CCTTGGGGGA GGACGAGGAG CACTGTGTCA AGAGCTTCCC CGAAGGGCCT 300 

GCAGTGGCAG TCCGCCTCTC CAAGGACCGA TCCACACTGC AGGTGCTGGA CTCGGCCACA 360 

GGGAACTGGT TCTCTGCCTG TTTCGACAAC TTCACAGAAG CTCTCGCTGA GACAGCCTGT 420 

AGGCAGATGG GCTACAGCAG CAAACCCACT TTCAGAGCTG TGGAGATTGG CCCAGACCAG 480 

GATCTGGATG TTGTTGAAAT CACAGAAAAC AGCCAGGAGC TTCGCATGCG GAACTCAAGT 540 

GGGCCCTGTC TCTCAGGCTC CCTGGTCTCC CTGCACTGTC TTGCCTGTGG GAAGAGCCTG 600 

AAGACCCCCC GTGTGGTGGG TGGGGAGGAG GCCTCTGTGG ATTCTTGGCC TTGGCAGGTC 660 

AGCATCCAGT ACGACAAACA GCACGTCTGT GGAGGGAGCA TCCTGGACCC CCACTGGGTC 720 

CTCACGGCAG CCCACTGCTT CAGGAAACAT ACCGATGTGT TCAACTGGAA GGTGCGGGCA 780 

GGCTCAGACA AACTGGGCAG CTTCCCATCC CTGGCTGTGG CCAAGATCAT CATCATTGAA 840 

TTCAACCCCA TGTACCCCAA AGACAATGAC ATCGCCCTCA TGAAGCTGCA GTTCCCACTC 900 

ACTTTCTCAG GCACAGTCAG GCCCATCTGT CTGCCCTTCT TTGATGAGGA GCTCACTCCA 960 

GCCACCCCAC TCTGGATCAT TGGATGGGGC TTTACGAAGC AGAATGGAGG GAAGATGTCT 1020 

GACATACTGC TGCAGGCGTC AGTCCAGGTC ATTGACAGCA CACGGTGCAA TGCAGACGAT 1080 

GCGTACCAGG GGGAAGTCAC CGAGAAGATG ATGTGTGCAG GCATCCCGGA AGGGGGTGTG 1140 

GACACCTGCC AGGGTGACAG TGGTGGGCCC CTGATGTACC AATCTGACCA GTGGCATGTG 1200 

GTGGGCATCG TTAGCTGGGG CTATGGCTGC GGGGGCCCGA GCACCCCAGG AGTATACACC 1260 
AAGGTCTCAG CCTATCTCAA CTGGATCTAC AATGTCTGGA AGGCTGAGCT G TAA 



A113 Protein sequence: 
Gene name: 
Unigene number: 
Probeset Accession #: 
Protein Accession #: 
Signal sequence: 
Transmembrane domains: 
LDLa domain: 
Tryp_SPc domain: 
Cellular Localization: 



Homo sapiens type II 

Hs.63325 

AA411502 

NP_057509 

none found 

31-53 

54-94 

204-429 

plasma membrane/ ER 



serine protease mRNA 



884 



WO 03/042661 



PCT/US02/36810 



MLQDPDSDQP 
YFLCGQPLHF 
GNWFSACFDN 
GPCLSGSLVS 
LTAAHCFRXH 
. TFSGTVRPIC 
AYQGEVTEKM 
KVSAYLNWIY 



11 
I 

LNSLDVKPLR 
IPRKQLCDGE 
FTEALAJETAC 
LHCLACGKSL 
TDVFNWKVRA 
LPFFDEELTP 
MCAGIPEGGV 
NVWKAEL 



21 
I 

KPRIPMETFR 
LDCPLGEDEE 
RQMGYSSKPT 
KTPRWGGEE 
GSDKLGSPPS 
ATPLWIIGWG 
DTCQGDSGGP 



31 
I 

KVGIPIIIAL 
HCVKSPPBGP 
FRAVEIGPDQ 
ASVDSWPWQV 
IAVAKIIIIE 
FTKQNGGKMS 
LMYQSDQWHV 



41 
I 

LSIASIIIW 
AVAVRLSKDR 
DLDWEITEN 
SIQYDKQHVC 
FNPMYPKDND 
DILLQASVQV 
VGIVSWGYGC 



SI 
I 

VLIKVILDKY 60 

STLQVLDSAT 120 

SQELRMRNSS 180 

GGSILDPHWV 240 

IALMKLQPPL 300 

IDSTRCNADD 360 

GGPSTPGVYT 420 



A114 DNA SEQUENCE; 
Gene name: 
Unigene number: 
Probeset Accession #: 
Nucleic Acid Accession #: 
Coding sequence: 



TTK protein kinase 

Hs. 169840 

M86699 

NM_003318 

1026-3551 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

GGAATTCCTT TTTTTTTTTT TTTGAGATGG AGTTTCACTC TTGTTGGCCA GGCTGGAGTG 60 

CAATGGCACA ATCTCAGCTT ACTGCAACCT CCGCCTCCCG GGTTCAAGCG ATTCTCCTGC 120 

CTCAGCCTCT CAAGTAGCTG GGATTACAGG CATGTGCCAC CACCCCTGGC TAACTAATTT 180 

CTTTTCTATT TAGTAGAGAT GGGGTTTCAC CATGTTGGTC AGGCTGGTCT TGAACTCCTG 240 

ACCTCAGGTG ATCCACTTGC CTTGGCCTCC CAAAGTGCTA GGATTACAGC CGTGAAACTG 300 

TGCCTGGCTG ATTCTTTTTT TGTTGTTGGA TTTTTGAAAC AGGGTCTCCC TTGGTCGCCC 360 

AGGCTGGAGT GCAGTGGTGC GATCTTGGCT CACTATAACC TCCACCTCCT GGTTTCAAGT 420 

GATCCTCCCA CTTTAGCCTC CTGAGTAGCT GTGATTACAG GCGTGCACCA CCACACCCGG 480 

CTAATTTTTG TATTTTTATT AGAGACAGGG TTTCACCATG TTGGCCAGGC TGTTCTCAAA 540 

CTCCTGGACT CAAGGGATCC GCCTGCCTCC ACTTCCCAAA GTCCCGAGAT TACAGGTGTG 600 

AGTCACCATG CCTGACCTTA TAATTCTTAA GTCATTTTTT CTGGTCCATT TCTTCCTTAG 660 

GGTCCTCACA ACAAATCTGC ATTAGGCGGT ACAATAATCC TTAACTTCAT GATTCACAAA 720 

AGGAAGATGA AGTGATTCAT GATTTAGAAA GGGGAAGTAG TAAGCCCACT GCACACTCCT 780 

GGATGATGAT CCTAAATCCA GATACAGTAA AAATGGGGTA TGGGAAGGTA GAATACAAAA 840 

TTTGGTTTAA ATTAATTATC TAAATATCTA AAAACATTTT TGGATACATT GTTGATGTGA 900 

ATGTAAGACT GTACAGACTT CCTAGAAAAC AGTTTGGGTT CCATCTTTTC ATTTCCCCAG 960 

TGCAGTTTTC TGTAGAAATG GAATCCGAGG ATTTAAGTGG CAGAGAATTG ACAATTGATT 1020 

CCATAATGAA CAAAGTGAGA GACATTAAAA ATAAGTTTAA AAATGAAGAC CTTACTGATG 1080 

AACTAAGCTT GAATAAAATT TCTGCTGATA CTACAGATAA CTCGGGAACT GTTAACCAAA 1140 

TTATGATGAT GGCAAACAAC CCAGAGGACT GGTTGAGTTT GTTGCTCAAA CTAGAGAAAA 1200 

ACAGTGTTCC GCTAAGTGAT GCTCTTTTAA ATAAATTGAT TGGTCGTTAC AGTCAAGCAA 1260 

TTGAAGCGCT TCCCCCAGAT AAATATGGCC AAAATGAGAG TTTTGCTAGA ATTCAAGTGA 1320 

GATTTGCTGA ATTAAAAGCT ATTCAAGAGC CAGATGATGC ACGTGACTAC TTTCAAATGG 1380 

CCAGAGCAAA CTGCAAGAAA TTTGCTTTTG TTCATATATC TTTTGCACAA TTTGAACTGT 1440 

CACAAGGTAA TGTCAAAAAA AGTAAACAAC TTCTTCAAAA AGCTGTAGAA CGTGGAGCAG 1500 

TACCACTAGA AATGCTGGAA ATTGCCCTGC GGAATTTAAA CCTCCAAAAA AAGCAGCTGC 1560 

TTTCAGAGGA GGAAAAGAAG AATTTATCAG CATCTACGGT ATTAACTGCC CAAGAATCAT 1620 

TTTCCGGTTC ACTTGGGCAT TTACAGAATA GGAACAACAG TTGTGATTCC AGAGGACAGA 1680 

CTACTAAAGC CAGGTTTTTA TATGGAGAGA ACATGCCACC ACAAGATGCA GAAATAGGTT 1740 

ACCGGAATTC ATTGAGACAA ACTAACAAAA CTAAACAGTC ATGCCCATTT GGAAGAGTCC 1800 

CAGTTAACCT TCTAAATAGC CCAGATTGTG ATGTGAAGAC AGATGATTCA GTTGTACCTT 1860 

GTTTTATGAA AAGACAAACC TCTAGATCAG AATGCCGAGA TTTGGTTGTG CCTGGATCTA 1920 

AACCAAGTGG AAATGATTCC TGTGAATTAA GAAATTTAAA GTCTGTTCAA AATAGTCATT 1980 

TCAAGGAACC TCTGGTGTCA GATGAAAAGA GTTCTGAACT TATTATTACT GATTCAATAA 2040 

CCCTGAAGAA TAAAACGGAA TCAAGTCTTC TAGCTAAATT AGAAGAAACT AAAGAGTATC 2100 

AAGAACCAGA GGTTCCAGAG AGTAACCAGA AACAGTGGCA AGCTAAGAGA AAGTCAGAGT 2160 

GTATTAACCA GAATCCTGCT GCATCTTCAA ATCACTGGCA GATTCGGGAG TTAGCCCGAA 2220 

AAGTTAATAC AGAGCAGAAA CATACCACTT TTGAGCAACC TGTCTTTTCA GTTTCAAAAC 2280 

AGTCACCACC AATATCAACA TCTAAATGGT TTGACCCAAA ATCTATTTGT AAGACACCAA 2340 

GCAGCAATAC CTTGGATGAT TACATGAGCT GTTTTAGAAC TCCAGTTGTA AAGAATGACT 2400 

TTCCACCTGC TTGTCAGTTG TCAACACCTT ATGGCCAACC TGCCTGTTTC CAGCAGCAAC 2460 

AGCATCAAAT ACTTGCCACT CCACTTCAAA ATTTACAGGT TTTAGCATCT TCTTCAGCAA 2520 

ATGAATGCAT TTCGGTTAAA GGAAGAATTT ATTCCATATT AAAGCAGATA GGAAGTGGAG 2580 

GTTCAAGCAA GGTATTTCAG GTGTTAAATG AAAAGAAACA GATATATGCT ATAAAATATG 2640 

TGAACTTAGA AGAAGCAGAT AACCAAACTC TTGATAGTTA CCGGAACGAA ATAGCTTATT 2700 

TGAATAAACT ACAACAACAC AGTGATAAGA TCATCCGACT TTATGATTAT GAAATCACGG 2760 

ACCAGTACAT CTACATGGTA ATGGAGTGTG GAAATATTGA TCTTAATAGT TGGCTTAAAA 2820 

AGAAAAAATC CATTGATCCA TGGGAACGCA AGAGTTACTG GAAAAATATG TTAGAGGCAG 2880 

TTCACACAAT CCATCAACAT GGCATTGTTC ACAGTGATCT TAAACCAGCT AACTTTCTGA 2940 

TAGTTGATGG AATGCTAAAG CTAATTGATT TTGGGATTGC AAACCAAATG CAACCAGATA 3000 

CAACAAGTGT TGTTAAAGAT TCTCAGGTTG GCACAGTTAA TTATATGCCA CCAGAAGCAA 3060 

TCAAAGATAT GTCTTCCTCC AGAGAGAATG GGAAATCTAA GTCAAAGATA AGCCCCAAAA 3120 

GTGATGTTTG GTCCTTAGGA TGTATTTTGT ACTATATGAC TTACGGGAAA ACACCATTTC 3180 

AGCAGATAAT TAATCAGATT TCTAAATTAC ATGCCATAAT TGATCCTAAT CATGAAATTG 3240 

AATTTCCCGA TATTCCAGAG AAAGATCTTC AAGATGTGTT AAAGTGTTGT TTAAAAAGGG 3300 

ACCCAAAACA GAGGATATCC ATTCCTGAGC TCCTGGCTCA TCCATATGTT CAAATTCAAA 3360 

CTCATCCAGT TAACCAAATG GCCAAGGGAA CCACTGAAGA AATGAAATAT GTTCTGGGCC 3420 

AACTTGTTGG TCTGAATTCT CCTAACTCCA TTTTGAAAGC TGCTAAAACT TTATATGAAC 3480 

ACTATAGTGG TGGTGAAAGT CATAATTCTT CATCCTCCAA GACTTTTGAA AAAAAAAGGG 3540 

GAAAAAAATG AT TTGCAGTT ATTCGTAATG TCAGATAGGA GGTATAAAAT ATATTGGACT 3600 

GTTATACTCT TGAATCCCTG TGGAAATCTA CATTTGAAGA CAACATCACT CTGAAGTGTT 3660 

ATCAGCAAAA AAAATTCAGT GAGATTATCT TTAAAAGAAA ACTGTAAAAA TAGCAACCAC 3720 



885 



WO 03/042661 



PCT/US02/36810 



TTATGGCACT GTATATATTG TAGACTTGTT TTCTCTGTTT TATGCTCTTG TGTAATCTAC 3780 
TTGACATCAT TTTACTCTTG GAATAGTGGG TGGATAGCAA GTATATTCTA AAAAACTTTG 3840 
TAAATAAAGT TTTGTGGCTA AAATGA 



10 



15 



20 



25 



30 



A115 Protein sequence; 
Gene name: 
Unigene number: 
Probeset Accession #: 
Protein Accession #: 
Signal sequence: 
Transmembrane domains: 
Protein Kinase Domain: 
Cellular Localization: 

1 11 

1 I 
MNKVRDIKNK FKNEDLTDEL 
VPLSDALLNK LIGRYSQAIE 
ANCKKFAFVH ISFAQFELSQ 
EEEKKNLSAS TVLTAQESFS 
NSLRQTNKTK QSCPFGRVPV 
SGNDSCELRN LKSVQNSHFK 
PEVPESNQKQ WQAKRKSECI 
PPISTSKWFD PKSICKTPSS 
QILATPLQNL QVLASSSANE 
LEEADNQTU) SYRNEIAYLN 
KSIDPWERKS YWKNMLEAVH 
SWKDSQVGT VNYMPPEAIK 
IINQISKLHA IIDPNHEIEF 
PVNQMAKGTT EEMKYVLGQL 
K 



TTK protein kinase 

Hs. 169840 

MB6699 

NPJJ03309 

none found 

none found 

510-775 

cytoplasmic and nuclear 



21 
I 

SLNKISADTT 
ALPPDKYGQN 
GNVKKSKQLL 
GSLGHLQNRN 
NIiLNSPDCDV 
EPLVSDEKSS 
NQNPAASSNH 
NTLDDYMSCF 
CISVKGRIYS 
KLQQHSDKII 
TIHQHGIVHS 
DMSSSREKGK 
PDIPEKDLQD 
VGLNSPNSIL 



31 
I 

DNSGTVNQIM 
ESFARIQVRF 
QKAVERGAVP 
NSCDSRGQTT 
KTDDSWPCF 
ELIITDSITL 
WQIPELARKV 
RTPWKNDFP 
ILKQIGSGGS 
RLYDYEITDQ 
DLKPANFLIV 
SKSKISPKSD 
VLKCCLKRDP 
KAAKTLYEHY 



41 
I 

MMANNPEDWL 
AELKAIQEPD 
LEMLEIALRN 
KARFLYGENM 
MKRQTSRSEC 
KNKTESSLLA 
NTEQKHTTFE 
PACQLSTPYG 
SKVFQVLNEK 
YIYMVMECGN 
DGMLKLIDFG 
VWSLGCILYY 
KQRISIPELL 
SGGESHNSSS 



51 
I 

SLLLKLEKNS 
DARDYFQMAR 
LNLQKKQLLS 
PPQDAEIGYR 
RDLWPGSKP 
KLEETKEYQE 
QPVFSVSKQS 
QPACFQQQQH 
KQIYAIKYVN 
IDLNSWLKKK 
IANQMQPDTT 
MTYGKTPFQQ 
AHPYVQIQTH 
SKTFEKKRGK 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 



35 

40 

45 

50 

55 
60 
65 
70 
75 
80 



OVARIAN 



A116 DNA SEQUENCE 
Gene name: 
Unigene number: 
Probeset Accession 
Nucleic Acid Accession 
Coding sequence: 



G protein-coupled receptor 39 

Ks. 85339 

AA349893 

NM_001508 

1-1362 (underlined sequences correspond to start and stop codons) 



ATGGCTTCAC 
CCCGAGTTTG 
TTCGTGATGG 
AAAGGATACT 
TTGGTGTTCC 
ACGTCCAGCT 
GCTACGCTGC 
TTCAGGTACA 
GTCACCTCCG 
GTGAACGTGC 
CAGCCCGAGA 
CAGTCCAGCA 
ATGTGCTGGA 
ACGCGGCCTC 
ACCATCATCT 
ATTCGGAGGA 
GCGTACATGA 
CCGCTCCTGT 
TGCCGCCTGT 
ACCACCGACA 
TCTGCAAGGA 
TCTAAGTCCC 
AATTCTGCTG 



11 
1 

CCAGCCTCCC 
AGGTGGCCAC 
GCC1TCTGGG 
TGCAGAAGGA 
TCATCGGCAT 
ACACCCTGTC 
TGCACGTGCT 
AGGCTGTGTC 
CCCTGGTGGC 
CCAGCCACCG 
CCTCCAATAT 
TCTTCGGCGC 
ACATGATGCA 
CGCAGCTGAG 
TCCTGAGGCT 
TCATGGCTGC 
TCCTCCTCCC 
ACACGGTGTC 
CGCTGCAGCA 
GCGCCCGCTT 
GAACTGAGAA 
AGTCATTGAG 
CAGAGAATGG 



21 
I 

GGGCAGTGAC 
CTGGATCAAA 
GAACAGCGCC 
GGTGACAGAC 
GCCCATGGAG 
CTGCAAGCTG 
GACACTCAGC 
GGGACCTTGC 
ACTGCCCTTG 
GGGTCTCACT 
GTCCATCTGT 
CTTCGTGGTC 
GGTGCTCATG 
GAAGTCCGAG 
GATTGTTGTG 
GGCCAAACCC 
CTTCTCGGAG 
CTCGCAGCAG 
CGCCAACCAC 
TGTGCAGCGC 
GATTTTCTTA 
TCTCGAGTCA 
TTTTCAGGAG 



31 
I 

TGCTCCCAAA 
ATCACCCTTA 
ACCATTCGGG 
CACATGGTGA 
TTCTACAGCA 
CACACTTTCC 
TTTGAGCGCT 
CAGGTGAAGC 
CTGTTTGCCA 
TGCAACCGCT 
ACCAACCTCT 
TACCTCGTGG 
AAAAGCCAGA 
AGCGAAGAGA 
ACATTGGCCG 
AAGCACGACT 
ACGTTTTTCT 
TTTCGGCGGG 
GAGAAGCGCC 
CCGTTGCTCT 
AGCACTTTTC 
CTAGAG CCCA 
CATGAAGTTT 



41 
1 

TCATTGATCA 
TTCTGGTGTA 
TCACCCAGGT 
GTTTGGCTTG 
TCATCTGGAA 
TCTTCGAGGC 
ACATCGCCAT 
TGCTGATTGG 
TGGGTACTGA 
CCAGCACCCG 
CCAGCCGCTG 
TCCTGCTCTC 
AGGGCTCGCT 
GCAGGACCGC 
TATGCTGGAT 
GGACGAGGTC 
ACCTCAGCTC 
TGTTCGTGCA 
TGCGCGTACA 
TCGCGTCCCG 
AGAGCGAGGC 
ACTCAGGCGC 
GA 



51 
I 

CAGTCATGTC 
CCTGATCATC 
GCTGCAGAAG 
CTCGGACATC 
TCCCCTGACC 
CTGCAGCTAC 
CTGTCACCCC 
CTTCGTCTGG 
GTACCCCCTG 
CCACCACGAG 
GACCGTGTTC 
CGTAGCCTTC 
GGCCGGGGGC 
CAGGAGGCAG 
GCCCAACCAG 
CTACTTCCGG 
GGTCATCAAC 
GGTGCTGTGC 
TGCGCACTCC 
GCGCCAGTCC 
CGAGCCCCAG 
GAAACCAGCC 



A117 Protein sequence: 
Gene name: 
Unigene number: 
Probeset Accession #: 
Protein Accession #: 
Signal sequence: 
Pfam domains: 
Transmembrane domains: 
Cellular Localization: 



G protein-coupled receptor 39 

Hs. 85339 

AA349893 

NM_001508, NPJ)01409 
none found 

7tm_l (72-172, 224-344] 

32-54, 68-90, 111-133, 1S1-173, 

plasma membrane 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 



221-243, 280-301, 320-342 



1 11 21 31 41 SI 

| I I I I I 

MASPSLPGSD CSQIIDHSHV PEFEVATWIK ITLILVYLII FVMGLLGNSA TIRVTQVLQK 

886 



WO 03/042661 



PCT/US02/36810 



KGYLQKEVTD HMVSLACSDI LVFLIGMPME FYSIIWNPLT TSSYTLSCKL HTFLFEACSY 120 

ATLLHVLTLS FERYIAICHP FRYKAVSGPC QVKLLIGFVW VTSALVALPL LFAMGTEYPL 180 

VNVPSHRGLT CNRSSTRHHE QPETSNMSIC TNLSSRWTVF QSSIFGAFW YLWLLSVAF 240 

MCWNMMQVLM KSQKGSLAGG TRPPQLRKSE SEESRTARRQ TIIFLRLIW TLAVCWMPNQ 300 

IRRIMAAAKP KHDWTRSYFR AYMILLPPSE TFFYLSSVIN PLLYTVSSQQ FRRVFVQVLC 360 

CRLSLQHANH EKRLRVHAHS TTDSARFVQR PLLFASRRQS SARRTEKIFL STPQSEAEPQ 420 
SKSQSLSLES LEPNSGAKPA NSAAENGFQE HEV 



All 8 DNA sequence 
Gene name: 
Unigene number: 
Probeset Accession #: 
Nucleic Acid Accession #: 
Coding sequence: 

11 



bone morphogenetic protein receptor IB (ALK-6) 

Hs. 87223 

AA250737 

NM_001203 

274-1782 (underlined sequences correspond to start and stop codons) 



21 31 41 51 

I ! I I I I 

CGCGGGGCGC GGAGTCGGCG GGGCCTCGCG GGACGCGGGC AGTGCGGAGA CCGCGGCGCT 
GAGGACGCGG GAGCCGGGAG CGCACGCGCG GGGTGGAGTT CAGCCTACTC TTTCTTAGAT 
GTGAAAGGAA AGGAAGATCA TTTCATGCCT TGTTGATAAA GGTTCAGACT TCTGCTGATT 
CATAACCATT TGGCTCTGAG CTATGACAAG AGAGGAAACA AAAAGTTAAA CTTACAAGCC 
TGCCATAAGT GAGAAGCAAA CTTCCTTGAT AACATGCTTT TGCGAAGTGC AGGAAAATTA 
AATGTGGGCA CCAAGAAAGA GGATGGTGAG AGTACAGCCG CCACCCCCCG TCCAAAGGTC 
TTGCGTTGTA AATGCCACCA CCATTGTCCA GAAGACTCAG TCAACAATAT TTGCAGCACA 
GACGGATATT GTTTCACGAT GATAGAAGAG GATGACTCTG GGTTGCCTGT GGTCACTTCT 
GGTTGCCTAG GACTAGAAGG CTCAGATTTT CAGTGTCGGG ACACTCCCAT TCCTCATCAA 
AGAAGATCAA TTGAATGCTG CACAGAAAGG AACGAATGTA ATAAAGACCT ACACCCTACA 
CTGCCTCCAT TGAAAAACAG AGATTTTGTT GATGGACCTA TACACCACAG GGCTTTACTT 
ATATCTGTGA CTGTCTGTAG TTTGCTCTTG GTCCTTATCA TATTATTTTG TTACTTCCGG 
TATAAAAGAC AAGAAACCAG ACCTCGATAC AGCATTGGGT TAGAACAGGA TGAAACTTAC 
ATTCCTCCTG GAGAATCCCT GAGAGACTTA ATTGAGCAGT CTCAGAGCTC AGGAAGTGGA 
TCAGGCCTCC CTCTGCTGGT CCAAAGGACT ATAGCTAAGC AGATTCAGAT GGTGAAACAG 
ATTGGAAAAG GTCGCTATGG GGAAGTTTGG ATGGGAAAGT GGCGTGGCGA AAAGGTAGCT 
GTGAAAGTGT TCTTCACCAC AGAGGAAGCC AGCTGGTTCA GAGAGACAGA AATATATCAG 
ACAGTGTTGA TGAGGCATGA AAACATTTTG GGTTTCATTG CTGCAGATAT CAAAGGGACA 
GGGTCCTGGA CCCAGTTGTA CCTAATCACA GACTATCATG AAAATGGTTC CCTTTATGAT 
TATCTGAAGT CCACCACCCT AGAOGCTAAA TCAATGCTGA AGTTAGCCTA CTCTTCTGTC 
AGTGGCTTAT GTCATTTACA CACAGAAATC TTTAGTACTC AAGGCAAACC AGCAATTGCC 
CATCGAGATC TGAAAAGTAA AAACATTCTG GTGAAGAAAA ATGGAACTTG CTGTATTGCT 
GACCTGGGCC TGGCTGTTAA ATTTATTAGT GATACAAATG AAGTTGACAT ACCACCTAAC 
ACTCGAGTTG GCACCAAACG CTATATGCCT CCAGAAGTGT TGGACGAGAG CTTGAACAGA 
AATCACTTCC AGTCTTACAT CATGGCTGAC ATGTATAGTT TTGGCCTCAT CCTTTGGGAG 
GTTGCTAGGA GATGTGTATC AGGAGGTATA GTGGAAGAAT ACCAGCTTCC TTATCATGAC 
CTAGTGCCCA GTGACCCCTC TTATGAGGAC ATGAGGGAGA TTGTGTGCAT CAAGAAGTTA 
CGCCCCTCAT TCCCAAACCG GTGGAGCAGT GATGAGTGTC TAAGGCAGAT GGGAAAACTC 
ATGACAGAAT GCTGGGCTCA CAATCCTGCA TCAAGGCTGA CAGCCCTGCG GGTTAAGAAA 
ACACTTGCCA AAATGTCAGA GTCCCAGGAC ATTAAACT CT GAT AGGAGAG GAAAAGTAAG 
CATCTCTGCA GAAAGCCAAC AGGTACTCTT CTGTTTGTGG GCAGAGCAAA AGACATCAAA 
TAAGCATCCA CAGTACAAGC CTTGAACATC GTCCTGCT T C CCAGTGGGTT CAGACCTCAC 
CTTTCAGGGA GCGACCTGGG CAAAGACAGA GAAGCTCCCA GAAGGAGAGA TTGATCCGTG 
TCTGTTTGTA GGCGGAGAAA CCGTTGGGTA ACTTGTTCAA GATATGATGC AT 



60 
120 
160 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
I960 



All 9 Protein 



Gene name: 
Unigene number: 
Probeset Accession #: 
Protein Accession #: 
Signal sequence: 
Transmembrane domains: 
PFAM domains: 
Cellular Localization: 



bone morphogenetic protein receptor IB (ALK-6) 
HS. 72472 / Hs. 87223 
AA250737 / U89326 
NP_001194 

1-13 
128-144 

activin_receptor [30-111} , protein kinase (204-491] 
plasma membrane 



1 11 21 31 41 51 

I I I I I 1 

MLLRSAGKLN VGTKKEDGES TAPTPRPKVL RCKCHHHCPE DSVNNICSTD GYCFTMIEED 60 

DSGLPWTSG CLGLEGSDFQ CRDTPIPHQR RSIECCTERN ECNKDLHPTL PPLKNRDFVD 120 

GPIHHRALLI SVTVCSLLLV LIILFCYFRY KRQETRPRYS IGLEQDETYI PPGESLRDLI 180 

EQSQSSGSGS GLPLLVQRTI AKQIQMVKQI GKGRYGEVWM GKWRGEKVAV KVFFTTEEAS 240 

WFRETEIYQT VLMRHENILG FIAADIKGTG SWTQLYLITD YHENGSLYDY LKSTTLDAKS 300 

MLKLAYSSVS GLCHLHTEIF STQGKPAXAH RDLKSKNILV KKNGTCCIAD LGLAVKFISD 360 

TNEVDIPPNT RVGTKRYMPP EVLDESLNRN HFQSYIMADM YSFGLILWEV ARRCVSGGIV 420 

EEYQLPYHDL VPSDPSYEDM REIVCIKKLR PSFPNRWSSD ECLRQMGKLM TECWAHNPAS 480 
RLTALRVKKT LAKMSESQDI KL 

A120 DNA SEQUENCE 

Gene name: LXV-1 protein, estrogen regulated 
Unigene number: Hs. 79136 

Probeset Accession #: U41060 
Nucleic Acid Accession #: NM_012319.2 

Coding sequence: 138-2405 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

887 



WO 03/042661 



PCT/US02/36810 



10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



I l i I I 1 

CTCGTGCCGA ATTCGGCACG AGACCGCGTG TTCGCGCCTG GTAGAGATTT CTCGAAGACA 
CCAGTGGGCC CGTGTGGAAC CAAACCTGCG CGCGTGGCCG GGCCGTGGGA CA ACGAG GCC 
GCGGAGACGA AGGCGCAATG GCGAGGAAGT TATCTGTAAT CTTGATCCTG ACCTTTGCCC 
TCTCTGTCAC AAATCCCCTT CATGAACTAA AAGCAGCTGC TTTCCCCCAG ACCACTGAGA 
AAATTAGTCC GAATTGGGAA TCTGGCATTA ATGTTGACTT GG CAAT TTCC ACACGGCAAT 
ATCATCTACA ACAGCTTTTC TACCGCTATG GAGAAAATAA TTCTTTGTCA GTTGAAGGGT 
TCAGAAAATT ACTTCAAAAT ATAGGCATAG ATAAGATTAA AAGAATCCAT ATACACCATG 
ACCACGACCA TCACTCAGAC CACGAGCATC ACTCAGACCA TGAGCGTCAC TCAGACCATG 
AGCATCACTC AGACCACGAG CATCACTCTG ACCATGATCA TCACTCTCAC CATAATCATG 
CTGCTTCTGG TAAAAATAAG CGAAAAGCTC TTTGCCCAGA CCATGACTCA GATAGTTCAG 
GTAAAGATCC TAGAAACAGC CAGGGGAAAG GAGCTCACCG ACCAGAACAT GCCAGTGGTA 
GAAGGAATGT CAAGGACAGT GTTAGTGCTA GTGAAGTGAC CTCAACTGTG TACAACACTG 
TCTCTGAAGG AACTCACTTT CTAGAGACAA TAGAGACTCC AAGACCTGGA AAACTCTTCC 
CCAAAGATGT AAGCAGCTGC ACTCCACCCA GTGTCACATC AAAGAGCCGG GTGAGCCGGC 
TGGCTGGTAG GAAAACAAAT GAATCTGTGA GTGAGCCCCG AAAAGGCTTT ATGTATTCCA 
GAAACACAAA TGAAAATCCT CAGGAGTGTT TCAATGCATC AAAGCTACTG ACATCTCATG 
GCATGGGCAT CCAGGTTCCG CTGAATGCAA CAGAGTTCAA CTATCTCTGT CCAGCCATCA 
TCAACCAAAT TGATGCTAGA TCTTGTCTGA TTCATACAAG TGAAAAGAAG GCTGAAATCC 
CTCCAAAGAC CTATTCATTA CAAATAGCCT GGGTTGGTGG TTTTATAGCC ATT TCCA TCA 
TCAGTTTCCT GTCTCTGCTG GGGGTTATCT TAGTGCCTCT CATGAATCGG GTGTTTTTCA 
AATTTCTCCT GAGTTTCCTT GTGGCACTGG CCGTTGGGAC TTTGAGTGGT GATGCTTTTT 
TACACCTTCT TCCACATTCT CATGCAAGTC ACCACCATAG TCATAGCCAT GAAGAACCAG 
CAATGGAAAT GAAAAGAGGA CCACTTTTCA GTCATCTGTC TTCTCAAAAC ATAGAAGAAA 
GTGCCTATTT TGATTCCACG TGGAAGGGTC TAACAGCTCT AGGAGGCCTG TATTTCATGT 
TTCTTGTTGA ACATGTCCTC ACATTGATCA AACAATTTAA AGATAAGAAG AAAAAGAATC 
AGAAGAAACC TGAAAATGAT GATGATGTGG AGATTAAGAA GCAGTTGTCC AAGTATGAAT 
CTCAACTTTC AACAAATGAG GAGAAAGTAG ATACAGATGA TCGAACTGAA GGCTATTTAC 
GAGCAGACTC ACAAGAGCCC TCCCACTTTG ATTCTCAGCA GCCTGCAGTC TTGGAAGAAG 
AAGAGGTCAT GATAGCTCAT GCTCATCCAC AGGAAGTCTA CAATGAATAT GTACCCAGAG 
GGTGCAAGAA TAAATGCCAT TCACATTTCC ACGATACACT CGGCCAGTCA GACGATCTCA 
TTCACCACCA TCATGACTAC CAT CAT ATT C TCCATCATCA CCACCACCAA AACCACCATC 
CTCACAGTCA CAGCCAGCGC TACTCTCGGG AGG AG CTGAA AGATGCCGGC GTCGCCACTT 
TGGCCTGGAT GGTGATAATG GGTGATGGCC TGCACAATTT CAGCGATGGC CTAGCAATTG 
GTGCTGCTTT TACTGAAGGC TTATCAAGTG GTTTAAGTAC TTCTGTTGCT GTGTTCTGTC 
ATGAGTTGCC TCATGAATTA GGTGACTTTG CTGTTCTACT AAAGGCTGGC ATGACCGTTA 
AGCAGGCTGT CCTTTATAAT GCATTGTCAG CCATGCTGGC GTATCTTGGA ATGGCAACAG 
GAATTTTCAT TGGTCATTAT GCTGAAAATG TTTCTATGTG GATATTTGCA CTTACTGCTG 
GCTTATTCAT GTATGTTGCT CTGGTTGATA TGGTACCTGA AATGCTGCAC AATGATGCTA 
GTGACCATGG ATGTAGCCGC TGGGGGTATT TCTTTTTACA GAATGC TGGG ATGCTTTTGG 
GTTTTGGAAT TATGTTACTT ATTTCCATAT TTGAACATAA AATCGTGTTT CGTATAAATT 
TCTAGTTAAG GTTTAAATGC TAGAGTAGCT TAAAAAGTTG TCATAGTTTC AGTAGGTCAT 
AGGGAGATGA GTTTGTATGC TGTACTATGC AGCGTTTAAA GTTAGTGGGT TTTGTGATTT 
TTGTATTGAA TATTGCTGTC TGTTACAAAG TCAGTTAAAG GTACGTTTTA ATATTTAAGT 
TATTCTATCT TGGAGATAAA ATCTGTATGT GCAATTCACC GGTATTACCA GTTTATTATG 
TAAACAAGAG ATTTGGCATG ACATGTTCTG TATGTTTCAG GGAAAAATGT CTTTAATGCT 
TTTTCAAGAA CTAACACAGT TATTCCTATA CTGGATTTTA GGTCTCTGAA GAACTGCTGG 
TGTTTAGGAA TAAGAATGTG CATGAAGCCT AAAATACCAA GAAAGCTTAT ACTGAATTTA 
AGCAAAGAAA TAAAGGAGAA AAGAGAAGAA TCTGAGAATT GGGGAGGCAT AGATTCTTAT 
AAAAATCACA AAATTTGTTG TAAATTAGAG GGGAGAAATT TAGAATTAAG TATAAAAAGG 
CAGAATTAGT ATAGAGTACA TTCATTAAAC ATTTTTGTCA GGATTATTTC CCGTAAAAAC 
GTAGTGAGCA CTCTCATATA CTAATTAGTG TACATTTAAC TTTGTATAAT ACAGAAATCT 
AAATATATTT AATGAATTCA AG CAAT AT AC ACTTGACCAA GAAATTGGAA TTTCAAAATG 
TTCGTGCGGG TTATATACCA GATGAGTACA GTGAGTAGTT TATGTATCAC CAGACTGGGT 
TATTGCCAAG TTATATATCA CCAAAAGCTG TATGACTGGA TGTTCTGGTT ACCTGGTTTA 
CAAAATTATC AGAGTAGTAA AACTTTGATA TATATGAGGA TATTAAAACT ACACTAAGTA 
TCATTTGATT CGATTCAGAA AGTACTTTGA TATCTCTCAG TGCTTCAGTG CTATCATTGT 
GAGCAATTGT CTTTATATAC GGTACTGTAG CCATACTAGG CCTGTCTGTG GCATTCTCTA 
GAIGTTTCTT TTTTACACAA TAAATTCCTT ATATCAGCTT G 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 



A121 PROTEIN SEQUENCE 

Gene name: 
Unigene number: 
Protein Accession #: 
Signal sequence: 
Pfam domain: 



Cellular Localization: 



LIV-1 protein, estrogen regulated 
Hs. 79136 

NP_0364S1 

1-21 

Zip [591-743] 

330-346, 352-368, 427-444, 663-679, 
plasma membrane 



688-703, 730-745 



1 
I 

MARKLSVTLI 
FYRYGENNSL 



SVSASEVTST 
NESVSEPRKG 
RSCLIHTSEK 
LVALAVGTLS 
TWKGLTALGG 
EEKVDTDDRT 
HSHFHDTLGQ 
MGDGLHNFSD 
NALSAMLAYL 



11 

1 

LTFALSVTNP 
SVEGFRKLLQ 
HHNHAASGKN 
VYNTVSEGTH 



KAEIPPKTYS 
GDAFLHLLPH 
LYFMFLVEHV 
EGYLRADSQE 
SDDLIHHHHD 
GLAIGAAFTE 
GKATGIFIGH 



21 

I 

LHELKAAAFP 
NIGIDKIKRI 
KRKALCPDHD 
FLETIETPRP 
PQECFNASKL 
LQIAWVGGFI 



31 41 51 

I I I. 

QTTEKISPNW ESGINVDLAI STRQYHLQQL 

HIHHDHDHHS 
SDSSGKDPRN 
GKLFPKDVSS 
LTSHGMGIQV 
AISIISFLSL 



LTLIKQFKDK 
PSHFDSQQPA 
YHHILHHHHH 
GLSSGLSTSV 
YAENVSMWIP 



KKKNQKKPEN 
VLEEEEVMIA 
QNHHPHSHSQ 
AVFCHELPHE 
ALTAGLFMYV 



SQGKGAHRPE 
STPPSVTSKS 
PLNATEFNYL 
LGVILVPLMN 
GPLFSHLSSQ 
DDDVEIKKQL 
HAHPQEVYNE 
RYSREELKDA 
LGDFAVLLKA 
ALVDMVPEML 



HASGRRNVKD 
RVSRLAGRKT 
CPAIINQIDA 
RVFFKFLLSF 
NIEESAYFDS 
SKYESQLSTN 
YVPRGCKNKC 
GVATLAWMVI 
GMTVKQAVLY 
HNDASDHGCS 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 



888 



WO 03/042661 



PCT/US02/36810 



RWGYFFLQNA GMLLGFGIML LISIPEHKIV FRINF 



A122 PNA SEQUENCE 
Gene name: 
Unigene number: 
Probeset Accession #: 
Nucleic Acid Accession #: 
Coding sequence: 



EGF- like -domain; multiple 6 

He. 12844 

N67551 

NM_015507 

241-1902 (underlined sequences correspond to start and stop codons) 



1 
I 

CCGCAGAGGA 
CGCCCCTGCC 
CGAGTGGAGC 
GGGTCCGGCC 
ATGCCTCTGC 
GGGAACGCGG 
TGTCACTATG 
TGTGAAGCTA 
AGATGCTTTC 
AAACCCCGGC 
CTCAGTGGCC 
ATAAACTGTC 
TCAGGACTCC 
GGTAAAGTCA 
AAATGTCACA 
AATGAATGTA 
GGGTCCTTCA 
ATCCCTGAAA 
AAGAAGTTGC 
CCAGAACCCA 
ATAGTTTCCA 
GAGGGGCTTG 
AGCCTGCGAG 
CTGGTCCAAA 
GACTGCAGCT 
TGGAATCCTG 
GGTCACAAGA 
AACTTCTGTT 
TTTGTGAAAA 
TGGAAGACAG 
GAAGCAGAAC 
TCAGGCTTAT 
TTGACTTTGT 
TTAGAATTAC 
TCTTGTATAA 
TTTCTGAATC 
CAGTATATCT 
TAGAAAAAAA 
TATGACATCA 
TGTATATTTA 



11 

I 

GCCTCGGCCA 
GOGGTGCCTG 
GGAGGACCCG 
GGCGCCCTCC 
CCTGGAGCCT 
CCAGTGCAAG 
GAACTAAACT 
CATGCGAACC 
CAGGATACAC 
CATGCCAACA 
ACATG CTCAT 
AGTACAGCTG 

TCTGTCCCTA 
TTGGTTTCGA 
CTATGGATAG 
AGTGTAAATG 
ATTCTGTGAA 
TTGCTCACAA 
CCAGGACTCC 
GAGGCGGGAA 
AGGATGAGAA 
GAGATGTGTT 
GGAAAGCGCT 
TCAATCATGG 
CTGATCGAGA 
AAGACATTGG 
TGCTCTTTGA 
ACAGTAACAA 
GGAAAATTCA 
GTGGCAAGGG 
GTCCAGATAG 
ATGTCAGTTC 
TAGCTGAAAA 
GATATGCCAA 
TTTCCACATT 
GATTTGTATA 
AGCACAGAGA 
AAGATAGACT 
AATTCTTTGT 



21 
I 

GGCTAGCCAG 
GCCTCCCCTC 
AGCGGCTGAG 
CGAGGGGGGC 
TGCGCTCCCG 
GCATCACGGG 
GGCCTGCTGC 
TGGATGTAAG 
CGGGAAAACC 
CAGATGTGTG 
GCCAGATGCT 
TGAAGACACA 
AAATGGAAGA 
CAATCGAAGA 
ACTGCAATAT 
CCATACGTGC 
CAAGCAGGGA 
GGAAGTCCTC 
AAACAGCATG 
TACCCCTAAG 
CTCTCATGGA 
AAGAGAAGAG 
TTTCCCTAAG 
AACTTCCAAA 
GATCTGTGAC 
TAATGCTATT 
CCGATTGAAA 
TTACCGGCTG 
TGCCCTGGCA 
GTTGTATCAA 
CAAAACCGGC 
CCTTTTATCT 
CCTGGTTTTT 
ATTGTAATGT 
TATTTGCTTT 
ATATTATAAA 
AGTAAGTTGA 
AATGTTTAAC 
TTTGCCTAAG 
AATAATAATA 



31 
I 

GGCGCCCCCA 
CCAGACTGCA 
GAGAGAGGAG 
TCAGGAGGAG 
CTGCTGCTCT 
TTGTTAGCAT 
TACGGCTGGA 
TTTGGTGAGT 
TGCAGTCAAG 
AATACACACG 
ACGTGTGTGA 
GAAGAAGGGC 
GACTGTCTAG 
TGTGTGAACA 
ATCAGTGGAC 
AGCCACCATG 
TATAAAGGCA 
AGAGCACCTG 
AAAAAGAAGG 
GTGAACTTGC 
GGTAAAAAAG 
AAAGCCCTGA 
GTGAATGAAG 
CTGGAACATA 
TGGAAACAGG 
GGCTTCTATA 
CTTCTCCTAC 
GCCGGAGACA 
TGGGAGAAGA 
GGAACTGATG 
GAAATCGCAG 
GTGGATGACT_ 
TTGATATTGC 
ACCAACAGAA 
AAATATCATA 
ATATGGAAAT 
TGAGCTTCTC 
TGTTTGACTC 
TGGCTTAGCT 
TCCAAATCAT 



41 
I 

GCCCCTCCCC 
GGGACAGCAC 
GCGGCGGCTT 
GAAGGAGGAC 
CCTGGGTGGC 
CGGCACGTCA 
GAAGAAACAG 
GCGTGGGACC 
ATGTGAATGA 
GAAGCTACAA 
ACTCTAGGAC 
CACAGTGCCT 
ATATTGATGA 
CATTTGGAAG 
GATATGACTG 
CCAATTGCTT 
ATGGACTTCG 
GTACCATCAA 
CAAAAATTAA 
AGCCCTTCAA 
GGAATGAAGA 
AGAATGACAT 
CAGGTGAATT 
AAGATTTAAA 
ATAGAGAAGA 
TGGCAGTTCC 
CTGACCTGCA 
AAGTCGGGAA 
CCACGAGTGA 
CTACCAAAAG 
TGGATGGCGT 
GAA TGTTACT 
ATCATAGGAC 
ATATTATTGT 
TCACTGTATC 
GTCAGTTTAT 
TCTACAACAT 
TTATGATACT 
GGGTCTTTCA 
CAAAAAAAAA 



51 
I 

AGGCCGCGAG 
CCGGTAACTG 
AGCTGCTACG 
CCGTGCGAGA 
AGGTGGTTTC 
GCCTGGGGTC 
CAAGGGAGTC 
AAACAAATGC 
GTGTGGAATG 
GTGCTTTTGC 
ATGTGCCATG 
GTGTCCATCC 
ATGTGCCTCT 
CTACTACTGC 
TATAGATATA 
CAATACCCAA 
GTGTTCTGCT 
AGACAGAATC 
AAATGTTACC 
CTATGAAGAG 
GAAAATGAAA 
AGAGGAGCGA 
CGGCCTGATT 
TATCTCGGTT 
TGATTTTGAC 
GGCCTTGGCA 
ACCCCAAAGC 
ACTTCGAGTG 
GGATGAAAAG 
CATCATTTTT 
CTTGCTTGTT 
ATCTTTATAT 
CTCTGGCATT 
AAGATGCCTT 
TTCTCAGTCA 
CTCCCCTCCT 
TTCTAGAAAA 
TCTTGGAAAC 
TAGCCAAACT 



A123 Protein sequences 
Gene name: 
Unigene number: 
Probeset Accession 
Protein Accession #t 
Signal sequence: 
Transmembrane domains: 
MAM domain: 



Cellular Localization: 



EGF- like-domain; multiple 6 

Hs. 12844 

N67551 

NP_0S6322.2 

1-21 

none found 
402-546 
80-259 
secreted 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1S60 
1620 
1680 
1740 
1800 
1860 
1920 
I960 
2040 
2100 
2160 
2220 
2280 
2340 



1 11 21 31 41 51 

I I I I I I 

MPLPWSLALP LLLSWVAGGF GNAASARHHG LLASARQPGV CHYGTKLACC YGWRRNSKGV 60 

CEATCEPGCK FGECVGPNKC RCFPGYTGKT CSQDVNECGM KPRPCQHRCV NTHGSYKCFC 120 

LSGHMLMPDA TCVNSRTCAM IKCQYSCEDT EEGPQCLCPS SGLRLAPNGR DCLDIDECAS 180 

GKVICPYNRR CVNTFGSYYC KCHIGFELQY ISGRYDCIDI NECTMDSHTC SHHANCFNTQ 240 

GSFKCKCKQG YKGNGLRCSA IPENSVKEVL RAPGTIKDRI KKLLAHKNSM KKKAKIKNVT 300 

PEPTRTPTPK VNLQPFNYKE IVSRGGNSHG GKKGNBEKMK EGLEDEKREE KALKNDIEER 360 

SLRGDVPPPK VNEAGEFGLI LVQRKALTSK LEHKDLNISV DCSFNHGICD WKQDREDDFD 420 

WNPADRDNAI GFYMAVPALA GHKKDIGRLK LLLPDLQPQS NFCLLFDYRIi AGDKVGKLRV 480 

PVKNSNNALA WEKTTSEDEK WKTGKIQLYQ GTDATKSIIP EAERGRGKTG E1AVDGVLLV 540 
SGLCPDSLLS VDD 

A124 PNA SEQUENCE 

Gene name: bone morphogenetic protein 7 (osteogenic protein 1) (BMP 7) 

Unigene number: Hs. 170195 

Probeset Accession #: BE616633 
Nucleic Acid Accession #: NM_001719 

889 



WO 03/042661 



PCT/US02/36810 



Coding sequence: 



123-1418 (underlined sequences correspond to start and stop codons) 



11 



GGGCGCAGCG 
CTGCCACCTG 
CGATGCACGT 
CCCTGTTCCT 
GCTTCATCCA 
CCATTTTGGG 
CCATGTTCAT 
GCCAGGGCTT 
GCCTGCAAGA 
TGGAACATGA 
TTTCCAAGAT 
ACATCCGGGA 
AGCACTTGGG 
AGGAGGGCTG 
GGCACAACCT 
AGTTGGCGGG 
TCTTCAAGGC 
GCCAGAACOG 
AGAACAGCAG 
GAGACCTGGG 
AGGGGGAGTG 
AGACGCTGGT 
AGCTCAATGC 
ACAGAAACAT 
TTGGGGCCAA 
CTGCCTTTTG 
AAACATGAGC 
TCCTACAAGC 
GCCGGGCCAG 
TTATGAGCGC 
GGGCACATTG 
CAATAAAACG 



GGGCCCGTCT 
GGGCGGTGCG 
GCGCTCACTG 
GCTGCGCTCC 
CCGGCGCCTC 
CTTGCCCCAC 
GCTGGACCTG 
CTCCTACCCC 
TAGCCATTTC 
CAAGGAATTC 
CCCAGAAGGG 
ACGCTTCGAC 
CAGGGAATCG 
GCTGGTGTTT 
GGGCCTGCAG 
CCTGATTGGG 
CACGGAGGTC 
CTCCAAGACG 
CAGCGACCAG 
CTGGCAGGAC 
TGCCTTCCCT 
CCACTTCATC 
CATCTCCGTC 
GGTGGTCCGG 
GTTTTTCTGG 
TGAGACCTTC 
AGCATATGGC 
TGTGCAGGCA 
GTCATTGGCT 
CTACCAGCCA 
GTGTCTGTGC 
AATGAATG 



21 
I 

GCAGCAAGTG 
GGCCCGGAGC 
CGAGCTGCGG 
GCCCTGGCCG 
CGCAGCCAGG 
CGCCCGCGCC 
TACAACGCCA 
TACAAGGCCG 
CTCACCGACG 
TTCCACCCAC 
GAAGCTGTCA 
AATGAGACGT 
GATCTCTTCC 
GACATCACAG 
CTCTCGGTGG 
CGGCACGGGC 
CACTTCCGCA 
CCCAAGAACC 
AGGCAGGCCT 
TGGATCATCG 
CTGAACTCCT 
AACCCGGAAA 
CTCTACTTCG 
GCCTGTGGCT 
ATCCTCCATT 
CCCTCCCTAT 
TTTTGATCAG 
AAACCTAGCA 
GGGAAGTCTC 
GGCCACCCAG 
GAAAGGAAAA 



31 



ACCGACGGCC 
CCGGAGCCCG 
CGCCGCACAG 
ACTTCAGCCT 
AGCGGCGGGA 
CGCACCTCCA 
TGGCGGTGGA 
TCTTCAGTAC 
CCGACATGGT 
GCTACCACCA 
CGGCAGCCGA 
TCCGGATCAG 
TGCTCGACAG 
CCACCAGCAA 
AGACGCTGGA 
CCCAGAACAA 
GCATCCGGTC 
AGGAAGCCCT 
GTAAGAAGCA 
CGCCTGAAGG 
ACATGAACGC 
CGGTGCCCAA 
ATGACAGCTC 
GCCACTAGCT 
GCTCGCCTTG 
CCCCAACTTT 
TTTTTCAGTG 
GGAAAAAAAA 
AGCCATGCAC 
CCGTGGGAGG 
TTGACCCGGA 



41 

1 

GGGACGGCCG 
GGTAGCGCGT 
CTTCGTGGCG 
GGACAACGAG 
GATGCAGCGC 
GGGCAAGCAC 
GGAGGGCGGC 
CCAGGGCCCC 
CATGAGCTTC 
TCGAGAGTTC 
ATTCCGGATC 
CGTTTATCAG 
CCGTACCCTC 
CCACTGGGTG 
TGGGCAGAGC 
GCAGCCCTTC 
CACGGGGAGC 
GCGGATGGCC 
CGAGCTGTAT 
CTACGCCGCC 
CACCAACCAC 
GCCCTGCTGT 
CAACGTCATC 
CCTCCGAGAA 
GCCAGGAACC 
AAAGGTGTGA 
GCAGCATCCA 
ACAACGCATA 
GGACTCGTTT 
AAGGGGGCGT 
AGTTCCTGTA 



51 
I 

CCTGCCCCCT 
AGAGCCGGCG 
CTCTGGGCAC 
GTGCACTCGA 
GAGATCCTCT 
AACTCGGCAC 
GGGCCCGGCG 
CCTCTGGCCA 
GTCAACCTCG 
CGGTTTGATC 
TACAAGGACT 
GTGCTCCAGG 
TGGGCCTCGG 
GTCAATCCGC 
ATCAACCCCA 
ATGGTGGCTT 
AAACAGCGCA 
AACGTGGCAG 
GTCAGCTTCC 
TACTACTGTG 
GCCATCGTGC 
GCGCCCAOGC 
CTGAAGAAAT 
TTCAGACCCT 
AGCAGACCAA 
GAGTATTAGG 
ATGAACAAGA 
AAGAAAAATG 
CCAGAGGTAA 
GGCAAGGGGT 
ATAAATGTCA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 



A125 Protein sequence i 

Gene name: bone morphogenetic protein 7 (osteogenic protein 1) (BMP7) 



Unigene number: 
Probeset Accession #: 
Protein Accession #: 
Signal sequence: 
Pfatn domains: 
Transmembrane domains: 
Cellular Localization: 



Hs. 170195 
BE616633 
NP_0 01710.1 
1-30 

TGFbjropeptide [37-281] 

none found 

Becreted 



MHVRSLRAAA 
ILGLPHRPRP 
LQDSHFLTDA 
IRERFDNETF 
HNLGLQLSVE 
QNRSKTPKNQ 
GECAFPLNSY 
RNMWRACGC 



11 
I 

PHSFVALWAP 
HLQGKHNSAP 
DMVMSFVNLV 
RISVYQVLQB 
TLDGQSINPK 
EAJjRMANVAE 
MNATNHAIVQ 
H 



21 
I 

LFLLRSAXiAD 
MFMLDLYNAM 



31 

I 

FSLDNEVHSS 



41 
I 

FIHRRLRSQE 
QGPSYPYKAV 



HLGRESDLFL 
LAGLIGRHGP 
NSSSDQRQAC 
TLVHFINPET 



YHHREFRFDL 
LDSRTLWASE 
QNKQPFMVAF 
KKHELYVSFR 
VPKPCCAPTQ 



EGWLVFDITA 
FKATEVHFRS 
DLGWQDWIIA 
LNAISVLYFD 



51 
I 

RREMQREILS 60 

FSTQGPPLAS 120 

AAEPRIYKDY 180 

TSNHWWNPR 240 

IRSTGSKQRS 300 

PEGYAAYYCE 360 

DSSKVILKKY 420 



A126 DNA SEQUENCE 
Gene name: 
Unigene number: 
Probeset Accession Jh 
Nucleic Acid Accession #: 
Coding sequence: 



integrin, beta 8 
Hs. 52620 
AA479726 
NM_002214 

680-2990 (underlined sequences correspond to start and stop codons) 



CCCAGAGCCG 
CTGCCGACTT 
GTTGGCCTCC 
TCCCCTCGAC 
TAGGGTGGTT 
CTAAGCTGAT 
TGTCCCGGAG 
TGGCCGTCGA 
GGCCGTAQGG 
CCGAGCCGCG 
GGCCCCGAGG 
GGGGCGGGCT 
TCTGCCTGCA 



11 
I 

CCTCCCCCTG 
GTCTTTGCCC 
CTGCCCACCT 
CTCGCCGGCG 
TCCCCCCCAG 
TTATGCAGCA 
CAGGCTGCGG 
AGGAGGTGCT 
GCCCTGAGAT 
GGGTCCGCCT 
TCGCCCGGGA 
GTTTTGCATT 
AAACGACCGG 



21 
I 

TTGCTGGCAT 
GCTGCTCCGC 
GTGGAAGCAA 
TACCCTCCCA 
CTTCGGGCTT 
GAAGCCCCAC 
AGCCCTTGCA 
TCTCGCGGAG 
GCCGAGCGGT 
GCTAGGCCTG 
GGCCGAGCCC 
ATGTGCGGCT 
CGAGGTCCCG 



31 
I 

CCCGAGCTTC 
AGACGGGGCT 
CTGCGCTGAT 
CAGATCCAGC 
TGTTTGGGTT 
CGGCTGGAGA 
GAGCCCTCTC 
ACCGCGGGAC 
GCCCGGGCCC 
CGGAAAACGT 
GCGTCCGGAA 
CGGCCCTGGC 
CCTCGTTCCT 



41 
I 

GCAAAGCTGC 
TGATGCGCCA 
ATCACCCAGT 
TGATTGTGTT 
GAAACAAAAG 
TCCAGTCGCC 
CCGCCGTGCC 
GCTTACCTGC 
CCTAGCGACA 
GGCAGCCAGG 
TTTTTTTACC 
CTGGGCAGCC 



51 
1 

AGCCAGGACG 
AACTAATGGT 
CAGACTTTTT 
GAATGTACAT 
TGGCTCTTCG 
CTCTTTTCTT 
GCCGGGCCCT 
GAGCCGGGAG 
ACCGCTTGCT 
CTCGCCCGCG 
CGGCGGGCGC 
GCTGCATTTG 
TGGGTGTTTT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



890 



WO 03/042661 



PCT/US02/36810 



5 

10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



CACTTGTTCT TGGACTGGGC CAAGGTGAAG ACAATAGATG TGCATCTTCA AATGCAGCAT 840 
CCTGTGCCAG GTGCCTTGCG CTGGGTCCAG AATGTGGATG GTGTGTTCAA GAGGATTTCA 900 
TTTCAGGTGG ATCAAGAAGT GAACGTTGTG ATATTGTTTC CAATTTAATA AGCAAAGGCT 960 
GCTCAGTTGA TTCAATAGAA TACCCATCTG TGCATGTTAT AATACCCACT GAAAATGAAA 1020 
TTAATACCCA GGTGACACCA GGAGAAGTGT CTATCCAGCT GOGTCCAGGA GCCGAAGCTA 1080 
ATTTTATGCT GAAAGTTCAT CCTCTGAAGA AATATCCTGT GGATCTTTAT TATCTTGTTG 1140 
ATGTCTCAGC ATCAATGCAC AATAATATAG AAAAATTAAA TTCCGTTGGA AACGATTTAT 1200 
CTAGAAAAAT GGCATTTTTC TCCCGTGACT TTCGTCTTGG ATTTGGCTCA TACGTTGATA 1260 
AAACAGTTTC ACCATACATT AGCATCCACC CCGAAAGGAT TCATAATCAA TGCAGTGACT 1320 
ACAATTTAGA CTGCATGCCT CCCCATGGAT ACATCCATGT GCTGTCTTTG ACAGAGAACA 1380 
TCACTGAGTT TGAGAAAGCA GTTCATAGAC AGAAGATCTC TGGAAACATA GATACACCAG 1440 
AAGGAGGTTT TGACGCCATG CTTCAGGCAG CTGTCTGTGA AAGTCATATC GGATGGCGAA 1500 
AAGAGGCTAA AAGATTGCTG CTGGTGATGA CAGATCAGAC GTCTCATCTC GCTCTTGATA 1560 
GCAAATTGGC AGGCATAGTG GTGCCCAATG ACGGAAACTG TCATCTGAAA AACAACGTCT 1620 
ACGTCAAATC GACAACCATG GAACACCCCT CACTAGGCCA ACTTTCAGAG AAATTAATAG 1680 
ACAACAACAT TAATGTCATC TTTGCAGTTC AAGGAAAACA ATTTCATTGG TATAAGGATC 1740 
TTCTACCCCT CTTGCCAGGC ACCATTGCTG GTGAAATAGA ATCAAAGGCT GCAAACCTCA 1800 
ATAATTTGGT AGTGGAAGCC TATCAGAAGC TCATTTCAGA AGTGAAAGTT CAGGTGGAAA 1860 
ACCAGGTACA AGGCATCTAT TTTAACATTA CCGCCATCTG TCCAGATGGG TCCAGAAAGC 1920 
CAGGCATGGA AGGATGCAGA AACGTGACGA GCAATGATGA AGTTCTTTTC AATGTAACAG 1980 
TTACAATGAA AAAATGTGAT GTCACAGGAG GAAAAAACTA TGCAATAATC AAACCTATTG 2040 
GTTTTAATGA AACCGCTAAA ATT CAT AT AC ACAGAAACTG CAGCTGTCAG TGTGAGGACA 2100 
ACAGAGGACC TAAAGGAAAG TGTGTAGATG AAACTTTTCT AGATTCCAAG TGTTTCCAGT 2 ISO 
GTGATGAGAA TAAATGTCAT TTTGATGAAG ATCAGTTTTC TTCTGAGAGT TGCAAGTCAC 2220 
ACAAGGATCA GCCTGTTTGC AGTGGTCGAG GAGTTTGTGT TTGTGGGAAA TGTTCATGTC 2280 
ACAAAATTAA GCTTGGAAAA GTGTATGGAA AATACTGTGA AAAGGATGAC TTTTCTTGTC 2340 
CATATCACCA TGGAAATCTG TGTGCTGGGC ATGGAGAGTG TGAAGCAGGC AGATGCCAAT 2400 
GCTTCAGTGG CTGGGAAGGT GATCGATGCC AGTGCCCTTC AGCAGCAGCC CAGCACTGTG 2460 
TCAATTCAAA GGGCCAAGTG TGCAGTGGAA GAGGCACGTG TGTGTGTGGA AGGTGTGAGT 2520 
GCACCGATCC CAGGAGCATC GGCCGCTTCT GTGAACACTG CCCCACCTGT TATACAGCCT 2580 
GCAAGGAAAA CTGGAATTGT ATGCAATGCC TTCACCCTCA CAATTTGTCT CAGGCTATAC 2640 
TTGATCAGTG CAAAACCTCA TGTGCTCTCA TGGAACAACA GCATTATGTC GACCAAACTT 2700 
CAGAATGTTT CTCCAGCCCA AGCTACTTGA GAATATTTTT CATCATTTTC ATAGTTACAT 2760 
TCTTGATTGG GTTGCTTAAA GTCCTGATCA TTAGACAGGT GATACTACAA TGGAATAGTA 2820 
ATAAAATTAA GTCCTGATCA GATTACAGAG TGTCAGCCTC AAAAAAGGAT AAGTTGATTC 2880 
TGCAAAGTGT TTGCACAAGA GCAGTCACCT ACCGACGTGA GAAGCCTGAA GAAATAAAAA 2940 
TGGATATCAG CAAATTAAAT GCTCATGAAA CTTTCAGGTG CAACTTCTAA AAAAAGATTT 3000 
TTAAACACTT AATGGGAAAC TGGAATTGTT AATAATTGCT CCTAAAGATT ATAATTTTAA 3060 
AAGTCACAGG AGGAGACAAA TTGCTCACGG TCATGCCAGT TGCTGGTTGT ACACTCGAAC 3120 
GAAGACTGAC AAGTATCCTC ATCATGATGT GACTCACATA GCTGCTGACT TTTTCAGAGA 3180 
AAAATGTGTC TTACTACTGT TTGAGACTAG TGTCGTTGTA GCACTTTACT GTAATATATA 3240 
ACTTATTTAG ATCAGCATAG AATGTAGATC CTCTGAAGAG CACTGATTAC ACTTTACAGG 3300 
TACCTGTTAT CCCTACGCTT CCCAGAGAGA ACAATGCTGT GAGAGAGTTT AGCATTGTGT 3360 
CACTACAAGG GTACAGTAAT CCCTGCACTG GACATGTGAG GAAAAAAATA ATCTGGCAAG 3420 
TATATTCTAA GGTTGCCAAA CACTTCAACA GTTGGTGGTT GAATAGACAA GAACAGCTAG 3480 
ATGAATAAAT GATTCGTGTT TCACTCTTTC AAGAGGTGAA CAGATACAAC CTTAATCTTA 3540 
AAAGATTATT GCTTTTTAAA GTGTGTAGTT TTATGCATGT GTGTTTATGG TTTGCTTATT 3600 
TTTGCAAGAT GGATACTAAT TCCAGCATTC TCTCCTCTTT GCCTTTATGT TTTGTTTTCT 3660 
TTTTTACAGG ATAAGTTTAT GTATGTCACA GATGACTGGA TTAATTAAGT GCTAAGTTAC 3720 
TACTGCCATA AAAAACTAAT AATACAATGT CACTTTATCA GAATACTAGT TTTAAAAGCT 3780 
GAATGTTAA 



A127 Protein sequence; 

Gene name: Integrin, beta 8 

Unigene number: Hs. 52620 

Probeset Accession #: AA479726 

Protein Accession #: NP_002205.1 

Signal sequence: 1-39 

Transmembrane domains: 682-704 

EGP domain: 552-584 

INB domain: 54-469 

Cellular Localization: plasma membrane 



1 11 21 31 41 51 

I I I I I I 

MCGSALAPFT AAFVCLQNDR RGPASFLWAA WVFSLVLGLG CGEDNRCASS NAASCARCLA 60 

LGPECGWCVQ EDFISGGSRS ERCDIVSNIjI SKGCSVDSIE YPSVHVIIPT ENEINTQVTP 120 

GEVSIQLRPG AEANFMLKVH PLKKYPVDLY YliVDVSASMH NNIEKLNSVG NDLSRKMAFF 180 

SRDPRLGFGS YVDKTVSPYI SIHPERIHNQ CSDYNLDCMP PHGYIHVLSL TENITEFEKA 240 

VHRQKISGNI DTP EGG FD AM LQAAVCESHI GWRKEAKRLL LVMTDQTSHL ALDSKLAGIV 300 

VPNDGNCHLK NNVYVKSTTM EHPSLGQLSE KLIDNNINVI FAVQGKQFHW YKDLLPLLPG 360 

TIAGEIESKA ANLNNLWEA YQKLISEVKV QVENQVQGIY FNITAICPDG SRKPGMEGCR 420 

NVTSNDEVLF NVTVTMKKCD VTGGKNYAII KPIGFNETAK IHIHRNCSCQ CEDNRGPKGK 480 

CVDETFLDSK CFQCDENKCH FDEDQFSSES CKSHKDQPVC SGRGVCVCGK CSCHKIKLGK 540 

VYGKYCEKDD FSCPYHHGNL CAGHGECEAG RCQCFSGWEG DRCQCPSAAA QHCVNSKGQV 600 

CSGRGTCVCG RCECTDPRSI GRFCEHCPTC YTACKENWNC MQCLHPHNLS QAILDQCKTS 660 

CALMEQQHYV DOTSECFSSP SYLRIFFIIF IVTFLIGLLK VLIIRQVILQ WNSNKIKSSS 720 
DYRVSASKKD KLILQSVCTR AVTYRREKPE EIKMDISKLN AHETFRCNF 



A128 DMA SEQUENCE 

Gene name: G protein- coupled receptor 64 



891 



WO 03/042661 



PCT/US02/36810 



Unigene number; Hs. 184 94 2 

Probeset Accession #: AA43S577 

Nucleic Acid Accession #: NM_005756 

Coding sequence- 73-3117 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

1 i | i I I 

AGCCAGCCCG AGGACGCGAG CGGCAGGTGT GCACAGAGGT TCTCCACTTT GTTTTCTGAA 60 
CTCGCGGTCA GGATGGTTTT CTCTGTCAGG. CAGTGTGGCC ATGTTGGCAG AACTGAAGAA 120 
GTTTTACTGA CGTTCAAGAT ATTCCTTGTC ATCATTTGTC TTCATGTCGT TCTGGTAACA 180 
TCCCTGGAAG AAGATACTGA TAATTCCAGT TTGTCACCAC CACCTGCTAA ATTAT CTGT T 240 
GTCAGTTTTG CCCCCTCCTC CAATGAGGTT GAAACAACAA GCCTCAATGA TGTTACTTTA 300 
AGCTTACTCC CTTCAAACGA AACAGAAAAA ACTAAAATCA CTATAGTAAA AACCTTCAAT 360 
GCTTCAGGCG TCAAACCCCA GAGAAATATC TGCAATTTGT CATCTATTTG CAATGACTCA 420 
GCATTTTTTA GAGGTGAGAT CATGTTTCAA TATGATAAAG AAAGCACTGT TCCCCAGAAT 480 ' 
CAACATATAA CGAATGGCAC CTTAACTGGA GTCCTGTCTC TAAGTGAATT AAAACGCTCA 540 
GAGCTCAACA AAACCCTGCA AACCCTAAGT GAGACTTACT TTATAATGTG TGCTACAGCA 600 
GAGGCCCAAA GCACATTAAA TTGTACATTC ACAATAAAAC TGAATAATAC AATGAATGCA 660 
TGTGCTGCAA TAGCCGCTTT GGAAAGAGTA AAGATTCGAC CAATGGAACA CTGCTGCTGT 720 
TCTGTCAGGA TACCCTGCCC TTCCTCCCCA GAAGAGTTGG GAAAGCTTCA GTGTGACCTG 780 
CAGGATCCCA TTGTCTGTCT TGCTGACCAT CCACGTGGCC CACCATTTTC TTCCAGCCAA 840 
TCCATCCCAG TGGTGCCTCG GGCCACTGTG CTTTCCCAGG TCCCCAAAGC TACCTCTTTT 900 
GCTGAGCCTC CAGATTATTC ACCTGTGACC CACAATGTTC CCTCTCCAAT AGGGGAGATT 960 
CAACCCCTTT CACCCCAGCC TTCAGCTCCC ATAGCTTCCA GCCCTGCCAT TGACATGCCC 1020 
CCACAGTCTG AAACGATCTC TTCCCCTATG CCCCAAACCC ATGTCTCCGG CACCCCACCT 1080. 
CCTGTGAAAG CCTCATTTTC CTCTCCCACC GTGTCTGCCC CTGCGAATGT CAACACTACC 1140 
AGCGCACCTC CTGTCCAGAC AGACATCGTC AACACCAGCA GTATTTCTGA TCTTGAGAAC 1200 
CAAGTGTTGC AGATGGAGAA GGCTCTGTCC TTGGGCAGCC TGGAGCCTAA CCTCGCAGGA 1260 
GAAATGATCA ACCAAGTCAG CAGACTCCTT CATTCCCCGC CTGACATGCT GGCCCCTCTG 1320 
GCTCAAAGAT TGCTGAAAGT AGTGGATGAC ATTGGCCTAC AGCTGAACTT TTCAAACACG 1380 
ACTATAAGTC TAACCTCCCC TTCTTTGGCT CTGGCTGTGA TCAGAGTGAA TGCCAGTAGT 1440 
TTCAACACAA CTACCTTTGT GGCCCAAGAC CCTGCAAATC TTCAGGTTTC TCTGGAAACC 1500 
CAAGCTCCTG AGAACAGTAT TGGCACAATT ACTCTTCCTT CATCGCTGAT GAATAATTTA 1560 
CCAGCTCATG ACATGGAGCT AGCTTCCAGG GTTCAGTTCA ATTTTTTTGA AACACCTGCT 1620 
TTGTTTCAGG ATCCTTCCCT GGAGAACCTC TCTCTGATCA GCTACGTCAT ATCATCGAGT 1680 
GTTGCAAACC TCAGCGTCAG GAACTTGACA AGAAACGTGA CAGTCACATT AAAGCACATC 1740 
AACCCGAGCC AGGATGAGTT AACAGTGAGA TGTGTATTTT GGGACTTGGG CAGAAATGGT 1800 
GGCAGAGGAG GCTGGTCAGA CAATGGCTGC TCTGTCAAAG ACAGGAGATT GAATGAAACC 1860 
ATCTGTACCT GTAGCCATCT AACAAGCTTC GGCGTTCTGC TGGACCTATC TAGG ACATCT 1920 
GTGCTGCCTG CTCAAATGAT GGCTCTGAGG TTCATTACAT ATATTGGTTG TGGGCTTTCA 1980 
TCAATTTTTC TGTCAGTGAC TCTTGTAACC TACATAGCTT TTGAAAAGAT CCGGAGGGAT 2040 
TACCCTTCCA AAATCCTCAT CCAGCTGTGT GCTGCTCTGC TTCTGCTGAA CCTGGTCTTC 2100 
CTCCTGGACT CGTGGATTGC TCTGTATAAG ATGCAAGGCC TCTGCATCTC AGTGGCTGTA 2160 
TTTCTTCATT ATTTTCTCTT GGTCTCATTC ACATGGATGG GCCTAGAAGC ATTCCATATG 2220 
TACCTGGCCC TTGTCAAAGT ATTTAATACT TACATCCGAA AATACATCCT TAAATTCTGC 2280 
ATTGTCGGTT GGGGGGTACC AGCTGTGGTT GTGACCATCA TCCTGACTAT ATCCCCAGAT 2340 
AACTATGGGC TTGGATCCTA TGGGAAATTC CCCAATGGTT CACCGGATGA CTTCTGCTGG 2400 
ATCAACAACA ATGCAGTATT CTACATTACG GTGGTGGGAT ATTTCTGTGT GATATTTTTG 2460 
CTGAACGTCA GCATGTTCAT TGTGGTCCTG GTTCAGCTCT GTCGAATTAA AAAGAAGAAG 2520 
CAACTGGGAG CCCAGCGAAA AACCAGTATT CAAGACCTCA GGAGTATCGC TGGCCTTACA 2580 
TTTTTACTGG GAATAACTTG GGGCTTTGCC TTCTTTGCCT GGGGACCAGT TAACGTGACC 2640 
TTCATGTATC TGTTTGCCAT CTTTAATACC TTACAAGGAT TTTTCATATT CATCTTTTAC 2700 
TGTGTGGCCA AAGAAAATGT CAGGAAGCAA TGGAGGCGGT ATCTTTGTTG TGGAAAGTTA 2760 
CGGCTGGCTG AAAATTCTGA CTGGAGTAAA ACTGCTACTA ATGGTTTAAA GAAGCAGACT 2820 
GTAAACCAAG GAGTGTCCAG CTCTTCAAAT TCCTTACAGT CAAGCAGTAA CTCCACTAAC 2880 
TCCACCACAC TGCTAGTGAA TAATGATTGC TCAGTACACG CAAGOGGGAA TGGAAATGCT 2940 
TCTACAGAGA GGAATGGGGT CTCTTTTAGT GTTCAGAATG GAGATGTGTG CCTTCAOGAT 3000 
TTCACTGGAA AACAGCACAT GTTTAACGAG AAGGAAGATT CCTGCAATGG GAAAGGCCGT 3060 
ATGGCTCTCA GAAGGACTTC AAAGOGGGGA AGCTTACACT TTATTGAGCA AATG TGATT C 3120 
CTTTCTTCTA AAATCAAAGC ATGATGCTTG ACAGTGTGAA ATGTCCAATT TTACCTTTTA 3180 
CACAATGTGA GATGTATGAA AATCAACTCA TTTTATTCTC GGCAACATCT GGAGAAGCAT 3240 
AAGCTAATTA AGGGCGATGA TTATTATTAC AAGAAGAAAC CAAGACATTA CACCATGGTT 3300 
TTTAGACATT TCTGATTTGG TTTCTTA TCT TTCATTTTAT AAGAAGGTTG GTTTTAAACA 3360 
ATACACTAAG AATGACTCCT ATAAAGAAAA CAAAAAAAGG TAGTGAACTT TCAGCTACCT 3420 
TTTAAAGAGG CTAAGTTATC TTTGATAACA TCATATAAAG CAACTGTTGA CTTCAGCCTG 3480 
TTGGTGAGTT TAGTTGTGCA TGCCTTTGTT GTATATAAGC TAAATTCTAG TGACCCATGT 3540 
GTCAAAAATC TTACTTCTAC ATTTTTTTGT ATTTATTTTC TACTGTGTAA ATGTATTCCT 3600 
TTGTAGAATC ATGGTTGTTT TGTCTCACGT GATAATTCAG AAAATCCTTG CTCGTTCCGC 3660 
AAATCCTAAA GCTCCTTTTG GAGATGATAT AGGATGTGAA ATACAGAAAC CTCAGTGAAA 3720 
TCAAGAAATA ATGATCCCAG CCAGACTGAG AAAATGTAAG CAGACAGTGC CACAGTTAGC 3780 
TCATACAGTG CCTTTGAGCA AGTTAGGAAA AGATGCCCCC ACTGGGCAGA CACAGCCCTA 3840 
TGGGTCATGG TTTGACAAAC AGAGTGAGAG ACCATATTTT AGCCCCACTC ACCCTCTTGG 3900 
GTGCACGACC TGTACAGCCA AACACAGCAT CCAATATGAA TACCCATCCC CTGACCGCAT 3960 
CCCCAGTAGT CAGATTATAG AATCTGCACC AAGATGTTTA GCTTTATACC TTGGCCACAG 4020 
AGAGGGATGA ACTGTCATCC AGACCATGTG TCAGGAAAAT TGTGAACGTA GATGAGGTAC 4080 
ATACACTGCC GCTTCTCAAA TCCCCAGAGC CTTTAGGAAC AGGAGAGTAG ACTAGGATTC 4140 
CTTCTCTTAA AAAGGTACAT ATATATGGAA AAAAATCATA TTGCCGTTCT TTAAAAGGCA 4200 
ACTGCATGGT ACATTGTTGA TTGTTATGAC TGGTACACTC TGGCCCAGCC AGAGCTATAA 4260 
TTGTTTTTTA AATGTGTCTT GAAGAATGCA CAGTGACAAG GGGAGTAGCT ATTGGGAACA 4320 
GGGAACTGTC CTACACTGCT ATTGTTGCTA CATGTATCGA GCCTTGATTG CTCCTAGTTA 4380 
TATACAGGGT CTATCTTGCT TCCTACCTAC ATCTGCTTGA GCAGTGCCTC AAGTACATCC 4440 
TTATTAGGAA CATTTCAAAC CCCTTTTAGT TAAGTCTTTC ACTAAGGTTC TCTTGCATAT 4500 
ATTTCAAGTG AATGTTGGAT CTCAGACTAA CCATAGTAAT AATACACATT TCTGTGAGTG 4560 
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CTGACTTGTC TTTGCAATAT TTCTTTTCTG ATTTATTTAA TTTTCTTGTA TTTATATGTT 
AAAATCAAAA ATGTTAAAAT CAATGAAATA AATTTGCAGT TAAGA 



4620 



A129 Protein sequence 
Gene name: 
Unigene number: 
Protein Accession # : 
Signal sequence: 
GPS domain: 

Transmembrane domains: 
Cellular Localization: 



G protein- coupled receptor 64 
HS. 184942 

NP_005747.1 

1-38 

624-646, 660-682, 688-710, 733-755, 783-805, 828-850, 858-880 
plasma membrane 



1 
I 

MVFSVRQCGH 
PSSNEVETTS 
GEIMFQYDKE 
TLKCTFTIKL 
VCLADHPRGP 
PQPSAPIASS 
VQTDIVNTSS 
LKWDDIGLQ 
NSIGTITLPS 
TVRNLTRNVT 
SHLTSFGVLL 
ILIQLCAALL 
VKVFNTYIRK 
AVFYITWGY 
ITWGPAFFAW 
NSDWSKTATN 
NGVSFSVQNG 



11 

I 

VGRTEEVLLT 
LNDVTLSLLP 
STVPQNQHIT 
KNTMNACAAI 
PFSSSQSIPV 
PAIDMPPQSE 
ISDLENQVLQ 
LNFSNTTISL 
SLMNNLPAHD 
VTLKHINPSQ 
DLSRTSVLPA 
LLNLVFLLDS 
YILKFCIVGW 
FCVIFLLNVS 
GPVNVTFMYL 
GLKKQTVNQG 
DVCLHDFTGK 



21 
I 

FKIFLVIICL 
SNETEKTKIT 
NGTLTGVLSL 
AALERVKIRP 
VPRATVLSQV 
TISSPMPQTH 
MEKALSLGSL 
TSPSLALAVI 
MELASRVQFN 
EELTVRCVFW 
QMMALTFITY 
WIALYKMQGL 
GVPAVWTII 
MFIWLVQLC 
FAI FNTLQGF 
VSSSSNSLQS 



31 
I 

HWLVTSLEE 
IVKTFNASGV 
SELKRSELNK 
MEHCCCSVRI 
PKATSFAEPP 
VSGTPPPVKA 
EPNIiAGEMIN 
RVNASSFNTT 
FFETPALFQD 
DLGRNGGRGG 
IGCGLSSIFL 
CISVAVFLHY 
LTISPDNYGL 
RIKKKKQLGA 
FIFIFYCVAK 
SSNSTNSTTL 
CNGKGRMALR 



41 


51 




1 

DTDNSSLSPP 


1 

PAKLSWSFA 


60 


KPQRNICNLS 


SICNDSAFFR 


120 


TLQTLSETYF 


IMCATAEAQS 


180 


PCPSSPEELG 


KLQCDLQDPI 


240 


DYSPVTHNVP 


SPIGEIQPLS 


300 


SFSSPTVSAP 


ANVNTTSAPP 


360 


QVSRLLHSPP 


DMLAPLAQRL 


420 


TFVAQDPANI* 


QVSLETQAPE 


480 


PSLENLSLIS 


YVISSSVANL 


540 


WSDNGCSVKD 


RRLNETICTC 


600 


SVTLVTYIAF 


EKIRRDYPSK 


660 


PLLVSFTWMG 


LEAFHMYLAL 


720 


GSYGKFPNGS 


PDDPCWINNN 


780 


QRKTSIQDLR 


SIAGLTFLLG 


840 


ENVRKQWRRY 


XjCCGKLRLAE 


900 


LVNNDCSVHA 


SGNGNASTER 


960 


RTSKRGSLHF 


IEQM 





A130 DNA SEQUENCE 

Gene name: G protein -coupled receptor 49 (GPR49) (HG38) (LGR5) 
Unigene number: Hs- 285529 

Probeset Accession #: AA460530 
Nucleic Acid Accession #: NM_003667 

Coding sequence: 201-2924 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I 1 I I I I 

GTGGCGGCAA CCGGCACCTC AGTCCCCGCC GCGCTTCTCC TCGCCGCCCA CGCCGTGGGG 
TCAGGAACGC GGCGTCTGGC GCTGCAGACG CCCGCTGAGT TGCAGAAGCC CACGGAGCGG 
CGCCCGGCGC GCCACGGCCC GTAGCAGTCC GGTGCTGCTC TCCGCCOGCG TCCGGCTCGT 
GGCCCCCTAC TTCGGGCACC ATGGACACCT CCCGGCTCGG TGTGCTCCTG TCCTTGCCTG 
TGCTGCTGCA GCTGGCGACC GGGGGCAGCT CTCCCAGGTC TGGTGTGTTG CTGAGGGGCT 
GCCCCACACA CTGTCATTGC GAGCCCGACG GCAGGATGTT GCTCAGGGTG GACTGCTCCG 
ACCTGGGGCT CTCGGAGCTG CCTTCCAACC TCAGCGTCTT CACCTCCTAC CTAGACCTCA 
GTATGAACAA CATCAGTCAG CTGCTCCCGA ATCCCCTGCC CAGTCTCCGC TTCCTGGAGG 
AGTTACGTCT TGCGGGAAAC GCTCTGACAT ACATTCCCAA GGGAGCATTC ACTGGCCTTT 
ACAGTCTTAA AGTTCTTATG CTGCAGAATA ATCAGCTAAG ACACGTACCC ACAGAAGCTC 
TGCAGAATTT GCGAAGCCTT CAATCCCTGC GTCTGGATGC TAACCACATC AGCTATGTGC 
CCCCAAGCTG TTTCAGTGGC CTGCATTCCC TGAGGCACCT GTGGCTGGAT GACAATGCGT 
TAACAGAAAT CCCCGTCCAG GCTTTTAGAA GTTTATCGGC ATTGCAAGCC ATGACCTTGG 
CCCTGAACAA AATACACCAC ATACCAGACT ATGCCTTTGG AAACCTCTCC AGCTTGGTAG 
TTCTACATCT CCATAACAAT AGAATCCACT CCCTGGGAAA GAAATGCTTT GATGGGCTCC 
ACAGCCTAGA GACTTTAGAT TTAAATTACA ATAACCTTGA TGAATTCCCC ACTGCAATTA 
GGACACTCTC CAACCTTAAA GAACTAGGAT TTCATAGCAA CAATATCAGG TCGATACCTG 
AGAAAGCATT TGTAGGCAAC CCTTCTCTTA TTACAATACA TTTCTATGAC AATCCCATOC 
AATTTGTTGG GAGATCTGCT TTTCAACATT TACCTGAACT AAGAACACTG ACTCTGAATG 
GTGCCTCACA AATAACTGAA TTTCCTGATT TAACTGGAAC TGCAAACCTG GAGAGTCTGA 
CTTTAACTGG AGCACAGATC TCATCTCTTC CTCAAACCGT CTGCAATCAG TTACCTAATC 
TCCAAGTGCT AGATCTGTCT TACAACCTAT TAGAAGATTT ACCCAGTTTT TCAGTCTGCC 
AAAAGCTTCA GAAAATTGAC CTAAGACATA ATGAAATCTA CGAAATTAAA GTTGACACTT 
TCCAGCAGTT GCTTAGCCTC CGATOGCTGA ATTTGGCTTG GAACAAAATT GCTATTATTC 
ACCCCAATGC ATTTTCCACT TTGCCATCCC TAATAAAGCT GGACCTATCG TCCAACCTCC 
TGTCGTCTTT TCCTATAACT GGGTTACATG GTTTAACTCA CTTAAAATTA ACAGGAAATC 
ATGCCTTACA GAGCTTGATA TCATCTGAAA ACTTTCCAGA ACTCAAGGTT ATAGAAATGC 
CTTATGCTTA CCAGTGCTGT GCATTTGGAG TGTGTGAGAA TGCCTATAAG ATTTCTAATC 
AATGGAATAA AGGTGACAAC AGCAGTATGG ACGACCTTCA TAAGAAAGAT GCTGGAATGT 
TTCAGGCTCA AGATGAACGT GACCTTGAAG ATTTCCTGCT TGACTTTGAG GAAGACCTGA 
AAGCCCTTCA TTCAGTGCAG TGTTCACCTT CCCCAGGCCC CTTCAAACCC TGTGAACACC 
TGCTTGATGG CTGGCTGATC AGAATTGGAG TGTGGACCAT AGCAGTTCTG GCACTTACTT 
GTAATGCTTT GGTGACTTCA ACAGTTTTCA GATCCCCTCT GTACATTTCC CCCATTAAAC 
TGTTAATTGG GGTCATCGCA GCAGTGAACA TGCTCAOGGG AGTCTCCAGT GCCGTGCTGG 
CTGGTGTGGA TGCGTTCACT TTTGGCAGCT TTGCACGACA TGGTGCCTGG TGGGAGAATG 
GGGTTGGTTG CCATGTCATT GGTTTTTTGT CCATTTTTGC TTCAGAATCA TCTGTTTTCC 
TGCTTACTCT GGCAGCCCTG GAGCGTGGGT TCTCTGTGAA ATATTCTGCA AAATTTGAAA 
CGAAAGCTCC ATTTTCTAGC CTGAAAGTAA TCATTTTGCT CTGTGCCCTG CTGGCCTTGA 
CCATGGCCGC AGTTCCCCTG CTGGGTGGCA GCAAGTATGG CGCCTCCCCT CTCTGCCTGC 
CTTTGCCTTT TGGGGAGCCC AGCACCATGG GCTACATGGT CGCTCTCATC TTGCTCAATT 
CCCTTTGCTT CCTCATGATG ACCATTGCCT ACACCAAGCT CTACTGCAAT TTGGACAAGG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
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GAGACCTGGA GAATATTTGG GACTGCTCTA TGGTAAAACA CATTGCCCTG TTGCTCTTCA 2520 

CCAACTGCAT CCTAAACTGC CCTGTGGCTT TCTTGTCCTT CTCCTCTTTA ATAAACCTTA 2580 

CATTTATCAG TCCTGAAGTA ATTAAGTTTA TCCTTCTGGT GGTAGTCCCA CTTCCTGCAT 2640 

GTCTCAATCC CCTTCTCTAC ATCTTGTTCA ATCCTCACTT TAAGGAGGAT CTGGTGAGCC 2700 

TGAGAAAGCA AACCTACGTC TGGACAAGAT CAAAACACCC AAGCTTGATG TCAATTAACT 2760 

CTGATGATGT CGAAAAACAG TCCTGTGACT CAACTCAAGC CTTGGTAACC TTTACCAGCT 2820 

CCAGCATCAC TTATGACCTG CCTCCCAGTT CCGTGCCATC ACCAGCTTAT CCAGTGACTG 2880 

AGAGCTGCCA TCTTTCCTCT GTGGCATTTG TCCCATGTCT CTAA TTAATA TGTGAAGGAA 2940 

AATGTTTTCA AAGGTTGAGA ACCTGAAAAT GTGAGATTGA GTATATCAGA GCAGTAATTA 3000 
ATAAGAAGAG CTGAGGTGAA ACTCGGTTTA AA 



A131 Protein sequence 
Gene name: 
Unigene number: 
Protein Accession #: 
Signal sequence: 
Transmembrane domains: 
Cellular Localization: 



G protein-coupled receptor 49 (GPR49) (HG38) (LGR5) 
Hs. 285529 

NP_003 658.1 

1-22 

557-579, 596-618, 641-663, 687-709, 724-746. 770-792, 803-825 
plasma membrane 



1 11 21 31 41 51 

I I I I I I 

MDTSRLGVLL SLPVLLQLAT GGSSPRSGVL LRGCPTHCHC EPDGRMLLRV DCSDLGLSEL 60 

PSNLSVFTSY LDLSMNNISQ LLPNPLPSLR FLEELRLAGK ALTYIPKGAF TGLYSLKVLM 120 

LQNNQLRHVP TEALQNLRSL QSLRLDANHI SYVPPSCFSG LHSLRHLWLD DNALTEIPVQ 1B0 

AFRSLSALQA MTLALNKIHH IPDYAFGNLS SLWLHLHNN RIHSLGKKCF DGLHSLETLD 240 

LNYNNLDEFP TAIRTLSNLK ELGFHSNNIR SIPEKAFVGN PSL1TIHFYD NPIQFVGRSA 300 

FQHLPELRTL TLKGASQITE FPDLTGTANI* ESLTLTGAQI SSLPQTVCNQ LPNLQVLDLS 360 

YNLLEDLPSF SVCQKLQKID L.RHNEIYEIK VDTFQQLLSL RSLNLAWNKI AIIHPNAFST 420 

LPSLIKLDLS SNLLSSFPIT GLHGLTHLKL TGNHALQSLI SSENFPELKV IEMPYAYQCC 480 

AFGVCENAYK ISNQWNKGDN SSMDDLHKKD AGMFQAQDER DLEDFLLDFE EDLKALHSVQ 540 

CSPSPGPFKP CEHLLDGWLI RIGVWTIAVL ALTCNALVTS TVFRSPLYIS PIKLLIGVIA 600 

AVNMLTGVSS AVLAGVDAFT FGSFARHGAW WENGVGCHVI GFLSIFASES SVFLLTLAAL 660 

ERGFSVKYSA KFETKAPFSS LKVIILLCAL LALTMAAVPL, LGGSKYGASP LCLPLPFGEP 720 

STMGYMVALI LLKSLCFLMM TIAYTKLYCN LDKGDLENIW DCSMVKHIAL LLPTNCILNC 780 

PVAFLSFSSL INLTFISPEV IKFILLVWP LPACLNPLLY ILFNPHFKED LVSLRKQTYV 840 

WTRSKHPSLM SINSDDVEKQ SCDSTQALVT FTSSSITYDL PPSSVPSPAY PVTESCHLSS 900 
VAFVPCL 



A132 DNA SEQUENCE 

Gene name: parathyroid hormone receptor 2 
Unigene number: Hs. 15 94 9 9 

Probeset Accession #: U25128 
Nucleic Acid Accession # : NM_0 05048 

Coding sequence: 143-1795 (underlined sequences correspond to start and stop codonsj 



1 11 21 31 41 51 

I I 1 ! I I 

GGCCGGTGGC CCGGGCCCGA CCACCCCAGC TGCGOGTCGT TACTGGCCAC AAGTTTGCTC 60 

TGGGCCAGCC AAGTTGGCAA CTTGGAAGCT TCTCCCGGGC TCTGGAGGAG GGTCCCTGCT 120 

TCTTCCTACA GCCGTTCCGG GCATGGCCGG GCTGGGGGCG TCGCTCCACG TCTGGGGTTG 180 

GCTAATGCTC GGCAGCTGCC TCCTGGCCAG AGCCCAGCTG GATTCTGATG GCACCATTAC 240 

TATAGAGGAG CAGATTGTCC TTGTGCTGAA AG CGAAAGT A CAATGTGAAC TCAACATCAC 300 

AGCTCAACTC CAGGAGGGAG AAGGTAATTG TTTCCCTGAA TGGGATGGAC TCATTTGTTG 360 

GCCCAGAGGA ACAGTGGGGA AAATATCGGC TGTTCCATGC CCTCCTTATA TTTATGACTT 420 

CAACCATAAA GGAGTTGCTT TCCGACACTG TAACCCCAAT GGAACATGGG ATTTTATGCA 480 

CAGCTTAAAT AAAACATGGG CCAATTATTC AGACTGCCTT CGCTTTCTGC AGCCAGATAT 540 

CAGCATAGGA AAGCAAGAAT TCTTTGAACG CCTCTATGTA ATGTATACCG TTGGCTACTC 600 

CATCTCTTTT GGTTCCTTGG CTGTGGCTAT TCTCATCATT GGTTACTTCA GACGATTGCA 660 

TTGCACTAGG AACTATATCC ACATGCACTT ATTTGTGTCT TTCATGCTGA GAGCTACAAG 720 

CATCTTTGTC AAAGACAGAG TAGTCCATGC TCACATAGGA GTAAAGGAGC TGGAGTCCCT 780 

AATAATGCAG GATGACCCAC AAAATTCCAT TGAGGCAACT TCTGTGGACA AATCACAATA 840 

TATCGGGTGC AAGATTGCTG TTGTGATGTT TATTTACTTC CTGGCTACAA ATTATTATTG 900 

GATCCTGGTG GAAGGTCTCT ACCTGCATAA TCTCATCTTT GTGGCTTTCT TTTCGGACAC 960 

CAAATACCTG TGGGGCTTCA TCTTGATAGG CTGGGGGTTT CCAGCAGCAT TTGTTGCAGC 1020 

ATGGGCTGTG GCACGAGCAA CTCTGGCTGA TGCGAGGTGC TGGGAACTTA GTGCTGGAGA 1080 

CATCAAGTGG ATTTATCAAG CACCGATCTT AGCAGCTATT GGGCTGAATT TTATTCTGTT 1140 

TCTGAATACG GTTAGAGTTC TAGCTACCAA AATCTGGGAG ACCAATGCAG TTGGGCATGA 1200 

CACAAGGAAG CAATACAGGA AACTGGCCAA ATCGACACTG GTCCTGGTCC TAGTCTTTGG 1260 

AGTGCATTAC ATCGTGTTCG TATGCCTGCC TCACTCCTTC ACTGGGCTCG GGTGGGAGAT 1320 

CCGCATGCAC TGTGAGCTCT TCTTCAACTC CTTTCAGGGT TTCTTTGTGT CTATCATCTA 1380 

CTGCTACTGC AATGGAGAGG TTCAGGCAGA GGTGAAGAAG ATGTGGAGTC GGTGGAATCT 1440 

CTCCGTGGAC TGGAAAAGGA CACOGCCATG TGGCAGCCGC AGATGCGGCT CAGTGCTCAC 1500 

CACCGTGACG CACAGCACCA GCAGCCAGTC ACAGGTGGCG GCCAGCACAC .GCATGGTGCT 1560 

TATCTCTGGC AAAGCTGCCA AGATCGCCAG CAGACAGCCT GACAGCCACA TCACTTTACC 1620 

TGGCTATGTC TGGAGTAACT CAGAGCAGGA CTGCCTGCCA CACTCTTTCC ACGAGGAGAC 1680 

CAAGGAAGAT AGTGGGAGGC AGGGAGATGA TATTCTAATG GAGAAGCCTT CCAGGCCTAT 1740 

GGAATCTAAC CCAGACACTG AAGG ATG CCA AGGAGAAACT GAGGATGTTC TCTGAATGGA 1800 

CATTTGTGGC TGACTTTCAT GGGCTGGTCC AATGGCTGGT TGTGTGAGAG GGCTTGGCTG 1860 

ATACTCCTAT GCTTGAGTTC AAAGGCTGAA AATTCAGTTA AGGTGTTACT TAATAATAGT 1920 

TTTTAGGCTC CATGAATTGG CTCCTGTAAA TACTAACGAC ATGAAAATGC AAGTGTCAAT 1980 



894 



WO 03/042661 



PCT/US02/36810 



GGAGTAGTTT 
GCTCTGTGAT 
GCTGTAGCTT 
ATTTTCCTTT 
ATTTATTTTG 
GATCTAAGAA 
TTATAACAAT 
ACATCCCTTC 
TTCTTTGTAA 
TTGATTTTGT 
TTTGAGCTGT 
T 



ATTACCTTCT 
TGTTCATTTT 
TCTCTCATAT 
TAG AAA CT AG 
CCTGTGCATA 
CAAGTACTTG 
TACATGTGTT 
TTTTGAATGG 
ACCATGTCAT 
TTGTAATGTA 
TACTACATTG 



ATTGGCATCA 
TTTCTGCTAC 
ATATCACCCT 
TATTCTCTTA 
GGAGCAATTA 
CTGGAAAATT 
TTTGGGAACA 
CCTCTTTGTG 
GTGGAAAGAT 
TTTTGATAGC 
TACATGGCAT 



AGTTTTCCTC 
TTTTGGGTAG 
AAATATAATG 
TTTCTTACTT 
GGATCTAAAA 
AGTTGGCTGG 
AGGAAAATTT 
ACCAGCCAGA 
TTCCTCAGTT 
AAATCATGCT 
GTGGGATCAA 



TAAATTAATG 
AAAAAAGATT 
AAGATCTTTT 
TAATGTACTT 
AAATATATGG 
ACATTGATAA 
CTCAAAAAAG 
CCTCAGGTCT 
AGTGAGCTTG 
GCATCTATAT 
TTAAAAATTT 



TATGGTATTT 
CAATTGCTTG 
AGTGTGTATC 
CTATCACT GC 
GAAGATAAAA 
AATAATGCAT 
AATATTTCAC 
TCACTCTTTC 
TGTCTGCAAA 
CTTTTTCTTG 
GTTTTAAAAA 



2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 



A133 PROTEIN SEQUENCE 
Gene name: 
Unigene number: 
Protein Accession ft: 
Signal sequence* 
Pfam domain: 
Transmembrane domains: 
Cellular Localization: 



parathyroid hormone receptor 2 
Hs. 159499 

NP_005039.1 

1-25 

7tm_2 U41-420] 

177-197, 228-250, 253-275. 280-302, 
plasma membrane 



320-342, 363-385, 398-419 



1 
I 

MAGLGASLHV 
GNCFPEWDGL 
NYSDCLRPLQ 
MHLFVSFMLR 
VMFIYFLATN 
LADARCWELS 
LAKSTLVLVL 
QAEVKKMWSR 
IASRQPDSHI 
GCQGETEDVL 



11 

I 

WGWLMLGSCL 
ICWPRGTVGK 
PDISIGKQEF 
ATSIFVKDRV 
YYWILVEGLY 
AGDIKWIYQA 
VFGVHYIVFV 
WNLSVDWKRT 
TLPGYVWSNS 



21 
I 

LARAQLDSDG 
ISAVPCPPYI 
FERLYVMYTV 
VHAHIGVKEL 
LHNLIFVAFF 
PILAAIGliNF 
CLPHSFTGLG 
PPCGSRRCGS 
EQDCLPHSFH 



31 
I 

TITIEEQ1VL 
YDFNHKGVAF 
GYSISFGSLA 
ESLIMQDDPQ 
SDTKYLWGFI 
II>FLNTVRVIi 
WEIRMHCELF 
VLTTVTHSTS 
EETKEDSGRQ 



41 
I 

VLKAKVQCEL 
RHCNPNGTWD 
VAILIIGYFR 
NSIEATSVDK 
LIGWGFPAAF 
ATKIWETNAV 
FNSFQGFFVS 
SQSQVAASTR 
GDDILMEKPS 



51 
1 

NITAQLQEGE 
FMHSLNKTWA 
RLHCTRNYIH 
SQYIGCKIAV 
VAAWAVARAT 
GHDTRKQYRK 
IIYCYCNGEV 
MVLISGKAAK 
RPMESNPDTE 



60 
120 
180 
240 
300 
360 
420 
480 
54 0 



ESTS 

HS. 128899 
AA9832S1. 
AA983251 

1-1749 (underlined sequences correspond to start 



A134 DMA sequence 
Gene name: 
Unigene number: 
Probeset Accession (h 
Nucleic Acid Accession #: 
Coding sequence: 

1 11 21 31 41 51 

I I ! I I I 

ATGCTGTCTG GCTTCTTGAT GAGTCCCAGT ACCCAGCACA GAGCACAGTA CACTCCCGGA 60 

GGAAAGAAAC TTCCGTGGGA GGCTTCCATC GGTGCGCACA CCTCCCGAGG GCGAGGCAGC 120 

GACCGGGAGA GGGAGAGCCG GCCGGAGGCT GCCGGGCTCC TGTGGGACCG CGCTGCAGCC 180 

GGGGAGGCGG AGAAGGGGAA CCGGGGCGAG CCGCCCGCCT GGATCCGCGC CCAGCAGCAG 240 

CCGCGGCCGC CGCCAGCTGG GCAGGCTCCC GGGACTGCGG CTGGGGGCGC GCAGGACCCT 300 

CGCCTGCGTC CTGGACGTTC CCGGGGGAGG GTCCGGTTGC CAGTGAAACC TCCAGAGGCT 360 

TCCGGACGAC AGCCCCGGGG GCCTTCTGAC TGCATCCCGA GATTTCCATC AGCGAGTGCA 420 

ACTCATAAGG CAGTCCCTAA GGGGACCGGG CCACCGGCTG AGGACGGGGA TGGCTTAGGA 480 

GCTCCTGGAC CTAGGGCCCG GCGTCGTCGC CTCCTGGGCG TCGCGGCAGA GGGGAGTGGC 540 

CCGCGCGGAA AGCGCCGCGG GACAGTCAGT GACGAGGCCC GGGGGTCGCC GGGGCCACGA 600 

CTTCTCGGAG ACCGTCCTGC GCTCTCTGGA GAOGOGCTGT COGCGCCCAG GGTGGTGCCA 660 

TGTGGGGCGC TCGCCGCTCG TCCGTCTCCT CATCCTGGAA CGCCGCTTCG CTCCTGCAGC 720 

TGCTGCTGGC TGCGCTGCTG GCGGCGGGGG CGAGGGCCCA GCGGCGAGTA CTGCCACGGC 780 

TGGCTGGACG CGCAGGGCGT CTGGCGCATC GGCTTCCAGT GTCCCGAGCG CTTCGACGGC 840 

GGCGACGCCA CCATCTGCTG CGGCAGCTGC GCGTTGCGCT ACTGCTGCTC CAGCGCCGAG 900 

GCGCGCCTGG ACCAGGGCGG CTGCGACAAT GACCGCCAGC AGGGCGCTGG CGAGCCTGGC 960 

CGGGCGGACA AAGACGGGCC CCGACGGCTC GGCAGGGCTT CATGTCTTAG GGGTACCCAA 1020 

GGAGACGGCG AGGGTGCGCC CCCACCCGTG AGGGCCTGGC AGOGGTGCTC CCCTGAAGGC 1080 

TCCCCGAAAG GAAGGCAGCT CCTCAGGGCT TTCCCGGGGC TGCTGCCCCG TGCCAGACGC 1140 

CGCGGATTCC CATCTTCTCC ACGCGGCGGC CTCTCTCCCC T GCAGO GGCC CGCCTTGCCC 1200 

ATCTACGTGC CGTTCCTCAT TGTTGGCTCC GTGTTTGTCG CCTTTATCAT CTTGGGGTCC 1260 

CTGGTGGCAG CCTGTTGCTG CAGATGTCTC CGGCCTAAGC AGGATCCCCA GCAGAGCCGA 1320 

GCCCCAGGGG GTAACCGCTT GATGGAGACC ATCCCCATGA TCCCCAGTGC CAGCACCTCC 1380 

CGGGGGTCGT CCTCACGCCA GTCCAGCACA GCTGCCAGTT CCAGCTCCAG CGCCAACTCC 1440 

GGGGCCCGGG CGCCCCCAAC AAGGTCACAG ACCAACTGTT GCTTGCCGGA AGGGACCATG 1500 

AACAACGTGT ATGTCAACAT GCCCACGAAT TTCTCTGTGC TGAACTGTCA GCAGGCCACC 1560 

CAGATTGTGC CACATCAAGG GCAGTATCTG CATCCCCCAT ACGTGGGGTA CACGGTGCAG 1620 

CACGACTCTG TGCCCATGAC AGCTGTGCCA CCTTTCATGG ACGGCCTGCA GCCTGGCTAC 1680 

AGGCAGATTC AGTCCCCCTT CCCTCACACC AACAGTGAAC AGAAGATGTA CCCAGCGGTG 1740 

ACTGT ATAAC CGAGAGTCAC TGGTGGGTTC CTTTACTGAA GGGAGACGAA GGCAGGGGTG 1800 

GATTCTCGAG GTGGAAGTCC GCACATGTCG GTGGTATTTA TGGCACGATT CCTTTGGATG 1860 

GCTTCATTTG CCCCCAGACT GTATGAAAAC ATCTCCGAAT TAGCATTTCT GGATATGTTT 1920 

CATCCAGGGT ATCATTGATT TATGATGGAA AACCGGCCTC AGCTGGAGAT GACTGTGATG 1980 

TTGCTGATGG GTGTATAACA AATGCTTGAG TCCGAAGTGC CCTTGAGATA TGGTTGACGA 2040 

AAGAATTTTA TAAACTGATA AATTAAGGAT TTTTATTATG TTGTTATTAT TATTTCTTTT 2100 

TTGTTGTTGA CTGCACAGGA TCAAAATGCC TGTTATCTCC CTTTTACTGG GACTTTTTTT 2160 

T T1 TTTTTTT TTTTTTTTAA TCAGACAGGG TCTTGCTCTG TTGCCCAGGC TGGAGTGCAG 2220 

TGGTGCGATC TCGGCTCACT GCAACTTCAG CCTCCTGGAT TCAGGCAACA CTCC TGCCTC 2280 

AGCCTCCCAC GTGGCTGGGA TTACAGGTGC CTGCCCCCAT GGCTAATTTT TTGTATTTTT 2340 

895 



and stop codons) 



WO 03/042661 



PCT/US02/36810 



TGTAGAGATG GGGTTTCACC ATGTTGGCTG GGCTGGTCTC ACTCTCCTGA CCTCAAGCAA 2400 

TCTGCCTGTC TCAGCCTCCC AAAGTGCTGG GATTACAGGC GTGAGCCACC GCCCCCAGCC 2460 

TGAGCCTTTT TTTTTTTCTA ATGCATCCAA GGTTAAGGGG AAGACGCAAA TAACAGGACT 252 0 

ATTCTAAAAG GAAACCTGTT TGAACTCTGT GAGATCAGTC ATCAGTCTCA GTA TTCCA CA 2580 

GGCACACCTT AATTTCATTG TAAAAAGATA TATATATTTT GTCTATTTTT GTGCTTTTGG 2640 

GGGCCTATTT TGTGCTTTTT TACCTTATGT AGAGATCTTA TTACAAAGTG ATTTTCTACA 2700 

TTAAAAAGAG ACTGAAATAA ATTGTATAGT TACTTAACTA ATGAAGACAT TTCAGAACTC 2760 

TGGGATGATT TTAATCTTGA AGTAGTAGGT GGTATAGTCA TAAAACCATT CATCCCCTTC 2820 

TTGATTGTAT CTTAATTTTC TGGCTTTAAG GTGACATCTG AGAGGTAATG CATTCTTTTT 2880 

TATATTGAAA TCATAAACTA TCACCCGCTG CTTCTCTGAG TTACTTTTAA TTTTGCCTTG 2940 

TGGTTATGGT TTGGCGTTTC CTTCTGTTTG GTTTTCAGAG CCCCATGTCT ATATAGTCCT 3000 

GAGTGCAAGT AATTACTATA CTTGTAAATG AAGATCAGTA TTTCTGCCTA GATCTGATAA 3060 

AAAAATTTTC TTGTCTTAGT TATAAAAATT CAAAGAAATG TGTTACAAAG ATACTTAGTA 3120 

TAGCTCCTCA GCCATAACCT GAGACTTGGG ATGAAATTTA AACCAGATAC GATTTACTTT 3180 

GCAGATCATA AGGCTTTTTA TACTCTTGTT ATCAAAATGG CTTATTTTTC AGGCACTAAG 3240 

GATTGTTAAG AGAAAAGCTT TTCAACGAAG GATTGCCTTT CTTCTCCCAC ACTGTTCTTG 3300 

ATTTCCTCTC TCTTTCAGGC CTCAACAGGC ACTGTATTCA TTGCCAATGT TCCAAATTAT 3360 

CAAATTCAAG TGAATTTATT TGTGTGTTCT TTACTTATAT AAAAAAAGAT AACTTTAAGG 3420 

ATGTGCAAGT ACATTTCCAA CTGCTAGCAC AACCAGTATT TTGTAATTAA ACAAA TCGCT 3480 

GTATGGTATG GTCTTCTACA CATTTATGTC TATAGATATC TATCGATCAT CTTTCTATTC 3S40 

TGTTTCATGA CTGAATAATG TAAAACCAGT GTTGGCAATT GGTATCATCA ATGATACTCA 3600 

TTTTTTAATA ACCAAAGGCA GGGGAAAATC ATTTTACTTA TTAATAAATA TTTTATGATG 3660 
TGAAAAAAAA AAAAAAAAAA AAAAAAAAAA 



A135 Protein sequence 
Gene name: 
Unigene number: 
Protein Accession # : 
Signal sequence: 
Transmembrane domains: 
Cellular Localization: 



ESTs 

Hs. 128899 

none found 

1-11 
402-424 

not determined 



1 11 21 31 41 51 

| 1 I I I I 

MLSGFLMSPS TQHRAQYTPG GKKLPWEASI GAHTSRGRGS DRERESRPEA AGLLWDRAAA 60 

GEAEKGNRGE PPAWIRAQQQ PRPPPAGQAP GTAAGGAQDP RLRPGRSRGR VRLPVKPPEA 120 

SGRQPRGPSD CIPRFPSASA THKAVPKGTG PPAEDGDGLG APGPRARRRR LLGVAAEGSG 180 

PRGKRRGTVS DEARGSPGPR LLGDRPALSG DALSAPRWP CGALAARPSP HPGTPLRSCS 240 

CCWLRCWRRG RGPSGEYCHG WLDAQGVWRI GFCGPERFDG GDATICCGSC ALRYCCSSAE 300 

ARliDQGGCDN DRQQGAGEPG RADKDGPRRL GRASCLRGTQ GDGEGAPPPV RAWQRCSPEG 360 

SPKGRQLLRA FPGLLPRARR RGFPSSPRGG PSPLQRPALP IYVPFLIVGS VFVAFIILGS 420 

LVAACCCRCL RPKQDPQQSR APGGNRLMET IPMIPSASTS RGSSSRQSST AASSSSSANS 4 B0 

GARAPPTRSQ TNCCLPBGTM NNVYVNMPTN FSVLNCQQAT QIVPHQGQYL HPPYVGYTVQ 540 
HDSVPMTAVP PFMDGLQPGY RQIQSPFPHT NSEQKMYPAV TV 



A136 DNA SEQUENCE 

Gene name: selectin E (endothelial adhesion molecule 1) 

Unigene number: Hs. 89546 

Probeset Accession #: M24736 

Nucleic Acid Accession #: NMJJ00450 

Coding sequence: 1-1833 (underlined sequences correspond to start and stop codons) 



1 
I 

ATGA TTGCTT 
GCCTGGTCTT 
CAGCAAAGGT 
TCCATATTGA 
TGGGTCTGGG 
GAACCCAACA 
GATGTGGGCA 
GCTGCCTGTA 
TACACTTGCA 
ACAGCCCTGG 
AGCTACAATT 
ACCATGCAGT 
GAGTGTGATG 
AGCTTCCCAT 
GCCCAGAGCC 
GCTGTGACAT 
CCTGCTGGAG 
TTGCAGGGAC 
GTTTGTGAAG 
CTTCCTAGTG 
GGTTTTGTGT 
GAGAAGCCCA 
GTGAGGTGTG 
TGTGAGGAGG 
TGGACAGAAG 
AAGATCAACA 
CCTGAAGGAT 
TCTGGCCTGC 
CTTTCTGCTG 



11 

I 

CACAGTTTCT 
ACAACACCTC 
ACACACACCT 
GCTATTCACC 
TAGGAACCCA 
ATAGGCAAAA 
TGTGGAATGA 
CCAATACATC 

agtgtgaccc 
aatcccctga 
cttcctgctc 
gtatgtcctc 

CTGTGACAAA 

ggaacacaac 
ttcagtgtac 
gcagggccgt 

AGTTCACCTT 
CAGCCCAGGT 

ctttccagtg 
cttctggcag 
tgaagggatc 
catgtgaagc 
ctcattcccc 
gatttgaatt 

AGGTTCCTTC 

tgagctgcag 
ggacgctcaa 
tacctacctg 

CTGGACTCTC 



21 
I 

CTCAGCTCTC 
CACGGAAGCT 
GGTTGCAATT 
AAGTTATTAC 
GAAACCTCTG 
AGATGAGGAC 
TGAGAGGTGC 
CTGCAGTGGC 
TGGCTTCAGT 
GCATGGAAGC 
TATCAGCTGT 
TGGAGAATGG 
TCCAGCCAAT 
CTGTACATTT 
CTCATCTGGG 
CCGCCAGCCT 
CAAATCATCC 
TGAATGCACC 
CACAGCCTTG 
TTTCCGTTAT 
CAAAAGGCTC 
TGTGAGATGC 
TATTGGAGAA 
ATATGGATCA 
CTGCCAAGTG 
TGGGGAGCCC 
TGGCTCTGCA 
TGAAGCTCCC 
CCTCCTGACA 



31 
I 

ACTTTGGTGC 
ATGACTTATG 
CAAAACAAAG 
TGGATTGGAA 
ACAGAAGAAG 
TGCGTGGAGA 
AGCAAGAAGA 
CACGGTGAAT 
GGACTCAAGT 
CTGGTTTGCA 
GATAGGGGTT 
AGTGCTCCTA 
GGGTTCGTGG 
GACTGTGAAG 
AATTGGGACA 
CAGAATGGCT 
TGCAACTTCA 
ACTCAAGGGC 
TCCAACCCCG 
GGGTCCAGCT 
CAATGTGGCC 
GATGCTGTCC 
TTCACCTACA 
ACTCAACTTG 
GTAAAATGTT 
GTGTTTGGCA 
GCTCGGACAT 
ACTGAGTCCA 
TTAGCACCAT 



41 
1 

TTCTCATTAA 
ATGAGGCCAG 
AAGAGATTGA 
TCAGAAAAGT 
CCAAGAACTG 
TCTACATCAA 
AGCTTGCCCT 
GTGTAGAGAC 
GTGAGCAAAT 
GTCACCCACT 
ACCTGCCAAG 
TTCCAGCCTG 
AATGTTTCCA 
AAGGATTTGA 
ACGAGAAGCC 
CTGTGAGGTG 
CCTGTGAGGA 
AGTGGACACA 
AGCGAGGCTA 
GTGAGTTCTC 
CCACAGGGGA 
ACCAGCCCCC 
AGTCCTCTTG 
AGTGCACATC 
CAAGCCTGGC 
CTGTGTGCAA 
GTGGAGCCAC 
ACATTCCCTT 
TTCTCCTCTG 



51 
I 

AGAGAGTGGA 
TGCTTATTGT 
GTACCTAAAC 
CAACAATGTG 
GGCTCCAGGT 
GAGAGAAAAA 
ATGCTACACA 
CATCAATAAT 
TGTGAACTGT 
GGGAAACTTC 
CAGCATGGAG 
CAATGTGGTT 
AAACCCTGGA 
ACTAATGGGA 
AACGTGTAAA 
CAGCCATTCC 
AGGCTTCATG 
GCAAATCCCA 
CATGAATTGT 
CTGTGAGCAG 
GTGGGACAAC 
GAAGGGTTTG 
TGCCTTCAGC 
TCAGGGACAA 
AGTTCCGGGA 
GTTCGCCTGT 
AGGACACTGG 
GGTAGCTGGA 
GCTTCGGAAA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
10B0 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 



896 



WO 03/042661 



PCT/US02/36810 



TGCTTACGGA AAGCAAAGAA ATTTGTTCCT GCCAGCAGCT GCCAAAGCCT TGAATCAGAC 1800 
GGAAGCTACC AAAAGCCTTC TTACATCCTT TAA 



A137 Protein sequence: 
Gene name: 
Unigene number: 
Probeset Accession ft: 
Protein Accession ft: 
Signal sequence: 
Transmembrane domains: 
C-lectin domain: 
Cellular Localization: 



Selectin E (endothelial 

Hs. 89546 

M24736 

NP_D00441 

1-22 

5SS-573 

23-139 

plasma membrane 



adhesion molecule 1) 



1 11 21 31 41 51 

I I I I I I 

MIASQFLSAL TLVLLIKESG AWSYNTSTEA MTYDEASAYC QQRYTHLVAI QNKEEIEYLN 60 

SILSYSPSYY WIGIRKVNNV WVWVGTQKPL TEEAKNWAPG EPNNRQKDED CVEIYIKREK 120 

DVGMWNDERC SKKKLALCYT AACTNTSCSG HGECVETINN YTCKCDPGPS GLKCEQIVNC 180 

TALESPEHGS LVCSHPLGNF SYNSSCSISC DRGYLPSSME TMQCMSSGEW SAPIPACNW 240 

ECDAVTNPAN GFVECFQNPG SFPWNTTCTF DCEEGFELMG AQSLQCTSSG NWDNEKPTCK 300 

AVTCRAVRQP QNGSVRCSHS PAGEFTFKSS CNFTCEEGFM LQGPAQVECT TQGQWTQQ1P 360 

VCEAFQCTAL SNPERGYMNC LPSASGSFRY GSSCEFSCEQ GFVLKGSKRL QCGPTGEWDN 420 

EKPTCEAVRC DAVHQPPKGL VRCAHSPIGE FTYKSSCAFS CEEGFELYGS TQLECTSQGQ 480 

WTEEVPSCQV VKCSSLAVPG KIKMSCSGEP VFGTVCKF AC PEGWTLNGSA ARTCGATGHW 540 

SGLLPTCEAP TESNIPLVAG LSAAGLSLLT LAPFLLWLRK CLRKAKKFVP ASSCQSLESD 600 
GSYQKPSYIL ■ 

A138 DNA SEQUENCE t 

Gene name: ESTs, Weakly similar to JE0350 Anterior gradient-2 [H. sapiens] 

Unigene number: Hs. 100686 

Probeset Accession ft: AA487468 

Nucleic Acid Accession ft: AA487468 

Coding sequence: 55-5SS (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

1 I I i I I 

CGGCACCAAG AGCACTGGCC AAGTCAGCTT CTTCTGAGAG AGTCTCTAGA AGACATGATG 60 

CTACACTCAG CTTTGGGTCT CTGCCTCTTA CTCGTCACAG TTTCTTCCAA CCTTGCCATT 120 

GCAATAAAAA AGGAAAAGAG GCCTCCTCAG ACACTCTCAA GAGGATGGGG AGATGACATC 180 

ACTTGGGTAC AAACTTATGA AGAAGGTCTC TTTTATGCTC AAAAAAGTAA GAAGCCATTA 240 

ATGGTTATTC ATCACCTGGA GGATTGTCAA TACTCTCAAG CACTAAAGAA AGTATTTGCC 300 

CAAAATGAAG AAATACAAGA AATGGCTCAG AATAAGTTCA TCATGCTAAA CCTTATGCAT 360 

GAAACCACTG ATAAGAATTT ATCACCTGAT GGGCAATATG TGCCTAGAAT CATGTTTGTA 420 

GACCCTTCTT TAACAGTTAG AGCTGACATA GCTGGAAGAT ACTCTAACAG ATTGTACACA 480 

TATGAGCCTC GGGATTTACC CCTATTGATA GAAAACATGA AGAAAGCATT AAGACTTATT 540 

CAGTCAGAGC TATAAGAGAT GATAGAAAAA AGCCTTCACT TCAAAGAAGT CAAATTTCAT 600 

GAAGAAAACC TCTGGCACAT TGACAAATAC TAAATGTGCA AGTATATAGA TTTTGTAATA 660 

TTACTATTTA GTTTTTTTAA TGTGTTTGCA ATAGTCTTAT TAAAATAAAT GTTTTTTAAA 720 
TCTGAAAAAA AAAAAAAAAA AAAAAAAAA 



A139 Protein sequence: 
Gene name: 
Unigene number: 
Probeset Accession ft: 
Protein Accession ft: 
Signal sequence: 
Transmembrane domains: 
Cellular Localization: 



ESTs, Weakly similar to JE0350 Anterior gradient-2 [H . sapiens) 

HS.1006B6 

AA487468 

none found 

1-23 

none found 
secreted 



1 11 21 31 41 51 

I I I I I I 

MMLHSALGLC LLLVTVSSNL AIAIKKEKRP PQTLSRGWGD DITWVQTYEE GLFYAQKSKK 
PLMVIHHLED CQYSQALKKV FAQNEEIQEM AQNKFIMLNL MHETTDKNLS PDGQYVPRIM 
FVDPSLTVRA DIAGRYSNRL YTYEPRDLPL LIENMKKALR LIQSEL 



€0 
120 



A140 DNA SEQUENCE 

Gene name: 
Unigene number: 
Probeset Accession ft: 
Nucleic Acid Accession ft: 
Coding sequence: 



TMPRSS3a 
Hs. 298241 
AI538613 
AB038157 

202-1S66 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I 1 I I I I 

ACCGGGCACC GGACGGCTCG GGTACTTTCG TTCTTAATTA GGTCATGCCC GTGTGAGCCA 
GGAAAGGGCT GTGTTTATGG GAAGCCAGTA ACACTGTGGC CTACTATCTC TTCCGTGGrG 
CCATCTACAT TTTTGGGACT CGGGAATTAT GAGGTAGAGG TGGAGGCGGA GCCGGATGTC 
AGAGGTCCTG AAATAGTCAC CATGGGGGAA AATGATCCGC CTGCTGTTGA AGCCCCCTTC 

897 



60 
120 
180 
240 



WO 03/042661 



PCT/US02/36810 



TCATTCCGAT CGCTTTTTGG CCTTGATGAT TTGAAAATAA GT CCTG TTGC ACCAGATGCA 300 

GATGCTGTTG CTGCACAGAT CCTGTCACTG CTGCCATTGA AGTTTTTTCC AATCATCGTC 360 

ATTGGGATCA TTGCATTGAT ATTAGCACTG GCCATTGGTC TGGGCATCCA CTTCGACTGC 420 

TCAGGGAAGT ACAGATGTCG CTCATCCTTT AAGTGTATCG AGCTGATAGC TCGATGTGAC 480 

GGAGTCTCGG ATTGCAAAGA CGGGGAGGAC GAGTACCGCT GTGTCCGGGT GGGTGGTCAG 540 

AATGCCGTGC TCCAGGTGTT CACAGCTGCT TCGTGGAAGA CCATGTGCTC CGATGACTGG 600 

AAGGGTCACT ACGCAAATGT TGCCTGTGCC CAACTGGGTT TCCCAAGCTA TGTGAGTTCA 660 

GATAACCTCA GAGTGAGCTC GCTGGAGGGG CAGTTCCGGG AGGAGTTTGT GTCCATCGAT 720 

CACCTCTTGC CAGATGACAA GGTGACTGCA TTACACCACT. CAGTATATGT GAGGGAGGGA 780 

TGTGCCTCTG GCCACGTGGT TACCTTGCAG TGCACAGCCT GTGGTCATAG AAGGGGCTAC 840 

AGCTCACGCA TCGTGGGTGG AAACATGTCC TTGCTCTCGC AGTGGCCCTG GCAGGCCAGC 900 

CTTCAGTTCC AGGGCTACCA CCTGTGCGGG GGCTCTGTCA TCACGCCCCT GTGGATCATC 960 

ACTGCTGCAC ACTGTGTTTA TGACTTGTAC CTCCCCAAGT CATGGACCAT CCAGGTGGGT 1020 

CTAGTTTCCC TGTTGGACAA TCCAGCCCCA TCCCACTTGG TGGAGAAGAT TGTCTACCAC 1080 

AGCAAGTACA AGCCAAAGAG GCTGGGCAAT GACATCGCCC TTATGAAGCT GGCCGGGCCA 1140 

CTCACGTTCA ATGAAATGAT CCAGCCTGTG TGCCTGCCCA ACTCTGAAGA GAACTTCCCC 1200 

GATGGAAAAG TGTGCTGGAC GTCAGGATGG GGGGCCACAG AGGATGGAGC AGGTGAOGCC 1260 

TCCCCTGTCC TGAACCACGC GGCCGTCCCT TTGATTTCCA ACAAGATCTG CAACCACAGG 1320 

GACGTGTACG GTGGCATCAT CTCCCCCTCC ATGCTCTGCG CGGGCTACCT GACGGGTGGC 1380 

GTGGACAGCT GCCAGGGGGA CAGCGGGGGG CCCCTGGTGT GTCAAGAGAG GAGGCTGTGG 1440 

AAGTTAGTGG GAGCGACCAG CTTTGGCATC GGCTGCGCAG AGGTGAACAA GCCTGGGGTG 1500 

TACACCCGTG TCACCTCCTT CCTGGACTGG ATCCACGAGC AGATGGAGAG AGACCTAAAA 1560 

ACCTGAAGAG GAAGGGGACA AGTAGCCACC TGAGTTCCTG AGGTGATGAA GACAGCCCGA 1620 

TCCTCCCCTG GACTCCCGTG TAGGAACCTG CACACGAGCA GACACCCTTG GAGCTCTGAG 1680 

TTCCGGCACC AGTAGCAGGC CCGAAAGAGG CACCCTTCCA TCTGATTCCA GCACAACCTT 1740 

CAAGCTGCTT TTTGTTTTTT GTTTTTTTGA GGTGGAGTCT CGCTCTGTTG CCCAGGCTGG 1800 

AGTGCAGTGG CGAAATCCCT GCTCACTGCA GCCTCCGCTT CCCTGGTTCA AGCGATTCTC 1860 

TTGCCTCAGC TTCCCCAGTA GCTGGGACCA CAGGTGCCCG CCACCACACC CAACTAATTT 1920 

TTGTATTTTT AGTAGAGACA GGGTTTCACC ATGTTGGCCA GGCTGCTCTC AAACCCCTGA 1980 

CCTCAAATGA TGTGCCTGCT TCAGCCTCCC ACAGTGCTGG GATTACAGGC ATGGGCCACC 2040 

ACGCCTAGCC TCACGCTCCT TTCTGATCTT CACTAAGAAC AAAAGAAGCA GCAACTTGCA 2100 

AGGGCGGCCT TTCCCACTGG TCCATCTGGT TTTCTCTCCA GGGGTCTTGC AAAATTCCTG 2160 

ACGAGATAAG CAGTTATGTG ACCTCACGTG CAAAGCCACC AACAGCCACT CAGAAAAGAC 2220 

GCACCAGCCC AGAAGTGCAG AACTGCAGTC ACTGCACGTT TTCATCTCTA GGGACCAGAA 2280 

CCAAACCCAC CCTTTCTACT TCCAAGACTT ATTTTCACAT GTGGGGAGGT TAATCTAGGA 2340 

ATGACTOGTT TAAGGCCTAT TTTCATGATT TCTTTGTAGC ATTTGGTGCT TGACGTATTA 2400 

TTGTCCTTTG ATTCCAAATA ATATGTTTCC TTCCCTCAAA AAAAAAAAAA AAAAAAAAAA 2460 
AAAAAAAA 



A141 Protein sequence: 

Gene name: TMPRSS3a 

Unigene number: Hs. 298241 

Probeset Accession #: AI538613 

Protein Accession #: BAB20077 

Signal sequence: none found 

Transmembrane domains: 43-65, 239-261 

Tryp_SPc domain: 216-444 

Cellular Localization: plasma membrane 



1 . 11 21 31 41 51 

MGENDPPAVE APFSFRSLFG LDDLKISPVA PDADAVAAQI LSLLPLKFPP IIVIGIIALI 60 

LALAIGLGIH FDCSGKYRCR SSFKCIELIA RCDGVSDCKD GEDEYRCVRV GGQNAVLQVP 120 

TAASWKTWCS DDWKGHYANV ACAQLGFPSY VSSDNLRVSS LEGQFREEFV SIDHLLPDDK 180 

VTALHHSVYV REGCASGHW TLQCTACGHR RGYSSRIVGG NMSLLSQWPW QASLQFQGYH 240 

LCGGSVITPL WIITAAHCVY DLYLPKSWTI QVGLVSLLDN PAPSHLVEKI VYHSXYKPKR 300 

LGNDIALMKL AGPLTFNEMI QPVCLPNSEE NFPDGKVCWT SGWGATEDGA GDASPVLNHA 360 

AVPLISNKIC NHRDVYGGII SPSMLCAGYL TGGVDSCQGD SGGPLVCQER RLWKLVGATS 420 
FGIGCAEVNK PGVYTRVTSP LDWIHEQMER DLKX 



A142 DNA SEQ UENCE 

Gene name: EST6; Weakly similar to LITHOSTATHIKE 1 BETA PRECURSOR [H. sapiens] 

Unigene number: Hs. 105484 

Probeset Accession #: AA314779 
Nucleic Acid Accession ft: none found 

Coding sequence: 103-579 {underlined sequences correspond to start and stop codons) 



l ii 21 31 41 51 

CCAAACAGAT TTGCAGATCA AGGAGAACCC AGGAGTTTCA AAGAAGCGCT AGTAAGGTCT 60 

CTGAGATCCT TGCACTAGCT ACATCCTCAG GGTAGGAGGA AGATGGCTTC CAGAAGCATG 120 

CGGCTGCTCC TATTGCTGAG CTGCCTGGCC AAAACAGGAG TCCTGGGTGA TATCATCATG 180 

AGACCCAGCT GTGCTCCTGG ATGGTTTTAC CACAAGTCCA ATTGCTATGG TTACTTCAGG 240 

AAGCTGAGGA ACTGGTCTGA TGCCGAGCTC GAGTGTCAGT CTTACGGAAA CGGAGCCCAC 300 

CTGGCATCTA TCCTGAGTTT AAAGGAAGCC AGCACCATAG CAGAGTACAT AAGTGGCTAT 360 

CAGAGAAGCC AGCCGATATG GATTGGCCTG CACGACCCAC AGAAGAGGCA GCAGTGGCAG 420 

TGGATTGATG GGGCCATGTA TCTGTACAGA TCCTGGTCTG GCAAGTCCAT GGGTGGGAAC 480 

AAGCACTGTG CTGAGATGAG CTCCAATAAC AACTTTTTAA CTTGGAGCAG CAACGAATGC 540 

AACAAGCGCC AACACTTCCT GTGCAAGTAC CGACCATAGA GCAAGAATCA AGATTCTGCT 600 

AACTCCTGCA CCAGCCCCGT CCTCTTCCTT TCTGCTAGCC TGGCTAAATC TG CTCATTAT 660 

TTCAGAGGGG AAACCTAGCA AACTAAGAGT GATAAGGGCC CTACTACACT GGCTTTTTTA 720 



898 



WO 03/042661 



PCT/US02/36810 



GGCTTAGAGA CAGAAACTTT AGCATTGGGC CCAGTAGTGG CTTCTAGCTC TAAATGTTTG 780 

CCCCGCCATC CCTTTCCACA GTATCCTTCT TCCCTCCTCC CCT GTCTCT G GCTGTCTCGA 840 

GCAGTCTAGA AGAGTGCATC TCCAGCCTAT GAAACAGCTG GGTCTTTGGC CATAAGAAGT 900 

AAAGATTTGA AGACAGAAGG AAGAAACTCA GGAGTAAGCT TCTAGACCCC TTCAGCTTCT 960 

ACACCCTTCT GCCCTCTCTC CATTGCCTGC ACCCCACCCC AGCCACTCAA CTCCTGCTTG 1020 

TTTTTCCTTT GGCCATAGGA AGGTTTACCA GTAGAATCCT TGCTAGGTTG ATGTGGGCCA 1080 
TACATTCCTT TAATAAACCA TTGTGTACAT AAGAGAAAAA AAAAAAAAAA AAAAAAAAAA 



A143 Protein sequence: 
Gene name: 
Unigene number: 
Probeset Accession ft: 
Protein Accession #: 
Signal sequence: 
Transmembrane domains: 
C-type lectin domain: 
Cellular Localization: 



ESTs; weakly similar to L I THOSTATH INE 1 BETA PRECURSOR (H • sapiens) 

Hs. 105484 

AA314779 

none found 

1-22 

none found 

47-156 

secreted 



1 11 21 31 41 51 

I 1 I I 1 1 

MASRSMRLLL LLSCLAKTGV LGDIIMRPSC APGWFYHKSN CYGYFRKLRN WSDAELECQS 60 
YGNGAHLASI LSLKEASTIA EYISGYQRSQ PIWIGLHDPQ KRQQWQWIDG AMYLYRSWSG 120 
KSMGGNKHCA EMSSNNNFLT WSSNECNKRQ HFLCKYRP 



A144 DNA SEQUENCE 
Gene name: 
Unigene number: 
Probeset Accession ft: 
Nucleic Acid Accession ft: 
Coding sequence: 



Cadherin 3, P-cadherin (placental) 

Hs.2877 

X63629 

X63629 

54-2543 (start and stop codons are underlined) 



11 



GCGGAACACC 
TCCCTCGTGG 
CCTCCGAGCC 
CGGAGCAGGA 
CAGCTCTGTT 
AAAGAAGGTC 
GAAGACACAA 
CCTTCCCCCA 
ACAGCATCAC 
AGACAGGCTG 
TCTTTGGCCA 
TCATCGTGAC 
GTGTCTTAGA 
ATGATGCCAT 
AGGACCCACA 
CCAGTGGCCT 
TGGATGGGGA 
ACAATGCTCC 
GCCATGAGGT 
GTGCCACCTA 
CTGAGAGCAA 
AGCACACCCT 
CCACAGCCAC 
CCTCCAAAGT 
CTGCAGAAGA 
CAGGGTGGCT 
GTGAGGATGA 
ATGGAAGCCC 
ACCATGGCCC 
ACGTGCTGAA 
TCACAGATGA 
TCTTGTCCCT 
ACCATGGCAA 
ATGTCGAAAC 
TCCTGGCTCT 
TCAAGGAGCC 
AAGAGGGGGG 
AGGCCAGGCC 
TGTACCGTCC 
AGGCGGCTAA 
AGGGCAGCGG 
ACCAAGATTA 
ACGGTGGCGG 
CACAGAGCAT 
GGAAGTGGCC 
TCCTTAGCCT 
CACCTGGGCC 
TGCTCAACCC 
CTGGAATGGA 



GGCCCGCCGT 
ACCTCTCGCG 
GTGCCGGGCG 
GCCCGGCCAG 
TAGCACTGAT 
ACTGAAGGAA 
GAGAGATTGG 
GAGACTGAAT 
GGGGCCGGGG 
GTTGTTGTTG 
CGCTGTGTCA 
CGACCAGAAT 
GGGAGTCCTA 
CTACACCTAC 
CGACCTCATG 
GGACCGGGAA 
CGGCTCCACC 
CATGTTTGAC 
GCAGAGGCTG 
CCTTATCATG 
CCAGGGCATC 
GTACGTTGAA 
CATAGTGGTC 
CGTTGAGGTC 
CCCTGACAAG 
AGCCATGGAC 
GCAGTTTGTG 
TCCCACCACT 
AGTCCCTGAG 
CATCACGGAC 
CTCAGACATC 
GAAGAAGTTC 
CAAAGAGCAG 
CTGCCCTGGA 
GCTGTTCCTC 
CCTCCTACTC 
TGGCGAAGAG 
GGAGGTGGTT 
TAGGCCAGCC 
CACAGACCCC 
CTCCGACGCC 
CGATTATCTG 
GGAGGACGAC 
CTCCAAGGGG 
GTAGCAACTT 
TTCAGGATGG 
AGGGTTGCCT 
TGTGTCCTGG 
ACCTTCTTAG 



21 
I 

CGCGGCAGCT 
TCTCTCCTCC 
GTCTTCAGGG 
GCGCTGGGGA 
AATGATGACT 
AGGAATCCAT 
GTGGTTGCTC 
CAGCTCAAGT 
GCAGACAGCC 
AATAAGCCAC 
GAGAATGGTG 
GACCACAAGC 
CCAGGTACTT 
AATGGGGTGG 
TTCACAATTC 
AAAGTCCCTG 
ACCACGGCAG 
CCCCAGAAGT 
ACGGTCACTG 
GGCGGTGACG 
CTGACAACCA 
GTGACCAACG 
CACGTGGAGG 
CAGGAGGGCA 
GAGAATCAAA 
CCAGACAGTG 
AGGAACAACA 
GGCACGGGAA 
CCCCGTCAGA 
AAGGACCTGT 
TACTGGACGG 
CTGAAGCAGG 
CTGACGGTGA 
CCCTGGAAAG 
CTGCTGGTGC 
CCAGAAGATG 
GACCAGGACT 
CTCCGCAATG 
AACCCAGATG 
ACAGCCCCGC 
GCGTCCCTGA 
AACGAGTGGG 
TAGGCGGCCT 
TCTCAGTTCC 
GGCGGAGACA 
AGGAATGTGG 
CAGAGGCCAA 
GCCTGGGCCT 
GCCTCCTGGT 



31 
I 

GCTTCACCCC 
TTCTCCAGGT 
AGGCTGAAGT 
AAGTATTCAT 
TCACTGTGCG 
TGAAGATCTT 
CAATATCTGT 
CTAATAAAGA 
CCCCTGAGGG 
TGGACCGGGA 
CCTCAGTGGA 
CCAAGTTTAC 
CTGTGATGCA 
TTGCTTACTC 
ACCGGAGCAC 
AGTACACACT 
TGGCAGTAGT 
ACGAGGCCCA 
ATCTGGACGC 
ACGGGGACCA 
GGAAGGGTTT 
AGGCCCCTTT 
ATGTGAATGA 
TCCCCACTGG 
AGATCAGCTA 
GGCAGGTCAC 
TCTATGAAGT 
CCCTTCTGCT 
TCACCATCTG 
CTCCCCACAC 
CAGAGGTCAA 
ATACATATGA 
TCAGGGCCAC 
GAGGTTTCAT 
TGCTTTTGTT 
ACACCCGTGA 
ATGACATCAC 
ACGTGGCACC 
AAATCGGCAA 
CCTACGACAC 
GCTCCCTCAC 
GCAGCCGCTT 
GCCTGCAGGG 
CCCTTCAGCT 
GGCTATGAGT 
GCAGTTTGAC 
GTTTCCAGAA 
GCTGTGACTG 
GCAACTTAAT 



41 



51 



TCTCTCTGCA 
TTGCTGGCTG 
GACCTTGGAG 
GGGCTGCCCT 
GAATGGCGAG 
CCCATCCAAA 
CCCTGAAAAT 
TAGAGACACC 
TGTCTTCGCT 
GGAGATTGCC 
GGACCCCATG 
CCAGGACACC 
GGTGACAGCC 
CATCCATAGC 
AGGCACCATC 
GACCATCCAG 
GGAGATCCTT 
TGTGCCTGAG 
CCCCAACTCA 
TTTTACCATC 
GGATTTTGAG 
TGTGCTGAAG 
GGCACCTGTG 
GGAGCCTGTG 
CCGCATCCTG 
AGCTGTGGGC 
CATGGTCTTG 
AACACTGATT 
CAACCAAAGC 
CTCCCCTTTC 
CGAGGAAGGT 
CGTGCACCTT 
TGTGTGCGAC 




GCCATGGGGC 
CAGTGCGCGG 
GCGGGAGGCG 
GGGCAAGAGC 
ACAGTCCAGG 
CGTATCTTAC 
GGCAAGGGTC 
AAGATTTTCT 
GTAGAGAAGG 
AAGTATGAGC 
AACATCTCCA 
TTCCGAGGGA 
ACAGATGAGG 
CAAGAACCAA 
AGCGTCATCT 
GCCACAGACA 
GATGCCAATG 
AATGCAGTGG 
CCAGCGTGGC 
ACCACCCACC 
GCCAAAAACC 
CTCCCAACCT 
TTTGTCCCAC 
TGTGTCTACA 
AGAGACCCAG 
ACCCTCGACC 
GCCATGGACA 
GATGTCAACG 
CCTGTGCGCC 
CAGGCCCAGC 
GACACAGTGG 
TCTCTGTCTG 
TGCCATGGCC 
CTGGGGGCTG 
AAGCGGAAGA 
TACTATGGCG 
CGAGGTCTGG 
CCGACACCCA 
GAGAACCTGA 
TTCGACTATG 
TCCGACCAAG 
GCAGACATGT 
AACGTCAGGC 
GAGCTTGTCA 
AGTGGTTGCT 
AAAACCTCTC 
TGCCGTAAAA 
GACTTTCTCT 
AATGCTATCT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 



899 



WO 03/042661 



PCT/US02/36810 



TCAAAACGTT AGAGAAAGTT CTTCAAAAGT GCAGCCCAGA GCTGCTGGGC CCACTGGCCG 3000 

TCCTGCATTT CTGGTTTCCA GACCCCAATG CCTCCCATTC GGATGGATCT CTGCGTTTTT 3060 

ATACTGAGTG TGCCTAGGTT GCCCCTTATT TTTTATTTTC CCTGTTGCGT TGCTATAGAT 3120 
GAAGGGTGAG GACAATCGTG TATATGTACT AGAACTTTTT TATTAAAGAA A 



A14S Protein sequence; 
Gene name: 
Unigene number: 
Probeset Accession #: 
Protein Accession ft: 
Signal sequence: 
Transmembrane domain: 

Cellular localization: 



Cadherin 3, 
Hs.2877 
X63629 
CAA45177 
1-24 
659-675 

plasma membrane 



P- cadherin (placental) 



1 11 21 31 41 51 

I I I 1 I I 

MGLPRGPIAS LLLLQVCWLQ CAASEPCRAV PREAEVTLEA GGAEQEPGQA LGKVFMGCPG 60 

QEPALFSTDN DDFTVRNGET VQERRSLKER NPLKIFPSKR ILRRHKRDWV VAPISVPENG 120 

KGPFPQRLNQ LKSNKDRDTK I FYS I TG PGA DSPPEGVFAV EKETGWLLLN KPLDREEIAK 180 

YELFGHAVSE NGASVEDPMN ISIIVTDQND HKPKFTQDTF RGSVLEGVLP GTSVMQVTAT 240 

DEDDAIYTYN GWAYSIHSQ EPKDPHDLMF TIHRSTGTIS VISSGLDREK VPEYTLTIQA 300 

TDMDGDGSTT TAVAWEILD ANDNAPMFDP QKYEAHVPEN AVGHEVQRLT VTDLDAPNSP 360 

AWRATYLIMG GDDGDHFTIT THPESNQGIL TTRKGLDFEA KNQHTLYVEV TNEAPFVLKL 420 

PTSTATIWH VEDVNEAPVF VPPSKWEVQ EGIPTGEPVC VYTAEDPDKE NQKISYRILR 480 

DPAGWLAMDP DSGQVTAVGT LDREDEQFVR NNIYEVMVLA MDNGSPPTTG TGTLLLTLID 540 

VNDHGPVPEP RQITICNQSP VRHVLNITDK DLSPHTSPFQ AQLTDDSDIY WTAEVNEEGD 600 

TWLSLKKFL KQDTYDVHLS LSDHGNKEQL TVIRATVCDC HGHVETCPGP WKGGFILPVL 660 

GAVLALLFLL LVLLLLVRKK RKIKEPLLLP EDDTRDNVFY YGEEGGGEED QDYDITQLHR 720 

GliEARPEWL RNDVAPTIIP TPMYRPRPAN PDEIGNFIIE NLKAANTDPT APPYDTLLVF 780 
DYEGSGSDAA SLSSLTSSAS DQDQDYDYLN EWGSRFKKLA DMYGGGEDD 



A146 PNA SEQUENCE: 
Gene name: 
Unigene number: 
Probeset Accession ft: 
Nucleic Acid Accession ft: 
Coding sequence: 



TTK protein kinase 

Hs. 169840 

M86699 

NM_003318 

1026-3551 (underlined 



sequences correspond to 



start and stop codons) 



1 11 21 31 41 51 

I I I I I 1 

GGAATTCCTT TTTTTTTTTT TTTGAGATGG AGTTTCACTC TTGTTGGCCA GGCTGGAGTG 60 

CAATGGCACA ATCTCAGCTT ACTGCAACCT CCGCCTCCCG GGTTCAAGCG ATTCTCCTGC 120 

CTCAGCCTCT CAAGTAGCTG GGATTACAGG CATGTGCCAC CACCCCTGGC TAACTAATTT 180 

CTTTTCTATT TAGTAGAGAT GGGGTTTCAC CATGTTGGTC AGGCTGGTCT TGAACTCCTG 240 

ACCTCAGGTG ATCCACTTGC CTTGGCCTCC CAAAGTGCTA GGATTACAGC CGTGAAACTG 300 

TGCCTGGCTG ATTCTTTTTT TGTTGTTGGA TTTTTGAAAC AGGGTCTCCC TTGGTCGCCC 360 

AGGCTGGAGT GCAGTGGTGC GATCTTGGCT CACTATAACC TCCACCTCCT GGTTTCAAGT 420 

GATCCTCCCA CTTTAGCCTC CTGAGTAGCT GTGATTACAG GCGTGCACCA CCACACCOGG 480 

CTAATTTTTG TATTTTTATT AGAGACAGGG TTTCACCATG TTGGCCAGGC TGTTCTCAAA 540 

CTCCTGGACT CAAGGGATCC GCCTGCCTCC ACTTCCCAAA GTCCCGAGAT TACAGGTGTG 600 

AGTCACCATG CCTGACCTTA TAATTCTTAA GTCATTTTTT CTGGTCCATT TCTTCCTTAG 660 

GGTCCTCACA ACAAATCTGC ATTAGGCGGT ACAATAATCC TTAACTTCAT GATTCACAAA 720 

AGGAAGATGA AGTGATTCAT GATTTAGAAA GGGGAAGTAG TAAGCCCACT GCACACTCCT 780 

GGATGATGAT CCTAAATCCA GATACAGTAA AAATGGGGTA TGGGAAGGTA GAATACAAAA 840 

TTTGGTTTAA ATTAATTATC TAAATATCTA AAAACATTTT TGGATACATT GTTGATGTGA 900 

ATGTAAGACT GTACAGACTT CCTAGAAAAC AGTTTGGGTT CCATCTTTTC ATTTCCCCAG 960 

TGCAGTTTTC TGTAGAAATG GAATCCGAGG ATTTAAGTGG CAGAGAATTG ACAATTGATT 1020 

CCAT AATGA A CAAAGTGAGA GACATTAAAA ATAAGTTTAA AAATGAAGAC CTTACTGATG 1080 

AACTAAGCTT GAATAAAATT TCTGCTGATA CTACAGATAA CTCGGGAACT GTTAACCAAA 1140 

TTATGATGAT GGCAAACAAC CCAGAGGACT GGTTGAGTTT GTTGCTCAAA CTAGAGAAAA 1200 

ACAGTGTTCC GCTAAGTGAT GCTCTTTTAA ATAAATTGAT TGGTCGTTAC AGTCAAGCAA 1260 

TTGAAGCGCT TCCCCCAGAT AAATATGGCC AAAATGAGAG TTTTGCTAGA ATTCAAGTGA 1320 

GATTTGCTGA ATTAAAAGCT ATTCAAGAGC CAGATGATGC ACGTGACTAC TTTCAAATGG 1380 

CCAGAGCAAA CTGCAAGAAA TTTGCTTTTG TTCATATATC TTTTGCACAA TTTGAACTGT 1440 

CACAAGGTAA TGTCAAAAAA AGTAAACAAC TTCTTCAAAA AGCTGTAGAA CGTGGAGCAG 1500 

TACCACTAGA AATGCTGGAA ATTGCCCTGC GGAATTTAAA CCTCCAAAAA AAGCAGCTGC 1560 

TTTCAGAGGA GGAAAAGAAG AATTTATCAG CATCTACGGT ATTAACTGCC CAAGAATCAT 1620 

TTTCCGGTTC ACTTGGGCAT TTACAGAATA GGAACAACAG TTGTGATTCC AGAGGACAGA 1680 

CTACTAAAGC CAGGTTTTTA TATGGAGAGA ACATGCCACC ACAAGATGCA GAAATAGGTT 1740 

ACCGGAATTC ATTGAGACAA ACTAACAAAA CTAAACAGTC ATGCCCATTT GGAAGAGTCC IB 00 

CAGTTAACCT TCTAAATAGC CCAGATTGTG ATGTGAAGAC AGATGATTCA GTTGTACCTT 1860 

GTTTTATGAA AAGACAAACC TCTAGATCAG AATGCCGAGA TTTGGTTGTG CCTGGATCTA 1920 

AACCAAGTGG AAATGATTCC TGTGAATTAA GAAATTTAAA GTCTGTTCAA AATAGTCATT 1980 

TCAAGGAACC TCTGGTGTCA GATGAAAAGA GTTCTGAACT TATTATTACT GATTCAATAA 2040 

CCCTGAAGAA TAAAACGGAA TCAAGTCTTC TAGCTAAATT AGAAGAAACT AAAGAGTATC 2100 

AAGAACCAGA GGTTCCAGAG AGTAACCAGA AACAGTGGCA AGCTAAGAGA AAGTCAGAGT 2160 

GTATTAACCA GAATCCTGCT GCATCTTCAA ATCACTGGCA GATTCCGGAG TTAGCCCGAA 2220 

AAGTTAATAC AGAGCAGAAA CATACCACTT TTGAGCAACC TGTCTTTTCA GTTTCAAAAC 2280 

AGTCACCACC AATATCAACA TCTAAATGGT TTGACCCAAA ATCTATTTGT AAGACACCAA 2340 



900 



WO 03/042661 



PCI7US02/36810 



GCAGCAATAC CTTGGATGAT- TACATGAGCT GTTTTAGAAC TCCAGTTGTA AAGAATGACT 2400 

TTCCACCTGC TTGTCAGTTG TCAACACCTT ATGGCCAACC TGCCTGTTTC CAGCAGCAAC 2460 

AGCATCAAAT ACTTGCCACT CCACTTCAAA ATTTACAGGT TTTAGCATCT TCTTCAGCAA 2520 

ATGAATGCAT TTCGGTTAAA GGAAGAATTT ATTCCATATT AAAGCAGATA GGAAGTGGAG 2580 

GTTCAAGCAA GGTATTTCAG GTGTTAAATG AAAAGAAACA GATATATGCT ATAAAATATG 2640 

TGAACTTAGA AGAAGCAGAT AACCAAACTC TTGATAGTTA CCGGAACGAA ATAGCTTATT 2700 

TGAATAAACT ACAACAACAC AGTGATAAGA TCATCCGACT TTATGATTAT GAAATCACGG 2760 

ACCAGTACAT CTACATGGTA ATGGAGTGTG GAAATATTGA TCTTAATAGT TGGCTTAAAA 2820 

AGAAAAAATC CATTGATCCA TGGGAACGCA AGAGTTACTG GAAAAATATG TTAGAGGCAG 2880 

TTCACACAAT CCATCAACAT GGCATTGTTC ACAGTGATCT TAAACCAGCT AACTTTCTGA 2940 

TAGTTGATGG AATGCTAAAG CTAATTGATT TTGGGATTGC AAACCAAATG CAACCAGATA 3000 

CAACAAGTGT TGTTAAAGAT TCTCAGGTTG GCACAGTTAA TTATATGCCA CCAGAAGCAA 3060 

TCAAAGATAT GTCTTCCTCC AGAGAGAATG GGAAATCTAA GTCAAAGATA AGCCCCAAAA 3120 

GTGATGTTTG GTCCTTAGGA TGTATTTTGT ACTATATGAC TTACGGGAAA ACACCATTTC 3180 

AGCAGATAAT TAATCAGATT TCTAAATTAC ATGCCATAAT TGATCCTAAT CATGAAATTG 3240 

AATTTCCCGA TATTCCAGAG AAAGATCTTC AAGATGTGTT AAAGTGTTGT TTAAAAAGGG 3300 

ACCCAAAACA GAGGATATCC ATTCCTGAGC TCCTGGCTCA TCCATATGTT CAAATTCAAA 3360 

CTCATCCAGT TAACCAAATG GCCAAGGGAA CCACTGAAGA AATGAAATAT GTTCTGGGCC 3420 

AACTTGTTGG TCTGAATTCT CCTAACTCCA TTTTGAAAGC TGCTAAAACT TTATATGAAC 3480 

ACTATAGTGG TGGTGAAAGT CATAATTCTT CATCCTCCAA GACTTTTGAA AAAAAAAGGG 3540 

GAAAAAAA TG AT TTGCAGTT ATTCGTAATG TCAGATAGGA GGTATAAAAT ATATTGGACT 3600 

GTTATACTCT TGAATCCCTG TGGAAATCTA CATTTGAAGA CAACATCACT CTGAAGTGTT 3660 

ATCAGCAAAA AAAATTCAGT GAGATTATCT TTAAAAGAAA ACTGTAAAAA TAGCAACCAC 3720 

TTATGGCACT GTATATATTG TAGACTTGTT TTCTCTGTTT TATGCTCTTG TGTAATCTAC 3780 

TTGACATCAT TTTACTCTTG GAATAGTGGG TGGATAGCAA GTATATTCTA AAAAACTTTG 3840 
TAAATAAAGT TTTGTGGCTA AAATGA 



A147 Protein sequence: 
Gene name: 
Unigene number: 
Probeset Accession #: 
Protein Accession # i 
Signal sequence: 
Transmembrane domains 
Protein Kinase Domain 
Cellular Localization 



TTK protein kinase 

Hs. 169840 

M86699 

NP_003309 

none found 

none found 

510-775 

cytoplasmic and nuclear 



1 
I 

MNKVRDIKNK 
VPLSDALLNK 
ANCKKFAFVH 
BEEKKNLSAS 
NSLRQTNKTK 
SGNDSCBLRN 
PEVPESNQKQ 
PPISTSKWFD 
QILATPLQNL 
LEEADNQTLD 
KSIDPKERKS 
SWKDSQVGT 
IINQISKLHA 
PVNQMAKGTT 
K 



11 
1 

FKNEDLTDEL 
LIGRYSQAIE 
ISFAQFELSQ 
TVLTAQESFS 
QSCPFGRVPV 
LKSVQNSHFK 
WQAKRKSECI 
PKSICKTPSS 
QVLASSSANE 
SYRNEIAYLN 
YWKNMLEAVH 
VNYMPPEAIK 
IIDPNHEIEF 
EEMKYVLGQL 



21 
I 

SLNKISADTT 
ALPPDKYGQN 
GNVKKSKQLL 
GSLGHLQNRN 
NLLNSPDCDV 



NQNPAASSNH 
NTLDDYMSCF 
CISVKGRIYS 
KLQQHSDKII 
TIHQHGIVHS 
DMSSSRENGK 
PDIPEKDLQD 
VGLNSPNSIL 



31 
I 

DNSGTVNQIM 
ESPARIQVRF 
QKAVERGAVP 
NSCDSRGQTT 
KTDDSWPCF 
ELIITDSITL 
WQIPELARKV 
RTPWKKDFP 
ILKQIGSGGS 
RLYDYEITDQ 
DLKPANFLIV 
SKSKISPKSD 
VLKCCLKRDP 
KAAKTLYEHY 



41 
I 

MMANNPEDWL 
AELKAIQEPD 
LEMLEIALRN 
KARFLYGENM 
MKRQTSRSEC 
KNKTESSLLA 
NTEQKHTTFE 
PACQLSTPYG 
SKVFQVLNEK 
YIYMVMECGN 
DGMLKLIDFG 
VWSLGCILYY 
KQRISIPELL 
SGGESHNSSS 



51 
I 

SLLLKLEKNS 
DARDYFQMAR 
LNLQKKQLLS 
PPQDAEIGYR 
RDLWPGSKP 
KLEETKEYQE 
QPVFSVSKQS 
QPACFQQQQH 
KQIYAIKYVN 
IDLNSWLKKK 
IANQMQPDTT 
MTYGKTPFQQ 
AHPYVQIQTH 
SKTFEKKRGK 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 



A14B DMA SEQUENCE 
Gene name: 
Unigene number: 
Probeset Accession #: 
Nucleic Acid Accession ft: 
Coding sequence: 



endothelial differentiation, lysophosphatidic acid G -protein-coupled receptor, 
Hs. 258583 
NM 012152 
NM~012152 

43-1104 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

CTTCTTTAAA TTTCTTTCTA GGATGTTCAC TTCTTCTCCA CAATGAATGA GTGTCACIAT 60 

GACAAGCACA TGGACTTTTT TTATAATAGG AGCAACACTG ATACTGTCGA TGACTGGACA 120 

GGAACAAAGC TTGTGATTGT TTTGTGTGTT GGGACGTTTT TCTGCCTGTT TATTTTTTTT 180 

TCTAATTCTC TGGTCATCGC GGCAGTGATC AAAAACAGAA AATTTCATTT CCCCTTCTAC 240 

TACCTGTTGG CTAATTTAGC TGCTGCCGAT TTCTTCGCTG GAATTGCCTA TGTATTCCTG 300 

ATGTTTAACA CAGGCCCAGT TTCAAAAACT TTGACTGTCA ACCGCTGGTT TCTCCGTCAG 360 

GGGCTTCTGG ACAGTAGCTT GACTGCTTCC CTCACCAACT TGCTGGTTAT CGCCGTGGAG 420 

AGGCACATGT CAATCATGAG GATGCGGGTC CATAGCAACC TGACCAAAAA GAGGGTGACA 480 

CTGCTCATTT TGCTTGTCTG GGCCATCGCC ATTTTTATGG GGGCGGTCCC CACACTGGGC 540 

TGGAATTGCC TCTGCAACAT CTCTGCCTGC TCTTCCCTGG CCCCCATTTA CAGCAGGAGT 600 

TACCTTGT T T TCTGGACAGT GTCCAACCTC ATGGCCTTCC TCATCATGGT TGTGGTGTAC 660 

CTGCGGATCT AOGTGTACGT CAAGAGGAAA ACCAACGTCT TGTCTCCGCA TACAAGTGGG 720 

TCCATCAGCC GCCGGAGGAC ACCCATGAAG CTAATGAAGA CGGTGATGAC TGTCTTAGGG 780 

GCGTTTGTGG TATGCTGGAC CCCGGGCCTG GTGGTTCTGC TCCTCGACGG CCTGAACTGC 840 

AGGCAGTGTG GCGTGCAGCA TGTGAAAAGG TGGTTCCTGC TGCTGGCGCT GCTCAACTCC 900 

GTCGTGAACC CCATCAT CTA CTCCTACAAG GACGAGGACA TGTATGGCAC CATGAAGAAG 960 

ATGATCTGCT GCTTCTCTCA GGAGAACCCA GAGAGGCGTC CCTCTCGCAT CCCCTCCACA 1020 

GTCCTCAGCA GGAGTGACAC AGGCAGCCAG TACATAGAGG ATAGTATTAG CCAAGGTGCA 10 B0 



901 



WO 03/042661 



PCTAJS02/36810 



GTCTGCAATA AAAGCACTTC CTAAACTCTG GATGCCTCTC GGCCCACCCA GGTGATGACT 1140 
GTCTTAGG 



A149 Protein sequence: 
Gene name: 
Unigene number: 
Probeset Accession fl: 
Protein Accession #: 
Signal sequencer 
Transmembrane domains: 
Cellular Localization: 



endothelial differentiation, lysophosphatidic acid G- protein- coup led receptor, 

HS. 258583 

NM_0121S2 

NP_036284 

none found 

31-53, 66-88, 150-172, 190-211, 239-261, 277-29S 
plasma membrane 



1 11 21 31 41 51 

I I I i I I 

MNECHYDKHM DFFYNRSNTD TVDDWTGTKL VIVLCVGTFF CLFIFFSNSL VIAAVIKNRK GO 
FHFPFYYLLA NLAAADFFAG IAYVFLMFNT GPVSKTLTVN RWFLRQGLLD SSLTASLTNL 120 
LVIAVERHMS IMRMRVHSNL TKKRVTLLIL LVWAIAIFMG AVPTLGWNCL CN I SACS SLA 180 
PIYSRSYLVF WTVSNLMAFL IMVWYLRIY VYVKRKTNVL SPHTSGSISR RRTPMKLMKT 240 
VMTVLGAFW CWTPGLWLL LDGLNCRQCG VQHVKRWFLL LALLNSWNP IIYSYKDEDM 300 
YGTMKKMICC FSQENPERRP SRIPSTVLSR SDTGSQYIED SISQGAVCNK STS 



Prostate 

A150 DNA SEQUENCE 

Gene name: ESTs 

Unigene number: Hs. 2 93616 

Probeset Accession fl: AW043782 

Nucleic Acid Accession # : none found 

Coding sequence: 38-1075 (underlined sequence corresponds to start and stop codon) 



1 11 21 31 41 51 

1 I I I I I 

AGCAACGACG CCGGGCAGCG GGAGOGGCGG CCGCGCCATG TGGCTGCTGG GGCCGCTGTG 60 

CCTGCTGCTG AGCAGCGCCG CGGAGAGCCA GCTGCTCCCC GGGAACAACT TCACCAATGA 120 

GTGCAACATA CCAGGCAACT TCATGTGCAG CAATGGACGG TGCATCCCGG GCGCCTGGCA 180 

GTGTGACGGG CTGCCTGACT GCTTCGACAA GAGTGATGAG AAGGAGTGCC CCAAGGCTAA 240 

GTCGAAATGT GGCCCAACCT TCTTCCCCTG TGCCAGCGGC ATCCATTGCA TCATTGGTCG 300 

CTTCCGGTGC AATGGGTTTG AGGACTGTCC CGATGGCAGC GATGAAGAGA ACTGCACAGC 360 

AAACCCTCTG CTTTGCTCCA CCGCCCGCTA CCACTGCAAG AACGGCCTCT GTATTGACAA 420 

GAGCTTCATC TGCGATGGAC AGAATAACTG TCAAGACAAC AGTGATGAGG AAAGCTGTGA 480 

AAGTTCTCAA GAACCCGGCA GTGGGCAGGT GTTTGTGACT TCAGAGAACC AACTTGTGTA 540 

TTACCCCAGC ATCACCTATG CCATCATCGG CAGCTCCGTC ATTTTTGTGC TGGTGGTGGC 600 

CCTGCTGGCA CTGGTCTTGC ACCACCAGCG GAAGCGGAAC AACCTCATGA CGCTGCCCGT 660 

GCACCGGCTG CAGCACCCTG TGCTGCTGTC CCGCCTGGTG GTCCTGGACC ACCCCCACCA 720 

CTGCAACGTC ACCTACAACG TCAATAATGG CATCCAGTAT GTGGCCAGCC AGGCGGAGCA 780 

GAATGCGTCG GAAGTAGGCT CCCCACCCTC CTACTCCGAG GCCTTGCTGG ACCAGAGGCC 840 

TGOGTGGTAT GACCTTCCTC CACCGCCCTA CTCTTCTGAC ACGGAATCTC TGAACCAAGC 900 

CGACCTGCCC CCCTACCGCT CCCGGTCCGG GAGTGCCAAC AGTGCCAGCT CCCAGGCAGC 960 

CAGCAGCCTC CTGAGCGTGG AAGACACCAG CCACAGCCCG GGGCAGCCTG GCCCCCAGGA 1020 

GGGCACTGCT GAGCCCAGGG ACTCTGAGCC CAGCCAGGGC ACTGAAGAAG TATAAGTCCC 1080 

AGTTATTCCA AAGTCCATAT GGGTTAATCT GCTCTGACTT GTTGCCATTC TAACAATTTG 1140 

TGCTCATGGG AAGCTCTTTA AGCACCTGTA AGGATGTCTC AAGTTACAGT TTGGGATATT 1200 

AACTATCTCT GCATTCCCCT CCTCCCCCAG ACTTCAGAGA TGTTTTTCTG GCGTCTCAGT 1260 

TGACATGATC TGTTGTGCGT CTTTTCTGTC AGGTCACTCT TCCCTTGGGA CCCGAGATCA 1320 

CACCCTCATT TTTCACATTA TTCTGTTTCT GTTGGAGAGA CAGCATATAA AACAGTATTG 1380 

AAATAGGCTG GGAGAGAGCA ATGTTTCTGT GCTATATTGG ATGCTCAGAA GTGCAGGAGA 1440 

CGCTGGACCC AATTCTCTCT GCTGGGTAGT TACCTTATAG CATTTGGGGA TTTGGGTTAG 1500 

ATGATCTAAC CAGGAGGCCA TCACTGGATG GTCACCCCCC CAAAAAAATT CCATTTGAGC 1560 

ATCAAAACCT GCTTTGCACA ATCCTATTTG ATGCCCCCAG TTCAGCAGAG TCAGTGGCCA 1620 

AAGAAAACTT TGGACGTGAG TAACACCCTT CAGCAGTCGC AACGTTATTT TGGTTTTGTG 1680 

AAGGACTCTG AAACCATCTA CCCTGTATAA ATTCTGGCTT TAGAAATTTG CCCAAGAATG 1740 

CTCATTCTGA GAGCTTTCCT CAGCAGCATA TATCATCAGC CTCATCCTAA AATAGGCAGG 1800 

GAGCCCCTCC CATGAGTTTA TCCAAGTTCT CAGCTCCTAA AATGCAGGCT GCCAAGACCC 1860 

TACACCTGCC CTGGCTCTAC AGCCACTTAC CTGGTTTCTG GACTGTCACC CTCCCAGCTG 1920 

ACCTGCCCGT AGCCAAGGAA TGAGGACCTA ACTTGAGTTG GCCCAAAGTC TGACCTGGCT 1980 

GTATGTCCCT GTGGCCCACA CCCAGCCTGT CTTGCTCATT CATGCAGCCT CAACACTGGC 2040 

CTCCAAAGTT CCCTTAACAC TTGCAAAGTC CTTTTTACCT GTGCATTTGG ACTTGAGGAC 21,00 

ACTGGTTTCT ATCACAGGTG AGAGCCATGT TCAATACCTC CAGCAAGCTC TCCTGGCTCC 2160 

CTGCACTGTG CACGCTCCTC TTCCCAAGGT CCCAATACCA GCACCTCTAG TTAGAGTTAG 2220 

GGTCAGGGTC AGGCCTCTCC CAACATCCCA GTAGTTTCTC CTCTGAGACA CATGGGCAAG 2280 

AGACAATTTG GAGTCAAGAT TTTCCATTTG GATCTATTTT AAATCTTTTA GAAATGCATT 2340 

TGAAACAGTG TGTTTGTTTT TTCCCTTCTA GTTAAGGGAC TATTTATATG TGTATAGGAA 2400 

AGCTGTCTCT TTTTTTGTTT TTCCTTTAAC AAGGTCCAAA GAAAGATGCA AAAGGAGATC 2460 

ACACCCTTGC CCCGCTGAGC CCCGTGATAA CAAGTCACTC CAGACTAACC TGTGTGCCAG 2520 

ACATTTGTGC ATTGTTGCAC TTTGAGGTTA TTATTTATCA AGTTCTTGAA GGAAGCAGAA 2580 

AGAGGGACTC CTCTCTCCCT CCGTGTATAG TCTCTATGTT TGTGCTAGTT TTTCTTTTTT 2640 

TTCTCTGTGT CCAGTCAGCC ACAGGGCCCG CCTCCCTGCA GGAATAAGGG GTAAAACGTT 2700 

AGGTGTTGTT TGGCAAGAAA CCACACTGAC TGATGAGGGG TAAAATGGAA CCAGGTAGAG 2760 

CCACTCCGGG CAGCTGTCAC CCATTCAGAA CTTCTTTCOG CAGCTGAAGA AATGTTCAGT 2820 



902 



WO 03/042661 



PCT/US02/36810 



AACCTGTTTG ACGCTAATTA AAACAGAGCC TGCAGGAAGT GGGGCTAAAG TGGCATTCAG 2880 

TGATCCTGTT CTGTAGACTT TTCTTTCTTT TTTTAACCAA ATCCAAAGGA TGTTACAGAA 2940 

AAGCTAGCCA CTGGTATTTT GTTTTGTTTA AAAAAAAAAA GAAAGAAAGA AAGAAAGAAA 3000 

AACGGAAAGG AACCTAGCTG CCTGTATCTT TCATTTTTAA AATAGCACTT GAGTTATTTT 3060 

CTGAGTAATC CAATAAAGAA CTTTTGATGA CAGCCAGAAT GTGTTAGAAC TCTGGCTGAA 3120 

CATTTCATCT CCTGTGAGTC AGAAGGGCTT TATTTCTCCC TTTGATGGGG CCCCTTCTTC 3180 

TTTCTGGTGC TCTGGAAGTT GTTTAGAGGA AAGAATTCTA ATTTTAATTA ATTGCGCAGT 3240 

GAGTTAATCT CACTCGCTTT TCTGCTTCCA GGCATCTTAG GAAAAACAAA TGGTTTTAGT 3300 

AGATAAGGGA TGCCTACTAA TGCTTTTTTA AAACAAACAG GGACATTTTT ATTATAGATT 3360 

TGATTTTTTT AATGAATGTT TTTAAAAATA TATAAATAGG ACACCAAAGC GGCAGGGTTT 3420 

TTTTTGGGGG GAGGGGGTTT GTTTTCCAAC TCAAGATGGC ACATTAGTGG CCAGCAATAT 3480 

TTTTTAACTC ATTCCAACCA GGAAGCTTTT TTATACATTG CCTAAATCTA CGCCAACCAG 3540 

AAAATAGTCT CATCTCTTTT TTTCTCAAAT GAGATCCGTG TTTTATTTTA GCATTAAATT 3600 

AGTTACACTG TGATGACTGG CCTATTACCT GACTCAGCTC CCTCTACCTT GAAATTGACA 3660 

TTTTTAAAAA ATGCAACTAA GTGGTTAATA GTGTGTGAOG CTCAAAGTTA ATGTAAACTG 3720 

GAAAGGTTGT GTGTCGTTGC TTTTTGTGTT TTGGTTAGGC TTGGTTTTGT TTTTTAATTT 3780 

TTATACTTTC TAATAAATTT GCAGTTTCAT TCTTTCTGTT TGTGCAAAWG GWMCTAMARM 3840 

AAMMAAAAAC AWYWTTGGGG GGGCTTGGGC CTCGGAAAAA GTTTTTAACA CCACTTCGGG 3900 

TGGGGCGGCG GGGCCCACGT AGGTACGGCG ACCACGCGGG CCCAAACGGG ACCCCAGAAG 3960 

GAAACCCTGG CCAAGAAAAA GGTGGCGAGA ATTCTCCACA CCAGAAAAAA AGGCGCCGGG 4020 

GGAAACCGCA GAGTGTTGCG TAAACCACAC CCGAAGAGAG AACTCAGAAG CACACAAGCG 4080 
GGACTCAACC AGGAGGACCC AAGGGAACCC GATAGAGTAC G 



A151 Protein sequence! 
Gene name : 
Unigene number: 
Probeset Accession #: 
Protein Accession ft; 
Signal sequence: 
Transmembrane domains: 
LDLa domains: 
Cellular Localization: 



ESTs 

Hs. 293616 
AW043782 
none found 
1-17 
169-191 

28-66, 70-108, 112-149 
plasma membrane 



1 11 21 31 41 SI 

I I I I I I 

MWLLGPLCLL LSSAAESQLL PGNNFTNECN IPGNFMCSNG RCIPGAWQCD GLPDCPDKSD 60 
EKECPKAKSK CGPTPFPCAS GIHCIIGRFR CNGFEDCPDG SDEENCTANP LLCSTARYHC 120 
KNGIiCIDKSF ICDGQNNCQD NSDEESCESS QBPGSGQVFV TSENQLVYYP SITYAIIGSS 180 
VIFVLWALL ALVLHHQRKR NNLMTLPVHR LQHPVLLSRL WLDHPHHCN VTYNVNNGIQ 240 
YVASQAEQNA SEVGSPPSYS EALLDQRPAW YDLPPPPYSS DTESLNQADL PPYRSRSGSA 300 
NSASSQAASS LLSVEDTSHS PGQPGPQEGT AEPRDSEPSQ GTEEV 



A152 DNA SEQUENCE 

Gene name: 
Unigene number: 
Probeset Accession #: 
Nucleic Acid Accession #: 



Transmembrane protease, 
Hs.105807 
T48536 
NM 005656.1 



serine 2 (TMPRSS2) 



Coding sequence: 57-1535 (underlined sequences correspond to start and stop codons) 



11 



21 



GTCATATTGA ACATTCCAGA TACCTATCAT 
CTTTGAACTC AGGGTCACCA CCAGCTATTG 
CGGAAAACCC CTATCCCGCA CAGCCCACTG 
CTCAGTACTA CCCGTCCCCC GTGCCCCAGT 
ACCCCGTCGT CTGCACGCAG CCCAAATCCC 
AGAAAGCACT GTGCATCACC TTGACCCTGG 
CTGGCCTACT CTGGAAGTTC ATGGGCAGCA 
CCTCAGGTAC CTGCATCAAC CCCTCTAACT 
GGGAGGACGA GAATCGGTGT GTTCGCCTCT 
CATCTCAGAG GAAGTCCTGG CACCCTGTGT 
GGGCGGCCTG CAGGGACATG GGCTATAAGA 
ATGACAGCGG ATCCACCAGC TTTATGAAAC 
ATAAAAAACT GTACCACAGT GATGCCTGTT 
TAGCCTGCGG GGTCAACTTG AACTCAAGCC 
CGCTCCCGGG GGCCTGGCCC TGGCAGGTCA 
GAGGCTCCAT CATCACCCCC GAGTGGATCG 
TTAACAATCC ATGGCATTGG ACGGCATTTG 
ATGGAGCCGG ATACCAAGTA CAAAAAGTGA 
AGAACAATGA CATTGCGCTG ATGAAGCTGC 
AACCAGTGTG TCTGCCCAAC CCAGGCATGA 
CCGGGTGGGG GGCCACCGAG GAGAAAGGGA 
TGCTTCTCAT TGAGACACAG AGATGCAACA 
CAGCCATGAT CTGTGCCGGC TTCCTGCAGG 
GAGGGCCTCT GGTCACTTCG AACAACAATA 
GTTCTGGCTG TGCCAAAGCT TACAGACCAG 
ACTGGATTTA TCGACAAATG AAGGCAAACG 
CGTTTTACAA GAAAACAATG GGGCTGGTTT 
ATGATTCAGA GGTCACTTCA TTTTTATTAA 
TGCCATACTG TGCAGGCTGC AGTGGCTCCC 



31 41 51 

I I I 

TACTCGATGC TGTTGATAAC AGCAAGATGG 60 

GACCTTACTA TGAAAACCAT GGATACCAAC 120 

TGGTCCCCAC TGTCTACGAG GTGCATCCGG 180 

ACGCCCCGAG GGTCCTGACG CAGGCTTCCA 240 

CATCCGGGAC AGTGTGCACC TCAAAGACTA 300 

GGACCTTCCT CGTGGGAGCT GCGCTGGCCG 360 

AGTGCTCCAA CTCTGGGATA GAGTGCGACT 420 

GGTGTGATGG GGTGTCACAC TGCCCCGGCG 480 

ACGGACCAAA CTTCATCCTT CAGATGTACT S40 

GCCAAGACGA CTGGAACGAG AACTACGGGC 600 

ATAATTTTTA CTCTAGCCAA GGAATAGTGG 660 

TGAACACAAG TGCCGGCAAT GTCGATATCT 720 

CTTCAAAAGC AGTGGTTTCT TTACGCTGTT 780 

GCCAGAGCAG GATCGTGGGC GGTGAGAGCG 840 

GCCTGCACGT CCAGAACGTC CACGTGTGCG 900 

TGACAGCCGC CCACTGCGTG GAAAAACCTC 960 

CGGGGATTTT GAGACAATCT TTCATGTTCT 1020 

TTTCTCATCC AAATTATGAC TCCAAGACCA 1080 

AGAAGCCTCT GACTTTCAAC GACCTAGTGA 1140 

TGCTGCAGCC AGAACAGCTC TGCTGGATTT 1200 

AGACCTCAGA AGTGCTGAAC GCTGCCAAGG 1260 

GCAGATATGT CTATGACAAC CTGATCACAC 1320 

GGAACGTCGA TTCTTGCCAG GGTGACAGTG 1380 

TCTGGTGGCT GATAGGGGAT ACAAGCTGGG 1440 

GAGTGTACGG GAATGTGATG GTATTCACGG 1500 

GCTAATCCAC ATGGTCTTCG TCCTTGACGT 1560 

TGCTTCCCCG TGCATGATTT ACTCTTAGAG 1620 

ACAGTGAACT TGTCTGGCTT TGGCACTCTC 1680 

CTGCCCAGCC TGCTCTCCCT AACCCCTTGT 1740 
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CCGCAAGGGG TGATGGCCGG CTGGTTGTGG GCACTGGCGG TCAATTGTGG AAG GAAG AGG 1800 

GTTGGAGGCT GCCCCCATTG AGATCTTCCT GCTGAGTCCT TTCCAGGGGC CAATTTTGGA 1860 

TGAGCATGGA GCTGTCACTT CTCAGCTGCT GGATGACTTG AGATGAAAAA GGAGAGACAT 1920 

GGAAAGGGAG ACAGCCAGGT GGCACCTGCA GCGGCTGCCC TCTGGGGCCA CTTGGTAGTG 1980 

TCCCCAGCCT ACTTCACAAG GGGATTTTGC TGATGGGTTC TTAGAGCCTT AGCAGCCCTG 2040 

GATGGTGGCC AGAAATAAAG GGACCAGCCC TTCATGGGTG GTGACGTGGT AGTCACTTGT 2100 

AAGGGGAACA GAAACATTTT TGTTCTTATG GGGTGAGAAT ATAGACAGTG CCCTTGGTGC 2160 

GAGGGAAGCA ATTGAAAAGG AACTTGCCCT GAGCACTCCT GGTGCAGGTC TCCACCTGCA 2220 

CATTGGGTGG GGCTCCTGGG AGGGAGACTC AGCCTTCCTC CTCATCCTCC CTGACCCTGC 2280 

TCCTAGCACC CTGGAGAGTG AATGCCCCTT GGTCCCTGGC AGGGCGCCAA GTTTGGCACC 2340 

ATGTCGGCCT CTTCAGGCCT GATAGTCATT GGAAATTGAG GTCCATGGGG GAAATCAAGG 2400 

ATGCTCAGTT TAAGGTACAC TGTTTCCATG TTATGTTTCT ACACATTGAT GGTGGTGACC 2460 
CTGAGTTCAA AGCCATCTT 



A153 Protein sequence: 
Gene name: 
Unigene number: 
Probeset Accession #: 
Protein Accession #: 
Signal sequence: 
Transmembrane domains: 
Cellular Localization: 



Transmembrane protease, 

Hs. 105807 

T48536 

NP_005647.1 

none found 

85-107 

plasma membrane 



serine 2 (TMPRSS2) 



1 
I 

MALNSGSPPA 
SNPWCTQPK 
DSSGTCINPS 
GRAACRDMGY 
CLACGVNLNS 
PLNNPWHWTA 
VKPVCLPNPG 
TPAMICAGFL 
TDWIYRQMKA 



11 
I 

IGPYYENHGY 
SPSGTVCTSK 
NWCDGVSHCP 
KNNFYSSQGI 
SRQSRIVGGE 
PAGILRQSFM 
MMLQPEQLCW 
QGNVDSCQGD 
NG 



21 
I 

QPENPYPAQP 
TKKALCITLT 
GGEDENRCVR 
VDDSGSTSFM 
SALPGAWPWQ 
FYGAGYQVQK 
ISGWGATEEK 
SGGPLVTSNN 



31 
I 

TWPTVYEVH 
LGTFLVGAAL 
LYGPNF1LQM 
KLNTSAGNVD 
VSLHVQNVHV 
VISHPNYDSK 
GKTSEVLNAA 
NIWWLIGDTS 



41 
I 

PAQYYPSPVP 
AAGLLWKFMG 
YSSQRKSWHP 
IYKKLYHSDA 
CGGSIITPEW 
TKNNDIALMK 
KVLLIETQRC 
WGSGCAKAYR 



51 
I 

QYAPRVLTQA 
SKCSNSGIEC 
VCQDDWNENY 
CSSKAWSLR 
IVTAAHCVEK 
LQKPLTFNDL 
NSRYVYDNIiI 
PGVYGNVMVF 



60 
120 
180 
240 
300 
360 
420 
480 



A154 DMA SEQUENCE 

Gene name: Homo sapiens cDNA FLJ13581 fis, clone PLACE1009039 
Unigene number: Hs. 12 917 9 

Probeset Accession #: AI694767 
Nucleic Acid Accession #: AI694767 

Coding sequence: 130-1086 {underlined sequences correspond to start and stop codons) 



I 

CAGAGAGGCT 
GGGGTCACAC 
AGCTTCTTCA_ 
ATAGGCCTCC 
TACCTTATTG 
CTGCATGAGC 
ACCTCATCCA 
GATGCTTGTC 
CTGCTGGCCA 
GTACTTACGT 
CTGATGGCAC 
TCCCATTOCT 
AATGTCGTCT 
TCCTTCTCAT 
AAGGCATTTG 
ATTGGATTGT 
TTGGCCAATA 
ACAAAGGAGA 
CCCTAGGTGT 
GTTAACATTT 
ATCCTTCAAA 
GTTTTCTTGC 
TTTTCATTTT 
GAGATAAGAA 
TAAACACAGA 
ACTCCCAACC 
AAATAATTTT 
AGAGTACATT 
ATGGACCCTG 
TTAGTACCCT 
GGGGTCATAC 
GGAAGAACTG 
TTCTARAGGA 
GCAACAGAAC 
AATTACCTGT 
AGAAAGTCTG 
TGATAGGCAG 
TGAAGATAAC 



11 



GTATTTCAGT 
ATTCCTTCCA 
TGATGGTGGA 
CTGGTTTAGA 
CTGTGCTAGG 
CCATGTATAT 
TGCCCAAAAT 
TGCTACAGAT 
TGGCTTTTGA 
TGCCTCGTGT 
CCCTTCCTGT 
ACTGCCTACA 
ATGGCCTTAT 
ATCTG CTTAT 
GCACTTGCGT 
CCATGGTGCA 
TCTATCTGCT 
TTCGACAGOG 
CAGTGATCAA 
TGGAAGACAG 
TATGAAACTG 
TACATATAAT 
ACCATGCAGT 
TGGTACATCT 
ATATAATAAA 
ACATTGGATC 
TCCTCTGGAC 
TACCTACGTT 
TTTTTCCTAT 
CATTGTAGCC 
AAGTATAAAA 
TTAAAGAGAC 
GGTATTTAAT 
TCATGGCTTT 
GTCTTGGAAG 
CATAGGGCTT 
TGAGGTTAGG 
ATTGGCCTTT 



21 

I 

GCAGCCTGCC 
TACGGTTGAG 
TCCCAATGGC 
AGAGGCTCAG 
TAACTTGACA 
ATTTCTTTGC 
GCTGGCCATC 
GTTTGCCATC 
COGCTATGTG 
CACCAAAATT 
CTTCATCAAG 
CCAAGATGTC 
CGTCATCATC 
TCTTAAGACT 
CTCTCATGTG 
TCGCTTTAGC 
GGTTCCTCCT 
CATCCTTCGA 
ACTTCTTTTC 
TATTCAGAAA 
GTTGGGGAAT 
TATTAATACC 
CCAAATCTAA 
AGAGAACATT 
ATGAGATAAT 
TCAGAAAAAT 
ACTAGCACTT 
AATGAAAGTT 
TTAATTTTCT 
ATGGGAAAAT 
ATTAAAAAAA 
CAACAGGGTA 
TTCTTCTCAC 
AATCCCACTA 
AAGTGATTTC 
ATAGCAAGTT 
GAGCCACCAG 
TGAGTGTGAC 



31 
I 

AGACCTCTTC 
CCTCTACCTG 
AATGAATCCA 
TTCTGGTTGG 
ATCATCTACA 
ATGCTTTCAG 
TTCTGGTTCA 
CACTCCTTAT 
GCCATCTGTC 
GGTGTGGCTG 
CAGCTGCCCT 
ATGAAGCTGG 
TCCGCCATTG 
GTGTTGGGCT 
TGTGCTGTGT 
AAGCGGCGTG 
GTGCTCAACC 
CTTTTCCATG 
CATTCAGAGT 
AAAAATTTCC 
CTCCATTTTT 
CTGACTAGGT 
ACTGCTTCTA 
TGCCAAAGGC 
CTAGCTTAAA 
ACTGTCTTCA 
AAGGGGAAGA 
GACACACTGT 
TATCAACCCT 
TGATGTTCAG 
AAAGACTTCA 
GTGGGTTAGA 
TCATCCAGTG 
GCTATTGCTT 
TAGGTTCACC 
ATTTATTTTT 
TTATGATGGG 
TCGTAGCTGG 



41 



TGGAGGAAGA 
CCTGGTGCTG 
GTGCTACATA 
CCTTCCCATT 
TTGTGOGGAC 
GCATTGACAT 
ATTCCACTAC 
CTGGCATGGA 
ACCCACTGCG 
CTGTGGTGCG 
TCTGCCGCTC 
CCTGTGATGA 
GCCTGGACTC 
TGACACGTGA 
TCATATTCTA 
ACTCTCCACT 
CAATTGTCTA 
TGGCCACACA 
CCTCTGATTC 
TTAATAAAAA 
TCAATATTAT 
TGTGGTTGGA 
CTGATGGTTT 
CTAAGCACAG 
ACTATAACTT 
AAATGACTTC 
TTGGAAGTAA 
TCTGAGAGTT 
TTAATTAGGC 
TGGGGATCAG 
TGCCCAATCT 
GATTTCCAGA 
TTGTATTTAG 
ATTGTCCTGG 
ATTATGGAAG 
AAAAGTTCCA 
AAGTATGGAA 
AAAGTGAGGG 



51 
I 

CTGGACAAAG 
GTCACAGTTC 
CTTCATCCTA 
GTGCTCCCTC 
TGAGCACAGC 
CCTCATCTCC 
CATCCAGTTT 
ATCCACAGTG 
CCATGCCACA 
GGGGGCTGCA 
CAATATCCTT 
TATCCGGGTC 
ACTTCTCATC 
AGCCCAGGCC 
TGTACCTTTC 
GCCCGTCATC 
TGGAGTGAAG 
CGCTTCAGAG 
AGATTTTAAT 
TACAACTCAG 
TTTCTTCTTT 
GGGTTATTAC 
ACAGCATTCT 
CAAAGGAAAA 
CCTCTTCAGA 
TACAGAGAAG 
AGCCTTGAAA 
TTCACAGCAT 
AAAGATATTA 
TGAATTAAAT 
CATATGATGT 
GTCTTACATT 
GAATTTCCTG 
TCCAATTGCC 
ATT err ATT c 
TAGGTGTTTC 
TGGCAGGTGT 
AATCTTCAGG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
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ACCATGCTTT ATTTGGGGCT TTGTGCAGTA TGGAACAGGG ACTTTGAGAC CGGGAAAGCA 2340 

ATCTGACTTA GGCATGGGAA TCAGGCATTT TTGCTTCTGA GGGGCTATTA CCAAGGGTTA 2400 

ATAGGTTTCA TCTTCAACAG GATATGACAA CAGTCTTAAC CAAGAAACTC AAATTACATA 2460 

TACTAAAACA TGTGATCATA TATGTGGTAA GTTTCATTTT CTTTTTCAAT CCTCAGGTTC 2520 

CCTGATATGG ATTCCTATNA CATGCTTTCA TCCCCTTTTG TAATGGATAT CATA TTTGG A 2580 

AATGCCTATT TAATACTTGT ATTTGCTGCT GGACTGTAAG CCCATGAGGG CACTGTTTAT 2640 

TATTGAATGT CATCTCTGTT CATCATTGAC TGCTCTTTGC TCATCATTGA ATCCCCCAGC 2700 

AAAGTGCCTA GAACATAATA GTGCTTATGC TTGACACCGG TTATTTTTCA TCAAACCTGA 2760 

TTCCTTCTGT GCTGAACACA TAGCCAGGCA ATTTTCCAGC CTTCTTTGAG TTGGGTATTA 2820 

TTAAATTTTA GCCATTACTT CCAATGTGAG TGGAAGTGAC ATGTGCAATT TTTATACCTG 2880 

GCTCATAAAA CCCTCCCATG TGCAGCCTTT CATGTTGACA TTAAATGTGA CTTGGGAAGC 2940 

TATGTGTTAC ACAGAGTTAA TTAACCNGAA AGGCCTGGNA ATlTfl'TGNN AAN NAAA CTG 3000 

TGGCOJNGAG GCCCHCAACC CTTTTTNNNA ATTTGGCAAN NTCCCACTTT GTANTTTGGT 3060 
AAGGAGGCCA GTTGGATAAG TGAAAAATAA AGTACTATTG TGTC 



A155 PROTEIN SEQUENCE 
Gene name: 
Unigene number: 
Protein Accession tf: 
Signal sequence: 
Pfatn domain: 
Transmembrane domains: 
Cellular Localizations 



Homo sapiens cDNA FLJ13581 fis, 
Hs. 129179 

not available 
none found 
7tm_l [43-2 93] 

29-51, 57-79, 82-104, 203-225, 
not determined 



Clone PLACE100903 9 



239-261, 273-295 



1 11 21 31 41 51 

I I I I 1 I 

MVDPNGNESS ATYFILIGLP GLEEAQFWLA FPLCSLYLIA VLGNLTIIYI VRTEHSLHEP 60 
MYIFLCMLSG IDILISTSSM PKMLAIFWFN STTIQPDACL LQMFAIHSLS GMESTVLLAM 120 
AFDRYVAICH PLRHATVLTIj PRVTKIGVAA WRGAALMAP LPVFIKQLPF CRSNILSHSY 180 
CLHQDVMKIiA CDDIRVNWY GLIVIISAIG LDSLLISFSY LLILKTVLGL TREAQAKAFG 240 
TCVSHVCAVF IFYVPFIGLS MVHRFSKRRD SPLPVILANI YLLVPPVLNP IVYGVKTKEI 300 
RQRILRLFHV ATHASEP 

A156 DNA SEQUENCE 

Gene name: vasoactive intestinal peptide receptor 1 

Unigene number: Hs. 198726 

Probeset Accession #: X77777 

Nucleic Acid Accession ft: KMJ)04624.1 

Coding sequence: S7-1544 (underlined sequences correspond to start and stop codons) 



TCGGAGCCTG CGGAGGGTGG TGGTGGTGGT GGTGGTGGCC CTCGCCCGCC TCACTCATGC 60 

CTCCTCCTCC TCTGCTCTCG CTCAGGCGCC TCGGTGGCGG TTGGTCGGCG GTTACGCGGC 120 

TGGTGGTCGC GGCGGCCGGG GCTCGCTCTC GGGGAGGCCG GGGCGGATCT CGCGGCGCAG 180 

GCGGCGGCGG CCGAGGTGGG GTCGCGCGGC GGAGGCGGCT CGAGCTTCGT GCTGCGCGCT 240 

CGCTCTTGGG CTCCTCGCTG CAGGAGGAGT GTGACTATGT GCAGATGATC GAGGTGCAGC 300 

ACAAGCAGTG CCTGGAGGAG GCCCAGCTGG AGAATGAGAC AATAGGCTGC AGCAAGATGT 360 

GGGACAACCT CACCTGCTGG CCAGCCACCC CTCGGGGCCA GGTAGTTGTC TTGGCCTGTC 420 

CCCTCATCTT CAAGCTCTTC TCCTCCATTC AAGGCCGCAA TGTAAGCCGC AGCTGCACCG 480 

ACGAAGGCTG GACGCACCTG GAGCCTGGCC CGTACCCCAT TGCCTGTGGT TTGGATGACA 540 

AGGCAGCGAG TTTGGATGAG CAGCAGACCA TGTTCTACGG TTCTGTGAAG ACCGGCTACA 600 

CCATTGGCTA CGGCCTGTCC CTCGCCACCC TTCTGGTCGC CACAGCTATC CTGAGCCTGT 660 

TCAGGAAGCT CCACTGCACG CGGAACTACA TCCACATGCA CCTCTTCATA TCCTTCATCC 720 

TGAGGGCTGC CGCTGTCTTC ATCAAAGACT TGGCCCTCTT CGACAGCGGG GAGTCGGACC 780 

AGTGCTCCGA GGGCTCGGTG GGCTGTAAGG CAGCCATGGT CTTTTTCCAA TATTGTGTCA 840 

TGGCTAACTT CTTCTGGCTG CTGGTGGAGG GCCTCTACCT GTACACCCTG CTTGCCGTCT 900 

CCTTCTTCTC TGAGCGGAAG TACTTCTGGG GGTACATACT CATCGGCTGG GGGGTACCCA 960 

GCACATTCAC CATGGTGTGG ACCATCGCCA GGATCCATTT TGAGGATTAT GGTCTGCTCA 1020 

GGTGCTGGGA CACCATCAAC TCCTCACTGT GGTGGATCAT AAAGGGCCCC ATCCTCACCT 1080 

CCATCTTGGT AAACTTCATC CTGTTTATTT GCATCATCCG AATCCTGCTT CAGAAACTGC 1140 

GGCCCCCAGA TATCAGGAAG AGTGACAGCA GTCCATACTC AAGGCTAGCC AGGTCCACAC 1200 

TCCTGCTGAT CCCCCTGTTT GGAGTACACT ACATCATGTT CGCCTTCTTT CCGGACAATT 1260 

TTAAGCCTGA AGTGAAGATG GTCTTTGAGC TCGTCGTGGG GTCTTTCCAG GGTTTTGTGG 1320 

TGGCTATCCT CTACTGCTTC CTCAATGGTG AGGTGCAGGC GGAGCTGAGG CGGAAGTGGC 1380 

GGCGCTGGCA CCTGCAGGGC GTCCTGGGCT GGAACCCCAA ATACCGGCAC CCGTCGGGAG 1440 

GCAGCAACGG CGCCACGTGC AGCACGCAGG TTTCCATGCT GACCCGCGTC AGCCCAGGTG 1500 

CCCGCCGCTC CTCCAGCTTC CAAGCCGAAG TCTCCCTGGT CTGACCACCA GGATCCCAGC 1560 

CCAAGCGGCC CCTCCCGCCC CTTCCCACTC GCAGCAGACG CCGGGGACAG AGGCCTGCCC 1620 

GGGCGCGCCA GCCCCGGCCC TGGGCTCGGA GGCTGCCCCC GGCCCCCTGG TCTCTGGTCC 1680 

GGACACTCCT AGAGAACGCA GCCCTAGAGC CTGCCTGGAG CGTTTCTAGC AAGTGAGAGA 1740 

GATGGGAGCT CCTCTCCTGG AGGATGCAGG TGGAACTCAG TCATTAGACT CCTCCTCCAA 1800 

AGGCCCCCTA CGCCAATCAA GGGCAAAAAG TCTACATACT TTCATCCTGA CTCTGCCCCC I860 

TGCTGGCTCT TCTGCCCAAT TGGAGGAAAG CAACCGGTGG ATCCTCAAAC AACACTGGTG 1920 

TGACCTGAGG GCAGAAAGGT TCTGCCCGGG AAGGTCACCA GCACCAACAC CACGGTAGTG 1980 

CCTGAAATTT CACCATTGCT GTCAAGTTCC TTTGGGTTAA GCATTACCAC TCAGGCATTT 2040 

GACTGAAGAT GCAGCTCACT ACCCTATTCT CTCTTTACGC TTAGTTATCA GCTTTTTAAA 2100 

GTGGGTTATT CTGGAGTTTT TGTTTGGAGA GCACACCTAT CTTAGTGGTT CCCCACCGAA 2160 

GTGGACTGGC CCCTGGGTCA GTCTGGTGGG AGGACGGTGC AACCCAAGGA CTGAGGGACT 2220 

CTGAAGCCTC TGGGAAATGA GAAGGCAGCC ACCAGCGAAT GCTAGGTCTC GGACTAAGCC 2280 

TACCTGCTCT CCAAGTCTCA GTGGCTTCAT CTGTCAAGTG GGACTCTGTC ACACCAGCCA 2340 

TTCTTATCTC TCTGTGCTGT GGAAGCAACA GGAATCAAGA GACTGCCCTC CTTGTCCACC 2400 

CACCTATGTG CCAACTGTTG TAACTAGGCT CAGAGATGTG CACCCATGGG CTCTGACAGA 24 SO 

905 



WO 03/042661 



PCT7US02/36810 



AAGCAGATCC 
GTGAAAGCAC 
TTATTTGTTT 




ACACATACAG GATTTGAACT CAGATCTGTC TGATAGGAAT 2520 

TGCTAACTTT TGTGTATCGT AACCAGCCAG ATCCTCTTGG 2580 

TTATTAATGC CATTATCCCT GAATTCCCCT TGCCACCCCA 2640 

AGGAGGCCTC CATCTCATGT ATCATCTGGA TAGGAGCCTG 2700 

CTGCCCTTCA CCCCAGTGGC CACTCAGCTT CCTACCCACA 2760 

CAGGACTGCA ACAGGCTTGT GCAACAATAA ATGTTGGCTT 2820 



A157 Protein sequence; 

10 Gene name: 

Unigene number: 
Probeset Accession #: 
Protein Accession : 
Signal sequence : 

15 Transmembrane domains: 
Cellular Localization: 



vasoactive intestinal peptide receptor 1 

Hs. 198726 

X77777 

JC2195 

none found 

181-202, 214-236, 255-277, 290-311, 332-354, 377-399, 
plasma membrane 



408-430 



1 11 21 31 41 51 

20 | | i I I I 

MPPPPLLSLR RLGGGWSAVT RLWAAAGAR SRGGRGGSRG AGGGGRGGVA RRRRLELRAA 60 
RSLLGSSLQE ECDYVQMIEV QHKQCLEEAQ LENETIGCSK MWDNLTCWPA TPRGQVWLA 120 
CPLIPKLFSS IQGRNVSRSC TDEGWTHLEP GPYPIACGLD DKAASLDEQQ TMFYGSVKTG 180 
YTIGYGLSLA TLLVATAILS LFRKLHCTRN YIHMHLFISF ILRAAAVFIK DLALFDSGES 240 
25 DQCSEGSVGC KAAMVFFQYC VMANFFWLLV EGLYLYTLLA VSFFSERKYF WGYILIGWGV 300 
PSTFTMVWTI ARIHFEDYGti LRCWDTINSS LWWIIKGPIL TSILVNFILF 1CIIRILLQK 360 
LRPPDIRKSD SSPYSRLARS TLLLIPLFGV HYIMFAFFPD NFKPEVKMVF ELWGSFQGF 420 
WAILYCFLN GEVQAELRRK WRRWHLQGVL GWNPKYRHPS GGSNGATCST QVSMLTRVSP 480 
GARRSSSFQA EVSLV 

30 

A158 DNA SEQUENCE 

Gene name: ~" ESTs 
Unigene number: Hs. 29383 

Probeset Accession AW207206 
35 Nucleic Acid Accession ft: AL133619 

Coding sequence: 1-2070 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

40 ATGAGCGGTG CGGGGGTGGC GGCTGGGACG CGGCCCCCCA GCTCGCCGAC CCCGGGCTCT 60 

CGGCGCCGGC GCCAGCGCCC CTCTGTGGGC GTCCAGTCCT TGAGGCCGCA GAGCCCGCAG 120 

CTCAGGCAGA GCGACCCGCA GAAACGGAAC CTGGACCTGG AGAAAAGCCT GCAGTTCCTG 180 

CAGCAGCAGC ACTCGGAGAT GCTGGCCAAG CTCCATGAGG AGATCGAGCA TCTGAAGCGG 240 

GAAAACAAGG GTGAGCCGGC GCGGGGCCCT AGGCCGGCCC TGCCTCCCCA GGCACACTCA 300 

45 ACACTGCCGC TCCCGCAGCA CAGAAACACA GCCATCAACT CCAGCACACG CCTGGGCTCA 360 

GGGGGAACAC AGGACGGGGA GCCCCTCCAG ACTGTCCTTG CCCACCTGGC TGCACTGGCC 420 

CCTGTATGCC AACCCAGTGG GTACAGGTTC TGGGGGACCT GGACAGATGC CGCTACCTCT 480 

AGCCGTGGCT GGACGATGTT ATGCAGCCAA GCACAGCAOG TGCTGCTCTC GGGAAGCCCA 540 

GGGCCTGAGG TCATTGCAGG GCGGCAGGTG GCCACAGGGT GCTCCCCAGA CCTCCCTCCT 600 

50 CCAAGTAGAG CTGAAATGGG AAGGAACCCC TGGGACAGCC CCTGCCCTGC TAGATCTTTG 660 

CCTCAGATTG CTGCTGTGGC CAGGCCCAGG ATTTCCAGCC CTATGGCTCT GAGTCCTCAC 720 

ATGCTGGGGG CCCAGGGGAT ATGGACACAC TCCATCCAGG GATCCCTTCC TGCCATCTGG 780 

GCAGCAACCA TGGGGACAAA GGGAGGAAGC AGAGTCCTGT TTCCTTGCCA CTTGTCCAAG 840 

GCACTTCCCC ATCCTGACAG CGGCCCCCAC CCAGCCCAGG ATCCTGGGCT GTGGTCTCAA 900 

55 GCTCACTTCC CATTATCTTT GGGGCTGGGG CTGACATCAG GAGGACATCT GACTGGTGGA 960 

TGGAGCCAGC CTGGGAACAT CGCAGCTGGG GCAGTGCCTA GGGCTCTCCC TTCCCAGGGA 1020 

GACATGGAGA AGGGGGTTGA GGGAGGGCCC TTCCCTAGCC GCTGTGGCAA CTCCAGTGAG 1080 

CTGTTCTGGG CAAAGTGTGG CCCAAGTCGG CAGCCCCAGC CCTGCAGTGC TGGGGACGCT 1140 

GACAGGACAC GGGAAGAGGC CATGCTTTCC CTCGGGACCT GCTGTTCCAT GTGTCCCAAG 1200 

60 CCCTCCTGCT TTCCAGATGG CCCCTCAGGA AACCACCTTT CCAGGGCCTC TGCTCCCTTG 1260 

GGCGCTCGCT GGGTCTGCAT CAACGGAGTG TGGGTAGAGC CGGGAGGACC CAGCCCTGCC 1320 

AGGCTGAAGG AGGGCTCCTC ACGGACACAC AGGCCAGGAG GCAAGCGTGG GCGTCTTGCG 1380 

GGCGGTAGCG CCGACACTGT GCGCTCTCCT GCAGACAGCC TCTCCATGTC AAGCTTCCAG 1440 

TCTGTCAAGT CCATCTCTAA TTCAGCCAAC TCTCAAGGCA AGGCCAGGCC CCAGCCCGGC 1500 

65 TCCTTCAACA AGCAAGATTC AAAAGCTGAC GTCTCCCAGA AGGCGGACCT GGAAGAGGAG 1560 

CCCCTACTTC ACAACAGCAA GCTGGACAAA GTTCCTGGGG TACAAGGGCA GGCCAGAAAG 1620 

GAGAAAGCAG AGGCCTCTAA TGCAGGAGCT GCCTGTATGG GGAACAGCCA GCACCAGGGC 1680 

AGGCAGATGG GGGCGGGGGC ACACCCCCCA ATGATCCTGC CCCTTCCCCT GCGAAAGCCC 1740 

ACCACACTTA GGCAGTGCGA AGTGCTCATC CGCGAGCTGT GGAATACCAA CCTCCTGCAG 1800 

70 ACCCAAGAGC TGCGGCACCT CAAGTCCCTC CTGGAAGGGA GCCAGAGGCC CCAGGCAGCC I860 

CCGGAGGAAG CTAGCTTTCC CAGGGACCAA GAAGCCACGC ATTTCCCCAA GGTCTCCACC 1920 

AAGAGCCTCT CCAAGAAATG CCTGAGCCCA CCTGTGGCGG AGCGTGCCAT CCTGCCCGCA 1980 

CTGAAGCAGA CCCCGAAGAA CAACTTTGCC GAGAGGCAGA AGAGGCTGCA GGCAATGCAG 2040 
AAACGGCGCC TGCATCGCTC AGTGCTTTGA 



75 



80 



A1S9 Protein sequence; 

Gene name: ESTs 

Unigene number: Hs.29383 

Probeset Accession #: AW207206 

Protein Accession #s T434S7 

Signal sequence: none found 

Transmembrane domains: 303-322 

Cellular Localization: not determined 



906 



WO 03/042661 



PCT/US02/36810 



1 11 21 31 41 51 

I I I I I I 

J MSGAGVAAGT RPPSSPTPGS RRRRQRPSVG VQSLRPQSPQ LRQSDPQKRN LDLEKSLQFL 60 

QQQHSEMLAK LHEEIEHLKR ENKGEPARGP RPALPPQAHS TLPLPQHRNT AINSSTRLGS 120 

GGTQJDGEPLQ TVLAHLAAIA PVCQPSGYRF WGTWTDAATS SRGWTMLCSQ AQHVLLSGSP 180 

GPEVIAGRQV ATGCSPDLPP PSRAEMGRNP WDSPCPARSL PQIAAVARPR ISSPMALSPH 240 

MLGAQGIWTH SIQGSLPAIW AATMGTKGGS RVLPPCHLSK ALPHPDSGPH PAQDPGLWSQ 300 

10 AHFPLSLGLG LTSGGHLTGG WSQPGNIAAG AVPRALPSQG DMEKGVEGGP FPSRCGNSSE 360 

LFWAKCGPSR QPQPCSAGDA DRTREEAMLS LGTCCSMCPK PSCFPDGPSG NHLSRASAPL 420 

GARWVCINGV WVEPGGPSPA RLKEGSSRTH RPGGKRGRLA GGSADTVRSP ADSLSMSSFQ 480 

SVKSISNSAN SQGKARPQPG SFNKQDSKAD VSQKADLEEE PLLHNSKLDK VPGVQGQARK 540 

EKAEASNAGA ACMGNSQHQG RQMGAGAHPP MILPLPLRKP TTLRQCEVLI RELWNTNLLQ 600 

15 TQELRHLKSL LEGSQRPQAA PEEASFPRDQ EATHFPKVST KSLSKKCLSP PVAERAILPA 660 
LKQTPKNNFA ERQKRLQAMQ KRRLHRSVL 
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A160 DNA SEQUENCE 

Gene name: LIV-1 protein, estrogen regulated 
Unigene number: Hs. 79136 

Probeset Accession #i U41060 
Nucleic Acid Accession #: NMJU2319.2 

Coding sequence: 138-2405 (underlined sequences correspond to start and stop codons) 



-41 



51 



1 11 21 31 

I I I I ! I 

CTCGTGCCGA ATTCGGCACG AGACCGCGTG TTCGCGCCTG GTAGAGATTT CTCGAAGACA 
CCAGTGGGCC CGTGTGGAAC CAAACCTGCG CGCGTGGCCG GGCCGTGGGA CAACGAGGCC 
GCGGAGACGA AGG CGCAATG GCGAGGAAGT TATCTGTAAT CTTGATCCTG ACCTTTGCCC 
TCTCTGTCAC AAATCCCCTT CATGAACTAA AAGCAGCTGC TTTCCCCCAG ACCACTGAGA 
AAATTAGTCC GAATTGGGAA TCTGGCATTA ATGTTGACTT GGCAATTTCC ACACGGCAAT 
ATCATCTACA ACAGCTTTTC TACCGCTATG GAGAAAATAA TTCTTTGTCA GTTGAAGGGT 
TCAGAAAATT ACTTCAAAAT ATAGGCATAG ATAAGATTAA AAGAATCCAT ATACACCATG 
ACCACGACCA TCACTCAGAC CACGAGCATC ACTCAGACCA TGAGCGTCAC TCAGACCATG 
AGCATCACTC AGACCACGAG CATCACTCTG ACCATGATCA TCACTCTCAC CATAATCATG 
CTGCTTCTGG TAAAAATAAG CGAAAAGCTC TTTGCCCAGA CCATGACTCA GATAGTTCAG 
GTAAAGATCC TAGAAACAGC CAGGGGAAAG GAGCTCACCG ACCAGAACAT GCCAGTGGTA 
GAAGGAATGT CAAGGACAGT GTTAGTGCTA GTGAAGTGAC CTCAACTGTG TACAACACTG 
TCTCTGAAGG AACTCACTTT CTAGAGACAA TAGAGACTCC AAGACCTGGA AAACTCTTCC 
CCAAAGATGT AAGCAGCTCC ACTCCACCCA GTGTCACATC AAAGAGCCGG GTGAGCCGGC 
TGGCTGGTAG GAAAACAAAT GAATCTGTGA GTGAGCCCCG AAAAGGCTTT ATGTATTCCA 
GAAACACAAA TGAAAATCCT CAGGAGTGTT TCAATGCATC AAAGCTACTG ACATCTCATG 
GCATGGGCAT CCAGGTTCCG CTGAATGCAA CAGAGTTCAA CTATCTCTGT CCAGCCATCA 
TCAACCAAAT TGATGCTAGA TCTTGTCTGA TTCATACAAG TGAAAAGAAG GCTGAAATCC 
CTCCAAAGAC CTATTCATTA CAAATAGCCT GGGTTGGTGG TTTTATAGCC ATTTCCATCA 
TCAGTTTCCT GTCTCTGCTG GGGGTTATCT TAGTGCCTCT CATGAATCGG GTGTTTTTCA 
AATTTCTCCT GAGTTTCCTT GTGGCACTGG CCGTTGGGAC TTTGAGTGGT GATGCTTTTT 
TACACCTTCT TCCACATTCT CATGCAAGTC ACCACCATAG TCATAGCCAT GAAGAACCAG 
CAATGGAAAT GAAAAGAGGA CCACTTTTCA GTCATCTGTC . TTCTCAAAAC ATAGAAGAAA 
GTGCCTATTT TGATTCCACG TGGAAGGGTC TAACAGCTCT AGGAGGCCTG TATTTCATGT 
TTCTTGTTGA ACATGTCCTC ACATTGATCA AACAATTTAA AGATAAGAAG AAAAAGAATC 
AGAAGAAACC TGAAAATGAT GATGATGTGG AGATTAAGAA GCAGTTGTCC AAGTATGAAT 
CTCAACTTTC AACAAATGAG GAGAAAGTAG ATACAGATGA TCGAACTGAA GGCTATTTAC 
GAGCAGACTC ACAAGAGCCC TCCCACTTTG ATTCTCAGCA GCCTGCAGTC TTGGAAGAAG 
AAGAGGTCAT GATAGCTCAT GCTCATCCAC AGGAAGTCTA CAATGAATAT GTACCCAGAG 
GGTGCAAGAA TAAATGCCAT TCACATTTCC ACGATACACT CGGCCAGTCA GAOGATCTCA 
TTCACCACCA TCATGACTAC CATCATATTC TCCATCATCA CCACCACCAA AACCACCATC 
CTCACAGTCA CAGCCAGCGC TACTCTCGGG AGGAGCTGAA AGATGCCGGC GTCGCCACTT 
TGGCCTGGAT GGTGATAATG GGTGATGGCC TGCACAATTT CAGCGATGGC CTAGCAATTG 
GTGCTGCTTT TACTGAAGGC TTATCAAGTG GTTTAAGTAC TTCTGTTGCT GTGTT CTGTC 
ATGAGTTGCC TCATGAATTA GGTGACTTTG CTGTTCTACT AAAGGCTGGC ATGACCGTTA 
AGCAGGCTGT CCTTTATAAT GCATTGTCAG CCATGCTGGC GTATCTTGGA ATGGCAACAG 
GAATTTTCAT TGGTCATTAT GCTGAAAATG TTTCTATGTG GATATTTGCA CTTACTGCTG 
GCTTATTCAT GTATGTTGCT CTGGTTGATA TGGTACCTGA AATGCTGCAC AATGATGCTA 
GTGACCATGG ATGTAGCCGC TGGGGGTATT TCTTTTTACA GAATGCTGGG ATGCTTTTGG 
GTTTTGGAAT TATGTTACTT ATTTCCATAT TTGAACATAA AATCGTGTTT CGTATAAATT 
TCTAGTTAAG GTTTAAATGC TAGAGTAGCT TAAAAAGTTG TCATAGTTTC AGTAGGTCAT 
AGGGAGATGA GTTTGTATGC TGTACTATGC AGCGTTTAAA GTTAGTGGGT TTTGTGATTT 
TTGTATTGAA TATTGCTGTC TGTTACAAAG TCAGTTAAAG GTACGTTTTA ATATTTAAGT 
TATTCTATCT TGGAGATAAA ATCTGTATGT GCAATTCACC GGTATTACCA GTTTA TTATG 
TAAACAAGAG ATTTGGCATG ACATGTTCTG TATGTTTCAG GGAAAAATGT CTTTAATGCT 
TTTTCAAGAA CTAACACAGT TATTCCTATA CTGGATTTTA GGTCTCTGAA GAACTGCTGG 
TGTTTAGGAA TAAGAATGTG CATGAAGCCT AAAATACCAA GAAAGCTTAT ACTGAATTTA 
AGCAAAGAAA TAAAGGAGAA AAGAGAAGAA TCTGAGAATT GGGGAGGCAT AGATTCTTAT 
AAAAATCACA AAATTTGTTG TAAATTAGAG GGGAGAAATT TAGAATTAAG TATAAAAAGG 
CAGAATTAGT ATAGAGTACA TTCATTAAAC ATTTTTGTCA GGATTATTTC COGTAAAAAC 
GTAGTGAGCA CTCTCATATA CTAATTAGTG TACATTTAAC TTTGTATAAT ACAGAAATCT 
AAATATATTT AATGAATTCA AGCAATATAC ACTTGACCAA GAAATTGGAA TTTCAAAATG 
TTCGTGCGGG TTATATACCA GATGAGTACA GTGAGTAGTT TATGTATCAC CAGACTGGGT 
TATTGCCAAG TTATATATCA CCAAAAGCTG TATGACTGGA TGTTCTGGTT ACCTGGTTTA 
CAAAATTATC AGAGTAGTAA AACTTTGATA TATATGAGGA TATTAAAACT ACACTAAGTA 
TCATTTGATT CGATTCAGAA AGTACTTTGA TATCTCTCAG TGCTTCAGTG CTATCATTGT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2860 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
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GAGCAATTGT CTTTATATAC GGTACTGTAG CCATACTAGG CCTGTCTGTG GCATTCTCTA 3420 
GATGTTTCTT TTTTACACAA TAAATTCCTT ATATCAGCTT G 



A161 PROTEIN SEQUENCE 
Gene name: 
Unigene number: 
Protein Accession #: 
Signal sequence: 
Pfam domain: 
Transmembrane domains: 
Cellular Localization: 



LIV-l protein, estrogen regulated 
Hs. 79136 

NP_036451 

1-21 

Zip[591-743} 

330-346, 352-368, 427-444, 663-679, 688-703, 730-745 
plasma membrane 



1 
I 

MARKLSVILI 
FYRYGENNSL 
EHHSDHDHHS 
SVSASEVTST 
NESVSEPRKG 
RSCLIHTSEK 
LVALAVGTLS 
TWKGLTALGG 
EEKVDTDDRT 
HSHFHDTLGQ 
MGDGLHNFSD 
NALSAMLAYL 
RWGYFFLQNA 



11 

I 

LTFALSVTNP 
SVEGFRKLLQ 
HHNHAASGKN 
VYNTVSEGTH 
FMYSRNTNEN 
KAEIPPKTYS 
GDAFLHLLPH 
LYFMFLVEHV 
EGYLRADSQE 
SDDLIHHHHD 
GLAIGAAFTE 
GMATGIFIGH 
GMLLGFGIML 



21 
I 

LHELKAAAFP 
NIGIDKIKRI 
KRKALCPDHD 
FLETIETPRP 
PQECFNASKL 
LQIAWVGGFI 
SHASHHHSHS 
LTLIKQFKDK 
PSHFDSQQPA 
YHHILHHHHH 
GLSSGLSTSV 
YAENVSMWIF 
LISIFEHKIV 



31 
1 

QTTEKISPNW 
HIHHDHDHHS 



GKLFPKDVSS 
LTSHGMGIQV 
AISIISFLSL 
HEEPAMEMKR 
KKKNQKKPEN 
VLEEEEVMIA 



AVFCHELPHE 
ALTAGLFMYV 
FRINF 



41 
I 

ESGINVDLAI 
DHEHHSDHER 
SQGKGAHRPE 
STPPSVTSKS 
PLNATEFNYL 
LGVILVPLMN 
GPLFSHLSSQ 
DDDVEIKKQL 
HAHPQEVYNE 
RYSREELKDA 
IjGDFAVLLKA 
ALVDMVPEML 



51 




1 

STRQYHLQQL 


60 


HSDHEHKSDH 


120 


HASGRRNVKD 


180 


RVSRLAGRKT 


240 


CPAIINQIDA 


300 


RVFFKFLLSF 


360 


NIEESAYFDS 


420 


SKYESQLSTN 


480 


YVPRGCKNKC 


540 


GVATLAWMVI 


600 


GMTVKQAVLY 


660 


HNDASDHGCS 


720 



A162 DNA sequence 
Gene name: 
Unigene number: 
Probeset Accession #: 
Nucleic Acid Accession #: 
Coding sequence: 



bone morphogenetic protein receptor IB (ALK-6) 

Hs. 87223 

AA250737 

NM_001203 

274-1782 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 SI 

OGCGGGGCGC GGAGTCGGCG GGGCCTCGCG GGACGCGGGC AGTGCGGAGA CCGCGGCGCT 60 

GAGGACGCGG GAGCCGGGAG CGCACGCGCG GGGTGGAGTT CAGCCTACTC TTTCTTAGAT 120 

GTGAAAGGAA AGGAAGATCA TTTCATGCCT TGTTGATAAA GGTTCAGACT TCTGCTGATT 180 

CATAACCATT TGGCTCTGAG CTATGACAAG AGAGGAAACA AAAAGTTAAA CTTACAAGCC 240 

TGCCATAAGT GAGAAGCAAA CTTCCTTGAT AA CATG CTTT TGCGAAGTGC AGGAAAATTA 300 

AATGTGGGCA CCAAGAAAGA GGATGGTGAG AGTACAGCCC CCACCCCCCG TCCAAAGGTC 360 

TTGCGTTGTA AATGCCACCA CCATTGTCCA GAAGACTCAG TCAACAATAT TTGCAGCACA 420 

GACGGATATT GTTTCACGAT GATAGAAGAG GATGACTCTG GGTTGCCTGT GGTCACTTCT 480 

GGTTGCCTAG GACTAGAAGG CTCAGATTTT CAGTGTCGGG ACACTCCCAT TCCTCATCAA 540 

AGAAGATCAA TTGAATGCTG CACAGAAAGG AACGAATGTA ATAAAGACCT ACACCCTACA 600 

CTGCCTCCAT TGAAAAACAG AGATTTTGTT GATGGACCTA TACACCACAG GGCTTTACTT 660 

ATATCTGTGA CTGTCTGTAG TTTGCTCTTG GTCCTTATCA TATTATTTTG TTACTTCCGG 720 

TATAAAAGAC AAGAAACCAG ACCTCGATAC AGCATTGGGT TAGAACAGGA TGAAACTTAC 780 

ATTCCTCCTG GAGAATCCCT GAGAGACTTA ATTGAGCAGT CTCAGAGCTC AGGAAGTGGA 840 

TCAGGCCTCC CTCTGCTGGT CCAAAGGACT ATAGCTAAGC AGATTCAGAT GGTGAAACAG 900 

ATTGGAAAAG GTCGCTATGG GGAAGTTTGG ATGGGAAAGT GGCGTGGCGA AAAGGTAGCT 960 

GTGAAAGTGT TCTTCACCAC AGAGGAAGCC AGCTGGTTCA GAGAGACAGA AATATATCAG 1020 

ACAGTGTTGA TGAGGCATGA AAACATTTTG GGTTTCATTG CTGCAGATAT CAAAGGGACA 1080 

GGGTCCTGGA CCCAGTTGTA CCTAATCACA GACTATCATG AAAATGGTTC CCTTTATGAT 1140 

TATCTGAAGT CCACCACCCT AGACGCTAAA TCAATGCTGA AGTTAGCCTA CTCTTCTGTC 1200 

AGTGGCTTAT GTCATTTACA CACAGAAATC TTTAGTACTC AAGGCAAACC AGCAATTGCC 1260 

CATCGAGATC TGAAAAGTAA AAACATTCTG GTGAAGAAAA ATGGAACTTG CTGTATTGCT 1320 

GACCTGGGCC TGGCTGTTAA ATTTATTAGT GATACAAATG AAGTTGACAT ACCACCTAAC 1380 

ACTCGAGTTG GCACCAAACG CTATATGCCT CCAGAAGTGT TGGACGAGAG CTTGAACAGA 1440 

AATCACTTCC AGTCTTACAT CATGGCTGAC ATGTATAGTT TTGGCCTCAT CCTTTGGGAG 1500 

GTTGCTAGGA GATGTGTATC AGGAGGTATA GTGGAAGAAT ACCAGCTTCC TTATCATGAC 1560 

CTAGTGCCCA GTGACCCCTC TTATGAGGAC ATGAGGGAGA TTGTGTGCAT CAAGAAGTTA 1620 

CGCCCCTCAT TCCCAAACCG GTGGAGCAGT GATGAGTGTC TAAGGCAGAT GGGAAAACTC 1680 

ATGACAGAAT GCTGGGCTCA CAATCCTGCA TCAAGGCTGA CAGCCCTGCG GGTTAAGAAA 1740 

ACACTTGCCA AAATGTCAGA GTCCCAGGAC ATTAAACT CT GAT AGGAGAG GAAAAGTAAG 1800 

CATCTCTGCA GAAAGCCAAC AGGTACTCTT CTGTTTGTGG GCAGAGCAAA AGACATCAAA 1860 

TAAGCATCCA CAGTACAAGC CTTGAACATC GTCCTGCTTC CCAGTGGGTT CAGACCTCAC 1920 

CTTTCAGGGA GCGACCTGGG CAAAGACAGA GAAGCTCCCA GAAGGAGAGA TTGATCCGTG 1980 
TCTGTTTGTA GGCGGAGAAA CCGTTGGGTA ACTTGTTCAA GATATGATGC AT 



A163 Protein sequence 
Gene name: 
Unigene number: 
Probeset Accession ft : 
Protein Accession ft: 
Signal sequence-. 
Transmembrane domains 
PFAM domains: 



bone morphogenetic protein receptor IB (ALK-6) 
Hs. 72472 / Hs. 87223 
AA250737 / U89326 
NP_001194 
1-13 
128-144 

activin_receptor [30-111] , protein kinase [204-491) 
Cellular Localization: plasma membrane 



908 



WO 03/042661 



PCT/US02/36810 



1 11 21 31 41 51 

I I I I I 1 

MLLRSAGKLN VGTKKEDGES TAPTPRPKVL RCKCHHHCPE DSVNNICSTD GYCFTMIEED 60 

5 DSGLPWTSG CLGLEGSDFQ CRDTPIPHQR RSIECCTERN ECNKDLHPTL PPLKNRDFVD 120 

GPIHKRALLI SVTVCSLLLV LIILFCYFRY KRQETRPRYS IGLEQDETYI PPGESLRDLI 180 

EQSQSSGSGS GLPIiLVQRTI AKQIQMVKQI GKGRYGEVWM GKWRGEKVAV KVFFTTEEAS 240 

WFRETEIYQT VLMRHENILG FIAADIKGTG SWTQLYLITD YHENGSLYDY LKSTTLDAKS 300 

MLKLAYSSVS GLCHLHTEIF STQGKPAIAH RDLKSKNIbV KKNGTCCIAD LGLAVKFISD 360 

10 TNEVDIPPNT RVGTKRYMPP EVLDESLNRN HFQSYIMADM YSFGLILWEV ARRCVSGGIV 420 

EEYQLPYHDL VPSDPSYEDM REIVCIKKLR PSPPNRWSSD ECLRQMGKLM TECWAHNPAS 480 
RLTALRVKKT LAKMSESQDI KL 



A164 DNA sequence 

15 Gene name: ESTs 

Unigene number: Hs. 157601 

Probeset Accession W07459 
Nucleic Acid Accession #: AC005383 

Coding Sequence: 328-2751 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

GACAGTGTTC GCGGCTGCAC CGCTCGGAGG CTGGGTGACC CGCGTAGAAG TGAAGTACTT 60 

25 TTTTATTTGC AGACCTGGGC CGATGCCGCT TTAAAAAACG CGAGGGGCTC TATGCACCTC 120 

CCTGGCGGTA GTTCCTCCGA CCTCAGCCGG GTCGGGTCGT GCCGCCCTCT CCCAGGAGAG 180 

ACAAACAGGT GTCCCACGTG GCAGCCGCGC CCCGGGCGCC CCTCCTGTGA TCCCGTAGCG 240 

CCCCCTGGCC CGAGCCGCGC CCGGGTCTGT GAGTAGAGCC GCCCGGGCAC CGAGCGCTGG 300 

TCGCCGCTCT CCTTCCGTTA TATCAA CATG CCCCCTTTCC TGTTGCTGGA GGCCGTCTGT 360 

30 GTTTTCCTGT TTTCCAGAGT GCCCCCATCT CTCCCTCTCC AGGAAGTCCA TGTAAGCAAA 420 

GAAACCATCG GGAAGATTTC AGCTGCCAGC AAAATGATGT GGTGCTCGGC TGCAGTGGAC 480 

ATCATGTTTC TGTTAGATGG GTCTAACAGC GTCGGGAAAG GGAGCTTTGA AAGGTCCAAG 540 

CACTTTGCCA TCACAGTCTG TGACGGTCTG GACATCAGCC COGAGAGGGT CAGAGTGGGA 600 

GCATTCCAGT TCAGTTCCAC TCCTCATCTG GAATTCCCCT TGGATTCATT TTCAACCCAA 660 

35 CAGGAAGTGA AGGCAAGAAT CAAGAGGATG GTTTTCAAAG GAGGGCGCAC GGAGACGGAA 720 

CTTGCTCTGA AATACCTTCT GCACAGAGGG TTGCCTGGAG GCAGAAATGC TTCTGTGCCC 780 

CAGATCCTCA TCATCGTCAC TGATGGGAAG TCCCAGGGGG ATGTGGCACT GCCATCCAAG 840 

CAGCTGAAGG AAAGGGGTGT CACTGTGTTT GCTGTGGGGG TCAGGTTTCC CAGGTGGGAG 900 

GAGCTGCATG CACTGGCCAG CGAGCCTAGA GGGCAGCACG TGCTGTTGGC TGAGCAGGTG 960 

40 GAGGATGCCA CCAACGGCCT CTTCAGCACC CTCAGCAGCT CGGCCATCTG CTCCAGCGCC 1020 

ACGCCAGACT GCAGGGTCGA GGCTCACCCC TGTGAGCACA GGACGCTGGA GATGGTCCGG 1080 

GAGTTCGCTG GCAATGCCCC ATGCTGGAGA GGATCGCGGC GGACCCTTGC GGTGCTGGCT 1140 

GCACACTGTC CCTTCTACAG CTGGAAGAGA GTGTTCCTAA CCCACCCTGC CACCTGCTAC 1200 

AGGACCACCT GCCCAGGCCC CTGTGACTOG CAGCCCTGCC AGAATGGAGG CACATGTGTT 1260 

45 CCAGAAGGAC TGGACGGCTA CCAGTGCCTC TGCCCGCTGG CCTTTGGAGG GGAGGCTAAC 1320 

TGTGCCCTGA AGCTGAGCCT GGAATGCAGG GTCGACCTCC TCTTCCTGCT GGACAGCTCT 1380 

GCGGGCACCA CTCTGGACGG CTTCCTGCGG GCCAAAGTCT TCGTGAAGCG GTTTGTGCGG 1440 

GCCGTGCTGA GCGAGGACTC TCGGGCCCGA GTGGGTGTGG CCACATACAG CAGGGAGCTG 1500 

CTGGTGGCGG TGCCTGTGGG GGAGTACCAG GATGTGCCTG ACCTGGTCTG GAGCCTOGAT 1560 

50 GGCATTCCCT TCCGTGGTGG CCCCACCCTG ACGGGCAGTG CCTTGCGGCA GGCGGCAGAG 1620 

CGTGGCTTCG GGAGCGCCAC CAGGACAGGC CAGGACCGGC CACGTAGAGT GGTGGTTTTG 1680 

CTCACTGAGT CACACTCCGA GGATGAGGTT GCGGGCCGAG CGCGTCACGC AAGGGCGCGA 1740 

GAGCTGCTCC TGCTGGGTGT AGGCAGTGAG GCCGTGCGGG CAGAGCTGGA GGAGATCACA 1800 

GGCAGCCCAA AGCATGTGAT GGTCTACTOG GATCCTCAGG ATCTGTTCAA CCAAATCCCT 1860 

55 GAGCTGCAGG GGAAGCTGTG CAGCCGGCAG CGGCCAGGGT GCCGGACACA AGCCCTGGAC 1920 

CTCGTCTTCA TGTTGGACAC CTCTGCCTCA GTAGGGCCCG AGAATTTTGC TCAGATGCAG 1980 

AGCTTTGTGA GAAGCTGTGC CCTCCAGTTT GAGGTGAACC CTGACGTGAC ACAGGTCGGC 2040 

CTGGTGGTGT ATGGCAGCCA GGTGCAGACT GCCTTCGGGC TGGACACCAA ACCCACCCGG 2100 

GCTGCGATGC TGCGGGCCAT TAGCCAGGCC CCCTACCTAG GTGGGGTGGG CTCAGCCGGC 2160 

60 ACCGCCCTGC TGCACATCTA TGACAAAGTG ATGACCGTCC AGAGGGGTGC CCGGCCTGGT 2220 

GTCCCCAAAG CTGTGGTGGT GCTCACAGGC GGGAGAGGOG CAGAGGATGC AGCCGTTCCT 2280 

GCCCAGAAGC TGAGGAACAA TGGCATCTCT GTCTTGGTOG TGGGCGTGGG GCCTGTCCTA 2340 

AGTGAGGGTC TGCGGAGGCT TGCAGGTGCC CGGGATTCCC TGATCCAOGT GGCAGCTTAC 2400 

GCCGACCTGC GGTACCACCA GGACGTGCTC ATTGAGTGGC TGTGTGGAGA AGCCAAGCAG 2460 

65 CCAGTCAACC TCTGCAAACC CAGCCCGTGC ATGAATGAGG GCAGCTGCGT CCTGCAGAAT 2520 

GGGAGCTACC GCTGCAAGTG TCGGGATGGC TGGGAGGGCC CCCACTGCGA GAACCGTGAG 2580 

TGGAGCTCTT GCTCTGTATG TGTGAGCCAG GGATGGATTC TTGAGACGCC CCTGAGGCAC 2640 

ATGGCTCCCG TGCAGGAGGG CAGGAGCCGT ACCCCTCCCA GCAACTACAG AGAAGGCCTG 2700 

GGCACTGAAA TGGTGCCTAC CTTCTGGAAT GTCTGTGCCC CAGGTCC TTA GA ATGTCTGC 2760 

70 TTCCCGCCGT GGCCAGGACC ACTATTCTCA CTGAGGGAGG AGGATGTCCC AACTGCAGCC 2820 

ATGCTGCTTA GAGACAAGAA AGCAGCTGAT GTCACCCACA AAGGATGTTG TTGAAAAGTT 2880 

TTGATGTGTA AGTAAATACC CACTTTCTGT ACCTGCTGTG CCTTGTTGAG GCTATGTCAT 2940 

CTGCCACCTT TCCCTTGAGG ATAAACAAGG GGTCCTGAAG ACTTAAATTT AGCGGCCTGA 3000 

CGTTCCTTTG CACACAATCA ATGCTCGCCA GAATGTTGTT GACACAGTAA TGCCCAGCAG 3060 

75 AGGCCTTTAC TAGAGCATCC TTTGGACGGC GAAGGCCACG GCCTTTCAAG ATGGAAAGCA 3120 

GCAGCTTTTC CACTTCCCCA GAGACATTCT GGATGCATTT GCATTGAGTC TGAAAGGGGG 3180 
CTTGAGGGAC GTTTGTGACT TCTTGGCGAC TGCCTTTTGT GTGTGGAAGA GACTTGGAAA 3240 

GGTCTCAGAC TGAATGTGAC CAATTAACCA GCTTGGTTGA TGATGGGGGA GGGGCTGAGT 3300 
TGTGCATGGG CCCAGGTCTG GAGGGCCACG TAAAATCGTT CTGAGTCGTG AGCAGTGTCC 3360 
oU ACCTTGAAGG TCTTC 



909 



WO 03/042661 



PCTAJS02/36810 



A165 Protein sequence 
Gene name: 
Unigene number: 
Protein Accession fh 
Signal sequence: 
Transmembrane domains: 
VGW domains: 
EGF domains: 
Cellular Localization: 



ESTs 

Hs. 157601 
none found 
1-17 

none found 

49-223; 341-S18; 529-706 
298-333; 715-748 
secreted 



1 11 21 31 41 51 

I 1 I I I I 

MPPFLLLEAV CVFLFSRVPP SLPLQEVHVS KETIGKISAA SKMMWCSAAV DIMPLLDGSN 60 

SVGKGSFERS KHFAITVCDG LDISPERVRV GAFQFSSTPH LEFPLDSFST QQEVKARIKR 120 

MVFKGGRTET ELALKYLLKR GLPGGRNASV PQILIIVTDG KSQGDVALPS KQLKERGVTV 180 

FAVGVRFPRW EELHALASEP RGQHVLLAEQ VEDATNGLPS TLSSSAICSS ATPDCRVEAH 240 

PCEHRTLEMV REFAGNAPCW RGSRRTLAVL AAHCPFYSWK RVFLTHPATC YRTTCPGPCD 300 

SQPCQNGGTC VPEGLDGYQC LCPLAFGGEA NCALKLSLEC RVDLLFLLDS SAGTTLDGFL 360 

RAKVFVKRFV RAVLSEDSRA RVGVATYSRE LLVAVPVGEY QDVPDLVWSL DGIPFRGGPT 420 

LTGSALRQAA ERGFGSATRT GQDRPRRVW LLTESHSEDE VAGPARHARA RELLLLGVGS 480 

EAVRAELEEI TGSPKHVMVY SDPQDIiFNQI PELQGKLCSR QRPGCRTQAL DLVFMLDTSA 540 

SVGPENFAQM QSFVRSCALG; FEVNPDVTQV GLWYGSQVQ TAFGLDTKPT RAAMLRAISQ 600 

APYLGGVGSA GTALLHIYDK VMTVQRGARP GVPKAWVLT GGRGAEDAAV PAQKLRNNGI 660 

SVLWGVGPV LSEGLRRLAG PRDSLIHVAA YADLRYHQDV LIEWLCGEAK QPVNLCKPSP 720 

CMNEGSCVLQ NGSYRCKCRD GWEGPHCENR EWSSCSVCVS QGWILETPLR HMAPVQEGSS 780 
RTPPSNYREG LGTEMVPTFW NVCAPGP 



A166 DNA sequence 

Gene name: 
Unigene number: 
Probeset Accession #: 
Nucleic Acid Accession #: 
Coding Sequence: 



Homo sapiens beta-1 adrenergic receptor mRNA, 3' UTR 

Hs. 37744 

AA011176 

AF272890 

87-1520 (underlined sequences correspond to start and stop codons) 



1 
I 

TGCTACCCGC 



11 



AGCCCGGTAA 
TGCTGGTGCC 
CGCTGTCTCA 
TCGTGGCGGG 
TCACCAACCT 
TGCCGTTCGG 
AGCTGTGGAC 
TTGCCCTGGA 
GCGCGCGGGC 
TGCCCATCCT 
ACCCCAAGTG 
CCTTCTACGT 
AGAAGCAGGT 
CGCCCTCGCC 
CCGCCGCCGC 
CGCGCCTCGT 
TCTTCACGCT 
AGCTGGTGCC 
TCAACCCCAT 
GCTGCGCGCG 
CGGGCTGTCT 
ACGACGATGT 
ACGGCGGGGC 
CCTCGGAATC 
GGGAACGAGG 
CCTCGTCTGA 
TTTGGGAAGG 



GCCCGGGCTT 
GCCTCCGCAG 
CCTGTCGTCG 
CGCGTCGCCG 
GCAGTGGACA 
CAATGTGCTG 
CTTCATCATG 
GGCCACCATC 
CTCAGTGGAC 
CCGCTACCTC 
GCGGGGCCTC 
CATGCACTGG 
CTGCGACTTC 
GCCCCTGTGC 
GAAGAAGATC 
CTCGCCCTCG 
CGCCGCCACC 
GGCCCTACGC 
CTGCTGGCTG 
CGACCGCCTC 
CATCTACTGC 
CAGGGCTGCC 
GGCCCGGCCC 
CGTCGGGGCC 
GGCGGCGGAC 
CAAGGTGTAG 
AGATCTGTGT 
ATCATCCGAG 
GATGGGAGAG 



21 
I 

CTGGGGTGTT 
CTCGGCATGG 
GCCGCACCGC 
CCCGCCTCGT 
GCGGGCATGG 
GTGATCGTGG 
TCCCTGGCCA 
GTGGTGTGGG 
GTGCTGTGCG 
GCCATCACCT 
GTGTGCACCG 
TGGCGGGCGG 
GTCACCAACC 
ATCATGGCCT 
GACAGCTGCG 
CCCGTCCCCG 
GCCCCGCTGG 
GAGCAGAAGG 
CCCTTCTTCC 
TTCGTCTTCT 
CGCAGCCCCG 
CGCCGGCGCC 
GGACCCCCGC 
ACGCCGCCCG 
AGCGACTCGA 
GGCCCGGCGC 
TTACTTAAGA 
GCAAAGAGAA 
TGGCTTGCTG 



31 

i 

CCCCAACCAC 
GCGCGGGGGT 
TCCCCGACGG 
TGCTGCCTCC 
GTCTGCTGAT 
CCATCGCCAA 
GCGCCGACCT 
GCCGCTGGGA 
TGACGGCCAG 
CGCCCTTCCG 
TGTGGGCCAT 
AGAGCGACGA 
GGGCCTACGC 
TCGTGTACCT 
AGCGCOGTTT 
CGCCCGCGCC 
CCAACGGGCG 
CGCTCAAGAC 
TGGCCAACGT 
TCAACTGGCT 
ACTTCCGCAA 
ACGCGACCCA 
CATCGCCCGG 
CGCGCCTGCT 
GCCTGGACGA 
GGGGCGCGGA 
CCGATAGCAG 
AAGCCACGGA 
ATGTTCCTTG 



41 
I 

GGCCCAGCCC 
GCTCGTCCTG 
CGCCGCCACC 
CGCCAGCGAA 
GGCGCTCATC 
GACGCCGCGG 
GGTCATGGGG 
GTACGGCTCC 
CATCGAGACC 
CTACCAGAGC 
CTCGGCCCTG 



CATCGCCTCG 
GCGGGTGTTC 
CCTCGGCGGC 
GCCGCCCGGA 
TGCGGGTAAG 
GCTGGGCATC 
GGTGAAGGCC 
GGGCTACGCC 
GGCCTTCCAG 
CGGAGACCGG 
GGCCGCCTCG 
GGAGCCCTGG 
GCCGTGCCGC 
CTCCGGGCAC 
GTGAACTCGA 
CCGTTGCACA 
TTG 



51 
I 

TGCCACACCC 
GGCGCCTCCG 
GCGGCGCGGC 
AGCCCCGAGC 
GTGCTGCTCA 
CTGCAGACGC 
CTGCTGGTGG 
TTCTTCTGCG 
CTGTGTGTCA 
CTGCTGACGC 
GTGTCCTTCC 
TGCTACAACG 
TCCGTAGTCT 
CGCGAGGCCC 
CCAGCGCGGC 
CCCCCGCGCC 
CGGCGGCCCT 
ATCATGGGCG 
TTCCACCGCG 
AACTCGGCCT 
GGACTGCTCT 
CCGCGCGCCT 
GACGACGACG 
GCCGGCTGCA 
CCCGGCTTCG 
GGCTTCCCAG 
AGCCCACAAT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 



A167 Protein sequence 
Gene naraei 
Unigene number: 
Protein Accession #: 
Signal sequence: 
Transmembrane domains: 
Pfam domian: 
Cellular Localization: 



Homo sapiens beta-l adrenergic receptor mRNA, 3« UTR 

Hs.37744 

AA011176 

none found 

62-B4, 95-117, 135-157, 177-198, 226-248 
7tm_l t75-377] 
plasma membrane 



1 11 21 31 41 51 

I | I I I I 

MGAGVLVLGA SEPGNLSSAA PLPDGAATAA RLLVPASPPA SLLPPASESP EPLSQQWTAG 60 
MGLLMALIVL LIVAGNVLVI VAIAKTPRLQ TLTNLFIMSL ASADLVMGLL WPFGATIW 120 

910 



WO 03/042661 



PCI7US02/36810 



WGRWEYGSFF CELWTSVDVL CVTASIETLC VIALDRYLAI TSPFRYQSLL TRARARGLVC 180 

TVWAISALVS FLPILMHWWR AESDEARRCY NDPKCCDFVT NRAYAIASSV VSFYVPLCIM 240 

AFVYLRVPRE AQKQVKKIDS CERRFLGGPA RPPSPSPSPV PAPAPPPGPP RPAAAAATAP 300 

LANGRAGKRR PSRLVALREQ KALKTLGIIM GVFTLCWLPF FLANWKAFH RELVPDRLFV 360 

FFNWLGYANS AFNPIIYCRS PDFRKAFQGL LCCARRAARR RHATHGDRPR ASGCLARPGP 420 
PPSPGAASDD DDDDWGATP PARLLEPWAG CNGGAAADSD SSLDEPCRPG FASESKV 



A168 DNA sequence 
Gene name: 
Unigene number: 
Probeset Accession # : 
Nucleic Acid Accession 8: 
Coding sequence: 81-3080 



CEGP1 
Hs. 222399 
AA256485 
AJ400877 

(underlined sequences correspond to start and stop codons) 



11 



21 



31 



41 51 
I I 

GGCGTCCGCG CACACCTCCC CGCGCCGCCG CCGCCACCGC CCGCACTCCG CCGCCTCTGC 
CCGCAACCGC TGAGCCATCC ATGGGGGTCG CGGGCCGCAA CCGTCCCGGG GCGGCCTGGG 
CGGTGCTGCT GCTGCTGCTG CTGCTGCCGC CACTGCTGCT GCTGGCGGGG GCCGTCCCGC 
CGGGTCGGGG CCGTGCCGCG GGGCCGCAGG AGGATGTAGA TGAGTGTGCC CAAGGGCTAG 
ATGACTGCCA TGCCGACGCC CTGTGTCAGA ACACACCCAC CTCCTACAAG TGCTCCTGCA 
AGCCTGGCTA CCAAGGGGAA GGCAGGCAGT GTGAGGACAT CGATGAATGT GGAAATGAGC 
TCAATGGAGG CTGTGTCCAT GACTGTTTGA ATATTCCAGG CAATTATCGT TGCACTTGTT 
TTGATGGCTT CATGTTGGCT CATGACGGTC ATAATTGTCT TGATGTGGAC GAGTGCCTGG 
AGAACAATGG CGGCTGCCAG CATACCTGTG TCAACGTCAT GGGGAGCTAT GAGTGCTGCT 
GCAAGGAGGG GTTTTTCCTG AGTGACAATC AGCACACCTG CATTCACCGC TCGGAAGAGG 
GCCTGAGCTG CATGAATAAG GATCACGGCT GTAGTCACAT CTGCAAGGAG GCCCCAAGGG 
GCAGCGTCGC CTGTGAGTGC AGGCCTGGTT TTGAGCTGGC CAAGAACCAG AGAGACTGCA 
TCTTGACCTG TAACCATGGG AACGGTGGGT GCCAGCACTC CTGTGACGAT ACAGCCGATG 
GCCCAGAGTG CAGCTGCCAT CCACAGTACA AGATGCACAC AGATGGGAGG AGCTGCCTTG 
AGCGAGAGGA CACTGTCCTG GAGGTGACAG AGAGCAACAC CACATCAGTG GTGGATGGGG 
ATAAACGGGT GAAACGGCGG CTGCTCATGG AAACGTGTGC TGTCAACAAT GGAGGCTGTG 
ACCGCACCTG TAAGGATACT TCGACAGGTG TCCACTGCAG TTGTCCTGTT GGATTCACTC 
TCCAGTTGGA TGGGAAGACA TGTAAAGATA TTGATGAGTG CCAGACCCGC AATGGAGGTT 
GTGATCATTT CTGCAAAAAC ATCGTGGGCA GTTTTGACTG CGGCTGCAAG AAAGGATTTA 
AATTATTAAC AGATGAGAAG TCTTGCCAAG ATGTGGATGA GTGCTCTTTG GATAGGACCT 
GTGACCACAG CTGCATCAAC CACCCTGGCA CATTTGCTTG TGCTTGCAAC CGAGGGTACA 
CCCTGTATGG CTTCACCCAC TGTGGAGACA CCAATGAGTG CAGCATCAAC AACGGAGGCT 
GTCAGCAGGT CTGTGTGAAC ACAGTGGGCA GCTATGAATG CCAGTGCCAC CCTGGGTACA 
AGCTCCACTG GAATAAAAAA GACTGTGTGG AAGTGAAGGG GCTCCTGCCC ACAAGTGTGT 
CACCCCGTGT GTCCCTGCAC TGCGGTAAGA GTGGTGGAGG AGACGGGTGC TTCCTCAGAT 
GTCACTCTGG CATTCACCTC TCTTCAGATG TCACCACCAT CAGGACAAGT GTAACCTTTA 
AGCTAAATGA AGGCAAGTGT AGTTTGAAAA ATGCTGAGCT GTTTCCCGAG GGTCTGCGAC 
CAGCACTACC AGAGAAGCAC AGCTCAGTAA AAGAGAGCTT CCGCTACGTA AACCTTACAT 
GCAGCTCTGG CAAGCAAGTC CCAGGAGCCC CTGGCCGACC AAGCACCCCT AAGGAAATGT 
TTATCACTGT TGAGTTTGAG CTTGAAACTA ACCAAAAGGA GGTGACAGCT TCTTGTGACC 
TGAGCTGCAT CGTAAAGCGA ACCGAGAAGC GGCTCCGTAA AGCCATCCGC ACGCTCAGAA 
AGGCCGTCCA CAGGGAGCAG TTTCACCTCC AGCTCTCAGG CATGAACCTC GACGTGGCTA 
AAAAGCCTCC CAGAACATCT GAACGCCAGG CAGAGTCCTG TGGAGTGGGC CAGGGTCATG 
CAGAAAACCA ATGTGTCAGT TGCAGGGCTG GGACCTATTA TGATGGAGCA CGAGAACGCT 
GCATTTTATG TCCAAATGGA ACCTTCCAAA ATGAGGAAGG ACAAATGACT TGTGAACCAT 
GCCCAAGACC AGGAAATTCT GGGGCCCTGA AGACCCCAGA AGCTTGGAAT ATGTCTGAAT 
GTGGAGGTCT GTGTCAACCT GGTGAATATT CTGCAGATGG CTTTGCACCT TGCCAGCTCT 
GTGCCCTGGG CACGTTCCAG CCTGAAGCTG GTCGAACTTC CTGCTTCCCC TGTGGAGGAG 
GCCTTGCCAC CAAACATCAG GGAGCTACTT CCTTTCAGGA CTGTGAAACC AGAGTTCAAT 
GTTCACCTGG ACATTTCTAC AACACCACCA CTCACCGATG TATTCGTTGC CCAGTGGGAA 
CATACCAGCC TGAATTTGGA AAAAATAATT GTGTTTCTTG CCCAGGAAAT ACTACGACTG 
ACTTTGATGG CTCCACAAAC ATAACCCAGT GTAAAAACAG AAGATGTGGA GGGGAGCTGG 
GAGATTTCAC TGGGTACATT GAATCCCCAA ACTACCCAGG CAATTACCCA GCCAACACCG 
AGTGTACGTG GACCATCAAC CCACCCCCCA AGCGCCGCAT CCTGATCGTG GTCCCTGAGA 
TCTTCCTGCC CATAGAGGAC GACTGTGGGG ACTATCTGGT GATGCGGAAA ACCTCTTCAT 
CCAATTCTGT GACAACATAT GAAACCTGCC AGACCTACGA ACGCCCCATC GCCTTCACCT 
CCAGGTCAAA GAAGCTGTGG ATTCAGTTCA AGTCCAATGA AGGGAACAGC GCTAGAGGGT 
TCCAGGTCCC ATACGTGACA TATGATGAGG ACTACCAGGA ACTCATTGAA GACATAGTTC 
GAGATGGCAG GCTCTATGCA TCTGAGAACC ATCAGGAAAT ACTTAAGGAT AAGAAACTTA 
TCAAGGCTCT GTTTGATGTC CTGGCCCATC CCCAGAACTA TTTCAAGTAC ACAGCCCAGG 
AGTCCCGAGA GATGTTTCCA AGATCGTTCA TCCGATTGCT ACGTTCCAAA GTGTCCAGGT 
TTTTGAGACC TTACAA ATGA CTCAGCCCAC GTGCCACTCA ATACAAATGT TCTGCTATAG 
GGTTGGTGGG ACAGAGCTGT CTTCCTTCTG CATGTCAGCA CAGTCGGGTA TTGCTGCCTC 
CCGTATCAGT GACTCATTAG AGTTCAATTT TTATAGATAA TACAGATATT TTGGTAAATT 
GAACTTGGTT TTTCTTTCCC AGCATCGTGG ATGTAGACTG AGAATGGCTT TGAGTGGCAT 
CAGCTTCTCA CTGCTGTGGG CGGATGTCTT GGATAGATCA CGGGCTGGCT GAGCTGGACT 
TTGGTCAGCC TAGGTGAGAC TCACCTGTCC TTCTGGGGTC TTACTCCTCC TCAAGGAGTC 
TGTAGTGGAA AGGAGGCCAC AGAATAAGCT GCTTATTCTG AAACTTCAGC TTCCTCTAGC 
CCGGCCCTCT CTAAGGGAGC CCTCTGCACT CGTGTGCAGG CTCTGACCAG GCAGAACAGG 
CAAGAGGGGA GGGAAGGAGA CCCCTGCAGG CTCCCTCCAC CCACCTTGAG ACCTGGGAGG 
ACTCAGTTTC TCCACAGCCT TCTCCAGCCT GTGTGATACA AGTTTGATCC CAGGAACTTG 
AGTTCTAAGC AGTGCTCGTG AAAAAAAAAA GCAGAAAGAA TTAGAAATAA ATAAAAACTA 
AGCACTTCTG GAGA CAT 



60 
120 
180 
240 
300 
360 
420 
480 
S40 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2860 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 



A169 Protein sequence 
Gene name: 
Unigene number i 



CEGP1 
Hs. 222399 



911 



WO 03/042661 



PCT/US02/36810 



Probeset Accession #: AA256485 
Protein Accession 8: CAB92285 
Signal sequence: 1-31 
Transmembrane domains: none 

PFAM domains: EGF-like_domains [49-84, 132-167, 177-213. 286-321, 407-442 J CUB_domain [809-918] 
Cellular Localization: ~ may be secreted 



MGVAGRNRPG 
LCQNTPTSYK 
HDGHNCLDVD 
DHGCSHICKE 
PQYKMHTDGR 
STGVHCSCPV 
SCQDVDECSL 
TVGSYECQCH 
SSDVTTIRTS 
PGAPGRPSTP 
FHLQLSGMNL 
TFQNEEGQMT 
PEAGRTSCFP 
KNNCVSCPGN 
PPPKRRILIV 
IQFKSNEGNS 
LAHPQNYFKY 



11 

I 

AAWAVLLLLL 
CSCKPGYQGE 
ECLENNGGCQ 
APRGSVACEC 
SCLEREDTVL 
GFTLQLDGKT 
DRTCDHSCIN 
PGYKLHWNKK 
VTFKLNEGKC 
KEMFITVEFE 
DVAKKPPRTS 
CEPCPRPGNS 
CGGGLATKHQ 
TTTDFDGSTN 
VPEIFLPIED 
ARGFQVPYVT 
TAQESREMFP 



21 
I 

LLPPLLLLAG 
GRQCEDIDEC 
HTCVNVMGSY 
RPGFELAKNQ 
EVTESNTTSV 
CXDIDECQTR 
HPGTFACACN 
DCVEVKGLLP 
SLKNAELFPE 
LETNQKEVTA 



GALKTPEAWN 
GATSFQDCET 
ITQCKNRRCG 
DCGDYLVMRK 
YDEDYQELIE 
RSFIRLLRSK 



31 
I 

AVPPGRGRAA 
GNELNGGCVH 
ECCCKEGFFL 
RDCILTCNHG 
VDGDKRVKRR 
NGGCDHFCKN 
RGYTLYGFTH 
TSVSPRVSLH 
GLRPALPEKH 
SCDLSCIVKR 
QGHAENQCVS 
MSECGGLCQP 
RVQCSPGHFY 
GELGDFTGYI 
TSSSNSVTTY 
DIVRDGRLYA 
VSRFI»RPYK 



41 

I 

GPQEDVDECA 
DCLNIPGKYR 
SDNQHTCIHR 
NGGCQHSCDD 
LLMETCAVNN 
IVGSFDCGCK 
CGDTNECSIN 
CGKSGGGDGC 
SSVKESFRYV 
TEKRLRKAIR 
CRAGTYYDGA 
GEYSADGFAP 
NTTTHRCIRC 
ESPNYPGNYP 
ETCQTYERPI 



51 
I 

QGLDDCHADA 
CTCFDGFMLA 
SEEGLSCKNX 
TADGPECSCH 
GGCDRTCKDT 
KGFKLLTDEK 
NGGCQQVCVN 
FLRCHSGIHL 
NLTCSSGKQV 
TLRKAVHREQ 
RERCILCPKG 
CQLCALGTFQ 
PVGTYQPEFG 
ANTECTWTIN 
AFTSRSKKLW 
KKLIKALFDV 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 



A170 DNA sequence 

Gene name: DEME-6 protein (KIAA0452) 

Unigene number: Hs. 125783 

Probeset Accession ft: AL039402 

Nucleic Acid Accession tf: AF007170 

Coding sequence: 1-1725 (underlined sequences correspond to stop codon) 



1 11 21 31 41 51 

(III!) 
AAGGAGGCGG CCTCCGGGAA AAGCGACCGC AGGACTCCTG AGAGCAGCCT CCATGAGGCC 
CTGGACCAGT GCATGACCGC CCTGGACCTC TTCCTCACCA ACCAGTTCTC AGAAGCACTC 
AGCTACCTCA AGCCCAGAAC CAAGGAAAGC ATGTACCACT CACTGACATA TGCCACCATC 
CTGGAGATGC AGGCCATGAT GACCTTTGAC CCTCAGGACA TCCTGCTTGC CGGCAACATG 
ATGAAGGAGG CACAGATGCT GTGTCAGAGG CACCGGAGGA AGTCTTCTGT AACAGATTCC 
TTCAGCAGCC TGGTGAACCG CCCCACGCTG GGCCAATTCA CTGAAGAAGA AATCCACGCT 
GAGGTCTGCT ATGCAGAGTG CCTGCTGCAG CGAGCAGCCC TGACCTTCCT GCAGGACGAG 
AACATGGTGA GCTTCATCAA AGGCGGCATC AAAGTTCGAA ACAGCTACCA GACCTACAAG 
GAGCTGGACA GCCTTGTTCA GTCCTCACAA TACTGCAAGG GTGAGAACCA CCCGCACTTT 
GAAGGAGGAG TGAAGCTTGG TGTAGGGGCC TTCAACCTGA CACTGTCCAT GCTTCCTACT 
AGGATCCTGA GGCTGTTGGA GTTTGTGGGG TTTTCAGGAA ACAAGGACTA TGGGCTGCTG 
CAGCTGGAGG AGGGAGCGTC AGGGCACAGC TTCCGCTCTG TGCTCTGTGT CATGCTCCTG 
CTGTGCTACC ACACCTTCCT CACCTTCGTG CTCGGTACTG GGAACGTCAA CATCGAGGAG 
GCCGAGAAGC TCTTGAAGCC CTACCTGAAC CGGTACCCTA AGGGTGCCAT CTTCCTGTTC 
TTTGCAGGGA GGATTGAAGT CATTAAAGGC AACATTGATG CAGCCATCCG GCGTTTOGAG 
GAGTGCTGTG AGGCCCAGCA GCACTGGAAG CAGTTTCACC ACATGTGCTA CTGGGAGCTG 
ATGTGGTGCT TCACCTACAA GGGCCAGTGG AAGATGTCCT ACTTCTACGC CGACCTGCTC 
AGCAAGGAGA ACTGCTGGTC CAAGGCCACC TACATTTACA TGAAGGCCGC CTACCTCAGC 
' ATGTTTGGGA AGGAGGACCA CAAGCCGTTC GGGGACGACG AAGTGGAATT ATTTCGAGCT 
GTGCCAGGCC TGAAGCTCAA GATTGCTGGG AAATCTCTAC CCACAGAGAA GTTTGCCATC 
CGGAAGTCCC GGCGCTACTT CTCCTCCAAC CCTATCTCGC TGCCAGTGCC TGCTCTGGAA 
ATGATGTACA TCTGGAACGG CTACGCCGTG ATTGGGAAGC AGCCGAAACT CACGGATGGG 
ATACTTGAGA TTATCACTAA GGCTGAAGAG ATGCTGGAGA AAGGCCCAGA GAACGAGTAC 
TCAGTGGATG ACGAGTGCTT GGTGAAATTG TTGAAAGGCC TGTGTCTGAA ATACCTGGGC 
CGTGTCCAGG AGGCCGAGGA GAATTTTAGG AGCATCTCTG CCAATGAAAA GAAGATTAAA 
TATGACCACT ACTTGATCCC AAACGCCCTG CTGGAGCTGG CCCTGCTGCT TATGGAGCAA 
GACAGAAACG AAGAGGCCAT CAAACTTTTG GAATCTGCCA AGCAAAACTA CAAGAATTAC 
TCCATGGAGT CAAGGACACA CTTTCGAATC CAGGCAGCCA CACTCCAAGC CAAGTCTTCC 
CTAGAGAACA GCAGCAGATC CATGGTCTCA TCAGTGTCCT T GTAGC TTTG TGCAGCAGTT 
CCGGGCTGGA AGACAGAGAC AGCTGGACAG AGCTCCTGAA AACATTTCAA AATACCCCCT 
CCCCCTGCCC TGCCCTGCCT TTGGGGTCCA CCGGCACTCC AGTTGGATGG CACAACATAG 
TGTATCCGTG CAGAAGCCGA GCTGGCATTT TCACCAGTGT AGCCAAGGGC CTTTGCCAAG 
GGCAGAGCAG GTGGAGCCCT CTGCCTGCCC TATCACACAT ACGGGTACTT GCTTTTCACT 
GTGATGTTTA AGAGAATGTA TGAACAGTTT ACATTTTCCT TAGAAATACA TTGATGGGAT 
CACAGTTGGC TTTAAAAACC AACAACAATC AACCACCTGT AAGTCTTTGT CTTCACCTAT 
TATCATCTGG AGGTAAATCT CTTTATATGA TGATGCCAAA GGGCAAATTG CTTTTCAAAT 
TCAGCAAGTT CTCAGCTTGT GTGACGGAAG GTCCTTCAGA GGACCTGAGG AATGCCTGGG 
AGAGGCTAAG CCTCAGGCTT CAATGCTTCT GGGGTTGGGC ATGAGGATGT ACACAGACAC 
CCACTACCTT ACTACTCACA CTTCATTTCA CTCCTTTTGT AAATTTCCAA TTTAAAAATC 
AAGCACGTCT TTTTAGTGAG ATAAAATCTG AGCTCTTCTG TAGAAAAATC AATCTCTACC 
AGTAGAAAAT GCCAGGGCTT GATGGAAGAG CTGTGTAGCC CTTTCTATGC CAAAGCCAGG 
AAATTTGGGG GGCAGGAGGA GGTTCTCAGA ATCCAGTCTG TATCTTTGCT GTATGCCAAA 
CTGAAACCAC TGGGAATAAT TTATGAAACA TAAAAATCTT CTGTACTTCA CTCCAAGGTA 
CATTTGCTTA CTGACAGCAT TTTTGTTAAA ACTGTTATTC TTGAAAAAAA AAAAAAAAAA 
AA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 



A171 Protein sequence 



912 



WO 03/042661 



PCT/US02/36810 



Gene name: 
Unigene number: 
Probeset Accession ft: 
Protein Accession #: 
Signal sequence: 
Transmembrane domains: 
Cellular Localization: 



DEME-6 protein (KIAA0452) 

Hs. 125783 

AL039402 

AAC39582 

none 

210-226 

plasma membrane 



1 11 21 31 41 51 

I I I I I I 

MTALDLFLTN QFSEALSYLK PRTKESMYHS LTYATILEMQ AMMTFDPQDI LLAGNMMKEA 60 

QMLCQRHRRK SSVTDSFSSL VNRPTLGQFT EEEIHAEVCY AECLLQRAAL TFLQDENMVS 120 

FIKGGIKVRN SYQTYKELDS LVQSSQYCKG ENHPHFEGGV KLGVGAFNLT LSMLPTRILR 180 

LLEFVGFSGN KDYGLLQLEE GASGHSFRSV LCVMLLLCYH TFLTFVLGTG NVNIEEAEKL 240 

LKPYLNRYPK GAIFLFFAGR IEVIKGNIDA AIRRFEECCE AQQHWKQFHH MCYWELMWCF 300 

TYKGQWKMSY PYADLLSKEN CWSKATYIYM KAAYLSMFGK EDHKPFGDDE VELFRAVPGL 360 

KLKIAGKSIiP TEKFAIRKSR RYFSSNPISL PVPALEMMYI WNGYAVIGKQ PKLTDGILEI 420 

ITKAEEMLEK GPENEYSVDD ECLVKLLKGL CLKYLGRVQE AEENFRSISA NEKKIKYDHY 480 

LIPNALLELA LLLMEQDRNE EAIKLLESAK QNYKNYSMES RTHFRIQAAT LQAKSSLENS S40 
SRSMVSSVSL 



A172 DNA sequence 



Unigene number: 
Probeset Accession #: 
Nucleic Acid Accession # : 
Coding sequence: 



Hs. 200102 
AL117406 
none found 

1-4044 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I 1 

ATGA CTAGGA AGAGGACATA CTGGGTGCCC AACTCTTCTG GTGGCCXCGT GAATCGTGGC 60 

ATCGACATAG GCGATGACAT GGTTTCAGGA CTTATTTATA AAACCTATAC TCTCCAAGAT 120 

GGCCCCTGGA GTCAGCAAGA GAGAAATCCT GAGGCTCCAG GGAGGGCAGC TGTCCCACCG 180 

TGGGGGAAGT ATGATGCTGC CTTGAGAACC ATGATTCCCT TCCGTCCCAA GCCGAGGTTT 240 

CCTGCCCCCC AGCCCCTGGA CAATGCTGGC CTGTTCTCCT ACCTCACCGT GTCATGGCTC 300 

ACCCCGCTCA TGATCCAAAG CTTACGGAGT CGCTTAGATG AGAACACCAT CCCTCCACTG 360 

TCAGTCCATG ATGCCTCAGA CAAAAATGTC CAAAGGCTTC ACCGCCTTTG GGAAGAAGAA 420 

GTCTCAAGGC GAGGGATTGA AAAAGCTTCA GTGCTTCTGG TGATGCTGAG GTTCCAGAGA 480 

ACAAGGTTGA TTTTCGATGC ACTTCTGGGC ATCTGCTTCT GCATTGCCAG TGTACTCGGG 540 

CCAATATTGA TTATACCAAA GATCCTGGAA TATTCAGAAG AGCAGTTGGG GAATGTTGTC 600 

CATGGAGTGG GACTCTGCTT TGCCCTTTTT CTCTCCGAAT GTGTGAAGTC TCTGAGTTTC 660 

TCCTCCAGTT GGATCATCAA CCAACGCACA GCCATCAGGT TCCGAGCAGC TGTTTCCTCC 720 

TTTGCCTTTG AGAAGCTCAT CCAATTTAAG TCTGTAATAC ACATCACCTC AGGAGAGGCC 780 

ATCAGCTTCT TCACCGGTGA TGTAAACTAC CTGTTTGAAG GGGTGTGCTA TGGACCCCTA 840 

GTACTGATCA CCTGCGCATC GCTGGTCATC TGCAGCATTT CTTCCTACTT CATTATTGGA 900 

TACACTGCAT TTATTGCCAT CTTATGCTAT CTCCTGGTTT TCCCACTGGC GGTATTCATG 960 

ACAAGAATGG CTGTGAAGGC TCAGCATCAC ACATCTGAGG TCAGCGACCA GCGCATCCGT 1020 

GTGACCAGTG AAGTTCTCAC TTGCATTAAG CTGATTAAAA TGTACACATG GGAGAAACCA 1080 

TTTGCAAAAA TCATTGAAGG TATGGAAAGT CTGACTTTCT GCTCCAAACC TGGTGATGGC 1140 

ATGGCCTTCA GCATGCTGGC CTCCTTGAAT CTCCTTCGGC TGTCAGTGTT CTTTGTGCCT 1200 

ATTGCAGTCA AAGGTCTCAC GAATTCCAAG TCTGCAGTGA TGAGGTTCAA GAAGTTTTTC 1260 

CTCCAGGAGA GCCCTGTTTT CTATGTCCAG ACATTACAAG ACCCCAGCAA AGCTCTGGTC 1320 

TTTGAGGAGG CCACCTTGTC ATGGCAACAG ACCTGTCCCG GGATCGTCAA TGGGGCACTG 1380 

GAGCTGGAGA GGAACGGGCA TGCTTCTGAG GGGATGACCA GGCCTAGAGA TGCCCTCGGG 1440 

CCAGAGGAAG AAGGGAACAG CCTGGGCCCA GAGTTGCACA AGATCAACCT GGTGGTGTCC 1500 

AAGGGGATGA TGTTAGGGGT CTGCGGCAAC ACGGGGAGTG GTAAGAGCAG CCTGTTGTCA 1560 

GCCATCCTGG AGGAGATGCA CTTGCTCGAG GGCTCGGTGG GGGTGCAGGG AAGCCTGGCC 1620 

TATGTCCCCC AGCAGGCCTG GATCGTCAGC GGGAACATCA GGGAGAACAT CCTCATGGGA 1680 

GGCGCATATG ACAAGGCCCG ATACCTCCAG GTGCTCCACT GCTGCTCCCT GAATCGGGAC 1740 

CTGGAACTTC TGCCCTTTGG AGACATGACA GAGATTGGAG AGCGGGGCCT CAACCTCTCT 1800 

GGGGGGCAGA AACAGAGGAT CAGCCTGGCC CGCGCCGTCT ATTCCGACCG TCAGATCTAC 1860 

CTGCTGGACG ACCCCCTGTC TGCTGTGGAC GCCCACGTGG GGAAGCACAT TTTTGAGGAG 1920 

TGCATTAAGA AGACACTCAG GGGGAAGACG GTCGTCCTGG TGACCCACCA GCTGCAGTAC 1980 

TTAGAATTTT GTGGCCAGAT CATTTTGTTG GAAAATGGGA AAATCTGTGA AAATGGAACT 2040 

CACAGTGAGT TAATGCAGAA AAAGGGGAAA TATGCCCAAC TTATCCAGAA GATGCACAAG 2100 

GAAGCCACTT CGGACATGTT GCAGGACACA GCAAAGATAG CAGAGAAGCC AAAGGTAGAA 2160 

AGTCAGGCTC TGGCCACCTC CCTGGAAGAG TCTCTCAACG GAAATGCTGT GCCGGAGCAT 2220 

CAGCTCACAC AGGAGGAGGA GATGGAAGAA GGCTCCTTGA GTTGGAGGGT CTACCACCAC 2280 

TACATCCAGG CAGCTGGAGG TTACATGGTC TCTTGCATAA TTTTCTTCTT CGTGGTGCTG 2340 

ATCGTCTTCT TAACGATCTT CAGCTTCTGG TGGCTGAGCT ACTGGTTGGA GCAGGGCTCG 2400 

GGGACCAATA GCAGCCGAGA GAGCAATGGA ACCATGGCAG ACCTGGGCAA CATTGCAGAC 2460 

AATCCTCAAC TGTCCTTCTA CCAGCTGGTG TACGGGCTCA ACGCCCTGCT CCTCATCTGT 2520 

GTGGGGGTCT GCTCCTCAGG GATTTTCACC AAAGTCACGA GGAAGGCATC CACGGCCCTG 2580 

CACAACAAGC TCTTCAACAA GGTTTTCCGC TGCCCCATGA GTTTCTTTGA CACCATCCCA 2640 

ATAGGCCGGC TTTTGAACTG CTTCGCAGGG GACTTGGAAC AGCTGGACCA GCTCTTGCCC 2700 

ATCTTTTCAG AGCAGTTCCT GGTCCTGTCC TTAATGGTGA TCGCCGTCCT GTTGATTGTC 2760 

AGTGTGCTGT CTCCATATAT CCTGTTAATG GGAGCCATAA TCATGGTTAT TTGCTTCATT 2820 

TATTATATGA TGTTCAAGAA GGCCATCGGT GTGTTCAAGA GACTGGAGAA CTATAGCCGG 2880 

TCTCCTTTAT TCTCCCACAT CCTCAATTCT CTGCAAGGCC TGAGCTCCAT CCATGTCTAT 2940 

GGAAAAACTG AAGACTTCAT CAGCCAGTTT AAGAGGCTGA CTGATGCGCA GAATAACTAC 3000 

CTGCTGTTGT TTCTATCTTC CACACGATGG ATGGCATTGA GGCTGGAGAT CATGACCAAC 3060 

CTTGTGACCT TGGCTGTTGC CCTGTTCGTG GCTTTTGGCA TTTCCTCCAC CCCCTACTCC 3120 



913 



WO 03/042661 



PCT/US02/36810 



TTTAAAGTCA 
CGGATTGGCT 
AAGATGTGTG 
CCACAGCATG 
ACCGTGCTTC 
AGGACGGGCT 
GCAGGCCGGA 
TCCAAGCTCT 
CTAGATCCCT 
CTGACCAAGG 
GGAAACTTCT 
TCCAAGATCA 
ATCCAGCGCA 
GTCACCACTG 
TTTGATCGGC 
ACAGCCACTT 
CTCAGAGGTT 
ATGAGAACTT 
ACCCTGGAAT 
ATGGGATTCA 
AGGTAAAAGC 
TGTAAAATAT 



TGGCTGTCAA 
TGGAGACAGA 
TCTCGGAAGC 
GGGAAATCAT 
ACGGCATCAA 
CTGGGAAGTC 
TTCTCATTGA 
CAGTGATCCC 
TTGACCGTCA 
CCATCTCAAA 
CTGTGGGGGA 
TCCTTATCGA 
CAATCCGTGA 
TGCTGAACTG 
CGGAGGTACT 
CTTCACTGAG 
CACACAGGTG 
CTCCTGGAAG 
AGGCTACTTG 
GTGATCATGT 
TTATAGTTTT 
AAAACTAAGG 



CATCGTGCTG 
GGCACAGTTC 
TCCTTTACAC 
ATTTCAGGAT 
CCTGACCATC 
CTCCTTGGGC 
CGGCGTGGAC 
TCAAGATCCA 
CACTGACCAG 
GTTCCCCAAA 
GAGGCAGCTG 
TGAAGCCACA 
AGCCTTCCAG 
TGACCACATC 
GCGGAAGAAG 
ATAAGGAGAT 
CAGCTTCGAG 
CAGGGGTAAA 
ATGGCTCTCA 

CTGATCTGTG 
AAAACTCAAA 



CAGCTGGCGT 
ACGGCTGTAG 
ATGGAAGGCA 
TATCACATGA 
CGCGGCCACG 
ATGGCTCTCT 
ATTTGCAGCA 
GTGCTGCTCT 
CAGATCTGGG 
AAGCTGCATA 
CTCTGCATTG 
GCCTCCATTG 
GGCTGCACCG 
CTGGTTATGG 
CCTGGGTCAT 
GTGGAGACTT 
GCCCACAGTC 
TGTAGGGGGG 
AGACCTTAGA 
TTAACTTACA 
TTAGAAGTGY 
AAAAAAAAAA 



CCAGCTTCCA 
AGAGGATACT 
CAAGTTGTCC 
AATACAGAGA 
AAGTGGTGGG 
TCCGCCTGGT 
TCGGCCTGGA 
CAGGAACCAT 
ATGCCTTGGA 
CAGATGTGGT 
CCAGGGCTGT 
ACATGGAGAC 
TGCTCGTCAT 
GCAATGGGAA 
TGTTCGCAGC 
CATGGAGGCT 
TGCGACCTTC 
GTGGGGATTG 
ACCCCAGAAC 
TGCTGAATAA 
TGCAAATGCT 
AAAAAAA 



GGCCACTGCC 
GCAGTACATG 
CCAGGGGTGG 
CAACACACCC 
CATCGTGGGA 
GGAGCCCATG 
GGACTTGCGG 
CAGATTCAAC 
GAGGACATTC 
GGAAAACGGT 
GCTTCGCAAC 
AGACACCCTG 
TGCCCACCGT 
GGTGGTAGAA 
CCTCATGGCC 
GGCAGCTGAG 
TTGTTTGGAG 
CTGGATGGAA 
CATCTAAGAC 
TTTTATAATA 
GTACTGACTT 



3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 



A173 Protein sequence 



Unigene number: 
Probeset Accession #: 
Protein Accession #: 
Signal sequence: 
Transmembrane domains: 
943,1018-1034 
PFAM domains: 

Cellular Localization: 



Hs. 200102 
AL117406 
none found 
none found 
169-185, 199-215, 



275-291, 304-320, 387-403, 770-786, 829-845, 907-923, 927- 



ABC_transporter {502-673) , ABC_membrane_region (163-432, 771-1060] 
ATP-bindingdomains [508-516, 1139-1147] 
plasma membrane 



11 
I 



21 



31 



41 



MTRKRTYWVP NSSGGLVNRG IDIGDDMVSG 
WGKYDAALRT MIPPRPKPRF PAPQPLDNAG 
SVHDASDKNV QRLHRLWEEE VSRRGIEKAS 
PILIIPKILE YSEEQLGNW HGVGLCFALF 
FAPEKLIQPK SVIHITSGEA ISFFTGDVNY 
YTAFIAILCY LLVFPLAVFM TRMAVKAQHH 
FAKIIEGMES LTFCSXPGDG MAFSMLASLN 
LGESPVFYVQ TLQDPSKALV FEEATLSWQQ 
PEEEGNSLGP ELHKINLWS KGMMLGVCGN 
YVPQQAWIVS GNIRENILMG GAYDKARYLQ 
GGQKQRISLA RAVYSDRQIY LLDDPLSAVD 
LEFCGQIILL ENGKICENGT HSELMQKKGK 
SQALATSLEE SLNGNAVPEH QLTQEEEMEE 
IVFLTIFSFW WLSYWLEQGS GTNSSRESNG 
VGVCSSGIFT KVTRKASTAL HNKLFNKVFR 
IFSEQFLVLS LMVIAVLLIV SVLSPYILLM 
SPLFSHILNS LQGLSSIHVY GKTEDFISQF 
LVTLAVALFV AFGISSTPYS FKVMAVNIVL 
KMCVSEAPLH MEGTSCPQGW PQHGEIIFQD 
RTGSGKSSLG MALFRLVEPM AGRILIDGVD 
LDPPDRHTDQ Q I WD ALERT F LTKAI SKFPK 
SKIILIDEAT ASIDMETDTL IQRTIREAFQ 
FDRPEVLRKK PGSLFAALMA TATS SLR 



LIYKTYTLQD GPWSQQERNP 
LFSYLTVSWL TPLMIQSLRS 
VLLVMLRFQR TRLIFDALLG 
LSECVKSLSF SSSWIINQRT 
LFEGVCYGPL VLITCASLVI 
TSEVSDQRIR VTSEVLTCIK 
LLRLSVFFVP IAVKGLTKSK 
TCPGIVNGAL ELERNGHASE 
TGSGKSSLLS AILEEMHLLE 
VLHCCSLNRD LELLPFGDMT 
AHVGKHIFEE CIKKTLRGKT 
YAQLIQKMHK EATSDMLQDT 
GSLSWRVYHH YIQAAGGYMV 
TMADLGNIAD NPQLSFYQLV 
CPMSFFDTIP IGRLLNCFAG 
GAIIMVICFI YYMMFKKAIG 
KRLTDAQNNY LLLFLSSTRW 
QLASSFQATA RIGLETEAQF 
YHMKYRDNTP TVLHGINLTI 
ICSIGLEDLR SKLSVIPQDP 
KLHTDWENG GNFSVGERQL 
GCTVLVIAHR VTTVLNCDHI 



51 
I 

EAPGRAAVPP 
RLDENTIPPL 
ICFCIASVLG 
AIRFRAAVSS 
CSISSYFIIG 
LIKMYTWEKP 
SAVMRFKKFF 
GMTRPRDALG 
GSVGVQGSLA 
EIGERGLNLS 
WLVTHQLQY 
AKIAEKPKVE 
SCIIFFFWL 
YGLNALLLIC 
DLEQLDQLLP 
VFKRLENYSR 
MALRLEIMTN 
TAVERILQYM 
RGHEWGZVG 
VLliSGTIRFN 
LCIARAVLRN 
LVMGNGKWE 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 



A174 DMA sequence 
Gene name: 
Unigene number: 
Probeset Accession #: 
Nucleic Acid Accession #: 
Coding sequence: 



Hs. 128899 

AA9B3251 

AA983251 

1-1749 (underlined sequences correspond to start and stop codons) 



ATGCTGTCTG 
GGAAAGAAAC 
GACCGGGAGA 
GGGGAGGCGG 
CCGCGGCCGC 
CGCCTGCGTC 
TCCGGACGAC 
ACTCATAAGG 
GCTCCTGGAC 
CCGCGCGGAA 
CTTCTCGGAG 
TGTGGGGCGC 
TGCTGCTGGC 



11 
I 

GCTTCTTGAT 
TTCCGTGGGA 
GGGAGAGCCG 
AGAAGGGGAA 
CGCCAGCTGG 
CTGGACGTTC 
AGCCCCGGGG 
CAGTCCCTAA 
CTAGGGCCCG 
AGCGCCGCGG 
ACCGTCCTGC 
TCGCCGCTCG 
TGCGCTGCTG 



21 
I 

GAGTCCCAGT 
GGCTTCCATC 
GCCGGAGGCT 
CCGGGGCGAG 
GCAGGCTCCC 
CCGGGGGAGG 
GCCTTCTGAC 
GGGGACCGGG 
GCGTCGTCGC 
GACAGTCAGT 
GCTCTCTGGA 
TCCGTCTCCT 
GCGGCGGGGG 




GGGACTGCGG 
GTCCGGTTGC 
TGCATCCCGA 
CCACCGGCTG 
CTCCTGGGCG 
GACGAGGCCC 
GACGCGCTGT 
CATCCTGGAA 
CGAGGGCCCA 



41 
1 

GAGCACAGTA 
CCTCCCGAGG 
TGTGGGACCG 
GGATCCGCGC 
CTGGGGGCGC 
CAGTGAAACC 
GATTTCCATC 
AGGACGGGGA 
TCGCGGCAGA 
GGGGGTCGCC 
CCGCGCCCAG 
CGCCGCTTCG 
GCGGCGAGTA 



51 
I 

CACTCCCGGA 
GCGAGGCAGC 
CGCTGCAGCC 
CCAGCAGCAG 
GCAGGACCCT 
TCCAGAGGCT 
AGCGAGTGCA 
TGGCTTAGGA 
GGGGAGTGGC 
GGGGCCACGA 
GGTGGTGCCA 
CTCCTGCAGC 
CTGCCACGGC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



914 



WO 03/042661 



PCT/US02/36810 



TGGCTGGACG CGCAGGGCGT CTGGCGCATC GGCTTCCAGT GTCCCGAGCG CTTCGACGGC 840 

GGCGACGCCA CCATCTGCTG CGGCAGCTGC GCGTTGCGCT ACTGCTGCTC CAGCGCCGAG 900 

GCGCGCCTGG ACCAGGGCGG CTGCGACAAT GACCGCCAGC AGGGCGCTGG CGAGCCTGGC 960 

CGGGCGGACA AAGACGGGCC CCGACGGCTC GGCAGGGCTT CATGTCTTAG GGGTACCCAA 1020 

GGAGACGGCG AGGGTGCGCC CCCACCCGTG AGGGCCTGGC AGCGGTGCTC CCCTGAAGGC 1080 

TCCCCGAAAG GAAGGCAGCT CCTCAGGGCT TTCCCGGGGC TGCTGCCCCG TGCCAGACGC 1140 

CGCGGATTCC CATCTTCTCC ACGCGGCGGC CCCTCTCCCC TGCAGCGGCC CGCCTTGCCC 1200 

ATCTACGTGC OGTTCCTCAT TGTTGGCTCC GTGTTTGTCG CCTTTATCAT CTTGGGGTCC 1260 

CTGGTGGCAG CCTGTTGCTG CAGATGTCTC CGGCCTAAGC AGGATCCCCA GCAGAGCCGA 1320 

GCCCCAGGGG GTAACCGCTT GATGGAGACC ATCCCCATGA TCCCCAGTGC CAGCACCTCC 1380 

CGGGGGTCGT CCTCACGCCA GTCCAGCACA GCTGCCAGTT CCAGCTCCAG CGCCAACTCC 1440 

GGGGCCCGGG CGCCCCCAAC AAGGTCACAG ACCAACTGTT GCTTGCCGGA AGGGACCATG 1500 

AACAACGTGT ATGTCAACAT GCCCACGAAT TTCTCTGTGC TGAACTGTCA GCAGGCCACC IS 60 

CAGATTGTGC CACATCAAGG GCAGTATCTG CATCCCCCAT ACGTGGGGTA CACGGTGCAG 1620 

CACGACTCTG TGCCCATGAC AGCTGTGCCA CCTTTCATGG ACGGCCTGCA GCCTGGCTAC 1680 

AGGCAGATTC AGTCCCCCTT CCCTCACACC AACAGTGAAC AGAAGATGTA CCCAGCGGTG 1740 

ACTGTATAAC CGAGAGTCAC TGGTGGGTTC CTTTACTGAA GGGAGACGAA GGCAGGGGTG 1800 

GATTCTCGAG GTGGAAGTCC GCACATGTCG GTGGTATTTA TGGCACGATT CCTTTGGATG 1860 

GCTTCATTTG CCCCCAGACT GTATGAAAAC ATCTCCGAAT TAGCATTTCT GGATATGTTT 1920 

CATCCAGGGT ATCATTGATT TATGATGGAA AACCGGCCTC AGCTGGAGAT GACTGTGATG 1980 

TTGCTGATGG GTGTATAACA AATGCTTGAG TCCGAAGTGC CCTTGAGATA TGGTTGACGA 2040 

AAGAATTTTA TAAACTGATA AATTAAGGAT TTTTATTATG TTGTTATTAT TATTTCTTTT 2100 

TTGTTGTTGA CTGCACAGGA TCAAAATGCC TGTTATCTCC CTTTTACTGG GACTTTTTTT 2160 

TTTTTTTTTT TTTTTTTTAA TCAGACAGGG TCTTGCTCTG TTGCCCAGGC TGGAGTGCAG 2220 

TGGTGCGATC TCGGCTCACT GCAACTTCAG CCTCCTGGAT TCAGGCAACA CTCCTGCCTC 2280 

AGCCTCCCAC GTGGCTGGGA TTACAGGTGC CTGCCCCCAT GGCTAATTTT TTGTATTTTT 2340 

TGTAGAGATG GGGTTTCACC ATGTTGGCTG GGCTGGTCTC ACTCTCCTGA CCTCAAGCAA 2400 

TCTGCCTGTC TCAGCCTCCC AAAGTGCTGG GATTACAGGC GTGAGCCACC GCCCCCAGCC 2460 

TGAGCCTTTT TTTTTTTCTA ATGCATCCAA GGTTAAGGGG AAGACGCAAA TAACAGGACT 2520 

ATTCTAAAAG GAAACCTGTT TGAACTCTGT GAGATCAGTC ATCAGTCTCA GTATTCCACA 2580 

GGCACACCTT AATTTCATTG TAAAAAGATA TATATATTTT GTCTATTTTT GTGCTTTTGG 2640 

GGGCCTATTT TGTGCTTTTT TACCTTATGT AGAGATCTTA TTACAAAGTG ATTTTCTACA 2700 

TTAAAAAGAG ACTGAAATAA ATTGTATAGT TACTTAACTA ATGAAGACAT TTCAGAACTC 2760 

TGGGATGATT TTAAT CTTGA AGTAGTAGGT GGTATAGTCA TAAAACCATT CATCCCCTTC 2820 

TTGATTGTAT CTTAATTTTC TGGCTTTAAG GTGACATCTG AGAGGTAATG CATTCTTTTT 2880 

TATATTGAAA TCATAAACTA TCACCCGCTG CTTCTCTGAG TTACTTTTAA TTTTGCCTTG 2940 

TGGTTATGGT TTGGCGTTTC CTrCT G TTTG GTTTTCAGAG CCCCATGTCT ATATAGTCCT 3000 

GAGTGCAAGT AATTACTATA CTTGTAAATG AAGATCAGTA TTTCTGCCTA GATCTGATAA 3060 

AAAAATTTTC TTGTCTTAGT TATAAAAATT CAAAGAAATG TGTTACAAAG ATACTTAGTA 3120 

TAGCTCCTCA GCCATAACCT GAGACTTGGG ATGAAATTTA AACCAGATAC GATTTACTTT 3180 

GCAGATCATA AGGCTTTTTA TACTCTTGTT ATCAAAATGG CTTATTTTTC AGGCACTAAG 3240 

GATTGTTAAG AGAAAAGCTT TTCAACGAAG GATTGCCTTT CTTCTCCCAC ACTGTTCTTG 3300 

ATTTCCTCTC TCTTTCAGGC CTCAACAGGC ACTGTATTCA TTGCCAATGT TCCAAATTAT 3360 

CAAATTCAAG TGAATTTATT TGTGTGTTCT TTACTTATAT AAAAAAAGAT AACTTTAAGG 3420 

ATGTGCAAGT ACATTTCCAA CTGCTAGCAC AACCAGTATT TTGTAATTAA ACAAATCGCT 3480 

GTATGGTATG GTCTTCTACA CATTTATGTC TATAGATATC TATCGATCAT CTTTCTATTC 3540 

TGTTTCATGA CTGAATAATG TAAAACCAGT GTTGGCAATT GGTATCATCA ATGATACTCA 3600 

TTTTTTAATA ACCAAAGGCA GGGGAAAATC ATTTTACTTA TTAATAAATA TTTTATGATG 3660 
TGAAAAAAAA AAAAAAAAAA AAAAAAAAAA 



A175 Protein sequence 

Gene name: ESTb 

Unigene number: Hs. 128899 

Protein Accession #: none found 

Signal sequence: 1-11 

Transmembrane domains: 402-424 

Cellular Localization: not determined 



1 11 21 31 41 51 

I I I 1 I I 

MLSGFLMSPS TQHRAQYTPG GKKLPWEASI GAHTSRGRGS DRERESRPEA AGI*LWDRAAA 60 

GEAERGNRGE PPAWIRAQQQ PRPPPAGQAP GTAAGGAODP RLRPGRSRGR VRLPVKPPEA 120 

SGRQPRGPSD CIPRPPSASA THKAVPKGTG PPAEDGDGLG APGPRARRRR LLGVAAEGSG 180 

PRGKRRGTVS DEARGSPGPR LLGDRPALSG DALSAPRWP CGALAARPSP HPGTPLRSCS 240 

CCWLRCWRRG RGPSGEYCHG WLDAQGVWRI GFQCPERFDG GDATICCGSC ALRYCCSSAE 300 

ARLDQGGCDN DRQQGAGEPG RADKDGPRRL GRASCLRGTQ GDGEGAPPPV RAWQRCSPEG 360 

SPKGRQLLRA FPGLLPRARR RGFPSSPRGG PSPLQRPALP IYVPFLIVGS VFVAFIILGS 420 

LVAACCCRCL RPKQDPQQSR APGGNRLMET IPMIPSASTS RGSSSRQSST AASSSSSANS 480 

GARAPPTRSQ TNCCLPEGTM NNVYVNMPTN FSVLNCQQAT QIVPHQGQYL HPPYVGYTVQ 540 
HDSVPMTAVP PFMDGLQPGY RQIQSPFPHT NSEQKMYPAV TV 

A176 DNA SEQUENCE 

Gene name: ESTs, Weakly similar to CGHU7L collagen alpha 1{III) chain precursor [H. sapiens] 
Unigene number: Hs. 19322 

Probeset Accession ft: AA088458 
Nucleic Acid Accession #: AA0884S8 

Coding sequence: 862-1995 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

GCCCTTGGAC ACTGACATGG ACTGAAGGAG TAGAATGGAG CACGAGGACA CTGACATGGA 60 

915 



WO 03/042661 



PCT/US02/36810 



CTGAAGAAAA AGGAGCTGGA GCAGGAGAAG GAGGTGCTGC TGCAGGGTTT GGAGATGATG 120 

GCGCGGGGCC GCGACTGGTA CCAGCAGCAG CTGCAACGAG TGCAGGAGCG CCAGCGCCGC 180 

CTGGGCCAGA GCAGAGCCAG CGCCGACTTT GGGGCTGCAG GGAGCCCCCG CCCACTGGGG 240 

CGGCTACTGC CCAAGGTACA AGAGGTGGCC CGGTGCCTGG GGGAGCTGCT GGCTGCAGCC 300 

5 TGTGCCAGCC GGGCCCTGCC CCCGTCCTCC TCCGGGCCCC CCTGCCCTGC CCTGACGTCC 360 

ACCTCACCCC CGGTCTGGCA GCAGCAGACC ATCCTCATGC TGAAGGAGCA GAACCGACTC 420 

CTCACCCAGG AGGTGACCGA GAAGAGTGAG CGCATCACGC AGCTGGAGCA GGAGAAGTCG 480 

GCGCTCATTA AGCAGCTGTT TGAGGCCCGC GCCCTGAGCC AGCAGGACGG GGGACCTCTG 540 

GATTCCACCT TCATCTAGTC CTTGTGGGCC GCGTGGGCCC CCAGGGCCAG CCTGGCACTC 600 

10 AGCCCTTCGA GGGTGGGCGC CCCATCGCAC CCACCCTCTC TGGCTGGAGA CCCCCGGCAG 660 

GCCCAGGCAC AGTCCCGGAG TGGGCGCCTT CCTGCOGCCC TTGCCAGATG GGCTCCCCAG 720 

GCCTGCCCCC GGCTGGTCCC CGCACCGAGC GCTTGACTCC GTTTKGGCTC CTGGTTGYTG 780 

ACATGGGCTG GGGGCTCTCT TGAGTCCGCA TAGTCCGCAG CTACTACTGG COGCTGTCAG 840 

TGGACAGTGG GGTACCCCTC CATG AGTTAG OGTCCCCCCG TTTCCAGCGG TGCCGCCCTG 900 

15 GGTCCCATCT TCAGGGAAAG GCACTGCCCA CGCCAGGCTG CACTTCCAAC AACGGGCAGC 960 

AGAGGGCGCG GGGCGGCTCC GACGCGGGTC CAAGGGCAGC TTCCCGCTCA ACCAGGGCAC 1020 

CAGGACGAGG TGGCTGTAGC TCGGACGGAC GGAAGTAGAT GGAGGGGGTG GGGACGGCCT 1080 

. GTAAGCGGGG GGTGCCTGCC TGGCTGGGGA GCCCCAGGGA TAGCGGTCGG ACTTCAGGTT 1140 

CTGGCCAAGG CTGAGGGACC CTGGCTGCAG CGGATCGGCA CGCCGGGTGG GCGAGAGCTT 1200 

20 GGCCTGCATG TGCCTCCCAC AGACCCTGGG GTGATGGCCT TCCCCCTCTT GGCCGGGACG 1260 

TTGCCCCACG TTGAGTCCCA CACAACATCC TGTGAGCCTG GCTCCCCAGG AGGGCCCCCA 1320 

GACAGCTCCC AGGCACGTCA TAGGCAAAGC CTGTTTCCCC CGACTCAGGA TTTCCAAGGC 1380 

CTGGGGTCCT GCTCACCCCC CTTTGCTCTC ACGCCCAGCC TGTCCCCAGG TTTCAGCTGG 1440 

GAGAGGCCAC CTCCCTCAGC CAAGGAAAAC GAGAACCCCC AGGGTACAGG AGGAGGCTGG 1500 

25 GGCAGGTCCC CTTGGGTGTC ACTCCCTCAG CCCCTGCCCA GGCCCACTCC CGCTGGTGCT 1560 

GGAGTACGCA CTGGTGGGGG GGCCCTGCTC AGCCCAACCT GGAGGGTCCC AGTGTCACCA 1620 

GAACCAGGGG CACGGCAACA GCATCGATGG GTTCTGCAGC CCAGGGCCCC CGATGCGGGG 1680 

TCAGTGTGTG TGGGGCGCAG GGCCTCCGAT GCGGGGTCAG TGCGTGGGGG GCGCAGGGCC 1740 

CCCGATGCGG GGTCAGTGCG TGGGGGGCGC AGGGCCCCCT OGTGTCCAGG GCACTTTGGT IB 00 

30 ACACTGTCCC ACAAGGCACC TGTCTCAGAG GAGGGGCCCT GGCAGGCAGC GTGGCAACTC 1860 

CCTTCCGGAG CCCAGCTCCA TGCTAACCTG CCCACAGCAA CCCCACAGAG CCACATTCCC 1920 

TGCTGCACCT GGTCTGCAGG GGTGTCCCAG GACAGGCCCA AGTCAGCCCA GCATGCAGCT 1980 

GCCCTCCTAC C CTGAA GATG GGAGTGGGCT TTCCAGGGGA CATAAGGATG TCAGGCCTGG 2040 

ACCTCCTGGG CAGGAAAGGG TGCAGGTCCT GAGGGCCTGT GCCCCACAGC CCCAGCACCC 2100 

35 AGGTGGACTG CAGCGCAGTG GGTGGGCCAG TGGCAGCCAG GGAGAAGCCC CCCGTCAGCA 2160 

GGCTGGGGTC TGCCCACCAG GGCCTCCCCA CGTCTGCCTT TGAGGGTGCC TGCCATGCCC 2220 

TGGGGGATCC TGGCATCTTT ACTGGACTGG AAGCAGGAGA CAGAACAGTG TCTGTCCCGG 2280 

GGTGACTTCA TCAGGAGACC GCCCACATAG AGCTGGACCC CGCAGCTGAA GCGGAAATGT 2340 

GAGACAGGCT GGCACCTCCG GAAAAACTGC CTTTCAGCCT TGGTGTTCCG TGCAAGGTGA 2400 

40 AAAGAAATAG GTCCTCCCAG TTTACAGCTT GAAATCAGGC TAGTGAGTGG CCCTGGAGAC 2460 

CACGAGGGGA GAATTTAAAG GCCCCGGCTG GCAGGGTCTA GGTGGCTGGC AGAGGCACAT 2520 

GCAGACCCTG CCTGGAGCCT GCCCTAGGAC GCTGGGCGGG TCAGTCTCCG TGCAGGATGT 2580 

GAGCAGCGTC CCTGGGCTCT ATCCGCGAGG TGCCAGTAGC GTGTGCAGGT ACATACACGT 2640 

GCGTGCACAC TGTGATGACA CCCGGAAATG TCTCAGGATG TTGAAATGTG TCCTTGGGGG 2700 

45 CAGAAGTGTC CCCAGTTGAG AATCTGCCCC AGAGGAACAC ACCCACACCA GGCCTCAGGA 2760 

TTTTGTGTTG ATCAAGTTCC AAGGAAAAGG AACATCTCAG CCGGGCGTGG TGGTTCACGC 2820 

CTGGAATCCC AGCACTTGAG GCCAGGAGTT CCAGAGCAGC CTGGGCAACG CAGTGAGAGA 2880 

CCCCATCTCT ACAARAAAAA AAAAAGAAAG AAAGAAAATG AGAGATCCAG GTTTAAAAAT 2940 

TCATAAACAC CACAAGGAAA CAATACACTA TGAGACCCAG CAGAAGCAAC AGATTGACTC 3000 

50 TAGACCCAGA TACTAGAATT ATCAGAGAGA ATATAAAGTA ACAGTGTTTT ATATATCTAA 3060 
AGAAATAAAA GAGATTTCTG GAAACATGAA AAAAAA 



55 
60 
65 
70 
75 
80 



A177 DNA SEQUENCE 
Gene name: 
Unigene number: 
Probeset Accession 8: 
Nucleic Acid Accession #: 
Coding sequence: 



Cadherin 3, P-cadherin (placental) 

Hb.2877 

X63629 

X63629 

54-2543 (start and stop codons are underlined) 



1 
I 

GCGGAACACC 
TCCCTCGTGG 
CCTCCGAGCC 
CGGAGCAGGA 
CAGCTCTGTT 
AAAGAAGGTC 
GAAGACACAA 



ACAGCATCAC 
AGACAGGCTG 
TCTTTGGCCA 
TCATCGTGAC 
GTGTCTTAGA 
ATGATGCCAT 
AGGACCCACA 
CCAGTGGCCT 
TGGATGGGGA 
ACAATGCTCC 
GCCATGAGGT 
GTGCCACCTA 
CTGAGAGCAA 
AGCACACCCT 
CCACAGCCAC 



11 

I 

GGCCCGCCGT 
ACCTCTCGCG 
GTGCCGGGCG 
GCCCGGCCAG 
TAGCACTGAT 
ACTGAAGGAA 
GAGAGATTGG 
GAGACTGAAT 
GGGGCCGGGG 
GTTGTTGTTG 
CGCTGTGTCA 
CGACCAGAAT 
GGGAGTCCTA 
CTACACCTAC 
CGACCTCATG 
GGACCGGGAA 
CGGCTCCACC 
CATGTTTGAC 
GCAGAGGCTG 
CCTTATCATG 
CCAGGGCATC 
GTACGTTGAA 
CATAGTGGTC 



21 
I 

CGCGGCAGCT 
TCTCTCCTCC 
GTCTTCAGGG 
GCGCTGGGGA 
AATGATGACT 
AGGAATCCAT 
GTGGTTGCTC 
CAGCTCAAGT 
GCAGACAGCC 
AATAAGCCAC 
GAGAATGGTG 
GACCACAAGC 
CCAGGTACTT 
AATGGGGTGG 
TTCACAATTC 
AAAGTCCCTG 
ACCACGGCAG 
CCCCAGAAGT 
ACGGTCACTG 
GGCGGTGACG 
CTGACAACCA 
GTGACCAACG 
CACGTGGAGG 



31 
1 

GCTTCACCCC 
TTCTCCAGGT 
AGGCTGAAGT 
AAGTATTCAT 
TCACTGTGCG 
TGAAGATCTT 
CAATATCTGT 
CTAATAAAGA 
CCCCTGAGGG 
TGGACCGGGA 
CCTCAGTGGA 
CCAAGTTTAC 
CTGTGATGCA 
TTGCTTACTC 
ACCGGAGCAC 
AGTACACACT 
TGGCAGTAGT 
ACGAGGCCCA 
ATCTGGACGC 
ACGGGGACCA 
GGAAGGGTTT 
AGGCCCCTTT 
ATGTGAATGA 




GGAGATTGCC 
GGACCCCATG 
CCAGGACACC 
GGTGACAGCC 
CATCCATAGC 
AGGCACCATC 
GACCATCCAG 
GGAGATCCTT 
TGTGCCTGAG 
CCCCAACTCA 
TTTTACCATC 
GGATTTTGAG 
TGTGCTGAAG 
GGCACCTGTG 



51 
I 

GCCATGGGGC 
CAGTGCGCGG 
GCGGGAGGCG 
GGGCAAGAGC 
ACAGTCCAGG 
CGTATCTTAC 
GGCAAGGGTC 
AAGATTTTCT 
GTAGAGAAGG 
AAGTATGAGC 
AACATCTCCA 
TTCCGAGGGA 
ACAGATGAGG 
CAAGAACCAA 
AGCGTCATCT 
GCCACAGACA 
GATGCCAATG 
AATGCAGTGG 
CCAGCGTGGC 
ACCACCCACC 
GCCAAAAACC 
CTCCCAACCT 
TTTGTCCCAC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
640 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 



916 
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CCTCCAAAGT CGTTGAGGTC CAGGAGGGCA TCCCCACTGG GGAGCCTGTG TGTGTCTACA 1440 

CTGCAGAAGA CCCTGACAAG GAGAATCAAA AGATCAGCTA CCGCATCCTG AGAGACCCAG 1500 

CAGGGTGGCT AGCCATGGAC CCAGACAGTG GGCAGGTCAC AGCTGTGGGC ACCCTCGACC 1S60 

GTGAGGATGA GCAGTTTGTG AGGAACAACA TCTATGAAGT CATGGTCTTG GCCATGGACA 1620 

ATGGAAGCCC TCCCACCACT GGCACGGGAA CCCTTCTGCT AACACTGATT GATGTCAACG 1680 

ACCATGGCCC AGTCCCTGAG CCCCGTCAGA TCACCATCTG CAACCAAAGC CCTGTGCGCC 1740 

ACGTGCTGAA CATCACGGAC AAGGACCTGT CTCCCCACAC CTCCCCTTTC CAGGCCCAGC 1800 

TCACAGATGA CTCAGACATC TACTGGACGG CAGAGGTCAA CGAGGAAGGT GACACAGTGG 1860 

TCTTGTCCCT GAAGAAGTTC CTGAAGCAGG ATACATATGA CGTGCACCTT TCTCTGTCTG 1920 

ACCATGGCAA CAAAGAGCAG CTGACGGTGA TCAGGGCCAC TGTGTGCGAC TGCCATGGCC 1980 

ATGTCGAAAC CTGCCCTGGA CCCTGGAAAG GAGGTTTCAT CCTCCCTGTG CTGGGGGCTG 2040 

TCCTGGCTCT GCTGTTCCTC CTGCTGGTGC TGCTTTTGTT GGTGAGAAAG AAGCGGAAGA 2100 

TCAAGGAGCC CCTCCTACTC CCAGAAGATG ACACCCGTGA CAACGTCTTC TACTATGGCG 2160 

AAGAGGGGGG TGGCGAAGAG GACCAGGACT ATGACATCAC CCAGCTCCAC CGAGGTCTGG 2220 

AGGCCAGGCC GGAGGTGGTT CTCCGCAATG AOGTGGCACC AACCATCATC CCGACACCCA 2280 

TGTACCGTCC TAGGCCAGCC AACCCAGATG AAATCGGCAA CTTTATAATT GAGAACCTGA 2340 

AGGCGGCTAA CACAGACCCC ACAGCCCCGC CCTACGACAC CCTCTTGGTG TTCGACTATG 2400 

AGGGCAGCGG CTCCGACGCC GCGTCCCTGA GCTCCCTCAC CTCCTCCGCC TCOGACCAAG 2460 

ACCAAGATTA CGATTATCTG AACGAGTGGG GCAGCCGCTT CAAGAAGCTG GCAGACATGT 2520 

ACGGTGGCGG GGAGGACGAC TAGGCGGCCT GCCTGCAGGG CTGGGGACCA AACGTCAGGC 2580 

CACAGAGCAT CTCCAAGGGG TCTCAGTTCC CCCTTCAGCT GAGGACTTCG GAGCTTGTCA 2640 

GGAAGTGGCC GTAGCAACTT GGCGGAGACA GGCTATGAGT CTGACGTTAG AGTGGTTGCT 2700 

TCCTTAGCCT TTCAGGATGG AGGAATGTGG GCAGTTTGAC TTCAGCACTG AAAACCTCTC 2760 

CACCTGGGCC AGGGTTGCCT CAGAGGCCAA GTTTCCAGAA GCCTCTTACC TGCCGTAAAA 2820 

TGCTCAACCC TGTGTCCTGG GCCTGGGCCT GCTGTGACTG ACCTACAGTG GACTTTCTCT 2880 

CTGGAATGGA ACCTTCTTAG GCCTCCTGGT GCAACTTAAT TTTTTTTTTT AATGCTATCT 2940 

TCAAAACGTT AGAGAAAGTT CTTCAAAAGT GCAGCCCAGA GCTGCTGGGC CCACTGGCCG 3000 

TCCTGCATTT CTGGTTTCCA GACCCCAATG CCTCCCATTC GGATGGATCT CTGCGTTTTT 3060 

ATACTGAGTG TGCCTAGGTT GCCCCTTATT TTTTATTTTC CCTGTTGCGT TGCTATAGAT 3120 
GAAGGGTGAG GACAATCGTG TATATGTACT AGAACTTTTT TATTAAAGAA A 



A178 Protein sequence: 
Gene name: 
Unigene number: 
Probeset Accession # : 
Protein Accession #: 
Signal sequence: 
Transmembrane domain: 



Cadherin 3, P-cadherin (placental) 

Hs.2877 

X63629 

CAA45177 

1-24 

659-675 



Cellular localization: plasma membrane 



MGLPRGPLAS 
QEPALFSTDN 
KGPFPQRLNQ 
YELFGHAVSE 
DEDDAIYTYN 
TDMDGDGSTT 
AWRATYLIMG 
PTSTATIWH 
DPAGWLAMDP 
VNDHGPVPEP 
TWLSLKKFL 
GAVLALLFLL 
GLEARPEWL 
DYEGSGSDAA 



11 
I 

LLLLQVCHZjQ 
DDFTVRNGET 
LKSNKDRDTK 
NGASVEDPMN 
GWAYSIHSQ 
TAVAWEILD 
GDDGDHFTIT 
VEDVNEAPVF 
DSGQVTAVGT 
RQITICNQSP 
KQDTYDVHLS 
LVLLLLVRKK 
RNDVAPTIIP 
SLSSLTSSAS 



21 
I 

CAASEPCRAV 



IFYSITGPGA 
ISIIVTDQND 
EPKDPHDIjMF 
ANDNAPMFDP 
THPESNQGIIi 
VPPSKWEVQ 
LDREDEQFVR 
VREVLNITDK 



RKIKEPLLLP 
TPMYRPRPAN 
DQDQDYDYLN 



31 
I 

FREAEVTLEA 
NPLKIFPSKR 
DSPPEGVFAV 
HKPKFTQDTF 
TIHRSTGTIS 
QKYEAHVPEN 
TTRKGLDFEA 
EGIPTGEPVC 
NNIYEVMVLA 
DLSPHTSPFQ 
TVIRATVCDC 
EDDTRDNVFY 
PDEIGNF1IE 
EWGSRFKKLA 



41 
I 

GGAEQEPGQA 
ILRRHKRDWV 
EKETGWLHU 
RGSVLEGVLP 
VISSGLDREK 
AVGHEVQRLT 
KNQHTLYVEV 
VYTAEDPDKE 
MDNGSPPTTG 
AQLTDDSDIY 
HGHVETCPGP 



NLKAAKTDPT 
DMYGGGEDD 



51 
I 

IjGKVFMGCPG 
VAPISVPENG 
KPLDREEIAK 
GTSVMQVTAT 
VPEYTLTIQA 
VTDLDAPNSP 
TNEAPFVLKL 
NQKISYRILR 
TGTLLLTLID 
VJTAEVNEEGD 
WKGGFILPVL 
QDYDITQLHR 
APPYDTLLVF 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



A179 UNA SEQUENCE 
Gene name: 
Unigene number: 
Probeset Accession #: 
Nucleic Acid Accession #: 
Coding sequence: 



endothelial differentiation, lysophosphatidic acid G-protein- coupled receptor, 

HS. 258583 

NMJ>12152 

NM~012152 

43-1104 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

CTTCTTTAAA TTTCTTTCTA GGATGTTCAC TTCTTCTCCA CAATGAATGA GTGTCACTAT 60 

GACAAGCACA TGGACTTTTT TTATAATAGG AGCAACACTG ATACTGTCGA TGACTGGACA 120 

GGAACAAAGC TTGTGATTGT TTTGTGTGTT GGGACGTTTT TCTGCCTGTT TATTTTTTTT 180 

TCTAATTCTC TGGTCATCGC GGCAGTGATC AAAAACAGAA AATTTCATTT CCCCTTCTAC 240 

TACCTGTTGG CTAATTTAGC TGCTGCCGAT TTCTTCGCTG GAATTGCCTA TGTATTCCTG 300 

ATGTTTAACA CAGGCCCAGT TTCAAAAACT TTGACTGTCA ACCGCTGGTT TCTCCGTCAG 360 

GGGCTTCTGG ACAGTAGCTT GACTGCTTCC CTCACCAACT TGCTGGTTAT CGCCGTGGAG 420 

AGGCACATGT CAATCATGAG GATGCGGGTC CATAGCAACC TGACCAAAAA GAGGGTGACA 480 

CTGCTCATTT TGCTTGTCTG GGCCATCGCC ATTTTTATGG GGGCGGTCCC CACACTGGGC 540 

TGGAATTGCC TCTGCAACAT CTCTGCCTGC TCTTCCCTGG CCCCCATTTA CAGCAGGAGT 600 

TACCTTGTTT TCTGGACAGT GTCCAACCTC ATGGCCTTCC TCATCATGGT TGTGGTGTAC 660 

CTGCGGATCT ACGTGTACGT CAAGAGGAAA ACCAACGTCT TGTCTCCGCA TACAAGTGGG 720 

TCCATCAGCC GCCGGAGGAC ACCCATGAAG CTAATGAAGA CGGTGATGAC TGTCTTAGGG 780 

917 
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GCGTTTGTGG TATGCTGGAC CCCGGGCCTG GTGGTTCTGC TCCTCGACGG CCTGAACTGC 840 

AGGCAGTGTG GCGTGCAGCA TGTGAAAAGG TGGTTCCTGC TGCTGGCGCT GCTCAACTCC 900 

GTCGTGAACC CCATCATCTA CTCCTACAAG GACGAGGACA TGTATGGCAC CATGAAGAAG 960 

ATGATCTGCT GCTTCTCTCA GGAGAACCCA GAGAGGCGTC CCTCTCGCAT CCCCTCCACA 1020 

GTCCTCAGCA GGAGTGACAC AGGCAGCCAG TACATAGAGG ATAGTATTAG CCAAGGTGCA 1080 

GTCTGCAATA AAAGCACTTC CTAAACTCTG GATGCCTCTC GGCCCACCCA GGTGATGACT 1140 
GTCTTAGG 



A180 Protein sequence: 
Gene name: 
Unigene number: 
Probeset Accession #: 
Protein Accession #: 
Signal sequence: 
Transmembrane domains: 
Cellular Localization: 



endothelial differentiation, lysophosphatidic acid G-protein-coupled receptor, 

Hs. 258583 

NM_0121S2 

NP_036284 

none found 

31-53, €6-88. 150-172, 190-211, 239-261, 277-295 
plasma membrane 



1 11 21 31 41 51 

I I I I I I 

MNECHYDKHM DFFYNRSNTD TVDDWTGTKL VIVLCVGTFF CLFIFFSNSL VIAAVIKNRK 60 
FHFPPYYLLA NLAAADFFAG IAYVPLMFNT GPVSKTLTVN RWFLRQGLLD SSLTASLTNL 120 
LVIAVERHMS IMRMRVHSNL TKKRVTLLIL LVWAIAIFMG AVPTLGWNCL CN I SACS SLA 180 
PIYSRSYLVF WTVSNLMAFL IMVWYLRIY VYVKRKTNVL SPHTSGS1SR RRTPMKLMKT 240 
VMTVLGAFW CWTPGLWLL LDGLNCRQCG VQHVKRWFLL LALLNSWNP IIYSYKDEDM 300 
YGTMKKMICC FSQENPERRP SRIPSTVLSR SDTGSQYIED SISQGAVCNK STS 



A1B1 DNA SEQUENCE 

Gene name: ESTs 

Unigene number: Hs. 162 859 

Probeset Accession ft; AA569531 

Nucleic Acid Accession ft: AA569531 

Coding sequence: 1-504 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

ATGACCTACA GTTACTCATT TTTCAGGCCT GAGTTGATCG TTAATCATCT TAATTATGTT 60 

CATTCTGAAG CCAACAGGAG AACCAAGACC AAAACTTTAT TGTCTCTGCT TTCATTTCTT 120 

GATGAAACCT CTGGACTAAG CACACATCTT CCTTGTTTAT CTCTCTCAAA GGAGTGTGGA 180 

GTGCTTCATC TGGACATCCA CGGGAAGAAG GAAGACATGA GAATCACCCA ACAGTCTTCC 240 

CAGCTATACC TGTGGGACAT GGGTGGTTTT ACAATATTTA AGAACCTGTG GATGAGCCTC 300 

ATACCCAGAG GGAACAAACG CTCCCCAAAA AGAGTTACAG AAACCATCCT GAGAGATTTT 360 

AAGCAGAAGC AAAGTTCAAA GATCCAAGAG GAGAGACGAA GAGAGTCTGC AGGACCAAAC 420 

CTCTCTTCAT TCTGGTTTGT GGGGAATGCT GGAAGAGGAG ACAGGCCCCA GATTTGGGCA 480 

GGAAGTAAAC AGTTTTCAGG CTGAGGCCAA TCTGAGCAGG AACATTCCAA TATTTCTTCA 540 

GCTACGTTGT CCCAGCACTT CACTGGTTAA CCTTTTATGT CCACCATTTG TGGATTTCAC 600 

AGCTACTTGT CAATGGTGAA TATTGATCAT CATCATTATC TACTGAGCTG CTACCATATC 660 

CCAGCTACTC CTTGCATGTT GTTCATTATT TTCTCAACAC TCAGCATATT TGCAATATGT 720 

TATGTAATAT CACAGACAAG GAAACTGAAC GCAGAAATGT TTTATTTCTT GCCAAACATC 780 

ACATGAGGAT GAACAATGAA ACCGATTTGA AACCAGGATT GTCTGATTCC AACATCTCTG 840 
GGTCCTTTTT CACTCTGATA TGCTGCAATT AAAAAGCCAT TTCTAAGACT GT 



A182 Protein sequence: 
Gene name: 
Unigene number: 
Probeset Accession #: 
Protein Accession #: 
Signal sequence: 
Transmembrane domains: 
Cellular Localization: 



ESTs 

Hs. 162859 
AA569531 
none found 
1-46 

none found 
not determined 



1 11 21 31 41 51 

I I I I i 1 

MTYSYSFFRP ELIVNHLNYV HSEANRRTKT KTLLSLLSFL DBTSGLSTHL PCLSLSKECG 60 
VLHLDIHGKK EDMRITQQSS QLYLWDMGGF TIFKNLWMSL IPRGNKRSPK. RVTETILRDF 120 
KQKQSSKIQE ERRRESAGPN LSSFWFVGNA GRGDRPQIWA GSKQFSG 



A183 DNA SEQUENCE 
Gene name: 
Unigene number: 
Probeset Accession #: 
Nucleic Acid Accession 
Coding sequence: 



ESTs 

Hs. 179809 
N95796 
#: XMJ>50197 
310-1971 (underlined 



sequences correspond to 



start and stop codons) 



1 11 21 31 41 51 

I I I t I I 

TCACACGTGC CAAGGGGCTG GCTCAGCGGA ACCAGCCTGC ACGCGCTGGC TCCGGGTGAC 60 

AGCCGCGCGC CTCGGCCAGG ATCTGAGTGA TGAGACGTGT CCCCACTGAG GTGCCCCACA 120 

GCAGCAGGTG TTGAGCATGG GCTGAGAAGC TGGACCGGCA CCAAAGGGCT GGCAGAAATG 180 

GGCGCCTGGC TGATTCCTAG GCAGTTGGCG GCAGCAAGGA GGAGAGGCCG CAGCTTCTGG 240 

AGCAGAGCCG AGACGAAGCA GTTCTGGAGT GCCTGAACGG CCCCCTGAGC CCTACOCGCC 300 



918 



WO 03/042661 



PCT/US02/36810 



TGGCCCACTA TGGTCCAGAG GCTGTGGGTG AGCCGCCTGC TGCGGCACCG GAAAGCCCAG 360 

CTCTTGCTGG TCAACCTGCT AACCTTTGGC CTGGAGGTGT GTTTGGCCGC AGGCATCACC 420 

TATGTGCCGC CTCTGCTGCT GGAAGTGGGG GTAGAGGAGA AGTTCATGAC CATGGTGCTG 480 

GGCATTGGTC CAGTGCTGGG CCTGGTCTGT GTCCCGCTCC TAGGCTCAGC CAGTGACCAC 540 

TGGCGTGGAC GCTATGGCCG CCGCCGGCCC TTCATCTGGG CACTGTCCTT GGGCATCCTG 600 

CTGAGCCTCT TTCTCATCCC AAGGGCCGGC TGGCTAGCAG GGCTGCTGTG CCCGGATCCC 660 

AGGCCCCTGG AGCTGGCACT GCTCATCCTG GGCGTGGGGC TGCTGGACTT CTGTGGCCAG 720 

GTGTGCTTCA CTCCACTGGA GGCCCTGCTC TCTGACCTCT TCCGGGACCC GGACCACTGT 780 

CGCCAGGCCT ACTCTGTCTA TGCCTTCATG ATCAGTCTTG GGGGCTGCCT GGGCTACCTC 840 

CTGCCTGCCA TTGACTGGGA CACCAGTGCC CTGGCCCCCT ACCTGGGCAC CCAGGAGGAG 900 

TGCCTCTTTG GCCTGCTCAC CCTCATCTTC CTCACCTGCG TAGCAGCCAC ACTGCTGGTG 960 

GCTGAGGAGG CAGCGCTGGG CCCCACCGAG CCAGCAGAAG GGCTGTCGGC CCCCTCCTTG 1020 

TCGCCCCACT GCTGTCCATG CCGGGCCCGC TTGGCTTTCC GGAACCTGGG CGCCCTGCTT 1080 

CCCCGGCTGC ACCAGCTGTG CTGCCGCATG CCCCGCACCC TGCGCCGGCT CTTCGTGGCT 1140 

GAGCTGTGCA GCTGGATGGC ACTCATGACC TTCACGCTGT TTTACACGGA TTTOGTGGGC 1200 

GAGGGGCTGT ACCAGGGCGT GCCCAGAGCT GAGCCGGGCA CCGAGGCCCG GAGACACTAT 1260 

GATGAAGGCG TTOGGATGGG CAGCCTGGGG CTGTTCCTGC AGTGCGCCAT .CTCCCTGGTC 1320 

TTCTCTCTGG TCATGGACCG GCTGGTGCAG CGATTCGGCA CTCGAGCAGT CTATTTGGCC 1380 

AGTGTGGCAG CTTTCCCTGT GGCTGCCGGT GCCACATGCC TGTCCCACAG TGTGGCCGTG 1440 

GTGACAGCTT CAGCCGCCCT CACCGGGTTC ACCTTCTCAG CCCTGCAGAT CCTGCCCTAC 1500 

ACACTGGCCT CCCTCTACCA COGGGAGAAG CAGGTGTTCC TGCCCAAATA CCGAGGGGAC 1560 

ACTGGAGGTG CTAGCAGTGA GGACAGCCTG ATGACCAGCT TCCTGCCAGG CCCTAAGCCT 1620 

GGAGCTCCCT TCCCTAATGG ACACGTGGGT GCTGGAGGCA GTGGCCTGCT CCCACCTCCA 1680 

CCCGCGCTCT GCGGGGCCTC TGCCTGTGAT GTCTCCGTAC GTGTGGTGGT GGGTGAGCCC 1740 

ACCGAGGCCA GGGTGGTTCC GGGCCGGGGC ATCTGCCTGG ACCTCGCCAT CCTGGATAGT 1800 

GCCTTCCTGC TGTCCCAGGT GGCCCCATCC CTGTTTATGG GCTCCATTGT CCAGCTCAGC 1860 

CAGTCTGTCA CTGCCTATAT GGTGTCTGCC GCAGGCCTGG GTCTGGTCGC CATTTACTTT 1920 

GCTACACAGG TAGTATTTGA CAAGAGCGAC TTGGCCAAAT ACTCAGC GTA GA AAACTTCC 1980 

AGCACATTGG GGTGGAGGGC CTGCCTCACT GGGTCCCAGC TCCCCGCTCC TGTTAGCCCC 2040 

ATGGGGCTGC CGGGCTGGCC GCCAGTTTCT GTTGCTGCCA AAGTAATGTG GCTCTCTGCT 2100 

GCCACCCTGT GCTGCTGAGG TGCGTAGCTG CACAGCTGGG GGCTGGGGCG TCCCTCTCCT 2160 

CTCTCCCCAG TCTCTAGGGC TGCCTGACTG GAGGCCTTCC AAGGGGGTTT CAGTCTGGAC 2220 

TTATACAGGG AGGCCAGAAG GGCTCCATGC ACTGGAATGC GGGGACTCTG CAGGTGGATT 2280 

ACCCAGGCTC AGGGTTAACA GCTAGCCTCC TAGTTGAGAC ACACCTAGAG AAGGGTTTTT 2340 

GGGAGCTGAA TAAACTCAGT CACCTGGTTT CCCATCTCTA AGCCCCTTAA CCTGCAGCTT 24 00 

CGTTTAATGT AGCTCTTGCA TGGGAGTTTC TAGGATGAAA CACTCCTCCA TGGGATTTGA 2460 

ACATATGAAA GTTATTTGTA GGGGAAGAGT CCTGAGGGGC AACACACAAG AACC AGGTCC 2520 

CCTCAGCCCC ACAGGCACTG GTCTTTTTTG CTNGANTCCA CCCCCCCCCT CTTTACCCTT 2580 
TT 



A184 Protein sequence: 
Gene name: 
Unigene number: 
Probeset Accession #: . 
Protein Accession #: 
Signal sequence: 
Transmembrane domains: 
Cellular Localization: 



ESTS 

Hs. 179809 
N95796 
XP_050197 
none 

21-43, 53-75, 90-112, 125-147, 
plasma membrane 



160-182, 199-221, 323-345, 350-372, 379-401 



1 11 21 31 41 51 

I I I I I I 

MVQRLWVSRL LRHRKAQLLL VNLLTFGLEV CLAAGITYVP PLLLEVGVEE KFMTMVLGIG 60 

PVLGLVCVPL LGSASDHWRG RYGRRRPFIW AI>SLGILLSIj PLIPRAGWLA GLLCPDPRPL 120 

ELALLILGVG LLDFCGQVCP TFLEALLSDL FRDPDHCRQA YSVYAFMISL GGCLGYLLPA 180 

IDWDTSALAP YLGTQEECLF GLLTLIFLTC VAATLLVABE AALGPTEPAE GLSAPSLSPH 240 

CCPCRARLAF RNLGALLPRL BQLCCRMPRT LRRLFVAELC SWMALMTFTL FYTDFVGEGL 300 

YQGVPRAEPG TEARRHYDEG VRMGSLGLFL QCAISLVFSIi VMDRLVQRFG TRAVYLASVA 360 

AFPVAAGATC LSHSVAWTA SAALTGFTFS ALQILPYTLA SLYHREKQVF LPKYRGDTGG 420 

ASSEDSLMTS FLPGPKPGAP FPNGHVGAGG SGLLPPPPAIi CGASACDVSV RWVGEPTEA 480 

RWPGRGICL DIAILDSAFL LSQVAPSLFM GSIVQLSQSV TAYMVSAAGL GLVAIYFATQ 540 
WFDKSDLAK YSA 



A185 DNA SEQUENCE 

Gene name: ESTs 

Unigene number: Hs. 11260 

Probeset Accession #: R73640 

Nucleic Acid Accession #: AK002126 

Coding sequence: 1-1593 {underlined 



sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I 1 I I I I 

ATGGTTCGCC GGGGGCTGCT TGCGTGGATT TCCCGGGTGG TGGTTTTGCT GGTGCTCCTC 60 

TGCTGTGCTA TCTCTGTCCT GTACATGTTG GCCTGCACCC CAAAAGGTGA OGAGGAGCAG 120 

CTGGCACTGC CCAGGGCCAA CAGCCCCACG GGGAAGGAGG GGTACCAGGC CGTCCTTCAG 180 

GAGTGGGAGG AGCAGCACCG CAACTACGTG AGCAGCCTGA AGCGGCAGAT CGCACAGCTC 240 

AAGGAGGAGC TGCAGGAGAG GAGTGAGCAG CTCAGGAATG GGCAGTACCA AGCCAGCGAT 300 

GCTGCTGGCC TGGGTCTGGA CAGGAGCCCC CCAGAGAAAA CCCAGGCCGA CCTCCTGGCC 360 

TTCCTGCACT CGCAGGTGGA CAAGGCAGAG GTGAATGCTG GCGTCAAGCT GGCCACAGAG 420 

TATGCAGCAG TGCCTTTCGA TAGCTTTACT CTACAGAAGG TGTACCAGCT GGAGACTGGC 480 

CTTACCCGCC ACCCCGAGGA GAAGCCTGTG AGGAAGGACA AGCGGGATGA GTTGGTGGAA 540 

GCCATTGAAT CAGCCTTGGA GACCCTGAAC AATCCTGCAG AGAACAGCCC CAATCACCGT 600 



919 



WO 03/042661 



PCT/US02/36810 



CCTTACACGG CCTCTGATTT CATAGAAGGG ATCTACCGAA CAGAAAGGGA CAAAGGGACA 660 

TTGTATGAGC TCACCTTCAA AGGGGACCAC AAACACGAAT TCAAACGGCT CATCTTATTT 720 

CGACCATTCG GCCCCATCAT GAAAGTGAAA AATGAAAAGC TCAACATGGC CAACACGCTT 780 

ATCAATGTTA TCGTGCCTCT AGCAAAAAGG GTGGACAAGT TCCGGCAGTT CATGCAGAAT 840 

TTCAGGGAGA TGTGCATTGA GCAGGATGGG AGAGTCCATC TCACTGTTGT TTACTTTGGG 900 

AAAGAAGAAA TAAATGAAGT CAAAGGAATA CTTGAAAACA CTTCCAAAGC TGCCAACTTC 960 

AGGAACTTTA CCTTCATCCA GCTGAATGGA GAATTTTCTC GGGGAAAGGG ACTTGATGTT 1020 

GGAGCCCGCT TCTGGAAGGG AAGCAACGTC CTTCTCTTTT TCTGTGATGT GGACATCTAC 1080 

TTCACATCTG AATTCCTCAA TACGTGTAGG CTGAATACAC AGCCAGGGAA GAAGGTATTT 1140 

TATCCAGTTC TTTTCAGTCA GTACAATCCT GGCATAATAT ACGGCCACCA TGATGCAGTC 1200 

CCTCCCTTGG AACAGCAGCT GGTCATAAAG AAGGAAACTG GATTTTGGAG AGACTTTGGA 1260 

TTTGGGATGA CGTGTCAGTA TCGGTCAGAC TTCATCAATA TAGGTGGGTT TGATCTGGAC 1320 

ATCAAAGGCT GGGGCGGAGA GGATGTGCAC CTTTATCGCA AGTATCTCCA CAGCAACCTC 1380 

ATAGTGGTAC GGACGCCTGT GCGAGGACTC TTCCACCTCT GGCATGAGAA GCGCTGCATG 1440 

GACGAGCTGA CCCCOGAGCA GTACAAGATG TGCATGCAGT CCAAGGCCAT GAACGAGGCA 1500 

TCCCACGGCC AGCTGGGCAT GCTGGTGTTC AGGCACGAGA TAGAGGCTCA CCTTCGCAAA 1560 
CAGAAACAGA AGACAAGTAG CAAAAAAACA TGA 



A186 Protein sequence; 
Gene name: 
Unigene number: 
Probeset Accession #: 
Protein Accession #: 
Signal sequence: 
Transmembrane domains: 
Cellular Localization: 



ESTs 
Hs. 11260 
R73640 
KP 060841 
1-26 

none found 
not determined 



1 
I 

MVRRGLLAWI 
EWEEQHRNYV 
FLHSQVDKAE 
AIESALETLN 
RPFGPIMKVK 
KEEINEVKGI 
FTSEFLNTCR 
FGMTCQYRSD 
DELTPEQYKM 



11 
I 

SRVWLLVLL 
SSLKRQIAQL 
VNAGVKLATE 
NPAENSPNHR 
NEKLKMANTL 
LENTSKAANF 
LNTQPGKKVF 
FINIGGFDLD 
CMQSKAMNEA 



21 
I 

CCAISVLYML 
KEELQERSEQ 
YAAVPFDSFT 
PYTASDFIEG 
INVIVPLAKR 
RNFTFIQLNG 
YPVLFSQYNP 
IKGWGGEDVH 
SHGQLGMLVF 



31 
I 

ACTPKGDEEQ 
LRNGQYQASD 
LQKVYQLETG 
IYRTERDKGT 
VDKFRQFMQN 
EFSRGKGLDV 
GIIYGHHDAV 
LYRKYLHSNL 
RHEXEAHLRK 



41 

i 

LALPRANSPT 
AAGLGLDRSP 
LTRHPEEKPV 
LYELTFKGDH 
FREMCIEQDG 
GARFWKGSNV 
PPLEQQLVIK 
IWRTPVRGti 
QKQKTSSKKT 



51 
I 

GKEGYQAVLQ 
PEKTQADLLA 
RKDKRDELVE 
KHEFKRliILF 
RVHLTWYFG 
LLFFCDVDIY 
KETGFWRDFG 
FHLWHEKRCM 



60 
120 
180 
240 
300 
360 
420 
480 



A187 DKA SEQUENCE 

Gene name: ATPase, Ca++ transporting, type 2C, member 1 

Unigene number: Hs. 10 677 8 

Probeset Accession #: N51919 
Nucleic Acid Accession #: AF189723 

Coding sequence: 1-2712 (underlined sequences correspond to start and stop codons) 



I 



ATGATTCCTG 
ATTCTCCAAG 
TTTCATGGCT 
TCTCAGTTTA 
ATGCATCAGT 
GCCTTTGTTC 
CCAGAATGCC 
CCAGGTGATA 
GAGGCTGTGG 
AAGGTGACAG 
GCCTTTATGG 
GAAAATTCTG 
CCTCTGCAGA 
ATAGGAATCA 
ATTAGTGTAA 
ACGCTAGCTC 
ATTGTTGAAA 
AAGAATGAAA 
GGAGTTGGCT 
TATAACCCAG 
AGAAACAATA 
ATGGGTCTTG 
GAGCAAAAGT 
TGTTTTATGA 
GGGCAGACCT 
ATGGGCTCAG 
ACATTTCTTG 
ACAACACTCA 
GCAGTTGCAA 
GAAGAAATAG 
TTTTACAGAG 
TCAGTTGTAG 
ATTGGAGTTG 
CTAGTGGATG 



11 
I 

TATTGACATC 
CTGATCTTCA 
GGAATGAGTT 
AAAATCCCCT 
TTGATGATGC 
AGGAATATCG 
ATTGTGTGCG 
CAGTTTGCCT 
ATCTTTCCAT 
CTCCTCAGCC 
GAACACTGGT 
AATTTGGGGA 
AGAGCATGGA 
TCATGTTGGT 



TTGGTGTTAT 
CTCTGGGCTG 
TGACTGTTAC 
ATAATCAATT 
CTGTTAGCAG 
CTCTAATGGG 
ATGGACTTCA 
GGATGGCTGT 
AAGGTGCTTA 
TGACACTTAC 
CGGGACTCAG 
GCTTGGTGGG 
TTGCCTCAGG 
TCGCCAGTCG 
ATGCAATGGA 
CTAGCCCAAG 
CCATGACAGG 
CGATGGGCCA 
ATGATTTTCA 



21 
I 

AAAAAAAGCA 
GAATGGTCTA 
TGATATTAGT 
T ATT ATG CTG 
CGTCAGTATC 
TTCAGAAAAA 
TGAAGGAAAA 
TTCTGTTGGG 
TGATGAGTCC 
AGCTGCAACT 
CAGATGTGGC 
GGTTTTTAAA 
CCTCTTAGGA 
TGGCTGGTTA 
AGCAGCAATT 
GAGAATGGTG 
CTGTAATGTG 
TCACATATTT 
TGGGGAAGTG 
AATTGTTGAG 
GAAGCCAACA 
ACAAGACTAC 
TAAGTGTGTA 
CGAACAAGTA 
TCAGCAGCAG 
AGTTCTTGCT 
AATCATTGAT 
AGTATCAATA 
TCTGGGATTG 
TGTTCAGCAG 
GCACAAGATG 
AGATGGAGTA 
GACTGGTACA 
AACCATAATG 



31 
I 

AGTGAATTAC 
AACAAATGTG 
GAAGATGAGC 
CTTCTGGCTT 
ACTGTGGCAA 
TCTCTTGAAG 
TTGGAGCATA 
GATAGAGTTC 
AGCTTGACAG 
AATGGAGATC 
AAAGCAAAGG 
ATGATGCAAG 
AAACAACTTT 
CTGGGAAAAG 
CCTGAAGGTC 
AAGAAAAGGG 
ATTTGTTCAG 
ACTTCAGATG 
ATTGTTGATG 
GCGGGCTGTG 
GAAGGGGCCT 
ATCAGAAAAG 
CACCGAACAC 
ATTAAGTACT 
AGAGATGTGT 
TTGGCTTCTG 
CCACCTAGAA 
AAAATGATTA 
TATTCCAAAA 
CTTTCACAAA 
AAAATTATTA 
AATGATGCAG 
GATGTTTGCA 
TCTGCAATCG 



41 
I 

CAGTCAGTGA 
AAGTTAGTCA 
CACTGTGGAA 
CTGCAGTCAT 
TACTTATCGT 
AATTGAGTAA 
CACTTGCCCG 
CTGCTGACTT 
GTGAGACAAC 
TTGCATCGAG 
GTGTTGTCAT 
CAGAAGAGGC 
CCTTTTACTC 
ATATCCTGGA 
TCCCCATTGT 
CCATTGTGAA 
ATAAAACTGG 
GTCTGCATGC 
GTGATGTTGT 
TGTGCAATGA 
TAATTGCTCT 
CTGAATACCC 
AGCAGGACAG 
GTACTACATA 
ACCAACAAGA 
GTCCTGAACT 
CTGGTGTGAA 
CTGGAGATTC 
CTTCCCAGTC 
TAGTACCAAA 
AGTCGCTACA 
TTGCTCTGAA 
AAGAGGCAGC 
AAGAGGGTAA 



51 
I 

AGTTGCAAGC 
TAGGCGAGCC 
GAAGTATATT 
CAGTGTTTTA 
TGTTACAGTT 
ACTTGTGCCA 
AGACTTGGTT 
ACGCTTGTTT 
GCCTTGTTCT 
AAGTAACATT 
TGGAACAGGA 
ACCAAAAACC 
CTTTGGTATA 
AATGTTTACT 
GGTCACAGTG 
AAAGCTGCCT 
AACACTGACG 
TGAGGTTACT 
TCATGGATTC 
TGCTGTAATT 
TGCAATGAAG 
TTTTAGCTCT 
ACCAGAGATT 
CCAGAGCAAA 
GAAGGCACGC 
GGGACAGCTG 
AGAAGCTGTT 
ACAGGAGACT 
AGTCTCAGGA 
GGTTGCAGTA 
GAAGAACGGT 
GGCTGCAGAC 
AGACATGATC 
AGGGATTTAT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1S00 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 



920 



WO 03/042661 



PCT/US02/36810 



10 

15 

20 

25 

30 

35 

40 

45 

50 



60 
65 
70 
75 



AATAACATTA 
ATCTCATTGG 
ATCAATATTA 
GATGTCATTC 
CTTAAAATAC 
GAGCTACGAG 
TTTTTTGACA 
GGACTCTGCA 
CTAGTTATTT 
GATCTGTTGT 
AAGGTTGAAA 
CTTGAAGTAT 



AAAATTTCGT 
CTACATTAAT 
TTATGGATGG 
GTAAACCTCC 
TTGTTTCATC 
ACAATGTGAT 
TGTTCAATGC 
GTAATAGAAT 
ACTTTCCTCC 
TTCTTTTGGG 
GGAGCAGGGA 
GA 



TAGATTCCAG 
GAACTTTCCT 
ACCCCCAGCT 
TCGCAACTGG 
AATAATCATT 
TACACCTCGA 
ACTAAGTTCC 
GTTTTGCTAT 
GCTTCAGAAG 
TCTCACCTCA 
AAAGATCCAG 



CTGAGCACGA 
AATCCTCTCA 
CAGAGCCTTG 
AAAGACAGCA 
GTTTGTGGGA 
GACACAACAA 
AGATCCCAGA 
GCAGTTCTTG 
GTTTTTCAGA 
TCAGTGTGCA 
AAGCATGTTA 



GTATAGCAGC 
ATGCCATGCA 
GAGTAGAACC 
TTTTGACTAA 
CTTTGTTTGT 
TGACCTTCAC 
CCAAGTCTGT 
GATCCATCAT 
CTGAGAGCCT 
TAGTGGCAGA 
GTTCGACATC 



ATTAACTTTA 
GATTTTGTGG 
AGTGGATAAA 
AAACTTGATA 
CTTCTGGCGT 
ATGCTTTGTG 
GTTTGAGATT 
GGGACAATTA 
AAGCATACTG 
AATTATAAAG 
ATCATCTTTT 



2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 



A18B Protein sequence : 
Gene name: 
Unigene number: 
Probeset Accession #: 
Protein Accession #: 
Signal sequence: 
Transmembrane domains: 
Pfatn domains: 
[744-889] 

Cellular Localization: 



ATPase, Ca++ transporting, type 2C, member 1 

Hs. 106778 

N51919 

AAP27813 

none found 

60-82, 86-108, 252-274, 282-304, 757-779, 827-849, 856-878 

Cation ATPase N [6-83] , E1-E2 ATPase [89-324] , Hydrolase [328-649] , 



Cation ATPase C 



not determined 



MIPVLTSKKA 
SQFKNPLIML 
PECHCVREGK 
KVTAPQPAAT 
PLQKSMDLLG 
TLALGVMRMV 
GVGYNQFGEV 
MGLDGLQQDY 
GQTLTLTQQQ 
TTLIASGVSI 
FYRASPRHKM 
LVDDDFQTIM 
INIIMDGPPA 
ELRDNVITPR 
LVIYFPPLQK 
LEV 



11 

I 

SELPVSEVAS 
LLASAVISVL 
LEHTLARDLV 
NGDLASRSNI 
KQLSPYSFGI 
KKRAIVKKLP 
IVDGDWHGF 
IRKAEYPFSS 
RDVYQQEKAR 
KMITGDSQET 
KIIKSLQKNG 
SAIEEGKGIY 
QSLGVEPVDK 
DTTMTFTCPV 
VFQTESLSIL 



21 
I 

ILOADLQNGL 
MHQFDDAVSI 
PGDTVCLSVG 
AFMGTLVRCG 
IGIIMLVGWL 
IVETLGCCNV. 
YNPAVSRIVE 
EQKWMAVKCV 
MGSAGLRVLA 
AVAIASRLGL 
SWAMTGDGV 
NNIKNFVRFQ 
DVIRKPPRNW 
FFDMFKALSS 
DLLFLLGLTS 



31 
I 

NKCEVSHRRA 
TVAILIWTV 
DRVPADLRLF 
KAKGWIGTG 
LGKDILEMFT 
ICSDKTGTLT 
AGCVCNDAVI 
HRTQQDRPEI 
LASGPELGQL 
YSKTSQSVSG 
NDAVALKAAD 
LSTSIAALTL 
KDSILTKNLI 
RSQTKSVFEI 
SVCIVAEIIK 



41 
I 

FHGWNEFDIS 
AFVQEYRSEK 
EAVDLSIDES 
ENSEFGEVFK 
ISVSLAVAA1 
KNEMTVTHIF 
RNNTLMGKPT 
CFMKGAYEQV 
TFLGLVGIID 
EEIDAMDVQQ 
IGVAMGQTGT 
ISLATLMNFP 
LKILVSSIII 
GLCSNRMFCY 
KVERSREKIQ 



51 
I 

EDEPLWKKYI 
SLEELSKLVP 
SLTGETTPCS 
MMQAEEAPKT 
PEGLPIWTV 
TSDGLHAEVT 
EGALIALAMK 
IKYCTTYQSK 
PPRTGVKEAV 
LSQIVPKVAV 
DVCKEAADMI 
NPLNAMQILW 
VCGTLFVFWR 
AVLGSIMGQL 
KHVSSTSSSF 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 



A169 UNA SEQUENCE 
Gene name: 
Unigene number: 
Probeset Accession #: 
Nucleic Acid Accession ft: 
Coding sequence: 



ESTs 

HS. 293185 

N62096 

N62096 

1-1284 (underlined sequences correspond to start and stop codons) 



I 

Al« 

J-> AAG< 



11 



21 



31 



41 



51 



ATGGGCTACC AGAGGCAGGA GCCTGTCATC CCGCCGCAGA GAGGATTGCC TTATTCAATG 
AAGCAAGCTG GGTTTCCTTT GGGAATATTG CTTTTATTCT GGGTTTCATA TGTTACAGAC 
TTTTCCCTTG TTTTATTGAT AAAAGGAGGG GCCCTCTCTG GAACAGATAC CTACCAGTCT 
TTGGTCAATA AAACTTTCGG CTTTCCAGGG TATCTGCTCC TCTCTGTTCT TCAGTTTTTG 
TATCCTTTTA TAGCAATGAT AAGTTACAAT ATAATAGCTG GAGATACTTT GAGCAAAGTT 
TTTCAAAGAA TCCCAGGAGT TGATCCTGAA AACGTGTTTA TTGGTCGCCA CTTCATTATT 
GGACTTTCCA CAGTTACCTT TACTCTGCCT TTATCCTTGT ACCGAAATAT AGCAAAGCTT 
GGAAAGGTCT CCCTCATCTC TACAGGTTTA ACAACTCTGA TTCTTGGAAT TGTAATGGCA 
AGGGCAATTT CACTGGGTCC ACACATACCA AAAACAGAAG ACGCTTGGGT ATTTGCAAAG 
CCCAATGCCA TTCAAGCGGT CGGGGTTATG TCTTTTGCAT TTATTTGCCA CCATAACTCC 
TTCTTAGTTT ACAGTTCTCT AGAAGAACCC ACAGTAGCTA AGTGGTCCCG CCTTATCCAT 
ATGTCCATCG TGATTTCTGT ATTTATCTGT ATATTCTTTG CTACATGTGG ATACTTGACA 
TTTACTGGCT TCACCCAAGG GGACTTATTT GAAAATTACT GCAGAAATGA TGACCTGGTA 
ACATTTGGAA GATTTTGTTA TGGTGTCACT GTCATTTTGA CATACCCTAT GGAATGCTTT 
GTGACAAGAG AGGTAATTGC CAATGTGTTT TTTGGTGGGA ATCTTTCATC GGTTTTCCAC 
ATTGTTGTAA CAGTGATGGT CATCACTGTA GCCACGCTTG TGTCATTGCT GATTGATTGC 
CTCGGGATAG TTCTAGAACT CAATGGTGTG CTCTGTGCAA CTCCCCTCAT TTTTATCATT 
CCATCAGCCT GTTATCTGAA ACTGTCTGAA GAACCAAGGA CACACTCCGA TAAGATTATG 
TCTTGTGTCA TGCTTCCCAT TGGTGCTGTG GTGATGGTTT TTGGATTCGT CATGGCTATT 
ACAAATACTC AAGACTGCAC CCATGGGCAG GAAATGTTCT ACTGCTTTCC TGACAATTTC 
TCTCTCACAA ATACCTCAGA GTCTCATGTT CAGCAGACAA CACAACTTTC TACTTTAAAT 
ATTAGTATCT TTCAACTCGA GTAA 
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A190 Protein sequence: 
Gene name: 
Unigene number: 
Probeset Accession ft: 
Protein Accession ft: 
Signal sequence: 
Transmembrane domains: 



ESTs 

Hs. 293185 
N62096 
none found 
none found 

28-50. 66-88-112-134, 



142-164, 217-239, 260-282, 298-320, 327-348, 359-381 

921 



WO 03/042661 



PCT/US02/36810 



Cellular Localization: plasma membrane 



1 11 21 31 41 51 

I I I I I I 

MGYQRQEPVI PPQRGLPYSM KQAGFPLGIL LLFWVSYVTD FSLVLLIKGG ALSGTDTYQS 60 

LVNKTFGFPG YLLLSVLQFL YPFIAMISYN IIAGDTLSKV FQRIPGVDPE NVFIGRHFII 120 

GLSTVTFTLP LSLYRNIAKL GKVSLISTGL TTLILGIVMA RAISLGPHIP KTEDAWVFAK 180 

PNAIQAVGVM SFAFICHHNS FLVYSSLEEP TVAKWSRLIH MSIVISVFIC IFFATCGYLT 240 

FTGFTQGDLF ENYCRNDDLV TFGRFCYGVT VTLTYPMECF VTREVIANVF FGGNLSSVFH 300 

IWTVMVITV ATLVSLLIDC LGIVLELNGV LCATPHFII PSACYLKLSE EPRTHSDKIM 360 

SCVMLPIGAV VMVFGFVMAI TNTQDCTHGQ EMFYCFPDNF SLTNTSESHV QQTTQLSTLN 420 
ZSIFQLE 



A191 DNA SEQUENCE 

Gene name: ESTs 

Unigene number: He. 293185 

Probeset Accession tf: N62096 

Nucleic Acid Accession N62096 

Coding sequence: 1-1203 (underlined sequences correspond to start and stop codons) 



l 11 21 31 41 SI 

| I I I I I 

ATGGGCTACC AGAGGCAGGA GCCTGTCATC CCGCCGCAGT TTTCCCTTGT TTTATTGATA 60 

AAAGGAGGGG CCCTCTCTGG AACAGATACC TACCAGTCTT TGGTCAATAA AACTTTCGGC 120 

TTTCCAGGGT ATCTGCTCCT CTCTGTTCTT CAGTTTTTGT ATCCTTTTAT AGCAATGATA 180 

AGTTACAATA TAATAGCTGG AGATACTTTG AGCAAAGTTT TTCAAAGAAT CCCAGGAGTT 240 

GATCCTGAAA ACGTGTTTAT TGGTCGCCAC TTCATTATTG GACTTTCCAC AGTTACCTTT 300 

ACTCTGCCTT TATCCTTGTA CCGAAATATA GCAAAGCTTG GAAAGGTCTC CCTCATCTCT 360 

ACAGGTTTAA CAACTCTGAT TCTTGGAATT GTAATGGCAA GGGCAATTTC ACTGGGTCCA 420 

CACATACCAA AAACAGAAGA CGCTTGGGTA TTTGCAAAGC CCAATGCCAT TCAAGCGGTC 480 

GGGGTTATGT CTTTTGCATT TATTTGCCAC CATAACTCCT TCTTAGTTTA CAGTTCTCTA 540 

GAAGAACCCA CAGTAGCTAA GTGGTCCCGC CTTATCCATA TGTCCATCGT GATTTCTGTA 600 

TTTATCTGTA TATTCTTTGC TACATGTGGA TACTTGACAT TTACTGGCTT C ACCCA AGGG 660 

GACTTATTTG AAAATTACTG CAGAAATGAT GACCTGGTAA CATTTGGAAG ATTTTGTTAT 720 

GGTGTCACTG TCATTTTGAC ATACCCTATG GAATGCTTTG TGACAAGAGA GGTAATTGCC 780 

AATGTGTTTT TTGGTGGGAA TCTTTCATCG GTTTTCCACA TTGTTGTAAC AGTGATGGTC 840 

ATCACTGTAG CCACGCTTGT GTCATTGCTG ATTGATTGCC TCGGGATAGT TCTAGAACTC 900 

AATGGTGTGC TCTGTGCAAC TCCCCTCATT TTTATCATTC CATCAGCCTG TTATCTGAAA 960 

CTGTCTGAAG AACCAAGGAC ACACTCCGAT AAGATTATGT CTTGTGTCAT GCTTCCCATT 1020 

GGTGCTGTGG TGATGGTTTT TGGATTCGTC ATGGCTATTA CAAATACTCA AGACTGCACC 1080 

CATGGGCAGG AAATGTTCTA CTGCTTTCCT GACAATTTCT CTCTCACAAA TACCTCAGAG 1140 

TCTCATGTTC AGCAGACAAC ACAACTTTCT ACTTTAAATA TTAGTATCTT TCAACTCGAG 1200 
TAA 



A192 Protein sequence: 
Gene name t 
Unigene number: 
Probeset Accession #: 
Protein Accession #: 
Signal sequence: 
Transmembrane domains: 
Cellular Localization: 



ESTs 

Hs. 293185 
N62096 
none found 
1-26 

45-61, 92-108, 117-132, 
plasma membrane 



191-207, 274-290, 297-313, 335-351 



1 11 21 31 41 51 

| i I 1 i I 

MGYQRQEPVI PPQPSLVLLI KGGALSGTDT YQSLVNKTFG FPGYLLLSVL QFLYPFIAMI 60 

SYNIIAGDTL SKVFQRIPGV DPENVFIGRH FIIGLSTVTF TLPLSLYRNI AKLGKVSLIS 120 

TGLTTLILGI VMARAISLGP HIPKTEDAWV FAKPNAIQAV GVMSFAFICH HNSFLVYSSL 180 

EEPTVAKWSR LIHMSIVISV FICIFFATCG YLTFTGFTQG DLFENYCRND DLVTFGRFCY 240 

GVTVILTYPM ECFVTREVIA NVPFGGNLSS VFHIWTVMV ITVATLVSLL IDCLGIVLEL 300 

NGVLCATPLI FIIPSACYLK LSEEPRTHSD KIMSCVMLPI GAWMVFGFV MAITNTQDCT 360 
HGQEMFYCFP DNFSLTNTSE SHVQQTTQLS TLNISIFQLE 



A193 DMA SEQUENCE 
Gene name: 
Unigene number: 
Probeset Accession 
Nucleic Acid Accession #: 
Coding sequence: 



ESTs 

Hs. 293185 

N62096 

N62096 

1-1140 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

1 I I I I I 

ATGGGCTACC AGAGGCAGGA GCCTGTCATC CCGCCGCAGG TCAATAAAAC TTTCGGCTTT 60 

CCAGGGTATC TGCTCCTCTC TGTTCTTCAG TTTTTGTATC CTTTTATAGC AATGATAAGT 120 

TACAATATAA TAGCTGGAGA TACTTTGAGC AAAGTTTTTC AAAGAATCCC AGGAGTTGAT 180 

CCTGAAAACG TGTTTATTGG TCGCCACTTC ATTATTGGAC TTTCCACAGT TACCTTTACT 240 

CTGCCTTTAT CCTTGTACCG AAATATAGCA AAGCTTGGAA AGGTCTCCCT CATCTCTACA 300 

GGTTTAACAA CTCTGATTCT TGGAATTGTA ATGGCAAGGG CAATTTCACT GGGTCCACAC 360 

ATACCAAAAA CAGAAGACGC TTGGGTATTT GCAAAGCCCA ATGCCATTCA AGCGGTCGGG 420 



922 



WO 03/042661 



PCT/US02/36810 



GTTATGTCTT TTGCATTTAT TTGCCACCAT AACTCCTTCT TAGTTTACAG TTCTCTAGAA 4 BO 

GAACCCACAG TAGCTAAGTG GTCCCGCCTT ATCCATATGT CCATCGTGAT TTCTGTATTT 540 

ATCTGTATAT TCTTTGCTAC ATGTGGATAC TTGACATTTA CTGGCTTCAC CCAAGGGGAC 600 

TTATTTGAAA ATTACTGCAG AAATGATGAC CTGGTAACAT TTGGAAGATT TTGTTATGGT 660 

GTCACTGTCA TTTTGACATA CCCTATGGAA TGCTTTGTGA CAAGAGAGGT AATTGCCAAT 720 

GTGTTTTTTG GTGGGAATCT TTCATCGGTT TTCCACATTG TTGTAACAGT GATGGTCATC 7 BO 

ACTGTAGCCA CGCTTGTGTC ATTGCTGATT GATTGCCTCG GGATAGTTCT AGAACTCAAT 840 

GGTGTGCTCT GTGCAACTCC CCTCATTTTT ATCATTCCAT CAGCCTGTTA TCTGAAACTG 900 

TCTGAAGAAC CAAGGACACA CTCCGATAAG ATTATGTCTT GTGTCATGCT TCCCATTGGT 960 

GCTGTGGTGA TGGTTTTTGG ATTCGTCATG GCTATTACAA ATACTCAAGA CTGCACCCAT 1020 

GGGCAGGAAA TGTTCTACTG CTTTCCTGAC AATTTCTCTC TCACAAATAC CTCAGAGTCT 10B0 
CATGTTCAGC AGACAACACA ACTTTCTACT TTAAATATTA GTATCTTTCA ACTCGAGTAA 



A194 Protein sequence: 
Gene name: 
Unigene number: 
Probeset Accession #: 
Protein Accession <h 
Signal sequence: 
Transmembrane domains: 
Cellular Localization: 



ESTs 

Hs. 293185 
N62096 
none found 
none found 

24-40, 70-86. 9S-111, 
plasma membrane 



171-186, 253-269, 276-292, 314-330 



1 11 21 31 41 51 

| | I I I I 

MGYQRQEPVI PPQVNKTFGP PGYLLLSVLQ FLYPFIAMIS YNIIAGDTLS KVFQRIPGVD 60 

PENVFIGRHF IIGLSTVTFT LPLSLYRNIA KLGKVSLIST GLTTLILGIV MARAISLGPH 120 

IPKTEDAWVF AKPNAIQAVG VMSFAFICHH NSFLVYSSLE EPTVAKWSRL IHMSIVISVF 180 

ICIFFATCGY LTFTGFTQGD LFENYCRNDD LVTFGRFCYG VTVILTYPME CFVTREVIAN 240 

VFFGGNLSSV FHIWTVMVI TVATLVSLLI DCLGIVLELN GVLCATPLIF IIPSACYLKL 300 

SEEPRTHSDK IMSCVMLPIG AWMVFGFVM AITNTQDCTH GQEMFYCFPD NFSLTNTSES 360 
HVQQTTQLST LNISIFQLE 



A195 DNA SEQUENCE: 
Gene name: 
Unigene number: 
Probeset Accession #: 
Nucleic Acid Accession #: 
Coding sequence: 



ESTs 

HS. 293185 

M62096 

N62096 

1-1389 {underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

| | I I I 1 

ATGGGCTACC AGAGGCAGGA GCCTGTCATC CCGCCGCAGA GAGATTTAGA TGACAGAGAA 60 

ACCCTTGTTT CTGAACATGA GT AT AAAG AG AAAACCTGTC AGTCTGCTGC TCTTTTTAAT 120 

GTTGTCAACT CGATTATAGG ATCTGGTATA ATAGGATTGC CTTATTCAAT GAAGC AAGCT 1B0 

GGGTTTCCTT TGGGAATATT GCTTTTATTC TGGGTTTCAT ATGTTACAGA CTTTTCCCTT 240 

GTTTTATTGA TAAAAGGAGG GGCCCTCTCT GGAACAGATA CCTAC CAGTC TTTGGTCAAT 300 

AAAACTTTCG GCTTTCCAGG GTATCTGCTC CTCTCTGTTC TTCAGTTTTT GTATCCTTTT 360 

ATAGCAATGA TAAGTTACAA TATAATAGCT GGAGATACTT TGAGCAAAGT TTTTCAAAGA 420 

ATCCCAGGAG TTGATCCTGA AAACGTGTTT ATTGGTCGCC ACTTCATTAT TGGACTTTCC 480 

ACAGTTACCT TTACTCTGCC TTTATCCTTG TACCGAAATA TAGCAAAGCT TGGAAAGGTC 540 

TCCCTCATCT CTACAGGTTT AACAACTCTG ATTCTTGGAA TTGTAATGGC AAGGGCAATT 600 

TCACTGGGTC CACACATACC AAAAACAGAA GACGCTTGGG TATTTGCAAA GCCCAATGCC 660 

ATTCAAGCGG TCGGGGTTAT GTCTTTTGCA TTTATTTGCC ACCAT AACT C CTTCTTAGTT 720 

TACAGTTCTC TAGAAGAACC CACAGTAGCT AAGTGGTCCC GCCTTATCCA TATGTCCATC 780 

GTGATTTCTG TATTTATCTG TATATTCTTT GCTACATGTG GATACTTGAC ATTTACTGGC 840 

TTCACCCAAG GGGACTTATT TGAAAATTAC TGCAGAAATG ATGACCTGGT AACATTTGGA 900 

AGATTTTGTT ATGGTGTCAC TGTCATTTTG ACATACCCTA TGGAATGCTT TGTGACAAGA 960 

GAGGTAATTG CCAATGTGTT TTTTGGTGGG AATCTTTCAT CGGTTTTCCA CATTGTTGTA 1020 

ACAGTGATGG TCATCACTGT AGCCACGCTT GTGTCATTGC TGATTGATTG CCTCGGGATA 1080 

GTTCTAGAAC TCAATGGTGT GCTCTGTGCA ACTCCCCTCA TTTTTATCAT TCCATCAGCC 1140 

TGTTATCTGA AACTGTCTGA AGAACCAAGG ACACACTCCG ATAAGATTAT GTCTTGTGTC 1200 

ATGCTTCCCA TTGGTGCTGT GGTGATGGTT TTTGGATTCG TCATGGCTAT TACAAATACT 1260 

CAAGACTGCA CCCATGGGCA GGAAATGTTC TACTGCTTTC CTGACAATTT CTCTCTCACA 1320 

AATACCTCAG AGTCTCATGT TCAGCAGACA ACACAACTTT CTACTTTAAA TATTAGTATC 1380 
TTTCAATGA 



A196 Protein sequence: 
Gene name: 
unigene number: 
Probeset Accession # : 
Protein Accession #: 
Signal sequence: 
Transmembrane domains: 
Cellular Localization: 



ESTs 

Hs. 293185 
N62096 
none found 
none found 

37-53, 66-82, 109-125, 155-172, 180-196, 2S5-271, 338-354, 361-377, 399-415 
plasma membrane 



1 ii 21 31 > 41 51 

| | I I I I 

MGYQRQEPVI PPQRDLDDRE TLVSEHEYKE KTCQSAALFN WNSIIGSGI IGLPYSMKQA 

923 



WO 03/042661 



PCTAJS02/36810 
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GFPLGIZ*LLF 
IAMISYNIIA 
SLISTGLTTL 
YSSLEEPTVA 
RFCYGVTVIL 
VLELNGVLCA 
QDCTHGQEMF 



WVSYVTDFSL 
GDTLSKVFQR 
ILGIVMARAI 
KWSRLIHMSI 
TYPMECFVTR 
TPLIFIIPSA 
YCFPDNFSLT 



A197 DNA SEQUENCE 
Gene name: 
Unigene numbers 
Probeset Accession #: 
Nucleic Acid Accession #: 
Coding sequence: 



GTDTYQSLVN KTFGFPGYLL LSVLQFLYPF 120 

IGRHFIIGLS TVTFTLPLSL YRNIAKLGKV 180 

DAWVFAKPNA IQAVGVMSFA FICHHNSFLV 240 

ATCGYLTFTG FTQGDLFENY CRNDDLVTFG 300 

NLSSVFHIW TVMVITVATL VSLLIDCLGI 360 

THSDKIMSCV MLPIGAWMV FGFVMAITNT 420 
TQLSTLNISI FQ 



HS. 31608 

H18836 

NM_017636 

1-3S01 (underlined sequences correspond to start and stop codons) 



VLLIKGGALS 
IPGVDPENVF 
SIXSPHIPKTE 
VISVFICIFF 
EVIANVFFGG 
CYliKLSEEPR 
NTSESHVQQT 



41 



51 



1 11 21 31 

I I I I I I 

ATGGAGGATG CCTTCGGGGC AGCCGTGGTG ACCGTGTGGG ACAGCGATGC ACACACCACG 
GAGAAGCCCA CCGATGCCTA CGGAGAGCTG GACTTCACGG GGGCCGGCCG CAAGCACAGC 
AATTTCCTCC GGCTCTCTGA CCGAACGGAT CCAGCTGCAG TTTATAGTCT GGTCACACGC 
ACATGGGGCT TCCGTGCCCC GAACCTGGTG GTGTCAGTGC TGGGGGGATC GGGGGGCCCC 
GTCCTCCAGA CCTGGCTGCA GGACCTGCTG CGTCGTGGGC TGGTGCGGGC TGCCCAGAGC 
ACAGGAGCCT GGATTGTCAC TGGGGGTCTG CACACGGGCA TCGGCCGGCA TGTTGGTGTG 
GCTGTACGGG ACCATCAGAT GGCCAGCACT GGGGGCACCA AGGTGGTGGC CATGGGTGTG 
GCCCCCTGGG GTGTGGTCCG GAATAGAGAC ACCCTCATCA ACCCCAAGGG CTCGTTCCCT 
GCGAGGTACC GGTGGCGCGG TGACCCGGAG GACGGGGTCC AGTTTCCCCT GGACTACAAC 
TACTCGGCCT TCTTCCTGGT GGAOGACGGC ACACACGGCT GCCTGGGGGG CGAGAACCGC 
TTCCGCTTGC GCCTGGAGTC CTACATCTCA CAGCAGAAGA CGGGCGTGGG AGGGACTGGA 
ATTGACATCC CTGTCCTGCT CCTCCTGATT GATGGTGATG AGAAGATGTT GACGCGAATA 
GAGAACGCCA CCCAGGCTCA GCTCCCATGT CTCCTCGTGG CTGGCTCAGG GGGAGCTGCG 
GACTGCCTGG CGGAGACCCT GGAAGACACT CTGGCCCCAG GGAGTGGGGG AGCCAGGCAA 
GGCGAAGCCC GAGATCGAAT CAGGCGTTTC TTTCCCAAAG GGGACCTTGA GGTCCTGCAG 
GCCCAGGTGG AGAGGATTAT GACCCGGAAG GAGCTCCTGA CAGTCTATTC TTCTGAGGAT 
GGGTCTGAGG AATTCGAGAC CATAGTTTTG AAGGCCCTTG TGAAGGCCTG TGGGAGCTCG 
GAGGCCTCAG CCTACCTGGA TGAGCTGCGT TTGGCTGTGG CTTGGAACCG CGTGGACATT 
GCCCAGAGTG AACTCTTTCG GGGGGACATC CAATGGCGGT CCTTCCATCT CGAAGCTTCC 
CTCATGGACG CCCTGCTGAA TGACCGGCCT GAGTTCGTGC GCTTGCTCAT TTCCCACGGC 
CTCAGCCTGG GCCACTTCCT GACCCCGATG CGCCTGGCCC AACTCTACAG CGCGGCGCCC 
TCCAACTCGC TCATCCGCAA CCTTTTGGAC CAGGCGTCCC ACAGCGCAGG CACCAAAGCC 
CCAGCCCTAA AAGGGGGAGC TGCGGAGCTC CGGCCCCCTG ACGTGGGGCA TGTGCTGAGG 
ATGCTGCTGG GGAAGATGTG CGCGCCGAGG TACCCCTCCG GGGGCGCCTG GGACCCTCAC 
CCAGGCCAGG GCTTCGGGGA GAGCATGTAT CTGCTCTCGG ACAAGGCCAC CTCGCCGCTC 
TCGCTGGATG CTGGCCTCGG GCAGGCCCCC TGGAGCGACC TGCTTCTTTG GGCACTGTTG 
CTGAACAGGG CACAGATGGC CATGTACTTC TGGGAGATGG GTTCCAATGC AGTTTCCTCA 
GCTCTTGGGG CCTGTTTGCT GCTCCGGGTG ATGGCACGCC TGGAGCCTGA CGCTGAGGAG 
GCAGCACGGA GGAAAGACCT GGCGTTCAAG TTTGAGGGGA TGGGCGTTGA CCTCTTTGGC 
GAGTGCTATC GCAGCAGTGA GGTGAGGGCT GCCCGCCTCC TCCTCCGTCG CTG CCOG CTC 
TGGGGGGATG CCACTTGCCT CCAGCTGGCC ATGCAAGCTG ACGCCOGTGC CTTCTTTGCC 
CAGGATGGGG TACAGTCTCT GCTGACACAG AAGTGGTGGG GAGATATGGC CAGCACTACA 
CCCATCTGGG CCCTGGTTCT CGCCTTCTTT TGCCCTCCAC TCATCTACAC CCGCCTCATC 
ACCTTCAGGA AATCAGAAGA GGAGCCCACA CGGGAGGAGC TAGAGTTTGA CATGGATAGT 
GTCATTAATG GGGAAGGGCC TGTCGGGACG GCGGACCCAG CCGAGAAGAC GCCGCTGGGG 
GTCCCGCGCC AGTCGGGCCG TCCGGGTTGC TGCGGGGGCC GCTGCGGGGG GCGCCGGTGC 
CTACGCCGCT GGTTCCACTT CTGGGGCGCG COGGTGACCA TCTTCATGGG CAACGTGGTC 
AGCTACCTGC TGTTCCTGCT GCTTTTCTCG OGGGTGCTGC TCGTGGATTT CCAGCCGGCG 
CCGCCCGGCT CCCTGGAGCT GCTGCTCTAT TTCTGGGCTT TCACGCTGCT GTGCGAGGAA 
CTGCGCCAGG GCCTGAGCGG AGGCGGGGGC AGCCTCGCCA GCGGGGGCCC CGGGCCTGGC 
CATGCCTCAC TGAGCCAGCG CCTGCGCCTC TACCTCGCCG ACAGCTGGAA CCAGTGCGAC 
CTAGTGGCTC TCACCTGCTT CCTCCTGGGC GTGGGCTGCC GGCTGACCCC GGGTTTGTAC 
CACCTGGGCC GCACTGTCCT CTGCATCGAC TTCATGGTTT TCACGGTGCG GCTGCTTCAC 
ATCTTCACGG TCAACAAACA GCTGGGGCCC AAGATCGTCA TCGTGAGCAA GATGATGAAG 
GACGTGTTCT TCTTCCTCTT CTTCCTCGGC GTGTGGCTGG TAGCCTATGG CGTGGCCACG 
GAGGGGCTCC TGAGGCCACG GGACAGTGAC TTCCCAAGTA TCCTGCGCCG CGTCTTCTAC 
CGTCCCTACC TGCAGATCTT CGGGCAGATT CCCCAGGAGG ACATGGACGT GGCCCTCATG 
GAGCACAGCA ACTGCTCGTC GGAGCCCGGC TTCTGGGCAC ACCCTCCTGG GGCCCAGGOG 
GGCACCTGCG TCTCCCAGTA TGCCAACTGG CTGGTGGTGC TGCTCCTCGT CATCTTCCTG 
CTCGTGGCCA ACATCCTGCT GGTCAACTTG CTCATTGCCA TGTTCAGTTA CACATTCGGC 
AAAGTACAGG GCAACAGCGA TCTCTACTGG AAGGCGCAGC GTTACCGCCT CATCCGGGAA 
TTCCACTCTC GGCCCGCGCT GGCCCCGCCC TTTATCGTCA TCTCCCACTT GCGCCTCCTG 
CTCAGGCAAT TGTGCAGGCG ACCCCGGAGC CCCCAGCCGT CCTCCCCGGC CCTCGAGCAT 
TTCCGGGTTT ACCTTTCTAA GGAAGCCGAG CGGAAGCTGC TAACGTGGGA ATCGGTGCAT 
AAGGAGAACT TTCTGCTGGC ACGCGCTAGG GACAAGCGGG AGAGCGACTC CGAGCGTCTG 
AAGCGCACGT CCCAGAAGGT GGACTTGGCA CTGAAACAGC TGGGACACAT CCGCGAGTAC 
GAACAGCGCC TGAAAGTGCT GGAGCGGGAG GTCCAGCAGT GTAGCCGCGT CCTGGGGTGG 
GTGGCCGAGG CCCTGAGCCG CTCTGCCTTG CTGCCCCCAG GTGGGCCGCC ACCCCCTGAC 
CTGCCTGGGT CCAAAGACTG A 



60 
120 
180 
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300 
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420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
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1080 
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A198 Protein sequence: 
Gene name: 
Unigene number: 
Probeset Accession #: 
Protein Accession #: 



ESTs 

Hs. 31608 
H18836 
none found 



924 



WO 03/042661 



PCT/US02/36810 



Signal sequence: 
Transmembrane domains: 
Cellular Localization: 



none found 

214-230, 537-556, 642-662, 
plasma membrane 



730-752, 760-782, 815 



837, 842-864, 877-899, 973 



1 11 21 31 41 51 

I I I I I I 

MEDAFGAAW TVWDSDAHTT EKPTDAYGEL DFTGAGRKHS NFLRLSDRTD PAAVYSLVTR 60 

TWGFRAPNLV VSVLGGSGGP VLQTWLQDLL RRGLVRAAQS TGAWIVTGGL HTGIGRHVGV 120 

AVRDHQMAST GGTKWAMGV APWGWRNRD TLINPKGSFP ARYRWRGDPE DGVQFPLDYN 180, 

YSAFPLVDDG THGCLGGENR FRLRLESYIS QQKTGVGGTG IDIPVLLLLI DGDEKMLTRI 240 

ENATQAQLPC LLVAGSGGAA DCLAETLEDT LAPGSGGARQ GEARDRIRRF FPKGDLEVLQ 300 

AQVERIMTRK ELLTVYSSED GSEEPETIVL KALVKACGSS EASAYLDELR LAVAWNRVDI 360 

AQSELFRGDI QWRSFHLEAS LMDALLNDRP EFVRLLISHG LSLGHFLTPM RLAQLYSAAP 420 

SNSLIRNLLD QASHSAGTKA PALKGGAAEL RPPDVGHVLR MLLGKMCAPR YPSGGAWDPH 480 

PGQGFGESMY LLSDKATSPL SLDAGLGQAP WSDLLLWALL LNRAQMAMYF WEMGSNAVSS 540 

ALGACLLLRV MARLEPDAEE AARRKDLAFK FEGMGVDLFG ECYRSSEVRA ARLL.LRRCPL 600 

WGDATCLQLA MQADARAFFA QDGVQSLLTQ KWWGDMASTT PIHALVLAFF CPPLIYTRLI 660 

TFRKSEEEPT REELEFDMDS VINGEGPVGT ADPAEKTPLG VPRQSGRPGC CGGRCGGRRC 720 

LRRWFHFWGA PVTIFMGNW SYLLFLLLFS RVLLVDFQPA PPGSLELLLY FWAFTLLCEE 780 

LRQGLSGGGG SLASGGPGPG HASLSQRLRL YLADSWNQCD LVALTCFLLG VGCRLTPGLY 840 

HLGRTVLCID FMVPTVRLLH IFTVNKQLGP KIVIVSKMMK DVFFFLFFLG VWLVAYGVAT 900 

EGLLRPRDSD FPSILRRVFY RPYLQIFGQI PQEDMDVALM EHSNCSSEPG FWAHPPGAQA 960 

GTCVSQYANW LWLLLVIFL LVANILLVNL LIAMFSYTFG KVQGNSDLYW KAQRYRLIRE 1020 

FHSRPALAPP FIVISHLRLL LRQLCRRPRS PQPSSPALEH FRVYLSKEAE RKLLTWESVH 1080 

KENFLLARAR DKRESDSERL KRTSQKVDLA LKQLGHIREY EQRLKVLERE VQQCSRVLGW 1140 
VAEALSRSAL LPPGGPPPPD LPGSKD 



A199 DNA SEQUENCE 
Gene name: ESTs 
Unigene number: Hs. 40808 

Probeset Accession fl: AA054237 
Nucleic Acid Accession #: AA054237 

Coding sequence: 1-894 {underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

| | I I ! I 

ATGGAGCCGC GGGCGCTCGT CACGGCGCTC AGCCTCGGCC TCAGCCTGTG CTCCCTGGGG 60 

CTGCTCGTCA CGGCCATCTT CACCGACCAC TGGTACGAGA CCGACCCGCG GCGCCACAAG 120 

GAGAGCTGCG AGCGCAGCCG CGCGGGCGCC GACCCCCCGG ACCAGAAGAA CCGCCTGATG 180 

CCGCTGTCGC ACCTGCCGCT GCGGGACTCG CCCCCGCTGG GGCGCCGGCT GCTCCCGGGC 240 

GGCCCGGGGC GCGCCGACCC CGAGTCCTGG CGCTCGCTCC TGGGGCTCGG CGGGCTGGAC 300 

GCCGAGTGCG GCCGGCCCCT CTTCGCCACC TACTCGGGCC TCTGGAGGAA GTGCTACTTC 360 

CTGGGCATCG ACCGGGACAT CGACACCCTC ATCCTGAAAG GTATTG CGCA GCGATGCACG 420 

GCCATCAAGT ACCACTTTTC TCAGCCCATC CGCTTGCGAA ACATTCCTTT TAATTTAACC 480 

AAGACCATAC AGCAAGATGA GTGGCACCTG CTTCATTTAA GAAGAATCAC TGCTGGCTTC 540 

CTCGGCATGG CCGTAGCCGT CCTTCTCTGC GGCTGCATTG TGGCCACAGT CAGTTTCTTC 600 

TGGGAGGAGA GCTTGACCCA GCACGTGGCT GGACTCCTGT TCCTCATGAC AGGGATATTT 660 

TGCACCATTT CCCTCTGTAC TTATGCCGCC AGTATCTCGT ATGATTTGAA CCGGCTCCCA 720 

AAGCTAATTT ATAGCCTGCC TGCTGATGTG GAACATGGTT ACAGCTGGTC CATCTTTTGC 780 

GCCTGGTGCA GTTTAGGCTT TATTGTGGCA GCTGGAGGTC TCTGCATCGC TTATCCGTTT 840 
ATTAGCCGGA CCAAGATTGC ACAGCTAAAG TCTGGCAGAG ACTCCACGGT ATGA 



A200 Protein sequence: 
Gene name: 
Unigene number: 
Probeset Accession #: 
Protein Accession #: 
Signal sequence: 
Transmembrane domains: 
Cellular Localization: 



ESTs 
Hs. 40808 
AA054237 
none found 
1-18 

179-201, 209-231, 257-279 
plasma membrane 



1 11 21 31 41 51 

I | I I ! I 

MEPRALVTAL SLGLSLCSLG LLVTAIFTDH WYETDPRRHK ESCERSRAGA DPPDQKNRLH 
PLSHLPLRDS PPLGRRLLPG GPGRADPESW RSLLGLGGLD AECGRPLFAT YSGLWRKCYF 
LGIDRDIDTL ILKGIAQRCT AIKYHFSQPI RLRNIPFNLT KTIQQDEWHL LHLRRITAGF 
LGMAVAVLLC GCIVATVSFF WEESLTQHVA GLLFLMTGIF CTISLCTYAA SISYDLNRLP 
KLIYSLPADV EHGYSWSIFC AWCSLGFIVA AGGLCIAYPF ISRTKIAQLK SGRDSTV 



60 
120 
180 
240 



A201 DNA SEQUENCE 

Gene name: CDA14 

Unigene number: Hs.26813 

Probeset Accession #: N32912 

Nucleic Acid Accession #: NM_016570 

Coding sequence: 1- 1134 (underlined 



sequences correspond to start and stop codons) 



1 ii 21 31 41 51 

| | 1 I I I 

ATGAGGCGAC TGAATCGGAA AAAAACTTTA AGTTTGGTAA AAGAGTTGGA TGCCTTTCCG 60 
AAGGTTCCTG AGAGCTATGT AGAGACTTCA GCCAGTGGAG GTACAGTTTC TCTAATAGCA 120 



925 



WO 03/042661 



PCT/US02/36810 



TTTACAACTA TGGCTTTATT AACCATAATG GAATTCTCAG TATATCAAGA TACATGGATG 180 

AAGTATGAAT ACGAAGTAGA CAAGGATTTT TCTAGCAAAT TAAGAATTAA TATAGATATT 240 

ACTGTTGCCA TGAAGTGTCA ATATGTTGGA GCGGATGTAT TGGATTTAGC AGAAACAATG 300 

GTTGCATCTG CAGATGGTTT AGTTTATGAA CCAACAGTAT TTGATCTTTC ACCACAGCAG 360 

AAAGAGTGGC AGAGGATGCT GCAGCTGATT CAGAGTAGGC TACAAGAAGA GCATTCACTT 420 

CAAGATGTGA TATTTAAAAG TGCTTTTAAA AGTACATCAA CAGCTCTTCC ACCAAGAGAA 480 

GATGATTCAT CACAGTCTCC AAATGCATGC AGAATTCATG GCCATCTATA TGTCAATAAA 540 

GTAGCAGGGA ATTTTCACAT AACAGTGGGC AAGGCAATTC CACATCCTCG TGGTCATGCA 600 

CATTTGGCAG CACTTGTCAA CCATGAATCT TACAATTTTT CTCATAGAAT AGATCATTTG 660 

TCTTTTGGAG AGCTTGTTCC AGCAATTATT AATCCTTTAG ATGGAACTGA AAAAATTGCT 720 

ATAGATCACA ACCAGATGTT CCAATATTTT ATTACAGTTG TGCCAACAAA ACTACATACA 780 

TATAAAATAT CAGCAGACAC CCATCAGTTT TCTGTGACAG AAAGGGAACG TATCATTAAC 840 

CATGCTGCAG GCAGCCATGG AGTCTCTGGG ATATTTATGA AATATGATCT CAGTTCTCTT 900 

ATGGTGACAG TTACTGAGGA GCACATGCCA TTCTGGCAGT TTTTTGTAAG ACTCTGTGGT 960 

ATTGTTGGAG GAATCTTTTC AACAACAGGC ATGTTACATG GAATTGGAAA ATTTATAGTT 1020 

GAAATAATTT GCTGTCGTTT CAGACTTGGA TCCTATAAAC CTGTCAATTC TGTTCCTTTT 1080 
GAGGATGGCC ACACAGACAA CCACTTACCT CTTTTAGAAA ATAATACACA TTGA 



A202 Protein sequence: 

Gene name: CDA14 

Unigene number: Hs.26813 

Probeset Accession #: N32912 

Protein Accession #: NP_0S7654 

Signal sequence: none found 

Transmembrane domains: none found 

Cellular Localization: nuclear 



1 11 21 31 41 51 

I I I 1 1 I 

MRRLNRKKTli SLVKELDAFP KVPESYVETS ASGGTVSLIA FTTMALLTIM EFSVYQDTWM 
KYEYEVDKDF SSKLRINIDI TVAMKCQYVG ADVLDLAETM VASADGLVYE PTVFDLSPQQ 
KEWQRMLQLI QSRLQEEHSL QDVIFKSAFK STSTALPPRE DDSSQSPNAC R1HGHLYVNK 
VAGNFHITVG KAIPHPRGHA HLAALVNHES YNFSHRIDHT, SFGELVPAII NPLDGTEKIA 
IDHNQMFQYF ITWPTKLHT YKISADTHQF SVTERERIIN BAAGSHGVSG IFMKYDLSSL 
MVTVTEEHMP FWQFFVRLCG IVGGIFSTTG MLHGIGKFIV EIICCRFRLG SYKPVNSVPF 
EDGHTDNHLP LLENNTH 



A203 DNA SEQUENCE 

Gene name: dipeptldylpeptidase IV (CD26, adenosine deaminase complexing protein 2) 

Unigene number: Hs. 44926 

Probeset Accession S79876 
Nucleic Acid Accession #: NMJ>01935.1 

Coding sequence: 76-2301 (underlined sequences correspond to start and stop codons) 



60 
120 
180 
240 
300 
360 



1 11 . 21 31 41 51 

I I I I I I 

CGCGCGTCTC CGCCGCCCGC GTGACTTCTG CCTGCGCTCC TTCTCTGAAC GCTCACTTCC 60 

GAGGAGACGC CGACGATGAA GACACCGTGG AAGATTCTTC TGGGACTGCT GGGTGCTGCT 120 

GCGCTTGTCA CCATCATCAC CGTGCCCGTG GTTCTGCTGA ACAAAGGCAC AGATGATGCT 180 

ACAGCTGACA GTCGCAAAAC TTACACTCTA ACTGATTACT TAAAAAATAC TTATAGACTG 240 

AAGTTATACT CCTTAAGATG GATTTCAGAT CATGAATATC TCTACAAACA AGAAAATAAT 300 

ATCTTGGTAT TCAATGCTGA ATATGGAAAC AGCTCAGTTT TCTTGGAGAA CAGTACATTT 360 

GATGAGTTTG GACATTCTAT CAATGATTAT TCAATATCTC CTGATGGGCA GTTTATTCTC 420 

TTAGAATACA ACTACGTGAA GCAATGGAGG CATTCCTACA CAGCTTCATA TGACATTTAT 480 

GATTTAAATA AAAGGCAGCT GATTACAGAA GAGAGGATTC CAAACAACAC ACAGTGGGTC 540 

ACATGGTCAC CAGTGGGTCA TAAATTGGCA TATGTTTGGA ACAATGACAT TTATGTTAAA 600 

ATTGAACCAA ATTTACCAAG TTACAGAATC ACATGGACGG GGAAAGAAGA TATAATATAT 660 

AATGGAATAA CTGACTGGGT TTATGAAGAG GAAGTCTTCA GTGCCTACTC TGCTCTGTGG 720 

TGGTCTCCAA ACGGCACTTT TTTAGCATAT GCCCAATTTA ACGACACAGA AGTCCCACTT 780 

ATTGAATACT CCTTCTACTC TGATGAGTCA CTGCAGTACC CAAAGACTGT ACGGGTTCCA 840 

TATCCAAAGG CAGGAGCTGT GAATCCAACT GTAAAGTTCT TTGTTGTAAA TACAGACTCT 900 

CTCAGCTCAG TCACCAATGC AACTTCCATA CAAATCACTG CTCCTGCTTC TATGTTGATA 960 

GGGGATCACT ACTTGTGTGA TGTGACATGG GCAACACAAG AAAGAATTTC TTTGCAGTGG 1020 

CTCAGGAGGA TTCAGAACTA TTCGGTCATG GATATTTGTG ACTATGATGA ATCCAGTGGA 1080 

AGATGGAACT GCTTAGTGGC ACGGCAACAC ATTGAAATGA GTACTACTGG CTGGGTTGGA 1140 

AGATTTAGGC CTTCAGAACC TCATTTTACC CTTGATGGTA ATAGCTTCTA CAAGATCATC 1200 

AGCAATGAAG AAGGTTACAG ACACATTTGC TATTTCCAAA TAGATAAAAA AGACTGCACA 1260 

TTTATTACAA AAGGCACCTG GGAAGTCATC GGGATAGAAG CTCTAACCAG TGATTATCTA 1320 

TACTACATTA GTAATGAATA TAAAGGAATG CCAGGAGGAA GGAATCTTTA TAAAATCCAA 1380 

CTTATTGACT ATACAAAAGT GACATGCCTC AGTTGTGAGC TGAATCCGGA AAGGTGTCAG 1440 

TACTATTCTG TGTCATTCAG TAAAGAGGCG AAGTATTATC AGCTGAGATG TTCCGGTCCT 1500 

GGTCTGCCCC TCTATACTCT ACACAGCAGC GTGAATGATA AAGGGCTGAG AGTCCTGGAA 1560 

GACAATTCAG CTTTGGATAA AATGCTGCAG AATGTCCAGA TGCCCTCCAA AAAACTGGAC 1620 

TTCATTATTT TGAATGAAAC AAAATTTTGG TATCAGATGA TCTTGCCTCC TCATTTTGAT 1680 

AAATCCAAGA AATATCCTCT ACTATTAGAT GTGTATGCAG GCCCATGTAG TCAAAAAGCA 1740 

GACACTGTCT TCAGACTGAA CTGGGCCACT TACCTTGCAA GCACAGAAAA CATTATAGTA 1800 

GCTAGCTTTG ATGGCAGAGG AAGTGGTTAC CAAGGAGATA AGATCATGCA TGCAATCAAC 1860 

AGAAGACTGG GAACATTTGA AGTTGAAGAT CAAATTGAAG CAGCCAGACA ATTTTCAAAA 1920 

ATGGGATTTG TGGACAACAA ACGAATTGCA ATTTGGGGCT GGTCATATGG AGGGTAOGTA 1980 

ACCTCAATGG TCCTGGGATC GGGAAGTGGC GTGTTCAAGT GTGGAATAGC CGTGGCGCCT 2040 



926 



WO 03/042661 
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GTATCCCGGT GGGAGTACTA TGACTCAGTG TACACAGAAC GTTACATGGG TCTCCCAACT 2100 

CCAGAAGACA ACCTTGACCA TTACAGAAAT TCAACAGTCA TGAGCAGAGC TGAAAATTTT 2160 

AAACAAGTTG AGTACCTCCT TATTCATGGA ACAGCAGATG ATAACGTTCA CTTTCAGCAG 2220 

TCAGCTCAGA TCTCCAAAGC CCTGGTCGAT GTTGGAGTGG ATTTCCAGGC AATGTGGTAT 2280 

ACTGATGAAG ACCATGGAA T AG CTAGCAGC ACAGCACACC AACATATATA TACCCACATG 2340 

AGCCACTTCA TAAAACAATG~TTTCTCTTTA CCTTAGCACC TCAAAATACC ATGCCATTTA 2400 

AAGCTTATTA AAACTCATTT TTGTTTTCAT TATCTCAAAA CTGCACTGTC AAGATGATGA 2460 

TGATCTTTAA AATACACACT CAAATCAAGA AACTTAAGGT TACCTTTGTT CCCAAATTTC 2520 

ATACCTATCA TCTTAAGTAG GGACTTCTGT CTTCACAACA GATTATTACC TTACAGAAGT 2580 

TTGAATTATC CGGTCGGGTT TTATTGTTTA AAATCATTTC TGCATCAGCT GCTGA AACAA 2640 

CAAATAGGAA TTGTTTTTAT GGAGGCTTTG CATAGATTCC CTGAGCAGGA TTTTAATCTT 2700 

TTTCTAACTG GACTGGTTCA AATGTTGTTC TCTTCTTTAA AGGGATGGCA AGATGTGGGC 2760 

AGTGATGTCA CTAGGGCAGG GACAGGATAA GAGGGATTAG GGAGAGAAGA TAGCAGGGCA 2820 

TGGCTGGGAA CCCAAGTCCA AGCATACCAA CACGAGCAGG CTACTGTCAG CTCCCCTCGG 2880 

AGAAGAGCTG TTCACCACGA GACTGGCACA GTTTTCTGAG AAAGACTATT CAAACAGTCT 2940 

CAGGAAATCA AATATCGAAA GCACTGACTT CTAAGTAAAC CACAGCAGTT GAAAGACTCC 3000 

AAAGAAATGT AAGGGAAACT GCCAGCAACG CAGCCCCCAG GTGCCAGTTA TGGCTATAGG 3060 

TGCTACAAAA ACACAGCAAG GGTGATGGGA AAGCATTGTA AATGTGCTTT TAAAAAAAAA 3120 

TACTGATGTT CCTAGTGAAA GAGGCAGCTT GAAACTGAGA TGTGAACACA TCAGCTTGCC 3180 

CTGTTAAAAG ATGAAAATAT TTGTATCACA AATCTTAACT TGAAGGAGTC CTTGCATCAA 3240 

TTTTTCTTAT TTCATTTCTT TGAGTGTCTT AATTAAAAGA ATATTTTAAC TTCCTTGGAC 3300 

TCATTTTAAA AAATGGAACA TAAAATACAA TGTTATGTAT TATTATTCCC ATTCTACATA 3360 
CTATGGAATT TCTCCCAGTC ATTTAATAAA TGTGCCTTCA TTTTTTC 



A204 Protein sequence: 
Gene name: 
Unigene number: 
Probeset Accession #: 
Protein Accession ft: 
Signal sequence: 
Transmembrane domains : 
DPPIV_N_term domain: 
Peptidase_S9 domain: 
Cellular realization: 



dipept idyl peptidase XV (CD26, adenosine deaminase complexing protein 2) 

Hs. 44926 

S79876 

NP_001926.1 

none found 

6-28 

43-557 

558-635 

plasma membrane 



1 11 21 31 41 51 

MKTPWKILLG LLGAAALVTI ITVPWLLNK GTDDATAESR KTYTL.TDYLK NTYRLKLYSL 60 

RVJISDHEYLY KQENNILVFN AEYGNSSVPL ENSTFDEFGH SINDYSISPD GQFILLEYNY 120 

VKQWRHSYTA SYDIYDLNKR QLITEERIPN NTQWVTWSPV GHKLAYVWNN DIYVKIEPNL 180 

PSYRITWTGK EDIIYNGITD WVYEEEVFSA YSALWWSPNG TFIAYAQFND TEVPLIEYSF 240 

YSDESLQYPK TVRVPYPKAG AVNPTVKFFV VNTDSLSSVT NATS IQ I TAP ASMLIGDHYL 300 

CDVTWATQER ISLQWLRRIQ NYSVMDICDY DESSGRWNCL VARQHIEMST TGWVGRFRPS 360 

EPHFTLDGNS PYKIISNEEG YRHICYFQID KKDCTFITKG TWEVIGIEAL TSDYLYYISN 420 

EYKGMPGGRN LYKIQLIDYT KVTCLSCELN PERCQYYSVS FSKEAKYYQL RCSGPGLPLY 480 

TLHSSVNDKG LRVLEDNSAL DKMLQNVQMP SKXLDFIILN ETKFWYQMIL PPHFDKSKKY 540 

PLLLDVYAGP CSQKADTVFR LNWATYLAST ENIIVASFDG RGSGYQGDKI MHAINRRLGT 600 

FEVEDQIEAA RQFSKMGFVD NKRIAIWGWS YGGYVTSMVL GSGSGVFKCG IAVAPVSRWE 660 

YYDSVYTERY MGLPTPEDNL DHYRNSTVMS RAENFKQVEY LLIHGTADDN VHFQQSAQIS 720 
KALVDVGVDF QAMWYTDEDH GIASSTAHQH IYTHMSHFIK QCFSLP 



predicted exon 
none found 
none found 
none found 

1-261 (underlined sequences correspond to start and stop codons) 



A205 DNA SEQUENCE 

Gene names 
Unigene number: 
Probeset Accession # : 
Nucleic Acid Accession #: 
Coding sequence: 



1. 11 21 31 41 51 

ATGGCTCTGG CGAAGGTGAG GGAGCCAAAC GCAAATGACA ATGCCATCAG AGTTGACAAC 60 

AGAAGTGTGA TTAAAGTGCG TGCTAACCAG TGTTCCCTGC ATGAGGCAGA AAGTGAATCC 120 

AGAAACCCTC AGGAGCTCTG GATGGGCCTG CTCCTCTTGA TGGGGGTCCT AGAAGCATGT 180 

GTGGAAATGA GGCCTCTGTC AGTCTGGTCC CTGAGAGATG ACAAGGAGCA GAGCCCCCAC 240 
CAGCCCACAC TGGATGT CTA A 



A206 Protein sequence: 
Gene name: 
Unigene number: 
Probeset Accession 4h 
Protein Accession #: 
Signal sequence: 
Transmembrane domains: 
Cellular Localization: 



predicted exon 
none found 
none found 
none found 
none found 
46-68 

not determined 



! n 21 31 41 51 

MALAKVREPN ANDNAIRVDN RSVIKVRANQ CSLHEAESES RNPQBLWMGL LLLMGVLEAC 
VEMRPLSVHS LRDDKEQSPH QPTLDV 

927 



WO 03/042661 



PCT/US02/36810 



A2Q7 DNA SEQUENCE 
Gene name: ESTs 
Unigene number: Hs. 222 88 6 

Probeset Accession tf: AI672225 

Nucleic Acid Accession fl: none found 

Coding sequence: 1-462 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

ATGCCAAATG CTGAGTTAGA AGCAAAGAGC CTTGGAAGCA GTAAATGTTT AAAAACTGCT 60 

CTCATACTTG CTGTATGTTG TGGATCAGCA AATATAGTCA GCCCTCTACT TGAGCAAAAT 120 

ATTGATGTAT CTTCTCAAGA TCTGGACAGA CGGCCAGAGA GTATGCTGTT TCTAGTCATC 180 

ATCATGTGGA CCAGTTTTGT GGAAGACAAT CTTTCCATGG GCTGGGGGAA GCTAGAAGAT 240 

TTTATGGCTA TTGAAGAAGA AATGAAGAAG CACGGAAGTA CTCATGTGGG ATTCCCAGAA 300 

AACCTGACTA ATGGTGCCGC TGCTGGCAAT GGTGATGATG GATTAATTCC TCCAAGGAAG 360 

AGCAGAACAC CTGAAAGCCA GCAATTTCCT GACACTGAGA ATGAAGAGTA TCACAGGTTT 420 
GTCAAAGATC AGATAGTTGT AGATATGCGG CGTTATTT CT GA 



A208 Protein sequence: 
Gene name: 
Unigene number: 
Probeset Accession #: 
Protein Accession #: 
Signal sequence: 
Transmembrane domains: 
Cellular Localization: 



ESts 

Hs. 222886 
AI672225 
none found 
none found 
16-38 

not determined 



1 11 21 31 41 51 

I I I ! 1 I 

MPNAELEAKS LGSSKCLKTA LILAVCCGSA NIVSPLLEQN IDVSSQDLDR RPESMLFLVI 60 
IMWTSFVEDN LSMGWGKLED FMAIEEEMKK HGSTHVGFPE NLTNGAAAGN GDDGLIPPRK 120 
SRTPESQQFP DTENEEYHRF VKDQIWDMR RYF 



A20 9 DNA SEQUENCE 

Gene name: odz (odd Oz/ten-m, Drosophila) homo log l 

Unigene number: Hs.23796 

Probeset Accession #: NM_014253 

Nucleic Acid Accession #: NM_014253 

Coding sequence: 65-8242 (underlined sequences correspond to start and stop codons) 



GACTGCTTGC 
AGAGATGGAG 
GGATCTAGCT 
ATACAACTCC 
CCAGAGTAGA 
CTCTCACACT 
CCAGCTAGAG 
TGCACTAAGA 
GGCCAACTCT 
TGGTTTCAAA 
TGTGCAGAGC 
TCCTCATGCC 
ATCAATGACT 
GGATTCAGTC 
GCATTCCCTG 
CTACCCTCTG 
CTTTTCCCGA 
AGCATTGAGC 
AGTGCATTTG 
TGGAGTTAGC 
AGGAAAAGTT 
TGGAGAAGTT 
TTTCCAGATT 
CTCTCTGCTG 
TGTAAAACTA 
ACAGCACTCC 
TATGGATCAA 
ATTCGTGTTA 
TGGAGAGTGT 
TAGAGATTCC 
CTGCCGGCAT 
AACATGCTTT 
AGGAGAAATA 
TGTAAAAGGA 
TCCTGTATGT 
CAGCTGTGAT 
TGGTAGCCAT 
AACATGTGAG 



11 
I 

ATTAAAGGAC 
CAAACTGACT 
TACACCAGTT 
AGGGAGACCC 
AAGAGGAAAG 
CTGTGCTCTG 
ATGGGATCTG 
ATGTGGATAA 
GCATTATCCT 
TTCTCTCCTG 
AGCCCACACA 
TGCACCTGTG 
ACCCGCAGCC 
CATCTGCATA 
TTCAAACATG 
ACATCCAATA 
CCTGCCTTTA 
GCCACTGCAA 
TTCGGCCTGA 
AAAGGGAACA 
TCTGATAAAT 
GACATTGGTG 
ACTATCCACC 
GGAATTTATG 
ATGGATGGCA 
CCTCGGAACC 
GGACCTTGGT 
ACTACAGCAA 
ATCTCTGGCC 
TGCCCTGTGC 
GGCTGGAAGG 
GGCCACGGCA 
TGCGAGGAAG 
GAATGTCACT 
CAAGAGCAGT 
CCCAAGTGGA 
GGAGTCTGCT 
GAACGCTCCT 



21 
I 

TTCCTCATCC 
GCAAACCCTA 
CTTCTGATGA 
TGCACGAGTA 
AAGTAGAAAA 
GCTACCAAAC 
ATGTGGACAC 
GGGGAATGAA 
TGACTGACAC 
TTTGTTGTGA 
ACCAGTTCAC 
CCAGGAAGCC 
AGCCCAGCCC 
ACAGCTGGGT 
GATCTGGTTC 
CCGTGTACTC 
CCTTTAACAA 
TCACAGTGAC 
CTTGGCAGTT 
GGGGGACCGA 
CAGAGAAAAA 
CACAGGTCAT 
ATCCAATATA 
GCAGAAGAAA 
AACAGCTGGT 
TGATCTTAAC 
ATCTGGCGTT 
TTGAAATAAT 
ATTGTCATTG 
TGTGTGGTGG 
GGCCAGAGTG 
CCTGCATCAT 
AGGACTGCCT 
GTTCTACTGG 
GCTCAGGACA 
CAGGATCTGA 
CAAGAGGAAT 
GTCATTCTCA 



31 

I 

TTTTTTTCAT 

CCAGCCTCTA 
GAGTGAAGAT 
TAACCAGGAG 
AT CT ACT CAA 
AGACATGCAC 
AGAGACAGAA 
ATCAGAGCAT 
TGACCATGAA 
CATGGAGGCT 
CTTCAGACCC 
ACCCCCTGCA 
AGCTGCTCCA 
CCTGAACAGC 
CTCTGCGATC 
GCCCCCTCCC 
ACCTTACAGG 
TTTGGCCTTG 
GCAACCAGTT 
GTCCATGGAC 
AGTGTTTCAG 
GCAGACCATT 
TCTGAAGTTC 
CATTCCACCT 
CAAGCAGGAC 
TTCGCTTCAG 
TTACAATGAT 
GGATGACTGT 
TTTCCCAGGA 
GAATGGAGAA 
TGACGTTCCG 
GGGAGTCTGC 
AGACCCAATG 
CTGGGGAGGA 
CGGAACTTTT 
CTGCTCAACA 
TTGCCAGTGT 
TTGTACTGAG 



41 
I 

GAAACTGAGC 
CCAAAAGTCA 
GGAAGAAAAC 
CTGAGGATGA 
GAGATGGAAT 
AGCGTTTCTC 
GGTGCTGCCT 
AGTTCCTGTT 
AGGAAGTCTG 
CAAGCTGGGT 
CTCCCACCGC 
GCGGACTCTC 
GCTCCCCCAA 
AACATACCAT 
TTCAGTGCAG 
AGGCCTCTTC 
TGCTGCAACT 
TTACTAGCCT 
GAAGGAGAGC 
ACTACTTACT 
AAGGGACGGG 
CCACCTGGTT 
AATATTTCTT 
ACACATACTC 
TCCAAGGGCT 
GAGACAGGTT 
GGAAAAAAGA 
TCAACCAATT 
TTCCTTGGAC 
TACGAGAAAG 
GAAGAACAAT 
ATCTGTGTGC 
TGTTCCAACC 
GTTAACTGTG 
CTTCTGGACG 
GAGCTGTGTA 
GAAGAAGGCT 
CATGGCCAAT 



51 
I 

TTGCTTAATC 
AGCATGAAAT 
CAAGACAGTC 
ATTACAATAG 
TCTGTGAAAC 
GGCATGGCTA 
CACCTGACCA 
TGTCCAGCCG 
ATGGGGAAAA 
CTACTCAAGA 
CACCTCCGCC 
TTCAGAGGAG 
CCAGCACGCA 
TGGAGACCAG 
CCAGTCAGAA 
CTCGAAGCAC 
GGAAGTGCAC 
ATGTGATTGC 
TGTATGCAAA 
CTCCAATTGG 
CGATAGACAC 
TATTCTGGCG 
TAGCCAAGGA 
AGTTTGATTT 
CTGATGATAC 
TCATAGAGTA 
TGGAGCAAGT 
GCAATGGAAA 
CTGACTGTGC 
GACACTGTGT 
GCATTGATCC 
CAGGATACAA 
ATGGCATCTG 
AAACACCACT 
CTGGAGTATG 
CCATGGAGTG 
GGGTAGGACC 
GCAAAGATGG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
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AAAATGTGAG TGTAGCCCTG GATGGGAGGG CGACCACTGC ACAATTGCTC ACTACTTAGA 
TGCTGTCCGA GATGGCTGCC CAGGGCTCTG CTTTGGAAAT GGACGATGTA CCCTGGATCA 
AAATGGTTGG CACTGTGTGT GTCAGGTGGG TTGGAGTGGG ACAGGCTGCA ATGTTGTCAT 
GGAAATGCTT TGTGGAGATA ACTTGGACAA TGATGGAGAT GGTTTAACCG ACTGTGTGGA 
5 TCCTGACTGT TGTCAACAAA GCAACTGTTA TATAAGTCCT CTCTGCCAGG GCTCACCAGA 
TCCTCTTGAC CTCATTCAGC AAAGCCAAAC TCTCTTCTCT CAGCACACTT CAAGACTTTT 
TTATGATCGA ATCAAATTCC TCATTGGCAA GGACAGTACT CATGTCATTC CTCCTGAGGT 
GTCATTTGAC AGCAGGCGTG CCTGTGTGAT TCGAGGCCAA GTGGTGGCCA TAGATGGAAC 
TCCTCTAGTO GGAGTGAATG TCAGTTTCTT GCACCACAGT GATTATGGGT TTACCATCAG 
10 CCGGCAAGAT GGAAGCTTTG ACCTCGTGGC CATCGGTGGC ATCTCTGTCA TCTTAATCTT 
CGACCGATCC CCTTTCCTGC CTGAGAAGAG AACACTCTGG TTGCCTTGGA ATCAGTTTAT 
TGTGGTAGAG AAAGTCACCA TGCAGAGAGT TGTATCAGAC COGCCATCCT GCGATATCTC 
CAACTTTATC AGCCCAAACC CTATTGTGCT TCCTTCACCG CTCACATCAT TTGGAGGGTC 
CTGTCCAGAG AGGGGAACTA TTGTTCCTGA GCTGCAGGTT GTACAGGAGG AAATTCCCAT 
15 TCCCTCCAGC TTTGTGAGGC TGAGTTACCT GAGCAGCCGC ACCCCTGGGT ATAAAACCCT 
GCTACGGATC CTTCTGACAC ATTCAACGAT TCCCGTAGGC ATGATAAAAG TACACCTCAC 
AGTAGCTGTG GAAGGGCGAC TCACACAGAA GTGGTTTCCC GCCGCAATTA ATCTTGTCTA 
CACATTTGCT TGGAACAAGA CCGATATCTA TGGACAGAAG GTTTGGGGCC TGGCAGAGGC 
TTTGGTATCT GTGGGATATG AATATGAAAC GTGCCCTGAC TTTATTCTCT GGGAGCAAAG 
20 GACAGTCGTT TTACAAGGTT TTGAGATGGA TGCTTCTAAC CTAGGAGACT GGTCTTTGAA 
TAAGCATCAC ATTTTGAATC CTCAAAGTGG AATCATACAT AAAGGGAATG GAGAAAATAT 
GTTCATTTCC CAGCAGCCCC CAGT CATATC AACCATAATG GGTAATGGAC ACCAAAGGAG 
TGTAGCCTGC ACCAACTGCA ATGGCCCAGC CCACAACAAC AAACTCTTTG CTCCTGTCGC 
CTTAGCTTCT GGCCCTGATG GCAGTGTGTA TGTTGGCGAC TTCAATTTTG TAAGGAGAAT 
25 ATTTCCCTCG GGAAACTCCG TTAGTATTTT GGAATTAAGC ACAAGTCCTG CTCACAAATA 
CTATCTGGCT ATGGACCCTG TGTCTGAATC ACTCTATCTA TCAGACACCA ATACTCGCAA 
AGTCTACAAG TTGAAATCTC TTGTGGAGAC GAAAGATCTG TCCAAGAATT TTGAAGTGGT 
GGCAGGAACT GGTGATCAGT GCCTTCCCTT TGACCAGAGT CATTGTGGAG ATGGTGGGAG 
AGCATCGGAA GCTTCACTGA ATAGCCCTCG AGGCATCACA GTTGATAGGC ATGGATTTAT 
30 TTACTTTGTG GATGGGACTA TGATTCGCAA AATTGATGAG AATGCTGTGA TCACAACTGT 
AATCGGCTCA AATGGTCTGA CTTCCACACA ACCACTGAGC TGTGACTCAG GAATGGACAT 
CACTCAGGTG CGATTAGAGT GGCCAACAGA CCTTGCAGTA AATCCTATGG ACAATTCATT 
GTATGTCTTG GATAACAACA TTGTGCTGCA AATTTCTGAG AACAGGCGTG TTCGGATCAT 
CGCAGGACGC CCCATTCACT GCCAGGTGCC AGGCATCGAT CATTTCCTGG TCAGCAAGGT 
35 AGCAATTCAC TCCACTCTAG AGTCAGCGAG GGCCATCAGT GTCTCCCACA GCGGGCTGCT 
CTTCATAGCT GAAACAGACG AGAGGAAAGT AAACCGCATT CAGCAAGTAA CCACCAATGG 
GGAGATCTAC ATCATCGCTG GTGCCCCCAC TGACTGTGAC TGCAAAATTG ATCCAAACTG 
TGACTGTTTT TCAGGTGATG GTGGCTATGC CAAAGATGCA AAGATGAAAG CCCCTTCCTC 
CTTAGCAGTG TCGCCTGATG GAACCCTCTA TGTGGCAGAC CTCGGAAATG TTCGAATTCG 
40 TACCATCAGC AGGAACCAAG CCCACCTGAA TGACATGAAC ATTTATGAGA TTGCTTCACC 
CGCTGATCAG GAACTGTACC AGTTCACTGT AAATGGAACC CACCTACACA CCCTGAACTT 
GATAACAAGG GACTATGTTT ATAACTTCAC CTACAATTCT GAAGGTGACT TGGGCGCGAT 
TACCAGCAGC AATGGCAATT CAGTGCACAT TCGCCGTGAT GCAGGCGGAA TGCCGCTATG 
GCTTGTGGTG CCTGGOGGAC AAGTATACTG GCTGACTATA AGCAGCAATG GAGTCCTGAA 
45 AAGAGTGTCA GCCCAAGGCT ATAATCCGGC CTTAATGACC TATCCAGGAA ACACAGGGCT 
TCTGGCTACC AAAAGTAACG AAAATGGATG GACAACCGTT TATGAGTATG ACCCCGAGGG 
ACACCTGACC AATGCAACGT TTCCCACTGG AGAGGTCAGC AGCTTCCACA GTGACCTGGA 
GAAGCTGACA AAAGTGGAGC TAGATACTTC CAACCGTGAA AATGTCCTCA TGTCAACCAA 
CTTGACGGCA ACTAGTACCA TATATATTTT AAAACAAGAA AATACTCAAA GTACCTATCG 
50 GGTGAATCCA GATGGTTCCC TGCGTGTCAC TTTTGCCAGC GGGATGGAGA TCGGCCTCAG 
CTCAGAGCCC CACATCCTGG CAGGGGCAGT CAACCCTACC CTGGGCAAAT GCAACATCTC 
ATTGCCCGGA GAGCACAATG CAAACCTCAT CGAGTGGCGG CAGAGGAAGG AGCAAAACAA 
AGGCAATGTT TCGGCTTTTG AAAGGAGGCT GAGGGCCCAC AACAGAAACC TACTCTCCAT 
AGATTTTGAT CATATAACCC GCACAGGAAA GATCTATGAT GACCATCGAA AATTCACCCT 
' 55 TCGAATTCTT TATGACCAGA CTGGGCGACC CATTCTGTGG TCTCCTGTAA GCAGATATAA 
TGAAGTGAAC ATCACATATT CACCTTCGGG ATTGGTGACG TTTATTCAAA GAGGAACGTG 
GAATGAAAAA ATGGAATATG ACCAGAGTGG GAAAATTATT TCAAGAACTT GGGCTGATGG 
GAAAATTTGG AGCTATACCT ACTTAGAAAA ATCTGTGATG CTTCTCCTAC ACAGCCAGCG 
GCGTTACATC TTTGAGTATG ACCAATCAGA TTGCCTGCTG TCAGTTACCA TGCCTAGCAT 
60 GGTGGGCCAC AGCTTACAAA GCATGCTTTC AGTGGGCTAC TACCGTAATA TCTACACCCC 
ACCGGACAGT AGCACTTCTT TTATCCAAGA CTATAGTCGA GATGGCCGAT TGCTACAGAC 
CCTGCATCTG GGGACAGGGC GCAGAGTCTT ATACAAGTAC ACCAAGCAAG CAAGGCTTTC 
TGAGGTTCTC TATGATACCA CTCAGGTCAC ATTAACATAT GAAGAGTCTT CTGGAGTGAT 
TAAGACAATA CACCTGATGC ATGACGGATT CATCTGCACA ATCAGATACA GGCAAACAGG 
65 ACCTCTTATT GGACGCCAGA TTTTCAGATT CAGTGAAGAA GGCCTTGTGA ATGCACGGTT 
CGACTACAGC TACAACAATT TCCGAGTCAC AAGCATGCAA GCTGTAATCA ATGAAACCCC 
TTTGCCTATA GATCTTTACC GATATGTTGA TGTCTCTGGC AGAACAGAGC AGTTTGGAAA 
ATTCAGTGTA ATTAATTACG ATTTAAATCA GGTCATAACT ACTACAGTGA TGAAACACAC 
CAAAATCTTC AGTGCCAATG GACAAGTCAT TGAAGTCCAA TATGAAATCC TAAAGGCAAT 
70 TGCCTACTGG ATGACCATTC AATATGATAA TGTGGGCCGA CATGGTAATA TGTGCATAAG 
GGTAGGAGTA GATGCCAATA TAACAAGGTA CTTCTATGAA TACGATGCTG ATGGGCAACT 
TCAGACTGTT TCTGTAAATG ACAAAACCCA GTGGCGTTAT AGTTACGATC TGAATGGAGA 
CATCAACCTC TTAAGCCATG GGAAGAGTGC TCGTCTTACT CCTCTCCGAT ATGACCTCCG 
AGACCGCATC ACCAGATTAG GAGAAATTCA GTATAAAATG GATGAAGATG GCTTTCTGAG 
75 GCAGAGGGGA AATGATATTT TTGAATATAA TTCTAATGGC CTGCTGCAGA AAGCCTACAA 
TAAGGCTTCT GGCTGGACTG TG CAGT ATT A CTATGATGGG CTTGGGCGAC GTGTCGCGAG 
TAAGTCCAGC CTAGGGCAGC ACCTTCAGTT CTTTGTCGAC GCGACGGCGA ACCCCATAAG 
AGTTACTCAT TTGTACAACC ACACAAGCTC GGAGATTACA TCTCTGTATT ATGATCTCCA 
AGGTCACCTT ATTGCCATGG AGTTAAGCAG TGGTGAAGAA TATTATGTAG CCTGTCATAA 
80 TACAGGTACC CCACTAGCTG TGTTCAGCAG CCGAGGTCAG GTCATAAAGG AGATACTATA 
CACACCTTAT GGCGATATCT ATCATGACAC TTACCCTGAC TTTCAGGTCA TAATTGGTTT 
TCATGGAGGA CTCTATGATT TCCTTACTAA ATTAGTGCAC CTGGGGCAAA GGGATTATGA 
TGTTGTTGCT GGCAGATGGA CAACGGCCTA TCATCACATA TGGAAACAGT TGAACCTCCT 
TCCTAAACCA TTCAACCTCT ACTCCTTTGA AAATAACTAC CCAGTTGGCA AAATTCAAGA 
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TGTTGCAAAG TATACCACAG ACATCAGAAG TTGGTTGGAG CTATTTGGTT TCCAATTACA 
CAATGTACTA CCTGGATTTC CCAAACCTGA ATTAGAAAAT TTAGAATTAA CTTACGAGCT 
TCTACGGCTT CAGACAAAAA CTCAAGAGTG GGATCCTGGA AAGACTATCC TGGGCATTCA 
GTGTGAACTC CAGAAACAGC TCAGGAATTT CATTTCCTTG GACCAACTAC CTATGACTCC 
CCGATACAAT GATGGACGGT GCCTTGAAGG AGGGAAGCAA CCAAGGTTTG CTGCTGTCCC 
TTCTGTTTTT GGGAAAGGTA TAAAATTTGC CATCAAGGAT GGCATAGTAA CAGCTGATAT 
TATAGGAGTA GCCAATGAAG ATAGCAGGCG GCTTGCTGCC ATTCTCAATA ATGCCCATTA 
CCTGGAAAAC CTACATTTTA CCATAGAGGG GAGGGACACT CACTACTTCA TTAAGCTTGG 
GTCTCTGGAG GAAGACCTGG TGCTCATCGG TAACACTGGG GGGAGGCGGA TTCTGGAGAA 
TGGTGTCAAT GTCACTGTGT CCCAGATGAC TTCTCTGTTG AATGGGAGGA CTAGACGGTT 
TGCAGATATT CAGCTCCAGC ATGGAGCCCT GTGCTTCAAC ATCCGGTATG GGACAACTGT 
CGAAGAGGAA AAGAATCACG TGTTGGAGAT TGCCAGACAG CGCGCAGTGG CCCAGGCCTG 
GACTAAGGAA CAAAGAAGGC TGCAAGAGGG GGAAGAGGGG ATTAGGGCAT GGACAGAAGG 
GGAAAAGCAG CAGCTTTTGA GCACTGGGCG GGTACAAGGT TACGATGGGT ATTTTGTTTT 
GTCTGTTGAG CAGTATTTAG AACTTTCTGA CAGTGCCAAT AATATTCACT TTATGAGACA 
GAGCGAAATA GGCAGGAG GT AA CAAAAATA TCTCTGCCTT TGCGTCACCA AAGACTGCCT 
GTTTTTAAAA CATAAAATGG TTTATTGTAT TGGTTTTCTA GATCAGAACT CTG TATAT GT 
AAATATGGAG GAAAAACATA TCCAACTGCC TTTCAATGTG ACGGAAGATG GTATTTTAAT 
ATTGTTTGTT TAAACTCTTT AAGAAATGAC AGAGATTTTT AGTTCTTGTG TGGCAGTATT 
CAAAATAACA CAAGTAGAAC TCAAACAGCT AAAAACAGTT TTCAGAAAGC ACCACTTTCA 
ATTTGCCGAG CCATGCATAT GTTCCAATAT CCAGAAAGAA CCCAAGGTTC TCTATCTCTA 
TTGTGAGAAG CAGTTTCATC CTTAACTGTT GGCAGAACTT ACG GGCT ATT TGAATAGGTG 
GTGCAATAGT ATCTGAAACT TGCCTTTCGA AAGACTGCCA GCCCTTTGAC GTTTTCCAGA 
TCTGTTATAG GAAACTTAAA AACAGGTGTA AAATGTCTTC AGCCACCATC TCCTAGAGTG 
AGGACCCAAT TGCCCTTCCT TCTTGATTAT TCCTCCTTGC TTGTTAAAGT AAATGCCATA 
TTGTTGTGCT GTGTTTTGGC GTGTGGTGGC TGGGTTCTGT CTACCATGCT TCCCTGTGGG 
TGTGGTAACC AGACTGTATA GCCGCTATTT GCTCGTGTGT ACATGATACC AAAGCAGCTG 
GCCAGCGTGA CCTCTCTCAC ACGACCTGTT TTGACTCAAT TTTTTACTAA AAGTTGTTCA 
GCTGTATTGG TATCATGTAA ACATAGCTTT TATTAACCTG GGTAGGAATT TCTCATTTAT 
ATATAGGATG TGTTTTGGTC ATAGTTTCAC ATTAGTGATT CAGTATCTAT ACACTGACCC 
AATGGTTTTG TGCACATGAA CGGTAATTTA CTTAAAAGTA TGATTCTGGT ACAAAAACAA 
ACAAAGGCTT TAGCAGGCAT ACGTGTCTGG GATGCCGATA CATACATTAA CTACTACTGC 
AGAAATTCAT AAGAGCCAAA ACCTTAAAAA AATAGACCTG GTACTTAAGT GAAAGTACTA 
AAGGGAAGAC CAGACCAAAC ATCACAGCAG TTGCTGCCAC ATTGTTTCAG CCCACTTAGA 
TTTATCTTTC AAATGTACAA TTCTGTATTG AACATCTCCC AGCCATCTTC AGGAAATCGA 
ATCAAGTAAA TCCTTTCCAA CGGAAAACAT TTCAACTAAC TATAGAGAGG CAGACTCATT 
TTTACTAAAA TAATTTATAC AGTTAGTTAT TTTCGTTCTC CGTACTTACC CATTTATCTT 
TATTTAATCG TCTCTACTGC CTAGGAAAAT AACTATTTTC CAGGACGGGT TATTTGTTCT 
GCGATCATTT AAAATTTGGA GAAAGGTCAG GATTAGTGTT AATATCAGCT GCAGTTTCTC 
AATCTCTAGG AATCCTGCAG TAAAACAAGC CCCTTGGTGA GCTGGAAGAT TTGTGCCCAG 
TGACAAAGAG ATAGTTTGTA AAATGCTGTG TAATTGTAAG TTACCACAAA TGAAAATACA 
TGACAGCACA ATGTGGCCCG TAGAAAATTC CCCTGAGCCA GCTTCTGCAC TTTCATCACC 
GAATCTGAAC ATTTGCTATG TCTGAAGGCA AATTTATGAT GGAATGTTAG TTTGGATTCT 
TTCCAGATGC TACCTAAATG CAGTGTGGGG TCATTGCCTT GCTTTGCGAT GACAGTTTCT 
TTGAAAATAT GCAAAGTCAT AAGCTCATGT TAAGGTTTTT CAAGAGTCTG CCTCCTACTA 10020 
CACAAAGGAA AGCAAGGGAA AGGAAATGAC CCTGGCAAAC AGTAGGGAAG GGTGTATTCA 10080 
AACATTTCAT TTTCAAAACC TTOGGGTTAG AATACCACTT ACACATGTAT TCTGAGAGAC 10140 
AGAATTCATG AGGAACTCAT CTCTCTTTAT AACTGGAAAC ACACCAGCTT GATATATTGC 10200 
TAATCCATAC TAAAATCATA TTATTGGGTT TTTTCTGAAT CAGGCCTGTA TTAATGGTAC 10260 
AGTATTTATT CAGAATGGAA TTCTAAAATT ACTAACAAAC TTGTTGAAAA TTTGAATACC 10320 
TCCACACCAA CCTAAAAATG GACCTTAAGT TCCTAGAACC TCTGATGTTC TTTTAAATTA 10380 
ATGGAAAAAT AATTTGTGAA CTGTATATAG AGAGTGCATT CATAAATGTG ATTATGTATT 10440 
TTATCACAAA TCCAAAATGT CAATATTAGA GTCTATTTTG C TTATA TTTT AAGCAATTAT 10500 
ACGTTTTTGC AATTCATTGA TGATGTATCA TTTTCAAACT GCTTTAAATA TCCATTAGAA 10560 
ACAAATATTT GAAGCTTTTA CTTAATAGTG ATTACCTTGA ACTGTGCATT TCTAGTTTGT 10620 
AATACGTATT TGGTTGGTTC GTGCCTTTAG TTTGTTAAAG TTACATTTGT ATTATATTCA 10680 
GGAAATGCAC TTTTTATTAC TTACAGCTGT GGTTTTAATA CTGCCTTGAA CTATTATTAT 10740 
TCTTTTTACA ACTCCTAAAG CTTGAGGGAG GAAAGAAAAA AAAAACAAAA CTACTAATCA 10800 
GTAGTAAATC GAAGAGAAAC ATTTTGGCAT TTCTTAAGAA GAAGATGGAG ATATTGAGTA 10860 
TATCACTTCC TATTCAGCTG AATAGAAAGA ATGCCTTCAT TGACTTGCAG TTCTGCAGTT 10920 
TAAATTATTG AAAGAACAAT TCGTTTGCAT TTCCTGATGA AAGTAAAAGC ATTTTTCAGA 10980 
GAAACATATG AATTTCTCAT ACCCAGCAGA CAGATGGCTG ACACTGCACA GCCACACACC 11040 
ATTCGAGTAA GTTAAAGTGA GAGCATAGTA GTTGGACTCT CCTATGAAGA ACATTCTGGG 11100 
CTGGAGGCAG GGAATACTCC ATGGTTGTTT CTTTTTCCTA CTTAAGCCCA TTTTGTTTGT 11160 
GCTTTTCTGT TTTGTTTTGT TTTCACTCTT GCACTACAGT CTAGAGATCC AAATGAACTG 11220 
AAAAGTTCAA AGTTTAACAC ATTTAAATAT GTTTACTTTT AGTTGTCATT CTAATCGTTA 11280 
TTGATTAGAA GCATGACTCC TGAAGGAAAG GGAAATAAAT CTCAATTCAT ACTAACTTGC 11340 
AACAAAACAC TTTTACCATA TAAATAAGTA TATGATTTAT TTTTAACCCA AAAAATGTAT 11400 
AAAATAAGTG TGTCCTTTAC TGTCAATTTA TCGAGAAGAT CTATAATATA TAGACTACAT 11460 
ATATATAATA TATACAACAT AGCCAAATGT ATGAAAACTT GACAATGTAT AATTT GGAAT 11520 
TCACATGCTA CCTATGTAGA CAGGTATGAA ATTAAGTTAT AATTTTCATG AGA CATTTTC 11S80 
ATCACTGTTG ACACAGTTTC AAGGCATTCC ATCATGTTAT TTTGACTCTT TTTCTTTTTT 11640 
TTTTCTTTAA AAATATATTT TTAACTAGAC CAGGCCCCAC TATAATATCA CTTAAGAGAG 11700 
TCAGGGCAAA GTTTTTGCAT TTATGAAGAT GTGTTCATGT AAGGGTGATT GTAATGGAGT 11760 
TCATTGGTAA TAGAAGCAAA AGTACAGTAA CGAAGTATTG AAAAGAAAAT TTTGGAGACA 11820 
TTGGAGCATA TTATATATAG CTTGTGGAAA GACATAAGGC TACAGATGGA ATGGAACATT 11880 
CCTGTTTTCT TGAAGAAATT CACATACACA TAGCTGACCT GACTAGTACT TCAGCTCTTC 11940 
CACAGCCTTC TATAAAGGTT CTTTCTTCTG CAAAGAAAAC AAAACAAAAC AAAACAAAAC 12000 
AAAAAAAAAC AAAAAAAGCG CAAAAAACAA AAAAACAAAA AAAAGCAAAG TA AAATTT AA 12060 
AAATACAGAA AACAAACAAC AAAAAAGAAT TCAACCATAA ATAGTGACTA TTATTTTCAG 12120 
TGTGTCCTTC ATGTGAAAGC TATTAAGGAC CAAATATACT ACTGTTCATA AGAAGAAATT 12180 
ACTTTCTAAA CAGTAACTGA AAATACTTAG AGTTAAACTT GCTGTGGATT TTGTCTTGGC 12240 
AGTTGTCATC TTACATTATT TGTCAAAGGA AATGTGTTTG GCAGTTAAAA ATCTTTCCTT 12300 
AGATTTAGTG GTGGACTTTA ACCTCTTAAA TAAATGTTAG TATATCAGAT TGTGTCCTTG 12360 
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AAAAATATTT 
AAGCATCAGT 
CTTTTTAGCT 
CAGTTTCTGT 
CTGAGCCATG 
AAAAGTGACT 
ATTTTGCTCT 
TGCTGTAAAT 
TATTTCATTT 



TACTTGTATG 
AAGAAAGAAG 
GCTTACTTTC 
ATAACAACAG 
TTGCTAGGAA 
GGATACTGTA 
GGCTTTCAGG 
ATTTAAAGTT 
TTAAAAAATA 



AATCATGACA 
GCGAAAAAGA 
TCATGAAAAG 
GTAGAGGTTC 
ACAATAGGTT 
CAGGTTCATG 
CCAGAAGCAT 
AAACATTATG 
AAATTGCCAA 



ACGTCTAAAT 
GAAGTATAGC 
TAAAGATGTT 
TAATCATATT 
CCAATTTTGT 
TTCTCTGGCT 
GCATTTTTCT 
TGTTGATATT 
TGAAAAAAAA 



CTTTACTATT 
CTTTATGTCA 
TACAGTGTAT 
GAAAATTGTG 
ATTCCTGCTC 
GCAGTTAAAT 
ACAAGAGCAT 
TGAAAGAAAA 



CTTCTGGCAA 
GAAAAACATT 
GCCAAGTTTT 
TTATAATGGT 
TCCTGTGCTG 
GGTCTTTTGC 
CACAACAACA 
GTACTTTGAA 
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A210 Protein sequence: 
Gene name: 
Unigene number: 
Probeset Accession ft: 
Protein Accession #: 
Signal sequence: 
Transmembrane domains: 
Cellular Localization: 



odz (odd Oz/ten-m, Drosophila) homolog 1 
Hs. 23796 
NM_014253 
NP_055068 
none found 
318-340 

plasma membrane 



1 11 21 31 41 51 

I i I I 1 I 

MEQTDCKPYQ PLPKVKHEMD IiAYTSSSDES EDGRKPRQSY NSRETLHEYN QELRMNYNSQ 60 

SRKRKEVEKS TQEMEFCETS HTLCSGYQTD MHSVSRHGYQ LEMGSDVDTE TEGAASPDHA 120 

LRMWIRGMKS EHSSCLSSRA NSALSLTDTD HERKSDGENG FKFSPVCCDM EAQAGSTQDV 180 

QSSPHNQPTF RPLPPPPPPP HACTCARKPP PAADSLQRRS MTTRSQPSPA APAPPTSTQD 240 

SVHLHNSWVL NSNIPLETRH SLFKHGSGSS AIFSAASQNY PLTSNTVYSP PPRPLPRSTF 300 

SRPAFTFNKP YRCCNWKCTA LSATAITVTL ALLLAYVIAV HLFGLTWQLQ PVEGELYANG 360 

VSKGNRGTES MDTTYSPIGG KVSDKSEKKV FQKGRAIDTG EVDIGAQVMQ TIPPGLFWRF 420 

QITIHHPIYL KFNISLAKDS LLGIYGRRNI PPTHTQFDFV KLMDGKQLVK QDSKGSDDTQ 480 

HSPRNLILTS LQETGFIEYM DQGPWYLAFY NDGKKMEQVF VLTTAIEIMD DCSTNCNGNG 540 

ECISGHCHCF PGFLGPDCAR DSCPVLCGGN GEYEKGHCVC RHGWKGPECD VPEEQCIDPT 600 

CFGHGTCIMG VCICVPGYKG EICEEEDCLD PMCSNHGICV KGECHCSTGW GGVNCETPLP 660 

VCQEQCSGHG TFLLDAGVCS CDPKWTGSDC STELCTMECG SHGVCSRGIC QCEEGWVGPT 720 

CEERSCHSHC TEHGQCKDGK CECSPGWEGD HCTIAHYLDA VRDGCPGLCF GNGRCTLDQN 780 

GWHCVCQVGW SGTGCNWME MLCGDHLDND GDGLTDCVDP DCGQQSNCYI SPLCQGSPDP 840 

LiDLIQQSQTL FSQHTSRLFY DRIKFLIGKD STHVIPPEVS FDSRRACVIR GQWAIDGTP 900 

LVGVNVSFLH HSDYGFTISR QDGSFDLVAI GGISVILIFD RSPFLPEKRT LWLPWNQPIV 960 

VEKVTMQRW SDPPSCI>ISN FISPNPIVLP SPLTSFGGSC PERGTIVPEL QWQEEIPIP 1020 

SSFVRLSYLS SRTPGYKTLL RILLTHSTIP VGMIKVHLTV AVEGRLTQKW FPAAINLVYT 1080 

FAWNKTDIYG QKVWGLAEAL VSVGYEYETC PDFILWEQRT WLQGFEMDA SNLGDWSLNK 1140 

HHILNPQSGI IHKGNGENMF ISQQPPVIST IMGNGHQRSV ACTNCNGPAH NNKLFAPVAL 1200 

ASGPDGSVYV GDFKFVRRIF PSGNSVSILE LSTSPAHKYY LAMDPVSESL YLSDTNTRKV 1260 

YKLKSLVETK DLSKNFEWA GTGDQCLPFD QSHCGDGGRA SEASLNSPRG ITVDRHGFIY 1320 

FVDGTMIRKI DENAVITTVI GSNGLTSTQP LSCDSGMDIT QVRLEWPTDL AVNPMDNSLY 1380 

VLDNNIVLQI SENRRVRIIA GRPIHCQVPG IDHFLVSKVA IHSTIiESARA ISVSHSGLLF 1440 

IAETDERKVN RIQQVTTNGE IYIIAGAPTD CDCKIDPNCD CFSGDGGYAK DAKMKAPSSL 1500 

AVSPDGTLYV ADLGNVRIRT ISRNQAHLND MNIYEIASPA DQELYQFTVN GTHLHTLNLI 1560 

TRDYVYNFTY NSEGDLGAIT SSNGNSVHIR RDAGGMPLWL WPGGQVYWL TISSNGVLKR 1620 

VSAQGYNPAL MTYPGNTGLL ATKSNENGWT TVYEYDPEGH LTNATFPTGE VSSFHSDLEK 1680 

LTKVELDTSN RENVLMSTNL TATSTIYILK QENTQSTYRV NPDGSLRVTF ASGMEIGLSS 1740 

EPHILAGAVN PTLGKCNISL PGEHNANLIE WRQRKEQNKG NVSAFERRLR AHNRNLLSID 1800 

FDHITRTGKI YDDHRKFTLR ILYDQTGRPI LWSPVSRYNE VNITYSPSGL VTFIQRGTWN 1860 

EKMEYDQSGK IISRTWADGK IWSYTYIjEKS VMLLLHSQRR YIFEYDQSDC LLSVTMPSMV 1920 

RHSLQTMLSV GYYRNIYTPP DSSTSFIQDY SRDGRLLQTL HLGTGRRVLY KYTKQARLSE 1980 

VLYDTTQVTL TYEESSGVIK TIHLKHDGFI CTIRYRQTGP LIGRQIFRFS EEGLVNARFD 2040 

YSYNNFRVTS MQAVINETPL PIDLYRYVDV SGRTEQFGKF SVINYDLNQV ITTTVMKHTK 2100 

IFSANGQVIE VQYEILKAIA YWMTIQYDNV GRHGNMCIRV GVDANITRYF YEYDADGQLQ 2160 

TVSVNDKTQW RYSYDLNGDI KLLSHGKSAR LTPLRYDLRD RITRLGEIQY KMDEDGFLRQ 2220 

RGNDIFEYNS NGLLQKAYNK ASGHTVQYYY DGLGRRVASK SSLGQHLQFF VDATANPIRV 2280 

THLYNHTSSE ITSLYYDLQG HLIAMELSSG EEYYVACDNT GTPLAVFSSR GCVIKEILYT 2340 

PYGDIYHDTY PDFQVI IGFH GGLYDFLTKL VHLGQRDYDV YAGRWTTAYH HIWKQLNLLP 2400 

KPFNLYSFEN NYPVGKIQDV AKYTTDIRSW LELFGFQLHN VLPGFPKPEL ENLELTYELL 2460 

RLQTKTQEWD PGKTILGIQC ELQKQLRNFI SLDQLPMTPR YNDGRCLEGG KQPRFAAVPS 2520 

VFGKGIKFAI KDGIVTADII GVANEDSRRL AAILNNAHYL ENLHFTIEGR DTHYFIKLGS 2580 

LEEDLVLIGN TGGRRILENG VNVTVSQMTS LLNGRTRRFA DIQLQHGALC FNIRYGTTVE 2640 

EEKNHVLEIA RQRAVAQAWT KEQRRLQEGE EGIRAWTEGE KQQLLSTGRV QGYDGYFVLS 2700 
VEQYLELSDS ANNIHFMRQS EIGRR 



A211 DNA SEQUENCE : 
Gene name: 
Unigene number: 
Probeset Accession S: 
Nucleic Acid Accession #: 
Coding sequence: 



Solute carrier family 26. member 4 

Hs. 159275 

AF030880 

NM_000441 

225-2567 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I | I I I 1 

CTCAGCCTTC CCGGTTCGGG AAAGGGGAAG AATGCAGGAG GGGTAGGATT TCTTTCCTGA 60 

TAGGATCGGT TGGGAAAGAC CGCAGCCTGT GTGTGTCTTT CCCTTCGACC AAGGTGTCTG 120 

TTGCTCCGTA AATAAAACGT CCCACTGCCT TCTGAGAGCG CTATAAAGGC AGCGGAAGGG 180 

TAGTCCGCGG GGCATTCCGG GCGGGGCGCG AGCAGAGACA GGTCATGGCA GCGCCAGGCG 240 

GCAGGTCGGA GCCGCCGCAG CTCCCCGAGT ACAGCTGCAG CTACATGGTG TCGCGGCCGG 300 

TCTACAGCGA GCTCGCTTTC CAGCAACAGC ACGAGCGGCG CCTGCAGGAG CGCAAGACGC 360 
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TGCGGGAGAG CCTGGCCAAG T6CTGCAGTT GTTCAAGAAA GAGAGCCTTT GGTGTGCTAA 420 
AGACTCTTGT GCCCATCTTG GAGTGGCTCC CCAAATACCG AGTCAAGGAA TGGCTGCTTA 480 
GTGACGTCAT TTCGGGAGTT AGTACTGGGC TAGTGGCCAC GCTGCAAGGG ATGGCATATG 540 
CCCTACTAGC TGCAGTTCCT GTCGGATATG GTCTCTACTC TGCTTTTTTC CCTATCCTGA 600 
CATACTTTAT CTTTGGAACA TCAAGACATA TCTCAGTTGG ACCTTTTCCA GTGGTGAGTT 660 
TAATGGTGGG ATCTGTTGTT CTGAGCATGG CCCCCGACGA ACACTTTCTC GTATCCAGCA 720 
GCAATGGAAC TGTATTAAAT ACTACTATGA TAGACACTGC AGCTAGAGAT ACAGCTAGAG 780 
TCCTGATTGC CAGTGCCCTG ACTCTGCTGG TTGGAATTAT ACAGTTGATA TTTGGTGGCT 840 
TGCAGATTGG ATTCATAGTG AGGTACTTGG CAGATCCTTT GGTTGGTGGC TTCACAACAG 900 
CTGCTGCCTT CCAAGTGCTG GTCTCACAGC TAAAGATTGT CCTCAATGTT TCAACCAAAA 960 
ACTACAATGG AGTTCTCTCT ATTATCTATA CGCTGGTTGA GATTTTTCAA AATATTGGTG 1020 
ATACCAATCT TGCTGATTTC ACTGCTGGAT TGCTCACCAT TGTCGTCTGT ATGGCAGTTA 1080 
AGGAATTAAA TGATCGGTTT AGACACAAAA TCCCAGTCCC TATTCCTATA GAAGTAATTG 1140 
TGACGATAAT TGCTACTGCC ATTTCATATG GAGCCAACCT GGAAAAAAAT TACAATGCTG 1200 
GCATTGTTAA ATCCATCCCA AGGGGGTTTT TGCCTCCTGA ACTTCCACCT GTGAGCTTGT 1260 
TCTCGGAGAT GCTGGCTGCA TCATTTTCCA TCGCTGTGGT GGCTTATGCT ATTGCAGTGT 1320 
CAGTAGGAAA AGTATATGCC ACCAAGTATG ATTACACCAT CGATGGGAAC CAGGAATTCA 1380 
TTGCCTTTGG GATCAGCAAC ATCTTCTCAG GATTCTTCTC TTGTTTTGTG GCCACCACTG 1440 
CTCTTTCCCG CACGGCCGTC CAGGAGAGCA CTGGAGGAAA GACACAGGTT GCTGGCATCA 1500 
TCTCTGCTGC GATTGTGATG ATCGCCATTC TTGCCCTGGG GAAGCTTCTG GAACCCTTGC 1560 
AGAAGTCGGT CTTGGCAGCT GTTGTAATTG CCAACCTGAA AGGGATGTTT ATGCAGCTGT 1620 
GTGACATTCC TCGTCTGTGG AGACAGAATA AGATTGATGC TGTTATCTGG GTGTTTACGT 1680 
GTATAGTGTC CATCATTCTG GGGCTGGATC TCGGTTTACT AGCTGGCCTT ATATTTGGAC 1740 
TGTTGACTGT GGTCCTGAGA GTTCAGTTTC CTTCTTGGAA TGGCCTTGGA AGCATCCCTA 1800 
GCACAGATAT CTACAAAAGT ACCAAGAATT ACAAAAACAT TGAAGAACCT CAAGGAGTGA I860 
AGATTCTTAG ATTTTCCAGT CCTATTTTCT ATGGCAATGT CGATGGTTTT AAAAAATGTA 1920 
TCAAGTCCAC AGTTGGATTT GATGCCATTA GAGTATATAA TAAGAGGCTG AAAGCGCTGA 1980 
GGAAAATACA GAAACTAATA AAAAGTGGAC AATTAAGAGC AACAAAGAAT GGCATCATAA 2040 
GTGATGCTGT TTCAACAAAT AATGCTTTTG AGCCTGATGA GGATATTGAA GATCTGGAGG 2100 
AACTTGATAT CCCAACCAAG GAAATAGAGA TTCAAGTGGA TTGGAACTCT GAGCTTCCAG 2160 
TCAAAGTGAA CGTTCCCAAA GTGCCAATCC ATAGCCTTGT GCTTGACTGT GGAGCTATAT 2220 
CTTTCCTGGA CGTTGTTGGA GTGAGATCAC TGCGGGTGAT TGTCAAAGAA TTCCAAAGAA 2280 
TTGATGTGAA TGTGTATTTT GCATCACTTC AAGATTATGT GATAGAAAAG ' CTGGAGCAAT 2340 
GCGGGTTCTT TGACGACAAC ATTAGAAAGG ACACATTCTT TTTGACGGTC CATGATGCTA 2400 
TACTCTATCT ACAGAACCAA GTGAAATCTC AAGAGGGTCA AGGTTCCATT TTAGAAACGA 2460 
TCACTCTCAT TCAGGATTGT AAAGATACCC TTGAATTAAT AGAAACAGAG CTGACGGAAG 2520 
AAGAACTTGA TGTCCAGGAT GAGGCTATGC GTACACTTGC ATCCTGAAAG TGGGTTCGGG 2580 
AGGTCTCTAT GAGCAAGGAA TACAAGACAA AACTTCCTCA ATGCATTGAC TATTTCTTCA 2640 
GACTCAAAAC ACTCATTCTT TTTTCTATTA AGCCATTGAA AGAGAAGCAC TAAGACTGCT 2700 
TCTAGGCTTT ATTTATAAAA TAAACACCTT ATCCCTAACA TGGGCAAAAT GGCTAGAATT 2760 
ATTCAGACGA TTTGGCAGCG TCCAGGGTAA GCTGGTGTTA TAATACGCTG CTGATCTACA 2820 
TCACAGATTT GCTAATAATG TTCACGTGGG CCCTGGCATA TCTCTGTTCA GTTAGAGTGA 2880 
GTGCTGACCC AACAGCCTCT GTGGTCAAGC GAGTCACGAA TGATTAATCA TAAAGAAAAA 2940 
TCAGTTTTTG ACTGACCTGG ATATCCATGA GCTGCACTGA TCACCATGTA AGGTCACATT 3000 
TAGTAAATGC TGAAATAAAA TGATTAATGC ATTTATCAAT AAAAGCCTTT GAAAATACTT 3060 
TGGATAATAA ATTGGAGTTT TAAAAATGCA AATTTGCTTA GTATCTAATA ATGAAGTGTT 3120 
ATTACATATA GCCGGAATTG AGGATCTCTT TGATCCTGGA AATGGTTTAC CTAAAAGCTA 3180 
CAGAACCAGG CCAATATATT TTGAAATATT GATGCAGACA AATGAAATAA TAAAGAGATT 3240 
TTCATGGTTT ATAAAAATCT TTTTTGATAT GATAATAATC ATGATCACAA CTGAGATCAA 3300 
AAAAATATAT GACAGATTAT TTTGTTTAAA AATGCAGTTT TAATTATCTT AGTCTATAGA 3360 
AATGATCATT GCATGGAGGC ATGTATAGGT ATGATCTGTG TAAAATCTGA CATAAAAACA 3420 
GTGCTATTCT GAGTGAAAAT TTTTTTGATG TGCTTACATA ACCATGGTGA TTAAAATGAG 3480 
TTTATATTTT TTCTCAAAAA TTTTAGCAGT GTGTAAAGTA AGTAATCTTT AACTGAACTC 3540 
TGACCACTTA AAAAAAAATC TAAAAATTGA ACTACCTATA GTAGTCTGTG TTTAAAGTGA 3600 
ATTTTTAAAG ACAAAGCATT CTAAATGAAC TCAATATAAA AACATTCATT TGGAATGTAC 3660 
ATACTGAAAA ATACAGGTTT TTTTGACCAA AAGTTTTTAT ATCTTTTCTT TTTATTTATT 3720 
TTTTTCCTAA GTGCCAACAA TTTTCTAGAT ATTATATACA ACACAGGCTT TGATCTTGGG 3780 
GACTTTTCCC ATATATTTCA CACTGGAGTG AATGAAGTTG TACTTCATTT CTAGAGAAAA 3840 
GTTATACCCA GGTCCCCAAT TGAGAATGTC TTGCTTGATT GAAAACGACA TCATCCCTTG 3900 
GTATACTCCA GGGATTGGTT TCAGGACCCC TGCATTTACC AAAATTTGTG CACACTCAAG 3960 
TCCTGCAGTC ACCCCTGCCT AAAGATAGAA TGGCTTCTCT GTTTTTCTTC TGAAATACAA 4020 
CCAGAAACAA TGTGTCTATT TCTGAAAGAA TAGGATTAAT GATCATACAA ATGGGTTAAT 4080 
CCTGAATTCT GGTTGTAAAT CTGGTTACAG CATAACTAGG ATTATAATGC TGCCTCATTT 4140 
TCACAGCACT ACTTGCTTAT ATTGACAACA AATCATCTCG CTAAAGAGTG AATGTAGGCC 4200 
AGGCGCGGTG GCTCATGCCT GTAATCCCAG CACTTTGGGA GGCCGAGGCG GGTGGATCAC 4260 
GAGGTCAGGA GATCGAGACC ATCCTGGCTA ACATGGTAAA ACCCCGTCTC TACTAA AAAT 4320 
AGAAAAAAAG AAATTAGCCT AGCGTGGTGG CTGGCGGGCG CCTGTAGTCC CAGCTATTTG 4380 
GGAGGCTAAG GCAGGAGAAT GGCGTGAACC CGGGAGGCGG AGCTTGCAGT GAGCCGAGGT 4440 
CGTGCCACTG CACTCCAGCC TGGGCGACAG AGCAAGACTC CGTCTCAAAA AAAAAAAAAA 4500 
AAAAAAAAAA AGAGTGAATG TAATAGTCTT GCAGAAAATG AATGAATACC TTTGTTCAAT 4560 
AAAGGAAATA TGCACTGCTC ACTTTTTTGA AGGAAATGCC AAAGTTACGT TTTACAACAA 4620 
GGCTAGAGTT TGTAAATTCT GGGTTCATTT GTGATGACAT AAGTCAGCAA ACTGCGGGAA 4680 
TACTGTCTCT TCTATGTATT TTGTGAATAG TAAGCATAAT TTTAGTTTTG TATTATCAAT 4740 
GAAAATTTCA CTTGAAATTA AAGCTGCCTT TTGTTATATT TTTAACCTAT AGGATAAGAT 4800 
TCCAGTATTG TATATGAGTT TTAACAAATT AAAAAATCAA ATCATGTACA TTTGAAAATA 4860 
TTTGCACACA TTTAAAAATA AATGTAAAGT TGTCTTTTAA ACTACTCGGA TGTGTCCTTT 4 920 
CTGAACAAAA 



A212 Protein sequence: 
Gene name*, 
tin i gene number: 
Probe set Accession §: 
Protein Accession #: 
Signal sequence: 



Solute carrier family 26, 

HS.15927S 

AP030880 

043511 

none found 



932 



WO 03/042661 



PCT/US02/36810 



Transmembrane domains: 81-103, 109-131. 136-158. 185-207. 221-243, 245-267, 270-291, 295-317. 

347-369, 386-408, 420-442, 446-470. 486-508 
Cellular Localization: plasma membrane 



1 
I 

MAAPGGRSEP 
AFGVLKTLVP 
FFPILTYFIF 
RDTARVLIAS 
NVSTKNYNGV 
PIEVIVTIIA 
YAIAVSVGKV 
QVAGIISAAI 
IWVFTCIVSI 
EPQGVKILRF 
KNGIISDAVS 
DCGAISFLDV 
TVHDAILYLQ 
QDEAMRTLAS 



11 
I 

PQLPEYSCSY 
ILEWLPKYRV 
GTSRHISVGP 
ALTLLVGIIQ 
LSIIYTLVEI 
TAISYGANLE 
YATKYDYTID 
VMIAILALGX 
ILGLDLGLLA 
SSPIFYGNVD 
TNNAFEPDED 
VGVRSLRVIV 
NQVKSQEGQG 



21 
I 

MVSRPVYSEL 
KEWLLSDVIS 
FPWSLMVGS 
LIFGGLQIGF 
FQNIGDTNliA 
KNYNAGIVKS 
GNQEFIAFGI 
LLEPLQKSVL 
GLIFGLLTW 
GFKKCIKSTV 
IEDLEELDIP 
KEFQRIDVNV 
SILETITLIQ 



31 
I 

AFQQQHERRL 
GVSTGLVATL 
WLSMAPDEH 
IVRYLADPLV 
DFTAGLLTIV 
IPRGFLPPEL 
SNIFSGFFSC 
AAWIANLKG 
LRVQFPSWNG 
GFDAIRVYNK 
TKEIEIQVDW 
YFASLQDYVI 
DCKDTLELIE 



41 

1 

QERKTLRESL 
QGMAYALLAA 
FLVSSSNGTV 
GGFTTAAAFQ 
VCMAVKELND 
PPVSLFSEML 
FVATTALSRT 
MFMQLCDIPR 
LGSIPSTDIY 
RLKALRKIQK 
NSELPVKVNV 
EKLEQCGFFD 
TELTEEELDV 



51 
I 

AKCCSCSRKR 
VPVGYGLYSA 
LNTTMIDTAA 
VLVSQLKIVL 
RFRHKIPVPI 
AASFSIAWA 
AVQESTGGKT 
LWRQNKIDAV 
KSTKKYKNIE 
LIKSGQLRAT 
PKVPIHSLVL 
DNIRKDTFFL 
QDEAMRTLAS 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



A213 DNA SEQUENCE : 
Gene name: 
Unigene number: 
Probecet Accession #: 
Nucleic Acid Accession tf: 
Coding sequence: 



ESTs, Highly similar to calcium- activated potassium channel rSK2 [R.norvegicus] 

Hs- 98280 

AA418000 

NM__021614 

458-2197 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I 1 I 

CGGCGGCAGC AGCCCATGCC TCCGGTGCAA CAGCTGCGCC TCCTCCGGTG CCCCGGCGGC 60 

GGGGGCGGGA GATAACCTGT CCCTGCTGCT CCGCACCTCC TCGCCCGGCG GCGCCTTCCG 120 

GACCCGCACC TCCTCGCCGC TGTCGGGCTC GTCCTGCTGC TGCTGCTGCT GCTCGTCGCG 180 

CCGGGGCAGC CAGCTCAATG TGAGCGAGCT GACGCCGTCC AGCCATGCCA GTGCGCTCCG 240 

GCAGCAGTAC GCGCAGCAGT CCGCGCAGCA GTCGGCGTCC GCCTCCCAGT ACCACCAGTG 300 

CCACAGCCTG CAGCCCGCCG CCAGCCCCAC GGGCAGCCTC GGCAGTCTGG GCTCCGCGCC 360 

CCCGCTCTCG CACCACCACC ACCACCCGCA CCCGGCGCAC CACCAGCACC ACCAGCCCCA 420 

GGCGCGCCGC GAGAGCAACC CCTTCACCGA AATAGCCATG AGCAGCTGCA GGTACAACGG 480 

GGGCGTCATG CGGCCGCTCA GCAACTTGAG CGCGTCCOGC CGGAACCTCC ACGAGATGGA 540 

CTCAGAGGCG CAGCCCCTGC AGCCCCCCGC GTCTGTCGGA GGAGGTGGCG GCGCGTCCTC 600 

CCCGTCTGCA GACGCTGCCG CCGCCGCCGC TGTTTCGTCC TCAGCCCCCG AGATCGTGGT 660 

GTCTAAGCCC GAGCACAACA ACTCCAACAA CCTGGOGCTC TATGGAACCG GCGGCGGAGG 720 

CAGCACTGGA GGAGGCGGCG GCGGTGGAGG GAGCGGGCAC GGCAGCAGCA GTGGCACCAA 780 

GTCCAGCAAA AAGAAAAACC AGAACATCGG CTACAAGCTG GGCCACCGGC GCGCCCTGTT 840 

CGAAAAGCGC AAGCGGCTCA GCGACTACGC GCTCATCTTC GGCATGTTCG GCATCGTGGT 900 

CATGGTCATC GAGACCGAGC TGTCGTGGGG CGCCTACGAC AAGGCGTCGC TGTATTCCTT 960 

AGCTCTGAAA TGCCTTATCA GTCTCTCCAC GATCATCCTG CTCGGTCTGA TCATCGTGTA 1020 

CCACGCCAGG GAAATACAGT TGTTCATGGT GGACAATGGA GCAGATGACT GGAGAATAGC 1080 

CATGACTTAT GAGCGTATTT TCTTCATCTG CTTGGAAATA CTGGTGTGTG CTATTCATCC 1140 

CATACCTGGG AATTATACAT TCACATGGAC GGCCCGGCTT GCCTTCTCCT ATGCCCCATC 1200 

CACAACCACC GCTGATGTGG ATATTATTTT ATCTATACCA ATGTTCTTAA GACTCTATCT 1260 

GATTGCCAGA GTCATGCTTT TACATAGCAA ACTTTTCACT GATGCCTCCT CTAGAAGCAT 1320 

TGGAGCACTT AATAAGATAA ACTTCAATAC ACGTTTTGTT ATGAAGACTT TAATGACTAT 1380 

ATGCCCAGGA ACTGTACTCT TGGTTTTTAG TATCTCATTA TGGATAATTG CCGCATGGAC 1440 

TGTCCGAGCT TGTGAAAGGT ACCATGATCA ACAGGATGTT ACTAGCAACT TCCTTGGAGC 1500 

GATGTGGTTG ATATCAATAA CTTTTCTCTC CATTGGTTAT GGTGACATGG TACCTAACAC 1560 

ATACTGTGGA AAAGGAGTCT GCTTACTTAC TGGAATTATG GGTGCTGGTT GCACAGCCCT 1620 

GGTGGTAGCT GTAGTGGCAA GGAAGCTAGA ACTTACCAAA GCAGAAAAAC ACGTGCACAA 1680 

TTTCATGATG GATACTCAGC TGACTAAAAG AGTAAAAAAT GCAGCTGCCA ATGTACTCAG 1740 

GGAAACATGG CTAATTTACA AAAATACAAA GCTAGTGAAA AAGATAGATC ATGCAAAAGT 1800 

AAGAAAACAT CAACGAAAAT TCCTGCAAGC TATTCATCAA TTAAGAAGTG TAAAAATGGA 1860 

GCAGAGGAAA CTGAATGACC AAGCAAACAC TTTGGTGGAC TTGGCAAAGA CCCAGAACAT 1920 

CATGTATGAT ATGATTTCTG ACTTAAACGA AAGGAGTGAA GACTTCGAGA AGAGGATTGT 1980 

TACCCTGGAA ACAAAACTAG AGACTTTGAT TGGTAGCATC CACGCCCTCC CTGGGCTCAT 2040 

AAGCCAGACC ATCAGGCAGC AGCAGAGAGA TTTCATTGAG GCTCAGATGG AGAGCTACGA 2100 

CAAGCACGTC ACTTACAATG CTGAGCGGTC CCGGTCCTCG TCCAGGAGGC GGCGGTCCTC 2160 

TTCCACAGCA CCACCAACTT CATCAGAGAG TAG CTAGA AG AGAATAAGTT AACCACAAAA 2220 

TAAGACTTTT TGCCATCATA TGGTCAATAT TTTAGCTTTT ATTGTAAAGC CCCTATGGTT 2280 

CTAATCAGCG TTATCCGGGT TCTGATGTCA GAATCCTGGG AACCTGAACA CTAAGTTTTA 2340 

GGCCAAAATG AGTGAAAACT CrriT T'l TTT CTTTCAGATG CACAGGGAAT GCACCTATTA 2400 

TTGCTATATA GATTGTTCCT CCTGTAATTT CACTAACTTT TTATTCATGC ACTTCAAACA 2460 
AACTTTACTA CTACATTATA TGATATATAA TAAAAAAAGT TAATTTCGGA 



A214 Protein sequence: 
Gene name: 
Unigene number: 
Probeset Accession #: 
Protein Accession #: 
Signal sequence: 
Transmembrane domains: 



ESTs, Highly similar to calcium-activated potassium channel rSK2 [R.norvegicus 

Hs. 98280 

AA418000 

NPJ> 67627 

none found 

135-157, 168-190, 208-230, 254-276, 306-328, 342-364, 373-394 



933 



WO 03/042661 



PCT/US02/36810 



Calmodulin binding domain; 412-488 
Cellular Localization: plasma membrane 



1 11 21 31 41 51 

I I I I I I 

MSSCRYNGGV MRPLSNLSAS RRNLHEMDSE AQPLQPPASV GGGGGASSPS ADAAAAAAVS 60 

SSAPEIWSK PEHNNSNNLA LYGTGGGGST GGGGGGGGSG HGSSSGTKSS KKKNQNIGYK 120 

LGHRRALFEK RKRLSDYALI FGMFGIWMV IETELSWGAY DKASLYSLAL KCLISLSTII 180 

LLGLIIVYHA REIQLFMVDN GADDWRIAMT YERIFFICLE ILVCAIHPIP GNYTFTWTAR 240 

LAFSYAPSTT TADVDIILSI PMFLRLYLIA RVMLLHSKLF TDASSRSIGA LNKINFNTRF 300 

VMKTLMTICP GTVLLVFSIS LWIIAAWTVR ACERYHDQQD VTSNFLGAMH LISITFLSIG 360 

YGDMVPNTYC GKGVCLLTGI MGAGCTALW AWARKLELT KAEKHVHNFM MDTQLTKRVK 420 

NAAANVLRET WLIYKNTKLV KKIDHAKVRK HQRKFLQAIH QLRSVKMEQR KLNDQANTLV 480 

DLAKTQNIKY DMISDLNERS EDPEKRIVTL ETKLETLIGS IHALPGLISQ TIRQQQRDFI 540 
EAQMESYDKH VTYNAERSRS SSRRRRSSST APPTSSESS 

A215 DMA SEQUENCE: 

Gene name: CGI -86 protein 

Unigene number: Hs. 109201 

Probeset Accession ft: AW161450 

Nucleic Acid Accession ft: NM_016029 

Coding sequence: 228-1097 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I 1 t i 1 

CTGCGATCCC GCAGGGCAGC GACGCGACTC TGGTGCGGGC CGTCTTCTTC CCCCCGAGCT 60 

GGGCGTGCGC GGCCGCAATG AACTGGGAGC TGCTGCTGTG GCTGCTGGTG CTGTGCGCGC 120 

TGCTCCTGCT CTTGGTGCAG CTGCTGCGCT TCCTGAGGGC TGACGGCGAC CTGACGCTAC 180 

TATGGGCCGA GTGGCAGGGA CGACGCCCAG AATGGGAGCT GACTGATATG GTGGTGTGGG 240 

TGACTGGAGC CTCGAGTGGA ATTGGTGAGG- AGCTGGCTTA CCAGTTGTCT AAACTAGGAG 300 

TTTCTCTTGT GCTGTCAGCC AGAAGAGTGC ATGAGCTGGA AAGGGTGAAA AGAAGATGCC 360 

TAGAGAATGG CAATTTAAAA GAAAAAGATA TACTTGTTTT GCCCCTTGAC CTGACCGACA 420 

CTGGTTCCCA TGAAGCGGCT ACCAAAGCTG TTCTCCAGGA GTTTGGTAGA ATCGACATTC 480 

TGGTCAACAA TGGTGGAATG TCCCAGCGTT CTCTGTGCAT GGATACCAGC TTGGATGTCT 540 

ACAGAAAGCT AATAGAGCTT AACTACTTAG GGACGGTGTC CTTGACAAAA TGTGTTCTGC 600 

CTCACATGAT CGAGAGGAAG CAAGGAAAGA TTGTTACTGT GAATAGCATC CTGGGTATCA 660 

TATCTGTACC TCTTTCCATT GGATACTGTG CTAGCAAGCA TGCTCTCCGG GGTTTTTTTA 720 

ATGGCCTTCG AACAGAACTT GCCACATACC CAGGTATAAT AGTTTCTAAC ATTTGCCCAG 780 

GACCTGTGCA ATCAAATATT GTGGAGAATT CCCTAGCTGG AGAAGTCACA AAGACTATAG 840 

GCAATAATGG AGACCAGTCC CACAAGATGA CAACCAGTCG TTGTGTGCGG CTGATGTTAA 900 

TCAGCATGGC CAATGATTTG AAAGAAGTTT GGATCTCAGA ACAACCTTTC TTGTTAGTAA 960 

CATATTTGTG GCAATACATG CCAACCTGGG CCTGGTGGAT AACCAACAAG ATGGGGAAGA 1020 

AAAGGATTGA GAACTTTAAG AGTGGTGTGG ATGCAGACTC TTCTTATTTT AAAATCTTTA 1080 

AGACAAAACA TGACTGAAAA GAGCACCTGT ACTTTTCAAG CCACTGGAGG GAGAAATGGA 1140 

AAACATGAAA ACAGCAATCT TCTTATGCTT CTGAATAATC AAAGACTAAT TTGTGATTTT 1200 

ACTTTTTAAT AGATATGACT TTGCTTCCAA CATGGAATGA AATAAAAAAT AAATAATAAA 1260 
AGATTGCCAT GAATCTTGCA AA 



A216 Protein sequence: 
Gene name: 
Unigene number: 
, Probeset Accession ft : 
Protein Accession #: 
Signal sequence: 
Transmembrane domains: 
Cellular Localization: 



CGI- 86 protein 
Hs. 109201 
AW161450 
NPJJS7113 
1-26 

183-206, 221-243 
plasma membrane 



1 11 21 31 41 51 

1 I I 1 I I 

MNWELLLWLL VLCALLLLLV QLLRFLRADG DLTLLWAEWQ GRRPEWELTD MWWVTGASS 
GIGEELAYQL SKLGVSLVLS ARRVHELERV KRRCLENGNL KEKDILVLPL DLTDTGSHEA 
ATKAVLQEFG RIDILVNNGG MSQRSLCMDT SLDVYRKLIE LNYLGTVSLT KCVLPHMIER 
KQGKIVTVNS ILGIISVPLS IGYCASKHAL RGFFNGLRTE LATYPGIIVS NICPGPVQSN 
IVENSLAGEV TKTIGNNGDQ SHKMTTSRCV RLMLISMAND LKEVWISEQP FLLVTYLWQY 
MPTWAWWITN KMGKKRIENF KSGVDADSSY FKIFKTKHD 



60 
120 
1B0 
240 
300 



A217 DNA SEQUENCE: 
Gene name: 
Unigene number: 
Probeset Accession ft: 
Nucleic Acid Accession ft: 
Coding sequence: 



Homo sapiens mRNA; cDNA DKFZp56401763 (from clone DKFZp5 64017 63) 

Hs. 27373 

F13036 

AC012478 

1-1896 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

| | I I I 1 

ATGCGCGCCQ TGCCGCTGCC CGCCCCGCTC CTGCCGCTGC TGCTGCTCGC GCTCCTGGCC 60 

GCTCCCGCCG CCCGCGCCAG CAGAGCCGAG TCCGTCTCCG CGCCGTGGCC CGAACCCGAG 120 

CGCGAGTCGC GGCCACCGCC CGGCCCGGGG CCCGGGAACA CCACCCGGTT TGGGTCTGGG 180 

GCGGCGGGCG GCAGCGGCAG CTCCAGCTCC AACAGCAGTG GCGACGCCTT GGTGACCCGC 240 



934 



WO 03/042661 



PCT/US02/36810 



ATTTCCATCC TCCTCCGCGA CCTACCCACC CTCAAGGCAG CCGTGATCGT GGCGTTCGCC 300 

TTTACCACCC TCCTCATCGC CTGCCTGCTG CTGCGCGTCT TCAGGTCGGG AAAGAGGTTA 360 

AAGAAGACAC GCAAGTATGA TATCATCACC ACTCCAGCAG AGCGAGTGGA AATGGCGCCA 420 

CTAAATGAAG AGGATGATGA AGATGAGGAC TCCACAGTAT TOGACATCAA ATACAGAGTG 480 

TCCTTGCCGG CTGCACTGAG ACGTCAGCTG CCAGGGTGCC AGACGCTACT GACAGTTCCT 540 

GTGCCCCCAC CCTTCATCCT CGACATTGAC CTTCCAGCAA GATGCAGTGG AAGGCCTGAT 600 

GGTGGAATCA GACCTGGTAA AACCTGTTTC CCAGCCTGGT GGCATCCTGT GGAAAGTTGG 660 

TCAGCTGCAA CCTGGGGTGT GAAGGACTGG ACCTGGAAGC CCTCTTGCGT CGGAGGTGTT 720 

GAAACCAAAA CGAACGTTAT GTATAAAACC CCAGCTCCAT CGTGCGTGTC AGGCATCTGC 780 

TCAGACTGTC ACTGGCAAGC TCGTTTCCAC GTCACCACAA TGGAGTTGCT TCTGCCACCC 840 

TTTGGGCATC CCTTTAAAGT GCCCCCTACT TCTACTCCCC ATGGTTTTCG ACAACTGCAG 900 

CTGAATCTCA TGGAAAAGCT GGATTCCTCT GCCTTACGCA GAAACACCCG GGCTCCATCT 960 

GCCAGGTGCT TGCCACTGGT CCTGGCAGAA ATGGCGGCTG CTGAAAGTGA CCTTCCAAAT 1020 

CCTTGGTGGC ACTTCAGCGC CACAGGCTCT CCAATAAAAA CCCTTTACAC ACAAACCATG 1080 

AGTACCTTGG GCTTGGATGT TTTCTGTGGT GCOGGCCAGC GGGGCACCTT TTGTGAAGAC 1140 

AGAGCAGTGA CTAAGGTTCT CCAGGGTAGC TCTTTCTCCA AACAGCTGCG CTGGAAGCCA 1200 

GCCCTAGAGA GTGGGTTTCC CCATCATCTC AGGCTTCTCA GAGAGTGTCC TCCGCTGAGC 1260 

ACCCATCCTG TCAGGTTGGC TCGTTCAGAT GCCCGGGGAC AAGCCAGCCT GACGGGGAGG 1320 

AGGGTGTTTC GGCGTCCGCG GCAGTCTCTG CATGGCGGAG GGTCAGCGGG TACCGCAACT 1380 

TGCCTTTTGG TTTTGAAGAT TCTGTTGAGG CGCCATCCTC ACCTTGACCT CTTCTACAAA 1440 

ATCTGTCTCC CCTGCTGTGC CGTGGAACAC CTACGGGAAG CCAAGAGAAG CTCAGTGACT 1500 

GTCCTTGCGT CATTTGAGCA GAGCCCACAA AAGGCAGCTG CTGCCCACGG GGAGCCTGTC 1560 

AAACGAGGGC CCAGTGGGCA ATTGACCAGA CACACATGCC CTGGCTGGGG GATCACACAT 1620 

GCGAACCTGC AGACAATTCC AGATACCCAA GGCCAGGAAG GCCCACGTGA GGATGTCACT 1680 

CACCCTGGAG GAGACTTGGA TGGGGTGGCA AATTTCTATT TGGAGGAAGA GGGTTTCCAG 1740 

GATGGCAGAT GCCAGAAGAT GGTCCTGATG TCTGAGGAAG GGCCACCTAG TTTGACAGGA 1800 

TGTGAGAGGC TCACAGGTTC CCATCACTTC TCCAGCCATT CCAAGTCTTG GTCCTTCCTT 1860 
TCCCCCCGAC AGCCCCTGTT TCTGTCCAGG CCCTGA 



A218 Protein sequence: 
Gene name: 
Unigene number: 
Pr obese t Accession tf : 
Protein Accession # : 
Signal sequence: 
Transmembrane domains: 
Cellular Localization: 



Homo sapiens mRNA; 

Hs. 27373 

F13036 

FGENESH predicted 
1-27 

94-115, 448-469 
not determined 



CDNA DKFZp56401763 (from clone DKPZp56401763) 



1 11 21 31 41 51 

I I I I I I 

MRAVPLPAPL LPLIdiLALLA APAARASRAE SVSAPWPEPE RESRPPPGPG PGNTTRFGSG 60 

AAGGSGSSSS NSSGDALVTR ISILLRDLPT LKAAVIVAPA FTTLLIACLL LRVPRSGKRL 120 

KKTRKYDIIT TPAERVEMAP LNEEDDEDED STVFDIKYRV SLPAALRRQL PGCQTLLTVP 180 

VPPPFILDID LPARCSGRPD GGIRPGKTCF PAWWHPVESW SAATWGVKDW TWKPSCVGGV 240 

ETKTNVMYKT PAPSCVSGIC SDCHWQARFH VTTMELLLPP FGHPFKVPPT STPHGPRQLQ 300 

LNLMEKLDSS ALRRNTRAPS ARCLPLVLAE MAAAESDLPN PWWHFSATGS PIKTLYTQTM 360 

STLGLDVFCG AGQRGTFCED RAVTKVLQGS SFSKQLRWKP ALESGFPHHL RLLRECPPLS 420 

THPVRLARSD ARGQASLTGR RVFRRPRQSL HGGGSAGTAT CLLVLKILLR RHPHLDLFYK 480 

ICLPCCAVEH LREAKRSSVT VLASFEQSPQ KAAAAHGEPV KRGPSGQLTR ETCPGWGITH 540 

ANLQTIPDTQ GQEGPREDVT HPGGDLDGVA NFYLEEEGFQ DGRCQKMVLM SEEGPPSLTG 600 
CERLTGSHHF SSHSKSWSFL SPRQPLFLSR P 

A219 DNA SEQUENCE 

Gene name: selectin E (endothelial adhesion molecule 1) 

Unigene number: Hs. 89546 

Probeset Accession #: M24736 

Nucleic Acid Accession #: NM_0004SO 

Coding sequence: 1-1833 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

| | I I I I 

ATGATTGCTT CACAGTTTCT CTCAGCTCTC ACTTTGGTGC TTCTCATTAA AGAGAGTGGA 60 

GCCTGGTCTT ACAACACCTC CACGGAAGCT ATGACTTATG ATGAGGCCAG TGCTTATTGT 120 

CAGCAAAGGT ACACACACCT GGTTGCAATT CAAAACAAAG AAGAGATTGA GTACCTAAAC 180 

TCCATATTGA GCTATTCACC AAGTTATTAC TGGATTGGAA TCAGAAAAGT CAACAATGTG 240 

TGGGTCTGGG TAGGAACCCA GAAACCTCTG ACAGAAGAAG CCAAGAACTG GGCTCCAGGT 300 

GAACCCAACA ATAGGCAAAA AGATGAGGAC TGCGTGGAGA TCTACATCAA GAGAGAAAAA 360 

GATGTGGGCA TGTGGAATGA TGAGAGGTGC AGCAAGAAGA AGCTTGCCCT ATGCTACACA 420 

GCTGCCTGTA CCAATACATC CTGCAGTGGC CACGGTGAAT GTGTAGAGAC CATCAATAAT 480 

TACACTTGCA AGTGTGACCC TGGCTTCAGT GGACTCAAGT GTGAGCAAAT TGTGAACTGT 540 

ACAGCCCTGG AATCCCCTGA GCATGGAAGC CTGGTTTGCA GTCACCCACT GGGAAACTTC 60 0 

AGCTACAATT CTTCCTGCTC TATCAGCTGT GATAGGGGTT ACCTGCCAAG CAGCATGGAG 660 

ACCATGCAGT GTATGTCCTC TGGAGAATGG AGTGCTCCTA TTCCAGCCTG CAATGTGGTT 720 

GAGTGTGATG CTGTGACAAA TCCAGCCAAT GGGTTCGTGG AATGTTTCCA AAACCCTGGA 780 

AGCTTCCCAT GGAACACAAC CTGTACATTT GACTGTGAAG AAGGATTTGA ACTAATGGGA 840 

GCCCAGAGCC TTCAGTGTAC CTCATCTGGG AATTGGGACA ACGAGAAGCC AACGTGTAAA 900 

GCTGTGACAT GCAGGGCCGT CCGCCAGCCT CAGAATGGCT CTGTGAGGTG CAGCCATTCC 960 

CCTGCTGGAG AGTTCACCTT CAAATCATCC TGCAACTTCA CCTGTGAGGA AGGCTTCATG 1020 

TTGCAGGGAC CAGCCCAGGT TGAATGCACC ACTCAAGGGC AGTGGACACA GCAAATCCCA 1080 

GTTTGTGAAG CTTTCCAGTG CACAGCCTTG TCCAACCCCG AGCGAGGCTA CATGAATTGT 1140 

CTTCCTAGTG CTTCTGGCAG TTTCCGTTAT GGGTCCAGCT GTGAGTTCTC CTGTGAGCAG 1200 

GGTTTTGTGT TGAAGGGATC CAAAAGGCTC CAATGTGGCC CCACAGGGGA GTGGGACAAC 1260 



935 



WO 03/042661 



PCT/US02/36810 



10 



GAGAAGCCCA 
GTGAGGTGTG 
TGTGAGGAGG 
TGGACAGAAG 
AAGATCAACA 
CCTGAAGGAT 
TCTGGCCTGC 
CTTTCTGCTG 
TGCTTACGGA 
GGAAGCTACC 



CATGTGAAGC 
CTCATTCCCC 
GATTTGAATT 
AGGTTCCTTC 
TGAGCTGCAG 
GGACGCTCAA 
TACCTACCTG 
CTGGACTCTC 
AAGCAAAGAA 
AAAAGCCTTC 



TGTGAGATGC 
TATTGGAGAA 
ATATGGATCA 
CTGCCAAGTG 
TGGGGAGCCC 
TGGCTCTGCA 
TGAAGCTCCC 
CCTCCTGACA 
ATTTGTTCCT 
TTACATCCTT 



GATGCTGTCC 
TTCACCTACA 
ACTCAACTTG 
GTAAAATGTT 
GTGTTTGGCA 
GCTCGGACAT 
ACTGAGTCCA 
TTAGCACCAT 
GCCAGCAGCT 
TAA 



ACCAGCCCCC 
AGTCCTCTTG 
AGTGCACATC 
CAAGCCTGGC 
CTGTGTGCAA 
GTGGAGCCAC 
ACATTCCCTT 
TTCTCCTCTG 
GCCAAAGCCT 



GAAGGGTTTG 
TGCCTTCAGC 
TCAGGGACAA 
AGTTCCGGGA 
GTTOGCCTGT 
AGGACACTGG 
GGTAGCTGGA 
GCTTCGGAAA 
TGAATCAGAC 



1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 



15 
20 
25 
30 
35 



A220 Protein sequence: 
Gene name: 
Unigene number: 
Probeset Accession #: 
Protein Accession #: 
Signal sequence: 
Transmembrane domains: 
C- lectin domain: 
Cellular Localization: 



Select in E (endothelial adhesion molecule 1) 

Hs. 89546 

M24736 

NP_0 00441 

1-22 

555-573 

23-139 

plasma membrane 



1 
I 

. MIASQFLSAL 
SILSYSPSYY 
DVGMWNDERC 
TALESPEHGS 
ECDAVTNPAN 
AVTCRAVRQP 
VCEAFQCTAL 
EKPTCEAVRC 
HTEEVPSCQV 
SGLLPTCEAP 
GSYQKPSYIL 



11 
I 

TLVLLIKESG 
WIGIRKVNNV 
SKKKLALCYT 
LVCSHPLGNF 
GFVECFQNPG 
QNGSVRCSHS 
SNPERGYMNC 
DAVHQPPKGL 
VKCSSLAVPG 
TESNI PLVAG 



21 
I 

AWSYNTSTEA 
WVWVGTQKPL 
AACTNTSCSG 
SYNSSCSISC 
SFPWNTTCTF 
PAGEFTFKSS 
LPSASGSFRY 
VRCAHSPIGE 
KINMSCSGEP 
LSAAGLSLliT 



31 
I 

MTYDEASAYC 
TEEAKNWAPG 
HGECVETINN 
DRGYLPSSME 
DCEEGFELMG 
CNFTCEEGFM 
GSSCEFSCEQ 
FTYKSSCAFS 
VFGTVCKFAC 
LAPFLLWLRK 



41 

I 

QQRYTHLVAI 
EPNNRQKDED 
YTCKCDPGFS 
TMQCMSSGEW 
AQSLQCTSSG 
LQGPAQVECT 
GFVLKGSKRL 
CEEGFELYGS 
PEGWTLNGSA 
CIjRKAKKFVP 



51 
I 

QNKEEIEYLN 
CVEIYIKREK 
GLKCEQIVNC 
SAPIPACNW 
NWDNEKPTCK 
TQGQWTQQIP 
QCGPTGEWDN 
TQLECTSQGQ 
ARTCGATGHW 
ASSCQSLESD 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



40 
45 
50 
55 
60 
65 
70 
75 
80 



Taxol Prostate 
A221 DNA SEQUENCE 

Gene name: ESTs; Liprin A2 

Unigene number: Hs. 306480 

Probeset Accession #: N51002 

Nucleic Acid Accession #: N51002 

Coding sequence: 1-3793 (underlined sequences correspond to start and stop codons) 



1 
I 

ATGATGTGTG 
CAAAGCAGTG 
GAAAGGGATC 
CAAAGACTTC 
CTGCCACAGG 
CCGGAATTTG 
GAAGAAGAAA 
TTGGAGTGCC 
GCCCAGTCTC 
TTTGAGCACC 
AGAGTCTCTG 
GAACAAAATG 
CATCTTGAAG 
ATAGACTCAA 
AACTATGAAA 
GTGGAACAGG 
AAGTATCAAA 
ACAACCCTTG 
AATGATAAAC 
AAAAACAGAC 
AGAAAGGCTG 
ACCAAGGCTG 
CTTGAAGAGA 
CATAACAAGA 
CAACTACACT 
TCAGAAACTT 
GAAATTGAAA 
GAACCCACAA 
TCCCTAGTGG 
. GGCCGCATGG 
AATAGAACTC 
TCTGATATTG 
AGTGGTCATT 
AACAAAGAAA 
ATTGAAAATA 



11 

I 

AAGTGATGCC 
GCTCGGACTC 
GTCTTCTAGA 
AGGATGTCAT 
ATATCGAATC 
CTGCACTGAC 
TCTCTGAACT 
TTGTGTCACG 
CCTCAGGAGT 
ACAAGGCCTT 
CACTGGAAGA 
TTCATATACA 
GGATGGAACC 
CCGATGAAAC 
TGGCCCAGAT 
AAGCAGAGAC 
GGGACATTAG 
AAAAGCGTTA 
TAGAAAATGA 
AGTTACAAGA 
AAACCTTGCC 
AAGAGAGACA 
AGAATCAAGA 
GATTATCGGA 
TAAAGGAAAG 
TCAGAAAGAA 
AGCTGAGATC 
TACCAAGAAC 
ACAGCCAGTC 
GTGTGCGAAG 
AACAGATTGG 
ATGATGATGA 
CCGATGCCCA 
TCAGGCTAAT 
GAGTGGCTAG 



21 
I 

CACGATTAAT 
AGACTCCCAT 
CACCCTTCGG 
CTATGACCGA 
CCTAACAGGA 
AAAAGAATTA 
TAAAGCTGAA 
ACATGAAAGA 
ATCCAGTGAA 
GGATGAAAAG 
AGAACTAGCT 
AAGAAAAATG 
TGGACAGAAA 
TAGTCAAATA 
GAAAGAACGT 
AGCAAGAAAG 
GGAGGCCATG 
CCTCAGTGCT 
GTTAGCAAAT 
ACGTCTTGAG 
TGAAGTAGAG 
TGGAAATATT 
ACTTCAAAGA 
TACGGTTGAT 
AATGGCTGCT 
TCTTGAAGAA 
TGAACTTGAC 
TCATCTAGAC 
TGATTACAGA 
AGATGAGCCA 
AGTACTAAGC 
CAGAGAAACA 
GACGCTAGCC 
TCAGGAAGAA 
TGTGAGCCTC 



31 
I 

GAGGACACCC 
TTTGAGCAGC 
GAGACCCAGG 
GACTCACTCC 
GGGCTGGCTG 
AATGCCTGCA 
AGAAACAACA 
TCACTAAGAA 
GTTGAAGTTC 
GTAAGGGAGC 
GCTGCTAATC 
GCATCAAGCG 
GTCCATGAGA 
GTTGAACTAC 
TTAGCAGCCC 
GATCTCATTA 
GCACAAAAGG 
CAGAGAGAAT 
AAAGAAGCTA 
CTAGCTGAAC 
GCTGAACTGG 
GAAGAACGTA 
GCTAGGCAAA 
AGACTTCTGA 
CTAGAAGAAA 
TCTTTACATG 
CAATTGAAAA 
ACCTCAGCTG 
ACAACTAAAG 
AAGGTGAAAT 
AGCCACCCTT 
ATTTTTAGCT 
ATGATGCTTC 
AAAGAATCTA 
GAAGGCCTGA 



41 
I 

CAATGAGCCA 
TGATGGTGAA 
AAAGCCTCTC 
AGAGACAGCT 
GTTCTAAGGG 
GGGAACAACT 
CAAGACTATT 
TGACGGTGGT 
TCAAGGCACT 
GACTGAGGGT 
AGGAGATTGT 
AGGGATCCAC 
AGCGTTTGTC 
AAGAATTGCT 
TTTCTTCCCG 
AAACAGAAGA 
AAGATATGGA 
CTACCTCCAT 
TCCTACGGCA 
AAAAGTTGCA 
CTCAGAGAAT 
TGAGACATTT 
GAGAGAAAAT 
CTGAATCCAA 
AGAATGTTTT 
ATAAGGAAAG 
TGAGAACTGG 
AGTTGCGGTA 
TAATAAGAAG 
CTCTTGGGGA 
TTGAAAGTGA 
CAATGGATCT 
AGGAACAATT 
CAGAGTTGCG 
ATTTGGCAAG 



51 
I 

AAGGGGGTCC 
TATGCTAGAT 
ACTTGCCCAG 
CAATTCAGCC 
GGCTGATCCA 
TCTAGAAAAG 
ACTGGAGCAT 
AAAACGGCAA 
GAAATCTTTG 
TTCTTTAGAA 
TGCCTTGCGT 
AGAGTCAGAA 
CAATGGTTCT 
TGAAAAGCAA 
AGTGGGAGAG 
AATGAACACC 
AGAAAGAATT 
ACATGACATG 
GATGGAAGAG 
GCAGACCATG 
TGCAGCCCTA 
AGAGGGTCAA 
GAATGAGGAG 
TGAACGCCTA 
AATTCAAGAA 
ATTAGCAGAA 
CTCTTTAATT 
CTCAGTGGGA 
ACCAAGGAGA 
TCACGAGTGG 
CACTGAAATG 
TCTCTCTCCA 
GGATGCCATC 
TGCTGAAGAA 
GGTCCACCCA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 



936 
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GGTACCTCCA TTACTGCCTC TGTTACAGCT 
GGACACTCAA CTCCAAAGCT CACCCCTCGA 
GTCATGACAC TGCCAAGTGA TCTGAGGAAA 
GATGGTCGAG AGGACAAAGC AACAATTAAA 
5 GCCCTCAGAA TGACTCACAC TCTCCCTTCT 
TCTGTCTCTC TTGAGCCAGA AAGCCTCGGG 
CTTCACAAAG CCCCCAAGAA GAAAGGAATC 
AAAGAAAAAG CTCGACTTGG GCAGCTCCGA 
GAGTCCCTGG GGTTAGGCAA ACTCGGAACT 
10 AAGCATGAAC TTCTTGAAGA AGCTCGGAGA 
CCAACTGTGG TCGCATGGCT AGAGCTTTGG 
TGCCGAGCCA ACGTGAAGAG TGGTGCCATC 
AGAGAAATTG GAATCAGCAA TCCACTGCAT 
ATGGTTTCCC TAACAAGTCC TTCAGCTCCT 
15 TGGGTGACTC ATGAAGAAAT GGAAAATCTT 

GAAGGAAGCT GGGCCCAGTG TCCGGTTTTT 
CATGAGTGGA TTGGAAATGA ATGGCTTCCC 
TTTATGGAAT GCTTGGTAGA TGCAAGAATG 
GTCCATTTAA AAATGGTGGA TAGTTTCCAT 
20 TTAAAGAGGT TGAATTATGA CAGAAAAGAA 
GAAATAAAAG ACGTGTTGGT GTGGAGCAAT 
GGACTTCGAG AATATGCAAA TAATATACTT 
CTGGATGAAA ACTTTGACTA CAGCAGCTTA 
ACCCAGGCAA GGCAGATTCT TGAAAGAGAA 
25 AGGCGACTGG ATGAAAGTGA TGACAAGAAC 
TTTCCTCCTC GTGAAGTACA TGGAATCAGC 
GCTGGATTTA GGTTAACCAC AACCTCTGGG 
TCATCAAGAC TGCAGAGGTT AGACAACTCC 
_ _ GCGGCCGCTT TAA 



TCATCGCTGG CCAGTTCATC TCCCCCCAGT 2160 

AGCCCTGCCA GGGAAATGGA TCGGATGGGA 2220 

CATCGGAGAA AGATTGCAGT TGTGGAAGAA 2280 

TGTGAAACTT CTCCTCCTCC TACCCCTAGA 2340 

TCCTACCACA ATGATGCTCG AAGTAGTTTA 2400 

CTTGGTAGTG CCAACAGCAG CCAAGACTCT 2460 

AAGTCTTCAA TAGGACGTTT GTTTGGTAAA 2S20 

GGCTTTATGG AGACTGAAGC TGCAGCTCAG 2580 

CAAGCTGAGA AGGATCGAAG ACTAAAGAAA 2640 

AAGGGATTAC CTTTTGCCCA GTGGGATGGG 2700 

TTGGGAATGC CTGCGTGGTA CGTGGCAGCC 2760 

ATGTCTGCTT TATCTGACAC TGAGATCCAG 2820 

CGCTTAAAAC TTCGATTAGC AATCCAGGAG 2880 

CCAACATCTC GAACTCCTTC AGGCAACGTT 2 940 

GCAGCTCCAG CAAAAACGAA AGAATCTGAG 3000 

CTACAGACCC TGGCTTATGG AGATATGAAT 3060 

AGCTTGGGGT TACCTCAGTA CAGAAGTTAC 3120 

TTAGATCACC TAACAAAAAA AGATCTCCGT 3180 

CGAACAAGTT TACAATATGG AATTATGTGC 3240 

CTAGAAAGAA GACGGGAAGC AAGCCAACAT 3300 

GACCGAATTA TTCGCTGGAT ACAAGCAATT 3360 

GAGAGCGGTG TGCATGGCTC ACTTATAGCC 3420 

ACTTTATTAT TACAGATTCC AACACAGAAC 3480 

TACAATAACC TCTTGGCCCT GGGAACTGAA 3540 

TTCAGACGTG GATCAACCTG GAGAAGGCAG 3600 

ATGATGCCTG GGTCCTCAGA AACATTACCA 3660 

CAATCAAGAA AAATGACAAC AGATGTTGCT 3720 

ACTGTTCGCA CATACTCATG TCTCGAGTAA 3780 



A222 Protein sequence: 

Gene name: 

Unigene number: 
35 . Probeset Accession ti: 

Protein Accession #: 

Signal sequence: 

Transmembrane domains: 

AAA domain: 
40 SAM domains: 



ESTs; Liprin A2 
Hs. 306480 
N51002 
none found 
none found 
none found 
286-539 

895-964, 1017-1084, 



1105-1177 



Cellular Localization: not determined 



1 11 21 31 41 51 

I I I I 1 I 

4j MMCEVMPTIN EDTPMSQRGS QSSGSDSDSH FBQLMVNMLD ERDRLLDTLR ETQESLSLAQ 60 

QRLQDVIYDR DSLQRQLNSA LPQDIESLTG GLAGSKGADP PEFAALTKEL NACREQLLEK 120 

EEEISELKAE RNNTRLLLEH LECLVSRHER SLRMTWKRQ AQSPSGVSSE VEVLKALKSL 180 

FEHHKALDEK VRERLRVSLE RVSALEEELA AANQEIVALR EQNVHIQRKM ASSEGSTESE 240 

HLEGMEPGQK VHEKRLSNGS IDSTDETSQI VELQELLEKQ NYEMAQMKER LAALSSRVGE 300 

50 VEQEAETARK DLIKTEEMNT KYQRDIREAM AQKEDMEERI TTLEKRYLSA QRESTSIHDM 360 

NDKLENELAN KEAILRQMEE KNRQLQERLE LAEQKLQQTM RKAETLPEVE AEXiAQRIAAL 420 

TKAEERHGNI EERMRHLEGQ LEEKNQELQR ARQREKMNEE HNKRLSDTVD RLLTESNERL 480 

QLHLKERMAA LEEKNVLIQE SETFRKNLEE SLHDKERIiAE EIEKLRSELD QLKMRTGSLI 540 

EPTIPRTHLD TSAELRYSVG SLVDSQSDYR TTKVIRRPRR GRMGVRRDEP KVKSLGDHEW 600 

55 NRTQQIGVLS SHPFESDTEM SDIDDDDRET IFSSMDLLSP SGHSDAQTLA MMLQEQLDAI 660 

NKEIRLIQEE KESTELRAEE IENRVASVSL EGLNLARVHP GTSITASVTA SSLASSSPPS 720 

GHSTPKLTPR SPAREMDRMG VMTLPSDLRK HRRKIAWEE DGREDKATIK CETSPPPTPR 780 

ALRMTHTLPS SYHNDARSSL SVSLEPESLG LGSANSSQDS LHKAPKKKGI KSSIGRLFGK 840 

KEKARLgQLR GFMETEAAAQ ESLGLGKLGT QAEKDRRLKK KHELLEEARR KGLPPAQWDG 900 

60 PTWAWLELH IjGMPAWYVAA CRANVKSGAI MSALSDTEIQ REIGISNPLH RLKLRLAIQE 960 

MVSLTSPSAP PTSRTPSGNV WVTHEEMENL AAPAKTKESE EGSWAQCPVF LQTLAYGDMN 1020 

HEWIGNEWLP SLGLPQYRSY FMECLVDARM LDHLTKKDLR VHLKMVDSFH RTSLQYGIMC 1080 

LKRLNYDRKE LERRREASQH EIKDVLVWSN DRIIRWIQAI GLREYANMIL ESGVHGSLIA 1140 

LDENFDYSSIi TLLLQIPTQN TQARQILERE YNNLLALGTE RRLDESDDKN PRRGSTWRRQ 1200 

65 FPPREVHGIS MMPGSSETLP AGFRLTTTSG QSRKMTTDVA SSRLQRLDNS TVRTYSCLE 



A223 DNA SEQUENCE 
Gene name: CDA14 
Unigene number: Hs. 26813 

70 Probeset Accession #: N32912 

Nucleic Acid Accession #: NM_016570 

Coding sequence: 1- 1134 (underlined sequences correspond to start and stop codons) 



75 1 11 21 31 41 51 

| | I 1 I I 

ATGAGGCGAC TGAATCGGAA AAAAACTTTA AGTTTGGTAA AAGAGTTGGA TGCCTTTCCG 60 

AAGGTTCCTG AGAGCTATGT AGAGACTTCA GCCAGTGGAG GTACAGTTTC TCTAATAGCA 120 

ori TTTACAACTA TGGCTTTATT AACCATAATG GAATTCTCAG TATATCAAGA TACATGGATG 180 

80 AAGTATGAAT ACGAAGTAGA CAAGGATTTT TCTAGCAAAT TAAGAATTAA TATAGATATT 240 

ACTGTTGCCA TGAAGTGTCA ATATGTTGGA GCGGATGTAT TGGATTTAGC AGAAACAATG 300 

GTTGCATCTG CAGATGGTTT AGTTTATGAA CCAACAGTAT TTGATCTTTC ACCACAGCAG 360 

AAAGAGTGGC AGAGGATGCT GCAGCTGATT CAGAGTAGGC TACAAGAAGA GCATTCACTT 420 

CAAGATGTGA TATTTAAAAG TGCTTTTAAA AGTACATCAA CAGCTCTTCC ACCAAGAGAA 480 



937 



WO 03/042661 



PCT/US02/36810 



GATGATTCAT CACAGTCTCC AAATGCATGC AGAATTCATG GCCATCTATA TGTCAATAAA 540 

GTAGCAGGGA ATTTTGACAT AACAGTGGGC AAGGCAATTC CACATCCTCG TGGTCATGCA 600 
CATTTGGCAG CACTTGTCAA CCATGAATCT TACAATTTTT CTCATAGAAT AGATCATTTG . 660 

TCTTTTGGAG AGCTTGTTCC AGCAATTATT AATCCTTTAG ATGGAACTGA AAAAATTGCT 720 

ATAGATCACA ACCAGATGTT CCAATATTTT ATTACAGTTG TGCCAACAAA ACTACATACA 780 

TATAAAATAT CAGCAGACAC CCATCAGTTT TCTGTGACAG AAAGGGAACG TATCATTAAC 840 

CATGCTGCAG GCAGCCATGG AGTCTCTGGG ATATTTATGA AATATGATCT CAGTTCTCTT 900 

ATGGTGACAG TTACTGAGGA GCACATGCCA TTCTGGCAGT TTTTTGTAAG ACTCTGTGGT 960 

ATTGTTGGAG GAATCTTTTC AACAACAGGC ATGTTACATG GAATTGGAAA ATTTATAGTT 1020 

GAAATAATTT GCTGTCGTTT CAGACTTGGA TCCTATAAAC CTGTCAATTC TGTTCCTTTT 1080 
GAGGATGGCC ACACAGACAA CCACTTACCT CTTTTAGAAA ATAATACACA TTGA 



A224 Protein sequence: 

Gene name: ' CDA14 

Unigene number: Hs. 26813 

Probeset Accession ft: N32912 

Protein Accession ft: NP_0S7654 

Signal sequence: none found 

Transmembrane domains: none found 

Cellular Localization: nuclear 



1 11 21 31 41 51 

I | I U.I I 

MRRLNRKKTL SLVKELDAFP KVPESYVETS ASGGTVSLIA FTTMALLTIM EFSVYQDTWM 60 

KYEYEVDKDF SSKLRINIDI TVAMKCQYVG ADVLDLAETM VASADGLVYE PTVFDLSPQQ 120 

KEWQRMLQLI QSRLQEEHSL QDVIFKSAFK STSTALPPRE DDSSQSPNAC RIHGHLYVNK 180 

VAGNFHITVG KAIPHPRGHA HLAALVNHES YNFSHRIDHL SFGELVPAII NPLDGTEKIA 240 

IDHNQMFQYF ITWPTKLHT YKISADTHQF SVTERERIIN HAAGSHGVSG IFMKYDLSSL 300 

MVTVTEEHMP FWQFFVRLCG IVGGIFSTTG MLHGIGKFIV EIICCRFRLG SYKPVNSVPF 360 
EDGHTDNHLP LLENNTH 



Uterine 

A22S DNA SEQUENCE: 

Gene name: ESTs, Weakly similar to JE0350 Anterior gradient-2 [H . sapiens] 

Unigene number: Hs. 100 68 6 

Probeset Accession ft: AA487468 
Kucleic Acid Accession ft: AA487468 

Coding sequence: 55-555 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I 1 I 

CGGCACCAAG AGCACTGGCC AAGTCAGCTT CTTCTGAGAG AGTCTCTAGA AGACATGATG 60 

CTACACTCAG CTTTGGGTCT CTGCCTCTTA CTCGTCACAG TTTCTTCCAA CCTTGCCATT 120 

GCAATAAAAA AGGAAAAGAG GCCTCCTCAG ACACTCTCAA GAGGATGGGG AGATGACATC 180 

ACTTGGGTAC AAACTTATGA AGAAGGTCTC TTTTATGCTC AAAAAAGTAA GAAGCCATTA 240 

ATGGTTATTC ATCACCTGGA GGATTGTCAA TACTCTCAAG CACTAAAGAA AGTATTTGCC 300 

CAAAATGAAG AAATACAAGA AATGGCTCAG AATAAGTTCA TCATGCTAAA CCTTATGCAT 360 

GAAACCACTG ATAAGAATTT ATCACCTGAT GGGCAATATG TGCCTAGAAT CATGTTTGTA 420 

GACCCTTCTT TAACAGTTAG AGCTGACATA GCTGGAAGAT ACTCTAACAG ATTGTACACA 480 

TATGAGCCTC GGGATTTACC CCTATTGATA GAAAACATGA AGAAAGCATT AAGACTTATT 540 

CAGTCAGAGC TATAAGAGAT GATAGAAAAA AGCCTTCACT TCAAAGAAGT CAAATTTCAT 600 

GAAGAAAACC TCTGGCACAT TGACAAATAC TAAATGTGCA AGTATATAGA TTTTGTAATA 660 

TTACTATTTA GTTTTTTTAA TGTGTTTGCA ATAGTCTTAT TAAAATAAAT GTTTTTTAAA 720 
TCTGAAAAAA AAAAAAAAAA AAAAAAAAA 



A226 Protein sequence: 
Gene name: 
Unigene number: 
Probeset Accession ft: 
Protein Accession ft: 
Signal sequence: 
Transmembrane domains: 
Cellular Localization: 



ESTs, Weakly similar to JE0350 Anterior gradient -2 [H . sapiens] 

Hs. 100686 

AA487468 

none found 

1-23 

none found 
secreted 



11 



21 
1 



31 



41 



51 
I 



MMLHSALGLC LLLVTVSSNL AIAIKKEKRP PQTLSRGWGD DITWVQTYEE GLFYAQKSKK 60 
PLMVIHHLED CQYSQALKKV FAQNEEIQEM AQNKFIMLNL MHETTDKNLS PDGQYVPRIM 120 
FVDPSLTVRA DIAGRYSNRL YTYEPRDLPL LIENMKKALR LIQSEL 



A227 DNA SEQUENCE 

Gene name: G protein-coupled receptor 49 (GPR49) (HG38) (LGR5) 
Unigene number: Hs. 285529 

Probeset Accession ft: AA460530 
Nucleic Acid Accession ft: NM_003667 

Coding sequence: 201-2924 (underlined sequences correspond to start and stop codons) 



938 



WO 03/042661 PCT/US02/36810 



5 

10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



1 11 21 31 41 51 

I I I I I I 

GTGGCGGCAA CCGGCACCTC AGTCCCCGCC GCGCTTCTCC TCGCCGCCCA CGCCGTGGGG 60 

TCAGGAACGC GGCGTCTGGC GCTGCAGACG CCCGCTGAGT TGCAGAAGCC CACGGAGCGG 120 

CGCCCGGCGC GCCACGGCCC GTAGCAGTCC GGTGCTGCTC TCCGCCCGCG TCCGGCTCGT 180 

GGCCCCCTAC TTCGGGCACC ATGGACACCT CCCGGCTCGG TGTGCTCCTG TCCTTGCCTG 240 

TGCTGCTGCA GCTGGCGACC GGGGGCAGCT CTCCCAGGTC TGGTGTGTTG CTGAGGGGCT 300 

GCCCCACACA CTGTCATTGC GAGCCCGACG GCAGGATGTT GCTCAGGGTG GACTGCTCCG 360 

ACCTGGGGCT CTCGGAGCTG CCTTCCAACC TCAGCGTCTT CACCTCCTAC CTAGACCTCA 420 

GTATGAACAA CATCAGTCAG CTGCTCCCGA ATCCCCTGCC CAGTCTCCGC TTCCTG GAGG 480 

AGTTACGTCT TGCGGGAAAC GCTCTGACAT ACATTCCCAA GGGAGCATTC ACTGGCCTTT 540 

ACAGTCTTAA AGTTCTTATG CTGCAGAATA ATCAGCTAAG ACACGTACCC ACAGAAGCTC 600 

TGCAGAATTT GCGAAGCCTT CAATCCCTGC GTCTGGATGC TAACCACATC AGCTATGTGC 660 

CCCCAAGCTG TTTCAGTGGC CTGCATTCCC TGAGGCACCT GTGGCTGGAT GACAATGCGT 720 

TAACAGAAAT CCCCGTCCAG GCTTTTAGAA GTTTATCGGC ATTGCAAGCC ATGACCTTGG 780 

CCCTGAACAA AATACACCAC ATACCAGACT ATGCCTTTGG AAACCT CTCC AGCTTGGTAG 840 

TTCTACATCT CCATAACAAT AGAATCCACT CCCTGGGAAA GAAATGCTTT GATGGGCTCC 900 

ACAGCCTAGA GACTTTAGAT TTAAATTACA ATAACCTTGA TGAATTCCCC ACTGCAATTA 960 

GGACACTCTC CAACCTTAAA GAACTAGGAT TTCATAGCAA CAATATCAGG TCGATACCTG 1020 

AGAAAGCATT TGTAGGCAAC CCTTCTCTTA TTACAATACA TTTCTATGAC AATCOCATCC 1080 

AATTTGTTGG GAGATCTGCT TTTCAACATT TACCTGAACT AAGAACACTG ACTCTGAATG 1140 

GTGCCTCACA AATAACTGAA TTTCCTGATT TAACTGGAAC TGCAAACCTG GAGAGTCTGA 1200 

CTTTAACTGG AGCACAGATC TCATCTCTTC CTCAAACCGT CTGCAATCAG TTACCTAAT C 1260 

TCCAAGTGCT AGATCTGTCT TACAACCTAT TAGAAGATTT ACCCAGTTTT TCAGTCTGCC 1320 

AAAAGCTTCA GAAAATTGAC CTAAGACATA ATGAAATCTA CGAAATTAAA GTTGACACTT 1380 

TCCAGCAGTT GCTTAGCCTC CGATCGCTGA ATTTGGCTTG GAACAAAATT GCTATTATTC 1440 

ACCCCAATGC ATTTTCCACT TTGCCATCCC TAATAAAGCT GGACCTATCG TCCAACCTCC 1500 

TGTCGTCTTT TCCTATAACT GGGTTACATG GTTTAACTCA CTTAAAATTA ACAGGAAATC 1560 

ATGCCTTACA GAGCTTGATA TCATCTGAAA ACTTTCCAGA ACTCAAGGTT ATAGAAATGC 1620 

CTTATGCTTA CCAGTGCTGT GCATTTGGAG TGTGTGAGAA TGCCTATAAG ATTTCTAATC 1680 

AATGGAATAA AGGTGACAAC AGCAGTATGG ACGACCTTCA TAAGAAAGAT GCTGGAATGT 1740 

TTCAGGCTCA AGATGAACGT GACCTTGAAG ATTTCCTGCT TGACTTTGAG GAAGACCTGA 1800 

AAGCCCTTCA TTCAGTGCAG TGTTCACCTT CCCCAGGCCC CTTCAAACCC TGTGAACACC 1860 

TGCTTGATGG CTGGCTGATC AGAATTGGAG TGTGGACCAT AGCAGTTCTG GCACTTACTT 1920 

GTAATGCTTT GGTGACTTCA ACAGTTTTCA GATCCCCTCT GTACATTTCC CCCATTAAAC 1980 

TGTTAATTGG GGTCATCGCA GCAGTGAACA TGCTCACGGG AGTCTCCAGT GCCGTGCTGG 2040 

CTGGTGTGGA TGCGTTCACT TTTGGCAGCT TTGCACGACA TGGTGCCTGG TGGGAGAATG 2100 

GGGTTGGTTG CCATGTCATT GGTTTTTTGT CCATTTTTGC TTCAGAATCA TCTGTTTTCC 2160 

TGCTTACTCT GGCAGCCCTG GAGCGTGGGT TCTCTGTGAA ATATTCTGCA AAATTTGAAA 2220 

CGAAAGCTCC ATTTTCTAGC CTGAAAGTAA TCATTTTGCT CTGTGCCCTG CTGGCCTTGA 2280 

CCATGGCCGC AGTTCCCCTG CTGGGTGGCA GCAAGTATGG CGCCTCCCCT CTCTGCCTGC 2340 

CTTTGCCTTT TGGGGAGCCC AGCACCATGG GCTACATGGT CGCTCTCATC TTGCTCAATT 2400 

CCCTTTGCTT CCTCATGATG ACCATTGCCT ACACCAAGCT CTACTGCAAT TTGGACAAGG 2460 

GAGACCTGGA GAATATTTGG GACTGCTCTA TGGTAAAACA CATTGCCCTG TTGCTCTTCA 2520 

CCAACTGCAT CCTAAACTGC CCTGTGGCTT TCTTGTCCTT CTCCTCTTTA ATAAACCTTA 2580 

CATTTATCAG TCCTGAAGTA ATTAAGTTTA TCCTTCTGGT GGTAGTCCCA CTTCCTGCAT 2640 

GTCTCAATCC CCTTCTCTAC ATCTTGTTCA ATCCTCACTT TAAGGAGGAT CTGGTGAGCC 2700 

TGAGAAAGCA AACCTACGTC TGGACAAGAT CAAAACACCC AAGCTTGATG TCAATTAACT 2760 

CTGATGATGT CGAAAAACAG TCCTGTGACT CAACTCAAGC CTTGGTAACC TTTACCAGCT 2820 

CCAGCATCAC TTATGACCTG CCTCCCAGTT CCGTGCCATC ACCAGCTTAT CCAGTGACTG 2880 

AGAGCTGCCA TCTTTCCTCT GTGGCATTTG TCCCATGTCT CTAATTAATA TGTGAAGGAA 2940 

AATGTTTTCA AAGGTTGAGA ACCTGAAAAT GTGAGATTGA GTATATCAGA GCAGTAATTA 3000 
ATAAGAAGAG CTGAGGTGAA ACTCGGTTTA AA 



A22S Protein sequence 
Gene name: 
Unigene number: 
Protein Accession #: 
Signal sequence: 
Transmembrane domains: 
Cellular Localization: 



G protein- coupled receptor 49 (GPR49) (HG38) (LGR5) 

Hs. 285529 

NP_0036S8.1 

1-22 

557-579, 596-618, 641-663, 687-709, 724-746, 770-792, 
plasma membrane 



803-825 



1 
I 

MDTSRLGVLL 
PSNIiSVFTSY 
LQNNQLRHVP 
AFRSLSALQA 
LNYNNLDEFP 
FQHLPELRTL 
VNLLEDLPSF 
LPSLIKLDLS 
AFGVCENAYK 
CSPSPGPFKP 
AVNMLTGVSS 
ERGPSVKYSA 
STMGYMVALI 
PVAFLSFSSL 
WTRSKHPSLM 
VAFVPCL 



11 
1 

SLPVLLQLAT 
LDLSMNNISQ 
TEALQNLRSL 
MTLALNKIHH 
TAIRTLSNLK 
TLNGASQITE 
SVCQKLQKID 
SNLLSSPPIT 
ISNQWNKGDN 
CEHLLDGWLI 
AVLAGVDAFT 
KFETKAPPSS 
LLNSLCPLMM 
INLTPISPEV 
SINSDDVEKQ 



21 



GGSSPRSGVL 
LLPNPLPSLR 
0SLRLDANHI 
IPDYAFGNLS 
ELGFHSNNIR 
FPDLTGTANL 
LRHNEIYEIK 
GLHGLTHLKL 
SSMDDLHKKD 
RIGVWTIAVL 
FGSFAREGAW 
LKVIILLCAL 
TIAYTKLYCN 
IKFILLVWP 
SCDSTQALVT 



31 
I 

LRGCPTHCHC 
FLEELRLAGN 
SYVPPSCFSG 
SLWLHLHNN 
SIPEKAPVGN 
ESLTLTGAQI 
VDTFQQLLSL 
TGNHALQSLI 
AGMFQAQDER 
ALTCNALVTS 
WENGVGCHVI 
LALTMAAVPL 
LDKGDLENIW 
LPACLNPLLY 
FTSSSITYDL 



41 
I 

EFDGRKLLRV 
ALTYIPKGAF 
LHSLRHLWLD 
RIHSLGKKCF 
PSLITIHFYD 
SSLPQTVCNQ 
RSLNLAWNKI 
SSENFPELKV 
DLEDFLLDPE 
TVFRSPIiYIS 
GFLSIFASES 
LGGSKYGASP 
DCSMVKHIAL 
ILFNPHFKED 
PPSSVPSPAY 



51 
1 

DCSDLGLSEL 
TGLYSLKVLM 
DNALTEIPVQ 
DGLHSLETLD 
NPIQFVGRSA 
LPNLQVLDLS 
AIIHPNAFST 
IEKPYAYQCC 
EDLKALHSVQ 
PIKLLIGVIA 
SVFLLTLAAL 
LCLPLPFGEP 
LLFTNCILNC 
LVSLRKQTYV 
PVTESCHLSS 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 



Table 75: See Table 1 
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Table 76A depicls Seq ID No; UnigenelD; UnigeneTilIe; PKey; Predicted Cetlular Localization; and Exemplar Accession for each of the sequences in Table 78. The Information in 

Table 76A is linked by SeqID No to Table 78. 

Seq ID No: Sequence ID No for sequences in table 

Pkey. Unique Eos probesel identifier number 

ExAccn: Exemplar Accession number, Genbank accession 

UnigenelD: Unigene number 

Unigene Title: Unigene gene title 

Pred Subcetl Lcc Predicted sub-cellular localization 



Seq lb No 

SeqlD1&2 

Seq ID3&4 

SeqlD5&6 

SeqlD7&8 

Seq ID9&10 

Seq ID 1U 12 

SeqlD13&14 

Seq ID 15 & 16 

Seq ID17&18 

Seq1D19&20 

Seq ID 21 & 22 

Seq ID 23 & 24 

Seq ID25&26 

Seq ID 27 & 28 

Seq ID 29 & 30 

Seq ID 31 & 32 

Seq ID 33 & 34 

Seql035&36 

SeqlD37&38 

SeqlD39&40 

SeqlD41&42 

SeqtD43&44 

SeqlD45&46 

SeqlD47&48 

SeqlD49&50 

Seq ID 51 & 52 

SeqiD53&54 

Seq ID 55 & 56 

Seq ID 57 & 58 

SeqlD59&60 

Seq ID 61 & 62 

SeqlD63&64 

SeqlD65&66 

SeqlD67&68 

SeqlD69&70 

SeqtD71&72 

Seql073&74 

SeqlD75&76 

SeqlD77&78 

Seq ID79&B0 

SeqlD81&82 

SeqtD83&84 

SeqlD85&B6 

SeqlD87&88 

SeqlD89&90 

SeqlD91&92 

SeqtD93&94 

Seq ID 95 & 96 

SeqlD97&98 

SeqID99&100 

SeqlD101&102 

Seq ID 103 & 104 

SeqlD106&106 

SeqlD107&108 

Seq ID109&110 

SeqlD11U112 

Seq ID 113 & 114 

Seq ID 115 & 116 

SeqlD117&118 

Seq ID 119 & 120 

SeqlD121&122 

SeqlD123&124 

Seq ID 125 & 126 

Seq ID 127 & 128 

Seq ID 129 & 130 

Seq ID 131 & 132 

Seq ID 133 & 134 

Seq ID 135 & 136 

Seq ID 137 & 138 



-Pkey ExAccn UnigenelD Unigene Title 



425023 

424503 

429423 

400289 

419172 

418007 

418007 

407836 

414577 

418738 

428368 

415138 

429276 

418994 

407975 

429113 

418506 

423961 

414812 

417433 

417433 

424399 

422867 

428227 

444381 

439569 

411558 

400303 

411789 

428598 

450098 

421552 

452747 

415539 

416636 

416636 

409079 

442082 

400297 

451398 

429220 

421524 

423242 

423242 

423242 

452190 

452190 

325372 

450375 

426215 

425247 

429353 

429353 

432201 

427585 

446163 

442117 

428179 

431211 

447033 

447033 

447033 

115522 

452679 

446051 

422048 

410418 

446342 

422260 



AW956889 
NM_002205 
AI016712 
X07820 
AW338625 
M 13509 
M13509 
T79340 
AJ056548 
AW388633 
BE440042 
C18356 
AF056085 
AA296520 
X89426 
D28235 
AA084248 
D13666 
X72755 
BE270266 
BE270266 
AI905687 
132137 
AA321649 
BE387335 
AW602166 
AA102670 
AA242758 
AF245505 
AA852773 
W27249 
AF026692 
BE153855 
AI733881 
N32536 
N32536 
W87707 
R41823 
AI127076 
AI793124 
AW207206 
AA312082 
AL039402 
AL039402 
AL039402 
H26735 
H26735 

AA009647 

AW983419 

Nl^_005940 

AL117406 

AL1 17406 

AI538613 

D31152 



AW664964 

AI127772 

M86849 

AI357412 

AI357412 

A1357412 

BE614387 

242387 

BE048061 

NMJH2445 

D31382 

BE298665 

AA315993 



Hs.154210 
Hs.149609 
Hs.287797 
Hs.2258 
Hs.22120 
Hs.83169 
Hs.83169 
Hs.200272 
Hs.72116 
Hs.6682 
Hs.B3326 
Hs.295944 
Hs.198612 
Hs.89546 
Hs.41716 
Hs.196384 
Hs.85339 
Hs.136348 
Hs.77367 
Hs.82128 
Hs.82128 

Hs.1584 
Hs.2248 
Hs.283713 
Hs.222399 
Hs.70725 
Hs.79136 
Hs.72157 
Hs.334838 
Hs.8109 
Hs.105700 
Hs.61460 
Hs.72472 
Hs.42645 
Hs.42645 
Hs.82065 
Hs.7413 
Hs.3062D1 
Hs.144479 

Hs.105445 
Hs.125783 
Hs.125783 
Hs.125783 
Hs.91668 
Hs.91658 



Hs.155223 

Hs.155324 

Hs.335891 

Ks335891 

Hjl298241 

Hs.179729 

Hs75252 

Hs.128899 

Hs^79696 

Hs323733 

Hs.157601 

Hs.157601 

Hs.157601 

Hs.333893 

Hs.83883 

HsJ37054 

Hs^8B126 

Hs.63325 

Hs.14846 

Hs.1 05484 



endothelial differentiation, sphingolipi 
irrtegrin, alpha 5 (fibronectin receptor, 
integrin. beta 1 (fibronectin receptor, 
matrix metalloproteinase 10 (stromelysln 
ESTs; similar to TRANSMEMBRANE 4 SUPERF 
matrix metalloproteinase 1 (interstitial 
matrix metalloproteinase 1 (interstitial 
B-cell CLUIymphoma 6, member B, zinc fi 
hypothetical protein FU20992 similar to 
solute earner family 7, (cationic amino 
matrix metalloproteinase 3 (stromerysin 
tissue factor pathway inhibitor 2 
G protein-coupled receptor 51 
selectin E {endothelial adhesion molecul 
endothelial cell-specific molecule 1 
prostaglandin-endoperoxide synthase 2 (p 
G protein-coupled receptor 39 
periostin (OSF-2os) 
monokine induced by gamma interferon 
5T4 oncofetal trophoblast glycoprotein 
5T4 oncofetal trophoblast glycoprotein 
AIS05687:IL-BT095-19019W)19 BT095 Homo 
cartilage oligomeric matrix protein (pse 
small inducible cytokine subfamily B (Cy 
ESTs, Weakly similar to S64054 hypothefi 
CEGP1 protein 

gamma-aminobutyric acid (GABA) A recepto 
LIV-1 protein, estrogen regulated 
Adlican 

WAA1866 protein 
hypotheScal protein FU21080 
secreted frizzled-related protein 4 
Ig superfamiry receptor LNIR 
BMP-R1B 

solute carrier family 16 (monocarboxyEc 
solute carrier family 16 (monocarboxyfic 
Interleuxin 6 signal transducer (gp130, 
ESTs 

hypothetical protein DKFZp56401278 

ESTs 

ESTs 

GDNF family receptor alpha 1 
DEME-6 protein 
DEME-6 protein 
DEME-6 protein 

Homo sapiens done PP149B unknown mRNA 
Homo sapiens done PP1498 unknown mRNA 
Phase 2 & 3 Exons 

a dlslntegrin and metalloproteinase doma 
stanniocaldn 2 

matrix metalloproteinase 1 1 (stromerysin 
ATP-binding cassette transporter MRP8 
ATP-binding cassette transporter MRP8 
Transmembrane protease, serine 3 
collagen, type X, alpha 1 (Schmid metaph 
prolactin receptor 

ESTs; hypothetical protein for IMAGE447 

serum/glucocorficoid regulated kinase-fi 

gap junction protein, beta 2, 26kD (conn 

ESTs 

ESTs 

ESTs 

c-Myc target JP01 

transmembrane, prostate androgen induced 
ephrin-A3 

spondin 2. extracellular matrix protein 
transmembrane protease, serine 4 
Homo sapiens mRNA; cDNA DKFZp564D016 (fr 
regenerating gene type IV 

940 



Pred Subcetl Loc 

plasma membrane 
plasma membrane 
plasma membrane 



plasma membrane 



intracell 



plasma membrane 
secreted 



plasma membrane 



secreted 
, intracell 
plasma membrane 
secreted 



plasma membrane 
plasma membrane 
secreted 



secreted 



plasma membrane 
plasma membrane 
secreted 



intracell 
secreted 

plasma membrane 
plasma membrane 



plasma membrane 



plasma membrane 
plasma membrane 
intracell 



plasma membrane 
plasma membrane 
intracell 
intracell 
intracell 
Intracell 
cytoplasmic 
plasma membrane 



plasma membrane 
plasma membrane 
plasma membrane 
secreted 

plasma membrane 
plasma membrane 
intracell 

plasma membrane 

secreted 

secreted 

secreted 

intracell 

plasma membrane 
plasma membrane 



plasma membrane 
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5 

10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



Seq ID 139 & 140 409041 AB033025 Hs.50081 

Seq ID 141 & 142 409041 AB033025 Hs.50081 

SeqlD143&144 111929 AF027208 Hs.112360 

SeqlD145&146 111929 AFO27208 Hs.112360 

Seq ID 147 & 148 111929 AF027208 Hs.112360 

Seq ID 149 & 150 104888 AW939591 Hs.5940 

Seq ID 151 & 152 420159 A1572490 Hs59785 

Seq 1D153&154 422330 D30783 Hs.115263 

Seq ID 155 & 156 452461 N78223 Hs.108106 

Seq ID 157 & 158 413324 V00571 Hs.75294 

Seq ID 159 & 160 412420 AL035668 Hs.73853 

SeqlD16U162 416658 U03272 Hs.79432 

Seq ID 163 & 164 407811 AW190902 Hs.40098 

Seq ID 165 402230 

Seq ID 166 & 167 402230 

Seq ID 168 & 169 432829 W60377 Hs.57772 

Seq ID 170 & 171 425721 AC002115 Hs.159309 

Seq ID 172 & 173 420370 Y13645 Hs.97234 

Seq ID 174 & 175 437852 BE001836 Hs.256897 

Seq ID 176 & 177 402075 

Seq ID 178 & 179 421110 AJ250717 Hs.1355 

Seq ID 180 & 181 451658 Z43948 Hs.326444 

Seq ID 182 & 163 451668 Z43948 Hs.326444 

Seq ID 184 & 185 451668 Z43948 Hs.326444 

SeqID 186&1B7 408243 Y00787 Hs.624 

SeqlD188&189 422282 AF019225 Hs.114309 

Seq ID 190 & 191 425852 AK001504 Hs.159651 

Seq ID 192 & 193 439738 BE246502 Hs.9598 

Seq ID 194 & 195 404875 

Seq ID 196 & 197 425863 AL137708 Hs.161031 

Seq ID 198 & 199 404977 

Seq ID 200 & 201 420876 AA918425 Hs.177744 

Seq ID 202 & 203 427747 AW411425 Hs.180655 

Seq ID 204 & 205 420281 AI623693 Hs.323494 

Seq ID 206 & 207 446673 NM.016361 Hs.15871 

Seq ID 208 437553 A1829935 Hs.130497 

Seq ID 209 & 210 437553 AI829935 Hs.130497 

SeqID 211 & 212 437553 A1829935 Hs.130497 

SeqID 213 & 214 426900 AW163564 Hs.142375 

Seq ID 215 & 216 426900 AW163564 Hs.142375 

Seq ID 217 & 21 8 426900 AW163564 Hs.142375 

Seq ID 219 & 220 405932 

Seq ID 221 & 222 405932 

Seq ID 223 & 224 424008 R02740 Hs.137555 

Seq ID 225 & 226 424008 R02740 Hs.137555 

Seq ID 227 & 228 424008 R02740 Hs.137555 

Seq ID 229 & 230 444342 NM_014398 Hs.10887 

Seq ID 231 & 232 421379 Y15221 Hs. 103982 

Seq ID 233 & 234 417079 U65590 

SeqlD235 &236 412986 XB1120 
Seq ID 237 & 238 412986 X81120 
Seq ID 239 & 240 412986 X81120 
Seq ID 241 & 242 447072 D61594 
Seq ID 243 & 244 419723 AL12Q193 Hs.339810 
Seq ID 245 & 246 419723 AL120193 Hs.339810 
Seq ID 247 & 248 430890 X54232 Hs.2699 
Seq ID 249 & 250 456759 BE259150 Hs.127792 
Seq ID 251 429466 M85835 Hs.12827 

Seq ID 252 429466 M85835 Hs.12827 

Seq ID 253 & 254 419721 NNL001650 Hs^8B650 
Seq ID 255 & 256 407034 U84540 
Seq ID 257 & 258 413472 8E242870 Hs.75379 
Seq ID 259 & 260 438380 T06430 Hs.6194 
Seq ID 261 & 262 426271 AF026547 Hs.169047 
Seq ID 263 & 264 419704 AA429104 Hs.45057 
Seq ID 265 & 266 444471 AB020684 Hs.11217 
Seq ID 267 & 268 409395 U46745 Hs.336678 
Seq ID 269 & 270 413063 AL035737 Hs.75184 
Seq ID 271 & 272 433800 AI034361 Hs.135150 
Seq ID 273 458435 AI418718 Hs.144121 

Seq ID 274 458435 AI418718 Hs.144121 

Seq ID 275 & 276 424343 AW956360 Hs.4748 
Seq ID 277 & 278 424998 U58515 Hs.154138 
Seq ID 279 & 280 412709 AL022327 Hs74518 
SeqlD28U 282 435615 Y15065 Hs.4975 
SeqlD283& 284 404049 
Seq ID 285 & 286 418932 134059 Hs.89484 
Seq ID 287 & 288 404029 
Seq ID 289 & 290 436480 AJ271643 Hs.87469 
Seq ID 291 & 292 452401 NMJXJ7115 Hs.29352 
Seq ID 293 & 294 452401 NMJH7115 Hs.29352 
Seq ID 295 & 296 436895 AF037335 Hs.5338 
Seq ID 297 & 298 421471 U90545 Hs.327179 



Hs.81134 
Hs.75110 
Hs.75110 
Hs.75110 
Hs.17279 



Hypothetical protein. XPJJ51860 (K1AA1 19 secreted 

Hypothetical protein, XPJJ51 860 (KIAA1 1 9 secreted 

prominin (mouseH^e 1 plasma membrane 

prominin (mouse)-like 1 plasma membrane 

prominin (mouse)-like 1 plasma membrane 

mucin 13, epithelial transmembrane plasma membrane 

Homo sapiens cDNA: FU21 245 fis, clone C plasma membrane 

epireguKn plasma membrane 

transcription factor intracell 

corticotropin releasing hormone secreted 
bone morphogeneBc protein 2 
fibrillin 2 (congenital contractura) ara 

cysteine knot superfamfly 1 , BMP antagon secreted 

Fgenesh predicted: CYTOCHROME P450 4F5 ( Intracell 

Fgenesh predicted: CYTOCHROME P450 4F5 ( intracell 

ESTs intracell 

uroplakin 1A plasma membrane 

uroplakin 2 plasma membrane 

ESTs, Weakly similar to <U36501 2.1 [H.sa plasma membrane 

ENSP00000251 056*:Plasma membrane calcium secreted 

cathepsin E secreted 

cartilage acidic protein 1 plasma membrane 

cartilage acidic protein 1 secreted 

cartilage acidic protein 1 intracell 

interleukin 8 secreted 

apolipo protein L secreted 

death receptor 6, TNF superfamily member plasma membrane 

sema domain, immunoglobulin domain (Ig), plasma membrane 

NM_022819*:Homo sapiens phospholipase A2 intracell 

Homo sapiens mRNA; cDNA DKFZp434K0322 {f plasma membrane 

Insulin-like growth factor 2 (somatomedi secreted 

ESTs plasma membrane 

serine/threonine kinase 1 2 intracell 

Predicted cation efflux pump plasma membrane 

LPAP for lysophosphatidic acid phosphate intracell 

ESTs, Weakly similar to MAT8_HUMAN CHLOR plasma membrane 

ESTs, Weakly similar to MAT8_HUMAN CHLOR plasma membrane 

ESTs, Weakly similar to MAT8_HUMAN CHLOR plasma membrane 

ESTs plasma membrane 

ESTs plasma membrane 

ESTs plasma membrane 

C15000305:gi|3806122lgb|AAC69198.1j (AFO intracell 

C15000305:gi[3806122|gblAAC69198.1| (AFO intraceH 

putative chemoWne receptor; GTP- binding plasma membrane 

putative chemokine receptor; GTP-binding plasma membrane 

putative chemokine receptor; GTP-binding plasma membrane 
similar to lysosome-associated membrane 
small Inducible cytokine subfamily B (Cy 
interleukin 1 receptor antagonist 
cannablnold receptor 1 (brain) 
cannabtnold receptor 1 (brain) 
cannabinoid receptor 1 (brain) 
tyrosylprotein sulfotransferase 1 
longevity assurance (LAG1, S. cerevisiae 
longevity assurance (LAG1 , S. cerevisiae 
glypfcanl 

deUa(DrosophlIa}-Ske3 
ESTs 
ESTs 

aquaporin4 

gb:Human dystrobrevin Isoform DTN-3 (DTN 
solute carrier family 1 (glial high affi 
chondroifin sulfate proteoglycan BEHAB/b 
chondroifin sulfate proteoglycan 3 (neur 

ESTs Intracell 

KIAA0877 protein plasma membrane 

dystrobrevin, alpha secreted 
chitinase 3-0ke 1 (cartilage gtycoprote 
lung type-I cell membrane-associated gfy 
ESTs. Weakly similar to T46916 hypotheti 
ESTs, Weakly similar to T46916 hypothec 

adenylate cyclase acSvafing polypeptide plasma membrane 

chitinase 3-Gke 2 secreted 

K1AA0027 protein plasma membrane 

potassium vottage-gated channel, KQT-fik plasma membrane 

NMJ018937*:Homo sapiens protocadherin be plasma membrane 

cadherin 4, type 1 , R-cadherin (retinal) plasma membrane 

NM_018938*:Homo sapiens protocadherin be plasma membrane 

putative acid-sensing ion channel intracell 

tumor necrosis factor, alpha-induced pro secreted 

tumor necrosis factor, alpha-Induced pro secreted 

carbonic anhydrase XII plasma membrane 

solute carrier family 17 (sodium phospha plasma membrane 
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secreted 

plasma membrane 
plasma membrane 
plasma membrane 
intracell 
secreted 

plasma membrane 
plasma membrane 
plasma membrane 



plasma membrane 



plasma membrane 
secreted & plasma membrane 
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SeqlO299&300 
SeqlD301&302 
Seq ID 303 & 304 
SeqlD305&306 
SeqlO307 &308 
SeqtO309 &310 
SeqlD311&312 
Seq ID 313 & 314 
Seq ID 315 & 316 
SeqID 317&318 
Seq !0 319 & 320 
Seq ID 321 & 322 
Seq ID 323 & 324 
SeqlD325 &326 
SeqlD327&328 
SeqlD329&330 
Seq ID 331 & 332 
Seq ID 333 & 334 
Seq ID 335 & 336 
Seq ID 337 & 338 
Seq ID 339 & 340 
SeqlD341&342 
Seq ID 343 & 344 
SeqlD345 &346 
Seq ID 347 & 348 
SeqiD349&350 
Seq ID 351 & 352 
SeqlD353&354 
Seq ID 355 & 356 
Seq ID 357 & 358 
Seq ID 359 & 360 
Seq ID 361 & 362 
Seq ID 363 & 364 
Seq ID 365 & 366 
SeqlD367 &368 
Seq ID 369 & 370 
Seq ID 371 & 372 
Seq ID 373 & 374 
Seq ID 375 & 376 
Seq ID 3774378 
Seq ID 379 & 380 
Seq ID 381 & 382 
SeqlD383&384 
SeqlD385 &386 
Seq ID 387 & 368 
SeqlD389&390 
SeqlD391&392 
SeqlD393 &394 
Seq ID 395 & 396 
SeqlD397&398 
Seq ID 399 & 400 
SeqlD401&402 
SeqlD403&404 
Seq ID 405 & 406 
SeqlD407&408 
Seq ID 409 & 410 
SeqiD 411 &412 
Seq ID 413 & 414 
SeqlD415&416 
Seq ID 417 & 418 
Seq ID 419 & 420 
SeqlD421&422 
Seq ID 423 & 424 
SeqlD425&426 
Seq ID 427 & 428 
Seq ID 429 & 430 
Seq ID 431 & 432 
Seq ID 433 & 434 
SeqlD435 &436 
Seq ID 437 & 438 
Seq ID 439 & 440 
SeqlD44U442 
Seq ID443&444 
Seq ID 445 & 446 
SeqlD447&448 
SeqID449 &450 
Seq ID 451 & 452 
SeqlD453 &454 
SeqlD455&456 
SeqtD457&458 
Seq ID 459 & 460 
Seq ID 461 & 462 
SeqlD463 &464 



428296 NM 003058 Hs.183572 solute earner family 22 (organic calion 
423508 AW604297 Hs.129711 hepatitis A vims cellular receptor 1 
450001 NM 001044 Hs.406 solute carrier family 6 (neurotransmitte 
410407 X66839 Hs.63287 carbonic anhydrase IX 
453496 AA442103 Hs.33084 solute carrier family 2 (facilitated gtu 
420737 U08096 Hs.99899 CD70; tumor necrosis factor (ligandjs 
309931 AW341683 gb:hd13d01.x1 Soares_NFUJJ3BC_S1 Homo s 

412719 AW016610 Hs.816 ESTs 
417034 NM_006183 Hs.80962 neurotensin 
430486 BE062109 Hs.241551 chloride channel, calcium activated, fam 
413753 U17760 Hs75517 laminin, beta3(nicein(125kD), kaltnin 
425650 NM_001944 Hs.1925 desmoglein 3 (pemphigus vulgaris antigen 
423673 BE003054 Hs.1695 matrix metalloproteinase 12 (macrophage 
418663 AK001100 Hs.41690 desmocd&n3 
418663 AK001100 Hs.41690 desmocoIHn 3 
429610 AB024937 Hs.211092 LUNX protein; PLUNC (palate rung and nas 
406690 M29540 Hs.220529 caranoembryonfc antigen-related cell ad 
431846 BE019924 Hs.271580 uroplakinIB 
422158 L10343 Hs.1 12341 protease inhibitor 3, skin-derived (SKAL 
431958 X63629 Hs.2877 cadherin 3, type 1. P-cadherln (placenta 
437044 AL035864 Hs.69517 differentially expressed in Fanconrs an 
428484 AF104032 Hs.184601 solute carrier family 7 (cationic amino 
429211 AF052693 Hs.198249 gap junction protein, beta 5 (connexin 3 
417389 BE260964 Hs.82045 midWne (neurite grovrftvpromoting factor 
431009 BE149762 Hs.48956 gap junction protein, beta 6 (connexin 3 
417542 J04129 Hs.82269 progestagen-associated endometrial prole 
449230 BE613348 HsJ211579 melanoma cell adhesion molecule 
410555 U92649 Hs.64311 a disintegrin and metalloproteinase doma 
410555 U92649 Hs.64311 a disintegrin and metafloproteinase doma 
424687 J05070 Hs.1 51738 matrix metalloproteinase 9 (gelatinase B 
418462 BE001596 Hs.85266 integrin, beta 4 
410274 AA381807 Hs.61762 hypoxia-inducible protein 2 
439606 W79123 Hs.58561 Gproleirvcoupled receptor 87 
404877 • NMJHJ5365:Homo sapiens melanoma antigen, 

444781 NM 014400 Hs.1 1950 GPI-aiichored rnetelasis-assocfeted prote 
109424 NM~005329 Hs.85962 hyaluronan synthase 3 
415817 U88967 Hs.78867 protein tyrosine phosphatase, receptcr-t 
415817 U8B957 Hs.78867 protein tyrosine phosphatase, receptor-t 
415817 U88967 Hs.78867 protein tyrosine phosphatase, receptor-t 
415817 U88967 Hs.78867 protein tyrosine phosphatase, receptor-t 
415817 U88967 Hs.78867 protein tyrosine phosphatase, receptor-t 
415B17 U88967 Hs.78867 protein tyrosine phosphatase, receptor-t 
421817 AF146074 Hs.108660 ATP-birwfing(^ette,s4iWamilyC(CFTR 
418678 NMJM1327 Hs.87225 cancer/tesfis antigen (NY-£SO-1) 
41B678 NMJJ01327 Hs.87225 cancer/tesfis anfigen (NY-ESO-1) 
409420 Z15008 Hs.54451 laminin. gamma 2 (nicein (lOOkD), kaKni 
332180 AF134160 Hs.7327 claudinl 
408790 AW580227 Hs.47860 neurotrophic tyrosine kinase, receptor, 
408790 AW580227 Hs.47860 neurotrophic tyrosine kinase, receptor, 
439223 AW238299 Hs.250618 UL1 6 binding protein 2 
409757 NM.001898 Hs.123114 cystafinSN 
428969 AF1 20274 Hs.1946B9 artemin 
AF120274 Hs.194689 artemin 
AF120274 Hs.194689 artemin 
AF120274 Hs.194689 artemin 

H39960 Hs.288467 hypothetical protein XPJ)981 51 (leucine- 
H39960 Hs.288467 hypothetical protein XPJJ981 51 (leudne- 
X02419 Hs.77274 plasminogen activator, urokinase 
R34008 Hs.239727 desmocollin 2 
R34008 Hs^39727 desmocollin2 
AW583497 Hs.1 84604 pancreatic polypeptide 
AI693815 Hs.1 27179 cryptic gene 
W94322 Hs279651 melanoma inhibitory activity 
AW391342 Hs.199460 DPCR1 protein 
AW391342 Hs.199460 DPCR1 protein 

C9001188 # B'l12738842JreflNP.073725.1| p 
Hs.99376 ESTs 

Hs. 16221 1 solute carrier family 6 (neurotransmitte 

cbrHuman nonspecific crossreacting antig 
gb:Human nonspecific crossreacting antig 
gbiHuman rwnspecific crossreacting antig 
ma1rifin3 

type I transmembrane protein Fn14 
uveal autoantigen with coiled cofl domai 
secretory granule, neuroendocrine protel 
a dtsinlegrin and metafloproteinase doma 
membrane-spanning 4-domains, subfamily A 
gastrin-reteasing peptide 



450701 
450701 
414774 
407944 
407944 
428486 
457489 
432874 
445891 
445891 
404682 
429547 
425921 
407242 
407242 
407242 
432596 
444006 
423685 
428392 
429597 
448030 
422109 
419235 
449048 
427333 
417931 
419216 



AW009166 
NMJJ07231 
M16728 
M18728 
M18728 
AJ224741 
BE395085 
BE350494 
H10233 
NMJJ03816 
N30714 
S73265 
AW470411 
Z45051 
AF067797 
W95642 
AU076718 



Hs.278461 
Hs.10086 
Hs.49753 
Hs.2265 
Hs.2442 
Hs.325960 
Hs.1473 
Hs.288433 
Hs.22920 
Hs.176658 
Hs.82961 
Hs.1 64021 



plasma membrane 
plasma membrane 
plasma membrane 
plasma membrane 
plasma membrane 
plasma membrane 
plasma membrane 
intracell 
secreted 

plasma membrane 
secreted 

plasma membrane 



plasma membrane 
plasma membrane 
secreted 

plasma membrane 
plasma membrane 
secreted 

plasma membrane 
plasma membrane 
plasma membrane 
plasma membrane 



plasma membrane 
secreted 

plasma membrane 
plasma membrane 
plasma membrane 
secreted 

plasma membrane 
secreted 

plasma membrane 
intracell 

plasma membrane 
plasma membrane 
plasma membrane 
plasma membrane 
plasma membrane 
plasma membrane 
plasma membrane 
plasma membrane 
plasma membrane 
intracell 
intracell 
secreted 

plasma membrane 
plasma membrane 
plasma membrane 
plasma membrane 
secreted 
secreted 
secreted 
secreted 



secreted 



secreted 

plasma membrane 
plasma membrane 
secreted 



secreted 

plasma membrane 
plasma membrane 
secreted 
secreted 

plasma membrane 
plasma membrane 
plasma membrane 
plasma membrane 
secreted 

plasma membrane 
Intracell 



similar to S68401 (cattle) glucose indue 
aquaporin 8 

trefoil factor 3 (intestinal) 

small inducible cytokine subfamfly B (Cy 
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plasma membrane 
plasma membrane 
secreted 

plasma membrane 
plasma membrane 



secreted 
secreted 
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10 

15 
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25 

30 
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Seq ID 465 4 466 
SeqID467&468 
Seq ID 469 & 470 
Seq ID 471 & 472 
SeqlD473&474 
SeqlD475&476 
Seq ID 477 & 478 
Seq ID 479 & 480 
Seq ID 4S1 & 4B2 
SeqlD483 &484 
SeqID4B5&486 
SeqlD487&488 
SeqlD489 &490 
Seq ID 491 & 492 
SeqlD493 &494 
Seq ID 495 & 496 
Seq ID 497 & 498 
SeqlD499 &500 
Seq ID SOU 502 
Seq ID 503 & 504 
Seq ID 505 & 506 
Seq ID 507 & 508 
SeqlD509 &510 
Seq ID 511 & 512 
Seq ID 513 
Seq ID 514 
Seq ID 515 & 516 
Seq ID 517 & 518 
Seq ID 519 & 520 
Seq ID 521 & 522 
Seq ID 523 & 524 
Seq ID 525 & 526 
Seq ID 527 & 528 
Seq ID 529 & 530 
Seq ID 531 & 532 
Seq ID 533 4534 
Seq ID 535 & 536 
Seq(D537 &538 
SeqlD539&540 
Seq ID 541 & 542 
SeqtD543&544 
Seq ID 545 & 546 
SeqlD547&548 
SeqtD549&550 
Seq ID 551 & 552 
SeqlD553 &554 
Seq ID 555 & 556 
SeqlD557&558 
SeqlD559&560 
SeqlD56U562 
SeqlD563 &564 
SeqtD565 &566 
Seq ID 567 & 568 
Seq ID 569 & 570 
Seq ID 571 & 572 
SeqlD573 &574 
Seq(D575&576 
Seq ID 577 4578 
Seq ID 579 & 580 
Seq ID 581 & 582 
Seq ID 583 & 584 
SeqtD565 &586 
Seq ID 587 & 588 
SeqlD589 &590 
SeqtD591&592 
Seq ID 593 & 594 
Seq ID 595 & 596 
SeqlD597&598 
Seq ID 599 & 600 
SeqID60U602 
SeqlD603 &604 
Seq ID 605 & 606 
SeqlD607&608 
SeqlD609 &610 
Seq ID 611 & 612 
Seq ID 613 & 614 
Seq ID 615 & 616 
Seq ID 617 & 618 
Seq ID 619 & 620 
Seq ID 621 & 622 
Seq ID 623 & 624 
Seq ID 625 & 626 
Seq ID 627 & 628 



431629 AU077025 Hs.265827 interferon, atpha-mdudble protein (do 
413554 AA319146 Hs.75426 secretogranin II (chromogrartin C) 
452194 AI694413 Hs.332649 UbiqirifiMikfi protein FAT 1077? 
452194 A1694413 Hs.332649 UbiquiMce protein FAT 10??? 
426322 J05068 Hs^012 transcobalamin I (vitamin B12 binding or 
429010 Y18198 Hs.194725 one cut domain, family member 2 
431462 AW583672 Hs.256311 granirvtflce neuroendocrine peptide precu 
448243 AW369771 Hs.52620 integrin, beta 8 
426427 M86699 Hs. 159 840 TTK protein kinase 
428187 AI687303 Hs.285529 G protenvcoupled receptor 49 
428579 NM_005756 Hs.184942 G protein-coupled receptor 64 
428579 NM 005756 Hs.184942 G protein-coupled receptor 64 
428579 NMJJ05756 Hs.184942 G protein-coupled receptor 64 
428579 NMl005756 Hs.184942 G protdrwoupted receptor 64 
445537 AJ245671 Hs.12844 EGF-like-domaln. multiple 6 
422278 AF072873 Hs.114218 frizzled (Drosophila) homolog 6 
424620 AA101043 Hs.151254 kaflfcrein 7 (chymotrypfic, stratum com 
406400 kallikrein 8 (neuropsin/ovasin) (KLK8) 

431130 NM.006103 Hs.2719 HE4; epklidfjrmls-specific, whey-acidic pr 
420440 NM 002407 Hs.97644 mammagtobin 2 
428450 NML014791 Hs.184339 KIAA01 75 gene product 
446619 AU076643 Hs.313 secreted phosphoprotein 1 (osteopontin, 
436982 AB018305 HS.537B sportcjn1.(f-spondin) extracellular mat 
453392 U23752 Hs.32964 SRY (sex determining region Y)-box 11 
431989 AW972870 Hs.291069 ESTs 

439820 AL360204 Hs^83853 Homo sapiens mRNA full length Insert cDN 
40917B BE393948 Hs.50915 kallikrein 5 

426514 BE616633 Hs.170195 bone morphogenetic protein 7 (osteogenic 
421478 AI683243 Hs.97258 ESTs, Moderately similar to S29539 ribos 
421478 AI683243 Hs.97258 ESTs, Moderately similar to S29539 ribos 
425776 U25128 Hs.1 59499 parathyroid hormone receptor 2 
425776 U25128 Hs.1 59499 parathyroid hormone receptor 2 
452097 AB002364 Hs.27916 a disintegrin-like and metalloprotease ( 
416530 U62801 Hs.79361 kallikrein 6 (neurosin, zyme) 
431515 NM_012152 Hs.258583 endothelial differentiation, tysophospha 
419452 U33635 Hs.90572 PTK7 protein tyrosine kinase 7 
412078 X69699 Hs.73149 paired box gene 8 
412078 X69699 Hs.73149 paired box gene 8 
411773 NMJJ06799 Hs.72026 protease, serine, 21 (testisin) 
407792 AI077715 Hs.39384 putative secreted ligand homologous to f 
431616 AA508552 Hs.195839 ESTs, Weakly similar to 138022 hypotheti 
452792 AB037765 Hs.30652 K1AA1344 protein 
400294 N95796 Hs.278695 Homo sapiens prostein mRNA, complete cds 
432653 N62096 Hs^93185 ESTs, Weakly similar to JC7328 amino ad 
432653 N62096 Hs^93185 ESTs, Weakly similar to JC7328 amino ad 
432653 N62096 Hs.293185 ESTs, Weakly similar to JC7328 amino ad 
432653 N62096 Hs.293185 ESTs, Weakly similar to JC7328 amino aci 
400290 H18836 Hs.31608 hypothetical protein FU20041 
410001 AB041036 Hs.57771 kallikrein 11 
418396 AI765805 Hs.26691 ESTs 
451027 AW519204 Hs.40808 ESTs 

446057 AI420227 Hs.1 49358 ESTs, Weakly similar to A46010 X-finked 
433466 AA508353 Hs.105314 reJaxin1(H1) 
453370 AI470523 Hs.139336 ATP-binding cassette, sub-family C (CFTR 
453370 AI470523 Hs.139336 ATP-binding cassette, sub-family C (CFTR 
414569 AF109298 Hs.118258 prostate cancer associated protein 1 
413435 X51405 Hs.75360 carbaxypeptidase E 
426501 AW043782 Hs.293616 ESTs 

AF179274 Hs.22791 transmembrane protein with EGF-Hke and 
R38438 Hs.182575 solute carrier family 15 (H??? transport 
AI972402 Hs306051 hypothefical protein MGC2648 

K^L005656^Homo sapiens transmembrane pr 
NM_005656*:Homo sapiens transmembrane pr 
dipepGdytpeptidase IV (CD26, adenosine 
CGM47 protein 

Homo sapiens mRNA; cDNA DKFZp56401763 (f 
potassium mtermediate/smafl conductance 
CG1-86 protein 

detodinase, todo thyronine, type II 
EphA3 
tetraspan 1 
chotecystokinin 
epithelial membrane protein 1 
ganTma-aminobutyric acid (GABA) A recepto 
gamma-aminobutyric add (GABA) A recepto 
gamma-arninobutyric add (GABA) A recepto 
gamma^minobutync add (GABA) A recepto 



secreted 



plasma membrane 
plasma membrane 



412628 
403047 
403047 
408430 
445413 
451982 
427958 
421887 
425071 
432101 
407786 
416836 
416539 
131083 
131083 
131083 
131083 
422424 
428970 
428330 
439018 
420610 
425723 



S79876 
AA151342 
F13036 
AA418000 
AW161450 
NMJJ13989 
AI918950 
AA687538 
D5474S 
Y07909 
Y09763 
Y09763 
Y09763 
Y09763 
AI186431 
BE276891 
L22524 
AW300887 
A1683183 
NWL014420 



Hs.44926 
Hs.12677 
Hs.27373 
Hs.98280 
Hs.109201 
Hs.154424 
Hs.123642 
Hs.38972 
Hs.80247 
Hs.79368 
Hs.22785 
Hs.22785 
Hs.22785 
Hs.22785 
Hs.298638 
Hs.194691 
Hs.2256 
Hs.26638 
Hs.99348 
Hs.1 59311 



ret nokj add Induced 3 
matrix metaBoprotetaase 7 (matrilysin, 
membrane-spanning 4-domains, subfamily A 
distaMess homeo box 5 
dickkopf (Xenopus laevis) homolog 4 
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intraceD 
secreted 

plasma membrane 
intraceD 

plasma membrane 
plasma membrane 
plasma membrane 
plasma membrane 
plasma membrane 



plasma membrane 



secreted 
intracell 
secreted 



secreted 



plasma membrane 
plasma membrane 
plasma membrane 
plasma membrane 
secreted 



plasma membrane 
plasma membrane 
intracell 



secreted 

plasma membrane 
plasma membrane 
plasma membrane 
plasma membrane 
plasma membrane 
plasma membrane 
plasma membrane 
plasma membrane 
secreted 

plasma membrane 
plasma membrane 
plasma membrane 
secreted 

plasma membrane 
plasma membrane 
plasma membrane 
secreted 

plasma membrane 
plasma membrane 
plasma membrane 



plasma membrane 
plasma membrane 
plasma membrane 



in trace!! 



plasma membrane 
secreted 

plasma membrane 
plasma membrane 



plasma membrane 
plasma membrane 
plasma membrane 
plasma membrane 
plasma membrane 
secreted 

plasma membrane 
secreted 

plasma membrane 

intracell 

secreted 
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SeqlD629&630 456662 NMJJ02448 Hs.1494 msh (Orosophila) homeo box homolog 1 (fo intracell 

Seq ID 631 & 632 418281 U09550 Hs.1154 oviductal glycoprotein 1. 120kD (mucin 9 

SeqlD633 &634 429903 AL134197 Hs.93597 cycfiivdependent kinase 5, regulatory su 

Table 768: 

Pkey: Unique Eos probesct identifier number 

CAT number Gene cluster number 

Accession: Genbank accession numbers 



Pkey 
424399 

429220 
450375 



CAT Number Accession 



238961J 



301384J 
83327.1 



AI905687 AI905624 AI905837 A190S623 AA340069 R75793 W72837 BE074512 AI905633 W72838 BE092421 AI127172 BE186013 AW070916 

AI139456AW176044AW291950 

AW207206 AW341473 AA448195 AI951341 

AA009647 AA131254 AA374293 AW954405 H04410 AW606284 AA151 166 BE157467 BE157501 H04384 W46291 AW663674 H04021 H01532 
AA190993 H03231 H59605 H01642AA852876AA113758AA626915AA746952AI161014AA099554 R69067 



Table 76Q 



Pkey: Unique number corresponding to an Eos pro beset 

Ref: Sequence source. The 7 digit numbers In this column are Genbank Identifier (Gl) numbers. "Dunham I. et aJ." refers to the publication entitled The DMA 

sequence of human chromosome 22." Dunham I. et a!., Nature (1999) 402:489495. 
Strand: Indicates DNA strand from which exons were predicted. 

NLposilion: Indicates nucleotide positions of predicted exons. 



Pkey 


Ref. 


Strand 


NLposrtion 


402075 


8117407 


Plus 


121907-122035,122804-122921,124019-124161,124455-124610,125672-126076 


402230 


9966312 


Minus 


29782-29932 


403047 


3540153 


Minus 


59793-59968 


404029 


7671252 


Pius 


108716-111112. 


404049 


3688074 


Minus 


75765-78155 


404682 


9797231 


Minus 


40977-41150 


404875 


9801324 


Plus 


96588-96732,97722-97831 


404877 


1519284 


Plus 


1095-2107 


404977 


3738341 


Minus 


43081-43229 


405932 


7767812 


Minus 


123525-123713 


40S4OO 


9256298 


Plus 


1553-1712,1878-2140,42524385,5922-6077 
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Table 77 provides Pkey; Seq ID No; Disease Indications; and Preferred Utility for sequences in Table 78. Seq ID No links the information m Table 77 to Table 78. 



SeqlDNo: Sequence ID No for sequences in table 

Pkey: Unique Eos probeset identifier number 

Disease Indications: Diseases designated for coverage as described in Table 1 

Preferred Utility: Preference of utility, based partly upon predicted localization (Ab is antibody; sm is small 



Seq ID No 



Pkey Disease Indications 



target; CTL is vaccine target) 
PreferredUlility 



Seq ID 1 & 2 425023 
SeqlD3& 4 424503 
Seq ID 5& 6 429423 
Seq ID 7 & 8 400289 
Seq ID 9 & 10 419172 
Seq ID 11 & 12 418007 
Seq ID 13 & 14 418007 
Seq ID 15 & 16 407836 
Seq ID 17 & 18 414577 
Seq ID 19 & 20 418738 
Seq ID 21 & 22 428368 
Seq ID 23 & 24 415138 
Seq ID 25 & 26 429276 
Seq ID 27 & 28 418994 
Seq ID 29 & 30 407975 
Seq ID 31 & 32 429113 
Seq ID 33 & 34 418506 
Seq ID 35 & 36 423961 
Seq ID 37 & 38 414812 
Seq ID 39 & 40 417433 
Seq ID 41 & 42 417433 
Seq ID 43 & 44 424399 
Seq ID 45 & 46 422867 
Seq ID 47 & 48 428227 
Seq ID 49 & 50 444381 
Seq ID 51 & 52 439569 
Seq ID 53 & 54 411558 
Seq ID 55 & 56 400303 
Seq ID 57 & 58 411789 
Seq ID 59 & 60 428698 
Seq ID 61 & 62 450098 
Seq ID 63 & 64 421552 
SeqlD65& 66 452747 
Seq!D67&68 415539 
Seq ID 69 & 70 416636 
Seq ID 71 & 72 416636 
SeqlD73& 74 409079 
SeqlD75&76 442082 
SeqlD77& 78 400297 
Seq ID 79 & 80 451398 
SeqlD81&82 429220 
Seq ID 83 & 84 421524 
Seq ID 85 & 66 423242 
Seq ID 87 & 88 423242 
Seq ID 89 & 90 423242 
Seq ID 91 & 92 452190 
Seq ID 93 & 94 452190 
SeqlD95&96 325372 
Seq ID 97 & 98 450375 
Seq ID 99 & 100 426215 
Seq ID 101 & 102 425247 
Seq ID 103 & 104 429353 
Seq ID 105 & 106 429353 
Seq ID 107 & 108 432201 
Seq ID 109 & 110 427585 
Seq ID 111 & 112 446163 
Seq ID 113 & 114 442117 
Seq ID 115 & 116 428179 
Seq ID 117 & 118 431211 
Seq ID 119 & 120 447033 
Seq ID 121 & 122 447033 
Seq ID 123 & 124 447033 
Seq ID 125 & 126 115522 
Seq ID 127 & 128 452679 
Seq ID 129 & 130 446051 
Seq ID 131 & 132 422048 
Seq ID 133 & 134 410418 
Seq ID 135 & 136 446342 
Seq ID 137 & 138 422260 
SeqtD139&140 409041 
Seq ID 14U142 409041 
Seq ID 143 & 144 111929 
Seq ID 145 & 146 111929 
Seq ID 147 & 148 111929 



angiogenesis 

pancreas, prostate, angiogenesis, bladder, lung. 



angiogenesis, bladder, lung, cervical, ovarian, head & neck 
angiogenesis, renal 

angiogenesis, lung, bladder, fibrosis, head & neck, pancreas, stomach, colon, ovarian 
angiogenesis, king, bladder, fibrosis, head & neck, pancreas, stomach, colon, ovarian 
angiogenesis 
angiogenesis 

angiogenesis, lung, ovarian, bladder & stomach, pancreas, uterine 
angiogenesis, head & neck, stomach 
angiogenesis, pancreas, stomach, lung, uterine 
angiogenesis, bladder, glioblastoma 
prostate, angiogenesis 
angiogenesis, renal 
angiogenesis, bladder and stomach 

angiogenesis, ovarian, glioblastoma, uterine, rung, bladder, pancreas 
breast colon, bladder, lung, fibrosis, pancreas, head and neck, ovarian 
breast, bladder, lung, Fibrosis, pancreas, colon, head and neck, cervical, stomach, renal, ovarian 
pancreas, breast, bladder, lung, head & neck, cervical, uterine, ovarian, stomach, renal 
pancreas, breast, bladder, lung, head & neck, cervical, uterine, ovarian, stomach, renal 
breast, uterine, head & neck 
breast ovarian, prostate, pancreas, lung, colon, uterine 
breast, lung, bladder, ovarian, head & neck, fibrosis, colon, stomach, cervical 
breast colon, bladder, lung, pancreas, head & neck, ovarian, stomach, uterine, renal, angiogenesis 
breast androgen withdrawal prostate, prostate, bladder 
pancreas, prostate, stomach, breast, uterine, cervical, ovarian 
breast ovarian, pros, stomach, uterine, bladder lung head & neck 
pancreas, lung, breast, stomach, head & neck, ovarian, uterine 
breast colon, lung, pancreas, stomach, head & neck, ovarian 
breast lung, stomach, uterine 

breast ovarian, pancreas, cervical, uterine, prostate, rung, stomach, head & neck 
breast, bladder, lung, head & neck, ovarian, stomach, uterine, pancreas 
breast prostate, ovarian, cervical, uterine 
breast pancreas, uterine 
breast pancreas, uterine 
breast prostate 
breast prostate, ovarian 
breast bladder, colon, prostate 
breast ovarian 

breast prostate, benign prostatic hyperplasia 



breast, renal, ovarian, prostate, colon 
breast renal, ovarian, prostate, colon 
breast renal, ovarian, prostate, colon 
breast stomach, pancreas 
breast stomach, pancreas 
breast 

breast ovarian, head & neck, pancreas, lung, colon 
breast lung, renal, colon, ovarian, uterine 

breast ovarian, lung, colon, pancreas, head & neck, stomach, uterine, cervical, bladder 
breast prostate 
breast prostate 

breast colon, lung enema, ovarian, stomach, pancreas, uterine, cervical 
breast rung, head & neck, pancreas, stomach, colon, ovarian, cervical 
breast cervical, uterine 

breast lung, bladder, pancreas, head & neck, stomach, ovarian, prostate 
breast 

colon, bladder, lung, pancreas, head & neck 
colon, prostate, fibrosis, breast lung pancreas 
colon, prostate, fibrosis, breast lung pancreas 
colon, prostate, fibrosis, breast lung pancreas 
colon, lung, bladder, pancreas 
prostate, colon, pancreas, Taxol prostate 
colon, breast 
colon, pancreas, prostate 

colon, bladder, rung, ovarian, pancreas, head & neck 
uterine, colon, prostate 
colon, ovarian mucinous 

uterine, ovarian, lung, colon, stomach, head & neck, breast pancreas 
uterine, ovarian, lung, colon, stomach, head & neck, breast pancreas 
colon, breast fibrosis 
colon, breast fibrosis 
colon, breast fibrosis 
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Ab, sm, CTL, imaging 
Ab, sm, imaging 
Ab, sm 

Ab, sm, CTL, diagnostic 
Ab, sm, CTL, Imaging 
Ab, sm, diagnostic 
Ab, sm, diagnostic 
CTL 

Ab, CTL, diagnostic 
Ab, sm, CTL, imaging 
Ab, sm, diagnostic 
Ab, CTL, diagnostic 
Ab, sm, imaging 
Ab, CTL imaging 
Ab.CTL. diagnostic 
sm, CTL 
Ab, sm, Imaging 
Ab, sm, diagnostic 
Ab, CTL, diagnostic 
Ab.CTL, imaging 
Ab.CTL, imaging 
Ab, CTL, diagnostic 
Ab, CTL, diagnostic 
Ab, CTL, diagnostic 
Ab, CTL, diagnostic 
Ab, CTL, diagnostic 
Ab, sm, imaging 
Ab,sm, CTL, imaging 
Ab. CTL, diagnostic 
Ab, sm, imaging 
CTL 

Ab, sm. CTL, diagnostic 
Ab, imaging 
Ab,sm, CTL, imaging 
Ab,sm, CTL, Imaging 
Ab.sm, CTL, imaging 
Ab,sm, CTL, imaging 
Ab, imaging 
Ab.sm, CTL, imaging 
CTL 

Ab, CTL, Imaging 

Ab^sm, CTL, Imaging 

Ab, CTL, Imaging 

CTL 

CTL 

CTL 

CTL 

CTL 

Ab,sm, CTL, Imaging 
Ab, CTL, diagnostic 
Ab, sm, CTL, diagnostic 
Ab, sm, CTL, imaging 
Ab, sm, CTL, imaging 
Ab, sm, CTL, imaging 
CTL 

Ab, sm, imaging 
Ab, CTL, imaging 
sm, CTL 
Ab, sm, Imaging 
Ab, CTL, diagnostic 
Ab, CTL, diagnostic 
Ab, CTL, diagnostic 
CTL 

Ab,CTU imaging 
Ab,sm, CTL. Imaging 
diagnostic 

Ab.sm, CTL. imaging 
Ab, sm, CTL imaging 
Ab, sm, CTL, diagnostic 
Ab, CTL, diagnostic 
Ab, CTL, diagnostic 
Ab, sm. Imaging 
Ab, sm, imaging 
Ab, sm, imaging 
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Seq 10 149 & 150 104888 
Seq ID 151 & 152 420159 
Seq ID 153 & 154 422330 
Seq ID 155 & 156 452461 
Seq ID 157 & 158 413324 
Seq ID 159 & 160 412420 
Seq ID 161 & 162 416658 
Seq ID 163 & 164 407811 
Seq ID 165 402230 
Seq ID 166 & 167 402230 
Seq ID 168 & 169 432829 
Seq ID 170 & 171 425721 
Seq ID 172 & 173 420370 
Seq ID 174 & 175 437652 
Seq ID 176 & 177 402075 
Seq ID 178 & 179 421110 
Seq ID 180 & 181 451668 
Seq ID 182 & 183 451668 
Seq ID 184 & 185 451668 
Seq ID 186 & 187 408243 
Seq ID 188 & 189 422282 
Seq ID 190 & 191 425852 
Seq ID 192 & 193 439738 
Seq ID 194 & 195 404875 
Seq ID 196 & 197 425883 
Seq ID 198 & 199 404977 
Seq ID 200 & 201 420876 
Seq ID 202 & 203 427747 
Seq ID 204 & 205 420281 
Seq ID 206 & 207 446673 
Seq ID 208 437553 
Seq ID 209 & 210 437553 
Seq ID211&212 437553 
Seq ID 213 & 214 426900 
Seq ID 215 & 216 426900 
Seq ID 217 & 218 426900 
Seq ID 219 & 220 405932 
Seq ID 221 & 222 405932 
Seq ID 223 & 224 424008 
Seq ID 225 & 226 424008 
Seq ID 227 & 228 424008 
Seq ID 229 & 230 444342 
Seq ID 231 & 232 421379 
Seq ID 233 & 234 417079 
Seq ID 235 & 236 412986 
Seq ID 237 & 238 412986 
Seq ID 239 & 240 412986 
Seq ID 241 & 242 447072 
Seq ID 243 & 244 419723 
Seq ID 245 & 246 419723 
Seq ID 247 & 248 430890 
Seq ID 249 & 250 456759 
Seq ID 251 429466 
Seq ID 252 429466 
Seq ID 253 & 254 419721 
Seq ID 255 & 256 407034 
Seq ID 257 & 258 413472 
Seq ID 259 & 260 438380 
Seq ID 261 & 262 426271 
Seq ID 263 & 264 419704 
Seq ID 265 & 266 444471 
Seq ID 267 & 268 409395 
Seq ID 269 & 270 413063 
Seq ID 271 & 272 433800 
Seq ID 273 458435 
Seq ID 274 458435 
Seq ID 275 & 276 424343 
Seq ID 277 & 278 424998 
Seq ID 279 & 280 412709 
Seq ID 281 4282 435615 
Seq ID 283 & 284 404049 
Seq ID 285 & 286 418932 
Seq ID 287 & 288 404029 
Seq ID 289 & 290 436480 
Seq ID 291 & 292 452401 
Seq ID 293 & 294 452401 
Seq ID 295 & 296 436895 
Seq ID 297 & 298 421471 
Seq ID 299 & 300 428296 
Seq ID 301 & 302 423508 
Seq ID 303 & 304 450001 
Seq ID 305 & 306 410407 
Seq ID 307 & 308 453496 



colon, stomach, uterine 
bladder, stomach 
pancreas, colon, bladder 

bladder, lung, head & neck, ovarian, glioblastoma, stomach, colon, cervical 
bladder 

bladder, glioblastoma, lung, stomach 
lung, ovarian, uterine, bladder 
bladder, pancreas, stomach, uterine, tung 
bladder 
bladder 
bladder 
bladder 
bladder 
bladder, lung 

bladder, lung, head & neck, cervical 
bladder, pancreas, stomach, ovarian, lung 
bladder, ovarian, lung 
bladder, ovarian, tung 
bladder, ovarian, lung 
bladder, stomach, head & neck, cervical 
bladder, lung, head & neck 
bladder, lung, head & neck 
bladder, lung, cervical 
bladder 

bladder, pancreas 
bladder, ovarian 
pancreas, bladder 
bladder, lung, ovarian, stomach 
lung, bladder, ovarian, pancreas 
bladder 
bladder 
bladder 
bladder 

bladder, prostate 
bladder, prostate 
bladder, prostate 

bladder, lung, head & neck, cervical 
bladder, lung, head & neck, cervical 
bladder, head & neck, stomach, cervical 
bladder, head & neck, stomach, cervical 
bladder, head & neck, stomach, cervical 
hepatitis C lung, fibrosis, bladder 

breast pancreas, head & neck, lung, stomach, bladder, cervical, colon 
bladder, tung, head & neck, cervical 
glioblastoma 
glioblastoma 
glioblastoma 
glioblastoma, pancreas 
glioblastoma 
glioblastoma 

glioblastoma, lung, cervical, bladder 
glioblastoma 
glioblastoma, uterine 
glioblastoma, uterine 
glioblastoma, lung 
glioblastoma 
glioblastoma 
glioblastoma 
glioblastoma 
glioblastoma 
glioblastoma, lung, colon 
glioblastoma 

glioblastoma, ovarian, bladder, lung 
glioblastoma, tung 



glioblastoma 

glioblastoma, ovarian, uterine 
glioblastoma 



glioblastoma 
glioblastoma 
glioblastoma 
glioblastoma 



bladder, breast pancreas, head & neck, stomach, lung, arthritis, renal 

bladder, breast pancreas, head & neck, stomach, tung, arthritis, renal 

breast renal, ovarian, glioblastoma 

renal 

renal 

renal, colon 
renal, lung 

renal lung, colon, stomach, ovarian, uterine 
renal, prostate 
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Ab, imaging 

Ab. sm, CTU imaging 

Ab,sm, CTU imaging, diagnostic 

CTL 

Ab. sm, CTL, diagnostic 

Ab. diagnostic 

Ab, CTL, diagnostic 

Ab, sm, diagnostic 

sm,CTL 

sm.CTL 

CTL 

Ab, imaging 

Ab, CTU. imaging 

Ab. sm, CTL, Imaging 

diagnostic 

Ab, sm, diagnostic 

Ab, sm, CTU diagnostic 

Ab, sm, CTU diagnostic 

Ab. sm, CTL. diagnostic 

Ab, diagnostic 

CTL, diagnostic 

Ab,sm, CTL, imaging 

Ab,sm, CTL, imaging 

sm, CTL 

Ab.CTU imaging 

Ab, sm, CTL. diagnostic 

Ab,sm, CTL, imaging 

sm,CTL 

Ab, sm, imaging 

sm, CTL 

Ab.CTU imaging 

Ab.CTL, imaging 

Ab.CTL, imaging 

Ab.CTL, imaging 

Ab.CTL, Imaging 

Ab.CTU imaging 

sm 

sm 

Ab,sm,CTL, imaging 
Ab,sm, CTU imaging 
Ab,sm, CTL, imaging 
Ab. CTL, diagnosfic 
Ab, CTL, diagnosfic 
Ab, diagnostic 
Ab.sm, CTU imaging 
Ab^m, CTU imaging 
Ab,sm, CTU imaging 
sm,CTL 

Ab, CTU diagnosfic 
Ab,sm, CTU imaging 
Ab, CTU imaging, diagnostic 
Ab,sm, CTU imaging 



Ab,sm, CTU imaging 
Ab, CTU diagnostic 
Ab.sm, CTU imaging 
Ab, CTU diagnostic, imaging 
Ab, CTU diagnostic 
sm. CTL 

Ab.sm, CTU imaging 
Ab, CTU diagnostic 
Ab, CTU diagnostic 
Ab, CTU imaging 



Ab.sm, CTU imaging 
Ab, CTU diagnosfic 
Ab,sm, CTU imaging 
Ab^sm, CTU imaging 
Ab.sm, CTU imaging 
Ab^sm, CTU imaging 
Ab,sm, CTU imaging 
sm, CTL 

Ab, CTU diagnostic 
Ab, CTU diagnosfic 
Ab,sm, Imaging 
Ab,sm, CTU imaging 
Ab.sm, CTU imaging 
Ab, CTU imaging 
Ab^m, CTU imaging 
Ab.sm, CTU imaging 
Ab,srn, CTU imaging 
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Seq ID 309 & 310 420737 
SeqlO 311&312 309931 
Seq ID 313 & 314 412719 
Seq ID 315 & 316 417034 
Seq 10 317 & 318 430486 
Seq ID 319 & 320 413753 
Seq ID 321 8, 322 425650 
Seq ID 323 & 324 423673 
Seq ID 325 & 326 418663 
Seq ID 327 & 328 418663 
Seq ID 329 & 330 429610 
Seq ID 331 & 332 406690 
Seq ID 333 & 334 431846 
Seq ID 335 & 336 422158 
Seq ID 337 & 338 431958 
Seq ID 339 & 340 437044 
Seq ID 341 & 342 428484 
Seq ID 343 & 344 429211 
Seq ID 345 & 346 417389 
Seq ID 347 & 348 431009 
Seq ID 349 & 350 417542 
Seq ID 351 & 352 449230 
Seq ID 353 & 354 410555 
Seq ID 355 & 356 410555 
Seq ID 357 & 358 424687 
Seq ID 359 & 360 418462 
Seq ID 361 & 362 410274 
Seq ID 363 & 364 439606 
Seq ID 365 & 366 404877 
Seq ID 367 & 368 444781 
Seq ID 369 & 370 109424 
Seq ID 371 & 372 415817 
Seq ID 373 & 374 415817 
Seq ID 375 & 376 415817 
Seq ID 377 & 378 415817 
Seq ID 379 & 380 415817 
Seq ID 381 &3B2 415817 
Seq ID 383 & 384 421817 
Seq ID 385 & 386 418678 
SeqlD387&3B8 418678 
Seq ID 389 & 390 409420 
Seq ID 391 & 392 332180 
Seq ID 393 & 394 408790 
Seq ID 395 & 396 408790 
Seq ID 397 & 398 439223 
Seq ID 399 & 400 409757 
Seq ID 401 & 402 428969 
Seq ID 403 & 404 428969 
Seq ID 405 & 406 428969 
Seq ID 407 & 408 428969 
Seq ID 409 & 410 450701 
SeqlD411&412 450701 
Seq ID 413 & 414 414774 
Seq ID 415 & 416 407944 
Seq I0417&418 407944 
Seq ID 419 & 420 428486 
Seq ID421&422 457489 
Seq ID 423 & 424 432874 
Seq ID 425 & 426 445891 
Seq ID 427 & 428 445891 
Seq ID 429 & 430 404682 
Seq ID 431 & 432 429547 
Seq ID 433 & 434 425921 
Seq ID 435 & 436 407242 
Seq ID 437 & 438 407242 
Seq ID 439 & 440 407242 
Seq ID 441 & 442 432596 
Seq ID 443 & 444 444006 
Seq ID 445 & 446 423685 
Seq ID 447 & 448 428392 
Seq ID 449 & 450 429597 
Seq ID 451 & 452 448030 
Seq ID 453 & 454 422109 
Seq ID 455 & 456 419235 
Seq ID 457 & 458 449048 
Seq ID 459 & 460 427333 
Seq ID 461 & 462 417931 
Seq ID 463 & 464 419216 
Seq ID 465 & 466 431629 
Seq ID 467 & 468 413554 
Seq ID 469 & 470 452194 
Seq ID 471 & 472 452194 
Seq ID 473 & 474 426322 



renal 
lung 

lung, head & neck, bladder, glioblastoma, cervical 

lung, head & neck, a couple cervical 

lung, bladder, head & neck, cervical 

lung, bladder, head & neck, pancreas, cervical, colon 

lung, head & neck, cervical, bladder 

bladder, King, head & neck, ovarian, pancreas, colon, stomach, uterine, cervical 
lung, bladder, head & neck, cervical 
tung, bladder, head & neck, cervical 
lung 

lung, head & neck, pancreas, stomach, bladder, colon, cervical 
lung, bladder, head & neck, uterine, cervical, stomach, ovarian 
head & neck, Madder, tung, cervical, stomach 

lung, bladder, cervical, head & neck, ovarian, colon, prostate, pancreas, breast 

head & neck, cervical, tung, bladder, breast, prostate, ovarian, stomach 

tung, glioblastoma, bladder, head & neck, colon, cervical 

tung, bladder, head & neck, cervical, stomach 

ovarian, lung, bladder, uterine, cervical, pancreas, stomach 

lung, bladder, head & neck 

tung 

lung, cervical, head & neck, bladder, ovarian, colon 

lung 

lung 

head & neck, pancreas, lung, uterine, cervical, colon, stomach 
lung, bladder 
lung, renal 

lung, bladder, head & neck, cervical . 
lung, bladder 

lung, bladder, head & neck, cervical 
bladder, lung 

lung, glioblastoma, head & neck, cervical, fibrosis 

lung, glioblastoma, head & neck, cervical, fibrosis 

lung, glioblastoma, head & neck, cervical, fibrosis 

lung, glioblastoma, head & neck, cervical, fibrosis 

lung, glioblastoma, head & neck, cervical, fibrosis 

lung, glioblastoma, head & neck, cervical fibrosis 

tung, cervical, head & neck & bladder 

tung, bladder, stomach, ovarian, pancreas & cervical 

lung, bladder, stomach, ovarian, pancreas & cervical 

lung, head & neck, pancreas, stomach, cervical, bladder 

lung 

lung 

lung 

lung, head & neck, cervical, bladder & colon 
pancreas, stomach, lung, bladder, stomach 
lung, cervical 
lung, cervical 
lung, cervical 
lung, cervical 

hmg, head & neck, pancreas, breast, stomach, ovarian, colon 
lung, head & neck, pancreas, breast, stomach, ovarian, colon 
lung, bladder, head & neck, pancreas, stomach, ovarian 
tung, head & neck 
lung, head & neck 



pancreas, prostate, lung 
pancreas, stomach 
stomach, pancreas, ovarian 
stomach, pancreas, ovarian 



pancreas, head & neck, lung, ovarian 
stomach, pancreas 

pancreas, colon, bladder, head & neck, stomach, lung, ovarian, cervical 
pancreas, colon, bladder, head & neck, stomach, lung, ovarian, cervical 
pancreas, colon, bladder, head & neck, stomach, lung, ovarian, cervical 



pancreas, colon, tung, ovarian & cervical 
pancreas, uterine, colon 



pancreas, colon, stomach, lung 

pancreas, renal and stomach, 

pancreas, lung, colon 

pancreas, fibrosis, head & neck & lung 

pancreas, ovarian, uterine, glioblastoma, head & neck & tung 

pancreas, colon 

ovarian, pancreas, stomach, colon, uterine, prostate 

pancreas, lung, stomach, cervical, prostate, head & neck 

pancreas, uterine, cervical, stomach 

pancreas, glioblastoma 

stomach, pancreas, renal, colon 

stomach, pancreas, renal, colon 

pancreas, bladder, stomach 



Ab,sm, CTL imaging 
Abvsm, CTL, imaging 
CTL 

Ab.CTL diagnostic 
Ab^m, imaging 
CTL, diagnostic 
Ab, imaging 
Ab, CTL, diagnostic 
Ab, imaging 
Ab, imaging 
CTL, diagnostic 
Ab, imaging 
Ab, imaging 
diagnostic 
Ab.CTL, imaging 
Ab, imaging 
Ab, sm, imaging 
Ab, imaging 
Ab, diagnostic 
Ab. sm. imaging 
CTL, diagnostic 
Ab. imaging 
Ab, sm, imaging 
Ab, sm, imaging 
Ab, sm, diagnostic 
Ab, imaging 
diagnostic 
Ab.sm, imaging 
CTL 

Ab, imaging 
Ab, sm, CTL, imaging 
Ab.sm, CTL, imaging 
Ab.sm, CTL, imaging 
Ab.sm, CTL, imaging 
Ab,sm, CTL, imaging 
Ab,sm, CTL, imaging 
Ab,sm, CTL, imaging 
Ab.sm, CTL, imaging 

ca 

CTL 

CTL, diagnostic 
Ab, sm, imaging 
Ab,sm, CTL, imaging 
Ab,s m, CTL, imaging 
Ab, CTL, imaging 
Ab,CTL diagnostic 
Ab.CTL diagnostic 
Ab, CTL, diagnostic 
Ab, CTL, diagnostic 
Ab, CTL, diagnostic 
Ab, CTL, diagnostic 
Ab, CTL, diagnostic 
Ab, sm, diagnostic 
Ab,sm, imaging 
Ab, sm, imaging 
Ab, CTL diagnostic 
Ab, CTL, diagnostic 
Ab. CTL diagnostic 
Ab, CTL imaging 
Ab, CTL imaging 
Ab, CTL diagnostic 
Ab, CTL diagnostic 
Ab.sm, CTL imaging 
Ab.sm, CTL imaging 
Ab,sm, CTL imaging 
Ab,sm, CTL imaging 
CTL 

Ab, CTL imaging 
CTL 

Ab, CTL diagnostic 
Ab,sm, CTL imaging 
Ab.sm, imaging 
Ab, CTL diagnostic 
Ab, CTL imaging 
Ab, CTL Imaging 
Ab, sm. imaging 
Ab. diagnostic 
Ab, CTL diagnostic 
Ab, CTL diagnostic 
Ab, CTL diagnostic 
Ab.sm, CTL Imaging 
Abism, CTL imaging 
Ab, diagnostic 
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Seql0475& 476 429010 
Seq ID 477 & 478 431462 
Seq ID 479 & 480 448243 
SeqlD k 481&482 426427 
Seq ID 483 4484 428187 
SeqlD485& 486 428579 
Seq ID 487 & 488 428579 
Seq ID 489 & 490 428579 
Seq ID 491 & 492 428579 
Seq ID 493 & 494 445537 
Seq ID 495 & 496 422278 
Seq ID 497 & 498 424620 
Seq ID 499 & 500 406400 
Seq ID 501 & 502 431130 
Seq ID 503 & 504 420440 
Seq ID 505 & 506 428450 
Seq ID 507 & 508 446619 
Seq ID 509 & 510 436982 
Seq ID 511 & 512 453392 
Seq ID 513 431989 
Seq ID 514 439820 
Seq ID 515 & 516 409178 
Seq ID 517 & 518 426514 
Seq ID 519 & 520 421476 
Seq ID 521 & 522 421478 
Seq ID 523 & 524 425776 
Seq ID 525 & 526 425776 
Seq ID 527 & 528 452097 
Seq ID 529 & 530 416530 
Seq ID 531 & 532 431515 
Seq ID 533 & 534 419452 
Seq ID 535 & 536 412078 
Seq ID 537 & 538 412078 
Seq ID 539 & 540 411773 
Seq ID 541 & 542 407792 
Seq ID 543 & 544 431616 
Seq ID 545 & 546 452792 
Seq ID 547 & 548 400294 
Seq ID 549 & 550 432653 
Seq ID 551 & 552 432653 
Seq ID 553 & 554 432653 
Seq ID 555 & 556 432653 
Seq ID 557 & 558 400290 
Seq ID 559 & 560 410001 
Seq ID 561 & 562 418396 
Seq ID 563 & 564 451027 
Seq ID 565 & 566 446057 
Seq ID 567 & 568 433466 
Seq ID 589 & 570 453370 
Seq ID 571 & 572 453370 
Seq ID 573 & 574 414569 
Seq ID 575 & 576 413435 
Seq ID 577 & 578 426501 
Seq ID 579 & 580 448999 
Seq ID 581 & 582 408369 
Seq ID 583 & 584 412628 
SeqlD5B5&5B6 403047 
Seq ID 587 & 588 403047 
Seq ID 589 & 590 408430 
Seq ID 591 & 592 445413 
Seq ID 593 & 594 451982 
Seq ID 595 & 596 427958 
Seq ID 597 & 598 421887 
Seq ID 599 & BOO 425071 
Seq ID 601 & 602 432101 
Seq ID 603 & 604 407786 
Seq ID 605 & 606 416836 
Seq ID 607 & 608 416539 
Seq ID 609 & 610 131083 
Seq ID 611 & 612 131083 
Seq ID 613 & 614 131083 
Seq ID 615 & 616 131083 
Seq ID 617 & 618 422424 
Seq ID 619 & 620 428970 
Seq ID 621 & 622 426330 
Seq ID 623 & 624 439018 
Seq ID 625 & 626 420610 
Seq ID 627 & 628 425723 
Seq ID 629 & 630 456662 
Seq ID 631 & 632 418281 
Seq ID 633 & 634 429903 



pancreas, lung, glioblastoma 

ovarian, uterine, lung, stomach, head & neck, glioblastoma, p 
ovarian, lung, head & neck, cervical, colon, uterine, stomach 
ovarian, uterine, colon, stomach 
ovarian, Ewing, uterine, pancreas 
ovarian, Ewing, uterine, pancreas 
ovarian, Ewing, uterine, pancreas 
ovarian, Ewing, uterine, pancreas 

ovarian, bladder, uterine, breast, lung, head & neck, renal, fibrosis, pancreas, cervical 
ovarian, head & neck, bladder, cervical, lung 



ovarian, uterine 
ovarian, uterine 
ovarian, uterine, cervical 
ovarian, cervical, pancreas, lung 
ovarian, fibrosis, pancreas, head & neck, rung, colon 
ovarian, fibrosis 
ovarian, lung, glioblastoma 
ovarian 

ovarian, uterine, cervical, breast, prostate 
ovarian, breast < 

ovarian, colon, bladder, lung, cervical 
ovarian, bladder 
ovarian, bladder 
ovarian, uterine, lung 
ovarian, uterine, lung 
ovarian 

ovarian, uterine 
ovarian, prostate, lung 
ovarian, prostate, lung, breast uterine 
ovarian 
ovarian 
ovarian 

ovarian, uterine, cervical, pancreas 
prostate, pancreas, colon 
prostate, uterine, breast 
prostate, taxol prostate 
prostate, lung 
prostate, lung 
prostate Jung 
prostate, lung 
prostate, colon 

ovarian, prostate, uterine, cervical, lung 
prostate 

prostate, uterine, glioblastoma 



prostate 
prostate 
prostate 
prostate 

prostate, glioblastoma, pancreas 
prostate, breast, glioblastoma, lung 
prostate, glioblastoma 

prostate, lung, fibrosis, uterine, glioblastoma, cervical, ovarian 



prostate, bladder, colon 
prostate, bladder, colon 



prostate, colon, uterine, ovarian, lung, pancreas 
prostate, bladder 
prostate, glioblastoma 



prostate, colon, stomach, uterine, cervical, head & neck, pancreas 



prostate, colon, uterine, stomach, Inflammatory bowel disease, ovarian 

Prostate, Ewing, gfiobtastoma 

ZD1839 resistant cancers, head & neck 

androgen withdrawal prostate 

androgen withdrawal prostate 

androgen withdrawal prostate 

androgen withdrawal prostate 

bladder, pancreas, prostate, angiogenesis, colon, stomach, lung 
stomach, pancreas, colon 

uterine, ovarian, fibrosis, prostate, pancreas, lung, bladder, head & neck 

uterine, stomach, prostate 

uterine, ovarian endometrioid, lung 

ovarian endometrioid, uterine, colon 

uterine, ovarian 

uterine, ovarian 

lung 



sm, CTL 
Ab, diagnostic 
Ab.sm, imaging 
sm,CTL 

Ab,sm, CTL, imaging 
Ab,sm, CTL, imaging 
Ab,sm, CTL imaging 
Ab,sm, CTL, imaging 
Ab,sm, CTL, imaging 
Ab, CTL, diagnostic 
Ab,sm, imaging 
Ab, CTL, diagnostic 
Ab, CTUdiagnostic 
Ab. CTL, diagnostic 
Ab, CTL, diagnostic 
sm 

Ab, diagnostic 
Ab, CTL, diagnostic 
CTL 



Ab, CTL. diagnostic 
Ab, CTL, diagnostic 
Ab, CTL, imaging 
Ab, CTL, imaging 
Ab,sm, CTL. imaging 
Ab,sm, CTL, imaging 
Ab, sm, diagnostic 
Ab, CTL, diagnostic 
Ab,sm. CTL, imaging 
Ab.sm, CTL, imaging 
CTL 
CTL 

Ab,sm,CTt_ diagnostic 
Ab, CTL, diagnostic 
Ab,sm, CTL, imaging 
Ab,CTl, Imaging 
Ab,sm, CTL, imaging 
Ab,sm, CTL, imaging 
Ab,sm, CTL, Imaging 
Ab^m, CTL, imaging 
Ab.sm, CTL, imaging 
Ab,sm, CTL imaging 
Ab. CTL diagnostic 
Ab,sm, CTL imaging 
Ab.sm, CTL imaging 
Ab,sm, CTL imaging 
Ab, CTL (fiagnosfic 
Ab,sm, CTL imaging 
Ab^m, CTL imaging 
Ab.sm, CTL imaging 
Ab, sm, diagnostic 
Ab, CTL, imaging 
Ab,sm, CTL imaging 
Ab.sm, CTL imaging 
Ab, CTL diagnostic 
Ab,sm, CTL imaging 
Ab^m, CTL imaging 
Ab^m, CTL imaging 



CTL 

Ab,sm, CTL Imaging 
Ab, CTL imaging 
Ab, diagnostic 
Ab,sm, imaging 
Ab^m, imaging 
Ab, CTL diagnostic 
Ab, sm, CTL imaging 
Ab, sm, CTL imaging 
Ab,sm, CTL imaging 
Ab, sm, CTL Imaging 
Ab, sm, CTL imaging 
Ab, CTL diagnostic 
Ab,sm, imaging 
Ab, sm, CTL diagnostic 
Ab^m, CTL imaging 
CTL 

Ab, CTL diagnostic 
CTL 

Ab, CTL diagnostic 
sm 
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Table 78 



10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 



Seq ID NO: 1 DNA sequence 

Nucleic Acid Accession 8: NM_001400 

Coding sequence: 251.. 1399 

1 11 21 31 41 51 

| t I 1 I I 

TCTAAAGGTC GGGGGCAGCA GCAAGATGCG AAGCGAGCCG TACAGATCCC GGGCTCTCCG 
AACGCAACTT CGCCCTGCTT GAGCGAGGCT GCGGTTTCCG AGGCCCTCTC CAGCCAAGGA 
AAAGCTACAC AAAAAGCCTG GATCACTCAT CGAACCACCC CTGAAGCCAG TGAAGGCTCT 
CTCGCCTCGC CCTCTAGCGT TCGTCTGGAG TAGCGCCACC CCGGCTTCCT GGGGACACAG 
GGTTGGCACC ATGGGGCCCA CCAGCGTCCC GCTGGTCAAG GCCCACCGCA GCTCGGTCTC 
TGACTACGTC AACTATGATA TCATOGTCCG GCATTACAAC TACACGGGAA AGCTGAATAT 
CAGCGCGGAC AAGGAGAACA GCATTAAACT GACCTCGGTG GTGTTCATTC TCATCTGCTG 
CTTTATCATC CTGGAGAACA TCTTTGTCTT GCTGACCATT TGGAAAACCA AGAAATTCCA 
CCGACCCATG TACTATTTTA TTGGCAATCT GGCCCTCTCA GACCTGTTGG CAGGAGTAGC 
CTACACAGCT AACCTGCTCT TGTCTGGGGC CACCACCTAC AAGCTCACTC CCGCCCAGTG 
GTTTCTGCGG GAAGGGAGTA TGTTTGTGGC CCTGTCAGCC TCCGTGTTCA GTCTCCTCGC 
CATCGCCATT GAGCGCTATA TCACAATGCT GAAAATGAAA CTCCACAACG GGAGCAATAA 
CTTCCGCCTC TTCCTGCTAA TCAGCGCCTG CTGGGTCATC TCCCTCATCC TGGGTGGCCT 
GCCTATCATG GGCTGGAACT GCATCAGTGC GCTGTCCAGC TGCTCCACCG TGCTGCCGCT 
CTACCACAAG CACTATATCC TCTTCTGCAC CACGGTCTTC ACTCTGCTTC TGCTCTCCAT 
CGTCATTCTG TACTGCAGAA TCTACTCCTT GGTCAGGACT CGGAGCCGCC GCCTGACGTT 
CCGCAAGAAC ATTTCCAAGG CCAGCCGCAG CTCTGAGAAG TCGCTGGCGC TGCTCAAGAC 
CGTAATTATC GTCCTGAGCG TCTTCATCGC CTGCTGGGCA CCGCTCTTCA TCCTGCTCCT 
GCTGGATGTG GGCTGCAAGG TGAAGACCTG TGACATCCTC TTCAGAGCGG AGTACTTCCT 
GGTGTTAGCT GTGCTCAACT CCGGCACCAA CCCCATCATT TACACTCTGA CCAACAAGGA 
GATGCGTCGG GCCTTCATCC GGATCATGTC CTGCTGCAAG TGCCCGAGCG GAGACTCTGC 
TGGCAAATTC AAGCGACCCA TCATCGCCGG CATGGAATTC AGCCGCAGCA AATCGGACAA 
TTCCTCCCAC CCCCAGAAAG ACGAAGGGGA CAACCCAGAG ACCATTATGT CTTCTGGAAA 
CGTCAACTCT TCTTCCTAGA ACTGGAAGCT GTCCACCCAC CGGAAGCGCT CTTTACTTGG 
TCGCTGGCCA CCCCAGTGTT TGGAAAAAAA TCTCTGGGCT TCGACTGCTG CCAGGGAGGA 
GCTGCTGCAA GCCAGAGGGA GGAAGGGGGA GAATACGAAC AGCCTGGTGG TGTCGGGTGT 
TGGTGGGTAG AGTTAGTTCC TGTGAACAAT GCACTGGGAA GGGTGGAGAT CAGGTCCCGG 
CCTGGAATAT AT ATT CT AC C CCCCTGGAGC TTTGATTTTG CACTGAGCCA AAGGTCTAGC 
ATTGTCAAGC TCCTAAAGGG TTCATTTGGC CCCTCCTCAA AGACTAATGT CCCCATGTGA 
AAGCGTCTCT TTGTCTGGAG CTTTGAGGAG ATGTTTTCCT TCACTTTAGT TTCAAACCCA 
AGTGAGTGTG TGCACTTCTG CTTCTTTAGG GATGCCCTGT ACATCCCACA CCCCACCCTC 
CCTTCCCTTC ATACCCCTCC TCAACGTTCT TTTACTTTAT ACTTTAACTA CCTGAGAGTT 
ATCAGAGCTG GGGTTGTGGA ATGATCGATC ATCTATAGCA AATAGGCTAT GTTGAGTACG 
TAGGCTGTGG GAAGATGAAG ATGGTTTGGA GGTGTAAAAC AATGTOCTTC GCTGAGGCCA 
AAGTTTCCAT GTAAGCGGGA TCCGTTTTTT GGAATTTGGT TGAAGTCACT TTGATTTCTT 
TAAAAAACAT CTTTTCAATG AAATGTGTTA CCATTTCATA TCCATTGAAG CCGAAATCTG 
CATAAGGAAG CCCACTTTAT CTAAATGATA TTAGCCAGGA TCCTTGGTGT CCTAGGAGAA 
ACAGACAAGC AAAACAAAGT GAAAACCGAA TGGATTAACT TTTGCAAACC AAGGGAGATT 
TCTTAGCAAA TGAGTCTAAC AAATATGACA TCCGTCTTTC CCACTTTTGT TGATGTTTAT 
TTCAGAATCT TGTGTGATTC ATTTCAAGCA ACAACATGTT GTATTTTGTT GTGTTAAAAG 
TACTTTTCTT GATTTTTGAA TGTATTTGTT TCAGGAAGAA GTCATTTTAT GGATTTTTCT 
AACCCGTGTT AACTTTTCTA GAATCCACCC TCTTGTGCCC TTAAGCATTA CTTTAACTGG 
TAGGGAACGC CAGAACTTTT AAGTCCAGCT ATTCATTAGA TAGTAATTGA AGATATGTAT 
AAATATTACA AAGAATAAAA ATATATTACT GTCTCTTTAG TATGGTTTTC AGTGCAATTA 
AACCGAGAGA TGTCTTGTTT TTTT AAAAAG AATAGTATTT AATAGGTTTC TGACTTTTGT 
GGATCATTTT GCACATAGCT TTATCAACTT TTAAACATTA ATAAACTGAT TTTTTTAAAG 



60 
120 
160 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
19B0 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 



Seq ID NO: 2 Protein sequence 
Protein Accession #i NP_001391 



MGPTSVPLVK 
LENIFVLLTI 
EGSMFVALSA 
GWNCISALSS 
ISKASRSSEK 
VLNSGTNPII 



11 
I 

AHRSSVSDYV 
WKTKKFHRPM 
SVFSLIAIAI 
CSTVLPLYHK 
SLALLKTVII 
YTLTNKEMRR 
TIMSSGNVNS 



21 
I 

NYDIIVRHYN 
YYFIGNLALS 
ERYITMLKMK 
HYILFCTTVF 
VLSVFIACWA 
APIRIMSCCK 



31 41 51 

1 1 I 

YTGKLNISAD KENSIKLTSV VFILICCFII 60 

DLLAGVAYTA NLLLSGATTY KLTPAQWFLR 120 

LHNGSNNFRL FLLISACWVI SLILGGLPIM 180 

TLLLLSIVIL YCRIYSLVRT KSRRLTFRKN 240 

PLFILLLLDV GCKVKTCDIL FRAEYPLVLA 300 

CPSGDSAGKF KRPIIAGMEF SRSKSDNSSH 360 



70 



75 



80 



Seq ID NO: 3 DNA sequence 

Nucleic Acid Accession «: NM_002205.1 

Coding sequence : 1 . . 3 14 9 



1 
I 

ATGGGGAGCC 
CGCCGACCCC 
GGCTTCAACT 
GGATTCTCAG 
CCCAAGGCTA 
TGGGGTGCCA 
CTGGAGTCCT 
TGGTTCGGGG 
AGCTGGCGCA 
GATAACTTCA 



11 
I 

GGACGCCAGA 
CGCTSSTGCC 
TAGACGCGGA 
TGGAGTTTTA 
ATACCAGCCA 
GCCCCACACA 
CACTGTCCAG 
CAACAGTTCG 
CAGAGAAGGA 
CCCGAATTCT 



21 

I 

GTCCCCTCTC 
GCTGCTGTTG 
GGCCCCAGCA 
CCGGCCGGGA 
GCCAGGAGTG 
GTGCACCCCC 
CTCAGAGGGA 
AGCCCATGGC 
GCCACTGAGC 
GGAGTATGCA 



31 
I 

CACGCCGTGC 
CTGCTSSTGC 
GTACTCTCGG 
ACAGACGGGG 
CTGCAGGGTG 
ATTGAATTTG 
GAGGAGCCXG 
TCCTCCATCT 
GACCCCGTGG 
CCCTGCCGCT 



41 
I 

AGCTGCGCTG 
OGCCGCCACC 
GGCCCCCGGG 
TCAGTGTGCT 
GTGCTGTCTA 
ACAGCAAAGG 
TGGAGTACAA 
TGGCATGCGC 
GCACCTGCTA 
CAGATTTCAG 



51 
I 

GGGCCCCCGG 
CAGGGTCGGG 
CTCCTTCTTC 
GGTGGGAGCA 
CCTCTGTCCT 
CTCTCGGCTC 
GTOCTTGCAG 
TCCACTGTAC 
CCTCTCCACA 
CTGGGCAGCA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



949 



WO 03/042661 



PCTAJS02/36810 



10 

15 

20 

25 

30 

35 

40 
45 
50 

5.5 
60 

65 

70 

75 

80 



GGACAGGGTT 
TTAGGTGGAC 
ATTGCAGAAT 
CGCCAGGCCA 
TTCAGTGGTG 
GGCTATGTCA 
CAGATGGCCT 
GATGACTTGC 
GAGGTGGGCA 
CTTACCCTCA 
GACCTGGACC 
CAGCAGGGAG 
CAGGTTCTGC 
CGAGGAGGCC 
GTGGACAAGG 
ATCTTCCCCG 
GCCTGCATCA 
GGTTTCACAG 
CTGTTCCTGG 
CGAGAGGATT 
CTCTCGCCGA 
CACGGCCTCA 
ATCTTGCTGG 
GGGGAGCAGA 
CAGAATGTGG 
GCTGAGTACT 
TTTGCCGTGA 
GCCAGTCTGT 
ATCCAGTTTG 
TCCTTTCGGC 
GAGGCAGTGC 
GACCTGGGAC 
AGCCAGGGTG 
GTGACCAGAG 
GAGTTGGATC 
TCTGCTTCCT 
TGTGAGCTCG 
TGGGCCAAGA 
TACAAAGCCC 
CAGGTGGCCA 
ATCATCATCC 
TACAAGCTTG 
CTCAAGCCTC 



ACTGCCAAGG 
CAGGAAGCTA 
CTTATTACCC 
GTTCCATCTA 
ATGACACAGA 
CCATCCTTAA 
CCTACTTTGG 
TGGTGGGGGC 
GGGTCTACGT 
CTGGCCATGA 
AGGATGGCTA 
TAGTGTTTGT 
AGCCCCTGTG 
GAGACCTGGA 
CTGTGGTATA 
CCATGTTCAA 
ACCTTAGCTT 
TGGAACTTCA 
CCTCCAGGCA 
GCAGAGAGAT 
TTCACATCGC 
GGCCAGCCCT 
ACTGTGGAGA 
ACCATGTGTA 
. GTGAGGGTGG 
CAGGACTCGT 
ACCAGAGCCG 
GGGGTGGCCT 
ACTTCCAGAT 
TCTCCGTGGA 
TATTCCCAGT 
CTGCTGTCCA 
TGCTGGAACT 
TTACGGGACT 
CCGAGGGTTC 
CGGGACCTCA 
GGCCCCTGCA 
CTTTCTTGCA 
TGAAGATGCC 
CAGCTGTGCA 
TAGCCATCCT 
GATTCTTCAA 
CAGCCACCTC 



AGGCTTCAGT 
TTTCTGGCAA 
CGAGTACCTG 
TGATGACAGC 
AGACTTTGTT 
TGGCTCAGAC 
CTATGCAGTG 
ACCCCTGCTC 
CTACCTGCAG 
TGAGTTTGGC 
CAATGATGTG 
ATTTCCTGGG 
GGCAGCCAGC 
TGGCAATGGA 
CAGGGGCCGC 
CCCAGAGGAG 
CTGCCTCAAT 
GCTGGACTGG 
GGCAACCCTG 
GAAGATCTAC 
TCTCAACTTC 
ACATTATCAG 
AGACAACATC 
CCTGGGTGAC 
CGCCTATGAG 
CAGACACCCA 
CCTGCTGGTG 
TCGGTTTACA 
CCTCAGCAAG 
GGCTCAGGCC 
AAGCGACTGG 
CCATGTCTAT 
CAGCTGTCCC 
CAACTGCACC 
CCTGCACCAC 
GATCCTGAAA 
CCAACAAGAG 
GCGGGAGCAC 
CTACCGAATC 
ATGGACCAAG 
GTTTGGCCTC 
ACGCTCCCTC 
TGATGCCTGA 



GCCGAGTTCA 
GGCCAGATCC 
ATCAACCTGG 
TACCTAGGAT 
GCTGGTGTGC 
ATTCGATCCC 
GCOGOCACAG 
ATGGATCGGA 
CACCCAGCCG 
CGATTTGGCA 
GCCATCGGGG 
GGCCCAGGAG 
CACACCCCAG 
TATCCTGATC 
CCCATCGTGT 
CGGAGCTGCA 
GCTTCTGGAA 
CAGAAGCAGA 
ACCCAGACCC 
CTCAGGAACG 
TCCTTGGACC 
AGCAAGAGCC 
TGTGTGCCTG 
AAGAATGCCC 
GCTGAGCTTC 
GGGAACTTCT 
TGTGACCTGG 
GTCCCTCATC 
AATCTCAACA 
CAGGTCACCC 
CATCCCCGAG 
GAGCTCATCA 
CAGGCTCTGG 
ACCAATCACC 
CAGCAAAAAC 
TGCCCGGAGG 
AGCCAAAGTC 
CAGCCATTTA 
CTGCCTCGGC 
GCAGAAGGCA 
CTGCTCCTAG 
CCATATGGCA 



CCAAGACTGG 
TGTCTGCCAC 
TTCAGGGGCA 
ACTCTGTGGC 
CCAAAGGGAA 
TCTACAACTT 
ACGTCAATGG 
CCCCTGACGG 
GCATAGAGCC 
GCTCCTTGAC 
CTCCCTTTGG 
GGCTGGGCTC 
ACTTCTTTGG 
TGATTGTGGG 
CCGCTAGTGC 
GCTTAGAGGG 
AACACGTTGC 
AGGGAGGGGT 
TGCTCATCCA 
AGTCAGAATT 
CCCAAGCCCC 
GGATAGAGGA 
ACCTGCAGCT 
TGAACCTCAC 
GGGTCACCGC 
CCAGCCTGAG 
GCAACCCCAT 
TCCGGGACAC 
ACTCGCAAAG 
TGAACGGTGT 
ACCAGCCTCA 
ACCAAGGCCC 
AAGGTCAGCA 
CCATTAACCC 
GGGAAGCTCC 
CTGAGTGTTT 
TGCAGTTGCA 
GCCTGCAGTG 
AGCTGCCCCA 
GCTATGGCGT 
GTCTACTCAT 
CCGCCATGGA 



CCGTGTGGTT 
TCAGGAGCAG 
GCTGCAGACT 
TGTTGGTGAA 
CCTCACTTAC 
CTCAGGGGAA 
GGACGGGCTG 
GCGGCCTCAG 
CACGCCCACC 
CCCCCTGGGG 
TGGGGAGACC 
TAAGCCTTCC 
CTCTGCCCTT 
GTCCTTTGGT 
CTCCCTCACC 
GAACCCTGTG 
TGACTCCATT 
ACGGCGGGCA 
GAATGGGGCT 
TCGAGACAAA 
AGTGGACAGC 
CAAGGCTCAG 
GGAAGTGTTT 
TTTCCATGCC 
CCCTCCAGAG 
CTGTGACTAC 
GAAGGCAGGA 
TAAGAAAACC 
CGACGTGGTT 
CTCCAAGCCT 
GAAGGAGGAG 
CAGCTCCATT 
GCTCCTATAT 
AAAGGGCCTG 
AAGCCGCAGC 
CAGGCTGCGC 
TTTCCGAGTC 
TGAGGCTGTG 
AAAAGAGCGT 
CCCACTGTGG 
CTACATCCTC 
AAAAGCTCAG 



660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 



Seq ID NO: . 4 Protein sequence 
Protein Accession #: NP_002196.1 



MGSRTPESPL 
GFSVEFYRPG 
LESSLSSSEG 
DNFTRILEYA 
IAESYYPEYL 
GYVTILNGSD 
EVGRVYVYLQ 
QQGWFVFPG 
VDKAWYRGR 
GFTVELQLDW 
LSPIHIALNF 
GEQNHVYLGD 
FAVNQSRLLV 
SFRLSVEAQA 
SQGVLELSCP 
SASSGPQILK 
YKAIiKMPYRI 
YKLGFFKRSL 



11 
I 

HAVQLRWGPR 
TDGVSVLVGA 
EEPVEYKSLQ 
PCRSDFSWAA 
INLVOGQLQT 
IRSLYNFSGE 
HPAGIEPTPT 
GPGGLGSKPS 
PIVSASASLT 
QKQKGGVRRA 
SLDPQAPVDS 
KNALNLTFHA 
CDLGNPMKAG 
QVTLKGVSKP 
QALEGQQLLY 
CPEAECFRLR 
LPRQLPQKER 
PYGTAMEKAQ 



21 
I 

RRPPLLPLLL 
PKANTSQPGV 
WFGATVRAHG 
GQGYCQGGFS 
RQASSIYDDS 
QMASYFGYAV 
LTLTGHDEFG 
QVLQPLHAAS 
IFPAMFNPEE 
LFLASRQATL 
HGLRPALHYQ 
QNVGEGGAYE 
ASLWGGLRFT 
EAVLFPVSDW 
VTRVTGLNCT 
CELGPLHQQE 
QVATAVQWTK 
LKPPATSDA 



31 
I 

LLLPPPPRVG 
LQGGAVYLCP 
SSILACAPLY 
AEFTKTGRW 
YLGYSVAVGE 
AATDVNGDGL 
RFGSSLTPLG 
HTPDPFGSAL 
RSCSLEGNPV 
TQTLiLIQNGA 
SKSRIEDKAQ 
AELRVTAPPE 
VPHLRDTKKT 
HPRDQPCKEE 
TKHPINPKGL 
SQSLQLHPRV 
AEGSYGVPLW 



41 
I 

GFNliDAEAPA 
WGASPTQCTP 
SWRTEKEPLS 
LGGPGSYFWQ 
FSGDDTEDFV 
DDLLVGAPLL 
DLDQDGYNDV 
RGGRDLDGNG 
ACINLSFCLN 
REDCREMKIY 
ILLDCGEDNI 
AEYSGLVRHP 
IQFDFQILSK 
DLGPAVHHVY 



51 
I 

VLSGPPGSFF 
IEFDSKGSRL 
DPVGTCYLST 
GQILSATQEQ 
AGVPKGNLTY 
MDRTPDGRPQ 
AIGAPFGGET 
YPDLIVGSFG 
ASGKHVADSI 



CVPDLQLEVF 
GNFSSLSCDY 
NLNNSQSDW 
ELINQGPSSI 



WAKTFLQREH QPFSLQCEAV 
IIILAILFGL LLLGLLIYIL 



60 
120 
180 
240 
300 
360 
420 
460 
540 
600 
660 
720 
780 
840 
900 
960 
1020 



Seq ID NO: 5 DNA sequence 

Nucleic Acid Accession #: NM_002211.1 

Coding sequence: 1..2397 



ATGAATTTAC 
CAAACAGATG 
GCAGGGCCAA 
TCTGCACGAT 
GAAAATCCCA 
GGAACAGCAG 
TTGCGATTAA 
TATCCCATTG 
AATGTAAAAA 



11 
I 

AACCAATTTT 
AAAATAGATG 
ATTGTGGGTG 
GTGATGATTT 
GAGGCTCCAA 
AGAAGCTCAA 
GATCAGGGGA 
ACCTCTACTA 
GTCTTGGAAC 



21 
I 

CTGGATTGGA 
TTTAAAAGCA 
GTGCACAAAT 
AGAAGCCTTA 
AGATATAAAG 
GCCAGAGGAT 
GCCACAGACA 
CCTTATGGAC 
AGATCTGATG 



31 
I 

CTGATCAGTT 
AATGCCAAAT 
TCAACATTTT 



AAAAATAAAA 
ATTACTCAGA 
TTTACATTAA 
CTGTCTTACT 
AATGAAATGA 



41 
I 

CAGTTTGCTG 
CATGTGGAGA 
TACAGGAAGG 
GTTGCCCTCC 
ATGTAACCAA 
TCCAACCACA 
AATTCAAGAG 
CAATGAAAGA 
GGAGGATTAC 



51 
I 

TGTGTTTGCT 
ATGTATACAA 
AATGCCTACT 
AGATGACATA 
CCGTAGCAAA 
GCAGTTGGTT 
AGCTGAAGAC 
CGATTTGGAG 
TTCGGACTTC 



60 
120 
180 
240 
300 
360 
420 
480 
540 



950 



WO 03/042661 PCT/US02/36810 



AGAATTGGAT TTGGCTCATT TGTGGAAAAG ACTGTGATGC CTTACATTAG CACAACACCA 600 

GCTAAGCTCA GGAACCCTTG CACAAGTGAA CAGAACTGCA CCAGCCCATT TAGCTACAAA 660 

AATGTGCtCA GTCTTACTAA TAAAGGAGAA GTATTTAATG AACTTGTTGG AAAACAGGGC 720 

ATATCTGGAA ATTTGGATTC TCCAGAAGGT GGTTTCGATG CCATCATGCA AGTTGCAGTT 780 

5 TGTGGATCAC TGATTGGCTG GAGGAATGTT ACACGGCTGC TGGTGTTTTC CACAGATGCC 840 

GGGTTTCACT TTGCTGGAGA TGGGAAACTT GGTGGCATTG TTTTACCAAA TGATGGACAA 900 

TGTCACCTGG AAAATAATAT GTACACAATG AGCCATTATT ATGATTATCC TTCTATTGCT 960 

CACCTTGTCC AGAAACTGAG TGAAAATAAT ATTCAGACAA TTTTTGCAGT TACTGAAGAA 1020 

TTTCAGCCTG TTTACAAGGA GCTGAAAAAC TTGATCCCTA AGTCAGCAGT AGGAACATTA 1080 

10 TCTGCAAATT CTAGCAATGT AATTCAGTTG ATCATTGATG CATACAATTC CCTTTCCTCA 1140 

GAAGTCATTT TGGAAAACGG CAAATTGTCA GAAGGAGTAA CAATAAGTTA CAAATCTTAC 1200 

TGCAAGAACG GGGTGAATGG AACAGGGGAA AATGGAAGAA AATGTTCCAA TATTTCCATT 1260 

GGAGATGAGG TTCAATTTGA AATTAGCATA ACTTCAAATA AGTGTCCAAA AAAGGATTCT 1320 

GACAGCTTTA AAATTAGGCC TCTGGGCTTT ACGGAGGAAG TAGAGGTTAT TCTTCAGTAC 1380 

15 ATCTGTGAAT GTGAATGCCA AAGCGAAGGC ATCCCTGAAA GTCCCAAGTG TCATGAAGGA 1440 

AATGGGACAT TTGAGTGTGG CGCGTGCAGG TGCAATGAAG GGCGTGTTGG TAGACATTGT 1500 

GAATGCAGCA CAGATGAAGT TAACAGTGAA GACATGGATG CTTACTGCAG GAAAGAAAAC 1560 

AGTTCAGAAA TCTGCAGTAA CAATGGAGAG TGCGTCTGCG GACAGTGTGT TTGTAGGAAG 1620 

AGGGATAATA CAAATGAAAT TTATTCTGGC AAATTCTGCG AGTGTGATAA TTTCAACTGT 1680 

20 GATAGATCCA ATGGCTTAAT TTGTGGAGGA AATGGTGTTT GCAAGTGTCG TGTGTGTGAG 1740 

TGCAACCCCA ACTACACTGG CAGTGCATGT GACTGTTCTT TGGATACTAG TACTTGTGAA 1800 

GCCAGCAACG GACAGATCTG CAATGGCCGG GGCATCTGCG AGTGTGGTGT CTGTAAGTGT 1860 

ACAGATCCGA AGTTTCAAGG GCAAACGTGT GAGATGTGTC AGACCTGCCT TGGTGTCTGT 1920 

GCTGAGCATA AAGAATGTGT TCAGTGCAGA GCCTTCAATA AAGGAGAAAA GAAAGACACA 1980 

25 TGCACACAGG AATGTTCCTA TTTTAACATT ACCAAGGTAG AAAGTCGGGA CAAATTACCC 2040 

CAGCCGGTCC AACCTGATCC TGTGTCCCAT TGTAAGGAGA AGGATGTTGA OGACTGTTGG 2100 

TTCTATTTTA CGTATTCAGT GAATGGGAAC AACGAGGTCA TGGTTCATGT TGTGGAGAAT 2160 

CCAGAGTGTC CCACTGGTCC AGACATCATT CCAATTGTAG CTGGTGTGGT TGCTGGAATT 2220 

GTTCTTATTG GCCTTGCATT ACTGCTGATA TGGAAGCTTT TAATGATAAT TCATGACAGA 2280 

30 AGGGAGTTTG CTAAATTTGA AAAGGAGAAA ATGAATGCCA AATGGGACAC GGGTGAAAAT 2340 
CCTATTTATA AGAGTGCCGT AACAACTGTG GTCAATCCGA AGTATGAGGG AAAATGA 

Seq ID NO: 6 Protein sequence 
^ Protein Accession tf: NP_002202.1 

1 11 21 31 41 51 

1 I I I 1 I 

MNLQPIFWIG LISSVCCVFA QTDENRCLKA NAKSCGECIQ AGPNCGWCTN STPLQEGMPT 60 

SARCDDLEAL KKKGCPPDDI ENPRGSKDIK KNKNVTNRSK GTAEKLKPED ITQIQPQQLV 120 

40 LRLRSGEPQT PTLKFKRAED YPIDLYYLMD LSYSMKDDLE NVKSLGTDLM NEMRRITSDF 180 

RIGFGSFVEK TVMPYISTTP AKLRNPCTSE QNCTSPFSYK NVLSLTNKGE VFNELVGKQR 240 

ISGNLDSPEG GFDAIMQVAV CGSLIGWRNV TRliLVFSTDA GFHFAGDGKL GGIVLPNDGQ 300 

CHLENNMYTM SHYYDYPSIA HLVQKLSENN IQTIFAVTEE FQPVYKELKN LIPKSAVGTL 360 

SANSSNVIQL IIDAYNSLSS EVILENGKLS EGVTISYKSY CKNGVNGTGE NGRKCSNISI 420 

45 GDEVQFEISI TSNKCPKKDS DSFKIRPLGF TEEVEVILQY ICECECQSEG IPESPKCHEG 480 

NGTFECGACR CNEGRVGRHC ECSTDEVNSE DMDAYCRKEN SSEICSNNGE CVCGQCVCRK 540 

RDNTNEIYSG KFCECDNFNC DRSNGLICGG NGVCKCRVCE CNPNYTGSAC DCSLDTSTCE 600 

ASNGQICNGR GICECGVCKC TDPKFQGQTC EMCQTCLGVC AEHKECVQCR AFNKGEKKDT 660 

CTQECSYFNI TKVESRDKLP QPVQPDPVSH CKEKDVDDCW FYPTYSVNGN NEVMVHWEN 720 

50 PECPTGPDI1 PIVAGWAGI VLIGLALLLI WKLLMIIHDR REFAKFEKEK MNAKWDTGEN 780 
PIYKSAVTTV VNPKYEGK 

Seq ID NO: 7 DNA sequence 
Nucleic Acid Accession #: NM_00242S 
55 Coding sequence: 26.. 1453 

1 11 21 31 41 51 

I I 1 I 1 I 

rA AAAGAAGGTA AGGGCAGTGA GAATGATGCA TCTTGCATTC CTTGTGCTGT TGTGTCTGCC 60 

60 AGTCTGCTCT GCCTATCCTC TGAGTGGGGC AGCAAAAGAG GAGGACTCCA ACAAGGATCT 120 

TGCCCAGCAA TACCTAGAAA AGTACTACAA CCTCGAAAAG GATGTGAAAC AGTT TAGAAG 180 

AAAGGACAGT AATCTCATTG TTAAAAAAAT CCAAGGAATG CAGAAGTTCC TTGGGTTGGA 240 

GGTGACAGGG AAG CT AGACA CTGACACTCT GGAGGTGATG CGCAAGCCCA GGTGTGGAGT 300 

- TCCTGACGTT GGTCACTTCA GCTCCTTTCC TGGCATGCCG AAGTGGAGGA AAACCCACCT 360 

65 TACATACAGG ATTGTGAATT ATACACCAGA TTTGCCAAGA GATGCTGTTG ATTCTGCCAT 420 

TGAGAAAGCT CTGAAAGTCT GGGAAGAGGT GACTCCACTC ACATTCTCCA GGCTGT ATGA 480 

AGGAGAGGCT GATATAATGA TCTCTTTGGC AGTTAAAGAA CATGGAGACT TTTACTCTTT 540 

TGATGGCCCA GGACACAGTT TGGCTCATGC CTACCCACCT GGACCTGGGC TTTATGGAGA 600 

TATTCACTTT GATGATGATG AAAAATGGAC AGAAGATGCA TCAGGCACCA ATTTATTCCT 660 

70 CGTTGCTGCT CATGAACTTG GCCACTCCCT GGGGCTCTTT CACTCAGCCA ACACTGAAGC 720 

TTTGATGTAC CCACTCTACA ACTCATTCAC AGAGCTCGCC CAGTTCCGCC TTTCGCAAGA 780 

TGATGTGAAT GGCATTCAGT CTCTCTACGG ACCTCCCCCT GCCTCTACTG AGGAACCCCT 840 

GGTGCCCACA AAATCTGTTC CTTCGGGATC TGAGATGCCA GCCAAGTGTG ATCCTGCTTT 900 

GTCCTTCGAT GCCATCAGCA CTCTGAGGGG AGAATATCTG TTCTTTAAAG ACAGATATTT 960 

75 TTGGCGAAGA TCCCACTGGA ACCCTGAACC TGAATTTCAT TTGATTTCTG CATTTTGGCC 1020 

CTCTCTTCCA TCATATTTGG ATGCTGCATA TGAAGTTAAC AGCAGGGACA CCGTTTTTAT 1080 

TTTTAAAGGA AATGAGTTCT GGGCCATCAG AGGAAATGAG GTACAAGCAG GTTATCCAAG 1140 

AGGCATCCAT ACCCTGGGTT TTCCTCCAAC CATAAGGAAA ATTGATGCAG CTGTTTCTGA 1200 

CAAGGAAAAG AAGAAAACAT ACTTCTTTGC AGCGGACAAA TACTGGAGAT TTGATGAAAA 1260 

80 TAGCCAGTCC ATGGAGCAAG GCTTCCCTAG ACTAATAGCT GATGACTTTC CAGGAGTTGA 1320 

GCCTAAGGTT GATGCTGTAT TACAGGCATT TGGATTTTTC TACTTCTTCA GTGGATCATC 1380 

ACAGTTTGAG TTTGACCCCA ATGCCAGGAT GGTGACACAC ATATTAAAGA GTAACAGCTG 1440 

GTTACATTGC TAGGCGAGAT AGGGGGAAGA CAGATATGGG TGTTTTTAAT AAATCTAATA 1500 

ATTATTCATC TAATGTATTA TGAGCCAAAA TGGTTAATTT TTCCTGCATG TTCTGTGACT 1560 



951 



WO 03/042661 



PCTAJS02/36810 



GAAGAAGATG AGCCTTGCAG ATATCTGCAT GTGTCATGAA GAATGTTTCT GGAATTCTTC 1620 
ACTTGCTTTT GAATTGCACT GAACAGAATT AAGAAATACT CATGTGCAAT AGGTGAGAGA 1680 
ATGTATTTTC ATAGATGTGT. TATTACTTCC TCAATAAAAA GTTTTATTTT GGGCCTGTTC 1740 
CTT 

Seq ID NO: 8 Protein sequence 
Protein Accession #: NP__002416 



10 
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65 

70 

75 

80 



MKLAFLVLLC 
KIQGMQKFLG 
PDLPRDAVDS 
HAYPPGPGLY 
PTELAQFRLS 
RGEYLFFKDR 
IRGNEVQAGY 
PRLIADDFPG 



11 
I 

LPVCSAYPLS 
LEVTGKLDTD 
AIEKALKVWE 
GDIHFDDDEK 
QDDVNGIQSL 
YFWRRSHWNP 
PRGIHTLGFP 
VEPKVDAVLQ 



21 
I 

GAAKEEDSNK 
TLEVMRKPRC 
EVTPLTFSRL 
WTEDASGTNL. 
YGPPPASTEE 
EPEFHLISAF 
PTIRKIDAAV 
AFGFFYFFSG 



31 
I 

DLAQQYLEKY 
GVPDVGHFSS 
YEGEADIMIS 
FLVAAHELGH 
PLVPTKSVPS 
WPSLPSYLDA 
SDKEKKKTYF 
SSQFEFDPNA 



41 

I 

YNLEKDVKQF 
FPGMPKWRKT 
FAVKEHGDFY 
SLGLFHSANT 
GSEMPAKCDP 
AYEVNSRDTV 
FAADKYWRPD 
RMVTHILKSN 



51 
I 

RRKDSNLIVK 60 

HLTYRIVNYT 120 

SFDGPGHSLA 180 

EALMYPLYNS 240 

ALSFDAI STL 300 

FIFKGNEFWA 360 

ENSQSMEQGF 420 
SWLHC 



Seq ID NO: 9 DNA sequence 

Nucleic Acid Accession #: XM__058189.2 

Coding sequence: 169.. 774 



GAAGACCAGC 
AGTATTTCAC 
TATGAAGTGA 
AAGTGTGGAG 
AACATATTAT 
AACTACGTGT 
ACAGTTCTTC 
AGCAAAAAAT 
GGATACTGCC 
GATGGCTGGG 
TGGATTCAGT 
CTCATAACCC 
TCCAAGATAC 
ACAAAATGTT 
GACTTGAGGG 
TTTGCAGTCC 
AATATGCATC 
• GAATCTGTAC 
TGCAACATTA 
AGGATTAAGT 
ATAAAAAATA 



11 
I 

TCAGCTCTTC 
ACTGAGATTG 
AGGGCTCTGA 
GCTGCCTAAG 
TGTATTTCCC 
GGTATTTTGA 
TGGTACTGGA 
ATGTGACACT 
TGGTCATCTC 
AGTATGCTTT 
GCCTGGAACC 
TCAGTGGGCT 
TGTGTGGAAG 
TTCCATTATC 
CAATATTGAA 
TCACTGCACA 
AGAAACTTCA 
TAGGGAAGTA 
TTTAATATTC 
AAAGAGTGGT 
AAGAGTACTA 



21 
I 

AGTTGTTGAT 
TCGGCTGCGG 
CCCTGGAAGT 
TTGTTTGCTG 
GAATGGGCAA 
AGGAATCTGT 
GAATAATAAC 
GCTGTCAATT 
TGCCTTGGGT 
TGAAGGCACT 
TGCACATGTT 
TCAAGTGATC 
CTATTCAGTG 
AAGACATGGC 
ATGATGGTGC 
TGCAAGTATA 
GAAATACTTC 
AATAAGAATA 
TGGAAAATTG 
ACATACTGTA 
CATGGTTGTA 



31 
I 

CATTGTCTAT 
GTATATTCCA 
GGTTCTAAGC 
ATTCCGCTTG 
ACTTCCTATG 
TTCTCAGGCA 
AACTATAAAT 
ATCTTTTCTT 
CTTGTCCAAG 
GCTGGACGTT 
GTGGAGTGGA 
ATCTGCCTCA 
ATCTTCCAGC 
CATCTATCTA 
TTTCTGCATT 
CCACCCTTCC 
TGCCCTTTGA 
TGAGAGAAAC 
GAAACACCCC 
AATGTTTTCT 
AAA 



41 
I 

TGTTCTCCAA 
ATTCCCCGTC 
AGGGCAAAAT 
CACTTTGGAG 
CATCCAGCAA 
TCATGATGCT 
GTTGCCAGAG 
CCCTCGGAAT 
GGCCATATTG 
TCCTTACAGA 
ACATCATTTT 
TCAGAGTAGT 
CTGGAATCAT 
AATATTATAT 
TGGTGTTTAT 
ATTTAGTATG 
TCAAACAAAT 
CTTTATGCAA 
AAAATTCTAA 
GATATTAAAA 



51 

1 

ACAGTAAACC 
TCCTCATGAA 
GGGGTCTCGG 
TATAATCGTG 
TAAACTCACC 
TATAGTAACA 
TGAAAACTGC 
TGCTTTTTCT 
CCGCACCCTT 
TTCTAGCATA 
ATTTTCCATT 
CATGCAACTA 
TTGAATAAGG 
CAACTGTGTA 
TTGTAAAAAA 
TTTTTTAAGT 
CCATTTCCAA 
ATATGTATAT 
ACTCAGAGGA 
AAAAAATTAA 



51 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
640 
900 
960 
1020 
1080 
1140 
1200 



Seq ID NO: 10 Protein sequence 
Protein Accession #: XP_0 58189.1 

1 11 21 31 41 

I I I I I 1 

MGSRKCGGCL SCLLIPLALW SIIVNILLYP PNGQTSYASS NKLTNYVWYF EGICFSGIMM 
LIVTTVLLVL ENNNNYKCOQ SENCSKKYVT LLSIIFSSLG IAFSGYCLVI SALGLVQGPY 
CRTLDGWEYA FEGTAGRFLT DSSIWIQCLE PAHWEWNII LFSILITLSG LQVIICLIRV 
VMQLSKILCG SYSVIPQPGI I 

Seq ID NO: 11 DNA sequence 

Nucleic Acid Accession #: NM_002421.2 

Coding sequence: 1..1409 



ATGCACAGCT 
CCAGCGACTC 
TACTACAACC 
GTTGAAAAAT 
GCTGAAACCC 
GTCCTCACTG 
TACACGCCAG 
TGGAGTAATG 
ATATCTTTTG 
CTTGCTCATG 
GAAAGGTGGA 
GGCCATTCTC 
ACCTTCAGTG 
GGACGTTCCC 
AAGCTAACCT 
TTCTACATGC 
TGGCCACAAC 
CGGTTTTTCA 
CCCAAGGACA 
CTTTCTGAGG 
GATGAATATA 



11 

1 

TTCCTCCACT 
TAGAAACACA 
TGAAGAATGA 
TGAAGCAAAT 
TGAAGGTGAT 
AGGGGAACCC 
ATTTGCCAAG 
TCACACCTCT 
TCAGGGGAGA 
CTTTTCAACC 
CCAACAATTT 
TTGGACTCTC 
GTGATGTTCA 
AAAATCCTGT 
TTGATGCTAT 
GCACAAATCC 
TGCCAAATGG 
AAGGGAATAA 
TCTACAGCTC 
AAAACACTGG 
AACGATCTAT 



21 
I 

GCTGCTGCTG 
AGAGCAAGAT 
TGGGAGGCAA 
GCAGGAATTC 
GAAGCAGCCC 
TCGCTGGGAG 
AGCAGATGTG 
GACATTCACC 
TCATCGGGAC 
AGGCCCAGGT 
CAGAGAGTAC 
CCATTCTACT 
GCTAGCTCAG 
CCAGCCCATC 
AACTACGATT 
CTTCTACCCG 
GCTTGAAGCT 
GTACTGGGCT 
CTTTGGCTTC 
AAAAACCTAC 
GGATCCAGGT 



31 
I 

CTGTTCTGGG 
GTGGACTTAG 
GTTGAAAAGC 
TTTGGGCTGA 
AGATGTGGAG 
CAAACACATC 
GACCATGCCA 
AAGGTCTCTG 
AACTCTCCTT 
ATTGGAGGGG 
AACTTACATC 
GATATCGGGG 
GATGACATTG 
GGCCCACAAA 
CGGGGAGAAG 
GAAGTTGAGC 
GCTTACGAAT 
GTTCAGGGAC 
CCTAGAACTG 
TTCTTTGTTG 
TATCCCAAAA 



41 
I 

GTGTGGTGTC 
TCCAGAAATA 
GGAGAAATAG 
AAGTGACTGG 
TGCCTGATGT 
TGACCTACAG 
TTGAGAAAGC 
AGGGTCAAGC 
TTGATGGACC 
ATGCTCATTT 
GTGTTGOGGC 
CTTTGATGTA 
ATGGCATCCA 
CCCCAAAAGC 
TGATGTTCTT 
TCAATTTCAT 
TTGCCGACAG 
AGAATGTGCT 
TGAAGCATAT 
CTAACAAATA 
TGATAGCACA 



60 
120 
180 



51 
I 

ACACAGCTTC 
CCTGGAAAAA 
TGGCCCAGTG 
GAAACCAGAT 
GGCTCAGTTT 
GATTGAAAAT 
CTTCCAACTC 
AGACATCATG 
TGGAGGAAAT 
TGATGAAGAT 
TCATGAACTC 
CCCTAGCTAC 
AGCCATATAT 
ATGTGACAGT 
TAAAGACAGA 
TTCTGTTTTC 
AGATGAAGTC 
ACACGGATAC 
CGATGCTGCT 
CTGGAGGTAT 
TGACTTTCCT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
10B0 
1140 
1200 
1260 



952 
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GGAATTGGCC ACAAAGTTGA TGCAGTTTTC ATGAAAGATG GATTTTTCTA TTTCTTTCAT 1320 

GGAACAAGAC AATACAAATT TGATCCTAAA ACGAAGAGAA TTTTGACTCT CCAGAAAGCT 1380 
AATAGCTGGT TCAACTGCAG GAAAAATTAG 

Seq ID NO: 12 Protein sequence 
Protein Accession #: NP_002412.1 

1 ll 21 31 41 51 

| | I 1 I 1 

MHSFPPLLLL LFWGWSHSF PATLETQEQD VDLVQKYLEK YYNLKNDGRQ VEKRRNSGPV 60 

VEKLKQMQEP PGLKVTGKPD AETLKVMKQP RCGVPDVAQF VLTEGNPRWE QTHLTYRIEN 120 

YTPDLPRADV DHAIEKAFQL WSNVTPLTFT KVSEGQADIM ISFVRGDHRD NSPFDGPGGN 180 

LAHAFQPGPG IGGDAHFDED ERWTNNFREY NLHRVAAHEL GHSLGLSHST DIGALMYPSY 240 

TFSGDVQLAQ DDIDGIQAIY GRSQNPVQPI GPQTPKACDS KLTFDAITTI RGEVMFFKDR 300 

FYMRTNPFYP EVELNFISVF WPQLPNGLEA AYEFADRDEV RFFKGNKYWA VQGQNVLHGY 360 

PKDIYSSFGF PRTVKHIDAA LSEENTGKTY FFVANKYWRY DEYKRSMDPG YPKMIAHDFP 420 
GIGHKVDAVF MKDGFFYPFH GTRQYKFDPK TKRILTLQKA NSWFNCRKN 

Seq ID NO: 13 DNA sequence 

Nucleic Acid Accession #: NM_002421.2 

Coding sequence: 1..1409 

1 11 21 31 41 51 

I I I I I I 

ATGCACAGCT TTCCTCCACT GCTGCTGCTG CTGTTCTGGG GTGTGGTGTC ACACAGCTTC 60 

CCAGCGACTC TAGAAACACA AGAGCAAGAT GTGGACTTAG TCCAGAAATA CCTGGAAAAA 120 

TACTACAACC TGAAGAATGA TGGGAGGCAA GTTGAAAAGC GGAGAAATAG TGGCCCAGTG 180 

GTTGAAAAAT TGAAGCAAAT GCAGGAATTC TTTGGGCTGA AAGTGACTGG GAAACCAGAT 240 

GCTGAAACCC TGAAGGTGAT GAAGCAGCCC AGATGTGGAG TGCCTGATGT GGCTCAGTTT 300 

GTCCTCACTG AGGGGAACCC TCGCTGGGAG CAAACACATC TGACCTACAG GATTGAAAAT 360 

TACACGCCAG ATTTGCCAAG AGCAGATGTG GACCATGCCA TTGAGAAAGC CTTCCAACTC 420 

TGGAGTAATG TCACACCTCT GACATTCACC AAGGTCTCTG AGGGTCAAGC AGACATCATG 480 

ATATCTTTTG TCAGGGGAGA TCATCGGGAC AACTCTCCTT TTGATGGACC TGGAGGAAAT 540 

CTTGCTCATG CTTTTCAACC AGGCCCAGGT ATTGGAGGGG ATGCTCATTT TGATGAAGAT 600 

GAAAGGTGGA CCAACAATTT CAGAGAGTAC AACTTACATC GTGTTGCGGC TCATGCCCTC 660 

GGCCATTCTC TTGGACTCTC CCATTCTACT GATATCGGGG CTTTGATGTA CCCTAGCTAC 720 

ACCTTCAGTG GTGATGTTCA GCTAGCTCAG GATGACATTG ATGGCATCCA AGCCATATAT 7 B0 

GGACGTTCCC AAAATCCTGT CCAGCCCATC GGCCCACAAA CCCCAAAAGC ATGTGACAGT 840 

AAGCTAACCT TTGATGCTAT AACTACGATT CGGGGAGAAG TGATGTTCTT TAAAGACAGA 900 

TTCTACATGC GCACAAATCC CTTCTACCCG GAAGTTGAGC TCAATTTCAT TTCTGTTTTC 960 

TGGCCACAAC TGCCAAATGG GCTTGAAGCT GCTTACGAAT TTGCCGACAG AGATGAAGTC 1020 

CGGTTTTTCA AAGGGAATAA GTACTGGGCT GTTCAGGGAC AGAATGTGCT ACACGGATAC 1080 

CCCAAGGACA TCTACAGCTC CTTTGGCTTC CCTAGAACTG TGAAGCATAT CGATGCTGCT 1140 

CTTTCTGAGG AAAACACTGG AAAAACCTAC TTCTTTGTTG CTAACAAATA CTGGAGGTAT 1200 

GATGAATATA AACGATCTAT GGATCCAGGT TATCCCAAAA TGATAGCACA TGACTTTCCT 1260 

GGAATTGGCC ACAAAGTTGA TGCAGTTTTC ATGAAAGATG GATTTTTCTA TTTCTTTCAT 1320 

GGAACAAGAC AATACAAATT TGATCCTAAA ACGAAGAGAA TTTTGACTCT CCAGAAAGCT 1380 
AATAGCTGGT TCAACTGCAG GAAAAATTAG 



Seq ID NO: 14 Protein sequence 
Protein Accession #: NP_002412.1 

1 11 21 31 41 51 

I I I I I I 

MHSFPPLLLL LFWGWSHSF PATLETQEQD VDLVQKYLEK YYNLKNDGRQ VEKRRNSGPV 60 
VEKLKQMQEF PGLKVTGKPD AETLKVMKQP RCGVPDVAQF VLTEGNPRWE QTHLTYRIEN 120 
YTPDLPRADV DHAIEKAFQL WSNVTPLTFT KVSEGQADIM ISFVRGDHRD NSPFDGPGGN 180 
LAHAFQPGPG IGGDAHFDED ERWTNNFREY NLHRVAAHAL GHSLGLSHST DIGALMYPSY 240 
TFSGDVQLAQ DDIDGIQAIY GRSQNPVQPI GPQTPKACDS KLTFDAITTI RGEVMFFKDR 300 
FYMRTNPFYP EVELNFISVF WPQLPNGLEA AYEFADRDEV RFFKGNKYWA VQGQNVLHGY 360 
PKDIYSSFGF PRTVKHIDAA LSEENTGKTY FFVANKYWRY DEYKRSMDPG YPKMIAHDFP 420 
GIGHKVDAVF MKDGFFYFFH GTRQYKFDPK TKRILTLQKA NSWFNCRKN 



Seq ID NO: 15 DNA sequence 

Nucleic Acid Accession #: FGENESH predicted 

Coding sequence: 141.. 1580 

1 11 21 31 41 51 

| | I I I I 

TCTGCGTGTG CCGGGGCTAG GGGCTGGAAG TCCTGGCTCT AGTTGCACCT CGGAAGGAAA 60 

AGGCAAACAG AGGAGGGAAG GCGTCTTAGG ACTGCCTGGA TCCAGAGCAC TTTCCTCGGC 120 

CTCTACAGGC CTGTGTCGCT ATGGGTTCCC CCGCCGCCCC GGAGGGAGCG CTGGGCTACG 180 

TCCGCGAGTT CACTCGCCAC TCCTCCGACG TGCTGGGCAA CCTCAACGAG CTGCGCCTGC 240 

GCGGGATCCT CACTGACGTC ACGCTGCTGG TTGGCGGGCA ACCCCTCAGA GCACACAAGG 300 

CAGTTCTCAT CGCCTGCAGT GGCTTCTTCT ATTCAATTTT CCGGGGCCGT GCGGGAGTCG 360 

GGGTGGACGT GCTCTCTCTG CCCGGGGGTC CCGAAGCGAG AGGCTTCGCC CCTCTATTGG 420 

ACTTCATGTA CACTTCGCGC CTGCGCCTCT CTCCAGCCAC TGCACCAGCA GTCCTAGCGG 480 

CCGCCACCTA TTTGCAGATG GAGCACGTGG TCCAGGCATG CCACCGCTTC ATCCAGGCCA S40 

GCTATGAACC TCTGGGCATC TCCCTGCGCC CCCTGGAAGC AGAACCCCCA ACACCCCCAA 600 

CGGCCCCTCC ACCAGGTAGT CCCAGGCGCT CCGAAGGACA CCCAGACCCA CCTACTGAAT 660 

CTCGAAGCTG CAGTCAAGGC CCCCCCAGTC CAGCCAGCCC TGACCCCAAG GCCTGCAACT 720 

GGAAAAAGTA CAAGTACATC GTGCTAAACT CTCAGGCCTC CCAAGCAGGG AGCCTGGTCG 780 



953 



WO 03/042661 



GGGAGAGAAG TTCTGGTCAA CCTTGCCCCC AAGCCAGGCT CCCCAGTGGA GACGAGGCCT 840 
CCAGCAGCAG CAGCAGCAGC AGCAGCAGCA GTGAAGAAGG ACCCATTCCT GGTCCCCAGA 900 
GCAGGCTCTC TCCAACTGCT GCCACTGTGC AGTTCAAATG TGGGGCTCCA GCCAGTACCC 960 
CCTACCTCCT CACATCCCAG GCTCAAGACA CCTCTGGATC ACCCTCTGAA CGGGCTCGTC 1020 
CACTACCGGG AAGTGAATTT TTCAGCTGCC AGAACTGTGA GGCTGTGGCA GGGTGCTCAT 1080 
CGGGGCTGGA CTCCTTGGTT CCTGGGGAOG AAGACAAACC CTATAAGTGT CAGCTGTGCC 1140 
GGTCTTCGTT CCGCTACAAG GGCAACCTTG CCAGTCATCG TACAGTGCAC ACAGGGGAAA 1200 
AGCCTTACCA CTGCTCAATC TGCGGAGCCC GTTTTAACCG GCCAGCAAAC CTGAAAACGC 1260 
ACAGCCGCAT CCATTCGGGA GAGAAGCCGT ATAAGTGTGA GACGTGCGGC TCGCGCTTTG 1320 
TACAGGTGGC ACATCTGCGG GCGCACGTGC TGATCCACAC CGGGGAGAAG CCCTACCCTT 1380 
GCCCTACCTG OGGAACCCGC TTCCGCCACC TGCAGACCCT CAAGAGCCAC GTTCGCATCC 1440 
ACACCGGAGA GAAGCCTTAC CACTGCGACC CCTGTGGCCT GCATTTCCGG CACAAGAGTC 1500 
AACTGCGGCT GCATCTGCGC CAGAAACACG GAGCTGCTAC CAACACCAAA GTGCACTACC 1560 
ACATTCTCGG GGGGCCCTAG CTGAGCGCAG GCCCAGGCCC CACTTGCTTC CTGCGGGTGG 1620 
GAAAGCTGCA GGCCCAGGCC TTGCTTCCCT ATCAGGCTTG GGCATAGGGG TGTGCCAGGC 1680 
CACTTTGGTA TCAGAAATTG CCACCCTCTT AATTTCTCAC TGGGGAGAGC AGGGGTGGCA 1740 
GATCCTGGCT AGATCTGCCT CTGTTTTGCT GGTCAAAACC TCTTCCCCAC AAGCCAGATT 1800 
GTTTCTGAGG AGAGAGCTAG CTAGGGGCTG GGAAAGGGGA GAGATTGGAG TCCTGGTCTC I860 
CCTAAGGGAA TAGCCCTCCA CCTGTGGCCC CCATTGCATT CAGTTTATCT GTAAATATAA 1920 
TTTATTGAGG CCTTTGGGTG GCACCGGGGC CTTCATTCGA TTGCATTTCC CACTCCCCTC 1980 
TTCCACAAGT GTGATTAAAA GTGACCAGAA ACACAGAAGG TGAGATCACA GCTCTGCTGG 2040 
CAGAGATTAC TAGCCCTTGG CTCTCTCGTT TGGCTTGGGT ATTTTATATT ATTTCTGTCA 2100 
TAACTTTTAT CTTTAGAATT GTTCTTTCTC CTGTTTGTTT GCTTGTTAGT TTGTTTAAAA 2160 
TGGAAAAAGG GGTTCTCTGT GTTCTGCCCC TGTAATTCTA GGTCTGGAAC CTTTATTTGT 2220 
TCTAGGGCAG CTCTGGGAAC ATGCGGGATT GTGGAATTGG GTCAGGAACC CTCTCTGGTA 2280 
TTCTGGATGT TGTAGGTTCT CTAGCAGTCT AGAAATGGAT ACAGACATTT CTCTGTTCTT 2340 
CAAGGGTGAT AGGAACCATT ATGTTGAGCC CAAAATGGAA GTAATAATAA ATGCCTCCTG 2400 
GAGGCTGTGG GTGTGGGGGA TTCTGTATCT GGATTCCGTA TCACTCCAAC TGGAGGCTGT 2460 
GGGTGTGGGG GATTCTGTAT CTGGATTCCG TATCACTCCA AGTGGAGGCT GGCAGGTTTT 2520 
TCTGCAAGAT GGTCCAGAAT CTAAAATGTC CCATTAATCT GGTCACTTGG GTTTGGCTCT 2580 
GCTGTATCCA TCTATAGTGG TAGAGACCCA CCAGGGCTCA AGTGGAGTCC ATCATCCTCC 2640 
CACGGGGGCC TGTTCTTAGC ACTGAGTTGA TCGCTCCATG GGGGAGAGAT CAGACATTCC 2700 
TTATCAGAGA TGATGTGACC TTTTCTGACT CTGCCCAGTC TCTATGAATG TTATGGCCTA 27 60 
GGGAAGAATC ATGAAACTCT TTAGCTTGAT TAGATGGTAA ACAGTGTTAA CCCATCCTTT 2820 
ACTACAGAGG CATATGGGTT TGAATGTTAC CTGGGGTTCT CTCTATTGAG TTGAGCCCCT 2880 
TCTTCCTTTA GTGGGTTTTG GACATCTTCT GGCAAGTGTC CAGATGCCAG AACCTTCTTT 2940 
TCCTCTAGAA GGGATGGTGC TTGGTAACCT TACCTTTTAA AAGCTGGGTC TGTGACCTGG 3000 
TCTTCCCATC CCTGCATTCC TGTCTGGAAC CAGTGAATGC ATTAGAACCT TCCATAGGAA 3060 
AAGAAAAGGG GCTGAGTTCC ATTCTGGGTT TGCTGTAGTT TGGTTGGGAT TATTGTTGGC 3120 
ATTACAGATG TAAAAGATTG ACTAGCCCAT AGGCCAAAGG CCTGTTCTAG TTGACCAAGT 3180 
TTCAAGTAGG ATTAAGAGGT TGGTTGAGGG GTGCAGTTTC TGGTGTAGGC CAGGTAGGTA 3240 
GAAAGTGAGG AACAGGGTTG CCTCTTGGCT GGGTGGAGTC TCTGAAATGT TAGAAGAAGC 3300 
GCTGAAGCCT TGATTGATAG TTCTGCCCCT TGTTGCCCTG GGGCTTATCT GATTATGGGA 3360 
CGAGGGTAGA AAGTAAGAAG CACTTTTGAA TTTGTGGGGT AGAACTTCAA CAATAAGTCA 3420 
GTTCTAGTGG CTGTCGCCTG GGGACTAGTG AGAAAGCTAC TCTTCTCCCT CTTCCCTCTT 3480 
TCTCCCCATG GCCCCACTGC AGAATTAAAG AAGGAAGAAG GGAAGGCGGA GGAGTCTATA 3540 
AGAAGGAATC ATGATTTCTA TTTAGCAGAT TGGATGGGCA GGTGGAGAAT GCCTGGGGGT 3600 
AGAAATGTTA GATCTTGCAA CATCAGATCC TTGGAATAAA GAAGCCTCTC TGYGCWRAAA 3660 
AAAAAAAAAA AAAAAA 

Seq ID NO: 16 Protein sequence 
Protein Accession ft: FGENESH predicted 

1 11 21 31 41 51 

I I I i I I 

MGSPAAPEGA LGYVREFTRH SSDVLGNLNE LRLRGILTDV TLLVGGQPLR AHKAVLIACS 60 

GFFYSIFRGR AGVGVDVLSL PGGPEARGPA PLLDFMYTSR LRLSPATAPA VLAAATYLQM 120 

EHWQACHRF IQASYEPLGI SLRPLEAEPP TPPTAPPPGS PRRSEGHPDP PTESRSCSQG 180 

PPSPASPDPK AQJWKKYKYI VLNSQASQAG SLVGERSSGQ PCPQARLPSG DEASSSSSSS 240 

SSSSEEGPIP GPQSRLSPTA ATVQFKCGAP ASTFYLLTSQ AQDTSGSPSE RARPLPGSEF 300 

FSCGNCEAVA GCSSGLDSLV PGDEDKPYKC QLCRSSFRYK GNLASHRTVH TGEKPYHCSI 360 

CGARFNRPAN LKTHSRIHSG EKPYKCETCG SRPVQVAHLR AHVLIHTGEK PYPCPTCGTR 420 
FRHLQTLKSH VRIHTGEKPY HCDPCGLHFR HKSQLRLHLR QKHGAATNTK VHYHILGGP 

Seq ID NO: 17 DNA sequence 

Nucleic Acid Accession #: XM__039209 

Coding sequence: 1..2049 

1 11 21 31 41 51 

1 I I I I 1 

ATGCTGAAGA TGCTCTCCTT TAAGCTGCTG CTGCTGGCCG TGGCTCTGGG CTTCTTTGAA 60 

GGAGATGCTA AGTTTGGGGA AAGAAACGAA GGGAGOGGAG CAAGGAGGAG AAGGTGCCTG 120 

AATGGGAACC CCCCGAAGCG CCTGAAAAGG AGAGACAGGA GGATGATGTC CCAGCTGGAG 180 

CTGCTGAGTQ GGGGAGAGAT GCTGTGOGGT GGCTTCTACC CTCGGCTGTC CTGCTGCCTG 240 

CGGAGTGACA GCCCGGGGCT AGGGCGCCTG GAGAATAAGA TATTTTCTGT TACCAACAAC 300 

ACAGAATGTG GGAAGTTACT GGAGGAAATC AAATGTGCAC TTTGCTCTCC ACATTCTCAA 360 

AGCCTGTTCC ACTCACCTGA GAGAGAAGTC TTGGAAAGAG ACCTAGTACT TC CTCTG CTC 420 

TGCAAAGACT ATTGCAAAGA ATTCTTTTAC ACTTGCCGAG GCCATATTCC AGGTTTCCTT 4 B0 

CAAACAACTG CGGATGAGTT TTGCTTTTAC TATGCAAGAA AAGATGGTGG GTTGTGCTTT 540 

CCAGATTTTC CAAGAAAACA AGTCAGAGGA CCAGCATCTA ACTACTTGGA CCAGATGGAA 600 

GAATATGACA AAGTGGAAGA GATCAGCAGA AAGCACAAAC ACAACTGCTT CTGTATTCAG 660 

GAGGTTGTGA GTGGGCTGCG GCAGCCCGTT GGTGCCCTGC ATAGTGGGGA TGGCTCGCAA 720 

CGTCTCTTCA TTCTGGAAAA AGAAGGTTAT GTGAAGATAC TTACCCCTGA AGGAGAAATT 780 

TTCAAGGAGC CTTATTTGGA CATTCACAAA CTTGTTCAAA GTGGAATAAA GGGAGGAGAT 840 



954 
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GAAAGAGGAC TGCTAAGCCT CGCATTCCAT CCCAATTACA AGAAAAATGG AAAGTTGTAT 900 

GTGTCCTATA CCACCAACCA AGAACGGTGG GCTATCGGGC CTCATGACCA CATTCTTAGG 960 

GTTGTGGAAT ACACAGTATC CAGAAAAAAT CCACACCAAG TTGATTTGAG AACAGCCAGA 1020 

GTCTTTCTTG AAGTTGCAGA ACTCCACAGA AAGCATCTGG GAGGACAACT GCTCTTTGGC 1080 

CCTGACGGCT TTTTGTACAT CATTCTTGGT GATGGGATGA TTACACTGGA TGATATGGAA 1140 

GAAATGGATG GGTTAAGTGA TTTCACAGGC TCAGTGCTAC GGCTGGATGT GGACACAGAC 1200 

ATGTGTAACG TGCCTTATTC CATACCAAGG AGCAACCCAC ACTTCAACAG CACCAACCAG 1260 

CCCCCCGAAG TGTTTGCTCA TGGGCTCCAC GATCCAGGCA GATGTGCTGT GGATAGACAT 1320 

CCCACTGATA TAAACATCAA TTTAACGATA CTGTGTTCAG ACTCCAATGG AAAAAACAGA 1380 

TCATCAGCCA GAATTCTACA GATAATAAAG GGGAAAGATT ATGAAAGTGA GCCATCACTT 1440 

TTAGAATTCA AGCCATTCAG TAATGGTCCT TTGGTTGGTG GATTTGTATA CCGGGGCTGC 1500 

CAGTCAGAAA GATTGTATGG AAGCTACGTG TTTGGAGATC GTAATGGGAA TTTCCTAACT 1560 

CTCCAGCAAA GTCCTGTGAC AAAGCAGTGG CAAGAAAAAC CACTCTGTCT CGGCACTAGT 1620 

GGGTCCTGTA GAGGCTACTT TTCCGGTCAC ATCTTGGGAT TTGGAGAAGA TGAACTAGGT 16 BO 

GAAGTTTACA TTTTATCAAG CAGTAAAAGT ATGACCCAGA CTCACAATGG AAAACTCTAC 1740 

AAAATTGTAG ATCCCAAAAG ACCTTTAATG CCTGAGGAAT GCAGAGCCAC GGTACAACCT 1800 

GCACAGACAC TGACTTCAGA GTGCTCCAGG CTCTGTCGAA ACGGCTACTG CACCCCCACG 1860 

GGAAAGTGCT GCTGCAGTCC AGGCTGGGAG GGGGACTTCT GCAGAACTGC AAAATGTGAG 1920 

CCAGCATGTC GTCATGGAGG TGTCTGTGTT AGACCGAACA AGTGCCTCTG TAAAAAAGGA 1980 

TATCTTGGTC CTCAATGTGA ACAAGTGGAC AGAAACATCC GCAGAGTGAC CAGGGCAGAC 2040 
ATCACCTAG 



Seq ID NO: 18 Protein sequence 
Protein Accession #: XP_039209 



1 11 21 31 41 51 

I I I I I I 

MLKMLSFKLL LLAVALGFFE GDAKFGERNE GSGARRRRCL NGNPPKRLKR RDRRMMSQLE 60 

LLSGGEMLCG GFYPRLSCCL RSDSPGLGRL ENKIFSVTNN TECGKLLEEI KCALCSPHSQ 120 

SLFHSPEREV LERDLVLPLL CKDYCKEFFY TCRGHIPGFL QTTADEFCFY YARKDGGLCF 180 

PDFPRKQVRG PASNYLDQME EYDKVEEISR KHKHNCFCIQ EWSGLRQPV GALHSGDGSQ 240 

RLFILEKEGY VKILTPEGEI FKEPYLDIHK LVQSGIKGGD ERGLLSLAFH PNYKKNGKLY 300 

VSYTTNQERW AIGPHDHILR WEYTVSRKN PHQVDLRTAR VFLEVAELHR KHLGGQLLFG 360 

PDGFLYIILG DGMITLDDME EMDGLSDFTG SVLRIjDVDTD MCNVPYSIPR SNPHFNSTNQ 420 

PPEVFAHGIjH DPGRCAVDRH PTDININLTI LCSDSNGKNR SSARILQIIK GKDYESEPSL 480 

LEFKPFSNGP LVGGFVYRGC QSERLYGSYV FGDRNGNFLT LQQSPVTKQW QEKPLCLGTS 540 

GSCRGYFSGH ILGFGEDEIX3 EVYILSSSKS MTQTHNGKLY KIVDPKRPLM PEECRATVQP 600 

AQTLTSECSR LCRNGYCTPT GKCCCSPGWE GDFCRTAKCE PACRHGGVCV RPNKCLCKKG 660 
YLGPQCEQVD RNIRRVTRAD IT 

Seq ID NO: 19 DNA sequence 

Nucleic Acid Accession ft: NM_014331.2 

Coding sequence: 1..1506 

1 11 21 31 41 51 

11)111 

ATGGTCAGAA AGCCTGTTGT GTCCACCATC TCCAAAGGAG GTTACCTGCA GGGAAATGTT 60 

AACGGGAGGC TGCCTTCCCT GGGCAACAAG GAGCCACCTG GGCAGGAGAA AGTGCAGCTG 120 

AAGAGGAAAG TCACTTTACT GAGGGGAGTC TCCATTATCA TTGGCACCAT CATTGGAGCA 180 

GGAATCTTCA TCTCTCCTAA GGGCGTGCTC CAGAACACGG GCAGCGTGGG CATGTCTCTG 240 

ACCATCTGGA CGGTGTGTGG GGTCCTGTCA CTATTTGGAG CTTTGTCTTA TGCTGAATTG 300 

GGAACAACTA TAAAGAAATC TGGAGGTCAT TACACATATA TTTTGGAAGT CTTTGGTCCA 360 

TTACCAGCTT TTGTACGAGT CTGGGTGGAA CTCCTCATAA TAOGCCCTGC AGCTACTGCT 420 

GTGATATCCC TGGCATTTGG ACGCTACATT CTGGAACCAT TTTTTATTCA ATGTGAAATC 480 

CCTGAACTTG CGATCAAGCT CATTACAGCT GTGGGCATAA CTGTAGTGAT GGTCCTAAAT 540 

AGCATGAGTG TCAGCTGGAG CGCCCGGATC CAGATTTTCT TAACCTTTTG CAAGCTCACA 600 

GCAATTCTGA TAATTATAGT CCCTGGAGTT ATGCAGCTAA TTAAAGGTCA AACGCAGAAC 660 
TTTAAAGACG CGTTTTCAGG AAGAGATTCA AGTATTACGC GGTTGCCACT GGCTTTTTAT 720 
TATGGAATGT ATGCATATGC TGGCTGGTTT TACCTCAACT TTGTTACTGA AGAAGTAGAA 780 
AACCCTGAAA AAACCATTCC CCTTGCAATA TGTATATCCA TGGCCATTGT CACCATTGGC 840 
TATGTGCTGA CAAATGTGGC CTACTTTAOG ACCATTAATG CTGAGGAGCT GCTGCTTTCA 900 
AATGCAGTGG CAGTGACCTT TTCTGAGCGG CTACTGGGAA ATTTCTCATT AGCAGTTCCG 960 

ATCTTTGTTG CCCTCTCCTG CTTTGGCTCC ATGAACGGTG GTGTGTTTGC TGTCTCCAGG 1020 

TTATTCTATG TTGCGTCTCG AGAGGGTCAC CTTCCAGAAA TCCTCTCCAT GATTCATGTC 1080 

CGCAAGCACA CTCCTCTACC AGCTGTTATT GTTTTGCACC CTTTGACAAT GATAATGCTC 1140 

TTCTCTGGAG ACCTCGACAG TCTTTTGAAT TTCCTCAGTT TTGCCAGGTG GCTTTTTATT 1200 

GGGCTGGCAG TTGCTGGGCT GATTTATCTT CGATACAAAT GCCCAGATAT GCATCGTCCT 1260 

TTCAAGGTGC CACTGTTCAT CCCAGCTTTG TTTTCCTTCA CATGCCTCTT CATGGTTGCC 1320 

CTTTCCCTCT ATTCGGACCC ATTTAGTACA GGGATTGGCT TCGTCATCAC TCTGACTGGA 1380 

GTCCCTGCGT ATTATCTCTT TATTATATGG GACAAGAAAC CCAGGTGGTT TAGAATAATG 1440 

TCAGAGAAAA TAACCAGAAC ATTACAAATA ATACTGGAAG TTGTACCAGA AGAAGATAAG 1500 

TTATGAACTA ATGGACTTGA GATCTTGGCA ATCTGCCCAA GGGGAGACAC AAAATAGGGA 1560 

TTTTTACTTC ATTTTCTGAA AGTCTAGAGA ATTACAACTT TGGTGATAAA CAAAAGGAGT 1620 

CAGTTATTTT TATTCATATA TTTTAGCATA TTCGAACTAA TTTCTAAGAA ATTTAGTTAT 1680 

AACTCTATGT AGTTATAGAA AGTGAATATG CAGTTATTCT ATGAGTCGCA CAATTCTTGA 1740 

GTCTCTGATA CCTACCTATT GGGGTTAGGA GAAAAGACTA GACAATTACT ATGTGGTCAT 1B00 

TCTCTACAAC ATATGTTAGC ACGGCAAAGA ACCTTCAAAT TGAAGACTGA GATTTTTCTG 1860 

TATATATGGG TTTTGTAAAG ATGGTTTTAC ACACTACAGA TGTCTATACT GTGAAAAGTG 1920 

TTTTCAATTC TGAAAAAAAG CATACATCAT GATTATGGCA AAGAGGAGAG AAAGAAATTT 1980 

ATTTTACATT GACATTGCAT TGCTTCCCCT TAGATACCAA TTTAGATAAC AAACACTCAT 2040 

GCTTTAATGG ATTATACCCA GAGCACTTTG AACAAAGGTC AGTGGGGATT GTTGAATACA 2100 

TTAAAGAAGA GTTTCTAGGG GCTACTGTTT ATGAGACACA TCCAGGAGTT ATG TTTAA GT 2160 

AAAAATCCTT GAGAATTTAT TATGTCAGAT GTTTTTTCAT TCATTATCAG GAAGTTTTAG 2220 



955 



WO 03/042661 



PCT/US02/36810 



10 
15 
20 
25 
30 



TTATCTGTCA 
AGCAAGAGTT 
TACCCCTGAT 
TGAGAGAAAT 
CTACATGCAA 
TGAATTTTGA 
CTTCAGATGA 
AAGAAATGTC 
ATCTAGGCTT 
CTGATAAGAA 
GTTTTGCCAG 
GCACTTTGGG 
CAACATGGAG 
GCTGGTAATC 
GAGGTTGCAG 
CCATCTCCAA 



TrnTl iii i 

AGTTTGGTAT 

GAGTCTATCT 

AACCAACAAA 

TGTTAGTAAT 

CAGTTTGTGC 

AACTGTCCAG 

GCTGTAAATA 

TGTCAGTAAT 

GAAAATTGAA 

TATTAGAAAA 

AGGCTGAGGG 

AAACCCCATC 

TCAGCTATTG 

TGAGCCAAGA 

AAAAAAAAAA 



TCACATCAGT 
TAAATCCTCA 
AAACATATGC 
GAAGATGTTC 
TCTGAAGTTT 
ATTTTCTTTA 
ATTAATTAGG 
AGATTTACAA 
TTCCACACCT 
ATGAGAATCT 
TACTGTGAGC 
GGTGGATCAC 
TCTACTAAAA 
AGGAGGCTGA 
TTGCACCACT 
AAAA 



TTGATCAGGA 
TTAGAACAAC 
ATTTTAAGCC 
AAAATAATAG 
TTTAAATTTA 
TACATTTTAT 
AAAAGGCATA 
CTGATGTTTC 
TAATTATCAT 
GTGGATAAGT 
CGGGCATGGT 
CTGAGGTCGG 
ATACAAAATT 
GGCAGGAGAA 
GTACTCCAGC 



AAGTGTATAA 
CACCTGTTTC 
TTCAAATTAC 
TCCCATATCT 
TGGCTATTTT 
ATTCTTCTGT 
TATTAACATA 
TAGAAAATTT 
TCAACTTGCA 
GTTTGTGTTC 
GGCTTACATC 
GAGTTCTAGA 
AGCTGGGCAT 
TTGCTTGAAC 
CTGGGTGACA 



CACATCTTAG 
ACTAATAACT 
ATTATCAACA 
GTAATCATAT 
TACACGATGA 
TAAAATATCT 
AAAATTGCAA 
CCACTTCTAT 
AAAGAGACAA 
AGAAGATGTT 
TGTAATCCCA 
CCAGCCTGAC 
GGTGGCACAT 
CCGGGAGGCG 
AAGTCAGACT 



Seq ID NO: 20 Protein sequence 
Protein Accession #: NP_055146.1 



MVRKPWSTI 
GIFISPKGVL 
LPAFVRVWVE 
SMSVSWSARI 
YGMYAYAGWF 
NAVAVTFSER 
RKHTPLPAVI 
FKVPLFIPAL 
SEKITRTLQI 



11 
I 

SKGGYLQGNV 
QNTGSVGMSL 
LLI IRPAATA 
QIFLTFCKLT 
YLNFVTEEVE 
LLGNFSLAVP 
VLHPLTMIML 
FSFTCLFMVA 
ILEWPEEDK 



21 
I 

NGRLPSLGNK 
TIWTVCGVLS 
VISLAFGRYI 
AILIIIVPGV 
NPEKTIPIAI 
IFVALSCFGS 
FSGDLDSLLN 
LSI*YSDPFST 
L 



31 
I 

EPPGQEKVQL 
LFGALSYAEL 
LEPFFIQCEI 
MQLIKGQTQN 
CISMAITIGV 
MNGGVFAVSR 
FLSFARWLFI 
GIGFVITLTG 



41 
I 

KRKVTLLRGV 
GTTIKKSGGH 
PELAIKLITA 
FKDAFSGRDS 
YVLTNVAYFT 
LFYVASREGH 
GLAVAGLIYL. 
VPAYYLFIIW 



22B0 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 



51 
I 

SIIIGTIIGA 
YTYILEVFGP 
VGITWMVIjN 
SITRLPIAFY 
TINAEELLLS 
LPEILSMIHV 
RYKCPDMHRP 
DKKPRWFRIM 



60 
120 
180 
240 
300 
360 
420 
4B0 



35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



Seq ID NO: 21 DNA sequence 

Nucleic Acid Accession #: NM_002422.2 

Coding sequence: 64.. 1497 



1 
I 

ACAAGGAGGC 
GAAATGAAGA 
TTGGATGGAG 
AACTACTACG 
GTTGTTAAAA 
GACTCCGACA 
TTCAGAACCT 
AATTATACAC 
GTCTGGGAAG 
ATGATCTCTT 
GTTTTGGCCC 
GATGAACAAT 
ATTGGCCACT 
TATCACTCAC 
CAGTCCCTCT 
GTCCCTCCAG 
AGCACTCTGA 
AGGAAGCTTG 
GTGGATGCCG 
TTCTGGGCCA 
GGTTTCCCTC 
ACATATTTCT 
CCAGGCTTTC 
GTTTTTGAAG 
CCAAATGCAA 
AGATATGTAG 
CTCTGTGAAT 
GGAACTTGAG 
ATGGGCTGCT 
AGCACTCGTG 
AGACGATTTG 



11 



21 



31 



AGGCAAGACA 
GTCTTCCAAT 
CTGCAAGGGG 
ACCTCAAAAA 
AAATCCGAGA 
CTCTGGAGGT 
TTCCTGGCAT 
CAGATTTGCC 
AGGTGACTCC 
TTGCAGTTAG 
ATGCCTATGC 
GGACAAAGGA 
CCCTGGGTCT 
TCACAGACCT 
ATGGACCTCC 
AACCTGGGAC 
GGGGAGAAAT 
AACCTGAATT 
CATATGAAGT 
TCAGAGGAAA 
CAACOGTGAG 
TTGTAGAGGA 
CCAAGCAAAT 
AATTTGGGTT 
AGAAAGTGAC 
AAGGCACAAT 
TGAAATGTTC 
CGTGAATCTG 
GCTTAGCTTG 
TGCAACAGAC 
TCAGTTGTTT 



GCAAGGCATA 
CCTACTGTTG 
TGAGGACACC 
AGATGTGAAA 
AATGCAGAAG 
GATGCGCAAG 
CCCGAAGTGG 
AAAAGATGCT 
ACTCACATTC 
AGAACATGGA 
CCCTGGGCCA 
TACAACAGGG 
CTTTCACTCA 
GACTCGGTTC 
CCCTGACTCC 
GCCAGCCAAC 
CCTGATCTTT 
GCATTTGATC 
TACTAGCAAG 
TGAGGTACGA 
GAAAATCGAT 
CAAATACTGG 
AGCTGAAGAC 
CTTTTATTTC 
ACACACTTTG 
ATGGGCACTT 
GTTTTCTCCT 
TATCTTGCCG 
CACCTTGTCA 
AAGTGACTGT 
T 



GAGACAACAT 
CTGTGCGTGG 
AGCATGAACC 
CAGTTTGTTA 
TTCCTTGGAT 
CCCAGGTGTG 
AGGAAAACCC 
GTTGATTCTG 
TCCAGGCTGT 
GACTTTTACC 
GGGATTAATG 
ACCAATTTAT 
GCCAACACTG 
CGCCTGTCTC 
CCTGAGACCC 
TGTGATCCTG 
AAAGACAGGC 
TCTTCATTTT 
GACCTCGTTT 
GCTGGATACC 
GCAGCCATTT 
AGATTTGATG 
TTTCCAGGGA 
TTTACTGGAT 
AAGAGTAACA 
TAAATGAAGC 
GCCTGTGCTG 
GTCATTTTTA 
CATAGAGTGA 
ATCTGTGTAG 



41 
I 

AGAGCTAAGT 
CAGTTTGCTC 
TTGTTCAGAA 
GGAGAAAGGA 
TGGAGGTGAC 
GAGTTCCTGA 
ACCTTACATA 
CTGTTGAGAA 
ATGAAGGAGA 
CTTTTGATGG 
GAGATGCCCA 
TTCTCGTTGC 
AAGCTTTGAT 
AAGATGATAT 
CCCTGGTACC 
CTTTGTCCTT 
ACTTTTGGCG 
GGCCATCTCT 
TCATTTTTAA 
CAAGAGGCAT 
CTGATAAGGA 
AGAAGAGAAA 
TTGACTCAAA 
CTTCACAGTT 
GCTGGCTTAA 
TAATAATTCT 
TGACTCGAGT 
TGTTATTACA 
TCTTTCCCAA 
ACTATTTGCT 



51 
I 

AAAGCCAGTG 
AGCCTATCCA 
ATATCTAGAA 
CAGTGGTCCT 
GGGGAAGCTG 
TGTTGGTCAC 
CAGGATTGTG 
AGCTCTGAAA 
GGCTGATATA 
ACCTGGAAAT 
CTTTGATGAT 
TGCTCATGAA 
GTACCCACTC 
AAATGGCATT 
CACGGAACCT 
TGATGCTGTC 
CAAATCCCTC 
TCCTTCAGGC 
AGGAAATCAA 
CCACACCCTA 
AAAGAACAAA 
TTCCATGGAG 
GATTGATGCT 
GGAGTTTGAC 
TTGTTGAAAG 
TCACCTAAGT 
CACACTCAAG 
GGGCATTCAA 
GAGAAGGGGA 
TATTTAATAA 



Seq ID NO-. 22 Protein sequence 
Protein Accession #: NP_002413 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 



MKSLPILLLL 
VKKIREKQKF 
YTPDLPKDAV 
LAHAYAPGPG 
HSLTDLTRFR 
TLRGEILIFK 
WAIRGNEVRA 
GFPKQIAEDF 



11 
I 

CVAVCSAYPL 
LGLEVTGKLD 
DSAVEKALKV 
INGDAHFDDD 
LSQDDINGIQ 
DRHFWRKSLR 
GYPRGIHTLG 
PGIDSKIDAV 



21 
I 

DGAARGEDTS 
SDTLEVMRKP 
WEEVTPI/TFS 
EQWTKDTTGT 
SLYGPPPDSP 
KLEPELHLIS 
FPPTVRKIDA 
FEEFGFFYFF 



31 
I 

MNLVOKYLEN 
RCGVPLVGHF 
RLYEGEADIH 
NLFLVAAHEI 
ETPLVPTEPV 
SFWPSLPSGV 
AISDKEKNKT 
TGSSQLEFDP 



41 
I 

YYDLEKDVKQ 
RTFPGIPKWR 
ISFAVREHGD 



PPEPGTPANC 
DAAYEVTSKD 
YFFVEDKYWR 
NAKKVTHTLK 



51 
1 

FVRRKDSGPV 60 

KTHLTYRIVN 120 

FYPFDGPGNV 180 

NTEALMYPLY 240 

DPALSFDAVS 300 

1»VFI FKGNQF 360 

FDEKRNSMEP 420 
SNSWUJC 
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Seq ID NO: 23 DNA sequence 

Nucleic Acid Accession fl: NM_006528 

Coding sequence: S7..764 



10 

15 

20 

25 

30 

35 

40 

45 
50 
55 
60 
65 
70 
75 
80 



GCCGCCAGCG 
ACCCCGCTCG 
GCGATGCTGC 
ACGGACCCTG 
GCCAGTTCCT 
GCGACGATGC 
TGGACGACCA 
GTGAAAAATT 
AAGCTACTTG 
AAGATGAGGG 
CCTGTGATGC 
AGGATTGCAA 
GCTTTGCCAG 
ATCTTGTTTG 
GCATGAGGAA 
TTCAAAAATT 
TTTAATTTAT 
AAATATGACT 
AAACAACATA 
CC 



11 
I 

GCTTTCTCGG 
CCCCCTGGGG 
TCAGGAGCCA 
CCGGGCCCTA 
GTACGGGGGC 
TTGCTGGAGG 
GTGTGAGGGG 
CTTTTCCGGT 
TATGGGCTTC 
ACTGTGCTCT 
TTTCACCTAT 
ACGTGCATGT 
TAGAATCCGG 
TCTTTATGGC 
ACAAATCATT 
TGGATTTTTT 
GGTTCAACTG 
CACTCATTTC 
AGACAATATA 



21 
I 

ACGCCTTGCC 
CTGTCGATTC 
ACAGGAAATA 
CTTCTCCGTT 
TGCGAGGGCA 
ATAGAAAAAG 
TCCACAGAAA 
GGGTGTCACC 
TGCGCACCAA 
GCCAATGTGA 
ACTGGCTGTG 
GCAAAAGCTT 
AAAATTCGGA 
TTATTTGCCT 
GGTGATTTAT 
TATATATAAC 
TTTGTGAGAC 
TTGGGGTCGT 
ATCATGTGCT 



31 
I 

CAGCGGGCCG 
TGCTGCTTTT 
ACGCGGAGAT 
ACTACTACGA 
ACGCCAACAA 
TTCCCAAAGT 
AGTATTTCTT 
GGAACCGGAT 
AGAAAATTCC 
CTCGCTATTA 
GAGGGAATGA 
TGAAAAAGAA 
AGAAGCAATT 
TTATGGTTGT 
TCACCAGTTT 
TAGCTGCTAT 
GAATTCTTGC 
ATTCCTGATT 
TTTAACATAT 



41 

I 

CCCGACCCCC 
CCTGACGGAG 
CTGTCTCCTG 
CAGGTACACG 
TTTCTACACC 
TTGCCGGCTG 
TAATCTAAGT 
TGAGAACAGG 
ATCATTTTGC 
TTTTAATCCA 
CAATAACTTT 
AAAGAAGATG 
TTAAACATTC 
ATCTGAAGAA 
TTATTAATAC 
TCAAATGTGA 
AATGCATAAG 
TCAGAAGAGG 
TTGAGAATAA 



51 
I 

TGCACCATGG 
GCTGCACTGG 
CCCCTAGACT 
CAGAGCTGCC 
TGGGAGGCTT 
CAAGTGAGTG 
TCCATGACAT 
TTTCCAGATG 
TACAGTCCAA 
AGATACAGAA 
GTTAGCAGGG 
CCAAAGCTTC 
TTAATATGTC 
TAATATGACA 
AAGTCACTTT 
GTCTACCATT 
ATATAAAAGC 
ATCATAACTG 
AAAGGACTAG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 



Seq ID NO: 24 Protein sequence 
Protein Accession #: NP_006519 



11 



21 



31 



41 



51 



MDPARPLGLS ILLLFLTEAA LGDAAQEPTG NNAEICLLPL DYGPCRALLL RYYYDRYTQS 
CRQFLYGGCE GNANNFYTWE ACDDACWRIE KVPKVCRLQV SVDDQCEGST EKYFFNLSSM 
TCEKFFSGGC HRNRIENRFP DEATCMGFCA PKKIPSFCYS PKDEGLCSAN VTRYYFNPRY 
RTCDAFTYTG CGGNDNNFVS REDCKRACAK ALKKKKKMPK LRFASRIRKI RKKQF 

Seq ID NO: 25 DNA sequence 

Nucleic Acid Accession #: NM_0054S8.1 

Coding sequence: 1..2825 



60 
120 
180 



1 11 21 31 41 51 

I I I I ! I 

ATGGCTTCCC CGCGGAGGTC CGGGCAGCCA GGGCGGCCGC CGCCGCCGCC ACCGCCGCCC 
GCGCGCCTGC TACTGCTACT GCTGCTGCCG CTGCTGCTGC CTCTGGCGCC CGGGGCCTGG 
GGCTGGGCGC GGGGCGCCCC CCGGCOGCCG CCCAGCAGCC CGCCGCTCTC CATCATGGGC 
CTCATGCCGC TCACCAAGGA GGTGGCCAAG GGCAGCATCG GGCGCGGTGT GCTCCCCGCC 
GTGGAACTGG CCATCGAGCA GATCCGCAAC GAGTCACTCC TGCGCCCCTA CTTCCTCGAC 
CTGCGGCTCT ATGACACGGA GTGCGACAAC GCAAAAGGGT TGAAAGCCTT CTACGATGCA 
ATAAAATACG GGCCGAACCA CTTGATGGTG TTTGGAGGCG TCTGTCCATC CGTCACATCC 
ATCATTGCAG AGTCCCTCCA AGGCTGGAAT CTGGTGCAGC TTTCTTTTGC TGCAACCACG 
CCTGTTCTAG CCGATAAGAA AAAATACCCT TATTTCTTTC GGACCGTCCC ATCAGACAAT 
GCGGTGAATC CAGCCATTCT GAAGTTGCTC AAGCACTACC AGTGGAAGCG CGTGGGCACG 
CTGACGCAAG ACGTTCAGAG GTTCTCTGAG GTGCGGAATG ACCTGACTGG AGTTCTGTAT 
GGCGAGGACA TTGAGATTTC AGACACCGAG AGCTTCTCCA ACGATCCCTG TACCAGTGTC 
AAAAAGCTGA AGGGGAATGA TGTGCGGATC ATCCTTGGCC AGTTTGACCA GAATATGGCA 
GCAAAAGTGT TCTGTTGTGC ATACGAGGAG AACATGTATG GTAGTAAATA TCAGTGGATC 
ATTCCGGGCT GGTACGAGCC TTCTTGGTGG GAGCAGGTGC ACACGGAAGC CAACTCATCC 
CGCTGCCTCC GGAAGAATCT GCTTGCTGCC ATGGAGGGCT ACATTGGCGT GGATTTCGAG 
CCCCTGAGCT CCAAGCAGAT CAAGACCATC TCAGGAAAGA CTCCACAGCA GTATGAGAGA 
GAGTACAACA ACAAGCGGTC AGGCGTGGGG CCCAGCAAGT TCCACGGGTA CGCCTACGAT 
GGCATCTGGG TCATCGCCAA GACACTGCAG AGGGCCATGG AGACACTGCA TGCCAGCAGC 
CGGCACCAGC GGATCCAGGA CTTCAACTAC ACGGACCACA CGCTGGGCAG GATCATCCTC 
AATGCCATGA ACGAGACCAA CTTCTTCGGG GTCACGGGTC AAGTTGTATT CCGGAATGGG 
GAGAGAATGG GGACCATTAA ATTTACTCAA TTTCAAGACA GCAGGGAGGT GAAGGTGGGA 
GAGTACAACG CTGTGGCCGA CACACTGGAG ATCATCAATG ACACCATCAG GTTCCAAGGA 
TCCGAACCAC CAAAAGACAA GACCATCATC CTGGAGCAGC TGCGGAAGAT CTCCCTACCT 
CTCTACAGCA TCCTCTCTGC CCTCACCATC CTCGGGATGA TCATGGCCAG TGCTTTTCTC 
TTCTTCAACA TCAAGAACCG GAATCAGAAG CTCATAAAGA TGTCGAGTCC ATACATGAAC 
AACCTTATCA TCCTTGGAGG GATGCTCTCC TATGCTTCCA TATTTCTCTT TGGCCTTGAT 
GGATCCTTTG TCTCTGAAAA GACCTTTGAA ACACTTTGCA CCGTCAGGAC CTGGATTCTC 
ACCGTGGGCT ACACGACCGC TTTTGGGGCC ATGTTTGCAA AGACCTGGAG AGTCCACGCC 
ATCTTCAAAA ATGTGAAAAT GAAGAAGAAG ATCATCAAGG ACCAGAAACT GCTTGTGATC 
GTGGGGGGCA TGCTGCTGAT CGACCTGTGT ATCCTGATCT GCTGGCAGGC TGTGGACCCC 
CTGCGAAGGA CAGTGGAGAA GTACAGCATG QAGCCGGACC CAGCAGGACG GGATATCTCC 
ATCCGCCCTC TCCTGGAGCA CTGTGAGAAC ACCCATATGA CCATCTGGCT TGGCATCGTC 
TATGCCTACA AGGGACTTCT CATGTTGTTC GGTTGTTTCT TAGCTTGGGA GACCCGCAAC 
GTCAGCATCC CCGCACTCAA CGACAGCAAG TACATCGGGA TGAGTGTCTA CAACGTGGGG 
ATCATGTGCA TCATCGGGGC CGCTGTCTCC TTCCTGACCC GGGACCAGCC CAATGTGCAG 
TTCTGCATCG TGGCTCTGGT CATCATCTTC TGCAGCACCA TCACCCTCTG CCTGGTATTC 
GTGCCGAAGC TCATCACCCT GAGAACAAAC CCAGATGCAG CAACGCAGAA CAGGCGATTC 
CAGTTCACTC AGAATCAGAA GAAAGAAGAT TCTAAAACGT CCACCTCGGT CACCAGTGTG 
AACCAAGCCA GCACATCCCG CCTGGAGGGC CTACAGTCAG AAAACCATCG CCTGCGAATG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 



957 



WO 03/042661 



AAGATCACAG AGCTGGATAA AGACTTGGAA GAGGTCACCA TGCAGCTGCA GGACACACCA 2460 

GAAAAGACCA CCTACATTAA ACAGAACCAC TACCAAGAGC TCAATGACAT CCTCAACCTG 2520 

GGAAACTTCA CTGAGAGCAC AGATGGAGGA AAGGCCATTT TAAAAAATCA CCTCGATCAA 2580 

AATCCCCAGC TACAGTGGAA CACAACAGAG CCCTCTCGAA CATGCAAAGA TCCTATAGAA 2640 

GATATAAACT CTCCAGAACA CATCCAGCGT CGGCTGTCCC TCCAGCTCCC CATCCTCCAC 2700 

CACGCCTACC TCCCATCCAT CGGAGGCGTG GACGCCAGCT GTGTCAGCCC CTGCGTCAGC 2760 

CCCACCGCCA GCCCCCGCCA CAGACATGTG CCACCCTCCT TCCGAGTCAT GGTCTCGGGC 2820 
CTGTAA 

Seq ID NOs 26 Protein sequence 
Protein Accession 8: NP_005449.l 

1 11 21 31 41 SI 

I I I I I I 

MASPRRSGQP GRPPPPPPPP ARLLLLLLLP LLLPLAPGAW GWARGAPRPP PSSPPLSIMG 60 

LMPLTKEVAK GSIGRGVLPA VELAIEQIRN ESLLRPYFLD LRLYDTECDN AKGLKAFYDA 120 

IKYGPNHLMV FGGVCPSVTS IIAESLQGWN LVQLSFAATT PVLADKKKYP YFFRTVPSDN 180 

AVNPAILKLL KHYQWKRVGT LTQDVQRFSE VRNDI/TGVLY GEDIEISDTE SFSNDPCTSV 240 

KKLKGNDVRI ILGQFDQNMA AKVFCCAYEE NMYGSKYQWI IPGWYEPSWW EQVHTEANSS 300 

RCLRKNLLAA MEGYIGVDFE PLSSKQIKTI SGKTPQQYER EYKNKRSGVG PSKFHGYAYD 360 

GIWVIAKTLQ RAMETLHASS RHQRIQDFNY TDHTLGRIIL NAMNETNFFG VTGQWFRNG 420 

ERMGTIKFTQ FQDSREVKVG EYKAVADTLE IINDTIRFQG SEPPKDKTII LEQLRKISLP 480 

LYSILSALTI LGMIMASAFL FFNIKNRNQK LIKMSSPYMN NLIILGGMLS YASIFIiFGLD 540 

GSFVSEKTFE TLCTVRTWIL TVGYTTAFGA KFAKTWRVHA IFKNVKMKKK IIKDQKLLVI 600 

VGGMLLIDLC ILICWQAVDP LRRTVEKYSM EPDPAGRDIS IRPLLEHCEK THMTIWLGIV 660 

YAYKGLLMLF GCFLAWETRN VSIPALNDSK YIGMSVYNVG IMCIIGAAVS FLTRDQPNVQ 720 

FCIVALVIIF CSTITLCLVF VPKLITLRTN PDAATQNRRF QFTQNQKKED SKTSTSVTSV 780 

NQASTSRIiEG LQSENHRLRM KITELDKDLE EVTMQLQDTP EKTTYIKQNH YQELNDILNL 840 

GNFTESTDGG KAILKNHLDQ NPQLQWNTTE PSRTCKDPIE DINSPEHIQR RLSLQLPILH 900 
HAYLPSIGGV DASCVSPCVS PTASPRHRHV PPSFRVMVSG L 

Seq ID NO: 27 DNA sequence 

Nucleic Acid Accession 8: NM_000450.1 

Coding sequence: 117.. 1949 

1 11 21 31 41 51 

I I.I I I i 

CCTGAGACAG AGGCAGCAGT GATACCCACC TGAGAGATCC TGTGTTTGAA CAACTGCTTC 60 

CCAAAACGGA AAGTATTTCA AGCCTAAACC TTTGGGTGAA AAGAACTCTT GAAGTCATGA 120 

TTGCTTCACA GTTTCTCTCA GCTCTCACTT TGGTGCTTCT CATTAAAGAG AGTGGAGCCT 180 

GGTCTTACAA CACCTCCACG GAAGCTATGA CTTATGATGA GGCCAGTGCT TATTGTCAGC 240 

AAAGGTACAC ACACCTGGTT GCAATTCAAA ACAAAGAAGA GATTGAGTAC CTAAACTCCA 300 

TATTGAGCTA TTCACCAAGT TATTACTGGA TTGGAATCAG AAAAGTCAAC AATGTGTGGG 360 

TCTGGGTAGG AACCCAGAAA CCTCTGACAG AAGAAGCCAA GAACTGGGCT CCAGGTGAAC 420 

CCAACAATAG GCAAAAAGAT GAGGACTGCG TGGAGATCTA CATCAAGAGA GAAAAAGATG 480 

TGGGCATGTG GAATGATGAG AGGTGCAGCA AGAAGAAGCT TGCCCTATGC TACACAGCTG 540 

CCTGTACCAA TACATCCTGC AGTGGCCACG GTGAATGTGT AGAGACCATC AATAATTACA 600 

CTTGCAAGTG TGACCCTGGC TTCAGTGGAC TCAAGTGTGA GCAAATTGTG AACTGTACAG 660 

CCCTGGAATC CCCTGAGCAT GGAAGCCTGG TTTGCAGTCA CCCACTGGGA AACTTCAGCT 720 

ACAATTCTTC CTGCTCTATC AGCTGTGATA GGGGTTACCT GCCAAGCAGC ATGGAGACCA 780 

TGCAGTGTAT GTCCTCTGGA GAATGGAGTG CTCCTATTCC AGCCTGCAAT GTGGTTGAGT 840 

GTGATGCTGT GACAAATCCA GCCAATGGGT TCGTGGAATG TTTCCAAAAC CCTGGAAGCT 900 

TCCCATGGAA CACAACCTGT ACATTTGACT GTGAAGAAGG ATTTGAACTA ATGGGAGCCC 960 

AGAGCCTTCA GTGTACCTCA TCTGGGAATT GGGACAACGA GAAGCCAACG TGTAAAGCTG 1020 

TGACATGCAG GGCCGTCCGC CAGCCTCAGA ATGGCTCTGT GAGGTGCAGC CATTCCCCTG 1080 

CTGGAGAGTT CACCTTCAAA TCATCCTGCA ACTTCACCTG TGAGGAAGGC TTCATGTTGC 1140 

AGGGACCAGC CCAGGTTGAA TGCACCACTC AAGGGCAGTG GACACAGCAA ATCCCAGTTT 1200 

GTGAAGCTTT CCAGTGCACA GCCTTGTCCA ACCCCGAGCG AGGCTACATG AATTGTCTTC 1260 

CTAGTGCTTC TGGCAGTTTC CGTTATGGGT CCAGCTGTGA GTTCTCCTGT GAGCAGGGTT 1320 

TTGTGTTGAA GGGATCCAAA AGGCTCCAAT GTGGCCCCAC AGGGGAGTGG GACAACGAGA 1380 

AGCCCACATG TGAAGCTGTG AGATGCGATG CTGTCCACCA GCCCCCGAAG GGTTTGGTGA 1440 

GGTGTGCTCA TTCCCCTATT GGAGAATTCA CCTACAAGTC CTCTTGTGCC TTCAGCTGTG 1S00 

AGGAGGGATT TGAATTATAT GGATCAACTC AACTTGAGTG CACATCTCAG GGACAATGGA 1560 

CAGAAGAGGT TCCTTCCTGC CAAGTGGTAA AATGTTCAAG CCTGGCAGTT CCGGGAAAGA 1620 

TCAACATGAG CTGCAGTGGG GAGCCCGTGT TTGGCACTGT GTGCAAGTTC GCCTGTCCTG 1680 

AAGGATGGAC GCTCAATGGC TCTGCAGCTC GGACATGTGG AGCCACAGGA CACTGGTCTG 1740 

GCCTGCTACC TACCTGTGAA GCTCCCACTG AGTCCAACAT TCCCTTGGTA GCTGGACTTT 1800 

CTGCTGCTGG ACTCTCCCTC CTGACATTAG CACCATTTCT CCTCTGGCTT CGGAAATGCT 1860 

TACGGAAAGC AAAGAAATTT GTTCCTGCCA GCAGCTGCCA AAGCCTTGAA TCAGACGGAA 1920 

GCTACCAAAA GCCTTCTTAC ATCCTTTAAG TTCAAAAGAA TCAGAAAC3VG GTGCATCTGG 1980 

GGAACTAGAG GGATACACTG AAGTTAACAG AGACAGATAA CTCTCCTCGG GTCTCTGGCC 2040 

CTTCTTGCCT ACTATGCCAG ATGCCTTTAT GGCTGAAACC GCAACACCCA TCACCACTTC 2100 

AATAGATCAA AGTCCAGCAG GCAAGGACGG CCTTCAACTG AAAAGACTCA GTGTTCCCTT 2160 

TCCTACTCTC AGGATCAAGA AAGTGTTGGC TAATGAAGGG AAAGGATATT TTCTTCCAAG 2220 

CAAAGGTGAA GAGACCAAGA CTCTGAAATC TCAGAATTCC TTTTCTAACT CTCCCTTGCT 2280 

CGCTGTAAAA TCTTGGCACA GAAACACAAT ATTTTGTGGC TTTCTTTCTT TTGCCCTTCA 2340 

CAGTGTTTCG ACAGCTGATT ACACAGTTGC TGTCATAAGA ATGAATAATA ATTATCCAGA 2400 

GTTTAGAGGA AAAAAATGAC TAAAAATATT ATAACTTAAA AAAATGACAG ATGTTGAATG 2460 

CCCACAGGCA AATGCATGGA GGGTTGTTAA TGGTGCAAAT CCTACTGAAT GCTCTGTGOG 2520 

AGGGTTACTA TGCACAATTT AATCACTTTC ATCCCTATGG GATTCAGTGC TTCTTAAAGA 2 5 B0 

GTTCTTAAGG ATTGTGATAT TTTTACTTGC ATTGAATATA TTATAATCTT CCATACTTCT 2640 

TCATTCAATA CAAGTCTGGT AGGGACTTAA AAAACTTGTA AATGCTGTCA ACTATGATAT 2700 

GGTAAAAGTT ACTTATTCTA GATTACCCCC TCATTGTTTA TTAACAAATT ATGTTACATC 2760 
TGTTTTAAAT TTATTTCAAA AAGGGAAACT ATTGTCCCCT AGCAAGGCAT GATGTTAACC 2820 



958 



WO 03/042661 



AGAATAAAGT 
AGTAAAAACT 
CCACGATGAA 
CACACAAAGG 
GCTTTGCATT 
TATGGAAGAT 
TTTAAOGAAT 
GCTCTGGAAG 
AACAATTCCA 
AGTAATTGCC 
CCATTAACTT 
AACGACAAAG 
TTAAAGGGGC 
ATGGAATACA 
GCATTAGAAA 
TTTAAATTAT 
TCAGACCTAT 



TCTGAGTGTT 
GAATGGAAGG 
AAACTTCCAT 
TATAATTTTA 
CCTACAAAGA 
TTTAAATTCA 
GAAGATGTCT 
AGAGGAATGC 
AAGGAATCTC 
AAAGCTGCTC 
AGCATGTGTT 
CCAACAGTCA 
AGAAAAACTC 
GTGTTATTTT 
TTAGCTGTGT 
AACTTAAAAT 
TTGACATAAC 



TTTACTACAG 
TTTGTATATT 
GAGGCCAAAC 
TGAATGTCTT 
TGTTTGTCAG 
CAATAGAAAC 
AATAGTTATT 
CTGTGTGAGC 
CAGTTTTCAG 
TAGCCTTGAG 
GAAAAAAAAA 
AAACAGAGAT 
TGGGAAATAA 
CTTTGAAATT 
GAAATACCAG 
ATTTTATAAT 
ACTATAAAGG 



TTGTTTTTTG 
GTCAGATATT 
GTTTTGAACT 
TGTTGGAAAA 
ATGTGATATG 
TCACCATGTA 
CCCTATTTGT 
AAGCATTTAT 
TTGATCACTG 
GAGTGTGAGA 
GTTTCAGAGA 
GTGATAAGGA 
GAGAGAACAA 
GTTTAAGTGT 
TGTGGTTTGT 
TTTTAAAGTA 
TTGACAATAA 



AAAACATGGT 
TTTTCAGAAA 
AATAAAAGCA 
GAATACAGAA 
TAAACATAAT 
AAAGAGTCAT 
TTTCTTCTGT 
GTTTATTTAT 
GCAATGAAAA 
ATCAAAACTC 
AGTTCTGGCT 
TCAGAACAGC 
CTACTGTGAT 
TGTAAATATT 
GTTTGAGTTT 
TATATTTATT 
ATGTGCTTAT 



AGAATTGGAG 
TATGTGGTTT 
TAAATGCAAA 
AGATGGATGT 
TCTTGTATAT 
CTGGTAGATT 
ATGTTAGGGT 
AAGCAGATTT 
ATTCTCAGTC 
TCCTACACTT 
GAACACTGGC 
AGAGGTTCTT 
CAGGCTATGT 
TATGTAAACT 
TATTGAGAAT 
TAAGCTTATG 
GTTT 



Seq ID NO a 28 Protein sequence 
Protein Accession 8; NP_000441.1 



MIASQFLSAL 
SILSYSPSYY 
DVGMWNDERC 
TALESPEHGS 
ECDAVTNPAN 
AVTCRAVRQP 
VCEAFQCTAL 
EKPTCEAVRC 
WTEEVPSCQV 
SGLIiPTCEAP 
GSYQKPSYIL 



11 
I 

TLVLIiIKESG 
WIGIRKVNNV 
SKKKLALCYT 
LVCSHPLGNF 
GFVECFQNPG 
QNGSVRCSHS 
SNPERGYMNC 
DAVHQPPKGL 
VKCSSLAVPG 
TESNIPLVAG 



21 
I 

AWSYNTSTEA 
WVWVGTQKPL 
AACTNTSCSG 
SYNSSCSISC 
SPPWNTTCTF 
PAGEFTFKSS 
LPSASGSFRY 
VRCAHSPIGE 
KINMSCSGEP 
LSAAGLSLLT 



31 
I 

MTYDEASAYC 
TEEAKNWAPG 
HGECVETINN 
DRGYLPSSME 
DCEEGFELMG 
CNFTCEEGFM 
GSSCEFSCEQ 
FTYKSSCAFS 
VFGTVCKFAC 
LAPFLLWLRK 



41 

! 

QQRYTHLVAI 
EPNNRQKDED 
YTCKCDPGFS 
TMQCMSSGEW 
AQSLQCTSSG 
LQGPAQVECT 
GFVLKGSKRL 
CEEGFELYGS 
PEGWTLNGSA 
CLHKAKKFVP 



28B0 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 



51 
I 

QNKEEI EYLN 
CVEIYIKREK 
GLKCEQIVNC 
SAPIPACNW 
NWDNEKPTCK 
TQGQWTQQIP 
QCGPTGEWDN 
TQLECTSQGQ 
ARTCGATGHW 



60 
120 
180 
240 
300 
360 
420 
480 
S40 
600 



Seq ID NO: 29 DNA sequence 

Nucleic Acid Accession #: NM_007036 

Coding sequence : 56 - . 610 



11 
1 



21 



CTTCCCACCA GCAAAGACCA 
GAGCGTCTTG CTGCTGACCA 
TAATTATGCG GTGGACTGCC 
CTGCAAGAGG ACAGTGCTCG 
AGAAACTTGC TACCGCACAG 
GTGTCAGCCT TCTAATGGGG 
TCCCTACGGC ACCTTCGGGA 
TGACAGGGGG ACGGGAAAAT 
TTCCAACAGA TTTGTTTCTC 
GAGAGAAGAA GTTGTGAAAG 
TCCACGCTGA TCCCGGCTGT 
CACAGCCAAC ATTTTAGGAA 
CCAAATTGTG ATGCATGGTG 
ATCCATATGA CTGAACACTT 
AAATGTGTGT GTATAGTAAC 
AGACAGGTCA ACCAAAGAGG 
TCTTTGACTT TGATGTACAT 
GATGGGGAGG GGGTGGGAGT 
TCTAGAATTT AATTGTGCTT 
ACCAGAAAAC CCCTGAAGGA 
AGCTTTGAAC TGAGAGCAAT 
TGAAGGACGG TTCTGGGGCA 
CCACCTCAGA GATAAATCTA 
GTAAATATTT ATATATTTTT 
ATTTATCATC CTCTTGAGGA 
AACCTATGAC TCTATAAGGT 
TATAGGAGTC ACTCTGGATT 
TAAACATAAG TGCTGTGACT 
GGAGGTTTGT AAAAGAAGAA 
TGGAATTAGG AGTATATTTG 
CTTAGGAAAT ATCTCAGAAG 
AGTATTTACC TGTATTTTAT 
GCCTTTGAAT GTAAAGCTGC 
TTGTTCAATA AAAAAGAACA 



CGACTGGAGA 
CGCTCCTCGT 
CTCAACACTG 
ACGACTGTGG 
TCTCAGGCAT 
AGGATCCTTT 
TGGATTGCAG 
GCCTGAAATT 
TCACGGAGCA 
AGAATGCTGC 
GATTTCTGAG 
CTTTCTAGAT 
GATCCAGAAA 
GTATGTGTTT 
ACTGAAGAAC 
GAGCTAGGCA 
TAATGTTGGG 
GGGAAATAAA 
TTTTTTTTT T 
AGTAAGATGT 
TTCAAAAGGC 
TAGGAAACAC 
AGAAGTATTT 
ATAAATAAAT 
AAGAAATCTA 
TTTCAAACAT 
TCAAAAAATG 
TCGGTGAATT 
TCAATTTTCA 
AAAGAATCTT 
TATTTTATTT 
TCTTGAAGTT 
ATAAG CTGTT 
AGATAC 



31 
I 

GCCGAGCCGG 
GCCTGCACAC 
TGACAGCAGT 
CTGCTGCCGA 
GGATGGCATG 
TGGTGAAGAG 
AGAGACCTGC 
CCCCTTCTTC 
TGACATGGCA 
CGGGTCTCCC 
AGAAGGCTCT 
ATAGCATAAG 
ACAAAAAGTA 
GTTAAATATT 
TAAAAATGCA 
AAGCTGAAGA 
ATATGGAATG 
ATATTTAGCC 
TTTGGCTTTG 
TTGAAGCTTA 
TGCTGATGTA 
ATACACTTCC 
TACCCACTGG 
GTGTTAGTGC 
GTATTATTTG 
CTGAGGCATG 
TCAAAAAATG 
TTCAATTTAA 
GCAGAAAACA 
AGCACAAACA 
GAAGTGAAGA 
GGCCAACAGA 
AGGTTTTGTT 



41 



51 



AGGCAGCTGG 
CTGGTGGCCG 
GAGTGCAAAA 
GTGTGCGCTG 
AAGTGTGGCC 
TTTGGTATCT 
AACTGCCAGT 
CAATATTCAG 
TCTGGAGATG 
GTAATGAGGA 
ATTTTCGTGA 
TACATGTAAT 
GGATACTTAC 
CGAATGCATG 
ATTTAGGTAA 
CCGCAGTGAG 
AAGACTTAAG 
CTTCCTTGGT 
GGAAAAGTCA 
TGGAAATTTG 
GTTCCCGGGT 
ATAAATAGCT 
TGGTTTGTGT 
AAGTCATCTT 
TTGAAAATGG 
ATAAATTTAT 
AGCAACAGAG 
GGTATGAAAA 
TGTCAACTTT 
GGACTGTTGT 
ACTTATTTAA 
GTTGTGAATG 
TTAAAAGGAC 



GAAACATGAA 
CCTGGAGCAA 
GCAGCCCGCG 
CAGGGCGGGG 
CGGGGCTGAG 
GCAAAGACTG 
CAGGCATCTG 
TAACCAAGTC 
GCAATATTGT 
AATGGTTAAA 
TTGTTCAACA 
TTTTGAAGAT 
AATCCATAAC 
TAGATTTGTT 
TCTTACATGG 
TCAAATTAGT 
AGCAGGAGAA 
AGGTAGCTTC 
AAATAAAACA 
AGTAACAAAC 
TACCTGTATC 
TTAACGTATG 
GTGTATGAAG 
CCCTACCCAT 
TTAGAATAAA 
TATCCATAAT 
GGACCTTATT 
TAAGTTTTTA 
AAAATATAGG 
ACTAGATGTT 
GAATTATTTC 
TGTGTGGAAG 
ATGTTTATTA 



60 
120 
180 
240 
300 
360 
420 
4B0 
540 
600 
660 
720 
780 
840 
900 

960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 



Seq ID NO: 30 Protein sequence 
Protein Accession 8: NP_008967.1 

1 11 21 31 41 51 

| | I I I I 

MKSVLLLTTL LVPAHLVAAW SNNYAVDCPQ HCDSSECKSS PRCKRTVLDD CGCCRVCAAG 60 
RGETCYRTVS GMDGMKCGPG LRCQPSNGED PFGEEFGICK DCPYGTFGMD CRETCNCQSG 120 
ICDRGTGKCL KFPFFQYSVT KSSNRFVSLT EHDMASGDGN IVREEWKEN AAGSPVMRKW 180 

959 



WO 03/042661 



PCT/US02/36810 



LNPR 

Seq ID NO: 31 DNA sequence 
Nucleic Acid Accession #: NM_000963 
5 Coding sequence: 135.. 1949 

1 11 21 31 41 51 

I I I I I 1 

CAATTGTCAT ACGACTTGCA GTGAGCGTCA GGAGCACGTC CAGGAACTCC TCAGCAGCGC 60 
CTCCTTCAGC TCCACAGCCA GACGCCCTCA GACAGCAAAG CCTACCCCCG OGCCGCGCCC 120 
10 TGCCCGCCGC TCGGATGCTC GCCCGCGCCC TGCTGCTGTG CGCGGTCCTG GCGCTCAGCC 180 
ATACAGCAAA TCCTTGCTGT TCCCACCCAT GTCAAAACCG AGGTGTATGT ATGAGTGTGG 240 
GATTTGACCA GTATAAGTGC GATTGTACCC GGACAGGATT CTATGGAGAA AACTGCTCAA 300 
CACCGGAATT TTTGACAAGA ATAAAATTAT TTCTGAAACC CACTCCAAAC ACAGTGCACT 360 
ACATACTTAC CCACTTCAAG GGATTTTGGA ACGTTGTGAA TAACATTCCC TTCCTTCGAA 420 
15 ATGCAATTAT GAGTTATGTC TTGACATCCA GATCACATTT GATTGACAGT CCACCAACTT 480 
ACAATGCTGA CTATGGCTAC AAAAGCTGGG AAGCCTTCTC TAACCTCTCC TATTATACTA 540 
GAGCCCTTCC TCCTGTGCCT GATGATTGCC CGACTCCCTT GGGTGTCAAA GGTAAAAAGC 600 
AGCTTCCTGA TTCAAATGAG ATTGTGGAAA AATTGCTTCT AAGAAGAAAG TTCATCCCTG 660 
ATCCCCAGGG CTCAAACATG ATGTTTGCAT TCTTTGCCCA GCACTTCACG CATCAGTTTT 720 
20 TCAAGACAGA TCATAAGCGA GGGCCAGCTT TCACCAACGG GCTGGGCCAT GGGGTGGACT 780 
TAAATCATAT TTACGGTGAA ACTCTGGCTA GACAGCGTAA ACTGCGCCTT TTCAAGGATG 840 
GAAAAATGAA ATATCAGATA ATTGATGGAG AGATGTATCC TCCCACAGTC AAAGATACTC 900 
AGGCAGAGAT GATCTACCCT CCTCAAGTCC CTGAGCATCT ACGGTTTGCT GTGGGGCAGG 960 
AGGTCTTTGG TCTGGTGCCT GGTCTGATGA TGTATGCCAC AATCTGGCTG CGGGAACACA 1020 
25 ACAGAGTATG CGATGTGCTT AAACAGGAGC ATCCTGAATG GGGTGATGAG CAGTTGTTCC 10 80 
AGACAAGCAG GCTAATACTG ATAGGAGAGA CTATTAAGAT TGTGATTGAA GATTATGTGC 1140 
AACACTTGAG TGGCTATCAC TTCAAACTGA AATTTGACCC AGAACTACTT TTCAACAAAC 1200 
AATTCCAGTA CCAAAATCGT ATTGCTGCTG AATTTAACAC CCTCTATCAC TGGCATCCCC 1260 
TTCTGCCTGA CACCTTTCAA ATTCATGACC AGAAATACAA CTATCAACAG TTTATCTACA 1320 
30 ACAACTCTAT ATTGCTGGAA CATGGAATTA CCCAGTTTGT TGAATCATTC ACCAGGCAAA 1380 
TTGCTGGCAG GGTTGCTGGT GGTAGGAATG TTCCACCCGC AGTACAGAAA GTATCACAGG 1440 
CTTCCATTGA CCAGAGCAGG CAGATGAAAT ACCAGTCTTT TAATGAGTAC CGCAAACGCT 150 0 
TTATGCTGAA GCCCTATGAA TCATTTGAAG AACTTACAGG AGAAAAGGAA ATGTCTGCAG IS 60 
AGTTGGAAGC ACTCTATGGT GACATCGATG CTGTGGAGCT GTATCCTGCC CTTCTGGTAG 1620 
35 AAAAGCCTCG GCCAGATGCC ATCTTTGGTG AAACCATGGT AGAAGTTGGA GCACCATTCT 1680 
CCTTGAAAGG ACTTATGGGT AATGTTATAT GTTCTCCTGC CTACTGGAAG CCAAGCACTT 1740 
TTGGTGGAGA AGTGGGTTTT CAAATCATCA ACACTGCCTC AATTCAGTCT CTCATCTGCA 1800 
ATAACGTGAA GGGCTGTCCC TTTACTTCAT TCAGTGTTCC AGATCCAGAG CTCATTAAAA 1860 
CAGTCACCAT CAATGCAAGT TCTTCCCGCT CCGGACTAGA TGATATCAAT CCCACAGTAC 1920 
40 TACTAAAAGA ACGTTCGACT GAACTGTAGA AGTCTAATGA TCATATTTAT TTATTTATAT 1980 
GAACCATGTC TATTAATTTA ATTATTTAAT AATATTTATA TTAAACTCCT TATGTTACTT 2040 
AACATCTTCT GTAACAGAAG TCAGTACTCC TGTTGCGGAG AAAGGAGTCA TACTTGTGAA 2100 
GACTTTTATG TCACTACTCT AAAGATTTTG CTGTTGCTGT TAAGTTTGGA AAACAGTTTT 2160 
TATTCTGTTT TATAAACCAG AGAGAAATGA GTTTTGACGT CTTTTTACTT GAATTTCAAC 2220 
45 TTATATTATA AGAACGAAAG TAAAGATGTT TGAATACTTA AACACTATCA CAAGATGGCA 2280 
AAATGCTGAA AGTTTTTACA CTGTCGATGT TTCCAATGCA TCTTCCATGA TGCATTAGAA 2340 
GTAACTAATG TTTGAAATTT TAAAGTACTT TTGGTTATTT TTCTGTCATC AAACAAAAAC 2400 
AGGTATCAGT GCATTATTAA ATGAATATTT AAATTAGACA TTACCAGTAA TTTCATGTCT 2460 
ACTTTTTAAA ATCAGCAATG AAACAATAAT TTGAAATTTC TAAATTCATA GGGTAGAATC 2520 
50 ACCTGTAAAA GCTT G TTT G A TTTCTTAAAG TTATTAAACT TGTACATATA CCAAAAAGAA 2580 
GCTGTCTTGG ATTTAAATCT GTAAAATCAG ATGAAATTTT ACTACAATTG CTTGTTAAAA 2640 
TATTTTATAA GTGATGTTCC TTTTTCACCA AGAGTATAAA CCTTTTTAGT GTGACTGTTA 2700 
AAACTTCCTT TTAAATCAAA ATGCCAAATT TATTAAGGTG GTGGAGCCAC TGCAGTGTTA 2760 
TCTCAAAATA AGAATATTTT GTTGAGATAT TCCAGAATTT GTTTATATGG CTGGTAACAT 2820 
55 GTAAAATCTA TATCAGCAAA AGGGTCTACC TTTAAAATAA GCAATAACAA AGAAGAAAAC 2880 
CAAATTATTG TTCAAATTTA GGTTTAAACT TTTGAAGCAA ACTTTTTTTT ATCCTTGTGC 2940 
ACTGCAGGCC TGGTACTCAG ATTTTGCTAT GAGGTTAATG AAGTACCAAG CTGTGCTTGA 3000 
ATAACGATAT GTTTTCTCAG ATTTTCTGTT GTACAGTTTA ATTTAGCAGT CCATATCACA 3060 
^ TTGCAAAAGT AGCAATGACC TCATAAAATA CCTCTTCAAA ATGCTTAAAT TCATTTCACA 3120 

60 CATTAATTTT ATCTCAGTCT TGAAGCCAAT TCAGTAGGTG CATTGGAATC AAGCCTGGCT 3180 
ACCTGCATGC TGTTCCTTTT CTTTTCTTCT TTTAGCCATT TTGCTAAGAG ACACAGTCTT 3240 
CTCATCACTT CGTTTCTCCT ATTTTGTTTT ACTAGTTTTA AGATCAGAGT TCACTTTCTT 3300 
TGGACTCTGC CTATATTTTC TTACCTGAAC TTTTGCAAGT TTTCAGGTAA ACCTCAGCTC 3360 
AGGACTGCTA TTTAGCTCCT CTTAAGAAGA TTAAAAGAGA AAAAAAAAGG CCCTTTTAAA 3420 
65 AATAGTATAC ACTTATTTTA AGTGAAAAGC AGAGAATTTT ATTTATAGCT AATTTTAGCT 3480 
ATCTGTAACC AAGATGGATG CAAAGAGGCT AGTGCCTCAG AGAGAACTGT ACGGGGTTTG 3540 
TGACTGGAAA AAGTTACGTT CCCATTCTAA TTAATGCCCT TTCTTATTTA AAAACAAAAC 3600 
CAAATGATAT CTAAGTAGTT CTCAGCAATA ATAATAATGA CGATAATACT TCTTTTCCAC 3660 
ATCTCATTGT CACTGACATT TAATGGTACT GTATATTACT TAATTTATTG AAGATTATTA 3720 
70 TTTATGTCTT ATTAGGACAC TATGGTTATA AACTGTGTTT AAGCCTACAA TCATTGATTT 3780 
TTTTTTGTTA TGTCACAATC AGTATATTTT CTTTGGGGTT ACCTCTCTGA ATATTATGTA 3840 
AACAATCCAA AGAAATGATT GTATTAAGAT TTGTGAATAA ATTTTTAGAA ATCTGATTGG 3900 
CATATTGAGA TATTTAAGGT TGAATGTTTG TCCTTAGGAT AGGCCTATGT GCTAGCCCAC 3960 
AAAGAATATT GTCTCATTAG CCTGAATGTG CCATAAGACT GACCTTTTAA AATGTTTTGA 4020 
75 GGGATCTGTG GATGCTTCGT TAATTTGTTC AGCCACAATT TATTGAGAAA ATATTCTGTG 4080 
TCAAGCACTG TGGGTTTTAA TATTTTTAAA TCAAACGCTG ATTACAGATA ATAGTATTTA 4140 
TATAAATAAT TGAAAAAAAT TTTCTTTTGG GAAGAGGGAG AAAATGAAAT AAATATCATT 4200 
AAAGATAACT CAGGAGAATC TTCTTTACAA TTTTACGTTT AGAATGTTTA AGGTTAAGAA 4260 
AGAAATAGTC AATATGCTTG TATAAAACAC TGTTCACTGT TTTTTTTAAA AAAAAAACTT 4320 
80 GATTTGTTAT TAACATTGAT CTGCTGACAA AACCTGGGAA TTTGGGTTGT GTATGCGAAT 4380 
GTTTCAGTGC CTCAGACAAA TGTGTATTTA ACTTATGTAA AAGATAAGTC TGGAAATAAA 4440 
TGTCTGTTTA TTTTTGTACT ATTTA 
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PCT/US02/36810 



Seq ID NO: 32 Protein sequence 
Protein Accession fl: NP_0009S4 



10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



i 

I 

MLARATJiTiCA 
TRIKLFLKPT 
GYKSWEAFSN 
NMMFAFFAQH 
QIIDGEMYPP 
VLKQEHPEWG 
NRIAAEFNTL 
AGGRNVPPAV 
YGDIDAVELY 
GFQIINTASI 
STEL 



11 
I 

VLALSKTANP 
PNTVHYILTH 
LSYYTRALPP 
FTHQFFKTDH 
TVKDTQAEMI 
DEQLFQTSRL 
YHWHPLLPDT 
QKVSQASIDQ 
PALLVEKPRP 
QSLICNNVKG 



21 

I 

CCSHPCQNRG 
FKGFWNWNN 
VPDDCPTPLG 
KRGPAFTNGL 
YPPQVPEHLR 
ILIGETIKIV 
FQIHDQKYNY 
SRQMKYQSFN 
DAIFGETMVE 
CPFTSFSVPD 



31 
I 

VCMSVGFDQY 
IPFLRNAIMS 
VKGKKQLPDS 
GHGVDLNHIY 
FAVGQEVFGL 
IEDYVQHLSG 
QQFIYNNSIL 
EYRKRPMLKP 
VGAPFSLKGIi 
PELIKTVTIN 



41 
I 

KCDCTRTGFY 
YVLTSRSHLI 
NEIVEKLLLR 
GETLARQRKL 
VPGLMMYATI 
YHFKLKFDPE 
LEHGITQFVE 
YESFEELTGE 
MGNVICSPAY 



51 
I 

GENCSTPEFL 
DSPPTYNADY 
RKFIPDPQGS 
RLFKDGKMKY 
WIiREHNRVCD 
LLFKKQFQYQ 
SFTRQIAGRV 
KEMSAELEAL 
WKPSTFGGEV 
INPTVLLKER 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



Seq ID NO: 33 DNA sequence 

Nucleic Acid Accession #: NM_001508.1 

Coding sequence: 1..1361 



1 

i 

ATGGCTTCAC 
CCCGAGTTTG 
TTCGTGATGG 
AAAGGATACT 
TTGGTGTTCC 
ACGTCCAGCT 
GCTACGCTGC 
TTCAGGTACA 
GTCACCTCCG 
GTGAACGTGC 
CAGCCCGAGA 
CAGTCCAGCA 
ATGTGCTGGA 
ACGGGGCCTC 
ACCATCATCT 
ATTCGGAGGA 
GCGTACATGA 
CCGCTCCTGT 

ACCACCGACA 
TCTGCAAGGA 
TCTAAGTCCC 
AATTCTGCTG 



11 
I 

CCAGCCTCCC 
AGGTGGCCAC 
GCCTTCTGGG 
TGCAGAAGGA 
TCATCGGCAT 
ACACCCTGTC 
TGCACGTGCT 
AGGCTGTGTC 
CCCTGGTGGC 
CCAGCCACCG 
CCTCCAATAT 
TCTTCGGCGC 
ACATGATGCA 
CGCAGCTGAG 
TCCTGAGGCT 
TCATGGCTGC 
TCCTCCTCCC 
ACACGGTGTC 
CGCTGCAGCA 

GAACTGAGAA 
AGTCATTGAG 
CAGAGAATGG 



21 
I 

GGGCAGTGAC 
CTGGATCAAA 
GAACAGCGTC 
GGTGACAGAC 
GCCCATGGAG 
CTGCAAGCTG 
GACGCTCAGC 
GGGACCTTGC 
ACTGCCCTTG 
GGGTCTCACT 
GTCCATCTGT 
CTTCGTGGTC 
GGTGCTCATG 
GAAGTCCGAG 
GATTGTTGTG 
GGCCAAACCC 
CTTCTCGGAG 
CTCGCAGCAG 
CGCCAACCAC 
TGTGCAGCGC 
GATTTTCTTA 
TCTCGAGTCA 
TTTTCAGGAG 



31 
I 

TGCTCCCAAA 
ATCACCCTTA 
ACCATTCGGG 
CACATGGTGA 
TTCTACAGCA 
CACACTTTCC 
TTTGAGCGCT 
CAGGTGAAGC 
CTGTTTGCCA 
TGCAACCGCT 
ACCAACCTCT 
TACCTCGTGG 
AAAAGCCAGA 
AGCGAAGAGA 
ACATTGGCCG 
AAGCACGACT 
ACGTTTTTCT 
TTTCGGCGGG 
GAGAAGCGCC 
CCGTTGCTCT 
AGCACTTTTC 
CTAGAGCCCA 
CATGAAGTTT 



41 

1 

TCATTGATCA 
TTCTGGTGTA 
TCACCCAGGT 
GTTTGGCTTG 
TCATCTGGAA 
TCTTCGAGGC 
ACATCGCCAT 
TGCTGATTGG 
TGGGTACTGA 
CCAGCACCCG 
CCAGCCGCTG 
TCCTGCTCTC 
AGGGCTCGCT 
GCAGGACCGC 
TATGCTGGAT 
GGACGAGGTC 
ACCTCAGCTC 
TGTTCGTGCA 
TGCGCGTACA 
TCGCGTCCCG 
AGAGCGAGGC 
ACTCAGGCGC 
GA 



51 
I 

CAGTCATGTC 
CCTGATCATC 
GCTGCAGAAG 
CTCGGACATC 
TCCCCTGACC 
CTGCAGCTAC 
CTGTCACCCC 
CTTCGTCTGG 
GTACCCCCTG 
CCACCACGAG 
GACCGTGTTC 
CGTAGCCTTC 
GGCCGGGGGC 
CAGGAGGCAG 
GCCCAACCAG 
CTACTTCCGG 
GGTCATCAAC 
GGTGCTGTGC 
TGCGCACTCC 
GCGCCAGTCC 
CGAGCCCCAG 
GAAACCAGCC 



Seq ID NO: 34 Protein sequence 
Protein Accession #: NP 001499.1 



1 
I 

MASPSLPGSD 
KGYLQKEVTD 
ATLLHVLTLS 
VNVPSHRGLT 
MCWNMMQVLM 
IRRIMAAAKP 
CRLSLQHANH 



11 
I 

CSQIIDHSHV 
HMVSIACSDI 
FERYIAICHP 
CNRSSTRHHE 
KSQKGSLAGG 
KHDWTRSYFR 
EKRLRVHAHS 
LEPNSGAKPA 



21 
I 

PEFEVATWIK 
LVFLIGMPHE 
FRYKAVSGPC 
QPETSNMSIC 
TRPPQLRKSE 
AYMILIiPFSE 
TTDSARFVQR 



31 
I 

ITLILVYLII 
FYSIIWNPLT 
QVKLLIGFVW 
TNLSSRWTVF 
SEESRTARRQ 
TFFYLSSVIN 
PLLFASRRQS 
HEV 



41 

I 

FVMGLLGNSV 
TSSYTLSCKL 
VT SAL V AL PL 
QSSIFGAFW 
TIIFLRLIW 
PLLYTVSSQQ 
SARRTEKIFL 



Seq ID NO: 35 DNA sequence 

Nucleic Acid Accession #: NM_00647S.l 

Coding sequence; 2 8.. 253 8 



1 
I 

AACAGAACTG 
TTGCTGCTTA 
AGTCGTATCA 
ACCAAAAAGA 
AAAACGACTG 
TGCCCAGCAG 
ACAACGCAGC 
TTCACTTACT 
GGTTTGGAGA 
AAGAGAATGT 
TTGGGGCTTT 
ATCCATGGGA 
CAAATTGGTA 
GCAGCTGCCA 
TTTGCTCCCA 
GGAGACAAAG 
TCTGAGTCTA 



11 
I 

CAACGGAGAG 
TTGTTAACCC 
GGGGTCGGGA 
AATACTTCAG 
TTTTATATGA 
TTTTGCCCAT 
GCTATTCTGA 
TTGCACCGAG 
GCAACGTGAA 
TGACCAAGGA 
TCATTAACCA 
ACCAGATTGC 
CCTCAATTCA 
TCACATCGGA 
CCAATGAGGC 
TGGCTTCCGA 
TTATGGGAGG 



21 
I 

ACTCAAGATG 
TATAAACGCC 
CCAAGGCCCA 
CACTTGTAAG 
ATGTTGCCCT 
TGACCATGTT 
CGCCTCAAAA 
TAATGAGGCT 
TGTTGAATTA 
CTTAAAAAAT 
TTATCCTAAT 
AACAAATGGT 
AGACTTCATT 
CAT ATT GG AG 
TTTTGAGAAA 
AGCTCTTATG 
AGCAGTCTTT 



31 
I 

ATTCCCTTTT 
AACAATCATT 
AATGTCTGTG 
AACTGGTATA 
GGTTATATGA 
TATGGCACTC 
CTGAGGGAGG 
TGGGACAACT 
CTGAATGCTT 
GGCATGATTA 
GGGGTTGTCA 
GTTGTCCATG 
GAAGCAGAAG 
GCCCTTGGAA 
CTTCCACGAG 
AAGTACCACA 
GAGACGCTGG 



41 



51 



TACCCATGTT 
ATGACAAGAT 
CCCTTCAACA 
AAAAGTCCAT 
GAATGGAAGG 
TGGGCATCGT 
AGATCGAGGG 
TGGATTCTGA 
TACATAGTCA 
TTCCTTCAAT 
CTGTTAATTG 
TCATTGACCG 
ATGACCTTTC 
GAGACGGTCA 
GTGTCCTAGA 
TCTTAAATAC 
AAGGAAATAC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 



51 
1 

TIRVTQVLOK 
HTFLFEACSY 
LFAKGTEYPL 
YLWLLSVAF 
TLAVCWMPNQ 
FRRVFVQVLC 
STFQSEAEPQ 



60 
120 
180 
240 
300 
360 
420 



TTCTCTACTA 
CTTGGCTCAT 
GATTTTGGGC 
CTGTGGACAG 
AATGAAAGGC 
GGGAGCCACC 
AAAGGGATCC 
TATCCGTAGA 
CATGATTAAT 
GTATAACAAT 
TGCTCGAATC 
TGTGCTTACA 
ATCTTTTAGA 
CTTCACACTC 
AAGGTTCATG 
TCTCCAGTGT 
AATTGAGATA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 



961 



WO 03/042661 



PCT/US02/36810 



GGATGTGACG GTGACAGTAT AACAGTAAAT GGAATCAAAA TGGTGAACAA AAAGGATATT 1080 
GTGACAAATA ATGGTGTGAT CCATTTGATT GATCAGGTCC TAATTCCTGA TTCTGCCAAA 1140 
CAAGTTATTG AGCTGGCTGG AAAACAGCAA ACCACCTTCA CGGATCTTGT GGCCCAATTA 1200 
GGCTTGGCAT CTGCTCTGAG GCCAGATGGA GAATACACTT TGCTGGCACC TGTGAATAAT 1260 
5 GCATTTTCTG ATGATACTCT CAGCATGGTT CAGCGCCTCC TTAAATTAAT TCTGCAGAAT 1320 
CACATATTGA AAGTAAAAGT TGGCCTTAAT GAGCTTTACA ACGGGCAAAT ACTGGAAACC 1380 
ATCGGAGGCA AACAGCTCAG AGTCTTCGTA TATCGTACAG CTGTCTGCAT TGAAAATTCA 1440 
TGCATGGAGA AAGGGAGTAA GCAAGGGAGA AAOGGTGCGA TTCACATATT CCGCGAGATC 1500 
ATCAAGCCAG CAGAGAAATC CCTCCATGAA AAGTTAAAAC AAGATAAGCG CTTTAGCACC 1560 
10 TTCCTCAGCC TACTTGAAGC TGCAGACTTG AAAGAGCTCC TGACACAACC TGGAGACTGG 1620 
ACATTATTTG TGCCAACCAA TGATGCTTTT AAGGGAATGA CTAGTGAAGA AAAAGAAATT 1680 
CTGATACGGG ACAAAAATGC TCTTCAAAAC ATCATTCTTT ATCACCTGAC ACCAGGAGTT 1740 
TTCATTGGAA AAGGATTTGA ACCTGGTGTT ACTAACATTT TAAAGACCAC ACAAGGAAGC 1800 
AAAATCTTTC TGAAAGAAGT AAATGATACA CTTCTGGTGA ATGAATTGAA ATCAAAAGAA I860 
15 TCTGACATCA TGACAACAAA TGGTGTAATT CATGTTGTAG ATAAACTCCT CTATCCAGCA 1920 
GACACACCTG TTGGAAATGA TCAACTGCTG GAAATACTTA ATAAATTAAT CAAATACATC 1980 
CAAATTAAGT TTGTTCGTGG TAGCACCTTC AAAGAAATCC COGTGACTGT CTATACAACT 2040 
AAAATTATAA CCAAAGTTGT GGAACCAAAA ATTAAAGTGA TTGAAGGCAG TCTTCAGCCT 2100 
ATTATCAAAA CTGAAGGACC CACACTAACA AAAGTCAAAA TTGAAGGTGA ACCTGAATTC 2160 
20 AGACTGATTA AAGAAGGTGA AACAATAACT GAAGTGATCC ATGGAGAGCC AATTATTAAA 2220 
AAATACACCA AAATCATTGA TGGAGTGCCT GTGGAAATAA CTGAAAAAGA GACACGAGAA 2280 
GAACGAATCA TTACAGGTCC TGAAATAAAA TACACTAGGA TTTCTACTGG AGGTGGAGAA 2340 
ACAGAAGAAA CTCTGAAGAA ATTGTTACAA GAAGAGGTCA CCAAGGTCAC CAAATTCATT 2400 
GAAGGTGGTG ATGGTCATTT ATTTGAAGAT GAAGAAATTA AAAGACTGCT TCAGGGAGAC 2460 
25 ACACCCGTGA GGAAGTTGCA AGCCAACAAA AAAGTTCAAG GTTCTAGAAG AOGATTAAGG 2520 
GAAGGTCGTT CTCAGTGAAA ATCCAAAAAC CAGAAAAAAA TGTTTATACA ACCCTAAGTC 2580 
AATAACCTGA CCTTAGAAAA TTGTGAGAGC CAAGTTGACT TCAGGAACTG AAACATCAGC 2640 
ACAAAGAAGC AATCATCAAA TAATTCTGAA CACAAATTTA ATATTTTTTT TTCTGAATGA 27 00 
GAAACATGAG GGAAATTGTG GAGTTAGCCT CCTGTGGTAA AGGAATTGAA GAAAATATAA 2760 
30 CACCTTACAC CCTTTTTCAT CTTGACATTA AAAGTTCTGG CTAACTTTGG AATCCATTAG 2820 
AGAAAAATCC TTGTCACCAG ATTCATTACA ATTCAAATCG AAGAGTTGTG AACTGTTATC 2880 
CCATTGAAAA GACCGAGCCT TGTATGTATG TTATGGATAC ATAAAATGCA CGCAAGCCAT 2940 
TATCTCTCCA TGGGAAGCTA AGTTATAAAA ATAGGTGCTT GGTGTACAAA ACTTTTTATA 3000 
TCAAAAGGCT TTGCACATTT CTATATGAGT GGGTTTACTG GTAAATTATG TTATTTTTTA 3060 
35 * CAACTAATTT TGTACTCTCA GAATGTTTGT CATATGCTTC TTGCAATGCA TATTTTTTAA 3120 
TCTCAAAOGT TTCAATAAAA CCATTTTTCA GATATAAAGA GAATTACTTC AAATTGAGTA 3180 
ATTCAGAAAA ACTCAAGATT TAAGTTAAAA AGTGGTTTGG ACTTGGGAA 

Seq ID NO: 36 Protein sequence 
40 Protein Accession #: NP_006466.1 

1 11 21 31 41 51 

I I 1 I I I 

MIPFLPMFSL LLLLIVNPIN ANNHYDKILA HSR1RGRDQG PNVCALQQIL GTKKKYFSTC 60 

45 KNWYKKSICG QKTTVLYECC PGYMRMEGMK GCPAVLPIDH VYGTLGIVGA TTTQRYSDAS 120 

KLREEIEGKG SFTYFAPSNE AWDNLDSDIR RGLESNVNVE LLNALHSHMI NKRMLTKDLK 180 

NGMIIPSMYN NLGLFINHYP NGWTVNCAR IIHGNQIATN GWHVIDRVL, TQIGTSIQDF 240 

IEAEDDLSSF RAAAITSDIL EAI/3RDGHFT LFAPTNEAFE KLPRGVLERF MGDKVASEAL 300 

MKYHILNTLQ CSESIMGGAV FETLEGNTIE IGCDGDSITV NGIKMVNKKD IVTNNGVIHL 360 

50 IDQVLIPDSA KQVIELAGKQ QTTFTDLVAQ LGLASALRPD GEYTLLAPVN NAFSDDTLSM 420 

VQRLLKLILQ NHILKVKVGL NELYNGQILE TIGGKQLRVF VYRTAVCIEN SCMEKGSKQG 480 

RNGAIHIFRE I IKPAEKSIiH EKLKQDKRFS TFLSLLEAAD LKELLTQPGD WTLFVPTNDA 540 

FKGMTSEEKE ILIRDKNALQ NIILYHLTPG VFIGKGFEPG VTNILKTTQG SKIFLKEVND 600 

TLLVNELKSK. ESDIMTTNGV IHWDKLLYP ADTPVGNDQL LEILNKLIKY IQIKFVRGST 660 

55 FKEIPVTVYT TKIITKWEP KIKVIEGSLQ PIIKTEGPTL TKVKIEGEPE FRLIKEGETI 720 

TEVIHGEPII KKYTKIIDGV PVEITEKETR EERIITGPEI KYTRISTGGG ETEETLKKLL 780 
QEEVTKVTKF IEGGDGHLFE DEEIKRLLQG DTPVRKLQAN KKVQGSRRRL REGRSQ 

- Seq ID NO: 37 DNA sequence 

60 Nucleic Acid Accession 8: NM_002416 
Coding sequence: 40.. 417 

1 11 21 31 41 51 

- r | | I I I I 

65 ATCCAATACA GGAGTGACTT GGAACTCCAT TCTATCACTA TGAAGAAAAG TGGTGTTCTT 60 

TTCCTCTTGG GCATCATCTT GCTGGTTCTG ATTGGAGTGC AAGGAACCCC AGTAGTGAGA 120 

AAGGGTCGCT GTTCCTGCAT CAGCACCAAC CAAGGGACTA TCCACCTACA ATCCTTGAAA 180 

GACCTTAAAC AATTTGCCCC AAGCCCTTCC TGCGAGAAAA TTGAAATCAT TGCTACACTG 240 

AAGAATGGAG TTCAAACATG TCTAAACCCA GATTCAGCAG ATGTGAAGGA ACTGATTAAA 300 

70 AAGTGGGAGA AACAGGTCAG CCAAAAGAAA AAGCAAAAGA ATGGGAAAAA ACATCAAAAA 360 

AAGAAAGTTC TGAAAGTTCG AAAATCTCAA CGTTCTCGTC AAAAGAAGAC TACATAAGAG 420 

ACCACTTCAC CAATAAGTAT TCTGTGTTAA AAATGTTCTA TTTTAATTAT ACCGCTATCA 480 

TTCCAAAGGA GGATGGCATA TAATACAAAG GCTTATTAAT TTGACTAGAA AATTTAAAAC 540 

ATTACTCTGA AATTGTAACT AAAGTTAGAA AGTTGATTTT AAGAATCCAA ACGTTAAGAA 600 

75 TTGTTAAAGG CTATGATTGT CTTTGTTCTT CTACCACCCA CCAGTTGAAT TTCATCATGC 660 

TTAAGGCCAT GATTTTAGCA ATACCCATGT CTACACAGAT GTTCACCCAA CCACATCCCA 720 

CTCACAACAG CTGCCTGGAA GAGCAGCCCT AGGCTTCCAC GTACTGCAGC CTCCAGAGAG 780 

TATCTGAGGC ACATGTCAGC AAGTCCTAAG CCTGTTAGCA TGCTGGTGAG CCAAGCAGTT 840 

_ _ ' TGAAATTGAG CTGGACCTCA CCAAGCTGCT GTGGCCATCA ACCTCTGTAT TTGAATCAGC 900 

80 CTACAGGCCT CACACACAAT GTGTCTGAGA GATTCATGCT GATTGTTATT GGGTATCACC 960 

ACTGGAGATC ACCAGTGTGT GGCTTTCAGA GCCTCCTTTC TGGCTTTGGA AGCCATGTGA 1020 

TTCCATCTTG CCCGCTCAGG CTGACCACTT TATTTCTTTT TGTTCCCCTT TGCTTCATTC 1080 

AAGTCAGCTC TTCTCCATCC TACCACAATG CAGTGCCTTT CTTCTCTCCA GTGCACCTGT 1140 

CATATGCTCT GATTTATCTG AGTCAACTCC TTTCTCATCT TGTCCCCAAC ACCCCACAGA 1200 



962 



WO 03/042661 



PCT/US02/36810 



10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



AGTGCTTTCT 
AAATAAACCT 
CACATGGGTG 
AGATTGTCAG 
CTAATAATAC 
TGGCAACCAG 
CTAGCCTCTG 
GATGCAACAT 
GCACGTGGTA 
AAAATCATAT 
CCAACCATAC 
TCTAAGATCT 
AGTTTTATTG 
TCTCCCATGA 
TAGTGGAAGC 
GGAGGTTCAG 
CTTTCCCAAA 
TCCCACCCGA 
AAAAATCTAA 
GTAGACAGTA 
TCATTTATCA 
AAACAGTATT 
TATCAATAAA 



TCTCCCAATT 
TTTTGGACAC 
AACACTCAAT 
CTCCTTGAGG 
TGTGGAACTA 
ACCATTGTCT 
GTAACCTCTT 
CCTTGTCTTT 
AAACACTTGC 
AATCTTACAA 
AAAAATTCCT 
AACAAGATAG 
TCCGTTTACT 
AGAAAGGGAA 
ATGATTGGTG 
TGAATTGTGT 
TTGAATCACT 
ACGTCTTATC 
GTGTTTCATA 
TATAACTAAC 
TATATATACA 
GACTTGTATA 
TAGACCATTA 



CATCCTCACT 
ACAAATTATC 
GGTTAACTAA 
GCAAGAGCCA 
GGTTTTAATA 
CAGAGCAGGT 
ACTTATTATC 
TTATGACAGG 
GGATATTCTG 
TGAAAAGGAC 
TTTCCCGAAG 
CCACCGAGAT 
TGTTTCAGAG 
CGGTGAAGTA 
CCCAGTTAGC 
AGGAGAGGTT 
GCTCACACTG 
TAATCATGAA 
AATTTGAGAG 
AACCAAAGAC 
TACATGCATA 
CCTTGTAATT 
ATCAG 



CAGTCCAGCT 
TTAAAACTCC 
TTCTTGGGTG 
CAGTATATTT 
ATTTTTTAAT 
GCTGGCTCTT 
TTCAGGACAC 
ATGTTTGCTC 
GACTGTTTTT 
TTTATAGATC 
GAAAAGGGCT 
CCTTATCGAA 
TTTGTATTGT 
CTAAGCGCTA 
CTCTGCAGGA 
GTCTGTGGCC 
CTGATGATTT 
ACTCCCTAGT 
TCTGTGACCC 
TACATATTGT 
CACTCTCAAA 
TGAAATATTT 



TAGTTCAAGT 
TGTTTCACTT 
TTTATCCTAT 
CCCTGTTTCT 
TGATGTTGTT 
TCCTGGCTAC 
TCACTACAGG 
AGCTTCTCCA 
AAAAAATATA 
AGCCAGTGAC 
TTCTCAATAA 
ACTCATTTTA 
GATTATCAAT 
GAGGAAGCAG 
TGTGGAAACC 
AGAATTTAAA 
AGAGTGCTGT 
TCCTTCATGT 
ACTTACCTTG 
CACTGACACA 
GCAAATAATT 
TCTTTGTTAA 



CCTGCCTCTT 
GGTTCAGTAC 
CTCTCCAACC 
TCCACAGTGC 
ATGGGCAGGA 
TCCATGTTGG 
GACCAGGGAT 
ACAATAAGAA 
CAGTTTACCG 
CAACCTTTTC 
GCCTCAGCTT 
GGCAAATATG 
TACCACACCA 
CCAAGTCGGT 
TCCTTCCAGG 
CCTATACTCA 
CCGGTGGAGA 
AACTTCCCTG 
CATCTCACAG 
CACGTTATAA 
TTTCACTTCA 
AATAGAATGG 



1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
19B0 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 



Seq ID NO: 38 Protein sequence 
Protein Accession #: NP_002407 



1 11 21 31 41 51 

| I I I I 

MKKSGVLFLL GIILLVLIGV QGTPWRKGR CSCISTNQGT IHLQSLKDLK QFAPSPSCEK 
IEIIATLKNG VQTCLNPDSA DVKELIKKWE KQVSQKKKQK NGKKHQKKKV LKVRKSQRSR 
QKKTT 

J 

Seq ID NO: 39 DNA sequence 

Nucleic Acid Accession #: NM_006670 

Coding sequence: 85.. 1347 



60 
120 



1 

1 

CCGGCTCGCG 
AGCTCCGGGG 
GACGGGCGTC 
TCTCCCACCT 
TCCGCCCAGC 
CGCACAGTCA 
GTGCGCAACC 
GGCCGGCCGC 
GTGCGCGCGG 
CCACTGGCCG 
AGTCCCCTTG 
CGGAGCTTCG 
CGCCGCTTGG 
CTGCCCAGCC 
TCCTTCCGCA 
CTTCACAATG 
AACAATCGCT 
GAGGTAGTGC 
GTCCTCTTGG 
CAAACCTCTT 
GTTTTGTATT 
AGGGATCACA 
AACCTCAGTT 
CATGAGATGT 
TAGATACAAC 
TTTCTOGGTG 
TTCTTTTTCT 
TGGGCTTCTT 
ACAGATAGCA 
TATCAGTTTT 
CTGCAGACGT 
AGAGCATGCT 
TTCTTTGACA 
TTTTAATAAA 
ATTCTTAAAA 



11 



21 



31 



CCCTCCGGGC 
AAACGCGAGC 
TGCGGCTGGC 
CCTCGGCATC 
CCCCGCTGCC 
AGTGCGTTAA 
TCTTCCTTAC 
CGCTGGCGGA 
GCGCCTTCGA 
ACCTCAGTCC 
TGGAACTGAT 
AGGGCATGGT 
AGCTGGCCAG 
TCAGGCACCT 
ACCTGACACA 
GCACCCTGGC 
GGGTCTGCGA 
AGGGCAAAGA 
AACTCAACAG 
ATGTCTTCCT 
TGAACCGCAA 
TGGAAGGGTA 
CTAACTCGGA 
AGACTTAAGC 
GGACTTTGAC 
TGTTCTGTTA 
TGGAACTCCT 
GCTGTCTGTC 
TTCAACAAAA 
ATTCTCATGT 
TAGCAGGCTC 
TACATTTTAC 
AAGTAAATTA 
CTGCATCGAG 
GAA 



CCAGCCTCCC 
CGCGATGCCT 
GCGACTAGCG 
CTCCTTCTCC 
GGACCAGTGC 
CCGCAATCTG 
CGGCAACCAG 
GCTGGCCGCG 
GCATCTGCCC 
CTTCGCTTTC 
CCTGAACCAC 
GGTGGCGGCC 
CAACCACTTC 
GGACTTAAGT 
TCTAGAAAGC 
TGAGTTGCAA 
CTGCCACATG 
CCGGCTCACC 
TGCTGACCTG 
GGGTATTGTT 
GGGGATAAAA 
TCATTACAGA 
TGTCTGAGAA 
TTTATCCCTA 
TAAAAGCAGT 
ATGTAAGACG 
CAACACGTAT 
TCTCTCTCAG 
GCTGCCTCAA 
ACCTAAGTTG 
TTCAAAATAA 
TGTTCTGCAT 
CTTTTTTGAT 
ATCCAACCGA 



GAGCCTTCGG 
GGGGGGTGCT 
CTGGTACTCC 
TCCTCGGCGC 
CCCGCGCTGT 
ACCGAGGTGC 
CTGGCCGTGC 
CTCAACCTCA 
AGCCTGCGCC 
TCGGGCAGCA 
ATCGTGCCCC 
CTGCTGGCGG 
CTTTACCTGC 
AATAATTCGC 
CTCCACCTGG 
GGTCTACCCC 
GCAGACATGG 
TGTGCATATC 
GACTGTGACC 
TTAGCCCTGA 
AAGTGGATGC 
TATGAAATCA 
ATATTAGAGG 
CTAGGCTTGC 
GAAGGGGATT 
ATGAACAGTT 
GGAGGGATTT 
TACAGTTCAA 
CTTTTTCGAG 
TGGAGAAAAT 
CTCCATGGTG 
ATTACAAAAA 
TGCAGTTTAT 
CTGAATTGTT 



41 
I 

AGCGGGCGCC 
CCCGGGGCCC 
TGGGCTGGGT 
CGTTCCTGGC 
GCGAGTGCTC 
CCACGGACCT 
TCCCTGCCGG 
GCGGCAGCCG 
AGCTCGACCT 
ATGCCAGCGT 
CTGAAGATGA 
GCCGTGCACT 
CGCGGGATGT 
TGGTGAGCCT 
AGGACAATGC 
ACATTAGGGT 
TGACCTGGCT 
CGGAAAAAAT 
CGATTCTTCC 
TAGGCGCTAT 
ATAACATCAG 
ATGCGGACCC 
ACAGACCAAG 
TCCACTTTCA 
TGCTTCCTTG 
GTGTATAGTG 
TTCAGGTTTC 
GGTGTAGCAA 
AAAAATACTT 
AATTGCATCC 
CACAGGAGCA 
ATAACTTGCA 
ATGAAAATGT 
AAAAAAAAAA 



51 



GTCCCAGCCC 
CGCCGCCGGG 
CTCCTCGTCT 
TTCCGCCGTG 
CGAGGCAGCG 
GCCCGCCTAC 
CGCCTTCGCC 
CCTGGACGAG 
CAGCCACAAC 
CTCGGCCCCC 
GCGGCAGAAC 
GCAGGGGCTC 
GCTGGCCCAA 
GACCTACGTG 
CCTCAAGGTC 
TTTCCTGGAC 
CAAGGAAACA 
GAGGAATCGG 
CCCATCCCTG 
TTTCCTCCTG 
AGATGCCTGC 
CAGATTAACA 
GACAACTCTG 
TCCTCCACTA 
TTATGTAAAG 
TTTTACCCTC 
AGCATGAACA 
GTGTACCCAC 
TATTCATAAA 
TATAAACTGC 
CCTGCATCCA 
ACTTCATAAC 
ACTGATTTTT 
AAAAATAAAG 



Seq ID NO: 40 Protein sequence 
Protein Accession #: NP_00666l 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 

960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 



11 



21 31 41 51 

{III 
MPGGCSRGPA AGDGRLRLAR LALVLLGWVS SSSPTSSASS FSSSAPPIAS AVSAQPPLPD 
QCPALCECSE AARTVKCVNR NLTEVPTDLP AYVRNLFLTG NQIAVLPAGA FARRPPLAEL 
AALNLSGSRL DEVRAGAFEH LPSLRQLDLS HNPIADLSPP AFSGSNASVS APSPLVELII* 
NHIVPPEDER QNRSFEGMW AALLAGRALQ GLRRLELASN HFLYLPRDVI* AQLPSLRHLD 

963 



60 
120 
180 
240 



WO 03/042661 



PCT/US02/36810 



LSNNSLVSLT YVSFRNLTHL ESLHLEDNAL KVLHNGTLAE LQGLPHIRVF LDNNPWVCDC 300 

HMADMVTWLK ETEWQGKDR LTCAYPEKMR NRVLLELNSA DLDCDPILPP SLQTSYVPLG 3 SO 
IVLALIGAIF LLVLYLNRKG IKKWMHNIRD ACRDHMEGYH YRYEINADPR LTNI»SSNSDV 

5 Seq ID NO: 41 DNA sequence 

Nucleic Acid Accession fl: Eos sequence 
Coding sequence : 1 . . 927 

1 11 21 31 41 51 

10 | | I 1 I I 

ATGCCTGGGG GGTGCTCCCG GGGCCCCGCC GCCGGGGACG GGCGTCTGCG GCTGGCGCGA 60 

CTAGCGCTGG TACTCCTGGG CTGGGTCTCC TCGTCTTCTC CCACCTCCTC GGCATCCTCC 120 

TTCTCCTCCT CGGCGCCGTT CCTGGCTTCC GCCGTGTCCG CCCAGCCCCC GCTGCCGGAC 180 

CAGTGCCCCG CGCTGTGCGA GTGCTCCGAG GCAGCGCGCA CAGTCAAGTG CGTTAACCGC 240 

15 AATCTGACCG AGGTGCCCAC GGACCTGCCC GCCTACGTGC GCAACCTCTT CCTTACCGGC 300 

AACCAGCTGG CCAGCAACCA CTTCCTTTAC CTGCCGCGGG ATGTGCTGGC CCAACTGCCC 360 

AGCCTCAGGC ACCTGGACTT AAGTAATAAT TCGCTGGTGA GCCTGACCTA CGTGTCCTTC 420 

CGCAACCTGA CACATCTAGA AAGCCTCCAC CTGGAGGACA ATGCCCTCAA GGTCCTTCAC 480 

AATGGCACCC TGGCTGAGTT GCAAGGTCTA CCCCACATTA GGGTTTTCCT GGACAACAAT 540 

20 CCCTGGGTCT GCGACTGCCA CATGGCAGAC ATGGTGACCT GGCTCAAGGA AACAGAGGTA 600 

GTGCAGGGCA AAGACCGGCT CACCTGTGCA TATCCGGAAA AAATGAGGAA TCGGGTCCTC 660 

TTGGAACTCA ACAGTGCTGA CCTGGACTGT GACCCGATTC TTCCCCCATC CCTGCAAACC 720 

TCTTATGTCT TCCTGGGTAT TGTTTTAGCC CTGATAGGCG CTATTTTCCT CCTGGTTTTG 780 

TATTTGAACC GCAAGGGGAT AAAAAAGTGG ATGCATAACA TCAGAGATGC CTGCAGGGAT 840 

25 CACATGGAAG GGTATCATTA CAGATATGAA ATCAATGCGG ACCCCAGATT AACAAACCTC 
AGTTCTAACT CGGATGTCCT CGAGTGA 



30 



55 



900 



Seq ID NO: 42 Protein sequence 
Protein Accession #: Eos sequence 



1 11 21 31 41 51 

I I 1 I 1 I 

MPGGCSRGPA AGDGRLRLAR LALVLLGWVS SSSPTSSASS FSSSAPFLAS AVSAQPPLPD 60 

QCPALCECSE AARTVKCVNR NLTEVPTDLP AYVRNLFLTG NQLASNHFLY LPRDVLAQLP 120 

35 SLRHLDLSNN SLVSLTYVSP RNLTHLESLH LEDNALKVLH NGTLAELQGL PHIRVFLDNN 180 

PWVCDCHMAD MVTWLKETEV VQGKDRLTCA YPEKMRNRVL LELNSADLDC DPILPPSLQT 240 

SYVFLGIVLA LIGAIFLLVL YLNRKGIKKW MHNIRDACRD HMEGYHYRYE INADPRI/TNL 300 



40 Seq ID NO; 43 DNA sequence 

Nucleic Acid Accession #: NMJJ58173 
Coding sequence: 68.. 340 

1 11 21 31 41 51 

45 | | l I I I 

AGCGCCTTGC CTTCTCTTAG GCTTTGAAGC ATTTTTGTCT GTGCTCCCTG ATCTTCAGGT 60 

CACCACCATG AAGTTCTTAG CAGTCCTGGT ACTCTTGGGA GTTTCCATCT TTCTGGTCTC 120 

TGCCCAGAAT CCGACAACAG CTGCTCCAGC TGACACGTAT CCAGCTACTG GTCCTGCTGA 180 

TGATGAAGCC CCTGATGCTG AAACCACTGC TGCTGCAACC ACTGCGACCA CTGCTGCTCC 240 

50 TACCACTGCA ACCACCGCTG CTTCTACCAC TGCTCGTAAA GACATTCCAG TTTTACCCAA 300 

ATGGGTTGGG GATCTCCCGA ATGGTAGAGT GTGTCCCTGA GATGGAATCA GCTTGAGTCT 360 

TCTGCAATTG GGTCACAACT ATTCATGCTT CCTGTGATTT CATCCAACTA CTTACCTTGC 420 

CTACGATATC CCCTTTATCT CTAATCAGTT TATTTTCTTT CAAATAAAAA ATAACTATGA 480 
GCGAGCTAAC AT 



Seq ID NO: 44 Protein sequence 
Protein Accession #: NP_477S2l 



1 11 21 31 41 51 

60 | | | ! I I 

MKFLAVLVL.L GVSIFLVSAQ NPTTAAPADT YPATGPADDE APDAETTAAA TTATTAAPTT 60 
ATTAASTTAR KDIPVL.PKWV GDLPNGRVCP 

Seq ID NO: 4S DNA sequence 
65 Nucleic Acid Accession ft: NM_000095.1 
Coding sequence: 2 6.. 2299 

1 11 21 31 41 51 

70 CAGCACCCAG CTCCCCGCCA CCGCCATGGT CCCCGACACC GCCTGCGTTC TTCTGCTCAC 60 

CCTGGCTGCC CTCGGCGCGT CCGGACAGGG CCAGAGCCCG TTGGGCTCAG ACCTGGGCCC 120 

GCAGATGCTT CGGGAACTGC AGGAAACCAA CGCGGCGCTG CAGGACGTGC GGGACTGGCT 180 

GCGGCAGCAG GTCAGGGAGA TCACGTTCCT GAAAAACACG GTGATGGAGT GTGACGCGTG 240 

CGGGATGCAG CAGTCAGTAC GCACCGGCCT ACCCAGCGTG CGGCCCCTGC TCCACTGCGC 300 

75 GCCCGGCTTC TGCTTCCCCG GCGTGGCCTG CATCCAGACG GAGAGCGGCG GCCGCTGCGG 360 

CCCCTGCCCC GCGGGCTTCA CGGGCAACGG CTCGCACTGC ACCGACGTCA ACGAGTGCAA 420 

CGCCCACCCC TGCTTCCCCC GAGTCCGCTG TATCAACACC AGCCCGGGGT TCCGCTGCGA 480 

GGCTTGCCCG CCGGGGTACA GCGGCCCCAC CCACCAGGGC GTGGGGCTGG CTTTCGCCAA 540 

Qrx GGCCAACAAG CAGGTTTGCA CGGACATCAA CGAGTGTGAG ACCGGGCAAC ATAACTGCGT 600 

80 CCCCAACTCC GTGTGCATCA ACACCCGGGG CTCCTTCCAG TGCGGCCCGT GCCAGCCCGG 660 

CTTCGTGGGC GACCAGGCGT CCGGCTGCCA GCGCGGCGCA CAGCGCTTCT GCCCCGACGG 720 

CTCGCCCAGC GAGTGCCACG AGCATGCAGA CTGCGTCCTA GAGCGCGATG GCTCGCGGTC 780 

GTGCGTGTGT CGCGTTGGCT GGGCCGGCAA CGGGATCCTC TGTGGTCGCG ACACTGACCT 840 

AGACGGCTTC CCGGACGAGA AGCTGCGCTG CCCGGAGCCG CAGTGCCGTA AGGACAACTG 900 

964 



WO 03/042661 



CGTGACTGTG CCCAACTCAG GGCAGGAGGA TGTGGACCGC GATGGCATCG GAGACGCCTG 960 

CGATCCGGAT GCCGACGGGG ACGGGGTCCC CAATGAAAAG GACAACTGCC CGCTGGTGCG 1020 

GAACCCAGAC CAGCGCAACA CGGACGAGGA CAAGTGGGGC GATGCGTGCG ACAACTGCCG 1080 

GTCCCAGAAG AACGACGACC AAAAGGACAC AGACCAGGAC GGCCGGGGCG ATGCGTGOGA 1140 

CGACGACATC GACGGCGACC GGATCCGCAA CCAGGCCGAC AACTGCCCTA GGGTACCCAA 1200 

CTCAGACCAG AAGGACAGTG ATGGCGATGG TATAGGGGAT GCCTGTGACA ACTGTCCCCA 1260 

GAAGAGCAAC CCGGATCAGG CGGATGTGGA CCACGACTTT GTGGGAGATG CTTGTGACAG 1320 

CGATCAAGAC CAGGATGGAG ACGGACATCA GGACTCTCGG GACAACTGTC CCACGGTGCC 1380 

TAACAGTGCC CAGGAGGACT CAGACCAOGA TGGCCAGGGT GATGCCTGCG ACGACGACGA 1440 

CGACAATGAC GGAGTCCCTG ACAGTCGGGA CAACTGCCGC CTGGT GCCTA ACCCCGGCCA 1500 

GGAGGACGCG GACAGGGACG GCGTGGGCGA CGTGTGCCAG GACGACTTTG ATGCAGACAA 1560 

GGTGGTAGAC AAGATCGACG TGTGTCCGGA GAACGCTGAA GTCACGCTCA CCGACTTCAG 1620 

GGCCTTCCAG ACAGTCGTGC TGGACCCGGA GGGTGACGCG CAGATTGACC CCAACTGGGT 1680 

GGTGCTCAAC CAGGGAAGGG AGATCGTGCA GACAATGAAC AGCGACCCAG GCCTGGCTGT 1740 

GGGTTACACT GCCTTCAATG GCGTGGACTT CGAGGGCACG TTCCATGTGA ACACGGTCAC 1800 

GGATGACGAC TATGCGGGCT TCATCTTTGG CTACCAGGAC AGCTCCAGCT TCTACGTGGT 1860 

CATGTGGAAG CAGATGGAGC AAACGTATTG GCAGGCGAAC CCCTTCCGTG CTGTGGCCGA 1920 

GCCTGGCATC CAACTCAAGG CTGTGAAGTC TTCCACAGGC CCCGGGGAAC AGCTGCGGAA 1980 

CGCTCTGTGG CATACAGGAG ACACAGAGTC CCAGGTGCGG CTGCTGTGGA AGGACCCGCG 2040 

AAACGTGGGT TGGAAGGACA AGAAGTCCTA TCGTTGGTTC CTGCAGCACC GGCCCCAAGT 2100 

GGGCTACATC AGGGTGOGAT TCTATGAGGG CCCTGAGCTG GTGGCCGACA GCAACGTGGT 2160 

CTTGGACACA ACCATGCGGG GTGGCCGCCT GGGGGTCTTC TGCTTCTCCC AGGAGAACAT 2220 

CATCTGGGCC AACCTGCGTT ACCGCTGCAA TGACACCATC CCAGAGGACT ATGAGACCCA 2280 

TCAGCTGCGG CAAGCCTAGG GACCAGGGTG AGGACCCGCC GGATGACAGC CACCCTCACC 2340 

GCGGCTGGAT GGGGGCTCTG CACCCAGCCC AAGGGGTGGC CGTCCTGAGG GGGAAGTGAG 2400 
AAGGGCTCAG AGAGGACAAA ATAAAGTGTG TGTGCAGGG 



Seq ID NO: 46 Protein sequence 
Protein Accession NP_000086.1 

1 11 21 31 41 51 

I I 1 I I I 

MVPDTACVLL LTLAALGASG QGQSPLGSDL GPQMLRELQE TNAALODVRD WLRQ0VREIT 60 

FLKNTVMECD ACGMQQSVRT GLPSVRPLLH CAPGFCFPGV ACIQTESGGR CGPCPAGFTG 120 

NGSHCTDVNE CNAHPCFPRV RCINTSPGFR CEACPPGYSG PTHQGVGLAF AKANKQVCTD - 180 

INECETGQHN CVPNSVCINT RGSFQCGPCQ PGFVGDQASG CQRGAQRFCP DGSPSECHEH 240 

ADCVLERDGS RSCVCRVGWA GNGILCGRDT DLDGFPDEKL RCPEPQCRKD NCVTVPNSGQ 300 

EDVDRDGIGD ACDPDADGDG VPNEKDNCPL VRNPDQRNTD EDKWGDACDN CRSQKNDDQK 360 

DTDQDGRGDA CDDDIDGDRI RNQADNCPRV PNSDQKDSDG DGIGDACDNC PQKSNFDQAD 420 

VDHDFVGDAC DSDQDQDGDG HQDSRDNCPT VPNSAQEDSD HDGQGDACDD DDDNDGVFDS 480 

RDNCRLVPNP GQEDADRDGV GDVCQDDFDA DKWDKIDVC PENAEVTLTD FRAFQTWLD 540 

PEGDAQIDPN WWLNQGREI VQTMNSDPGL AVGYTAFNGV DFEGTFHVNT VTDDDYAGFI 600 

FGYQDSSSFY WMWKQMEQT YWQANPFRAV AEPGIQLKAV KSSTGPGEQL RNALWHTGDT 660 

ESQVRLLWKD PRNVGWKDKK SYRWFLQHRP QVGYIRVRFY EGPELVADSN WLDTTMRGG 720 
RLGVFCFSQE NIIWANliRYR CNDTIPEDYE THQLRQA 

Seq ID NO: 47 DNA sequence 

Nucleic Acid Accession #: NM_001565.1 

Coding sequence: 67.. 3 63 

1 11 21 31 41 51 

1 I I I I I 

GAGACATTCC TCAATTGCTT AGACATATTC TGAGCCTACA GCAGAGGAAC CTCCAGTCTC 60 

AGCACCATGA ATCAAACTGC GATTCTGATT TGCTGCCTTA TCTTTCTGAC TCTAAGTGGC 120 

ATTCAAGGAG TACCTCTCTC TAGAACCGTA CGCTGTACCT GCATCAGCAT TAGTAATCAA 180 

CCTGTTAATC CAAGGTCTTT AGAAAAACTT GAAATTATTC CTGCAAGCCA ATTTTGTCCA 240 

CGTGTTGAGA TCATTGCTAC AATGAAAAAG AAGGGTGAGA AGAGATGTCT GAATCCAGAA 300 

TCGAAGGCCA TCAAGAATTT ACTGAAAGCA GTTAGCAAGG AAATGTCTAA AAGATCTCCT 360 

TAAAACCAGA GGGGAGCAAA ATCGATGCAG TGCTTCCAAG GATGGACCAC ACA GAGGC TG 420 

CCTCTCCCAT CACTTCCCTA CATGGAGTAT ATGTCAAGCC ATAATTGTTC TTAGTTTGCA 480 

GTTACACTAA AAGGTGACCA ATGATGGTCA CCAAATCAGC TGCTACTACT CCTGTAGGAA 540 

GGTTAATGTT CATCATCCTA AGCTATTCAG TAATAACTCT ACCCTGGCAC TATAATGTAA 600 

GCTCTACTGA GGTGCTATGT TCTTAGTGGA TGTTCTGACC CTGCTTCAAA TATTTCCCTC 660 

ACCTTTCCCA TCTTCCAAGG GTACTAAGGA ATCTTTCTGC TTTGGGGTTT ATCAGAATTC 720 

TCAGAATCTC AAATAACTAA AAGGTATGCA ATCAAATCTG CTTTTTAAAG AATGCTCTTT 780 

ACTTCATGGA GTTCCACTGC CATCCTCCCA AGGGGCCCAA ATTCTTTCAG TGGCTACCTA 840 

CATACAATTC CAAACACATA CAGGAAGGTA GAAATATCTG AAAATGTATG TGTAAGTATT 900 

CTTATTTAAT GAAAGACTGT ACAAAGTATA AGTCTTAGAT GTATATATTT CCTATATTGT 960 

TTTCAGTGTA CATGGAATAA CATGTAATTA AGTACTATGT ATCAATGAGT AACAGGAAAA 1020 

TTTTAAAAAT ACAGATAGAT ATATGCTCTG CATGTTACAT AAGATAAATG TGCTGAATGG 1080 
TTTTCAAATA AAAATGAGGT ACTCTCCTGG AAATATTAAG 



Seq ID NO: 48 Protein sequence 
Protein Accession #: NP_001556.1 

1 11 21 31 41 51 

| I I I I I 

MNQTAILICC LIFLTLSGIQ GVPLSRTVRC TCISISNQPV NPRSLEKLEI IPASQFCPRV 
EIIATMKKKG EKRCLNPESK AIKNLLKAVS KEMSKRSP 

Seq ID NO: 49 DNA sequence 

Nucleic Acid Accession #: XM_057014 

Coding sequence: 143.. 874 



965 



WO 03/042661 



PC17US02/36810 



5 

10 
15 
20 
25 
30 
35 



I 

GGGAGGGAGA 
CGCGGCGGAG 
CGCTGCCCGG 
CCGCGGCCTC 
CCCCAAGGGG 
AATGTGCTTA 
CATTCCGGGT 
TCTGAGGGAA 
ATTGAATTAT 
AAATAGTGCT 
CTGTCAGCGT 
AGCTATAATT 
CACTTCTTCT 
CTGGGTTGGC 
TTCTCGCATC 
TTTTTTTATT 
CATCTGAATG 
TTTAAATCTA 
TGGTTAGAAT 
GGTCTTTTGT 
TGTACAATTT 
CAACCTTAAA 



11 
I 

GAGGCGCGCG 
CCAGACGCTG 
CAGCCGGGAG 
CTGCTGCTCC 
AAGCAAAAGG 
CAAGGGCCAG 
ACACCTGGGA 
AGCTTTGAGG 
GGCATAGATC 
CTAAGAGTTT 
TGGTATTTCA 
TATTTGGACC 
GTGGAAGGAC 
ACTTGTTCAG 
ATTATTGAAG 
ATGCCTTGGA 
AAAAGCAAAG 
GCATTATTCA 
ACTTTCTTCA 
TTTTTCTCTT 
GTAAATGTTA 
AAAAAAAAAA 



21 
I 

GGTGAAAGGC 
ACCACGTTCC 
CCATGCGACC 
TGCTGCTGCA 
CGCAGCTCCG 
CAGGAGTGCC 
TCCCAGGTCG 
AGTCCTGGAC 
TTGGGAAAAT 
TGTTCAGTGG 
CATTCAATGG 
AAGGAAGCCC 
TTTGTGAAGG 
ATTACCCAAA 
AACTACCAAA 
ATGGTTCACT 
CTAAATATGT 
TTTTGCTTCA 
TAGTCACATT 
AGTATAGCAT 
AGAATTTTTT 
AAAA 



31 
1 

GCATTGATGC 
TCTCCTCGGT 
CCAGGGCCCC 
GCTGCCCX3CG 
GCAGAGGGAG 
TGGTCGAGAC 
GGATGGATTC 
ACCCAACTAC 
TGOGGAGTGT 
CTCACTTOGG 
AGCTGAATGT 
TGAAATGAAT 
AATTGGTGCT 
AGGAGATGCT 
ATAAATGCTT 
TAAATGACAT 
TTACAGACCA 
ATCAAAAGTG 
CTCTCAACCT 
TTTTAAAAAA 
TTATATCTGT 



41 

I 

AGCCTGCGGC 
CTCCTCCGCC 
GCCGCCTCCC 
CCGTOGAGCG 
GTGGTGGACC 
GGGAGCCCTG 
AAAGGAGAAA 
AAGCAGTGTT 
ACATTTACAA 
CTAAAATGCA 
TCAGGACCTC 
TCAACAATTA 
GGATTAGTGG 
TCTACTGGAT 
TAATTTTCAT 
TTTAAATAAG 
AAGTGTGATT 
GTTTCAATAT 
ATAATTTGGA 
ATATAAAAGC 
TAAATAAAAA 



51 

I 

GGCCTCGGAG 
TCCAGCTCOG 
CGCAGCGGCT 
CCTCTGAGAT 
TGTATAATGG 
GGGCCAATGG 
AGGGGGAATG 
CATGGAGTTC 
AGATGCGTTC 
GAAATGCATG 
TTCCCATTGA 
ATATTCATCG 
ATGTTGCTAT 
GGAATTCAGT 
TTGCTACCTC 
TTTATGTATA 
TCACACTGTT 
TTTTTTTAGT 
ATATTGTTGT 
TACCAATCTT 
TTATTTCCAA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 



Seq ID NO: 50 Protein sequence 
Protein Accession 8r XP_057014 



11 



60 
120. 
180 
240 



21 31 41 51 

I I I I 

MRPQGPAASP QRLRGLLLLL LI/QLPAPSSA SEIPKGKQKA QLRQREWDL YNGMCLQGPA 
GVPGRDGSPG ANGIPGTPGI PGRDGFKGEK GECLRESFEE SWTPNYKQCS WSSLNYGIDL 
GKIAECTFTK MRSNSALRVL FSGSLRLKCR NACCQRWYFT FNGAECSGPL PIEAIIYLDQ 
GSPEMNSTIN IHRTSSVEGL CEGIGAGLVD VAIWVGTCSD YPKGDASTGW NSVSRIIIEE 
LPK 

Seq ID NO: 51 DNA sequence 
Nucleic Acid Accession #: NM__020974 
^ Coding sequence s 8 1 - - 3 0 8 0 

1 11 21 31 41 51 

| I I I I I 

GGCGTCCGCG CACACCTCCC CGCGCCGCCG CCGCCACCGC CCGCACTCCG CCGCCTCTGC 60 

CCGCAACCGC TGAGCCATCC ATGGGGGTCG CGGGCCGCAA CCGTCCCGGG GCGGCCTGGG 120 

45 CGGTGCTGCT GCTGCTGCTG CTGCTGCCGC CACTGCTGCT GCTGGCGGGG GCCGTCCCGC 180 

CGGGTCGGGG CCGTGCCGCG GGGCCGCAGG AGGATGTAGA TGAGTGTGCC CAAGGGCTAG 240 

ATGACTGCCA TGCCGACGCC CTGTGTCAGA ACACACCCAC CTCCTACAAG TGCTCCTGCA 300 

AGCCTGGCTA CCAAGGGGAA GGCAGGCAGT GTGAGGACAT CGATGAATGT GGAA ATGAGC 360 

TCAATGGAGG CTGTGTCCAT GACTGTTTGA ATATTCCAGG CAATTATCGT TGCACTTGTT 420 

50 TTGATGGCTT CATGTTGGCT CATGACGGTC ATAATTGTCT TGATGTGGAC GAGTGCCTGG 480 

AGAACAATGG CGGCTGCCAG CATACCTGTG TCAACGTCAT GGGGAGCTAT GAGTGCTGCT 540 

GCAAGGAGGG GTTTTTCCTG AGTGACAATC AGCACACCTG CATTCACCGC TCGGAAGAGG 600 

GCCTGAGCTG CATGAATAAG GATCACGGCT GTAGTCACAT CTGCAAGGAG GCCCCAAGGG 660 

GCAGCGTCGC CTGTGAGTGC AGGCCTGGTT TTGAGCTGGC CAAGAACCAG AGAGACTGCA 720 

55 TCTTGACCTG TAACCATGGG AACGGTGGGT GCCAGCACTC CTGTGACGAT ACAGCCGATG 780 

GCCCAGAGTG CAGCTGCCAT CCACAGTACA AGATGCACAC AGATGGGAGG AGCTGCCTTG 840 

AGCGAGAGGA CACTGTCCTG GAGGTGACAG AGAGCAACAC CACATCAGTG GTGGATGGGG 900 

ATAAACGGGT GAAACGGCGG CTGCTCATGG AAACGTGTGC TGTCAACAAT GGAGGCTGTG 960 

ACCGCACCTG TAAGGATACT TCGACAGGTG TCCACTGCAG TTGTCCTGTT GGATTCACTC 1020 

60 TCCAGTTGGA TGGGAAGACA TGTAAAGATA TTGATGAGTG CCAGACCCGC AATGGAGGTT 1080 

GTGATCATTT CTGCAAAAAC ATCGTGGGCA GTTTTGACTG CGGCTGCAAG AAAGGATTTA 1140 

AATTATTAAC AGATGAGAAG TCTTGCCAAG ATGTGGATGA GTGCTCTTTG GATAGGACCT 1200 

GTGACCACAG CTGCATCAAC CACCCTGGCA CATTTGCTTG TGCTTGCAAC CGAGGGTACA 1260 

CCCTGTATGG CTTCACCCAC TGTGGAGACA CCAATGAGTG CAGCATCAAC AACGGAGGCT 1320 

65 GTCAGCAGGT CTGTGTGAAC ACAGTGGGCA GCTATGAATG CCAGTGCCAC CCTGGGTACA 1380 

AGCTCCACTG GAATAAAAAA GACTGTGTGG AAGTGAAGGG GCTCCTGCCC ACAAGTGTGT 1440 

CACCCCGTGT GTCCCTGCAC TGCGGTAAGA GTGGTGGAGG AGACGGGTGC TTCCTCAGAT 1500 

GTCACTCTGG CATTCACCTC TCTTCAGATG TCACCACCAT CAGGACAAGT GTAACCTTTA 1560 

AGCTAAATGA AGGCAAGTGT AGTTTGAAAA ATGCTGAGCT GTTTCCCGAG GGTCTGCGAC 1620 

70 CAGCACTACC AGAGAAGCAC AGCTCAGTAA AAGAGAGCTT CCGCTACGTA AACCTTACAT 16B0 

GCAGCTCTGG CAAGCAAGTC CCAGGAGCCC CTGGCCGACC AAGCACCCCT AAGGAAATGT 1740 

TTATCACTGT TGAGTTTGAG CTTGAAACTA ACCAAAAGGA GGTGACAGCT TCTTGTGACC 1800 

TGAGCTGCAT CGTAAAGCGA ACCGAGAAGC GGCTCCGTAA AGCCATCCGC ACGCTCAGAA 1860 

AGGCCGTCCA CAGGGAGCAG TTTCACCTCC AGCTCTCAGG CATGAACCTC GACGTGGCTA 1920 

75 AAAAGCCTCC CAGAACATCT GAACGCCAGG CAGAGTCCTG TGGAGTGGGC CAGGGTCATG 1980 

CAGAAAACCA ATGTGTCAGT TGCAGGGCTG GGACCTATTA TGATGGAGCA CGAGAAOGCT 2040 

GCATTTTATG TCCAAATGGA ACCTTCCAAA ATGAGGAAGG ACAAATGACT TGTGAACCAT 2100 

GCCCAAGACC AGGAAATTCT GGGGCCCTGA AGACCCCAGA AGCTTGGAAT ATGTCTGAAT 2160 

GTGGAGGTCT GTGTCAACCT GGTGAATATT CTGCAGATGG CTTTGCACCT TGCCAGCTCT 2220 

80 GTGCCCTGGG CACGTTCCAG CCTGAAGCTG GTCGAACTTC CTGCTTCCCC TGTGGAGGAG 2280 

GCCTTGCCAC CAAACATCAG GGAGCTACTT CCTTTCAGGA CTGTGAAACC AGAGTTCAAT 2340 

GTTCACCTGG ACATTTCTAC AACACCACCA CTCACCGATO TATTCGTTGC CCAGTGGGAA 2400 

CATACCAGCC TGAATTTGGA AAAAATAATT GTGTTTCTTG CCCAGGAAAT ACTACGACTG 2460 

ACTTTGATGG CTCCACAAAC ATAACCCAGT GTAAAAACAG AAGATGTGGA GGGGAGCTGG 2520 

966 
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PCIYUS02/36810 



10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



GAGATTTCAC 
AGTGTACGTG 
TCTTCCTGCC 
CCAATTCTGT 
CCAGGTCAAA 
TCCAGGTCCC 
GAGATGGCAG 
TCAAGGCTCT 
AGTCCCGAGA 
TTTTGAGACC 
GGTTGGTGGG 
CCGTATCAGT 
GAACTTGGTT 
CAGCTTCTCA 
TTGGTCAGCC 
TGTAGTGGAA 
COGGCCCTCT 
CAAGAGGGGA 
ACTCAGTTTC 
AGTTCTAAGC 
AGCACTTCTG 



TGGGTACATT 
GACCATCAAC 
CATAGAGGAC 
GACAACATAT 
GAAGCTGTGG 
ATACGTGACA 
GCTCTATGCA 
GTTTGATGTC 
GATGTTTCCA 
TTACAAATGA 
ACAGAGCTGT 
GACTCATTAG 
TTTCTTTCCC 
CTGCTGTGGG 
TAGGTGAGAC 
AGGAGGCCAC 
CTAAGGGAGC 
GGGAAGGAGA 
TCCACAGCCT 
AGTGCTCGTG 
GAGACAT 



GAATCCCCAA 
CCACCCCCCA 
GACTGTGGGG 
GAAACCTGCC 
ATTCAGTTCA 
TATGATGAGG 
TCTGAGAACC 
CTGGCCCATC 
AGATCGTTCA 
CTCAGCCCAC 
CTTCCTTCTG 
AGTTCAATTT 
AGCATOGTGG 
CGGATGTCTT 
TCACCTGTCC 
AGAATAAGCT 
CCTCTGCACT 
CCCCTGCAGG 
TCTCCAGCCT 
AAAAAAAAAA 



ACTACCCAGG 
AGCGCCGCAT 
ACTATCTGGT 
AGACCTACGA 
AGTCCAATGA 
ACT AC CAGGA 
ATCAGGAAAT 
CCCAGAACTA 
TCCGATTGCT 
GTGCCACTCA 
CATGTCAGCA 
TTATAGATAA 
ATGTAGACTG 
GGATAGATCA 
TTCTGGGGTC 
GCTTATTCTG 
CGTGTGCAGG 
CTCCCTCCAC 
GTGTGATACA 
GCAGAAAGAA 



CAATTACCCA 
CCTGATCGTG 
GATGCGGAAA 
ACGCCCCATC 
AGGGAACAGC 
ACTCATTGAA 
ACTTAAGGAT 
TTTCAAGTAC 
ACGTTCCAAA 
ATACAAATGT 
CAGTCGGGTA 
TACAGATATT 
AGAATGGCTT 
CGGGCTGGCT 
TTACTCCTCC 
AAACTTCAGC 
CTCTGACCAG 
CCACCTTGAG 
AGTTTGATCC 
TTAGAAATAA 



GCCAACACCG 
GTCCCTGAGA 
ACCTCTTCAT 
GCCTTCACCT 
GCTAGAGGGT 
GACATAGTTC 
AAGAAACTTA 
ACAGCCCAGG 
GTGTCCAGGT 
TCTGCTATAG 
TTGCTGCCTC 
TTGGTAAATT 
TGAGTGGCAT 
GAGCTGGACT 
TCAAGGAGTC 
TTCCTCTAGC 
GCAGAACAGG 
ACCTGGGAGG 
CAGGAACTTG 
ATAAAAACTA 



2S80 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 



Seq ID NO: 52 Protein sequence 
Protein Accession #: NP_066025 



1 
I 

MGVAGRNRPG 
LCQNTPTSYK 
HDGHNCLDVD 
DHGCSHICKE 
PQYKMHTDGR 
STGVHCSCPV 
SCQDVDECSL 
TVGSYECQCH 
SSDVTTIRTS 
PGAPGRPSTP 
FHLQLSGMNL 
TFQNEEGQMT 
PEAGRTSCFP 
KNNCVSCPGN 
PPPKRRILIV 
IQFKSNEGNS 
LAHPQNYFKY 



11 

I 

AAWAVLLLLL 
CSCKPGYQGE 
ECLENNGGCQ 
APRGSVACEC 
SCLEREDTVL 
GFTLQLDGKT 
DRTCDHSCIN 
PGYKLHWNKK 
VTFKLNEGKC 
KEMFITVEFE 
DVAKKPPRTS 
CEPCPRPGNS 
CGGGLATKHQ 
TTTDFDGSTN 
VPEZFLPIED 
ARGFQVPYVT 
TAQESREMFP 



21 
1 

LLPPLLLI*AG 
GRQCEDIDEC 
HTCVNVMGSY 
RPGFEIAKNQ 
EVTESNTTSV 
CKDIDECQTR 
HPGTFACAQJ 
DCVEVKGLLP 
SLKNAELFPE 
LETNQKEVTA 
ERQAESCGVG 
GALKTPEAWN 
GATSFQDCET 
ITQCKNRRCG 
DCGDYLVMRK 
YDEDYQELIE 
RSFIRIiLRSK 



31 
I 

AVPPGRGRAA 
GNELNGGCVH 
ECCCKEGFFL 
RDCILTCNHG 
VDGDKRVKRR 
NGGCDHFCKN 
RGYTLYGFTH 
TSVSPRVSLH 
GLRPALPEKH 
SCDLSCIVXR 
QGHAENQCVS 
MSECGGLCQP 
RVQCSPGHFY 
GELGDFTGYI 
TSSSNSVTTY 
DIVRDGRLYA 
VSRFLRPYK 



41 
I 

GPQEDVDECA 
DCLNIPGNYR 
SDNQHTCIHR 
NGGCQHSCDD 
LLMETCAVNN 
IVGSFDCGCK 
CGDTNECSIN 
CGKSGGGDGC 
SSVKESFRYV 
TEKRLRKAIR 
CRAGTYYDGA 
GEYSADGFAP 
NTTTHRCIRC 
ESPNYPGNYP 
ETCQTYERPI 
SENHQEILKD 



51 
I 

QGLDDCHADA 
CTCFDGFMLA 



TADGPECSCH 
GGCDRTCKDT 
KGFKLLTDEK 
NGGCQQVCVN 
FLRCHSGIHL 
NLTCSSGKQV 
TXjRKAVHREQ 
RERCILCPNG 
CQLCALGTFQ 
PVGTYQPEFG 
ANTE CTW TIN 
AFTSRSKKLW 
KKLIKALFDV 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 



Seq ID NO: 53 DNA sequence 

Nucleic Acid Accession #: NM_0142ll 

Coding sequence: 157.. 14 7 9 



GGGACAGGGC 
GTCCTGCCTG 
CCTACTTCAG 
TTCGTGTGTC 
GAGGTCGGCA 
AACAAATTTC 
ATTGCAAGTA 
CGACAGCGCT 
GCCCGCCTCG 
TTCCTCCATG 
CTGTATGCCC 
ATGGACACAC 
GAGTTCACCT 
CAGTACACCA 
TACACTAGAT 
ACCTACGTTC 
GATTCAGTCC 
ATGATCGGGT 
TACCTGGGGA 
TACAGTTCCT 
GTCAGTATTA 
GCCAGCATTG 
GACAAGTTCA 
CAAAACCCCA 
GCCAATGTAT 
TGCCATAGGT 
ACCCACATCC 
ATGAAGCTCC 
TAGAAACATC 
TTACAAATGT 
AAGAATGGGA 
TATACATGTT 
GAGTAACATT 



11 
I 

TGAGGATGAG 
TAGGCCTGAA 
CCCCTTGGTG 
TGAGTCTCTT 
GAAGTGACAA 
TCAGGCCCAA 
TCTCTAGCAT 
GGATGGACCA 
TGGAGTTCCT 
AAGTCACTGT 
TCAGAATCAC 
AGACATGCAA 
GGCTGAGAGG 
TAGAGCGGTA 
TGGTCTTACA 
CTTCCACTTT 
CTGCAAGAAC 
CCCGCACTTC 
TCTGCTTTAG 
TACAGCAGAT 
CTAATATCAT 
AAATTTCCAG 
AGTTTGTCTT 
GTAATGTTGA 
TTTACTGGGC 
CTTCAACAGG 
AATGGTGCTA 
AACCATTGTT 
AGTCCATTCC 
ACTCAGGGCT 
AGGAGACCAT 
TTTTACTAAA 
TTCTAGTTTT 



21 
I 

GAGAACCCTG 
GGACTTGCCC 
TGAGCAGCTT 
CACTGAGAGG 
GCTTTCCCTG 
TTTTGGTGGA 
TTCAGAGAGT 
GCGGCTGGTG 
CTGGGTGCCA 
GGGAAACAGG 
GACAACTGTT 
GTTGCAGCTG 
GAACGACTCT 
TTTCACCTTA 
GTTTGAGCTT 
CCTGGTGGTG 
CTGCATTGGA 
TCTTCCCAAC 
CTTTGTGTTT 
GGCAGCCAAA 
CAACAGCTCC 
CGACAACGTT 
CCGAGAAAAG 
TCACTATTCC 
ATACTACATG 
ACAAGATAAT 
CAAGTGACTG 
CTAAGCTGTG 
TCTTTCATCT 
GTTTATTCGG 
TGGGTAACCC 
TCTCTGCAGT 
TGTTTCTGGT 



31 



41 



GGGACCCAGA 
TAACAGAGCC 
CTCAACATGA 
ATGTGCATCC 
CCTGGCTTTG 
GAACCCGTAC 
AACATGGACT 
TTTGAAGGCA 
GATACTTACA 
CTCATCCGCC 
GCATGTAACA 
GAAAGCTGGG 
GTGCGTGGAC 
GTCACCAGAT 
CGGAGGAATG 
TTGTCCTGGG 
GTGACGACCG 
ACCAACTGCT 
GGGGCCTTGC 
GATAGGGGGA 
ATCTCCAGCT 
GACTACAGTG 
ATGGG CAGGA 
AAACTACTGT 
TATTTTTGAG 
GATGTAAATG 
AAATAATATT 
TAGAAGTCCT 
TAATCAAGGA 
TGGCTCCCTG 
TCAAGTGTCA 
GCTTATAAAA 
TAAAATGAAA 



AGACCGTGCC 
TCAACAACTA 
ACTACAGCCT 
AGGGGAGTCA 
AGAACCTCAC 
AGATAGCGCT 
ACACAGCCAC 
ACAAGAGCTT 
TTGTGGAGTC 
TCTTCTCCAA 
TGGATCTGTC 
GCTATGATGG 
TGGAACACCT 
CGCAGCAGGA 
TTCTGTATTT 
TTTCATTTTG 
TGTTATCAAT 
TCATCAAGGC 
TAGAATATGC 
CAACAAAGGA 
TTAAACGGAA 
ACTTGACAAT 
TTGTTGATTA 
TTCCTTTGAT 
TCAATGTTAA 
GTATTTTAGG 
TGAGTCTTTC 
AGCATTATAG 
CATTCCCATG 
GTTTGCATTT 
GAAGTTGTTT 
TACATTGTTG 
TATGGGCTTA 



51 
I 

TTGCCCGGAA 
CCTGGTGATT 
CCACTTGGCC 
GTTCAACGTC 
AGCAGGATAT 
GACTCTGGAC 
CATATACCTC 
CACTCTGGAT 
CAAGAAGTCC 
TGGCACGGTC 
TAAATACCCC 
AAATGATGTG 
GCGGCTTGCT 
GACAGGAAAT 
CATTTTGGAA 
GATCTCTCTC 
GACCACACTG 
CATCGATGTG 
AGTTGCTCAC 
AGTAGAAGAA 
GATCAGCTTT 
GAAAACCAGC 
TTTCACAATT 
TTTTATGCTA 
ATTTCTTGCA 
CCAAGTGTGC 
TGCTCAAAGA 
GATCTTGTAA 
GAGCCCAAGA 
ACCTCATATA 
CTAAAGTAAC 
CCTATTTAGG 
TGTCAATTCA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 

960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 



967 



WO 03/042661 



TTGGAAGTCA ATGCACTAAC TCAATACCAA GATGAGTTTT TAAATAATGA ATATTATTTA 2040 

ATACCACAAC AGAATTATCC CCAATTTCCA ATAAGTCCTA TCATTGAAAA TTCAAATATA 2100 

AGTGAAGAAA AAATTAGTAG ATCAACAATC TAAACAAATC CCTCGGTTCT AAGATACAAT 2160 

GGATTCCCCA TACTGGAAGG ACTCTGAGGC TTTATTCCCC CACTATGCAT ATCTTATCAT 2220 

TTTATTATTA TACACACATC CATCCTAAAC TATACTAAAG CCCTTTTCCC ATGCATGGAT 2280 

GGAAATGGAA GATTTTTTTG TAACTTGTTC TAGAAGTCTT AATATGGGCT GTTGCCATGA 2340 

AGGCTTGCAG AATTGAGTCC ATTTTCTAGC TGCCTTTATT CACATAGTGA TGGGGTACTA 2400 

AAAGTACTGG GTTGACTCAG AGAGTCGCTG TCATTCTGTC ATTGCTGCTA CTCTAACACT 2460 

GAGCAACACT CTCCCAGTGG CAGATCCCCT GTATCATTCC AAGAGGAGCA TTCATCCCTT 2520 

TGCTCTAATG ATCAGGAATG ATGCTTATTA GAAAACAAAC TGCTTGACCC AGGAACAAGT 2580 

GGCTTAGCTT AAGTAAACTT GGCTTTGCTC AGATCCCTGA TCCTTCCAGC TGGTCTGCTC 2640 

TGAGTGGCTT ATCCCGCATG AGCAGGAGCG TGCTGGCCCT GAGTACTGAA CTTTCTGAGT 2700 

AACAATGAGA CACGTTACAG AACCTATGTT CAGGTTGCGG GTGAGCTGCC CTCTCCAAAT 2760 

CCAGCCAGAG ATGCACATTC CTCGGCCAGT CTCAGCCAAC AGTACCAAAA GTGATTTTTG 2820 

AGTGTGCCAG GGTAAAGGCT TCCAGTTCAG CCTCAGTTAT TTTAGACAAT CTCGCCATCT 2880 

TTAATTT CT T AGCTTCCTGT TCTAATAAAT GCACGGCTTT ACCTTTCCTG TCAGAAATAA 2940 

ACCAAGGCTC TAAAAGATGA TTTCCCTTCT GTAACTCCCT AGAGCCACAG GTTCTCATTC 3000 

CTTTTCCCAT TATACTTCTC ACAATTCAGT TTCTATGAGT TTGATCACCT GATTTTTTTA 3060 

ACAAAATATT TCTAACGGGA ATGGGTGGGA GTGCTGGTGA AAAGAGATGA AATGTGGTTG 3120 

TATGAGCCAA TCATATTTGT GATTTTTTAA AAAAAGTTTA AAAGGAAATA TCTGTTCTGA 3180 

AACCCCACTT AAGCATTGTT TTTATATAAA AACAATGATA AAGATGTGAA CTGTGAAATA 3240 
AATATACCAT ATTAGCTACC CACC 

Seq ID NO: 54 Protein sequence 
Protein Accession #: KP_055026 

1 11 21 31 41 51 

I I I I I I 

MNYSLHLAFV CLSLFTERMC IQGSQFNVEV GRSDKLSLPG FENLTAGYNK FLRPNFGGEP 60 
VQIALTLDIA S1SSISESNM DYTATIYLRQ RWMDQRLVFE GNKSFTLDAR LVEFLWVPDT 120 
YIVESKKSFL HEVTVGNRIiI RLFSNGTVLY ALRITTTVAC NMDLSKYPMD TQTCKLQLES 180 
WGYDGNDVEF TWLRGNDSVR GLEHLRLAQY TIERYFTLVT RSQQETGNYT RLVLQFELRR 240 
NVTiYFILETY VPSTFLWLS VIVSFWISLDS VPARTCIGVT TVLSMTTLMI GSRTSLPNTN 300 
CFIKAIDVYL GICFSFVFGA LLEYAVAHYS SLQQMAAKDR GTTKEVEEVS ITNIINSSIS 3 60 
SFKRKISFAS IEISSDNVDY SDLTMKTSDK FKFVFREKMG RIVDYFTIQN PSNVDHYSKL 420 
LFPLIFMLAN VFYWAYYMYF 

Seq ID NO: 55 DNA sequence 

Nucleic Acid Accession 8: XM_084007 

Coding sequence; 138.-240S 

1 11 21 31 41 51 

I I I I I I 

CTCGTGCCGA ATTCGGCACG AGACCGCGTG TTCGCGCCTG GTAGAGATTT CTCGAAGACA 60 

CCAGTGGGCC CGTGTGGAAC CAAACCTGCG CGCGTGGCCG GGCCGTGGGA CAACGAGGCC 120 

GCGGAGACGA AGGCGCAATG GCGAGGAAGT TATCTGTAAT CTTGATCCTG ACCTTTGCCC 180 

TCTCTGTCAC AAATCCCCTT CATGAACTAA AAGCAGCTGC TTTCCCCCAG ACCACTGAGA 240 

AAATTAGTCC GAATTGGGAA TCTGGCATTA ATGTTGACTT GGCAATTTCC ACACGGCAAT 300 

ATCATCTACA ACAGCTTTTC TACCGCTATG GAGAAAATAA TTCTTTGTCA GTTGAAGGGT 360 

TCAGAAAATT ACTTCAAAAT ATAGGCATAG ATAAGATTAA AAGAATCCAT ATACACCATG 420 

ACCACGACCA TCACTCAGAC CACGAGCATC ACTCAGACCA TGAGCGTCAC TCAGACCATG 480 

AGCATCACTC AGACCACGAG CATCACTCTG ACCATGATCA TCACTCCCAC CATAATCATG 540 

CTGCTTCTGG TAAAAATAAG CGAAAAGCTC TTTGCCCAGA CCATGACTCA GATAGTTCAG 600 

GTAAAGATCC TAGAAACAGC CAGGGGAAAG GAGCTCACCG ACCAGAACAT GCCAGTGGTA 660 

GAAGGAATGT CAAGGACAGT GTTAGTGCTA GTGAAGTGAC CTCAACTGTG TACAACACTG 720 

TCTCTGAAGG AACTCACTTT CTAGAGACAA TAGAGACTCC AAGACCTGGA AAACTCTTCC 780 

CCAAAGATGT AAGCAGCTCC ACTCCACCCA GTGTCACATC AAAGAG CCGG GTGAGCCGGC 840 

TGGCTGGTAG GAAAACAAAT GAATCTGTGA GTGAGCCCCG AAAAGGCTTT ATGTATTCCA 900 

GAAACACAAA TGAAAATCCT CAGGAGTGTT TCAATGCATC AAAGCTACTG ACATCTCATG 960 

GCATGGGCAT CCAGGTTCCG CTGAATGCAA CAGAGTTCAA CTATCTCTGT CCAGCCATCA 1020 

TCAACCAAAT TGATGCTAGA TCTTGTCTGA TTCATACAAG TGAAAAGAAG GCTGAAATCC 1080 

CTCCAAAGAC CTATTCATTA CAAATAGCCT GGGTTGGTGG TTTTATAGCC ATTTCCATC A 1140 

TCAGTTTCCT GTCTCTGCTG GGGGTTATCT TAGTGCCTCT CATGAATCGG GTGTTTTTCA 1200 

AATTTCTCCT GAGTTTCCTT GTGGCACTGG CCGTTGGGAC TTTGAGTGGT GATGCTTTTT 1260 

TACACCTTCT TCCACATTCT CATGCAAGTC ACCACCATAG TCATAGCCAT GAAGAACCAG 1320 

CAATGGAAAT GAAAAGAGGA CCACTTTTCA GTCATCTGTC TTCTCAAAAC ATAGAAGAAA 1380 

GTGCCTATTT TGATTCCACG TGGAAGGGTC TAACAGCTCT AGGAGGCCTG TATTTCATGT 1440 

TTCTTGTTGA ACATGTCCTC ACATTGATCA AACAATTTAA AGATAAGAAG AAAAAGAATC 1500 

AGAAGAAACC TGAAAATGAT GATGATGTGG AGATTAAGAA GCAGTTGTCC AAGTATGAAT 1560 

CTCAACTTTC AACAAATGAG GAGAAAGTAG ATACAGATGA TCGAACTGAA GGCTATTTAC 1620 

GAGCAGACTC ACAAGAGCCC TCCCACTTTG ATTCTCAGCA GCCTGCAGTC TTGGAAGAAG 1680 

AAGAGGTCAT GATAGCTCAT GCTCATCCAC AGGAAGTCTA CAATGAATAT GTACCCAGAG 1740 

GGTGCAAGAA TAAATGCCAT TCACATTTCC ACGATACACT CGGCCAGTCA GACGATCTCA 1800 

TTCACCACCA TCATGACTAC CAT CAT ATT C TCCATCATCA CCACCACCAA AACCACCATC 1860 

CTCACAGTCA CAGCCAGCGC TACTCTCGGG AGGAGCTGAA AGATGCCGGC GTCGCCACTT 1920 

TGGCCTGGAT GGTGATAATG GGTGATGGCC TGCACAATTT CAGCGATGGC CTAGCAATTG 1980 

GTGCTGCTTT TACTGAAGGC TTATCAAGTG GTTTAAGTAC TTCTGTTGCT GTGTTCTGTC 2040 

ATGAGTTGCC TCATGAATTA GGTGACTTTG CTGTTCTACT AAAGGCTGGC ATGACCGTTA 2100 

AGCAGGCTGT CCTTTATAAT GCATTGTCAG CCATGCTGGC GTATCTTGGA ATGGCAACAG 2160 

GAATTTTCAT TGGTCATTAT GCTGAAAATG TTTCTATGTG GATATTTGCA CTTACTGCTG 2220 

GCTTATTCAT GTATGTTGCT CTGGTTGATA TGGTACCTGA AATGCTGCAC AATGATGCTA 2280 

GTGACCATGG ATGTAGCCGC TGGGGGTATT TCTTTTTACA GAATGCTGGG ATGCTTTTGG 2340 

GTTTTGGAAT TATGTTACTT ATTTCCATAT TTGAACATAA AATCGTGTTT CGTATAAATT 2400 

TCTAGTTAAG GTTTAAATGC TAGAGTAGCT TAAAAAGTTG TCATAGTTTC AGTAGGTCAT 2460 



968 



WO 03/042661 



PCT/US02/36810 



10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 



AGGGAGATGA 
TTGTATTGAA 
TATTCTATCT 
TAAACAAGAG 
TTTTCAAGAA 
TGTTTAGGAA 
AGCAAAGAAA 
AAAAATCACA 
CAGAATTAGT 
GTAGTGAGCA 
AAATATATTT 
TTCGTGCGGG 
TATTGCCAAG 
CAAAATTATC 
TCATTTGATT 
GAGCAATTGT 
GATGTTTCTT 



70 



75 



80 



GTTTGTATGC 
TATTGCTGTC 
TGGAGATAAA 
ATTTGGCATG 
CTAACACAGT 
TAAGAATGTG 
TAAAGGAGAA 
AAATTTGTTG 
ATAGAGTACA 
CTCTCATATA 
AATGAATTCA 
TTATATACCA 
TTATATATCA 
AGAGTAGTAA 
CGATTCAGAA 
CTTTATATAC 
TTTTACACAA 



TGTACTATGC 
TGTTACAAAG 
ATCTGTATGT 
ACATGTTCTG 
TATTCCTATA 
CATGAAGCCT 
AAGAGAAGAA 
TAAATTAGAG 
TTCATTAAAC 
CTAATTAGTG 
AGCAATATAC 
GATGAGTACA 
CCAAAAGCTG 
AACTTTGATA 
AGTACTTTGA 
GGTACTGTAG 
TAAATTCCTT 



AGCGTTTAAA 
TCAGTTAAAG 
GCAATTCACC 
TATGTTTCAG 
CTGGATTTTA 
AAAATACCAA 
TCTGAGAATT 
GGGAGAAATT 
ATTTTTGTCA 
TACATTTAAC 
ACTTGACCAA 
GTGAGTAGTT 
TATGACTGGA 
TATATGAGGA 
TATCTCTCAG 
CCATACTAGG 
ATATCAGCTT 



GTTAGTGGGT 
GTACGTTTTA 
GGTATTACCA 
GGAAAAATGT 
GGTCTCTGAA 
GAAAGCTTAT 
GGGGAGGCAT 
TAGAATTAAG 
GGATTATTTC 
TTTGTATAAT 
GAAATTGGAA 
TATGTATCAC 
TGTTCTGGTT 
TATTAAAACT 
TGCTTCAGTG 
CCTGTCTGTG 
G 



TTTGTGATTT 
ATATTTAAGT 
GTTTATTATG 
CTTTAATGCT 
GAACTGCTGG 
ACTGAATTTA 
AGATTCTTAT 
TATAAAAAGG 
CCGTAAAAAC 
ACAGAAATCT 
TTTCAAAATG 
CAGACTGGGT 
ACCTGGTTTA 
ACACTAAGTA 
CTATCATTGT 
GCATTCTCTA 



2520 
2560 
2640 
2700 
2760 
2820 
2B80 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 



Seq ID NO: 56 Protein sequence 
Protein Accession #: XP_084007 



1 
I 

MARKLSVILI 
FYRYGENNSIi 



SVSASEVTST 
NESVSEPRKG 
RSCLIHTSEK 
LVALAVGTLS 
TWKGLTALGG 
EEKVDTDDRT 
HSHFHDTLGQ 
MGDGLHNFSD 
NALSAMLAYL 
RWGYFFLQNA 



11 
I 

LTFALSVTNP 
SVEGFRKLLQ 
HHNHAASGKN 
VYNTVSEGTH 
FMYSRNTNEN 
KAEIPPKTYS 
GDAFLHLLPH 
LYFMFLVEHV 
EGYLRADSQE 
SDDLIHHHHD 
GLAIGAAFTE 
GMATGIFIGH 
GMLLGFGIML 



21 
I 

LHEIiKAAAFP 
NIGIDKIKRI 
KRKALCPDHD 
FLETIETPRP 
PQECFNASKL 
LQIAWVGGFI 
SHASHHHSHS 
LTLIKQFKDK 
PSHFDSQQPA 
YHHILHHHHH 
GLSSGLSTSV 
YAENVSMWIF 
LISIFEHKIV 



31 
I 

QTTEKISPNW 
HIHHDHDHHS 
SDSSGKDPRN 
GKLFPKDVSS 
LTSHGMGIQV 
AISIISFLSL 
HEEPAMEMKR 
KKKNQKKPEN 
VLEEEEVMIA 
QNKHPHSHSQ 
AVFCHELPHE 
ALTAGLFMYV 
FRINF 



41 

1 

ESGINVDLAI 
DHEHHSDHER 
SQGKGAHRPE 
STPPSVTSKS 
PLNATEFNYL 
LGVILVPLMN 
GPLFSHLSSQ 
DDDVEIKKQL 
HAHPQEVYNE 
RYSREELKDA 
LGDFAVLLKA 
ALVDMVPEML 



51 
I 

STRQYHLQQL 
HSDHEHHSDH 
HASGRRNVKD 
RVSRLAGRKT 
CPAIINQIDA 
RVFFKFLLSF 
NIEESAYFDS 
SKYESQLSTN 
YVPRGCKNKC 
GVATLAWMVI 
GMTVKQAVLY 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 



Seq ID NO: 57 DNA sequence 

Nucleic Acid Accession #: NM_015419.1 

Coding sequence: 1..8487 

1 11 21 31 41 51 

1 I I I I I 

ATGCCCAAGC GCGCGCACTG GGGGGCCCTC TCCGTGGTGC TGATCCTGCT TTGGGGCCAT 
CCGCGAGTGG CGCTGGCCTG CCCGCATCCT TGTGCCTGCT ACGTCCCCAG CGAGGTCCAC 
TGCACGTTCC GATCCCTGGC TTCCGTGCCC GCTGGCATTG CTAGACACGT GGAAAGAATC 
AATTTGGGGT TT AAT AG CAT ACAGGCCCTG TCAGAAACCT CATTTGCAGG AC TGACC AAG 
TTGGAGCTAC TTATGATTCA CGGCAATGAG ATCCCAAGCA TCCCCGATGG AGCTTTAAGA 
GACCTCAGCT CTCTTCAGGT TTTCAAGTTC AGCTACAACA AGCTGAGAGT GATCACAGGA 
CAGACCCTCC AGGGTCTCTC TAACTTAATG AGGCTGCACA TTGACCACAA CAAGATCGAG 
TTTATCCACC CTCAAGCTTT CAACGGCTTA AOGTCTCTGA GGCTACTCCA TTTGGAAGGA 
AATCTCCTCC ACCAGCTGCA CCCCAGCACC TTCTCCACGT TCACATTTTT GGATTATTTC 
AGACTCTCCA CCATAAGGCA CCTCTACTTA GCAGAGAACA TGGTTAGAAC TCTTCCTGCC 
AGCATGCTTC GGAACATGCC GCTTCTGGAG AATCTTTACT TGCAGGGAAA TCCGTGGACC 
TGCGATTGTG AGATGAGATG GTTTTTGGAA TGGGATGCAA AATCCAGAGG AATTCTGAAG 
TGTAAAAAGG ACAAAGCTTA TGAAGGCGGT CAGTTGTGTG CAATGTGCTT CAGTCCAAAG 
AAGTTGTACA AACATGAGAT ACACAAGCTG AAGGACATGA CTTGTCTGAA GCCTTCAATA 
GAGTCCCCTC TGAGACAGAA CAGGAGCAGG AGTATTGAGG AGGAGCAAGA ACAGGAAGAG 
GATGGTGGCA GCCAGCTCAT CCTGGAGAAA TTCCAACTGC CCCAGTGGAG CATCTCTTTG 
AATATGACCG ACGAGCACGG GAACATGGTG AACTTGGTCT GTGACATCAA GAAACCAATG 
GATGTGTACA AGATTCACTT GAACCAAACG GATCCTCCAG ATATTGACAT AAATGCAACA 
GTTGCCTTGG ACTTTGAGTG TCCAATGACC CGAGAAAACT ATGAAAAGCT ATGGAAATTG 
ATAGCATACT ACAGTGAAGT TCCCGTGAAG CTACACAGAG AGCTCATGCT CAGCAAAGAC 
CCCAGAGTCA GCTACCAGTA CAGGCAGGAT GCTGATGAGG AAGCTCTTTA CTACACAGGT 
GTGAGAGCCC AGATTCTTGC AGAACCAGAA TGGGTCATGC AGCCATCCAT AGATATCCAG 
CTGAACCGAC GTCAGAGTAC GGCCAAGAAG GTGCTACTTT CCTACTACAC CCAGTATTCT 
CAAACAATAT CCACCAAAGA TACAAGGCAG GCTCGGGGCA GAAGCTGGGT AATGATTGAG 
CCTAGTGGAG CTGTGCAAAG AGATCAGACT GTCCTGGAAG GGGGTCCATG CCAGTTGAGC 
TGCAACGTGA AAGCTTCTGA GAGTCCATCT ATCTTCTGGG TGCTTCCAGA TGGCTCCATC 
CTGAAAGCGC CCATGGATGA CCCAGACAGC AAGTTCTCCA TTCTCAGCAG TGGCTGGCTG 
AGGATCAAGT CCATGGAGCC ATCTGACTCA GGCTTGTACC AGTGCATTGC TCAAGTGAGG 
GATGAAATGG ACCGCATGGT ATATAGGGTA CTTGTGCAGT CTCCCTCCAC TCAGCCAGCC 
GAGAAAGACA CAGTGACAAT TGGCAAGAAC CCAGGGGAGT CGGTGACATT GCCTTGCAAT 
GCTTTAGCAA TACCCGAAGC CCACCTTAGC TGGATTCTTC CAAACAGAAG GATAATTAAT 
GATTTGGCTA ACACATCACA TGTATACATG TTGCCAAATG GAACTCTTTC CATCCCAAAG 
GTCCAAGTCA GTGATAGTGG TTACTACAGA TGTGTGGCTG TCAACCAGCA AGGGGCAGAC 
CATTTTACGG TGGG AAT CAC AGTGACCAAG AAAGGGTCTG GCTTGCCATC CAAAAGAGGC 
AGACGCCCAG GTGCAAAGGC TCTTTCCAGA GTCAGAGAAG ACATCGTGGA GGATGAAGGG 
GGCTCGGGCA TGGGAGATGA AGAGAACACT TCAAGGAGAC TTCTGCATCC AAAGGACCAA 
GAGGTGTTCC TCAAAACAAA GGATGATGCC ATCAATGGAG ACAAGAAAGC CAAGAAAGGG 
AGAAGAAAGC TGAAACTCTG GAAGCATTCG GAAAAAGAAC CAGAGACCAA TGTTGCAGAA 
GGTCGCAGAG TGTTTGAATC TAGACGAAGG ATAAACATGG CAAACAAACA GATTAATCCG 
GAGCGCTGGG CTGATATTTT AGCCAAAGTC CGTGGGAAAA ATCTCCCTAA GGGCACAGAA 
GTACCCCCGT TGATTAAAAC CACAAGTCCT CCATCCTTGA GCCTAGAAGT CACACCACCT 

969 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
I860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 



WO 03/042661 



PCT/US02/36810 



TTTCCTGCTG TTTCTCCCCC CTCAGCATCT CCTGTGCAGA CAGTAACCAG TGCTGAAGAA 2520 
TCCTCAGCAG ATGTACCTCT ACTTGGTGAA GAAGAGCACG TTTTGGGTAC CATTTCCTCA 2580 
GCCAGCATGG GGCTAGAACA CAACCACAAT GGAGTTATTC TTGTTGAACC TGAAGTAACA 2640 
AGCACACCTC TGGAGGAAGT TGTTGATGAC CTTTCTGAGA AGACTGAGGA GATAACTTCC 2700 
5 ACTGAAGGAG ACCTGAAGGG GACAGCAGCC CCTACACTTA TATCTGAGCC TTATGAACCA 2760 
TCTCCTACTC TGCACACATT AGACACAGTC TATGAAAAGC CCACCCATGA AGAGACGGCA 2820 
ACAGAGGGTT GGTCTGCAGC AGATGTTGGA TCGTCACCAG AGCCCACATC CAGTG AGTAT 2880 
GAGCCTCCAT TGGATGCTGT CTCCTTGGCT GAGTCTGAGC CCATGCAATA CTTTGACCCA 2940 
GATTTGGAGA CTAAGTCACA ACCAGATGAG GATAAGATGA AAGAAGACAC CTTTGCACAC 3000 
10 CTTACTCCAA CCCCCACCAT CTGGGTTAAT GACTCCAGTA CATCACAGTT ATTTGAGGAT 3060 
TCTACTATAG GGGAACCAGG TGTCCCAGGC CAATCACATC TACAAGGACt GACAGACAAC 3120 
ATCCACCTTG TGAAAAGTAG TCTAAGCACT CAAGACACCT TACTGATTAA AAAGGGTATG 3180 
AAAGAGATGT CTCAGACACT ACAGGGAGGA AATATGCTAG AGGGAGACCC CACACACTCC 3240 
AGAAGTTCTG AGAGTGAGGG CCAAGAGAGC AAATCCATCA CTTTGCCTGA CTCCACACTG 3300 
.15 GGTATAATGA GCAGTATGTC TCCAGTTAAG AAGCCTGCGG AAACCACAGT TGGTACCCTC 3360 
CTAGACAAAG ACACCACAAC AGTAACAACA ACACCAAGGC AAAAAGTTGC TCCGTCATCC 3420 
ACCATGAGCA CTCACCCTTC TCGAAGGAGA CCCAAOGGGA GAAGGAGATT ACGCCCCAAC 3480 
AAATTCCGCC ACCGGCACAA GCAAACCCCA CCCACAACTT TTGCCCCATC AGAGACTTTT 3540 
TCTACTCAAC CAACTCAAGC ACCTGACATT AAGATTTCAA GTCAAGTGGA GAGTTCTCTG 3600 
20 GTTCCTACAG CTTGGGTGGA TAACACAGTT AATACCCCCA AACAGTTGGA AATGGAGAAG 3660 
AATGCAGAAC CCACATCCAA GGGAACACCA CGGAGAAAAC ACGGGAAGAG GCCAAACAAA 3720 
CATCGATATA CCCCTTCTAC AGTGAGCTCA AGAGCGTCCG GATCCAAGCC CAGCCCTTCT 37 BO 
CCAGAAAATA AACATAGAAA CATTGTTACT CCCAGTTCAG AAACTATACT TTTGCCTAGA 3840 
ACTGTTTCTC TGAAAACTGA GGGCCCTTAT GATTCCTTAG ATTACATGAC AACCACCAGA 3900 
25 AAAATATATT CATCTTACCC TAAAGTCCAA GAGACACTTC CAGTCACATA TAAACCCACA 3960 
TCAGATGGAA AAGAAATTAA GGATGATGTT GCCACAAATG TTGACAAACA TAAAAGTGAC 4020 
ATTTTAGTCA CTGGTGAATC AATTACTAAT GCCATACCAA CTTCTCGCTC CTTGGTCTCC 4080 
ACTATGGGAG AATTTAAGGA AGAATCCTCT CCTGTAGGCT TTCCAGGAAC TCCAACCTGG 4140 
AATCCCTCAA GGACGGCCCA GCCTGGGAGG CTACAGACAG ACATACCTGT TACCACTTCT 4200 
30 GGGGAAAATC TTACAGACCC TCCCCTTCTT AAAGAGCTTG AGGATGTGGA TTTCACTTCC 4260 
GAGTTTTTGT CCTCTTTGAC AGTCTCCACA CCATTTCACC AGGAAGAAGC TGGTTCTTCC 4320 
ACAACTCTCT CAAGCATAAA AGTGGAGGTG GCTTCAAGTC AGGCAGAAAC CACCACCCTT 4380 
GATCAAGATC ATCTTGAAAC CACTGTGGCT ATTCTCCTTT CTGAAACTAG ACCACAGAAT 4440 
CACACCCCTA CTGCTGCCCG GATGAAGGAG CCAGCATCCT CGTCCCCATC CACAATTCTC 4500 
35 ATGTCTTTGG GACAAACCAC CACCACTAAG CCAGCACTTC CCAGTCCAAG AATATCTCAA 4560 
GCATCTAGAG ATTCCAAGGA AAATGTTTTC TTGAATTATG TGGGGAATCC AGAAACAGAA 4620 
GCAACCCCAG TCAACAATGA AGGAACACAG CATATGTCAG GGCCAAATGA ATTATCAACA 4680 
CCCTCTTCCG ACCGGGATGC ATTTAACTTG TCTACAAAGC TGGAATTGGA AAAGCAAGTA 4740 
TTTGGTAGTA GGAGTCTACC ACGTGGCCCA GATAGCCAAC GCCAGGATGG AAGAGTTCAT 4800 
40 GCTTCTCATC AACTAACCAG AGTCCCTGCC AAACCCATCC TACCAACAGC AACAGTGAGG 4860 
CTACCTGAAA TGTCCACACA AAGCGCTTCC AGATACTTTG TAACTTCCCA GTCACCTCGT 4920 
CACTGGACCA ACAAACCGGA AATAACTACA TATCCTTCTG GGGCTTTGCC AGAGAACAAA 4980 
CAGTTTACAA CTCCAAGATT ATCAAGTACA ACAATTCCTC TCCCATTGCA CATGTCCAAA 5040 
CCCAGCATTC CTAGTAAGTT TACTGACCGA AGAACTGACC AATTCAATGG TTACTCCAAA 5100 
45 GTGTTTGGAA ATAACAACAT CCCTGAGGCA AGAAACCCAG TTGGAAAGCC TCCCAGTCCA 5160 
AGAATTCCTC ATTATTCCAA TGGAAGACTC CCTTTCTTTA CCAACAAGAC TCTTTCTTTT 5220 
CCACAGTTGG GAGTCACCCG GAGACCCCAG ATACCCACTT CTCCTGCCCC AGTAATGAGA 5280 
GAGAGAAAAG TTATTCCAGG TTCCTACAAC AGGATACATT CCCATAGCAC CTTCCATCTG 5340 
GACTTTGGCC CTCCGGCACC TCCGTTGTTG CACACTCCGC AGACCACGGG ATCACCCTCA 5400 
50 ACTAACTTAC AGAATATCCC TATGGTCTCT TCCACCCAGA GTTCTATCTC CTTTATAACA 5460 
TCTTCTGTCC AGTCCTCAGG AAGCTTCCAC CAGAGCAGCT CAAAGTTCTT TGCAGGAGGA 5520 
CCTCCTGCAT CCAAATTCTG GTCTCTTGGG GAAAAGCCCC AAATCCTCAC CAAGTCCCCA 5580 
CAGACTGTGT CCGTCACCGC TGAGACAGAC ACTGTGTTCC CCTGTGAGGC AACAGGAAAA 5640 
CCAAAGCCTT TCGTTACTTG GACAAAGGTT TCCACAGGAG CTCTTATGAC TCCGAATACC 5700 
55 AGGATACAAC GGTTTGAGGT TCTCAAGAAC GGTACCTTAG TGATACGGAA GGTTCAAGTA 5760 
CAAGATCGAG GCCAGTATAT GTGCACCGCC AGCAACCTGC ACGGCCTGGA CAGGATGGTG 5820 
GTCTTGCTTT CGGTCACCGT GCAGCAACCT CAAATCCTAG CCTCCCACTA CCAGGACGTC 5880 
ACTGTCTACC TGGGAGACAC CATTGCAATG GAGTGTCTGG CCAAAGGGAC CCCAGCCCCC S940 

- CAAATTTCCT GGATCTTCCC TGACAGGAGG GTGTGGCAAA CTGTGTCCCC CGTGGAGAGC 6000 
60 CGCATCACCC TGCACGAAAA CCGGACCCTT TCCATCAAGG AGGCGTCCTT CTCAGACAGA 6060 

GGCGTCTATA AGTGOGTGGC CAGCAATGCA GCCGGGGCGG ACAGCCTGGC CATCCGCCTG 6120 
CACGTGGCGG CACTGCCCCC CGTTATCCAC CAGGAGAAGC TGGAGAACAT CTCGCTGCCC 6180 
CCGGGGCTCA GCATTCACAT TCACTGCACT GCCAAGGCTG OGCCCCTGCC CAGOGTGCGC 6240 

- TGGGTGCTCG GGGACGGTAC CCAGATCCGC CCCTCGCAGT TCCTCCACGG GAACTTGTTT 6300 
65 GTTTTCCCCA ACGGGACGCT CTACATCCGC AACCTCGCGC CCAAGGACAG OGGGCGCTAT 6360 

• GAGTGCGTGG CCGCCAACCT GGTAGGCTCC GCGCGCAGGA CGGTGCAGCT GAACGTGCAG 6420 
CGTGCAGCAG CCAACGCGCG CATCACGGGC ACCTCCCCGC GGAGGAGGGA CGTCAGGTAC 6480 
GGAGGAACCC TCAAGCTGGA CTGCAGCGCC TCGGGGGACC CCTGGCCGCG CATCCTCTGG 6540 
AGGCTGCCGT CCAAGAGGAT GATCGACGCG CTCTTCAGTT TTGATAGCAG AATCAAGGTG 6600 
70 TTTGCCAATG GGACCCTGGT GGTGAAATCA GTGACGGACA AAGATGCCGG AGATTACCTG 6660 
TGCGTAGCTC GAAATAAGGT TGGTGATGAC TACGTGGTGC TCAAAGTGGA TGTGGTGATG 6720 
AAACCGGCCA AGATTGAACA CAAGGAGGAG AACGACCACA AAGTCTTCTA CGGGGGTGAC 67B0 
CTGAAAGTGG ACTGTGTGGC CACCGGGCTT CCCAATCCCG AGATCTCCTG GAGCCTCCCA 6840 
_ c GAOGGGAGTC TGGTGAACTC CTTCATGCAG TCGGATGACA GCGGTGGACG CACCAAGCGC 6900 
75 TATGTCGTCT TCAACAATGG GACACTCTAC TTTAACGAAG TGGGGATGAG GGAGGAAGGA 6960 
GACTACACCT GCTTTGCTGA AAATCAGGTC GGGAAGGACG AGATGAGAGT CAGAGTCAAG 7020 
GTGGTGACAG CGCCCGCCAC CATCCGGAAC AAGACTTACT TGGCGGTTCA GGTGCCCTAT 7080 
GGAGACGTGG TCACTGTAGC CTGTGAGGCC AAAGGAGAAC CCATGCCCAA GGTGACTTGG 7140 
0 _ TTGTCCCCAA CCAACAAGGT GATCCCCACC TCCTCTGAGA AGTATCAGAT ATACCAAGAT 7200 

80 GGCACTCTCC TTATTCAGAA AGCCCAGCGT TCTGACAGCG GCAACTACAC CTGCCTGGTC 7260 
AGGAACAGCG CGGGAGAGGA TAGGAAGACG GTGTGGATTC ACGTCAACGT CCAGCCACCC 7320 
AAGATCAACG GTAACCCCAA CCCCATCACC ACCGTGCGGG AGATAGCAGC CGGGGGCAGT 7380 
CGGAAACTGA TTGACTGCAA AGCTGAAGGC ATCCCCACCC CGAGGGTGTT ATGGGCTTTT 7440 
CCCGAGGGTG TGGTTCTGCC AGCTCCATAC TATGGAAACC GGATCACTGT CCATGGCAAC 7500 
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GGTTCCCTGG ACATCAGGAG TTTGAGGAAG 
CGCAACGAGG GAGGGGAGGC GAGGTTGATC 
AAACCCATCT TCCACGACCC GATCAGCGAG 
AGCCTCAACT GCTCTGCCGC GGGGACCCCG 
5 GGCACCGATC TGCAGAGTGG ACAGCAGCTG 
CTACACATTA GCGGTCTCTC CTCGGTGGAC 
GCCGCTGGCC ACACGGAGAG GCTGGTCTCC 
AAGCAGTATC ATAACCTGGT CAGCATCATC 
CCTCCCGGGG CTGGGCAGGG ACGTTTCTCC 
10 GGCCCCCAAA CCCTGGGACG CGTTTCTCTT 
GCCTCGGTGT TTGACAGGGG TACCTATGTA 
GTCACCAGCA TCCCCGTGAT TGTGATCGCC 
CCGGTCATCT ACACCCGGCC CGGGAACACC 
CCCAAAGCTG ACATCACGTG GGAGTTACCG 
15 . GCTCGTCTGT ATGGAAACAG ATTTCTTCAC 
ACACAGAGAG ATGCCGGCTT CTACAAGTGC 
AAAACAACTT ACATCCACGT CTTCTGAAAT 
ACAACAAAGC GGGGTTTGTA AGGGAAGCCA 
GTCACAGTGC ATGGTGGCCT CTGGTGGGTT 
20 GTTGGGAAAA GGAAGCAATG CAGACACGAG 
CTTTTGTGTT TACATCATGC CAGGGGCTTC 
TTTCTTCTTT TGCAAATGCC ACTCGACTGC 
ACATTCATCA AAAATAAGCC ATAGACATGA 
TCACCTAGTT AACCTGCTGC AGTTTTTACA 
25 TCTTTCAGTT ATTTCCTCTG TCACTTCAAA 
CAGAGTGACT GATATATATA TATATATTTT 
TTTATATGAA AAAAGAAAAA CATTTCTTCC 
TATATATTTT TTCCTTTCAA ATCAGACGAT 
ATTAAAATTA ATAAATTATT GGTCTTTACA 
30 AT ATAATTTT AAAAAATTTC TCTCCAACCT 
CCTTCTCCAG GAACCCTCCA GTGGGGAAGG 
TTTTTGTTGA AAGCTGTGCT CAGAGGAGGT 
AACTTTACAG AATTGAATCT AGAGTCTTCC 
CTGGCTTGTC CATCTGGTCT AAGGTGGCTG 
35 CCATGAATAA TACACGACCT GTTATTTCCA 

ATACTGTACA TTTGATAATA AAATAATATT 



AGCGACTCCG TCCAGCTGGT ATGCATGGCA 7SS0 

GTGCAGCTCA CTGTCCTGGA GCCCATGGAG 7620 

AAGATCACGG CCATGGCGGG CCACACCATC 7680 

ACACCCAGCC TGGTGTGGGT CCTTCCCAAT 7740 

CAGCGCTTCT ACCACAAGGC TGACGGCATG 7800 

GCTGGGGCCT ACOGCTGCGT GGCCCGCAAT 7860 

CTGAAGGTGG GACTGAAGCC AGAAGCAAAC 7920 

AATGGTGAGA CCCTGAAGCT CCCCTGCACC 7980 

TGGACGCTCC CCAATGGCAT GCATCTGGAG 8040 

CTGGACAATG GCACCCTCAC GGTTCGTGAG 8100 

TGCAGGATGG AGACGGAGTA CGGCCCTTCG 8160 

TATCCTCCCC GGATCACCAG CGAGCCCACC 8220 

GTGAAACTGA ACTGCATGGC TATGGGGATT 8280 

GATAAGTCGC ATCTGAAGGC AGGGGTTCAG B340 

CCCCAGGGAT CACTGACCAT CCAGCATGCC 8400 

ATGGCAAAAA ACATTCTCGG CAGTGACTCC 8460 

GTGGATTCCA GAATGATTGC TTAGGAACTG 8520 

GGTTGGGGAA TAGGAGCTCT TAAATAATGT 8580 

TCAAGTTGAG GTTGATCTTG ATCTACAATT 8640 

AAGGAGGGCT CAGCCTTGCT GAGACACTTT 8700 

ATTCAGGGTG TCTGTGCTCT GACTGCAATT 8760 

CTTCATAAGC GTCCATAGGA TAT CTGAGG A 8820 

ACAACACCTC ACTACCCCAT TGAAGACGCA 8880 

TGATAGACTT TGTTCCAGAT TGACAAGTCA 8940 

ACTCCAGCTT GCCCAATAAG GATTTAGAAC 9000 

AATTCAGAGT TACATACATA CAGCTACCAT 9060 

TGGAACTCAC TTTTTATATA ATGTTTTATA 9120 

GAGACTAGAA GGAGAAATAC TTTCTGTCTT 9180 

AGACTTGGAT ACATTACAGC AGACATGGAA 9240 

CCTTCAAATT CAGTCACCAC TGTTATATTA 9300 

CTGCGATATT AGATTTCCTT GTATGCAAAG 9360 

GAGAGGAGAG GAAGGAGAAA ACTGCATCAT 9420 

CCGAAAAGCC CAGAAACTTC TCTGCAGTAT 9480 

CTTCTTCCCC AGCCATGAGT CAGTTTGTGC 9540 

TGACTGCTTT ACTGTATTTT TAAGGTCAAT 9600 
CTCCCAAAAA AAAAA 
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I I I I 1 I 

MPKRAHWGAL SWIiILLWGH PRVALACPHP CACYVPSEVH CTFRSLASVP AGIARHVERI 60 

NU3FNSIQAL, SETSFAGLTK LELLMIHGNE IPSIPDGALR DLSSLQVFKF SYNKLRVITG 120 

45 QTLQGLSNLM RLHIDHNKIE FIHPQAFNGL TSLRLLHLEG NLLHQLHPST FSTFTFLDYF 180 

RliSTIRHLYL AENMVRTLPA SMLRNMPLLE NLYLQGNPWT CDCEMRWFLE WDAKSRGILK 240 

CKKDKAYEGG QLCAMCFSPK KLYKHEIHKL KDMTCLKPSI ESPLRQNRSR SIEEEQEQEE 300 

DGGSQLILEK FQLPQWSISL NMTDEHGNMV NLVCDIKKPM DVYKIHLNQT DPPDIDINAT 360 

VALDFECPMT RENYEKLWKL IAYYSEVPVK LHRELMLSKD PRVSYQYRQD ADEEALYYTG 420 

50 VRAQIIiAEPE WVMQPSIDIQ LNRRQSTAKK VLLSYYTQYS QTISTKDTRQ ARGRSWVMIE 480 

PSGAVQRDQT VLEGGPCQLS CNVKASESPS IFWVLPDGSI LKAPMDDPDS KFSILSSGWL 540 

RIKSMEPSDS GLYQCIAQVR DEMDRMVYRV LVQSPSTQPA EKDTVTIGKN PGESVTLPCN 600 

ALAIPEAHLS WJXPNRRIIN DLANTSHVYM LPNGTLSIPK VQVSDSGYYR CVAVNQQGAD 660 

HFTVGITVTK KGSGLPSKRG RRPGAKALSR VREDIVEDEG GSGMGDEENT SRRLLHPKDQ 720 

55 EVFLKTKDDA INGDKKAKKG RRKLKLWKHS EKEPETNVAE GRRVFESRRR INMANKQINP 780 

ERWADIIiAKV RGKNLPKGTE VPPLIKTTSP PSLSLEVTPP FPAVSPPSAS PVQTVTSAEE 840 

SSADVPLLGE EEHVLGTISS ASMGLEHNHN GVILVEPEVT STPLEEWDD LSEKTEEITS 900 

TEGDLKGTAA PTLISEPYEP SPTLHTLDTV YEKPTHEETA TEGWSAADVG SSPEPTSSEY 960 

EPPLDAVSLA ESEPMQYFDP DLETKSQPDE DKMKEDTFAH LTPTPTIWVN DSSTSQLFED 1020 

60 STIGEPGVPG QSHLOGLTDN IHLVKSSLST QDTLLIKKGM KEMSOTLOGG NMLEGDPTHS 1080 

RSSESEGQES KSITLPDSTL GIMSSMSPVK KPAETTVGTL LDKDTTTVTT TFRQKVAPSS 1140 

TMSTHPSRRR PNGRRRLRPN KFRHRHKQTP PTTFAPSETF STQPTQAPDI KISSQVESSL 1200 

VPTAWVDNTV NTPKQLEMEK NAEPTSKGTP RRKHGKRPNK HRYTPSTVSS RASGSKPSPS 1260 

PEMKHRNIVT PSSETILLPR TVSLKTEGPY DSLDYMTTTR KIYSSYPKVQ ETLPVTYKPT 1320 

65 SDGKEIKDDV ATNVDKHKSD ILVTGESITN AIPTSRSLVS TMGEFKEESS PVGFPGTPTW 1380 

NPSRTAQPGR LOTDIPVTTS GENLTDPPLL KELEDVDFTS EFLSSLTVST PFHQEEAGSS 1440 

TTLSSIKVEV ASSQAETTTL DQDHLETTVA ILLSETRPQN HTPTAARMKE PASSSPSTIL 1500 

MSLGQTTTTK PALPSPRISQ ASRDSKENVF LNYVGNPETE ATPVNNEGTQ HMSGPNELST 1560 

PSSDRDAFNL STKLELEKQV FGSRSLPRGP DSQRQDGRVH ASHQLTRVPA KPILPTATVR 1620 

70 LPEMSTQSAS RYFVTSQSPR HWTNKPEITT YPSGALPENK QFTTPRLSST TIPLPLHMSK 1680 

PSIPSKFTDR RTDQFNGYSK VFGNNNIPEA RNFVGKPPSP RIPHYSNGRL PFFTNKTLSF 1740 

PQLGVTRRPQ IPTSPAPVMR ERKVIPGSYN RIHSHSTFHL DPGPPAPPLL HTPQTTGSPS 1800 

TNLQNIPMVS STQSSISFIT SSVQSSGSFH QSSSKFFAGG PPASKFWSLG EKPQILTKSP 1860 

QTVSVTAETD TVFPCEATGK PKPFVTWTKV STGALMTPNT RIQRFEVLKN GTLVIRKVQV 1920 

75 QDRGQYMCTA SNLHGLDRMV VLLSVTVQQP QILASHYQDV TVYLGDTIAM EOAKGTPAP 1980 

QISWIFPDRR VWQTVSPVES RITLHENRTL SIKEASFSDR GVYKCVASNA AGADSLAIRL 2040 

HVAALPPVIH QEKLENISLP PGLSIHIHCT AKAAPLPSVR WVLGDGTQIR PSQFliHGNLF 2100 

VFPNGTLYIR NIiAPKDSGRY ECVAANLVGS ARRTVQLNVQ RAAANARITG TSPRRTDVRY 2160 

GGTLKLDCSA SGDPWPRILW RLPSKRMIDA LFSFDSRIKV FANGTLWKS VTDKDAGDYL 2220 

80 CVARNKVGDD YWLKVDWM KPAKIEHKEE NDHKVFYGGD LKVDCVATGL PNPEISWSLP 2280 

DGSLVNSFMQ SDDSGGRTKR YWFNNGTLY FNEVGMREEG DYTCFAENQV GKDEMRVRVK 2340 

WTAPATIRN KTYLAVQVPY GDWTVACEA KGEPMPKVTW LSPTNKVIPT SSEKYQIYQD 2400 

GTLLIQKAQR SDSGNYTCLV RNSAGEDRKT VWIHVNVQPP KIKGNPNPIT TVREIAAGGS 2460 

RKLIDCKAEG IPTPRVLWAF PEGWLPAPY YGNRITVHGN GSLDIRSLRK SDSVQLVCMA 2520 
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RNEGGEARLI VQLTVLEPME KPIFHDPISE KITAMAGHTI SLNCSAAGTP TPSLVWVLPN 
GTDLQSGQQL QRFYHKADGM LHISGLSSVD AGAYRCVARN AAGHTERLVS LKVGLKPEAN 
KQYHNLVSII NGETLKLPCT PPGAGQGRFS WTLPNGMHLE GPQTLGRVSL LDNGTLTVRE 
ASVFDRGTYV CRMETEYGPS VTSIPVIVIA YPPRITSEPT PVIYTRPGNT VKLNCMAMGI 
5 PKADITWELP DKSHLKAGVQ ARLYGNRFLH PQGSLTIQHA TQRDAGFYKC MAKNILGSDS 
KTTYIHVF 

Seq ID NO: 59 DNA sequence 
Nucleic Acid Accession #: Eos sequence 
10 Coding sequence: 1..5001 

1 11 21 31 41 51 

| 1 I I I ! 

ATGCCAGGCA CAAAACTAAC CCGAACAGGC GCCCCAGCAG ACTACAGAGT GATATTGAAG 
15 ACCTCTCAAG AGGACGAATT GGATGTACCT GACGACATCA GCGTCOGGGT TATGTCATCT 120 
CAGTCTGTGC TTGTGTCCTG GGTGGATCCT GTTCTGGAAA AACAGAAGAA AGTTGTTGCA 180 
TCAAGACAGT ACACCGTGCG CTATCGAGAG AAGGGGGAAT TGGCCAGGTG GGATTATAAG 240 
CAGATCGCTA ACAGGCGTGT GCTGATTGAG AACCTGATTC CAGACACTGT GTATGAATTT 300 
GCAGTCCGTA TTTCACAGGG TGAAAGAGAT GGCAAATGGA GTACGTCAGT CTTCCAAAGA 360 
20 ACACCAGAAT CTGCCCCTAC CACAGCTCCT GAAAACTTGA ACGTCTGGCC AGTCAATGGC 420 
AAACCTACAG TTGTCGCTGC ATCTTGGGAT GCGCTACCAG AGACTGAGGG GAAAGTGAAA 480 
GTCTGTCTGC TGGACACAGG ACTGTTTTCA GTTTCCTCCT TCCAACCATC TGCCAAATCA 540 
TTTCAGAATA CATTCTTTCA TACGCCCCGG CTCTCAAACC ATTTGGAGCA AAGTCCCTCA 600 
CCTATCCTGG AGACACTACT TCTGCCCTGG TGGATGGTCT GCAGCCTGGG GAACGCTATC 660 
25 TTTTCAAAAT CCGGGCCACA AACAGGAGAG GCCTGGGACC TCACTCCAAA GCCTTCATTG 720 

TCGCTATGCC AACAAGAATG CAGCTGTACC CAGAAGGATT TCAGTTGTCT AGCTTACCTG 780 
ATCGATATCC AAACCAAACA AGTTAATAAA GATCCACAAC TGGAAGGGAG TGTTTTTGGA 840 
CCATGTTTTC TTTTCTACTT CCTCACATTT ATGCTGGATA TTGGCGGCTT TTCCTTCATT 900 
ATGTGCTATG AAGACCCANN TGTTTCTTCT TTGACAGGCA ATTCTTTAAA ATCTGTTGCA 960 
30 ' GCCAGTAAGG CGGATGTTCA GCAGAACACG GAGGACAATG GGAAACCCGA AAAACCTGAG 1020 
CCTTCCTCAC CTTCTCCCAG AGCTCCAGCT TCCTCCCAAC ACCCCTCTGT GCCTGCTTCT 1080 
CCCCAAGGGA GAAATGCCAA GGACCTTCTT CTTGACTTGA AGAACAAAAT ATTGGCTAAT 1140 
GGTGGGGCGC CCCGAAAACC CCAGCTTCGC GCCAAGAAGG CAGAGGAGCT GGATCTTCAG 1200 
TCGACAGAAA TCACTGGGGA GGAGGAGCTG GGTTCCCGGG AGGACTCGCC CATGTCACCC 1260 
35 TCAGACACCC AAGACCAGAA ACGGACCCTG AGGCCGCCAA GTAGACACGG CCACTCGGTG 1320 

GTTGCTCCCG GCAGGACTGC AGTGAGGGCC CGGATGCCAG CGCTGCCCCG AAGGGAAGGC 1380 
GTAGATAAGC CTGGCTTTTC CCTGGCCACG CAGCCCCGCC CAGGGGCGCC CCCCTCGGCT 1440 
TCGGCCTCTC CTGCCCACCA CGCGTCCACC CAGGGCACCT CTCATCGTCC TTCCCTGCCT 1500 
GCCAGCTTGA ATGACAACGA CTTGGTGGAC TCAGACGAAG ATGAGCGCGC TGTGGGCTCC 1560 
40 CTCCACCCCA AGGGCGCCTT CGCCCAGCCC CGGCCAGCCC TGTCCCCCAG CCGCCAGTCC 1620 
CCGTCCAGCG TTCTCCGCGA CAGAAGCTCT GTGCACCCCG GCGCAAAGCC AGCCTCGCCG 1680 
GCGCGGAGGA CCCCCCATTC AGGGGCCGCA GAGGAAGATT CCAGTGCCTC AGCCCCACCC 1740 
TCAAGACTTT CTCCACCCCA TGGGGGATCA TCTCGGCTGC TGCCCACCCA GCCACACCTG 1800 
AGCTCTCCAC TTTCCAAGGG CGGGAAGGAT GGTGAGGACG CCCCAGCCAC CAACTCCAAT 1860 
45 GCGCCATCAC GGTCCACCAT GTCCTCCTCC GTCTCTTCTC ATCTCTCGTC CAGGACGCAG 1920 

GTCTCTGAGG GAGCGGAGGC TTCTGATGGT GAAAGCCACG GTGACGGCGA TAGGGAAGAC I960 
GGCGGAAGGC AGGCGGAGGC CACGGCCCAG ACGCTGCGGG CCCGGCCTGC CTCTGGACAC 2040 
TTCCATTTGC TCAGACACAA ACCCTTTGCT GCCAACGGGA GGTCTCCAAG CAGGTTCAGC 2100 
ATTGGGCGGG GACCTCGGCT GCAGCCCTCC AGCTCCCCAC AGTCGACTGT GCCCTCCCGA 2160 
50 GCCCACCCCA GGGTTCCCTC TCACTCTGAT TCCCACCCTA AGCTTAGCTC AGGTATCCAT 2220 
GGAGACGAGG AGGATGAGAA GCCGCTTCCT GCCACCGTTG TCAATGACCA CGTGCCTTCC 2280 
TCCTCCAGGC AGCCCATCTC COGGGGCTGG GAGGACTTAA GGAGAAGCCC GCAGAGAGGG 2340 
GCCAGCCTGC ATCGGAAGGA ACCCATCCCA GAGAACCCCA AATCCACAGG GGCAGATACA 2400 
CATCCTCAGG GCAAGTACTC CTCCCTGGCC TCCAAGGCTC AGGATGTTCA ACAGAGCACA 2460 
55 GACGCGGACA CGGAGGGTCA TTCTCCCAAA GCACAGCCAG GGTCCACAGA CCGCCACGCG 2520 
TCCCCTGCTC GTCCTCCCGC AGCACGGTCA CAGCAGCATC CCAGTGTTCC CAGAAGGATG 2580 
ACACCCGGCC GGGCCCCAGA ACAGCAGCCC CCTCCTCCCG TCGCCACGTC CCAGCACCAC 2640 
CCGGGACCCC AGAGCAGAGA CGCGGGTCGG TCACCTTCCC AGCCCAGGCT CTCACTGACC 2700 
CAGGCCGGGC GGCCCCGCCC CACGTCGCAG GGCCGCTCCC ACTCCTCCTC GGACCCTTAC 2760 
60 ACGGCGAGCT CCAGAGGGAT GCTCCCCACG GCCCTCCAGA ACCAGGAOGA GGATGCCCAG 2820 
GGCAGCTACG ACGAOGACAG CACAGAAGTC GAGGCCCAGG ATGTGCGGGC CCCCGCGCAC 2880 
GCCGCGCGCG CCAAGGAGGC AGCTGCGTCC CTTCCCAAGC ACCAGCAGGT GGAGTCTCCC 2940 
ACAGGCGCAG GGGCAGGTGG CGACCACAGG TCCCAGCGCG GACATGCGGC CTCCCCCGCC 3000 
AGGCCCAGCC GACCCGGCGG CCCCCAGTCC CGCGCCCGGG TCCCCAGCAG GGCAGCGCCG 3060 
65 GGGAAGTCGG AGCCTCCTTC CAAGCGGCCC CTGTCCTCCA AGTCCCAGCA GTCGGTCTCA 3120 
GCCGAGGACG AGGAGGAGGA GGACGCGGGG TTTTTTAAAG GCGGGAAAGA AGACCTTCTG 3180 
TCTTCCTCTG TGCCAAAGTG GCCCTCTTCC TCCACTCCCA GGGGCGGCAA AGACGCCGAT 3240 
GGGAGCCTCG CCAAGGAAGA GAGGGAGCCT GCCATCGCGC TTGCCCCTCG CGGAGGGAGC 3300 
CTGGCTCCTG TGAAGCGACC TCTCCCCCCA CCTCCAGGCA GCTCCCCCAG GGCCTCCCAC 3360 
70 GTCCCTTCCC GACCGCCGCC TCGCAGCGCT GCCACCGTGA GCCCCGTCGC GGGCACCCAC 3420 
CCCTGGCCGC GGTACACCAC GCGCGCCCCV CCTGGCCACT TCTCCACCAC CCCGATGCTG 3480 
TCCTTGCGCC AGAGGATGAT GCATGCCAGA TTCCGTAACC CTCTCTCCCG ACAGCCTGCC 3540 
AGACCCTCTT ACAGACAAGG TTATAATGGC AGACCAAATG TAGAAGGGAA AGTCCTTCCT 3600 
GGTAGTAATG GAAAACCGAA TGGACAGAGA ATTATCAATG GCCCTCAAGG AACAAAGTGG 3660 
75 GTTGTGGACC TTGATCGTGG GTTAGTATTG AATGCAGAAG GAAGGTACCT CCAAGATTCA 3720 
CATGGAAATC CTCTTCGGAT TAAACTAGGA GGAGATGGTC GAACCATTGT AGATCTGGAA 3780 
GGGACCCCCG TGGTGAGTCC TGACGGCCTC CCACTCTTTG GGCAGGGGCG ACATGGCACA 3840 
CCTCTGGCCA ATGCCCAAGA TAAGCCAATT TTGAGTCTTG GAGGAAAGCC GCTGGTGGGC 3900 
TTGGAGGTCA TCAAAAAAAC CACCCATCCC CCTACCACTA CCATGCAGCC CACCACTACT 3960 
80 ACGACGCCCC TGCCTACCAC TACAACCCCG AGGCCCACCA CTGCCACCAC CATGCAGCCC 4020 
ACCACTACTA CGACGCCCCT GCCTACCACT ACACCGAGGC CCACCACTGC CACCACCCGC 4080 
CGCACGACCA CCAGGCGTCC AACAACCACA GTCCGAACCA CTACGCGGAC AACCACCACC 4140 
ACCACCCCCA AACCCACCAC TCCCATCCCC ACCTGTCCCC CTGGGACCTT GGAACGGCAC 4200 
GACGATGATG GCAACCTGAT AATGAGCTCC AATGGGATCC CAGAGTGCTA CGCTGAAGAA 4260 
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GATGAGTTCT CAGGCTTGGA GACTGACACT GCAGTACCTA CGGAAGAGGC CTACGTTATA 4320 
TATGATGAAG ATTATGAATT TGAGACGTCA AGGCCACCAA CCACCACTGA GCCTTCGACC 4380 
ACTGCTACCA CACCGAGGGT GATCCCAGAG GAAGGCGCCA TCAGTTCCTT TCCTGAAGAA 4440 
GAATTTGATC TGGCTGGAAG GAAACGATTT GTTGCTCCTT ACGTGACGTA CCTAAATAAA 4S00 
GACCCATCAG CCCCGTGCTC TCTGACTGAT GCACTGGATC ACTTCCAAGT GGACAGCCTG 4560 
GATGAAATCA TCCCCAATGA CCTGAAGAAG AGTGATCTGC CTCCCCAGCA TGCTCCCCGC 4620 
AACATCACCG TGGTGGCCGT GGAAGGTTGC CACTCATTTG TCATTGTGGA TTGGGACAAA 4680 
GCCACCCCAG GAGATTTGGT CACAGGTTAT TTGGTTTACA GTGCATCCTA TGAAGATTTC 4740 
ATCAGGAACA AGTTTTCCAC TCAAGCTTCA TCAGTAACTC ACTTGCCCAT TGAGAACCTA 4800 
AAGCCCAACA CGAGGTATTA TTTTAAAGTG CAAGCACAAA ATCCTCATGG CTACGGACCT 4860 
ATCAGCCCTT CGGTCTCATT TGTCACCGAA TCAGATAATC CTCTGCTTGT TGTGAGGCCC 4920 
CCAGGCGGTG AGCTATCTGG ATCCCATTCG CTTTCAAACA TGATCCCAGC TACACGGACT 4980 
GCCATGGACG GCAATATGTG AAGCGCACGT GGTATCGAAA GTTCGTGGGA GTTGTTCTTT 5040 
GTAATTCACT GAGGTATAAA ATCTACCTCA GTGACAACCT GAAAGATACA TTCTACAGCA 5100 
TTGGAGACAG CTGGGGAAGA GGTGAAGACC ATTGCCAATT TGTGGATTCA CACCTTGATG 5160 
GAAGAACAGG GCCTCAGTCC TATGTAGAAG CCCTCCCTAC TATTCAAGGC TACTATCGCC 5220 
AGTATCGTCA GGAGCCTGTC AGGTTTGGGA ACATCGGCTT CGGAACCCCC TACTACTATG 5280 
TGGGCTGGTA CGAGTGTGGG GTCTCCATCC CTGGAAAGTG GTAATCACAG GACCGTCATG 5340 
CTGCAAGCTT GCCCTGCCCA GCCCCACCAA CTAAGTCGCA CTAGGGGCTG TGAGCAAAGA 5400 
CAGCCAGCAT GCTCAGCCCC GCTGCCCTAG GTGCCAGGAA GGTCACAGAT GGACACTGGC 5460 
CATTCTGGTC ATCTCAGTCT GGAACTCAGT CCCACTTCTT GGCCTGGACA ATGAACAGGA 5520 
TTCAGTTTTG CTGTTAACTT TGCTTCTCTA CTTTTTTTTG TTTGTTTGTA ATAGCACATC 5580 
CCAGAGACAT CAGAAACCAG CAACTGATTC AGTGTGATTT CCCAGACTTT TTAGGCATGA 5640 
AATTCGGACA CTTCAGTATT TCCAGGAATA GCATATGCAC GCTGTTCTTG CTTCATGGAA 5700 
TGCTACATGC TTTCTGTTTT TCTCATTTTG GATTTCTCCA AAACTAACTG AATTTAAGCT 5760 
TCAGGTCCCT TTGTATGCAG TAGAAAGGAA TTATTAAAAA CACCACCAAA GAAAATAAAT 5820 
ATATCCTACT TGAAATTTAC TCTATGGACT TACCCACTGC TAGAATAAAT GTATCAAATC 5880 
TTATTTGTAA ATTCTCAATT TTGATATATA TATGTATATA TGCATATACA TATCCACACT 5940 
TGTCTGCAAG AATATTGATT AAAATTGCTA AATTTGTACT TGTTCACCAA AAAAAAAAAA 6000 
AAAAAAA 

Seq ID NO: 60 Protein sequence 
Protein Accession #: Eos sequence 

1 11 21 31 41 51 

! I I I I I 

MPGTKLTRTG APADYRVILK TSQEDELDVP DDISVRVMSS QSVL.VSWVDP VLEKQKKWA 60 
SRQYTVRYRE KGELARWDYK QIANRRVLIE NLIPDTVYEF AVRISQGERD GKWSTSVFQR 120 
TPESAPTTAP ENLNVWPVNG KPTWAASWD ALPETEGKVK VCLLDTGLFS VSSPQPSAKS 180 
FQNTFFHTPR LSNHLEQSPS PILETLLLPW WMVCSLGNAI FSKSGPQTGE AWDLTPKPSL 240 
SLCQQECSCT QKDFSCLAYL IDIQTKQVNK DPQLEGSVFG PCFLFYFLTF MLDIGGFSFI 300 
MCYEDPVSSL TGNSLKSVAA SKADVQQNTE DNGKPEKPEP SSPSPRAPAS SQHPSVPASP 360 
QGRNAKDLLL DLKNKILANG GAPRKPQLRA KKAEELDLQS TEITGEEELG SREDSPMSPS 420 
DTQDQKRTLR PPSRHGHSW APGRTAVRAR MPALPRREGV DKPGFSLATQ PRPGAPPSAS 480 
ASPAHHASTQ GTSHRPSLPA SLNDNDLVDS DEDERAVGSIt HPKGAFAQPR PALSPSRQSP 540 
SSVLRDRSSV HPGAKPASPA RRTPHSGAAE EDSSASAPPS RLSPPHGGSS RIiLPTQPHLS 600 
SPLSKGGKDG EDAPATNSNA PSRSTMSSSV SSHLSSRTQV SEGAEASDGE SHGDGDREDG 660 
GRQAEATAQT LRARPASGHT HLLRHKPFAA NGRSPSRFSI GRGPRLQPSS SPQSTVPSRA 720 
HPRVPSHSDS HPKLSSGIHG DEEDEKPLPA TWNDHVPSS SRQPISRGWE DLRRSPQRGA 780 
SLHRKEPIPE NPKSTGADTH POGKYSSLAS KAQDVQQSTD ADTEGHSPKA QPGSTDRHAS 840 
PARPPAARSQ QHPSVPRRMT PGRAPEQQPP PPVATSQHHP GPQSRDAGRS PSQPRLSLTQ 900 
AGRPRPTSQG RSHSSSDPYT ASSRGMLPTA LQNQDEDAQG SYDDD STEVE AQDVRAPAHA 960 
ARAKEAAASL PKHQQVESPT GAGAGGDHRS QRGHAASPAR PSRPGGPQSR ARVPSRAAPG 1020 
KSEPPSKRPL SSKSQQSVSA EDEEEEDAGF FKGGKEDLLS SSVPKWPSSS TPRGGKDADG 1080 
SLAKEEREPA IALAPRGGSL APVKRPLPPP PGSSPRASHV PSRPPPRSAA TVSPVAGTHP 1140 
WPRYTTRAPP GHFSTTPMLS LRQRMMHARF RNPLSRQPAR PSYRQGYNGR PNVEGKVLPG 1200 
SNGKPNGQRI INGPQGTKWV VDLDRGLVLN AEGRYLQDSH GNPLRIKLGG DGRTIVDLEG 1260 
TPWSPDGLP LFGQGRHGTP LANAQDKPIL SLGGKPLVGL EVIKKTTHPP TTTMQ PTTTT 1320 
TPLPTTTTPR PTTATTMQPT TTTTPLPTTT PRPTTATTRR TTTRRPTTTV RTTTRTTTTT 1380 
TPKPTTPIPT CPPGTLERHD DDGNLIMSSN GIPECYAEED EFSGLETDTA VPTEEAYVIY 1440 
DEDYEFETSR PPTTTEPSTT ATTPRVIPEE GAISSFPEEE FDLAGRKRFV APYVTYLNKD 1500 
PSAPCSLTDA LDHFQVDSLD EI I PNDLKKS DLPPQHAPRN ITWAVEGCH SFVIVDWDKA 1560 
TPGDLVTGYL VYSASYEDPI RNKFSTQASS VTHLPIENLK PNTRYYFKVQ AQNPHGYGPI 1620 
SPSVSFVTES DNPLLWRPP GGELSGSHSL SNMIPATRTA MDGNM 

Seq ID NO: 61 DNA sequence 

Nucleic Acid Accession ft: NM_022743 

Coding sequence: 12 8.. 123 7 

1 11 21 31 41 51 

| I I I I 1 

GTGGATTTTA GAGATACCTC CCCTCCTTCT GCTCAGCTGC CTTGCAGTAA TTAAACTCTT 60 

TCTCTGCTGC AACACCCCTA CTGTTCTCCG TGTATTGGCT TTTCTGGGCA GCAGGAAGGA 120 

AAAGCTGATG CGATGCTCTC AGTGCCGCGT CGCCAAATAC TGTAGTGCTA AGTGTCAGAA 180 

AAAAGCTTGG CCAGACCACA AGCGGGAATG CAAATGCCTT AAAAGCTGCA AACCCAGATA 240 

TCCTCCAGAC TCCGTTCGAC TTCTTGGCAG AGTTGTCTTC AAACTTATGG ATGGAGCACC 300 

TTCAGAATCA GAGAAGCTTT ACTCATTTTA TGATCTGGAG TCAAATATTA ACAAACTGAC 360 

TGAAGATAAG AAAGAGGGCC TCAGGCAACT CGTAATGACA TTTCAACATT TCATGAGAGA 420 

AGAAATACAG GATGCCTCTC AGCTGCCACC TGCCTTTGAC CTTTTTGAAG CCTTTGCAAA 480 

AGTGATCTGC AACTCTTTCA CCATCTGTAA TGCGGAGATG CAGGAAGTTG GTGTTGGCCT 540 

ATATCCCAGT ATCTCTTTGC TCAATCACAG CTGTGACCCC AACTGTTCGA TTGTGTTCAA 600 

TGGGCCCCAC CTCTTACTGC GAGCAGTCCG AGACATCGAG GTGGGAGAGG AGCTCACCAT 660 

CTGCTACCTG GATATGCTGA TGACCAGTGA GGAGCGCCGG AAGCAGCTGA GGGACCAGTA 720 

CTGCTTTGAA TGTGACTGTT TCCGTTGCCA AACCCAGGAC AAGGATGCTG ATATGCTAAC 780 



973 



WO 03/042661 



TGGTGATGAG 
GGCACACTGG 
TGAACGGCTT 
CTGCATCAAC 
ATACAGGATT 
CAAACTGCAG 
TGATATTATG 
TTTAGAAGAA 
CGGCGTGTGT 
GTGAACCTCT 
GGTTTGCAAA 
ATTTGGTTGA 



CAAGTATGGA 
AAGTGGGAGC 
CCCGATATCA 
CTCGGCCTGT 
TTTTTCCCAG 
CTACATCAAG 
AGAGTGACAC 
TGCGACGCCA 
CTTTGTTGAA 
CTTATTGGAA 
CCACAAGAAT 
GGATGCCAAA 



AGGAAGTTCA 
AGGTTCTGGC 
ACATCTACCA 
TGGAGGAAGC 
GAAGCCATCC 
GCATGTTTCC 
ATGGCAGAGA 
ACATCAGAGC 
TGCCTTATTG 
ATTCTGTTCC 
CATTAGTTGT 
AAAAAAAAAA 



AGAATCCCTG 
CATGTGCCAG 
GCTGAAGGTG 
CTTGTTCTAT 
CGTCAGAGGG 
CCAAGCAATG 
ACACAGCCTG 
ATCCTAAGGG 
AGGTCACACA 
GTGTTTGTGT 
AGAGAAGCAC 



AAAAAAATTG 
GCGATCATAA 
CTCGACTGCG 
GGTACTCGGA 
GTTCAAGTGA 
AAGAATCTGA 
ATTGAAGATT 
AACGCAGTCA 
CTCTATGCTT 
AGGTAAATAA 
GATTATAATA 



AAGAACTGAA 
GCAGCAATTC 
CCATGGATGC 
CCATGGAGCC 
TGAAAGTTGG 
GACTGGCTTT 
TGATTCTACT 
GAGGGAAATA 
TGTTAGCTGT 
AGGCAGACAT 
AATTCAAAAC 



840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 



Seq ID NO: 62 Protein sequence 
Protein Accession #: NP_073580 



1 
I 

MRCSQCRVAK 
SEKLYSFYDL 
CNSFTICNAE 
LDMLMTSEER 
WKWEQVLAMC 
IFFPGSHPVR 
ECDANIRAS 



11 21 31 41 51 

I I I I I 

YCSAKCQKKA WPDHKRECKC LKSCKPRYPP DSVRLLGRW FKLMDGAPSE 
ESN IN KL TED KKEGLRQLVM TFQHFMREEI QDASQLPPAF DLFEAFAKVI 
MQEVGVGLYP SISLLNHSCD PNCSIVFNGP HLLLRAVRDI EVGEELTICY 
RKQLRDQYCF ECDCFRCQTQ DKDADMLTGD EQVWKEVQES LKKIEELKAH 
QAIISSNSER LPDINIYQLK VLDCAMDACI NLGLLEEALF YGTRTMEPYR 
GVQVMKVGKL QLHQGMFPQA MKNLRLAFDI MRVTHGREHS LIEDLILLLE 
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120 
180 
240 

300 
360 



Seq ID NO: 63 DNA sequence 

Nucleic Acid Accession 8 : NM_003014.2 

Coding sequence: 238.. 648 

31 



11 21 31 41 51 

III III 
GGCGGGTTCG CGCCCCGAAG GCTGAGAGCT GGCGCTGCTC GTGCCCTGTG TGCCAGACGG 
CGGAGCTCCG CGGCCGGACC CCGCGGCCCC GCTTTGCTGC CGACTGGAGT TTGGGGGAAG 
AAACTCTCCT GCGCCCCAGA AGATTTCTTC CTCGGCGAAG GGACAGCGAA AGATGAGGGT 
GGCAGGAAGA GAAGGCGCTT TCTGTCTGCC GGGGTCGCAG CGCGAGAGGG CAGTGCCATG 
TTCCTCTCCA TCCTAGTGGC GCTGTGCCTG TGGCTGCACC TGGCGCTGGG CGTGCGCGGC 
GCGCCCTGCG AGGCGGTGCG CATCCCTATG TGCCGGCACA TGCCCTGGAA CATCACGCGG 
ATGCCCAACC ACCTGCACCA CAGCACGCAG GAGAACGCCA TCCTGGCCAT CGAGCAGTAC 
GAGGAGCTGG TGGACGTGAA CTGCAGCGCC GTGCTGCGCT TCTTCTTCTG TGCCATGTAC 
GCGCCCATTT GCACCCTGGA GTTCCTGCAC GACCCTATCA AGCCGTGCAA GTCGGTGTGC 
CAACGCGCGC GCGACGACTG CGAGCCCCTC ATGAAGATGT ACAACCACAG CTGGCCCGAA 
AGCCTGGCCT GCGACGAGCT GCCTGTCTAT GACCGTGGCG TGTGCATTTC GCCTGAAGCC 
ATCGTCACGG ACCTCCCGGA GGATGTTAAG TGGATAGACA TCACACCAGA CATGATGGTA 
CAGGAAAGGC CTCTTGATGT TGACTGTAAA CGCCTAAGCC CCGATCGGTG CAAGTGTAAA 
AAGGTGAAGC CAACTTTGGC AACGTATCTC AGCAAAAACT ACAGCTATGT TATTCATGCC 
AAAATAAAAG CTGTGCAGAG GAGTGGCTGC AATGAGGTCA CAACGGTGGT GGATGTAAAA 
GAGATCTTCA AGTCCTCATC ACCCATCCCT CGAACTCAAG TCCCGCTCAT TACAAATTCT 
TCTTGCCAGT GTCCACACAT CCTGCCCCAT CAAGATGTTC TCATCATGTG TTACGAGTGG 
CGTTCAAGGA TGATGCTTCT TGAAAATTGC TTAGTTGAAA AATGGAGAGA TCAGCTTAGT 
AAAAGATCCA TACAGTGGGA AGAGAGGCTG CAGGAACAGC GGAGAACAGT TCAGGACAAG 
AAGAAAACAG CCGGGCGCAC CAGTCGTAGT AATCCCCCCA AACCAAAGGG AAAGCCTCCT 
GCICCCAAAC CAGCCAGTCC CAAGAAGAAC ATTAAAACTA GGAGTGCCCA GAAGAGAACA 
AACCCGAAAA GAGTGTGAGC TAACTAGTTT CCAAAGCGGA GACTTCCGAC TTCCTTACAG 
GATGAGGCTG GGCATTGCCT GGGACAGCCT ATGTAAGGCC ATGTGCCCCT TGCCCTAACA 
ACTCACTGCA GTGCTCTTCA TAGACACATC TTGCAGCATT TTTCTTAAGG CTATGCTTCA 
GTTTTTCTTT GTAAGCCATC ACAAGCCATA GTGGTAGGTT TGCCCTTTGG TACAG AAGGT 
GAGTTAAAGC TGGTGGAAAA GGCTTATTGC ATTGCATTCA GAGTAACCTG TGTGCATACT 
CTAGAAGAGT AGGGAAAATA ATGCTTGTTA CAATTCGACC TAA TATGTG C ATTGTAAAAT 
AAATGCCATA TTTCAAACAA AACACGTAAT TTTTTTACAG TATGTTTTAT TACCTTTTGA 
TATCTGTTGT TGCAATGTTA GTGATGTTTT AAAATGTGAT GAAAATATAA TGTTTTTAAG 
AAGGAACAGT AGTGGAATGA ATGTTAAAAG ATCTTTATGT GTTTATGGTC TGCAGAAGGA 
TTTTTGTGAT GAAAGGGGAT TTTTTGAAAA ATTAGAGAAG TAGCATATGG AAAATTATAA 
TGTGTTTTTT TACCAATGAC TTCAGTTTCT GTTTTTAGCT AGAAACTTAA AAACAAAAAT 
AATAATAAAG AAAAATAAAT AAAAAGGAGA GGCAGACAAT GTCTGGATTC CTGTTTTTTG 
GTTACCTGAT TTCCATGATC ATGATGCTTC TTGTCAACAC CCTCTTAAGC AGCACCAGAA 
ACAGTGAGTT TGTCTGTACC ATTAGGAGTT AGGTACTAAT TAGTTGGCTA ATGCTCAAGT 
ATTTTATACC CACAAGAGAG GTATGTCACT CATCTTACTT CCCAGGACAT CCACCCTGAG 
AATAATTTGA CAAGCTTAAA AATGGCCTTC ATGTGAGTGC CAAATTTTGT TTTTCTTCAT 
TTAAATATTT TCTTTGCCTA AATACATGTG AGAGGAGTTA AATATAAATG TACAGAGAGG 
AAAGTTGAGT TCCACCTCTG AAATGAGAAT TACTTGACAG TTGGGATACT TTAATCAGAA 
AAAAAGAACT TATTTGCAGC ATTTTATCAA CAAATTTCAT AATTGTGGAC AATTGGAGGC 
ATTTATTTTA AAAAACAATT TTATTGGCCT TTTGCTAACA CAGTAAGCAT GTATTTTATA 
AGGCATTCAA TAAATGCACA ACGCCCAAAG GAAATAAAAT CCTATCTAAT CCTACTCTCC 
ACTACACAGA GGTAATCACT ATTAGTATTT TGGCATATTA TTCTCCAGGT GTTT GCTTAT 
GCACTTATAA AATGATTTGA ACAAATAAAA CTAGGAACCT GTATACATGT GTTTCATAAC 
CTGCCTCCTT TGCTTGGCCC TTTATTGAGA TAAGTTTTCC TGTCAAGAAA GCAGAAACCA 
TCTCATTTCT AACAGCTGTG TTATATTCCA TAGTATGCAT TACTCAACAA ACTGTTGTGC 
TATTGGATAC TTAGGTGGTT TCTTCACTGA CAATACTGAA TAAACATCTC ACCGGAATTC 

Seq ID NO: 64 Protein sequence 
Protein Accession 3: NP_003005.1 
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I I I I I I 

MFLSILVALC LWLHLALGVR GAPCEAVRIP MCRHMPWNIT RMPNHLHHST QENAILAIEQ 
YEELVDVNCS AVLRFFFCAM YAPICTLEFL HDPIKPCKSV CQRARDDCEP LMKMYNHSWP 
ESLACDELPV YDRGVCISPE AIVTDLPEDV KWIDITPDMM VQERPLDVDC KRLSPDRCKC 
KKVKPTLATY LSKNYSYVIH AKIKAVQRSG CNEVTTWDV KEIFKSSSPI PRTQVPLITN 
SSCQCPHILP HQDVLIMCYE WRSRMMLLEN CLVEKWRDQIi SKRSIQWEER LQEQRRTVQD 
KKKTAGRTSR SNPPKPKGKP PAPKPASPKK NIKTRSAQKR TNPKRV 

Seq ID NO: 65 DNA sequence 

Nucleic Acid Accession #: BC010423 

Coding sequence: 248..17B0 



60 
120 
180 
240 
300 



80 



ii 



21 



CACAGCGTGG GAAGCAGCTC TGGGGGAGCT 
AGCTACGGCT GGQTGTGTAG AACGGGGCCG 
CAAGTGCGAG AGGCAAGAAC TCTGCAGCTT 
TCTGCAGCCG GCTCCCAGGG AGATCTCGGT 
TTCAACCATG CCCCTGTCCC TGGGAGCCGA 
GCTGCTACTG CTGGCATCAT TTACAGGCCG 
CGTGGTAACT GTGGTGCTGG GCCAGGACGC 
CGGCGAGCAA GTGGGGCAAG TGGCATGGGC 
ACTAGCGCTA CTGCACTCCA AATACGGGCT 
GGAGCAGCCG CCGCCCCCAC GCAACCCCCT 
GCAGGCGGAT GAGGGCGAGT ACGAGTGCCG 
GGCGCGGCTG CGGCTCCGAG TGCTGGTGCC 
ACTAGAAGAG GGCCAGGGCC TGACCCTGGC 
CCCCAGCGTG ACCTGGGACA CGGAGGTCAA 
CTCCCGCTCT GCTGCCGTCA CCTCAGAGTT 
GCAGCCACTG ACTTGTGTGG TGTCCCATCC 
CATCCTCCAC GTGTCCTTCC TTGCTGAGGC 
GTGGCACATT GGCAGAGAAG GAGCTATGCT 
CTCATACAAC TGGACACGGC TGGATGGGCC 
CACTTTGGGC TTTCCCCCAC TGACCACTGA 
CAATGAGTTC TCCTCAAGGG ATTCTCAGGT 
CTCTGGGAAG CAGGTGGACC TAGTGTCAGC 
ACTCTTGTTC TGCCTTCTGG TGGTGGTGGT 
GGCCCAGCAG ATGACCCAGA AATATGAGGA 
CCGGAGGCTG CATTCCCATC ACACGGACCC 
GAGAGCCGAG GGCCACCCTG ATAGTCTCAA 
AGAGCCCGAG GGCCGCAGTT ACTCCACGCT 
TGAACTGCTG TCTCCAGGCT CTGGGCGGGC 
CAAACAGGCC ATGAACCATT TTGTTCAGGA 
CAATGGCATC TACATCAATG GGCGGGGACA 
CTAGGCCTGG CTCCTTCTGT TGACATGGGA 
ACACCCCCAT TTCTTGCX3GA AGATGCTCCC 
AACCCTTCTG TTCATCGGGA GGGCTCCACC 
CACTGTGTGT GTGCATGTGT GCCTGTGTGA 
TGACTGTCCG TGGAGGGGTG ACTGTGTCCG 
AAGTGAACTG TGGTGTATGT GCCACGGGAT 
GTTTGGCGTG TGTGTCATGT GGCTGTGTGT 
CAGACCCCAG AGCAGTATTA ATGATGCAGA 
CAGACCCAGG TGTG CGGGCA TAGCTGGAGC 
TCTCCTACCA CTTCGGAGCC ATGGGGGCAA 
GAGGCTTGAA CTGTTACAGA AGCCCTCTGC 
ACATATTTTC TGTAAATATA CATGCGCCGG 
ACTTTTAATT TTTTTCTTTT TTTTTTCTTG 
TTTTATTTTT ATTTTTTTTT AGAGTTTGAG 
CTGTAAAAAA ACCAAAACCC AAAAAAAAAA 

Seq ID NO: 66 Protein sequence 
Protein Accession #: AAH10423 



31 
I 

CGGAGCTCCC 
GGGCTGGGGC 
CCTGCCTTCT 
GGAACTTCAG 
GATGTGGGGG 
GTGCCCCGOG 
AAAACTGCCC 
TCGGGTGGAC 
TCATGTGAGC 
GGACGGCTCA 
GGTCAGCACC 
TCCCCTGCCC 
AGCCTCCTGC 
AGGCACAACG 
CCACTTGGTG 
TGGCCTGCTC 
CTCTGTGAGG 
CAAGTGCCTG 
TCTGCCCAGT 
GCACAGCGGC 
CACTGTGGAT 
CTCGGTGGTG 
GGTGCTCATG 
GGAGCTGACC 
CAGGAGCCAG 
GGACAACAGT 
GACCACGGTG 
CGAGGAGGAG 
GAATGGGACC 
CCTGGTCTGA 
GATTTTAGCT 
CATCCCACTG 
AATTGAGTCT 
GTGTTGACTG 
TGGTGTGTAT 
TTGAGTGGTT 
GACCTCTGCC 
GGTTGGAGGA 
TGGAATCTGC 
GTGTGAAGCA 
CCTCTGGTGG 
GAGCTTCTTG 
CCCTTTCCAT 
TCCAGCCTGG 
AAAAAAAAAA 



41 



51 



GATCACGGCT 
TGGGTCCCCT 
GGGTCAGTTC 
AAACGCTGGG 
CCTGAGGCCT 
GGTGAGCTGG 
TGCTTCTACC 
GCGGGCGAAG 
CCGGCTTACG 
GTGCTCCTGC 
TTCCCCGCCG 
TCACTGAATC 
ACAGCTGAGG 
TCCAGCCGTT 
CCTAGCCGCA 
CAGGACCAAA 
GGCCTTGAAG 
AGTGAAGGGC 
GGGGTACGAG 
ATCTACGTCT 
GTTCTTGACC 
GTGGTGGGTG 
TCCCGATACC 
CTGACCAGGG 
CCGGAGGAGA 
AGCTGCTCTG 
AGGGAGATAG 
GAAGATCAGG 
CTACGGGCCA 
CCCAGGCCTG 
CATCTTGGGG 
ACTGCTTGAC 
CTCCCACCAT 
ACTGTGTGTG 
TATGCTGTCA 
GCGTGGGCAA 
TGAAAAAGCA 
GAGAGGTGGA 
CTCCGGTGTG 
GCCAGTCCCT 
CCTCTGGGCC 
CAGGAATACT 
TAGTTGTATT 
ACGATATAGC 



TCTTGGGGGT 
AGTGGAGACC 
CTTATTCAAG 
CAGTCTGCCT 
GGCTGCTGCT 
AGACCTCAGA 
GAGGGGACTC 
GCGCCCAGGA 
AGGGCCGCGT 
GCAACGCAGT 
GCAGCTTCCA 
CTGGTCCAGC 
GCAGCCCAGC 
CCTTCAAGCA 
GCATGAATGG 
GGATCACCCA 
ACCAAAATCT 
AGCCCCCTCC 
TGGATGGGGA 
GCCATGTCAG 
CCCAGGAAGA 
TGATCGCCGC 
ATCGGCGCAA 
AGAACTCCAT 
GTGTAGGGCT 
TGATGAGTGA 
AAACACAGAC 
ATGAAGGCAT 
AGCCCACGGG 
CCTCCCTTCC 
GCCTCCTTAA 
CTTTACCTCC 
GCATGCAGGT 
TGTGGAGGGG 
TATCAGAGTC 
CACTGTCAGG 
GGTATTTTCT 
GACTGTGGCT 
AGGGAACCTG 
GGGTCAGCCA 
TGCTGCATGT 
GCTCCGAATC 
TTTTATTTAT 
CAGACCCTGT 



1 
I 

MPLSLGAEMW 
QVGQVAWARV 
DEGEYECRVS 
VTWDTEVKGT 
HVSFLAEASV 
GFPPLTTEHS 
FCLLVWWL 



11 
I 

GPEAWLLLLL 
DAGEGAQELA 
TFPAGSFQAR 
TSSRSFKHSR 
RGLEDQNLWH 
GIYVCHVSNE 
MSRYHRRKAQ 



AMNHFVQENG TLRAKPTGNG 



21 
I 

LLiASFTGRCP 
LLHSKYGLHV 
LRLRVLVPPL, 
SAAVTSEFHL 
IGREGAMLKC 
FSSRDSQVTV 
QMTQKYEEEL 
EGRSYSTLTT 
IYINGRGHLV 



31 
I 

AGELETSDW 
SPAYEGRVEQ 
PSLNPGPALE 
VPSRSMNGQP 
LSEGQPPPSY 
DVLDPQEDSG 
TLTRENSIRR 
VREIETQTEL 



41 
I 

TWLGQDAKL 
PPPPRNPLDG 
EGQGLTLAAS 
LTCWSHPGL 
NWTRUX3PLP 
KQVDLVSASV 
LHSHHTDPRS 
LSPGSGRAEE 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 



51 

I 

PCFYRGDSGE 
SVLLRNAVQA 
CTAEGSPAPS 
LQDQRITHIL 
SGVRVDGDTL 
WVGVIAALL 
QPEESVGLRA 



60 
120 
180 
240 
300 
360 
420 
480 



Seq ID NO: 67 DNA sequence 

Nucleic Acid Accession #: NMJ>01203 

Coding sequences 274.. 1782 

1 11 21 31 41 51 

I I I I I I 

CGCGGGGCGC GGAGTCGGCG GGGCCTCGCG GGACGCGGGC AGTGCGGAGA CCGCGGCGCT 
GAGGACGCGG GAGCCGGGAG CGCACGCGCG GGGTGGAGTT CAGCCTACTC TTTCTTAGAT 
GTGAAAGGAA AGGAAGATCA TTTCATGCCT TGTTGATAAA GGTTCAGACT TCTGCTGATT 

975 



60 
120 
180 



WO 03/042661 



PCT/US02/36810 



CATAACCATT TGGCTCTGAG CTATGACAAG 
TGCCATAAGT GAGAAGCAAA CTTCCTTGAT 
AATGTGGGCA CCAAGAAAGA GGATGGTGAG 
TTGCGTTGTA AATGCCACCA CCATTGTCCA 
5 GACGGATATT GTTTCACGAT GATAGAAGAG 
GGTTGCCTAG GACTAGAAGG CTCAGATTTT 
AGAAGATCAA TTGAATGCTG CACAGAAAGG 
CTGCCTCCAT TGAAAAACAG AGATTTTGTT 
ATATCTGTGA CTGTCTGTAG TTTGCTCTTG 
10 TATAAAAGAC AAGAAACCAG ACCTCGATAC 
ATTCCTCCTG GAGAATCCCT GAGAGACTTA 
TCAGGCCTCC CTCTGCTGGT CCAAAGGACT 
ATTGGAAAAG GTCGCTATGG GGAAGTTTGG 
GTGAAAGTGT TCTTCACCAC AGAGGAAGCC 
15 ACAGTGTTGA TGAGGCATGA AAACATTTTG 
GGGTCCTGGA CCCAGTTGTA CCTAATCACA 
TATCTGAAGT CCACCACCCT AGACGCTAAA 
AGTGGCTTAT GTCATTTACA CACAGAAATC 
CATCGAGATC TGAAAAGTAA AAACATTCTG 
20 GACCTGGGCC TGGCTGTTAA ATTTATTAGT 
ACTCGAGTTG GCACCAAACG CTATATGCCT 
AATCACTTCC AGTCTTACAT CATGGCTGAC 
GTTGCTAGGA GATGTGTATC AGGAGGTATA 
CTAGTGCCCA GTGACCCCTC TTATGAGGAC 
25 CGCCCCTCAT TCCCAAACCG GTGGAGCAGT 
ATGACAGAAT GCTGGGCTCA CAATCCTGCA 
ACACTTGCCA AAATGTCAGA GTCCCAGGAC 
CATCTCTGCA GAAAGCCAAC AGGTACTCTT 
TAAGCATCCA CAGTACAAGC CTTGAACATC 
30 CTTTCAGGGA GCGACCTGGG CAAAGACAGA 
TCTGTTTGTA GGCGGAGAAA CCGTTGGGTA 



AGAGGAAACA AAAAGTTAAA CTTACAAGCC 240 

AACATGCTTT TGCGAAGTGC AGGAAAATTA 300 

AGTACAGCCC CCACCCCCCG TCCAAAGGTC 360 

GAAGACTCAG TCAACAATAT TTGCAGCACA 420 

GATGACTCTG GGTTGCCTGT GGTCACTTCT 480 

CAGTGTCGGG ACACTCCCAT TCCTCATCAA 540 

AACGAATGTA ATAAAGACCT ACACCCTACA 600 

GATGGACCTA TACACCACAG GGCTTTACTT 660 

GTCCTTATCA TATTATTTTG TTACTTCCGG 720 

AGCATTGGGT TAGAACAGGA TGAAACTTAC 780 

ATTGAGCAGT CTCAGAGCTC AGGAAGTGGA 840 

ATAGCTAAGC AGATTCAGAT GGTGAAACAG 900 

ATGGGAAAGT GGCGTGGCGA AAAGGTAGCT 960 

AGCTGGTTCA GAGAGACAGA AATATATCAG 1020 

GGTTTCATTG CTGCAGATAT CAAAGGGACA 1080 

GACTATCATG AAAATGGTTC CCTTTATGAT 1140 

TCAATGCTGA AGTTAGCCTA CTCTTCTGTC 1200 

TTTAGTACTC AAGGCAAACC AGCAATTGCC 1260 

GTGAAGAAAA ATGGAACTTG CTGTATTGCT 1320 

GATACAAATG AAGTTGACAT ACCACCTAAC 1380 

CCAGAAGTGT TGGACGAGAG CTTGAACAGA 1440 

ATGTATAGTT TTGGCCTCAT CCTTTGGGAG 1500 

GTGGAAGAAT ACCAGCTTCC TTATCATGAC 1560 

ATGAGGGAGA TTGTGTGCAT CAAGAAGTTA 1620 

GATGAGTGTC TAAGGCAGAT GGGAAAACTC 1680 

TCAAGGCTGA CAGCCCTGCG GGTTAAGAAA 1740 

ATTAAACTCT GATAGGAGAG GAAAAGTAAG 1B00 

CTGTTTGTGG GCAGAGCAAA AGACATCAAA 1860 

GTCCTGCTTC CCAGTGGGTT CAGACCTCAC 1920 

GAAGCTCCCA GAAGGAGAGA TTGATCCGTG 1980 
ACTTGTTCAA GATATGATGC AT 



Seq ID NO: 68 Protein sequence 
Protein Accession #: NP 001194 

35 

1 11 21 31 41 SI 

MLLRSAGKLN VGTKKEDGES TAPTPRPKVL RCKCHHHCPE DSVNNICSTD GYCPTMIEED 60 

DSGLPWTSG CLGLEGSDFQ CRDTPIPHQR RSIECCTERN ECNKDIiHPTL PPLKNRDFVD 120 

40 GPIHHRALLI SVTVCSLLLV LIILFCYFRY KRQETRPRYS IGLEQDETYI PPGESLRDLI 180 

EQSQSSGSGS GLPLLVQRTI AKQIQMVKQI GKGRYGEVWM GKWRGEKVAV KVFFTTEEAS 240 

WFRETEIYQT VLMRHENILG FIAADIKGTG SWTQLYLITD YHENGSLYDY LKSTTLDAKS 300 

MLKLAYSSVS GLCHLHTEIF STQGKPAIAH RDLKSKNILV KKNGTCCIAD LGLAVKFISD 360 

TNEVDIPPNT KVGTKRYMPP EVLDESLNRN HFQSYIMADM YSFGLILWEV ARRCVSGGIV 420 

45 EEYQLPYHDL VPSDPSYEDM REIVCIKKLR PSFPNRWSSD ECLRQMGKLM TECWAHNPAS 480 
RLTALRVKKT LAKMSESQDI KL 



Seq ID NO: 69 DNA sequence 
Nucleic Acid Accession #: Eos sequence 
50 Coding sequence: 166-.1737 

1 ll 21 31 41 51 

TTGGGGGTTT ATTCTCTTCC CTTCTAACTT GACAGGGTCT TGCTCTGTCA TTCAGGCAAG 60 

55 AGTGCAGTAG tgtgatcact tcttactgcc gcctcaagct TCCAGCCTCA ACTCAAGCAA 120 

TCCTCCCACC TCAGCCACCC AAGTGGCTGG GACTACAGAT TAAGAATGAC CCAAAATAAA 180 

TTAAAGCTTT GTTCCAAAGC CAATGTGTAT ACTGAAGTGC CTGATGGAGG ATGGGGCTGG 240 

GCGGTAGCTG TTTCATTTTT CTTCGTTGAA GTCTTCACCT ACGGCATCAT CAAGACATTT 300 

GGTGTCTTCT TTAATGACTT AATGGACAGT TTTAATGAAT CCAATAGCAG GATCTCATGG 360 

60 ATAATCTCAA TCTGTGTGTT TGTCTTAACA TTTTCAGCTC CCCTCGCCAC AGTCCTGAGC 420 

AATCGTTTCG GACACCGTCT GGTAGTGATG TTGGGGGGGC TACTTGTCAG CACCGGGATG 480 

GTGGCCGCCT CCTTCTCACA AGAGGTTTCT CATATGTACG TCGCCATCGG CATCATCTCT 540 

GGTCTGGGAT ACTGCTTTAG TTTTCTCCCA ACTGTAACCA TCCTATCACA ATATTTTGGC 600 

AAAAGACGTT CCATAGTCAC TGCAGTTGCT TCCACAGGAG AATGTTTCGC TGTGTTTGCT 660 

65 TTCGCACCAG CAATCATGGC TCTGAAGGAG CGCATTGGCT GGAGATACAG CCTCCTCTTC 720 

GTGGGCCTAC TACAGTTAAA CATTGTCATC TTCGGAGCAC TGCTCAGACC CATCTTTATC 780 

AGAGGACCAG CGTCACCGAA AATAGTCATC CAGGAAAATC GGAAAGAAGC GCAGTATATG 840 

CTTGAAAATG AGAAAACACG AACCTCAATA GACTCCATTG ACTCAGGAGT AGAACTAACT 900 

ACCTCACCTA AAAATGTGCC TACTCACACT AACCTGGAAC TGGAGCCGAA GGCGGACATG 960 

70 CAGCAGGTCC TGGTGAAGAC CAGCCCCAGG CCAAGCGAAA AGAAAGCCCC GCTA TTAGAC 1020 

TTCTCCATTT TGAAAGAGAA AAGTTTTATT TGTTATGCAT TATTTGGTCT CTTTGCAACA 1080 

CTGGGATTCT TTGCACCTTC CTTGTACATC ATTCCTCTGG GCATTAGTCT GGGCATTGAC 1140 

CAGGACCGCG CTGCTTTTTT ATTATCTACG ATGGCCATTG CAGAAGTTTT CGGAAGGATC 1200 

GGAGCTGGTT TTGTCCTCAA CAGGGAGCCC ATTCGTAAGA TTTACATTGA GCTCATCTGC 1260 

75 GTCATCTTAT TGACTGTGTC TCTGTTTGCC TTTACTTTTG CTACTGAATT CTGGGGTCTA 1320 

ATGTCATGCA GCATATTTTT TGGGTTTATG GTTGGAACAA TAGGAGGGAC TCACATTCCA 1380 

CTGCTTGCTG AGGATGATGT CGTGGGCATT GAGAAGATGT CTTCTGCAGC TGGGGTCTAC 1440 

ATCTTCATTC AGAGCATAGC AGGACTGGCT GGACCGCCCC TTGCAGGTTT GTTGGTGGAC 1500 

CAAAGTAAGA TCTACAGCAG GGCCTTCTAC TCCTGCGCAG CTGGCATGGC CCTGGCTGCT 1560 

80 GTGTGCCTCG CCCTGGTGAG ACCGTGTAAG ATGGGACTGT GCCAGCATCA TCACTCAGGT 1620 

GAAACAAAGG TAGTGAGCCA TCGTGGGAAG ACTTTACAGG ACATACCTGA AGACTTTCTG 1680 

GAAATGGATC TTGCAAAAAA TGAGCACAGA GTTCACGTGC AAATGGAGCC GGTATGACAC 1740 

ACTTTCTTAC AACAACAGCC ACTGTGTTGG CTGGAGAGGG ATGGGGTGGG CCCAACGGGG 1800 

ACACAAGGAG GCAGAGGAGC TAACCCCTCT ACTCCACTTT CAAAACTACA TTTTAAAGGG 1860 
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AATGTGTATG TGAAGAGCAC TACCAACATC GCTTTTGTTT TGTTTTGTTT TGTTTTAAGC 1920 

TTTTTTTTTT XGC TTGTTTT TAAAGCCAAA ACAAAAAACA ACCAAGCACT CTTCCATATA 1980 

TAAATCTGGC TGTATTCAGT AGCAATACAA GAGATATGTA GAAAGACTCT TTGGTTCACA 2040 

TTCCGATATT AAAATAGTGA CATGAACTGG CAAAGTGGTT TTAAAAGCTT TCACGTGGGA 2100 

5 TAAATGATTT TCTTTTTTTC TTTTCTTTCT TCCTATGGTC TTGTCTGAAT AAACTACTCT 2160 
CCTGAATAAA ACAACATCCA ACCCAGGTCA TTGAAATGAA ATTGGCCAGT C 



Seq ID NO: 70 Protein sequence 
^ Protein Accession «: Eos sequence 

1 11 21 

I I 1 

MTQNKLKLCS KANVYTEVPD GGWGWAVAVS 
SRISWIISIC VFVLTFSAPL ATVLSNRFGH 

15 IGIISGLGYC FSFLPTVTIL SQYFGKRRSI 
YSLLFVGLLQ LNIVIFGALL RPIFIRGPAS 
GVELTTSPKN VPTHTNLELE PKADMQQVLV 
GLFATLGFFA PSLYIIPLGI SLGIDQDRAA 
IELICVILLT VSLFAFTFAT EFWGLMSCSI 

20 AAGVYIFIQS IAGLAGPPLA GLLVDQSKIY 
HHHSGETKW SHRGKTLQDI PEDFLEMDLA 



31 41 51 

i I I 

FFFVEVFTYG IIKTFGVFFN DLMDSFNESN 60 

RLWMLGGLL VSTGMVAASF SQEVSHMYVA 120 

VTAVASTGEC FAVFAFAPAI MALKERIGWR 180 

PKIVIQENRK EAQYMLENEK TRTSIDSIDS 240 

KTSPRPSEKK APLLDFSILK EKSFICYALF 300 

FLLSTMAIAE VFGRIGAGFV LNREPIRKIY 360 

FFGFMVGTIG GTHIPLLAED DWGIEKMSS 420 

SRAFYSCAAG MALAAVCLAL VRPCKMGLCQ 480 
KNEHRVHVOM EPV 



Seq ID NO: 71 DNA sequence 
Nucleic Acid Accession #: NMJJ04694 
25 Coding sequence; 166.. 173 7 

1 11 21 31 41 51 

TTGGGGGTTT ATTCTCTTCC CTTCTAACTT GACAGGGTCT TGCTCTGTCA TTCAGGCAAG 60 

30 AGTGCAGTAG TGTGATCACT TCTTACTGCC GCCTCAAGCT TCCAGCCTCA ACTCAAGCAA 120 

TCCTCCCACC TCAGCCACCC AAGTGGCTGG GACTACAGAT TAAGAATGAC CCAAAATAAA 180 

TTAAAGCTTT GTTCCAAAGC CAATGTGTAT ACTGAAGTGC CTGATGGAGG ATGGGGCTGG 240 

GCGGTAGCTG TTTCATTTTT CTTCGTTGAA GTCTTCACCT ACGGCATCAT CAAGACATTT 300 

GGTGTCTTCT TTAATGACTT AATGGACAGT TTTAATGAAT CCAATAGCAG GATCTCATGG 360 

35 ATAATCTCAA TCTGTGTGTT TGTCTTAACA TTTTCAGCTC CCCTCGCCAC AGTCCTGAGC 420 

AATCGTTTCG GACACCGTCT GGTAGTGATG TTGGGGGGGC TACTTGTCAG CACCGGGATG 480 

GTGGCCGCCT CCTTCTCACA AGAGGTTTCT CATATGTACG TCGCCATCGG CATCATCTCT 540 

GGTCTGGGAT ACTGCTTTAG TTTTCTCCCA ACTGTAACCA TCCTATCACA ATATTTTGGC 600 

AAAAGACGTT CCATAGTCAC TGCAGTTGCT TCCACAGGAG AATGTTTCGC TGTGTTTGCT 660 

40 TTCGCACCAG CAATCATGGC TCTGAAGGAG CGCATTGGCT GGAGATACAG CCTCCTCTTC 720 

GTGGGCCTAC TACAGTTAAA CATTGTCATC TTCGGAGCAC TGCTCAGACC CATCATTATC 780 

AGAGGACCAG CGTCACCGAA AATAGTCATC CAGGAAAATC GGAAAGAAGC GCAGTATATG 840 

CTTGAAAATG AGAAAACACG AACCTCAATA GACTCCATTG ACTCAGGAGT AGAACTAACT 900 

ACCTCACCTA AAAATGTGCC TACTCACACT AACCTGGAAC TGGAGCCGAA GGCCGACATG 960 

45 CAGCAGGTCC TGGTGAAGAC CAGCCCCAGG CCAAGCGAAA AGAAAGCCCC GCTATTAGAC 1020 

TTCTCCATTT TGAAAGAGAA AAGTTTTATT TGTTATGCAT TATTTGGTCT CTTTGCAACA 1080 

CTGGGATTCT TTGCACCTTC CTTGTACATC ATTCCTCTGG GCATTAGTCT GGGCATTGAC 1140 

CAGGACCGCG CTGCTTTTTT ATTATCTACG ATGGCCATTG CAGAAGTTTT CGGAAGGATC 1200 

GGAGCTGGTT TTGTCCTCAA CAGGGAGCCC ATTCGTAAGA TTTACATTGA GCTCATCTGC 1260 

50 GTCATCTTAT TGACTGTGTC TCTGTTTGCC TTTACTTTTG CTACTGAATT CTGGGGTCTA 1320 

ATGTCATGCA GCATATTTTT TGGGTTTATG GTTGGAACAA TAGGAGGACT CACATTCCAC 1380 

TGCTTGCTGA AGATGATGTC GTGGGCATTG CAGAAGATGT CTTCTGCAGC TGGGGTCTAC 1440 

ATCTTCATTC AGAGCATAGC AGGACTGGCT GGACCGCCCC TTGCAGGTTT GTTGGTGGAC 1500 

CAAAGTAAGA TCTACAGCAG GGCCTTCTAC TCCTGCGCAG CTGGCATGGC CCTGGCTGCT 1560 

55 GTGTGCCTCG CCCTGGTGAG ACCGTGTAAG ATGGGACTGT GCCAGCGTCA TCACTCAGGT 1620 

GAAACAAAGG TAGTGAGCCA TCGTGGGAAG ACTTTACAGG ACATACCTGA AGACTTTCTG 1680 

GAAATGGATC TTGCAAAAAA TGAGCACAGA GTTCACGTGC AAATGGAGCC GGTATGACAC 1740 

ACTTTCTTAC AACAACAGCC ACTGTGTTGG CTGGAG AGGG ATGGGGTGGG CCCAACGGGG 1800 

ACACAAGGAG GCAGAGGAGC TAACCCCTCT ACTCCACTTT CAAAACTACA TTTTAAAGGG 1860 

60 AATGTGTATG TGAAGAGCAC TACCAACATC G C TTTT GTT T TGTTTTGTTT TGTTTTAAGC 1920 

TTTTTTTTTT TGCTTGTTTT TAAAGCCAAA ACAAAAAACA ACCAAGCACT CTTCCATATA 19 B0 

TAAATCTGGC TGTATTCAGT AGCAATACAA GAGATATGTA GAAAGACTCT TTGGTTCACA 2040 

TTCCGATATT AAAATAGTGA CATGAACTGG CAAAGTGGTT TTAAAAGCTT TCACGTGGGA 2100 

TAAATGATTT TC T TTT T TTC TTTTCTTTCT TCCTATGGTC TTGTCTGAAT AAACTACTCT 2160 
65 CCTGAATAAA ACAACATCCA ACCCAGGTCA TTGAAATGAA ATTGGCCAGT C 



Seq ID NO: 72 Protein sequence 
Protein Accession #: NP_0046B5 

70 1 11 21 31 41 51 

| | | 1 I I 

MTQNKLKLCS KANVYTEVPD GGWGWAVAVS FFFVEVFTYG IIKTFGVFFN DLMDSFNESN 60 

SRISWIISIC VFVLTFSAPL ATVLSNRFGH RLWMLGGLL VSTGMVAASF SQEVSHMYVA 120 

IGIISGLGYC FSFLPTVTIL SQYFGKRRSI VTAVASTGEC FAVFAFAPAI MALKERIGWR 180 

75 YSLLFVGLLQ LNIVIFGALL RPIIIRGPAS PKIVIQENRK EAQYMLENEK TRTSIDSIDS 240 

GVELTTSPKN VPTHTNLELE PKADMQQVLV KTSPRPSEKK APLLDFSILK EKSFICYALF 300 

GLFATLGFFA PSLYIIPLGI SLGIDQDRAA FLLSTMAIAE VFGRIGAGFV LNREPIRKIY 360 

IELICVILLT VSLFAFTFAT EFWGLMSCSI FFGFMVGTIG GLTFHCLLKM MSWALQKMSS 420 

AAGVYIFIQS IAGLAGPPLA GLLVDQSKIY SRAFYSCAAG MALAAVCLAL VRPCKMGLCQ 480 

80 RHHSGETKW SHRGKTLQDI PEDFLEMDLA KNEHRVHVQM EPV 



Seq ID NO: 73 DNA sequence 

Nucleic Acid Accession #: NM__0021B4.1 

Coding sequence: 256.. 3012 

977 



WO 03/042661 



1 11 21 31 41 51 

1 I 1 i I.I 

GAGCAGCCAA AAGGCCCGCG GAGTCGCGCT GGGCCGCCCC GGCGCAGCTG AACCGGGGGC 60 

CGCGCCTGCC AGGCCGACGG GTCTGGCCCA GCCTGGCGCC AAGGGGTTCG TGCGCTGTGG 120 

AGACGCGGAG GGTCGAGGCG GCGCGGCCTG AGTGAAACCC AATGGAAAAA GCATGACATT 180 

TAGAAGTAGA AGACTTAGCT TCAAATCCCT ACTCCTTCAC TTACTAATTT TGTGATTTGG 240 

AAATATCCGC GCAAGATGTT GACGTTGCAG ACTTGGGTAG TGCAAGCCTT GTTTATTTTC 300 

CTCACCACTG AATCTACAGG TGAACTTCTA GATCCATGTG GTTATATCAG TCCTGAATCT 360 

CCAGTTGTAC AACTTCATT C TAATTTCACT GCAGTTTGTG TGCTAAAGGA AAAATGTATG 420 

GATTATTTTC ATGTAAATGC TAATTACATT GTCTGGAAAA CAAACCATTT TACTATTCCT 480 

AAGGAGCAAT ATACTATCAT AAACAGAACA GCATCCAGTG TCACCTTTAC AGATATAGCT 540 

TCATTAAATA TTCAGCTCAC TTGCAACATT CTTACATTCG GACAGCTTGA ACAGAATGTT 600 

TATGGAATCA CAATAATTTC AGGCTTGCCT CCAGAAAAAC CTAAAAATTT GAGTTGCATT 660 

GTGAACGAGG GGAAGAAAAT GAGGTGTGAG TGGGATGGTG GAAGGGAAAC ACACTTGGAG 720 

ACAAACTTCA CTTTAAAATC TGAATGGGCA ACACACAAGT TTGCTGATTG CAAAGCAAAA 780 

CGTGACACCC CCACCTCATG CACTGTTGAT TATTCTACTG TGTATTTTGT CAACATTGAA 840 

GTCTGGGTAG AAGCAGAGAA TGCCCTTGGG AAGGTTACAT CAGATCATAT CAATTTTGAT 900 

CCTGTATATA AAGTGAAGCC CAATCCGCCA CATAATTTAT CAGTGATCAA CTCAGAGGAA 960 

CTGTCTAGTA TCTTAAAATT GACATGGACC AACCCAAGTA TTAAGAGTGT TATAATACTA 1020 

AAATATAACA TTCAATATAG GACCAAAGAT GCCTCAACTT GGAGCCAGAT TCCTCCTGAA 1080 

GACACAGCAT CCACCCGATC TTCATTCACT GTCCAAGACC TTAAACCTTT TACAGAATAT 1140 

GTGTTTAGGA TTCGCTGTAT GAAGGAAGAT GGTAAGGGAT ACTGGAGTGA CTGGAGTGAA 1200 

GAAGCAAGTG GGATCACCTA TGAAGATAGA CCATCTAAAG CACCAAGTTT CTGGTATAAA 1260 

ATAGATCCAT CCCATACTCA AGGCTACAGA ACTGTACAAC TCGTGTGGAA GACATTGCCT 1320 

CCTTTTGAAG CCAATGGAAA AATCTTGGAT TATGAAGTGA CTCTCACAAG ATGGAAATCA 1380 

CATTTACAAA ATTACACAGT TAATGCCACA AAACTGACAG TAAATCTCAC AAATGATCGC 1440 

TATCTAGCAA CCCTAACAGT AAGAAATCTT GTTGGCAAAT CAGATGCAGC TGTTTTAACT 1500 

ATCCCTGCCT GTGACTTTCA AGCTACTCAC CCTGTAATGG ATCTTAAAGC ATTCCCCAAA 15 SO 

GATAACATGC TTTGGGTGGA ATGGACTACT CCAAGGGAAT CTGTAAAGAA ATATATACTT 1620 

GAGTGGTGTG TGTTATCAGA TAAAGCACCC TGTATCACAG ACTGGCAACA AGAAGATGGT 1680 

ACCGTGCATC GCACCTATTT AAGAGGGAAC TTAGCAGAGA GCAAATGCTA TTTGATAACA 1740 

GTTACTCCAG TATATGCTGA TGGACCAGGA AGCCCTGAAT CCATAAAGGC ATACCTTAAA 1800 

CAAGCTCCAC CTTCCAAAGG ACCTACTGTT CGGACAAAAA AAGTAGGGAA AAACGAAGCT 1860 

GTCTTAGAGT GGGACCAACT TCCTGTTGAT GTTCAGAATG GATTTATCAG AAATTATACT 1920 

ATATTTTATA GAACCATCAT TGGAAATGAA ACTGCTGTGA ATGTGGATTC TTCCCACACA 1980 

GAATATACAT TGTCCTCTTT GACTAGTGAC ACATTGTACA TGGTACGAAT GGCAG CATAC 2040 

ACAGATGAAG GTGGGAAGGA TGGTCCAGAA TTCACTTTTA CTACCCCAAA GTTTGCTCAA 2100 

GGAGAAATTG AAGCCATAGT CGTGCCTGTT TGCTTAGCAT TCCTATTGAC AACTCTTCTG 2160 

GGAGTGCTGT TCTGCTTTAA TAAGCGAGAC CTAATTAAAA AACACATCTG GCCTAATGTT 2220 

CCAGATCCTT CAAAGAGTCA TATTGCCCAG TGGTCACCTC ACACTCCTCC AAGGCACAAT 2280 

TTTAATTCAA AAGATCAAAT GTATTCAGAT GGCAATTTCA CTGATGTAAG TGTTGTGGAA 2340 

ATAGAAGCAA ATGACAAAAA GCCTTTTCCA GAAGATCTGA AATCATTGGA CCTGTTCAAA 2400 

AAGGAAAAAA TTAATACTGA AGGACACAGC AGTGGTATTG GGGGGTCTTC ATGCATGTCA 2460 

TCTTCTAGGC CAAGCATTTC TAGCAGTGAT GAAAATGAAT CTTCACAAAA CACTTCGAGC 2520 

ACTGTCCAGT ATTCTACCGT GGTACACAGT GGCTACAGAC ACCAAGTTCC GTCAGTCCAA 2580 

GTCTTCTCAA GATCCGAGTC TACCCAGCCC TTGTTAGATT CAGAGGAGCG GCCAGAAGAT 2640 

CTACAATTAG TAGATCATGT AGATGGCGGT GATGGTATTT TGCCCAGGCA ACAGTACTTC 2700 

AAACAGAACT GCAGT CAGCA TGAATCCAGT CCAGATATTT CACATTTTGA AAGGTCAAAG 2760 

CAAGTTTCAT CAGTCAATGA GGAAGATTTT GTTAGACTTA AACAGCAGAT TTCAGATCAT 2820 

ATTTCACAAT CCTGTGGATC TGGGCAAATG AAAATGTTTC AGGAAGTTTC TGCAGCAGAT 2880 

GCTTTTGGTC CAGGTACTGA GGGACAAGTA GAAAGATTTG AAACAGTTGG CATGGAGGCT 2940 

GCGACTGATG AAGGCATGCC TAAAAGTTAC TTACCACAGA CTGTACGGCA AGGCGGCTAC 3000 

ATGCCTCAGT GAAGGACTAG TAGTTCCTGC TACAACTTCA GCAGTACCTA TAAAGTAAAG 3060 
CTAAAATGAT TTTATCTGTG AATTC 

Seq ID NO: 74 Protein sequence 
Protein Accession #; NP_O02175.1 



I 

MLTLQTWWQ 
NANYIVWKTN 
ISGLPPEKPK 
SCTVDYSTVY 
KLTWTNPSIK 
CMKEDGKGYW 
GKILDYEVTL 
PQATHPVMDL 
YLRGNLAESK 
QLPVDVQNGF 
KDGPEFTFTT 
SHIAQWSPHT 
TEGHSSGIGG 
ESTQPLLDSE 
NEEDFVRLKQ 
MPKSYLPQTV 



11 
1 

ALFIPLTTES 
HFTIPKEQYT 
NLSCIVNEGK 
FVNIEVWVEA 
SVIILKYNIQ 
SDWSEEASGI 
TRWKSHLQNY 
KAPPKDNMLW 
CYLITVTPVY 
IRNYTIFYRT 
PKFAQGEIEA 
PPRHNFNSKD 
SSCMSSSRPS 
ERPEDXiQLVD 
QISDHISQSC 
RQGGYMPQ 



21 
I 

TGELLDPCGY 
I INRTASSVT 
KMRCEWDGGR 
ENALGKVTSD 
YRTKDASTWS 
TYEDRPSKAP 
TVNATKLTVN 
VEWTTPRESV 
ADGPGSPESI 
IIGNETAVNV 
IWPVCIiAFL 
QMYSDGNFTD 
ISSSDENESS 
HVDGGDGILP 
GSGQMKMFQE 



31 
I 

ISPESPWQI* 
FTDIASLNIQ 
ETHLETNFTL 
HINFDPVYKV 
QIPPEDTAST 
SFWYKIDPSH 
LTNDRYLATL 
KKYILEWCVL 
KAYLKQAPPS 
DSSHTEYTLS 
LTTLLGVLFC 
VSWEIEAND 
QNTSSTVQYS 
RQQYFKQNCS 
VSAADAFGPG 



41 
I 

HSNFTAVCVIi 
LTCHILTPGQ 
KSEWATHKFA 
KPNPPHNLSV 
RSSFTVQDLK 
TQGYRTVQLV 
TVRNLVGKSD 
SDKAPCITDW 
KGPTVRTKKV 
SLTSDTLYMV 
FNKRDLIKKH 
KKPFPEDliKS 
TWKSGYRHQ 



TEGQVERFET 



51 
I 

KEKCMDYFHV 
LEQNVYGITI 
DCKAKRDTPT 
INSEELSSIL 
PFTEYVFRIR 
WKTLPPPEAN 
AAVLTIPACD 
QQEDGTVHRT 
GKNEAVLEWD 
RMAAYTDEGG 
IWPNVPDPSK 
LDLFKKEKIN 
VPSVQVFSRS 
FERSKQVSSV 
VGMEAATDEG 



60 
120 
160 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 



Seq ID NO: 75 DNA sequence 

Nucleic Acid Accession #: NM_022131 

Coding sequence: 11. .2878 

1 11 21 31 41 SI 

TGCTGCGAGG ATGCTGCCTG GGCGGCTGTG CTGGGTGCCG CTCCTGCTGG CGCTGGGCGT 
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GGGGAGCGGC AGCGGCGGTG GCGGGGACAG CCGGCAGCGC CGCCTCCTCG CGGCTAAAGT 120 
CAATAAGCAC AAGCCATGGA TCGAGACTTC ATATCATGGA GTCATAACTG AGAACAATGA 180 
CACAGTCATT TTGGACCCAC CACTGGTAGC CCTGGATAAA GATGCACCGG TTCCTTTTGC 240 
AGGGGAAATC TGTGCGTTCA AGATCCATGG CCAGGAGCTG CCCTTTGAGG CTGTGGTGCT 300 
5 CAACAAGACA TCAGGAGAGG GCCGGCTCCG TGCCAAGAGC CCCATTGACT GTGAGTTGCA 360 
GAAGGAGTAC ACATTCATCA TCCAGGCCTA TGACTGTGGT GCTGGGCCCC ACGAGACAGC 420 
CTGGAAAAAG TCACACAAGG CCGTGGTCCA TATACAGGTG AAGGATGTCA ACGAGTTTGC 480 
TCCCACCTTC AAAGAGCCAG CCTACAAGGC TGTTGTGACG GAGGGCAAGA TCTATGACAG 540 
CATTCTGCAG GTGGAGGCCA TTGACGAGGA CTGCTCCCCA CAGTACAGCC AGATCTGCAA 600 
10 CTATGAAATC GTCACCACAG ATGTGCCTTT TGCCATCGAC AGAAATGGCA ACATCAGGAA 660 
CACTGAGAAG CTGAGCTATG ACAAACAACA CCAGTATGAG ATCCTGGTGA CCGCCTACGA 720 
CTGTGGACAG AAGCCCGCTG CTCAGGACAC CCTGGTGCAG GTGGATGTGA AGCCAGTTTG 780 
CAAGCCTGGC TGGCAAGACT GGACCAAGAG GATTGAGTAC CAGCCTGGCT CCGGGAGCAT 840 
GCCCCTGTTC CCCAGCATCC ACCTGGAGAC GTGCGATGGA GCCGTGTCTT CCCTCCAGAT 900 
15 CGTCACAGAG CTGCAGACTA ATTACATTGG GAAGGGTTGT GACCGGGAGA CCTACTCTGA 960 
GAAATCCCTT CAGAAGTTAT GTGGAGCCTC CTCTGGCATC ATTGACCTCT TGCCATCCCC 1020 
TAGCGCTGCC ACCAACTGGA CTGCAGGACT GCTGGTGGAC AGCAGTGAGA TGATCTTCAA 1080 
GTTTGACGGC AGGCAGGGTG CCAAAATCCC CGATGGGATT GTGCCCAAGA ACCTGACCGA 1140 
TCAGTTCACC ATCACCATGT GGATGAAACA CGGCCCCAGC CCTGGTGTGA GAGCCGAGAA 1200 
20 GGAAACCATC CTCTGCAACT CAGACAAAAC CGAAATGAAC OGGCATCACT ATGCCCTGTA 1260 
TGTGCACAAC TGCCGCCTCG TCTTTCTCTT GCGGAAGGAC TTCGACCAGG CTGACACCTT 1320 
TCGCCCCGCG GAGTTCCACT GGAAGCTGGA TCAGATTTGT GACAAAGAGT GGCACTACTA 1380 
TGTCATCAAT GTGGAGTTTC CTGTGGTAAC CTTATACATG GATGGAGCAA CATATGAACC 1440 
ATACCTGGTG ACCAACGACT GGCCCATTCA TCCATCTCAC ATAGCCATGC A ACTCA CAGT 1500 
25 CGGCGCTTGT TGGCAAGGAG GAGAAGTCAC CAAACCACAG TTTGCTCAGT TCTTTCATGG 1560 
AAGCCTGGCC AGTCTCACCA TCCGCCCTGG CAAAATGGAA AGCCAGAAGG TGATCTCCTG 1620 
CCTGCAGGCC TGCAAGGAAG GGCTGGACAT TAATTCCTTG GAAAGCCTTG GCCAAGGAAT 1680 
AAAGTATCAC TTCAACCCCT CGCAGTCCAT CCTGGTGATG GAAGGTGACG ACATTGGGAA 1740 
CATTAACCGT GCTCTCCAGA AAGTCTCCTA CATCAACTCC AGGCAGTTCC CAACGGCGGG 1800 
30 TGTGCGGCGC CTCAAAGTAT CCTCCAAAGT CCAGTGCTTT GGGGAAGACG TATGCATCAG 1860 
TATCCCTGAG GTAGATGCCT ATGTGATGGT CCTCCAGGCC ATCGAGCCCC GGATCACCCT 1920 
CCGGGGCACA GACCACTTCT GGAGACCTGC TGCCCAGTTT GAAAGTGCCA GGGGAGTGAC 1980 
CCTCTTCCCT GAT AT CAAGA TTGTGAGCAC CTTCGCCAAA ACCGAAGCCC CCGGGGACGT 2040 
GAAAACCACA GACCCCAAAT CAGAAGTCTT AGAGGAAATG CTTCATAACT TAGATTTCTG 2100 
35 TGACATTTTG GTGATCGGAG GGGACTTGGA CCCAAGGCAG GAGTGCTTGG AGCTCAACCA 2160 
CAGTGAG CTC CACCAACGAC ACCTGGATGC CACTAATTCT ACTGCAGGCT ACTCCATCTA 2220 
CGGTGTGGGC TCCATGAGCC GCTATGAGCA GGTGCTACAT CACATCCGCT ACCGCAACTG 2280 
GCGTCCGGCT TCCCTTGAGG CCCGGCGTTT CCGGATTAAG TGCTCAGAAC TCAATGGGCG 2340 
CTACACTAGC AATGAGTTCA ACTTGGAGGT CAGCATCCTT CATGAAGACC AAGTCTCAGA 2400 
40 TAAGGAGCAT GTCAATCATC TGATTGTGCA GCCTCCCTTC CTCCAGTCTG TCCATCATCC 2460 
TGAGTCCCGG AGTAGCATCC AGCACAGTTC AGTGGTCCCA AGCATTGCCA CAGTGGTCAT 2520 
CATCATCTCC GTGTGCATGC TTGTGTTTGT CGTGGCCATG GGTGTGTACC GGGTCCGGAT 2580 
CGCCGACCAG CACTTCATCC AGGAGACTGA GGCTGCCAAG GAATCTGAGA TGGACTGGGA 2640 
CGATTCTGCG CTGACTATCA CAGTCAACCC CATGGAGAAA CATGAAGGAC CAGGGCATGG 2700 
45 GGAAGATGAG ACTGAGGGAG AAGAGGAGGA AGAAGCCGAG GAAGAAATGA GCTCCAGCAG 2760 
TGGCTCTGAC GACAGCGAAG AGGAGGAGGA GGAGGAAGGG ATGGGCAGAG GCAGACATGG 2820 
GCAGAATGGA GCCAGGCAAG CCCAGCTGGA GTGGGATGAC TCCACCCTCC CCTACTAGTG 2880 
CCCAGGGGTC TGCTGCCTGG CCCACATGTC CCTTTTGTAA ACCCTGACCC AGTGTATGCC 2940 
CATGTCTATC ATACCTCACC TCTGATGTCT GTGACATGTC TGGGAAGGCC TTCTCCAGCT 3000 
50 TCCTGGAGCC CACCCTTTAA GCCTTGGGCA CTCCCTGTGT TTCATCCATG GGGAAGTTCC 3060 
AAGAAGCCCA GCATGGCCAT CAGTGAGGAC TTCAGGGTAG ACTTTGTCCT GTAGC CTCCA 3120 
CTTCTGCCCT AAGTTCCCCA GCATCCTGAC TACCTGTCTG CAGAGTTTGC CTTTGTTTTT 3180 
TCCTGCAGGG AAGAAGGCCC ACCTTTGTGT CACTCACCTC CCCAGGCTCA' GAGTCCCCAA 3240 
GGCCCTGGGG TTCCAACTCA CTGTGCGTCT CCTCCACACA GACCAGTAGG TTCTCCTATG 3300 
55 CTGACTCCAG GTTGCTTCAT AGAAGGAGGG TGGTTGAACT TCACACACGT AAGGTCTTAG 3360 
TGCTTAACAG TTTAAAGGAA AGTCCTTGTT GAGGCAGAAC TAAGTT TACA GGGAAAGGTA 3420 
CACACATTCT CTCTCTCTCT CTCTCTCTGT CTATCTAGTT CCCCAGCTTG GAGAGCCTTT 3480 
CCCCTTGCTT CTTTCTGAGG CCATATAAGC TTATAAGAAA AGTCCCAAAC CAAGAATAGG 3540 
TCCTTGGCCA CAAGCAGGGT CTGATCCCCC ATCAGAGCTA TCTGAGCCTG CCTGTCTGGG 3600 
60 CACCTGCTGC AACCATGCAG CTACCCTGCC AGGGGCACTC AGCAAACAGA ACCACAGGGC 3660 
CCAGGAGGCA TTCCACACAG GCACTGCCCC AGGACAACAC AACAAGGACA GTCACAACAA 3720 
GGACAACAAG GACACAACAC AACACACAAC AAGGACAGTC ACAACAAGCC TAGAGCCAGA 3780 
AAGCAGATGG AAATG CTAAT GAGGTCAAAC GTAGGCTTCA TGGTGGGTGG AGTGGGGGTG 3840 
GCTGGGCTCC CCCAGGACAG AGGGGACCCT GAGGTTGGCA AGGCTCTCAC CACTCAGCCT 3900 
65 TATGGTCCCT TATCTCCTAT CTTCCCTCTT GAGAAAATAC ACGCTTTCTG CATGTATTAG 3960 
AAACGCACGA GCTCCACCAA GTCTACAATG AAAGTTTGAA ATTTAACTGC AAGGAATTAG 4020 
AAG CAT ATTT GCAATCATTG CAGCTTCTTC TTTCTTCTGC TCATAAAAGG AGGAACACTT 4080 
TAGATAGAGG GCAAATATAT CTGAAAACCT AATTTCTTTC TTTTTTTGAT AAGGAAATCT 4140 
TTTCCATCTC CATCCTAACA TGCACAACCT GTGAAGAGAA TTGTTTCTAT AGTAACTGGT 4200 
70 CTGTGATCTT TTGTGGCCAA GAGAATAGCA GGCAAGAATT AGGGCCTTGA CAGAATTTCC 4260 
ACGAAGCTCT GAGAACATGT TTGTTTCGAA TGTCTGATTC CTCTTTGTCA TCAATGTGTA 4320 
TGCTCTGTCC CCATCCTTCA CTCCTCCTCA AGCTCACACC AATTGGTTTG GCACAGGCAC 4380 
AGAGCTGGTC CCTAGTTAAG TGGCATTTAT GTTAAAAAAA A 

75 Seq ID NO: 76 Protein sequence 
Protein Accession NP_071414 

1 11 21 31 41 51 

1 | | I I I 

80 MLPGRLCWVP LLLALGVGSG SGGGGDSRQR RLLAAKVNKH KPWIETSYHG VITEKNDTVI 
LDPPLVALDK DAPVPFAGEI CAFKIHGQEL PFEAWLNKT SGEGRLRAKS PIDCELQKEY 
TFI IQAYDCG AGPHETAWKK SHKAWHIQV KDVNEFAPTF KEPAYKAWT EGKIYDSILQ 
VEAIDEDCSP QYSQICNYEI VTTDVPFAID RNGNIRNTEK LSYDKQHQYE ILVTAYDCGQ 
KPAAQDTLVQ VDVKPVCKPG WQDWTKRIEY QPGSGSMPLF PSIHLETCDG AVSSLQIVTE 
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LQTNYIGKGC DRETYSEKSL QKLCGASSGI IDLLPSPSAA TKWTAGLLVD SSEMIFKFDG 360 

RQGAKIPDGI VPKNLTDQFT ITMWMKHGPS PGVRAEKETI LCNSDKTEMN RHHYALYVHN 420 

CRLVFLLRKD PDQADTPRPA EFHWKLDQIC DKEWHYYVIN VEFPWTLYM DGATYEPYLV 400. 

TNDWPIHPSH IAMQLTVGAC WQGGEVTKPQ FAQFFHGSLA SLTIRPGKME SQKVISCLQA 540 

CKEGLDINSL ESLGQGIKYH FNPSQSILVM EGDDIGNINR ALQKVSYINS RQFPTAGVRR 600 

LKVSSKVQCF GEDVCISIPE VDAYVMVLQA IEPRITLRGT DHFWRPAAQP ESARGVTLPP 660 

DIKIVSTFAK TEAPGDVKTT DPKSEVLEEM LHNLDFCDIL VIGGDLDPRQ ECLELNHSEL, 720 

HQRHLDATNS TAGYSIYGVG SMSRYEQVLH HIRYRNWRPA SLEARRFRIK CSELNGRYTS 780 

NEFNLEVSIL HEDQVSDKEH VNHLIVQPPF LQSVHHPESR SSIQHSSWP SIATWIIIS 840 

VCMLVFWAM GVYRVRIAEQ HFIQETEAAK ESEMDWDDSA LTITVNPMEK HEGPGHGEDE 900 
TEGEEEEEAE EEMSSSSGSD DSEEEEEEEG KGRGRHGQNG ARQAQLEWDD STLPY 

Seq ID NO: 77 DMA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 482.. 3007 

1 11 21 31 41 51 

1 I I I I 1 

AACTGAGCTA ACAAGAAATA CTAGAAAAGG AGGAAGGAGA ACATTGCTGC AGCT TGGATC 60 

TACAACCTAA GAAAGCAAGA GTGATCAATC TCAGCTCTGT TAAACATCTT GTTTACTTAC 120 

TGCATTCAGC AGCTTGCAAA TGGTTAACTA TATGCAAAAA AGTCAGCATA GCTGTGAAGT 180 

ATGCCGTGAA TTTTAATTGA GGGAAAAAGG GACAATTGCT TCAGGATGCT CTAGTATGCA 240 

CTCTGCTTGA AATATTTTCA ATGAAATGCT CAGTATTCTA TCTTTGACCA GAGGTTTTAA 300 

CTTTATGAAG CTATGGGACT TGACAAAAAG TGATATTTGA GAAGAAAGTA CGCAGTGGTT 360 

GGTGTTTTCT TTTTTTTAAT AAAGGAATTG AATTACTTTG AACACCTCTT CCAGCTGTGC 420 

ATTACAGATA ACGTCAGGAA GAGTCTCTGC TTTACAGAAT CGGATTTCAT CACATGACAA 480 

CATGAAGCTG TGGATTCATC TCTTTTATTC ATCTCTCCTT GCCTGTATAT CTTTACACTC 540 

CCAAACTCCA GTGCTCTCAT CCAGAGGCTC TTGTGATTCT CTTTGCAATT GTGAGGAAAA 600 

AGATGGCACA ATGCTAATAA ATTGTGAAGC AAAAGGTATC AAGATGGTAT CTGAAATAAG 660 

TGTGCCACCA TCACGACCTT TCCAACTAAG CTTATTAAAT AACGGCTTGA CGATGCTTCA 720 

CACAAATGAC TTTTCTGGGC TTACCAATGC TATTTCAATA CACCTTGGAT TTAACAATAT 780 

TGCAGATATT GAGATAGGTG CATTTAATGG CCTTGGCCTC CTGAAACAAC TTCATATCAA 840 

TCACAATTCT TTAGAAATTC TTAAAGAGGA TACTTTCCAT GGACTGGAAA ACCTGGAATT 900 

CCTGCAAGCA GATAACAATT TTATCACAGT GATTGAACCA AGTGCCTTTA GCAAGCTCAA 960 

CAGACTCAAA GTGTTAATTT TAAATGACAA TGCTATTGAG AGTCTTCCTC CAAACATCTT 1020 

CCGATTTGTT CCTTTAACCC ATCTAGATCT TCGTGGAAAT CAATTACAAA CATTGCCTTA 1080 

TGTTGGTTTT CTCGAACACA TTGGCCGAAT ATTGGATCTT CAGTTGGAGG ACAACAAATG 1140 

GGCCTGCAAT TGTGACTTAT TGCAGTTAAA AACTTGGTTG GAGAACATGC CTCCACAGTC 1200 

TATAATTGGT GATGTTGTCT GCAACAGCCC TCCATTTTTT AAAGGAAGTA TACTCAGTAG 1260 

ACTAAAGAAG GAATCTATTT GCCCTACTCC ACCAGTGTAT GAAGAACATG AGGATCCTTC 1320 

AGGATCATTA CATCTGGCAG CAACATCTTC AATAAATGAT AGTCGCATGT CAACTAAGAC 1380 

CACGTCCATT CTAAAACTAC CCACCAAAGC ACCAGGTTTG ATACCTTATA TTACAAAGCC 1440 

ATCCACTCAA CTTCCAGGAC CTTACTGCCC TATTCCTTGT AACTGCAAAG TCCTATCCCC 1500 

ATCAGGACTT CTAATACATT GTCAGGAGCG CAACATTGAA AGCTTATCAG ATCTGAGACC 1560 

TCCTCCGCAA AATCCTAGAA AGCTCATTCT AGCGGGAAAT ATTATTCACA GTTTAATGAA 1620 

GTCTGATCTA GTGGAATATT TCACTTTGGA AATGCTTCAC TTGGGAAACA ATCGTATTGA 1680 

AGTTCTTGAA GAAGGATCGT TTATGAACCT AACGAGATTA CAAAAACTCT ATCTAAATGG 1740 

TAACCACCTG ACCAAATTAA GTAAAGGCAT GTTCCTTGGT CTCCATAATC TTGAATACTT 1800 

ATATCTTGAA TACAATGCCA TTAAGGAAAT ACTGCCAGGA ACCTTTAATC CAA TGCCTAA I860 

ACTTAAAGTC CTGTATTTAA ATAACAACCT CCTCCAAGTT TTACCACCAC ATATTTTTTC 1920 

AGGGGTTCCT CTAACTAAGG TAAATCTTAA AACAAACCAG TTTACCCATC TACCTGTAAG 1980 

TAATATTTTG GATGATCTTG ATTTACTAAC CCAGATTGAC CTTGAGGATA ACCCCTGGGA 2040 

CTGCTCCTGT GACCTGGTTG GACTGCAGCA ATGGATACAA AAGTTAAGCA AGAACACAGT 2100 

GACAGATGAC ATCCTCTGCA CTTCCCCCGG GCATCTCGAC AAAAAGGAAT TGAAAGCCCT 2160 

AAATAGTGAA ATTCTCTGTC CAGGTTTAGT AAATAACCCA TCCATGCCAA CACAGACTAG 2220 

TTACCTTATG GTCACCACTC CTGCAACAAC AACAAATACG GCTGATACTA TTTTACGATC 2280 

TCTTACGGAC GCTGTGCCAC TGTCTGTTCT AATATTGGGA CTTCTGATTA TGTTCATCAC 2340 

TATTGTTTTC TGTGCTGCAG GGATAGTGGT TCTTGTTCTT CACCGCAGGA GAAGATACAA 2400 

AAAGAAACAA GTAGATGAGC AAATGAGAGA CAACAGTCCT GTGCATCTTC AGTACAGCAT 2460 

GTATGGCCAT AAAACCACTC ATCACACTAC TGAAAGACCC TCTGCCTCAC TCTATGAACA 2520 

GCACATGGTG AGCCCCATGG TTCATGTCTA TAGAAGTCCA TCCTTTGGTC CAAAGCATCT 2580 

GGAAGAGGAA GAAGAGAGGA ATGAGAAAGA AGGAAGTGAT GCAAAACATC TCCAAAGAAG 2640 

TCTTTTGGAA CAGGAAAATC ATTCACCACT CACAGGGTCA AATATGAAAT ACAAA ACCAC 2700 

GAACCAATCA ACAGAATTTT TATCCTTCCA AGATGCCAGC TCATTGTACA GAAACATTTT 2760 

AGAAAAAGAA AGGGAACTTC AGCAACTGGG AATCACAGAA TACCTAAGGA AAAACATTGC 2820 

TCAGCTCCAG CCTGATATGG AGGCACATTA TCCTGGAGCC CACGAAGAGC TGAAGTTAAT 2880 

GGAAACATTA ATGTACTCAC GTCCAAGGAA GGTATTAGTG GAACAGACAA AAAATGAGTA 2940 

TTTTGAACTT AAAGCTAATT TACATGCTGA ACCTGACTAT TTAGAAGTCC TGGAGCAGCA 3000 

AACATAGATG GAGAGTTTGA GGGCTTTCGC AGAAATGCTG TGATTCTGTT TTAAGTCCAT 3060 

ACCTTGTAAA TAAGTGCCTT ACGTGAGTGT GTCATCAATC AGAACCTAAG CACAGCAGTA 3120 

AACTATGGGG AAAAAAAAAG AAGAAGAAAA GAAACTCAGG GATC ACTG GG AGAAGCCATG 3180 
GCATTATCTT CAGGCAATTT AGTCTGTCCC AAATAAAATC AATCCTTGCA TGTAAATC 

Seq ID NO: 78 Protein sequence 
Protein Accession #j Eos sequence 

1 11 21 31 41 51 

I | I 1 I I 

MKLWIHLFYS SLLACISLHS QTPVLSSRGS CDSLCNCEEK DGTKLINCEA KGIKMVSEIS 60 
VPPSRPFQLS LLNNGLTMLH TNDFSGLTNA ISIHLGFNNI ADIEIGAFNG LGLLKQLHIN 120 
HNSLEILKED TFHGLENLEF LQADNNFITV IEPSAFSKLN RLKVLILNDN AIESLPPNIF 180 
RFVPLTHLDL RGNQLQTLPY VGFLEHIGRI LDLQLEDNKW ACNCDLLGLK TWLENMPPQS 240 
IIGDWCNSP PFFKGSILSR LKKESICPTP PVYEEHEDPS GSLHLAATSS INDSRMSTKT 300 
TSILKLPTKA PGLIPYITKP STQLPGPYCP IPCNCKVLSP SGLLIHCQER NIESLSDLRP 360 
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PPQNPRKLIL AGNIIHSLMK SDLVEYFTLE MLHLGNNRIE VLEEGSFMNL TRLQKLYLNG 420 

NHLTKLSKGM FLGLHNLEYL YLEYNAIKEI LPGTFNPMPK LKVLYLNNNL LQVLPPHIFS 480 

GVPLTKVNLK TNQFTHLPVS NILDDLDLLT QIDLEDNPWD CSCDLVGLQQ WIQKLSKNTV 540 

TDDILCTSPG HLDKKELKAL NSEILCPGLV NNPSMPTQTS YLMVTTPATT TNTADTILRS 600 

LTDAVPLSVL ILGLLIHFIT IVFCAAGIW LVLHRRRRYK KKQVDEQMRD NSPVHLQYSM 660 

YGHKTTHHTT ERPSASLYEQ HMVSPMVHVY RSPSFGPKHL EEEEERNEKE GSDAKHLQRS 720 

LLEQENHSPL TGSNMKYKTT NQSTEFLSFQ. DASSLYRNII* EKERELQQLG ITEYLRKNIA 780 

QLQPDMEAHY PGAHEELKLM ETLMYSRPRK VLVEQTKNEY FELKANLHAE PDYLEVLEQQ 840 
T 

Seq ID NO: 79 DNA sequence 

Nucleic Acid Accession #: NM__0 1664 0.2 

Coding sequence: 39.. 1358 

1 11 21 31 41 51 

I I 1 I I I 

GCTTAAGTTG ACCTCTGGGT CCGGAATCGC GGGCAAAGAT GGCGGCGGCC AGGTGTTGGA 60 

GGCCTTTGCT ACGCGGTCCG AGGCTTTCAT TGCACACCGC GGCTAATGCC GCCGCCACGG 120 

CTACAGAAAC GACCTCCCAA GACGTCGCGG CGACCCCCGT CGCGCGGTAC CCGCCGATTG 180 

TGGCCTCCAT GACAGCCGAC AGCAAAGCTG CACGGCTGCG GCGGATCGAG CGCTGGCAGG 240 

CGACGGTGCA CGCTGCGGAG TCGGTAGACG AGAAGCTGCG AATCCTCACC AAGATGCAGT 300 

TTATGAAGTA CATGGTTTAC CCGCAGACCT TCGCGCTGAA TGCCGACCGC TGGTACCAGT 360 

ACTTCACCAA GACCGTGTTC CTGTCGGGTC TGCCGCCGCC CCCAGCGGAG CCCGAGCCCG 420 

AGCCCGAACC CGAACCTGAA CCTGCGCTGG ACCTCGCGGC GCTGCGTGCG GTCGCCTGCG 480 

ACTGCCTGCT GCAGGAGCAC TTCTACCTGC GGCGCAGGCG GOGCGTGCAC CGTTACGAGG 540 

AGAGCGAGGT CATATCTTTG CCCTTCCTGG ATCAGCTGGT GTCAACCCTC GTGGGCCTCC 600 

TCAGCCCACA CAACCCGGCC CTGGCCGCTG CCGCCCTCGA TTATAGATGC CCAGTTCATT 660 

TTTACTGGGT GCGTGGTGAA GAAATTATTC CTCGTGGTCA TCGAAGAGGT CGAATTGATG 720 

ACTTGCGATA CCAGATAGAT GATAAACCAA ACAACCAGAT TCGAATATCC AAGCAACTCG 780 

CAGAGTTTGT GCCATTGGAT TATTCTGTTC CTATAGAAAT CCCCACTATA AAATGTAAAC 840 

CAGACAAACT TCCATTATTC AAACGGCAGT ATGAAAACCA CATATTTGTT GGCTCAAAAA 900 

CTGCAGATCC TTGCTGTTAC GGTCACACCC AGTTTCATCT GTTACCTGAC AAATTAAGAA 960 

GGGAAAGGCT TTTGAGACAA AACTGTGCTG ATCAGATAGA AGTTGTTTTT AGAGCTAATG 1020 

CTATTGCAAG CCTTTTTGCT TGGACTGGAG CACAAGCTAT GTATCAAGGA TTCTGGAGTG 1080 

AAGCAGATGT TACTCGACCT TTTGTCTCCC AGGCTGTGAT CACAGATGGA AAATACTTTT 1140 

CCTTTTTCTG CTACCAGCTA AATACTTTGG CACTGACTAC ACAAGCTGAT CAAAATAACC 1200 

CTCGTAAAAA TATATGTTGG GGTACACAAA GTAAGCCTCT TTATGAAACA ATTGAGGATA 1260 

ATGATGTGAA AGGTTTTAAT GATGATGTTC TACTTCAGAT AGTTCACTTT CTACTGAATA 1320 

GACCAAAAGA AGAAAAATCA CAGCTGTTGG AAAACTGAAA AAGCATATTT GATTGAGAAC 1380 

TGTGGGAATA TTTAAATTTT ACTGAAGGAA CAATAATGAT GAGATTTGTA ACTGTCAACT 1440 
ATTAAATACA TTGATTTTTG AGACAAAAAA AAAAAAAAAA AA 

Seq ID NO: 80 Protein sequence 
Protein Accession #: NP_057724.l 

1 11 21 31 41 51 

I I I I I I 

MAAARCWRPL LRGPRLSLHT AANAAATATE TTSQDVAATP VARYPPIVAS MTADSKAARL 60 
RRIERWQATV HAAESVDEKL RILTKMQFMK YMVYPQTFAL NADRWYQYFT KTVFLSGLPP 120 
PPAEPEPEPE PEPEPALDLA ALRAVACDCL LQEHFYLRRR RRVHRYEESE VISLPFLDQL 180 
VSTLVGLLSP HNPALAAAAL DYRCPVHFYW VRGEEIIPRG HRRGRIDDLR YQIDDKPNNQ 240 
IRISKQLAEF VPLDYSVPIE IPTIKCKPDK LPLFKRQYEN HIFVGSKTAD PCCYGHTQFH 300 
LLPDKLRRER LLRQNCADQI EWFRANAIA SLFAWTGAQA MYQGFWSEAD VTRPFVSQAV 360 
ITDGKYFSFF CYQLNTLALT TQADQNNPRK NICWGTQSKP LYETIEDNDV KGFNDDVLLQ 420 
IVHFLLNRPK EEKSQLLEN 

Seq ID NO: 81 DNA sec-uence 

Nucleic Acid Accession #: FGENESH predicted 
Coding sequence: 1..2070 

1 11 21 31 41 51 

|| | | I I 

ATGAGCGGTG CGGGGGTGGC GGCTGGGACG CGGCCCCCCA GCTCGCCGAC CCCGGGCTCT 60 

CGGCGCCGGC GCCAGCGCCC CTCTGTGGGC GTCCAGTCCT • TGAGGCCGCA GAGCCCGCAG 120 

CTCAGGCAGA GCGACCCGCA GAAACGGAAC CTGGACCTGG AGAAAAGCCT GCAGTTCCTG 180 

CAGCAGCAGC ACTOGGAGAT GCTGGCCAAG CTCCATGAGG AGATCGAGCA TCTGAAGCGG 240 

GAAAACAAGG GTGAGCCGGC GCGGGGCCCT AGGCCGGCCC TGCCTCCCCA GGCACACTCA 300 

ACACTGCCGC TCCCGCAGCA CAGAAACACA GCCATCAACT CCAGCACACG CCTGGGCTCA 360 

GGGGGAACAC AGGACGGGGA GCCCCTCCAG ACTGTCCTTG CCCACCTGGC TGCACTGGCC 420 

CCTGTATGCC AACCCAGTGG GTACAGGTTC TGGGGGACCT GGACAGATGC CGCTACCTCT 480 

AGCCGTGGCT GGACGATGTT ATGCAGCCAA GCACAGCACG TGCTGCTCTC GGGAAGCCCA 540 

GGGCCTGAGG TCATTGCAGG GCGGCAGGTG GCCACAGGGT GCTCCCCAGA CCTCCCTCCT 600 

CCAAGTAGAG CTGAAATGGG AAGGAACCCC TGGGACAGCC CCTGCCCTGC TAGATCTTTG 660 

CCTCAGATTG CTGCTGTGGC CAGGCCCAGG ATTTCCAGCC CTATGGCTCT GAGTCCTCAC 720 

ATGCTGGGGG CCCAGGGGAT ATGGACACAC TCCATCCAGG GATCCCTTCC TGCCATCTGG 780 

GCAGCAACCA TGGGGACAAA GGGAGGAAGC AGAGTCCTGT TTCCTTGCCA CTTGTCCAAG 840 

GCACTTCCCC ATCCTGACAG CGGCCCCCAC CCAGCCCAGG ATCCTGGGCT GTGGTCTCAA 900 

GCTCACTTCC CATTATCTTT GGGGCTGGGG CTGACATCAG GAGGACATCT GACTGGTGGA 9S0 

TGGAGCCAGC CTGGGAACAT CGCAGCTGGG GCAGTGCCTA GGGCTCTCCC TTCCCAGGGA 1020 

GACATGGAGA AGGGGGTTGA GGGAGGGCCC TTCCCTAGCC GCTGTGGCAA CTCCAGTGAG 1080 

CTGTTCTGGG CAAAGTGTGG CCCAAGTCGG CAGCCCCAGC CCTGCAGTGC TGGGGACGCT 1140 

GACAGGACAC GGGAAGAGGC CATGCTTTCC CTCGGGACCT GCTGTTCCAT GTGTCCCAAG 1200 

CCCTCCTGCT TTCCAGATGG CCCCTCAGGA AACCACCTTT CCAGGGCCTC TGCTCCCTTG 1260 

GGCGCTCGCT GGGTCTGCAT CAACGGAGTG TGGGTAGAGC CGGGAGGACC CAGCCCTGCC 1320 
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AGGCTGAAGG AGGGCTCCTC ACGGACACAC AGGCCAGGAG GCAAGCGTGG GCGTCTTGCG 
GGCGGTAGCG CCGACACTGT GCGCTCTCCT GCAGACAGCC TCTCCATGTC AAGCTTCCAG 
TCTGTCAAGT CCATCTCTAA TTCAGCCAAC TCTCAAGGCA AGGCCAGGCC CCAGCCCGGC 
TCCTTCAACA AGCAAGATTC AAAAGCTGAC GTCTCCCAGA AGGCGGACCT GGAAGAGGAG 
CCCCTACTTC ACAACAGCAA GCTGGACAAA GTTCCTGGGG TACAAGGGCA GGCCAGAAAG 
GAGAAAGCAG AGGCCTCTAA TGCAGGAGCT GCCTGTATGG GGAACAGCCA GCACCAGGGC 
AGGCAGATGG GGGCGGGGGC ACACCCCCCA ATGATCCTGC CCCTTCCCCT GCGAAAGCCC 
ACCACACTTA GGCAGTGCGA AGTGCTCATC CGCGAGCTGT GGAATACCAA CCTCCTGCAG 
ACCCAAGAGC TGCGGCACCT CAAGTCCCTC CTGGAAGGGA GCCAGAGGCC CCAGGCAGCC 
CCGGAGGAAG CTAGCTTTCC CAGGGACCAA GAAGCCACGC ATTTCCCCAA GGTCTCCACC 
AAGAGCCTCT CCAAGAAATG CCTGAGCCCA CCTGTGGCGG AGCGTGCCAT CCTGCCCGCA 
CTGAAGCAGA CCCCGAAGAA CAACTTTGCC GAGAGGCAGA AGAGGCTGCA GGCAATGCAG 
AAACGGCGCC TGCATCGCTC AGTGCTTTGA 

Seq ID NO: 82 Protein sequence 
Protein Accession #: FGENESH predicted 

1 11 21 31 41 51 

I 1 I I I I 

MSGAGVAAGT RPPSSPTPGS RRRRQRPSVG VQSLRPQSPQ LRQSDPQKRN LDLEKSLQFL 60 

QQQHSEMLAK LHEEIEHLKR ENKGEPARGP RPALPPQAHS TLPLPQHRNT AINSSTRLGS 120 

GGTQDGEPLQ TVLAHLAAIA PVCQPSGYRF WGTWTDAATS SRGWTMLCSQ AQHVL.LSGSP 180 

GPEVIAGRQV ATGCSPDLPP PSRAEMGRNP WDSPCPARSL PQIAAVARPR ISSPMALSPH 240 

MLGAQGIWTH SIQGSLPAIW AATMGTKGGS RVLFPCHLSK ALPHPDSGPH PAQDPGLWSQ 300 

AHFPLSLGLG LTSGGHLTGG WSQPGNIAAG AVPRALPSQG DMEKGVEGGP FPSRCGNSSE 360 

LFWAKCGPSR QPQPCSAGDA DRTREEAMLS LGTCCSMCPK PSCFPDGPSG NHLSRASAPL 420 

GARWVCINGV WVEPGGPSPA RLKEGSSRTH RPGGKRGRLA GGSADTVRSP ADSLSMSSFQ 480 

SVKSISNSAN SQGKARPQPG SFNKQDSKAD VSQKADLEEE PLLHNSKLDK VPGVQGQARK 540 

EKAEASNAGA ACMGNSQHQG RQMGAGAHPP MILPLPLRKP TTLRQCEVXiI RELWNTNLLQ 600 

TQELRHLKSIi LEGSQRPQAA PEEASFPRDQ EATHFPKVST KSLSKKCLSP PVAERAI L PA 660 
LKQTPKNNFA ERQKRLQAMQ KRRLHRSVI* 

Seq ID NO: 83 DNA sequence 

Nucleic Acid Accession #: NM_0 05264.1 

Coding sequence : 557 . . 1954 

1 11 21 31 41 51 

I I I I I I 

GAATTCCGGC CAGAAGAAAT CTGGCCTCGG AACACGCCAT TCTCCGCGCC GCTTCCAATA 60 

ACCACTAACA TCCCTAACGA GCATCCGAGC CGAGGGCTCT GCTCGGAAAT CGTCCTGGCC 120 

CAACTCGGCC CTTCGAGCTC TCGAAGATTA CCGCATCTAT TTTTTTTTTC TTTTTTTTCT 180 

TTTCCTAGCG CAGATAAAGT GAGCCCGGAA AGGGAAGGAG GGGGCGGGGA CACCATTGCC 240 

CTGAAAGAAT AAATAAGTAA ATAAACAAAC TGGCTCCTCG CCGCAGCTGG ACGCGGTCGG 300 

TTGAGTCCAG GTTGGGTCGG ACCTGAACCC CTAAAAGCGG AACCGCCTCC CGCCCTCGCC 360 

ATCCCGGAGC TGAGTCGCCG GCGGCGGTGG CTGCTGCCAG ACCCGGAGTT TCCTCTTTCA 420 

CTGGATGGAG CTGAACTTTG GGCGGCCAGA GCAGCACAGC TGTCCGGGGA TCGCTGCACG 480 

CTGAGCTCCC TCGGCAAGAC CCAGCGGCGG CTCGGGATTT TTTTGGGGGG GCGGGGACCA 540 

GCCCCGCGCC GGCACCATGT TCCTGGCGAC CCTGTACTTC GCGCTGCCGC TCTTGGACTT 600 

GCTCCTGTCG GCCGAAGTGA GCGGCGGAGA CCGCCTGGAT TGCGTGAAAG CCAGTGATCA 660 

GTGCCTGAAG GAGCAGAGCT GCAGCACCAA GTACCGCACG CTAAGGCAGT GCGTGGCGGG , 720 

CAAGGAGACC AACTTCAGCC TGGCATCCGG CCTGGAGGCC AAGGATGAGT GCCGCAGCGC 780 

CATGGAGGCC CTGAAGCAGA AGTCG CTCT A CAACTGCCGC TGCAAGCGGG GTATGAAGAA 840 

GGAGAAGAAC TGCCTGCGCA TTTACTGGAG CATGTACCAG AGCCTGCAGG GAAATGATCT 900 

GCTGGAGGAT TCCCCATATG AACCAGTTAA CAGCAGATTG TCAGATATAT TCCGGGTGGT 960 

CCCATTCATA TCAGATGTTT TTCAGCAAGT GGAGCACATT CCCAAAGGGA ACAACTGCCT 1020 

GGATGCAGCG AAGGCCTGCA ACCTCGACGA CATTTGCAAG AAGTACAGGT CGGCGTACAT 1080 

CACCCCGTGC ACCACCAGCG TGTCCAACGA TGTCTGCAAC CGCCGCAAGT GCCACAAGGC 1140 

CCTCCGGCAG TTCTTTGACA AGGTCCCGGC CAAGCACAGC TACGGAATGC TCTTCTGCTC 1200 

CTGCCGGGAC ATCGCCTGCA CAGAGCGGAG GCGACAGACC ATCGTGCCTG TGTGCTCCTA 1260 

TGAAGAGAGG GAGAAGCCCA ACTGTTTGAA TTTGCAGGAC TCCTGCAAGA CGAATTACAT 1320 

CTGCAGATCT CGCCTTGCGG ATTTTTTTAC CAACTGCCAG CCAGAGTCAA GGTCTGTCAG 1380 

CAGCTGTCTA AAGGAAAACT ACGCTGACTG CCTCCTCGCC TACTCGGGGC TTATTGGCAC 1440 

AGTCATGACC CCCAACTACA TAGACTCCAG TAGCCTCAGT GTGGCCCCAT GGTGTGACTG 1500 

CAGCAACAGT GGGAACGACC TAGAAGAGTG CTTGAAATTT TTGAATTTCT TCAAGGACAA 1560 

TACATGTCTT AAAAATGCAA TTCAAGCCTT TGGCAATGGC TCCGATGTGA CCGTGTGGCA 1620 

GCCAGCCTTC CCAGTACAGA CCACCACTGC CACTACCACC ACTGCCCTCC GGGTTAAGAA 1680 

CAAGCCCCTG GGGCCAGCAG GGTCTGAGAA TGAAATTCCC ACTCATGTTT TGCCACCGTG 1740 

TGCAAATTTA CAGGCACAGA AGCTGAAATC CAATGTGTCG GGCAATACAC ACCTCTGTAT 1800 

TTCCAATGGT AATTATGAAA AAGAAGGTCT CGGTGCTTCC AGCCACATAA CCACAAAATC 1860 

AATGGCTGCT CCTCCAAGCT GTGGTCTGAG CCCACTGCTG GTCCTGGTGG TAACCGCTCT 1920 

GTCCACCCTA TTATCTTTAA CAGAAACATC ATAGCTGCAT TAAAAAAATA CAATATGGAC 1980 

ATGTAAAAAG ACAAAAACCA AGTTATCTGT TTCCTGTTCT CTTGTATAGC TGAAATTCCA 2040 

GTTTAGGAGC TCAGTTGAGA AACAGTTCCA TTCAACTGGA ACATTTTTTT TTTTCCTTTT 2100 

AAGAAAGCTT CTTGTGATCC TTCGGGGCTT CTGTGAAAAA CCTGATGCAG TGCT CCATCC 2160 

AAACTCAGAA GGCTTTGGGA TATGCTGTAT TTTAAAGGGA CAGTTTGTAA CTTGGGCTGT 2220 

AAAGCAAACT GGGGCTGTGT TTTCGATGAT GATGATCATC ATGATCATGA TGATTTTAAC 2280 

AGTTTTACTT CTGGCCTTTC CTAGCTAGAG AAGGAGTTAA TATTTCTAAG GTAACTCCCA 2340 

TATCTCCTTT AATGACATTG ATTTCTAATG ATATAAATTT CAGCCTACAT TGATGCCAAG 2400 

CTTTTTTGCC ACAAAGAAGA TTCTTACCAA GAGTGGGCTT TGTGGAAACA GCTGGTACTG 2460 

ATGTTCACCT TTATATATGT ACTAGCATTT TCCACGCTGA TGTTTATGTA CTGTAAACAG 2520 
TTCTGCACTC TTGTACAAAA GAAAAAACCA CCCGGAATTC 
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1 
I 

MFLATLYFAL 
SLASGLEAKD 
YEPVNSRLSD 
SVSNDVCNRR 
PNCLNLQDSC 
YIDSSSLSVA 
QTTTATTTTA 
EKEGLGASSH 



11 
I 

PLliDLLLSAE 
ECRSAMEALK 
IFRWPFISD 
KCHKALRQFF 
KTMYICRSRL 
PWCDCSNSGN 
LRVKNKPLGP 
ITTKSMAAPP 



21 
I 

VSGGDRLDCV 
QKSLYNCRCK 
VFQQVEHIPK 
DKVPAKHSYG 
ADFFTNCQPE 
DLEECLKPLN 
AGSENEIPTH 
SCGLSPLLVL 



31 
I 

KASDQCLKEQ 



41 



51 



GNNCLDAAKA 
MLFCSCRDIA 
SRSVSSCLKE 
FFKDNTCLKN 
VLPPCANLQA 
WTALSTLLS 



SCSTKYRTLR QCVAGKETNF 60 

RIYWSMYQSL QGNDLLEDSP 120 

CNLDDICKKY RSAYITPCTT 180 

CTERRRQTIV PVCSYEEREK 240 

NYADCLLAYS GLIGTVMTPN 300 

AIQAFGNGSD VTVWQPAFPV 360 

QKLKSNVSGN THLCISNGNY 420 
LTETS 



Seq ID NO: 85 DNA sequence 

Nucleic Acid Accession fl: XMJ>27172.1 

Coding sequence: 143.. 1405 

1 ll 21 31 41 51 

I I I I I I 

GGTGGAGACA CCGCCTCAGG GCTCGGTGCA CAGTGGACAT TTGGGGAGOG TTGTGGGTGA 
CCCCCACACA GGCACTGGGA ATGCAGGGGA GAGGGGGCCA AGGGGGAAAG GGGCCAGAGT 
GTTGGCTTTG GATTCAGGAG GGATGGATTC CAGTCCTAGC TTGCCACTTA TTAGGACTCC 
TGAGAGCAGC CTCCATGAGG CCCTGGACCA GTGCATGACC GCCCTGGACC TCTTCCTCAC 
CAACCAGTTC TCAGAAGCAC TCAGCTACCT CAAGCCCAGA ACCAA GGAA A GCATGTACCA 
CTCACTGACA TATGCCACCA TCCTGGAGAT GCAGGCCATG ATGACCTTTG ACCCTCAGGA 
CATCCTGCTT GCCGGCAACA TGATGAAGGA GGCACAGATG CTGTGTCAGA GGCACCGGAG 
GAAGTCTTCT GTAACAGATT CCTTCAGCAG CCTGGTGAAC CGCCCCACGC TGGGCCAATT 
CACTGAAGAG GAAATCCACG CTGAGGTCTG CTATGCAGAG TGCCTGCTGC AGCGAGCAGC 
CCTGACCTTC CTGCAGGGTT CCTCACACGG AGGGGCAGTC AGGCCC AGAG CCTTGCATGA 
TCCCTCTCAC GCCTGCAGCT GCCCACCTGG GCCAGGCCGT CAGCATCTTT TCCTCCTGCA 
GGACGAGAAC ATGGTGAGCT TCATCAAAGG CGGCATCAAA GTTCGAAACA GCTACCAGAC 
CTACAAGGAG CTGGACAGCC TTGTTCAGTC CTCACAATAC TGCAAGGGTG AGAACCACCC 
GCACTTTGAA GGAGGAGTGA AGCTTGGTGT AGGGGCCTTC AACCTGACAC TGTCCATGCT 
TCCTACTAGG ATCCTGAGGC TGTTGGAGTT TGTGGGGTTT TCAGGAAACA AGGACTATGG 
GCTGCTGCAG CTGGAGGAGG GAGCGTCAGG GCACAGCTTC CGCTCTGTGC TCTGTGTCAT 
GCTCCTGCTG TGCTACCACA CCTTCCTCAC CTTCGTGCTC GGTACTGGGA ACGTCAACAT 
CGAGGAGGCC GAGAAGCTCT TGAAGCCCTA CCTGAACCGG TACCCTAAGG GTGCCATCTT 
CCTGTTCTTT. GCAGGGAGGA TTGAAGTCAT TAAAGGCAAC ATTGATGCAG TGAGTGATGG 
GGGTCCGGGC CGGGGCTGGG GATCCCTCGG GGTCTCCCAG ACCAGCAGGA AGTCAGGCAC 
ATGTGACATA CTCAGGGACA GGATAGACTG GGGGCGGGGG GGGGGCCAAG AGAGAACCAA 
CCAGAGAGCA GGGGCAGGAG AGGCCCTTCT GGCAGAGCAG CCTGGGAAGA CAAGGGAGGA 
GGAGGCATTT GTGGTGCCTG GGATTTTGAC TGGGAGATAT AGGACTGCAG C ATTGCAG TG 
GAGGGAGGTG GAGGGAGGTG CTTGAGGGAG GCAGAGGTTA GGAAAGCCCA TCTGTTTAGG 
GCATGACGAT TAGGCTGGAG TCTGGTACCT CCCCTCCATT ATAGCTCTCT CCTGCTCTTT 
CATTTTGTTA CTAAAAACCA GAGTCCTAGG CGGGGGCTGT ATTTGAGCCC AACGTCATGT 
AAGACTTAGG AGGTAAAACC AGGACTGCAG GCCAGATCTC CTGGCTCCTG GGGCCCCACC 
TGAGCCTAGC ACAGGGCTGG ACCACTATGC CCTGGAGGAG TCCCGGTCTG CTGTGGTGTT 
GGGAGGTTCG GAGGATGCAG AGGGGTTGGG GCTGGGTGGG CACCCGTCAG GCTGACCAGA 
AGGTGCCTGC AGGCCATCCG GCGTTTCGAG GAGTGCTGTG AGGCCCAGCA GCACTGGAAG 
CAGTTCCACC ACATGTGCTA CTGGGAGCTG ATGTGGTGCT TCACCTACAA GGGCCAGTGG 
AAGATGTCCT ACTTCTACGC CGACCTGCTC AGCAAGGAGA ACTGCTGGTC CAAGGTGGGC 
TGATGCCACG TGTTAGGGGC ATTGGGTGAC CAGGGCTGAC TGTGTGCCTC CAGACCACGG 
GCCAAATCCC TAACTGAACA CAGATGTCTC AGCTGGAATC TAAACATAAC CTTAAATTCT 
AATAGGACTC AGGCTTGGAA GGAACTAAAG ACCACAAGAG AAACTTCTGA ACCACAATGT 
GTCACAAAGA GATTTCTAGT CACAAAGGAC AGAAACATGG CTCCCTCTGT CCAGTAGAAC 
TGTTTCTGTG GTAGAAATGT TCTACTTATG CACTGCCCAA ATATGGTAGT CACCAGCCAC 
ATGTGCACAA TGAGCACATG AAATGTGCCT AGTGCAACTG GGAAACTGAT TGTTTTCAGT 
TTTATTTAAT TTTAATTAAC TAAATGTTAA ATTTAAATAG CCATGTAGGG CTTGTGGCCA 
CTATATTGGA CTATGCAGGT CCAAAACACA AAAGGCTCAT ATAACTGAAC ATTCTTGGCA 
CATCCGACTT CAGGTAGGGC TGGATCCAGG AATTCAAATG ATGTCGTCTG GCTTGGTCTT 
TCCATTTGTG GCTGTGCTCT CTCCTATGAC ATCTTTGTTT CTGCTGCATC TTTGCAGGAA 
GGTTCTCTCC ATGTGACAGG CAAGGTGGCC ACGGGCTGCT TCTACTCATA TCCTCCCCTT 
GGTTTCAACC .AGAGTCCCAC AGTTTGAGCC TCACTGGTCT GACTTGTCAC CTGCCAATGC 
CTGGAACAGA GGGGTGGGAA GATTCCTCAT GGGCTGAGAG TAGGAGAGGG GTGGTTCCAC 
AGAAGAAAAT GATGCACCCA GAAATAGTGG AGGGATTAAC AAGATGCCAT ACAGGCAAAA 
CAAAGCCAAA CAGATGCCCG CCTACCAAAG ATGAAATTTA TCATGGTAAG TATTGAAATA 
AGTGTTAGCT TGTACCATCA TAGTAATGAT AGTGCAGAAA TTGGAACCAA GAGTCTTACA 
ACCACCTAGC TCAGCAAACG TCTAATCTGT TATTTGTAAA TACACAGGAC ATGTGTCTTC 
ATGGCTTCAT CCCCTGCCCA TAGCAGACAT TGCTAATCAA TCCTCTGCCA TGAGCCTAGC 
TGTGACCTTA GATACCTTCC CTGCAAAGCT CCAGGCAGCC GTTAGAACTG ACCCACATTG 
GCACCTGTTA GGAGACCTGT CATCTGCATC TGGTCCCTCC CCCTCCTCTG CATGGAGGTC 
TCTTCCGTAG CCCTGCCTGG GGACCAAGCT GAAGGGGAAT TCTCTGGGTC TGGGGCAGGT 
GGGGCTGGGG AAGGGAGCAG GCTTCTACTG AGCTCTCAAC ATGTCTGGCA CCTGTCATGT 
CCTCCACACC ACACCCAGTG AGGAGAAGTT ATTATCTCCA TTGGGCACAT GAGGGCCACT 
GATGCTAAGC GAGGACTGTG ACTTTCCCAC CCACATGGCT GTTGGTGACA GACACCGGAT 
TTGAGGACAG TCCTGTGTGA CTCAGAGGCC TCTGCTTCTC TCTTGCTCCT AGAGTCCTCC 
TGAGGAGTCG GGGCTTGCCC TGAGCCCACC CTGCTGTTGA AGGTGCTTCC TCAGGCCCAG 
CTCCCATGGC CCCCACACCC CCTCCTCATC ACCTCCTACT CCCAAAAAGG ACAAAGCCTC 
AGGGAACCTT TTTTCTTTTT TTAGAGACAG GGTCTTGCTA TGTTGGTCAG GCTAGTCTTG 
AACTCCTGGG CCCAAGCAAT CTTCCCACCT CTGCCTCCTA AAGTGCTGGG ATTCTGGGCC 
TTAGGGAACC TTTTTGAACT GAAAGTGACT CTCGAAGCCT TCCTGTAGAG TGAGGTGGGT 
GGCTGGGGCT AACCATATGG GGAAGGAGAG ACCTGGTGGG GGCACACAGC TGCTATATAG 
AGGAACAGAA GGTGGGCCAG GCCTCCCAGC TGTCAGGAAA TCTGGCTCAG TCCCCAGACC 
GCTGTGTGGC TTTGGGGTGG CCAGTCCCTC TCTCTCTGCA ACACTCTCCT CATGAGGACT 
TCTGTGAAAA ATGGGGGTTG TAACCCTCTG AGTTCTGGGG CCCCACCCAT TTCTAATCTC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
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CAGAACTTGG CCACCACAGA CTCCACCAAC TTCTAGTCCT GGGGCCTGGG CCTCTGGCCA 4020 
TTGCCATAGG CACCACCTGC TCTGTGCAGG CAGCGCCCCC CTCTGCCAGG ATCCTCCGAG 4080 
GTCAGCTGCT GGGTCTGACC CGCAGACCCT GGCTGAGCGA CGGATGAACG GAGTATGCAG 4140 
ACACAGGCTT GCCTGTCAGC AGATGGGGGA CCCCTGCCAG AGTCAGCAGC GGCCCCCATA 4200 
AGCCTGCCAC GCTTGCATTT ATTTAGTACA GATGTAATGA CAAAGGCCTA AAGCAAACTC 4260 
CATTTGTGGG TAATTAACAT TGTCGCCCCC CCAGAAAGAG CAGTCCTCCG CATGATGATT 4320 
AAAGGCCAGG TTCCGAGGCC TAAGTAAACC AACTTATCTA GATCAATTCC CTTACTTCTT 4380 
GTTATCTACT CTGAGAGAAT TCAGCTGCCT TCAGCCAAAT CCTTTCCCGA AGCTTTTGCA 4440 
AAACCTCCGA GCCTTCCAAG GTTTGCTTCT TTCTGTAATT TTTCTCACCA CCCTGACCTA 4500 
TCTOCTGCAG TCAGCCCTGT GGAGGCCTTT GTGTTTCCCC CAGTGCTGGC AGCCTAGAGG 4560 
CTGAGATGGC CAGAAACAAG GTGGTGACAG TGGCGTGCTC AGGGCTTGGG AAACCCAAGG 4620 
AGCTAAAGGC ATGCCCAGGC AACCAAAGAG GACAGGAAGG CTTCTGAGGA GAGACCTCTG 4680 
AGGTGGGTCT TGGAGAGGAA GGACTTAGGG AGGCAGAGTG GAGGAAGTGA GAGGACACCC 4740 
CAAGCCAAGA GGGCGGCAGG ACCAAAGGCT CAGAAGCCAG GGCGCTGCAG AGGGGCTGTG 4800 
TGCCACAGGG TGAAGAGTTT GTGTGGCAGA AGGGCAGGGG GCTTGCATCA GGGGTGACAG 4860 
CTGCTCTTTT GTCCCAGCAT AGCCCCTGTA CATCCCTGGA GAGCTGGGGC GTCCACAACT 4920 
CTAAGTCACA GCCCCCATCC TAACCCTGGT GGTGCAGTGA GGGTGAGCTG TCTGTGGGCA 4980 
GGAGGGAAGA CTCTTGGAGA TGAGCCTGGT GAAGGGATAA TGGCATCCOG GGCCGAGGAG 5040 
CAGCACAGGC AGAGGCCTGG GGAGAGTTTA AGGAGTGTAG GGGAGGAAAT GGCAGAAGAT 5100 
GAGCCAGAAA AAGAAAGGTT AGGGCAGGTC CTGGAGGACA TGAGTG GCTG TTTGGGCTTT 5160 
ATCCAGCAGT GGGGGAGCCT TGGCAGGCTT GTGGCTTAGA TAGGTGCTTT AGAAAGCCCA 5220 
CCAGCAGTTG CTGGGCCACC CCGCTGGCTG GGTCCTGTTC TAAGGCAGGA AATACAAGCA 5280 
TGAGCAGGAA AAGACCCCCT CAAGGCTCAC GTCCTAGTGG GGAGACAAGA AACACAGATG 5340 
GGCAATATAA CACGATGTCT GGTTCCAGTA AGTGCAGTGA AGAACAAGCG AGGCTGGATG S400 
CAGGGAGTGA TGGGAGGGGC TTTGTAAGGG GAGGTCGGGG GAAGCCTGTC TCAGAGGACA 5460 
CCAGAATGGA GCGCAGGAGC AGCACGTGGC AGTCACATGG CAGGCCGTTA GGGCAGAGGG 5520 
AGCTGGGCAG GGCACAGCAG GGCAGGAGTG TGTTTGATGT GTCCTGGGAA CCGCCCTGAG 5580 
GCCGTCGTGT GGCTGGAGTG CTGCAGGTGT CAAGGAAATT GTAGGAGATG TCTCCTGAGT 5640 
GTGATGGAAT ATAACCAGAT TTCCAGAAGG AACTGACATG ATCTGACTTA AAAAGGTCAG 5700 
TGTGCGAAAT GGCTTGCAGG GGACAGGAGT GGGAGCAGGG AGATAGGAGA CAATGTGTAC 5760 
CAGGACAGCA GAAAGACATC CCGGGTAGCC TGGAACAGGG AGACGGTGTG GAGATGGTGG 5820 
CAGTCCGATA ATGAGAGCCG TAGGGCAAGG CCAGCAGGAT CCTAGAGTGA GACGGGAGGT 5880 
AAAGTCACCG GGACTTGGTG TCTCCACGTC AGGGGCAGGG GAAAGGGAGA GGACAAGGGT 5940 
GACCCGGGAG GTTAAAGATG GGACCGGGGC CAGACGCAGT GGCTCATGCC TGTAATCCTA 6000 
GCACTTTGGG AGGCTGAGGC GGGCGGATAG CTTGAGGTCA GGAGTTTGAA ACCAGCCTGG 6060 
CCAACATGGT GAAACCCCGT CTCTACTAAA ATATACAAAA ATTAGCCTGG CGTGGTGGTG 6120 
CATGCCTGCA GTCCCAGCTA TTCAGGAGGC TGAGGCAACA AGAATCGCTT GAACCTGGGA 6180 
GGCGGAGGTT GCAGTGAGCC GAGATCGCGC CATAGCACTC CAGCCTTAGC CTGGGCGACA 6240 
GAGCGAGACC ACATC 



Seq ID NO: 86 Protein sequence 
Protein Accession ft: XP_027172.1 

1 11 21 31 41 51 

MDSSPSLPLI RTPESSLHBA LDQCMTALDL FLTNQFSEAL SYLKPRTKES MYHSLTYATI 60 

LEMQAMMTFD PQDILLAGNM KKEAQMLCQR HRRKSSVTDS FSSLVNRPTL GQFTEEEIHA 120 

EVCYAECLLQ RAALTFLQGS SHGGAVRPRA LHDPSHACSC PPGPGRQHLF LLQDENMVSF 180 

IKGGIKVRNS YQTYKELDSL VQSSQYCKGE NHPHFEGGVK LGVGAFNLTL SMLPTRILRL 240 

LEFVGFSGNK DYGLLQLEEG ASGHSFRSVL CVMLLLCYHT FLTFVLGTGN VNIEEAEKLL 300 

KPYLNRYPKG AIPLFFAGRI EVIKGNIDAV SDGGPGRGVJG SLGVSQTSRK SGTCDILRDR 360 
IDWGRGGGQE RTNQRAGAGE ALLAEQPGKT REEEAFWPG ILTGRYRTAA LQWREVEGGA 



Seq ID NO: 87 DNA sequence 

Nucleic Acid Accession #: ABO 07 921 

Coding sequence: 143.. 1363 

1 ii 21 31 41 51 

* GGTGGAGACA CCGCCTCAGG GCTCGGTGCA CAGTGGACAT TTGGGGAGCG TTGTGGGTGA 60 

CCCCCACACA GGCACTGGGA ATGCAGGGGA GAGGGGGCCA AGGGGGAAAG GGGCCAGAGT 120 

GTTGGCTTTG GATTCAGGAG GGATGGATTC CAGTCCTAGC TTGCCACTTA TTAGGACTCC 180 

TGAGAGCAGC CTCCATGAGG CCCTGGACCA GTGCATGACC GCCCTGGACC TCTTCCTCAC 240 

CAACCAGTTC TCAGAAGCAC TCAGCTACCT CAAGCCCAGA ACCAA GGAAA GCATGTACCA 300 

CTCACTGACA TATGCCACCA TCCTGGAGAT GCAGGCCATG ATGACCTTTG ACCCTCAGGA 360 

CATCCTGCTT GCCGGCAACA TGATGAAGGA GGCACAGATG CTGTGTCAGA GGCACCGGAG 420 

GAAGTCTTCT GTAACAGATT CCTTCAGCAG CCTGGTGAAC CGCCCCACGC TGGGCCAATT 480 

CACTGAAGAG GAAATCCACG CTGAGGTCTG CTATGCAGAG TGCCTGCTGC AGCGAGCAGC 540 

CCTGACCTTC CTGCAGGGTT CCTCACACGG AGGGGCAGTC AGGCCCAGAG CCTTGCATGA 600 

TCCCTCTCAC GCCTGCAGCT GCCCACCTGG GCCAGGCCGT CAGCATCTTT TCCTCCTGCA 660 

GGACGAGAAC ATGGTGAGCT TCATCAAAGG OGGCATCAAA GTTCGAAACA GCTACCAGAC 720 

CTACAAGGAG CTGGACAGCC TTGTTCAGTC CTCACAATAC TGCAAGGGTG AGAACCACCC 780 

GCACTTTGAA GGAGGAGTGA AGCTTGGTGT AGGGGCCTTC AACCTGACAC TGTCCATGCT 840 

TCCTACTAGG ATCCTGAGGC TGTTGGAGTT TGTGGGGTTT TCAGGAAACA AGGACTATGG 900 

GCTGCTGCAG CTGGAGGAGG GAGCGTCAGG GCACAGCTTC CGCTCTGTGC TCTGTGTCAT 960 

GCTCCTGCTG TGCTACCACA CCTTCCTCAC CTTCGTGCTC GGTACTGGGA ACGTCAACAT 1020 

CGAGGAGGCC GAGAAGCTCT TGAAGCCCTA CCTGAACCGG TACCCTAAGG GTGCCATCTT 1080 

CCTGTTCTTT GCAGGGAGGA TTGAAGTCAT TAAAGGCAAC ATTGATGCAG TGAGTGATGG 1140 

GGGTCCGGGC CGGGGCTGGG GATCCCTCGG GGTCTCCCAG ACCAGCAGGA AGTCAGGCAC 1200 

ATGTGACATA CTCAGGGACA GGATAGACTG GGGGCGGGGG GGGGGGCCAA GAGAGAACCA 1260 

ACCAGAGAGC AGGGGCAGGA GAGGCCCTTC TGGCAGAGCA GCCTGGGAAG ACAAGGGAGG 1320 

AGGAGGCATT TGTGGTGCCT GGGATTTTGA CTGGGAGATA TAGGACTGCA GCATTGCAGT 1380 

GGAGGGAGGT GGAGGGAGGT GCTTGAGGGA GGCAGAGGTT AGGAAAGCCC ATCTGTTTAG 1440 

GGCATGACGA TTAGGCTGGA GTCTGGTACC TCCCCTCCAT TATAGCTCTC TCCTGCTCTT 1500 
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TCATTTTGTT ACTAAAAACC AGAGTCCTAG GCGGGGGCTG TATTTGAGCC CAACGTCATG 
TAAGACTTAG GAGGTAAAAC CAGGACTGGA GGCCAGATCT CCTGGCTCCT GGGGCCCCAC 
CTGAGCCTAG CACAGGGCTG GACCACTATG CCCTGGAGGA GTCCCGGTCT GCTGTGGTGT 
TGGGAGGTTC GGAGGATGCA GAGGGGTTGG GGCTGGGTGG GCACCCGTCA GGCTGACCAG 
AAGGTGCCTG CAGGCCATCC GGCGTTTCGA GGAGTGCTGT GAGGCCCAGC AGCACTGGAA 
GCAGTTCCAC CACATGTGCT ACTGGGAGCT GATGTGGTGC TTCACCTACA AGGGCCAGTG 
GAAGATGTCC TACTTCTACG CCGACCTGCT CAGCAAGGAG AACTGCTGGT CCAAGGTGGG 
CTGATGCCAC GTGTTAGGGG CATTGGGTGA CCAGGGCTGA CTGTGTGCCT CCAGACCACG 
GGCCAAATCC CTAACTGAAC ACAGATGTCT CAGCTGGAAT CTAAACATAA CCTTAAATTC 
TAATAGGACT CAGGCTTGGA AGGAACTAAA GACCACAAGA GAAACTTCTG AACCACAATG 
TGTCACAAAG AGATTTCTAG TCACAAAGGA CAGAAACATG GCTCCCTCTG TCCAGTAGAA 
CTGTTTCTGT GGTAGAAATG TTCTACTTAT GCACTGCCCA AATATGGTAG TCACCAGCCA 
CATGTGCACA CTGAGCACAT GAAATGTGCC TAGTGCAACT GGGAAACTGA TTGTTTTCAG 
TTTTATTTAA TTTTAATTAA CTAAATGTTA AATTTAAATA GCCATGTAGG GCTTGTGGCC 
ACTATATTGG ACTATGCAGG TCCAAAACAC AAAAGGCTCA TATAACTGAA CATTCTTGGC 
ACATCCGACT TCAGGTAGGG CTGGATCCAG GAATTCAAAT GATGTCGTCT GGCTT GGTCT 
TTCCATTTGT GGCTGTGCTC TCTCCTATGA CATCTTTGTT TCTGCTGCAT CTTTGCAGGA 
AGGTTCTCTC CATGTGACAG GCAAGGTGGC CACGGGCTGC TTCTACTCAT ATCCTCCCCT 
TGGTTTCAAC CAGAGTCCCA CAGTTTGAGC CTCACTGGTC TGACTTGTCA CCTGCCAATG 
CCTGGAACAG AGGGGTGGGA AGATTCCTCA TGGGCTGAGA GTAGGAGAGG GGTGGTTCCA 
CAGAAGAAAA TGATGCACCC AGAAATAGTG GAGGGATTAA CAAGATGCCA TACAGGCAAA 
ACAAAGCCAA ACAGATGCCC GCCTACCAAA GATGAAATTT ATCATGGTAA GTATTGAAAT 
AAGTGTTAGC TTGTACCATC ATAGTAATGA TAGTGCAGAA ATTGGAACCA AGAGTCTTAC 
AACCACCTAG CTCAGCAAAC GTCTAATCTG TTATTTGTAA ATACACAGGA CATGTGTCTT 
CATGGCTTCA TCCCCTGCCC ATGGCAGACA TTGCTAATCA ATCCTCTGCC ATGAGCCTAG 
CTGTGACCTT AGATACCTTC CCTGCAAAGC TCCAGGCAGC CGTTAGAACT GACCCACATT 
GGCACCTGTT AGGAGACCTG TCATCTGCAT CTGGTCCCTC CCCCTCCTCT GCATGGAGGT 
CTCTTCCGTA GCCCTGCCTG GGGACCAAGC TGAAGGGGAA TTCTCTGGGT CTGGGGCAGG 
TGGGGCTGGG GAAGGGAGCA GGCTTCTACT GAGCTCTCAA CATGTCTGGC ACCTGTCATG 
TCCTCCACAC CACACCCAGT GAGGAGAAGT TATTATCTCC ATTGGGCACA TGAGGGCCAC 
TGATGCTAAG CGAGGACTGT GACTTTCCCA CCCACATGGC TGTTGGTGAC AGACACCGGA 

TTTGAGGACA GTCCTGTGTG actcagaggc ctctgcttct ctcttgctcc tagagtcctc 

CTGAGGAGTC GGGGCTTGCC CTGAGCCCAC CCTGCTGTTG AAGGTGCTTC CTCAGGCCCA 
GCTCCCATGG cccccacacc ccctcctcat cacctcctac TCCCAAAAAG GACAAAGCCT 
CAGGGAACCT TTTTTCTTTT TTTAGAGACG GGGTCTTGCT ATGTTGGTCA GGCTGGTCTT 
GAACTCCTGG GCCCAGGCAA TCTTCCCGCC TCTGCCTCCT AAAGTGCTGG GATTCTGGGC 
CTTAGGGAAC CTTTTTGAAC TGAAAGTGAC GCTCGAAGCC TTCCTGTAGA GTGAGGTGGG 
TGGCTGGGGC TAACCATATG GGGAAGGAGA GACCTGGTGG GGGCACACAG CTGCTATATA 
GAGGAACAGA AGGTGGGCCA GGCCTCCCAG CTGTCAGGAA ATCTGGCTCA GTCCCCAGAC 
CGCTGTGTGG CTTTGGGGTG GCCAGTCCCT CTCTCTCTGC AACACTCTCC TCATGAGGAC 
TTCTGTGAAA AATGGGGGTT GTAACCCTCT GAGTTCTGGG GCCCCACCCA TTTCTAATCT 
CCAGAACTTG GCCACCACAG ACTCCACCAA CTTCTAGTCC TGGGGCCTGG GCCTCTGGCC 
ATTGCCATAG GCACCACCTG CTCTGTGCAG GCAGCGCCCC CCTCTGCCAG GATCCTCCGA 
GGTCAGCTGC TGGGTCTGAC CCGCAGACCC TGGCTGAGCG ACGGATGAAC GGAGTATGCA 
GACACAGGCT TGCTTGTCAG CAGATGGGGG ACCCCTGCCA GAGTCAGCAG CGGCCCCCAT 
AAGCCTGCCA CGCTTGCATT TATTTAGTAC AGATGTAATG ACAAAGGCCT AAAGCAAACT 
CCATTTGTGG GTAATTAACA TTGTCGCCCC CCCAGAAAGA GCAGTCCTCC GCATGATGAT 
TAAAGGCCAG GTTCCGAGGC CTAAGTAAAC CAACTTATCT AGATCAATTC CCTTACTTCT 
TGTTATCTAC TCTGAGAGAA TTCAGCTGCC TTCAGCCAAA TCCTTTCCCG AAGCTTTTGC 
AAAACCTCCG AGCCTTCCAA GGTTTGCTTC TTTCTGTAAT TTTTCTCACC ACCCTGACCT 
ATCTCCTGCA GTCAGCCCTG TGGAGGCCTT TGTGTTTCCC CCAGTGCTGG CAGCCTAGAG 
GCTGAGATGG CCAGAAACAA GGTGGTGACA GTGGCGTGCT CAGGGCTTGG GAAACCCAAG 
GAGCTAAAGG CATGCCCAGG CAACCAAAGA GGACAGGAAG GCTTCTGAGG AGAGACCTCT 
GAGGTGGGTC TTGGAGAGGA AGGACTTAGG GAGGCAGAGT GGAGGAAGTG AGAGGACACC 
CCAAGCCAAG AGGGCGGCAG GACCAAAGGC TCAGAAGCCA GGGGGCTGCA GAGGGGCTGT 
GTGCCACAGG GTGAAGAGTT TGTGTGGCAG AAGGGCAGGG GGCTTGCATC AGGGGTGACA 
GCTGCTCTTT TGTCCCAGCA TAGCCCCTGT ACATCCCTGG AGAGCTGGGG CGTCCACAAC 
TCTAAGTCAC AGCCCCCATC CTAACCCTGG TGGTGCAGTG AGGGTGAGCT GTCTGTGGGC 
AGGAGGGAAG ACTCTTGGAG ATGAGCCTGG TGAAGGGATA ATGGCATCCC GGGCCGAGGA 
GCAGCACAGG CAGAGGCCTG GGGAGAGTTT AAGGAGTGTA GGGGAGGAAA TGGCAGAAGA 
TGAGCCAGAA AAAGAAAGGT TAGGGCAGGT CCTGGAGGAC ATGAGTGGCT GTTTGGGCTT 
TATCCAGCAG TGGGGGAGCC TTGGCAGGCT TGTGGCTTAG ATAGGTGCTT TAGAAAGCCC 
ACCAGCAGTT GCTGGGCCAC CCCGCTGGCT GGGTCCTGTT CTAAGGCAGG AAATACAAGA 
ATGAGCAGGA AAAGACCCCC TCAAGGCTCA CGTCCTAGTG GGGAGACAAG AAACACAGAT 
GGGCAATATA ACACGATGTC TGGTTCCAGT AAGTGCAGTG AAGAACAAGC GAGGCTGGAT 
GCAGGGAGTG ATGGGAGGGG CTTTGTAAGG GGAGGTCGGG GGAAGCCTGT CTCAGAGGAC 
ACCAGAATGG AGCGCAGGAG CAGCACGTGG CAGTCACATG GCAGGCCGTT AGGGCAGAGG 
GAGCTGGGCA GGGCACAGCA GGGCAGGAGT GTGTTTGATG TGTCCTGGGA ACCGCCCTGA 
GGCCGTCGTG TGGCTGGAGT GCTGCAGGTG TCAAGGAAAT TGTAGGAGAT GTCTCCTGAG 
TGTGATGGAA TATAACCAGA TTTCCAGAAG GAACTGACAT GATCTGACTT AAAAAGGTCA 
GTGTGCGAAA TGGCTTGCAG GGGACAGGAG TGGGAGCAGG GAGATAGGAG ACAATGTGTA 
CCAGGACAGC AGAAAGACAT CCCGGGTAGC CTGGAACAGG GAGACGGTGT GGAGATGGTG 
GCAGT CCGAT AATGAGAGCC GTAGGGCAAG GCCAGCAGGA TCCTAGAGTG AGACGGGAGG 
TAAAGTCACC GGGACTTGGT GTCTCCACGT CAGGGGCAGG GGAAAGGGAG AGGACAAGGG 
TGACCCGGGA GGTTAAAGAT GGGACCGGGG CCAGACGCAG TGGCTCATGC CTGTAATCCT 
AGCACTTTGG GAGGCTGAGG CGGGCGGATG GCTTGAGGTC AGGAGTTTGA AACCGGCCTG 
GCCAACATGG TGAAACCCCG TCTCTACTAA AATATACAAA AATTAGCCTG GCGTGGTGGT 
GCATGCCTGC AGTCCCAGCT ATTCAGGAGG CTGAGGCGAC AAGAATCGCT TGAACCTGGG 
AGGCGGAGGT TGCAGTGAGC CGAGATCGCG CCATAGCACT CCAGCCTTAG CCTGGGCGAC 
AGAGCGAGAC CACATC 
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Seq ID NO: 88 Protein sequence 
Protein Accession #: BAA32297.1 
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1 11 21 31 41 51 

| 1 I 1 I I 

VETPPQGSVH SGHLGSWGD PHTGTGNAGE RGPRGKGARV LALDSGGMDS SPSLPLIRTP 60 

ESSLHEALDQ CMTALDLFLT NQFSEALSYL KPRTKESMYH SLTYATILEM QAMMTFDPQD 120 

ILLAGNMMKE AQMLCQRHRR KSSVTDSFSS LVNRPTLGQF TEEEIHAEVC YAECLLQRAA 180 

LTFLQGSSHG GAVRPRALHD PSHACSCPPG PGRQHLFLLQ DENMVSFIKG GIKVRNSYQT 240 

YKELDSLVQS SQYCKGENHP HFEGGVKIiGV GAFNLTLSML PTRILRLLEP VGFSGNKDYG 300 

LLQLEEGASG HSFRSVLCVM LLLCYHTFLT FVLGTGNVNI EEAEKLLKPY LNRYPKGAIF 360 

LFFAGRIEVI KGNIDAVSDG GPGRGWGSU3 VSQTSRKSGT CDILRDRIDW GRGGGPRENQ 420 
PESRGRRGPS GRAAWEDKGG GGICGAWDFD WEI 



Seq ID NO: 89 DNA sequence 
Nucleic Acid Accession ft: AF007170 
Coding sequence: 73.. 1725 

1 11 21 31 41 51 

| | | I I I 

AAGGAGGCGG CCTCCGGGAA AAGCGACCGC AGGACTCCTG AGAGCAGCCT CCATGAGGCC 60 

CTGGACCAGT GCATGACCGC CCTGGACCTC TTCCTCACCA ACCAGTTCTC AGAAGCACTC 120 

AGCTACCTCA AGCCCAGAAC CAAGGAAAGC ATGTACCACT CACTGACATA TGCCACCATC 180 

CTGGAGATGC AGGCCATGAT GACCTTTGAC CCTCAGGACA TCCTGCTTGC CGGCAACATG 240 

ATGAAGGAGG CACAGATGCT GTGTCAGAGG CACCGGAGGA AGTCTTCTGT AACAGATTCC 300 

TTCAGCAGCC TGGTGAACCG CCCCACGCTG GGCCAATTCA CTGAAGAAGA AATCCACGCT 360 

GAGGTCTGCT ATGCAGAGTG CCTGCTGCAG CGAGCAGCCC TGACCTTCCT GCAGGACGAG 420 

AACATGGTGA GCTTCATCAA AGGCGGCATC AAAGTTCGAA ACAGCTACCA GACCTACAAG 480 

GAGCTGGACA GCCTTGTTCA GTCCTCACAA TACTGCAAGG GTGAGAACCA CCCGCACTTT 540 

GAAGGAGGAG TGAAGCTTGG TGTAGGGGCC TTCAACCTGA CACTGTCCAT GCTTCCTACT 600 

AGGATCCTGA GGCTGTTGGA GTTTGTGGGG TTTTCAGGAA ACAAGGACTA TGGGCTGCTG 660 

CAGCTGGAGG AGGGAGCGTC AGGGCACAGC TTCCGCTCTG TGCTCTGTGT CATGCTCCTG 720 

CTGTGCTACC ACACCTTCCT CACCTTCGTG CTCGGTACTG GGAACGTCAA CATCGAGGAG 780 

GCCGAGAAGC TCTTGAAGCC CTACCTGAAC CGGTACCCTA AGGGTGCCAT CTTCCTGTTC 840 

TTTGCAGGGA GGATTGAAGT CATTAAAGGC AACATTGATG CAGCCATCCG GCGTTTCGAG 900 

GAGTGCTGTG AGGCCCAGCA GCACTGGAAG CAGTTTCACC ACATGTGCTA CTGGGAGCTG 960 

ATGTGGTGCT TCACCTACAA GGGCCAGTGG AAGATGTCCT ACTTCTACGC CGACCTGCTC 1020 

AGCAAGGAGA ACTGCTGGTC CAAGGCCACC TACATTTACA TGAAGGCCGC CTACCTCAGC 1080 

ATGTTTGGGA AGGAGGACCA CAAGCCGTTC GGGGACGACG AAGTGGAATT ATTTCGAGCT 1140 

GTGCCAGGCC TGAAGCTCAA GATTGCTGGG AAATCTCTAC CCACAGAGAA GTTTGCCATC 1200 

CGGAAGTCCC GGCGCTACTT CTCCTCCAAC CCTATCTCGC TGCCAGTGCC TGCTCTGGAA 1260 

ATGATGTACA TCTGGAACGG CTACGCCGTG ATTGGGAAGC AGCCGAAACT CACGGATGGG 1320 

ATACTTGAGA TTATCACTAA GGCTGAAGAG ATGCTGGAGA AAGGCCCAGA GAACGAGTAC 1380 

TCAGTGGATG ACGAGTGCTT GGTGAAATTG TTGAAAGGCC TGTGTCTGAA ATACCTGGGC 1440 

CGTGTCCAGG AGGCCGAGGA GAATTTTAGG AGCATCTCTG CCAATGAAAA GAAGATTAAA 1500 

TATGACCACT ACTTGATCCC AAACGCCCTG CTGGAGCTGG CCCTGCTGCT TATGGAGCAA 1560 

GACAGAAACG AAGAGGCCAT CAAACTTTTG GAATCTGCCA AGCAAAACTA CAAGAATTAC 1620 

TCCATGGAGT CAAGGACACA CTTTCGAATC CAGGCAGCCA CACTCCAAGC CAAGTCTTCC 1680 

CTAGAGAACA GCAGCAGATC CATGGTCTCA TCAGTGTCCT TGTAGCTTTG TGCAGCAGTT 1740 

CCGGGCTGGA AGACAGAGAC AGCTGGACAG AGCTCCTGAA AACATTTCAA AATACCCCCT 1800 

CCCCCTGCCC TGCCCTGCCT TTGGGGTCCA CCGGCACTCC AGTTGGATGG CACAA CATAG I860 

TGTATCCGTG CAGAAGCCGA GCTGGCATTT TCACCAGTGT AGCCAAGGGC CTTTGCCAAG 1920 

GGCAGAGCAG GTGGAGCCCT CTGCCTGCCC TATCACACAT ACGGGTACTT GCTTTTCACT 1980 

GTGATGTTTA AGAGAATGTA TGAACAGTTT ACATTTTCCT TAGAAATACA TTGATGGGAT 2040 

CACAGTTGGC TTTAAAAACC AACAACAATC AACCACCTGT AAGTCTTTGT CTTCA CCTAT 2100 

TATCATCTGG AGGTAAATCT CTTTATATGA TGATGCCAAA GGGCAAATTG CTTTTCAAAT 2160 

TCAGCAAGTT CTCAGCTTGT GTGACGGAAG GTCCTTCAGA GGACCTGAGG AATGCCTGGG 2220 

AGAGGCTAAG CCTCAGGCTT CAATGCTTCT GGGGTTGGGC ATGAGGATGT ACACAGACAC 2280 

CCACTACCTT ACT ACT CACA CTTCATTTCA CTCCTTTTGT AAATTTCCAA TTTAAAAATC 2340 

AAGCACGTCT TTTTAGTGAG ATAAAATCTG AGCTCTTCTG TAGAAAAATC AATCTCTACC 2400 

AGTAGAAAAT GCCAGGGCTT GATGGAAGAG CTGTGTAGCC CTT TCTAT GC CAAAGCCAGG 2460 

AAATTTGGGG GGCAGGAGGA GGTTCTCAGA ATCCAGTCTG TATCTTTGCT GTATGCCAAA 2520 

CTGAAACCAC TGGGAATAAT TTATGAAACA TAAAAATCTT CTGTACTTCA CTCCAAGGTA 2580 

CATTTGCTTA CTGACAGCAT TTTTGTTAAA ACTGTTATTC TTGAAAAAAA AAAAAAAAAA 2640 
AA 



Seq ID NO: 90 Protein sequence 
Protein Accession #: AAC39582.1 

1 11 21 31 41 51 

1 | I I I I 

KEAASGKSDR RTPESSLHEA LDQCMTALiDL FLTNQFSEAI* SYLKPRTKES MYHSLTYATI 60 
LEMQAMMTFD PQDILLAGNM MKEAQMLCQR HRRKSSVTDS FSSLVNRPTL GQFTEEEIHA 120 
EVCYAECLiLQ RAAX.TFLQDB NMVSFIKGGI KVRNSYQTYK ELDSLVQSSQ YCKGENHPHF 180 
EGGVKLGVGA FNLTLSMLPT RILRLLEFVG FSGNKDYGLI* QLEEGASGHS FRSVLCVMLL 240 
LCYHTFLTFV LGTGNVNIEE AEKLLKPYLN RYPKGAIFLF FAGRIEVIKG NIDAAIRRFE 300 
ECCEAQQHWK QFHHMCYWEL MWCFTYKGQW KMSYFYADLL SKENCWSKAT YIYMKAAYLS 360 
MFGKEDHKPP GDDEVELFRA VPGLKLKIAG KSLPTEKFAI RKSRRYFSSN PISLPVPALE 420 
MMYIWNGYAV IGKQPKLTDG ILEIITKAEE MLEKGPENEY SVDDECLVKL LKGLCLKYLG 480 
RVQEAEENFR SISANEKKIK YDHYLIPNAL LELALLLMEQ DRNEEAIKLL ESAKQNYKNY 540 
SMESRTHFRI QAATLQAKSS LENSSRSMVS SVSL 

Seq ID NO: 91 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 31.. 906 
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1 11 21 31 41 51 

■I | | I I I 

CGGGTCGACC CACGCGTCCG GGGAGAAAGG ATGGCCGGCC TGGCGGCGCG GTTGGTCCTG 60 

CTAGCTGGGG CAGCGGCGCT GGCGAGCGGC TCCCAGGGCG ACCGTGAGCC GGTGTACCGC 120 

GACTGCGTAC TGCAGTGCGA AGAGCAGAAC TGCTCTGGGG GCGCTCTGAA TCACTTCCGC 180 

TCCCGCCAGC CAATCTACAT GAGTCTAGCA GGCTGGACCT GTCGGGACGA CTGTAAGTAT 240 

GAGTGTATGT GGGTCACCGT TGGGCTCTAC CTCCAGGAAG GTCACAAAGT GCCTCAGTTC 300 

CATGGCAAGT GGCCCTTCTC CCGGTTCCTG TTCTTTCAAG AGCCGGCATC GGCCGTGGCC 360 

TCGTTTCTCA ATGGCCTGGC CAGCCTGGTG ATGCTCTGCC GCTACCGCAC CTTCGTGCCA 420 

GCCTCCTCCC CCATGTACCA CACCTGTGTG GCCTTCGCCT GGGTGTCCCT CAATGCATGG 480 

TTCTGGTCCA CAGTYTTCCA CACCAGGGAC ACTGACCTCA CAGAGAAAAT GGACTACTTC 540 
TGTGCCTCCA CTGTCATCCT ACACTCAATC TACCTGTGCT GCGTCAGCCT CATCCGCTTC 600 
GACTATGGCT ACAACCTGGT GGCCAACGTG GCTATTGGCC TGGTCAACGT GGTGTGGTGG 660 
CTGGCCTGGT GCCTGTGGAA CCAGCGGCGG CTGCCTCACG TGCGCAAGTG CGTGGTGGTG 720 
GTCTTGCTGC TGCAGGGGCT GTCCCTGCTC GAGCTGCTTG ACTTCCCACC GCT CTTCTG G 780 
GTCCTGGATG CCCATGCCAT CTGGCACATC AGCACCATCC CTGTCCACGT CCTCTTTTTC 840 
AGCTTTCTGG AAGATGACAG CCTGTACCTG CTGAAGGAAT CAGAGGACAA GTTCAAGCTG 900 
GACTGAAGAC CTTGGAGCGA GTCTGCCCCA GTGGGGATCC TGCCCCCGCC CTGCTGGCCT 960 

CCCTTCTCCC CTCAACCCTT GAGATGATTT TCTCTTTTCA ACTTCTTGAA CTTGGACATG 1020 

AAGGATGTGG GCCCAGAATC ATGTGGCCAG CCCACCCCCT GTTGGCCCTC ACCAGCCTTG 1080 

GAGTCTGTTC TAGGGAAGGC CTCCCAGCAT CTGGGACTCG AGAGTGGGCA GCCCCTCTAC 1140 

CTCCTGGAGC TGAACTGGGG TGGAACTGAG TGTGCTCTTA GCTCTACCGG GAGGACAGCT 1200 

GCCTGTTTCC TCCCCATCAG CCTCCTCCCC ACATCCCCAG CTGCCTGGCT GGGTCCTGAA 1260 

GCCCTCTGTC TACCTGGGAG ACCAGGGACC ACAGGCCTTA GGGATACAGG GGGTCCCCTT 1320 

CTGTTACCAC CCCCCACCCT CCTCCAGGAC ACCACTAGGT GGTGCTGGAT GCTTGTTCTT 1380 

TGGCCAGCCA AGGTTCACGG CGATTCTCCC CATGGGATCT TGAGGGACCA AGCTGCTGGG 1440 

ATTGGGAAGG AGTTTCACCC TGACCRTTGC CCTAGCCAGG TTCCCAGGAG. GCCTCACCAT 1500 

ACTCCCTTTC AGGGCCAGGG CTCCAGCAAG CCCAGGGCAA GGATCCTGTG CTGCTGTCTG 1560 

GTTGAGAGCC TGCCACCGTG TGTCGGGAGT GTGGGCCAGG CTGAGTGCAT AGGTGACAGG 1620 

GCCGTGAGCA TGGGCCTGGG TGTGTGTGAG CTCAGGCACT AGGTGCGCAG TGTGGAGACG 1680 

GGTGTTGTCG GGGAAGAGGT GTGGCTTCAA AGTGTTGTGT GTGCAGGGGG TKGGTGTGTT 1740 

AAGCGTGGGT TAGGGGAACG TGTGTGCGCG TGCTGGTGGG CATGTGAGAT GAGTGACTGC 1800 

CGGTGAATGT GTCCACAGTT GAGAGGTTGG AGCAGGATGA GGGAATCCTG TCACCATCAA 1860 

TAATCACTTG TGGAGCGCCA CTTGGCCCAA GACGCCACCT GGGCGGACAG CAGGAGCTCT 1920 

CCATGGCCAG GCTGCCTGTG TGCATGTTCC CTGTCTGGTG CCCCTTTGCC CGCCTCCTGC 1980 

AAACCTCACA GGGTCCCCAC ACAACAGTGC CCTCCAGAAG CAGCCCCTCG GAGGCAGAGG 2040 

AAGGAAAATG GGGATGGCTG GGGCTCTCTC CATCCTCCTT TTCTCCTTGC CTTCGCATGG 2100 

CTGGCCTTCC CCTCCAAAAC CTCCATTCCC CTGCTGCCAG CCCCTTTGCC ATAGCCTGAT 2160 

TTTGGGGAGG AGGAAGGGGC GATTTGAGGG AGAAGGGGAG AAAGCTTATG GCTGGGTCTG 2220 
GTTTCTTCCC TTCCCAGAGG GTCTTACTGT TCCAGGGTGG CCCCAGGGCA GGCAGGGGCC 2280 
ACACTATGCC TGCGCCCTGG TAAAGGTGAC CCCTGCCATT TACCAGCAGC CCTGGCATGT 2340 
TCCTGCCCCA CAGGAATAGA ATGGAGGGAG CTCCAGAAAC TTTCCATCCC AA AGGCA GTC 2400 
TCCGTGGTTG AAGCAGACTG GATTTTTGCT CTGCCCCTGA CCCCTTGTCC CTCTTTGAGG 2460 
GAGGGGAGCT ATGCTAGGAC TCCAACCTCA GGGACTCGGG TGGCCTGCGC TAGCTTCTTT 2520 
TGATACTGAA AACTTTTAAG GTGGGAGGGT GGCAAGGGAT GTGCTTAATA AATCAATTCC 2580 
AAGCCTCAAA AAAAAAAAAA AAAAAAAAAA AAAAAA 



Seq ID NO: 92 Protein seguence 
Protein Accession #; Eos sequence 



1 11 21 31 41 51 

| | | I I I 

MAGLAARLVL LAGAAALASG SQGDREPVYR DCVLQCEEQN CSGGALNHFR SRQPIYMSLA 60 
GWTCRDDCKY ECMWVTVGLY LQEGHKVPQP HGKWPFSRFL FFQEPASAVA SFLNGIASLV 120 
MLCRYRTFVP ASSPMYHTCV AFAWVSI2JAW FWSTVFHTRD TDLTEKMDYF CASTVILHSI 180 
YLCCVRTVGL QHPAWSAFR ALLLLHLTVH VSYLSLIRFD YGYNLVANVA IGLVNWWWL 240 
AWCLWNQRRL PHVRKCWW LLLQGLSLLE LLDFPPLFWV LDAHAIWHIS TIPVHVLFFS 300 
FLEDDSLYIiL KESEDKFKLD 



Seq ID NO: 93 DNA sequence 

Nucleic Acid Accession #: NM_033419.1 

Coding sequence: 18.. 980 

1 11 21 31 41 51 

CGAGCCAGGG AGAAAGGATG GCCGGCCTGG CGGCGCGGTT GGTCCTGCTA GCTGGGGCAG 60 

CGGCGCTGGC GAGCGGCTCC CAGGGCGACC GTGAGCCGGT GTACCGCGAC TGCGTACTGC 120 

AGTGCGAAGA GCAGAACTGC TCTGGGGGCG CTCTGAATCA CTTCCGCTCC CGCCAGCCAA 180 

TCTACATGAG TCTAGCAGGC TGGACCTGTC GGGACGACTG TAAGTATGAG TGTATGTGGG 240 

TCACOGTTGG GCTCTACCTC CAGGAAGGTC ACAAAGTGCC TCAGTTCCAT GGCAAGTGGC 300 

CCTTCTCCCG GTTCCTGTTC TTTCAAGAGC CGGCATCGGC CGTGGCCTCG TTTCTCAATG 360 

GCCTGGCCAG CCTGGTGATG CTCTGCCGCT ACCGCACCTT CGTGCCAGCC TCCTCCCCCA 420 

TGTACCACAC CTGTGTGGCC TTCGCCTGGG TGTCCCTCAA TGCATGGTTC TGGTCCACAG 480 

TTTTCCACAC CAGGGACACT GACCTCACAG AGAAAATGGA CTACTTCTGT GCCTCCACTG 540 

TCATCCTACA CTCAATCTAC CTGTGCTGCG TCAGGACCGT GGGGCTGCAG CACCCAGCTG 600 

TGGTCAGTGC CTTCCGGGCT CTCCTGCTGC TCATGCTGAC CGTGCACGTC TCCTACCTGA 660 

GCCTCATCCG CTTCGACTAT GGCTACAACC TGGTGGCCAA CGTGGCTATT GGCCTGGTCA 720 

ACGTGGTGTG GTGGCTGGCC TGGTGCCTGT GGAACCAGCG GCGGCTGCCT CACGTGOGCA 780 

AGTGCGTGGT GGTGGTCTTG CTGCTGCAGG GGCTGTCCCT GCTCGAGCTG CTTGACTTCC 840 

CACCGCTCTT CTGGGTCCTG GATGCCCATG CCATCTGGCA CATCAGCACC ATCCCTGTCC 900 

ACGTCCTCTT TTTCAGCTTT CTGGAAGATG ACAGCCTGTA CCTGCTGAAG GAATCAGAGG 960 

ACAAGTTCAA GCTGGACTGA AGACCTTGGA GCGAGTCTGC CCCAGTGGGG ATCCTGCCCC 1020 

CGCCCTGCTG GCCTCCCTTC TCCCCTCAAC CCTTGAGATG ATTTTCTCTT TTCAACTTCT 1080 

TGAACTTGGA CATGAAGGAT GTGGGCCCAG AATCATGTGG CCAGCCCACC CCCTGTTGGC 1140 
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CCTCACCAGC CTTGGAGTCT GTTCTAGGGA AGGCCTCCCA GCATCTGGGA CTCGAGAGTG 1200 

GGCAGCCCCT CTACCTCCTG GAGCTGAACT GGGGTGGAAC TGAGTGTGCT CTTAGCTCTA 1260 

CCGGGAGGAC AGCTGCCTGT TTCCTCCCCA TCAGCCTCCT CCCCACATCC CCAGCTGCCT 1320 

GGCTGGGTCC TGAAGCCCTC TGTCTACCTG GGAGACCAGG GACCACAGGC CTTAGGGATA 1360 

CAGGGGGTCC CCTTCTCTTA CCACCCCCCA CCCTCCTCCA GGACACCACT AGGTGGTGCT 1440 

GGATGCTTGT TCTTTGGCCA GCCAAGGTTC ACGGCGATTC TCCCCATGGG ATCTTGAGGG 1500 

ACCAAGCTGC TGGGATTGGG AAGGAGTTTC ACCCTGACCA TTGCCCTAGC CAGGTTCCCA 1560 

GGAGGCCTCA CCATACTCCC TTTCAGGGCC AGGGCTCCAG CAAGCCCAGG GCAAGGATCC 1620 

TGTGCTGCTG TCTGGTTGAG AGCCTGCCAC CGTGTGTCGG GAGTGTGGGC CAGGCTGAGT 1680 

GCATAGGTGA CAGGGCCGTG AGCATGGGCC TGGGTGTGTG TGAGCTCAGG CCTAGGTGCG 1740 

CAGTGTGGAG ACGGGTGTTG TCGGGGAAGA GGTGTGGCTT CAAAGTGTGT GTGTGCAGGG 1800 

GGTGGGTGTG TTAGCGTGGG TTAGGGGAAC GTGTGTGOGC GTGCTGGTGG GCATGTGAGA 1860 

TGAGTGACTG CCGGTGAATG TGTCCACAGT TGAGAGGTTG GAGCAGGATG AGGGAATCCT 1920 

GTCACCATCA ATAATCACTT GTGGAGCGCC AGCTCTGCCC AAGGCGCCAC CTGGGCGGAC 1980 

AGCCAGGAGC TCTCCATGGC CAGGCTGCCT GTGTGCATGT TCCCTGTCTG GTGCCCCTTT 2040 

GCCCGCCTCC TGCAAACCTC ACAGGGTCCC CACACAACAG TGCCCTCCAG AAGCAGCCCC 2100 

TCGGAGGCAG AGGAAGGAAA ATGGGGATGG CTGGGGCTCT CTCCATCCTC CTTTTCTCCT 2160 

TGCCTTCGCA TGGCTGGCCT TCCCCTCCAA AACCTCCATT CCCCTGCTGC CAGCCCCTTT 2220 

GCCATAGCCT GATTTTGGGG AGGAGGAAGG GGCGATTTGA GGGAGAAGGG GAGAAAGCTT 2280 

ATGGCTGGGT CTGGTTTCTT CCCTTCCCAG AGGGTCTTAC TGTTCCAGGG TGGCCCCAGG 2340 

GCAGGCAGGG GCCACACTAT GCCTGCGCCC TGGTAAAGGT GACCCCTGCC ATTTACCAGC 2400 

AGCCCTGGCA TGTTCCTGCC CCACAGGAAT AGAATGGAGG GAGCTCCAGA AACTTTCCAT 2460 

CCCAAAGGCA GTCTCCGTGG TTGAAGCAGA CTGGATTTTT GCTCTGCCCC TGACCCCTTG 2520 

TCCCTCTTTG AGGGAGGGGA GCTATGCTAG GACTCCAACC TCAGGGACTC GGGTGGCCTG 2580 

CGCTAGCTTC TTTTGATACT GAAAACTTTT AAGGTGGGAG GGTGGCAAGG GATGTGCTTA 2640 
ATAAATCAAT TCCAAGCCTC AAAAAAAAAA AAAAAAAAAA AA 

Seq ID NO: 94 Protein sequence 
Protein Accession #: NP_219487.1 

1 11 21 31 41 51 

I I I I I I 

MKDVGPESCG QPTPCWPSPA LESVLGKASQ HLGLESGQPL YLLELNWGGT ECALSSTGRT 60 
AACFLPISLL PTSPAAWLGP EALCLPGRPG TTGLRDTGGP LLLPPPTLLQ DTTRWCWMLV 120 
LWPAKVHGDS PHGILRDQAA GIGKEFHPDH CPSQVPRRPH HTPFQGQGSS KPRARILCCC 180 
LVESLPPCVG SVGQAECIGD RAVSMGLGVC ELRPRCAVWR RVLSGKRCGF KVCVCRGWVC 

Seq ID NO: 95 DNA sequence 

Nucleic Acid Accession #: XM_090469 

Coding sequence; 

1 11 21 31 41 51 

| I I I 1 I 

ATGGGGTTTG GAGACCAGGG AACGGTGGAA GGGAGCCTAG GAACGTCGAA AGATAGCTCC 60 
TGCCGCGTGC TGGACAGGAC GCAGGTGAAC AAGCCCTGCC TGCTGGAGGC TGCCAAGCAC 120 
AGCCCACACC CCTTGCATGG ACAGTCGGTC AACAGCCACC CATCAGGTGC TCATCAGAAA 180 
CCACCTGAAG TGAAAATGTT TGGAGCCAGT CAAGGTTTGC TGACAATGGA AACAAACCAG 240 
TCCCTGGCAC AAGGCACAGG CTGCTCAGTG TGCTGGGAGG TGAACGGCCC AGGGTTGGAA 300 
ATGTCTCTTC CTGGAGTGCT CAGTGCTGAT GCTGGCCAGG TGGAGCACAG AAGACAAATG 360 
AACTCCGCAG ATACTGACTG GGGAACTGAG GAGGGACCAG GAAGTCTAGA TTTCGCCGTG 420 
GCTGCCCACC AGGAAGACAC TTTCTTTTTG AAAGACATCA AGCACACAAG TACGTTCAGG 480 
CAATCAGTTC AGCAGCAGAA TTGCATTTAC AGCCCCAGAG AAAAACCCTG TGGGAATGTC 540 
AGGGCACCTT GCGCGCCACC AAGGCGAGAG GCCCCGCTTG CCCTCAGCAG GCCCTGGCGG 600 
CTCAGCCGCA GCCCGGCGCC GTCCCCACGT ACACCCATGG CGCCCTTCCC GACCTCTGAC 660 
CGCGAGCTAG ACGCGCCAGG CCCGCCCCCC GGGCTGCGGA GCTCGGCAGC AGCCCCTCAC 720 
TGCCTGOCCG CGGCTCCAGA AGCCCAGAAA CTCTTCTTCC TGCTGCCCCT TTATCCAGAT 780 
GGCAGCCCAC CACCTAAGGA CATCTTGCAG ACGCTCCAGC ACAAGGCACC TGGGAAGAGA 840 
CTGCATCAAG AATCACCAGG TTCATTTCCA ATAGGTTTAT GTAACAGAAC AGCTG AACCA 900 
TGTATCAAAG GGGCTCTGGG TGTGGCTGCT GAAGCAGCAT TTCACTTACA GTTTTCCAGT 960 
GAATGA 

Seq ID NO: 96 Protein sequence 
Protein Accession #: XP_090469 

1 11 21 31 41 51 

I 1 I I I I 

MGFGDQGTVE GSLGTSKDSS CRVLDRTQVN KPCLLEAAKH SPHPLHGQSV NSHPSGAHQK. 60 
PPEVKMFGAS QGLLTMETNQ SIiAQGTGCSV CWEVNGPGLE MSLPGVLSAD AGQVEHRRQM 120 
NSADTDWGTE EGPGSLDFAV AAHQEDTFFL KDIKHTSTPR QSVQQQNCIY SPREKPCGNV 180 
RAPCAPPRRE APLALSRPWR LSRSPAPSPR TPMAPFPTSD RELDAPGPPP GLRSSAAAPH 240 
CLPAAPEAQK LFFLLPLYPD GSPPPKDILQ TLQHKAPGKR LHQESPGSFP IGLCNRTAEP 300 
CIKGALGVAA EAAFHLQFSS E 

Seq ID NO: 97 DNA sequence 

Nucleic Acid Accession #: NM_003474.2 

Coding sequence: 37.. 3036 

1 11 21 31 41 51 

| I I I I I 

CACTAACGCT CTTCCTAGTC CCCGGGCCAA CTCGGACAGT TTGCTCATTT ATTGCAACGG 60 
TCAAGGCTGG CTTGTGCCAG AACGGCGCGC GCGCGACGCA CGCACACACA CGGGGGGAAA 120 
CTTTTTTAAA AATGAAAGGC TAGAAGAGCT CAGCGGCGGC GCGGGCCGTG CGCGAGGGCT 180 
CCGGAGCTGA CTCGCCGAGG CAGGAAATCC CTCCGGTCGC GACGCCCGGC CCCGCTCGGC 240 
GCCCGCGTGG GATGGTGCAG CGCTCGCCGC CGGGCCCGAG AGCTGCTGCA CTGAAGGCCG 300 
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GCGACGATGG CAGCGCGCCC GCTGCCCGTG TCCCCCGCCC GCGCCCTCCT GCTCGCCCTG 360 
GCCGGTGCTC TGCTCGCGCC CTGCGAGGCC CGAGGGGTGA GCTTATGGAA CGAAGGAAGA 420 
GCTGATGAAG TTGTCAGTGC CTCTGTTOGG AGTGGGGACC TCTGGATCCC AGTGAAGAGC 480 
TTCGACTCCA AGAATCATCC AGAAGTGCTG AATATTCGAC TACAACGGGA AAGCAAAGAA 540 
5 CTGATCATAA ATCTGGAAAG AAATGAAGGT CTCATTGCCA GCAGTTTCAC GGAAACCCAC 600 

TATCTGCAAG ACGGTACTGA TGTCTCCCTC GCTCGAAATT ACACGGTAAT TCTGGGTCAC 660 
TGTTACTACC ATGGACATGT ACGGGGATAT TCTGATTCAG CAGTCAGTCT CAGCACGTGT 720 
TCTGGTCTCA GGGGACTTAT TGTGTTTGAA AATGAAAGCT ATGTCTTAGA ACCAATGAAA 780 
AGTGCAACCA ACAGATACAA ACTCTTCCCA GCGAAGAAGC TGAAAAGCGT CCGGGGATCA 840 
10 TGTGGATCAC ATCACAACAC ACCAAACCTC GCTGCAAAGA ATGTGTTTCC ACCACCCTCT 900 
CAGACATGGG CAAGAAGGCA TAAAAGAGAG ACCCTCAAGG CAACTAAGTA TGTGGAGCTG 960 
GTGATCGTGG CAGACAACCG AGAGTTTCAG AGGCAAGGAA AAGATCTGGA AAAAGTTAAG 1020 
CAGCGATTAA TAGAGATTGC TAATCACGTT GACAAGTTTT ACAGACCACT GAACATTCGG 1080 
ATCGTGTTGG TAGGCGTGGA AGTGTGGAAT GACATGGACA AATGCTCTGT AAGTCAGGAC 1140 
15 CCATTCACCA GCCTCCATGA ATTTCTGGAC TGGAGGAAGA TGAAGCTTCT ACCTCGCAAA 1200 
TCCCATGACA ATGCGCAGCT TGTCAGTGGG GTTTATTTCC AAGGGACCAC CATCGGCATG 1260 
GCCCCAATCA TGAGCATGTG CACGGCAGAC CAGTCTGGGG GAATTGTCAT GGACCATTCA 1320 
GACAATCCCC TTGGTGCAGC CGTGACCCTG GCACATGAGC TGGGCCACAA TTTCGGGATG 1380 
AATCATGACA CACTGGACAG GGGCTGTAGC TGTCAAATGG CGGTTGAGAA AGGAGGCTGC 1440 
20 ATCATGAACG CTTCCACCGG GTACCCATTT CCCATGGTGT TCAGCAGTTG CAGCAGGAAG 1500 
GACTTGGAGA CCAGCCTGGA GAAAGGAATG GGGGTGTGCC TGTTTAACCT GCCGGAAGTC 1560 
AGGGAGTCTT TCGGGGGCCA GAAGTGTGGG AACAGATTTG TGGAAGAAGG AGAGGAGTGT 1620 
GACTGTGGGG AGCCAGAGGA ATGTATGAAT CGCTGCTGCA ATGCCACCAC CTGTACCCTG 1680 
AAGCCGGACG CTGTGTGCGC ACATGGGCTG TGCTGTGAAG ACTGCCAGCT GAAGCCTGCA 1740 
25 GGAACAGCGT GCAGGGACTC CAGCAACTCC TGTGACCTCC CAGAGTTCTG CACAGGGGCC 1800 
AGCCCTCACT GCCCAGCCAA CGTGTACCTG CACGATGGGC ACTCATGTCA GGATGTGGAC I860 
GGCTACTGCT ACAATGGCAT CTGCCAGACT CACGAGCAGC AGTGTGTCAC ACTCTGGGGA 1920 
CCAGGTGCTA AACCTGCCCC TGGGATCTGC TTTGAGAGAG TCAATTCTGC AGGTGATCCT 1980 
TATGGCAACT GTGGCAAAGT CTCGAAGAGT TCCTTTGCCA AATGCGAGAT GAGAGATGCT 2040 
30 AAATGTGGAA AAATCCAGTG TCAAGGAGGT GCCAGCCGGC CAGTCATTGG TACCAATGCC 2100 
GTTTCCATAG AAACAAACAT CCCCCTGCAG CAAGGAGGCC GGATTCTGTG CCGGGGGACC 2160 
CACGTGTACT TGGGCGATGA CATGCCGGAC CCAGGGCTTG TGCTTGCAGG CACAAAGTGT 2220 
GCAGATGGAA AAATCTGCCT GAATCGTCAA TGTCAAAATA TTAGTGTCTT TGGGGTTCAC 2280 
GAGTGTGCAA TGCAGTGCCA CGGCAGAGGG GTGTGCAACA ACAGGAAGAA CTGCCACTGC 2340 
35 GAGGCCCACT GGGCACCTCC CTTCTGTGAC AAGTTTGGCT TTGGAGGAAG CACAGACAGC 2400 
GGCCCCATCC GGCAAGCAGA TAACCAAGGT TTAACCATAG GAATTCTGGT GACCATCCTG 2460 
TGTCTTCTTG CTGCCGGATT TGTGGTTTAT CTCAAAAGGA AGACCTTGAT ACGACTGCTG 2520 
TTTACAAATA AGAAGACCAC CATTGAAAAA CTAAGGTGTG TGCGCCCTTC CCGGCCACCC 2580 
CGTGGCTTCC AACCCTGTCA GGCTCACCTC GGCCACCTTG GAAAAGGCCT GATGAGGAAG 2640 
40 CCGCCAGATT CCTACCCACC GAAGGACAAT CCCAGGAGAT TGCTGCAGTG TCAGAATGTT 2700 
GACATCAGCA GACCCCTCAA CGGCCTGAAT GTCCCTCAGC CCCAGTCAAC TCAGCGAGTG 2760 
CTTCCTCCCC TCCACCGGGC CCCACGTGCA CCTAGCGTCC CTGCCAGACC CCTGCCAGCC 2820 
AAGCCTGCAC TTAGGCAGGC CCAGGGGACC TGTAAGCCAA ACCCCCCTCA GAAGCCTCTG 2880 
CCTGCAGATC CTCTGGCCAG AACAACTCGG CTCACTCATG CCTTGGCCAG GACCCCAGGA 2 940 
45 CAATGGGAGA CTGGGCTCCG CCTGGCACCC CTCAGACCTG CTCCACAATA TCCACACCAA 3000 

GTGCCCAGAT CCACCCACAC CGCCTATATT AAGTGAGAAG CCGACACCTT TTTTCAACAG 3060 
TGAAGACAGA AGTTTGCACT ATCTTTCAGC TCCAGTTGGA GTTTTTTGTA CCAACTTTTA 3120 
GGATTTTTTT TAATGTTTAA AACATCATTA CTATAAGAAC TTTGAGCTAC TGCCGTCAGT 3180 
GCTGTGCTGT GCTATGGTGC TCTGTCTACT TGCACAGGTA CTTGTAAATT ATTAATTTAT 3240 
50 GCAGAATGTT GATTACAGTG CAGTGCGCTG TAGTAGGCAT TTTTACCATC ACTGAGTTTT 3300 
CCATGGCAGG AAGGCTTGTT GTGCTTTTAG TATTTTAGTG AACTTGAAAT ATCCTGCTTG 3360 
ATGGGATTCT GGACAGGATG TGTTTGCTTT CTGATCAAGG CCTTATTGGA AAGCAGTCCC 3420 
CCAACTACCC CCAGCTGTGC TTATGGTACC AGATGCAGCT CAAGAGATCC CAAGTAGAAT 3480 
CTCAGTTGAT TTTCTGGATT CCCCATCTCA GGCCAGAGCC AAGGGGCTTC AGGTCCAGGC 3540 
55 TGTGTTTGGC TTTCAGGGAG GCCCTGTGCC CCTTGACAAC TGGCAGGCAG GCTCCCAGGG 3600 

ACACCTGGGA GAAATCTGGC TTCTGGOCAG GAAGCTTTGG TGAGAACCTG GGTTGCAGAC 3660 
AGGAATCTTA AGGTGTAGCC ACACCAGGAT AGAGACTGGA ACACTAGACA AGCCAGAACT 3720 
TGACCCTGAG CTGACCAGCC GTGAGCATGT TTGGAAGGGG TCTGTAGTGT CACTCAAGGC 3780 
GGTGCTTGAT AGAAATGCCA AGCACTTCTT TTTCTCGCTG TCCTTTCTAG AGCACTGCCA 3840 
60 CCAGTAGGTT ATTTAGCTTG GGAAAGGTGG TGTTTCTGT A AGAAACCTAC TGCCCAGGCA 3900 
CTGCAAACCG CCACCTCCCT ATACTGCTTG GAGCTGAGCA AATCACCACA AACTGTAATA 3960 
CAATGATCCT GTATTCAGAC AGATGAGGAC TTTCCATGGG ACCACAACTA TTTTCAGATG 4020 
TGAACCATTA ACCAGATCTA GTCAATCAAG TCTGTTTACT GCAAGGTTCA ACTTATTAAC 4080 
AATTAGGCAG ACTCTTTATG CTTGCAAAAA CTACAACCAA TGGAATGTGA TGTTCATGGG 4140 
65 TATAGTTCAT GTCTGCTATC ATTATTCGTA GATATTGGAC AAAGAACCTT CTCTATGGGG 4200 
CATCCTCTTT TTCCAACTTG GCTGCAGGAA TCTTTAAAAG ATGCTTTTAA CAGAGTCTGA 4260 
ACCTATTTCT TAAACACTTG CAACCTACCT GTTGAGCATC ACAGAATGTG ATAAGGAAAT 4320 
CAACTTGCTT ATCAACTTCC TAAATATTAT GAGATGTGGC TTGGGCAGCA TCCCCTTGAA 4380 
_ _ CTCTTCACTC TTCAAATGCC TGACTAGGGA GCCATGTTTC ACAAGGTCTT TAAAGTGACT 4440 
70 AATGGCATGA GAAATACAAA AATACTCAGA TAAGGTAAAA TGCCATGATG CCTCTGTCTT 4500 
CTGGACTGGT TTTCACATTA GAAGACAATT GACAACAGTT AC ATAAT TCA CTCTGAGTGT 4560 
TTTATGAGAA AGCCTTCTTT TGGGGTCAAC AGTTTTCCTA TGCTTTGAAA CAGAAAAATA 4620 
TGTACCAAGA ATCTTGGTTT GCCTTCCAGA AAACAAAACT GCATTTCACT TTCCCGGTGT 4680 
' _ _ TCCCCACTGT ATCTAGGCAA CATAGTATTC ATGACTATGG ATAAACTAAA CACGTGACAC 4740 
75 AAACACACAC AAAAGGGAAC CCAGCTCTAA TACATTCCAA CTCGTATAGC ATGCATCTGT 4800 
TTATTCTATA GTTATTAAGT TCTTTAAAAT GTAAAGCCAT GCTGGAAAAT AATACTGCTG 4860 
AGATACATAC AGAATTACTG TAACTGATTA CACTTGGTAA TTGTACTAAA GCCAAACATA 4920 
TATATACTAT TAAAAAGGTT TACAGAATTT TATGGTGCAT TACGTGGGCA TTGTCTTTTT 4980 
on AGATGCCCAA ATCCTTAGAT CTGGCATGTT AGCCCTTCCT CCAATTATAA GAGGATATGA 5040 
80 ACCAAAAAAA AAAAAAAAAA AA 
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MAARPLPVSP ARALLLALAG ALLAPCEARG VSLWNEGRAD BWSASVRSG DLWIPVKSFD 60 

SKNHPEVLNI RLQRESKELI INLERNEGLI ASSFT ETHYL QDGTDVSLAR NYTVILGHCY 120 

YHGHVRGYSD SAVSLSTCSG LRGLIVFENE SWLEPMKSA TNRYKLFPAK KLKSVRGSCG 180 

SHHNTPNLAA KNVFPPPSQT WARRHKRETL KATKYVELVI VADNREFQRQ GKDLEKVKQR 240 

LIEIANHVDK FYRPLNIRIV LVGVEVWNDM DKCSVSQDPF TSLHEFLDWR KMKLLPRKSH 300 

DNAQLVSGVY FQGTTIGMAP IMSMCTADQS GGIVMDHSDN PLGAAVTLAH ELGHNFGMNH 360 

DTLDRGCSCQ MAVEKGGCIM NASTGYPFPM VFSSCSRKDL ETSLEKGMGV CLFNLPEVRE 420 

SFGGQKCGNR FVEEGEECDC GEPEECMNRC CNATTCTLKP DAVCAHGLCC EDCQLKPAGT 480 

ACRDSSNSCD LPEFCTGASP HCPANVYLHD GHSCQDVDGY CYNGICQTHE QQCVTLWGPG 540 

AKPAPGICFE RVNSAGDPYG NCGKVSKSSF AKCEMRDAKC GKIQCQGGAS RPVIGTNAVS 600 

IETKIPLQQG GRILCRGTHV YLGDDMPDPG LVLAGTKCAD GKICLNRQCQ NISVFGVHEC 660 

AMQCHGRGVC NNRKNCHCEA KWAPPFCDKF GFGGSTDSGP IRQADNQGLT IGILVTILCL 720 

LAAGFWYLK RKTLIRLLFT NKKTTIEKLR CVRPSRPPRG FQPCQAHLGH LGKGLMRKPP 780 

DSYPPKDNPR RLLQCQNVDI SRPLNGLNVP QPQSTQRVLP PLHRAPRAPS VPARPLPAKP 840 

ALRQAQGTCK PNPPQKPLPA DPLARTTRLT HALARTPGQW ETGLRLAPLR PAPQYPHQVP 900 
RSTHTAYIK 



Seq ID NO: 99 DNA sequence 

Nucleic Acid Accession #: NM_003714 

Coding sequence: 135.. 1043 

1 11 21 31 41 51 

GAGGAGGAGG GAAAAGGCGA GCAAAAAGGA AGAGTGGGAG GAGGAGGGGA AGCGGCGAAG 60 

GAGGAAGAGG AGGAGGAGGA AGAGGGGAGC ACAAAGGATC CAGGTCTCCC GACGGGAGGT 120 

TAATACCAAG AACCATGTGT GCCGAGCGGC TGGGCCAGTT CATGACCCTG GCTTTGGTGT 180 

TGGCCACCTT TGACCCGGCX3 CGGGGGACCG ACGCCACCAA CCCACCCGAG GGTCCCCAAG 240 

ACAGGAGCTC CCAGCAGAAA GGCCGCCTGT CCCTGCAGAA TACAGCGGAG ATCCAGCACT 300 

GTTTGGTCAA CGCTGGCGAT GTGGGGTGTG GCGTGTTTGA ATGTTTCGAG AACAACTCTT 360 

GTGAGATTCG GGGCTTACAT GGGATTTGCA TGACTTTTCT GCACAACGCT GGAAAATTTG 420 

ATGCCCAGGG CAAGTCATTC ATCAAAGACG CCTTGAAATG TAAGGCCCAC GCTCTGCGGC 480 

ACAGGTTCGG CTG CATAAGC CGGAAGTGCC CGGCCATCAG GGAAATGGTG TCCCAGTTGC 540 

AGCGGGAATG CTACCTCAAG CACGACCTGT GCGCGGCTGC CCAGGAGAAC ACCCGGGTGA 600 

TAGTGGAGAT GATCCATTTC AAGGACTTGC TGCTGCACGA ACCCTACGTG GACCTCGTGA 660 

ACTTGCTGCT GACCTGTGGG GAGGAGGTGA AGGAGGCCAT CACCCACAGC GTGCAGGTTC 720 

AGTGTGAGCA GAACTGGGGA AGCCTGTGCT CCATCTTGAG CTTCTGCACC TCGGCCATCC 780 

AGAAGCCTCC CACGGCGCCC CCCGAGCGCC AGCCCCAGGT GGACAGAACC AAGCTCTCCA 840 

GGGCCCACCA CGGGGAAGCA GGACATCACC TCCCAGAGCC CAGCAGTAGG GAGACTGGCC 900 

GAGGTGCCAA GGGTGAGCGA GGTAGCAAGA GCCACCCAAA CGCCCATGCC CGAGGCAGAG 960 

TCGGGGGCCT TGGGGCTCAG GGACCTTCCG GAAGCAGCGA GTGGGAAGAC GAACAGTCTG 1020 

AGTATTCTGA TATCCGGAGG TGAAATGAAA GGCCTGGCCA CGAAATCTTT CCTCCACGCC 1080 

GTCCATTTTC TTATCTATGG ACATTCCAAA ACATTTACCA TTAGAGAGGG GGGATGTCAC 1140 

ACGCAGGATT CTGTGGGGAC TGTGGACTTC ATCGAGGTGT GTGTTCGCGG AACGGACAGG 1200 

TGAGATGGAG ACCCCTGGGG CCGTGGGGTC TCAGGGGTGC CTGGTGAATT CTGCACTTAC 1260 

ACGTACTCAA GGGAGCGCGC CCGCGTTATC CTCGTACCTT TGTCTTCTTT CCATCTGTGG 1320 

AGTCAGTGGG TGTCGGCCGC TCTGTTGTGG GGGAGGTGAA CCAGGGAGGG GCAGGGCAAG 1380 

GCAGGGCCCC CAGAGCTGGG CCACACAGTG GGTGCTGGGC CTCGCCCCGA AGCTTCTGGT 1440 

GCAGCAGCCT CTGGTGCTGT CTCCGCGGAA GTCAGGGCGG CTGGATTCCA GGACAGGAGT 1500 

GAATGTAAAA ATAAATATCG CTTAGAATGC AGGAGAAGGG TGGAGAGGAG GCAGGGGCCG 1560 

AGGGGGTGCT TGGTGCCAAA CTGAAATTCA GTTTCTTGTG TGGGGCCTTG CGGTTCAGAG 1620 

CTCTTGGCGA GGGTGGAGGG AGGAGTGTCA TTTCTATGTG TAATTTCTGA GCCATTGTAC 1680 

TGTCTGGGCT GGGGGGGACA CTGTCCAAGG GAGTGGCCCC TATGAGTTTA TATTTTAACC 1740 

ACTGCTTCAA ATCTCGATTT CACTTTTTTT ATTTATCCAG TTATATCTAC ATATCTGTCA 1800 

TCTAAATAAA TGGCTTTCAA ACAAAGCAAC TGGGTCATTA AAACCAGCTC AAAGGGGGTT 1860 

TAAAAAAAAA AAAACCAGCC CATCCTTTGA GGCTGATTTT TCTTTTTTTT AAGTTCTATT 1920 

TTAAAAGCTA TCAAACAGCG ACATAG CCAT ACATCTGACT GCCTGACATG GACTCCTGCC I960 

CACTTGGGGG AAACCTTATA CCCAGAGGAA AATACACACC TGGGGAGTAC ATTTGACAAA 2040 

TTTCCCTTAG GATTTCGTTA TCTCACCTTG ACCCTCAGCC AAGATTGGTA AAGCTGCGTC 2100 

CTGGCGATTC CAGGAGACCC AGCTGGAAAC CTGGCTTCTC CATGTGAGGG GATGGG AAAG 2160 

GAAAGAAGAG AATGAAGACT ACTTAGTAAT TCCCATCAGG AAATGCTGAC CTTTTACATA 2220 

AAATCAAGGA GACTGCTGAA AATCTCTAAG GGACAGGATT TTCCAGATCC TAATTGGAAA 2280 

TTT AG CAAT A AGGAGAGGAG TCCAAGGGGA CAAATAAAGG CAGAGAGAGA GAGAGAGAGA 2340 
GGGAGAGGAA GAAAAGAGAG AGAGAAAAGA GCCTCGTGCC 
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MCAERLGQFM TLALVLATFD PARGTDATNP PEGPQDRSSQ QKGRLSLQNT AEIQHCLVNA 60 
GDVGCGVFEC FENNSCEIRG LHGICMTFLH NAGKFDAOGK SFIKDALKCK AHALRHRFGC 120 
ISRKCPAIRE MVSQLQRECY LKHDLCAAAQ ENTRVIVEMI HFKDLLLHEP YVDLVNLLLT 180 
CGEEVKEAIT HSVQVQCEQN WGSLCSILSF CTSAIQKPPT APPERQPQVD RTKLSRAHHG 240 
EAGHHLPEPS SRETGRGAKG ERGSKSHPNA HARGRVGGLG AQGPSGSSEW EDEQSEYSDI 300 
RR 
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AAGCCCAGCA 
CGCCCTCCTG 
TCTGCCGCCG 
AGCCCTGCCC 
CAGCCTCAGG 
CCGACAGAAG 
GATCCTTCGG 
CCTAAAGGTA 
TGACATCATG 
TGGGGGCATC 
CGACTATGAT 
AGCCCATGAA 
GTCCGCCTTC 
TCAACACCTA 
CCAGGCTGGG 
CTGTGAGGCC 
GGGCTTTGTG 
TCGCCACTGG 
CATTTGGTTC 
CCCCGCACCC 
GGGTCCCGAG 
CAGCACCCGG 
CTCTGAGATC 
CCTCTACTGG 
GGGTCCTGAC 
ATGCCCTCAG 
ATCTTTGTGG 
GGTGGGGTAC 
AGCGACTGTC 
GGGACCCGCT 
GTAGCACCAT 
TCCTTCCAGG 
TGAGCAACTG 
ATCTGTCTGC 
GTTCACAGTC 
CAACATACCT 
ATCCTCCAAA 
TTTTTAAACT 



GCCCCGGGGC 
CCCCCGATGC 
GACGTCCACC 
AGTAGCCCGG 
CCTCCCCGCT 
AGGTTCGTGC 
TTCCCATGGC 
TGGAGCGATG 
ATCGACTTCG 
CTGGCCCATG 
GAGACCTGGA 
TTTGGCCACG 
TACACCTTTC 
TATGGCCAGC 
ATAGACACCA 
TCCTTTGACG 
TGGCGCCTCC 
CAGGGACTGC 
TTCCAAGGTG 
CTCACCGAGC 
AAGAACAAGA 
CGTGTAGACA 
GACGCTGCCT 
AAGTTTGACC 
TTCTTTGGCT 
GGGTGCTGAC 
CTGTGGGCAC 
AACCACCATG 
TCAGACTGGG 
ATGCAGGTCC 
GGCAGGACTG 
GGCTGGCACT 
GGCTGTAGGG 
CTTCTGGCTG 
AAATGGGGAG 
CAATCCTGTC 
GCCATTGTAA 
GAGGATTGTC 



GGATGGCTCC 
TGCTGCTGCT 
ACCTCCATGC 
CACCTGCCCC 
GTGGCGTGCC 
TTTCTGGCGG 
AGTTGGTGCA 
TGACGCCACT 
CCAGGTACTG 
CCTTCTTCCC 
CTATCGGGGA 
TGCTGGGGCT 
GCTACCCACT 
CCTGGCCCAC 
ATGAGATTGC 
CGGTCTCCAC 
GTGGGGGCCA 
CCAGCCCTGT 
CTCAGTACTG 
TGGGCCTGGT 
TCTACTTCTT 
GTCCCGTGCC 
TCCAGGATGC 
CTGTGAAGGT 
GTGCCGAGCC 
CCCTGCCAGG 
CAGGCATGGG 
ACAACTGCCG 
CAGGGAGGCT 
TGGCAAACCT 
GGGGAACTGG 
GAAGCAAGGG 
CAGGGCCACT 
ACAATCCTGG 
GGGTATTCTT 
CCAGGCCGGA 
ATGTGTGTAC 
ATTAAACACA 



GGCCGCCTGG CTCCGCAGCG 
GCTCCAGCCG CCGCCGCTGC 
CGAGAGGAGG GGGCCACAGC 
TGCCACGCAG GAAGCCCCCC 
CGACCCATCT GATGGGCTGA 
GCGCTGGGAG AAGACGGACC 
GGAGCAGGTG CGGCAGACGA 
CACCTTTACT GAGGTGCACG 
GCATGGGGAC GACCTGCCGT 
CAAGACTCAC CGAGAAGGGG 
TGACCAGGGC ACAGACCTGC 
GCAGCACACA ACAGCAGCCA 
GAGTCTCAGC CCAGATGACT 
TGTCACCTCC AGGACCCCAG 
ACCGCTGGAG CCAGACGCCC 
CATCCGAGGC GAGCTCTTTT 
GCTGCAGCCC GGCTACCCAG 
GGACGCTGCC TTCGAGGATG 
GGTGTACGAC GGTGAAAAGC 
GAGGTTCCCG GTCCATGCTG. 
CCGAGGCAGG GACTACTGGC 
CCGCAGGGCC ACTGACTGGA 
TGATGGCTAT GCCTACTTCC 
GAAGGCTCTG GAAGGCTTCC 
TGCCAACACT TTCCTCTGAC 
CCACGAATAT CAGGCTAGAG 
ACTGAGCCCA TGTCTCCTGC 
GGAGGGCCAC GCAGGTCGTG 
TTGGCATGAC TTAAGAGGAA 
GGCTGCCCTG TCTCATCCCT 
AGTGTCCTTG CTGTATCCCT 
TGCTGGGGCC CCATGGCCTT 
TCCTGAGGTC AGGTCTTGGT 
AAATCTGTTC TCCAGAATCC 
CATGCAGGAG ACCCCAGGCC 
TCCTCCTGAA GCCCTTTTCG 
AGTGTGTATA AACCTTCTTC 
GTTGTTTTCT 



CGGCCGCGCG 
TGGCCCGGGC 
CCTGGCATGC 
GGCCTGCCAG 
GTGCCCGCAA 
TCACCTACAG 
TGGCAGAGGC 
AGGGCCGTGC 
TTGATGGGCC 
ATGTCCACTT 
TGCAGGTGGC 
AGGCCCTGAT 
GCAGGGGCGT 
CCCTGGGCCC 
CGCCAGATGC 
TCTTCAAAGC 
CATTGGCCTC 
CCCAGGGCCA 
CAGTCCTGGG 
CCTTGGTCTG 
GTTTCCACCC 
GAGGGGTGCC 
TGCGCGGCCG 
CCCGTCTCGT 
CATGGCTTGG 
ACCCATGGCC 
AGGGGGATGG 
GTCACCTGCC 
GGGCAGTCTT 
GTCCCTCAGG 
GTTGTGAGGT 
CAGCCCTGGC 
AGGTGCCTGC 
AGGCCAAAAA 
CTGGAGGCTG 
CAGCACTGCT 
TTCTTTTTTT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 



Seq ID NO: 102 Protein sequence 
Protein Accession ft: NP_005931 



1 
I 

MAPAAWLRSA 
PAPATQEAPR 
LVQEQVRQTM 
FFPKTHREGD 
YPLSLSPDDC 
VSTIRGELFF 
QYWVYDGEKP 
PVPRRATDWR 
AEPANTFL 



11 
I 

AARALLPPML 
PASSLRPPRC 
AEALKVWSDV 
VHFDYDETWT 
RGVQHLYGQP 
FKAGFVWRLR 
VLGPAPLTEL 
GVPSEIDAAF 



21 
I 

LLLLQPPPLl* 
GVPDPSDGLS 
TPLTFTEVHE 
IGDDQGTDLL 
WPTVTSRTPA 
GGQLQPGYPA 
GLVRFPVHAA 
QDADGYAYFL 



31 
1 

ARALPPDVHH 
ARNRQKRFVL 
GRADIMIDFA 
QVAAHEFGHV 
LGPQAGIDTN 
LASRHWQGLP 
LVWGPEKNKI 
RGRLYWKFDP 



41 
I 

LHAERRGPQP 
SGGRWEKTDL 
RYWHGDDLPF 
LGLQHTTAAK 
EIAPLEPDAP 
SPVDAAFEDA 
YFFRGRDYWR 
VKVKALEGFP 



51 
I 

WHAALPSSPA 
TYRILRFPWQ 
DGPGGILAHA 
AIMSAFYTFR 
PDACEASFDA 
QGHIWFFQGA 
FHPSTRRVDS 
RLVGPDFFGC 



60 
120 
180 
240 
300 
360 
420 
480 



Seq ID NO: 103 DNA sequence 

Nucleic Acid Accession ft: NM_033151.2 

Coding sequence: 351.. 4499 



1 
I 

ACTGGGATAA 
GCCCAGGATC 
ACTAAGTGAT 
TAAGGAGAGG 
GTAGTGCCCC 
CTTTTCATAT 
AGAGGACATA 
GCGATGACAT 
GTCAGCAAGA 
ATGATGCTGC 
AGCCCCTGGA 
TGATCCAAAG 
ATGCCTCAGA 
GAGGGATTGA 
TTTTCGATGC 
TTATACCAAA 
GACTCTGCTT 
GGATCATCAA 
AGAAGCTCAT 
TCACCGGTGA 
CCTGCGCATC 
TTATTGCCAT 
CTGTGAAGGC 
AAGTTCTCAC 



11 



AGCAAGAAGA 
AAGGGTGCGG 
TTGGGCCTGA 
AAAGAGCAGG 
CTCTTCTGGC 
TCTCTGATTC 
CTGGGTGCCC 
GGTTTCAGGA 
GAGAAATCCT 
CTTGAGAACC 
CAATGCTGGC 
CTTACGGAGT 
CAAAAATGTC 
AAAAGCTTCA 
ACTTCTGGGC 
GATCCTGGAA 
TGCCCTTTTT 
CCAACGCACA 
CCAATTTAAG 
TGTAAACTAC 
GCTGGTCATC 
CTTATGCTAT 
TCAGCATCAC 
TTGCATTAAG 



21 
! 

CTGATTTTAT 
TGTTGGGGGT 
GGCCTGAGAA 
CACCCAAACC 
TGCTCAAGGC 
TGGGAAACGA 
AACTCTTCTG 
CTTATTTATA 
GAGGCTCCAG 
ATGATTCCCT 
CTGTTCTCCT 
CGCTTAGATG 
CAAAGGCTTC 
GTGCTTCTGG 
ATCTGCTTCT 
TATTCAGAAG 
CTCTCCGAAT 
GCCATCAGGT 
TCTGTAATAC 
CTGTTTGAAG 
TGCAGCATTT 
CTCCTGGTTT 
ACATCTGAGG 
CTGATTAAAA 



31 
I 

GAGCAGGGGT 
GGGTTGGGGA 
GATGTTTAAA 
TCTGCATGGC 
GAGATCTAAG 
AGAATTGGCA 
GTGGCCTCGT 
AAACCTATAC 
GGAGGGCAGC 
TCCGTCCCAA 
ACCTCACCGT 
AGAACACCAT 
ACCGCCTTTG 
TGATGCTGAG 
GCATTGCCAG 
AGCAGTTGGG 
GTGTGAAGTC 
TCCGAGCAGC 
ACATCACCTC 
GGGTGTGCTA 
CTTCCTACTT 
TCCCACTGGC 
TCAGCGACCA 
TGTACACATG 



41 



TTGATACATC 
GGGTGGTTAG 
AAGAGGGATC 
CCCAATATGC 
CTTCTTCTAA 
GGAACTGAAA 
GAATCGTGGC 
TCTCCAAGAT 
TGTCCCACCG 
GCCGAGGTTT 
GTCATGGCTC 
CCCTCCACTG 
GGAAGAAGAA 
GTTCCAGAGA 
TGTACTCGGG 
GAATGTTGTC 
TCTGAGTTTC 
TGTTTCCTCC 
AGGAGAGGCC 
TGGACCCCTA 
CATTATTGGA 
GGTATTCATG 
GCGCATCCGT 
GGAGAAACCA 



51 
I 

AAAGGAGATT 
AGAAGGTTTC 
AAGCACAGGC 
TCCCTGCAGG 
CTCCTGCTGT 
ATGACTAGGA 
ATCGACATAG 
GGCCCCTGGA 
TGGGGGAAGT 
CCTGCCCCCC 
ACCCCGCTCA 
TCAGTCCATG 
GTCTCAAGGC 
ACAAGGTTGA 
CCAATATTGA 
CATGGAGTGG 
TCCTCCAGTT 
TTTGCCTTTG 
ATCAGCTTCT 
GTACTGATCA 
TACACTGCAT 
ACAAGAATGG 
GTGACCAGTG 
TTTGCAAAAA 
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TCATTGAAGA CCTAAGAAGG AAGGAAAGGA AACTATTGGA GAAGTGCGGG CTTGTCCAGA 1500 
GCCTGACAAG TATAACCTTG TTCATCATCC CCACAGTGGC CACAGCGGTC TGGGTTCTCA 1S60 
TCCACACATC CTTAAAGCTG AAACTCACAG CGTCAATGGC CTTCAGCATG CTGGCCTCCT 1620 
TGAATCTCCT TCGGCTGTCA GTGTTCTTTG TGCCTATTGC AGTCAAAGGT CTCACGAATT 1680 
5 CCAAGTCTGC AGTGATGAGG TTCAAGAAGT TTTTCCTCCA GGAGAGCCCT GTTTTCTATG 1740 

TCCAGACATT ACAAGACCCC AGCAAAGCTC TGGTCTTTGA GGAGGCCACC TTGTCATGGC 1800 
AACAGACCTG TCCCGGGATC GTCAATGGGG CACTGGAGCT GGAGAGGAAC GGGCATGCTT 1860 
CTGAGGGGAT GACCAGGCCT AGAGATGCCC TCGGGCCAGA GGAAGAAGGG AACAGCCTGG 1920 
GCCCAGAGTT GCACAAGATC AACCTGGTGG TGTCCAAGGG GATGATGTTA GGGGTCTGCG 1980 
10 GCAACACGGG GAGTGGTAAG AGCAGCCTGT TGTCAGCCAT CCTGGAGGAG ATGCACTTGC 2040 
TCGAGGGCTC GGTGGGGGTG CAGGGAAGCC TGGCCTATGT CCCCCAGCAG GCCTGGATCG 2100 
TCAGCGGGAA CATCAGGGAG AACATCCTCA TGGGAGGCGC ATATGACAAG GCCCGATACC 2160 
TCCAGGTGCT CCACTGCTGC TCCCTGAATC GGGACCTGGA ACTTCTGCCC TTTGGAGACA 2220 
TGACAGAGAT TGGAGAGCGG GGCCTCAACC TCTCTGGGGG GCAGAAACAG AGGATCAGCC 2280 
15 TGGCCCGCGC CGTCTATTCC GACCGTCAGA TCTACCTGCT GGACGACCCC CTGTCTGCTG 2340 
TGGACGCCCA CGTGGGGAAG CACATTTTTG AGGAGTGCAT TAAGAAGACA CTCAGGGGGA 2400 
AGACGGTCGT CCTGGTGACC CACCAGCTGC AGTACTTAGA ATTTTGTGGC CAGATCATTT 2460 
TGTTGGAAAA TGGGAAAATC TGTGAAAATG GAACTCACAG TGAGTTAATG CAGAAAAAGG 2520 
GGAAATATGC CCAACTTATC CAGAAGATGC ACAAGGAAGC CACTTCGGAC ATGTTGCAGG 2580 
20 ACACAGCAAA GATAGCAGAG AAGCCAAAGG TAGAAAGTCA GGCTCTGGCC ACCTCCCTGG 2640 
AAGAGTCTCT CAACGGAAAT GCTGTGCCGG AGCATCAGCT CACACAGGAG GAGGAGATGG 2700 
AAGAAGGCTC CTTGAGTTGG AGGGTCTACC ACCACTACAT CCAGGCAGCT GGAGGTTACA 2760 
TGGTCTCTTG CATAATTTTC TTCTTCGTGG TGCTGATCGT CTTCTTAACG ATCTTCAGCT 2820 
TCTGGTGGCT GAGCTACTGG TTGGAGCAGG GCTCGGGGAC CAATAGCAGC CGAGAGAGCA 2880 
25 ATGGAACCAT GGCAGACCTG GGCAACATTG CAGACAATCC TCAACTGTCC TTCTACCAGC 2940 
TGGTGTACGG GCTCAACGCC CTGCTCCTCA TCTGTGTGGG GGTCTGCTCC TCAGGGATTT 3000 
TCACCAAAGT CACGAGGAAG GCATCCACGG CCCTGCACAA CAAGCTCTTC AACAAGGTTT 3060 
TCCGCTGCCC CATGAGTTTC TTTGACACCA TCCCAATAGG CCGGCTTTTG AACTGCTTCG 3120 
CAGGGGACTT GGAACAGCTG GACCAGCTCT TGCCCATCTT TTCAGAGCAG TTCCTGGTCC 3180 
30 TGTCCTTAAT GGTGATCGCC GTCCTGTTGA TTGTCAGTGT GCTGTCTCCA TATATCCTGT 3240 
TAATGGGAGC CATAATCATG GTTATTTGCT TCATTTATTA TATGATGTTC AAGAAGGCCA 3300 
TCGGTGTGTT CAAGAGACTG GAGAACTATA GCCGGTCTCC TTTATTCTCC CACATCCTCA 3360 
ATTCTCTGCA AGGCCTGAGC TCCATCCATG TCTATGGAAA AACTGAAGAC TTCATCAGCC 3420 
AGTTTAAGAG GCTGACTGAT GCGCAGAATA ACTACCTGCT GTTGTTTCTA TCTTCCACAC 3480 
35 GATGGATGGC ATTGAGGCTG GAGATCATGA CCAACCTTGT GACCTTGGCT GTTGCCCTGT 3540 
TCGTGGCTTT TGGCATTTCC TCCACCCCCT ACTCCTTTAA AGTCATGGCT GTCAACATCG 3600 
TGCTGCAGCT GGCGTCCAGC TTCCAGGCCA CTGCCGGGAT TGGCTTGGAG ACAGAGGCAC 3660 
AGTTCACGGC TGTAGAGAGG ATACTGCAGT ACATGAAGAT GTGTGTCTCG GAAGCTCCTT 3720 
TACACATGGA AGGCACAAGT TGTCCCCAGG GGTGGCCACA GCATGGGGAA ATCATATTTC 3780 
40 AGGATTATCA CATGAAATAC AGAGACAACA CACCCACCGT GCTTCACGGC ATCAACCTGA 3840 
CCATCCGCGG CCACGAAGTG GTGGGCATCG TGGGAAGGAC GGGCTCTGGG AAGTCCTCCT 3900 
TGGGCATGGC TCTCTTCCGC CTGGTGGAGC CCATGGCAGG CCGGATTCTC ATTGACGGCG 3960 
TGGACATTTG CAGCATCGGC CTGGAGGACT TGCGGTCCAA GCTCTCAGTG ATCCCTCAAG 4020 
ATCCAGTGCT GCTCTCAGGA ACCATCAGAT TCAACCTAGA TCCCTTTGAC CGTCACACTG 4080 
45 ACCAGCAGAT CTGGGATGCC TTGGAGAGGA CATTCCTGAC CAAGGCCATC TCAAAGTTCC 4140 
CCAAAAAGCT GCATACAGAT GTGGTGGAAA ACGGTGGAAA CTTCTCTGTG GGGGAGAGGC 4200 
AGCTGCTCTG CATTGCCAGG GCTGTGCTTC GCAACTCCAA GATCATCCTT ATCGATGAAG 4260 
CCACAGCCTC CATTGACATG GAGACAGACA CCCTGATCCA GCGCACAATC OGTGAAGCCT 4320 
TCCAGGGCTG CACCGTGCTC GTCATTGCCC ACCGTGTCAC CACTGTGCTG AACTGTGACC 4380 
50 ACATCCTGGT TATGGGCAAT GGGAAGGTGG TAGAATTTGA TCGGCCGGAG GTACTGCGGA 4440 
AGAAGCCTGG GTCATTGTTC GCAGCCCTCA TGGCCACAGC CACTTCTTCA CTGAGATAAG 4500 
GAGATGTGGA GACTTCATGG AGGCTGGCAG CTGAGCTCAG AGGTTCACAC AGGTGCAGCT 4560 
TCGAGGCCCA CAGTCTGCGA CCTTCTTGTT TGGAGATGAG AACTTCTCCT GGAAGCAGGG 4620 
GTAAATGTAG GGGGGGTGGG GATTGCTGGA TGGAAACCCT GGAATAGGCT ACTTGATGGC 4680 
55 TCTCAAGACC TTAGAACCCC AGAACCATCT AAGACATGGG ATTCAGTGAT CATGTGGTTC 4740 
TCCTTTTAAC TTACATGCTG AATAATTTTA TAATAAGGTA AAAGCTTATA GTTTTCTGAT 4800 
CTGTGTTAGA AGTGTTGCAA ATGCTGTACT GACTTTGTAA AATATAAAAC TAAGGAAAAC 4860 
TC 

60 Seq ID NO: 104 Protein sequence 
Protein Accession #: NP_149163.2 

1 11 21 31 41 51 

I I I I I I 

65 MTRKRTYWVP MSSGGLVNRG IDIGDDMVSG LIYKTYTLQD GPWSQQERNP EAPGRAAVPP 
WGKYDAALRT MIPFRPKPRF PAPQPLDNAG LFSYLTVSWL TPLMIQSLRS RLDENTIPPL 
SVHDASDKNV QRLHRLWEEE VSRRGIEKAS VLLVMLRFQR TRLIFDALLG ICFCIASVLG 
PILIIPKILE YSEEQLGNW HGVGLCFALF LSECVKSLSF SSSWIINQRT AIRFRAAVSS 
FAFEKLIQFK SVIHITSGEA ISFFTGDVNY LFEGVCYGPL VLITCASLVI CSISSYFIIG 

70 YTAFIAILCY LLVFPIAVFM TRMAVKAQHH TSEVSDQRIR VTSEVLTCIK LIKMYTWEKP 
FAKIIEDLRR KERKLLEKCG LVQSLTSITL FIIPTVATAV WVLIHTSLKL KLTASMAFSM 
LASLNLLRLS VFFVPIAVKG LTNSKSAVMR FKKFFLQESP VFYVQTLQDP SKALVFEEAT 
LSWQQTCPGI VNGALELERN GHASEGMTRP RDALGPEEEG NSLGPELHKI NLVVS KGMML 
GVCGNTGSGK SSLI»SAIL.EE MHLLEGSVGV QGSLAYVPQQ AWIVSGNIRE NILMGGAYDK 

75 ARYLQVLHCC SLNRDLELLP FGDMTEIGER GLNLSGGQKQ RISLARAVYS DRQIYLLDDP 
LSAVDAHVGK HIFEECIKKT LRGKTWLVT HQLQYLEFCG QIILLENGKI CENGTHSELM 
QKKGKYAQLI QKMHKEATSD MLQDTAKIAE KPKVESQALA TSLEESLNGN AVPEHQLTQE 
EEMEEGSLSW RVYHHYIQAA GGYMVSCIIF FFWLIVFLT IFSFWWLSYW LEQGSGTNSS 
RESNGTMADL GNIADNPQI*S FYQLVYGLNA LLLICVGVCS SGIFTKVTRK ASTALHNKLF 

80 NKVFRCPMSF FDTIPIGRLL NCFAGDLEQL DQLLPIFSEQ FLVLSLMVIA VLLIVSVLSP 
YILLMGAIIM VICFIYYMMF KKAIGVFKRL ENYSRSPLFS HILNSLQGLS SIHVYGKTED 
FISQFKRLTD AQNNYLLLFL SSTRWMALRL EIMTNLVTLA VALFVAFGIS STPYSFKVMA 
VNIVLQLASS FQATARIGLE TEAQFTAVER ILQYMKMCVS EAPLHMEGTS CPQGWPQHGE 
IIFQDYHMKY RDNTPTVLHG IKLTIRGHEV VGIVGRTGSG KSSLGMALFR LVEPMAGRIL 
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IDGVDICSIG LEDLRSKLSV IPQDPVLLSG TIRFNLDPFD RHTDQQIWDA LERTFLTKAI 1260 
SKFPKKLHTD WENGGNFSV GERQLUIIAR AVLRNSKIIL IDEATASIDM ETDTLIQRTI 1320 
REAFQGCTVL VIAHRVTTVL NCDHILVMGN GKWEFDRPE VLRKKPGSLF AALMATATSS 1380 
LR 

Seq ID NO: 105 DNA sequence 

Nucleic Acid Accession «: Eos sequence 

Coding sequence: 1..4043 

1 11 21 31 41 51 

I i I 1 I 1 

ATGACTAGGA AGAGGACATA CTGGGTGCCC AACTCTTCTG GTGGCCTCGT GAATCGTGGC 60 
ATCGACATAG GCGATGACAT GGTTTCAGGA CTTATTTATA AAACCTATAC TCTCCAAGAT 120 
GGCCCCTGGA GTCAGCAAGA GAGAAATCCT GAGGCTCCAG GGAGGGCAGC TGTCCCACCG 180 
TGGGGGAAGT ATGATGCTGC CTTGAGAACC ATGATTCCCT TCCGTCCCAA GCCGAGGTTT 240 
CCTGCCCCCC AGCCCCTGGA CAATGCTGGC CTGTTCTCCT ACCTCACCGT GTCATGGCTC 300 
ACCCCGCTCA TGATCCAAAG CTTACGGAGT CGCTTAGATG AGAACACCAT CCCTCCACTG 360 
TCAGTCCATG ATGCCTCAGA CAAAAATGTC CAAAGGCTTC ACCGCCTTTG GGAAGAAGAA 420 
GTCTCAAGGC GAGGGATTGA AAAAGCTTCA GTGCTTCTGG TGATGCTGAG GTTCCAGAGA 480 
ACAAGGTTGA TTTTCGATGC ACTTCTGGGC ATCTGCTTCT GCATTGCCAG TGTACTCGGG 540 
CCAATATTGA TTATACCAAA GATCCTGGAA TATTCAGAAG AGCAGTTGGG GAATGTTGTC 600 
CATGGAGTGG GACTCTGCTT TGCCCTTTTT CTCTCCGAAT GTGTGAAGTC TCTGAGTTTC 660 
TCCTCCAGTT GGATCATCAA CCAACGCACA GCCATCAGGT TCCGAGCAGC TGTTTCCTCC 720 
TTTGCCTTTG AGAAGCTCAT CCAATTTAAG TCTGTAATAC ACATCACCTC AGGAGAGGCC 780 
ATCAGCTTCT TCACCGGTGA TGTAAACTAC CTGTTTGAAG GGGTGTGCTA TGGACCCCTA 840 
GTACTGATCA CCTGCGCATC GCTGGT CATC TGCAGCATTT CTTCCTACTT CATTATTGGA 900 
TACACTGCAT TTATTGCCAT CTTATGCTAT CTCCTGGTTT TCCCACTGGC GGTATTCATG 960 
ACAAGAATGG CTGTGAAGGC TCAGCATCAC ACATCTGAGG TCAGCGACCA GCGCATCCGT 1020 
GTGACCAGTG AAGTTCTCAC TTGCATTAAG CTGATTAAAA TGTACACATG GGAGAAACCA 1080 
TTTGCAAAAA TCATTGAAGG TATGGAAAGT CTGACTTTCT GCTCCAAACC TGGTGATGGC 1140 
ATGGCCTTCA GCATGCTGGC CTCCTTGAAT CTCCTTCGGC TGTCAGTGTT CTTTGTGCCT 1200 
ATTGCAGTCA AAGGTCTCAC GAATTCCAAG TCTGCAGTGA TGAGGTTCAA GAAGTTTTTC 1260 
CTCCAGGAGA GCCCTGTTTT CTATGTCCAG ACATTACAAG ACCCCAGCAA AGCTCTGGTC 1320 
TTTGAGGAGG CCACCTTGTC ATGGCAACAG ACCTGTCCCG GGATCGTCAA TGGGGCACTG 1380 
GAGCTGGAGA GGAACGGGCA TGCTTCTGAG GGGATGACCA GGCCTAGAGA TGCCCTCGGG 1440 
CCAGAGGAAG AAGGGAACAG CCTGGGCCCA GAGTTGCACA AGATCAACCT GGTGGTGTCC 1500 
AAGGGGATGA TGTTAGGGGT CTGCGGCAAC ACGGGGAGTG GTAAGAGCAG CCTGTTGTCA 1560 
GCCATCCTGG AGGAGATGCA CTTGCTCGAG GGCTCGGTGG GGGTGCAGGG AAGCCTGGCC 1620 
TATGTCCCCC AGCAGGCCTG GATCGTCAGC GGGAACATCA GGGAGAACAT CCTCATGGGA 1680 
GGCGCATATG ACAAGGCCCG ATACCTCCAG GTGCTCCACT GCTGCTCCCT GAATCGGGAC 1740 
CTGGAACTTC TGCCCTTTGG AGACATGACA GAGATTGGAG AGCGGGGCCT CAACCTCTCT 1800 
GGGGGGCAGA AACAGAGGAT CAGCCTGGCC CGCGCCGTCT ATTCCGACCG TCAGATCTAC 1860 
CTGCTGGACG ACCCCCTGTC TGCTGTGGAC GCCCACGTGG GGAAGCACAT TTTTGAGGAG 1920 
TGCATTAAGA AGACACTCAG GGGGAAGACG GTCGTCCTGG TGACCCACCA GCTGCAGTAC 1980 
TTAGAATTTT GTGGCCAGAT CATTTTGTTG GAAAATGGGA AAATCTGTGA AAATGGAACT 2040 
CACAGTGAGT TAATGCAGAA AAAGGGGAAA TATGCCCAAC TTATCCAGAA GATGCACAAG 2100 
GAAGCCACTT CGGACATGTT GCAGGACACA GCAAAGATAG CAGAGAAGCC AAAGGTAGAA 2160 
AGTCAGGCTC TGGCCACCTC CCTGGAAGAG TCTCTCAACG GAAATGCTGT GCCGGAGCAT 2220 
CAGCTCACAC AGGAGGAGGA GATGGAAGAA GGCTCCTTGA GTTGGAGGGT CTACCACCAC 2280 
TACATCCAGG CAGCTGGAGG TTACATGGTC TCTTGCATAA TTTTCTTCTT CGTGGTGCTG 2340 
ATCGTCTTCT TAACGATCTT CAGCTTCTGG TGGCTGAGCT ACTGGTTGGA GCAGGGCTCG 2400 
GGGACCAATA GCAGCCGAGA GAGCAATGGA ACCATGGCAG ACCTGGGCAA CATTGCAGAC 2460 
AATCCTCAAC TGTCCTTCTA CCAGCTGGTG TACGGGCTCA ACGCCCTGCT CCTCATCTGT 2520 
GTGGGGGTCT GCTCCTCAGG GATTTTCACC AAAGTCACGA GGAAGGCATC CACGGCCCTG 2580 
CACAACAAGC TCTTCAACAA GGTTTTCCGC TGCCCCATGA GTTTCTTTGA CACCATCCCA 2640 
ATAGGCCGGC TTTTGAACTG CTTCGCAGGG GACTTGGAAC AGCTGGACCA GCTCTTGCCC 2700 
ATCTTTTCAG AGCAGTTCCT GGTCCTGTCC TTAATGGTGA TCGCCGTCCT GTTGATTGTC 2760 
AGTGTGCTGT CTCCATATAT CCTGTTAATG GGAGCCATAA TCATGGTTAT TTGCTTCATT 2820 
TATTATATGA TGTTCAAGAA GGCCATCGGT GTGTTCAAGA GACTGGAGAA CTATAGCCGG 2880 
TCTCCTTTAT TCTCCCACAT CCTCAATTCT CTGCAAGGCC TGAGCTCCAT CCATGTCTAT 2940 
GGAAAAACTG AAGACTTCAT CAGCCAGTTT AAGAGGCTGA CTGATGCGCA GAATAACTAC 3000 
CTGCTGTTGT TTCTATCTTC CACACGATGG ATGGCATTGA GGCTGGAGAT CATGACCAAC 3060 
CTTGTGACCT TGGCTGTTGC CCTGTTCGTG GCTTTTGGCA TTTCCTCCAC CCCCTACTCC 3120 
TTTAAAGTCA TGGCTGTCAA CATCGTGCTG CAGCTGGCGT CCAGCTTCCA GGCCACTGCC 3180 
CGGATTGGCT TGGAGACAGA GGCACAGTTC ACGGCTGTAG AGAGGATACT GCAGTACATG 3240 
AAGATGTGTG TCTCGGAAGC TCCTTTACAC ATGGAAGGCA CAAGTTGTCC CCAGGGGTGG 3300 
CCACAGCATG GGGAAATCAT ATTTCAGGAT TATCACATGA AATACAGAGA CAACACACCC 3360 
ACCGTGCTTC ACGGCATCAA CCTGACCATC CGCGGCCAOG AAGTGGTGGG CATCGTGGGA 3420 
AGGACGGGCT CTGGGAAGTC CTCCTTGGGC ATGGCTCTCT TCCGCCTGGT GGAGCCCATG 3480 
GCAGGCCGGA TTCTCATTGA CGGCGTGGAC ATTTGCAGCA TCGGCCTGGA GGACTTGCGG 3540 
TCCAAGCTCT CAGTGATCCC TCAAGATCCA GTGCTGCTCT CAGGAACCAT CAGATTCAAC 3600 
CTAGATCCCT TTGACCGTCA CACTGACCAG CAGATCTGGG ATGCCTTGGA GAGGACATTC 3660 
CTGACCAAGG CCATCTCAAA GTTCCCCAAA AAGCTGCATA CAGATGTGGT GGAAAACGGT 3720 
GGAAACTTCT CTGTGGGGGA GAGGCAGCTG CTCTGCATTG CCAGGGCTGT GCTTCGCAAC 3780 
TCCAAGATCA TCCTTATCGA TGAAGCCACA GCCTCCATTG ACATGGAGAC AGACACCCTG 3840 
ATCCAGCGCA CAATCCGTGA AGCCTTCCAG GGCTGCACCG TGCTCGTCAT TGCCCACCGT 3900 
GTCACCACTG TGCTGAACTG TGACCACATC CTGGTTATGG GCAATGGGAA GGTGGTAGAA 3960 
TTTGATCGGC CGGAGGTACT GCGGAAGAAG CCTGGGTCAT TGTTCGCAGC CCTCATGGCC 4020 
ACAGCCACTT CTTCACTGAG ATAA 



Seq ID NO: 106 Protein sequence 
Protein Accession ft: Eos sequence 
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1 11 21 31 |1 51 

MTRKRTYWVP NSSGGLVNRG IDIGDDMVSG LIYKTYTLQD GPWSQQERNP EAPGRAAVPP 60 

WGKYDAALRT MIPFRPKPRF PAPQPLDNAG LFSYLTVSWL TPLMIQSLRS RLDENTIPPL* 120 

SVHDASDKNV QRLHRLWEEE VSRRGIEKAS VLLVMLRFQR TRLIFDALLG ICFCIASVLG 180 

PILIIPKILE YSEEQLGNW HGVGLCFALF LSECVKSLSF SSSWIINQRT AIRFRAAVSS 240 

FAFEKLIQFK SVIHITSGEA ISFPTGDVNY LFEGVCYGPL VLITCASLVI CSISSYFIIG 300 

YTAFIAILCY LLVFPLAVFM TRMAVKAQHH TSEVSDQRIR VTSEVLTCIK LIKMYTWEKP 3 SO 

FAKIIEGMES LTFCSKPGDG MAFSMLASLN LLRLSVFFVP IAVKGLTNSK SAVMRFKKFF 420 

LQESPVFYVQ TLQDPSKALV FEEATLSWQQ TCPGIVNGAL ELERNGHASE GMTRPRDALG 480 

PEEEGNSLGP ELHKINLWS KGMMLGVCGN TGSGKSSLLS AILEEMHLLE GSVGVQGSLA 540 

YVPQQAWIVS GNIRENILMG GAYDKARYLQ VLHCCSLNRD LELLPFGDMT EIGERGLNLS 600 

GGQKQRISLA RAVYSDRQIY LLDDPLSAVD AHVGKHIFEE CIKKTLRGKT WLVTHQLQY 660 

LEFCGQIILL ENGKICENGT HSELMQKKGK YAQLIQKMHK EATSDMLQDT AKIAEKPKVE 720 

SQALATSLEE SLNGNAVPEH QLTQEEEMEE GSLSWRVYHH YIQAAGGYMV SCIIFFFWL 780 

IVFLTIFSFW WLSYWLEQGS GTNSSRESNG TMADLGKIAD NPQLSFYQLV YGLNALLLIC 840 

VGVCSSGIFT KVTRKASTAL HNKLFNKVFR CPMSFFDTIP IGRLLNCFAG DLEQLDQLLP 900 

IFSEQFLVLS LMVIAVLLIV SVLSPYILLM GAIIMVICFI YYMMFKKAIG VFKRLENYSR 960 

SPLFSHILNS LQGLSSIHVY GKTEDFISQF KRLTDAQNNY LLLFLSSTRW MALRLEIMTN 1020 

LVTLAVALFV AFGISSTPYS FFCVMAVNIVL QLASSFQATA RIGLETEAQF TAVERILQYM 1080 

KMCVSEAPLH MEGTSCPQGW PQHGEIIFQD YHMKYRDNTP TVLHGINLTI RGHEWGIVG 1140 

RTGSGKSSLG MALFRLVEPM AGRILIDGVD ICSIGLEDLR SKLSVIPQDP VLLSGTIRFN 1200 

LDPFDRHTDQ Q I WD ALERT F LTKAISKFPK KLHTDWENG GNFSVGERQL LCIARAVLRN 1260 

SKIILIDEAT ASIDMETDTL IQRTIREAFQ GCTVLVIAHR VTTVLNCDHI LVMGNGKWE 1320 
FDRPEVLRKK PGSLFAALMA TATS SLR 



Seq ID NO: 107 DNA sequence 
Nucleic Acid Accession #i NM_024022 
Coding sequence: 202.. 1563 

1 11 21 31 41 SI 

ACCGGGCACC GGACGGCTCG GGTACTTTCG TTCTTAATTA GGTCATGCCC GTGTGAGCCA 60 

GGAAAGGGCT GTGTTTATGG GAAGCCAGTA ACACTGTGGC CTACTATCTC TTCCGTGGTG 120 

CCATCTACAT TTTTGGGACT CGGGAATTAT GAGGTAGAGG TGGAGGCGGA GCCGGATGTC 180 

AGAGGTCCTG AAATAGTCAC CATGGGGGAA AATGATCCGC CTGCTGTTGA AGCCCCCTTC 240 

TCATTCCGAT CGCTTTTTGG CCTTGATGAT TTGAAAATAA GTCCTGTTGC ACCAGATGCA 300 

GATGCTGTTG CTGCACAGAT CCTGTCACTG CTGCCATTGA AGTTTTTTCC AATCATCGTC 360 

ATTGGGATCA TTGCATTGAT ATTAGCACTG GCCATTGGTC TGGGCATCCA CTTCGACTGC 420 

TCAGGGAAGT ACAGATGTCG CTCATCCTTT AAGTGTATCG AGCTGATAGC TCGATGTGAC 480 

GGAGTCTCGG ATTGCAAAGA CGGGGAGGAC GAGTACCGCT GTGTCCGGGT GGGTGGTCAG 540 
AATGCCGTGC TCCAGGTGTT CACAGCTGCT TCGTGGAAGA CCATGTGCTC CGATGACTGG • 600 

AAGGGTCACT AOGCAAATGT TGCCTGTGCC CAACTGGGTT TCCCAAGCTA TGTGAGTTCA 660 

GATAACCTCA GAGTGAGGTC GCTGGAGGGG CAGTTCCGGG AGGAGTTTGT GTCCATCGAT 720 

CACCTCTTGC CAGATGACAA GGTGACTGCA TTACACCACT CAGTATATGT GAGGGAGGGA 780 

TGTGCCTCTG GCCACGTGGT TACCTTGCAG TGCACAGCCT GTGGTCATAG AAGGGGCTAC 840 

AGCTCACGCA TCGTGGGTGG AAACATGTCC TTGCTCTCGC AGTGGCCCTG GCAGGCCAGC 900 

CTTCAGTTCC AGGGCTACCA CCTGTGCGGG GGCTCTGTCA TCACGCCCCT GTGGATCATC 960 

ACTGCTGCAC ACTGTGTTTA TGACTTGTAC CTCCCCAAGT CATGGACCAT CCAGGTGGGT 1020 

CTAGTTTCCC TGTTGGACAA TCCAGCCCCA TCCCACTTGG TGGAGAAGAT TGTCTACCAC 1080 

AGCAAGTACA AGCCAAAGAG GCTGGGCAAT GACATCGCCC TTATGAAGCT GGCCGGGCCA 1140 

CTCACGTTCA ATGAAATGAT CCAGCCTGTG TGCCTGCCCA ACTCTGAAGA GAACTTCCCC 1200 

GATGGAAAAG TGTGCTGGAC GTCAGGATGG GGGGCCACAG AGGATGGAGG TGACGCCTCC 1260 

CCTGTCCTGA ACCACGCGGC CGTCCCTTTG ATTTCCAACA AGATCTGCAA CCACAGGGAC 1320 

GTGTACGGTG GCATCATCTC CCCCTCCATG CTCTGCGCGG GCTACCTGAC GGGTGGCGTG 1380 

GACAGCTGCC AGGGGGACAG OGGGGGGCCC CTGGTGTGTC AAGAGAGGAG GCTGTGGAAG 1440 

TTAGTGGGAG CGACCAGCTT TGGCATCGGC TGCGCAGAGG TGAACAAGCC TGGGGTGTAC 1500 

ACCCGTGTCA CCTCCTTCCT GGACTGGATC CACGAGCAGA TGGAGAGAGA CCTAAAAACC 1560 

TGAAGAGGAA GGGGACAAGT AGCCACCTGA GTTCCTGAGG TGATGAAGAC AGCCCGATCC 1620 

TCCCCTGGAC TCCCGTGTAG GAACCTGCAC ACGAGCAGAC ACCCTTGGAG CTCTGAGTTC 1680 

CGGCACCAGT AGCAGGCCCG AAAGAGGCAC CCTTCCATCT GATTCCAGCA CAACCTTCAA 1740 

GCTCCTTTTT GTTTTTTGTT TTTTTGAGGT GGAGTCTCGC TCTGTTGCCC AGGCTGGAGT 1800 

GCAGTGGCGA AATCCCTGCT CACTGCAGCC TCCGCTTCCC TGGTTCAAGC GATT CTCTTG 1860 

CCTCAGCTTC CCCAGTAGCT GGGACCACAG GTGCCCGCCA CCACACCCAA CTAATTTTTG 1920 

TATTTTTAGT AGAGACAGGG TTTCACCATG TTGGCCAGGC TGCTCTCAAA CCCCTGACCT 1980 

CAAATGATGT GCCTGCTTCA GCCTCCCACA GTGCTGGGAT TACAGGCATG GGCCACCACG 2040 

CCTAGCCTCA CGCTCCTTTC TGATCTTCAC TAAGAACAAA AGAAGCAGCA ACTTGCAAGG 2100 

GCGGCCTTTC CCACTGGTCC ATCTGGTTTT CTCTCCAGGG GTCTTGCAAA ATTCCTGACG 2160 

AGATAAGCAG TTATGTGACC TCACGTGCAA AGCCACCAAC AGCCACTCAG AAAAGACGCA 2220 

CCAGCCCAGA AGTGCAGAAC TGCAGTCACT GCACGTTTTC ATCTCTAGGG ACCAGAACCA 2280 

AACCGACCCT TTCTACTTCC AAGACTTATT TTCACATGTG GGGAGGTTAA TCTAGGAATG 2340 

ACTCGTTTAA GGCCTATTTT CATGATTTCT TTGTAGCATT TGGTGCTTGA CGTATTATTG 2400 

TCCTTTGATT CCAAATAATA TGTTTCCTTC CCTCAAAAAA AAAAAAAAAA AAAAAAAAAA 2460 
AAAAA 



Seq ID NO: 108 Protein sequence 
Protein Accession #t NP_076927 

1 11 21 31 41 SI 

MGENDPPAVE APFSFRSLFG LDDLKISPVA PDADAVAAQI LSLLPLKFFP IIVIGIIALI 60 

LALAIGLGIH FDCSGKYRCR SSFKCIELIA RCDGVSDCKD GEDEYRCVRV GGQNAVLQVF 120 

TAASWKTMCS DDWKGHYANV ACAQLGFPSY VSSDNLRVSS LEGQFREEFV SIDHLLPDDK 180 

VTALHHSVYV REGCASGHW TLQCTACGHR RGYSSRIVGG NMSLLSQWPW QASLQFQGYH 240 
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LCGGSVITPL WIITAAHCVY DLYLPKSWTI QVGLVSLLDN PAPSHLVEKI VYHSKYKPKR 
LGNDIALMKL AGPLTFNEMI QPVCLPKSEE NFPDGKVCWT SGWGATEDGG DASPVLNHAA 
VPLISNKICN HRDVYGGIIS PSMLCAGYLT GGVDSCQGDS GGPLVCQERR LWKLVGATSF 
GIGCAEVNKP GVYTRVTSFL DWIHEQMERD LKT 

Seq ID NO: 109 DNA sequence 

Nucleic Acid Accession #: NM_000493-2 

Coding sequence: 97.. 2139 

1 il 21 31 41 51 

| 1 I I I I 

CACCTTCTGC ACTGCTCATC TGGGCAGAGG AAGCTTCAGA AAGCTGCCAA GGCACCATCT 
CCAGGAACTC CCAGCACGCA GAATCCATCT GAGAATATGC TGCCACAAAT ACCCTTTTTG 
CTGCTAGTAT CCTTGAACTT GGTTCATGGA GTGTTTTACG CTGAACGATA CCAAATGCCC 
ACAGGCATAA AAGGCCCACT ACCCAACACC AAGACACAGT TCTTCATTCC CTACACCATA 
AAGAGTAAAG GTATAGCAGT AAGAGGAGAG CAAGGTACTC CTGGTCCACC AGGCCCTGCT 
GGACCTCGAG GGCACCCAGG TCCTTCTGGA CCACCAGGAA AACCAGGCTA CGGAAGTCCT 
GGACTCCAAG GAGAGCCAGG GTTGCCAGGA CCACCGGGAC CATCAGCTGT AGGGAAACCA 
GGTGTGCCAG GACTCCCAGG AAAACCAGGA GAGAGAGGAC CATATGGACC AAAAGGAGAT 
GTTGGACCAG CTGGCCTACC AGGACCCCGG GGCCCACCAG GACCACCTGG AATCCCTGGA 
CCGGCTGGAA TTTCTGTGCC AGGAAAACCT GGACAACAGG GACCCACAGG AGCCCCAGGA 
CCCAGGGGCT TTCCTGGAGA AAAGGGTGCA CCAGGAGTCC CTGGTATGAA TGGACAGAAA 
GGGGAAATGG GATATGGTGC TCCTGGTCGT CCAGGTGAGA GGGGTCTTCC AGGCCCTCAG 
GGTCCCACAG GACCATCTGG CCCTCCTGGA GTGGGAAAAA GAGGTGAAAA TGGGGTTCCA 
GGACAGCCAG GCATCAAAGG TOATAGAGGT TTTCCGGGAG AAATGGGACC AATTGGCCCA 
CCAGGTCCCC AAGGCCCTCC TGGGGAACGA GGGCCAGAAG GCATTGGAAA GCCAGGAGCT 
GCTGGAGCCC CAGGCCAGCC AGGGATTCCA GGAACAAAAG GTCTCCCTGG GGCTCCAGGA 
ATAGCTGGGC CCCCAGGGCC TCCTGGCTTT GGGAAACCAG GCTTGCCAGG CCTGAAGGGA 
GAAAGAGGAC CTGCTGGCCT TCCTGGGGGT CCAGGTGCCA AAGGGGAACA AGGGCCAGCA 
GGTCTTCCTG GGAAGCCAGG TCTGACTGGA CCCCCTGGGA ATATGGGACC CCAAGGACCA 
AAAGGCATCC CGGGTAGCCA TGGTCTCCCA GGCCCTAAAG GTGAGACAGG GCCAGCTGGG 
CCTGCAGGAT ACCCTGGGGC TAAGGGTGAA AGGGGTTCCC CTGGGTCAGA TGGAAAACCA 
GGGTACCCAG GAAAACCAGG TCTCGATGGT CCTAAGGGTA ACCCAGGGTT ACCAGGTCCA 
AAAGGTGATC CTGGAGTTGG AGGACCTCCT GGTCTCCCAG GCCCTGTGGG CCCAGCAGGA 
GCAAAGGGAA TGCCCGGACA CAATGGAGAG GCTGGCCCAA GAGGTGCCCC TGGAATACCA 
GGTACTAGAG GCCCTATTGG GCCACCAGGC ATTCCAGGAT TCCCTGGGTC TAAAGGGGAT 
CCAGGAAGTC CCGGTCCTCC TGGCCCAGCT GGCATAGCAA CTAAGGGCCT CAATGGACCC 
ACCGGGCCAC CAGGGCCTCC AGGTCCAAGA GGCCACTCTG GAGAGCCTGG TCTTCCAGGG 
CCCCCTGGGC CTCCAGGCCC ACCAGGTCAA GCAGTCATGC CTGAGGGTTT TATAAAGGCA 
GGCCAAAGGC CCAGTCTTTC TGGGACCCCT CTTGTTAGTG CCAACCAGGG GGTAACAGGA 
ATGCCTGTGT CTGCTTTTAC TGTTATTCTC TCCAAAGCTT ACCCAGCAAT AGGAACTCCC 
ATACCATTTG ATAAAATTTT GTATAACAGG CAACAGCATT ATGACCCAAG GACTGGAATC 
TTTACTTGTC AGATACCAGG AATATACTAT TTTTCATACC ACGTGCATGT GAAAGGGACT 
CATGTTTGGG TAGGCCTGTA TAAGAATGGC ACCCCTGTAA TGTACACCTA TGATGAATAC 
ACCAAAGGCT ACCTGGATCA GGCTTCAGGG AGTGCCATCA TCGATCTCAC AGAAAATGAC 
CAGGTGTGGC TCCAGCTTCC CAATGCCGAG TCAAATGGCC TATACTCCTC TGAGTATGTC 
CACTCCTCTT TCTCAGGATT CCTAGTGGCT CCAATGTGAG TACACCCCAC AGAGCTAATC 
TAAATCTTGT GCTAGAAAAA GCATTCTCTA ACTCTACCCC ACCCTACAAA ATGCATATGG 
AGGTAGGCTG AAAAGAATGT AATTTTTATT TTCTGAAATA CAGATTTGAG CTATCAGACC 
AACAAACCTT CCCCCTGAAA AGTGAGCAGC AACGTAAAAA CGTATGTGAA GCCTCTCTTG 
AATTTCTAGT TAGCAATCTT AAGGCTCTTT AAGGTTTTCT CCAATATTAA AAAATATCAC 
CAAAGAAGTC CTGCTATGTT AAAAACAAAC AACAAAAAAC AAAGCAACAA AAAAAAAAAT 
TAAAAAAAAA AACAGAAATA GAGCTCTAAG TTATGTGAAA TTTGATTTGA GAAACTCGGC 
ATTTCCTTTT TAAAAAAGCC TGTTTCTAAC TATGAATATG AGAACTTCTA GGAAACATCC 
AGGAGGTATC ATATAACTTT GTAGAACTTA AATACTTGAA TATTCAAATT TAAAAGACAC 
TGTATCCCCT AAAATATTTC TGATGGTGCA CTACTCTGAG GCCTGTATGG CCCCTTTCAT 
CAATATCTAT TCAAATATAC AGGTGCATAT ATACTTGTTA AAGCTCTTAT ATAAAAAAGC 
CCCAAAATAT TGAAGTTCAT CTGAAATGCA AGGTGCTTTC ATCAATGAAC CTTTTCAAAA 
CTTTTCTATG ATTGCAGAGA AGCTTTTTAT ATACCCAGCA TAACTTGGAA ACAGGTATCT 
GACCTATTCT TATTTAGTTA ACACAAGTGT GATTAATTTG ATTTCTTTAA TTCCTTATTG 
AATCTTATGT GATATGATTT TCTGGATTTA CAGAACATTA GCACATGTAC CTTGTGCCTC 
CCATTCAAGT GAAGTTATAA TTTACACTGA GGGTTTCAAA ATTCGACTAG AAGTGGAGAT 
ATATTATTTA TTTATGCACT GTACTGTATT TTTATATTGC TGTTTAAAAC TTTTAAGCTG 
TGCCTCACTT ATTAAAGCAC AAAATGTTTT ACCTACTCCT TATTTACGAC ACAATAAAAT 
AACATCAATA GATTTTTAGG CTGAATTAAT TTGAAAGCAG CAATTTGCTG TTCTCAACCA 
TTCTTTCAAG GCTTTTCATT CGACACAATA AAATAACATC AATAG 



300 
360 
420 



60 
120 
180 
240 
300 
360 
420 
460 
540 
600 
660 
720 
780 
640 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2B20 
2880 
2940 
3000 
3060 
3120 
3180 
3240 



Seq ID NO: 110 Protein sequence 
Protein Accession 8: NP_000484.2 



1 

I 

MLPQIPFLLL 
TPGPPGPAGP 
GPYGPKGDVG 
VPGMNGQKGB 
GEMGPIGPPG 
PGLPGLKGER 
KGETGPAGPA 
PGPVGPAGAK 
ATKGLNGPTG 
SANQGVTGMP 
YHVHVKGTHV 
GLYSSEYVHS 



11 
1 

VSLNIiVHGVF 
RGHPGPSGPP 
PAGLPGPRGP 
MGYGAPGRPG 
PQGPPGERGP 
GPAGLPGGPG 
GYPGAKGERG 
GMPGHNGEAG 
PPGPPGPRGH 
VSAFTVILSK 
WVGLYKNGTP 
SFSGPLVAPM 



21 
I 

YAERYQMPTG 
GKPGYGSPGL 
PGPPGIPGPA 
ERGLPGPOGP 
EGIGKPGAAG 
AKGEQGPAGL 
SPGSDGKPGY 
PRGAPGIPGT 
SGEPGLPGPP 
AYPAIGTPIP 
VMYTYDEYTK 



31 
I 

IKGPLPNTKT 
QGEPGLPGPP 
GISVPGKPGQ 
TGPSGPPGVG 
APGQPGIPGT 
PGKPGLTGPP 
PGKPGLDGPK 
RGPIGPPGIP 
GPPGPPGQAV 
FDKILYNRQQ 
GYLDQASGSA 



41 
1 

QFFIPYTIKS 
GPSAVGKPGV 
QGPTGAPGPR 
KRGENGVPGQ 
KGLPGAPGIA 
GNMGPQGPKG 
GNPGLPGPKG 



MPEGFIKAGQ 
HYDPRTGIFT 
IIDLTENDQV 



51 
I 

KGIAVRGEQG 
PGLPGKPGER 
GFPGEKGAPG 
PGIKGDRGFP 
GPPGPPGFGK 
IPGSHGLPGP 
DPGVGGPPGL 
SPGPPGPAGI 
RPSLSGTPLV 
CQIPGIYYFS 
WLQLPNAESN 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
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Seq ID NO: 111 DMA sequence 
Nucleic Acid Accession 8: NM_000949 
Coding sequence: 285.. 2153 

1 11 21 31 41 51 

1 II I I I 

GGAGGCTGAA ATCCCCAGAC GCCGGTTTTC TGGGCTGGGC TTTCTGCTTA CTCACTCCTT 60 

CTCCCTCTTT CTGGATTTTA CCGACCGTTC GCGAAACAGC TTTCCACACA ATGGAGCTTC 120 

ATGTCCTCGT GCAGGAAGTA CTCATCGACT GATGTGGCAG ACTTTGCTCC CTGACAAAAC 180 

TAAAGAACTC TCCTATTCAT GGAGGCGAAC ACTGAGGATG CTTTCCACAT GAACCCTGAA 240 

GTGAACTTCT GATACATTTC CTGCAGCAAG AGAAGGCAGC CAACATGAAG GAAAATGTGG 300 

CATCTGCAAC CGTTTTCACT CTGCTACTTT TTCTCAACAC CTGCCTTCTG AATGGACAGT 360 

TACCTCCTGG AAAACCTGAG ATCTTTAAAT GTCGTTCTCC CAATAAGGAA ACATTCACCT 420 

GCTGGTGGAG GCCTGGGACA GATGGAGGAC TTCCTACCAA TTATTCACTG ACTTACCACA 480 

GGGAAGGAGA GACACTCATG CATGAATGTC CAGACTACAT AACCGGTGGC CCCAACTCCT 540 

GCCACTTTGG CAAGCAGTAC ACCTCCATGT GGAGGACATA CATCATGATG GTCAATGCCA 600 

CTAACCAGAT GGGAAGCAGT TTCTCGGATG AACTTTATGT GGACGTGACT TACATAGTTC 660 

AGCCAGACCC TCCTTTGGAG CTGGCTGTGG AAGTAAAACA GCCAGAAGAC AGAAAACCCT 720 

ACCTGTGGAT TAAATGGTCT CCACCTACCC TGATTGACTT AAAAACTGGT TG GTTC ACGC 780 

TCCTGTATGA AATTCGATTA AAACCCGAGA AAGCAGCTGA GTGGGAGATC CATTTTGCTG 840 

GGCAGCAAAC AGAGTTTAAG ATTCTCAGCC TACATCCAGG ACAGAAATAC CTTGTCCAGG 900 

TTCGCTGCAA ACCAGACCAT GGATACTGGA GTGCATGGAG TCCAGCGACC TTCATTCAGA 960 

TACCTAGTGA CTTCACCATG AATGATACAA CCGTGTGGAT CTCTGTGGCT GTCCTTTCTG 1020 

CTGTCATCTG TTTGATTATT GTCTGGGCAG TGGCTTTGAA GGGCTATAGC ATGGTGACCT 1080 

GCATCTTTCC GCCAGTTCCT GGGCCAAAAA TAAAAGGATT TGATGCTCAT CTGTTGGAGA 1140 

AGGGCAAGTC TGAAGAACTA CTGAGTGCCT TGGGATGCCA AGACTTTCCT CCCACTTCTG 1200 

ACTATGAGGA CTTGCTGGTG GAGTATTTAG AAGTAGATGA TAGTGAGGAC CAGCATCTAA 1260 

TGTCAGTCCA TTCAAAAGAA CACCCAAGTC AAGGTATGAA ACCCACATAC CTGGATCCTG 1320 

ACACTGACTC AGGCCGGGGG AGCTGTGACA GCCCTTCCCT TTTGTCTGAA AAGTGTGAGG 1380 

AACCCCAGGC CAATCCCTCC ACATTCTATG ATCCTGAGGT CATTGAGAAG CCAGAGAATC 1440 

CTGAAACAAC CCACACCTGG GACCCCCAGT GCATAAGCAT GGAAGGCAAA ATCCCCTATT 1500 

TTCATGCTGG TGGATCCAAA TGTTCAACAT GGCCCTTACC ACAGCCCAGC CAGCACAACC 1560 

CCAGATCCTC TTACCACAAT ATTACTGATG TGTGTGAGCT GGCTGTGGGC CCTGCAGGTG 1620 

CACCGGCCAC TCTGTTGAAT GAAGCAGGTA AAGATGCTTT AAAATCCTCT CAAACCATTA 1680 

AGTCTAGAGA AGAGGGAAAG GCAACCCAGC AGAGGGAGGT AGAAAGCTTC CATTCTGAGA 1740 

CTGACCAGGA TACGCCCTGG CTGCTGCCCC AGGAGAAAAC CCCCTTTGGC TCCGCTAAAC 1800 

CCTTGGATTA TGTGGAGATT CACAAGGTCA ACAAAGATGG TGCATTATCA TTGCTACCAA 1860 

AACAGAGAGA GAACAGCGGC AAGCCCAAGA AGCCCGGGAC TCCTGAGAAC AATAAGGAGT 1920 

ATGCCAAGGT GTCCGGGGTC ATGGATAACA ACATCCTGGT GTTGGTGCCA GATCCACATG 1980 

CTAAAAACGT GGCTTGCTTT GAAGAATCAG CCAAAGAGGC CCCACCATCA CTTGAACAGA 2040 

ATCAAGCTGA GAAAGCCCTG GCCAACTTCA CTGCAACATC AAGCAAGTGC AGGCTCCAGC 2100 

TGGGTGGTTT GGATTACCTG GATCCCGCAT GTTTTACACA CTCCTTTCAC TGATAGCTTG 2160 

ACTAATGGAA TGATTGGTTA AAATGTGATT TTTCTTCAGG TAACACTACA GAGTACGTGA 2220 

AATGCTCAAG AATGTAGTCA GACTGACACT ACTAAAGCTC CCAGCTCCTT TCATGCTCCA 2280 

TTTTTAACCA CTTGCCTCTT TCTCCAGCAG CTGATTCCAG AACAAATCAT TATGTTTCCT 2340 

AACTGTGATT TGTAGATTTA CTTTTTGCTG TTAGTTATAA AACTATGTGT TCAATGAAAT 2400 

AAAAGCACAC TGCTTAGTAT TCTTGAGGGA CAATGCCAAT AGGTATATCC TCTGGAAAAG 2460 

GCTTTCATGA TTTGGCATGG GACAGACGGA AATGAAATTG TCAAAATTGT TTACCATAGA 2520 

AAGATGACAA AAGAAAATTT TCCACATAGG AAAATGCCAT GAAAATTGCT TTTGAAAAAC 2580 

AACTGCATAA CCTTTACACT CCTCGTCCAT TTTATTAGGA TTACCCAAAT ATAACCATTT 2640 

AAAGAAAGAA TGCATTCCAG AACAAATTGT TTACATAAGT TCCTATACCT TACTGACACA 2700 
. TTGCTGATAT GCAAGTAAGA AAT 

Seq ID NO: 112 Protein sequence 
Protein Accession #i NP_000940 

1 11 , 21 31 41 51 

| I I 1 I i 

MKENVASATV FTLLLFLNTC LLNGQLPPGK PEIPKCRSPN KETFTCWWRP GTDGGLPTNY 60 

SLTYHREGET LMHECPDYIT GGPNSCHFGK QYTSMWRTYI MMVNATNQMG SSPSDELYVD 120 

VTYIVQPDPP LELAVEVKQP EDRKPYLWIK WSPPTLIDLK TGWPTLLYEI RLKPEKAAEW 180 

EIHFAGQQTE FKILSLHPGQ KYLVQVRCKP DHGYWSAWSP ATFIQIPSDF TMNDTTVWIS 240 

VAVLSAVICL 1 1 VW AVAL KG YSMVTCIFPP VPGPKIKGFD AHLLEKGKSE ELLSALGCQD 300 

PPPTSDYEDL LVEYLEVDDS EDQHLMSVHS KEHPSQGMKP TYLDPDTDSG RGSCDSPSLL 360 

SEKCEEPQAN PSTPYDPEVI EKPENPETTH TWDPQCISME GKI PYFHAGG SKCSTWPLPQ 420 

PSQHNPRSSY HNITDVCELA VGPAGAPATLi LNEAGKDALK SSQTIKSREE GKATQQREVE 480 

SFHSETDQDT PWLLPQEKTP FGSAKPLDYV EIHKVNKDGA LSLLPKQREN SGKPKKPGTP 540 

ENNKEYAKVS GVMDNNILVL VPDPHAKNVA CFEESAKEAP PSLEQNQAEK ALANFTATSS 600 
KCRLQLGGLD YLDPACFTHS FH 

Seq ID NO: 113 DNA sequence 
Nucleic Acid Accession *h XMJJ62811 
Coding sequence: 1..888 

1 11 21 31 41 51 

| | | 1 I I 

ATGTGGGGCG CTCGCCGCTC GTCCGTCTCC TCATCCTGGA ACGCCGCTTC GCTCCTGCAG 60 
CTGCTGCTGG CTGCGCTGCT GGCGGCGGGG GCGAGGGCCA GCGGCGAGTA CTGCCACGGC 120 
TGGCTGGACG CGCAGGGCGT CTGGCGCATC GGCTTCCAGT GTCCCGAGCG CTTCGACGGC 180 
GGCGACGCCA CCATCTGCTG CGGCAGCTGC GCGTTGCGCT ACTGCTGCTC CAGCGCCGAG 240 
GCGCGCCTGG ACCAGGGCGG CTGCGACAAT GACCGCCAGC AGGGCGCTGQ CGAGCCTGGC 300 
CGGGCGGACA AAGACGGCCC CGACGGCTCG GCAGTGCCCA TCTACGTGCC GTTCCTCATT 360 
GTTGGCTCCG TGTTTGTCGC CTTTATCATC TTGGGGTCCC TGGTGGCAGC CTGTTGCTGC 420 
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AGATGTCTCC 
ATGGAGACCA 
TCCAGCACAG 
AGGTCACAGA 
CCCACGAATT 
CAGTATCTGC 
GCTGTGCCAC 
CCTCACACCA 



GGCCTAAGCA 
TCCCCATGAT 
CTGCCAGTTC 
CCAACTGTTG 
TCTCTGTGCT 
ATCCCCCATA 
CTTTCATGGA 
ACAGTGAACA 



GGATCCCCAG 
CCCCAGTGCC 
CAGCTCCAGC 
CTTGCCGGAA 
GAACTGTCAG 
CGTGGGGTAC 
CGGCCTGCAG 
GAAGATGTAC 



CAGAGCCGAG 
AGCACCTCCC 
GCCAACTCAG 
GGGACCATGA 
CAGGCCACCC 
ACGGTGCAGC 
CCTGGCTACA 
CCAGCGGTGA 



CCCCAGGGGG 
GGGGGTCGTC 
GGGCCCGGGC 
ACAACGTGTA 
AGATTGTGCC 
ACGACTCTGT 
GGCAGATTCA 
CTGTATAA 



TAACCGCTTG 
CTCACGCCAG 
GCCCCCAACA 
TGTCAACATG 
ACATCAAGGG 
GCCCATGACA 
GTCCCCCTTC 



480 
540 
600 
660 
720 
780 
840 



Seq ID NO: 114 Protein sequence 
Protein Accession #: XF_062811 



1 11 

I 1 
MWGARRSSVS SSWNAASLLQ 
GDATICCGSC ALRYCCSSAE 
VGSVFVAFII LGSLVAACCC 
SSTAASSSSS ANSGARAPPT 
QYLHPPYVGY TVQHDSVPMT 



21 31 41 SI 

Sill 
LLLAALLAAG ARASGEYCHG WLDAQGVWRI GFQCPERFDG 
ARLDQGGCDN DRQQGAGEPG RADKDGPDGS AVPIYVPFLI 
RCLRPKQDPQ QSRAPGGNRL METIPMIPSA STSRGSSSRQ 
RSOTNCCLPE GTMNNVYVNM PTNFSVUJCQ QATQIVPHQG 
AVPPFMDGLQ PGYRQIQSPF PHTNSEQKMY PAVTV 



Seq ID NO: 115 DMA sequence 

Nucleic Acid Accession #: NM_013257 

Coding sequence: 223.. 1512 



51 



60 
120 
180 
240 



1 11 21 31 41 

I 1 I I I I 

GGTGTGCTCT TGAGGGATTA AATGCAAAGA GATCACACCA TGGACTACAA GGAAAGCTGC 
CCAAGTGTAA GCATTCCCAG CTCCGATGAA CACAGAGAGA AAAAGAAGAG GTTTACTGTT 
TATAAAGTTC TGGTTTCAGT GGGAAGAAGT GAATGGTTTG TCTTCAGGAG ATATGCAGAG 
TTTGATAAAC TTTATAACAC TTTAAAAAAA CAGTTTCCTG CTATGGCCCT GAAGATTCCT 
GCCAAGAGAA TATTTGGTGA TAATTTTGAT CCAGATTTTA TTAAACAAAG ACGAGCAGGA 
CTAAACGAAT TCATTCAGAA CCTAGTTAGG TATCCAGAAC TTTATAACCA TCCAGATGTC 
AGAGCATTCC TTCAAATGGA CAGTCCAAAA CACCAGTCAG ATCCATCTGA AGATGAGGAT 
GAAAGAAGTT CTCAGAAGCT ACACTCTACC TCACAGAACA TCAACCTGGG ACCGTCTGGA 
AATCCTCATG CCAAACCAAC TGACTTTGAT TTCTTAAAAG TTATTGGAAA AGGCAGCTTT 
GGCAAGGTTC TTCTTGCAAA ACGGAAACTG GATGGAAAAT TTTATGCTGT CAAAGTGTTA 
CAGAAAAAAA TAGTTCTCAA CAGAAAAGAG CAAAAACATA TTATGGCTGA ACGTAATGTG 
CTCTTGAAAA ATGTGAAACA TCCGTTTTTG GTTGGATTGC ATTATTCCTT CCAAACAACT 
GAAAAGCTTT ATTTTGTTCT GGATTTTGTT AATGGAGGGG AGCTTTTTTT CCACTTACAA 
AGAGAACGGT CCTTTCCTGA GCACAGAGCT AGGTTTTACG CTGCTGAAAT TGCTAGTGCA 
TTGGGTTACT TACATTCCAT CAAAATAGTA TACAGAGACT TGAAACCAGA AAATATTCTT 
TTGGATTCAG TAGGACATGT TGTCTTAACA GATTTTGGGC TTTGTAAAGA AGGAATTGCT 
ATTTCTGACA CCACTACCAC ATTTTGTGGG ACACCAGAGT ATCTTGCACC TGAAGTAATT 
AGAAAACAGC CCTATGACAA TACTGTAGAT TGGTGGTGCC TTGGGGCTGT TCTGTATGAA 
ATGCTGTATG GATTGCCTCC TTTTTATTGC CGAGATGTTG CTGAAATGTA TGACAATATC 
CTTCACAAAC CCCTAAGTTT GAGGCCAGGA GTGAGTCTTA CAGCCTGGTC CATTCTGGAA 
GAACTCCTAG AAAAAGACAG GCAAAATCGA CTTGGTGCCA AGGAAGACTT TCTTGAAATT 
CAGAATCATC CTTTTTTTGA ATCACTCAGC TGGGCTGACC TTGTACAAAA GAAGATTCCA 
CCACCATTTA ATCCTAATGT GGCTGGACCA GATGATATCA GAAACTTTGA CACAGCATTT 
ACAGAAGAAA CAGTTCCATA TTCTGTGTGT GTATCTTCTG ACTATTCTAT AGTGAATGCC 
AGTGTATTGG AGG CAGATG A TGCATTOGTT GGTTTCTCTT ATGCACCTCC TTCAGAAGAC 
TTATTTTTGT GAGCAGTTTG CCATTCAGAA ACCATTGAGC AAAATAAGTC TATAGATGGG 
ACTGAAACTT CTATTTGTGT GAATATATTC AAATATGTAT AACTAGTGCC TCATTTTTAT 
ATGTAATGAT GAAAACTATG AAAAAATGTA TTTTCTTCTA TGTGCAAGAA AAATAGGGCA 
TTTCAAAGAG CTGTTTTGAT TAAAATTTAT ATTCTTGTTT AATAAGCTTA TTTTTAAACA 
ATTTAAAAGC TATTATTCTT AGCATTAACC TATTTTTAAA GAAACCTTTT TTGCTATTGA 
CTGTTTTTTC CCTCTAAGTT TACACTAACA TCTACCCAAG ATAGACTGTT TTTTAACAGT 
CAATTTCAGT TCAGCTAACA TATATTAATA CCTTTGTAAC TCTTTGCTAT GGCTTTTGTT 
ATCACACCAA AACTATGCAA TTGGTACATG GTTGTTTAAG AAGAAACCGT ATTTTTCCAT 
GATAAATCAC TGTTTGAAAT ATTTGGTTCA TGGTATGATC GAAATGTAAA AGCATAATTA 
ACACATTGGC TGCTAGTTAA CAATTGGAAT AACTTTATTC TGCAGATCAT TTAAGAAGTA 
ACAGGCCGGG CGCGGTGGCT CACGCCTGTA ATCCCAGCAC TTTGGGAGGC TGAGGCGGGC 
AGATCACCTG AGGTCAGGAG TTGGAGACCA GCCTGACCAA CATGGACAAA CCCCGTCTCT 
ACTAAAAATA CAAAATTGGC AGGGTGTGGT GGCACATGCC TATAATCCCA GCTACTTGGG 
AGGCTAAGGC AGGAGAATCG CTTGAACCCG GGAGGCGGAG GTTGCAGTGA GCCGAGATCG 
- CACCATTGCA CTCCTGCCTG GGCAACAAGA GTGAAACTCC ATCTCCAAAA A 

Seq ID NO: 116 Protein sequence 
Protein Accession #: NP_037389 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1060 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 



MALKIPAKRI 



YAVKVLQKKI 
LFFHLQRERS 
CKEGIAISDT 
EMYDNILHKP 
VQKKIPPPFN 
APPSEDLFL 



11 
I 

PGDNFDPDFI 
QKLHSTSQNI 
VLNRKEQKHI 
FPEHRARFYA 
TTTFCGTPEY 
LSLRPGVSLT 
PNVAGPDDIR 



21 
I 

KQRRAGLNEF 
NLGPSGNFHA 
MAERNVLLKN 
AEIASALGYL 
LAPEVIRKQP 
AWSILEELLE 
NFDTAFTEET 



31 
I 

IQNLVRYPEL 
KPTDFDFLKV 
VKHPFLVGLH 
HSIKIVYRDL 
YDNTVDWWCL 
KDRQNRLGAK 
VPYSVCVSSD 



41 

1 

YNHPDVRAFL 
IGKGSFGKVL 
YSFQTTEKLY 
KPENILLDSV 
GAVLYEMLYG 
EDFLEIQNHP 
YSIVNASVLE 



51 
I 

QMDSPKHQSD 60 

LAKRKLDGKF 120 

FVLDFVNGGE 180 

GHWLTDFGL 240 

LPPFYCRDVA 300 

FFESLSWADL 360 

ADDAFVGFSY 420 



Seq ID NO: 117 DNA sequence 

Nucleic Acid Accession #: NM_004004.1 



997 



WO 03/042661 



Coding sequence: 1..681 

1 11 21 31 41 SI 

1 I I I I 1 

ATGGATTGGG GCACGCTGCA GACGATCCTG GGGGGTGTGA ACAAACACTC CACCAGCATT 60 

GGAAAGATCT GGCTCACCGT CCTCTTCATT TTTCGCATTA TGATCCTCGT TGTGGCTGCA 120 

AAGGAGGTGT GGGGAGATGA GCAGGCCGAC TTTGTCTGCA ACACCCTGCA GCCAGGCTGC 180 

AAGAACGTGT GCTACGATCA CTACTTCCCC ATCTCCCACA TCCGGCTATG GGCCCTGCAG 240 

CTGATCTTCG TGTCCAGCCC AGCGCTCCTA GTGGCCATGC ACGTGGCCTA CCGGAGACAT 300 

GAGAAGAAGA GGAAGTTCAT CAAGGGGGAG ATAAAGAGTG AATTTAAGGA CATCGAGGAG 360 

ATCAAAACCC AGAAGGTCCG CATCGAAGGC TCCCTGTGGT GGACCTACAC AAGCAGCATC 420 

TTCTTCCGGG TCATCTTCGA AGCCGCCTTC ATGTACGTCT TCTATGTCAT GTACGACGGC 480 

TTCTCCATGC AGCGGCTGGT GAAGTGCAAC GCCTGGCCTT GTCCCAACAC TGTGGACTGC 540 

TTTGTGTCCC GGCCCACGGA GAAGACTGTC TTCACAGTGT TCATGATTGC AGTGTCTGGA 600 

ATTTGCATCC TGCTGAATGT CACTGAATTG TGTTATTTGC TAATTAGATA TTGTTCTGGG 660 
AAGTCAAAAA AGCCAGTTTA A 

Seq ID NO: 118 Protein sequence 
Protein Accession #: NP_0 03 995.1 

1 11 21 31 41 51 

I I 1 I I 1 

MDWGTLQTIL GGVNKHSTSI GKIWLTVLFI FRIMILWAA KEVWGDEQAD FVCNTLQPGC 60 
KNVCYDHYFP ISH1RLWALQ LIFVSSPALL VAMHVAYRRH EKKRKFIKGE IKSEFKDIEE 120 
IKTQKVRIEG SLWWTYTSSI FFRVIFEAAF MYVFYVMYDG FSMQRLVKCN AWPCPNTVDC 180 
FVSRPTEKTV FTVFMIAVSG ICILLNVTEL CYLLIRYCSG KSKKPV 



Seq ID NO: 119 DNA sequence 

Nucleic Acid Accession #: XM_0 610 91.1 

Coding sequence: 1..2481 

1 11 21 31 41 51 

I I I 1 1 I 

ATGCCAAATA CTTCAGGAAC AACCAGGATT GAAATTTGGC TTCTCCAAGA GCCGCCCGGG 60 

CACCGAGCGC TGGTCGCCGC TCTCCTTCCG GTGAGTCCCA GCCCCGAGTT GGCTCTGGCG 120 

CCCGGGTACC CGCCAGTGCC GGCTGCCGAT GACCGATTCA CGCTCCCGAT GATTGGAGGT 180 

CAGATGCATG GTGAGAAGGT AGATCTCTGG AGCCTTGGTG TTCTTTGCTA TGAATTTTTA 240 

GTTGGGAAGC CTCCTTTTGA GGCAAACGAA GTCCATGTAA GCAAAGAAAC CATCGGGAAG 300 

ATTTCAGCTG CCAGCAAAAT GATGTGGTGC TCGGCTGCAG TGGACATCAT GTTTCTGTTA 360 

GATGGGTCTA ACAGCGTCGG GAAAGGGAGC TTTGAAAGGT CCAAGCACTT TGCCATCACA 420 

GTCTGTGACG GTCTGGACAT CAGCCCCGAG AGGGTCAGAG TGGGAGCATT CCAGTTCAGT 480 

TCCACTCCTC ATCTGGAATT CCCCTTGGAT TCATTTTCAA CCCAACAGGA AGTGAAGGCA 540 

AGAATCAAGA GGATGGTTTT CAAAGGAGGG CGCACGGAGA CGGAACTTGC TCTGAAATAC 600 

CTTCTGCACA GAGGGTTGCC TGGAGGCAGA AATGCTTCTG TGCCCCAGAT CCTCATCATC 660 

GTCACTGATG GGAAGTCCCA GGGGGATGTG GCACTGCCAT CCAAGCAGCT GAAGGAAAGG 720 

GGTGTCACTG TGTTTGCTGT GGGGGTCAGG TTTCCCAGGT GGGAGGAGCT GCATGCACTG 780 

GCCAGCGAGC CTAGAGGGCA GCACGTGCTG TTGGCTGAGC AGGTGGAGGA TGCCACCAAC 840 

GGCCTCTTCA GCACCCTCAG CAGCTCGGCC ATCTGCTCCA GCGCCACGCC AGCTGGGAGC 900 

CCCGAGCTTG TCTTCATGGA GCGGTTAATG GGCATCTCTC TGATAGGCCC CTGTGACTCG 960 

CAGCCCTGCC AGAATGGAGG CACATGTGTT CCAGAAGGAC TGGACGGCTA CCAGTGCCTC 1020 

TGCCCGCTGG CCTTTGGAGG GGAGGCTAAC TGTGCCCTGA AGCTGAGCCT GGAATGCAGG 1080 

GTCGACCTCC TCTTCCTGCT GGACAGCTCT GCGGGCACCA CTCTGGACGG CTTCCTGCGG 1140 

GCCAAAGTCT TCGTGAAGCG GTTTGTGCGG GCCGTGCTGA GCGAGGACTC TCGGGCCCGA 1200 

GTGGGTGTGG CCACATACAG CAGGGAGCTG CTGGTGGCGG TGCCTGTGGG GGAGTACCAG 1260 

GATGTGCCTG ACCTGGTCTG GAGCCTCGAT GGCATTCCCT TCCGTGGTGG CCCCACCCTG 1320 

ACGGGCAGTG CCTTGCGGCA GGCGGCAGAG CGTGGCTTCG GGAGCGCCAC CAGGACAGGC 1380 

CAGGACCGGC CACGTAGAGT GGTGGTTTTG CTCACTGAGT CACACTCCGA GGATGAGGTT 1440 

GCGGGCCCAG CGCGTCACGC AAGGGCGCGA GAGCTGCTCC TGCTGGGTGT AGGCAGTGAG 1500 

GCCGTGCGGG CAGAGCTGGA GGAGATCACA GGCAGCCCAA AGCATGTGAT GGTCTACTCG 1560 

GATCCTCAGG ATCTGTTCAA CCAAATCCCT GAGCTGCAGG GGAAGCTGTG CAGCCGGCAG 1620 

CGGCCAGGGT GCCGGACACA AGCCCTGGAC CTCGTCTTCA TGTTGGACAC CTCTGCCTCA 1680 

GTAGGGCCCG AGAATTTTGC TCAGATGCAG AGCTTTGTGA GAAGCTGTGC CCTCCAGTTT 1740 

GAGGTGAACC CTGACGTGAC ACAGGTCGGC CTGGTGGTGT ATGGCAGCCA GGTGCAGACT 1800 

GCCTTCGGGC TGGACACCAA ACCCACCCGG GCTGCGATGC TGCGGGCCAT TAGCCAGGCC 1860 

CCCTACCTAG GTGGGGTGGG CTCAGCCGGC ACCGCCCTGC TGCACATCTA TGACAAAGTG 1920 

ATGACCGTCC AGAGGGGTGC CCGGCCTGGT GTCCCCAAAG CTGTGGTGGT GCTCACAGGC 1980 

GGGAGAGGCG CAGAGGATGC AGCCGTTCCT GCCCAGAAGC TGAGGAACAA TGGCATCTCT 2040 

GTCTTGGTCG TGGGCGTGGG GCCTGTCCTA AGTGAGGGTC TGCGGAGGCT TGCAGGTCCC 2100 

CGGGATTCCC TGATCCACGT GGCAGCTTAC GCCGACCTGC GGTACCACCA GGACGTGCTC 2160 

ATTGAGTGGC TGTGTGGAGA AGCCAAGCAG CCAGTCAACC TCTGCAAACC CAGCCCGTGC 2220 

ATGAATGAGG GCAGCTGCGT CCTGCAGAAT GGGAGCTACC GCTGCAAGTG TCGGGATGGC 2280 

TGGGAGGGCC CCCACTGCGA GAACCGTGAG TGGAGCTCTT GCTCTGTATG TGTGAGCCAG 2340 

GGATGGATTC TTGAGACGCC CCTGAGGCAC ATGGCTCCCG TGCAGGAGGG CAGCAGCCGT 2400 

ACCCCTCCCA GCAACTACAG AGAAGGCCTG GGCACTGAAA TGGTGCCTAC CTTCTGGAAT 2460 
GTCTGTGCCC CAGGTCCTTA G 

Seq ID NO: 120 Protein sequence 
Protein Accession #: XPJ>61091.1 

1 11 21 31 41 51 

I 1 I I I I 

MPNTSGTTRI EIWLLQEPPG HRALVAALLP VSPSPELALA PGYPPVPAAD DRFTLPMIGG 60 

QMHGEKVDLW SLGVLCYEFL VGKPPFEANE VHVSKETIGK ISAASKMMWC SAAVDIMFLL 120 



998 



WO 03/042661 



PCT/US02/36810 



DGSNSVGKGS FERSKHFAIT VCDGLDISPE RVRVGAFQFS STPHLEFPLD SFSTQQEVKA 180 

RIKRMVFKGG RTETELALKY LLHRGLPGGR NASVPQILII VTDGKSQGDV ALPSKQLKER 240 

GVTVFAVGVR FPRWEELHAL ASEPRGQHVL LAEQVEDATN GLFSTLSSSA ICSSATPAGS 300 

PELVFMERLM GISLIGPCDS QPCQNGGTCV PEGLDGYQCL CPLAFGGEAN CALKLSLECR 360 

5 VDLLFLLDSS AGTTLDGFLR AKVFVKRFVR AVLSEDSRAR VGVATYSREL LVAVPVGEYQ 420 

DVPDLVWSLD GIPFRGGPTL TGSALRQAAE RGFGSATRTG QDRPRRWVL LTESHSEDEV 480 

AGPARHARAR ELLLLGVGSE AVRAELEEIT GSPKHVMVYS DPQDLFNQIP ELQGKLCSRQ 540 

RPGCRTQALD LVFMLDTSAS VGPENFAQMQ SFVRSCALQF EVNPDVTQVG LWYGSQVQT 600 

AFGLDTKPTR AAMLRAISQA PYLGGVGSAG TALLHIYDKV MTVQRGARPG VPKAVWLTG 660 

10 GRGAEDAAVP AQKLRNNGIS VLWGVGPVL SEGLRRLAGP RDSLIHVAAY ADLRYHQDVL 720 

IEWLCGEAKQ PVNLCKPSPC MMEGSCVLQN GSYRCKCRDG WEGPHCENRE WSSCSVCVSQ 7B0 
GWILETPLRH MAPVQEGSSR TPPSNYREGL GTEMVPTFWN VCAPGP 



Seq ID NO: 121 DNA sequence 
15 Nucleic Acid Accession fl: Eos sequence 
Coding sequence: 1..2424 

1 11 21 31 41 51 

OA ) I I ! I I 

ZU ATGCCCCCTT TCCTGTTGCT GGAGGCCGTC TGTGTTTTCC TGTTTTCCAG AGTGCCCCCA 60 

TCTCTCCCTC TCCAGGAAGT CCATGTAAGC AAAGAAACCA TCGGGAAGAT TTCAGCTGCC 120 

AGCAAAATGA TGTGGTGCTC GGCTGCAGTG GACATCATGT TTCTGTTAGA TGGGTCTAAC 180 

AGCGTCGGGA AAGGGAGCTT TGAAAGGTCC AAGCACTTTG CCATCACAGT CTGTGACGGT 240 

CTGGACATCA GCCCCGAGAG GGTCAGAGTG GGAGCATTCC AGTTCAGTTC CACTCCTCAT 300 

25 CTGGAATTCC CCTTGGATTC ATTTTCAACC CAACAGGAAG TGAAGGCAAG AATCAAGAGG 360 

ATGGTTTTCA AAGGAGGGCG CACGGAGACG GAACTTGCTC TGAAATACCT TCTGCACAGA 420 

GGGTTGCCTG GAGGCAGAAA TGCTTCTGTG CCCCAGATCC TCATCATCGT CACTGATGGG 480 

AAGTCCCAGG GGGATGTGGC ACTGCCATCC AAGCAGCTGA AGGAAAGGGG TGTCACTGTG S40 

TTTGCTGTGG GGGTCAGGTT TCCCAGGTGG GAGGAGCTGC ATGCACTGGC CAGOGAGCCT 600 

30 AGAGGGCAGC ACGTGCTGTT GGCTGAGCAG GTGGAGGATG CCACCAACGG CCTCTTCAGC 660 

ACCCTCAGCA GCTCGGCCAT CTGCTCCAGC GCCACGCCAG ACTGCAGGGT CGAGGCTCAC 720 

CCCTGTGAGC ACAGGACGCT GGAGATGGTC CGGGAGTTCG CTGGCAATGC CCCATGCTGG 780 

AGAGGATCGC GGCGGACCCT TGCGGTGCTG GCTGCACACT GTCCCTTCTA CAGCTGGAAG 840 

AGAGTGTTCC TAACCCACCC TGCCACCTGC TACAGGACCA CCTGCCCAGG CCCCTGTGAC 900 

35 TCGCAGCCCT GCCAGAATGG AGGCACATGT GTTCCAGAAG GACTGGACGG CTACCAGTGC 960 

CTCTGCCCGC TGGCCTTTGG AGGGGAGGCT AACTGTGCCC TGAAGCTGAG CCTGGAATGC 1020 

AGGGTCGACC TCCTCTTCCT GCTGGACAGC TCTGCGGGCA CCACXCTGGA CGGCTTCCTG 1080 

CGGGCCAAAG TCTTCGTGAA GCGGTTTGTG CGGGCCGTGC TGAGCGAGGA CTCTCGGGCC 1140 

CGAGTGGGTG TGGCCACATA CAGCAGGGAG CTGCTGGTGG CGGTGCCTGT GGGGGAGTAC 1200 

40 CAGGATGTGC CTGACCTGGT CTGGAGCCTC GATGGCATTC CCTTCCGTGG TGGCCCCACC 1260 

CTGACGGGCA GTGCCTTGCG GCAGGCGGCA GAGCGTGGCT TCGGGAGCGC CACCAGGACA 1320 

GGCCAGGACC GGCCACGTAG AGTGGTGGTT TTGCTCACTG AGTCACACTC CGAGGATGAG 1380 

GTTGCGGGCC CAGCGCGTCA CGCAAGGGCG CGAGAGCTGC TCCTGCTGGG TGTAGGCAGT 1440 

GAGGCCGTGC GGGCAGAGCT GGAGGAGATC ACAGGCAGCC CAAAGCATGT GATGGTCTAC 1500 

45 TCGGATCCTC AGGATCTGTT CAACCAAATC CCTGAGCTGC AGGGGAAGCT GTGCAGCCGG 1560 

CAGCGGCCAG GGTGCCGGAC ACAAGCCCTG GACCTCGTCT TCATGTTGGA CACCTCTGCC 1620 

TCAGTAGGGC CCGAGAATTT TGCTCAGATG CAGAGCTTTG TGAGAAGCTG TGCCCTCCAG 1680 

TTTGAGGTGA ACCCTGACGT GACACAGGTC GGCCTGGTGG TGTATGGCAG CCAGGTGCAG 1740 

ACTGCCTTCG GGCTGGACAC CAAACCCACC CGGGCTGCGA TGCTGCGGGC CATTAGCCAG 1800 

50 GCCCCCTACC TAGGTGGGGT GGGCTCAGCC GGCACCGCCC TGCTGCACAT CTATGACAAA 1860 

GTGATGACCG TCCAGAGGGG TGCCCGGCCT GGTGTCCCCA AAGCTGTGGT GGTGCTCACA 1920 

GGCGGGAGAG GCGCAGAGGA TGCAGCOGTT CCTGCCCAGA AGCTGAGGAA CAATGGCATC 1980 

TCTGTCTTGG TCGTGGGCGT GGGGCCTGTC CTAAGTGAGG GTCTGCGGAG GCTTGCAGGT 2040 

CCCCGGGATT CCCTGATCCA CGTGGCAGCT TACGCCGACC TGCGGTACCA CCAGGACGTG 2100 

55 CTCATTGAGT GGCTGTGTGG AGAAGCCAAG CAGCCAGTCA ACCTCTGCAA ACCCAGCCCG 2160 

TGCATGAATG AGGGCAGCTG CGTCCTGCAG AATGGGAGCT ACCGCTGCAA GTGTCGGGAT 2220 

GGCTGGGAGG GCCCCCACTG CGAGAACCGT GAGTGGAGCT CTTGCTCTGT ATGTGTGAGC 2280 

CAGGGATGGA TTCTTGAGAC GCCCCTGAGG CACATGGCTC CCGTGCAGGA GGGCAGCAGC 2340 

CGTACCCCTC CCAGCAACTA CAGAGAAGGC CTGGGCACTG AAATGGTGCC TACCTTCTGG 2400 
60 AATGTCTGTG CCCCAGGTCC TTAG 



Seq ID NO: 122 Protein sequence 
Protein Accession #: Eos sequence 

65 1 11 21 31 41 51 

I I I I I I 

MPPFLLLEAV CVFLFSRVPP SLPIiQEVHVS KETIGKISAA SKMMWCSAAV DIMFLLDGSN 60 

SVGKGSFERS KHFAITVCDG LDISPERVRV GAFQFSSTPH LEFPLDSFST QQEVKARIKR 120 

MVFKGGRTET ELALKYLLHR GLPGGRNASV PQILIIVTDG KSQGDVALPS KQLKERGVTV 180 

70 FAVGVRFPRW EELHALASEP RGQHVLLAEQ VEDATNGLFS TLSSSAICSS ATPDCRVEAH 240 

PCEHRTLEMV REFAGNAPCW RGSRRTLAVL AAHCPFYSWK RVFLTHPATC YRTTCPGPCD 300 

SQPCQNGGTC VPEGLDGYQC LCPLAFGGEA NCALKLSIiEC RVDLLFLLDS SAGTTLDGFL 360 

RAKVFVKRFV RAVLSEDSRA RVGVATYSRE LLVAVPVGEY QDVPDLVWSL DGIPFRGGPT 420 

LTGSALRQAA ERGFGSATRT GQDRPRRVW LLTESHSEDE VAGPARHARA RELLLLGVGS 480 

75 EAVRAELEEI TGSPKHVMVY SDPQDLFNQI PELOGKLCSR QRPGCRTQAL DLVFMLDTSA 540 

SVGPENFAQM QSFVRSCALQ FEVNPDVTQV GLWYGSQVQ TAFGLDTKPT RAAMLRAISQ 600 

APYLGGVGSA GTALLHIYDK VMTVQRGARP GVPKAVWLT GGRGAEDAAV PAQKLRNNGI 660 

SVLWGVGPV LSEGLRRLAG PRDSLIHVAA YADLRYHQDV LIEWLCGEAK QPVNLCKPSP 720 

OA CMNEGSCVLQ NGSYRCKCRD GWEGPHCENR EWSSCSVCVS QGWILETPLR HMAPVQEGSS 780 

80 RTPPSNYREG LGTEMVPTFW NVCAPGP 



Seq ID NO*. 123 DNA sequence 

Nucleic Acid Accession ft: Eos sequence 

Coding sequence: 89.. 2356 



999 



WO 03/042661 



1 11 21 31 41 51 

I I I I I I 

GCCCCCTGGC CCGAGCCGCG CCCGGGTCTG TGAGTAGAGC CGCCOGGGCA CCGAGCGCTG 60 

GTCGCOGCTC TCCTTCCGTT ATATCAACAT GCCCCCTTTC CTGTTGCTGG AAGCCGTCTG 120 

TGTTTTCCTG TTTTCCAGAG TGCCCCCATC TCTCCCTCTC CAGGAAGTCC ATGTAAGCAA 180 

AGAAACCATC GGGAAGATTT CAGCTGCCAG CAAAATGATG TGGTGCTCGG CTGCAGTGGA 240 

CATCATGTTT CTGTTAGATG GGTCTAACAG CGTCGGGAAA GGGAGCTTTG AAAGGTCCAA 300 

GCACTTTGCC ATCACAGTCT GTGACGGTCT GGACATCAGC CCCGAGAGGG TCAGAGTGGG 360 

AGCATTCCAG TTCAGTTCCA CTCCTCATCT GGAATTCCCC TTGGATTCAT TTTCAACCCA 420 

ACAGGAAGTG AAGGCAAGAA TCAAGAGGAT GGTTTTCAAA GGAGGGCGCA CGGAGACGGA 480 

ACTTGCTCTG AAATACCTTC TGCACAGAGG GTTGCCTGGA GGCAGAAATG CTTCTGTGCC 540 

CCAGATCCTC ATCATCGTCA CTGATGGGAA GTCCCAGGGG GATGTGGCAC TGCCATCCAA 600 

GCAGCTGAAG GAAAGGGGTG TCACTGTGTT TGCTGTGGGG GTCAGGTTTC CCAGGTGGGA 660 

GGAGCTGCAT GCACTGGCCA GCGAGCCTAG AGGGCAGCAC GTGCTGTTGG CTGAGCAGGT 720 

GGAGGATGCC ACCAACGGCC TCTTCAGCAC CCTCAGCAGC TCGGCCATCT GCTCCAGCGC . 780 

CACGCCAGAC TGCAGGGTCG AGGCTCACCC CTGTGAGCAC AGGACGCTGG AGATGGTCCG 840 

GGAGTTCGCT GGCAATGCCC CATGCTGGAG AGGATCGCGG CGGACCCTTG CGGTGCTGGC 900 

TGCACACTGT CCCTTCTACA GCTGGAAGAG AGTGTTCCTA ACCCACCCTG CCACCTGCTA 960 

CAGGACCACC TGCCCAGGCC CCTGTGACTC GCAGCCCTGC CAGAATGGAG GCACATGTGT 1020 

TCCAGAAGGA CTGGACGGCT ACCAGTGCCT CTGCCCGCTG GCCTTTGGAG GGGAGGCTAA 1080 

CTGTGCCCTG AAGCTGAGCC TGGAATGCAG GGTCGACCTC CTCTTCCTGC TGGACAGCTC 1140 

TGCGGGCACC ACTCTGGACG GCTTCCTGCG GGCCAAAGTC TTCGTGAAGC GGTTTGTGCG 1200 

GGCCGTGCTG AGCX3AGGACT CTCGGGCCCG AGTGGGTGTG GCCACATACA GCAGGGAGCT 1260 

GCTGGTGGCG GTGCCTGTGG GGGAGTACCA GGATGTGCCT GACCTGGTCT GGAGCCTCGA 1320 

TGGCATTCCC TTCCGTGGTG GCCCCACCCT GACGGGCAGT GCCTTGCGGC AGGCGGCAGA 1380 

GCGTGGCTTC GGGAGCGCCA CCAGGACAGG CCAGGACOGG CCACGTAGAG TGGTGGTTTT 1440 

GCTCACTGAG TCACACTCCG AGGATGAGGT TGCGGGCCCA GCGCGTCACG CAAGGGCGCG 1500 

AGAGCTGCTC CTGCTGGGTG TAGGCAGTGA GGCCGTGCGG GCAGAGCTGG AGGAGATCAC 1560 

AGGCAGCCCA AAGCATGTGA TGGTCTACTC GGATCCTCAG GATCTGTTCA ACCAAATCCC 1620 

TGAGCTGCAG GGGAAGCTGT GCAGCCGGCA GCGGCCAGGG TGCCGGACAC AAGCCCTGGA 1680 

CCTCGTCTTC ATGTTGGACA CCTCTGCCTC AGTAGGGCCC GAGAATTTTG CTCAGATGCA 1740 

GAGCTTTGTG AGAAGCTGTG CCCTCCAGTT TGAGGTGAAC CCTGACGTGA CACAGGTCGG 1800 

CCTGGTGGTG TATGGCAGCC AGGTGCAGAC TGCCTTCGGG CTGGACACCA AACCCACCCG 1860 

GGCTGCGATG CTGCGGGCCA TTAGCCAGGC CCCCTACCTA GGTGGGGTGG GCTCAGCCGG 1920 

CACCGCCCTG CTG CACATCT ATGACAAAGT GATGACCGTC CAGAGGGGTG CCCGGCCTGG 1980 

TGTCCCGAAA GCTGTGGTGG TGCTCACAGG CGGGAGAGGC GCAGAGGATG CAGCCX3TTCC 2040 

TGCCCAGAAG CTGAGGAACA ATGGCATCTC TGTCTTGGTC GTGGGCGTGG GGCCTGTCCT 2100 

AAGTGAGGGT CTGCGGAGGC TTGCAGGTCC CCGGGATTCC CTGATCCACG TGGCAGCTTA 2160 

OGCCGACCTG CGGTACCACC AGGACGTGCT CATTGAGTGG CTGTGTGGAG AAGCCAAGCA 2220 

GCCAGTCAAC CTCTGCAAAC CCAGCCCGTG CATGAATGAG GGCAGCTGGG TCCTGCAGAA 2280 

TGGGAGCTAC CGCTGCAAGT GTCGGGATGG CTGGGAGGGC CCCCACTGCG AGAACCGATT 2340 

CTTGAGACGC CCCTGAGGCA CATGGCTCCC GTGCAGGAGG GCAGCAGCCG TACCCCTCCC 2400 

AGCAACTACA GAGAAGGCCT GGGCACTGAA ATGGTGCCTA CCTTCTGGAA TGTCTGTGCC 2460 

CCAGGTCCTT AGAATGTCTG CTTCCCGCCG TGGCCAGGAC CACTATTCTC ACTGAGGGAG 2520 

GAGGATGTCC CAACTGCAGC CATGCTGCTT AGAGACAAGA AAGCAGCTGA TGTCACCCAC 2580 

AAACGATGTT GTTGAAAAGT TTTGATGTGT AAGTAAATAC CCACTTTCTG TACCTGCTGT 2640 

GCCTTGTTGA GGCTATGTCA TCTGCCACCT TTCCCTTGAG GATAAACAAG GGGTCCTGAA 2700 

GACTTAAATT TAGCGGCCTG ACGTTCCTTT GCACACAATC AATGCTCGCC AGAATGTTGT 2760 
TGACACAGTA ATGCCCAGCA GAGGCCTTTA CTAGAGCATC CTTTGGACGG 

Seq ID NO: 124 Protein sequence 
Protein Accession #: Eos sequence 



MPPFLLLEAV 



MVFKGGRTET 
FAVGVRFPRW 
PCEHRTLEMV 
SQPCQNGGTC 
RAKVFVKRFV 
LTGSALRQAA 
EAVRAELEEI 
SVGPENFAQM 
APYLGGVGSA 
SVLWGVGPV 
CMNEGSCVLQ 



11 
I 

CVFLFSRVPP 
KHFAITVCDG 
ELALKYLLHR 
EELHALASEP 
REPAGNAPCW 
VPEGLDGYQC 
RAVLSEDSRA 
ERGFGSATRT 
TGSPKHVMVY 
QSFVRSCALQ 
GTALLHIYDK 
LSEGLRRLAG 
NGSYRCKCRD 



21 
i 

SLPLQEVHVS 
U3ISPERVRV 
GLiPGGRNASV 
RGQHVLLAEQ 
RGSRRTLAVL 
LCPLAFGGEA 
RVGVATYSRE 
GQDRPRRWV 
SDPQDLFNQI 
FEVNPDVTQV 
VMTVQRGARP 
PRDSLIHVAA 



31 
I 

KETIGKISAA 
GAPQFSSTPH 
PQILIIVTDG 
VEDATNGIiFS 
AAHCPFYSWK 
NCALKLSLEC 
LLVAVPVGEY 
LLTE SHSEDE 
PELQGKLCSR 
GLWYGSQVQ 
GVPKAWVLT 
YADLRYHQDV 
FLRRP 



41 
I 

SKMMWCSAAV 
LEFPLDSFST 
KSQGDVALPS 
TLSSSAICSS 
RVFLTHPATC 
RVDLLFLLDS 
QDVPDLVNSL 
VAGPARHARA 
QRPGCRTQAL 
TAFGLDTKPT 
GGRGAEDAAV 
LIEWLCGEAK 



51 
I 

DIMFLLDGSN 
QQEVKARIKR 
KQLKERGVTV 
ATPDCRVEAH 
YRTTCPGPCD 
SAGTTLDGFL 
DGIPFRGGPT 
RELLXiLGVGS 
DLVFMLDTSA 
RAAMLRAISQ 
PAQKLRNNGI 
QPVNLCKPSP 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 



Seq ID NO: 125 DNA sequence 

Nucleic Acid Accession #: NM_031942.1 

Coding sequence; 145 . .1260 



1 
I 

CCCGAGCCCC 
TGCTCCTCCT 
CCGATCTGGG 
GTAAAGAAGA 
TCCTCTGATG 
TCAGTTCGGG 
GCGATGAAGT 
CAGCCCTCAG 



11 
I 

GCCCCTCCGG 
GCTGTGGGAC 
CACCCGCCAC 
ACTTAAAGAA 
ACAGTTGTGA 
AAGGCTGTAG 
TTCCAGCGCG 
AGAATTCTGT 



21 
I 

GCCCGGGTOG 
CGCTGACCGC 
CAGCATGGAC 
ATTCAGATAT 
CAGCTTTGCT 
GACCCGCAGC 
GAGTACCAGG 
GACTGATTCC 



31 
I 

GCGCGCCCAG 
GCGGCTGCTC 
GCTGGCCGGG 
GTGAAGTTGA 
TCTGATAATT 
CAGTGCAGGC 
GGAGCAACCA 
AACTCCGATT 



41 
I 

CCTGCCAGCC 
CGCTCTCCCC 
TGCCGCAGAA 
TTTCCATGGA 
TTGCAAACAC 
ACTCTGGACC 
ACAAAAAAGC 
CAGAAGATGA 



51 
1 

GCGCTGCTGC 
GCTCCAAGCG 
AGATCTCAGA 
AACCTCGTCA 
GAGGCTGCAG 
TCTCAGGGTG 
AGAGTCCCGC 
AAGTGGAATG 



60 
120 
180 
240 
300 
360 
420 
480 



1000 



WO 03/042661 



tltlt AGAAAAGGGC TTTAAATATA AAGCAAAACA AAGCAATGCT TGCAAAACTC 540 

ATGTCTGAAT TAGAAAGCTT CCCTGGCTCG TTCCGTGGAA GACATCCCCT CCCAGGCTCC 600 

GACTCACAAT CAAGGAGACC GCGAAGGCGT ACATTCCCGG GTGTTGCTTC CAGGAGAAAC 660 

CCTGAACGGA GAGCTCGTCC TCTTACCAGG TCAAGGTCCC GGATCCTCGG GTCCCTTGAC 720 

GCTCTACCCA TGGAGGAGGA GGAGGAAGAG GATAAGTACA TGTTGGTGAG AAAGAGGAAG 780 

ACCGTGGATG GCTACATGAA TGAAGATGAC CTGCCCAGAA GCCGTCGCTC CAGATCATCC 840 

GTGACCCTTC CGCATATAAT TCGCCCAGTG GAAGAAATTA CAGAGGAGGA GTTGGAGAAC 900 

GTCTGCAGCA ATTCTCGAGA GAAGATATAT AACCGTTCAC TGGGCTCTAC TTGTCATCAA 960 

TGCCGTCAGA AGACTATTGA TACCAAAACA AACTGCAGAA ACCCAGACTG CTGGGGCGTT 1020 

CGAGGCCAGT TCTGTGGCCC CTGCCTTCGA AACCGTTATG GTGAAGAGGT CAGGGATGCT 1080 

CTGCTGGATC CGAACTGGCA TTGCCCGCCT TGTCGAGGAA TCTGCAACTG CAGTTTCTGC 1140 

CGGCAGCGAG ATGGACGGTG TGOGACTGGG GTCCTTGTGT ATTTAGCCAA ATATCATGGC 1200 

TTTGGGAATG TGCATGCCTA CTTGAAAAGC CTGAAACAGG AATTTGAAAT GCAAGCATAA 1260 

TATCTGGAAA ATTTGCTGCC TGCCTTCTAC TTCTCAAATC TTTCTTGTAA AAGTTTCCAA 1320 

TTTTTTCACT GAAACCTGAG TTAAAAATCT TGATGATCAG CCTGTTTCAT AAGAAACTCC 1380 

AATCAAGTTA ATCTTAGCAG ACATGTGTTT CTGGAGCATC ACAGAAGGTA TATTGCTAGT 1440 

TACACTTTGC CCTCCTGCAG TTTCTTCTCT GCTCCCAACC CCCATCTCAT AGCATCCCCC 1500 

TCTATTTCCA ATGCTCCTCT CCAACCGCTT AGTTTCTGAA TTTCTTTTAA ATTACAGTTT 1560 

TATGAAAGCA TATTTTATTT ACTTGGTGTT GAAATAGCCC TCATAAAACC TAAGCACTTG 1620 

GAAACACAAT AATAGTATTA ACTAACTAGA TCTATTGAAT TTCAGAGAAG AGCCTTCTAA 1680 

CTTGTTTACA CAAAAACGAG TATGATTTAG CACTCATACT AGTTGAAATT TTTAATAGAA 1740 

TCAAGGCACA AAAGTCTTAA AACCATGTGG AAAAATTAGG TAATTATTGC AGATTGATGT 1800 

CTCTCAATCC CATGTATTGC GCTTATGTTA CAAGTTGTTG TCACAGTTGA GACTTAATTT I860 

CTCCTAATTT CTTCTGCCCG AAGGGTAAGT GGTGCGTCCA GCTTACACGA TCATAATTCA 1920 

AAGGTTGGTG GGCAATGTAA TACTTAATTA AAATAATGAT GGAAGAGCTA TCTGGAGATT 1980 

ATGAGTAAGC TGATTTGAAT TTTCAGTATA AAACTTTAGT ATAATTGTAG TTTGCAAAGT 2040 

TTATTTCAGT TCACATGTAA GGTATTGCAA ATAAATTCTT GGACAATTTT GTATGGAAAC 2100 

TTGATATTAA AAACTAGTCT GTGGTTCTTT GCAGTTTCTT GTAAATTTAT AAACCAGGCA 2160 

CAAGGTTCAA GTTTAGATTT TAAGCACTTT TATAACAATG ATAAGTGCCT TTTTGGAGAT 2220 

GTAACTTTTA GCAGTTTGTT AACCTGACAT CTCTGCCAGT CTAGTTT CTG GGCAGGTTTC 2280 

CTGTGTCAGT ATTCCCCCTC CTCTTTGCAT TAATCAAGGT ATTTGGTAGA GGTGGAATCT 2340 

AAGTGTTTGT ATGTCCAATT TACTTGCATA TGTAAACCAT TGCTGTGCCA TTCAATGTTT 2400 

GATGCATAAT TGGACCTTGA ATCGATAAGT GTAAATACAG' CTTTTGATCT GTAATGCTTT 2460 
TATACAAAAG TTTATTTTAA TAATAAAATG TTTGTTCTAA AAAAAAAAAA 

Seq ID NO: 126 Protein sequence 
Protein Accession #: NP_114148.1 

1 11 21 31 41 51 

1 I I I | | 

MDARRVPQKD LRVKKNLKKP RYVKLISMET SSSSDDSCDS PASDNPANTR LQSVREGCRT 60 

RSQCRHSGPL RVAMKFPARS TRGATNKKAE SRQPSENSVT DSNSDSEDES GMNFLEKRAL 120 

NZKQNKAMLA KLMSELESFP GSFRGRHPLP GSDSQSRRPR RRTFPGVASR RNPERRARPL 180 

TRSRSRILGS LDALPMEEEE EEDKYMLVRK RKTVDGYMNE DDLPRSRRSR SSVTLPHIIR 240 

PVEEITEEEL ENVCSNSREK IYNRSLGSTC HQCRQKTIDT KTNCRNPDCW GVRGQFCGPC 300 

LRNRYGEEVR DALLDPNWHC PPCRGICNCS FCRQRDGRCA TGVLVYLAKY HGFGNVHAYL 360 
KSLKQEFEMQ A 

Seq ID NO: 127 DNA sequence 

Nucleic Acid Accession #: AF305616.1 

Coding sequence: 1..863 

} I 1 21 31 41 51 

ATGCACCGCT TGATGGGGGT CAACAGCACC GCCGCCGCCG CCGCCGGGCA GCCCAATGTC 60 

TCCTGCACGT GCAACTGCAA ACGCTCTTTG TTCCAGAGCA TGGAGATCAC GGAGCTGGAG 120 

TTTGTTCAGA TCATCATCAT CGTGGTGGTG ATGATGGTGA TGGTGGTGGT GATCACGTGC 180 

CTGCTGAGCC ACTACAAGCT GTCTGCACGG TCCTTCATCA GCCGGCACAG CCAGGGGCGG 240 

AGGAGAGAAG ATGCCCTGTC CTCAGAAGGA TGCCTGTGGC CCTCGGAGAG CACAGTGTCA 300 

GGCAACGGAA TCCCAGAGCC GCAGGTCTAC GCCCCGCCTC GGCCCACCGA CCGCCTGGCC 360 

GTGCCGCCCT TCGCCCAGCG GGAGCGCTTC CACCGCTTCC AGCCCACCTA TCCGTACCTG 420 

CAGCACGAGA TCGACCTGCC ACCCACCATC TCGCTGTCAG ACGGGGAGGA GCCCCCACCC 480 

TACCAGGGCC CCTGCACCCT CCAGCTTCGG GACCCCGAGC AGCAGCTGGA ACTGAACCGG 540 

GAGTCGGTGC GCGCACCCCC AAACAGAACC ATCTTOGACA GTGACCTGAT GGATAGTGCC 600 

AGGCTGGGCG GCCCCTGCCC CCCCAGCAGT AACTCGGGCA TCAGCGCCAC GTGCTACGGC 660 

AGCGGCGGGC GCATGGAGGG GCOGCCGCCC ACCTACAGCG AGGTCATCGG CCACTACCCG 720 

GGGTCCTCCT TCCAGCACCA GCAGAGCAGT GGGCCGCCCT CCTTGCTGGA GGGGACCCGG 780 

CTCCACCACA CACACATCGC GCCCCTAGAG AGCGCAGCCA TCTGGAGCAA AGAGAAGGAT B40 
AAACAGAAAG GACACCCTCT CTAG 

Seq ID NO: 128 Protein sequence 
Protein Accession #: AAL09357.1 

I 11 21 31 41 51 

I 1 I | | | 

MHRLMGVNST AAAAAGQPNV SCTCNCKRSL FQSMEITELE FVQIIIIWV MMVMVWITC 60 

LLSHYKLSAR SFISRHSQGR RREDALSSEG CLWPSESTVS GNGIPEPQVY APPRPTDRLA 120 

VPPFAQRERF HRFQPTYPYL QHEIDLPPTI SLSDGBEPPP YQGPCTLQLR DPEQQLELNR 180 

ESVRAPPNRT IFDSDLMDSA RLGGPCPPSS NSGISATCYG SGGRMEGPPP TYSEVIGHYP 240 
GSSFQHQQSS GPPSLLEGTR LHHTHIAPLE SAAIWSKEKD KQKGHPL 



Seq ID NO: 129 DNA sequence 

Nucleic Acid Accession fl : nm 004952.1 



1001 



WO 03/042661 



PCT/US02/36810 



Coding sequence: 1..718 

1 11 21 31 41 51 

<: I i I I I I 

J ATGGCGGCGG CTCCGCTGCT GCTGCTGCTG CTGCTCGTGC CCGTGCCGCT GCTGCCGCTG 60 

CTGGCCCAAG GGCCCGGAGG GGCGCTGGGA AACCGGCATG CGGTGTACTG GAACAGCTCC 120 

AACCAGCACC TGCGGCGAGA GGGCTACACC GTGCAGGTGA ACGTGAACGA CTATCTGGAT 180 

ATTTACTGCC CGCACTACAA CAGCTCGGGG GTGGGCCCCG GGGCGGGACC GGGGCCCGGA 240 

GGCGGGGCAG AGCAGTACGT GCTGTACATG GTGAGCCGCA ACGGCTACCG CACCTGCAAC 300 

11/ GCCAGCCAGG GCTTCAAGCG CTGGGAGTGC AACCGGCCGC ACGCCCCGCA CAGCCCCATC 360 

AAGTTCTCGG AGAAGTTCCA GCGCTACAGC GCCTTCTCTC TGGGCTACGA GTTCCACGCC 420 

GGCCACGAGT ACTACTACAT CTCCACGCCC ACTCACAACC TGCACTGGAA GTGTCTGAGG 480 

ATGAAGGTGT TCGTCTGCTG CGCCTCCACA TCGCACTCCG GGGAGAAGCC GGTCCCCACT 540 

1 CTCCCCCAGT TCACCATGGG CCCCAATGTG AAGATCAACG TGCTGGAAGA CTTTGAGGGA 600 

13 GAGAACCCTC AGGTGCCCAA GCTTGAGAAG AGCATCAGCG GGACCAGCCC CAAACGGGAA 660 
CACCTGCCCC TGGCCGTGGG CATCGCCTTC TTCCTCATGA CGTTCTTGGC CTCCTAG 



20 



50 



65 



75 



Seq ID NO: 130 Protein sequence 
Protein Accession #: NP_004943.1 



1 11 21 31 41 51 

I I I I I I 

MAAAPLLLLL LLVPVPLLPL LAQGPGGALG NRHAVYWNSS NQHLRREGYT VQVNVNDYLD 60 
IYCPHYNSSG VGPGAGPGPG GGAEQYVLYM VSRNGYRTCN ASQGFKRWEC NRPHAPHSPI 120 
ZZ> KFSEKFQRYS AFSLGYEFHA GHEYYYISTP THNIiHWKCLR MKVFVCCAST SHSGEKPVPT 180 
LPQFTMGPNV KINVLEDFEG ENPQVPKLEK SISGTSPKRE HLPLAVGIAF FLMTFLAS 

Seq ID NO: 131 DNA sequence 
.„ Nucleic Acid Accession #: NM_012445.1 
J\) Coding sequence: 276.. 1271 

1 11 . 21 31 41 51 

I I I I I I 

GCACGAGGGA AGAGGGTGAT CCGACCCGGG GAAGGTCGCT GGGCAGGGCG AGTTGGGAAA 60 

DJ GCGGCAGCCC CCGCCGCCCC CGCAGCCCCT TCTCCTCCTT TCTCCCACGT CCTATCTGCC 120 

TCTCGCTGGA GGCCAGGCCG TGCAGCATCG AAGACAGGAG GAACTGGAGC CTCATTGGCC 180 

GGCCCGGGGC GCCGGCCTCG GGCTTAAATA GGAGCTCCGG GCTCTGGCTG GGACCCGACC 240 

GCTGCCGGCC GCGCTCCCGC TGCTCCTGCC GGGTGATGGA AAACCCCAGC CCGGCOGCCG 300 

Aft CCCTGGGCAA GGCCCTCTGC GCTCTCCTCC TGGCCACTCT CGGCGCCGCC GGCCAGCCTC 360 

W TTGGGGGAGA GTCCATCTGT TCCGCCAGAG CCCCGGCCAA ATACAGCATC ACCTTCACGG 420 

GCAAGTGGAG CCAGACGGCC TTCCCCAAGC AGTACCCCCT GTTCCGCCCC CCTGCGCAGT 480 

GGTCTTCGCT GCTGGGGGCC GCGCATAGCT CCGACTACAG CATGTGGAGG AAGAACCAGT 540 

ACGTCAGTAA CGGGCTGCGC GACTTTGCGG AGCGCGGCGA GGCCTGGGCG CTGATGAAGG 600 

A c AGATCGAGGC GGCGGGGGAG GCGCTGCAGA GCGTGCACGC GGTGTTTTCG GCGCCCGCCG 660 
**J TCCCCAGOGG CACCGGGCAG ACGTCGGCGG AGCTGGAGGT GCAGCGCAGG CACTCGCTGG 



Seq ID NO: 132 Protein sequence 
Protein Accession #: NP 036S77.1 



720 



TCTCGTTTGT GGTGCGCATC GTGCCCAGCC CCGACTGGTT CGTGGGCGTG GACAGCCTGG 780 



840 



ACCTGTGCGA CGGGGACCGT TGGCGGGAAC AGGCGGCGCT GGACCTGTAC CCCTACGACG 
CCGGGACGGA CAGCGGCTTC ACCTTCTCCT CCCCCAACTT CGCCACCATC CCGCAGGACA 900 
CGGTGACCGA GATAACGTCC TCCTCTCCCA GCCACCOGGC CAACTCCTTC TACTACCCGC 960 

GGCTGAAGGC CCTGCCTCCC ATCGCCAGGG TGACACTGGT GCGGCTGCGA CAGAGCCCCA 1020 

GGGCCTTCAT CCCTCCCGCC CCAGTCCTGC CCAGCAGGGA CAATGAGATT GTAGACAGCG 10 B0 

CCTCAGTTCC AGAAACGCCG CTGGACTGCG AGGTCTCCCT GTGGTOGTCC TGGGGACTGT 1140 

GCGGAGGCCA CTGTGGGAGG CTCGGGACCA AGAGCAGGAC TOGCTACGTC CGGGTCCAGC 1200 

rr CCGCCAACAA CGGGAGCCCC TGCCCCGAGC TCGAAGAAGA GGCTGAGTGC GTCCCTGATA 1260 

J J ACTGCGTCTA AGACCAGAGC CCCGCAGCCC CTGGGGCCCC CGGAGCCATG GGGTGTCGGG 1320 

GGCTCCTGTG CAGGCTCATG CTGCAGGCGG CCGAGGCACA GGGGGTTTCG CGCTGCTCCT 1380 

GACCGCGGTG AGGCCGCGCC GACCATCTCT GCACTGAAGG GCCCTCTGGT GGCCGGCACG 1440 

GGCATTGGGA AACAGCCTCC TCCTTTCCCA ACCTTGCTTC TTAGGGGCCC CCGTGTCCCG 1500 

^ n TCTGCTCTCA GCCTCCTCCT CCTGCAGGAT AAAGTCATCC CCAAGGCTCC AGCTACTCTA 1560 

OU AATTATGGTC TCCTTATAAG TTATTGCTGC TCCAGGAGAT TGTCCTTCAT CGTCCAGGGG 1620 

CCTGGCTCCC ACGTGGTTGC AGATACCTCA GACCTGGTGC TCTAGGCTGT GCTGAGCCCA 1680 

CTCTCCCGAG GGCGCATCCA AGCGGGGGCC ACTTGAGAAG TGAATAAATG GGGCGGTTTC 1740 

GGAAGCGTCA GTGTTTCCAT GTTATGGATC TCTCTGCGTT TGAATAAAGA CTATCTCTGT 1800 
TGCTCAC 



7A 1 11 21 31 41 51 

MENPSPAAAL GKALCALLLA TLGAAGQPLG GESICSARAP AKYSITFTGK WSQTAFPKQY 60 

PLPRPPAQWS SLLGAAHSSD YSMWRKNQYV SNGLRDPAER GEAWALMKEI EAAGEALQSV 120 

HAVFSAPAVP SGTGQTSAEL EVQRRHSLVS FWRIVPSPD WFVGVDSLDL CDGDRWREQA 180 

ALDLYPYDAG TDSGFTFSSP NFATIPQDTV TEITSSSPSH PANSFYYPRL KALPPIARVT 240 

LVRLRQSPRA FIPPAPVLPS RDNEIVDSAS VPBTPLDCEV SLWSSWGLCG GHCGRLGTKS 300 
RTRYVRVQPA NNGSPCPELE EEAECVPDNC V 

Seq ID NO: 133 DNA sequence 
SlCi Nuc leic Acid Accession #: NM_019894 
oU Coding sequence: 1..1314 

1 11 J 21 31 41 51 

I I I I I | 

ATGTTACAGG ATCCTGACAG TGATCAACCT CTGAACAGCC TCGATGTCAA ACCCCTGCGC 60 

1002 



WO 03/042661 



PCT/US02/36810 
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AAACCCCGTA TCCCCATGGA GACCTTCAGA AAGGTGGGGA TCCCCATCAT CATAGCACTA 120 

CTGAGCCTGG CGAGTATCAT CATTGTGGTT GTCCTCATCA AGGTGATTCT GGATAAATAC 180 

TACTTCCTCT GCGGGCAGCC TCTCCACTTC ATCCCGAGGA AGCAGCTGTG TGACGGAGAG 240 

CTGGACTGTC CCTTGGGGGA GGACGAGGAG CACTGTGTCA AGAGCTTCCC CGAAGGGCCT 300 

GCAGTGGCAG TCCGCCTCTC CAAGGACCGA TCCACACTGC AGGTGCTGGA CTCGGCCACA 360 

GGGAACTGGT TCTCTGCCTG TTTCGACAAC TTCACAGAAG CTCTCGCTGA GACAGCCTGT 420 

AGGCAGATGG GCTACAGCAG CAAACCCACT TTCAGAGCTG TGGAGATTGG CCCAGACCAG 460 

GATCTGGATG TTGTTGAAAT CACAGAAAAC AGCCAGGAGC TTCGCATGCG GAACTCAAGT 540 

GGG CCCTGTC TCTCAGGCTC CCTGGTCTCC CTGCACTGTC TTGCCTGTGG GAAGAGCCTG 600 

AAGACCCCCC GTGTGGTGGG TGGGGAGGAG GCCTCTGTGG ATTCTTGGCC TTGGCAGGTC 660 

AGCATCCAGT ACGACAAACA GCAOGTCTGT GGAGGGAGCA TCCTGGACCC CCACTGGGTC 720 

CTCACGGCAG CCCACTGCTT CAGGAAACAT ACCGATGTGT TCAACTGGAA GGTGCGGGCA 780 

GGCTCAGACA AACTGGGCAG CTTCCCATCC CTGGCTGTGG CCAAGATCAT CATCATTGAA 840 

TTCAACCCCA TGTACCCCAA AGACAATGAC ATCGCCCTCA TGAAGCTGCA GTTCCCACTC 900 

ACTTTCTCAG GCACAGTCAG GCCCATCTGT CTGCCCTTCT TTGATGAGGA GCTCACTCCA 960 

GCCACCCCAC TCTGGATCAT TGGATGGGGC TTTACGAAGC AGAATGGAGG GAAGATGTCT 1020 

GACATACTGC TGCAGGCGTC AGTCCAGGTC ATTGACAGCA CACGGTGCAA TGCAGACGAT 1080 

GCGTACCAGG GGGAAGTCAC CGAGAAGATG ATGTGTGCAG GCATCCCGGA AGGGGGTGTG 1140 

GACACCTGCC AGGGTGACAG TGGTGGGCCC CTGATGTACC AATCTGACCA GTGGCATGTG 1200 

GTGGGCATCG TTAGCTGGGG CTATGGCTGC GGGGGCCCGA GCACCCCAGG AGTATACACC 1260 
AAGGTCTCAG CCTATCTCAA CTGGATCTAC AATGTCTGGA AGGCTGAGCT GTAA 

Seq ID NO: 134 Protein sequence 
Protein Accession #: NP_063 947.1 

1 11 21 31 41 51 

I I 1 I I I 

MliQDPDSDQP LNSLDVKPLR KPRIPMETFR KVGIPIIIAL LSLASIIIW VLIKVILDKY 60 

YFLCGQPLHF IPRKQLCDGE LDCPLGEDEE HCVKSFPEGP AVAVRLSKDR STLQVLDSAT 120 

GNWFSACFDN FTEALAETAC RQMGYSSKPT FRAVEIGPDQ DLDWEITEN SQELRMRNSS 180 

GPCLSGSLVS LHCLACGKSL KTPRWGGEE ASVDSWPWQV SIQYDKQHVC GGSILDPHWV 240 

LTAAHCFRKH TDVFNWKVRA GSDKLGSFPS LAVAKIIIIE FNPMYPKDND IALMKLQFPL 300 

TFSGTVRPIC LPFFDEELTP ATPLWIIGWG FTKQNGGKMS DILLQASVQV I DSTRCNADD 360 

AYQGEVTEKM MCAGIPEGGV DTCQGDSGGP LMYQSDQWHV VGIVSWGYGC GGPSTPGVYT 420 

Seq ID NO: 135 DNA sequence 
Nucleic Acid Accession #: NMJ)03045 
Coding sequence: 14 8.. 2 037 

1 11 21 31 41 51 

I I I I I I 

CGATCCTGCC GGAGCCCCGC CGCCGCCGGC TTGGATTCTG AAACCTTCCT TGTATCCCTC 60 

CTGAGACATC TTTGCTGCAA GATCGAGGCT GTCCTCTGGT GAGAAGGTGG TGAGGCTTCC 120 

CGTCATATTC CAGCTCTGAA CAGCAACATG GGGTGCAAAG TCCTGCTCAA CATTGGGCAG 180 

CAGATGCTGC GGCGGAAGGT GGTGGACTGT AGCCGGGAGG AGACGCGGCT GTCTCGCTGC 240 

CTGAACACTT TTGATCTGGT GGCCCTOGGG GTGGGCAGCA CACTGGGTGC TGGTGTCTAC 300 

GTCCTGGCTG GAGCTGTGGC CCGTGAGAAT GCAGGCCCTG CCATTGTCAT CTCCTTCCTG 360 

ATCGCTGCGC TGGCCTCAGT GCTGGCTGGC CTGTGCTATG GCGAGTTTGG TGCTCGGGTC 420 

CCCAAGACGG GCTCAGCTTA CCTCTACAGC TATGTCACCG TTGGAGAGCT CTGGGCCTTC 480 

ATCACCGGCT GGAACTTAAT CCTCTCCTAC ATCATCGGTA CTTCAAGCGT AGCGAGGGCC 540 

TGGAGCGCCA CCTTCGACGA GCTGATAGGC AGACCCATCG GGGAGTTCTC ACGGACACAC 600 

ATGACTCTGA ACGCCCCCGG CGTGCTGGCT GAAAACCCCG ACATATTCGC AGTGATCATA 660 

ATTCTCATCT TGACAGGACT TTTAACTCTT GGTGTGAAAG AGTCGGCCAT GGTCAACAAA 720 

ATATTCACTT GTATTAACGT CCTGGTCCTG GGCTTCATAA TGGTGTCAGG ATTTGTGAAA 780 

GGATCGGTTA AAAACTGGCA GCTCACGGAG GAGGATTTTG GGAACACATC AGGCCGTCTC 840 

TGTTTGAACA ATGACACAAA AGAAGGGAAG CCCGGTGTTG GTGGATTCAT GCCCTTCGGG 900 

TTCTCTGGTG TCCTGTCGGG GGCAGCGACT TGCTTCTATG CCTTCGTGGG CTTTGACTGC 960 

ATCGCCACCA CAGGTGAAGA GGTGAAGAAC CCACAGAAGG CCATCCCCGT GGGGATCGTG 1020 

GCGTCCCTCT TGATCTGCTT CATCGCCTAC TTTGGGGTGT CGGCTGCCCT CACGCTCATG 1080 

ATGCCCTACT TCTGCCTGGA CAATAACAGC CCCCTGCCCG ACGCCTTTAA GCACGTGGGC 1140 

TGGGAAGGTG CCAAGTACGC AGTGGCCGTG GGCTCCCTCT GCGCTCTTTC CGCCAGTCTT 1200 

CTAGGTTCCA TGTTTCCCAT GCCTCGGGTT ATCTATGCCA TGGCTGAGGA TGGACTGCTA 1260 

TTTAAATTCT TAGCCAACGT CAATGATAGG ACCAAAACAC CAATAATCGC CACATTAGCC 1320 

TCGGGTGCCG TTGCTGCTGT GATGGCCTTC CTCTTTGACC TGAAGGACTT GGTGGACCTC 1380 

ATGTCCATTG GCACTCTCCT GGCTTACTCG TTGGTGGCTG CCTGTGTGTT GGTCTTACGG 1440 

TACCAGCCAG AGCAGCCTAA CCTGGTATAC CAGATGGCCA GTACTTCCGA CGAGTTAGAT 1500 

CCAGCAGACC AAAATGAATT GGCAAGCACC AATGATTCCC AGCTGGGGTT TTTACCAGAG 1560 

GCAGAGATGT TCTCTTTGAA AACCATACTC TCACCCAAAA ACATGGAGCC TTCCAAAATC 1620 

TCTGGGCTAA TTGTGAACAT TTCAACCAGC CTTATAGCTG TTCTCATCAT CACCTTCTGC 1680 

ATTGTGACCG TGCTTGGAAG GGAGGCTCTC ACCAAAGGGG CGCTGTGGGC AGTCTTTCTG 1740 

CTOGCAGGGT CTGCCCTCCT CTGTGCCGTG GTCACGGGCG TCATCTGGAG GCAGCCCGAG 1800 

AGCAAGACCA AGCTCTCATT TAAGGTTCCC TTCCTGCCAG TGCTCCCCAT CCTGAGCATC 1860 

TTCGTGAACG TCTATCTCAT GATGCAGCTG GACCAGGGCA CCTGGGTCCG GTTTGCTGTG 1920 

TGGATGCTGA TAGGCTTCAT CATCTACTTT GGCTATGGCC TGTGGCACAG CGAGGAGGCG 1980 

TCCCTGGATG CCGACCAAGC AAGGACTCCT GACGGCAACT TGGACCAGTG CAAGTGACGC 2040 

ACAGCCCCGC CCCCCGGAGG TGGCAGCAGC CCCGAGGGAC GCCCCCAGAG GACCGGGAGG 2100 
CACCCCACCC TCCCCACCAG TGCAACAGAA ACCACCTGCG TCCACACCCT CACTGCA 

Seq ID NO: 136 Protein sequence 
Protein Accession #: NMJJ03045 

1 11 21 31 41 51 

I I 1 I I I 
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KGCKVLLNIG 
NAGPAIVISF 
YIIGTSSVAR 
LGVKESAMVN 
KPGVGGFMPF 
YFGVSAALTL 
VIYAMAEDGL 
SLVAACVLVL 
LSPKNMEPSK 
VVTGVIWRQP 
FGYGLWHSEE 



QQMLRRKWD 
LIAALASVLA 
AWSATFDELI 
KIFTCINVLV 
GFSGVLSGAA 
KMPYFCLDNN 
LFKFLANVND 
RYQPEQPNLV 
ISGLIVNIST 
ESKTKLSFKV 
ASLDADQART 



CSREETRLSR 
GLCYGEFGAR 
GRPIGEFSRT 
LGFIMVSGFV 
TCFYAFVGFD 
SPLPDAFKHV 
RTKTPIIATL 
YQMASTSDEL 
SLIAVLIITF 
PFLPVLPILS 
PDGNLDQCK 



CLNTFDLVAL 
VPKTGSAYLY 
HMTLNAPGVL 
KGSVKNWQLT 
CIATTGEEVK 
GWEGAKYAVA 
ASGAVAAVMA 
DPADQNELAS 
CIVTVLGREA 
IFVNVYLMMQ 



Seq ID KOi 137 DNA sequence 

Nucleic Acid Accession #: NM_032044.1 

Coding sequence: 182.. 658 



AAGATATAAA 
GCCCTTAGAG 
AGAAGCGCTA 
GATGGCTTCC 
CCTGGGTGAT 
TTGCTATGGT 
TTACGGAAAC 
AGAGTACATA 
GAAGAGGCAG 
CAAGTCCATG 
TTGGAGCAGC 
CAAGAATCAA 
GCTAAATCTG 
ACTACACTGG 
CTAGCTCTAA 
GTCTCTGGCT 
CTTTGGCCAT 
AGACCCCTTC 
CACTCAACTC 
TAGGTTGATG 



11 
I 

AGCTCCAGAA 
TCTTGGTTGC 
GTAAGGTCTC 
AGAAGCATGC 
ATCATCATGA 
TACTTCAGGA 
GGAGCCCACC 
AGTGGCTATC 
CAGTGGCAGT 
GGTGGGAACA 
AACGAATGCA 
GATTCTGCTA 
CTCATTATTT 
CTTTTTTAGG 
ATGTTTGCCC 
GTCTCGAGCA 
AAGAAGTAAA 
AGCTTCTACA 
CTGCTTGTTT 
TGGGCCATAC 



21 
I 

ACGTTGACTG 
CAAACAGATT 
TGAGATCCTT 
GGCTGCTCCT 
GACCCAGCTG 
AGCTGAGGAA 
TGGCATCTAT 
AGAGAAGCCA 
GGATTGATGG 
AGCACTGTGC 
ACAAGCGCCA 
ACTCCTGCAC 
CAGAGGGGAA 
CTTAGAGACA 
CGCCATCCCT 
GTCTAGAAGA 
GATTTGAAGA 
CCCTTCTGCC 
TTCCTTTGGC 
ATTCCTTTAA 



31 
I 

GGACCACTGG 
TGCAGATCAA 
GCACTAGCTA 
ATTGCTGAGC 
TGCTCCTGGA 
CTGGTCTGAT 
CCTGAGTTTA 
GCCGATATGG 
GGCCATGTAT 
TGAGATGAGC 
ACACTTCCTG 
AGCCCCGTCC 
ACCTAGCAAA 
GAAACTTTAG 
TTCCACAGTA 
GTGCATCTCC 
CAGAAGGAAG 
CTCTCTCCAT 
CATAGGAAGG 
TAAACCATTG 



GVGSTLGAGV 
SYVTVGELWA 
AENPDIFAVI 
EEDFGNTSGR 
NPQKAIPVGI 
VGSLCALSAS 
FLFDLKDLVD 
TNDSQLGFLP 
LTKGALWAVF 
LDQGTWVRFA 



41 
I 

AGACACTGAA 
GGAGAACCCA 
CATCCTCAGG 
TGCCTGGCCA 
TGGTTTTACC 
GCCGAGCTCG 
AAGGAAGCCA 
ATTGGCCTGC 
CTGTACAGAT 
TCCAATAACA 
TGCAAGTACC 
TCTTCCTTTC 
CTAAGAGTGA 
CATTGGCCCA 
TCCTTCTTCC 
AGCCTATGAA 
AAACTCAGGA 
TGCCTGCACC 
TTTACCAGTA 
TGTACATAAG 



YVLAGAVARE 
FITGWNLILS 
IILILTGLLT 
LCLNNDTKEG 
VASLLICFIA 
LLGSMFPMPR 
LMSIGTLIAY 
EAEMFSLKTI 
LLAGSALLCA 
VWMLIGFIIY 



51 
I 

GAAGGCAGGG 
GGAGTTTCAA 
GTAGGAGGAA 
AAACAGGAGT 
ACAAGTCCAA 
AGTGTCAGTC 
GCACCATAGC 
ACGACCCACA 
CCTGGTCTGG 
ACTTTTTAAC 
GACCATAGAG 
TGCTAGCCTG 
TAAGGGCCCT 
GTAGTGGCTT 
CTCCTCCCCT 
ACAGCTGGGT 
GTAAG CTTCT 
CCACCCCAGC 
GAATCCTTGC 
AAAAAAAAAA 



Seq ID NO: 138 Protein sequence 
Protein Accession #: NP 114433.1 



11 



21 



31 



41 



51 



I 

MASRSMRLLL LLSCLAKTGV LGDIIMRPSC APGWPYHKSN CYGYFRKLRN WSDAELECQS 
YGNGAHLASI LSLKEASTIA EYISGYQRSQ PIWIGLHDPQ KRQQWQWIDG AMYLYRSWSG 
KSMGGNKHCA EMSSNNNFLT WSSNECNKRQ HFLCKYRP 

Seq ID NO: 139 DNA sequence 

Nucleic Acid Accession XM_051860.2 

Coding sequence: 52.. 3 042 



GCTCACCCAG 
GTTAACCTCA 
GACCGGGGCA 
CCCAAACTCA 
AATGTACAGT 
TACCAGGCAG 
GTGGCAGGGA 
GCGGAGGTTG 
TACCCCTACA 
AAGTTTGCTC 
CAGCAGCTGG 
GGAGGTTATG 
TGCGTCACAG 
TTGGGCCACT 
CTTGGCCTCC 
AAGATGATCA 
GCTGTGTCCA 
GGATCTGAGG 
GGAATGTACT 
GCACATTCCA 
TCTGCCAAGG 
GACGCCGACC 
AACCAGGACC 
GCTGACAATG 
TCCAAGCAAG 
ATGATGGACA 
ATAGGCCAGA 
AACTGCACTT 
CGCCTGAATA 
GACGTTCCGA 



11 
I 

GAAAAATATG 
GCACCGAGGT 
GAGCCTGCCG 
CAGTCACCAT 
CATGGAAACC 
AAGAGTTCCA 
AACCAATGTA 
GGCTTCTGAG 
GAAACCACAT 
TGGGATTTAA 
TGGGTCAGTA 
ACCCACCCAC 
TCCATGGCTC 
GCTTCTTCAC 
TTGTCAAGTC 
CAGGAGACTC 
CCTTCTGGAT 
AAACTGGATT 
CCCCAGGTTA 
ACTACCGGGC 
ACAAGCGGCC 
CGCTGAAGCC 
ACGGGGCCTG 
GCATTGGCCT 
AGATAAAGAA 
ATAGGATCTG 
ATTTTCCAAT 
TCCGAAAGTT 
ATGCCTGGCA 
TTACTTCCAG 



21 
I 

CAATCGTCCC 
TGTCTACAAA 
GAGCTACCGT 
TGACACCAAT 
TGGAGATACC 
GGTGCTTCCC 
CCTGCACATC 
CCGGAACATC 
CTGCAATTTC 
GGCAGCACAC 
CCCGATTCAC 
ATACATCAGG 
CAATGGCTTG 
GGAAGATGGG 
TGGAACCCTC 
CTACCCAGGG 
GGCCAATCCC 
TTGGTTTATT 
TTCAGAGCAC 
TGGCATGATC 
GTTCCTCTCA 
CCGGGAGCCG 
GCTGCGCGGC 
GACCCTGGCC 
CAGCTTGTTT 
GGGCCCTGGC 
TAGAGGAATT 
TGTGGCCCTG 
GAGCTGCCCC 
AGTGTTCTTC 



31 
I 

ATTGATATAC 
AAAGGCCAGG 
GTACGGTTCC 
GTGAACAGCA 
CTGGTCATTG 
TGCAGATCCT 



ATAGTGATGG 
TTTGACTTCG 
TTGGAGGGCA 
TTCCACCTGG 
GACCTCTCCA 
TTGATCAAGG 
CCGGAGGAAC 
CTCCCCTCGG 
TACATCCCCA 
AACAACAACC 
TTTCACCACG 
ATTCCACTGG 
ATAGACAACG 
ATCATCTCTG 
GCCATCATCA 
GGGGATGTGT 
AGTGGTGGAA 
GTTGGCGAGA 
GGCTTGGACC 
CAGTTATATG 
GAGGGCCGGC 
CATAACAACG 
GGAGAGCCTG 



41 

I 

AGGCCACTAC 
ATTATAGGTT 
TCTGTGGGAA 
CCATTCTGAA 
CCAGTACTGA 
GCGCCCCCAA 
TAGACGGOGT 
GGGAGATGGA 
ATACCTTTGG 
CGGAGCTGAA 
CCGGTGATGT 
TCCATCATAC 
ACGTTGTGGG 
GCAACACTTT 
ACCGTGACAG 
AGCCCAGGCA 
TCATCAACTG 
TACCAACGGG 
GAAAATTCTA 
GAGTCAAAAC 
CCAGATACAG 
GACACTTCAT 
GGCTGGACAG 
CCTTCCCGTA 
GTGGCAACGT 
ATAGCGGAAG 
ATGGCCCCAT 
ACACCAGCGC 
TGACCGGCAT 
GGCCCTGGTT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



60 
120 
180 
240 
300 
360 
420 
480 
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600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 



60 
120 



51 
I 

AATGGATGGA 
TGCTTGCTAC 
GCCTGTGAGG 
CTTGGAGGAT 
TTACTCCATG 
CCAGGTCAAA 
GGACATGCGG 
GGACAAATGC 
GGGCCACATC 
GCATATGGGA 
AGACGAAAGG 
ATTCTCTCGC 
CTATAACTCT 
TGACCACTGT 
CAAGATGTGC 
AGACTGCAAT 
TGCCGCTGCA 
CCCCTCCGTG 
TAACAACCGA 
CACCGAGGCC 
CCCTCACCAG 
TGCCTACAAG 
CTGCOGGTTT 
TGACGACGGC 
GGGGACGGAA 
GACCCTCCCT 
CAACATCCAA 
CCTGGCCTTC 
TGCCTTTGAG 
CAACCAGCTG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
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GACATGGATG GGGATAAGAC ATCTGTGTTC CATGACGTCG ACGGCTCCGT GTCCGAGTAC 1660 
CCTGGCTCCT ACCTCACGAA GAATGACAAC TGGCTGGTCC GGCACCCAGA. CTGCATCAAT 1920 
GTTCCCGACT GGAGAGGGGC CATTTGCAGT GGGTGCTATG CACAGATGTA CATTCAAG.CC 1980 
TACAAGACCA GTAACCTGCG AATGAAGATC ATCAAGAATG ACTTCCCCAG CCACCCTCTT 2040 
TACCTGGAGG GGGCGCTCAC CAGGAGCACC CATTACCAGC AATACCAACC GGTTGTCACC 2100 
CTGCAGAAGG GCTACACCAT CCACTGGGAC CAGACGGCCC CCGCCGAACT CGCCATCTGG 2160 
CTCATCAACT TCAACAAGGG CGACTGGATC CGAGTGGGGC TCTGCTACCC GCGAGGCACC 2220 
ACATTCTCCA TCCTCTCGGA TGTTCACAAT OGCCTGCTGA AGCAAACGTC CAAGACGGGC 2280 
GTCTTCGTGA GGACCTTGCA GATGGACAAA GTGGAGCAGA GCTACCCTGG CAGGAGCCAC 2340 
TACTACTGGG ACGAGGACTC AGGGCTGTTG TTCCTGAAGC TGAAAGCTCA GAACGAGAGA 2400 
GAGAAGTTTG CTTTCTGCTC CATGAAAGGC TGTGAGAGGA TAAAGATTAA AGCTCTGATT 2460 
CCAAAGAACG CAGGCGTCAG TGACTGCACA GCCACAGCTT ACCCCAAGTT CACCGAGAGG 2520 
GCTGTCGTAG ACGTGCCGAT GCCCAAGAAG CTCTTTGGTT CTCAGCTGAA AACAAAGGAC 2580 
CATTTCTTGG AGGTGAAGAT GGAGAGTTCC AAGCAGCACT TCTTCCACCT CTGGAACGAC 2640 
TTCGCTTACA TTGAAGTGGA TGGGAAGAAG TACCCCAGTT CGGAGGATGG CATCCAGGTG 2700 
GTGGTGATTG ACGGGAACCA AGGGCGCGTG GTGAGCCACA CGAGCTTCAG GAACTCCATT 2760 
CTGCAAGGCA TACCATGGCA GCTTTTCAAC TATGTGGCGA CCATCCCTGA CAATTCCATA 2820 
GTGCTTATGG CATCAAAGGG AAGATACGTC TCCAGAGGCC CATGGACCAG AGTGCTGGAA 2880 
AAGCTTGGGG CAGACAGGGG TCTCAAGTTG AAAGAGCAAA TGGCATTCGT TGGCTTCAAA 2940 
GGCAGCTTCC GGCCCATCTG GGTGACACTG GACACTGAGG ATCACAAAGC CAAAATCTTC 3000 
CAAGTTGTGC CCATCCCTGT GGTGAAGAAG AAGAAGTTGT GAGGACAGCT GCCGCCOGGT 3060 
GCCACCTCGT GGTAGACTAT GACGGTGACT CTTGGCAGCA GACCAGTGGG GGATGGCTGG 3120 
GTCCCCCAGC CCCTGCCAGC AGCTGCCTGG GAAGGCCGTG TTTCAGCCCT GATGGGCCAA 3180 
GGGAAGGCTA TCAGAGACCC TGGTGCTGCC ACCTGCCCCT ACTCAAGTGT CTACCTGGAG 3240 
CCCCTGGGGC GGTGCTGGCC AATGCTGGAA ACATTCACTT TCCTGCAGCC TCTTGGGTGC 3300 
TTCTCTCCTA TCTGTGCCTC TTCAGTGGGG GTTTGGGGAC CATATCAGGA GACCTGGGTT 3360 
GTGCTGACAG CAAAGATCCA CTTTGGCAGG AGCCCTGACC CAGCTAGGAG GTAGTCTGGA 3420 
GGGCTGGTCA TTCACAGATC CCCATGGTCT TCAGCAGACA AGTGAGGGTG GTAAATGTAG 3480 
GAGAAAGAGC CTTGGCCTTA AGGAAATCTT TACTCCTGTA AGCAAGAGCC AACCTCACAG 3540 
GATTAGGAGC TGGGGTAGAA CTGGCTATCC TTGGGGAAGA GGCAAGCCCT GCCTCTGGCC 3600 
GTGTCCACCT TTCAGGAGAC TTTGAGTGGC AGGTTTGGAC TTGGACTAGA TGACTCTCAA 3660 
AGGCCCTTTT AGTTCTGAGA TTCCAGAAAT CTGCTGCATT TCACATGGTA CCTGGAACCC 3720 
AACAGTTCAT GGATATCCAC TGATATCCAT GATGCTGGGT GCCCCAGCGC ACACGGGATG 3780 
GAGAGGTGAG AACTAATGCC TAGCTTGAGG GGTCTGCAGT CCAGTAGGGC AGGCAGTCAG 3 840 
GTCCATGTGC ACTGCAATGC CAGGTGGAGA AATCACAGAG AGGTAAAATG GAGGCCAGTG 3900 
CCATTTCAGA GGGGAGGCTC AGGAAGGCTT CTTGCTTACA GGAATGAAGG CTGGGGGCAT 3960 
TTTGCTGGGG GGAGATGAGG CAGCCTCTGG AATGGCTCAG GGATTCAGCC CTCCCTGCCG 4020 
CTGCCTGCTG AAGCTGGTGA CTACGGGGTC GCCCTTTGCT CACGTCTCTC TGGCCCACTC 4080 
ATGATGGAGA AGTGTGGTCA GAGGGGAGCA ATGGGCTTTG CTGCTTATGA GCACAGAGGA 4140 
ATTCAGTCCC CAGGCAGCCC TGCCTCTGAC TCCAAGAGGG TGAAGTCCAC AGAAGTGAGC 4200 
TCCTGCCTTA GGGCCTCATT TGCTCTTCAT CCAGGGAACT GAGCACAGGG GGCCTCCAGG 4260 
AGACCCTAGA TGTGCTCGTA CTCCCTCGGC CTGGGATTTC AGAGCTGGAA ATATAGAAAA 4320 
TATCTAGCCC AAAGCCTTCA TTTTAACAGA TGGGGAAAGT GAGCCCCCAA GATGGGAAAG 4380 
AACCACACAG CTAAGGGAGG GCCTGGGGAG CCCCACCCTA GCCCTTGCTG CCACACCACA 4440 
TTGCCTCAAC AACCGGCCCC AGAGTGCCCA GGCACTCCTG AGGTAGCTTC TGGAAATGGG 4500 
GACAAGTCCC CTCGAAGGAA AGGAAATGAC TAGAGTAGAA TGACAGCTAG CAGATCTCTT 4560 
CCCTCCTGCT CCCAGCGCAC ACAAACCCGC CCTCCCCTTG GTGTTGGCGG TCCCTGTGGC 4620 
CTTCACTTTG TTCACTACCT GTCAGCCCAG CCTGGGTGCA CAGTAGCTGC AACTCCCCAT 4680 
TGGTGCTACC TGGCTCTCCT GTCTCTGCAG CTCTACAGGT GAGGCCCAGC AGAGGGAGTA 4740 
GGGCTCGCCA TGTTTCTGGT GAGCCAATTT GGCTGATCTT GGGTGTCTGA ACAGCTATTG 4800 
GGTCCACCCC AGTCCCTTTC AGCTGCTGCT TAATGCCCTG CTCTCTCCCT GGCCCACCTT 4860 
ATAGAGAGCC CAAAGAGCTC CTGTAAGAGG GAGAACTCTA TCTGTGGTTT ATAATCTTGC 4920 
ACGAGGCACC AGAGTCTCCC TGGGTCTTGT GATGAACTAC ATTTATCCCC TTTCCTGCCC 4980 
CAACCACAAA CTCTTTCCTT CAAAGAGGGC CTGCCTGGCT CCCTCCACCC AACTGCACCC 5040 
ATGAGACTCG GTCCAAGAGT CCATTCCCCA GGTGGGAGCC AACTGTCAGG GAGGTCTTTC 5100 
CCACCAAACA TCTTTCAGCT GCTGGGAGGT GACCATAGGG CTCTGCTTTT AAAGATATGG 5160 
CTGCTTCAAA GGCCAGAGTC ACAGGAAGGA CTTCTTCCAG GGAGATTAGT GGTGATGGAG 5220 
AGGAGAGTTA AAATGACCTC ATGTCCTTCT TGTCCACGGT TTTGTTGAGT TTTCACTCTT 5280 
CTAATGCAAG GGTCTCACAC TGTGAACCAC TTAGGATGTG ATCACTTTCA GGTGGCCAGG 5340 
AATGTTGAAT GTCTTTGGCT CAGTTCATTT AAAAAAGATA TCTATTTGAA AGTTCTCAGA 5400 
GTTGTACATA TGTTTCACAG TACAGGATCT GTACATAAAA GTTTCTTTCC TAAACCATTC 5460 
ACCAAGAGCC AATATCTAGG CATTTTCTTG GTAGCACAAA TTTTCTTATT GCTTAGAAAA 5520 
TTGTCCTCCT TGTTATTTCT GTTTGTAAGA CTTAAGTGAG TTAGGTCTTT AAGGAAAGCA 5580 
ACGCTCCTCT GAAATGCTTG TCTTTTTTCT GTTGCCGAAA TAGCTGGTCC TTTTTCGGGA 5640 
GTTAGATGTA TAGAGTGTTT GTATGTAAAC ATTTCTTGTA GGCATCACCA TGAACAAAGA 5700 
TATATTTTCT ATTTATTTAT TAT ATGTG CA CTTCAAGAAG TCACTGTCAG AGAAATAAAG 5760 
AATTGTCTTA AATGTCAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAA 



Seq ID NO: 140 Protein sequence 
Protein Accession #: XP__051860.2 

1 11 21 31 41 51 

I I I 1 I I 

MDGVNLSTEV VYKKGQDYRF ACYDRGRACR SYRVRFLCGK PVRPKLTVTI DTNVNSTILN 60 

LEDNVQSWKP GDTLVIASTD YSMYQAEEFQ VLPCRSCAPN QVKVAGKPMY LHIGEEIDGV 120 

DMRAEVGLLS RNIIVMGEME DKCYPYRNHI CNFFDFDTPG GHIKFALGFK AAHLEGTELK 180 

HMGQQLVGQY PIHFHLAGDV DERGGYDPPT YIRDLSIHHT FSRCVTVHGS NGLLIKDWG 240 

YNSLGHCFPT EDGPEERNTF DHCLGLLVKS GTLLPSDRDS KMCKM1TGDS YPGYIPKPRQ 300 

DCNAVSTFWM ANPNNNL.INC AAAGSEETGF WFIFHHVPTG PSVGMYSPGY SEHIPLGKFY 360 

NNRAHSNYRA GMIIDNGVKT TEASAKDKRP FLSIISARYS PHQDADPLKP REPAIIRHFI 420 

AYKNQDHGAW LRGGDVWLDS CRFADNGIGL TLASGGTFPY DDGSKQEIKN SLFVGESGNV 480 

GTEMMDNRIW GPGGLDHSGR TLPIGQNFPI RGIQLYDGPI NIQNCTFRKF VALEGRHTSA 540 

LAFRLNNAWQ SCPHNNVTGI AFEDVPITSR VFFGEPGPWF NQLDMDGDKT SVFHDVDGSV 600 
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SEYPGSYLTK NDNWLVRHPD CINVPDWRGA ICSGCYAQMY IQAYKTSNLR 
HPLYLEGALT RSTHYQQYQP WTLQKGYTI HWDQTAPAEL AIWLINFNKG 
RGTTFSILSD VHNRLLKQTS KTGVFVRTLQ MDKVEQSYPG RSHYYWDEDS 
NEREKFAFCS MKGCERIKIK ALIPKNAGVS DCTATAYPKF TERAWDVPM 
TKDHFLEVKM ESSKQHFFHL WNDFAYIEVD GKKYPSSEDG IQVWIDGNQ 
NSILQGIPWQ LFNYVATIPD NSIVLMASKG RYVSRGPWTR VLEKLGADRG 
GPKGSFRPIW VTLDTEDHKA KIFQWPIPV VKKKKL 

Seq ID NO: 141 DNA sequence 
Nucleic Acid Accession tf: Eos sequence 
Coding sequence: 261.. 2861 

1 11 21 31 41 51 

I I I I I I 

GAGCTAGCGC TCAAGCAGAG CCCAGCGOGG TGCTATCGGA CAGAGCCTGG CGAGCGCAAG 60 

CGGCGCGGGG AGCCAGCGGG GCTGAGCGOG GCCAGGGTCT GAACCCAGAT TTCCCAGACT 120 

AGCTACCACT CCGCTTGCCC ACGCCCCGGG AGCTCGCGGC GCCTGGCGGT CAGCGACCAG 180 

ACGTCCGGGG CCGCTGCGCT CCTGGCCCGC GAGGCGTGAC ACTGTCTCGG CTACAGACCC 240 

AGAGGGAGCA CACTGCCAGG ATGGGAGCTG CTGGGAGGCA GGACTTCCTC TTCAAGGCCA 300 

TGCTGACCAT CAGCTGGCTC ACTCTGACCT GCTTCCCTGG GGCCACATCC ACAGTGGCTG 360 

CTGGGTGCCC TGACCAGAGC CCTGAGTTGC AACCCTGGAA CCCTGGCCAT GACCAAGACC 420 

ACCATGTGCA TATCGGCCAG GGCAAGACAC TGCTGCTCAC CTCTTCTGCC ACGGTCTATT 480 

CCATCCACAT CTCAGAGGGA GGCAAGCTGG TCATTAAAGA CCACGACGAG CCGATTGTTT 540 

TGCGAACCCG GCACATCCTG ATTGACAACG GAGGAGAGCT GCATGCTGGG AGTGCCCTCT 600 

GCCCTTTCCA GGGCAATTTC ACCATCATTT TGTATGGAAG GGCTGATGAA GGTATTCAGC 660 

CGGATCCTTA CTATGGTCTG AAGTACATTG GGGTTGGTAA AGGAGGCGCT CTTGAGTTGC 720 

ATGGACAGAA AAAGCTCTCC TGGACATTTC TGAACAAGAC CCTTCACCCA GGTGGCATGG 780 

CAGAAGGAGG CTATTTTTTT GAAAGGAGCT GGGGCCACCG TGGAGTTATT GTTCATGTCA 840 

TCGACCCCAA ATCAGGCACA GTCATCCATT CTGACCGGTT TGACACCTAT AGATCCAAGA 900 

AAGAGAGTGA ACGTCTGGTC CAGTATTTGA ACGCGGTGCC GGATGGCAGG ATCCTTTCTG 960 

TTGCAGTGAA TGATGAAGGT TCTCGAAATC TGGATGACAT GGCCAGGAAG GCGATGACCA 1020 

AATTGGGAAG CAAACACTTC CTGCACCTTG GATTTAGACA CCCTTGGAGT TTTCTAACTG 1080 

TGAAAGGAAA TCCATCATCT TCAGTGGAAG ACCATATTGA ATATCATGGA CATCGAGGCT 1140 

CTGCTGCTGC CCGGGTATTC AAATTGTTCC AGACAGAGCA TGGCGAATAT TTCAATGTTT 1200 

CTTTGTCCAG TGAGTGGGTT CAAGACGTGG AGTGGACGGA GTGGTTCGAT CATGATAAAG 1260 

TATCTCAGAC TAAAGGTGGG GAGAAAATTT CAGACCTCTG GAAAGCTCAC CCAGGAAAAA 1320 

TATGCAATCG TCCCATTGAT ATACAGGCCA CTACAATGGA TGGAGTTAAC CTCAGCACCG 1380 

AGGTTGTCTA CAAAAAAGGC CAGGATTATA GGTTTGCTTG CTACGACCGG GGCAGAGCCT 1440 

GCCGGAGCTA CCGTGTACGG TTCCTCTGTG GGAAGCCTGT GAGGCCCAAA CTCACAGTCA ' 1500 

CCATTGACAC CAATGTGAAC AGCACCATTC TGAACTTGGA GGATAATGTA CAGTCATGGA 1560 

AACCTGGAGA TACCCTGGTC ATTGCCAGTA CTGATTACTC CATGTACCAG GCAGAAGAGT 1620 

TCCAGGTGCT TCCCTGCAGA TCCTGCGCCC CCAACCAGGT CAAAGTGGCA GGGAAACCAA 1680 

TGTACCTGCA CATCGGGGAG GAGATAGACG GCGTGGACAT GCGGGCGGAG GTTGGGCTTC 1740 

TGAGCCGGAA CATCATAGTG ATGGGGGAGA TGGAGGACAA ATGCTACCCC TACAGAAACC 1800 

ACATCTGCAA TTTCTTTGAC TTCGATACCT TTGGGGGCCA CATCAAGTTT GCTCTGGGAT 1860 

TTAAGGCAGC ACACTTGGAG GGCACGGAGC TGAAGCATAT GGGACAGCAG CTGGTGGGTC 1920 

AGTACCCGAT TCACTTCCAC CTGGCCGGTG ATGTAGACGA AAGGGGAGGT TATGACCCAC 1980 

CCACATACAT CAGGGACCTC TCCATCCATC ATACATTCTC TCGCTGCGTC ACAGTCCATG 2040 

GCTCCAATGG CTTGTTGATC AAGGACGTTG TGGGCTATAA CTCTTTGGGC CACTGCTTCT 2100 

TCACGGAAGA TGGGCCGGAG GAACGCAACA CTTTTGACCA CTGTCTTGGC CTCCTTGTCA 2160 

AGTCTGGAAC CCTCCTCCCC TCGGACCGTG ACAGCAAGAT GTGCAAGATG ATCACAGAGG 2220 

ACTCCTACCC AGGGTACATC CCCAAGCCCA GGCAAGACTG CAATGCTGTG TCCACCTTCT 2280 

GGATGGCCAA TCCCAACAAC AACCTCATCA ACTGTGCCGC TGCAGGATGT GAGGAAACTG 2340 

GATTTTGGTT TATTTTTCAC CACGTACCAA CGGGCCCCTC CGTGGGAATG TACTCCCCAG 2400 

GTTATTCAGA GCACATTCCA CTGGGAAAAT TCTATAACAA CCGAGCACAT TCCAACTACC 2460 

GGGCTGGCAT GATCATAGAC AACGGAGTCA AAACCACCGA GGCCTCTGCC AAGGACAAGC 2520 

GGCCGTTCCT CTCAATCATC TCTGCCAGAT ACAGCCCTCA CCAGGACGCC GACCCGCTGA 2580 

AGCCCCGGGA GCCGGCCATC ATCAGACACT TCATTGCCTA CAAGAACCAG GACCACGGGG 2640 

CCTGGCTGCG CGGCGGGGAT GTGTGGCTGG ACAGCTGCCA TTTCAGAGGG GAGGCTCAGG 2700 

AAGGCTTCTT GCTTACAGGA ATGAAGGCTG GGGGCATTTT GCTGGGGGGA GATGAGGCAG 2760 

CCTCTGGAAT GGCTCAGGGA TTCAGCCCTC CCTGCCGCTG CCTGCTGAAG CTGGTGACTA 2820 

CGGGGTCGCC CT TTGC TCAC GTCTCTCTGG CCCACTCATG ATGGAGAAGT GTGGTCAGAG 2880 

GGGAGCAATG GGCTTTGCTG CTTATGAGCA CAGAGGAATT CAGTCCCCAG GCAGCCCTGC 2940 

CTCTGACTCC AAGAGGGTGA AGTCCACAGA AGTGAGCTCC TGCCTTAGGG CCTCATTTGC 3000 

TCTTCATCCA GGGAACTGAG CACAGGGGGC CTCCAGGAGA CCCTAGATGT GCTCGTACTC 3060 

CCTCGGCCTG GGATTTCAGA GCTGGAAATA TAGAAAATAT CTAGCCCAAA GCCTTCATTT 3120 

TAACAGATGG GGAAAGTGAG CCCCCAAGAT GGGAAAGAAC CACACAGCTA AGGGAGGGCC 3180 

TGGGGAGCCC CACCCTAGCC CTTGCTGCCA CACCACATTG CCTCAACAAC CGGCCCCAGA 3240 

GTGCCCAGGC ACTCCTGAGG TAGCTTCTGG AAATGGGGAC AAGTCCCCTC GAAGGAAAGG 3300 

AAATGACTAG AGTAGAATGA CAGCTAGCAG ATCTCTTCCC TCCTGCTCCC AGCGCACACA 3360 

AACCCGCCCT CCCCTTGGTG TTGGCGGTCC CTGTGGCCTT CACTTTGTTC ACTACCTGTC 3420 

AGCCCAGCCT GGGTGCACAG TAGCTGCAAC TCCCCATTGG TGCTACCTGG CTCTCCTGTC 3480 

TCTGCAGCTC TACAGGTGAG GCCCAGCAGA GGGAGTAGGG CTCGCCATGT TTCTGGTGAG 3540 

CCAATTTGGC TGATCTTGGG TGTCTGAACA GCTATTGGGT CCACCCCAGT CCCTTTCAGC 3600 

TGCTGCTTAA TGCCCTGCTC TCTCCCTGGC CCACCTTATA GAGAGCCCAA AGAGCTCCTG 3660 

TAAGAGGGAG AACTCTATCT GTGGTTTATA ATCTTGCACG AGGCACCAGA GTCTCCCTGG 3720 

GTCTTGTGAT GAACTACATT TATCCCCTTT CCTGCCCCAA CCACAAACTC TTTCCTTCAA 3780 

AGAGGGCCTG CCTGGCTCCC TCCACCCAAC TGCACCCATG AGACTCGGTC CAAGAGTCCA 3840 

TTCCCCAGGT GGGAGCCAAC TGTCAGGGAG GTCTTTCCCA CCAAACATCT TTCAGCTGCT 3900 

GGGAGGTGAC CATAGGGCTC TGCTTTTAAA GATATGGCTG CTTCAAAGGC CAGAGTCACA 3960 

GGAAGGACTT CTTCCA GGGA GATTAGTGGT GATGGAGAGG AGAGTTAAAA TGACCTCATG 4020 

TCCTTCTTGT CCACGGTTTT GTTGAGTTTT CACTCTTCTA ATGCAAGGGT CTCACACTGT 4080 

GAACCACTTA GGATGTGATC ACTTTCAGGT GGCCAGGAAT GTTGAATGTC TTTGGCTCAG 4140 

TTCATTTAAA AAAGATATCT ATTTGAAAGT TCTCAGAGTT GTACATATGT TTCACAGTAC 4200 



MKIIKNDFPS 660 

DWIRVGLCYP 720 

GLLFLKLKAQ 780 

PKKLFGSQLK 840 

GRWSHTSFR 900 

LKIaKEQMAFV 960 
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AGGATCTGTA CATAAAAGTT TCTTTCCTAA ACCATTCACC AAGAGCCAAT ATCTAGGCAT 4260 

TTTCTTGGTA GCACAAATTT TCTTATTGCT TAGAAAATTG TCCTCCTTGT TATTTCTGTT 4320 

TGTAAGACTT AAGTGAGTTA GGTCTTTAAG GAAAGCAACG CTCCTCTGAA ATGCTTGTCT 4380 

TTTTTCTGTT GCCGAAATAG CTGGTCCTTT TTCGGGAGTT AGATGTATAG AGTGTTTGTA 4440 

TGTAAACATT TCTTGTAGGC ATCACCATGA ACAAAGATAT ATTTTCTATT TATTTATTAT 4500 

ATGTGCACTT CAAGAAGTCA CTGTCAGAGA AATAAAGAAT TGTCTTAAAT GTCATGATTG 4560 

GAGATGTCCT TTGCATTGCT TGGAAGGGGT GTACCTAGAG CCAAGGAAAT TGGCTCTGGT 4620 

TTGGAAAAAT TTTGCTGTTA TTATAGTAAA CATACAAAGG ATGTCAAAAA AAAAAAAAAA 4680 
AAAAAAAAAA AAAAAAAAAA AA 

Seq ID NO: 142 Protein sequence 
Protein Accession #: Eos sequence 

1C 1 11 21 31 41 51 

MGAAGRQDFI* FKAMLTISWL TLTCFPGATS TVAAGCPDQS PELQPWNPGH DQDHHVHIGQ 60 

GKTLLLTSSA TVYSIHISEG GKLVIKDHDE PIVLRTRHIL IDNGGELHAG SALCPFQGNF 120 

TIILYGRADE GIQPDPYYGL KYIGVGKGGA LELHGQKKLS WTFLNKTLHP GGMAEGGYFP 180 

ERSWGHRGVI VHVIDPKSGT VIHSDRFDTY RSKKESERLV QYLNAVPDGR ILSVAVNDEG 240 

ZU SRNLDDMARK AMTKLGSKHF LHLGFRHPWS FLTVKGNPSS SVEDHIEYHG HRGSAAARVF 300 

KLFQTEHGEY FNVSLSSEWV QDVEWTEWFD HDKVSQTKGG EKISDLWKAH PGKICNRPID 360 

IQATTMDGVN LSTEWYKKG QDYRFACYDR GRACRSYRVR FLCGKPVRPK LTVTIDTNVN 420 

STILNLEDNV QSWKPGDTLV IASTDYSMYQ AEEFQVLPCR SCAPNQVKVA GKPMYLHIGE 480 

EIDGVDMRAE VGLLSRNIIV MGEMEDKCYP YRNHICNFFD FDTFGGHIKF ALGFKAAHLE 540 

ZD GTELKHMGQQ LVGQYPIHFH LAGDVDERGG YDPPTYIRDL SIHHTFSRCV TVHGSNGLLI 600 

KDWGYNSLG HCFFTEDGPE ERNTFDHCLG LLVKSGTLLP SDRDSKMCKM ITEDSYPGYI 660 

PKPRQDCNAV STFWMANPNN NLINCAAAGS EETGFWFIFH HVPTGPSVGM YSPGYSEHIP 720 

1/3KFYNNRAH SNYRAGMIID NGVKTTEASA KDKRPFLSII SARYSPHQDA DPLKPREPAI 780 

IRHFIAYKNQ DHGAWLRGGD VWLDSCHFRG EAQEGFLLTG MKAGGILLGG DEAASGMAQG B40 

3U FSPPCRCLLK LVTTGSPFAH VSIAHS 
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Seq ID NO: 143 NM DMA sequence 
Nucleic Acid Accession #: NM_006017 
Coding sequence: 38.-2635 



1 11 21 31 41 51 

I I I I I 1 

CCAAGTTCTA CCTCATGTTT GGAGGATCTT GCTAGCTATG GCCCTCGTAC TCGGCTCCCT 60 

GTTGCTGCTG GGGCTGTGCG GGAACTCCTT TTCAGGAGGG CAGCCTTCAT CCACAGATGC 120 

H\J TCCTAAGGCT TGGAATTATG AATTGCCTGC AACAAATTAT GAGACCCAAG ACTCCCATAA 180 

AGCTGGACCC ATTGGCATTC TCTTTGAACT AGTGCATATC TTTCTCTATG TGGTACAGCC 240 

GCGTGATTTC CCAGAAGATA CTTTGAGAAA ATTCTTACAG AAGGCATATG AATCCAAAAT 300 

TGATTATGAC AAGCCAGAAA CTGTAATCTT AGGTCTAAAG ATTGTCTACT ATGAAGCAGG 360 

GATTATTCTA TGCTGTGTCC TGGGGCTGCT GTTTATTATT CTGATGCCTC TGGTGGGGTA 420 

43 TTTCTTTTGT ATGTGTCGTT GCTGTAACAA ATGTGGTGGA GAAATGCACC AGCGACAGAA 480 

GGAAAATGGG CCCTTCCTGA GGAAATGCTT TGCAATCTCC CTGTTGGTGA TTTGTATAAT 540 

AATAAGCATT GGCATCTTCT ATGGTTTTGT GGCAAATCAC CAGGTAAGAA CCCGGATCAA 600 

AAGGAGTCGG AAACTGGCAG ATAGCAATTT CAAGGACTTG CGAACTCTCT TGAATGAAAC 660 

TCCAGAGCAA ATCAAATATA TATTGGCCCA GTACAACACT ACCAAGGACA AGGCGTTCAC 720 

DU AGATCTGAAC AGTATCAATT CAGTGCTAGG AGGCGGAATT CTTGACCGAC TGAGACCCAA 780 

CATCATCCCT GTTCTTGATG AGATTAAGTC CATGGCAACA GCGATCAAGG AGACCAAAGA 840 

GGCGTTGGAG AACATGAACA GCACCTTGAA GAGCTTGCAC CAACAAAGTA CACAGCTTAG 900 

CAGCAGTCTG ACCAGCGTGA AAACTAGCCT GCGGTCATCT CTCAATGACC CTCTGTGCTT 960 

GGTGCATCCA TCAAGTGAAA CCTGCAACAG CATCAGATTG TCTCTAAGCC AGCTGAATAG 1020 

DO CAACCCTGAA CTGAGGCAGC TTCCACCCGT GGATGCAGAA CTTGACAACG TTAATAACGT 1080 

TCTTAGGACA GATTTGGATG GCCTGGTCCA ACAGGGCTAT CAATCCCTTA ATGATATACC 1140 

TGACAGAGTA CAACGCCAAA CCACGACTGT CGTAGCAGGT ATCAAAAGGG TCTTGAATTC 1200 

CATTGGTTCA GATATCGACA ATGTAACTCA GCGTCTTCCT ATTCAGGATA TACTCTCAGC 1260 

ATTCTCTGTT TATGTTAATA ACACTGAAAG TTACATCCAC AGAAATTTAC CTACATTGGA 1320 

OU AGAGTATGAT TCATACTGGT GGCTGGGTGG CCTGGTCATC TGCTCTCTGC TGACCCTCAT 1380 

CGTGATTTTT TACTACCTGG GCTTACTGTG TGGCGTGTGC GGCTATGACA GGCATGCCAC 1440 

CCCGACCACC CGAGGCTGTG TCTCCAACAC CGGAGGCGTC TTCCTCATGG TTGGAGTTGG 1500 

ATTAAGTTTC CTCTTTTGCT GGATATTGAT GATCATTGTG GTTCTTACCT TTGTCTTTGG 1560 

s-r TGCAAATGTG GAAAAACTGA TCTGTGAACC TTACACGAGC AAGGAATTAT TCCGGGTTTT 1620 

GGATACACCC TACTTACTAA ATGAAGACTG GGAATACTAT CTCTCTGGGA AGCTATTTAA 1680 

TAAATCAAAA ATGAAGCTCA CTTTTGAACA AGTTTACAGT GACTGCAAAA AAAATAGAGG 1740 

CACTTACGGC ACTCTTCACC TGCAGAACAG CTTCAATATC AGTGAACATC TCAACATTAA 1800 

TGAGCATACT GGAAGCATAA GCAGTGAATT GGAAAGTCTG AAGGTAAATC TTAATATCTT 1860 

TCTGTTGGGT GGAGCAGGAA GAAAAAACCT TCAGGATTTT GCTGCTTGTG GAATAGACAG 1920 

/U AATGAATTAT GACAGCTACT TGGCTCAGAC TGGTAAATCC CCCGCAGGAG TGAATCTTTT 1980 

ATCATTTGCA TATGATCTAG AAGCAAAAGC AAACAGTTTG CCCCCAGGAA ATTTGAGGAA 2040 

CTCCCTGAAA AGAGATGCAC AAACTATTAA AACAATTCAC CAGCAACGAG TCCTTCCTAT 2100 

AGAACAATCA CTGAGCACTC TATACCAAAG CGTCAAGATA CTTCAACGCA CAGGGAATGG 2160 

ATTGTTGGAG AGAGTAACTA GGATTCTAGC TTCTCTGGAT TTTGCTCAGA ACTTCATCAC 2220 

/J AAACAATACT TCCTCTGTTA TTATTGAGGA AACTAAGAAG TATGGGAGAA CAATAATAGG 2280 

ATATTTTGAA CATTATCTGC AGTGGATOGA GTTCTCTATC AGTGAGAAAG TGGCATCGTG 2340 

CAAACCTGTG GCCACCGCTC TAGATACXGC TGTTGATGTC TTTCTGTGTA GCTACATTAT 2400 

CGACCCCTTG AATTTGTTTT GGTTTGGCAT AGGAAAAGCT ACTGTATTTT TACTTCOGGC 2460 

TCTAATTTTT GCGGTAAAAC TGGCTAAGTA CTATCGTCGA ATGGATTCGG AGGACGTGTA 2S20 

OU CGATGATGTT GAAACTATAC CCATGAAAAA TATGGAAAAT GGTAATAATG GTTATCATAA 2580 

AGATCATGTA TATGGTATTC ACAATCCTGT TATGACAAGC CCATCACAAC ATTGATAGCT 2640 

GATGTTGAAA CTGCTTGAGC ATCAGGATAC TCAAAGTGGA AAGGATCACA GATTTTTGGT 2700 

AGTTTCTGGG TCTACAAGGA CTTTCCAAAT CCAGGAGCAA CGCCAGTGGC AAOGTAGTGA 2760 

CTCAGGCGGG CACCAAGGCA ACGGCACCAT TGGTCTCTGG GTAGTGCTTT AAGAATGAAC 2820 
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ACAATCACGT 
TATTTTTGTT 
GTTTGTTCCC 
ACATATTGCC 
GTGTACAGTA 
TTATAGGACT 
AAATAATCCA 
TTCATCTATT 
CCATCATAGA 
GAGAGAGAAA 
ACAGTTTAAC 
GAGTATCATC 
TTCTTAGATC 
AAAGCACAAG 
AAGTGATCAG 
GATGTAGGCA 
AAAAAGGAAC 



TATAGTCCAT 
TTTTACTTTT 
ATTGGATGCA 
ATGTGTGGAG 
AACGGTGTAT 
TTCTTCTAAA 
TTTTCACTAA 
ATACTTTATC 
GAAACCTGCG 
TGCTGTTCGT 
AGAGTTTTCT 
AGACTAGTAT 
TGGTGTCCAG 
GTAAATATTC 
CAATGAAGAA 
TTCCCCCCAG 
TTGG 



GGTCCATCAC 
TTACACTGAG 
TTTCTATCAA 
TGTGCTGAAC 
ATACCTTTTG 
TGAGCTAAAT 
AAGTGTGTGA 
AAAGATTGGC 
TAACTCCATC 
TCAAAAGTGG 
GTTGCATTAG 
CAAGTGTTCT 
CATGGATGAA 
ATTTGCTTCA 
CTGGTCGGAC 
GTCTTTTCAT 



TATTCAAGGA 
TTTCTATTTA 
AACTCTATCA 
ACACACCAGT 
TTACCACAGA 
AAGTCACCAT 
AACCTACAGC 
CATGTTCCAC 
TGACAAATTC 
AGTTGTTTTA 
GATAAACATT 
AAAATGAAAT 
ACCTTTGAGT 
GGAGTTTCAT 
AAAATTTAAC 
GTGCAGATTG 



TGACTCCCTC 
GACACTACAA 
AATGTGATGG 
TTACAGGAAA 
GTrrTTTAAA 
TGACTTCTTG 
ATATTCTTCA 
TTGGAAATGG 
AAAAGAGAGA 
ACAGATGCCA 
AATTGGAGTG 
ATGAGAAGAT 
TTGGTCCCTA 
GTTGGATCTG 
GTTGATGTAA 
CAGTTCTGAT 



CCTTCCTGTC 
CATATGGGGT 
CTAGATTCTA 
GATGCATTTT 
CAAATGAGTA 
GTGCTGTTGA 
CGCAGAGATT 
CATGCAAAAG 
GAGAGATCTT 
ATTACGGTGT 
CAGCTAACAT 
CCTGTCACAA 
AATTTGCATG 
TCATTATCAA 
TGGAATTCCA 
TCATTTGAAT 



Seq ID NO: 144 NP Protein sequence 
Protein Accession #: NP_006008.1 



MALVLGSLLL 
IFLYWQPRD 
ILMPLVGYPP 
HQVRTRIKRS 
ILDRLRPNII 
SLNDPLCLVH 
YQSLNDIPDR 
HRNLPTLEEY 
VFLMVGVGLS 
YLSGKLFNKS 
LKVNLNIFLL 
LPPGNLRNSL 
DFAQNFITNN 
VFLCSYIIDP 
NGNNGYHKDH 



11 
I 

LGLCGNSFSG 
FPEDTLRKFL 
CMCRCCNKCG 
RKLADSNFKD 
PVLDEIKSMA 
PSSETCNSIR 
VQRQTTTWA 
DSYWWLGGIiV 
FLFCWILMII 
KMKLTFEQVY 
GAAGRKNLQD 
KRDAQTIKTI 
TSSVIIEETK 
LNLFWFGIGK 
VYGIHNPVMT 



21 
I 

GQPSSTDAPK 
QKAYESKIDY 
GEMHQRQKEN 
LRTLLNETPE 
TAIKETKEAL 
LSLSQLKSNP 
GIKRVLNSIG 
ICSLLTLIVI 
WLTFVFGAN 
SDCKKNRGTY 
FAACGIDRMN 
HQQRVLPIEQ 
KYGRTIIGYF 
ATVPLLPALI 
SPSQH 



31 

I 

AWNYELPATN 
DKPETVILGL 
GPFLRKCFAI 
QIKYILAQYN 
ENMNSTLKSL 
ELRQLPPVDA 
SDIDNVTQRL 
FYYLGLLCGV 
VEKLICEPYT 
GTLHLQNSFN 
YDSYLAQTGK 
SLSTLYQSVK 
EHYLQWIEFS 
FAVKLAKYYR 



41 
I 

YETQDSHKAG 
KIVYYEAGII 
SLLVICIIIS 
TTKDKAFTDL 
HQQSTQLSSS 
ELDNVNNVLR 
PIQDILSAFS 
CGYDRHATPT 
SKELFRVLDT 
ISEHLNINEH 
SPAGVNLLSF 
ILQRTGNGLb 
ISEKVASCKP 
RMDSEDVYDD 



Seq ID NO: 145 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 38.. 2605 



1 11 21 

i I I 

CCAAGTTCTA CCTCATGTTT GGAGGATCTT 
GTTGCTGCTG GGGCTGTGCG GGAACTCCTT 
TCCTAAGGCT TGGAATTATG AATTGCCTGC 
AGCTGGACCC ATTGGCATTC TCTTTGAACT 
GCGTGATTTC CCAGAAGATA CTTTGAGAAA 
TGATTATGAC ATTGTCTACT ATGAAGCAGG 
GTTTATTATT CTGATGCCTC TGGTGGGGTA 
ATGTGGTGGA GAAATGCACC AGCGACAGAA 
TGCAATCTCC CTGTTGGTGA TTTGTATAAT 
GGCAAATCAC CAGGTAAGAA CCCGGATCAA 
CAAGGACTTG CGAACTCTCT TGAATGAAAC 
GTACAACACT ACCAAGGACA AGGCGTTCAC 
AGGCGGAATT CTTGACCGAC TGAGACCCAA 
CATGGCAACA GCGATCAAGG AGACCAAAGA 
GAGCTTGCAC CAACAAAGTA CACAGCTTAG 
GCGGTCATCT CTCAATGACC CTCTGTGCTT 
CATCAGATTG TCTCTAAGCC AGCTGAATAG 
GGATGCAGAA CTTGACAACG TTAATAACGT 
ACAGGGCTAT CAATCCCTTA ATGATATACC 
CGTAGCAGGT ATCAAAAGGG TCTTGAATTC 
GCGTCTTCCT ATTCAGGATA TACTCTCAGC 
TTACATCCAC AGAAATTTAC CTACATTGGA 
CCTGGTCATC TGCTCTCTGC TGACCCTCAT 
TGGCGTGTGC GGCTATGACA GGCATGCCAC 
CGGAGGGGTC TTCCTCATGG TTGGAGTTGG 
GATCATTGTG GTTCTTACCT TTGTCTTTGG 
TTACACGAGC AAGGAATTAT TCCGGGTTTT 
GGAATACTAT CTCTCTGGGA AGCTATTTAA 
AGTTTACAGT GACTGCAAAA AAAATAGAGG 
CTTCAATATC AGTGAACATC TCAACATTAA 
GGAAAGTCTG AAGGTAAATC TTAATATCTT 
TCAGGATTTT GCTGCTTGTG GAATAGACAG 
TGGTAAATCC CCCGCAGGAG TGAATCTTTT 
AAACAGTTTG CCCCCAGGAA ATTTGAGGAA 
AACAATTCAC CAGCAACGAG TCCTTCCTAT 
CGTCAAGATA CTTCAACGCA CAGGGAATGG 
TTCTCTGGAT TTTGCTCAGA ACTTCATCAC 
AACTAAGAAG TATGGGAGAA CAATAATAGG 
GTTCTCTATC AGTGAGAAAG TGGCATCGTG 



31 41 

I I 
GCTAGCTATG GCCCTCGTAC 
TTCAGGAGGG CAGCCTTCAT 
AACAAATTAT GAGACCCAAG 
AGTGCATATC TTTCTCTATG 
ATTCTTACAG AAGGCATATG 
GATTATTCTA TGCTGTGTCC 
TTTCTTTTGT ATGTGTCGTT 
GGAAAATGGG CCCTTCCTGA 
AATAAGCATT GGCATCTTCT 
AAGGAGTCGG AAACTGGCAG 
TCCAGAGCAA ATCAAATATA 
AGATCTGAAC AGTATCAATT 
CATCATCCCT GTTCTTGATG 
GGCGTTGGAG AACATGAACA 
CAGCAGTCTG ACCAGCGTGA 
GGTGCATCCA TCAAGTGAAA 
CAACCCTGAA CTGAGGCAGC 
TCTTAGGACA GATTTGGATG 
TGACAGAGTA CAACGCCAAA 
CATTGGTTCA GATATCGACA 
ATTCTCTGTT TATGTTAATA 
AGAGTATGAT TCATACTGGT 
CGTGATTTTT TACTACCTGG 
CCCGACCACC CGAGGCTGTG 
ATTAAGTTTC CTCTTTTGCT 
TGCAAATGTG GAAAAACTGA 
GGATACACCC TACTTACTAA 
TAAATCAAAA ATGAAGCTCA 
CACTTACGGC ACTCTTCACC 
TGAGCATACT GGAAGCATAA 
TCTGTTGGGT GCAGCAGGAA 
AATGAATTAT GACAGCTACT 
ATCATTTGCA TATGATCTAG 
CTCCCTGAAA AGAGATGCAC 
AGAACAATCA CTGAGCACTC 
ATTGTTGGAG AGAGTAACTA 
AAACAATACT TCCTCTGTTA 
ATATTTTGAA CATTATCTGC 
CAAACCTGTG GCCACCGCTC 



2S80 
2940 
3000 
3060 
3120 
31B0 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 



51 
I 

PIGIIjFELVH 
LCCVLGLLFI 
IGIFYGFVAN 
NSINSVLGGG 
LTSVKTSLRS 
TDLDGLVQQG 
VYVNNTESYI 
TRGCVSNTGG 
PYLLNEDWEY 
TGSISSELES 
AYDLEAKANS 
ERVTRILASL 
VATALDTAVD 
VETIPMKNME 



51 
I 

TCGGCTCCCT 
CCACAGATGC 
ACTCCCATAA 
TGGTACAGCC 
AATCCAAAAT 
TGGGGCTGCT 
GCTGTAACAA 
GGAAATGCTT 
ATGGTTTTGT 
ATAGCAATTT 
TATTGGCCCA 
CAGTGCTAGG 
AGATTAAGTC 
GCACCTTGAA 
AAACTAGCCT 
CCTGCAACAG 
TTCCACCCGT 
GCCTGGTCCA 
CCACGACTGT 
ATGTAACTCA 
ACACTGAAAG 
GGCTGGGTGG 
GCTTACTGTG 
TCTCCAACAC 
GGATATTGAT 
TCTGTGAACC 
ATGAAGACTG 
CTTTTGAACA 
TGCAGAACAG 
GCAGTGAATT 
GAAAAAACCT 
TGGCTCAGAC 
AAGCAAAAGC 
AAACTATTAA 
TATACCAAAG 
GGATTCTAGC 
TTATTGAGGA 
AGTGGATCGA 
TAGATACTGC 



60 
120 
180 
240 
300 
360 
420 
480. 
540 
600 
660 
720 
780 
840 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 



1008 
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TGTTGATGTC TTTCTGTGTA GCTACATTAT CGACCCCTTG AATTTGTTTT GGTTTGGCAT 
AGGAAAAGCT ACTGTATTTT TACTTCGGGC TCTAATTTTT GCGGTAAAAC TGGCTAAGTA 
CTATCGTCGA ATGGATTCGG AGGACGTGTA CGATGATGTT GAAACTATAC CCATGAAAAA 
TATGGAAAAT GGTAATAATG GTTATCATAA AGATCATGTA TATGGTATTC ACAATCCTGT 
TATGACAAGC CCATCACAAC ATTGATAGCT GATGTTGAAA CTGCTTGAGC ATCAGGATAC 
TCAAAGTGGA AAGGATCACA GATTTTTGGT AGTTTCTGGG TCTACAAGGA CTTTCCAAAT 
CCAGGAGCAA CGCCAGTGGC AACGTAGTGA CTCAGGCGGG CACCAAGGCA ACGGCACCAT 
TGGTCTCTGG GTAGTGCTTT AAGAATGAAC ACAATCACGT TATAGTCCAT GGTCCATCAC 
TATTCAAGGA TGACTCCCTC CCTTCCTGTC TATTTTTGTT TTTTACTTTT TTACACTGAG 
TTTCTATTTA GACACTACAA CATATGGGGT GTTTGTTCCC ATTGGATGCA TTTCTATCAA 
AACTCTATCA AATGTGATGG CTAGATTCTA ACATATTGCC ATGTGTGGAG TGTGCTGAAC 
ACACACCAGT TTACAGGAAA GATGCATTTT GTGTACAGTA AACGGTGTAT ATACCTTTTG 
TTACCACAGA GTTTTTTAAA CAAATGAGTA TTATAGGACT TTCTTCTAAA TGAGCTAAAT 
AAGTCACCAT TGACTTCTTG GTGCTGTTGA AAATAATCCA TTTTCACTAA AAGTGTGTGA 
AACCTACAGC ATATTCTTCA CGCAGAGATT TTCATCTATT ATACTTTATC AAAGATTGGC 
CATGTTCCAC TTGGAAATGG CATGCAAAAG CCATCATAGA GAAACCTGCG TAACTCCATC 
TGACAAATTC AAAAGAGAGA GAGAGATCTT GAGAGAGAAA TGCTGTTCGT TCAAAAGTGG 
AGTTGTTTTA ACAGATGCCA ATTACGGTGT ACAGTTTAAC AGAGTTTTCT GTTGCATTAG 
GATAAACATT AATTGGAGTG CAGCTAACAT GAGTATCATC AGACTAGTAT CAAGTGTTCT 
AAAATGAAAT ATGAGAAGAT CCTGTCACAA TTCTTAGATC TGGTGTCCAG CATGGATGAA 
ACCTTTGAGT TTGGTCCCTA AATTTGCATG AAAGCACAAG GTAAATATTC ATTTGCTTCA 
GGAGTTTCAT GTTGGATCTG TCATTATCAA AAGTGATCAG CAATGAAGAA CTGGTCGGAC 
AAAATTTAAC GTTGATGTAA TGGAATTCCA GATGTAGGCA TTCCCCCCAG GTCTTTTCAT 
GTGCAGATTG CAGTTCTGAT TCATTTGAAT AAAAAGGAAC TTGG 

Seq ID NO: 146 Protein sequence 
Protein Accession 8: 

1 11 21 31 41 51 

I I t I I I 

MALVLGSLLL LGLCGNSPSG GQPSSTDAPK AWNYELPATN YETQDSHKAG PIGILFELVH 60 

IFLYWQPRD FPEDTLRKFL QKAYESKIDY DIVYYEAGII LCCVLGLbFI ILMPLVGYFF 120 ' 

CMCRCCNKCG GEMHQRQKEN GPFLRKCFAI SLLVICIIIS IGIFYGFVAN HQVRTRIKRS 180 

RKLADSNFKD LRTLLNETPE QIKYILAQYN TTKDKAFTDL NSINSVLGGG ILDRLRPNII 240 

PVLDEIKSMA TAIKETKEAX. ENMNSTLKSL HQQSTQLSSS LTSVKTSLRS SLNDPLCLVH 300 

PSSETCNSIR LSLSQLNSNP ELRQLPPVDA ELDNVNNVLR TDLDGLVQQG YQSLNDIPDR 360 

VQRQTTTWA GIKRVLNSIG SDIDNVTQRL PIQDILSAFS VYVNNTESYI HRNLPTLEEY 420 

DSYWWLGGLV ICSLLTLIVI FYYLGLLCGV CGYDRHATPT TRGCVSNTGG VFLMVGVGLS 480 

FLFCWILMII WLTFVFGAN VEKLICEPYT SKELFRVLDT PYLLNEDWEY YLSGKLFNKS 540 

KMKLTFEQVY SDCKKNRGTY GTLHLQNSFN ISEHLNINEH TGS IS SELES LKVNLNIFLL 600 

GAAGRKNLQD FAACGIDRMN YDSYLAQTGK SPAGVNLLSF AYDLEAKANS LPPGNLRNSL 660 

KRDAQTIKTI HQQRVLPIEQ SLSTLYQSVK ILQRTGNGLL ERVTRILASL DFAQNFITNN 720 

TSSVIIEETK KYGRTIIGYF EHYLQWIEFS ISEKVASCKP VATALDTAVD VFLCSYIIDP 780 

LNLFWFGIGK ATVFLLPALI FAVKLAKYYR RMDSEDVYDD VETIPMKNME NGNNGYHKDH 840 
VYGIHNPVMT SPSQH 

Seq ID NO: 147 DNA sequence 

Nucleic Acid Accession #: BC012 089.1 

Coding sequence: 236.. 2806 

1 11 21 31 41 51 

I I 1 ill 

ATGCTCTCAG CTCTCCCGCC GCGGGATGGT GCCTTGAGTG AATGACCCCC TTGGAGAACA 60 

TTCTTCCGCA TCCCTCGCCT CAAGCCAGCC TCAGACAGAA AACTGAAGAT TCAGCAGATC 120 

CAGTGCTTCC TGCTCCTCTT CTGCCCAGGA ACACGCTTGC CTTCCCCAAG GCTTCCAGAA 180 

GCTCTGAGGC AGGAGGCACC AAGTTCTACC TCATGTTTGG AGGATCTTGC TAGCTATGGC 240 

CCTCGTACTC GGCTCCCTGT TGCTGCTGGG GCTGTGCGGG AACTCCTTTT CAGGAGGGCA 300 

GCCTTCATCC ACAGATGCTC CTAAGGCTTG GAATTATGAA TTGCCTGCAA CAAATTATGA 360 

GACCCAAGAC TCCCATAAAG CTGGACCCAT TGGCATTCTC TTTGAACTAG TGCATATCTT 420 

TCTCTATGTG GTACAGCCGC GTGATTTCCC AGAAGATACT TTGAGAAAAT TCTTACAGAA 480 

GGCATATGAA TCCAAAATTG ATTATGACAA GATTGTCTAC TATGAAGCAG GGATTATTCT 540 

ATGCTGTGTC CTGGGGCTGC TGTTTATTAT TCTGATGCCT CTGGTGGGGT ATTTCTTTTG 600 

TATGTGTCGT TGCTGTAACA AATGTGGTGG AGAAATGCAC CAGCGACAGA AGGAAAATGG 660 

GCCCTTCCTG AGGAAATGCT TTGCAATCTC CCTGTTGGTG ATTTGTATAA TAATAAGCAT 720 

TGGCATCTTC TATGGTTTTG TGGCAAATCA CCAGGTAAGA ACCCGGATCA AAAGGAGTCG 780 

GAAACTGGCA GATAGCAATT TCAAGGACTT GCGAACTCTC TTGAATGAAA CTCCAGAGCA 840 

AATCAAATAT ATATTGGCCC AGTACAACAC TACCAAGGAC AAGGCGTTCA CAGATCTGAA 900 

CAGTATCAAT TCAGTGCTAG GAGGCGGAAT TCTTGACCGA CTGAGACCCA ACATCATCCC 960 

TGTTCTTGAT GAGATTAAGT CCATGGCAAC AGCGATCAAG GAGACCAAAG AGGCGTTGGA 1020 

GAACATGAAC AGCACCTTGA AGAGCTTGCA CCAACAAAGT ACACAGCTTA GCAGCAGTCT 1080 

GACCAGCGTG AAAACTAGCC TGCGGTCATC TCTCAATGAC CCTCTGTGCT TGGTGCATCC 1140 

ATCAAGTGAA ACCTGCAACA GCATCAGATT GTCTCTAAGC CAGCTGAATA GCAACCCTGA 1200 

ACTGAGGCAG CTTCCACCCG TGGATGCAGA ACTTGACAAC GTTAATAACG TTCTTAGGAC 1260 

AGATTTGGAT GGCCTGGTCC AACAGGGCTA TCAATCCCTT AATGATATAC CTGACAGAGT 1320 

ACAACGCCAA ACCACGACTG TCGTAGCAGG TATCAAAAGG GTCTTGAATT CCATTGGTTC 1380 

AGATATCGAC AATGTAACTC AGCGTCTTCC TATTCAGGAT ATACTCTCAG CATTCTCTGT 1440 

TTATGTTAAT AACACTGAAA GTTACATCCA CAGAAATTTA CCTACATTGG AAGAGTATGA 1500 

TTCATACTGG TGGCTGGGTG GCCTGGTCAT CTGCTCTCTG CTGACCCTCA TCGTGATTTT 1560 

TTACTACCTG GGCTTACTGT GTGGCGTGTG CGGCTATGAC AGGCATGCCA CCCCGACCAC 1620 

CCGAGGCTGT GTCTCCAACA CCGGAGGCGT CTTCCTCATG GTTGGAGTTG GATTAAGTTT 1680 

CCTCTTTTGC TGGATATTGA TGATCATTGT GGTTCTTACC TTTGTCTTTG GTGCAAATGT 1740 

GGAAAAACTG ATCTGTGAAC CTTACACGAG CAAGGAATTA TTCCGGGTTT TGGATACACC 1800 

CTACTTACTA AATGAAGACT GGGAATACTA TCTCTCTGGG AAGCTATTTA ATAAATCAAA 1860 

AATGAAGCTC ACTTTTGAAC AAGTTTACAG TGACTGCAAA AAAAATAGAG GCACTTACGG 1920 
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CACTCTTCAC 
TGGAAGCATA 
TGCAGCAGGA 
TGACAGCTAC 
ATATGATCTA 
AAGAGATGCA 
ACTGAGCACT 
GAGAGTAACT 
TTCCTCTGTT 
ACATTATCTG 
GGCCACCGCT 
GAATTTGTTT 
TGCGGTAAAA 
TGAAACTATA 
ATATGGTATT 
ACTGCTTGAG 
GTCTACAAGG 
GCACCAAGGC 
TTATAGTCCA 
TTTTTACTTT 
CATTGGATGC 
CATGTGTGGA 
AAACGGTGTA 
TTTCTTCTAA 
ATTTTCACTA 
TATACTTTAT 
AGAAACCTGC 
ATGCTGTTCG 
CAGAGTTTTC 
CAGACTAGTA 
CTGGTGTCCA 
GGTAAATATT 
GCAATGAAGA 
ATTCCCCCCA 
CTTGGAAAAA 



CTGCAGAACA GCTTCAATAT 
AGCAGTGAAT TGGAAAGTCT 
AGAAAAAACC TTCAGGATTT 
TTGGCTCAGA CTGGTAAATC 
GAAGCAAAAG CAAACAGTTT 
CAAACTATTA AAACAATTCA 
CTATACCAAA GCGTCAAGAT 
AGGATT CT AG CTTCTCTGGA 
ATTATTGAGG AAACTAAGAA 
CAGTGGATCG AGTTCTCTAT 
CTAGATACTG CTGTTGATGT 
TGGTTTGGCA TAGGAAAAGC 
CTGGCTAAGT ACTATCGTCG 
CCCATGAAAA ATATGGAAAA 
CACAATCCTG TTATGACAAG 
CATCAGGATA CTCAAAGTGG 
ACTTTCCAAA TCCAGGAGCA 
AACGGCACCA TTGGTCTCTG 
TGGTCCATCA CTATTCAAGG 
TTTACACTGA GTTTCTATTT 
ATTTCTATCA AAACTCTATC 
GTGTGCTGAA CACACACCAG 
TATACCTTTT GTTACCACAG 
ATGAGCTAAA TAAGTCACCA 
AAAGTGTGTG AAACCTACAG 
CAAAGATTGG CCATGTTCCA 
GTAACTCCAT CTGACAAATT 
TTCAAAAGTG GAGTTGTTTT 
TGTTGCATTA GGATAAACAT 
TCAAGTGTTC TAAAATGAAA 
GCATGGATGA AACCTTTGAG 
CATTTGCTTC AGGAGTTTCA 
ACTGGTCGGA CAAAATTTAA 
GGTCXTTTCA TGTGCAGATT 
AAAAAAAAAA AAA 



CAGTGAACAT CTCAACATTA 
GAAGGTAAAT CTTAATATCT 
TGCTGCTTGT GGAATAGACA 
CCCCGCAGGA GTGAATCTTT 
GCCCCCAGGA AATTTGAGGA 
CCAGCAACGA GTCCTTCCTA 
ACTTCAACGC ACAGGGAATG 
TTTTGCTCAG AACTTCATCA 
GTATGGGAGA ACAATAATAG 
CAGTGAGAAA GTGGCATCGT 
CTTTCTGTGT AGCTACATTA 
TACTGTATTT TTACTTCCGG 
AATGGATTCG GAGGACGTGT 
TGGTAATAAT GGTTATCATA 
CCCATCACAA CATTGATAGC 
AAAGGATCAC AGATTTTTGG 
ACGCCAGTGG CAAGGTAGTG 
GGTAGTGCTT TAAGAATGAA 
ATGACTCCCT CCCTTCCTGT 
AGACACTACA ACATATGGGG 
AAATGTGATG GCTAGATTCT 
TTTACAGGAA AGATGCATTT 
AGTTTTTTAA ACAAATGAGT 
TTGACTTCTT GGTGCTGTTG 
CATATTCTTC ACGCAGAGAT 
CTTGGAAATG GCATGCAAAA 
CAAAAGAGAG AGAGAGATCT 
AACAGATGCC AATTACGGTG 
TAATTGGAGT GCAGCTAACA 
TATGAGAAGA TCCTGTCACA 
TTTGGTCCCT AAATTTGCAT 
TGTTGGATCT GTCATTATCA 
CGTTGATGTA ATGGAATTCC 
GCAGTTCTGA TTCATTTGAA 



ATGAGCATAC 
TTCTGTTGGG 
GAATGAATTA 
TATCATTTGC 
ACTCCCTGAA 
TAGAACAATC 
GATTGTTGGA 
CAAACAATAC 
GATATTTTGA 
GCAAACCTGT 
TCGACCCCTT 
CTCTAATTTT 
ACGATGATGT 
AAGATCATGT 
TGATGTTGAA 
TAGTTTCTGG 
ACTCAGGOGG 
CACAATCACG 
CTATTTTTGT 
TGTTTGTTCC 
AACATATTGC 
TGTGTACAGT 
ATTATAGGAC 
AAAATAATCC 
TTTCATCTAT 
GCCATCATAG 
TGAGAGAGAA 
TACAGTTTAA 
TGAGTATCAT 
ATTCTTAGAT 
GAAAGCACAA 
AAAGTGATCA 
AGATGTAGGC 
TAAAAAGGAA 



1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3160 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 



Seq ID NO: 148 Protein sequence 
Protein Accession #: AAH12 089.1 



MALVLGSLLL 
IFLYWQPRD 
FCMCRCCNKC 
SRKLADSNFK 
IPVLDEIKSM 
HPSSETCNSI 
RVQRQTTTW 
YDSYWWLGGL 
SFLFCWILMI 
SKMKLTFEQV 
LGAAGRKNLQ 
LKRDAQTIKT 
NTSSVIIEET 
PLNLFWFGIG 
HVYGIHNPVM 



11 
I 

LGLCGNSFSG 
FPEDTLRKFL 
GGEMHQRQKE 
DLRTLLNETP 
ATAIKETKEA 
RLSLSQLNSN 
AGIKRVLNSI 
VICSLLTLIV 
IWLTFVFGA 
YSDCKKNRGT 
DFAACGIDRM 
IHQQRVLPIE 
KKYGRTIIGY 
KATVFIiLPAL 
TSPSQH 



21 
I 

GQPSSTDAPK 
QKAYESKIDY 
NGPFLRKCFA 
EQIKYILAQY 
LENMNSTLKS 
PELRQLPPVD 
GSDIDNVTQR 
IFYYLGLLCG 
NVEKLICEPY 
YGTLHLQNSF 
NYDSYLAQTG 
QSLSTLYQSV 
FEHYLQWIEF 
IFAVKLAKYY 



31 
I 

AWNYELPATN 
DECIVYYEAGI 
ISLLVICIII 
NTTKDKAFTD 
LHQQSTQLSS 
AELDNVNNVL 
LPIQDILSAF 
VCGYDRHATP 
TSKELFRVLD 
NISEHLNINE 
KSPAGVNLLS 
KILQRTGNGL 
SISEKVASCK 
RRMDSEDVYD 



41 

I 

YETQDSHKAG 
ILCCVLGLLF 
SIGIFYGFVA 
LNSINSVLGG 
SLTSVKTSLR 
RTDLDGLVQQ 
SVYVNNTESY 
TTRGCVSNTG 
TPYLLNEDWE 
HTGSISSELE 
FAYDLEAKAN 
LERVTRILAS 
PVATALDTAV 
DVETIPKKNM 



51 
I 

PIGILFELVH 
IILMPLVGYF 
NHQVRTRIKR 
GILDRLRPNI 
SSLNDPLCLV 
GYQSLNDIPD 
IHRNLPTLEE 
GVFLMVGVGL 
YYLSGKIiFNK 
SLKVNLNIFL 
SLPPGNIiRNS 
LDFAQNFITN 
DVFLCSYIID 



Seq ID NO i 149 DNA sequence 

Nucleic Acid Accession Si NM_033049.l 

Coding sequence : 28.. 1566 



1 
I 

CCACGCGTCC 
CTCCTTTCTG 
GAAACTGCGA 
GAAACTGCTA 
CCCATAATTA 
ACACATAGTT 
GTAAATTCAT 
ACAATGGTTC 
CAATCATCAG 
GGTCCCAGCA 
CATAATACAA 
AAAGGAAAGG 
GAGAAACATT 
GTATTTGGCA 
CCAAGATCTG 
GCAGAAACCA 
AGTAGCTCAA 
AACCAGACTG 
AGGCCTAACC 
AACGCACAGC 
GTGCCCGGCT 



11 
I 

GAGCAAGAAC 
TAAACACAGC 
CTAGTGGTCC 
GCACCACAGC 
GTACACATAG 
CCTCCACAAT 
TAGCTACCTC 
CTTCTGAAAC 
GGCCTCCCAC 
ATCCTTGCCA 
GTTTTTGCCT 
TATTCCCTGG 
CCATGGCCTA 
CATCTGTTTA 
AAATGCGTGC 
CAAGTGACAA 
GCAACTTTCT 
CGGATGACTG 
CACAGAGCCC 
ACAAGCAATG 
ACCAGGAAGA 



21 
I 

AGCTAAAATG 
CACCAACCAA 
TACAGTAGCT 
AAATACACCT 
TTCCTCCACA 
TCCTATACCT 
TGACATAATC 
ACAAAGTAAC 
TGGCACCGCT 
AGATGATCCC 
GTGTTTAGAA 
GAAGATTTCA 
TCAAGACTTG 
TGGACAGACT 
TGATGACAAG 
TGAGAAGACT 
AAACTATGAT 
CCTCAATGGT 
TTTCTGOGTT 
CTTAATAAAG 
TGCTAATGGG 



31 
I 

AAAGCCATCA 
GGCAACTCAG 
GCAGCTGATA 
TCTTTCCCAA 
ATTCCTACAC 
ACTGCTGCAG 
ACCGCTTCAT 
AATGAAATGT 
TTATTGGAGA 
TGTGCAGATA 
GGGTATTACT 
GTGACAGTAT 
CATAGTGAAA 
GTAATTCTTA 
TTTGTTAATG 
GTGACTGAGA 
TTGACCCTTC 
TTAGCATGCG 
GCTTCCAGTC 
AAGAGTGGTG 
AACTGCCAAA 



41 

I 

TTCATCTTAC 
CTGATGCTGT 
CCACTGAAAC 
CAGCTACTTC 
CTGCTCCCCC 
ACAGTGAGTC 
CTCCAAATGA 
CCCCCACCAC 
CCAGCACCCT 
ATTCGTTATG 
ACAACTCTTC 
GAGAAACATT 
TTACTAGCTT 
CTGTAAGCAC 
TAACAATAGT 
AAATTAATAA 
GGTGTGATTA 
ATTGCAAATC 
TCAAGTGTCC 
GGGCCCCTGA 
AGTGTGCATT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 



51 
I 

TCTTCTTGCT 
AACAACCACA 
TAATTTCCCT 
ACCTGCTCCC 
CATAATTAGT 
AACCACAAAT 
TGGATTAATC 
AGAAGACAAT 
AAACAGCACA 
TGTTAAGCTG 
TACATGTAAG 
TGACCCAGAA 
GTTTAAAGAT 
ATCTCTGTCA 
AACAATTTTG 
AGCAATTAGA 
TTATGGCTGT 
TGACCTGCAA 
TGATGCCTGC 
GTGTGCGTGC 
TGGCTACAGT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 



1010 



WO 03/042661 



GGACTCGACT GTAAGGACAA ATTTCAGCTG ATCCTCACTA TTGTGGGCAC CATCGCTGGC 1320 

ATTGTCATTC TCAGCATGAT AATTGCATTG ATTGTCACAG CAAGATCAAA TAACAAAACG 1380 

AAGCATATTG AAGAAGAGAA CTTGATTGAC GAAGACTTTC AAAATCTAAA ACTGCGGTCG 1440 

ACAGGCTTCA CCAATCTTGG AGCAGAAGGG AGCGTCTTTC CTAAGGTCAG GATAACGGCC 1500 

TCCAGAGACA GCCAGATGCA AAATCCCTAT TCAAGACACA GCAGCATGCC CCGCCCTGAC 1560 

TATTAGAATC ATAAGAATGT GGAACCCGCC ATGGCCCCCA ACCAATGTAC AAGCTATTAT 1620 

TTAGAGTGTT TAGAAAGACT GATGGAGAAG TGAGCACCAG TAAAGATCTG GCCTCCGGGG 16B0 

TTTTTCTTCC ATCTGACATC TGCCAGCCTC TCTGAATGGA AGTTGTGAAT GTTTGCAACG 1740 

AATCCAGCTC ACTTGCTAAA TAAGAATCTA TGACATTAAA TGTAGTAGAT GCTATTAGCG 1800 

CTTGTCAGAG AGGTGGTTTT CTTCAATCAG TACAAAGTAC TGAGACAATG GTTAGGGTTG 1860 

TTTTCTTAAT TCTTTTCCTG GTAGGGCAAC AAGAACCATT TCCAATCTAG AGGAAAGCTC 1920 

CCCAGCATTG CTTGCTCCTG GGCAAACATT GCTCTTGAGT TAAGTGACCT AATTCCCCTG 1980 

GGAGACATAC GCATCAACTG TGGAGGTCCG AGGGGATGAG AAGGGATACC CACCACCTTT 2040 

CAAGGGTCAC AAGCTCACTC TCTGACAAGT CAGAATAGGG ACACTGCTTC TATCCCTCCA 2100 

ATGGAGAGAT TCTGGCAACC TTTGAACAGC CCAGAGCTTG CAACCTAGCC TCACCCAAGA 2160 

AGACTGGAAA GAGACATATC TCTCAGCTTT TTCAGGAGGC GTGCCTGGGA ATCCAGGAAC 2220 

TTTTTGATGC TAATTAGAAG GCCTGGACTA AAAATGTCCA CTATGGGGTG CACTCTACAG 2280 

TTTTTGAAAT GCTAGGAGGC AGAAGGGGCA GAGAGTAAAA AACATGACCT GGTAGAAGGA 2340 

AGAGAGGCAA AGGAAACTGG GTGGGGAGGA TCAATTAGAG AGGAGGCACC TGGGATCCAC 2400 

CTTCTTCCTT AGGTCCCCTC CTCCATCAGC AAAGGAGCAC TTCTCTAATC ATGCCCTCCC 2460 

GAAGACTGGC TGGGAGAAGG TTTAAAAACA AAAAATCCAG GAGTAAGAGC CTTAGGTCAG 2520 

TTTGAAATTG GAGACAAACT GTCTGGCAAA GGGTGCGAGA GGGAGCTTGT GCTCAGGAGT 25B0 

CCAGCCGTCC AGCCTCGGGG TGTAGGTTTC TGAGGTGTGC CATTGGGGCC TCAGCCTTCT 2640 

CTGGTGACAG AGGCTCAGCT GTGGCCACCA ACACACAACC ACACACACAC AACCACACAC 2700 

ACAAATGGGG GCAACCACAT CCAGTACAAG CTTTTACAAA TGTTATTAGT GTCCTTTTTT 2760 

ATTTCTAATG CCTTGTCCTC TTAAAAGTTA TTTTATTTGT TATTATTATT TGTTCTTGAC 2820 

TGTTAATTGT GAATGGTAAT GCAATAAAGT GCCTTTGTTA GATGGTGAAA AAAAAAAAAA 2880 
AAAAAAA 

Seq ID NO: 150 Protein sequence 
Protein Accession #: NP_149038.1 

1 11 21 31 41 51 

I I I 1 I I 

MKAIIHLTLL ALLSVNTATN QGNSADAVTT TETATSGPTV AAADTTETNF PETASTTANT 60 

PSFPTATSPA PPIISTHSSS TIPTPAPPII STHSSSTIPI PTAADSESTT NVNSLATSDI 120 

ITASSPNDGL ITMVPSETQS NNEMSPTTED NQSSGPPTGT ALLETSTLNS TGPSNPCQDD 180 

PCADNSLCVK LHNTSFCLCL EGYYYNSSTC KKGKVFPGKI SVTVSETFDP EEKHSMAYQD 240 

L HSEIT SLFK , DVFGTSVYGQ TVILTVSTSL SPRSEMRADD KFVNVTIVTI LAETTSDNEK 300 

TVTEKINKAI RSSSSNFLNY DLTLRCDYYG CNOTADDCLN GLACDCKSDL QRPNPQSPFC 360 

VASSLKCPDA CNAQHKQCL.I KKSGGAPECA CVPGYQEDAN GNCQKCAFGY SGLDCKDKFQ 420 

LILTIVGTIA GIVILSMIIA LIVTARSNNK TKHIEEENLI DEDFQNLKLR STGFTNLGAE 480 
GSVFPKVRIT ASRDSQMQNP YSRHSSMPRP DY 

Seq ID NO: 151 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 87.. 914 

1 11 21 31 41 51 

I I I I I I 

AGTACGCGGG GCCGCGTTTT GCCTCCGCAG CAGCTCTGGG CTCTTCTCAG CTGCGOGAGC 60 

AGCTGCTCCA ATGCCCCGGA GTGGCCATGG GCGCCCCGCA CTGGTGGGAC CAGCTGCAGG 120 

CTGGTAGCTC GGAGGTGGAC TGGTGCGAGG ACAACTACAC CATCGTGCCT GCTATCGCCG 180 

AGTTCTACAA CACGATCAGC AATGTCTTAT TTTTCATTTT ACCGCCCATC TGCATGTGCT 240 

TGTTTCGTCA GTATGCAACA TGCTTCAACA GTGGCATCTA CTTAATCTGG ACTCTTTTGG 300 

TTGTAGTGGG AATTGGATCC GTCTACTTCC ATGCAACCCT TAGTTTCTTG GGTCAGATGC 360 

TTGATGAACT TGCAGTCCTT TGGGTTCTGA TGTGTGCTTT GGCCATGTGG TTCCCCAGAA 420 

GGTATCTACC AAAGATCTTT CGGAATGACC GGGGTAGGTT CAAGGTGGTG GTCAGTGTCC 480 

TGTCTGCGGT TACGACGTGC CTGGCATTTG TCAAGCCTGC CATCAACAAC ATCTCTCTGA 540 

TGACCCTGGG AGTTCCTTGC ACTGCACTGC TCATCGCAGA GCTAAAGAGG TGTGACAACA 600 

TGCGTGTGTT TAAGCTGGGC CTCTTCTCGG GCCTCTGGTG GACCCTGGCC CTGTTCTGCT 660 

GGATCAGTGA COGAGCTTTC TGCGAGCTGC TGTCATCCTT CAACTTCCCC TACCTGCACT 720 

GCATGTGGCA CATCCTCATC TGCCTTGCTG CCTACCTGGG CTGTGTATGC TTTGCCTACT 780 

TTGATGCTGC CTCAGAGATT CCTGAGCAAG GCCCTGTCAT CAAGTTCTGG CCCAATGAGA 840 

AATGGGCCTT CATTGGTGTC CCCTATGTGT CCCTCCTGTG TGCCAACAAG AAATCATCAG 900 

TCAAGATCAC GTGATGGCAA GATGGTGGCT GGCTTCTCTG CTTATCGCCC CTCATGCAGT 960 

GGGCTTCCTT TGCTAGGAAG ACAGCCAAGG GAGTTCGAAT AGTTGGGGTG TGGGCTATCT 1020 

TTTCAAAAAT CTATTTGCTG GGGCTCTTAA TTTCTTTAGT GTTCTTTGTA TGTAGGGATT 1080 

TAAACTTTGT CATATGGTAC AAATATTCCC TGCCCCCCTG CAGTTTCCCA TTTGTCTTTC 1140 

AGTATGTTAA TATTTTTGTG CCATACTGGT TTTAAACTTT CATGTTGTCA CATCTGTTAA 1200 

TCTTTTCTTT AGGATTTCTG GATTTTGTGT AATTTTTAAA AAGGTCCCCT CCTCCTCCCT 1260 

AATGTGTCTG TGGACCACCT GGATTCCACT GTACAAGGGG AAAAGTGTCT ATTCCTTTCC 1320 

CAAAGATGGA AAATGGAGGG CTTAGGGACA CTAGATGCAT CTTTCTCAGC ATCACTTCCA 1380 

GATGCAGTGA CTTGTTGGGC TGCGTCCTTA ATGGCCATGG CAGAGCAGTC CCTTGGGGGA 1440 

TCCAGCCCTG TACAATGCAT CTCTTCCTGG AGAAAGCTGG CCTGCTCCAG ACCCCACCAT 1500 

TCCCAGGCGC CCTTGGAGTG GACTCTACTG ATGACAGACA GACCCTCTGA GAGACAAGAC 1560 

CCTCTGACTC TGTGATGGAA GATGCCAGAG ATTTTCCTTT GGGGTAATTG TCCTTAAACA 1620 

AAACCAAACA GATGAAACAC ACACAGGACT TGTGGCTAAA AAGGCTAGTT TTTCACTTGC 1680 

ATTTCTCAAC TAACCCAGGT TTTACATGCA TCTGTGAATC CTTTTACTAC TACCTCTGTG 1740 

GAGAGATGGA GAGACTTCAG ATAAACGTGA AGCTAATGAG TAAAACCCTC TCTGCCAAAA 1800 

CCTACACTCC ACTTTAGGCC CTTCTTGAAG ATGAGCACAA TTTTTAAATA CTGAGCACAA 1860 

TTTTTAAATA CTGACATCAC TTCCTCTTCC CCCTCCCACC CCAGCTCAGC AGCCTCAAAT 1920 

CTACAGAGAA GAAGAATTAT GGCATGAACA TTCCCACAGA CCCACCATCT TTAAGACTTG 1980 

ACCTCTGTAA GTTTACCAAA GGGCTCCTCA CAATTGTGGT GGGGGTTCTG GTTCAAAATT 2040 



1011 



WO 03/042661 



PCT/US02/36810 



5 

10 

15 

20 

25 

30 

35 

40 

45 

50 
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60 

65 

70 

75 

80 



TGGAGCAAAC 
TATTCTGGAA 
TCATGTTCTT 
TCACCTTTCT 
CAGCAGCGCA 
GAATCTTCTC 
GGAATTATCA 
CATTGTTGGG 
CTACATATGT 
GTTAGAATTT 
AAAATGAATG 
GAACGCTGTT 
TATTAAATTT 
AATGCAAATT 
TATGTTGGTT 
GTCTTGCTCT 
GTCTCCCAGG 
CGTGCCTCCA 
GCCCAGGCTG 
GCTGGGAATA 
AATTAAATAT 
GAACTGCTGT 
CATTCTTATT 
TAAATGCTCC 
TTTTTTTTTT 



TGAGCTGAGA 
TTTGAACAGA 
ACCCATCTTA 
TCTTTAAAGA 
TCTATTGCCC 
AATATAGTTG 
CATGTAAATG 
GCTATATGCT 
ATCAAGCCTA 
CCTCTGCACT 
TTTGATATGT 
GAATCTCCCT 



ATGAAGTTTT 
ATTACCTTCA 
CACGCTTGCA 
CAAGTGCTGG 
AATCTGTTGC 
TTCATCGGGC 
ACCTCAGGGA 
AGAGGGCAGG 
AAGAGTTGGG 
GTACCAGATC 
ACTTTATGCT 
TGTAGTAAGA 
ATTTAAGGTT 
AACAAATTGG 
TATTATAGTT 
ATTACCCAGG 
CTCAAGTGAT 
CTCCCAGCTA 
GTCTCAAACT 
GGCATGAGCC 
TCGTAGAATT 
TAAGTCCTTT 
TTAGATTGAC 
AAGGTAGGGG 
TTTTTAATGT 
AAGAAATGTG 
AATCACTCTA 
TTTTAAGTTG 
AAATTGGGAA 
AGGTATAATC 
TAAATAAATG 
CCAGTTGTAT 
TGAATGCTGA 
ACATGAATGG 
TGGCTTGGTG 
TCAGATTGTA 
CC 



TGGAAACGTT 
AGAGTCTCAC 
GTAGAAGGTG 
CAGTAGCTAT 
CTTCTGAATT 
TGAATGAAAG 
TACTCATTTT 
ACAGGTGTGT 
AAGGTGACGA 
CAAGGTGAAA 
ACATCTGTGG 
AATCTTTGTG 
AAATTTATGG 
TAATTTCCAA 
TTATATATAT 
CTGGAGTGCA 
CCTCTCACCT 
ATTTTTGTAT 
TCTGGGCTCA 
GTCACGCCTG 
TCTAATGTAT 
GGGTTTCAAG 
CTTATCAGGC 
GTTTGGTATA 
GAGAAGCAGA 
GAATGGAGTA 
GCTTGCCAGC 
GGTTTATGGC 
ACTACAGAGA 
CTTGTTAATA 
ATGTCTTTCA 
ATTTTTTCAT 
TCACTGGATT 
GCTCCAAAAA 
AGTCATCATA 
AAAAATGAAT 



TTCTCATTTG 
TTCTTGTTTC 
CTTTCTCGGT 
GCACTCAOGG 
TTTCTCACCT 
ATTCAAGAAC 
AACTCAGGCG 
TCTTCTGTGG 
TTTTTGACAC 
ACCCCAATAA 
TTATCAAATT 
GAACCCCAGT 
CATTTACTTA 
GGTAGAAAAA 
ATAGAGAGAG 
GTGGTGCAGT 
CAGCCTCCCC 
TTTTTCATAG 
AGAGATTCGC 
GCCTAAAAAA 
CTCTTTGAGA 
TCTAGAATTT 
ATGGATTCTG 
TATTTTAAGC 
ATGTGCTTCT 
TATTTGAGGA 
ATGGGATGTA 
ATCGCTCATA 
AGTCAAAGAA 
TTTTGGTTTG 
GAGTTGACAT 
GAAGTGTAGG 
CTGAGACTGA 
GTAATGAATG 
AATAGCTGTT 
AGTTTATTTC 



AAGCCTCCAG 
TGTTGTGTTT 
TTCCCAGAGT 
GCTGGTTTGG 
AATGTGACAC 
CATCTTCAAG 
TGTCCTGCTT 
GCAGGAGTCA 
ATCCAGGAAC 
GCAACTGAAT 
ATATAGGTTG 
GTGTGAAGTA 
ATAATATATG 
TTAGGTGTTG 
TGTTTTGTTT 
CATGACTCAC 
AGGAGCTGGG 
AGATGGGGTT 
CCGCCTCGGC 
TATTTTTAAA 
CCTAGGAGGT 
TTTAAAGGCA 
GTCTCATCTA 
CCGGCTTTTT 
AGAAACTGGT 
GGACAAAACA 
AACCAAGAGA 
TATGTAAAAG 
AAAAAAAAGT 
GTCTCCTCTT 
TTATCCTGTA 
TTTGGAATAC 
CTACTGAGTC 
TAATTGTACT 
AAATATGTGA 
ATTAATTGAT 



TATGCTGTAC 
TCTGTGGGCA 
ATCCAACGGC 
GTCGCTGGTG 
TGGCTACAAT 
GTGCATGGTG 
TGTAACATTC 
TGTCACTGTC 
TCTTACTCTA 
TTAGAGTTTA 
TTGAGAAGCA 
AATTGTATGT 
AGGTGGTGAA 
AATGAATGTA 
TTGAGTCAGG 
TGTAGCCTCT 
AGTACTGGTG 
TCACCATTTT 
CTCCCAAAGT 
ATGATCTTTG 
TGATGGAACA 
AATATCAGCT 
CTTTATGGTA 
TT TTTTX TIT 
TTTAAAGAGA 
TAACTTCACT 
GTAGAAATAT 
CACTACAAAC 
AACCCATATT 
TTTTTCCCCC 
GCTTGAATGG 
ACTAGAGTTA 
TACCTTTTTA 
TTTTGATGTG 
CTTTACAGAT 
GGGCAGTCAA 



2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4060 

4140 

4200 



Seq ID NO: 152 Protein sequence 
Protein Accession #: Eos sequence 

1 11 21 31 41 51 

| | I I I I 

KGAPHWWDQI* QAGSSEVDWC EDNYTIVPAI AEFYNTISNV LFFILPPICM CLFRQYATCP 
NSGIYLIWTL LWVGIGSVY FHATLSFLGQ MLDELAVLWV LMCALAMWFP RRYLFKIFRN 
DRGRFKWVS VLSAVTTCLA FVKPAINNIS IMTLGVPCTA LLIAELKRCD NMRVFKLGLF 
SGLWWTLALF CWISDRAFCE LLSSPNFPYL HCMWHILICL AAYLGCVCFA YFDAASEIPE 
QGPVIKFWFN EKWAFIGVPY VSLLCANKKS SVKIT 

Seq ID NO: 153 DNA sequence 

Nucleic Acid Accession #: NM_001432.1 

Coding sequence: 167.. 676 



60 
120 
180 
240 



TCACTTGCCT 
TGACAGCCGC 
TCCGCAGCCG 
GAGGATGGAG 
TCTTCTACAG 
TGATAACTGC 
AACAAAGTGT 
GGACATGAGT 
CTTCTTTTTA 
TATTATTTTG 
AAATCGAAAA 
GTTGCCGCAA 
TAATATTAAT 
CACTGTATTT 
TTGCTAATGT 
AAGTAATTTC 
AGTGCTTAGA 
TTCTGTAAGC 
AATCAGTGAT 
ATATATTAAA 
ACACTACACA 
AGCTATAATT 
ACAGGTACTA 
TTCTGTAATC 
ATATTACAAC 
CTTGAAGCCT 
TTTGCTGAAT 
AATATATGAA 
TTTATTCAAG 



11 
I 

GATATTTCCA 
TCTCCAGCCA 
CCCTCOGCCA 
ATGCTCTGTG 
GCAGTCCTCA 
ACAGCTTTAG 
AGCTCTGACA 
CAAAACTACT 
ACCGTCCACC 
TTTCTTATCA 
AGTAAAGAAC 
GTCTGAATGG 
ATTCCATTTT 
TAATGTACTT 
ATAATGTGCA 
CTGAGCTAAA 
AGTAAGCAAT 
CTATATATAT 
AATTGGTTTG 
ACAATAATGC 
TTTGAAACAA 
CTGTTATATG 
GTCACTCATC 
TTTTATTTAA 
ATTTATGTGA 
AAATTTGTGC 
TCTAACATTA 
GACAATAATT 
GAAGTTTTAA 



21 
I 

GTGTCAGAGG 
CTGCCGCGAG 
AGCCCCAGCG 
CCGGCAGGGT 
GTACAACTGT 
TTCAGACAGA 
TGAATGGCTA 
GCAGGTGTGA 
AACCTTTAAG 
CAGTCGTCGG 
CAAAGAAGGA 
CGCCATCAAA 
ATTAATAATA 
GAAAAATGTT 
GAAAATATTT 
TGCTTCATTG 
TCCCAGGTCA 
AGTCAAATCG 
ACTGTACCAT 
AATTAGAATT 
GCTAACCCTG 
AGGGGCAGTG 
TACCAGATTC 
GTAGTGGGCA 
GGTAATTATT 
TTTTTAAGAA 
AATCACAGCC 
CTACATGTTG 
CTTTAATACA 



31 



GACACAGCCA 
CCCGTCTGCT 
CCCGCTCCCA 
CCCTGCGCTG 
GATTCCATCA 
AGACAATCCA 
TTGTTTGCAT 
AGTGGGTTAT 
CAAAGAGTAT 
TTCCACATAT 
ATATGAGAGA 
CTTATGGGCA 
TTTATGTTGG 
TTTATTTTTG 
AATATCAAAA 
AAAGCTTCAA 
TAGCTCAAGA 
ATTTAGTAAG 
GGTTTGATAT 
TGGGAGAAGC 
GGGAGTCTAT 
GACAGTTCCC 
TGCCTATGTA 
TTTCATAGCT 
GCTCAACAGA 
TATTTTTAGA 
CAAAATTTGA 
TCTTAAGATG 
GCTCAGTAAA 



41 

i 

ACGTGGGGTC 
CCCGCCCTGC 
TCGCCGATGA 
CTGCTCTGCC 
TGTATCCCAG 
CGTGTGGCTC 
GGACAGTGCA 
ACTGGTGTCC 
GTGGCTTTGA 
TATTTCTGCA 
GTTACCTCAG 
GGGATAACAG 
GTCAAGTGTT 
TTTTATTTTT 
GAAAATTGAT 
AGTTTATATG 
ATTGTTAGCA 
TATGTTTTTT 
GTAGTTGGCA 
AAATATAGGT 
GGTCTCTTCA 
TATGCCAACT 
AAATGAATTG 
TCACAATGTT 
CAATTAGAAA 
CTATTTCTTT 
TGGACTAATT 
GAAATACAGT 



51 
I 

CCTTCTAGGC 
CCGTGCACTC 
CCGCGGGGAG 
TGGGTTTCCA 
GAGAGTCCAG 
AAGTGTCAAT 
TCTATCTGGT 
GATGTGAACA 
CCGTGATTCT 
GATGGTACAG 
GGGATCCAGA 
TGTGCCTGGT 
AGGTCAATAA 
GACAGACTAT 
ATTTTTATAC 
CCTGGTGCAC 
AATGACAGAT 
ATGTTCCTCA 
CCATGGTATC 
CCTGTGTTAA 
CTCAGGTCTC 
CACGACTCCT 
AAAAACAATT 
CCTTTTTTGT 
AAAGTCCACA 
TTATAGGGGC 
ATTATTTTAA 
TATTTCATCT 
TAGAATGTAA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
16B0 
1740 



1012 



WO 03/042661 
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AGTTATGTAT TTAAAGTTGT ATCTTGACAC 
TGGTGTATTT TTCCAAATGA AAAATCTCAA 
CACACCTTCC TGTGGAGGCT GAGATGAAAA 
TTTTTGGAAG AGACAAAGAT TTCTTCTGCA 
AGGAGTATTT TTGGGCTATT GCATAAGGAG 
GGAGGCTCCT TCATCGAATG CTAAACCTTT 
TCAGGGAGGA TCTGTTCTTC CTCTACGTTT 
ATAATAAGCC ATGGCTGACC TCTGGAGCAC 
CATGCATTAT ATACCCTGGT GCAATCACAC 
TTACTATTAG GAAAATAAAC AGACAAAAAC 
TATATATATT CATATACAAA CATGTATGTA 
GTCATTTGGT GCTCTGCTAA CCATTTATAT 
AATTAGATCT AAACAGTTAT TTCTGTTTCC 
AGAACACATT TTAAATACTC TACTTACAGT 
GATTCAACCA ACCAAGGACG GAAAATGCTT 
ATTATAACAA CTATTTACTT TTTTTTTTTT 
CAGGTTGGAG TGCAGTGGCA CGATCTOGGC 
GATCCTCCTG CCTCAGCCTC CTGAGCAGCT 
CTAATTTTTG TATTTTTAGT AGAGGCGGGG 
CTCCTAACCT TGAGATCCAC CCTCCACAGC 
CCACCGCACG TAGCATTTAC ATTAGGTATT 
AGGAGGATGT GAATAGGTTA TATGCAAGCA 
TCTGTGCCCG ATTTTAGTAT GTGCAGGGGG 
CAAGGTACAA CTGTATTTAT TAACGCTTAC 
TGATCTTGGC TGTTTCAAAA AAATCTATTG 
ATTTCATTTA CAAAAGATTT ATTGTAAGCC 
GCATGTCAAT TCATAAAAAC AAGTCATTTT 
AAGCTAATCC CTAAAATAGT TAGATCTTTG 
CACATTACTT TTTATGGGTG AAAATAAGAC 
GAGTAAACAT AAAAACTGAT TTGCTCTCAG 
TTAGTTCACA CTTAGTCTAA CTCCCCCATT 
AAGGGGCTAT GCATATTCAA TGTATTGAGA 
AATTTATGGT CTTCAACCAA GGCCACATAA 
ATATTAAGTT CTATAACTAG TACTCAAGGT 
TAACCTTAAA ATTGATATTA TATTAAACAT 
TGAATATTTT TTGCTCTAAT CTCTCTGCCG 
TGGTATGACT ACGTCTTAAG TCAGATTTTT 
ACCACCAGGT ATCAAATCAA CTTTTATGCA 
GTTAAATTCA GTAATGCAGT TTTTAAAAAC 
CCAATATCCA TTCTGTAGAC TTTTGAAAAA 
TGACCGTTAA AAAATTCTTG TTCATATGGG 
TATTTCTGGT GTATATTTTA ATGTTTTTAA 
TCAAATTTGA TGATGTTAAA TGTAATATAA 
TTGCACTTTT TAAGTTTGAA GAGCCATTTT 
AACATAAAGT TGTATTGCAT GCAATTTAAA 
TTACTGAATT GTATCAATTT GTTTGTGTTC 
GATATTGAAG GAGAAAATAG ATAATTTACA 
GGGAATTGAA AAAAATTGAA ATAAATAAAA 
CACTGAC 

Seq ID NO: 154 Protein sequence 
Protein Accession #: NP_001423.1 

21 



AGGAAATGGG 
TTGAAAGCTT 
CTAGGGCTCA 
CTCTGAGCCC 
CCACTGCTGC 
GAGTAGAGTC 
ATCCTGGCAT 
CAGGTGCCAG 
GACTGTCATC 
AAGTAAATAT 
TACATGACCT 
AAAACTTAAA 
TATTTAATAT 
TGGCCCTCTG 
AAAAAATAAT 
CTTTTTGAGA 
TCACTGCAAC 
GGGACTACAG 
TTTCACCATG 
CTCCCAAACT 
ACAAGTAATG 
CTATGCCCTT 
GCGATCTGGG 
TAGATGTGAG 
ACTTTTCAAT 
TCTCAATCTT 
TGTATTTTTC 
TAAATGCATA 
AAAAATAATA 
CTGATGTGTC 
TTACAGATTT 
ACCAAAGCAA 
TAACCCAGTT 
TTAACCTTAA 
ACATAATACA 
AAAGTCAAAG 
ATTTATGAGT 
GCAAATATAT 
CTGTATCTGA 
AAAGTTTTTA 
AGAAGGGGGA 
AAAGAGTAAT 
TGTATTTTCT 
GGTAAACGGT 
GTAACTTATT 
AATATCAGCT 
AGATATTATT 
ATGCATTGAA 



AAAAAACTTA AAAATTAATA 
TTAAAATGTA GAAACTTAAA 
TTTTCCTGAC ATTTGTTTAT 
ATAGGTCTCA GAGAGTTAAT 
CACCACTTTT GGATTTTATG 
TCCCTGGATC ACATACCAGG 
GTGCTAGGGT AAACGAAGGC 
GACTTGTCTC CATGTGTATC 
TAAAGTCCTG GCCCTGGCCC 
ATATGGTCCT ATACATATTG 
TAATGGATCA TAGAATTGCA 
AACAAGAGAA AAGAAAAATC 
AGCTGAAGTC AAAATATGTA 
TGGTTAGTTC CACATCTGTG 
ACAACAACAA CAAAAAATAC 
TGGAGTCTOG CTCTGTTGCC 
CTCACCTCCC GGGTTCAAGA 
GCGCATGCCA CCATGCCCAG 
TTGGCCAGGA TGGTCTCAAT 
GCTGGGATTA CAGGCGTGAG 
TAAAGATGAT TTAAGTATAC 
TTATATAAGT GACTTGAACA 
AATCAGTCCC CTGTGGATAC 
GAGAGTCTGA ATATTTTCAG 
AAATCAGCTG CAATCCATTT 
GGTTTTTCAG TTGATCTTAA 
ATCTTTAAGA ATGCTTAAAA 
TTAAATAATA AAGTATGACC 
GTTTTAGTGA GGATGGTGCT 
CTGTACACAG TGGGAAGATT 
CTCACTATAT ATATTTCTAG 
CCACAAATGC ATAAATGCAT 
AACTTACTCT TTAACCAGGA 
AATTAAGATT TCCTTAACCT 
ATGTAACTCC ACTGTTCTCC 
TGATGGGAGA ATTGGTATAC 
CTTTGAGACT AAATTCAATC 
GATTCTAGTG TCTGACTTTT 
CCCACTTTGT AATTTTTGCT 
ATTTGATGCC CAATATATTC 
GTAATGACTT GTACAAACAG 
TTCATTTAAA TATCTGTTAT 
TTTTATTTTG CACTCTGTAA 
TTTTATTAAA GATGCTATGG 
TGACTATGAA TATTATCGGA 
TTGATAATTG TGTACCTTAA 
AATTTTTATT TATTTTTCTT 
CATCTTGCAT TCAAAATCTT 



51 



CGACTCCTTA 
GTCCCTCCCC 
ACCCACACGG 
CAGGAGCTGT 
GAGGACGGCC 
GTCCGCCAGA 
ACCGACTCCG 
GCGGCCGCCG 
GGGCTGTACA 
GAGGCGCAGG 
ACGTCCAGGC 
GAGAACGGOG 
AAGTGGCAGG 
AAGGAGCGGG 
CGGGAACTCT 
TTCGTGGACG 
CCCATGAGAC 
TGCCGGGTCT 
TGCGATGAGT 



11 
I 

GAGCATGGCA 
TCAGCGCCGA 
TGGACTCGCT 
TCCACGTGGA 
ATACCCTCTT 
GCCTCGTGCT 
GCTGCTGCCT 
AGACTGACAG 
AGGTCAATGA 
TGGTCAGGGT 
CGGCGCTGGA 
TGGTCCAGAT 
ACCTGGAGGT 
GCTTCTGGTA 
ACGCCAACGT 
AAGTCTTCAA 
GGAAGAGCGG 
GCGCCTGCCA 
GCGACATGGC 



21 
I 

TGGCTCAGAG 
CACCATGTGG 
GTCCAGGCTG 
GCCAGGCCTG 
CGACTACGAG 
CCCCCACAGC 
GGGCCAGAGT 
CAGGCCAGCC 
GTACGTCGAT 
GACGCGGAAG 
GGAGGACGTC 
GAACTCCAGG 
GGGCCAGGTG 
CGACGCGGAG 
GGTGCTGGGG 
GATTGAG CGG 
GCCGTCCTGC 
CCTGTGCGGG 
CTTCCACATC 



31 
I 

GTGCTGGTAA 
ATCCAGGTTC 
ACCAAGGTGG 
CAGAGGCTGT 
GTCCGCCTGA 
ACCAAGGAGC 
GAGTCAGACA 
GATGAGGACA 
GCTCGGGACA 
GCCCCCTCCC 
ATTTACCACG 
GACX3TCCGAG 
GTCATGCTCA 
ATCTCCAGGA 
GATGATTCTC 
CCGGGTGAAG 
AAGCACTGCA 
GGCCGGCAGG 
TACTGCCTGG 



41 

I 

AACTGATGGG 
GGACCATGGA 
AGGAGCTGAG 
TCTACAGGGG 
ATGACACCAT 
GGGACTCCGA 
AGTCCTCCAC 
TGTGGGATGA 
CGAACATGGG 
GGGACGAGOC 
TGAAATACGA 
CGCGCGCCCG 
ACTACAACCC 
AGCGCGAGAC 
TGAACGACTG 
GGAGCCCCAT 
AGGACGACGT 
ACCCCGACAA 
ACCCGCCCCT 



1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 



1 11 21 31 41 

I I I I I I 

MTAGRRMEML CAGRVPALLL CLGFHLLQAV LSTTVIPSCI PGESSDNCTA LVQTEDNPRV 
AQVSITKCSS DMNGYCLHGQ CIYLVDMSQN YCRCEVGYTG VRCEHFFLTV EQPLSKEYVA 
LTVILIILFL ITWGSTYYF CRWYRNRKSK EPKKEYERVT SGDPELPQV 

Seq ID NO: 15S DNA sequence 

Nucleic Acid Accession #: NM_013282.2 

Coding sequence: 85.. 2466 



60 
120 



51 

I 

GGTTTTTGCT 
CGGGAGGCAG 
GCGGAAGATC 
CAAACAGATG 
CCAGCTCCTG 
GCTCTCCGAC 
CCACGGCGAG 
GACGGAATTG 
GGCGTGGTTT 
CTGCAGCTCC 
GGACTACCCG 
CACCATCATC 
CGACAACCCC 
CAGGACGGCG 
TCGGATCATC 
GGTTGACAAC 
GAACAGACTC 
GCAGCTCATG 
CAGCAGTGTT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
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5 

10 

15 

20 

25 

30 

35 

40 

45 
50 
55 
60 
65 
70 
75 
80 



CCCAGCGAGG ACGAGTGGTA CTGCCCTGAG TGCCGGAATG ATGCCAGCGA GGTGGTACTG 1200 

GCGGGAGAGC GGCTGAGAGA GAGCAAGAAG AAGGCGAAGA TGGCCTCGGC CACATCGTCC 1260 

TCACAGCGGG ACTGGGGCAA GGGCATGGCC TGTGTGGGCC GCACCAAGGA ATGTACCATC 1320 

GTCCCGTCCA ACCACTACGG ACCCATCCCG GGGATCCCCG TGGGCACCAT GTGGCGGTTC 1380 

CGAGTCCAGG TCAGOGAGTC GGGTGTCCAT CGGCCCCACG TGGCTGGCAT ACACGGCCGG 1440 

AGCAACGACG GAGCGTACTC CCTAGTCCTG GCGGGGGGCT ATGAGGATGA CGTGGACCAT 1500 

GGGAATTTTT TCACATACAC GGGTAGTGGT GGTCGAGATC TTTCCGGCAA CAAGAGGACC 1560 

GCGGAACAGT CTTGTGATCA GAAACTCACC AACACCAACA GGGCGCTGGC TCTCAACTGC 1620 

TTTGCTCCCA TCAATGACCA AGAAGGGGCC GAGGCCAAGG ACTGGCGGTC GGGGAAGCCG 1680 

GTCAGGGTGG TGCGCAATGT CAAGGGTGGC AAGAATAGCA AGTACGCCCC CGCTGAGGGC 1740 

AACCGCTACG ATGGCATCTA CAAGGTTGTG AAATACTGGC CCGAGAAGGG GAAGTCCGGG 1800 

TTTCTCGTGT GGCGCTACCT TCTGCGGAGG GACGATGATG AGCCTGGCCC TTGGACGAAG 1860 

GAGGGGAAGG ACCGGATCAA GAAGCTGGGG CTGACCATGC AGTATCCAGA AGGCTACCTG 1920 

GAAGCCCTGG CCAACCGAGA GCGAGAGAAG GAGAACAGCA AGAGGGAGGA GGAGGAGCAG 1980 

CAGGAGGGGG GCTTCGCGTC CCCCAGGACG GGCAAGGGCA AGTGGAAGCG GAAGTCGGCA 2040 

GGAGGTGGCC CGAGCAGGGC CGGGTCCCCG CGCGGGACAT CCAAGAAAAC CAAGGTGGAG 2100 

CCCTACAGTC TCACGGCCCA GCAGAGCAGC CTCATCAGAG AGGACAAGAG CAACGCCAAG 2160 

CTGTGGAATG AGGTCCTGGC GTCACTCAAG GACCGGCCGG CGAGCGGCAG CCCGTTCCAG 2220 

TTGTTCCTGA GTAAAGTGGA GGAGACGTTC CAGTGTATCT GCTGTCAGGA GCTGGTGTTC 2280 

CGGCCCATCA CGACCGTGTG CCAGCACAAC GTGTGCAAGG ACTGCCTGGA CAGATCCTTT 2340 

CGGGCACAGG TGTTCAGCTG CCCTGCCTGC CGCTACGACC TGGGCCGCAG CTATGCCATG 2400 

CAGGTGAACC AGCCTCTGCA GACCGTCCTC AACCAGCTCT TCCCCGGCTA CGGCAATGGC 2460 

CGGTGATCTC CAAGCACTTC TCGACAGGCG TTTTGCTGAA AACGTGTCGG AGGGCTCGTT 2520 

CATCGGCACT GATTTTGTTC TTAGTGGGCT TAACTTAAAC AGGTAGTGTT TCCTCCGTTC 2560 

CCTAAAAAGG TTTGTCTTCC TTTTTTTTTA TTTTTATTTT TCAAATCTAT ACATTTTCAG 2640 

GAATTTATGT ATTCTGGCTA AAAGTTGGAC TTCTCAGTAT TGTGTTTAGT TCTTTGAAAA 2700 

CATAAAAGCC TGCAATTTCT CGACAAAACA ACACAAGATT TTTTAAAGAT GGAATCAGAA 2760 

ACTACGTGGT GTGGAGGCTG TTGATGTTTC TGGTGTCAAG TTCTCAGAAG TTGCTGCCAC 2820 

CAACTCTTTA AGAAGGCGAC AGGATCAGTC CTTCTCTAGG GTTCTGGCCC CCAAGGTCAG 2880 

AGCAAGCATC TTCCTGACAG CATTTTGTCA TCTAAAGTCC AGTGACATGG TTCCCCGTGG 2940 

TGGCCCGTGG CAGCCCGTGG CATGGCGTGG CTCAGCTGTC TGTTGAAGTT GTTGCAAGGA 3000 

AAAGAGGAAA CATCTCGGGC CTAGTTCAAA CCTTTGCCTC AAAGCCATCC CCCACCAGAC 3060 

TGCTTAGCGT CTGAGATCCG CGTGAAAAGT CCTCTGCCCA CGAGAGCAGG GAGTTGGGGC 3120 

CACGCAGAAA TGGCCTCAAG GGGACTCTGC TCCACGTGGG GCCAGGCGTG TGACTGACGC 3180 

TGTCCGACGA AGGCGGCCAC GGACGGACGC CAGCACACGA AGTCACGTGC AAGTGCCTTT 3240 

GATTCGTTCC TTCTTTCTAA AGACGACAGT CTTTGTTGTT AGCACTGAAT TATTGAAAAT 3300 

GTCAACCAGA TTCTAGAAAC TGCGGTCATC CAGTTCTTCC TGACACCGGA TGGGTGCTTG 3360 

GGAACCGTTT GAGCCTTATA GATCATTTAC ATTCAATTTT TTTAACTCAG CAAGTGAGAA 3420 

CTTACAAGAG GGTTTTTTTT TAATTTTTTT TTCTCTTAAT GAACACATTT TCTAAATGAA 3480 

TTTTTTTTGT AGTTACTGTA TATGTACCAA GAAAGATATA ACGTTAGGGT TTGGTTGTTT 3540 

TTGTTTTTGT ATTTTTTTTC TTTTGAAAGG GTTTGTTAAT TTTTCTAATT TTACCAAAGT 3600 

TTGCAGCCTA TACCTCAATA AAACAGGGAT ATTTTAAATC ACATACCTGC AGACAAACTG 3660 

GAGCAATGTT ATTTTTAAAG GGTTTTTTTC ACCTCCTTAT TCTTAGATTA TTAATGTATT 3720 

AGGGAAGAAT GAGACAATTT TGTGTAGGCT TTTTCTAAAG TCCAGTACTT TGTCCAGATT 3780 
TTAGATTCTC AGAATAAATG TTTTTCACAG ATTGAAAAAA AAAAAAAA 

Seq JED NO : 156 Protein sequence 
Protein Accession ft: NP 037414.2 



1 
I 

MWIQVRTMDG 
YEVRLNDTIQ 
PADEDMWDET 
DVIYHVKYDD 
AEISRKRETR 
SCKHCKDDVN 
PECRNDASEV 
IPGIPVGTMW 
SGGRDLSGNK 
GGKNSKYAPA 
LGLTMQYPEG 



TPQCICCQEL 



11 
I 

RQTHTVDSLS 
LLVRQSLVLP 
ELGLYKVNEY 
YPENGWQMN 
TARELYANW 
RLCRVCACHL 
VLAGERLRES 
RFRVQVSESG 
RTAEQSCDQK 
EGNRYDGIYK 
YLEALANRER 
VEPYSLTAQQ 
VFRPITTVCQ 



21 

I 

RLTKVEELRR 
HSTKERDSEL 
VDARDTNMGA 
SRDVRARART 
LGDDSLKDCR 
CGGRQDPDKQ 
KKKAKMASAT 
VHRPHVAGIH 
LTNTNRALAL 
WKYWPEKGK 



31 
I 

KIQELPBVEP 
SDTDSGCCLG 
WFEAQWRVT 
IIKWQDLEVG 
IIPVDEVFKI 
LMCDECDMAF 



SSLIREDKSN 
HNVCKDCLDR 



GRSNDGAYSL 
NCFAPINDQE 
SGFLVWRYLL 
EQQEGGFASP 
AKLWNEVLAS 
SFRAQVFSCP 



41 
I 

GLQRLPYRGK 
QSESDKSSTH 
RKAPSRDEPC 
QWMLNYNPD 
ERPGEGSPMV 
HIYCLDPPLS 
MACVGRTKEC 
VLAGGYEDDV 
GAEAKDWRSG 
RRDDDEPGPW 
RTGKGKWKRK 
LKDRPASGSP 
ACRYDLGRSY 



51 
I 

QMEDGHTLFD 
GEAAAETDSR 
SSTSRPALEE 
NPKERGFWYD 
DNPMRRKSGP 
SVPSEDEWYC 
TIVPSNHYGP 
DHGNFFTYTG 
KPVRWRNVK 
TKEGKDRIKK 
SAGGGPSRAG 
FQLFLSKVEE 
AMQVNQPLQT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 



Seq ID NO: 157 DNA sequence 

Nucleic Acid Accession #: NMJ)00756. 

Coding sequence: 186.. 776 



AGAAACTCAG 
GAGAGTGGGA 
TCTCTGCAGA 
CTAACATGCG 
GCCCGCCATG 
AGCACCCTCA 
CGCAGGCTCG 
TCAACAAGAG 
GCAGCCGCCC 
TGCTGCTGCC 
ATGCCCTCGG 
TCTCCCTGGA 
AGTTAGCACA 



11 
I 

AGACCAAGTC 
ACAGTAAAGA 
GAGGCGGCAG 
GCTGCCGCTG 
CAGGGCGCTC 
GCCCTTGGAT 
GCCGGTCCTG 
CCCGGCCGCT 
TTCGCCGGAA 
TCGGCGCTCG 
CGGCCACCAG 
TCTCACCTTC 
GCAAGCTCAC 



21 
I 

CATTGAGAGA 
GAAAGGAAGA 
CACCCGGCTC 
CTTGTGTCCG 
CTGAGCCGCG 
TTCTTCCAGC 
CTCCGCATGG 
CCCCTTTCGC 
CAGGCGACOG 
CTCGACAGCC 
GAGGCACCGG 
CACCTCCTCC 
AGCAACAGGA 



31 
I 

CTGAGGGGAA 
CAACCTCCAG 
ACCTGCGAAG 
CGGGAGTCCT 
GGCCGGTCCC 
CGCCGCCGCA 
GAGAGGAGTA 
CCGCCTCCTC 
CCAACTTTTT 
CCGCGGCTCT 
AGAGAGAAAG 
GGGAAGTCTT 
AACTCATGGA 



41 

I 

AGAGAGGAGA 
AGAAAGCCCC 
CGCCTGGGAA 
GCTGGTGGCT 
GGGAGCTCGG 
GTCCGAGCAG 
CTTCCTCCGC 
GCTCCTCGCC 

CGCGGAGCGC 
GCGGTCCGAG 
GGAAATGGCC 
GATTATTGGG 



51 
I 

GAAAGAAAAA 
CGGAGACGTC 
GCGAGTGCCC 
CTCCTGCCCT 
CAGGCGCCGC 
CCCCAGCAGC 
CTGGGGAACC 
GGAGGCAGCG 
CTGCAGCAGC 
GGCGCTAGGA 
GAGCCTCCCA 
AGGGCCGAGC 
AAATAAAACG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
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GTGCGTTTGG CCAAAAAGAA TCTGCATTTA GCACAAAAAA AATTTAAAAA AATACAGTAT 840 

TCTGTACCAT AGCGCTGCTC TTATGCCATT TGTTTATTTT TATATAGCTT GAAACATAGA 900 

GGGAGAGAGG GAGAGAGCCT ATACCCCTTA CTTAGCATGC ACAAAGTGTA TTCACGTGCA 960 

GCAGCAACAC AATGTTATTC GTTTTGTCTA CGTTTAGTTT CCGTTTCCAG GTGTTTATAG 1020 

TGGTGTTTTA AAGAGAATGT AGACCTGTGA GAAAACGTTT TGTTTGAAAA AGCAGACAGA 1080 

AGTCACTCAA TTGTTTTTGT TGTGGTCTGA GCCAAAGAGA ATGCCATTCT CTTGGGTGGG 1140 

TAAGACTAAA TCTGTAAGCT CTTTGAAACA ACTTTCTCTT GTAAACGTTT CAGTAATAAA 1200 

ACATCTTTCC AGTCCTTGGT CAGTTTGGTT GTGTAAGAGA ATGTTGAATA CTTATATTTT 1260 
TAATAAAAGT TGCAAAGGT 

Seq ID NO: 158 Protein sequence 
Protein Accession KP_000747.1 

1 11 21 31 41 51 

1 I I 1 ) | 

MRLPLLVSAG VLLVALLPCP PCRALLSRGP VPGARQAPQH PQPLDFPQPP PQSEQPQQPQ 60 

ARPVLLRMGE EYFLRLGNLN KSPAAPLSPA SSLLAGGSGS RPSPEQATAN FFRVLLQQLL 120 

LPRRSLDSPA AIAERGARNA LGGHQEAPER ERRSEEPPIS LDLTFHLLRB VLEMARAEQL 180 
AQQAHSNRKL MEIIGK 

Seq ID NO: 159 DNA Be que nee 

Nucleic Acid Accession #: NM_001200.l 

Coding sequence: 325.. 1514 

1 11 21 31 41 51 

I I I I I | 

GGGGACTTCT TGAACTTGCA GGGAGAATAA CTTGCGCACC CCACTTTGCG CCGGTGCCTT 60 

TGCCCCAGCG GAGCCTGCTT CGCCATCTCC GAGCCCCACC GCCCCTCCAC TCCTCGGCCT 120 

TGCCCGACAC TGAGACGCTG TTCCCAGCGT GAAAAGAGAG ACTGCGCGGC CGGCACCCGG 180 

GAGAAGGAGG AGGCAAAGAA AAGGAACGGA CATTCGGTCC TTGCGCCAGG TCCTTTGACC 240 

AGAGTTTTTC CATGTGGACG CTCTTTCAAT GGACGTGTCC CCGCGTGCTT CTTAGACGGA 300 

CTGCGGTCTC CTAAAGGTCG ACCATGGTGG CCGGGACCCG CTGTCTTCTA GCGTTGCTGC 360 

TTCCCCAGGT CCTCCTGGGC GGCGCGGCTG GCCTCGTTCC GGAGCTGGGC CGCAGGAAGT 420 

TCGCGGCGGC GTCGTCGGGC CGCCCCTCAT CCCAGCCCTC TGACGAGGTC CTGAGCGAGT 480 

TCGAGTTGCG GCTGCTCAGC ATGTTCGGCC TGAAACAGAG ACCCACCCCC AGCAGGGACG 540 

CCGTGGTGCC CCCCTACATG CTAGACCTGT ATCGCAGGCA CTCAGGTCAG CCGGGCTCAC 600 

CCGCCCCAGA CC ACCGG TTG GAGAGGGCAG CCAGCCGAGC CAACACTGTG CGCAGCTTCC 660 

ACCATGAAGA ATCTTTGGAA GAACTACCAG AAACGAGTGG GAAAACAACC CGGAGATTCT 720 

TCTTTAATTT AAGTTCTATC CCCACGGAGG AGTTTATCAC CTCAGCAGAG CTTCAGGTTT 780 

TCCGAGAACA GATGCAAGAT GCTTTAGGAA ACAATAGCAG TTTCCATCAC CGAATTAATA 840 

TTTATGAAAT CATAAAACCT GCAACAGCCA ACTCGAAATT CCCCGTGACC AGACTTTTGG 900 

ACACCAGGTT GGTGAATCAG AATGCAAGCA GGTGGGAAAG TTTTGATGTC ACCCCCGCTG 960 

TGATGCGGTG GACTGCACAG GGACACGCCA ACCATGGATT CGTGGTGGAA GTGGCCCACT 1020 

TGGAGGAGAA ACAAGGTGTC TCCAAGAGAC ATGTTAGGAT AAGCAGGTCT TTGCACCAAG 1080 

ATGAACACAG CTGGTCACAG ATAAGGCCAT TGCTAGTAAC TTTTGGCCAT GATGGAAAAG 1140 

GGCATCCTCT CCACAAAAGA GAAAAACGTC AAGCCAAACA CAAACAGCGG AAACGCCTTA 1200 

AGTCCAGCTG TAAGAGACAC CCTTTGTACG TGGACTTCAG TGACGTGGGG TGGAATGACT 1260 

GGATTGTGGC TCCCCCGGGG TATCACGCCT TTTACTGCCA CGGAGAATGC CCTTTTCCTC 1320 

TGGCTGATCA TCTGAACTCC ACTAATCATG CCATTGTTCA GACGTTGGTC AACTCTGTTA 1380 

ACTCTAAGAT TCCTAAGGCA TGCTGTGTCC CGACAGAACT CAGTGCTATC TCGATGCTGT 1440 

ACCTTGACGA GAATGAAAAG GTTGTATTAA AGAACTATCA GGACATGGTT GTGGAGGGTT 1500 
GTGGGTGTCG CTAGTACAGC AAAATTAAAT ACATAAATAT ATATATA 

Seq ID NO: 160 Protein sequence 
Protein Accession #: NP_001191.1 

1 11 21 31 41 51 

I ! I I | | 

MVAGTRCLLA LLLPQVLLGG AAGLVPELGR RKPAAASSGR PSSQPSDEVL SEFELRLLSM 60 

FGLKQRPTPS RDAWPPYML DLYRRHSGQP GSPAPDHRLE RAASRANTVR SFHHEESLEE 120 

LPETSGKTTR RFFFNLSSIP TEEPITSAEL QVFREQMQDA LGNNSSFHHR DJIYEIIKPA 180 
TANSKFPVTR LLDT 

Seq ID NO: 161 DNA sequence 

Nucleic Acid Accession #: NMJJ01999.2 

Coding sequence: 1..8736 

1 11 21 31 41 51 

I I I I I I 

ATGGGGAGAA GACGGAGGCT GTGTCTCCAG CTCTACTTCC TGTGGCTGGG CTGTGTGGTG 60 

CTCTGGGCGC AGGGCAOGGC CGGCCAGCCT CAGCCTCCTC CGCCCAAGCC GCCCCGGCCC 120 

CAGCCGCCGC CGCAACAGGT TCGGTCCGCT ACAGCAGGCT CTGAAGGCGG GTTTCTAGCG 180 

CCCGAGTATC GCGAGGAGGG TGCCGCAGTG GCCAGCCGCG TCCGCCGGCG AGGACAGCAG 240 

GACGTGCTCC GAGGGCCCAA CGTGTGCGGC TCCAGATTCC ACTCCTACTG CTGCCCTGGA 300 

TGGAAG ACGC TCCCTGGAGG AAACCAGTGC ATTGTCCCGA TTTGTAGAAA TAGTTGTGGA 360 

GATGGATTTT GTTCCCGTCC TAACATGTGT ACTTGTTCCA GTGGGCAAAT ATCATCAACC 420 

TGTGGATCAA AATCAATTCA GCAGTGCAGT GTGAGATGCA TGAATGGTGG GACCTGTGCA 480 

GATGACCACT GCCAGTGCCA GAAAGGATAT ATTGGAACTT ATTGTGGACA ACCTGTCTGT 540 

GAAAATGGAT GTCAGAATGG TGGACGTTGC ATCGCCCAAC CGTGTGCTTG TGTTTATGGG 600 

TTCACTGGTC CACAGTGTGA AAGAGATTAC AGGACAGGCC CGTGTTTCAC TCAGGTCAAC 660 

AACCAGATGT GCCAAGGGCA GCTGACAGGC ATTGTCTGCA CGAAGACTCT GTGCTGTGCC 720 

ACCACTGGAC GGGCGTGGGG CCATCCCTGT GAGATGTGTC CAGCCCAGCC TCAGCCCTGC 780 

CGACGGGGTT TCATCCCCAA CATCCGCACT GGAGCTTGCC AAGATGTTGA TGAATGCCAG 840 

GCTATCCCAG GGATATGCCA AGGAGGAAAC TGTATCAATA CAGTGGGCTC TTTTGAATGC 900 

1015 



WO 03/042661 



AGATGCCCTG CTGGTCACAA ACAGAGTGAA ACTACTCAGA AATGTGAAGA CATTGATGAG 960 
TGCAGCATCA TTCCTGGGAT ATGTGAAACT GGTGAATGTT CCAACACCGT GGGAAGCTAT 1020 
TTTTGTGTTT GTCCACGTGG ATATGTAACC TCAACAGATG GCTCTCGATG CATCGATCAG 1080 
AGAACAGGCA TGTGTTTCTC GGGCCTGGTG AATGGCCGCT GTGCACAAGA GCTCCCGGGG 1140 
AGAATGACGA AAATGCAGTG CTGCTGTGAG CCTGGCCGCT GCTGGGGCAT CGGAACCATT 1200 
CCTGAAGCCT GTCCTGTCAG AGGTTCTGAG GAATATCGCA GACTTTGCAT GGATGGACTT 1260 
CCAATGGGAG GAATTCCAGG GAGTGCTGGT TCCAGACCTG GAGGCACTGG GGGAAATGGC 1320 
TTTGCCCCAA GTGGCAATGG CAATGGCTAT GGCCCAGGAG GGACAGGCTT CATCCCCATC 1380 
CCTGGAGGCA ATGGCTTTTC TCCTGGCGTT GGGGGAGCCG GTGTGGGGGC CGGGGGACAG 1440 
GGACCTATCA TCACTGGACT AACAATTCTG AACCAGACAA TAGATATCTG TAAGCATCAT 1500 
GCTAACCTTT GTTTAAATGG ACGCTGTATA CCAACTGTCT CAAGCTACCG ATGTGAATGC 1560 
AACATGGGTT ATAAGCAGGA TGCAAATGGA GATTGTATAG ATGTTGATGA ATGCACATCA 1620 
AATCCCTGCA CTAATGGAGA TTGTGTTAAC ACACCTGGTT CCTATTATTG TAAATGTCAT 1680 
GCTGGATTCC AGAGGACTCC TACCAAGCAA GCATGCATTG ATATTGATGA GTGCATCCAG 1740 
AATGGGGTTC TTTGTAAAAA CGGTCGATGC GTGAACTCAG ATGGAAGTTT CCAGTGCATT 1800 
TGCAATGCCG GCTTTGAATT AACTACAGAT GGAAAAAACT GTGTTGATCA TGATGAATGT 1860 
ACAACTACCA ACATGTGTTT GAATGGAATG TGCATCAATG AAGATGGCAG CTTCAAGTGC 1920 
ATCTGCAAAC CAGGATTTGT CTTGGCTCCA AATGGGCGTT ACTGTACTGA TGTTGATGAA 1980 
TGCCAGACCC CAGGAATCTG CATGAATGGG CACTGCATCA ACAGTGAAGG GTCCTTCCGC 2040 
TGTGACTGTC CCCCAGGCCT GGCTGTGGGC ATGGATGGAC GTGTGTGTGT TGATACTCAC 2100 
ATGCGCAGTA CCTGCTATGG AGGAATCAAG AAAGGAGTGT GTGTGCGTCC TTTCCCCGGT 2160 
GCAGTGACCA AGTCCGAATG CTGCTGTGCC AATCCAGACT ATGGTTTTGG AGAACCCTGC 2220 
CAGCCATGCC CTGCAAAAAA TTCAGCTGAA TTCCACGGCC TTTGTAGTAG TGGAGTAGGT 2280 
ATCACTGTGG ATGGAAGAGA TATCAATGAA TGTGCTTTGG ATCCTGATAT ATGTGCCAAT 2340 
GGGATTTGTG AAAACTTACG TGGTAGTTAC CGTTGTAATT GCAACAGTGG CTATGAACCA 2400 
GATG CCTCTG GAAGAAACTG TATTGACATT GATGAATGTT TAGTAAACAG ACTGCTTTGT 2460 
GATAACGGAT TGTGCCGAAA CACGCCAGGA AGTTACAGCT GTACGTGCCC ACCAGGGTAT 2520 
GTGTTCAGGA CTGAGACAGA GACCTGTGAA GATATAAATG AATGTGAAAG CAACCCATGT 2580 
GTCAATGGGG CCTGCAGAAA CAACCTTGGA TCTTTCAATT GTGAATGTTC GCCCGGCAGC 2640 
AAACTCAGCT CCACAGGATT GATCTGTATT GACAGCCTGA AGGGGACCTG TTGGCTCAAC 2700 
ATCCAGGACA GCCG CTGTGA GGTGAATATT AATGGAGCCA CTCTGAAATC TGAATGCTGT 2760 
GCCACCCTCG GAGCCGCCTG GGGGAGCCCC TGTGAGCGGT GTGAACTAGA TACAGCTTGC 2 820 
CCAAGAGGGC TTGCCAGGAT TAAAGGTGTT ACGTGTGAAG ATGTTAATGA GTGTGAGGTG 2880 
TTCCCTGGCG TTTGTCCAAA TGGACGCTGT GTCAACAGTA AGGGATCTTT TCATTGCGAG 2940 
TGCCCTGAAG GCCTTACGTT GGATGGGACT GGCCGTGTAT GTTTGGATAT TCGCATGGAG 3000 
CAGTGTTACT TGAAGTGGGA TGAAGATGAA TGCATCCACC CCGTTCCTGG AAAGTTCCGC 3060 
ATGGATGCCT GCTGCTGTGC TGTCGGGGCG GCTTGGGGCA CCGAGTGTGA GGAGTGCCCC 3120 
AAACCTGGCA CCAAGGAATA CGAGACACTG TGCCCCCGCG GGGCTGGCTT TGCTAACCGA 3180 
GGGGATGTTC TTACTGGGCG GCCATTTTAC AAAGACATCA ATGAATGCAA AGCATTTCCT 3240 
GGGATGTGCA CTTATGGGAA GTGCAGAAAT ACAATCGGAA GCTTCAAATG CCGTTGCAAT 3300 
AGTGGCTTTG CTCTAGACAT GGAGGAAAGA AACTGCACGG ACATCGACGA GTGCAGGATT 3360 
TCTCCTGACC TCTGTGGCAG TGGAATCTGC GTCAATACAC CGGGCAGCTT TGAGTGCGAG 3420 
TGCTTCGAAG GCTATGAAAG TGGCTTCATG ATGATGAAGA ACTGCATGGA CATTGACGGA 3480 
TGTGAACGTA ACCCTCTCCT TTGTAGGGGT GGCACCTGTG TGAACACTGA GGGCAGCTTT 3540 
CAGTGTGACT GCCCACTGGG ACACGAGCTG TCACCATCCC GTGAGGACTG TGTGGATATT 3600 
AATGAATGCT CCCTGAGTGA CAATCTCTGC AGAAATGGAA AATGTGTGAA CATGATTGGA 3660 
ACCTATCAGT GCTCTTGCAA TCCTGGATAT CAGGCTACGC CAGACCGCCA GGGCTGTACA 3720 
GATATTGATG AATGTATGAT AATGAACGGA GGCTGTGACA CCCAGTGCAC AAATTCAGAG 3780 
GGAAGCTACG AATGCAGCTG CAGTGAGGGT TATGCCCTGA TGCCAGATGG GAGATCGTGT 3840 
GCAGACATTG ATGAATGTGA AAACAATCCT GATATCTGTG ATGGCGGCCA GTGTACCAAC 3900 
ATTCCTGGAG AGTATCGCTG CCTCTGCTAT GATGGCTTCA TGGCTTCCAT GGACATGAAA 3960 
ACATGCATTG ATGTCAATGA ATGTGACCTA AATTCAAATA TCTGCATGTT TGGGGAATGT 4020 
GAGAACACAA AGGGATCCTT CATTTGCCAC TGTCAGCTGG GTTACTCAGT GAAGAAGGGG 4080 
ACCACAGGAT GTACAGATGT GGATGAGTGT GAAATTGGTG CTCATAACTG CGACATGCAT 4140 
GCCTCATGTC TGAATATCCC AGGAAGCTTC AAGTGTAGCT GCAGAGAAGG CTGGATTGGA 4200 
AACGGCATCA AGTGTATTGA TCTGGACGAA TGTTCTAATG GAACCCACCA GTGTAGCATC 4260 
AATGCTCAGT GTGTAAATAC CCCGGGCTCA TACCGCTGTG CCTGCTCCGA AGGTTTCACT 4320 
GGTGATGGCT TTACCTGCTC AGATGTTGAT GAGTGTGCAG AAAACATAAA CCTCTGTGAG 4380 
AACGGACAGT GCCTTAATGT CCCGGGTGCA TATCGCTGCG AGTGTGAGAT GGGCTTCACT 4440 
CCAGCCTCAG ACAGCAGATC CTGCCAAGAT ATTGATGAAT GCTCCTTCCA AAACATTTGT 4500 
GTCTCTGGAA CATGTAATAA CCTGCCTGGA ATGTTTCATT GCATCTGCGA TGATGGTTAT 4560 
GAATTGGACA GAACAGGAGG GAACTGTACA GATATTGATG AGTGTGCAGA TCCTATAAAC 4620 
TGTGTCAATG GCCTATGTGT CAACACGCCT GGTCGCTATG AGTGTAACTG CCCACCCGAT 4680 
TTTCAGTTGA ACCCAACTGG TGTGGGTTGT GTTGACAACC GTGTGGGCAA CTGCTACCTG 4740 
AAGTTTGGAC CTCGAGGAGA TGGGAGTCTG TCTTGCAACA CCGAGATCGG GGTGGGCGTC 4800 
AGTCGCTCTT CATGCTGCTG CTCTCTGGGA AAGGCCTGGG GAAACCCCTG TGAGACATGC 4860 
CCCCCTGTCA ATAGCACTGA ATATTACACC CTGTGTCCCG GAGGTGAAGG CTTCAGACCT 4920 
AACCCCATCA CAATCATTTT AGAAGACATT GACGAATGCC AGGAGTTACC AGGTCTCTGC 4980 
CAGGGTGGAA ACTGCATCAA CACTTTTGGG AGCTTCCAGT GTGAGTGCCC ACAAGGCTAC 5040 
TACCTCAGCG AGGATACCCG CATCTGTGAG GATATTGATG AGTGTTTTGC ACATCCTGGT 5100 
GTGTGTGGGC CTGGGACCTG CTATAACACC CTGGGAAATT ACACCTGCAT TTGCCCACCT 5160 
GAGTACATGC AGGTCAATGG AGGCCACAAC TGCATGGACA TGAGAAAAAG CTTTTGCTAC 5220 
CGAAGCTATA ATGGAACCAC TTGTGAGAAT GAGTTGCCTT TCAATGTGAC AAAAAGGATG 5280 
TGCTGCTGCA CATATAATGT GGGCAAAGCT GGGAACAAAC CTTGTGAACC ATGCCCAACT 5340 
CCAGGAACAG CTGACTTTAA AACCATATGT GGAAATATTC CTGGATTCAC CTTTGACATT 5400 
CACACAGGAA AAGCTGTTGA CATTGATGAA TGTAAAGAGA TTCCAGGCAT TTGTGCAAAT 5460 
GGTGTGTGCA TTAACCAGAT TGGCAGTTTC CGCTGTGAAT GCCCTACAGG ATTCAGTTAC 5520 
AATGACCTGC TGTTGGTTTG TGAAGATATA GATGAGTGCA GCAATGGTGA TAATCTCTGC 5580 
CAGCGGAATG CAGACTGCAT CAATAGTCCT GGTAGTTACC GCTGTGAATG TGCCGCGGGT 5640 
TTCAAACTTT CACCCAATGG GGCCTGTGTA GATCGCAATG AATGTTTAGA AATTCCTAAC 5700 
GTTTGCAGTC ATGGCTTGTG TGTTGATCTG CAAGGAAGTT ACCAGTGCAT CTGCCACAAT 5760 
GGCTTTAAGG CTTCTCAGGA CCAGACCATG TGCATGGATG TTGATGAGTG CGAGCGGCAC 5820 
CCATGTGGAA ATGGAACTTG TAAAAACACC GTTGGATCCT ATAACTGTCT GTG CTACCCA 5880 
GGGTTTGAAC TCACTCATAA TAATGATTGC CTGGACATAG ATGAGTGCAG TTCCTTTTTT 5940 
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GGTCAGGTGT GCAGAAATGG ACGTTGTTTT AATGAAATTG GTTCTTTCAA GTGTCTATGT 6000 
AACGAAGGTT ATGAACTTAC CCCAGATGGC AAAAACTGTA TAGACACTAA TGAGTGTGTC 6060 
GCCCTTCCCG GCTCTTGCTC TCCTGGTACC TGTCAGAATT TGGAGGGATC CTTCAGATGC 6120 
ATCTGTCCCC CAGGGTATGA AGTAAAAAGC GAGAACTGCA TTGATATAAA TGAATGTGAT 6180 
5 GAAGATCCCA ACATTTGTCT TTTTGGTTCC TGTACTAATA CTCCAGGGGG CTTCCAGTGC 6240 
CTCTGCCCCC CTGGCTTTGT ACTATCTGAT AATGGACGGA GATGCTTTGA TACTCGCCAG 6300 
AGCTTCTGCT TCACAAATTT TGAAAATGGA AAGTGTTCTG TACCCAAAGC TTTCAACACC 6360 
ACAAAAGCAA AATGCTGCTG TAGTAAGATG CCAGGAGAGG GCTGGGGGGA CCCCTGTGAG 6420 
CTGTGCCCCA AAGACGATGA AGTTGCATTT CAGGATTTGT GTCCATATGG CCATGGAACT 6480 
10 GTCCCTAGTC TTCATGATAC ACGTGAAGAT GTCAATGAGT GTCTTGAGAG CCCAGGCATT 6540 
TGTTCAAATG GTCAATGTAT CAACACCGAC GGATCTTTTC GCTGTGAATG TCCAATGGGC 6600 
TACAACCTTG ACTACACTGG AGTACGCTGT GTGGATACTG ATGAGTGTTC AATCGGCAAT 6660 
CCGTGTGGAA ATGGTACATG CACCAATGTT ATTGGGAGTT TTGAATGCAA TTGCAATGAA 6720 
GGCTTTGAGC CAGGGCCCAT GATGAATTGT GAAGATATCA ACGAATGTGC CCAGAACCCA 6780 
15 CTGCTGTGTG CTTTACGCTG CATGAACACT TTTGGGTCCT ATGAATGCAC GTGCCCGATT 6840 
GGCTATGCCC TCAGGGAAGA TCAAAAGATG TGCAAAGATC TGGATGAATG TGCTGAAGGG 6900 
TTACACGACT GTGAATCTAG GGGCATGATG TGTAAGAATC TAATCGGCAC CTTCATGTGC 6960 
ATCTGCCCTC CTGGAATGGC CCGAAGGCCC GATGGAGAAG GCTGTGTAGA TGAAAATGAA 7020 
TGCAGGACCA AGCCAGGAAT CTGTGAAAAT GGACGTTGTG TTAACATTAT TGGAAGCTAT 7080 
20 AGATGTGAGT GTAATGAAGG ATTCCAGTCA AGTTCTTCAG GCACTGAATG CCTTGACAAT 7140 
CGACAGGGTC TCTGCTTTGC AGAGGTACTG CAGACAATAT GTCAAATGGC ATCCAGTAGT 7200 
CGCAATCTCG TCACTAAGTC AGAATGCTGC TGTGATGGTG GGCGAGGCTG GGGCCACCAG 7260 
TGCGAGCTTT GCCCACTTCC TGGAACTGCC CAGTACAAAA AGATATGTCC TCATGGCCCA 7320 
GGATATACAA CTGATGGAAG AGATATTGAT GAATGTAAGG TAATGCCAAA CCTCTGCACC 7380 
25 AATGGTCAGT GCATCAATAC CATGGGCTCA TTCCGATGCT TCTGCAAGGT TGGCTACACC 7440 
ACAGACATCA GTGGAACCTC TTGTATAGAC CTTGATGAAT GCTCCCAGTC CCCGAAACCA 7500 
TGCAACTACA TCTGCAAGAA CACTGAGGGG AGTTATCAGT GTTCATGTCC GAGGGGGTAT 7560 
GTCCTGCAAG AGGATGGAAA GACATGCAAA GACCTTGATG AATGTCAAAC AAAGCAGCAT 7620 
A AACTGCCAGT TCCTCTGTGT CAACACCCTG GGGGGGTTTA CCTGTAAATG TCCACCTGGT 7680 
30 TTCACACAGC ATCACACTGC TTGTATCGAC AACAACGAAT GTGGGTCTCA ACCTTTGCTT 7740 
TGTGGAGGAA AGGGAATCTG TCAAAACACT CCAGGCAGTT TCAGCTGTGA ATGCCAAAGA 7800 
GGGTTCTCTC TTGATGCCAC CGGACTGAAC TGTGAAGATG TTGATGAATG TGATGGGAAC 7860 
CACAGGTGCC AACACGGCTG CCAGAACATC CTGGGTGGCT ACAGATGTGG CTGCCCCCAA 7920 
GGCTACATCC AGCACTACCA GTGGAATCAG TGTGTCGATG AGAATGAATG CTCCAATCCC 7980 
35 AATGCCTGTG GCTCTGCTTC CTGCTACAAC ACCCTGGGGA GTTACAAGTG CGCCTGCCCC 8040 
TCGGGGTTCT CCTTCGACCA GTTCTCCAGT GCCTGCCACG ACGTGAATGA GTGCTCGTCC 8100 
TCCAAGAACC CCTGCAATTA CGGCTGCTCT AACACGGAGG GGGGCTACCT CTGTGGCTGC 8160 
CCCCCTGGGT ATTACAGAGT GGGACAAGGC CACTGTGTCT CAGGAATGGG ATTTAACAAG 8220 
GGGCAGTACC TGTCACTGGA TACAGAGGTC GATGAGGAAA ATGCTCTGTC CCCAGAAGCA 8280 
40 TGCTACGAGT GCAAAATCAA CGGCTATCCT AAGAAAGACA GCAGGCAGAA GAGAAGTATT 8340 
CATGAACCTG ATCCCACTGC TGTTGAACAG ATCAGCCTAG AGAGTGTCGA CATGGACAGC 8400 
CCCGTCAACA TGAAGTTCAA CCTCTCCCAC CTCGGCTCTA AGGAGCACAT CCTGGAACTA 8460 
AGGCCCGCCA TCCAGCCCCT CAACAACCAC ATCCGTTATG TCATCTCTCA AGGGAACGAT 8520 
GACAGCGTCT TCCGCATCCA CCAAAGGAAT GGGCTCAGCT ACTTGCACAC GGCCAAGAAG 8580 
45 AAGCTCATGC CCGGCACATA CACACTGGAA ATCACTAGCA TCCCTCTCTA CAAGAAGAAG 8640 
GAGCTTAAGA AACTGGAAGA GAGCAATGAG GATGACTACC TCCTAGGGGA GCTTGGGGAG 8700 
GCTCTCAGAA TGAGGCTGCA GATTCAGCTC TATTAACCGT TCACAGACTT GGGCCCAGGC 8760 
TCAAATCCTA GCACAGCCAG TCTGCAGAAG CATTTGAAAA GTCAAGGACT AATTTTAAAG 8820 
AGGAAAAATA ATAATAACTC TTGTTTCTTT CCTCCCTGTC TTAGACTTTG AATGTTGACC 8880 
50 CTCACAGGGA GGGATAATTT AGACTCTGGT ATGGCCAAAG ATTTGAGCTC AAAGGCAACC 8940 
GTGGTTACTG TATTTTTTAT ATAACTTCAT TTTAAAATAT ATTAAAAGAA ACCTAAATGT 9000 
TCAAGATATC AGCATATGGC ACTAAATGCA CAAAAATAAT GTGAGCTTTT TTTTTTTTTT 9060 
CCTGTTAGCA GTCTGTAACA CTTTGGGTAT TTTGCTATAG TTGCTAATTA AAAAAATATA 9120 
GATGTTTATT TATTTTTAAT GCAGTAATAT ATGGAGAAAT GAACAAACTA TGTAAACAAA 9180 
55 AAGGGAAACT CACTTGTTTT TCTTTAGATT TATAAATTTG AGCTATTTTT TTTAGAGGTG 9240 
CTTTTTAAAA ATCCAATAGA TACAAGAGAT GTTTCCTTTG GTTTTCTGCC AGTCATCCAG 9300 
CTGATACACA CCTGATCGAT TTTAAAGAAA GCCACACAGA GCTGAATCGG GCAGTGCTAA 9360 
TCAATAATTT AAAAGACATG AATGTCATTA GATCCTTTAT AACGTAGATC GAAGCCAAAG 9420 
- CAGCTCATTT GTGACAACAT TTCATATCAC CAGACACACC AGGCAACAGA AGTTGAAGCA 9480 

60 CAACCACTGT AG CAAAATAC CTTGACTGCT TGTGAGACCA TTAGCATTGC AGGCCAAACC 9540 
GTACTGTATT TCCTTCTCAT AACCTCAAGG AACCATATGT GCTACCCACA ACACCTCATT 9600 
CTTACCCAGG GTGCGCTGCG TCCTCATGGT ACTGTAGGCA GCTGAAGAAC CGCCGTTCCC 9660 
TTGAAAGGGA ACACCTGGCA TTCTGTGGTG TTTCGTGCTG TCTTAAATAA TGGTGCATTT 9720 
ATTATGTTCA AGTTATTTCA GGATTGCCAT ATGTGCAAAC AAATCATGCA ATGCAGCCAA 9780 
65 GGAATATATG TTGTTGTTGT TGTTTTAAAC CCATTTTTTT TTTAGAATTT TCATTAATAC 9840 
TGTAGTTATA CACCATATGC CTCATTTTAT CATAGCCTAT TGTGTATGAA AGATGTTTGT 9900 
ACAATGAATT GATGTTTAGT TTGCTTTAGT CATTTAAAAA GATATTGTAC CAGGATGTGC 9960 
TATTAAGAGC ACGTATCCAT TATTCTTCTC AACCCAAGAA CCTGTTTCCT GGACCAGTGA 10020 
CCAAACCTCA TATGTGAAAT GGCCAAAGCA CATGCAGGCT CCTGGTTGTT CCTCTCAAAC 10080 
70 CTGTGCTGAC CAAAGATTAG TAACCAGTTA TACCCAGTAT TTTGAGGTTT TATTGTTTTT 10140 
TTAATAACTA AAAAAAAACT CGTGCC 

Seq ID NO: 162 Protein sequence 
Protein Accession #: NP_001990.1 

1 11 21 31 41 51 

1 I I I I I 

MGRRRRIiCLQ LYFLWLGCW LWAQGTAGQP QPPPPKPPRP OPPPQQVRSA TAGSEGGPLA 60 
PEYREEGAAV ASRVRRRGQQ DVLRGPNVCG SRFHSYCCPG WKTLPGGNQC IVPICRNSOG 120 
DGFCSRPNMC TCSSGQISST CGSKSIQQCS VRCMNGGTCA DDHCQCQKGY IGTYCGQPVC 180 
ENGCQNGGRC IAQPCACVYG FTGPQCERDY RTGPCFTQVN NQMCQGQLTG IVCTKTLCCA 240 
TTGRAWGHPC EMCPAQPQPC RRGFIPNIRT GACQDVDECQ AIPGICQGGN CINTVGSFEC 300 
RCPAGHKQSE TTQKCEDIDE CSIIPGICET GECSNTVGSY FCVCPRGYVT STDGSRCIDQ 360 
RTGMCPSGLV NGRCAQELPG RMTKMQCCCE PGRCWGIGTI PEACPVRGSE EYRRLCKDGL 420 
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5 
10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 



PMGGIPGSAG 
GPIITGLTIL 
MPCTNGDCVN 
CNAGFELTTD 
CQTPGICMNG 
AVTKSECCCA 
GICENLRGSY 
VFRTETETCE 
IQDSRCEVNI 
PPGVCPNGRC 
KDACCCAVGA 
GMCTYGKCRN 
CFEGYESGFM 
NECSLSDNLC 
GSYECSCSEG 
TCIDVNECDL 
ASCLNIPGSP 
GDGFTCSDVD 
VSGTCNNLPG 
FQLNPTGVGC 
PPVNSTEYYT 
YLSEDTRICE 
RSYNGTTCEN 
HTGKAVD IDE 
QRNADCINSP 
GFKASQDQTM 
GQVCRNGRCF 
ICPPGYEVKS 
SFCFTNFENG 
VPSLHDTRED 
PCGNGTCTNV 
GYALREDQKM 
CRTKPGICEN 
RNLVTKSECC 
NGQCINTMGS 
VLQEDGKTCK 
CGGKGICQNT 
GYIQHYQWNQ 
SKNPCNYGCS 
CYECKINGYP 
RPAIQPLNNH 
ELKKLEESNE 



SRPGGTGGNG 
NQTIDICKHH 
TPGSYYCKCH 
GKNCVDHDEC 
RCINSEGSPR 
NPDYGFGEPC 
RCNCNSGYEP 
DINECESNPC 
NGATLKSECC 
VNSKGSFHCE 
AWGTECEECP 
TIGSFKCRCN 
MMKNCMDIDG 
RNGKCVNMIG 
YALMPDGRSC 
NSNICMFGEC 
KCSCREGWIG 
ECAENINLCE 
MFHCICDDGY 
VDNRVGNCYL 
LCPGGEGFRP 
DIDECFAHPG 
ELPFNVTKRM 
CKEIPGICAN 
GSYRCECAAG 
CMDVDECERH 
NEIGSFKCLC 
ENCIDINECD 
KCSVPKAFNT 
VNECLESPGI 
IGSFECNCNE 
CKDLDECAEG 
GRCVNIIGSY 
CDGGRGWGKQ 
FRCFCKVGYT 
DLDECQTKQH 
PGSFSCECQR 



FAPSGNGNGY 
ANLCLNGRCI 
AGFQRTPTKQ 
TTTNMCLNGM 
CDCPPGLAVG 
QPCPAKNSAE 
DASGRNCIDI 
VNGACRNNLG 
ATLGAAWGSP 
CPEGLTLDGT 
KPGTKEYETL 
SGFALDMEER 
CERNPLLCRG 
TYQCSCNPGY 
ADIDECENNP 
ENTKGSFICH 
NGIKCIDLDE 
NGQCLNVPGA 
EIiDRTGGNCT 
KFGPRGOGSL 
NPITIILEDI 
VCGPGTCYNT 
CCCTYNVGKA 
GVCINQIGSF 
FKLSPNGACV 
PCGNGTCKNT 
NEGYELTPDG 
EDPNICLFGS 
TKAKCCCSKM 
CSNGQCINTD 
GFEPGPMMNC 
LHDCESRGMM 



GPGGTGFIPI 
PTVSSYRCEC 
ACIDIDECIQ 
CINEDGSFKC 
MDGRVCVDTH 
FHGLCSSGVG 
DECLVNRLLC 
SFNCECSPGS 
CERCELDTAC 
GRVCLDIRME 
CPRGAGPANR 
NCTDIDECRI 
GTCVNTEGSF 
QATPDRQGCT 
DICDGGQCTN 
CQLGYSVKKG 
CSNGTKQCSI 
YRCECEMGFT 
DIDECADPIN 
SCNTEIGVGV 
DECQELPGLC 
LGNYTCICPP 
GNKPCEPCPT 
RCECPTGFSY 
DRNECLEIFN 
VGSYNCLCYP 
KNCIDTNECV 
CTNTPGGFQC 



PGGNGFSPGV 
NMGYKQDANG 
NGVLCKNGRC 
ICKPGFVLAP 
MRSTCYGGIK 
ITVDGRDINE 
DNGLCRNTPG 
KLSSTGLICI 
PRGLARIKGV 
QCYLKWDEDE 
GDVLTGRPFY 
SPDLCGSGIC 
QCDCPLGHEL 
DIDECMIMNG 
IPGEYRCLCY 
TTGCTDVDEC 
NAQCVNTPGS 



NTEGGYLCGC 
KKDSRQKRSI 
IRYVISQGND 
DDYLI/3ELGE 



CELCPLPGTA 
TDISGTSCID 
NCQFLCVNTI* 
GFSLDATGLN 
NACGSASCYN 
PPGYYRVGQG 
HEPDPTAVEQ 
DSVFRIHQRN 
ALRMRLQIQL 



GSFRCECPMG 
EDINECAQNP 
CKN1»IGTFMC 
SSSGTECLDN 
QYKKICPHGP 
LDECSQSPKP 
GGFTCKCPPG 
CEDVDECDGN 
TLGSYKCACP 
HCVSGMGFNK 
ISLESVDMDS 
GLSYLHTAKK 
Y 



CVNGLCVNTP 
SRSSCCCSLG 
CJGGNCINTFG 
EYMQVNGGHN 
PGTADFKTIC 
NDLLLVCEDI 
VCSHGIiCVDIj 
GFELTHNNDC 
ALPGSCSPGT 
LCPPGFVLSD 
LCPKDDEVAF 
YNLDYTGVRC 
LLCALRCMNT 
ICPPGMARRP 
RQGLCFAEVL 
GYTTDGRDID 
CNYICKNTEG 
FTQHHTACID 
HRCQHGCQNI 
SGFSFDQFSS 
GQYLSLDTEV 
PVNMKFNLSH 
KLMPGTYTLE 



GGAGVGAGGQ 
DCIDVDECTS 
VNSDGSFQCI 
NGRYCTDVDE 
KGVCVRPFPG 
CALDPDICAN 
SYSCTCPPGY 
DSLKGTCWLN 
TCEDVNECEV 
CIHPVPGKFR 
KDINECKAFP 
VNTPGSFECE 
SPSREDCVDI 
GCDTQCTNSE 
DGFMASMDMK 
EIGAHNCDMH 
YRCACSEGFT 
IDECSFQNIC 
GRYECNCPPD 
KAWGNPCETC 
SFQCECPQGY 
CMDMRKSFCY 
GNIPGFTFDI 



QGSYQCICHN 
XjDIDECSSFF 
CQNLEGSFRC 
NGRRCFDTRQ 
QDLCPYGHGT 
VDTDECSIGN 
FGSYECTCPI 
DGEGCVDENE 
QTICQMASSS 
ECKVMPNLCT 
SYQCSCPRGY 
NNECGSQPLL 
LGGYRCGCPQ 
ACHDVNECSS 



LGSKEHILEL 
ITSIPLYKKK 



Seg ID NO: 163 DNA sequence 

Nucleic Acid Accession 8: NM_013372.1 

Coding sequence: 63.. 617 



1 
I 

GCGGCCGCAC 
GTATGAGCCG 
CGGCTGCTGA 
AGCACAATGA 
GGGGCCAAGG 
CCCTGCATGT 
AGCAGACCAT 
GCCAGTGCAA 
CCTGCTCCTT 
AACTACAGCC 
CCATCGATTT 
AGGAAGTCCC 
AACCCCCAGC 
ATGGGTGCCT 
CCCTATTTTG 
CTCACATCTA 
GGGGACCAGA 
GACCTGTTTT 
CTTTCCTCCT 
ATCTCTTGTT 
TTTTGTGAAG 
TGGAGTGAGA 
GACATTGCAG 
TTTTCCTAGT 
ATTAACTTTG 
ACCACTCCTA 
CCCTCAGGTG 
AAACCKCAGA 
TCCATTCCAC 
GCCTCTGCTG 
TTTTAGCAAG 
RAGAGAAGAC 
GGTGTTAATA 
AGGATCTGAG 
CTACTGGTTG 



11 
I 

TCAGCGCCAC 
CACAGCCTAC 
AGGGAAAAAG 
CTCAGAGCAG 
GCGGGGCACT 
GACGGAGCGC 
CCACGAGGAA 
CTCTTTCTAC 
CTGCAAGCCC 
ACCTACCAAG 
GGATTAAGCC 
AGACCTAAAA 
TGCCTCCTGG 
GTGGGTGTTT 
TAAACATATC 
AAGGGGCGGG 
ATCTCCTTTC 
AGTGCTGCAT 
CCTCCTCACA 
TGCCAAGGTT 
ACCCTCCAGA 
AAGGGAGGGT 
AAGCTTGAAA 
ATTTAACAGA 
GCCGTTGCAA 
TGTTCGGACC 
GAAAAGAGAG 
GGCTGAAATT 
TATTTCCCAT 
AGTGTACCTG 
ARATATTKTG 
GACGAGAGTA 
CCTGGTAGAA 
GGGACCCTGT 
GATGGACATA 



21 

I 

GCGTCGAAAG 
ACGGTGGGAG 
AAAGGGTCCC 
ACTCAGTCGC 
GCCATGCCCG 
AAATACCTGA 
GGCTGCAACA 
ATCCCCAGGC 
AAGAAATTCA 
AAGAAGAGAG 
AAATCCAGGT 
CAACCAGATT 
CAGGAGCCTG 
TTAGACACCA 
TGCTTTAATG 
GCCGTGGTCT 
GGAATGAATG 
TCGACATGGA 
ATCCATCTCT 
CCTAAATTAA 
CTCTGGGAGA 
GGAGGGTGAG 
GGCCAATACC 
ACCCAAGTGA 
TCTGCTCAAA 
CAAGCAAGTT 
GTAGTTTAGA 
CCTAATACCT 
AATGCTTCTG 
ACAGTAAGTC 
GGGGTCTTTT 
AGGAAATAAA 
ATGTAAGGGA 
TAGGAGAGCA 
ACTATTGTAA 



31 
I 

CGCAGGCCCC 
CCCTGCTTCT 
AAGGTGCCAT 
CCCAGCAGCC 
GGGAGGAGGT 
AGCGAGACTG 
GTCGCACCAT 
ACATCCGGAA 
CTACCATGAT 
TCACACGTGT 
GCACCCAGCA 
CTTACTTGGC 
CTTGTGCGTA 
GAGAAAACAC 
GGGATGTACC 
GGTTCTGACT 
TTCATGGAAG 
AAAGTCCTTT 
TCTTAAGTTG 
TTCACTTAAC 
GGCTGGTGTG 
GCCAAATCAG 
AGAACACAGG 
ACAGAGGAGA 
CCTAACACCA 
AGCTAAACCA 
ACTCTCTGCA 
TTCCTTTATC 
AGAGCCACTA 
TAAAGATGAR 
TGTTTTAACT 
GGGRATTGCC 
TATGACCTCC 
TAGCATCATG 
CTATTCAGTA 



41 * 
I 

GAGGACCCGC 
CCTCTTGGGG 
CCCCCGGCCA 
TGGCTCCAGG 
GCTGGAGTCC 
GTGCAAAACC 
CATCAACCGC 
GGAGGAAGGT 
GGTCACACTC 
GAAGCAGTGT 
TGTCCTAGGA 
TTAAACCTAG 
GTTGGTGTGC 
AGTCTCTGCT 
AGAAACCCAC 
TTGTGTTTTT 
AGGCTCCTCT 
TAACCTGTGC 
ATAGTGACTA 
CATGATGCAA 
GGCAAGGACA 
GTCCAGCAAA 
CTGATGCTTC 
AATGAGATTG 
AACTGAAAAC 
AACCAACTCC 
TAGGGGTGGG 
GTGGTTATAG 
ACTTGATTGA 
AGAGTTTAGG 
ATTGTCAGGA 
TCTGGCTAGA 
CTTTCTTTAT 
ATGTATTAGC 
TTTACTGGTA 



480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
120O 
1260 
1320 
1380 
1440 
1500 
1S60 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2S20 
2580 
2640 
2700 
2760 
2820 
2880 



51 
I 

CGCACTGACA 
ACCCTGCTGC 
GACAAGGCCC 
AACCGGGGGC 
AGCCAAGAGG 
CAGCCGCTTA 
TTCTGTTACG 
TCCTTTCAGT 
AACTGCCCTG 
CGTTGCATAT 
ATGCAGCCCC 
AGGCCAGAAG 
ATGAGTGTGG 
AGAGAGCACT 
CTCACCCCGG 
GTGCCCTCCT 
GAGGGCAAGA 
TTGCATCCTC 
TGTCAGTCTA 
ATGTTTTTCA 
AGCAGGATAG 
AGTCAGTAGG 
TGAGAAAGTC 
CCAGAAAGTG 
ATAAATACTG 



AATTAATCAA 
TCAGCTCATT 
TAAAGATCCT 
GACTACTCTG 
GATTGGGCTA 
GAGTAAGTTA 
GTGCTCACTG 
TGTTCATCTG 
GGCACTGTCC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
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TCTGATTAAA CTTGGCCTAC TGGCAATGGC TACTTAGGAT TGATCTAAGG GCCAAAGTGC 2160 

AGGGTGGGTG AACTTTATTG TACTTTGGAT TTGGTTAACC TGTTTTCTTC AAGCCTGAGG 2220 

TTTTATATAC AAACTCCCTG AATACTCTTT TTGCCTTGTA TCTTCTCAGC CTCCTAGCCA 2280 

AGTCCTATGT AATATGGAAA ACAAACACTG CAGACTTGAG ATTCAGTTGC CGATCAAGGC 2340 

TCTGGCATTC AGAGAACCCT TGCAACTCGA GAAGCTGTTT TTATTTCGTT TTTGTTTTGA 2400 

TCCAGTGCTC TCCCATCTAA CAACTAAACA GGAGCCATTT CAAGGCGGGA GATATTTTAA 2460 

ACACCCAAAA TGTTGGGTCT GATTTTCAAA CTTTTAAACT CACTACTGAT GATTCTCACG 2520 

CTAGGCGAAT TTGTCCAAAC ACATAGTGTG TGTGTTTTGT ATACACTGTA TGACCCCACC 2580 

CCAAATCTTT GTATTGTCCA CATTCTCCAA CAATAAAGCA CAGAGTGGAT TTAATTAAGC 2640 

ACACAAATGC TAAGGCAGAA TTTTGAGGGT GGGAGAGAAG AAAAGGGAAA GAAGCTGAAA 2700 

ATGTAAAACC ACACCAGGGA GGAAAAATGA CATTCAGAAC CAGCAAACAC TGAATTTCTC 2760 

TTGTTGTTTT AACTCTGCCA CAAGAATGCA ATTTCGTTAA TGGAGATGAC TTAAGTTGGC 2820- 

AGCAGTAATC TTCTTTTAGG AGCTTGTACC ACAGTCTTGC ACATAAGTGC AGATTTGGCT 2880 

CAAGTAAAGA GAATTTCCTC AACACTAACT TCACTGGGAT AATCAGCAGC GTAACTACCC 2940 

TAAAAGCATA TCACTAGCCA AAGAGGGAAA TATCTGTTCT TCTTACTGTG CCTATATTAA 3000 

GACTAGTACA AATGTGGTGT GTCTTCCAAC TTTCATTGAA AATGCCATAT CTATACCATA . 3060 

TTTTATTCGA GTCACTGATG ATGTAATGAT ATATTTTTTC ATTATTATAG TAGAATATTT 3120 

TTATGGCAAG ATATTTGTGG TCTTGATCAT ACCTATTAAA ATAATGCCAA ACACCAAATA 3180 

TGAATTTTAT GATGTACACT TTGTGCTTGG CATTAAAAGA AAAAAACACA CATCCTGGAA 3240 

GTCTGTAAGT TGTTTTTTGT TACTGTAGGT CTTCAAAGTT AAGAGTGTAA GTGAAAAATC 3300 

TGGAGGAGAG GATAATTTCC ACTGTGTGGA ATGTGAATAG TTAAATGAAA AGTTATGGTT 3360 

ATTTAATGTA ATTATTACTT CAAATCCTTT GGTCACTGTG ATTTCAAGCA TGTTTTCTTT 3420 

TTCTCCTTTA TATGACTTTC TCTGAGTTGG GCAAAGAAGA AGCTGACACA CCGTATGTTG 3480 

TTAGAGTCTT TTATCTGGTC AGGGGAAACA AAATCTTGAC CCAGCTGAAC ATGTCTTCCT 3540 

GAGTCAGTGC CTGAATCTTT ATTTTTTAAA TTGAATGTTC CTTAAAGGTT AACATTTCTA 3600 

AAGCAATATT AAGAAAGACT TTAAATGTTA TTTTGGAAGA CTTACGATGC ATGTATACAA 3660 

ACGAATAGCA GATAATGATG ACTAGTTCAC ACATAAAGTC CTTTTAAGGA GAAAATCTAA 3720 

AATGAAAAGT GGATAAACAG AACATTTATA AGTGATCAGT TAATGCCTAA GAGTGAAAGT 3780 

AGTTCTATTG ACATTCCTCA AGATATTTAA TATCAACTGC ATTATGTATT ATGTCTGCTT 3840 

AAATCATTTA AAAACGGCAA AGAATTATAT AGACTATGAG GTACCTTGCT GTGTAGGAGG 3900 

ATGAAAGGGG AGTTGATAGT CTCATAAAAC TAATTTGGCT TCAAGTTTCA TGAATCTGTA 3960 

ACTAGAATTT AATTTTCACC CCAATAATGT TCTATATAGC CTTTGCTAAA GAGCAACTAA 4020 
TAAATTAAAC CTATTCTTTC AAAAAAAAA 

Seq ID NO j 164 Protein sequence 
Protein Accession #: NPJ) 3 7504.1 

1 11 21 31 41 51 

I I Sill 

MSRTAYTVGA LLLLLGTLLP AAEGKKKGSQ GAIPPPDKAQ HNDSEQTQSP QQPGSRNRGR 60 

GQGRGTAMPG EEVLESSQEA LHVTERKYLK RDWCKTQPLK QTIHEEGCNS RTIINRFCYG 120 

QCNSPYIPRH IRKEEGSFQS CSFCKPKKFT TMMVTLNCPE LQPPTKKKRV TRVKQCRCIS 180 
IDLD 

Seq ID NO: 16S DNA sequence 

Nucleic Acid Accession ft: CAT cluster 

1 11 21 31 41 51 

i I I I I I 

GAAT TGCAT C GGACAGAGCT TCGCCATGGC CGAGTTGCGC GTGGTTGTGG CACTAACACT 60 

GCTACGTTTC OGCCTGAGCG TGGACCGAAC GCGCAAGGTG CGGCGGAAGC CGGAGCTCAT 120 
ACTGCGCACG GAGAACGGGC TCTGGCTCAA G 



Seq ID NO: 166 DNA sequence 

Nucleic Acid Accession #: PGENESH predicted 

Coding sequence: 1..1650 

1 11 21 31 41 51 

1 I I I 1 I 

ATGCCACCTC TGCCATCCAG AATGTCTCGG GGGCCCTCCT TGGTGACAGG CAGGATGCTG 60 

CCCATCACAG ACCGCCTGCT GCACCTCCTG GGGCTGGAGA AGACGGCGTT CCGCATATAC 120 

GCGGTGTCCA CCCTTCTCCT CTTCCTGCTC TTCTTCCTGT TCCGCCTGCT GCTGCGGTTC 180 

CTGAGGCTCT GCAGGAGCTT CTACATCACC TGCCGCCGGC TGCGCTGCTT CCCCCAGCCT 240 

CCCCGGCGCA ACTGGCTGCT GGGCCACCTG GGCATGTACC TTCCAAATGA GGCGGGCCTT 300 

CAAGATGAGA AGAAGGTACT GGACAACATG CACCATGTAC TCTTGGTATG GATGGGACCT 360 

GTCCTGCCGC TGTTGGTTCT GGTGCACCCT GATTACATCA AACCCCTTTT GGGAGCCTCA 420 

GCTGCCATCG CCCCCAAGGA TGACCTCTTC TATGGCTTCC TAAAACCTTG GCTAGGGGAT 480 

GGGCTGCTGC TCAGCAAAGG TGACAAGTGG AGCCGGCACC GTCGCCTGCT GACACCCGCC 540 

TTCCACTTTG ACATCCTGAA GCCTTACATG AAGATCTTCA ACCAGAGCGC TGACATTATG 600 

CATGCTAAAT GGCGGCATCT GGCAGAGGGC TCAGCGGTCT CCCTTGATAT GTTTGAGCAT 660 

ATCAGCCTCA TGACCCTGGA CAGTCTTCAG AAATGTGTCT TCAGCTACAA CAGCAACTGC 720 

CAAGAGAAGA TGAGTGATTA TATCTCCGCT ATCATTGAAC TGAGCGCTCT GTCTGTCCGG 780 

CGCCAGTATC GCTTGCACCA CTACCTCGAC TTCATTTACT ACCGCTCGGC GGATGGGCGG 840 

AGGTTCCGGC AGGCCTGTGA CATGGTGCAC CACTTCACCA CTGAAGTCAT CCAGGAACGG 900 

CGGCGGGCAC TGCGTCAGCA GGGGGCCGAG GCCTGGCTTA AGGCCAAGCA GGGGAAGACC 960 

TTGGACTTTA TTGATGTGCT GCTCCTGGCC AGGGATGAAG ATGGAAAGGA ACTGTCAGAC 1020 

GAGGATATCC GAGCCGAAGC AGACACCTTC ATGTTTGAGG GTCACGACAC AACATCCAGT 1080 

GGGATCTCTT GGATGCTGTT CAATTTGGCA AAGTATCCGG AATACCAGGA GAAATGCCGA 1140 

GAAGAGATTC AGGAAGTCAT GAAAGGCCGG GAGCTGGAGG AGCTGGAGTG GGACGATCTG 1200 

ACTCAGCTGC CCTTTACAAC TATGTGCATT AAGGAGAGCC TGCGCCAGTA CCCACCTGTC 1260 

ACTCTTGTCT CTCGCCAATG CACGGAGGAC ATCAAGCTCC CAGATGGGCG CATCATCCCC 1320 

AAAGGAATCA TCTGCTTGGT CAGCATCTAT GGAACCCACC ACAACCCCAC AGTGTGGCCT 1380 

GACTCCAAGG TGTACAACCC CTACCGCTTT GACCCGGACA ACCCACAGCA GCGCTCTCCA 1440 
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CTGGCCTATG TGCCCTTCTC TGCAGGACCC AGGAATTGCA TCGGACAGAG CTTCGCCATG 1500 

GCCGAGTTGC GCGTGGTTGT GGCACTAACA CTGCTACGTT TCCGCCTGAG CGTGGACCGA IS 60 

ACGCGCAAGG TGCGGCGGAA GCCGGAGCTC ATACTGCGCA CGGAGAACGG GCTCTGGCTC 1620 
AAGGTGGAGC CGCTGCCTCC GCGGGCCTGA 

Seq ID NO: 167 Protein sequence 

Protein Accession #: FGENESH predicted 

1 11 21 31 41 51 

t I I I I i 

MPPLPSRMSR GPSLVTGRML PITDRLLHLL GLEKTAFRIY AVSTLLLFLL FFLFRLLLRF 60 

LRLCRSFYIT CRRLRCFPQP PRRNWLLGHL GMYLPNEAGL QDEKKVLDNM HHVLLVWMGP 120 

VLPLLVLVHP DYIKPLLGAS AAIAPKDDLF YGFLKPWLGD GLLLSKGDKW SRHRRLLTPA 1B0 

FHFDILKPYM KIFNQSADIM HAKWRHLAEG SAVSLDMFEH ISLMTLDSLQ KCVFSYNSNC 240 

QEKMSDYISA IIELSALSVR RQYRLHHYLD FIYYRSADGR RFRQACDMVH HFTTEVIQER 300 

RRALRQQGAB AWLKAKQGKT LDFIDVLLLA RDEDGKELSD EDIRAEADTF MFEGHDTTSS 360 

GISHMLFNLA KYPEYQEKCR EEIQEVMKGR ELEELEWDDL TQLPFTTMCI KESLRQYPPV 420 

TLVSRQCTED IKLPDGRIIP KGIICLVSIY GTHHNPTVWP DSKVYNPYRF DPDNPQQRSP 460 

LAYVPFSAGP RNCIGQSFAM AELRVWALT LLRFRLSVDR TRKVRRKPEL ILRTENGLWL 540 
KVEPLPPRA 

Seq ID NO: 168 DNA sequence 
Nucleic Acid Accession #: AK0S8088.1 
Coding sequence: 252.. 1772 

1 11 21 31 41 51 

i I ! I I I 

AGGAAACCAA GGCAAGCTCC CCCTGTCAAA GCACCTTGGC CCATAAGAAG AAAAGGGGGA 60 

GCCCCAGATG TGATGAGCGC TTCCAGGCTT CAGGCTCAGA AGGCGCCCCC AGCTCTCCTG 120 

TAACTCAGAG GCCAGTGTGA TGGGAGTTCC TCCACTCAGC ACACTTCCCC TGTAAACACG 180 

CCTGTGGTGG GCAAAAGGGC TTTGGAACGG TTGCTTGTCT TTTCTCTCCT GCGTAATTTC 240 

CACTTTCATT CATGATAATG TCGAACACGC ACAAAGCTCG GCTGGAACGC CGGGTCACTG 300 

GCTCAACCAA CCGGTGGCGT TTGCCCAAAC AGCCTTTCTC TGGGGACCTG CTCTCACTTT 360 

CCCAGATGTG CAAGGCTCTG AG CAT AG ACT TTGAGGAAGC TTTGAGGAAC CCAGACAGGT 420 

TATGCATTTC ACAAATCCAG AAGTTTTTCT TTGAGAATTT CAAGAACAAG GACATCCAAA 480 

GTGGGGAAGC AGATGTGATT CTCGAGTGCC TGGGCTTCAA ATGGGAGCTC CATCAGCCCC 540 

GGCTTTTTCA GTCTGAGACC TTGGCCAAGC TCTACCTGAA AGCCCTGGCG CAGGGCACCA 600 

CACACCCCCT GAGGGAGCTG GAGGAGCTTC TGCGAGCTCA ATCACCTAAG AAGACCAAAG 660 

AAAAATCCCC TGCAAAGAGG ATCATCATTT CCTTGAAGAT CAATGACCCA CTGGTCACTA 720 

AAGTCGCCTT CGCCACGGCC CTGAAGAACC TCTACATGAG TGAGGTGGAG ATTAACTTGG 780 

AAGACCTACT GGGAGTGCTG GCTTCCGCCC ACATCCTCCA GTTCAGTGGC CTGTTTCAAA 840 

GGTGCGTGGA TGTGATGATA GCCAGACTCA AGCCAAGCAC CATCAAGAAA TTCTACGAGG 900 

CCGGCTGCAA GTACAAGGAA GAGCAGCTCA CCACCGGCTG CGAGAAGTGG CTGGAAATGA 960 

ACTTGGTTCC TCTAGGGGGG ACGCAGATCC ACCTCCACAA AATCCCACAG GACCTGCTCC 1020 

ACAAAGTGCT GAAGTCCCCC AGGTTATTTA CCTTTAGTGA ATTCCATCTT CTGAAAACAA 1080 

TGCTTTTGTG GGTCTTCTTG CAACTGAACT ACAAGATTCA GGCAATTCCG ACTTATGAAA 1140 

CCGTGATGAC ATTTTTTAAG AGCTTTCCTG AGAACTGTTG CTTTCTGGAC CGGGACATAG 1200 

GACGGAGCTT GAGGCCGCTC TTCCTCTGCT TGCGTCTGCA CGGCATCACC AAAGGCAAGG 1260 

ATCTGGAGGT GCTGCGGCAC CTTAACTTCT TCCCAGAGTC ATGGCTCGAC CAGGTTACAG 1320 

TCAACCATTA CCACGCACTG GAGAATGGGG GCGACATGGT CCACCTGAAA GATCTTAACA 1380 

CCCAGGCTGT GAGATTTGGG CTGCTCTTTA ACCAGGAGAA TACAACTTAT TCGAAAACGA 1440 

TTGCTCTATA TGGATTCTTC TTTAAGATAA AGGGACTCAA ACATGATACT ACCT CTTATA 1500 

GTTTTTACAT GCAGAGAATA AAGCACACAG ACCTGGAATC TCCCTCTGCG GTCTACGAGC 1560 

ACAACCACGT CAGCCTGCGA GCGGCACGCC TGGTGAAGTA TGAGATCAGA GCAGAGGCCC 1620 

TGGTTGACGG CAAGTGGCAG GAGTTCAGGA CAAACCAGAT CAAGCAGAAG TTTGGGTTGA 1680 

CCACGTCATC CTGCAAAAGC CATACCTTGA AAATCCAAAC TGTGGGCATC CCAATCTATG 1740 

TAAGTTTTGC ATTCATCTTC CCAGCATCTT GACAGTTTCC AGAAGAATCT ATGGGATTTT 1800 
CCCCCCACTG GTCTGCATAA AAGAAAATAA AATGACATAA AAGGGAGC 

Seq ID NO; 169 Protein sequence 
Protein Accession #: BAB71658.1 

1 11 21 31 41 SI 

I I I 1 1 I 

MIMSNTHKAR LERHVTGSTN RWRLPKQPFS GDLLSLSQMC KALSIDFEEA LRNPDRLCIS 60 

QIQKFFFENF KNKDIQSGEA DVILECLGFK WELHQPRLFQ SETLAKLYLK ALAQGTTHPL 120 

RELEELLRAQ SPKKTKEKSP AKRIIISLKI NDPLVTKVAF ATALKNLYMS EVEINLEDLL 180 

GVLASAHILQ FSGLFQRCVD VMIARLKPST IKKFYEAGCK YKEEQLTTGC EKWLEMNLVP 240 

LGGTQIHLHK IPQDLLHKVL KSPRLFTFSE FHLLKTMLLW VFLQLNYKIQ AIPTYBTVMT 300 

FFKSFPENCC FLDRDIGRSL RPLFLCLRLH GITKGKDLEV LRHLNFFPES WLDQVTVNHY 360 

HALENGGDMV HLKDLNTQAV RFGLLFNQEN TTYSKTIALY GFFFKIKGLK HDTTSYSFYM 420 

QRIKHTDLES PSAVYEHNHV SLRAARLVKY EIRAEALVDG KWQEFRTNQI KQKFGLTTSS 480 
CKSHTLKIQT VGIPIYVSFA PIFPAS 

Seq ID NO: 170 DNA sequence 

Nucleic Acid Accession #: NMJJ07OOO.1 

Coding sequence: 1...777 

1 11 21 31 41 51 

I I I I I I 

ATGGCGTCTG CGGCAGCAGC GGAGGCCGAG AAGGGATCTC CAGTTGTGGT GGGCCTGCTA 60 

GTTGTGGGCA ATATCATTAT TCTGCTGTCA GGCCTGTCCC TGTTTGCTGA GACCATATGG 120 

GTGACAGCCG ACCAGTACCG TGTATACCCA CTGATGGGAG TCTCAGGCAA GGATGACGTC 180 

TTCGCTGGTG CCTGGATTGC CATCTTCTGC GGCTTCTCCT TCTTCATGGT AGCCAGTTTT 240 

1020 
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PCTAJS02/36810 



5 
10 

15 
20 
25 
30 
35 
40 
45 
50 
55 



65 
70 
75 
80 



GGTGTGGGTG 
CTCATCGTCT 
ATGGTGTCCA 
GACCAGGGCC 
GGCACATCTG 
GAGGTGGTGT 
AACGAGGAGG 
CACATCGGCC 
CTGATGTGGA 



CCGCACTCTG 
ACATCTTCGA 
ACCCATCCCT 
AGGAGCTGAC 
GTCCCATGGA 
TCCCCTGGCC 
GCTGCCGCCT 
ACGCCATCGA 
CGCTCCCGGT 



CCGCCGCCGG 
GTGCGCCTCC 
GATCACCAAG 
CCGCCTCTGG 
CTGGGTGAAC 
CCCACTGTGC 
GGGGCACATG 
CAGCTACACG 
CATGCTGATA 



TCCATGGTCC 
TGCATCACGT 
CAGATGCTGA 
GACCGCGTCA 
TTCACGTCAG 
TGTCGCCGGA 
GACTACCTGT 
TGGGGTATCT 
GCCATGTATT 



TCACGTACCT 
CCTACACCCA 
CCTTCTACAG 
TGATTGAGCA 
CCTTCCGGGC 
CGGGAAACTT 
TCACCAAGGG 
CGTGGTTTGG 
TCTACACCAT 



GGTGCTCATG 
CCGTGACTAC 
CGCGGACACC 
AGAATGCTGT 
GGCCACTCCG 
CATCCCCCTC 
CTGCTTCGAA 
GTTTGCCATC 
GCTCTGA 



Seq ID NO: 171 Protein sequence 
Protein Accession 8: np 008931.1 



51 



1 11 21 31 41 

I I I i ! I 

MASAAAAEAE KGSPVWGLL WGNIIILLS GLSLFAETIW VTADQYRVYP LMGVSGKDDV 
FAGAWIAIPC GPSFFMVASF GVGAALCRRR SMVLTYLVLM LIVYIFECAS CITSYTHRDY 
MVSNPSLITK QMLTFYSADT DQGQELTRLW DRVMIEQECC GTSGPMDWVN PTSAPRAATP 
EWFPWPPLC CRRTGNFIPL NEEGCRLGHM DYLFTKGCFE HIGHAIDSYT WGISWFGPAI 
LMWTLPVMLI AMYFYTML 

Seq ID NO: 172 DNA sequence 

Nucleic Acid Accession #: NM_006760.1 

Coding sequence: 39.-593 



GAAAGCCTGC 
GGACCTTGCC 
ACATCTCAAG 
TGCCCCCCTG 
GCAAAGTGGT 
GTGTGGTGGA 
ACCTCGTGCC 
AGTCCAGCAG 
TGGGTATGGC 
TCCTGCTGGT 
TGCCCGGAGC 
GCTCCCAGGC 
CTCCCCTAGA 
ACACTCCATT 
CTTTAAACAC 
GCTGAGGCTG 



11 
I 

CAGCACCTAT 
CTTGATCCTG 
CCTCTCTGGT 
TCACCTCACA 
GACGTCCAGC 
CAGTGGTGCT 
AGGAACCAAA 
AGAGATCCCA 
CCGCACAGGG 
GCTGGGCTTC 
AGCAGCTTCT 
CCCAGGCCTG 
GCCCTCTCCT 
ATTCCTCTCA 
CCCCATAACA 
CATCTGCCAA 



21 
I 

TCCACCTCCC 
ATTCTGCTGG 
CTGCTGTCCC 
GGAGGCAATG 
TTTGTGGTGC 
GGCTTCACAG 
TTCTACATTT 
ATGTCCACAC 
GGCATGGTGG 
ATCATTGCCC 
CCAGGAAGCC 
TGGCTCCCTT 
CCCTCTGTCC 
CCCCACTCCT 
ATTCCCCCAT 
AAAAAAAAAA 



31 
I 

AGCCCAGCAT 
CTCTGCTGTC 
CGGCGCTAAC 
CCACACTGAT 
CTCCGTGCCG 
TCACTCGGCT 
CCTACCTAGT 
TCCCTCGAAG 
TCATCACGGT 
TGGCACTGGG 
CAGGGCACCA 
GGTGCCCTCG 
CTCTCCTTGC 
GTCAGAGTTG 
CCTTCAGTGA 
AA 



41 
I 

GGCACCCCTG 
CCCAGGGGCT 
GGAGAGCCTG 
GGTCCGGAGA 
TGGGCGCAGG 
CAGTGCATAC 
GAAGAAGGGG 
GAACATGGAA 
GCTGCTCTCT 
CTCCCGCAAG 
TCCAGCTCCC 
CCTCCTCCTC 
CCCCAGTGCC 
ACTTTCCTCC 
ACTAAGTCCC 



51 
I 

CTGCCCATCC 
GCAGACTTCA 
CTGGTTGCCT 
GCCAATGACA 
GAACTGGTGA 
CAGGTGACAA 
ACAGCCACTG 
TCCATTGGGC 
GTCGCCATGT 
TAAGGAGGTC 
CAGCCCACCT 
CTGCCCTCCT 
TCACCTTCCA 
CATTTTACCA 
TATAATAAAG 



Seq ID NO: 173 Protein sequence 
Protein Accession #: NP_0067S1.1 



31 



51 



1 11 21 

111!!! 

MAPLLPIRTL PLILILLALL SPGAADPNIS SLSGLLSPAL TESLLVALPP CHLTGGNATL 
MVRRANDSKV VTSSFWPPC RGRRELVSW DSGAGFTVTR LSAYQVTNLV PGTKFYISYL 
VKKGTATESS REIPMSTLPR RNMESIGLGM ARTGGMWIT VLLSVAMFLL VLGFIIALAL 
GSRK 

Seq ID NO: 174 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 1..2733 



60 | 



ATGAAAGTTG 
TTCCTGGGGA 
CATCTAGGCC 
GAGAAAAGAG 
GGGCTAATTA 
CTGCAGTGTA 
AACTGCTACC 
AGCCAGAGTG 
AGGTTTACAA 
ATTGAAATTC 
ACCCAATTTC 
TCTGAACTGC 
CTGTTTCCAT 
GTCTTTGGAT 
GGAAACATCA 
CTCTCTCTGC 
GCAGCTGTGT 
ACAGTGGGGA 
GCCAGCCATT 
ATCCTTAATT 
AGCTCACGGT 
CCTCTGAATT 
CTCAAAAGGG 
AGAGGCCGTG 



11 
I 

GAGTGCTGTG 
AAAATGATGG 
CAGTCGAAGA 
ATTTGAGAAA 
GAATTATCAG 
CCTGTGAAGA 
TTCACACGGC 
TCAATTTCTG 
ATGACCTTTT 
AACTTAAAAA 
GAAATGGAAG 
TGTCAGCCAT 
TAGAAGACGG 
TTGGGTCCAA 
CAGCCAAGTQ 
TTGAAGAACT 
CATCCTTCGT 
ATCTGGCTTC 
TCAGGGTGTC 
CAGCCTCAGT 
TACTAGAGAC 
TTTCTCGGAA 
GTTACAGCTA 
TGTTAATTGG 



21 
I 

GCTCATTTCT 
CATCAAAACA 
ATATCAGCTG 
TTTTCTGAAG 
AGCAAAGGCT 
CAGCTACACC 
TGGAGCACTC 
TGAGAGAACA 
GAATTCATCT 
AGCATATGAA 
CATCGTTGCT 
TGAACATGTT 
CTCTTTCAGA 
GGATGATGAA 
TGAGTCCTCT 
GAACAAGAAT 
GCAAAATCTT 
GGTGGTGTCG 
CAATTCAACA 
AACCAACTGG 
ATTAGAAAAC 
ATTCATTGAC 
TCAGATTAAA 
GTCAGACCAA 



31 
I 

TTCTTCACCT 
AAAAAAGAAC 
CTGCTTCAGG 
CTCTTGAAGC 
ACCACAGACT 
TGGTTTCCTC 
CCAAGCTGTG 
AAGATTTGGG 
TCTGCTATAT 
AGAATTCAAG 
GGGTATGAAG 
GCCGAGAAGG 
GTGTTCGGAA 
TATACCCTGC 
GGGTGGCAGG 
TTCAGTATGA 
TCTGTCATCA 
ATTCTGAGCA 
ATGGAGGATG 
ACAGTCTTAC 
ATCAGCACTC 
TGGAAAGGGA 
ATGTGTCCCC 
TTCCAGAGAT 



41 
I 

TCACTGACGG 
TCATTGTGAA 
TGACCTATAG 
CTCCATTATT 
GCAACAGCCT 
CCTCATGCCT 
AATGTCATCT 
GCACTTTCAA 
ACTCCAAATA 
GTTTTGAGTC 
TTGTTGGCTC 
CTAAGACAGC 
AAGCCCAGTG 
CCTGCAGCAG 
TCATCAGGGA 
TTGTAGGCAA 
TTCGGCAAAA 
ATATTTCATC 
TCATCAGTAT 
TGCGGGAAGA 
TGGTGCCTCC 
TTCCAGTGAA 
AAAATACATC 
CCCTTCCAGA 



51 
I 

CCACGGTGGC 
TAAGAAAAAA 
AGATTCCAAG 
ATGGTCACAT 
GAATGGAGTC 
TGATCCCCAG 
CAACAACCTC 
AATTAATGAA 
TGCAAATGGA 
GGTTCAGGTC 
CAGCAGTGCA 
CCTTCACAAG 
TAATGACATT 
TGGCTACAGG 
GACTTGTGTG 
TGCCACTGAG 
CCCATCAACC 
TCTGTCACTG 
AGCTGACAAT 
AAAGTATGCC 
GACAGCTCTT 
CAAAAGCCAA 
TATTCCCATC 
AACTATTATC 



300 
360 
420 
460 
540 
600 
660 
720 



60 
120 
180 
240 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 



60 
120 
180 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 



1021 



WO 03/042661 



AGCATGGCCT CGTTGACTCT GGGGAACATT CTACCCGTTT CCAAAAATGG AAATGCTCAG 1500 

GTCAATGGAC CTGTGATATC CACGGTTATT CAAAACTATT CCATAAATGA AGTTTTCCTA 1560 

TTTTTTTCCA AGATAGAGTC AAACCTGAGC CAGCCTCATT GTGTGTTTTG GGATTTCAGT 1620 

CATTTGCAGT GGAACGATGC AGGCTGCCAC CTAGTGAATG AAACTCAAGA CATCGTGACG 1680 

TGCCAATGTA CTCACTTGAC CTCCTTCTCC ATATTGATGT CACCTTTTGT CCCCTCTACA 1740 

ATCTTCCCCG TTGTAAAATG GATCACCTAT GTGGGACTGG GTATCTCCAT TGGAAGTCTC 1B00 

ATTTTATGCC TGATCATCGA GGCTTTGTTT TGGAAGCAGA TTAAAAAAAG CCAAACCTCT 1860 

CACACACGTC GTATTTGCAT GGTGAACATA GCCCTGTCCC TCTTGATTGC TGATGTCTGG 1920 

TTTAT TGTT G GTGCCACAGT GGACACCACG GTGAACCCTT CTGGAGTCTG CACAGCTGCT 1980 

GTGTTCTTTA CACACTTCTT CTACCTCTCT TTGTTCTTCT GGATGCTCAT GCTTGGCATC 2040 

CTGCTGGCTT ACCGGATCAT CCTCGTGTTC CATCACATGG CCCAGCATTT GATGATGGCT 2100 

GTTGGATTTT GCCTGGGTTA TGGGTGCCCT CTCATTATAT CTGTCATTAC CATTGCTGTC 2160 

ACGCAACCTA GCAATA CCTA CAAAAGGAAA GATGTGTGTT GGCTTAACTG GTCCAATGGA 2220 

AGCAAACCAC TCCTGGCTTT TGTTGTCCCT GCACTGGCTA TTGTGGCTGT GAACTTCGTT 2280 

GTGGTGCTGC TAGTTCTCAC AAAGCTCTGG AGGCCGACTG TTGGGGAAAG ACTGAGTCGG 2340 

GATGACAAGG CCAC CATC AT CCGCGTGGGG AAGAGCCTCC TCATTCTGAC CCCTCTGCTA 2400 

GGGCTCACCT GGGGCTTTGG AATAGGAACA ATAGTGGACA GCCAGAATCT GGCTTGGCAT 2460 

GTTATTTTTG CTTTACTCAA TGCATTCCAG GGATTTTTTA TCTTATGCTT TGGAATACTC 2520 

TTGGACAGTA AGCTGCGACA ACTTCTGTTC AACAAGTTGT CTGCCTTAAG TTCTTGGAAG 2580 

CAAACAGAAA AGCAAAACTC ATCAGATTTA TCTGCCAAAC CCAAATTCTC AAAGCCTTTC 2640 

AACCCACTGC AAAACAAAGG CCATTATGCA TTTTCTCATA CTGGAGATTC CTCCGACAAC 2700 
ATCATGCTAA CTCAGTTTGT CTCAAATGAA TAA 

Seq ID NO: 175 Protein sequence 
Protein Accession #: Eos sequence 

} H 21 31 41 51 

I I I I I | 

MKVGVLWLIS FFTFTDGHGG FLGKNDGIKT KKELIVNKKK HLGPVEEYQL LLQVTYRDSK 60 

EKRDLRNFLK LLKPPLLWSH GL.IRIIRAKA TTDCNSLNGV LQCTCEDSYT WFPPSCLDPQ 120 

NCYLHTAGAL PSCECHLNNL SQSVNFCERT KIWGTFKINE RFTNDLLNSS SAIYSKYANG 180 

IEIQLKKAYE RIQGFESVQV TQFRNGSIVA GYEWGSSSA SELLSAIEHV AEKAKTALHK 240 

LFPLEDGSFR VFGKAQCNDI VFGFGSKDDE YTLPCSSGYR GNITAKCESS GWQVIRETCV 300 

LSLLEELNKN FSMIVGNATE AAVSSFVQNL SVIIRQNPST TVGNLASWS ILSNISSLSL 360 

ASHFRVSNST MEDVISIADN ILNSASVTNW TVLLREEKYA SSRLLETLEN ISTLVPPTAL 420 

PLNFSRKFID WKGIPVNKSQ LKRGYSYQIK MCPQNTSIPI RGRVLIGSDQ FQRSLPETI I 480 

SMASLTLGNI LPVSKNGNAQ VNGPVISTVI QNYSINEVFL FFSKIESNLS QPHCVFWDFS 540 

HLQWNDAGCH LVNETQDIVT CQCTHLTSFS ILHSPFVPST IFPWKWITY VGLGISIGSL 600 

ILCLIIEALF WKQIKKSQTS HTRRICMVNI ALSLLIADVW FIVGATVDTT VNPSGVCTAA 660 

VFFTHFFYLS LFFWMLMLGI LLAYRIILVF HHMAQHLMMA VGFCLGYGCP LIISVITIAV 720 

TQPSNTYKRK DVCWLNWSNG SKPLLAFWP ALAIVAVNFV WLLVLTKLW RPTVGERLSR 780 

DDKATIIRVG KSLLILTPLL GLTWGFGIGT IVDSQNLAWH VIFALLNAFQ GFFILCFGIL 840 

LDSKLRQLLF NKLSALSSWK QTEKQNSSDL SAKPKFSKPF NPLQNKGHYA FSHTGDSSDN 900 
IMLTQFVSNE 

Seq ID NO: 176 DMA sequence 

Nucleic Acid Accession #: AB035089.1 

Coding sequence: 9845.. 10219 
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GGGCATGCAG CCATCGGGGA AAATCCATAG TGCAGATAAA GCAAGGAGGA AGAAGAAGGA 60 

CAGTTCTAGT AAAAGGGAGA ACATCAATAT AGGATGTTTC TTAGCAATAG AAAAAGAAGG 120 

CCAAGAGGAA TTAGGGAGAG AGTTATAAGA GATCAGCAAG GGGACAGGGT TAGATTTGGT 180 

TTGGTTTGAA AGCATACAGT AAATATGATG TCTGTCCCTG GCAGTGTTGG CAGAGTAGGA 240 

AGGAGGAAGG GAGGCAAGAG ATAATATCAT TTTCTCTGTG CTCCAACTGT ACTTACATAT 3O0 

GAGACTATTT CCCTCTCTGC TTTTCAAACC TTACTGGAGT TGTTTTCCCT CATGAAAACC 360 

AAGAAAGGAA AGCTAGTTAG TCTTGTTCTG AGGTTGTTCA ATGTATACAT ATCTATATCT 420 

GTAGACAGAA TCCTTGGGAA TACAGTAATT GACATATATT CTGTTATTTG ATGCTTGAAA 480 

AATCTCCTCC ACTAACCAGT TTCCCTATAG ATTGCCACAA GCACATAATA AGAAACAATA 540 

AATAAAATGT TCTCTTGACT TTGTTACTTA ACAATGCTGA GAAAACTTTA CAGCCTTCAT 600 

AAGGAAGTGA GGTCCAGGAA AATCTAGGAG ATATTTCTTA ACCAATCTAT AAAGGCATTA 660 

GTAATGACAG GATATTTCCT GAAAGTGTAA TTTCCCATTG AGGATTTGTT TTTAATTTCT 720 

GGATTCCTGG AGCCAATGAA GTTGGTGTAT GTTTATGAAA TATCAAGAGA CATAAGTTGG 780 

CAAGTGTTCA TATGCAAAAA CTTCTTGGAA TTTCTGAGTT CTCTGTGGCA ATATATGACA 840 

TCAGGATATG TCCAGTCTCA CACACCAGGA TATGTCCTTT CTAGCCTGTC TATCACATGC 900 

TAGGAGAACT ATTTAGGAAC AGAAAAAAAT GCCTGAAATG ATTTCTCATT TGAACTCATC 960 

CAAGCTTTCT CTAAATTTAA GCAAACTCCT GGTCATTTTC AGTTAGTACC TTTCCTTAAG 1020 

TTCAACCTTC AGGGCAAACC TCCGTGCCTC AGACGTTTAG CCATAGTCTG AAATTCTCTT 1080 

CCATAGATTG GTCCCCTGTA ACCCCGGTTT GTCTCAGCTT GTTATCCTGT TTTTTTCTTC 1140 

CCTCCATTCC CAGGATGAGC TTGTTGCTTC TGTCCTATGA GACATTAGAT TCCTTTTCTT 1200 

TGGTACCCGA GTAAATCCAT CCTACTCCAA TAGAGGAAGG TCCATTTTTG TCTTATAGCG 1260 

CTGGATGCAG ACT CAGCTGA GAAGACCATT ATTCATTTTT GGAATTCTTT ATCTCAGATA 1320 

TTTCCTCTTC TTTCTTTTTC TTCTATCTTT GGATTTTTAG TCCATCAACG CCCCATTAGT 1380 

CTATTCCCCG ACTTCAATCA GGGAACTTAT ACCTCTTAAA CTCATTCAGA GACTCAAAAC 1440 

ATATATATTG ATACAGGAGA CCTAAGAAGA GCATGTCTTG GGGGTTGAGG AAACAGGCAG 1500 

GTGAGAAATT TCCAGATTGG AAACACAGCT TCCTTTCTCC CATCCAGCCC CTACTTTCAG 1560 

CCTATGTGTT TCTGGCACCT TGTTGTAGAT AAATCTCCCT TX3ACTTTGTG ATGTGCTGAG 1620 

AAAACAAACT CACGGCTGGT GTTAAAAAGG GCCCATGACA ATACCAAGTG TTGGGGAGAA 1680 

TGTGGAGAAA TCAGAACTCT ATTCACGGTC GGTTGGAATG CACACTTGTG CAGAATTCTA 1740 

TGGAGAAGAG TCTGGCATTT CCTCAAAATG TTAACCTGGA TTTACCATAT GACCCAGOGA 1800 

TTTCATTCAT AGGTTTATAC TCAAAAGAAA TGAAGAAATA TGCCATGCAA AAAAATGTAC I860 

ATGAAAGGTC ACAACATCAT TATTCATAAT AGTAAAAGGA TGGAAACAAC ACAAATGTCC 1920 

ATCAACTTAT GATTAAAGAA AATCTGGTCT ATTCATAGAA TGGAATATTA TTCGACCACA 1980 
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AAAAGGAATG ATGTACTGAT CCATGCAATG ATGTGGACAA ACCATGAAAA TAACACTAGA 
TTAAAGAAGC CAGTCACAAA AGGACTTACT GTATGATTCC ATTTACCTGA AATGTTTGGA 
ATAGGCAAAT CCATAGAAAC AGGAGGTAGA TTCCTGGTTT CCAGGGTCTC CAGGAAGGGA 
AGAATGAAGT ACAAGATTTC TTTTGGAGGT AGTGAAATTG TTGTGGAATG AGATCATGAT 
GATGATAGCA CAACTTTGTG AATATAATAA AATCATTGAA TTGTACAGTT GAATTTATGG 
TATATAAATT ATATGTTAAT AAAAAGGGGG TCCACAAAAC AAACAGCCCC CCACTCTGGT 
TGTCAGGGAG ATATTGGATT AAATGGCCTT GGACAACAAC CCCTCTCCCT GGCCACAGAC 
ATTCTTCAGA TTACAAGATA TTCCAGGGGA AACACIGGAA TGAGTCTGAA GCCAGGTGCT 
AAACAGAAGG ACCATTGAGA AATGTTGTGA TCCTGACAGG TCAAGCAATT TATTTTTCGG 
CTTCATTTTT AAATGTAAAA TTAGAAAGCT GCCATTTAAA ATGGCCCGTC TGTTTCAATT 
GCTCTTCTCA GTGTCAGCCT GTTAACTCAA TGTGTTAGTC TGTTTTCATG CTGCTGATAA 
AAACATACCT GAGACTGGCA AGAAAAAGAG GTTTAATTGG GCTTAGAGTT CCACGTGATT 
GGGGAGGCCT CAGAATCACA GTAGGAGGCA AAAGTTATTC TTACATGGTG GCTGCAAGAG 
AAGATGAGGA AGAAGCAAAA GAAGAAACCC CTGATAAACC CATCGGATCT CCTGAGGCTT 
ATTAACTATC ATGAGAATAG CACAAGAAAG ACCGGCCCCC ATGATTCAAT TACCTCTACC 
TGGGTCCCTC CAATAACATG TGGAAATTCT GGTAGATACA ATTCAAGTTG AGATTTGGGT 
GGGAACACAG CCAAACCATA TCACTCAGCA AGGCAGATAA CTTTCTCACT GAGCCTATGC 
AACAGAAAAC CATCTGGGAT GGTTGTAAGG GGCACAGGAA GTGACTGGTA GGATCACTGC 
CAAAGCTGAG CACTCAGGAG AAGGCAATAG AATCCTATTC TCCATAGTAT GCTATAAGAT 
ACTGAAGTAC ACTTCTTCAC TATCTCTTTG GACTTAGAAT TAGCACTACA TTCCTTGTTA 
TACAGAAAAA TTACTAAGGA AATTCATAGG ATGACAAAAA CTTTCAGAAC TGAAAAACAG 
GAAATGTAAG CTTTTTAGTT CTTTGGTATT CGAAGTATGC CTAAAAGACA ATGCAAAATC 
CAAGAAAAGA ATGGTGGGGT TTTTGTTTGT TTGGTTTTGT TTTTGTTTTA CAGCTGGAGT 
AGAATACAAA GGGATGGAGT TGAAACAAAT GAGAGGAAAT TGGAATTCTA AACTTATTCT 
CATTGGCATT AGAAAGGCAC CTACATGTAT TTCACATGAG CCGGTGACTG CTGACTTGCA 
TTCTTATTTT TTCCCTATAG ATTAAAAAGG AGGTACAATG GTAGAACTGT AATCCTGTCC 
TTTGTCATAA ATTTTCATAT TCATAAAGGT GAGTGTTAGC CCGCTTGTGA AATCTGAAGT 
TGAGTAACTT CAAATACTAA CCACAGAGGG AAAGGCAGCA AGAGGAGAGG CATAAATTTA 
GGATCTCACC CTTCATTCCA CAGACACACA CAGCCTCTCT GCCCACCTCT GCTTCCTCTA 
GGAACACAGG TAAGAGCTTC AAGCCTCTCC AGCTTAATAA CATGAATTAT TTTTGAGAAT 
AATAATGATA CTGTGTTCTA TATCATGCAT CTCCTGCATT CTGTCTGATT ATATTTTACT 
TATTCTGCCA GAGCAAAATT AAAATACCTA TTTCATCTGA TTTGTCCTTT ATCTAAATTG 
CTTAGTTCCA AGTAAACCAA GGCACTTTTA GGAACACAGA GGGAGAGTGC CTTGCAGCCA 
GAGAGTCTTG AAGGAGATGT CAGGGACGCA TCTTAACAGC TGGTTGGATG TGATCCACAG 
AGGTCTCCTG TTAGCATTCA TTGTAAAGCC AT CCTACCTA GCTCTAGTGT AACCAGCAAT 
GAAAGAAAGA TAAAGAGGGT CGATTACTTA TTTACAATAG TCTTTAAAAA CGTAGTTTTG 
TAAGCCTTCT AATTAGGACA TTAATATATT TAATATATGC ACATTGTAGA AAGATTGAAG 
CGTTAAAAAT AAGAGAAAAA CTTTAAATGT CAAAATCTCA CAACCCAGAT ATATCATTTC 
TTTAAGAAAA TTGTACTACA AAATACCATT CCATTTATTA AAGTCATTCT GACAGGAATC 
TGATGCTTTT CCAGGAGTTC CAGATCACAT CGAGTTCACC ATGAATTCAC TCAGTGAAGC 
CAACACCAAG TTCATGTTCG ATCTGTTCCA ACAGTTCAGA AAATCAAAAG AGAACAACAT 
CTTCTATTCC CCTATCAGCA TCACATCAGC ATTAGGGATG GTCCTCTTAG GAGCCAAAGA 
CAACACTGCA CAACAAATTA GCAAGGTAGC TATCAGCATC ATTACGTTGT CCTGTTGCAG 
TTTTTCTCTG GTTCCGTCGG CTAGCACGCA GATGGTAATA GATGTGGTGG TCTGATGGGT 
AGCACAGGGG GCTGTGCAGG AATTCCCATA ACTGTGAGAC CACTGACTTA AACAGATCTT 
TTGAGTAAAG TTTTCTTGTC CCGCTTCATG TCTCTTCCAG GTTCTTCACT TTGATCAAGT 
CACAGAGAAC ACCACAGAAA AAGCTGCAAC ATATCATGTG AGTCACAGAG CACTCTGATT 
CAGCTTTAGA TCCCTGAACA GGTCATAGTT TAAACCTGGA ACTTCACAAA AACTAAGAAA 
AGGCCAGTTT TAGGGAAAAT CTTGGACACA AAGATTGAGA CATACAGAGT GGGTTGGCAT 
TTCATGGCAC ATAATTATTA TTCCTCATTT CTGCGTTACT AAAAGACAGT CAGCACTGTA 
CCTCAGAGCA TAGGTCTGGA TCAGGATAGG CTGGGTTCAG ACTCCAGCTT TGCTCTTCAC 
AAATGATGAA TAAGAGCAGG ACACAACTGC TCGGAGTCCC AGTGACCTCA TCCCAGAAAA 
CTAAGGGTAA GAAAAAATCT GACTCAATAC ATGCAAATAC ATGCAAATGT TTACAACAGT 
GCCTTGCCCA TAAAAGTCAT AATAAATGTT ATTATTATTA TAAAGTAGCT ATAATTATAC 
TAATCATAAT AATGTGAAAA TAATTTAATT TTCATTGAGT CATTAATGAG ATTCAGAGGA 
ATAAGCACAA GTCCAAGTAT ATTTTGGAAA ATGATTGCTA TGGAATATAT TGGTTTAGAG 
CCTTAATAGT GCAAAATGCT TTGCTGGAAG GTAGAAAGTT CTAGAT TTAA ACAGGCTTAG 
GTTCAAAACT TGGCACTTCT AATTTATGTC TCTATAAACA GGGTTTTTTT CCCCATTCTC 
TGAGCTTTCT TGTGTTCATC TGAATTGAAC TAAAGACTTA GAGTTACCCA TGTAAAGTCC 
TTAGCCATGG ACCTGGCATA CACTCTTCTT ACGTGCAGAG AATGACCATC ATGAGGAAAG 
AGCCACAGAT CAGTCAATGT GTCCTACAAG ATAATAGCAC CAACAGGTAT AACAGGGCTT 
CCTGGCATAA TCTATTTAAA ATATCCAACC TTCAACATAC TCGTATCCTT GATGACTGTT 
AGAAGTGAAA TATGGTCCTT GCCCATAAGG AGCTGAGAGT TTAACTGGGA AGCTAAACCT 
AACCCTTTAA ACCAACAAGG AGAAAATCTA CTGGTAGACA GCGCTGCATC TTTAGTTCAG 
AAGAGAAAAG ATTGCAGTAC GTTAGAGCAA GAAGAATTTT CTGGAAGAAG TCAAATATAA 
GGTGGATTTT GAAGGGTATT TGAGGTGAAA TACACCAATT ATCAGGGAAT AACATCAAAG 
GTCCTCAATG AGACTACCAG CATTTAGGGA CTGATCTAAC AGACTTAGCA TGGGTTTAGT 
ATTTACATTG ATACAGCAAT TGAATGATCT CCTTTTTTGA TGTTTGAAGG TTjGATAGjGTC 
AGGAAATGTT CATCACCAGT TTCAAAAGCT TCTGACTGAA TTCAACAAAT CCACTGATGC 
ATATGAGCTG AAGATCGCCA ACAAGCTCTT CGGAGAAAAG ACGTATCAAT TTTTACAGGT 
AATTTCACCT GGCCTACCCA CATTTCATTT GCATCCTGAT GTCTGTGTCT CTGAGTGGCC 
AAATGGAAGA AAGCAAGGCA GATGAGCCTG GCCGACCCAG GTGGAGAGCA TTTACTCAGA 
GTGCATTAGC TCCATTTCCA CAACTCTCCC CCACTGGAGT GTCCCAGACC CCAACGATAC 
ATCACTGAAG TGTGGATTTA GGGATAATCT TGTGATAAAA GAGGAGGTTG TGTAATAGAG 
TGAGTAAGAG TAATAAGTAA TAAGATACCA TCGATAAACT GGCACTGACT CAGTCACATA 
CGATACATCT TGGTGGGAAA TGTATGACTA ATGGGATATT ATTGGAATGG GCAGGCTTGG 
GTGAGTTCCT GAGAATAGTT GAGGAAGTAC CAGGAAATAT TGAATGCACA C3GATCAAAGA 
CAAAAACAAA GATCAGAAAC ATCATGGTTA AAATTACTGG AGAGAAGTCT GAGAAGCAAT 
GAATCTCCTT CAGGGAAGCC TGCTCTGCAG TTTGCAAACC ACAGCCTCTT CTGCTTCTGC 
CTTTTGCCAA GATGATATTG ACCTTCAGTG ACCTCTTTCT TGTGCCAGCC CACATTCCCC 
TTTTGCATTG CCTACATGAC ACCTGTATAA AAATATCCAT GGACAGGAGA TACTGCATCT 
ATTCAGGGTC TGGATTCAGC TTACTGTTGT TACAAATAAG TAAGTTTGGT AATATATAGT 
TACATAAATT ACTCCTAATT CCTACTTCTT CCTTCATATC Tt^AAGGAAT ATTTAGATGC 
CATCAAGAAA TTTTACCAGA CCAGTGTGGA ATCTACTGAT TTTGCAAATG CTCCAGAAGA 
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AAGTCGAAAG AAGATTAACT CCTGGGTGGA AAGTCAAACG AATGGTAGGA GAGCCACCCA 7080 
TTATAGAAAC ACCTTTGAGA AACCTATGCC AGTGAGCCTT GTGCTTGACA CTGCATGGGG 7140 
GAACAGGTGT GGGGATTGAG ATGGGTTTGC AGGGAGGGCT GAAGAGGGCA CTCCAGATGA 7200 
AGGATTTGTC CAAATGAATA TGAAGAGAGC CTAGGGGAGC CAAGGAGGAA ATCACAGGAA 7260 
GCCAATTAGA TGGAAACACA TCTGGAGAAT TATTTGCTTA TGGCCC TGCA TGACAATAGC 7320 
TTTGTGGATC CCCTGTCTCC GCTCAGACCT ATTTTGAGAT CATATCCTTT ACTTTAAATC 7380 
AGACTCAAAT TTTTATGATG AATATTTAAT AGAAAACATT AGAAAGCGTC TCTCGTCTCC 7440 
TTTACTAATT GGGAAACAAG CAGCTCTCTG GTAAATCACC CTTTTGTCTC TGAGCTGGAG 7500 
CTGCCTGGAT CACATCTGTA GCCAATGTGT TCTGCAGGGA TTA TCAC AGC TCTCTTCCCC 7560 
ATCAAGGGCA AAGAGCTTGA CAAAGTCTCC ATTCTACAGA CATCTTTCTT ACCTCCCACC 7620 
TCTCATTACA GGCCAAACTT ACAGCAACTC AACATGAGAG TGAATAGGAA GATACCCCCG 7680 
GAAGTAGTGT CTGACAGCAC AGGACATGCG TTTCATATTA CAGAGCTCAA GTCACTCATC 7740 
CTAAAATGCA ATCAGGGCCT CCTTCCTCTG AATGGGGACC CCGTAGTTAA AAAAAAATAA 7800 
AAGTAGGAAG AGGAGGGAGG GAGAAAGGAA AGACACATGT TGGAAGAGTA GACAAAATCA 7860 
GTTTATCAGT ATTCCAAATC AGATGATTGG AGACATTCAT ACACAGAGAA CGTGAACTCC 7920 
TTCTCTATCA CAAGAAGTGA TGTCTCCATC AAGGGTAACT TTATACGACT GGAGCCTTGA 7980 
AGAAAGCTGC ATCTGGTGAA CCACTGGTCA GTGAGTCTAA CAATTCAAAG ATCAAAGTCA 8040 
GTGAGTCTCA AGCAGGGATT TGGGTCAATA ATTAACGATC AGTCA CGAAC AT TTGCAA AG 8100 
CATCTTCCAG ACAAGCCATT TGTAGCTTGT GTAAAAGACT CTTTTATTCT TTCCCTTGCA 8160 
GAAAAAATTA AAAACCTATT TCCTGATGGG ACTATTGGCA ATGATACGAC ACTGGTTCTT 8220 
GTGAACGCAA TCTATTTCAA AGGGCAGTGG GAGAATAAAT TTAAAAAAGA AAACACTAAA 8280 
GAGGAAAAAT TTTGGCCAAA CAAGGTATTG TCTATATTTT ATTTATATAG TGTAATATGT 8340 
TAATACATGG AATGTTAAAC ATTTCTGATG GAATGTAACA TGATAAGTAA AAAATAAAAA 8400 
TTGTTCATGT CTGTTATTTT GTTGTTTTAC TCTTATAACT TTATTTAGTT AGGAATACCT 8460 
GAAAAACTAT TGTTTCTAAC TCATGGAATT CCTGGGTTAT TTCTTAGAAG AAGAAGGATG 8520 
TGTTGCTATC TCAATAATAT TATCTTTTTT GTCTTGTGTT TCACGTGTTA TTTGTTGGAC 8580 
ACATTGATTT ATTGCAGAAT ACATACAAAT CTGTACAGAT GATGAGGCAA TACAATTCCT 8640 
TTAATTTTGC CTTGCTGGAG GATGTACAGG CCAAGGTCCT GGAAATACCA TACAAAGGCA 8700 
AAGATCTAAG CATGATTGTG CTGCTGCCAA ATGAAATCGA TGGTCTGCAG AAGGTAAGAA 8760 
CTTGCATCTA CAACTCTTCC TTCTACTGCC GGACATTTTT CCAAAGATAC CAAGTTTAAA 8820 
CAAGGTAAAA GCTTATGACC GAGTTGCCTC AAAATGATGA AAAATTCTAA ATGAGGAATG 8880 
ATGACTCACC TTCATATTAC AAATATTTGA GCATAGGGCC TGACACAAAC TGAAAGCTTA 8940 
GTTTTTGTTT GTTTGTTTGT TTTTATTATT ATTATTATAA TACTTTAAGC TTTAGGGTAC 9000 
ATGTGCACAA TGTGCAGGTT AGTTACATAT GTATACATGT GCCATGCTGG TGTGCTGCAC 9060 
CCATTAACTC ATCATTTAGC GTTAGGTATA TCTCCTAATG CTATCCCTCC CCCCTCCCCC 912 0 
CACCCCACAA CAGTCCTCAG AGTGTGATGT TACCTTCCTG TGTCCAAGTG TTCTCATTGT 9180 
TCAATTCCCA TCTATGATTT AATTCCATCT ATGGCTTAGT TAATGATTAA TTTATTAGAG 9240 
TTACATGCAT TGGATATCAA TTTGATGATA TTATTATGCA GCAATTTAAA CTTGACTGGG 9300 
AGAAATATAT ACCAATGTGA GGAAAGTTTA CAAATAGGCC GAGTAGAAAA GGGAATACAA 9360 
ATTTAGGAAT TTAGGGAATT ACAATTTAAT AATTGCAATG TGTACTAAAT AATGTATACA 9420 
GAAAAATATG ATGAGCCTAT TAAAAATTGA CACATGTAGT AGG CTGTT GG CACAAGAAAT 9480 
AGTGATACAT ACAGTTCATT GTGTACAAAA TAATGTAATC ATATTTTACA TGTGTATCAT 9540 
ACAGTTGTAT ACATACATAT GTACACATAT ACATATACGT AAAAACATGA TTCTGTTTTT 9600 
ACATACATGT ATATACATAT ACACATATAA CCCAATGTAT TTATATATTC AGGACTCATA 9660 
TTTTACCTAT TAGAATAATA ATGTCTATTA AAGTGAACCT TCTGTATTTC ACATTTATTG 9720 
CCAAAATAAC GAATCTCCAC ATAGTCAATT CATTGTTAAG GTGTATTAGA GATCGACAGT 9780 
TAGTCATATC AGTTTCTTTT TTCCATTTGT ATAGCTTGAA GAGAAACTCA CTGCTGAGAA 9840 
ATTGATGGAA TGGACAAGTT TGCAGAATAT GAGAGAGACA TGTGTCGATT TACACTTACC 9900 
TCGGTTCAAA ATGGAAGAGA GCTATGACCT CAAGGACACG TTGAGAACCA TGGGAATGGT 9960 
GAATATCTTC AATGGGGATG CAGACCTCTC AGGCATGACC TGGAGCCACG GTCTCTCAGT 10020 
ATCTAAAGTC CTACACAAGG CCTTTGTGGA GGTCACTGAG GAGGGAGTGG AAGCTGCAGC 10080 
TGCCACCGCT GTAGTAGTAG TCGAATTATC ATCTCCTTCA ACTAATGAAG AGTTCTGTTG 10140 
TAATCACCCT TTCCTATTCT TCATAAGGCA AAATAAGACC AACAGCATCC TCTTCTATGG 10200 
CAGATTCTCA TCCCCATAGA TGCAATTAGT CTGTCACTCC ATTTAGAAAA TGTTCAC C T A 10260 
GAGGTGTTCT GGTAAACTGA TTGCTGGCAA CAACAGATTC TCTTGGCTCA TATTTCTTTT 10320 
CTATCTCATC TTGATGATGA TAGTCATCAT CAAGAATTTA ATGATTAAAA TAGCATGCCT 10380 
TTCTCTCTTT CTCTTAATAA GCCCACATAT AAATGTACTT TTCCTTCCAG AAAAATTTCC 10440 
CTTGAGGAAA AATGTCCAAG ATAAGATGAA TCATTTAATA CCGTGTCTTC TAAATTTGAA 10500 
ATATAATTCT GTTTCTGACC TGTTTTAAAT GAACCAAACC AAATCATACT TTCTCTTCAA 10560 
ATTTAGCAAC CTAGAAACAC ACATTTCTTT GAATTTAGGT GATACCTAAA TCCTTCTTAT 10620 
GTTTCTAAAT TTTGTGATTC TATAAAACAC ATCATCAATA AAATAATGAC ATAAAATCAT 10680 
TTTTGCTTTA CCTGTTTTCT CTCTGGAAAG GGCAAGTGTC CAGTTACACA TAGGAAAGAT 10740 
AATTTAGAGA TATATTAATC ATATATAAAG GAAAATTAAA AACAGAGTAG TTCATGATGA 10800 
GCCTGGAGTA GAAGGCATAT CCCAGAACAG GAGGAGCCTT GTAAACCACA TAGGAACTTC 10860 
CTATTTTATG CTAAAGGGAT AAGAAACTCA TTACAGGCTT TGATGGTTGT TTGTCAAAGA 10920 
GGGGCATAAA ATTATCATAT CCACATCTAG AAAATACATC TCTGGCTACG CTGATATCAA 10980 
TGGATGCGAG GAAAGAACAG TGTGGTTACC ATATATAAAT TAGGAAATCA TTAGAGTATT 11040 
GGGAGTGGAA ATGGAGAGAA AGAAAGAGCC TGGGGGAATT ATTTAGGAAA TAATAGTT AC 11100 
AGAAAGACAT CTAAGTTGCT GACCTATCTG ACTGGATGGA TGGAAGAATA TCTTGTTTCT 11160 
GAGAGAAAAA AAGACTTTGG GTTTAAATTT GTACTTGATG AATTAAGGTA CTTTTAATAT 11220 
TCAAATGGAT TTGCCTGGCA GGCACTTGAA GATATTAGTC TAAATCTCAG AAACAGAATA 11280 
TGATCTGAAG CTCTAAATTT GTGATATTCA ATATAAATAC TTTAGAGTCA TTGGGATAAA 11340 
TATGGTAGTT GTAGCTAAAA GCAAAAATAA GATACTAGGG AGAAAGGATA AAGTTAGAAG 11400 
AAAGAAGAAT CTAGAATTGA CCTTGAAGTA TATCAGCATG TGTAAAGATC AGGAATTGAT 11460 
CATTTTTATT TTCCAGAAAG TAGCTTTTCT TAGGGTTCCA TATTTACTCC CATAGATTCT 11520 
TCCC 



Seq ID NO: 177 Protein secruence 
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1 11 21 

I I I 

MNSLSEANTK FMFDLFQQFR KSKENNIFYS 
QVTENTTEKA ATYHVDRSGN VHHQFQKLLT 



31 41 51 

I I 1 

PISITSALGM VLliGAKDNTA QQISKVLHFD 60 
EFNKSTDAYE LKIANKLFGE KTYQFLQEYL 120 
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DAIKKFYQTS VESTDFANAP EESRKKINSW VESQTNEKIK NLFPDGTIGN DTTLVLVNAI 
YFKGQWENKF KKENTKEEKF WPNKNTYKSV QMMRQYNSFN FALL ED VQ AX VLEIPYXGKD 
LSMIVLLPNE IDGLQKLEEK LTAEKLMEWT SLQNMRETCV DLHLPRFKME ESYDLKDTLR 
TMGMVNIFNG DADLSGMTWS HGLSVSKVLH KAFVEVTEEG VEAAAATAW WELSSPSTN 
EEFCCNHPFL FFIRQNKTNS ILFYGRFSSP 

Seq ID NO: 178 DNA sequence 

Nucleic Acid Accession #: NM_001910.1 

Coding sequence: 50.. 12 40 



180 
240 
300 
360 



GGAGAGAAGA 
CCTTCTTTTG 
GCCCCTCAGG 
GTTCTGGAAA 
GAGTGCCAAG 
TGGCTCCCCA 
CCCCTCTGTG 
GTCCAGCACA 
GTCCGGGATC 
GTTTGGAGAA 

GATGGCTCAG 
AGGTGGTGCG 
CCTGAATTGG 
GGTGGGAGGC 
TTGCCTCATC 
CCCCGTGGAT 
CTTCACCATT 
CGTGGATGGA 
TGGGCCCCTC 
TGGGAATAAC 
CCTGCCTGTC 
GTTATTTTCC 
ACATGAGAAT 
ACTCCCACCA 
TGATTATGAA 
GCAGAGATCA 
CACACGGCCA 
GTACCTGGAT 
TAACATCCTT 
CTGTTTTTGT 
TTTGAAATGT 
TTGAAATCCC 
AGTTAGATCT 
CTTGTTGCAT 
CTGCTGGGTG 



11 

I 

CTGCTGGTGC 
AGGCATCCGT 
TCCCATAATT 
GAACCCCTCA 
CCACAGAACT 
TACTGCACTA 
TACAGCCAGC 
ATTGGAGCCG 
AGTGTCACAG 
GGATACCCCT 
AACCTGGTGG 
GGGAGCGAGC 
GTCCCAGTCA 
ACTGTTATGT 
ACTGGCCCTT 
GGAGAATATG 
AACGGAGTCC 
ATGCAGTTCT 
TGGATCCTGG 
CGTGTGGGAC 
TGACAGACCT 
AGAGAATGTA 
ACACACACAC 
CCGTCATGAT 
AATCAAAAAT 
TGGTATAATA 
GGCCTGTTTA 
CATTCTGAAG 
AAATATACAA 
AGTTGGATTG 
CTGTAAGTCT 
GAGGTGTCAT 
GGAGGTCTGA 
CCTGTCAGGA 
GGTTTGGAGT 



21 



31 
I 



41 



CAAGGGAGAA GCTGCTGGTC GGACTCACAA 
TCCTGGAGCT GGGAGAGGCC CAAGGATCCC 
CCCTCAAGAA GAAGCTGCGG GCACGGAGCC 
TGGACATGAT CCAGTTCACC GAGTCCTGCT 
TCAACTACTT GGATATGGAA TACTTCGGCA 
TCACTGTCAT CTTCGACACT GGCTCCTCCA 
GCCCAGCCTG CAAGACGCAC AGCAGGTTCC 
CAGGTCAATC TTTCTCCATT CAGTATGGAA 
ACCAAGTCTC TGTGGAAGGA CTAACCGTGG 
AGCCAGGCCA GACCTTTGTG GATGCAGAGT 
CCTTGGCTGT GGGAGGAGTG ACTCCAGTAT 
ACTTGCCGAT GTTTTCTGTC TACATGAGCA 
TGATTTTTGG AGGCTACGAC CACTCCCATT 
CCAAGCAAGC TTACTGGCAG ATTGCACTGG 
TCTGCTCCGA GGGCTGCCAG GCCATTGTGG 
CCGACAAGAT TAAGCAGCTG CAAAACGCCA 
CTGTGGAGTG TGCCAACCTT AACGTCATGC 
CCTATACCCT CAGCCCAACT GCCTACACCC 
GCAGCAGTGG CTTTCAAGGA CTTGACATCC 
GGGATGTCTT CATTCGACAG TTTTACTCAG 
TGGCCCCAGC AGTCCCCTAA GGAGGGGCCT 
TGAATATGTT AGGCTGGGGC ATTCTTTACA 
GCTGTTTCCA GGGTTGCAAC TTGAATTAAG 
ACACACATAT ACACACACAC ACACTTCACA 
GGAGGAATTA CGTTATACAT TCATATTTTG 
TTTCACATTT GATTATGAAA ATCTCCAAAC 
AATCCCTTTG CAACTCCACT CAGCCCTGAC 
TCTACACTGC TGCCCACTCC TCTCTCCAGC 
CAAATTCCGA GCATTACATC ATTTTGTCCA 
TCGGAATTCA AGCATCTCCC ATTGTCCCAC 
TTTGTATTAG GATTCAAGCA AGGCCCATAT 
CTTTCCATCT ACAGAGTTTA GCACATTTGA 
TTGACATGGT TCTCTGAACT TATCTTTCCT 
TTTTGTGGCA AAAATACTTC CTAGGTGGTG 
GGCAGATAAT GCTGGTGCCT CTCTATTGGT 
TCTTGGCTTT AATCATTCAT TACAAAGTTC 



51 
i 

TGAAAACGCT 
TTCACAGGGT 
AGCTCTCTGA 
CAATGGACCA 
CTATCTCCAT 
ACCTCTGGGT 
AGCCTTCCCA 
CCGGGAGCTT 
TTGGCCAGCA 
TTGATGGAAT 
TTGACAACAT 
GTAACCCAGA 
TCTCTGGGAG 
ATAACATCCA 
ACACAGGGAC 
TTGGGGCAGC 
CGGATGTCAC 
TACTGGACTT 
ACCCTCCAGC 
TCTTTGACCG 
TGTGTCTGTG 
CCTACAAAAA 
ACCAAACAGA 
CATACACACC 
TATTGATTTT 
ATATGCACAA 
AACCCATCCA 
TCCACATGCT 
TAAATATTTC 
AAATGTTTGG 
ATTGCATTTA 
ACGTTGCTGG 
ATAAAATGGT 
CTGGGTACTT 
AATGTTAAGA 
AGCATTTT 



Seq ID NO: 179 Protein sequence 
Protein Accession #: NP_001901.1 



1 
I 

MKTLLLLLLV 
SMDQSAKEPL 
QPSQSSTYSQ 
FDGILGLGYP 
FSGSIiNWVPV 
IGAAPVDGEY 
HPPAGPLWIL 



11 
I 

LLELGEAQGS 
INYLDMEYFG 
PGQSFSIQYG 
SLAVGGVTPV 
TKQAYWQIAL 
AVECANLNVM 
GDVFIRQFYS 



21 
I 

LHRVPLRRHP 
TISIGSPPQN 
TGSL5GI IGA 
FDNMMAQNLV 
DNIQVGGTVM 
PDVTFTINGV 
VFDRGNNRVG 



31 
I 

SLKKKLRARS 
FTVIFDTGSS 
DQVSVEGLTV 
DLPMFSVYMS 
FCSEGCQAIV 
PYTLSPTAYT 
LAPAVP 



41 
I 

QLSEFWKSHN 
NLWVPSVYCT 
VGQQFGESVT 
SNPEGGAGSE 
DTGTSLITGP 
LLDFVDGMQF 



Seq ID NO: 180 DNA sequence 

Nucleic Acid Accession ft: NM_018058. 

Coding sequence: 319. .1575 



1 
I 

TACGCGCTGC 
GACGGCCGGG 
TACACCGACA 
GTCAACGTGG 
AGAAAGGGCT 
CCTGATGCCC 
CTCAGAGATG 
GTGGGCCCCA 
AACTTCCTTT 
GTGGACGACC 
AAAGTGGACA 
ACCCATGGGA 
GTCCGCACGG 
AACATTGCCT 
GGAGACCCCC 
ACAGGGGGTG 



11 
I 

GGGACCGGCA 
AGGAGATCTA 
AGTTGTTCAA 
CCCGTGGTGT 
CTGGACGCTA 
TCATTGAAAT 
TGGCTGCTGA 
TCCTCAGCAG 
TCCACAACCG 
CCCACCAGCA 
TCGTCTATGG 
AGGTCCGCTT 
TCATCACCGC 
ACCGCAGCTC 
TCATCGAGGA 
TGGTGACCGA 



21 
I 

GGGGAACGCC 

crrccTCAAC 

GTTCCGCAAT 
GGCCAGCCTC 
CTCTATCTAC 
GGACCCTGAG 
GGCTGGGGTC 
CAGTGCCTCG 
GGGCGATGGC 
TGGGCGAGGT 
CAACTGGAAT 
CCGGGACATC 
CGACTTTGAC 
CTCAGCCAAC 
GCTCAATCCC 
CTTCGACGGA 



31 
I 

ATCGGGGTCA 
ACCAATAATG 
AACCGGTGGG 
TTTGCCGGAC 
ATTGCCAATT 
GCCAGTGACC 
AGCAAATATA 
GATATCTTCT 
ACCTTTGTGG 
GTCGCCCTGG 
GGCCCCCACC 
GCCTCACCCA 
AATGACCAGG 

CGccTcrrcc 

GGCGACGCCT 
GACGGGATGC 



41 

I 

CAGCCTGCGA 
CCTTCTCGGG 
AAGACATCCT 
GCTCTGTGGC 
ACGCCTACGG 

CAGGGGGCCG 
GCGACAATGA 
ACGCTGCGGC 
CTGACTTCAA 
GCCTCTATCT 
AGTTCTCCAT 
AGCTGGAGAT 
GCGTCATCCG 
TGGAGCCTGA 
TGGACCTCAT 



€0 
120 
180 
240' 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 



51 
I 

LDMIQFTESC 
SPACKTHSRF 
EPGQTFVDAE 
LIFGGYDHSH 
SDKIKQLQNA 
CSSGFQGLDI 



60 
120 
180 
240 
300 
360 



51 
I 

CATCGACGGG 
GGTGGCCACG 
GAGCGATGAG 
CTGTGTGGAC 
TAATGTGGGC 
CATTCTGGCG 
AGGCGTCAGC 
GAATGGGCCT 
CAGTGCTGGT 
CCGTGATGGC 
GCAAATGAGC 
GCCCTCCCCT 
CTTCTTCAAC 
TAGAGAGCAC 



CTTGTCCCAT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 



1025 
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10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



GGAGAGTCCA TGGCTCAGCC GCTGTCCGTC 
TGGCTGCGAG TGGTGCCACG CACCCGGGTT 
CTCTACACCA AGAAGAGTGG GGCCCACCTG 
TGTGAGATGG AGCCCGTGGC ACACTTTGGC 
GTGACGTGGC CAGATGGCAA GATGGTGAGC 
GTGCTGGAGA TCCTCTACCC CCGGGATGAG 
ACACCAATGA ATGCATCCAG TTCCCATTCG 
ACACCTATGG AAGCTACAGG TGCCGGACCA 
ACGAGGATGG CACAGCCTGC GTGGGGACTC 
CCCCCACCGC TGCTGCTGCC ACTGCCGCTG 
CACCGGTCCT CGTAGATGGA GATCTCAATC 
CCAGCTGCTG AGCAGGGGTG GGACATGAAC 
AAGTGGGCTT GTGCTGCTGC CTAGACAGTA 
CCCAAGCCCA TCCATGCACA TTACTTAGCT 
CTGTGCTGGG CACATAGCTG TGATCACAGC 
ATTCCAGTGG GTCTAATGAC CATATCTTAG 
CTGCACAGGA AGTATGAGGA CTTTAGTGTC 
AAAGCTATGT GACCTTACAC CAGTCACTTA 
AAATGGGGAT TAAGAATAGA ATCTTGGGGT 
GACACTTGGC ACAAAACCTG GCACATAGTA 
GGGCTTTGTC AACAGGTG 



TTCCGGGGCA 
GGGGCCTTTG 
AGGATCATCG 
CTGGGGAAGG 
CGGAACGTGG 
GACACACTTC 
TGTGCCCTCG 
ACAAGAAGTG 
TCGGCCAGTC 
CTGCTGCCGC 
TGGGGTCGGT 
CAGCGGATGG 
GGGATGTAAA 
AACAATTAGG 
AGACAGGGTC 
GACACAGATG 
CTGAGTTCAA 
ACTTGTTAGC 
TAGTGTGGAG 
AAGGCTCAAT 



ATCAGGGCTT 
CCAGGGGAGC 
ACGGGGGCTC 
ATGAAGCCAG 
CCAGCGGGGA 
AGGACCCAGC 
AGACAAGCCC 
CAGTCGGGGC 
ACCGGGCCCC 
TGCTGGAGCT 
GGTTAAGGAG 
AGTCCAGCAG 
GGCCTGGGAG 
GAGACTCGTA 
GCTGCCCTGA 
TGCCCAGGGA 
ATCCTGATTC 
CATCCATTAT 
ATTAGATTAA 
AAAAACAAGT 



CAACAACAAC 
TAAGGTCGTG 
AGGCTACCTG 
CAGTGTGGAG 
GATGAACTCA 
CCCACTGGAG 
GTATGTGTCA 
TACGAGCCCA 
CGCCCCACCA 
GCCACTGCTG 
AGCTGCGAGC 
GGGAGTGGGA 
CTAGACCCTC 
AGGCCAGGCC 
TGGCGCTTAC 
GGTGGTGTCA 
AGGAACTCAC 
CGCATCTGCA 
ATGTATGTAA 
GCCTCTCACT 



1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1660 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 



Seq ID NO: 181 Protein sequence 
Protein Accession #: NP_060528.l 



MDPEASDLSR 
RGDGTFVDAA 
FRDIASPKFS 
ELNPGDALEP 
RTRVGAFARG 
KMVSRNVASG 
GAGPTRSAVG 



11 
I 

GILALRDVAA 
ASAGVDDPHQ 
MPSPVRTVIT 
EGRGTGGWT 
AKWLYTKKS 
EMNSVLEILY 
ATSPTRMAQP 



21 
I 

EAGVSKYTGG 
HGRGVALADF 
ADFDNDQELE 
DFDGDGMLDL 
GAHLRIIDGG 
PRDEDTLQDP 
AWGLSASHRA 



31 
I 

RGVSVGPILS 
NRDGKVDIVY 
IFFNNIAYRS 
ILSHGESMAQ 
SGYLCEMEPV 
APLETPMNAS 
PAPPPPPLLL 



41 
I 

SSASDIFCDN 
GNWNGPHRLY 
SSANRLFRVI 
PLSVFRGNQG 
AHFGLGKDEA 
SSHSCALETS 
PLPLLLPLLE 



51 
I 

ENGPNFLFHN 
LQMSTHGKVR 
RREHGDPLIE 
FNNNWLRWP 
SSVEVTWPDG 
PYVSTPMEAT 
LPLLHRSS 



60 
120 
180 
240 
300 
360 



Seq ID NO: 182 DNA sequence 
Nucleic Acid Accession #: AJ279016 
Coding sequence i 1..1962 

1 11 21 31 41 51 

1 I 1 I I 1 

ATGTCCAGGA TGTTACCGTT CCTGCTGCTG CTCTGGTTTC TGCCCATCAC TGAGGGGTCC 60 

CAGCGGGCTG AACCCATGTT CACTGCAGTC ACCAACTCAG TTCTGCCTCC TGACTATGAC 120 

AGTAATCCCA CCCAGCTCAA CTATGGTGTG GCAGTTACTG ATGTGGACCA TGATGGGGAC 180 

TTTGAGATCG TCGTGGCGGG GTACAATGGA CCCAACCTGG TTCTGAAGTA TGACCGGGCC 240 

CAGAAGCGGC TGGTGAACAT CGCGGTCGAT GAGCGCAGCT CACCCTACTA CGCGCTGCGG 300 

GACCGGCAGG GGAACGCCAT CGGGGTCACA GCCTGCGACA TCGACGGGGA CGGCCGGGAG 360 

GAGATCTACT TCCTCAACAC CAATAATGCC TTCTCGGGGG TGGCCACGTA CACCGACAAG 420 

TTGTTCAAGT TCCGCAATAA CCGGTGGGAA GACATCCTGA GCGATGAGGT CAACGTGGCC 480 

CGTGGTGTGG CCAGCCTCTT TGCCGGACGC TCTGTGGCCT GTGTGGACAG AAAGGGCTCT 540 

GGACGCTACT CTATCTACAT TGCCAATTAC GCCTACGGTA ATGTGGGCCC TGATGCCCTC 600 

ATTGAAATGG ACCCTGAGGC CAGTGACCTC TCCCGGGGCA TTCTGGCGCT CAGAGATGTG 660 

GCTGCTGAGG CTGGGGTCAG CAAATATACA GGGGGCCGAG GCGTCAGCGT GGGC CCCAT C 720 

CTCAGCAGCA GTGCCTCGGA TATCTTCTGC GACAATGAGA ATGGGCCTAA CTTCCTTTTC 780 

CACAACCGGG GGGATGGCAC CTTTGTGGAC GCTGCGGCCA GTGCTGGTGT GGACGACCCC 840 

CACCAGCATG GGCGAGGTGT CGCCCTGGCT GACTTCAACC GTGATGGCAA AGTGGACATC 900 

GTCTATGGCA ACTGGAATGG CCCCCACCGC CTCTATCTGC AAATGAGCAC CCATGGGAAG 960 

GTCCGCTTCC GGGACATCGC CTCACCCAAG TTCTCCATGC CCTCCCCTGT CCGCACGGTC 1020 

ATCACCGCCG ACTTTGACAA TGACCAGGAG CTGGAGATCT TCTTCAACAA CATTGCCTAC 1080 . 

CGCAGCTCCT CAGCCAACCG CCTCTTCCGC GTCATCCGTA GAGAGCACGG AGACCCCCTC 1140 

ATCGAGGAGC TCAATCCCGG CGACGCCTTG GAGCCTGAGG GCCGGGGCAC AGGGGGTGTG 1200 

GTGACCGACT TCGACGGAGA CGGGATGCTG GACCTCATCT TGTCCCATGG AGAGTCCATG 1260 

GCTCAGCCGC TGTCCGTCTT CCGGGGCAAT CAGGGCTTCA ACAACAACTG GCTGCGAGTG 1320 

GTGCCACGCA CCCGGTTTGG GGCCTTTGCC AGGGGAGCTA AGGTCGTGCT CTACACCAAG 1380 

AAGAGTGGGG CCCACCTGAG GATCATCGAC GGGGGCTCAG GCTACCTGTG TGAGATGGAG 1440 

CCCGTGGCAC ACTTTGGCCT GGGGAAGGAT GAAGCCAGCA GTGTGGAGGT GAOGTGGCCA 1500 

GATGGCAAGA TGGTGAGCCG GAACGTGGCC AGCGGGGAGA TGAACTCAGT GCTGGAGATC 1S60 

CTCTACCCCC GGGATGAGGA CACACTTCAG GACCCAGCCC CACTGGAGTG TGGCCAAGGA 1620 

TTCTCCCAGC AGGAAAATGG CCATTGCATG GACACCAATG AATGCATCCA GTTCCCATTC 1680 

GTGTGCCCTC GAGACAAGCC CGTATGTGTC AACACCTATG GAAGCTACAG GTGCOGGACC 1740 

AACAAGAAGT GCAGTCGGGG CTACGAGCCC AACGAGGATG GCACAGCCTG CGTGGGGACT 1800 

CTCGGCCAGT CACCGGGCCC CCGCCCCACC ACCCCCACCG CTGCTGCTGC CACTGCCGCT 1860 

GCTGCTGCCG CTGCTGGAGC TGCCACTGCT GCACCGGTCC TCGTAGATGG AGATCTCAAT 1920 

CTGGGGTCGG TGGTTAAGGA GAGCTGCGAG CCCAGCTGCT GAGCAGGGGT GGGACATGAA 1980 

CCAGCGGATG GAGTCCAGCA GGGGAGTGGG AAAGTGGGCT TGTGCTGCTG CCTAGACAGT 2040 

AGGGATGTAA AGGCCTGGGA GCTAGACCCT CCCCAAGCCC ATCCATGCAC ATTACTTAGC 2100 

TAACAATTAG GGAGACTCGT AAGGCCAGGC CCTGTGCTGG GCACATAGCT GTGATCACAG 2160 

CAGACAGGGT CGCTGCCCTG ATGGCGCTTA CATTCCAGTG GGTCTAATGA CCATATCTTA 2220 

GGACACAGAT GTGCCCAGGG AGGTGGTGTC ACTGCACAGG AAGTATGAGG ACTTTAGTGT 2280 

CCTGAGTTCA AATCCTGATT CAGGAACTCA CAAAGCTATG TGACCTTACA CCAGTCACTT 2340 

AACTTGTTAG CCATCCATTA TCGCATCTGC AAAATGGGGA TTAAGAATAG AATCTTGGGG 2400 

TTAGTGTGGA GATTAGATTA AATGTATGTA AGACACTTGG CACAAAACCT GGCACATAGT 2460 
AAAGGCTCAA TAAAAACAAG TGCCTCTCAC TGGGCTTTGT CAACACG 



1026 



WO 03/042661 PCT/US02/36810 



Seq ID MO: 183 Protein sequence 
Protein Accession ft: CAC08451 

1 11 21 31 41 51 

5 | | | I I I 

MSRMLPFLLL LWFLPITEGS QRAEPMFTAV TNSVLPPDYD SNPTQLNYGV AVTDVDHDGD €0 

FETWAGYNG PNWLKYDRA QKRLVNIAVD ERSSPYYALR DRQGNAIGVT ACD1DGDGRE 120 

EIYFLNTNNA FSGVATYTDK LFKFRNNRWE DILSDEVNVA RGVASLFAGR SVACVDRKGS 180 

GRYSIYIANY AYGNVGPDAL IEMDPEASDL SRGILALRDV AAEAGVSKYT GGRGVSVGPI 240 

10 LSSSASDIFC DNENGPNFLF HNRGDGTFVD AAASAGVDDP HQHGRGVALA DFNRDGKVDI 300 

VYGNWNGPHR LYLQMSTHGK VRFRDIASPK FSMPSPVRTV ITADFDNDQE LEIFFNNIAY 360 

RSSSANRLFR VIRREHGDPL IEELNPGDAL EPEGRGTGGV VTDFDGDGML DLILSHGESM 420 

AQPLSVFRGN QGFNNNWLRV VPRTRFGAFA RGAKWLYTK KSGAHLRIID GGSGYLCEME 480 

PVAHFGLGKD EASSVEVTWP DGKMVSRNVA SGEMNSVLEI LYPRDEDTLQ DPAPLECGQG 540 

15 FSQQENGHCM DTNECIQFPF VCPRDKPVCV NTYGSYRCRT NKKCSRGYEP NEDGTACVGT 600 
LGQSPGPRPT TPTAAAATAA AAAAAGAATA AFVLVDGDLN LGSWKESCE PSC 



Seq ID NO i 184 DNA sequence 
Nucleic Acid Accession #: FGENESHH 
20 Coding sequence: 1..4794 

1 11 21 31 41 51 

1 | I 1 I 1 

ATGGCGTGTC CGGGAGGACT CCCAGCCCGT TGCTCTGGTT GGATGGGACT GGGTGGGCCC 60 
25 AGCGGCTCCT CCCCAGCATC CCCTCCCCAT TCCTCCTCCA GGTACAATGG ACCCAACCTG 120 
GTTCTGAAGT ATGACCGGGC CCAGAAGCGG CTGGTGAACA TCGCGGTCGA TGAGCGCAGC 180 
TCACCCTACT ACGCGCTGCG GGACCGGCAG GGGAACGCCA TCGGGGTCAC AGCCTGCGAC 240 
ATCGACGGGG AOGGCCGGGA GGAGATCTAC TTCCTCAACA CCAATAATGC CTTCTCGGGC 300 
CACAGCAGCT CAGCGCAGGT CCCTTCTGGG CTCCACAGAA ACAGGCCTGT GCTG AAGC CT 360 
30 CCACCTACAA CCCCTGCAGG CCTCCTGGGT CTGCCTCCAC TCAGCGGAAG GGACTTTTCC 420 
TCCTCCCTGG GTCAGGCTTC TCCGGACAGC AGGCAGGGAG AGAGGGTGCC GGTTCCCTGC 480 
TGTCGGGGTG GACTGAGACC TACCCATGAA CCAGAACCAT TTCTTCTGAG ACCCAAATCA 540 
GGGGTGGCCA CGTACACCGA CAAGTTGTTC AAGTTCCGCA ATAACCGGTG GGAAGACATC 600 
CTGAGCGATG AGGTCAACGT GGCCCGTGGT GTGGCCAGCC TCTTTGCCGG ACGCTCTGTG 660 
35 GCCTGTGTGG ACAGAAAGGG CTCTGGACGC TACTCTATCT ACATTGCCAA TTACGCCTAC 720 
GGTAATGTGG GCCCTGATGC CCTCATTGAA ATGGACCCTG AGGCCAGTGA CCTCTCCCGG 780 
GGCATTCTGG CGCTCAGAGA TGTGGCTGCT GAGGCTGGGG TCAGCAAATA TACAGAAGGC 840 
TTCTCCCACA CTGCCTCTCC AAGCATTGGT GAGATATCTG GCAGAACCGA GGAGCGGGAA 900 
GGAGGAGACC CAGAGGAGGC AGATGAGGAG CACAGTGGGG ATGGAA GCAC CAGCCAACTG 960 
40 TGCCGGCTGG GCTGGAAGGA CGGGCAGTTC AAGGAAGAAG CAGCAGCTTT GGTGGAGGAA 1020 
CAGAGGGAGG CTGGGGCAGC TGGCGTGCCC AGAGGACGTG TTCGAACAGC TCTGCAGACT 1080 
TCCAAAAGCC ATTTGGCTGA CAAGAACCTA TTTGGCCCAC CATGTTACTA TTCTGTCTGC 1140 
GCGCCTTCTC CAGCCCACCC TTTCCCTGCC CGCCAAGCCC CCCAACACTA CCCTGTAGCC 1200 
CCCCTTGTCA CTCAGCTAAT GACACATGGA CGTCTGGCTG GAAAACTAGC CCGGAGTGTC 1260 
45 CCCCACCCCC GAGCCCCAGG AATGGACCCC AAATGTAAGG GCCGCCATGC TGAGCCOGGC 1320 
CTGATGGCTG AGGCTTTGGG CGCGTGGCCA GCGCTCAGCA CCACTGTGGT GCCAGGGGGC 1380 
CTGAGAAGCT GGGAGGAAAG CAGGCAGAAG GGGCAGGCCA TGTCCAGATG TGCACTCAGG 1440 
GAGCTGGGAG GTCCCTGGAG CCAAGCCACA CAGCACCTGC CTGCTAGAGA GCTGTATGAC 1500 
CTGGGAGAAC CTCCCATTTT ACAAAGAACA GACGGAGATC CAGGGAGGAG AAGGGACTCG 1560 
50 CCCAAGGTCA CACAGGAGTG CCATCTAGTG GCCACCATGC CAGCTCTCGG GGGACTCGAG 1620 
GGCCCCGGGA GGGTGGCCAA GCGAGAGATT GGGAGAGAGA CTGGGGCAGT AGGAAGACCA 1680 
CTCTCCCATC CCCTGGTCCC CAACTTCCCC AGCTGCTTGA GGCCTCTTGA AGCCGGGACA 1740 
GTGCCGGGAG CTGCCCTGCC TGGGAATCCT GGGAACTGGG TTCTGGACAT GGCCAAGGCC 1800 
CTGGCGTGGA ACCAGATGGA AAAAGAGGAG GGGAAGATTC ATGGAGACCA TGAGCCCAGA 1860 
55 TTTAGGCTCA GGAAAGCACG GGAAGCAGAA TTCCCCCCAG GCTCCTCTGA GGAGCCTCTG 1920 
CTGCAGTTCC CCTCAGGCCT CAGAGGCAGC CCTGTCCTCC AGGTGGGCCT GGGGCTTGCT 1980 
TCTGCCACTC ACTGTGGGTC GATGTCTTTT CTAGGGGGCC GAGGCGTCAG CGTGGGCCCC 2040 
ATCCTCAGCA GCAGTGCCTC GGATATCTTC TGCGACAATG AGAATGGGCC TAACT TCCT T 2100 
TTCCACAACC GGGGCGATGG CACCTTTGTG GACGCTGCGG CCAGTGCTGA ACGTCGTTTA 2160 
60 GCCTTCATCG TTCACCTCAA ATATCACCTC TGCAGAGATT TTCCTCACTC CCTGTGCCAC 2220 
CTAGCAGAAA CTGGTCCTTC CTCCTCCTGC TGCCCGTGGC ATGCACGTCT TCTTCAGGCT 2280 
CCACATTGCC ATCATGGTTT GTCTATGAGC TTTACAAGGA CCGGGTCACG GTTCTATTCA 2340 
TTCTTGACGC AAGGCTTGGC CTCCAGTGCC CACCGGAGGA CACTCAGCCT CCAGGGTTCT 2400 
CAGGGGGCCC CACCCTGCCT TCTGGCAAGA GCTCCCTGTG TCCTGGGGTC TCTGATCCCC 2460 
65 ACTGCCTATT ACATTGTCCT GTGGTCTGCC ATCCCAGAGA GCCTGATGAC CCACAGCTAT 2520 
TTGTCCTCTG AAAGAGTCAA CGTGGGTGTG GACGACCCCC ACCAGCATGG GCGAGGTGTC 2580 
GCCCTGGCTG ACTTCAACCG TGATGGCAAA GTGGACATCG TCTATGGCAA CTGGAATGGC 2640 
CCCCACCGCC TCTATCTGCA AATGAGCACC CATGGGAAGG TCCGCTTCCG GGACA TCGCC 2700 
TCACCCAAGT TCTCCATGCC CTCCCCTGTC CGCACGGTCA TCACCGCCGA CTTTGACAAT 2760 
70 GACCAGGAGC TGGAGATCTT CTTCAACAAC ATTGCCTACC GCAGCTCCTC AGCCAACCGC 2820 
CTCTTCCGAT GCTCCATCCT GGCTCGTGGC TCTTCATCCT TGACAGCTGG TGGGAGGAAC 2880 
GGTCAGGGAG AAGGTTTAAG AATCAGAAGG GGAGGGTTOC CAGGGCCAGG GGGTCAGGCC 2940 
AAGGTCAACA CAGGTCCCCT GATGAAGAAA CAGAAAGGAA GGAAGGACGA GGACTGGGCA 3000 
AGAGGCTGTG GGAATGCAGG GCAAAGCCTG GCCAAGGAGC CGGCCTCTGC TATTGCAGGG 3060 
75 AAAGGGAAGG GAAATGTGGC CCAAAGTGTG CCCAGAACCC AAGCGCCACA AGATACAAAG 3120 
CCACACTACC ACAAAAAGGG GCTACAGGGT CCAATCACTA CCAGGAAAAG GGGCTACGGG 3180 
GTCCAATCAC TACCAGGAAA AGGGGCTACG GGGTCCAATC ACTACCAGGA AAAGGGGCTA 3240 
CGGGGTCCAA TCACTACCAG GAAAAGGGGC TACGGGGTCC AATCACTACC AGGAAAAGGG 3300 
GCTACGGGCT CCAATCACTA CCAGGAAAAG GGGCTACAGG GTCCAATCAC TACCAGGAAA 3360 
80 AGGGGCTACG GGCTCCAATC ACTACCAGGA AAAGGGGCTA CAGGGTCCAA TCACTACCAC 3420 
AGAAAGGGGC TACGGGCTCC AATCACTACC AGGAAAAGGG GCTACGGGGT CCAATCACTA 3480 
CCAGGAAAAG GGGCTACAGG GTCCAATCAC TACCAGGAAA AGGGGCTACG GGGTCCAATC 3540 
ACTACCAGGA AAAGGGGCTA CGGGCTCCAA TCACTACCAG GAAAAGGGGC TACGGGGTCC 3600 
AATCACTACC AGGAAAAGGG GCTACAGGGT CCAATCACTA CCAGGAAAAG GGGCTACAGG 3660 



1027 
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GTCCAATCAC TACCACAGAA AGGGGCTACG GGCTCCAATC ACTACCAGGA AAAGGGGCTA 3720 

CGGGGTCCAA TCACTACCAG GAAAAGGGGC TACGGGCTCC AATCACTACC AGGAAAAGAG 3780 

GCTATGGGGT CCAATCACTA CCAGGAAAAG GGGCTAOGGG CTCCAATCAC TACCAGGAAA 3840 

AGGGGCTATG GGGTCCAATC ACTACCACAG AAAGGGGCTA CGGGGTCCAA CGTCATCCGT 3900 

AGAGAGCACG GAGACCCCCT CATCGAGGAG CTCAATCCCG GCGACGCCTT GGAGCCTGAG 3960 

GGCCGGGGCA CAGGGGGTGT GGTGACCGAC TTCGACGGAG ACGGGATGCT GGACCTCATC 4020 

TTGTCCCATG GAGAGTCCAT GGCTCAGCCG CTGTCCGTCT TCCGGGGCAA TCAGGGCTTC 40 BO 

AACAACAACT GGCTGCGAGT GGTGCCACGC ACCCGGTTTG GGGCCTTTGC CAGGGGAGCT 4140 

AAGGTCGTGC TCTACACCAA GAAGAGTGGG GCCCACCTGA GGATCATCGA CGGGGGCTCA 4200 

GGCTACCTGT GTGAGATGGA GCCCGTGGCA CACTTTGGCC TGGGGAAGGA TGAAGCCAGC 4260 

AGTGTGGAGG TGACGTGGCC AGATGGCAAG ATGGTGAGCC GGAACGTGGC CAGCGGGGAG 4320 

ATGAACTCAG TGCTGGAGAT CCTCTACCCC CGGGATGAGG ACACACTTCA GGACCCAGCC 4380 

CCACTGGAGT GTGGCCAAGG ATTCTCCCAG CAGGAAAATG GCCATTGCAT GGACACCAAT 4440 

GAATGCATCC AGTTCCCATT CGTGTGCCCT CGAGACAAGC CCGTATGTGT CAACACCTAT 4500 

GGAAGCTACA GGTGCCGGAC CAACAAGAAG TGCAGTCGGG GCTACGAGCC CAACGAGGAT 4560 

GGCACAGCCT GCGTGGGTAC TGAGCTAGGC TCTAGGCATA CAATGACGTG GAAACCAAGG 4620 

CCCAAAAAGG AGCTGCAACT TTCCCAAGGC ATCTGCACCC CCGTCTGGTC CTTTTTCCTG. 4680 

CCGGGTTGCC GGCTGCTCCT CAAAAGAGCT CAGCTCCAGG CTGCTCCCAG CACCCTTCTC 4740 
CAGAAAGCTC CAGGTATTCC AGAAGCCCAA GTGTATGAAC AAGATCAGGA ATAA 



Seq ID NO: 185 Protein sequence 
Protein Accession FGENESHH 



1 11 21 31 41 51 

1 I 1 I I I 

MACPGGLPAR CSGWMGLGGP SGSSPASPPH SSSRYNGPNL VLKYDRAQKR LVNIAVDERS 60 

SPYYALRDRQ GNAIGVTACD IDGDGREEIY FLNTNNAFSG HSSSAQVPSG LHRNRPVLKP 120 

PPTTPAGLLG LPPLSGRDFS SSLGQASPDS RQGERVPVPC CRGGLRPTHE PEPFLLRPKS 180 

GVATYTDKLF KFRNNRWEDI LSDEVNVARG VASLFAGRSV ACVDRKGSGR YSIYIANYAY 240 

GNVGPDALIE MDPEASDLSR GILALRDVAA EAGVSKYTEG FSHTASPSIG EISGRTEERE 300 

GGDPEEADEE HSGDGSTSQL CRLGWKDGQF KEEAAALVEE QREAGAAGVP RGRVRTALQT 360 

SKSHLADKNL FGPPCYYSVC APSPAHPFPA RQAPQHYPVA PLVTQLMTHG RLAGKLARSV 420 

PHPRAPGMDP KCKGRHAEPG LMAEALGAWP ALSTTWPGG LRSWEESRQK GQAMSRCALR 480 

ELGGPWSQAT QHLPAREL.YD LGEPPILQRT DGDPGRRRDS PKVTQECHLV ATMPALGGLE 540 

GPGRVAKREI GRETGAVGRP LSHPLVPNFP SCLRPLEAGT VPGAALPGNP GNWVLDMAKA 600 

LAWNQMEKEE GKIHGDHEPR FRLRKAREAE FPPGSSEEPL LQFPSGLRGS PVLQVGLGLA 660 

SATHCGSMSF LGGRGVSVGP ILSSSASDIF CDNENGPNFL FHNRGDGTFV DAAASAERRL 720 

AFIVHLKYHL CRDFPHSLCH LAETGPSSSC CPWHARLLQA PHCHHGLSMS FTRTGSRFYS 780 

FLTQGLASSA HRRTLSLQGS QGAPPCLLiAR APCVLGSLIP TAYYIVLWSA IPESLMTHSY 840 

LSSERVNVGV DDPHQHGRGV ALADFNRDGK VDIVYGNWNG PHRLYLQMST HGKVRFRDIA 900 

SPKFSMPSPV RTVITADFDN DQELEIFFNN IAYRSSSANR LFRCSILARG SSSLTAGGRN 960 

GQGEGLRIRR GGFPGPGGQA KVNTGPLMKK QKGRKDEDWA RGCGNAGQSL AKEPASAIAG 102 0 

KGKGNVAQSV PRTQAPQDTK PHYHKKGLQG PITTRKRGYG VQSLPGKGAT GSNHYQEKGL 1080 

RGPITTRKRG YGVQSLPGKG ATGSNHYQEK GLQGPITTRK RGYGLQSLPG KGATGSNHYH 1140 

RKGLRAPITT RKRGYGVQSI* PGKGATGSNH YQEKGLRGPI TTRKRGYGLQ SLPGKGATGS 1200 

NHYQEKGLQG PITTRKRGYR VQSLPQKGAT GSNHYQEKGL RGPITTRKRG YGLQSLPGKE 1260 

AMGSNHYQEK GLRAPITTRK RGYGVQSLPQ KGATGSNVIR REHGDPLIEE LNPGDALEPE 1320 

GRGTGGWTD FDGDGMLDL.I IiSHGESMAQP LSVFRGNQGF NNNWLRWPR TRFGAFARGA 1380 

KWLYTKKSG AHLRIIDGGS GYLCEMEPVA HFGLGKDEAS SVEVTWPDGK MVSRNVASGE 1440 

MNSVLEILYP RDEDTLQDPA PLECGQGFSQ QENGHCMDTN ECIQFPFVCP RDKPVCVNTY 1500 

GSYRCRTNKK CSRGYEPNED GTACVGTELG SRHTMTWKPR PKKELQLSQG ICTPVWSFFL 1560 
PGCRLLLKRA QLQAAPSTLL QKAPGIPEAQ VYEQDQE 



Seq ID NO: 186 DMA sequence 

Nucleic Acid Accession #: NM_0 00584.1 

Coding sequence: 75.-374 

1 11 21 31 41 51 

I I I I i I 

AGCAGAGCAC ACAAGCTTCT AGGACAAGAG CCAGGAAGAA ACCACCGGAA GGAACCATCT 60 

CACTGTGTGT AAACATGACT TCCAAGCTGG CCGTGGCTCT CTTGGCAGCC TTCCTGATTT 120 

CTGCAGCTCT GTGTGAAGGT GCAGTTTTGC CAAGGAGTGC TAAAGAACTT AGATGTCAGT 180 

GCATAAAGAC ATACTCCAAA CCTTTCCACC CCAAATTTAT CAAAGAACTG AGAGTGATTG 240 

AGAGTGGACC ACACTGCGCC AACACAGAAA TTATTGTAAA GCTTTCTGAT GGAAGAGAGC 300 

TCTGTCTGGA CCCCAAGGAA AACTGGGTGC AGAGGGTTGT GGAGAAGTTT TTGAAGAGGG 360 

CTGAGAATTC ATAAAAAAAT TCATTCTCTG TGGTATCCAA GAATCAGTGA AGATGCCAGT 420 

GAAACTTCAA GCAAATCTAC TTCAACACTT CATGTATTGT GTGGGTCTGT TGTAGGGTTG 480 

CCAGATGCAA TACAAGATTC CTGGTTAAAT TTGAATTTCA GTAAACAATG AATAGTTTTT 540 

CATTGTACCA TGAAATATCC AGAACATACT TATATGTAAA GTATTATTTA TTTGAATCTA 600 

CAAAAAACAA CAAATAATTT TTAAATATAA GGATTTTCCT AGATATTGCA CGGGAGAATA 660 

TACAAATAGC AAAATTGAGC CAAGGGCCAA GAGAATATCC GAACTTTAAT TTCAGGAATT 720 

GAATGGGTTT GCTAGAATGT GATATTTGAA GCATCACATA AAAATGATGG GACAATAAAT 780 

TTTGCCATAA AGTCAAATTT AGCTGGAAAT CCTGGATTTT TTTCTGTTAA ATCTGGCAAC 840 

CCTAGTCTGC TAGCCAGGAT CCACAAGTCC TTGTTCCACT GTGCCTTGGT TTCTCCTTTA 900 

TTTCTAAGTG GAAAAAGTAT TAGCCACCAT CTTACCTCAC AGTGATGTTG TGAGGACATG 960 

TGGAAGCACT TTAAGTTTTT TCATCATAAC ATAAATTATT TTCAAGTGTA ACTTATTAAC 1020 

CTATTTATTA TTTATGTATT TATTTAAGCA TCAAATATTT GTGCAAGAAT TTGG AAAAA T 1080 

AGAAGATGAA TCATTGATTG AATAGTTATA AAGATGTTAT AGTAAATTTA TTTTATTTTA 1140 

GATATTAAAT GATGTTTTAT TAGATAAATT TCAATCAGGG TTTTTAGATT AAACAAAGAA 1200 

ACAATTGGGT ACCCAGTTAA ATTTTCATTT CAGATAAACA ACAAATAATT TTTTAGTATA 1260 

AGTACATTAT TGTTTATCTG AAAGTTTTAA TTGAACTAAC AATCCTAGTT TGATACTCCC 1320 

AGTCTTGTCA TTGCCAGCTG TGTTGGTAGT GCTGTGTTGA ATTACGGAAT AATGAGTTAG 1380 

AACTATTAAA ACAGCCAAAA CTCCACAGTC AATATTAGTA ATTTCTTGCT GGTTGAAACT 1440 

TGTTTATTAT GTACAAATAG ATTCTTATAA TATTATTTAA ATGACTGCAT TTTTAAATAC 1500 
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AAGGCTTTAT ATTTTTAACT TTAAGATGTT TTTATGTGCT CTCCAAATTT TTTTTACTGT 1560 
TTCTGATTGT ATGGAAATAT AAAAGTAAAT ATGAAACATT TAAAATATAA TTTGTTGTCA 1620 
AAGTAAAAAA AAAAAAAAA 

5 Seq ID NO: 187 Protein sequence 
Protein Accession ft: NP_00057S.l 

1 11 21 31 41 51 

10 MTSKLAVALL AAFLISAALC EGAVLPRSAK ELRCQCIKTY SKPFHPKFIK ELRVIESGPH 60 
CANTEIIVKL SDGRELCLDP KENWVQRWE KFLKRAENS 

Seq ID NO: 188 DNA sequence 
Nucleic Acid Accession #: NM_003661.1 
15 Coding sequence: 1..1152 

1 11 21 31 41 51 

I i I 1 I I 

ATGAGTGCAC TTTTCCTTGG TGTGGGAGTG AGGGCAGAGG AAGCTGGAGC GAGGGTGCAA 60 

20 CAAAACGTTC CAAGTGGGAC AGATACTGGA GATCCTCAAA GTAAGCCCCT CGGTGACTGG 120 

GCTGCTGGCA CCATGGACCC AGAGAGCAGT ATCTTTATTG AGGATGCCAT TAAGTATTTC 180 

AAGGAAAAAG TGAGCACACA GAATCTGCTA CTCCTGCTGA CTGATAATGA GGCCTGGAAC 240 

GGATTCGTGG CTGCTGCTGA ACTGCCCAGG AATGAGGCAG ATGAGCTCCG TAAAGCTCTG 300 

GACAACCTTG CAAGACAAAT GATCATGAAA GACAAAAACT GGCACGATAA AGGCCAGCAG 360 

25 TACAGAAACT GGTTTCTGAA AGAGTTTCCT CGGTTGAAAA GTGAGCTTGA GGATAACATA 420 

AGAAGGCTCC GTGCCCTTGC AGATGGGGTT CAGAAGGTCC ACAAAGGCAC CACCATCGCC 480 

AATGTGGTGT CTGGCTCTCT CAGCATTTCC TCTGGCATCC TGACCCTCGT CGGCATGGGT 540 

CTGGCACCCT TCACAGAGGG AGGCAGCCTT GTACTCTTGG AACCTGGGAT GGAGTTGGGA 600 

ATCACAGCCG CTTTGACCGG GATTACCAGC AGTACCATGG ACTACGGAAA GAAGTGGTGG 660 

30 ACACAAGCCC AAGCCCACGA CCTGGTCATC AAAAGCCTTG ACAAATTGAA GGAGGTGAGG 720 

GAGTTTTTGG GTGAGAACAT ATCCAACTTT CTTTCCTTAG CTGGCAATAC TTACCAACTC 780 

ACACGAGGCA TTGGGAAGGA CATCCGTGCC CTCAGACGAG CCAGAGCCAA TCTTCAGTCA 840 

GTACCGCATG CCTCAGCCTC ACGCCCCCGG GTCACTGAGC CAATCTCAGC TGAAAGCGGT 900 

GAACAGGTGG AGAGGGTTAA TGAACCCAGC ATCCTGGAAA TGAGCAGAGG AGTCAAGCTC 960 

35 ACGGATGTGG CCCCTGTAAG CTTCTTTCTT GTGCTGGATG TAGTCTACCT CGTGTACGAA 1020 
TCAAAGCACT TACATGAGGG GGCAAAGTCA GAGACAGCTG AGGAGCTGAA GAAGGTGGCT 1080 
CAGGAGCTGG AGGAGAAGCT AAACATTCTC AACAATAATT ATAAGATTCT GCAGGCGGAC 1140 
CAAGAACTGT GA 

40 Seq ID NO: 189 Protein sequence 
Protein Accession #: NP_0036S2.l 

1 11 21 31 41 51 

^11 I i i I 

45 MSALFLGVGV RAEEAGARVQ QNVPSGTDTG DPQSKPLGDW AAGTMDPESS IFIEDAIKYF 60 

KEKVSTQNLL LLLTDNEAWN GFVAAAELPR NEADELRKAL DNLARQMIMK DKNWHDKGQQ 120 

YRNMFLKEFP RLKSELEDNI RRLRALADGV QKVHKGTTIA NWSGSLSIS SGILTLVGMG 180 

LAPFTEGGSL VLLEPGMELG ITAALTGITS STKDYGKKWW TQAQAHDLVI KSLDKLKEVR 240 

EFLGENISNF LSLAGNTYQL TRGIGKDIRA LRRARANLQS VPHASASRPR VTEPISAESG 300 

50 EQVERVNEPS ILEMSRGVKL TDVAPVSFFL VLDWYLVYE SKHLHEGAKS ETAEELKKVA 360 
QELEEKLNIL NNNYKILQAD QEL 

Seq ID NO: 190 DNA sequence 
Nucleic Acid Accession #: NM_014452.1 
55 Coding sequence: 1..196B 

1 11 21 31 41 51 

I I I I I I 

ATGGGGACCT CTCCGAGCAG CAGCACCGCC CTCGCCTCCT GCAGCCGCAT CGCCCGCCGA 60 

60 GCCACAGCCA CGATGATCGC GGGCTCCCTT CTCCTGCTTG GATTCCTTAG CACCACCACA 120 

GCTCAGCCAG AACAGAAGGC CTCGAATCTC ATTGGCACAT ACCGCCATGT TGACCGTGCC 180 

ACCGGCCAGG TGCTAACCTG TGACAAGTGT CCAGCAGGAA CCTATGTCTC TGAGCATTGT 240 

ACCAACACAA GCCTGCGCGT CTGCAGCAGT TGCCCTGTGG GGACCTTTAC CAGGCATGAG 300 

AATGGCATAG AGAAATGCCA TGACTGTAGT CAGCCATGCC CATGGCCAAT GATTGAGAAA 360 

65 TTACCTTGTG CTGCCTTGAC TGACCGAGAA TGCACTTGCC CACCTGGCAT GTTCCAGTCT 420 

AACGCTACCT GTGCCCCCCA TACGGTGTGT CCTGTGGGTT GGGGTGTGCG GAAGAAAGGG 480 

ACAGAGACTG AGGATGTGCG GTGTAAGCAG TGTGCTCGGG GTACCTTCTC AGATGTGCCT 540 

TCTAGTGTGA TGAAATGCAA AGCATACACA GACTGTCTGA GTCAGAACCT GGTGGTGATC 600 

AAGCCGGGGA CCAAGGAGAC AGACAACGTC TGTGGCACAC TCCCGTCCTT CTCCAGCTCC 660 

70 ACCTCACCTT CCCCTGGCAC AGCCATCTTT CCACGCCCTG AGCACATGGA AACCCATGAA 720 

GTCCCTTCCT CCACTTATGT TCCCAAAGGC ATGAACTCAA CAGAATCCAA CTCTTCTGCC 780 

TCTGTTAGAC CAAAGGTACT GAGTAG CATC CAGGAAGGGA CAGTCCCTGA CAACACAAGC 840 

TCAGCAAGGG GGAAGGAAGA CGTGAACAAG ACCCTCCCAA ACCTTCAGGT AGTCAACCAC 900 

CAGCAAGGCC CCCACCACAG ACACATCCTG AAGCTGCTGC CGTCCATGGA GGCCACTGGG 960 

75 GGCGAGAAGT CCAGCACGCC CATCAAGGGC CCCAAGAGGG GACATCCTAG ACAGAACCTA 1020 

CACAAGCATT TTGACATCAA TGAGCATTTG CCCTGGATGA TTGTGCTTTT CCTGCTGCTG 1080 

GTGCTTGTGG TGATTGTGGT GTGCAGTATC CGGAAAAGCT CGAGGACTCT GAAAAAGGGG 1140 

CCCCGGCAGG ATCCCAGTGC CATTGTGGAA AAGGCAGGGC TGAAGAAATC CATGACTCCA 1200 

ACCCAGAACC GGGAGAAATG GATCTACTAC TGCAATGGCC ATGGTATCGA TATCCTGAAG 1260 

80 CTTGTAGCAG CCCAAGTGGG AAGCCAGTGG AAAGATATCT ATCAGTTTCT TTGCAATGCC 1320 

AGTGAGAGGG AGGTTGCTGC TTTCTCCAAT GGGTACACAG CCGACCACGA GCGGGCCTAC 1380 

GCAGCTCTGC AGCACTGGAC CATCCGGGGC CCCGAGGCCA GCCTCGCCCA GCTAATTAGC 1440 

GCCCTGCGCC AGCACCGGAG AAACGATGTT GTGGAGAAGA TTCGTGGGCT GATGGAAGAC 1500 

ACCACCCAGC TGGAAACTGA CAAACTAGCT CTCCCGATGA GCCCCAGCCC GCTTAGCCCG 1560 
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5 

10 

15 

20 

25 

30 

35 

40 

45 
50 
55 
60 
65 
70 
75 
80 



AGCCCCATCC 
TCCCCACAGG 
GACTCTACAT 
AAGAAGGACA 
GATGACATGC 
GCTGAGGACA 
CAGACCCTCC 



CCAGCCCCAA 
ACAAGAACAA 
CCAGCGGCTC 
CAGTGTTGCG 
TCCACTTTCT 
AACTAGACCG 
TGGACTCTGT 



CGCGAAACTT 
GGGCTTCTTC 
CTCCGCGCTG 
GCAGGTACGC 
AAATCCTGAG 
GCTATTCGAA 
TTATAGCCAT 



GAGAATTCCG 
GTGGATGAGT 
AGCAGGAACG 
CTGGACCCCT 
GAGCTGCGGG 
ATTATTGGAG 
CTTCCTGACC 



CTCTCCTGAC GGTGGAGCCT 
CGGAGCCCCT TCTCCGCTGT 
GTTCCTTTAT TACCAAAGAA 
GTGACTTGCA GCCTATCTTT 
TGATTGAAGA GATTCCCCAG 
TCAAGAGCCA GGAAGCCAGC 
TGCTGTAG 



1620 
1680 
1740 
1800 
1860 
1920 



Seq ID NO: 191 Protein sequence 
Protein Accession it: NP 0S5267.1 



MGTSPSSSTA 
TGQVLTCDKC 
LPCAALTDRE 
SSVMKCKAYT 
VPSSTYVPKG 
QQGPHHKHIL 
VLWIWCSI 
LVAAQVGSQW 
ALRQHRRNDV 
SPQDKNKGFF 
DDMUiFLNPE 



11 

! 

LASCSRIARR 
PAGTYVSEHC 
CTCPPGMFQS 
DCLSQNLWI 
MNSTESNSSA 
KLLPSMEATG 
RKSSRTLKKG 
KDIYQFLCNA 
VEKIRGLMED 
VDESEPLLRC 
ELRVIEEIPQ 



21 

I 

ATATMIAGSL 
TNTSLRVCSS 
NATCAPHTVC 
KPGTKETDNV 
SVRPKVIiSSI 
GEKSSTPIKG 
PRQDPSAIVE 
SEREVAAFSN 
TTQLETDKLA 
DSTSSGSSAL 
AEDKLDRLFE 



31 

1 

LLLGFLSTTT 
CPVGTFTRHE 
FVGWGVRKKG 
CGTliPSFSSS 
QEGTVPDNTS 
PKRGHPRQNL 
KAGLKKSMTP 
GYTADHERAY 
LPMSPSPLSP 
SRNGSFITKE 
IIGVKSQEAS 



41 
I 

AQPEQKASNL 
NGIEKCHDCS 
TETEDVRCKQ 
TSPSPGTAIF 
SARGKEDVNK 
HKHFDINEHL 
TQNREKWIYY 
AALQHWTIRG 
SPIPSPNAKL 
KKDTVLRQVR 
QTLLDSVYSH 



51 
I 

IGTYRHVDRA 
QPCPWPMIEK 
CARGTFSDVP 



TLPNLQWNH 
PWMIVLFLLL 
CNGHGIDILK 
PEASIiAQLIS 
ENSALLTVEP 
LDPCDLQPIF 
LPDLL 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



Seq ID NO: 192 DNA sequence 

Nucleic Acid Accession #: XM_044533 

Coding sequence: 238..27S1 



51 



1 11 21 31 41 

I I I I I I 

GCTCTGCCCA AGCCGAGGCT GCGGGG CCGG CGCCGGCGGG AGGACTGCGG TGCCCCGCGG 60 

AGGGGCTGAG TTTGCCAGGG CCCACTTGAC CCTGTTTCCC ACCTCCCGCC CCCCAGGTCC 120 

GGAGGCGGGG GCCCCCGGGG CGACTCGGGG GCGGACCGCG GGGCGGAGCT GCCGCCCGTG 180 

AGTCCGGCCG AGCCACCTGA GCCCGAGCCG CGGGACACCG TCGCTCCTGC TCTCCGAATG 240 

CTGCGCACCG CGATGGGCCT GAGGAGCTGG CTCGCCGCCC CATGGGGCGC GCTGCCGCCT 300 

CGGCCACCGC TGCTGCTGCT CCTGCTGCTG CTGCTCCTGC TGCAGCCGCC GCCTCCGACC 360 

TGGGCGCTCA GCCCCCGGAT CAGCCTGCCT CTGGGCTCTG AAGAGCGGCC ATTCCTCAGA 420 

TTCGAAGCTG AACACATCTC CAACTACACA GCCCTTCTGC TGAGCAGGGA TGGCAGGACC 480 

CTGTACGTGG GTGCTGGAGA GGCCCTCTTT GCACTCAGTA GCAACCTCAG CTTCCTGCCA 540 

GGCGGGGAGT ACCAGGAGCT GCTTTGGGGT GCAGACGCAG AGAAGAAACA GCAGTGCAGC 600 

TTCAAGGGCA AGGACCCACA GCGCGACTGT CAAAACTACA TCAAGATCCT CCTGCCGCTC 660 

AGCGGCAGTC ACCTGTTCAC CTGTGGCACA GCAGCCTTCA GCCCCATGTG TACCTACATC 720 

AACATGGAGA ACTTCACCCT GGCAAGGGAC GAGAAGGGGA ATGTCCTCCT GGAAGATGGC 780 

AAGGGCCGTT GTCCCTTCGA CCCGAATTTC AAGTCCACTG CCCTGGTGGT TGATGGOGAG 840 

CTCTACACTG GAACAGTCAG CAGCTTCCAA GGGAATGACC CGGCCATCTC GCGGAGCCAA 900 

AGCCTTCGCC CCACCAAGAC CGAGAGCTCC CTCAACTGGC TGCAAGACCC AGCTTTTGTG 960 

GCCTCAGCCT ACATTCCTGA GAGCCTGGGC AGCTTGCAAG GCGATGATGA CAAGATCTAC 1020 

TTTTTCTTCA GCGAGACTGG CCAGGAATTT GAGTTCTTTG AGAACACCAT TGTGTCCCGC 10 80 

ATTGCCCGCA TCTGCAAGGG CGATGAGGGT GGAGAGCGGG TGCTACAGCA GCGCTGGACC 1140 

TCCTTCCTCA AGGCCCAGCT GCTGTGCTCA CGGCCCGAOG ATGGCTTCCC CTTCAACGTG 1200 

CTGCAGGATG TCTTCACGCT GAGCCCCAGC CCCCAGGACT GGCGTGACAC CCTTTTCTAT 1260 

GGGGTCTTCA CTTCCCAGTG GCACAGGGGA ACTACAGAAG GCTCTGCCGT CTGTGTCTTC 1320 

ACAATGAAGG ATGTGCAGAG AGTCTTCAGC GGCCTCTACA AGGAGGTGAA CCGTGAGACA 1380 

CAGCAGTGGT ACACCGTGAC CCACCCGGTG CCCACACCCC GGCCTGGAGC GTGCATCACC 1440 

AACAGTGCCC GGGAAAGGAA GATCAACTCA TCCCTGCAGC TCCCAGACCG CGTGCTGAAC 1500 

TTCCTCAAGG ACCACTTCCT GATGGACGGG CAGGTCCGAA GCOGCATGCT GCTGCTGCAG 1560 

CCCCAGGCTC GCTACCAGCG CGTGGCTGTA CACCGCGTCC CTGGCCTGCA CCACACCTAC 1620 

GATGTCCTCT TCCTGGGCAC TGGTGACGGC CGGCTCCACA AGGCAGTGAG CGTGGGCCCC 1680 

CGGGTGCACA TCATTGAGGA GCTGCAGATC TTCTCATCGG GACAGCCCGT GCAGAATCTG 1740 

CTCCTGGACA CCCACAGGGG GCTGCTGTAT GCGGCCTCAC ACTCGGGCGT AGTCCAGGTG 1800 

CCCATGGCCA ACTGCAGCCT GTACAGGAGC TGTGGGGACT GCCTCCTCGC CCGGGACCCC 1860 

TACTGTGCTT GGAGCGGCTC CAGCTGCAAG CACGTCAGCC TCTACCAGCC TCAGCTGGCC 1920 

ACCAGGCCGT GGATCCAGGA CATCGAGGGA GCCAGCGCCA AGGACCTTTG CAGCGCGTCT 1980 

TCGGTTGTGT CCCCGTCTTT TGTACCAACA GGGGAGAAGC CATGTGAGCA AGTCCAGTTC 2040 

CAGCCCAACA CAGTGAACAC TTTGGCCTGC CCGCTCCTCT CCAACCTGGC GACCCGACTC 2100 

TGGCTACGCA ACGGGGCCCC CGTCAATGCC TCGGCCTCCT GCCACGTGCT ACCCACTGGG 2160 

GACCTGCTGC TGGTGGGCAC CCAACAGCTG GGGGAGTTCC AGTGCTGGTC ACTAGAGGAG 2220 

GGCTTCCAGC AGCTGGTAGC CAGCTACTGC CCAGAGGTGG TGGAGGACGG GGTGGCAGAC 2280 

CAAACAGATG AGGGTGGCAG TGTACCGGTC ATTATCAGCA CATCGCGTGT GAGTGCACCA 2340 

GCTGGTGGCA AGGCCAGCTG GGGTGCAGAC AGGTCCTACT GGAAGGAGTT CCTGGTGATG 2400 

TGCACGCTCT TTGTGCTGGC CGTGCTGCTC CCAGTTTTAT TCTTGCTCTA CCGGCACCGG 2460 

AACAGCATGA AAGTCTTCCT GAAGCAGGGG GAATGTGCCA GCGTGCACCC CAAGACCTGC 2520 

CCTGTGGTGC TGCCCCCTGA GACCCGCCCA CTCAACGGCC TAGGGCCCCC TAGCACCCCG 2580 

CTCGATCACC GAGGGTACCA GTCCCTGTCA GACAGCCCCC CGGGGTCCCG AGTCTTCACT 2640 

GAGTCAGAGA AGAGGCCACT CAGCATCCAA GACAGCTTCG TGGAGGTATC CCCAGTGTGC 2700 

CCCCGGCCCC GGGTCCGCCT TGGCTCGGAG ATCCGTGACT CTGTGGTGTG AGAGCTGACT 2760 

TCCAGAGGAC GCTGCCCTGG CTTCAGGGGC TGTGAATGCT CGGAGAGGGT CAACTGGACC 2820 

TCCCCTCCGC TCTGCTCTTC GTGGAACACG ACCGTGGTGC CCGGCCCTTG GGAGCCTTGG 2880 

GGCCAGCTGG CCTGCTGCTC TCCAGTCAAG TAGCGAAGCT CCTACCACCC AGACACCCAA 2940 

ACAGCCGTGG CCCCAGAGGT CCTGGCCAAA TATGGGGGCC TGCCTAGGTT GGTGGAACAG 3000 

TGCTCCTTAT GTAAACTGAG CCCTTTGTTT AAAAAACAAT TCCAAATGTG AAACTAGAAT 3060 

GAGAGGGAAG AGATAGCATG GCATGCAGCA CACACGGCTG CTCCAGTTCA TGGCCTCCCA 3120 

GGGGTGCTGG GGATGCATCC AAAGTGGTTG TCTGAGACAG AGTTGGAAAC CCTCACCAAC 3180 
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10 

15 

20 
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TGGCCTCTTC 
CAGGACCAGC 
CTGCCGTCGT 
GGTCCTGGGC 
GAGAGGACAG 
CAGGGAAGAG 
CCACCATATC 
GTCCTCTCCC 
TCAACACTGC 
ATGCACTTTA 



ACCTTCCACA 
TTGGGCTGCG 
CCCACCACCT 
TCGGACCCAA 
CGCGAGCTCA 
ACTGTCGCCT 
CACCCTCGCT 
CAGTCCCCAG 
CCAGCACAGG 
TGTCATTTTT 



TTATCCCGCT 
TGCGTTCTGC 
CAGGGACCAG 
CTCCTGGACC 
GGAGAGATTT 
GCCTTCCTCC 
CCATCTTTGA 
TTCACCCTCC 
GGCCCTGAAT 
TAATAAAGTC 



GCCACCGGCT 
CTTGCCAGTC 
AGGGCTAGGT 
TTTCCAGCCT 
CGTGACAATG 
GTTGTTGCGT 
ACTCAAACAC 
ATCCCTCACC 
TTATGTGGTT 
TGAAGAATTA 



GCCCTGTCTC 
AGCCGAGGAT 
TGGCACTGCG 
GTATCAGGCT 
TACGCCTTTC 
GAGAACCCGT 
GAGGAACTAA 
TTCCTCCACT 
TTTATACATT 
CTGTTT 



Seq ID NO: 193 Protein sequence 
Protein Accession #: XP 044S33.3 



1 
I 

MLRTAMGliRS 
RFEAEHISNY 
SFKGKDPQRD 
GKGRCPFDPN 
VASAYIPESL 
TSFLKAQLLC 
FTMKDVQRVF 
NFLKDHFLMD 
PRVHIIEELQ 
PYCAWSGSSC 
FQPNTVNTLA 
EGFQQLVASY 
MCTLFVLAVL 
PLDHRGYQSL 



11 
I 

WLAAPWGALiP 
TALLLSRDGR 
CQNYIKILLP 
PKSTALWDG 
GSLQGDDDKI 
SRPDDGFPFN 
SGLYKEVNRE 
GQVRSRMLLL 
IFSSGQPVQN 
KHVSLYQPQL 
CPLLSNLATR 
CPEWEDGVA 
LPVLFLLYRH 
SDSPPGSRVF 



21 
I 

PRPPLLLLLL 
TLYVGAREAL 
LSGSHLFTCG 
ELYTGTVSSF 
YFFFSETGQE 
VLQDVFTLSP 
TQQWYTVTHP 
QPQARYQRVA 
LLLDTHRGLL 
ATRPWIQDIE 
LWLRNGAPVN 
DQTDEGGSVP 
RNSMKVFLKQ 
TESEKRPLSI 



31 
I 

LLLLLQPPPP 
FALSSNIiSFL 
TAAFSPMCTY 
QGNDPAISRS 
FEFFENTIVS 
SPQDWRDTLF 
VPTPRPGACI 
VHRVPGLHHT 
YAASHSGWQ 
GASAKDLCSA 
ASASCHVLPT 
VIISTSRVSA 
GECASVHPKT 
QDSFVEVSPV 



41 
I 

TWALSPRISL 
PGGEYQELLW 
INMENFTLAR 
QSLRPTKTES 
RIARICKGDE 
YGVFTSQWHR 
TNSARERKIN 
YDVLFLGTGD 
VPMANCSLYR 
SSWSPSFVP 
GDLLLVGTQQ 
PAGGKASWGA 
CPWLPPETR 
CPRPRVRLGS 



ACTGCAGATT 
GTAGTTGTTG 
GCCCTCACCA 
GTGGCCACAC 
CCTCAGAATT 
GTGCCCCTTC 
CTGCACCCTG 
CTAAGGGATA 
TTTTAATAAG 



51 
I 

PLGSEERPFL 
GADAEKKQQC 
DEKGNVLLED 
SUJWLQDPAF 
GGERVLQQRW 
GTTEGSAVCV 
SSLQLPDRVL 
GRliHKAVSVG 
SCGDCLLARD 
TGEKPCEQVQ 
LGEFQCWSLE 
DRSYWKEFLV 
PLNGLGPPST 
EIRDSW 



Seq ID NO: 194 DNA sequence 

Nucleic Acid Accession #: NM_022819.1 

Coding sequence: 1..635 



1 
I 

ATGGCAGATG 
TTCTCTGGGT 
AGCCTGGGTA 
GCTCACGGCA 
ATCCTGTCCT 
GATGAGGTGG 
GGCTGTCACC 
TGCAGTGACC 
GTTCZGTGCC 
TGCCAGGGCC 
AGTCACCAAT 



Seq ID NO: 195 Protein sequence 
Protein Accession #: NP_073730 

21 




41 
I 

AGAAGGTGCT 
CTTCAAGAAC 
CTGGCAGCGT 
CCGTCACAGG 
GGGGCCGTGG 
ACCAGGAACT 
AGAACAACAC 
GCATGTGTGA 
GTGGCTTCCT 
CCCCTGAGGA 



GGCATTGATG 
GTGTGTGTGT 
AGACTGGGGG 
ATGGCCTGGG 



CTGCTCTCTG 
GGGTCACCGT 
CCCAGCCAGG 
CGCCCGCCCC 
GAGTCCAGGC 
CTGCCCCTAC 
CAGGAATGCA 
CTCCCGCCAT 
ATGCTCCACC 
AGAGGCAAGG 
AGGCAGGGCC 
TGGAGGACGC 
AGGCTGCCCA 
GCTGGGCCTG 



11 

I 

CTGTGTGCGC 
GTCTGGAGTC 
GCTGCAGAGG 

CAAAGCAGGG 
GAGCGGGTGG 
AGGCCCCATG 
TGTGCTCCCT 
ACCTTCCTTC 
AGCGGCTGAG 
CTGCCACCAC 
CCTTTAGCCC 
GACCCTGCAG 
TGCATGGCTG 
AAGTGATGGG 
CAGGGTCCAQ 
ATCCTACAGT 
GGCCCTGCCT 
GGCAGCTAAG 



21 
I 

GTGCGTGTGT 
ATGGCAGGGT 
TGAGGGTATC 
GAGGCCAGGC 
GAGGCGCCAA 
AGTCAGGGAA 
TAGCACCCTG 
CCCCAGGCCC 
CCACCCACAT 
TCAGTGTGTG 
CTCACCTTCA 
AGGCCTGCTC 
ACCCCTCTGG 
GCAAACCATG 
ACCGCAGAGA 
GGCCCAGGAG 
GGGGGCAAGG 
GCTTGGCTGG 
CTGGAGCTTT 



31 
I 

CCCTTTCTGT 
TGGCCTCAAC 
TGACTTGGAC 
TGTGGAGGAA 
GAGCTGAGCT 
GTTCCCCTGC 
TAGGCAGGCG 
GCCGAAGGGT 
TGGAATGTTC 
TCCTCAGGOG 
AGTGAGCTCC 
GCTTCCAAGT 
GTGGGCCCCA 
TGAGACCCCC 
AGAGAAGCAG 
GTGCTCTGAG 
GGCTGGGGGC 
GGCCAGGGTC 



41 
I 

CTGTCTCCTT 
AGCTGCTTAT 
ATGGCAAGAG 
CAGAGTCTCC 
GGGGAGTCAC 
CTGTAGGTGA 
GGTACAGGGG 
GGCCAGGCAG 
TGGCCGCTCC 
CTGCGGCCCT 
GCCGACAGCC 
TCCTGGGGGC 
GCTGTGGTGC 
AGGGATGAGA 
GGAGGGAGAG 
GTCCGGTGAA 
TGCTGGGAGG 
CAGAGCCTCC 



3240 
3300 
3360 
3420 
3460 
3540 
3600 
3660 
3720 



51 
I 

GGATAGATGC 
CTCCAGGTCT 
TCTGTCCACA 
GAGGAGCGCC 
CCAGCCCAAG 
CTTTGACCAA 
TGAGATAGTC 
CAAGAACATG 
CAATGTCTAC 
GGTCACCTGC 



1 11 21 31 41 51 

! I II f I 

MADGAKANPK GFKKKVLDRC FSGWRGPRFG ASCPSRTSRS SLGMKKFFTV AILAGSVLST 
AHGSLLNLKA MVEAVTGRSA ILSFVGYGCY CGLGGRGQPK DEVDWCCHAH DCCYQELFDQ 
GCHPYVDHYD HTIENNTEIV CSDLNKTECD KQTOICDKNM VLCLMNQTYR EEYRGFLNVY 
CQGPTPNCSI YEPPPEEVTC SHQSPAPPAP P 

Seq ID NO: 196 DNA sequence 

Nucleic Acid Accession #: XM_028196.1 

Coding sequence: 1315.. 1791 * 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



51 
I 

GTGTGTGTGT 
GCTCTGCCCC 
TCCCGATGGG 
GGGTCCCAGG 
TGGCTGGCTG 
CCTGGGCCTG 
CAGGAGCCAG 
CCAGCAGCTO 
GCAGGTGGAC 
CAGCTGCACC 
GAGCCCCTGC 
AGCCCTGCTC 
TGCAGTGAAC 

TGGGACCCCC 
AGCTTCCTGG 
GGCAGGGACT 
TGGCTGGGAG 
CTCCCTTCAG 



60 
120 
160 
240 
300 
360 
420 
480 
540 
600 



60 
120 
180 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 



1031 



WO 03/042661 



PCT/US02/36810 



5 

10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



CTTCCTGCTG 
ACCCAGCACT 
TCCCAGCTGA 
CACCCACCAG 
ATTCCAGACC 
GATAACCCAG 
CCAGGGCCCC 
TGCCTCCTCT 
GAGTCCGTGG 
TCCTTGCTCA 
GGCGAGTTCA 
TGGCAGGGAA 
GGGCCCGAGG 
TTTGGGTGGG 
CTGGCCTGGA 
CTCTGATGAG 
CCTGATGTGG 
CTCTCCCTGG 
CGGTTCTGOG 
TGGGGAGCTG 
GTGGGCCTGA 
GTCAGCGTCT 
TGCCCCGTGT 
GCCTGGAGGG 
GCTGGGTGGG 
AGGCGGAGCT 
GGCATGAGCC 
TGGAGCACTG 
GCCACAGCCC 
CTGATGGGCA 
TGAAGCCCCT 
TACGTGCCCA 
GGACTTGCAG 
AGAAAGACAG 
CTGGTGCCCT 
CTGCCGCTCC 
CCCCTGCAGC 
CCCCTGCGGC 
CGCCTGCCCT 
GCACTTGCCC 



CACAGAACCC 
GGCTGCTGCT 
CCCAGCCTCC 
TCTCTCCCAG 
TTGTCGCCGG 
CCAAGTGGGG 
GCTGGGCTCT 
GTGCTGCCTG 
GTCTGGGCAG 
CTCAGTCCAG 
GCCCCAGGGA 
AGTGGGTGAA 
GAGCCACAGC 

GGCGGGGGGT 
GCATGATGTC 
ATGGCCTGGA 
AGTTCGACTT 
AGTTTCCGGA 
ACAGGGCAGG 
GGCAGGCAGC 
CCACCCAGGC 
TTGACGAGAC 
CTGGATGGGC 
CCTGAGCTAG 
GCCAGGGGCC 
CCTGGGTGAG 
GTACCTGCTG 
AAGGCAGAGC 
GCATTTTCGG 
CTTGCTGCCC 
GCTCAGGCCG 
AGCCCTACGT 
CCACCAAAAA 
TCAGCCAGGT 
GAACTGAGCC 
ACTGGGCAGA 
CAGCCAGGGA 
TGCCCCACTC 
AGGCCGCCCT 



TCGCCCCTGG 
AGTCAGATGG 
TGGGCCGCTT 
TGCCCCGGCC 
GACCCCCTGT 
GCTGCAGCTG 
CATTGCCGGC 
CTGCTGCTGC 



AGAGGGCTTG 
TGGTTTAACC 
CAGAGGTGAG 
GGGTTCTTGA 
CTCACAGAGC 
CTTGACCCAT 
AGCACCACCT 
GTCCAGCCCG 
TGGAAGCCAG 
AAGGGTGACG 
GGCCCTTGGC 
CGACCTGAGG 
CGGACACAGA 



CCACCCCGTG 
GGTAGCGGGC 
CTTCCAAACC 
CCAGCTGGCA 
GAGTTGTGGG 
TCCACAGATG 
GCCCTTGCCG 
CGCCGCCACA 
ACCACCACCA 
AAATCCAGGC 
CCCACAGAGG 
AAGGAGGCCA 
GGAAGGCAGG 
GAAGCCGACG 
GTCATGCAAG 
GCCCCTTGTC 
GGGGATGCTC 
GAGGTGAAGG 
GGGGAAGGGC 
TGAGCCCACC 
CCTGGGGGCA 
CATGAGACAA 
CACGTGAGTC 



CTGGGCTGGG 
GGCAGCAGGG 
ACCCTGCAGG 
CTCCGTCTGC 
GGCCCGCCGG 
TGGCAGGGAC 
GGGTCTGAGC 
ACAGCCCGAG 
GCTGACCGTG 
GAAGGTCCAG 
GGGCACGGCG 
CCAGAATGTG 
CGTAGGCAAG 
CATGCTGGCC 
GGTGGACCGC 
CTGAATGCAC 
GCAGGACCAC 



CCTGGCTCAC 
TGCAGCTTTT 
CACTGGGCAC 
CTGCCACTCA 
CCTGCCCTAT 
CCCAACTCGG 
CAGGTCGGGG 
GTGGTGCTGG 
CTCATGCTGA 
GCCCCCTACT 
GACCTGGTGC 
GTGCACCTGG 
CACGCCCGGC 
ATGCTGGCCC 
CACATGCCTC 
TGCAATAAAC 



CTGCCTCCTT 
AGGGGCCGGA 
AGCAGGGTAG 
CCACAGCTAT 
ATTCCCAAGA 
CACTCAGCCT 
CGGGCGTCCT 
GGAAGAAGCC 
CCCACCTGGT 
TCCAGAGCCC 
CAGGGCGTTG 
TGCAACAGGG 
GGGTACCCCA 
ATTTGTGCCT 
GGCTGCCCGG 
CCAACTCACT 
AGCAATGGGG 
GCCCCGCTGC 
AGACCCCATG 
CCGCTGGCTC 
CCGTGGACCC 
AAGTGCACCG 
AGGGATGGTC 
CAGCTGGGTG 
GCCGCTGCCT 
CAACTTCAAG 
CGTGGATCTG 
GGTGAGGTGC 
GGGCCATCGG 
CCAGAATCAC 
AGCTGTGCTT 
AGGCTCGAGG 
ACCAGAGGAA 
TCAATGAGGC 
TGGCTGTCTG 
GTGCCCGGGC 
GGCCCATTGC 
TGCAGCCCCG 
TGTCTCCCCG 
GCCTTCTCCT 



GCCCTGGCAG 
GGGGCCACCC 
AAAGATGGGG 
ACCTGGGCTT 
GGGGTGTGGG 
GGCCTCTACC 
CCTCGTCTCC 
CAGGGACAAG 
GAGGAGCGGC 
AGGGCAGCGA 
AGGACCTTCC 
GCTGCCCCAT 
GATGCCACGT 
GTTGGGTGGC 
GAGCCCAGGG 
CCAGGTGCAA 
GTGCCTGCAG 
GCAGGACCAG 
CCCTGGGTGG 
CCAGATCAGG 
CTATGCCCGG 
AGGCACGCTC 
GGCTGGGTGG 
GGCCTGGGCA 
CAGATCCCGC 
CGCTTCTCGG 
CAGCATGTTC 
TGGTCACCAG 
AAAGACAGGC 
CCTCCCGGGC 
CTCTCTCCGG 
CCTGCGTCCA 
GTGGAAGAAG 
CTTCACCTTC 
GGACCGCAGC 
CTCGGGGCAG 
CCAGCGGCAC 
CCTTCGCCTG 
CTGAGCCCAG 
GGC 



Seq ID NO: 197 Protein sequence 
Protein Accession #: XP_028196.1 



1 
I 

TTCTCCCGCA 
CTCCCCCCCT 
TTCTGTTTCT 
TCTCCCCCTC 
CTCTGCCCCC 
CCGAAAAGTA 
CAGACGAATT 
ACATTCGGCC 
CCAACGCCCG 
TTCCAGACAC 
GCCTTOGCCT 
CTGGTGGACA 
AGCCGTGTGA 
CTGGCCCTCC 
CCTCCGACCG 
GACACCTGGA 
GGCCGGGGTC 
CCCCTGATTG 
AGCAATCGGA 
CTCCTCCTGA 
CCCCCTGGGG 
CCTCGGCCCC 
GATTGGCTTT 




GCCCTCTCTT 
CTCTATCCTT 
CAACATCTGG 



CCCGCGACTC 
CTGTTCGGTT 
CAATGGGAAT 
CGTGCTGCAT 
CCCTCCAGTT 
GCCGTCGCAG 
TGGAGACGTA 



AGCAGTCCAC 
ACGTGCTCGC 
CTCTACCCAC 
AGTGAGCAAA 
CCACGGACGT 
CTTCTCCTGA 
CTCCATCGGG 
AAACACCCTT 



GATACAACAG 
CCCGCCCCAG 
CCCCAAAAAA 
GGCCAGAGCG 
TGCGACACGC 
CCCAATGGGG 
TGCTGCTTAC 
OGTCTGTGGG 
COGTGGCATC 
CTGTGCTACC 
CAACTTCCCC 
CCAGCGCCTG 
CAAGGAGCTC 
CCAAGACCCC 
ACTGCCGCAA 
TTCCATCAGG 
CCCAGTCCCC 
CTGAGGAAGC 
CACATACCCT 



31 



CGCCGAACCA 
CTGTGOGCCT 
CTTTCACGTT 
CTGACCTCAT 
CCCGAAGACA 
AAAAGCCATC 
GCGCTGGCAG 
AGCAGGGAGG 
AAGTCGATGC 
CGCCCCAGTG 
GACCGCGGCT 
GTTGAGGAGT 
CCCGCCAAGT 
AGATACCCCG 
CGCAGGGGCC 
GAGGCGTTCA 
GCCCACGGGG 
GTCTGCAGCC 
TTCCATCCCG 
GTGCCCCGCC 
ACAGCAGCAT 
CCCCCC 



41 
I 

AGCTCCTAGC 
AAGTGGATTA 
GCCCGCCTCT 
CACTCTGTCT 
TTCCCGATAC 
GCCCGTCCTC 
CCCCCGCTCT 
AGGAGTGTCC 
TGGGCGGCAG 
TGGTGCTTCT 
AGACCCTGTG 
TCTACTTCAG 
GCTGTTTCCG 
CCGAGAGGGA 
TGGGCAAGTT 
TGCCTGCCCT 
GGGAGGCCAA 
GCGCCCCCCC 
CGGCGCCACC 
AAAATCTCTC 
TCCCCGAAAC 
CTTCAAACAT 



1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
I960 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 



1 11 21 31 41 51 

I I I I 1 I 

MGHPPVSPSA PAPAGTTAIP GLIPDLVAGT PCELWDSQEG CGDNPAKWGL QLSTDALSLA 
STPGPRWALI AGALAAGVUj VSCLLCAACC CCRRHRKKPR DKESVGLGSA RGTTTTHLVR 
SGSLLTQSRE GLKSRLQSPG QRGEPSPRDG LTPTEAGR 

Seq ID NO: 198 DNA sequence 

Nucleic Acid Accession 8: NM_000612.2 

Coding sequence: 553.. 1095 



60 
120 



51 
I 

CTCCGACTCC 
ATTACACGCT 
CGCTGTCCTC 
CTCCCACTAT 
CTTTTCCCCC 
CCTGGACAAT 
GCCCCGTCGC 
GGCAGGAGGG 
CGTCGCCGGC 
CACCTTCTTG 
CGGCGGGGAG 
CAGGCCCGCA 
CAGCTGTGAC 
CGTGTCGACC 
CTTCCAATAT 
CCTGCGTGCC 
ACGTCACCGT 
AGAGATGGCC 
ATCCTGCAGC 
GGTTCCACGT 
AGGCTACTCT 
GTACAAAATC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 



Seq ID NO: 199 Protein sequence 
Protein Accession #: NP 000603.1 



1032 



WO 03/042661 



PCT/US02/36810 



5 

10 

15 

20 

25 

30 

35 

40 
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55 

60 

65 

70 

75 

80 



I I I I I I 

MGIPMGKSML VLLTFLAFAS CCIAAYRPSE TLCGGELVDT LQFVCGDRGF YPSRPASRVS 
RRSRGIVEEC CFRSCDLALL ETYCATPAKS ERDVSTPPTV LPDNFPRYPV GKFFQYDTWK 
QSTQRLRRGL PALLRARRGH VLAKELEAFR EAKRHRPLIA LPTQDPAHGG APPEMASKRK 

Seq ID NO: 200 DNA sequence 

Nucleic Acid Accession #: AK057131.1 

Coding sequence: 61.. 1146 



60 
120 



1 11 21 31 41 

I I I I I 

AGTCTGGGCG TTTAGGTCAG AACTACCCCG GTAGCCTGAC AGCAGGAGCT 
ATGGCTCAGC GGTGCGTTTG CGTGCTGGCC CTGGTGGCTA TGCTGCTCCT 
ACCGTCTCCA GATCGATGGG CCCGAGGAGC GGGGAGCATC AAAGGGCGTC 
TCTCAGTTCA GCAAAGAGGA ACGCGTCGCG ATGAAAGAGG CGCTGAAAGG 
ATTCCAACAG TGACTTTTAG CTCTGAGAAG TCCAATACTA CAGCCCTGGC 
AAATACATTC ATAAAGTCTT TCCTACAGTG GTCAGCACCA GCTTTATCCA 
GTGGAAGAGT ATAGCCACCT GTTCACTATC CAAGGCTCGG ACCCCAGCTT 
CTGCTGATGG CTCACTTTGA TGTGGTGCCT GCCCCTGAAG AAGGCTGGGA 
TTCTCTGGGT TGGAGCGTGA TGGCGTCATC TATGGTTGGG GCACACTGGA 
TCTGTGATGG CATTACTGCA GGCCTTGGAG CTCCTGCTGA TCAGGAAGTA 
AGATCTTTCT TCATTTCTCT GGGCCATGAT GAGGAGTCAT CAGGGACAGG 
ATCTCAGCCC TGCTACAGTC AAGGGGCGTC CAGCTAGCCT TCATTGTGGA 
TTCATCTTGG ATGATTTCAT TCCTAACTTC AAGAAGCCCA TCGCCTTGAT 
GAGAAGGGTT CCATGAACCT CATGCTGCAA GTAAACATGA CTTCAGGCCA 
CCTCCAAAGG AGACAAGCAT TGGCATCCTT GCAGCTGCTG TCAGCCGATT 
CCAATGCCTA TCATATTTGG AAGCGGGACA GTGGTGACTG TATTGCAGCA 
GAGGTTTATG GAGAGAAATC CCTTAACCAA TGCAATAATC AGGACCACCA 
CATATTCAAA GCAGGGGTCA AGTTCAATGT CATCCCCCCA GTGGCCCAGG 
CTTCCGGATT CACCCTGGAC AGACAGTCCA AGAGGTCCTA GAACTCACGA 
GGCTGATAAC AGAGTCCAGT TCCATGTGTT GAGTGCCTTT GACCCCCTCC 
TTCTGATGAC AAGGCCTTGG GCTACCAGCT GCTCCGCCAG ACCGTACAGT 
GGAAGTCAAT ATTACTGCCC CAGTTACTTC TATTGGCAAC ACAGACAGCC 
AAACCTCACC ACTGGCATCT ACAGGTTCTA CCCCATCTAC ATACAGCCTG 
ACG CATCCAT GGAGTCAACG AGAAAATCTC AGTCCAAGCC TATGAGACCC 
CATCTTTGAG TTGATTCAGA ATGCTGACAC AGACCAGGAG CCAGTTTCTC 
ACTGTGAGGT CAAGGGGCCT GCTGGGTTAG GCATGCCCGA CCCCGGGACA 
AAGGGGGAAA GCTAGTGTTG ATGAAACTTT TGATCAAAAC CACATTGTAA 
ATCTGTCTTG CTCACTCTTA AACTCTCCCA AGAACAAGGC CGGGGTAAGG 
AGAAATCTGG CTTCTCCCTT CCTCCCGACA TCTGCATCCC TTGATCCACT 
GCCCTCTTGT CCCTTATCTG TCTTATGCTG GTTATTTCAC TGCTTCACCT 
ACTTAACAAA TGTAGATTTG AGAAATCTCA ACCAGTTGTT ACCTGATAGG 
TTAGGGCACT CTTGCTGGGA TGCTTTCTCC AGAGCTTATA TATTTCTTCT 
TTCTTCOCCC TTTTATTCCC CTCTCTTCTT GGACTCATGA GCTGTCTCTT 
CTCTCTCCTG CATCTCTCCC CTTACTCTTC AATTTATTCT ACTTCTGGAC 
CCAAACTGTG ATACTACCAT AATTGTCACC ATAATCAGTC AAATAAAGTG 
C 



51 
I 

CGAGAGAAGC 
AGTTTTCCCT 
GCGAATCCCT 
TGCCATCCAG 
TGAGTTCGGA 
GCATGAAGTC 
GCAGCCCTAC 
GGTGCCCCCA 
CGACAAGAAC 
CATCCCCCGA 
GGCTCAGAGG 
CGAGGGGGGC 
TGCAGTCTCA 
CTCTTCAGCT 
GGAGCAGACA 
ACTGGCAAAT 
CGGCACTCAC 
CCACAGTCAA 
AGAACATTGT 
CCGTCAGCCC 
CCGTCTTCCC 
GATTCTTTAC 
AAGACTTCAA 
AAGTGAAATT 
ACCTGCACAA 
GGACTAACCC 
AACATTGCCC 
TAAAGTCAGC 
GGCATTTGCT 
TCCAGGCTTG 
AGTCTTTAAT 
TACTAGAACT 
CATCTCTCCT 
CTGGACTTAC 
ATCTGTGCAT 



51 



1 
I 

GGCCGGGAGA 
GCCCAGAAGG 
AGCACCCTGC 
ATGAGCCGCT 
AGTGGGACAC 
CCTCTGGGCA 
ATCGTGGCGC 
CTGCGCAGAG 
AACTATTTTT 
CTCTACAAGG 
GAGGAGTTGG 
AAGCCAGAAA 
TCTGTGCATG 
CCAGAGATGA 
CTTTGCTATG 
TATCGCCGCA 
GACCTCATCT 



11 

! 

GTAGCAGTGC 
AGAACTCCTA 
CCCAGCGAGT 
CCAATGTCCA 
CCGACATCTT 
AAGGCAAGTT 
TCAAGGTCCT 
AGATCGAAAT 
ATGACCGGAG 
AGCTGCAGAA 
CAGATGCTCT 
ATCTGCTCTT 
CGCCCTCCCT 
TTGAGGGGCG 
AGCTGCTGGT 
TCGTCAAGGT 
CCAAACTGCT 



21 
I 

CTTGGACCCC 
CCCCTGGCCC 
CCTCCGGAAA 
GCCCACAGCT 
AACGGGGCAC 
TGGAAACGTG 
CTTCAAGTCC 
CCAGGCCCAC 
GAGGATCTAC 
GAGCTGCACA 
AATGTACTGC 
AGGGCTCAAG 
GAGGAGGAAG 
CATGCACAAT 
GGGGAACCCA 
GGACCTAAAG 
CAGGCATAAC 



31 
I 

AGCTCTCCTC 
TACGGCCGAC 
GAGCCTGTCA 
GCCCCTGGCC 
TTCACAATTG 
TACTTGGCTC 
CAGATAGAGA 
CTGCACCATC 
TTGATTCTAG 
TTTGACGAGC 
CATGGGAAGA 
GGAGAGCTGA 
ACAATGTGTG 
GAGAAGGTGG 
CCCTTTGAGA 
TTCCCCGCTT 
CCCTCGGAAC 



41 

I 

CCCCTTTCTC 
AGACGGCTCC 
CCCCATCTGC 
AGAAGGTGAT 
ATGACTTTGA 
GGGAGAAGAA 
AGGAGGGCGT 
CCAACATCCT 
AGTATGCCCC 
AGCGAACAGC 
AGGTGATTCA 
AGATTGCTGA 
GCACCCTGGA 
ATCTGTGGTG 
GTGCATCACA 
CTGTGCCCAC 
GGCTGCCCCT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 



Seq ID NO: 201 Protein sequence 
Protein Accession #: BAB71368.1 

1 11 21 31 41 

I I 1 I I 1 

MAORCVCVIiA LVAHLLLVFP TVSRSMGPRS GEHQRASRIP SQFSKEERVA MKEALKGAIQ 
IPTVTFSSEK SNTTALAEFG KYIHKVFPTV VSTSFIQHEV VEEYSHLFTI QGSDPSLQPY 
LLMAHFDWP APEEGWEVPP FSGLERDGVI YGWGTLDDKN SVMALLQALE LLLIRKYIPR 
RSFFISLGHD EESSGTGAQR ISALLQSRGV QLAFIVDEGG FILDDFIPNF KKPIALIAVS 
EKGSMNLMLQ VNMTSGHSSA PPKETSIGIL AAAVSRIiEQT PMPIIFGSGT WTVLQQLAN 
EVYGEKSI2IQ CNNQDHHGTH HIQSRGGVQC HPPSGPGHSQ LPDSPWTDSP RGPRTHBEHC 
G 

Seq ID NO: 202 DNA sequence 

Nucleic Acid Accession #: NM_004217.1 

Coding sequence : 5 8 10 92 



60 
120 
180 
240 
300 
360 



51 
I 

TCTAAGGATG 

ATCTGGCCTG 

ACTTGTCCTC 

GGAGAATAGC 

GATTGGGCGT 

AAGCCATTTC 

GGAGCATCAG 

GOGTCTCTAC 

CCGCGGGGAG 

CACGATCATG 

CAGAGACATA 

CTTCGGCTGG , 

CTACCTGCCC 

CATTGGAGTG 

CAACGAGACC 

GGGAGCCCAG 

GGCCCAGGTC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 



1033 



WO 03/042661 PCT/US02/36810 
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80 



TCAGCCCACC CTTGGGTCCG GGCCAACTCT CGGAGGGTGC TGCCTCCCTC TGCCCTTCAA 1080 

TCTGTCGCCT GATGGTCCCT GTCATTCACT CGGGTGCGTG TGTTTGTATG TCTGTGTATG 1140 

TATAGGGGAA AGAAGGGATC CCTAACTGTT CCCTTATCTG TTTTCTACCT CCTCCTTTGT 1200 
TTAATAAAGG CTGAAGCTTT TTGT 

Seq ID NO: 203 Protein sequence 
Protein Accession fl: NP 004208 



1 11 21 31 41 51 

I I I I. I 1 

MAQKENSYPW PYGRQTAPSG LSTLPQRVLR KEPVTPSALV LMSRSNVQPT AAPGQKVMEN 
SSGTPDILTR HFTIDDFEIG RPLGKGKFGN VYLAREKKSH FIVALKVLFK SQIEKEGVEH 
QLRREIEIQA HLHHPNILRL YNYFYDRRRI YLILEYAPRG ELYKELQKSC TFDEQRTATI 
MEELADALMY CHGKKVIHRD IKPENLLLGI* KGELKIADFG WSVHAPSLRR KTMCGTLDYL 
PPEMIEGRMH NEKVDLWCIG VLCYELLVGN PPFESASHNE TYRRIVKVDL KFPASVPTGA 
QDIiXSKLLRH NPSERLPLAQ VSAHPWVRAN SRRVLPPSAL QSVA 

Seq ID NO: 204 DNA sequence 
Nucleic Acid Accession fh AK0S5663 
Coding sequence: 38.. 1423 

1 11 21 31 41 51 

I I I I I I 

AGAACGGCTT CCGGCGGGAG CTGTGCAGCT CCTTATCATG GGGACAATTC ATCTCTTTCG 
AAAACCACAA AGATCCTTTT TTGGCAAGTT GTTACGGGAA TTTAGACTTG TAGCAGCTGA 
CCGAAGGTCC TGGAAGATAC TGCTCTTTGG TGTAATAAAC TTGATATGTA CTGGCTTCCT 
GCTTATGTGG TGCAGTTCTA CTAATAGTAT AGCTTTAACT GCCTATACTT ACCTGACCAT 
TTTTGATCTT TTTAGTTTAA TGACATGTTT AATAAGTTAC TGGGTAACAT TGAGGAAACC 
TAGCCCTGTC TATTCATTTG GGTTTGAAAG ATTAGAAGTC CTGGCTGTAT TTGCCTCCAC 
AGTCTTGGCA CAGTTGGGAG CTCTCTTTAT ATTAAAAGAA AGTGCAGAAC GCTTTTTGGA 
ACAGCCCGAG ATACACACGG GAAGATTATT AGTTGGTACT TTTGTGGCTC TTTGTTTCAA 
CCTGTTCACG ATGCTTTCTA TTCGGAATAA ACCTTTTGCT TATGTCTCAG AAGCTGCTAG 
TACGAGCTGG CTTCAAGAGC ATGTTGCAGA TCTTAGT CGA AGCTTGTGTG GAATTATTCC 
GGGACTTAGC AGTATCTTCC TTCCCCGAAT GAATCCATTT GTTTTGATTG ATCTTGCTGG 
AGCATTTGCT CTTTGTATTA CATATATGCT CATTGAAATT AATAATTATT TXGCCGTAGA 
CACTGCCTCT GCTATAGCTA TTGCCTTGAT GACATTTGGC ACTATGTATC CCATGAGTGT 
GTACAGTGGG AAAGTCTTAC TCCAGACAAC ACCACCCCAT GTTATTGGTC AGTTGGACAA 
ACTCATCAGA GAGGTATCTA CCTTAGATGG AGTTTTAGAA GTCCGAAATG AACATTTTTG 
GACCCTAGGT TTTGGCTCAT TGGCTGGATC AGTGCATGTA AGAATTCSAC GAGATGCCAA 
TGAACAAATG GTTCTTGCTC ATGTGACCAA CAGGCTGTAC ACTCTAGTGT CTACTCTAAC 
TGTTCAAATT TTCAAGGATG ACTGGATTAG GCCTGCCTTA TTGTCTGGGC CTGTTGCAGC 
CAATGTCCTA AACTTTTCAG ATCATCACGT AATCCCAATG CCTCTTTTAA AGGGTACTGA 
TGATTTGAAC CCAGTTACAT CAACTCCAGC TAAACCTAGT AGTCCACCTC CAGAATTTTC 
ATTTAACACT CCTGGGAAAA ATGTGAACCC AGTTATTCTT CTAAACACAC AAACAAGGCC 
TTATGGTTTT GGTCTCAATC ATGGACACAC ACCTXACAGC AGCATGCTTA ATCAAGGACT 
TGGAGTTCCA GGAATTGGAG CAACTCAAGG ATTGAGGACT GGTTTTACAA ATATACCAAG 
TAGATATGGA ACTAATAATA GAATTGGACA ACCAAGACCA TGATAGACTC TAACTTATTT 
TTATAAGGAA TATTGACTCC TTGGCTTCCA ATTTATTTAG TAATCCAACT TTGCATTGAC 
TGTTTAATCA TTTACTCTAA ATGTTAGATA ATAGTAGTCT TGTTCACATT TCATGAAACC 
TATGAAACTA TATTTTTGTA AAATGTATTT GTGACAGTGA AATCCTCGTA AATGTTAAAG 
GCTTTAAATA GGCTTCCTTT AGAAAATGTG TTTCTTTAAA TTTGGATTTT GGTATCTTTG 
GTTTTGTAGT TGACTGCAGT GTGATGTGAC CTTACCTTTA TAAGAGCCAC TTGATGGAGT 
AGATCTGTCA CATTACTAAG ATACGATATT TCTTTTTTTT TCCGAGACGG AGTCTTGCTC 
TGCCACTGTG CCCGGCCAAT ACATTATTAT TAACTTAAGG CTGTACTTTA TTAAGGCTTC 
CTTAGTTTTT GTTTTGTTTT GTTTTTTGAG ATGGAGTCTC ACTCTGTCGC CCAGGCTGGA 
ATGCAGTGGC ATGATCTCAG CTCACTGCAA CCTCTGCCTC CTGAGTTCAA ATGATTCTCC 
TGCCTCAGCC TCCCGAGTAG CTGGGATTAC AGGCACCTGC CACCACGCCC AGCTAATTTT 
TGTATTTTTA GTAAAGACGG GGGATTTCAC CATGTTGGCC AGGCTGGTCT TGAACTCCTG 
ACCTCATGAT CCACCCACCT TAGCCTCCCA AAGTGCTGGG ATTAGGTGTG AGCCACCGCA 
CCTGGCCGAT ATTTTCTTTA ATGAAATTTA TAAATATGCT TCTTGAATAA TACACATTTT 
GGGAAAGGGA AAAATGTCTG TTCAAAAAGT AAAGGTCTCT TTTATAGCTT TTCCAAACTT 
AATTGCTAAA TTTTTCTTTG AGGTTCTCCT GAATTATGTC TTACAAACTA AAAGCAAAAA 
TTTTTAGCAG AAATTTTGGA ATACATTCTA TCTAGCACAA TTTGAATTTT TAATTATCAA 
GATTTTTGTT AAAGTTTCTC TCCTTTAAAA ATTTTAGTAC ATTTGTAAAT 

Seq ID NO: 205 Protein sequence 
Protein Accession #: BAB70980.1 



1 
I 

MGTIHLFRKP 
TAYTYLTIFD 



RSLCGIIPGL 
GTMYPHSVYS 
VRIRRDANEQ 
MPLLKGTDDL 
SSMLNQGLGV 



11 
I 

QRSFFGKLLR 
LFSLMTCLIS 
EIHTGRLLVG 
SSIFLPRMNP 
GKVLLQTTPP 
MVLAHVTNRL. 
NPVTSTPAKP 
PGIGATQGLR 



21 
I 

EFRLVAADRR 
YWVTLRKPSP 
TFVALCFNLF 
FVLIDLAGAF 
HVIGOLDKLI 
YTLVSTLTVQ 
SSPPPEFSFN 
TGFTNIPSRY 



31 

! 

SWKILLFGVI 
VYSFGFERLE 
TMLSIRNKPF 
ALCITYMLIE 
REVSTLDGVL 
IFKDDWIRPA 
TPGKNVNPVI 
GTNNRIGQPR 



41 
I 

NLICTGFLLM 
VLAVFASTVL 
AYVSEAASTS 
INNYFAVDTA 
EVRNEHFWTL 
LLSGPVAANV 
LUJTQTRPYG 
P 



60 
120 
180 
240 
300 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
640 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 



51 
I 

WCSSTNSIAL 
AQLGALFILK 
WLQEHVADLS 
SAIAIALMTF 
GFGSLAGSVH 
LNFSDHHVIP 
FGLNHGHTPY 



60 
120 
180 
240 
300 
360 
420 



Seq ID NO: 206 DNA sequence 

Nucleic Acid Accession #: NM_016361.1 

Coding sequence: 397.. 1662 



11 



21 



31 



1034 



WO 03/042661 



PCT7US02/36810 



5 

10 

15 

20 

25 

30 

35 

40 

45 



GGAACTCAGG 
TTTAATGTTC 
AGGAGCGACC 
TTGTTCTCTC 
TGCTTGCTGA 
TGAGCCCCGG 
CGAGGGCCTA 
GTCCTGACCT 
GAGGCCGATG 
GTGTTTCGAC 
TGGAACCCCC 
CTAGCTGGTG 
AAGGGGGGCA 
GGAGAGAGAC 
CCACAGGAGG 
TTGCTGGCTG 
GCAGATTCAG 
AGAGGCCGGA 
AAGGACAGGA 
GTGGCTGCCG 
ATGATCGAAC 
AGTCTTCAGA 
ATGGACTCTG 
GTGACCTTCA 
GCTGTTGACC 
CTCTATTACC 
GACATGTTCT 
TGCTCTCAAA 
ATGTGTTGAT 



55 
60 
65 
70 
75 
80 



I 

GCCGGCTCCT 
CTCGTCGGGC 
AACCCCGACC 
CCCCGTTCCC 
AGGGCTGGAT 
CGCCCGCAGA 
CCACCATGAT 
CGCTGGCGTA 
GCCAGTGTCC 
ACGGGGCTCG 
AGCTATTAGA 
GTCCGAAACC 
TGTTTGCTGG 
TGAGGAAGAA 
TCTTTATTCG 
GGCTTTTCCA 
AAGTCTTGTA 
GGCAGACTGC 
TGGGCATTGA 
AGCAGGCACA 
AGAGAGCTGT 
TGGCAGTAGG 
CCACTGCCCC 
TACCGCTCTT 
TGACCATGGA 
ACGGGAAGGA 
TGAATGCCAT 
CTCAGGTGAT 
TTTAAAATAA 



GTTCCTTCAA 
AAAAGATAAG 
CGGGTTAAAA 
ACTCGGGGTC 
GTACGCATCC 
AGACTTGTGT 
CACTGGTGTG 
CTGCCTGCAC 
GGTCGACCGC 
GAGTCCTCTC 
GGTCCCACCC 
ATATTCTCCT 
GCAGCTGACC 
CTATGTGGAA 
TTCCACTAAC 
GTGTCAGAAA 
TCCCAACTAC 
CTCTTTACAG 
CAGTAGTGAT 
CAACCTCCCA 
GGACACATCC 
CCCATTCCTC 
CGACAAGATC 
AATGACCCTG 
ACTTTACCAG 
GCAGGTGCCG 
GTCAGTTTAT 
GGAAGTTGGA 
AGTGCCTTTA 



GAGTGCTGGA 
GATCCGATCT 
CTCCCAGGGA 
TCCCTCAGGG 
GCAGGTTCCC 
TTGCCTCCTG 
TTCAGCATGC 
CAGCGGCGGG 
AGCCTGCTGA 
AAGCCGCTCC 
CAAACTCAGT 
TACGACTCTC 
AAGGTGGGCA 
GACATTCCCT 
ATTTTTCGGA 
GAAGGACCCA 
CAAAGCTGCT 
CCAGGAATCT 
AAAGTGGACT 
AGCTGCCCCA 
TTGTACATAC 
CACATCCTAG 
AGAAAGCTGT 
GGGATTTTTG 
CACCTGGAAT 
AGAGGTTGCC 
ACCTTAAGCC 
AATGAAGAGT 
TACAAAAAAA 



GGCCAAACTT 
CCCCCGGCCC 
CTCTTCGCTG 
CCGGGAGGCA 
GCGGACTTGG 
CAGCCTCAAC 
GCTTGTGGAC 
TGGCCCTGGC 
AGTTGAAAAT 
CGCTGGAGGA 
TTGATTACAC 
AATACCATGA 
TGCAGCAAAT 
TTCTTTCACC 
ATCTGGAGTC 
TCATCATCCA 
GGAGCCTGAG 
CAGAGGATTT 
TCTTCATCCT 
TGCTGAAGAG 
TGCCCAAGGA 
AGAGCAACCT 
ATCTCTATGC 
ACCACAAATG 
CTAAGGAGTG 
CTGATGGGCT 
CAGAAAAATA 
AACTGATTTA 
AAAAAAAAAA 



GAAATACAAG 
GGTGTGCAGC 
CTGCCACCTC 
CAGCGGTCCC 
GGGCGCCCGC 
CCGGAGGCAG 
CCCAGTGGGC 
CGAGCTGCAG 
GGTGCAGGTC 
GCAGGTAGAG 
AGTCACCAAT 
GACCACCCTG 
GTTTGCCTTG 
AACCTTCAAC 
CACCCGTTGT 
CACTGATGAA 
GCAGAGAACC 
GAAAAAGGTG 
CCTGGACAAC 
ATTTGCACGG 
AGACAGGGAA 
GCTGAAAGCC 
GGCTCATGAT 
GCCACCGTTT 
GTTTGTGCAG 
CTGCCCGCTG 
CCATGCACTC 
TAAAAGCAGG 
A 



60 
120 
180 
240 
300 
360 
420 
4S0 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 



Seq ID NO: 207 Protein sequence 
Protein Accession ft: NP_057445.1 



MRLWTPVGVL 
LPLEEQVEWN 
GMQQMFALGE 
PIIIHTDEAD 
DFFILLDNVA 
LESNLLKAMD 
ESKEWFVQLY 
E 



11 
I 

TSLAYCLHQR 
PQLIiEVPPQT 
RLRKNYVEDI 
SEVLYPNYQS 
AEQAHNLPSC 
SATAPDKIRK 
YHGKEQVPRG 



21 
I 

RVALAELQEA 
QFDYTVTNLA 
PFLSPTPNPQ 
CWSLRQRTRG 
PMLKRFARMI 
LYLYAAHDVT 
CPDGI/CPLDM 



31 
I 

DGQCPVDRSL 
GGPKPYSPYD 
EVFIRSTNIF 
RRQTASLQPG 
EQRAVDTSLY 
FIPLLMTLGI 
FLNAMSVYTL 



41 
I 

LKLKMVQWF 
SQYHETTLKG 
RNLESTRCLL 
ISEDLKKVKD 
ILPKEDRESL 
FDHKWPPFAV 
SPEKYHALCS 



51 
I 

RHGARSPLKP 
GMFAGQLTKV 
AGLFQCQKEG 
RMGIDSSDKV 
QMAVGPFLHI 
DLTMELYQHL 
QTQVMEVGNE 



Seq ID NO: 208 DNA sequence 

Nucleic Acid Accession #: CAT cluster 



50 | 



ii 



TTTGAGGGGG 
TAACAGCCTG 
TGCTGGGGGC 
TAGTGGCAGA 

GGAGCCCTCC 
CGTCTTTATT 
GCAGGTTCCT 
TGAGCTGGCA 



CCCTTGGAGA 
CAGTGTTAGG 
GCCTGGAGTG 
CTTGCATTTG 
GCAGATCAGT 
GGCAAATGGG 
CACTGCTCCT 
GCGCAG 



21 
I 

AGTTTAATTC 
GAAGTCCTTC 
CTTCAGGCCA 
ATGAGTGGGA 
CATTTGCCAC 
CCGCTCAGCT 
TCATTGGCTT 
TCACCAGTGT 



31 
I 

ATAAAGAAGC 
CTTGAGGATA 
TCCCTGGAGG 
TGGCCTTCTC 
TCAGAACTGC 
GCAGGTTTTT 
CCAAGGCAGT 
CCTGCGAGGT 



41 
1 

CTCCTGATCA 
AGGCCTCCCA 
CCAGTCCTGT 
AGGTACAGGA 
CGCGATCCCA 
CCAGTCATAG 
CAGGCCAACT 
CACCTTGGCG 



Seq ID NO: 209 DNA sequence 

Nucleic Acid Accession #: FGENESH predicted 

Coding sequence: 1..564 



60 
120 
180 
240 
300 
360 
420 



SI 
I 

GAAAGGGGCC 
GGGGAGGAGG 
GCTCAGCAAG 
CTGTGCTGCT 
GCAATGGCCA 
TAGAAGGGAT 
GTGTGACTCT 
AGGGCTCACC 



60 
120 
180 
240 
300 
360 
420 
480 



ATGGAGCCCT 
TCAGATCCGT 
CAGAGTCACA 
GATCCCTTCT 
CTGGCCATTG 
AAGCAGCACA 
ACCTTGGCCA 
AGCTACACAT 
GATGCTGCTG 
GGAACGAAGG 



11 
I 

GGGCGTGGCT 
TCTCTGCGCT 
CAGTTGGCCT 
ACTATGACTG 
CTGGGATCGC 
GTCCTGTACC 
AATCAAATAA 
CAGTCTTCCG 
CCCTCACATC 
GCTGTACCTA 



21 
I 

GCAGGGTTTA 
GCCAGCTCAG 
GACTGCCTTG 
GAAAAACCTG 
GGCAGTTCTG 
TGAGAAGGCC 
ACCTTTATCT 
AGTGCCTCTT 
AGGCCATCCA 
ATGA 



31 
I 

AAGAGCCGAC 
GACACTGGTG 
GAAGCCAATG 
CAGCTGAGCG 
AGTGGCAAAT 
ATCCCACTCA 
CCAAGCACCT 
TCTGCGTCCC 
AGCATGCAGA 



41 
I 

CCACGTGCCC 
AAGGAGCAGT 
ACCCATTTGC 
GACTGATCTG 
GCAAATGCAA 
TCACTCCAGG 
TTGTCTTGGT 
TGTACCCTGC 
ACATAAGCAT 



51 

I 

AGCAGCCTCC 
GAGGAACCTG 
CAATAAAGAC 
CGGAGGGCTC 
GAGCAGCCAG 
CAGATTTCTC 
GTTTGGCATC 
CATTCCTGGT 
GCAGAACACT 



60 
120 
180 
240 
300 
360 
420 
480 
540 



Seq ID NO: 210 Protein sequence 
Protein Accession fl: FGENESH predicted 

1 11 21 31^ 41 51 

I I I I 1 1 ™ 

MEPWAWLQGL KSRPTCPAAS SDPFSALPAQ DTGEGAVRNL QSHTVGLTAL EANDPFANKD 

DPFYYDWKNIi QLSGLICGGL LAIAGIAAVL SGKCKCKSSQ KQHSPVPEKA IPLITPGRFL 

1035 



120 



WO 03/042661 



PCTYUS02/36810 



TLAKSNKPLS PSTFVLVFGI SYTSVFRVPL SASLYPAIPG DAAALTSGHP SMQNISMQNT 180 
GTKGCT 



10 
15 
20 
25 
30 
35 



60 



75 



80 



Seq ID NO: 211 DNA sequence 

Nucleic Acid Accession R: FGENESH predicted 

Coding sequence: 1..318 

1 11 21 31 

I I I 1 



Seq ID NO: 212 Protein sequence 
Protein Accession #: FGENESH predicted 

1 11 21 31 

I I I I 



HPSMQNISMQ NTGTKGCT 



41 
1 


51 




CCCGGCTGCC 


1 

CCGTCTGGGA 


60 


GTGCCTGCTQ 


CGCTGGGCCC 


120 


TCCACCCCAG 


GGCCTGCTCC 


180 


GGGTCTTCAT 


TCTCACCCTG 


240 


ACAACTGTCT 


CACCCCAGCA 


300 


GCCTCACTCT 


TCGCGCCCAT 


360 


CCCTTCACGT 


CCAGTTATCC 


420 


ATCACGACAT 


AGCGCAGCAC 


480 


GTCACACAGT 


TGGCCTGACT 


540 


CCTTCTACTA 


TGACTGGAAA 


600 


CCATTGCTGG 


GATCGCGGCA 


660 


AGCACAGTCC 


TGTACCTGAG 


720 


TGGCCAAATC 


AAATAAACCT 


780 


ACACATCAGT 


CTTCCGAGTG 


840 


CTGCTGCCCT 


CACATCAGGC 


900 


CGAAGGGCTG 


TACCTAA 




41 
1 


51 




RPPVPCQALR 


1 

PPTFHPRACS 


60 


GNSPGSRAPC 


GVAGLTLRAH 


120 


IWDTGEGAVR 


NLQSHTVGLT 


180 


VLSGKCKCKS 


SQKQHSPVPE 


240 


PLSASLYPAI 


PGDAAALTSG 


300 



Seq ID NO: 213 DNA sequence 
4U Nucleic Acid Accession #: FGENESH predicted 
Coding sequence: 1..1758 

1 H 21 31 41 51 

A* I I i 1 I i 

ATGATGGGGT CTCATGTTGC CCAGGCTGGT CTTGAACTCC TGGGCTCGAG TGACCCTCCT 60 

GCCTTGGCCT CCGAAAGTGC TGGGATTACA GGACTGTTAT TACAGGAATC CATAACACTG 120 

GAGGATGTGG CTGTGGACTT CACTTGGGAG GAGTGGCAAC TCCTGGGCGC TGCTCAGAAG 180 

GACCTGTACC GGGATGTGAT GTTGGAGAAC TACAGCAACC TGGTGGCAGT GGGGTATCAA 240 

_ n GCCAGCAAAC CGGATGCACT CTTCAAGTTG GAACAAGGAG AACAACTGTG GACAATTGAA 300 

->U GATGGAATCC ACAGTGGAGC CTGTTCAGGT TCTCCAAAGG TCCCGTTCTC CATTTTCTCA 360 

TCTGTGCCCT TCACTCTTCA AAATTGCCTT CATTCTAACA TATGGAAAGT TGATCATGTG 420 

CTGGAGCGCT TGCAGAGTGA AAGCCTGGTG AACAGAAGGA AACCATGTCA TGAACATGAT 480 

GCATTTGAAA ATATTGTTCA TTGCAGCAAA AGTCAGTTTC TGTTAGGGCA AAATCATGAT 540 

ATATTTGACT TACGTGGAAA AAGTTTGAAA TCCAATTTAA CTTTAGTTAA CCAGAGCAAA 600 

GGCTATGAAA TAAAGAACTC TGTTGAGTTT ACTGGAAATG GGGACTCCTT TCTTCATGCT 660 

AACCATGAAC gacttcatac tgcaattaaa TTCCCTGCAA GTCAAAAACT CATCAGCACT 720 

AAGTCCCAAT TCATCAGTCC CAAGCATCAG AAAACACGAA AATTAGAGAA GCATCATGTG 780 

TGCAGTGAAT GTGGGAAAGC CTTCATCAAG AAGTCTTGGC TAACTGATCA CCAGGTAATG 840 

CATACAGGAG AGAAACCCCA CAGATGTAGT CTATGTGAGA AAGCCTTCTC CAGAAAGTTC 900 

ATGCTTACTG AACATCAGCG AACTCATACA GGAGAAAAAC CTTATGAATG CCCTGAATGT 960 

GGCAAAGCCT TTCTCAAGAA ATCACGGCTC AACATACATC AGAAAACACA TACCGGAGAG 1020 

AAACCCTATA TATGCAGTGA ATGTGGAAAA GGCTTCATCC AGAAAGGAAA TCTCATTGTA 1080 

CACCAGCGAA TTCATACAGG TGAGAAACCT TATATATGCA ATGAATGTGG AAAAGGCTTC 1140 

^ _ ATTCAGAAGA CGTGTCTCAT AGCACATCAG AGATTTCACA CAGGAAAGAC GCCCTTTGTG 1200 

O0 TGCAGTGAAT GTGGAAAATC CTGTTCTCAG AAATCAGGTC TCATTAAACA TCAAAGAATT 1260 

CACACAGGAG AGAAACCCTT TGAATGTAGT GAATGTGGGA AAGCCTTTAG CACAAAGCAA 1320 

AAGCTCATTG TCCATCAAAG GACTCATACA GGAGAGAGAC CCTATGGCTG TAACGAGTGT 1380 

GGGAAAGCGT TTGCGTATAT GTCGTGTCTG GTTAAGCATA AGAGAATACA CACAAGGGAG 1440 

rjp. AAACAAGAGG CAGCCAAGGT GGAAAATCCT CCTGCAGAGA GGCACAGCTC ATTACACACC 1500 

/U AGTGATGTCA TGCAGGAGAA AAACTCTGCT AACGGGGCGA CTACACAAGT GCCTTCTGTG IS 60 

GCCCCTCAGA CATCATTAAA CATCAGCGGC CTCCTCGCAA ACAGGAACGT AGTCCTTGTG 1620 

GGACAGCCAG TGGTCAGATG TGCAGCCTCA GGAGATAACA GAGGATTTGC ACAGGACAGA 1680 

AACCTTGTGA ATGCAGTGAA TGTGGTTGTG CCTTCCGTGA TCAATTATGT CTTATTTTAT 1740 
GTTACAGAAA ACCCATAG 



Seq ID NO: 214 Protein sequence 
Protein Accession #: FGENESH predicted 

1 11 21 31 41 51 

I I I I I | 

KMGSHVAQAG LELIX3SSDPP AIASESAGIT GLLLQESITL EDVAVDFTWE EWQLLGAAQK 60 
DLYRDVMLEN YSNLVAVGYQ ASKPDALFKL EQGEQLWTIE DGIHSGACSG SPKVPFSIFS 120 
SVPFTLQNCL KSNIWKVDHV LERLQSESLV NRRXPCHEHD AFENIVHCSK SQFLLGQNHD 180 
IFDLRGKSLK SNLTLVNQSK GYEIKNSVEF TGNGDSFLHA NHERLHTAIK FPASQKLIST 240 

1036 



WO 03/042661 



KSQFISPKHQ KTRKLEKHHV CSECGKAFIK KSWLTDHQVM HTGEKPHRCS LCEKAFSRKF 300 

MLTEHQRTHT GEKPYECPEC GKAFLKKSRL NIHQKTHTGE KPYICSECGK GFIQKGNLIV 360 

HQRIHTGEKP YICNECGKGF IQKTCLIAHQ RFHTGKTPFV CSECGKSCSQ KSGLIKHQRI 420 

HTGEKPFECS ECGKAFSTKQ KLIVHQRTHT GERPYGCNEC GKAFAYMSCL VKHKRIHTRE 480 

KQEAAKVENP PAERHSSLHT SDVMQEKNSA NGATTQVPSV APQTSLNISG LLANRNWLV 540 
GQPWRCAAS GDNRGFAQDR NLVNAVNVW PSVINYVLFY VTENP 

Seq ID NO: 215 DNA sequence 

Nucleic Acid Accession #: NM_032 190.1 

Coding sequence: 502.. 1332 

1 11 21 31 41 51 

I I 1 i I I 

GATTCCGTGT TCTTGGCCAT GTTAGCCATA ATATCCTGTG CAGTATGTTT TTCCTGTGCA 60 

GAGGCAAAAA CATATTGGGC ATATGTTCCC AAGCCCCCAG CAGTATGACC CATACTTTGG 120 

AGTGACACTC CTCCTAAGAT TTATCATGAT TAAGGAGCAT GGGCTCCAGG ACCCCTAACT 180 

CCACCTGACA TAGAACAGTT AGACTCTCAG AATAATGTCA TTAATTATAC CGCTCCATTG 240 

GAAGGACTTC CTTT G TGTGT CACCACAAAG ACATCACTCA GCCATAGCTG TCTTACAGTT 300 

CAAGCTCACA CATGGTTGAG TCACTATGGG AAAATCATGT ACTTATTAAG TCTTGGTTAT 360 

ATTAATGTAA CCGGTGTGCT AACCAACCAT TCCTGGCCCA ATCGCCTTCA TTGTGCTGAC 420 

TATACAGAAT GGATTCCCTT CAATAGTTCC TACCCCCCTC CATAGACCCA GTGTCTTGGC 480 

CCACTGGCTA GAAAACAATC TATGTTAACT GGAGACATTG TGGATTGGGG ACCTAAAGGC 540 

CAATTAGATG GAAAAGAAGA AAATCAGAAA TCGTGGCACA AACTTTGCTG GCATTGGTGG 600 

CAAGCTTTTA ATGCTTCTTC TTTATATAAC ACTGGGATCC AATCCCAGTC GGCOGCCCAG 660 

ATTGCTTGGC ATGGAGCAGG CTTTAGCCCG CCTCTTCCTC AGTGGCATTA TCTAGGGAGG 720 

AAAGGACCAA TTCAAAAGAT GATATGGAAG GCAGCATTCC CATTTATGAA TGGCAACATC 780 

TGGGTTGCCA TAATACTATC CAATAATAGC AATAGTAAGC AACACAGTCT TAATGTTACA 840 

TTTGTAAAGA ATATCACCAC TCAATTTACA GTTTGTGTTT TTAATCCTTA TGTGTTTTTG 900 

GCAGCTAAGA AGGACCAGCT CCAGGTAAAC AATACCCAAT TGACCTGTAA ATCTTGCCAG 960 

TTATATCACT GCATTAATCA TAGCACATTG CAAACACATA ATATCTCTAC TTTGATGATT 1020 

TTAGGTTGCA TCCCTGGGCT ATGGATTCCT GTTAATCTGT CTGAGCCATG GGCTGCCACA 1080 

ATTGCTTTAC ATTTTGTGAA ACTTCTTCTA ACTCAGTTTA CTCATTGTGT CCGTAGAGGC 1140 

TTAGGCATGA TAATTTTTGC TATTGTTTAC TTGGTCACAC TAATAATTTC TGTTGTGATG 1200 

TCCTCTGTAG CTTTGCATAG TTCTATTCAA ACAGCTCAGT ATGTGGAGAA CTGGACACGC 1260 

ACAGTCAACC AAGGGTGGCT ACTTGAGAAT AAAATTAACA CTGAGTTACA AACTGAAGTG 1320 

GCAGTGTTAT AATCCACGAT TCTATGGTTA GGGGAACAAG TACAAAGCTT GCAATTGCAG 1380 

CAGTAATTGT GTTGTCATTT TAATCACACT CATATTTGTG TAACCAACTT AGAATATAAC 1440 

CAAAGTGAGT ATCCATGGGA TCTTGTGAAA GCCCATTTGC AGGGAGCTTT CACATCCGAC 1500 

ATCACCTTTG ATATTGGTGA ATTACAAAAC AAAATTCTTG ATTTAAATAA ACAAATTCCA 1560 

GAGTTTCAGC CTTCTTTAGA AGACTGGACT GAATTCCAGC AAGGCCTGGA GAGCGTCAAC 1620 

CCTTGGACCT ATCTAAAGCA CCACATTAAC ATCTTATATA TAGTTCTTGG AATAATGTTG 1680 

TTTTGTCTCT GTCTTCTGTT CATAGTCTGT AAAATCGGAT GGACTGCCAA TCGGAGAATG 1740 

AAAGCTACCC AGCCTGGCCT TACATTCTTT CACTTAATAC ATAAACAAGA AGGGGGAAAT 1800 

GTTGGGAGCC AAAAAGGCCA AAGGGATGGT GACCAACTCA GCATTCCACT GGAGGCTACA 1860 

TGATCAAACA GCAAACTGTT TATCATGAAT ACAGAATGTG GGCAAACTCG CTTCTGTGCC 1920 

TGCCCAGAAG GTTTGCTGAG GGCCATOGCT CCCTGGCCCC GGCTCCTTGA GGTTATCTAC 1980 

TGGGACATCT AGAGCCTATT GTTCGAGGAA TGCAGTCTTG CAAGCCTACT CTGGACCGAG 2040 

CAGCTGACCT CTTCTTCCAC ACCCCTTCTC ACTATCTCTT TTGCCTAATA AATATGGAGG 2100 

GCTGTGTAAA GCTCAGGGCC CTTGTCCACT AGAGGCAAGG TGTCCCCTGA CCCTTCTTCC 2160 
AAACAT 

Seq ID NO: 216 Protein sequence 
Protein Accession #: NP_115566.1 

1 11 21 31 41 51 

I I I I I I 

MLTGDIVDWG PKGQLDGKEE NQKSWHKLCW HWWQAFNASS LYNTGIQSQS AAQIAWHGAG 60 

FSPPLPQWHY LGRKGPIQKM IWKAAFPFMN GNIWVAIILS NNSNSKQHSL NVTFVXNITT 120 

QFTVCVFNPY VFLAAKKDQL QVNNTQLTCK SCQLYHCINH STLQTHNIST LMILGCIPGL 180 

WIPVNLSEPW AATIALHFVK LLLTQFTHCV RRGLGMIIFA IVYLVTLIIS WMSSVALHS 240 
SIQTAQYVEN WTRTVNQGWL LENKINTELQ TEVAVL 

Seq ID NO: 217 DNA sequence 

Nucleic Acid Accession fh FGENESH predicted 

Coding sequence*. 1..1566 

1 11 21 31 41 51 
I I I I I I 

ATGGTGAACC CCAAATCCAC TTCCTCCCTC TTCAGGTTAT GTTTTTTGCT CCTGAGGAGT 60 

CAGAACCTGT GGGTTGAAGA GCAAATTCAA TGCAAAAACA TATTGGGCAT ATGTTCCCAA 120 

TCCCCCAGCA GTATGGCCTA TACTTTGGAG CTCACTCCTC CTGAGATTTA TCACGATCAG 180 

GGAGAGTGGG CTCCAGGACC CCTAACTCCC CGTGACATAG AAAAGTTAGA CTCTCAGAAC 240 

AATGTCATTA ATTATACCAC TCCACTGGAA GGACTCCCTT TGTTTATCAC CACAAAGACG 300 

TCGCTCAGCC ATAGCTGTCT TGCAATTCAA GCTCAAACAT GGTTGAGTCA CTATGGAAAA 360 

ATTATGTACT TATTAGGTCT TGGTTCTATT AATGTAACTG GTGTGCTAAC CAATCATTCC 420 

CAGTCCAGTC ACCCTAATTG TGCTGATTAT ACAGAATGGA TTCCATTCAA TAGTTCCTAC 480 

CCCACTCTGT GGACCCAGTG TCTTGATCCA CTGGCTAGTA AACAATATAT GTCAACTGAA 540 

GACACTGTGG ATTGGGAACC TAAAGGTCAA TTAGATGGAA AAGGTGAAAG TCAGAAATCA 600 

TGGCACAAAC TTCACTGGCA TTGGCGGCAA GCTTTTAATG CTTCTTCTTT ATACAACAGC 660 

AGAATCCAAT CCCAGTCTGC TGCTCAGATT GCTTGGCATG GAGCAGGCTT TAGCCCACCT 720 

CTTCCTCAGT TGCATTATCT GGGGAGGAAA GGACCAATTC AAGAAACTAT ATGGAAGGCA 780 

GCACTCCCAT TTATGAATGG CAACATCTGG ATTGGAACAC TGTCTAATAA TAGCAATAGT 840 

AAGCAACACA GTCTTAATGT TGCATTTGTA AAGAATATCA CCACTCAGTT TACAGTTTGT 900 

GTTTTTAATC CTTATGCCTT TTTGGCAGCT AAGAAGAACC AGCTTCAGGT GGAGAACTGG 960 



1037 



WO 03/042661 



ACACGCACAG CTGACCAAGC GAGGCTACTT CAGAATAAAA TTAACACTGA GTTACAAACT 
GAAGTGGCAA TGTTGAAATC CATGGTTCTG TGGTTAGGAG AACAGGTACA AAGCTTGCAG 
TTGCAGCAGC AATTGCGTCA TCATTTTAAT CACATTCATA TTTGCGTAAC TAACTCAGAA 
TATAACCAAA GTGAGTATCC GTGGGACCTT GTGAAAGCCC ATTTGCAAGG AGCTTTCACA 
TCCAACATCA CCTTTGATAT TGGTGAATTA CAAAACAAAA TTATTGATTT AAATAGGCAA 
ACTCAAGAAT TTCAGCCTTC TTTAGAAGAC TGGACOGAAT TCCAGGAAGG CCTGGAGAGC 
CTCAACCCTT GGACCTATCT AAGGCACCAC ATTAACATCT TATATGTAGT TCTTGGAATA 
ATGTTGTTTT GTCTCTGTCT TCGGTTTATA GTCTGTAAAA TCGGATGGAC CACCAATTGG 
AGAATGAGAG CCTCCCAGCC CAGCCTTACA TTCTTTCAAT TAATACATAA ACAGAAAGGG 
GGATATGCAG GGAGCCAAAG GCCTGTGGGA CGTGACCAAC TCAGCATTCT GCTGGAGGCT 
ATATGA 

Seq ID NO: 21B Protein sequence 
Protein Accession #: FGENESH predicted 

1 11 21 31 41 51 

I I I I I I 

MVNPKSTSSL FRLCFLLLRS QNLWVEEQIQ CKNILGICSQ SPSSMAYTLE LTPPEIYHDQ 
GEWAPGPLTP RDIEKLDSQN NVINYTTPLE GLPLFITTKT SLSHSCLAIQ AQTWLSHYGK 
IMYLLGLGSI NVTGVLTNHS . QSSHPNCADY TEWIPFNSSY PTLWTQCLDP LASKQYMSTE 
DTVDWEPKGQ LDGKGESQKS WHKLHWHWRQ AFKASSLYNS RIQSQSAAQI AWHGAGFSPP 
LPQLHYLGRK GPIQETIWKA ALPFMNGNIW IGTLSNNSNS KQHSLNVAFV KNITTQFTVC 
VFNPYAFLAA KKNQLQVENW TRTADQARLL QNKINTELQT EVAMLKSMVL WLGEQVQSLQ 
LQQQLRHHFN HIHICVTNSE YNQSEYPWDL VKAHLQGAFT SNITFDIGEL QNKIIDLNRQ 
TQEFQPSLED WTEFQEGLES LNPWTYLRHH INILYWLGI MLFCLCLRFI VCKIGWTTNW 
RMRASQPSLT FFQLIHKQKG GYAGSQRPVG RDQLSILLEA I 

Seq ID NO: 219 DNA sequence 

Nucleic Acid Accession #: FGENESH predicted 
Coding sequence: 1..900 

1 11 21 31 41 51 . 

| | I I I I 

ATGCCGCCGC GGGAGCTGAG CGAGGCCGAG CCGCCCCCGC TCCGGGCCCC GACCCCTCCC 60 

CCGCGGCGGC GTAGCGCGCC CCCAGAGCTG GGCATCAAGT GCGTGCTGGT GGGCGACGGC 120 

GCCGTGGGCA AGAGCAGCCT CATCGTCAGC TACACCTGCA ATGGGTACCC CGCGCGCTAC 180 

CGGCCCACTG CGCTGGACAC CTTCTCTGGT ACGTACGTTC AATCGCCCGT GCGGCCGCGT 240 

GGCTGCGGCG GGGCTGTGCA CCGGGGAGCT GGGGCGGGCG TCTCGGCGGG AGGGCGCAGA 300 

GGACCCCGGG GAGGAGACTG GAGCAGGCCC CGAGGTGGCG CTGGTGCGGC CCAGGACGCT 360 

CTTCCTAACT CAGGCTCTCC CCGCCCCGCC CCTGCAGTGC AAGTCCTGGT GGATGGAGCT 420 

CCGGTGCGCA TTGAGCTCTG GGACACAGCG GGACAGGAGG ATTTTGACCG ACTTCGTTCC 480 

CTTTGCTACC CGGATACCGA TGTCTTCCTG GCGTGCTTCA GCGTGGTGCA GCCCAGCTCC 540 

TTTCAAAACA TCACAGAGAA ATGGCTGCCC GAGATCCGCA CGCACAACCC CCAGGCGCCT 600 

GTGCTGCTGG TGGGCACCCA GGCCGACCTG AGGGACGATG TCAACGTACT AATTCAGCTG 660 

GACCAGGGGG GCCGGGAGGG CCCCGTGCCC CAACCCCAGG CTCAGGGTCT GGCCGAGAAG 720 

ATCCGAGCCT GCTGCTACCT TGAGTGCTCA GCCTTGACGC AGAAGAACTT GAAGGAAGTA 780 

TTTGACTCGG CT ATT CT CAG TGCCATTGAG CACAAAGCCC GGCTGGAGAA GAAACTGAAT 840 
GCCAAAGGTG TGCGCACCCT CTCCCGCTGC CGCTGGAAGA AGTTCTTCTG CTTCGTTTGA 

Seq ID NO: 220 Protein sequence 
Protein Accession #: FGENESH predicted 

1 11 21 31 41 SI 

| I I 1 1 I 

MPPRELSEAE PPPLRAPTPP PRRRSAPPEL GIKCVLVGDG AVGKSSLIVS YTCNGYPARY 60 
RPTALDTFSG TYVQSPVRPR GCGGAVHRGA GAGVSAGGRR GPRGGDWSRP RGGAGAAQDA 120 
LPNSGSPRPA PAVQVLVDGA PVRIELWDTA GQEDFDRLRS LCYPDTDVFIi ACFSWQPSS 180 
FQNITEKWI*P EIRTHNPQAP VLLVGTQADL RDDVNVLIQL DQGGREGPVP QPQAQGLAEK 240 
IRACCYLECS ALTQKNLKEV FD SAILS AIE HKARLEKKLN AKGVRTLSRC RWKKFFCFV 

Seq ID NO: 221 DNA sequence 

Nucleic Acid Accession #: XM_063B32.2 

Coding sequence: 1.-711 

1 11 21 31 41 51 

| | i 1 I i 

ATGCCGCCGC GGGAGCTGAG CGAGGCCGAG CCGCCCCCGC TCCGGGCCCC GACCCCTCCC 60 
CCGCGGCGGC GTAGCGCGCC CCCAGAGCTG GGCATCAAGT GCGTGCTGGT GGGCGACGGC 120 
GCCGTGGGCA AGAGCAGCCT CATCGTCAGC TACACCTGCA ATGGGTACCC CGCGCGCTAC 180 
CGGCCCACTG CGCTGGACAC CTTCTCTGTG CAAGTCCTGG TGGATGGAGC TCCGGTGCGC 240 
ATTGAGCTCT GGGACACAGC GGGACAGGAG GATTTTGACC GACTTCGTTC CCTT TGCTAC 300 
CCGGATACCG ATGTCTTCCT GGCGTGCTTC AGCGTGGTGC AGCCCAGCTC CTTTCAAAAC 360 
ATCACAGAGA AATGGCTGCC CGAGATCCGC ACGCACAACC CCCAGGCGCC TGTGCTGCTG 420 
GTGGGCACCC AGGCCGACCT GAGGGACGAT GTCAACGTAC TAATTCAGCT GGACCAGGGG 480 
GGCCGGGAGG GCCCCGTGCC CCAACCCCAG GCTCAGGGTC TGGCCGAGAA GATCCGAGCC 540 
TGCTGCTACC TTGAGTGCTC AGCCTTGACG CAGAAGAACT TGAAGGAAGT ATTTGACTCG 600 
GCTATTCTCA GTGCCATTGA GCACAAAGCC CGGCTGGAGA AGAAACTGAA TGCCAAAGGT 660 
GTGCGCACCC TCTCCCGCTG CCGCTGGAAG AAGTTCTTCT GCTTCGTTTG A 

Seq ID NO: 222 Protein sequence 
Protein Accession #: XP_063832.1 

1 11 21 31 41 51 

I I 1 I I I 
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MPPRELSEAE PPPLRAPTPP PRRRSAPPEL GIKCVLVGDG AVGKSSLIVS YTCNGYPARY 
RPTALDTFSV QVLVDGAPVR IELWDTAGQE DFDRLRSLCY PDTDVFLACF SWQPSSFQN 
ITEKWLPEIR THNPQAPVLL VGTQADLRDD VNVLIQLDQG GREGPVPQPQ AQGLAEKIRA 
CCYLECSALT QKNLKEVFDS AILSAIEHKA RLEKKLNAKG VRTLSRCRWK KFFCFV 



60 
120 
180 



Seq ID NO: 223 DNA sequence 
Nucleic Acid Accession #: Eos 
Coding sequence: 1..1161 



1 
I 

ATGAATCGGC 
TTCCGAGATG 
GGGCTTCTGG 
TCCAGCCGGA 
CCGTTCGTGA 
CGGCTGGTGC 
GTGGCGGTAG 
AATTGGACAG 
CACCTCCTGA 
J AGCATCTGCC 
CTGGGCATCA 
GACCGGCATG 
GTCATCTGCT 
TCGGGCACGC 
AGCTTCACCT 
TTTCCCAACT 
CCAGATAATA 
GCTCCAGAGG 
ACCTCAAATA 
CAGTTGGGAT 



11 

I 

ACCATCTGCA 
ACTTCATTGC 
GCAATGGCCT 
TTTTCCTGTT 
TGGACTACTA 
TCTTCATGTT 
ACAGGTATTT 
CAGCCATCAT 
AGAAGAAGTT 
ATACCTTCCG 
TCCTGTTCTG 
CCAAGATCAA 
TCCTTCCCAG 
AGAATTGTGA 
ACATGAACAG 
TCTTCTCCAC 
ACCGCAGCAC 
CGTTAATGGC 
ACCATT CCAA 
GTTGCATCGA 




TGCCATGAAC 
CCGGGTGGTC 
CTCTTGCCTT 
GCTGATCCAG 
GTGGCACGAA 
CTCAGCCAGA 
GAGAGCCATC 



AGTGTACCGC 
CATGCTGGAC 
TTTGATCAAC 
GAGCGTCGAG 
CAACTCCGGT 
GAAGGGACAT 
G 



sequence 



31 
I 

CTGGAAATAG 
CCGCCGGTGT 
ATTTTCTGTT 
GTAGCTGACT 
TCAGACTGGA 
CGCCAGGGCA 
CATCCCCACC 
CTGTGGGGCA 
AATGGCCCTG 
GCTATGTTCC 
ATTATCTGGA 
ACCTT CATCA 
CGGATCCGCA 
TCGGTGGACC 
CCCGTGGTGT 
CGCTGCCTCC 
CTCACAGGGG 
GAGCCATGGA 
TGTCACCAAG 



41 
I 

ACAAGAAGAA 
TGGGGCTGGA 
TCCACCTCAA 
TTCTACTGAT 
AGTTTGGGGA 
GCATCATCTT 
ACGCCCTGAA 
TCACTGTTGG 
CAAATGTGTG 
TCCTGGAGTT 
GCCTGCGGCA 
TGGTGGTGGC 
TCTTCTGGCT 
TGGCGTTCTT 
ACTACTTCTC 
AGAGGAAGAT 
ACCCCAACAA 
GCCCCTCTTA 
AACCAGCATC 



51 
I 

GTTTATCTTT 
GTCCTGGAAA 
CATCTGCCTG 
CATCCCTTGC 
CCTCACGGTG 
CAAGATCTCC 
CCTAACAGTC 
CATCAGCTTC 
CCTCCTGCCC 
GAGACAAATG 
CATCGTCTTT 
CCTGCACACT 
TATCACTCTC 
CAGCCCATCC 
GACAGGTGAG 
AACCAGAGGC 
TCTGGGCCCA 
TCTGGAGAAA 



60 
120 
1B0 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 



Seq ID NO: 224 Protein sequence 
Protein Accession # : Eos sequence 



1 
I 

MNRHHLQDHF 
SSRIFLFNIA 
VAVDRYFRW 
SICHTFRWHE 
VICFLPSVW 
FPNFFSTLIN 
TSNNHSKKGH 



11 

I 

LEIDKKNCCV 
VADFLLIICL 
HPHHALNKIS 
AMFLLEFLLP 
RIRIFWLLHT 
RCLQRKMTGE 
CHQEPASLEK 



21 
I 

FRDDFIAKVL 
PFVMDYYVRR 
NWTAAIISCL 
LGIILFCSAR 
SGTQNCEVYR 
PDNNRSTSVE 
QLGCCIE 



31 
I 

PPVLGLEFIF 
SDWKFGDIPC 
liWGITVGLTV 
IIWSLRQRQM 
SVDLAFFITL 
LTGDPNKTRG 



41 
I 

GLLGNGLALW 
RLVLFMFAMN 
HLLKKKLLIQ 
DRHAKIKRAI 
SFTYMNSMLD 
APEALMANSG 



51 
I 

IFCFHLKSWK 
RQGSIIFLTV 
NGPANVCISF 
TFIMWAIVF 
PWYYFSSPS 
EPWSPSYLGP 



Seq ID NO: 225 DNA sequenc e 
Nucleic Acid Accession #: E» 
Coding sequence: 1..1092 



1 
I 

ATGAATCGGC 
TTCCGAGATG 
GGGCTTCTGG 
TCCAGCCGGA 
CCCTTCCTGA 
CGGCTGATGC 
GTGGCGGTAG 
AATCGGACAG 
CACCTCCTGA 
AGCATCTGCC 
CTGGGCATCA 
GACCGGCATG 
GTCATCTGCT 
TCGGGCACGC 
AGCTTCACCT 
TTTCCCAACT 
CCAGATAATA 
GCTCCAGAGG 



11 
1 

ACCATCTGCA 
ACTTCATTGT 
GCAATGGCCT 
TTTTCCTGTT 
TGGACAACTA 
TCTTCATGTT 
ACAGGTATTT 
CAGCCATCAT 
AGAAGAAGAT 
ATACCTTCCA 
TCCTGTTCTG 
CCAAGATCAA 
TCCTTCCCAG 
AGAATTGTGA 
ACATGAACAG 
TCTTCTCCAC 
ACCGCAGCAC 
CGTTAATGGC 




AGTGTACCGC 
CATGCTGGAC 
TTTGATCAAC 
GAGCGTCGAG 
CAACTCCGGT 



31 
I 

CTGGAAATAG 
CCGCCGGTGT 
ATTTTCTGTT 
GTGGCTGACT 
TGGGACTGGA 
CGCCAGGGCA 
CATCCCCACC 
CTGTGGGGCA 
AATGGCGGTG 
GCCATGTTCC 
ATTATCTGGA 
ACCTT CATCA 
CGGATCCGCA 
TCGGTGGACC 
CCCGTGGTGT 
CGCTGCCTCC 
CTCACAGGGG 
GAGCCATGGA 



41 

I 

ACAAGAAGAA 
TGGGGCTGGA 
TCCACCTCAA 
TTCTACTGAT 
AGTTTGGGGA 
GCATCATCTT 
ACGCCCTGAA 
TCACTATTGG 
CAAATTTGTG 
TCCTGGAGTT 
GCCTGCGGCA 
TGGTGGTGGC 
TCTTCTGGCT 
TGGCGTTCTT 
ACTACTTCTC 
AGAGGAAGAT 
ACCCCAACAA 
GCCCCTCTTA 



Seq ID NO: 226 Protein sequence 
Protein Accession Eos sequence 



1 
I 

MNRHHLQDHF 
SSRIFLFNLA 
VAVDRYFRW 
SICHTFQWHE 
VICFLPSVW 
FPNFFSTLIN 
TSP 



11 
I 

LEIDKKNCCV 
VADFLLIICL 
HPHHALNKIS 
AMFLLEFFLP 
RIRIFWLLHT 
RCLQRKMTGE 



21 



51 



60 
120 
180 
240 
300 
360 



51 
I 

CTGCTGTGTG 
GTTTATCTTC 
GTCCTGGAAA 
CATCTGCCTG 
CATCCCTTGC 
CCTCACGGTG 
CAAGATCTCC 
CCTGACAGTC 
CAGCAGCTTC 
CTTCCTGCCC 
GAGACAAATG 
CATCGTCTTT 
CCTGCACACT 
TATCACTCTC 
CAGCCCATCC 
GACAGGTGAG 
AACCAGAGGC 
TCTGGGCCCA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 



31 41 

I I I 

FRDDFIVKVL PPVLGLEFIF GLLGNGLALW IFCFHLKSWK 
PFLMDNYVRR WDWKFGDIPC RLMLFMLAMN RQGSIIFLTV 
NRTAAIISCL LWGITIGLTV HLLKKKMPIQ NGGANLCSSF 
LGIILFCSAR IIWSLRQRQM DRHAKIKRAI TFIMWAIVF 
SGTQNCEVYR SVDLAFFITL SFTYMNSMLD PWYYFSSPS 
PDNNRSTSVE LTGDPNKTRG APEALMANSG EPWSPSYLGP 



60 
120 
180 
240 
300 
360 



1039 
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Seq ID NO: 227 DMA sequence 

Nucleic Acid Accession «: NM_006018 

Coding sequence: 61.. 1224 

1 11 21 31 41 51 

I I I I I I 

CGCCACTTTG CTGGAGCATT CACTAGGCGA GGCGCTCCAT CGGACTCACT AGCCGCACTC 60 

ATGAATCGGC ACCATCTGCA GGATCACTTT CTGGAAATAG ACAAGAAGAA CTGCTGTGTG 120 

TTCCGAGATG ACTTCATTGC CAAGGTGTTG CCGCCGGTGT TGGGGCTGGA GTTTATCTTT 180 

GGGCTTCTGG GCAATGGCCT TGCCCTGTGG ATTTTCTGTT TCCACCTCAA GTCCTGGAAA 240 

TCCAGCCGGA TTTTCCTGTT CAACCTGGCA GTAGCTGACT TTCTACTGAT CATCTGCCTG 300 

CCGTTCGTGA TGGACTACTA TGTGCGGCGT TCAGACTGGA ACTTTGGGGA CATCCCTTGC 360 

CGGCTGGTGC T CTTCATGTT TGCCATGAAC CGCCAGGGCA GCATCATCTT CCTCACGGTG 420 

GTGGCGGTAG ACAGGTATTT CCGGGTGGTC CATCCCCACC ACGCCCTGAA CAAGATCTCC 480 

AATTGGACAG CAGCCATCAT CTCTTGCCTT CTGTGGGGCA TCACTGTTGG CCTAACAGTC 540 

CACCTCCTGA AGAAGAAGTT GCTGATCCAG AATGGCCCTG CAAATGTGTG CATCAGCTTC 600 

AGCATCTGCC ATACCTTCCG GTGGCACGAA GCTATGTTCC TCCTGGAGTT CCTCCTGCCC 660 

CTGGGCATCA TCCTGTTCTG CTCAGCCAGA ATTATCTGGA GCCTGCGGCA GAGACAAATG 720 

GACCGGCATG CCAAGATCAA GAGAGCCATC ACCTTCATCA TGGTGGTGGC CATCGTCTTT 780 

GTCATCTGCT TCCTTCCCAG CGTGGTTGTG CGGATCCGCA TCTTCTGGCT CCTGCACACT 840 

TCGGGCACGC AGAATTGTGA AGTGTACCGC TCGGTGGACC TGGCGTTCTT TATCACTCTC 900 

AGCTTCACCT ACATGAACAG CATGCTGGAC CCCGTGGTGT ACTACTTCTC CAGCCCATCC 960 

TTTCCCAACT TCTTCTCCAC TTTGATCAAC CGCTGCCTCC AGAGGAAGAT GACAGGTGAG 1020 

CCAGATAATA ACCGCAGCAC GAGCGTCGAG CTCACAGGGG ACCCCAACAA AACCAGAGGC 1080 

GCTCCAGAGG CGTTAATGGC CAACTCCGGT GAGCCATGGA GCCCCTCTTA TCTGGGCCCA 1140 

ACCTCAAATA ACCATTCCAA GAAGGGACAT TGTCACCAAG AACCAGCATC TCTGGAGAAA 1200 

CAGTTGGGCT GTTGCATCGA GTAATGTCAC TGGACTCGGC CTAAGGTTTC CTGGAACTTC 1260 

CAGATTCAGA GAATCTGATT TAGGGAAACT GTGGCAGATG AGTGGGAGAC TGGTTGCAAG 1320 

GTGTGACCAC AGGAATCCTG GAGGAACAGA GAGTAAAGCT TCTAGGCATC TGAAACTTGC 1380 

TTCATCTCTG ACGCTCGCAG GACTGAAGAT GGGCAAATTG TAGGCGTTTC TGCTGAGCAG 1440 

AGTTGGAGCC AGAGATCTAC TTGTGACTTG TTGGCCTTCT TCCCACATCT GCCTCAGACT 1500 

GGGGGGGGCT CAGCTCCTCG GGTGATATCT AGCCTGCTTG TGAGCTCTAG CAGGGATAAG 1560 

GAGAGCTGAG ATTGGAGGGA ATTGTGTTGC TCCTGGAGGA AGCCCAGGCA TCATTAAACA 1620 

AGCCAGTAGG TCACCTGGCT TCCGTGGACC AATTCATCTT TCAGACAAGC TTTAGAGAAA 1680 

TGGACTCAGG GAAGAGACTC ACATGCTTTG GTTAGTATCT GTGTTTCCGG TGGGTGTAAT 1740 

AGGGGATTAG CCCCAGAAGG GACTGAGCTA AACAGTGTTA TTATGGGAAA GGAAATGGCA 1800 

TTGCTGCTTT CAACCAGCGA CTAATGCAAT CCATTCCTCT CTTGTTTATA GTAATCTAAG 1860 

GGTTGAGCAG TTAAAACGGC TTCAGGATAG AAAGCTGTTT CCCACCTGTT TCGTTTTACC 1920 

ATTAAAAGGG AAACGTGCCT CTGCCCCACG GGTAGAGGGG GTGCACGTTC CTCCTGGTTC 1980 

CTTCGCTTGT GTTTCTGTAC TTACCAAAAA TCTACCACTT CAATAAATTT TGATAGGAGA 2040 
CAAAAAAAAA A 

Seq ID NO: 228 Protein sequence 
Protein Accession #: NPJ)06009.1 

1 11 21 31 41 51 

I 1 i I ' I I 

MNRHHLQDHF LEIDKKNCCV FRDDFIAKVL PPVLGLEFIF GLLGNGLALW IFCPHLKSWK 60 

SSRIFLFNLA VADFLLIICL PFVMDYYVRR SDWNFGDIPC RLVLFMFAMN RQGSIIFLTV 120 

VAVDRYFRW HPHHALNKIS NWTAAIISCL LWGITVGLTV HLLKKKLLIQ NGPANVCISF 180 

SICHTFRWHE AMFLLEFLLP LGIILFCSAR IIWSLRQRQM DRHAKIKRAI TFIMWAIVF 240 

VICFLPSVW RIRIFWLLHT SGTQNCEVYR SVDLAFFITL SFTYMNSMLD PWYYFSSPS 300 

FPNFFSTLIN RCLQRKMTGE PDNNRSTSVE LTGDPNKTRG APEALMANSG EPWSPSYLGP 360 
TSNNHSKKGH CHQEPASLEK QLGCCIE 

Seq ID NO: 229 DNA sequence 

Nucleic Acid Accession #: NM_014398.1 

Coding sequence: 64.. 1314 

1 11 21 31 41 51 

I I 1 I I I 

GGCACCGATT CGGGGCCTGC CCGGACTTCG CCGCACGCTG CAGAACCTCG CCCAGCGCCC 60 

ACCATGCCCC GGCAGCTCAG CGCGGCGGCC GCGCTCTTCG CGTCCCTGGC CGTAATTTTG 120 

CACGATGGCA GTCAAATGAG AGCAAAAGCA TTTCCAGAAA CCAGAGATTA TTCTCAACCT 180 

ACTGCAGCAG CAACAGTACA GGACATAAAA AAACCTGTCC AGCAACCAGC TAAGCAAGCA 240 

CCTCACCAAA CTTTAGCAGC AAGATTCATG GATGGTCATA TCACCTTTCA AACAGCGGCC 300 

ACAGTAAAAA TTCCAACAAC TACCCCAGCA ACTACAAAAA ACACTGCAAC CACCAGCCCA 360 

ATTACCTACA CCCTGGTCAC AACCCAGGCC ACACCCAACA ACTCACACAC AGCTCCTCCA 420 

GTTACTGAAG TTACAGTCGG CCCTAGCTTA GCCCCTTATT CACTGCCACC CACCATCACC 480 

CCACCAGCTC ATACAGCTGG AACCAGTTCA TCAACCGTCA GCCACACAAC TGGGAACACC 540 

ACTCAACCCA GTAACCAGAC CACCCTTC CA GCAACTTTAT CGATAGCACT GCACAAAAGC 600 

ACAACCGGTC AGAAGCCTGA TCAACCCACC CATGCCCCAG GAACAACGGC AGCTGCCCAC 660 

AATACCACCC GCACAGCTGC ACCTGCCTCC ACGGTTCCTG GGCCCACCCT TGCACCTCAG 720 

CCATCGTCAG TCAAGACTGG AATTTATCAG GTTCTAAACG GAAGCAGACT CTGTATAAAA 780 

GCAGAGATGG GGATACAGCT GATTGTTCAA GACAAGGAGT CGGTTTTTTC ACCTCGGAGA 840 

TACTTCAACA TCGACCCCAA CGCAACGCAA GCCTCTGGGA ACTGTGGCAC CCGAAAATCC 900 

AACCTTCTGT TGAATTTTCA GGGCGGATTT GTGAATCTCA CATTTACCAA GGATGAAGAA 960 

TCATATTATA TCAGTGAAGT GGGAGCCTAT TTGACCGTCT CAGATCCAGA GACAGTTTAC 1020 

CAAGGAATCA AACATGCGGT GGTGATGTTC CAGACAGCAG TCGGGCATTC CTTCAAGTGC 1080 

GTGAGTGAAC AGAGCCTCCA GTTGTCAGCC CACCTGCAGG TGAAAACAAC CGATGTCCAA 1140 

CTTCAAGCCT TTGATTTTGA AGATGACCAC TTTGGAAATG TGGATGAGTG CTCGTCTGAC 1200 

TACACAATTG TGCTTCCTGT GATTGGGGCC ATCGTGGTTG GTCTCTGCCT TATGGGTATG 1260 

GGTGTCTATA AAATCOGCCT AAGGTGTCAA TCATCTGGAT ACCAGAGAAT CTAATTGTTG 1320 

CCCGGGGGGA ATGAAAATAA TGGAATTTAG AGAACTCTTT CATCCCTTCC AGGATGGATG 1380 



1040 



WO 03/042661 



PCT/US02/36810 



10 
15 

20 

25 
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TTGGGAAATT 
AATGAAGTGA 
GTTTATTTTA 
GCCACTCAAA 
AGCCTTCAAA 
ATTTTATTTT 
GCTTTTACTA 
ACTCCTTTTC 
CAGGCTGGAG 
TGATTCTCCT 
GCTAATTTTT 
CTCTTGACCT 
AGCCATTGCG 
GTTGTCTAAG 
CTTGATGACT 
CTAAACAATA 
TAGGCTAAGC 
ACTGAGACTT 
GAGCTTGAAT 
CCTACAAGAA 
TCACCTTACA 
AGCTTTGCAG 
TGAGGGGCTT 
GTAAAGATGA 
ACTTCGCTAA 
TTCTGCACTT 
GAATTTTATT 
AGAAAAGTCC 
CCATGTTGAC 
TAGTCTTAAT 



CCCTCAGAGT 
GTCATGTGTG 
TGAAAGATAT 
GTCAACATTT 
TTATAAACCA 
ACCCTTGATC 
TCTGTGTTTT 
CACTTTAAAT 
TACAGTGGCA 
GCTTCAGCTT 
GTATTTTTAT 
CAGGTGATCC 
CCCGGCCTTA 
TGTTTTTATG 
CCTGCTCCAG 
AGCAAGAGAC 
ACTTTATCTA 
AAGGGAACTG 
TCATGTTGGT 
CAATGACACC 
GGGAAATGGG 
ATAACAAAAT 
TGTAAAACAT 
AGGCATCAAA 
CCAACTGTTC 
CATATCCATA 
TCTGCTGTTT 
ACATAACCCT 
TTTCCTCATG 
AAAACATTGA 



GTGGGTCCTT 
ATTTAAGTTC 
AGTGAGCTGT 
GAGATATGTT 
AGGGTCAATT 
TTAACAAAGC 
ATGGTTTCAT 
TTGTTTTTGT 
CGATCTCGGC 
CCCGAGTAGC 
TATAGACGGG 
ACCCACCTCA 
AATGTTTTTT 
TAAAACCAAC 
AATTGCTAGA 
AATAATAATG 
TATCTCATTT 
AATCACTTAA 
CTGACATCAA 
ACACTCTGCC 
TTTATCCAGG 
AGCCTATCCT 
TAGTCAGTTG 
TAAACTCAAA 
TTTCTTGAGT 
TTTCCTATTG 
TTTTTGCTGC 
AGAATTCTTA 
TGTTTCCTTA 
ATTGTAGTAA 



CAAACAATGT 
AGGCAGCACA 
TTATTTTCTA 
GAATTAACAT 
GTAACTAATA 
CTTTGCTTTG 
GTAACATACA 
TTTTTGAGAC 
TTATGGCAAC 
TGGGATTACA 
TTTCACCATG 
GCCTCCCAAA 
TTAATCATCA 
AAAAAGAACA 
CTAAGAATTA 
GCCCTTAATT 
CATTCTCACA 
ATGTCACCTG 
GGTCTTTGGT 
TGAAGGCTCA 
ATCATGAGAC 
TAATAAATCC 
CTCATTTTTA 
GTATTTTTAA 
GTATAGCCCC 
TTCACTTTAT 
TAAAGAAAGG 
GTCAAGGAAT 
TGACTCAGTA 
AGGTTTTTGC 



AAACCACCAT 
TCAATTTCTA 
GTTTCCTTTA 
AATATATGTA 
CTACTGTGTG 
TTATCAAATG 
TATTCCTGGT 
GGAGTTTCAC 
CTCCGCCTCC 
GGCACACACT 
TTGGCCAGAC 
GTGCTGGGAT 
AAAAGAACAA 
AATCAGCTTA 
GGTGGCTACA 
ATTAACAAAG 
ACTTATAAGT 
GCTAACTGAT 
CTTCTCCCTA 
CACCTCATAC 
ATTAGGGTAG 
TCCACTCTCT 
TGGGATTGCT 
ATTTTTTTGA 
ATCTTGTGGT 
TCTGTAGAGC 
AACTAAGTCA 
AATTCAAGTC 
AGTTGGCAAG 
AATAAAAACT 



CTTCTATTCA 
AATACTTTTT 
GAATATTTTA 
AAGTAGAATA 
TGCATTGAAG 
GACTTTCAGT 
GTAGCACTTA 
TCTTGTCACC 
CGGGTTCAAG 
ACCACGCCTG 
TGGTCTTGAA 
TACAGGCATG 
CATATCTCAG 
TATTTTTTAT 
GATGGTAGAA 
TGCCAGAGTC 
GAATGAGTAA 
GGCAGAGCCA 
CACCAAGTTA 
CAGCATACGC 
ATGAAAGGAG 
GGAAGGAGAC 
TAGCTGGGCT 
TAATAGAGAA 
AACTTGCTGC 
AGCCTGCCAA 
GGATGTTAAC 
AGCCTAGAGA 
GTCCTGACTT 
TACTTTGG 



X440 

1500 

1560 

1G20 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 



Seq ID NO: 230 Protein sequence 
Protein Accession #: NP_055213.1 



1 
I 

MPRQLSAAAA 
HQTLAARPMD 
TEVTVGPSLA 
TGQKPDQPTH 
EMGIQLIVQD 
YYISEVGAYI* 
QAFDPEDDHF 



11 
I 

LFASLAVILH 
GHITFQTAAT 
PYSLPPTITP 
APGTTAAAHN 
KESVFSPRRY 
TVSDPETVYQ 
GNVDECSSDY 



21 
I 

DGSQMRAKAF 
VKI PTTTPAT 
PAHTAGTSSS 
TTRTAAPAST 
FNIDPNATOA 
GIKHAWMFQ 
TIVLPVIGAI 



31 
I 

PETRDYSQPT 
TKNTATTSPI 
TVSHTTGNTT 
VPGPTLAPQP 
SGNCGTRKSN 
TAVGHSFKCV 
WGLCLMGMG 



41 
I 

AAATVODIKK 
TYTIjVTTQAT. 
QPSNQTTLPA 
SSVKTGIYQV 
LLLNFQGGFV 
SEQSLQLSAH 
VYKIRLROQS 



51 
I 

PVQQPAKQAP 
PNNSHTAPPV 
TLSIALHKST 
LNGSRLCIKA 
NLTFTKDEES 
LQVKTTDVQL 
SGYQRI 



60 
120 
180 
240 
300 
360 



Seq ID NO: 231 DNA sequence 

Nucleic Acid Accession #: NM_005409.3 

Coding sequence: 94.. 378 



1 

! 

TTCCTTTCAT 
CAACAGCACC 
TTGGCTGTGA 
TGTCTTTGCA 
ATAATGTACC 
AAAGGACAAC 
GAAAGAAAGA 
AACCTAGAAC 
AGACTTTTCT 
GGGTGAAAGG 
CTGCCCAAAG 
GGTTACCATC 
GCATTTCTAG 
GAGAACATTT 
CTGTGGTTAC 
CATCTATGTG 
CCAAATATCA 
TTTATAACCA 
TGGGATACTG 
GATGTTTTTC 
TGTACTTTTT 
TACAAAATGT 
AATCACTTTT 
TTGTTCATGC 
GTCATTTTTT 



11 

I 

GTTCAGCATT 
AGCAGCAACA 
TATTGTGTGC 
TAGGCCCTGG 
CAAGTAACAA 
GATGCCTAAA 
ATTTTTAAAA 
AAGTTTAACT 
ATGGTTTTGT 
ACCAAAAACA 
GAGTCCAGCA 
GGAGTTTACA 
GCTAGAGAAC 
CTGTCTCTAG 
AGTGGAGACA 
TCGTAAAGCA 
TGTAGCACAT 
ATTCATTAAA 
GCAACAGTGC 
AACTTTTATT 
GTTTTGATCC 
TTTTGTCTAC 
ACTTTTTGTA 
CTATATACTG 
TCTCTAATAA 



21 
I 

TCTACTCCTT 
GCAAAAAACA 
TACAGTTGTT 
GGTAAAAGCA 
CTGTGACAAA 
TCCCAAATCG 
ATATCAAAAC 
GTGACTACTG 
GACTTTCAAC 
GAAATACAGT 
ATTAAATGGA 
AAGTGCTTTC 
CTTCTAGATT 
AAGTTATCTG 
TTGACATTAT 
TTCCTCAAAC 
CAATATGTAG 
TGTAATTCAT 
ACATATTTCA 
CATTGAGATG 
GTTTGTATAA 
CAAAGAAAAA 
ATTCTGTCTC 
TAAAATTTAG 
ACTACCACAA 



31 
I 

CCAAGAAGAG 
AACATGAGTG 
CAAGGCTTCC 
GTGAAAGTGG 
ATAGAAGTGA 
AAGCAAGCAA 
ATATGAAGTC 
AAATGACAAG 
TTTTGTACAG 
CTTCCTGAAT 
TTTCTAGGAA 
ACGTTCTTAC 
TGATGCTTAC 
TCTGTATTGA 
TACTGGAGTC 
ATTTTTTCAT 
GGAAACATTC 
AAAATGTACT 
TAACCAAATT 
TTTTGAAGCA 
ATGATAGCAA 
TGTTGAAAAA 
TTAGAAAAAT 
GTATACTCAA 
CCTTTCTTTT 



41 
I 

CAGCAAAGCT 
TGAAGGGCAT 
CCATGTTCAA 
CAGATATTGA 
TTATTACCCT 
GGCTTATAAT 
CTGGAAAAGG 
AATTCTACAG 
TTATGTGAAG 
GAATGACAAT 
AAGCTACCTT 
TTGTTGTATT 
AACTATTCTG 
TCTTTATGCT 
AAGCCCTTAT 
GCAAATACAC 
TTATGCATCA 
ATGAAAAAAA 
AGCAGCACCG 
ATTAGGATAT 
TATCTTGGAC 
TAAGCAAATG 
ACATAATCTA 
GACTAGTTTA 
TTAAAAAAAA 



51 

i 

GAAGTAGCAG 
GGCTATAGCC 
AAGAGGACGC 
GAAAGCCTCC 
GAAAGAAAAT 
CAAAAAAGTT 
GCATCTGAAA 
TAGGAAACTG 
GATGAAAGGT 
CAGAATTCCA 
AAGAAAGGCT 
ATACATTCAT 
TTGTGACTAT 
ATATTACTAT 
AAGTCAAAAG 
ACTTCTTTCC 

TTATACGCTA 
GTCTTAATTT 
GTGTGTTTAC 
ACATTTGAAA 
TATACCTAGC 
ATCAATTTCT 
AAGAATCAAA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 



Seq ID NO: 232 Protein sequence 
Protein Accession NP 005400.1 



1 11 21 31 . 41 51 

I I I I I I 

MSVKGMAIAL AVILCATWQ GFPMFKRGRC LCIGPGVKAV KVADIEKASI MYPSNNCDKI 
EVIITLKENK GQRCLNPKSK QARLIIKKVE RKNF 



1041 



WO 03/042661 



PCT/US02/36810 



Seq ID NO: 233 DNA sequence 

Nucleic Acid Accession #: NMJJ00577. 

Coding sequence: 41.. 520 



10 

15 

20 

25 

30 

35 

40 

45 
50 
55 
60 
65 
70 
75 
80 



i 
I 

GGCACGAGGG 
CCGACCCTCT 
GAAGACCTTC 
CAATTTAGAA 
CCATGGAGGG 
GGAGGCAGTT 
CATCCGCTCA 
CCTCTGCACA 
CGTCATGGTC 
TCCCATTCTT 
6CTATG66GG 
CCTGGTCACA 
GTCTTTCTAA 
TCTGCATTCA 
CTTCCTCCCT 
ACCAAGTGGC 
TTTAAGGGTT 
GAAGGAGAGC 
ATTCCTGCAT 
TTTTTTTGTG 
ATTTTTTTTT 
CCCAGCCTCC 
TCCTATCAGA 
TCTTCCTCTG 
ACTTGTATGA 
GAGCAGGAAA 
CTCTTGGCAG 
CTGTGACTTC 
AAAAAAAAAA 



11 
I 

GAAGACCTCC 
GGGAGAAAAT 
TATCTGAGGA 
GAAAAGATAG 
AAGATGTGCC 
AACATCACTG 
GACAGTGGCC 
GCGATGGAAG 
ACCAAATTCT 
GCATGGCAAG 
CACTGAGGAC 
GGACTCTGCC 
TGTGTGAATC 
GGATCAAACC 
CATTCCACCT 
TCCCACACCC 
TGTGGAAAAT 
CCTTCATTTG 
TTGTGAAATG 
ATGTCCCAAC 
TCCTTTTAAA 
AAGCTCCATC 
AGTTTCTCAG 
CTGAAGGAAT 
AAGATGGCTG 
CATGACTCGT 
GTACTCAGCG 
AGCTCTGTTT 
AAAAAAAAAA 



21 
I 

TGTCCTATCA 
CCAGCAAGAT 
ACAACCAACT 
ATGTGGTACC 
TGTCCTGTGT 
ACCTGAGCGA 
CCACCACCAG 
CTGACCAGCC 
ACTTCCAGGA 
GACTGCAGGG 
CAGCCATTGA 
TCCTCTTCAA 
AGAGCACAGC 
CCGACCACCT 
TCCCATGCCC 
TGTTTTACAA 
GAAAATTAGG 
GAGATTATGT 
ATGGTGAAAG 
TTGTAAAAAT 
ACACTTCCAT 
TCCACTCCAG 
CTCCCAAGGC 
AAATTGCTCC 
TGCCTCTGCC 
ATATGTCTCA 
AATGAATGCT 
TACAATAAAA 
AAAAAAAAAA 



31 

1 

GGCCCTCCCC 
GCAAGCCTTC 
AGTTGCCGGA 
CATTGAGCCT 
CAAGTCTGGT 
GAACAGAAAG 
TTTTGAGTCT 
GGTCAGCCTC 
GGACGAGTAG 
ACTGCCAGTC 
GGGGTGGACC 
CTGACCAGCC 
AGCCCCTGCA 
GCCCAACCTG 
TGGATCCATC 
AAAAGAAAAG 
ATTTCATGAT 
TCTTTCGGGG 
TAAGTGGTAG 
TAAAAGTTAT 
AATCTGGACT 
ATTTTTTACA 
TCTGAGCAAA 
TTGACATTGT 
TGTCTCCCCC 
GGTCCCTGCA 
GTATATGTTG 
TCTTGAAAAT 
AAAAAA 



41 
I 

ATGGCTTTAG 
AGAATCTGGG 
TACTTGCAAG 
CATGCTCTGT 
GATGAGACCA 
CAGGACAAGC 
GCCGCCTGCC 
ACCAATATGC 
TACTGCCCAG 
CCCCTGCCCC 
CTCAGAAGGC 
TCCATGCTGC 
CAAAGCCCTT 
CTCTCCTCTT 
AGGCCACTTG 
ACCAGTCCAT 
TTTTTTTTTT 
AGAGGCTGAG 
CTTTTCCCTT 
GGTACTATGT 
CCTCTGTCCA 
GCTGCCTGCA 
TGTGGCTCCT 
AGAGCTTCTG 
ACCAGGCTGG 
GGGCCAAGCA 
GGTGCAAAGT 
GCCT AAAAAA 




CTGACGAAGG 
GCCTGCCTGT 
AGGGCTCCCG 
GTCACAACAA 
CTCCAGAATG 
CCATGTCGCC 
GCCACTGCCT 
ATGACCCCCA 
GAGGGAGGTT 
CAGTCCCCGT 
GACTTAAAAT 

TAGCCCCATA 
GGCACTGCTG 
GTACTTTACC 
GGGGGTTCTT 
GCACTTGGAG 
GAGCTCTGCA 
CCTAGCCTCG 
TCCCTACTTC 
AAAAAAAAAA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 



Seq ID NO: 234 Protein sequence 
Protein Accession #: NP_000568.1 



1 11 21 31 41 51 

I I I I 1 I 

MALETICRPS GRKSSKMQAF RIWDVNQKTF YLRNNQLVAG YLQGPNVNLE EKIDWPIEP 
HALFLGIHGG KMCLSCVKSG DETRLQLEAV NITDLSENRK QDKRPAFIRS DSGPTTSFES 
AACPGWPLCT AMEADQPVSL TNMPDEGVMV TKFYFQEDE 

Seq ID NO: 235 DNA sequence 

Nucleic Acid Accession #: NM_001840.1 

Coding sequence: 149.. 1567 



1 
I 

GGGGACTACG 
TCCCGAGGAC 
GAGCTCAGCC 
CACCTTCCGC 
AGACATCAAA 
TTCCTTTAGG 
CCCAGCAGAC 
GAATGAGGAG 
GAACCCCAGC 
CCTGGAGAAC 
TTCCTACCAC 
CTACAGCTTC 
CAAACTGGGT 
CATCGACAGG 
CAAGGCCGTG 
TCTCCTGGGC 
TGATGAAACC 
GTATGCGTAC 
TGGCACCCAG 
GCCAGACCAA 
GTTGATCATC 
GAACAAGCTC 
CGTGAACCCC 
GTTTCCCTCT 
CCTGCACAAA 
GAGCACGGTC 
TCTGTGAGCC 
ATCTAGAAGA 
AAGGTGATTG 
TCGTAGGAAA 




TGGAACTGCG 
TACCTGATGT 
ATGTATATTC 
AAGAGCATCA 
GCCCGCATGG 
TGCTGGGGCC 
ATTAAGACGG 
ATCATCTATG 
TGTGAAGGCA 
CACGCAAACA 
AAGATTGCCA 
TGATGCCTCC 
GTCTATTGTC 
CCACATGTCA 
CCTTT 



21 31 

I I 
CAGGGAGCCG AGGCCCCCGC 
GAAGGGATTG CCCCCTGTGG 
CTGAGGTTAT GAAGTCGATC 
CTGACCTCCT GTACGTGGGC 
CATCCAAATT AGGGTACTTC 
TCCAAGAGAA GATGACTGCG 
TTACAGAATT TTACAACAAG 
GTGGGGAGAA CTTCATGGAC 
CCATTGCAGT CCTGTCCCTC 
TGTGCGTCAT CCTCCACTCC 
GCCTGGCGGT GGCAGACCTC 
ACGTGTTCCA CCGCAAAGAT 
CCTCCTTCAC TGCCTCCGTG 
TTCACAGGCC CCTGGCCTAT 
GCCTGATGTG GACCATAGCC 
AGAAACTGCA ATCTGTTTGC 
TCTGGATCGG GGTCACCAGC 
TCTGGAAGGC TCACAGCCAC 
TCATCCACAC GTCTGAGGAT 
ACATTAGGTT AGCCAAGACC 
CTCTGCTTGC AATCATGGTG 
TGTTTGCATT CTGCAGTATG 
CTCTGAGGAG TAAGGACCTG 
CTGCGCAGCC TCTGGATAAC 
ATGCAGCCAG TGTTCACAGG 
AGGTAACCAT GTCTGTGTCC 
CTGGCAGCAC AGGAAAAGAA 
TCCTTGGTTA TATTTTTTTA 
CTTATTTGCT TAGTTTCCGT 



41 
I 

CCGGGCCAAG 
GTCACTTTCT 
CTAGATGGCC 
TCAAATGACA 
CCACAGAAAT 
GGAGACAACC 
TCTCTCTCGT 
ATAGAGTGTT 
ACGCTGGGCA 
CGCAGCCTCC 
CTGGGGAGTG 
AGCCGCAACG 
GGCAGCCTGT 
AAGAGGATTG 
ATTGTGATCG 
TCAGACATTT 
GTACTGCTTC 
GCCGTCCGCA 
GGGAAGGTAC 
CTGGTCCTGA 
TATGATGTCT 
CTCTGCCTGC 
CGACACGCTT 
AGCATGGGGG 
GCCGCAGAAA 
ACAGACACGT 
TTTTTTTTTT 
ACTTTACCAT 
TTGGGCTAAT 



60 
120 



51 
I 

GGAGCTTCTG 
CAGTCATTTT 
TTGCAGATAC 
TTCAGTACGA 
TCCCTTTAAC 
CCCAGCTAGT 
CCTTCAAGGA 
TCATGGTCCT 
CCTTCAOGGT 
GCTGCAGGCC 
TCATTTTTGT 
TGTTTCTGTT 
TCCTCACAGC 
TCACCAGGCC 
CCGTGCTGCC 
TCCCACACAT 
TGTTCATCGT 
TGATTCAGCG 
AGGTGACCCG 
TCCTGGTGGT 
TTGGGAAGAT 
TGAACTCCAC 
TCCGGAGCAT 
ACTCGGACTG 
GCTGCATCAA 
CTGCCGAGGC 
TAAGCTCAAA 
GCTCAATGAA 
CTTCCGGGGT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 



Seq ID NO: 236 Protein sequence 
Protein Accession #: NP 001831.1 



1042 



WO 03/042661 



PCT/US02/36810 



5 

10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



1 11 21 31 41 51 

1 I I I I I 

MKSILDGLAD TTFRTITTDL, LYVGSNDIQY EDI KGD MASK LGYFPQKFPL TSFRGSPFQE 60 

KMTAGDNPQL. VPADQVNITE FYNKSLSSFK ENEENIQCGE NFMDIECFMV LNPSQQLAIA 120 

VLSLTLGTFT VLENLLVLCV ILHSRSLRCR PSYHFIGSLA VADLLGSVIF VYSFIDFHVF 180 

HRKDSRNVFL FKLGGVTASF TASVGSLFLT AIDRYISIHR PLAYKRIVTR PKAWAFCLM 240 

VTTIAIVIAVL PLLGWNCEKL QSVCSDIFPH IDETYLMFWI GVTSVLLLFI VYAYMYILWK 300 

AHSHAVRMIQ RGTQKSIIIH TSEDGKVQVT RPDQARMDIR LAXTLVLILV VLIICWGPLL 360 

AIMVYDVFGK MNKLIKTVFA FCSMLCLLNS TVNPIIYALR SKDLRHAFRS MFPSCEGTAQ 420 
PLDNSMGDSD CLHKHANNAA SVHRAAESCI KSTVKIAKVT MSVSTDTSAE AL 

Seq ID NO: 237 DNA sequence 

Nucleic Acid Accession #: NM_016083.2 

Coding sequence: 64.. 14 82 

1 11 21 31 41 51 

I I I I I I 

GATTGCCCCC TGTGGGTCAC TTTCTCAGTC ATTTTGAGCT CAGCCTAATC AAAGACTGAG 60 

GTTATGAAGT CGATCCTAGA TGGCCTTGCA GATACCACCT TCOGCACCAT CACCACTGAC 120 

CTCCTGTACG TGGGCTCAAA TGACATTCAG TACGAAGACA TCAAAGGTGA CATGGCATCC 180 

AAATTAGGGT ACTTCCCACA GAAATTCCCT TTAACTTCCT TTAGGGGAAG TCCCTTCCAA 240 

GAGAAGATGA CTGCGGGAGA CAACCCCCAG CTAGTCCCAG CAGACCAGGT GAACATTACA 300 

GAATTTTACA ACAAGTCTCT CTCGTCCTTC AAGGAGAATG AGGAGAACAT CCAGTGTGGG 360 

GAGAACTTCA TGGACATAGA GTGTTTCATG GTCCTGAACC CCAGCCAGCA GCTGGCCATT 420 

GCAGTCCTGT CCCTCACGCT GGGCACCTTC ACGGTCCTGG AGAACCTCCT GGTGCTGTGC 4 B0 

GTCATCCTCC ACTCCCGCAG CCTCCGCTGC AGGCCTTCCT ACCACTTCAT CGGCAGCCTG 540 

GCGGTGGCAG ACCTCCTGGG GAGTGTCATT TTTGTCTACA GCTTCATTGA CTTCCACGTG 600 

TTCCACCGCA AAGATAGCCG CAACGTGTTT CTGTTCAAAC TGGGTGGGGT CACGGCCTCC 660 

CTCACTGCCT CCGTGGGCAG CCTGTTCCTC ACAGCCATCG ACAGGTACGT ATCCATTCAC 720 

AGGCCCCTGG CCTATAAGAG GATTGTCACC AGGCCCAAGG CCGTGGTGGC GTTTTGCCTG 780 

ATGTGGACCA TAGCCATTGC GATCGCCGTG CTGCCTCTCC TGGGCTGGAA CTGCGAGAAA 840 

CTGCAATCTG TTTGCTCAGA CATTTTCCCA CACATTGATG AAACCTACCT GATGTTCTGG 900 

ATCGGGGTCA CCAGCGTACT GCTTCTGTTC ATCGTGTATG CGTACATGTA TATTCTCTGG 960 

AAGGCTCACA GCCACGCCGT CCGCATGATT CAGCGTGGCA CCCAGAAGAG CATCATCATC 1020 

CACACGTCTG AGGATGGGAA GGTACAGGTG ACCCGGCCAG ACCAAGCCCG CATGGACATT 1080 

AGGTTAGCCA AGACCCTGGT CCTGATCCTG GTGGTGTTGA TCATCTGCTG GGGCCCTTTG 1140 

CTTGCAATCA TGGTGTATGA TGTCTTTGGG AAGATGAACA AGCTCATTAA GACGGTGTTT 1200 

GCATTCTGCA GTATGCTCTG CCTGCTGAAC TCCACCGTGA ACCCCATCAT CTATGCTCTG 1260 

AGGAGTAAGG ACCTGCGACA CGCTTTCCGG AGCATGTTTC CCTCTTGTGA AGGCACTGCG 1320 

CAGCCTCTGG ATAACAGCAT GGGGGACTCG GACTGCCTGC ACAAACACGC AAACAATGCA 1380 

GCCAGTGTTC ACAGGGCCGC AGAAAGCTGC ATCAAGAGCA CGGTCAAGAT TGCCAAGGTA 1440 

ACCATGTCTG TGTCCACAGA CACGTCTGCC GAGGCTCTGT GAGOCTGATG CCTCCCTGGC 1500 

AGCACAGGAA AAGAATTTTT TTTTTTAAGC TCAAAATCTA GAAGAGTCTA TTGTCTCCTT 1560 

GGTTATATTT TTTTAACTTT ACCATGCTCA ATGAAAAGGT GATTGTCACC ATGATCACTT 1620 

ATCAGTTTGC TAATGTTTCC ATAGTTTAGG TACTCAAACT CCATTCTCCA GGGGTTTACA 16B0 

GTGAAGAAAG CCTGTTGTTT AAGTGACTGA ACGATCCTTC AAAGTCTCAA TGAAATAGGA 1740 

GGGAAACCTT TGGCTACACA ATTGGAAGTC TAAGAACCCA TGGAAAAATG CCATCAAATG 1800 

AATAATGCCT TTGTAACCAC AACTTTCACT ATAATGTGAA ATGTAACTGT CCGTAGTATC 1860 

AGAGATGTCC ATTTTTACAA GTTATAGTAC TAGAGATATT TTGTAAAATG TATTATGTCC 1920 

TGTGAGATGT GTATCAGTGT TTATGTGCTA TTAATATTTG TTTAGTTCAG CAAAACTGAA 1980 

AGGTAGACTT TTATGAGAAC AATGGACAAG CAGTGGATAC GTGTCAATGT GTGCACTTTT 2040 

TTTCTATATT ATTG CCCATG ATATAACTTT AGAAATAAAC CTTAATATTT CTTCAAATAT 2100 

CTCTATTTAA TTTTGACACT GAAATAACCG TAAAGGTTTA TTTTTCTGTT ACCTCAACAA 2160 

GAAGAATTTG AAGACTTCAA AATATTGAGC AGAATTCATT CATACTTAAA AATTTATTAG 2220 

CCCTGCATTT TCATAGGAAG ACACATTATC TTCTGGACTA TAGCTGTTCT AATGGATTAT 2280 

AATCAGAATG GAAGAGAGAA AGCATATTGA CTTTTTTTGA GCGACATCTC TGACTTTCTT 2340 

TAGTCTTTAG CTATTACTGG ATCTCTTAAG ACAGCATGTG TTAATCTTAA TGTATATCGT 2400 

TATCACTGTG CAGTTGCTGT TTACTTGAAT AGTATTGTGT TCCTATATTC CAG GTTTA AG 2460 

TAGATTTCAT GCCTGGGTGG CCAAACAACA GTCTTCATTT TTTTTAATTG AAAAGAAGTA 2520 

GTGTCTGGAT CAGTAAAATT ATACTGTGT G TGAGTGTGAA TATAAATGTG TGTATGTGTG 2580 

TTTCTGTCCG TAACTGTTAC AGTAATGTCA TAAAGTGAGA AAACTGTGAC CAAGTATAAA 2640 

CTTTTACCAC TTGCTGCACT CTTGCACATG GATTCAGTTT CTAAAATTGA GTTCTTCCTG 2700 

TAATCTTGTT GATAAAAATA CTGACTCCAA CCATTCAAAA ATTTCACCCC ATCCCTCCTT 2760 

AAGAGATTGG ATCAAGTATT ACTAAATTGA CCTTTAGGTA TTACACAAGA CCAGTGCTTA 2820 

GCAAAAAATA ATGACAGGCA TCCAAGGAAG GGATGTATTT GTAGTGTTAT TGCCAGGAAA 2880 

GGAGAGTACT TTGGTTTCTG AGCACCGAAT ATTGAGCAAT ATGTCAGTCA CTAAAAGGAA 2940 

GACAGTTCTA CAGAAAAACA AATGGTAACA TTTTTCAATA GCGTGTGTAG ATAGTATGCA 3000 

CTATATACAT CACGTTAAAG TAGGACTATC ACACCCAGCC CATGTGGCTA AAAAAGCTGA 3060 

ATCAGACAGT GGATGAGACA CACAACGGCA GTGAAGAACC GATACACTTG GCATTGACGT 3120 

CTAGCTATGC TGTATCTGTG CTTTGCCCAC ATGCCCTTGG TGACAGCTGA GCACCCAGCT 3180 

CTGTCTTGGT AGGTTTGGGC TAAGGAACAA ATCTCTCCTT TGCTCGTGGT TAGCAAGATA 3240 

CACTCAAGCA TGAAGATAAA CACAGCTGCT TTCTTCTTAC ACCCCGGTCT CATGCTCCTT 3300 

AATGGCGCCA TGGGTGCTTG TTGGGCCTTT TTCCAGTAAG GAATGATATT GCTGAAGAAT 3360 

CTACTTAACC CTGACAAATT TTAATTATAA TCTCTTCTTA TACAGATAAA ACATGACTCC 3420 

TACAAGGCCC CAAGGTTTAC ATAGTCTGAA GTGAAGTACA GAGCTGGCAT CTATCTGGTG 3480 

ATTTCTAGCT CTCGAGATAC CCAAGCAGCC TGATGGGGCA GTTCCCCTTC TTACGGTTCA 3S40 

CGCTCTAAGG CAGGATGTGG CTTATGAGAT ACTTTGCATT GTCTGTCTGC ACACCTTGAA 3600 

TCTGCCTGCT GGCTCCCTTA CTTTACCTCT CTGTCATGTG CAGATGAAGG CTCAGGGTGC 3660 

TAGAGGATTA GTAAGATCTC TTTCTAAAGA CAGGAGAGAT TATTTACAAG AAGAACTCAC 3720 

CAGGGTTTAG TTTGCATTTA AGAATTGCCA GTCTTTTGTC CTGCATCATC TTGAACATTA 3780 

ATCCACATGT TTCAGAGCTC ACCAGGCAGT ACCAATGCTC TTTTCACAGC TATGAAG AG C 3840 

TAGAGAAATT CTTGTTATGG TAGAAAAATT TCACGGTTCA TTTTTGAAAC TGCATTTGTG 3900 

CGTATGCAGT GTAGATTTTA TAGTGTGTTG TGCTTTCAAG ATCTAAATCA TATATAATAA 3960 

1043 



WO 03/042661 



PCT/US02/36810 



10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



ATTAAGGGAC 
TCTGTGAAAT 
AGATTCACTT 
AATGAATATA 
TAGATCCATT 
GTATCTTACT 
ATGTTCAGAA 
AATCAGTCCT 
TTATTTTGTA 
CCTACACTTG 
AAATGCCAAG 
ATGAGAATTA 
CATTTCATGT 
TAACTGTATA 
ATTAATTGTC 
AAAAACATTC 
ATGTTTGAGC 
CGTGGCATCT 
AGTTCCCAAG 
AGCGGGAAAC 
GAATATTGTA 
TTCGTTAGTA 
AGCTTCAGTC 
TTAAGCCCAC 
ACATCTCATC 
ATGAAGAACA 



AATGGGGCTG 
ATCATCACGT 
TGACACTTTT 
CTCATTAGAA 
AATTTAAATG 
TTCCTCTGAG 
AGGGCCAAAT 
AGAATGTTTC 
TGAAATATAT 
AGATGATTTC 
GACCTTTTGC 
GGCCCTGGAT 
CCATAAAAGA 
TTTGTATGAT 
TTGCACTGGC 
TATCTACTGA 
AGTGGCCTAC 
TAAAAGCTAT 
AAAGGAAACC 
ATGACTCTTT 
TTGTTAGATG 
ATCTGTATAA 
ATTAAATTAC 
AGTATATTTT 
ATGTCTATTG 
TA 



ACAGCACTAA 
ATGTTATACA 
CATATCATTT 
TTACCATTTG 
ATTTAAATTT 
GATGATGTAC 
TCCCAACCTG 
ATTTGCACAA 
GTGAAAGGAT 
CAAGCCTCTC 
TGCCTAAATC 
GAGTAGCGCT 
GACCACCCAT 
ATTTTAAAAT 
CTTCTGATGA 
TTTGGGCTGA 
AAATCAGTAA 
CTTCATGTAA 
TTTTCTTGTA 
ATTGTCTGTA 
TTTGTTGAGC 
TACTTTGTAT 
TATAGCAAAG 
ATTTCAATGT 
TCCAAAGTTA 



ACTTGGTGCT 
ACCTTCATTT 
CTTAACCCAA 
TTAATATCAC 
AAGTAAGTTT 
TTGCCCTGAC 
CTCATTTTTT 
GTAGGGCTGC 
ATGAATCTGA 
TGGCACTTTG 
CACTCTGCAG 
ATAGTTACTG 
ATCATGCACA 
CTCCTAAATG 
AATGTTAACA 
ATGTATGTAA 
TTTTCGGGTG 
ATTGACTGTA 
TCTGGATTCA 
AATCTAACAT 
TGGTAACATC 
ACAAGTACTG 
TAGTACTTCT 
AATTAAACTG 
CCTGGAATCA 



TATTGATATT 
AAAAAGGTTT 
GTGACGAAAA 
TCATTAATTA 
TATAAGGTCT 
CATGCATTTT 
TTTTTATCAG 
CTCCAAGAGG 
GAGATGCTGT 
AGTTAAGTCT 
GAAATAGGCC 
TCCTGTTGAT 
CAATTAGATT 
CTGGGCAATG 
ATGCCTATTG 
ATAGGTTTCT 
GGAGAGTTTC 
CTAGGCCTAC 
AATTTATTTC 
TATTACTTTT 
GTTGCAACCA 
GTAAGATTGT 
TCTGTAATAT 
TTAACTTATT 
AATAAAAATT 



CTAAGAAATA 
AAAACTAGTT 
CATTGTCCCC 
ACCCCATAAT 
GACATCAGAG 
ACCATCACAC 
AGTCATGATG 
AACCTCTGAT 
AGACATCTGT 
ATCTGGTATT 
CAACCACCAG 
TAATTTCTGC 
TCTCACACTC 
GCTATTAACA 
TAATATAGAA 
AAAAAGTCAG 
TTTACATTGC 
TGGGGATCAG 
CAATGTTTCA 
CCTCTTAGAA 
CTGCAATATC 
TATTAAATGT 
TTACAATGTA 
CAAAGAGAAA 
CTAGATTACC 



4020 
4080 
4140 
4200 
4260 
4320 
4360 
4440 
4S00 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 



Seq ID NO: 238 Protein sequence 
Protein Accession #: NP_057167.1 



MKSILDGLAD 
KMTAGDNPQL 
VLSLTLGTFT 
HRKDSRNVFL 
WTIAIAIAVL 
AHSHAVRMIQ 
AIMWDVFGK 
PLDNSMGDSD 



11 
I 

TTFRTITTDL 
VPADQVNITE 
VLENLLVLCV 
FKLGGVTASL 
PLLGWNCEKL 
RGTQKSIIIH 
MNKLIKTVFA 
CLHKHANNAA 



21 
I 

LYVGSNDIQY 
FYNKSLSSFK 
ILHSRSLRCR 
TASVGSLFLT 
QSVCSDIFPH 
TSEDGKVQVT 
FCSKLCLLNS 
SVHRAAESCI 



31 
I 

EDIKGDMASK 
ENEENIQCGE 
PSYHFIGSLA 
AIDRYVSIHR 
IDETYLMFWI 
RPDQARMDIR 
TVNPIIYAIiR 
KSTVKIAKVT 



41 
1 

LGYFPQKFPti 
NFMDIECFMV 
VADLLGSVIF 
PLAYKRIVTR 
GVTSVLLLFI 
IAKTLVLILV 
SKDLRHAFRS 
MSVSTDTSAE 



51 
1 

TSFRGSPFQE 
LNPSQQIiAIA 
VYSFIDFHVF 
PKAWAFCLM 
VYAYMYILWK 
VLIICWGPLL 
KFPSCEGTAQ 
Al» 



60 
120 
180 
240 
300 
360 
420 



Seq ID NO: 239 DNA sequence 

Nucleic Acid Accession #: NM_033 181.1 

Coding sequence: 17.. 1252 



1 
I 

ATGAAGTCGA 
CTGTAOGTGG 
CCAGTGTGGG 
GCTGGCCATT 
GGTGCTGTGC 
CGGCAGCCTG 
CTTCCACGTG 
CACGGCCTCC 
ATCCATTCAC 
GTTTTGCCTG 
CTGCGAGAAA 
GATGTTCTGG 
TATTCTCTGG 
CATCATCATC 
CATGGACATT 
GGGCCCTCTG 
GACGGTGTTT 
CTATGCTCTG 
AGGCACTG CG 
AAACAATGCA 
TGCCAAGGTA 



11 

I 

TCCTAGATGG 
GCTCAAATGA 
GAGAACTTCA 
GCAGTCCTGT 
GTCATCCTCC 
GCGGTGGCAG 
TTCCACCGCA 
TTCACTGCCT 
AGGCCCCTGG 
ATGTGGACCA 
CTGCAATCTG 
ATCGGGGTCA 
AAGGCTCACA 
CACACGTCTG 
AGGTTAGCCA 
CTTGCAATCA 
GCATTCTGCA 
AGGAGTAAGG 
CAGCCTCTGG 
GCCAGTGTTC 
ACCATGTCTG 



21 
I 

CCTTGCAGAT 
CATTCAGTAC 
TGGACATAGA 
CCCTCACGCT 
ACTCCCGCAG 
ACCTCCTGGG 
AAGATAGCCG 
CCGTGGGCAG 
CCTATAAGAG 
TAGCCATTGT 
TTTGCTCAGA 
CCAGCGTACT 
GCCACGCCGT 
AGGATGGGAA 
AGACCCTGGT 
TGGTGTATGA 
GTATGCTCTG 
ACCTGCGACA 
ATAACAGCAT 
ACAGGGCCGC 
TGTCCACAGA 



31 
I 

ACCACCTTCC 
GAAGACATCA 
GTGTTTCATG 
GGGCACCTTC 
CCTCCGCTGC 
GAGTGTCATT 
CAAGGTGTTT 
CCTGTTCCTC 
GATTGTCACC 
GATCGCCGTG 
CATTTTCCCA 
GCTTCTGTTC 
CCGCATGATT 
GGTACAGGTG 
CCTGATCCTG 
TGTCTTTGGG 
CCTGCTGAAC 
CGCTTTCCGG 
GGGGGACTCG 
AGAAAGCTGC 
CACGTCTGCC 



41 
I 

GCACCATCAC 
AAGGAGAATG 
GTCCTGAACC 
ACGGTCCTGG 
AGGCCTTCCX 
TTTGTCTACA 
CTGTTCAAAC 
ACAGCCATCG 
AGGCCCAAGG 
CTGCCTCTCC 
CACATTGATG 
ATCGTGTATG 
CAGCGTGGCA 
ACCOGGCCAG 
GTGGTGTTGA 
AAGATGAACA 
TCCACCGTGA 
AGCATGTTTC 
GACTGCCTGC 
ATCAAGAGCA 
GAGGCTCTGT 



51 
I 

CACTGACCTC 
AGGAGAACAT 
CCAGCCAGCA 
AGAACCTCCT 
ACCACTTCAT 
GCTTCATTGA 



ACAGGTACAT 
CCGTGGTGGC 
TGGGCTGGAA 
AAACCTACCT 
GGTACATGTA 
CCCAGAAGAG 
ACCAAGCCCG 
TCATCTGCTG 
AGCTCATTAA 
ACCCCATCAT 
CCTCTTGTGA 
ACAAACACGC 
CGGTCAAGAT 
GA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 



Seq ID NO: 240 Protein sequence 
Protein Accession (h NP_149421.1 



MALQIPPSAP 
LSLTLGTETV 
RKDSRNVFLF 
TIAIVIAVLP 
HSHAVRMIQR 
IMVYDVFGKM 
LDNSMGDSDC 



11 
I 

SPLTSCTWAQ 
LENLLVLCVI 
KLGGVTASFT 
LLGWNCEKLQ 
GTQKSIIIHT 
NKLIKTVFAF 
LHKHANNAAS 



21 
i 

MTFSTKTSKE 
LHSRSLRCRP 
ASVGSLFLTA 
SVCSDIFPHI 
SEDGKVQVTR 
CSMLCLLNST 
VHRAAESCIK 



31 
I 

NEENIQCGEN 
SYHFIGSIAV 
IDRYISIHRP 
DETYLMFWIG 
PDQARMDIRL 
VNPIIYAIiRS 
STVKIAKVTM 



41 
I 

FMDISCFMVL 
ADLLGSVIFV 
LAYKRIVTRP 
VTSVLLLFIV 
AKTLVLILW 



SVSTDTSAEA 



51 
I 

NPSQQLAIAV 
YSFIDFHVFH 
KAWAFCLMW 
YAYMYILWKA 
LIICWGPLLA 
FPSCEGTAQP 
L 



60 
120 
180 
240 
300 
360 



Seq ID NO: 241 DNA sequence 

Nucleic Acid Accession #i NM 003596.1 



1044 



WO 03/042661 



Coding sequence: 82.. 1194 

1 11 21 31 41 51 

I I I I I I 

GTAGACTGTC CATGGCCTGA ACATTTTCCG AAAATCATTT TGAGCAAAAT ATCTGTTTAA 60 

TAACAAGATA ACCACATCAA GATGGTTGGA AAGCTGAAGC AGAACTTACT ATTGGCATGT 120 

CTGGTGATTA GTTCTGTGAC TGTGTTTTAC CTGGGCCAGC ATGCCATGGA ATGCCATCAC 180 

CGGATAGAGG AACGTAGCCA GCCAGTCAAA TTGGAGAGCA CAAGGACCAC TGTGAGAACT 240 

GGCCTGGACC TCAAAGCCAA CAAAACCTTT GCCTATCACA AAGATATGCC TTTAATATTT 300 

ATTGGRGGTG TGCCTCGGAG TGGAACCACA CTCATGAGGG CCATGCTGGA CGCACATCCT 360 

GACATTCGCT GTGGAGAGGA AACCAGGGTC ATTCCCCGAA TCCTGGCCCT GAAGCAGATG 420 

TGGTCACGGT CAAGTAAAGA GAAGATCCGC CTGGATGAGG CTGGTGTTAC TGATGAAGTG 480 

CTGGATTCTG CCATGCAAGC CTTCTTACTA GAAATTATCG TTAAGCATGG GGAGCCAGCC 540 

CCTTATTTAT GTAATAAAGA TCCTTTTGCC CTGAAATCTT TAACTTACCT TTCTAGGTTA 600 

TTCCCCAATG CCAAATTTCT CCTGATGGTC CGAGATGGCC GGGCATCAGT ACATTCAATG 660 

ATTTCTCGAA AAGTTACTAT AGCTGGATTT GATCTGAACA GCTATAGGGA CTGTTTGACA 720 

AAGTGGAATC GTGCTATAGA GACCATGTAT AACCAGTGTA TGGAGGTTGG TTATAAAAAG 780 

TGCATGTTGG TTCACTATGA ACAACTTGTC TTACATCCTG AACGGTGGAT GAGAACACTC 640 

TTAAAGTTCC TCCAGATTCC ATGGAACCAC TCAGTATTGC ACCATGAAGA GATGATTGGG 900 

AAAGCTGGGG GAGTGTCTCT GTCAAAAGTG GAGAGATCTA CAGACCAAGT AATCAAGCCA 960 

GTCAATGTAG GAGCTCTATC AAAATGGGTT GGGAAGATAC CGCCAGATGT TTTACAAGAC 1020 

ATGGCAGTGA TTGCTCCTAT GCTTGCCAAG CTTGGATATG ACCCATATGC CAACCCACCT 1080 

AACTACGGAA AACCTGATCC CAAAATTATT GAAAACACTC GAAGGGTCTA TAAGGGAGAA 1140 

TTCCAACTAC CTGACTTTCT TAAAGAAAAA CCACAGACTG AGCAAGTGGA GTAGCAGAAC 1200 

CAGGAGCCTC TTCCATACAT GAGGAAAGAT TGCTGCCTTT TCAGCAGAAG GGAAATTCCT 1260 

AGGATTGGCT GTCCCCTGCC AAGCTTGGTG GAGCGTCTGC ACCTTGGCTG CGCCGCCTGT 1320 

GCATTTGCCA GTTTCCTCCC ACTGAGAGGA TGGAGGTGTC CGCACAGCTT TGGGCCTCGT 1380 

GAGGGATCTG CCTCCTGAGC AAAGAGCTCT TGATCCCGAT TTCATGCACA GCCCTGCAGT 1440 

AAGGAGCCCA GAAGGAACAT GTGTTTCCTG TTAAAACTCC TCTTGTTCTC TTTTCTTACA 1500 

TTATGACGTT TGTTTTCAAG GA6A0GGTTT AAAAATGGGA TCCTGTAAGC AGACTTGGGC 1560 

AGTCTCCTTT TGAAATAGGT TGTCTGTACA TGTTCTAATG TTTTGTAGAA CACGTGTGCC 1620 

TGTTTAAGTG TATTGATGTG AATAATATTA AATATCCTAA TTATTTAATT CATTGTATTG 1680 

TTTCTGAGAA GTTGGGAAAT TACCATTATA CATTTACAAC CTAATGACTT TTGTATTTTA 1740 
TTTTTCAAAA TAAAAGCTTT CAATGTGA 

Seq ID NO: 242 Protein sequence 
Protein Accession #: NP_003587.1 

1 11 21 31 41 51 

I I I I I I 

MVGKLKQNLL LACLVISSVT VFYLGQHAME CHHRIEERSQ PVKLESTRTT VRTGLDLKAN 60 

KTFAYHKDMP LIFIGGVPRS GTTLMRAMLD AHPDIRCGEE TRVIPRILAL KQMWSRSSKE 120 

KIRLDEAGVT DEVLD SAMOA FLLEIIVKHG EPAPYLCNKD PFALKSLTYL SRLFPNAKFL 180 

LMVRDGRASV HSMISRKVTI AGFDLNSYRD CLTKWNRAIE TMYNQCMEVG YKKCMLVHYE 240 

QLVLHPERWM RTLLKFLQIP WNHSVLHHEE MIGKAGGVSL SKVERSTDQV IKPVNVGALS 300 

KWVGKIPPDV LQDMAVIAPM LAKLGYDPYA NPPNYGKPDP KIIENTRRVY KGEFQLPDFL 360 
KEKPQTEQVE 

Seq ID NO: 243 DNA sequence 

Nucleic Acid Accession #: NMJ>01492.3 

Coding sequence: 1395.. 2513 

1 11 21 31 41 51 

I I ! t I I 

ACGCGGGGCG CGCGGCTCCG TCGGCTACCG CGGGCGGGCG CAGGCGACGG GCACGGCGGG 60 

CGAGCGGGCG GTATGGCGGC GGCGGGGCCC GCGGCGGGGC CGACGGGGCC CGAGCCCATG 120 

CCGAGCTACG CGCAGCTAGT GCAGCGCGGC TGGGGCAGCG CGCTGGCGGC GGCGCGGGGC 180 

TGCACGGACT GCGGCTGGGG GCTGGCGCGT CGCGGCCTGG CTGAGCACGC GCACCTGGCG 240 

CCGCCCGAGC TGCTGCTGCT GGCGCTCGGC GCGCTGGGCT GGACCGCGCT GCGCTCCGCG 300 

GCCACTGCGC GCCTCTTTCG GCCCCTGGCG AAGCGGTGCT GCCTCCAGCC CAGAGATGCC 360 

GCCAAGATGC CCGAGAGCGC TTGGAAGTTT CTCTTCTACC TGGGCAGCTG GAGCTACAGT 420 

GCCTACCTGC TGTTTGGCAC CGACTACCCC TTCTTCCATG ACCCACCATC TGTCTTCTAC 480 

GACTGGACGC CGGGCATGGC AGTGCCACGG GACATTGCAG CCGCCTACCT GCTCCAGGGA 540 

AGCTTCTATG GCCACTCCAT CTACGCTACG CTATACATGG ACACCTGGCG CAAGGACTCG 600 

GTGGTCATGC TGCTCCACCA CGTGGTCACT CTCATCCTCA TCGTCTCCTC CTACGCCTTC 660 

CGGTACCACA ATGTGGGCAT CCTTGTGCTC TTCCTGCACG ATATCAGTGA CGTGCAGCTT 720 

GAGTTCACCA AGCTCAACAT TTACTTCAAG TCCCGCGGCG GCTCCTACCA TCGGCTGCAT 780 

GCCTTGGCAG CAGACTTGGG CTGCCTCAGC TTCGGCTTCA GCTGGTTCTG GTTCCGCCTC 840 

TACTGGTTCC CGCTCAAGGT CCTGTATGCC ACCAGTCACT GCAGTCTGCG CACGGTGCCT 900 

GACATCCCCT TCTACTTCTT CTTCAATGCG CTCCTGCTGC TGCTCACCCT TATGAACCTC 960 

TACTGGTTCC TGTACATCGT GGCGTTTGCA GCCAAGGTGT TGACAGGCCA GGTGCACGAG 1020 

CTGAAGGACC TGCGGGAGTA TGACACAGCC GAGGCCCAGA GCCTGAAGCC CAGCAAAGCC 1080 

GAGAAGCCAC TGAGGAACGG CCTGGTGAAG GACAAGCGCT TCTGAACCCC TCGGCCCCGC 1140 

CCCCGTGGAC CCGGCCCCAC CCCGAATACC CCGGCCACGC TCCCCGTCCT TGGCCGCCCC 1200 

TCCACCCCCT CCAACTCTGC TCCTCTAGGG CCGCCGCCAC CTCCCCTGGG ACCCCGCCCC 1260 

CTCATCCTGC CTCCATTTCC CGGCCACGCC CCCCAGGACC CCTGCCCCTC CGGGGACACC 1320 

GGCCCCGCCC TCAGCCCACT GGTCCCGGGC CGCCGCGGAC CCTGCGCACT CTCTGGTCAT 1380 

CGCCTGGGAG GAAGATGCCA CCGCCGCAGC AAGGTCCCTG CGGCCACCAC CTCCTCCTCC 1440 

TCCTGGCCCT GCTGCTGCCC TCGCTGCCCC TGACCCGCGC CCCCGTGCCC CCAGGCCCAG 1500 

CCGCCGCCCT GCTCCAGGCT CTAGGACTGC GCGATGAGCC CCAGGGTGCC CCCAGGCTCC 1560 

GGCCGGTTCC CCCGGTCATG TGGCGCCTGT TTCGACGCCG GGACCCCCAG GAGACCAGGT 1620 

CTGGCTCGCG GCGGACGTCC CCAGGGGTCA CCCTGCAACC GTGCCACGTG GAGGAGCTGG 1680 

GGGTCGCCGG AAACATCGTG CGCCACATCC CGGACCGCGG TGCGCCCACC CGGGCCTCGG 1740 

AGCCTGTCTC GGCCGCGGGG CATTGCCCTG AGTGGACAGT CGTCTTCGAC CTGTCGGCTG 1800 



1045 
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TGGAACCCGC TGAGCGCCCG AGCCGGGCCC GCCTGGAGCT GCGTTTCGCG GCGGCGGCGG 1860 

CGGCAGCCCC GGAGGGCGGC TGGGAGCTGA GCGTGGCGCA AGCGGGCCAG GGCGCGGGCG 1920 

CGGACCCCGG GCCGGTGCTG CTCCGCCAGT TGGTGCCCGC CCTGGGGCCG CCAGTGCGCG 1980 

CGGAGCTGCT GGGCGCCGCT TGGGCTCGCA ACGCCTCATG GCCGCGCAGC CTCCGCCTGG 2040 

CGCTGGCGCT ACGCCCCCGG GCCCCTGCCG CCTGCGCGCG CCTGGCCGAG GCCTCGCTGC 2100 

TGCTGGTGAC CCTCGACCCG CGCCTGTGCC ACCCCCTGGC CCGGCCGCGG CGCGACGCCG 2160 

AACCCGTGTT GGGCGGCGGC CCCGGGGGCG CTTGTCGCGC GCGGCGGCTG TACGTGAGCT 2220 

TCCGCGAGGT GGGCTGGCAC CGCTGGGTCA TCGCGCCGCG CGGCTTCCTG GCCAACTACT 2280 

GCCAGGGTCA GTGCGCGCTG CCCGTCGCGC TGTCGGGGTC CGGGGGGCCG CCGGCGCTCA 2340 

ACCACGCTGT GCTGCGCGCG CTCATGCACG CGGCCGCCCC GGGAGCCGCC GACCTGCCCT 2400 

GCTGCGTGCC CGCGCGCCTG TCGCCCATCT CCGTGCTCTT CTTTGACAAC AGCGACAACG 2460 

TGGTGCTGCG GCAGTATGAG GACATGGTGG TGGACGAGTG CGGCTGCCGC TAACCCGGGG 2520 
CGGGCAGGGA CGCGGGCCCA ACAATAAATG CCGCGTGGTC TGCTC 

Seq ID NO: 244 Protein sequence 
Protein Accession #: NP_001483.2 

1 11 21 31 41 51 

I I i I I 1 

MPPPQQGPCG HHLLLLLALL LPSLPLTRAP VPPGPAAALL QALGLRDEPQ GAPRLRPVPP 60 

VMWRLFRRRD PQETRSGSRR TSPGVTLQPC HVEELGVAGN IVRHIPDRGA PTRASEPVSA 120 

AGHCPEWTW FDLSAVEPAE RPSRARLELR FAAAAAAAPE GGWELSVAQA GQGAGADPGP 180 

VLLRQLVPAL GPPVRAELLG AAWARNASWP RSLRLALALR PRAPAACARL AEASLLLVTL 240 

DPRLCHPLAR PRRDAEPVLG GGPGGACRAR RLYVSFREVG WHRWVIAPRG FLANYCQGQC 300 

ALPVALSGSG GPPALNHAVL RALMHAAAPG AADLPCCVPA RLSPISVLFF DNSDNWLRQ 360 
YEDMWDECG CR 



Seq ID NO: 245 DNA sequence 

Nucleic Acid Accession #: NM_021267.1 

Coding sequence: 17.. 1125 



1 11 21 31 41 51 

I I I I I I 

ACGCGGGGCG CGCGGCTCCG TCGGCTACCG CGGGCGGGCG CAGGCGACGG GCACGGCGGG 60 

CGAGCGGGCG GTATGGCGGC GGCGGGGCCC GCGGCGGGGC CGACGGGGCC CGAGCCCATG 120 

CCGAGCTACG CGCAGCTAGT GCAGCGCGGC TGGGGCAGCG CGCTGGCGGC GGCGCGGGGC 180 

TGCACGGACT GCGGCTGGGG GCTGGCGCGT CGCGGCCTGG CTGAGCACGC GCACCTGGCG 240 

CCGCCCGAGC TGCTGCTGCT GGCGCTCGGC GCGCTGGGCT GGACCGCGCT GCGCTCCGCG 300 

GCCACTGCGC GCCTCTTTCG GCCCCTGGCG AAGCGGTGCT GCCTCCAGCC CAGAGATGCC 360 

GCCAAGATGC CCGAGAGCGC TTGGAAGTTT CTCTTCTACC TGGGCAGCTG GAGCTACAGT 420 

GCCTACCTGC TGTTTGGCAC OGACTACCCC TTCTTCCATG ACCCACCATC TGTCTTCTAC 480 

GACTGGACGC CGGGCATGGC AGTGCCACGG GACATTGCAG CCGCCTACCT GCTCCAGGGA 540 

AGCTTCTATG GCCACTCCAT CTACGCTACG CTATACATGG ACACCTGGCG CAAGGACTCG 600 

GTGGTCATGC TGCTCCACCA CGTGGTCACT CTCATCCTCA TCGTCTCCTC CTACGCCTTC 660 

CGGTACCACA ATGTGGGCAT CCTTGTGCTC TTCCTGCACG ATATCAGTGA CGTGCAGCTT 720 

GAGTTCACCA AGCTCAACAT TTACTTCAAG TCCCGCGGCG GCTCCTACCA TCGGCTGCAT 780 

GCCTTGGCAG CAGACTTGGG CTGCCTCAGC TTCGGCTTCA GCTGGTTCTG GTTCCGCCTC 840 

TACTGGTTCC CGCTCAAGGT CCTGTATGCC ACCAGTCACT GCAGTCTGCG CACGGTGCCT 900 

GACATCCCCT TCTACTTCTT CTTCAATGCG CTCCTGCTGC TGCTCACCCT TATGAACCTC 960 

TACTGGTTCC TGTACATCGT GGCGTTTGCA GCCAAGGTGT TGACAGGCCA GGTGCACGAG 1020 

CTGAAGGACC TGCGGGAGTA TGACACAGCC GAGGCCCAGA GCCTGAAGCC CAGCAAAGCC 1080 

GAGAAGCCAC TGAGGAACGG CCTGGTGAAG GACAAGCGCT TCTGAACCCC TCGGCCCCGC 1140 

CCCCGTGGAC CCGGCCCCAC CCCGAATACC CCGGCCACGC TCCCCGTCCT TGGCCGCCCC 1200 

TCCACCCCCT CCAACTCTGC TCCTCTAGGG CCGCCGCCAC CTCCCCTGGG ACCCCGCCCC 1260 

CTCATCCTGC CTCCATTTCC OGGCCACGCC CCCCAGGACC CCTGCCCCTC CGGGGACACC 1320 

GGCCCCGCCC TCAGCCCACT GGTCCCGGGC CGCCGCGGAC CCTGCGCACT CTCTGGTCAT 1380 

CGCCTGGGAG GAAGATGCCA CCGCCGCAGC AAGGTCCCTG CGGCCACCAC CTCCTCCTCC 1440 

TCCTGGCCCT GCTGCTGCCC TCGCTGCCCC TGACCCGCGC CCCCGTGCCC CCAGGCCCAG 1500 

CCGCCGCCCT GCTCCAGGCT CTAGGACTGC GCGATGAGCC CCAGGGTGCC CCCAGGCTCC 1560 

CCCGGTCATG TGGCGCCTGT TTCGACGCCG GGACCCCCAG GAGACCAGGT 1620 

GCGGACGTCC CCAGGGGTCA CCCTGCAACC GTGCCACGTG GAGGAGCTGG 1680 

GGGTCGCCGG AAACATCGTG CGCCACATCC CGGACCGCGG TGCGCCCACC CGGGCCTCGG 1740 

AGCCTGTCTC GGCCGCGGGG CATTGCCCTG AGTGGACAGT CGTCTTCGAC CTGTCGGCTG 1800 

TGGAACCCGC TGAGCGCCCG AGCCGGGCCC GCCTGGAGCT GCGTTTCGCG GCGGCGGCGG I860 

CGGCAGCCCC GGAGGGCGGC TGGGAGCTGA GCGTGGCGCA AGCGGGCCAG GGCGCGGGCG 1920 

CGGACCCCGG GCCGGTGCTG CTCCGCCAGT TGGTGCCCGC CCTGGGGCCG CCAGTGCGCG 1980 

CGGAGCTGCT GGGCGCCGCT TGGGCTCGCA ACGCCTCATG GCCGCGCAGC CTCCGCCTGG 2040 

CGCTGGCGCT ACGCCCCCGG GCCCCTGCCG CCTGCGCGCG CCTGGCCGAG GCCTCGCTGC 2100 

TGCTGGTGAC CCTCGACCCG CGCCTGTGCC ACCCCCTGGC CCGGCCGCGG CGCGACGCCG 2160 

AACCCGTGTT GGGCGGCGGC CCCGGGGGCG CTTGTCGCGC GCGGCGGCTG TACGTGAGCT 2220 

TCCGCGAGGT GGGCTGGCAC CGCTGGGTCA TCGCGCCGCG CGGCTTCCTG GCCAACTACT 2280 

GCCAGGGTCA GTGCGCGCTG CCCGTCGCGC TGTCGGGGTC CGGGGGGCCG CCGGCGCTCA 2340 

ACCACGCTGT GCTGCGCGCG CTCATGCACG CGGCCGCCCC GGGAGCCGCC GACCTGCCCT 2400 

GCTGCGTGCC CGCGCGCCTG TCGCCCATCT CCGTGCTCTT CTTTGACAAC AGCGACAACG 2460 
TGGTGCTGCG GCAGTATGAG GACATGGTGG TGGACGAGTG CGGCTGCCGC TAACCCGGGG 



Seq ID NO: 246 Protein sequence 
Protein Accession #: NP_067090.1 

1 11 21 31 41 51 

I I I 1 I I 

MAAAGPAAGP TGPEPMPSYA QLVQRGWGSA LAAARGCTDC GWGLARRGLA EHAHLAPPEL 60 

LLLALGALGW TALRSAATAR LFRPLAKRCC LQPRDAAKMP ESAWKFLFYL GSWSYSAYLL 120 

FGTDYPFFHD PPSVFYDWTP GMAVPRDIAA AYLLQGSFYG HSIYATLYMD TWRKDSWML 180 



1046 



WO 03/042661 



LHHWTLILI VSSYAFRYHN VGILVLFLHD ISDVQLEFTK LNIYPKSRGG SYHRLHALAA 240 
DLGCLSPGFS WFWFRLYWFP LKVLYATSHC SLRTVPDIPF YFFFNALLLL LTLMNLYWFL 300 
YIVAFAAKVL TGQVHELKDI* REYDTAEAQS LKPSKAEKPL RNGLVKDKRF 

Seq ID NO: 247 DMA sequence 

Nucleic Acid Accession #: NM_0020B1.1 

Coding sequence i 222.. 1898 

1 11 21 31 41 51 

I I I I I 1 

GGCTGCCCGA GCGAGCGTTC GGACCTCGCA CCCCGCGCGC CCCGCGCCGC CGCCGCCGCC 60 

GGCTTTTGTT GTCTCCGCCT CCTCGGCCGC CGCCGCCTCT GGACCGCGAG CCGCGCGCGC 120 

CGGGACCTTG GCTCTGCCCT TCGCGGGCGG GAACTGCGCA GGACCCGGCC AGGATCCGAG 180 

AGAGGCGCGG GCGGGTGGCC GGGGGCGCCG CCGGCCCCGC CATGGAGCTC CGGGCCCGAG 240 

GCTGGTGGCT GCTATGTGCG GCCGCAGCGC TGGTCGCCTG CGCCCGCGGG GACCCGGCCA 300 

GCAAGAGCCG GAGCTGCGGC GAGGTCCGCC AGATCTACGG AGCCAAGGGC TTCAGCCTGA 360 

GCGACGTGCC CCAGGCGGAG ATCTCGGGTG AGCACCTGCG GATCTGTCCC CAGGGCTACA 420 

CCTGCTGCAC CAGCGAGATG GAGGAGAACC TGGCCAACCG CAGCCATGCC GAGCTGGAGA 480 

CCGCGCTCCG GGACAGCAGC CGCGTCCTGC AGGCCATGCT TGCCACCCAG CTGCGCAGCT 540 

TCGATGACCA CTTCCAGCAC CTGCTGAACG ACTCGGAGCG GACGCTGCAG GCCACCTTCC 600 

CCGGCGCCTT CGGAGAGCTG TACACGCAGA ACGCGAGGGC CTTCCGGGAC CTGTACTCAG 660 

AGCTGCGCCT GTACTACCGC GGTGCCAACC TGCACCTGGA GGAGACGCTG GCCGAGTTCT 720 

GGGCCCGCCT GCTCGAGCGC CTCTTCAAGC AGCTGCACCC CCAGCTGCTG CTGCCTGATG 780 

ACTACCTGGA CTGCCTGGGC AAGCAGGCCG AGGCGCTGCG GCCCTTCGGG GAGGCCCOGA 840 

GAGAGCTGCG CCTGCGGGCC ACCCGTGCCT TCGTGGCTGC TCGCTCCTTT GTGCAGGGCC 900 
TGGGCGTGGC CAGCGACGTG GTCCGGAAAG TGGCTCAGGT CCCCCTGGGC CCGGAGTGCT 960 

CGAGAGCTGT CATGAAGCTG GTCTACTGTG CTCACTGCCT GGGAGTCCCC GGCGCCAGGC 1020 

CCTGCCCTGA CTATTGCCGA AATGTGCTCA AGGGCTGCCT TGCCAACCAG GCCGACCTGG 1080 

ACGCCGAGTG GAGGAACCTC CTGGACTCCA TGGTGCTCAT CACCGACAAG TTCTGGGGTA 1140 

CATCGGGTGT GGAGAGTGTC ATCGGCAGCG TGCACACGTG GCTGGCGGAG GCCATCAACG 1200 

CCCTCCAGGA CAACAGGGAC ACGCTCACGG CCAAGGTCAT CCAGGGCTGC GGGAACCCCA 1260 

AGGTCAACCC CCAGGGCCCT GGGCCTGAGG AGAAGCGGCG CCGGGGCAAG CTGGCCCCGC 1320 

GGGAGAGGCC ACCTTCAGGC ACGCTGGAGA AGCTGGTCTC TGAAGCCAAG GCCCAGCTCC 1380 

GCGACGTCCA GGACTTCTGG ATCAGCCTCC CAGGGACACT GTGCAGTGAG AAGATGGCCC 1440 

TGAGCACTGC CAGTGATGAC CGCTGCTGGA ACGGGATGGC CAGAGGCCGG TACCTCCCCG 1500 

AGGTCATGGG TGACGGCCTG GCCAACCAGA TCAACAACCC CGAGGTGGAG GTGGACATCA 1560 

CCAAGCCGGA CATGACCATC CGGCAGCAGA TCATGCAGCT GAAGATCATG ACCAACCGGC 1620 

TGCGCAGCGC CTACAACGGC AACGACGTGG ACTTCCAGGA CGCCAGTGAC GACGGCAGCG 1680 

GCTCGGGCAG CGGTGATGGC TGTCTGGATG ACCTCTGCGG CCGGAAGGTC AGCAGGAAGA 1740 

GCTCCAGCTC CCGGACGCCC TTGACCCATG CCCTCCCAGG CCTGTCAGAG CAGGAAGGAC 1800 

AGAAGACCTC GGCTGCCAGC TGCCCCCAGC CCCOGACCTT CCTCCTGCCC CTCCTCCTCT I860 

TCCTGGCCCT TACAGTAGCC AGGCCCCGGT GGCGGTAACT GCCCCAAGGC CCCAGGGACA 1920 

GAGGCCAAGG ACTGACTTTG CCAAAAATAC AACACAGACG ATATTTAATT CACCTCAGCC 1980 

TGGAGAGGCC TGGGGTGGGA CAGGGAGGGC CGGCGGCTCT GAGCAGGGGC AGGCGCAGAG 2040 

GTCCCAGCCC CAGGCCTGGC CTCGCCTGCC TTTCTGCCTT TTAATTTTGT ATGAGGTCCT 2100 

CAGGTCAGCT GGGAGCCAGT GTGCCCAAAA GCCATGTATT TCAGGGACCT CAGGGGCACC 2160 

TCCGGCTGCC TAGCCCTCCC CCCAGCTCCC TGCACCGCCG CAGAAGCAGC CCCTCGAGGC 2220 

CTACAGAGGA GGCCTCAAAG CAACCCGCTG GAGCCCACAG CGAGCCTGTG CCTTCCTCCC 2280 

CGCCTCCTCC CACTGGGACT CCCAGCAGAG CCCACCAGCC AGCCCTGGCC CACCCCCCAG 2340 

CCTCCAGAGA AGCCCCGCAC GGGCTGTCTG GGTGTCCGCC ATCCAGGGTC TGGCAGAGCC 2400 

TCTGAGATGA TGCATGATGC CCTCCCCTCA GCGCAGGCTG CAGAGCCCGG CCCCACCTCC 2460 

CTGCGCCCTT GAGGGGCCCC AGCGTCTGCA GGGTGACGCC TGAGACAGCA CCACTGCTGA 2520 

GGAGTCTGAG GACTGTCCTC CCACAGACCC TGCAGTGAGG GGCCCTCCAT GCGCAGATGA 2580 

GGGGCCACTG ACCCACCTGC GCTTCTGCTG GAGGAGGGGA AGCTGGGCCC AAAGGCCCAG 2640 

GGAGGCAGCG TGGGCTCTGC CAATGTGGGC TGCCCCTCGC ACACAGGGCT CACAGGGCAG 2700 

GCCTTGCTGG GGTCCAGGGC TGTTGGAGGA CCCCGAGGGC TGAGGAGCAG CCAGGACCCG 2760 

CCTGCTCCCA TCCTCACCCA GATCAGGAAC CAGGGCCTCC CTGTTCACGG TGACACAGGT 2820 

CAGGGCTCAG AGTGACCCTC GGCTGTCACC TGCTCACAGG GATGCTGGTG GCTGGTGAGA 2880 

CCCCGCACTG CACACGGGAA TGCCTAGGTC CCTTCCCGAC CCAGCCAGCT GCACTGCAGG 2940 

GCACGGGGAC CTGGATAGTT AAGGGCTTTT CCAAACATGC ATCCATTTAC TGACACTTCC 3000 

TGTCCTTGTT CATGGAGAGC TGTTCGCTCC TCCCAGATGG CTTCGGAGGC CCGCAGGGCC 3060 
CACCTTGGAC CCTGGTGACC TCCTGTCACT CACTGAGGCC ATCAGGGCCC TGCCCCAGGC 3120 
CTGGACGGGC CCTCCTTCCC TCCTGTGCCC CAGCTGCCAG GTGGCCCTGG GGAGGGGTGG 3180 
TGTGGTGTTG GGAAGGGGTC CTGCAGGGGG AGGAGGACTT GGAGGGTCTG GGGGCAGCTG 3240 
TCCTGAACCG ACTGACCCTG AGGAGGCCGC TTAGTGCTGC TTTGCTTTTC ATCACCGTCC 3300 
CGCACAGTGG ACGGAGGTCC CCGGTTGCTG GTCAGGTCCC CATGGCTTGT TCTCTGGAAC 3360 
CTGACTTTAG ATGTTTTGGG ATCAGGAGCC CCCAACACAG GCAAGTCCAC CCCATAATAA 3420 
CCCTGCCAGT GCCAGGGTGG GCTGGGGACT CTGGCACAGT GATGCCGGGC GCCAGGACAG 3480 
CAGCACTCCC GCTGCACACA GACGGCCTAG GGGTGGCGCT CAGACCCCAC CCTACGCTCA 3540 
TCTCTGGAAG GGGCAGCCCT GAGTGGTCAC TGGTCAGGGC AGTGGCCAAG CCTGCTGTGT 3600 
CCTTCCTCCA CAAGGTCCCC CCACCGCTCA GTGTCAGCGG GTGACGTGTG TTCTTTTGAG 3660 
TCCTTGTATG AATAAAAGGC TGGAAACCTA AA 

Seq ID NO: 246 Protein sequence 
Protein Accession #: NP_002072.1 

1 11 21 31 41 51 

I I I I I I 

MELRARGWWL LCAAAALVAC ARGDPASKSR SCGEVRQIYG AKGFSLSDVP QAEISGEHLR 60 
ICPQGYTCCT SEMEENLANR SHAELETALR DSSRVLQAML ATQLRSFDDH FQHLLNDSER 120 
TLQATFPGAF GELYTQNARA FRDLYSELRL YYRGANLHLE ETLAEFWARL LERLFKQLHP 180 
QLLLPDDYLD CLGKQAEALR PFGEAPRELR LRATRAFVAA RSFVQGLGVA SDWRKVAQV 240 
PLGPECSRAV MKLVYCAHCL GVPGARPCPD YCRNVLKGCL ANQADLDAEW RNLLDSMVLI 300 
TDKFWGTSGV ESVIGSVHTW LAEAINALQD NRDTLTAKVI QGCGNPKVNP QGPGPEEKRR 360 
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RGKLAPRERP PSGTLEKLVS EAKAQLRDVQ DFWISLPGTL CSEKMALSTA SDDRCWNGMA 420 
RGRYLPEVMG DGLANQINNP EVEVDITKPD MTIRQQIMQL KIMTNRLRSA YNGNDVDFQD 480 
ASDDGSGSGS 
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Seq ID NO: 249 DNA sequence 

Nucleic Acid Accession #: NMJ)01492.3 

Coding sequence: 8.. 1864 

1 11 21 31 41 51 

I I I I I I 

GAAGGCCATG GTCTCCCCAC GGATGTCCGG GCTCCTCTCC CAGACTGTGA TCCTAGCGCT 60 

CATTTTCCTC CCCCAGACAC GGCCCGCTGG CGTCTTCGAG CTGCAGATCC ACTCTTTCGG 120 

GCCGGGTCCA GGCCCTGGGG CCCCGCGGTC CCCCTGCAGC GCCCGGCTCC CCTGCCGCCT 180 

CTTCTTCAGA GTCTGCCTGA AGCCTGGGCT CTCAGAGGAG GCCGCCGAGT CCCCGTGCGC 240 

CCTGGGCGCG GCGCTGAGTG CGCGCGGACC GGTCTACACC GAGCAGCCCG GAGCGCCCGC 300 

GCCTGATCTC CCACTGCCCG ACGGGCTCTT GCAGGTGCCC TTCCGGGACG CCTGGCCTGG 360 

CACCTTCTCT TTCATCATCG AAACCTGGAG AGAGGAGTTA GGAGACCAGA TTGGAGGGCC 420 

CGCCTGGAGC CTGCTGGCGC GCGTGGCTGG CAGGCGGCGC TTGGCAGCCG GAGGCCCGTG 480 

GGCCCGGGAC ATTCAGCGCG CAGGCGCCTG GGAGCTGCGC TTCTCGTACC GCGCGCGCTG 540 

CGAGCCGCCT GCCGTCGGGA CCGCGTGCAC GCGCCTCTGC CGTCCGCGCA GCGCCCCCTC 600 

GCGGTGCGGT COGGGACTGC GCCCCTGCGC ACCGCTCGAG GACGAATGTG AGGCGCCGCT . 660 

GGTGTGCCGA GCAGGCTGCA GCCCTGAGCA TGGCTTCTGT GAACAGCCCG GTGAATGCCG 720 

ATGCCTAGAG GGCTGGACTG GACCCCTCTG CACGGTCCCT GTCTCCACCA GCAGCTGCCT 780 

CAGCCCCAGG GGCCCGTCCT CTGCTACCAC CGGATGCCTT GTCCCTGGGC CTGGGCCCTG 840 

TGACGGGAAC CCGTGTGCCA ATGGAGGCAG CTGTAGTGAG ACACCCAGGT CCTTTGAATG 900 

CACCTGCCCG CGTGGGTTCT ACGGGCTGCG GTGTGAGGTG AGCGGGGTGA CATGTGCAGA 960 

TGGACCCTGC TTCAACGGCG GCTTGTGTGT CGGGGGTGCA GACCCTGACT CTGCCTACAT 1020 

CTGCCACTGC CCACCTGGTT TCCAAGGCTC CAACTGTGAG AAGAGGGTGG ACCGGTGCAG 1080 

CCTGCAGCCA TGCCGCAATG GCGGACTCTG CCTGGACCTG GGCCACGCCC TGCGCTGCCG 1140 

CTGCCGCGCC GGCTTCGCGG GTCCTCGCTG CGAGCACGAC CTGGACGACT GCGCGGGCCG 1200 

CGCCTGCGCT AACGGCGGCA CGTGTGTGGA GGGCGGCGGC GCGCACCGCT GCTCCTGCGC 1260 

GCTGGGCTTC GGCGGCCGCG ACTGCCG CGA GCGCGCGGA.C CCGTGCGCCG CGCGCCCCTG 1320 

TGCTCACGGC GGCCGCTGCT ACGCCCACTT CTCCGGCCTC GTCTGCGCTT GCGCTCCCGG 13 80 

CTACATGGGA GCGCGGTGTG AGTTCCCAGT GCACCCCGAC GGCGCAAGCG CCTTGCCCGC 1440 

GGCCCCGCCG GGCCTCAGGC CCGGGGACCC TCAGCGCTAC CTTTTGCCTC CGGCTCTGGG 1500 

ACTGCTCGTG GCCGCGGGCG TGGCCGGCGC TGCGCTCTTG CTGGTCCACG TGCGCCGCCG 1560 

TGGCCACTCC CAGGATGCTG GGTCTCGCTT GCTGGCTGGG ACCCCGGAGC CGTCAGTCCA 1620 

CGCACTCCCG GATGCACTCA ACAACCTAAG GACGCAGGAG GGTTCCGGGG ATGGTCCGAG 1680 

CTCGTCCGTA GATTGGAATC GCCCTGAAGA TGTAGACCCT CAAGGGATTT ATGTCATATC 1740 

TGCTCCTTCC ATCTACGCTC GGGAGGTAGC GACGCCCCTT TTCCCCCCGC TACACACTGG 1800 

GCGCGCTGGG CAGAGGCAGC ACCTGCTTTT TCCCTACCCT TCCTCGATTC TGTCCGTGAA 1860 
ATGAATTGGG TAGAGTCTCT GGAAGGTTTT AAGCCCATTT TCAGTTCTAA CTTACTTTCA 

Seq ID NO: 250 Protein sequence 
Protein Accession #: NP 058637.1 



1 
I 

MVSPRMSGLL 
RVCLKPGLSE 
SFIIETWREE 
PAVGTACTRI* 
EGWTGPLCTV 
PRGFYGLRCE 



FGGRDCRERA 
PGLRPGDPQR 
PDALNNLRTQ 
GQRQHLLFPY 



11 
I 

SQTVIIALIP 
EAAESPCALG 
LGDQIGGPAW 
CRPRSAPSRC 
PVSTSSCLSP 
VSGVTCADGP 
LiGHALRCRCR 
DPCAARPCAH 
YLLPPALGLL 
EGSGDGPSSS 
PSSILSVK 



21 
I 

LPQTRPAGVF 
AALSARGPVY 
SLIARVAGRR 
GPGLRPCAPL 
RGPSSATTGC 
CFNGGLCVGG 
AGFAGPRCEH 
GGRCYAHFSG 
VAAGVAGAAL 
VDWNRPEDVD 



31 
I 

ELQIHSFGPG 
TEQPGAPAPD 
RLAAGGPWAR 
EDECEAPLVC 
LVPGPGPCDG 
ADPDSAYICH 
DLDDCAGRAC 
LVCACAPGYM 
LLVHVRRRGH 
PQGIYVISAP 



41 
I 

PGPGAPRSPC 
LPLPDGLIiQV 
DIQRAGAWEL 
RAGCSPEHGF 
NPCANGGSCS 
CPPGFQGSNC 
ANGGTCVEGG 
GARCEFPVHP 
SQDAGSRLLA 
S I YAREVATP 



51 
I 

SARLPCRLFF 
PFRDAWPGTF 
RFSYRARCEP 
CEQPGECRCL 
ETPRSFECTC 
EKRVDRCSLQ 
GAHRCSCALG 
DGASALPAAP 
GTPEPSVHAL 
L.FPPLHTGRA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
60O 



Seq ID NO: 251 DNA sequence 

Nucleic Acid Accession #: CAT cluster 



1 
I 

GAAATATAAC 
AGATTATTGA 
GCTTCAATTC 
TTAGAAAGTC 
AGTTTTACTA 
TTGTGCAAAC 
GGGTGCTATT 
GGTTTTCTTT 
GTTTACTGAA 
TTATGTAAAT 
TGCCATTGTC 
TAAAAATACA 



11 
I 

CATTGCAATT 
CAATCCCAAA 
CTTATATGAC 
TGAGAGACTT 
CTGTAGTGAC 
AACCTTTCAT 
TACCTAGAAC 
TTCAAGGAGG 
AGCACTCAAT 
ATGTTAAATA 
ATATCAAACT 
TTTCAATGTT 



21 
I 

AGAAAATATC 
TATACAATTT 
TTCAGACTGG 
TATACATAAA 
CGTTGAGAAG 
TGTTAAGTGC 
CATTGTCTAC 
TTCAATTAAG 
GAAGTCAAAT 
TAAGATCTAA 
TGCTTTGTAT 
AAAAAAAAAA 



31 
I 

CAAAATAGCC 
TTCTTTAAAA 
AGAAGCCTGT 
TTCTCAATTT 
ACCCTTGTTT 
CTGTATTCCT 
TACAATTAAC 
GCAATAAGAT 
TACTGAAGCT 
GGAAAATAAA 
CATACTAATG 
AAAAAAAA 



41 
I 

TGTTATCTTC 
GTAGTACAAT 
TAAACCACTG 
GGCTGCTGTA 
ATTTACATTT 
TTCATTTACT 
ATTTACATTA 
GTTTGCTGGA 
TTTGCCTACA 
CAATATTATA 
TTACATAACT 



51 
I 

CACGTGGCCT 
TTCCTTTGTA 
TTAGTTTCAG 
CACGTGCCAG 
GAAGCACTGT 
TCATGTCCAG 
CAAAGTGTGT 
GAAACCTATT 
TCTTGGTCTT 
ATTATGTGTT 
TATCGATCAA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 



80 



Seq ID NOs 252 DNA sequence 

Nucleic Acid Accession tii Eos sequence 



1 11 21 31 41 51 

I I I I I I 

AGGTACTGCC AGAAAGGATC AGGACCTGGA GTCTGGCAAG AGGAAGACAG AGGCCTGTGT 
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GGGAAGCGAG TTGTTATCTT TGGTTATCTA GCTGTATGAG TGTATTGGTC TTCATAAAGC 120 

TAGATAACCG AAAGTAAAAA CTTCTTCAAG ATCGCCGGGG AGCGTGTGAG AATGAAAGAC 180 

TACAGCCGAG AGACAGTAAA AACCAGAAAG GTCAGGAATA CTTATTGAAT CTAACTTTGT 240 

TTTTGTTTTG TTTTTTTCCT TATGATTAAA GGTGGGATGA GAGAAAATTA AATGACACAC 300 

ACATGCTAAA ATATCAAGGT TCCAGATATG TCTTGAGAGG GGTTGTTGCA GCTGCAAAGG 360 

AGAAGTGTAT AGTGATAATG AGTAAAGATG CATGTGCAGT TTGTTCTATT TTAAGGCAAA 420 

AGTTATATCA GGGATTTTTT TCTTAGAAAG GTGTTGCAGA GATGTCTGGT ACCTAGTTTA 480 

AAAATGATTC CATAATATGT AGACTTGGGC AGTTCCTTTG GGAGGCACCT CCCTCTCAAA 540 

ATTTGAAGAT TGTGCTTGGA AATTACTTTA CATGTATTTG GGCTGTATGT CAATTTGTAC 600 

GAAATTAGAC TTTCAGAAAA GTTTATACTG GAAGGTTAAT AATTTGTATC TACTGAGGAC 660 

TTAGAGCTAG CAGGCAAAAT GAAAAAAAAA AAAAAAAARG CAAGGGCTGA TTTTTATTCT 720 

TCTATTCAAA ATACAAGGAC AGATGCTTCT CTGTTCCAAG AGGGTTTCCT TGAGGAAGCT 780 

ACTGAAGCAG AAAGACATGA TGGAGACGAG ATCGCCTCCC CTCTTGTCAA AGTGTTAAAA 840 

AAATGTTCTG TCTTACTCTG CGCCTAGCAT TGGAAATGAA AGTGACATTT ACGCCACAAC 900 

CCACGTGTGC GCCTCCTCTC TTTTTGTTTA AGGATGATCA GGTCTATCCA GGAAACAGCT 960 

CTGGCATCCC AAACTGAAAT AATTAGGACG TATATAGACC TGACAAAAAT GGAAAGGGGG 1020 

TGGGGAATCT GAGGGTCTGT CTTGCCTAAT TGATTCCGCT AAACGGAATG CAGGAGATGT 1080 

GAACGGCAGG ACGCTCCGAT TCCCACGCTC GGGGGCAAGT GATAAAGCGG GGCCGGGCAG 1140 

CCTATGACAG ACAGCCCTGT TGGGGGGTGG GGGTATGAAA AAAACATCAA GTGCACACAC 1200 

CATACTCATC TCCATCGCTT AAGAAAGTAA AGGCATTTCC CACCCACAGC CATCTGCAGC 1260 

TTCCCAAATG GCAGCACCAA CTGGTCTGTA GCTGCTACAT AGTGTGCTTC TGTTAATTTT 1320 

TTAACCACTG TTTAATCTGG CCATAATTAA GTTTGGCTTT CTTCGTTGTT TGAGATTTTC 1380 

AGAATTCAAG GCAAGCTAGT AGAAAGCAAT TCCAAGAAAG TCCCATGACT GCCTGCCCCT 1440 

AATGTCAAAA TCTCAGTCCA TGAGATTATG GCCTTGTGAC CACATTTTTG CTTTTGTTTT 1500 

GGGTGGGCAA ATGTGTATAG AGATAAAATA CATATCTCTA TATAACAGTC GTTATTATAA 1560 

TTTCATGAGG CTTTTTACCT CTTAACATGA TACATCTAGG AACTTGGTCT AATTGTGCTA 1620 

GTAGATATAC ACTAGAAGTA AAACTATAAA GTCATTAGCC TGTAGAAAGT TGATTATGAT 1680 

AACAATATGA TAAAAAGTTT GTTTTGGAAT AGTCTCAGCT AGATGGGTCT AAATAGCCAT 1740 

TTTAATGTAA TCTAAAAATA ACACTATGCC TAGCAGAAAC TTTGGCGCTT TGGAGGTCCC 1800 

CATTGTCGCC TTTTCATAAA AGTCCTTAAG TTTTCCATAT GTCACCAAGC AAACATTTGT 1860 

ATAGGCATTT ACACAGAAAT ATAACCATTG CAATTAGAAA ATATCCAAAA TAGCCTGTTA 1920 

TCTTCCACGT GGCCTAGATT ATTGACAATC CCAAATATAC AATTTTTCTT TAAAAGTAGT 1980 

ACAATTTCCT TTGTAGCTTC AATTCCTTAT ATGACTTCAG ACTGGAGAAG CCTGTTAAAC 2040 

CACTGTTAGT TTCAGTTAGA AAGTCTGAGA GACTTTATAC ATAAATTCTC AATTTGGCTG 2100 

CTGTACACGT GCCAGAGTTT TACTACTGTA GTGACCGTTG AGAAGACCCT TGTTTATTTA 2160 

CATTTGAAGC ACTGTTTGTG CAAACAACCT TTCATTGTTA AGTGCCTGTA TTCCTTTCAT 2220 

TTACTTCATG TCCAGGGGTG CTATTTACCT AGAACCATTG TCTACTACAA TTAACATTTA 2280 

CATTACAAAG TGTGTGGTTT TCTTTTTCAA GGAGGTTCAA TTAAGGCAAT AAGATGTTTG 2340 

CTGGAGAAAC CTATTGTTTA CTGAAAGCAC TCAATGAAGT CAAATTACTG AAGCTTTTGC 2400 

CTACATCTTG GTCTTTTATG TAAATATGTT AAATATAACA TCTAAGGAAA ATAAACAATA 2460 

TTATAATTAT GTGTTTGCCA TTGTCATATC AAACTTGCTT TGTATCATAC TAATGTTACA 2520 
TAACTTATCG ATCAATAAAA ATACATTTCA ATGTT 



Seq ID NO-. 253 DNA sequence 

Nucleic Acid Accession #: NM_001650.2 

Coding sequence: 40.1011 



1 11 21 31 41 51 

I I I 1 I I 

GGGGCAGGCA ATGAGAGCTG CACTCTGGCT GGGGAAGGCA TGAGTGACAG ACCCACAGCA 60 

AGGCGGTGGG GTAAGTGTGG ACCTTTGTGT ACCAGAGAGA ACATCATGGT GGCTTTCAAA 120 

GGGGTCTGGA CTCAAGCTTT CTGGAAAGCA GTCACAGCGG AATTTCTGGC CATGCTTATT 180 

TTTGTTCTCC TCAGCCTGGG ATCCACCATC AACTGGGGTG GAACAGAAAA GCCTTTACCG 240 

GTCGACATGG TTCTCATCTC CCTTXGCTTT GGACTCAGCA TTGCAACCAT GGTGCAGTGC 300 

TTTGGCCATA TCAGCGGTGG CCACATCAAC CCTGCAGTGA CTGTGGCCAT GGTGTGCACC 360 

AGGAAGATCA GCATCGCCAA GTCTGTCTTC TACATCGCAG CCCAGTGCCT GGGGGCCATC 420 

ATTGGAGCAG GAATCCTCTA TCTGGTCACA CCTCCCAGTG TGGTGGGAGG CCTGGGAGTC 480 

ACCATGGTTC ATGGAAATCT TACCGCTGGT CATGGTCTCC TGGTTGAGTT GATAATCACA 540 

TTTCAATTGG TGTTTACTAT CTTTGCCAGC TGTGATTCCA AACGGACTGA TGTCACTGGC 600 

TCAATAGCTT TAGCAATTGG ATTTTCTGTT GCAATTGGAC ATTTATTTGC AATCAATTAT 660 

ACTGGTGCCA GCATGAATCC CGCCCGATCC TTTGGACCTG CAGTTATCAT GGGAAATTGG 720 

GAAAACCATT GGATATATTG GGTTGGGCCC ATCATAGGAG CTGTCCTCGC TGGTGGCCTT 780 

TATGAGTATG TCTTCTGTCC AGATGTTGAA TTCAAACGTC GTTTTAAAGA AGCCTTCAGC 840 

AAAGCTGCCC AGCAAACAAA AGGAAGCTAC ATGGAGGTGG AGGACAACAG GAGTCAGGTA 900 

GAGACGGATG ACCTGATTCT AAAACCTGGA GTGGTGCATG TGATTGACGT TGACCGGGGA 960 

GAGGAGAAGA AGGGGAAAGA CCAATCTGGA GAGGTATTGT CTTCAGTATG ACTAGAAGAT 1020 

CGCACTGAAA GCAGACAAGA CTCCTTAGAA CTGTCCTCAG ATTTCCTTCC ACCCATTAAG 1080 

GAAACAGATT TGTTATAAAT TAGAAATGTG CAGGTTTGTT GTTTCATGTC ATATTACTCA 1140 

GTCTAAACAA TAAATATTTC ATAATTTACA AAGGAGGAAC GGAAGAAACC TATTGTGAAT 1200 

TCCAAATCTA AAAAAAGAAA TATTTTTAAG ATGTTCTTAA GCAAATATAT ACCTATTTTA 1260 

TCTAGTTACC TTTCATTAAC AACCAATTTT AACCGTGTGT CAAGATTTGG TTAAGTCTTG 1320 

CCTGACAGAA CTCAAAGACA CGTCTATCAG CTTATTCCTT CTCTACTGGA ATATTGGTAT 13 BO 
AGTCAATTCT TATTTGAATA TTTATTCTAT TAAACTGAGT TTAACAATGG C ' 



Seq id NO j 254 Protein sequence 
Protein Accession ft: NPJJ01641.1 

1 11 21 31 41 51 

I I I I I I 

MSDRPTARRW GKCGPLCTRE NIMVAPKGVW TQAFWKAVTA EPLAMLIFVL LSLGSTINWG 60 
GTEKPLFVDM VLISLCFGLS IATKVQCFGH ISGGHINPAV TVAMVCTRKI SIAKSVPYIA 120 
AOCLGAIIGA GILYLVTPPS WGGLGVTMV HGNLTAGHGL LVELIITFQL VFTIFASCDS 180 
KRTDVTGSIA LAIGFSVAIG HLFAINYTGA SMNPARSFGP AVIMGNWENH WIYWVGPIIG 240 
AVLAGGLYEY VFCPDVEFKR RFKEAFSKAA OQTKGSYMEV EDNRSQVETD DLILKPGWH 300 
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VIDVDRGEEK KGKDQSGEVL SSV 

Seq ID NO: 2S5 DNA sequence 
Nucleic Acid Accession #: U26742.1 
Coding sequence: 325.- 1449 

1 11 21 31 41 SI 

I I I I I I 

CAGGAAACCC TGGTACTGGC AGCAGCCAGC CTCTGCTGTG CCCACATGAC CCACAACTCT 60 

GGCAGCGGAC CCGGCACTTC CAACATTATT AAATAATAAG AAAGCGGCTC CTACTCCAGG 120 

CTCAAACCTC CCTGCAGACC AATGGACACC TTCTAAGAGT TTGGCGAGTC AGTGACTGAA 180 

GCGCCCGTCC ATTCCAAGAT AAATAGGATT TACCAATCCT TGGATGAAGT GCTTGGGAAG 240 

TCTTTAAGTG CCATAATCAA CTGCCATTTC AAAGAATATA GATGGTTTTG AAAAGTTCAT 300 

GCTGTCCCTT CATTGAATTT TAGAATGATT GAAGATAGTG GGAAAAGAGG AAATACCATG 360 

GCAGAAAGAA GACAGCTGTT TGCAGAGATG AGGGCTCAAG ATCTGGATCG CATCCGACTC 420 

TCCACCTACA GAACAGCATG CAAGCTTAGG TTTGTTCAGA AGAAATGCAA TTTGCACCTG 480 

GTGGACATAT GGAATGTCAT AGAAGCATTG CGGGAAAATG CTCTGAACAA CCTGGACCCA 540 

AACACTGAAC TCAACGTGTC CCGCTTAGAG GCTGTGCTCT CCACTATTTT TTACCAGCTC 600 

AACAAACGGA TGCCAACCAC TCACCAAATC CATGTGGAGC AGTCCATCAG CCTCCTCCTT 660 

AACTTCCTGC TTGCAGCGTT TGATCCGGAA GGCCATGGTA AAATTTCAGT ATTTGCTGTC 720 

AAAATGGCTT TAGCCACATT GTGTQGAGGG AAGATCATGG ACAAATTAAG ATATATTTTC 780 

TCAATGATTT CTGACTCCAG TGGGGTGATG GTTTATGGAC GATATGACCA ATTCCTTCGG 840 

GAAGTTCTCA AACTACCCAC GGCAGTTTTT GAAGGTCCTT CATTTGGTTA CACAGAACAG 900 

TCAGCCAGAT CCTGTTTCTC CCAACAGAAA AAAGTCACGT TAAATGGTTT CTTGGACACG 960 

CTTATGTCAG ATCCTCCCCC GCAGTGTCTG GTCTGGTTGC CTCTTCTGCA TCGACTAGCA 1020 

AATGTGGAAA ATGTCTTCCA TCCGGTTGAG TGTTCCTACT GCCACAGTGA GAGTATGATG 1080 

GGATTTCGCT ACCGATGCCA ACAGTGTCAC AATTACCAGC TCTGTCAGGA CTGCTTCTGG 1140 

AGGGGACATG CCGGTGGTTC TCATAGCAAC CAGCACCAAA TGAAAGAGTA CACGTCATGG 1200 

AAATCACCTG CTAAGAAGCT GACTAATGCA TTAAGCAAGT CCCTGAGCTG TGCTTCCAGC 1260 

CGTGAACCTT TGCACCCCAT GTTCCCAGAT CAGCCTGAGA AGCCACTCAA CTTGGCTCAC 1320 

ATCGTTGATA CTTGGCCTCC CAGACCTGTA ACCAGCATGA ACGACACCCT GTTCTCCCAC 1380 

TCTGTTCCCT CCTCAGGAAG TCCTTTTATT ACCAGGAGCT CGGACGGTGC TTTTGGTGGA 1440 

TGCGTCTAGA TGGATAACAT GACTTCTTCT ACCCTAAAAT ATTCCTATAA TACTTTGAGC 1500 

TGTTCTGGTT CCTCCAGGGT GCATGGTACC CATTAACCCA AAATATGATT ATTTCCCTTT 1560 

TTTCCCATTT TCAGTCATTT TGGAATGTTC TCTGTGAACC ACAGTTGGGT TGTTTAAAGC 1620 

TCACATTTCT TTCTGTCACC ACAGAGATTG GCCTACGGTT TCTGTTTTGA GGGTGCTGTT 1680 
CAATAAAGCT GTGTACACTA AATGTCC 

Seq ID NO: 256 Protein sequence 
Protein Accession fh AAC50424.1 

1 11 21 31 41 51 

I I I I I I 

MIEDSGKRGN TMAERRQLPA EMRAQDLDRI RLSTYRTACK LRFVQKKCNL HLVDIWNVIE 60 

ALRENALNNIi DPNTELNVSR LEAVLSTIFY QLNKRMPTTH QIHVEQSISL. LLNFLLAAFD 120 

PEGHGKISVP AVKMALATLC GGKIMDKLRY IFSMISDSSG VMVYGRYDQF LREVLKLPTA 180 

VPEGPSFGYT EQSARSCFSQ QKKVTLNGFL DTLMSDPPPQ CLVWLPLLHR LANVENVFHP 240 

VECSYCHSES MMGFRYRCQQ CHNYQLOQDC FWRGHAGGSH SNQHQMKEYT SWKSPAKKLT 300 

NALSKSLSCA SSREPLHPMF PDQPEKPLNL AUIVDTWPPR PVTSMNDTLF SHSVPSSGSP 360 
FITRSSDGAF GGCV 

Seq ID NO: 257 DNA sequence 

Nucleic Acid Accession #: NM_004172.1 

Coding sequence: 17 9.. 180 7 

1 11 21 31 41 51 

I I I I I I 

GCGGATTGTT GCTCCGTTGT ACCTGCTGGG GAATTCACCT CGTTACTGCT TGATATCTTC 60 

CACCCCTTAC AAAATCAGAA AAGTTGTGTT TTCTAATACC AAAGAGGAGG TTTGGCTTTC 1 120 

TGTGGGTGAT TCCCAGACAC TGAAGTGCAA AGAAGAGACC CTCCTAGAAA AGTAAAATAT 180 

GACTAAAAGC AATGGAGAAG AGCCCAAGAT GGGGGGCAGG ATGGAGAGAT TCCAGCAGGG 240 

AGTCCGTAAA CGCACACTTT TGGCCAAGAA GAAAGTGCAG AACATTACAA AGGAGGATGT 300 

TAAAAGTTAC CTGTTTCGGA ATGCTTTTGT GCTGCTCACA GTCACCGCTG TCATTGTGGG 360 

TACAATCCTT GGATTTACCC TCCGACCATA CAGAATGAGC TACCGGGAAG TCAAGTACTT 420 

CTCCTTTCCT GGGGAACTTC TGATGAGGAT GTTACAGATG CTGGTCTTAC CACTTATCAT 480 

CTCCAGTCTT GTCACAGGAA TGGCGGCGCT AGATAGTAAG GCATCAGGGA AGATGGGAAT S40 

GCGAGCTGTA GTCTATTATA TGACTACCAC CATCATTGCT GTGGTGATTG GCATAATCAT 600 

TGTCATCATC ATCCATCCTG GGAAGGGCAC AAAGGAAAAC ATGCACAGAG AAGGCAAAAT 660 

TGTACGAGTG ACAGCTGCAG ATGCCTTCCT GGACTTGATC AGGAACATGT T CCCT CCAAA 720 

TCTGGTAGAA GCCTGCTTTA AACAGTTTAA AACCAACTAT GAGAAGAGAA GCTTTAAAGT 780 

GCCCATCCAG GCCAACGAAA CGCTTGTGGG TGCTGTGATA AACAATGTGT CTGAGGCCAT 840 

GGAGACTCTT ACCCGAATCA CAGAGGAGCT GGTCCCAGTT CCAGGATCTG TGAATGGAGT 900 

CAATGCCCTG GGTCTAGTTG TCTTCTCCAT GTGCTTCGGT TTTGTGATTG GAAACATGAA 960 

GGAACAGGGG CAGGCCCTGA GAGAGTTCTT TGATTCTCTT AACGAAGCCA TCATGAGACT 1020 

GGTAGCAGTA ATAATGTGGT ATGCCCCCGT GGGTATTCTC TTCCTGATTG CTGGGAAGAT 1080 

TGTGGAGATG GAAGACATGG GTGTGATTGG GGGGCAGCTT GCCATGTACA CCGTGACTGT 1140 
CATTGTTGGC TTACTCATTC ACGCAGTCAT CGTCTTGCCA CTCCTCTACT TCTTGGTAAC 1200 
ACGGAAAAAC CCTTGGGTTT TTATTGGAGG GTTGCTGCAA GCACTCATCA CCGCTCTGGG 1260 
GACCTCTTCA AGTTCTGCCA CCCTACCCAT CACCTTCAAG TGCCTGGAAG AGAACAATGG 1320 
CGTGGACAAG CGCGTCACCA GATTOGTGCT CCCCGTAGGA GCCACCATTA ACATGGATGG 1380 
GACTGCCCTC TATGAGGCTT TGGCTGCCAT TTTCATTGCT CAAGTTAACA ACTTTGAACT 1440 
GAACTTCGGA CAAATTATTA CAATCAGCAT CACAGCCACA GCTGCCAGTA TTGGGGCAGC 1500 
TGGAATTCCT CAGGCGGGCC TGGTCACTAT GGTCATTGTG CTGACATCTG TCGGCCTGCC 1560 
CACTGACGAC ATCACGCTCA TCATCGCGGT GGACTGGTTC CTGGATCGCC TCCGGACCAC 1620 
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CACCAACGTA 
ACTGAAGAAC 
ACCATATCAA 
GATGTAGACT 
ATCTTTCCAT 
CTTCTGATAA 
GTGTACAATT 
TAAGGTACCA 
TCCTATAAAT 
TAAAATCAAC 
TGTTAACAAC 
TTCTGACTGG 
TGAGGACAGA 
GGCCTCAGTG 
CAGCCCAGAC 
CTGTACCATC 
CTAACTAAGG 
TTATGCAAGT 
CCCAGTTCAT 
TGCCACTCCT 
GGTGCTTGTA 
ATTACAAACA 
AAGAAGAGGT 
AATCATTCCC 
AGTCAAATGA 
AAGTGTTTTT 
GGAATCCCAC 
GTTTGTGTCC 
TTATGAGAAA 
TTAATTTTTA 
TAGCAGCTGT 
AAACTGAAAA 
CCCAAGCATT 
GAGGCTCTAT 
CTAGTAGAGC 
ATATACTAAC 
CACAGGCTCT 
TTTTCAATAA 



CTGGGAGACT 
AGAGATGTTG 
CTGATTGCAC 
AACATAAAGA 
CTCATTACAG 
GACTGGAAAA 
TTCATCCCAC 
AGATCACAAA 
GTGATTTTTT 
TTTTGAAATT 
TTGAATTACA 
TATACCTATT 
CAGAAGGCAA 
TCCTCATCTA 
ATCCTGTGAT 
CATGGAGCAC 
ACAGGATGGC 
CGCCAGGCAG 
CCCAATGGGG 
CAACTGATGA 
GTCACACTGT 
AAGGATTTGA 
TCATTTTTCT 
CCCGTGAAGT 
ATGAGCTCTC 
CTTTAATCAA 
TTATCAAATC 
ATAAGAGAAA 
AAAACAGCAG 
CCCCCTAAGG 
GTTTGAGTTT 
ATCTAGACAT 
CTGTGCATAA 
TTCGGAAATA 
AAAGACTTAA 
CATTTCTTAT 
GCATACACAT 
AGAAAAAAAT 



CCCTGGGAGC 
AAATGGGTAA 
AGGACAATGA 
AACACTTTCT 
CTCATTCGCT 
TAGTCCTCCA 
AATTGAAATT 
TAGTGTTGAT 
TATATAAGTT 
TAAAAATCTT 
ACCGGTTATC 
TCATTAGTAG 
AGTTTCCATG 
TAAAATGAGG 
GTCATGAAAG 
GAGGCCTTCA 
TGTGTGTGGT 
GAGGCCATTC 
GAAGTATTCC 
TAGACTTCGA 
GGAACACTAA 
AAACACCAAG 
GGCTTTTTTT 
CTGCTTACCA 
TAATAGAAGT 
AAGTCCTTAG 
ATTCAAAACT 
TGGTAGAAGA 
GGGCATTAGT 
AATATCCAGT 
CTGGCTGAAA 
AGATCCTCTG 
AAGTTAACAT 
CACTACAAAT 
TCATATCAAC 
GGAAAGGTCC 
GCACTCAGTG 
AGTTTTACAT 



TGGGATTGTG 
CTCAGTGATT 
AACTGAGAAA 
TGAGCACCAG 
CCAGCAAGCC 
AAACACAAGG 
TTTAAATCAT 
CAGATCTTAC 
AAAGAGACAA 
TCAGAATACA 
AGTTGGACAG 
CTAGGTGCAC 
TGGCCTTGAG 
GACTTCCCTA 
CACCTGCCCT 
GAAAAGACAC 
GGTCACCAGG 
CAGGAGTGGG 
CTTCTTTCCT 
AAACAGATGA 
AGAGCTAGGA 
AGTACAGGTC 
TTCACCTGAA 
AAACATAAGA 
CCATGAGTTG 
AATGAGGGAA 
TTCAGCTGGA 
TGAATCAGTA 
TTCAGGCAAG 
CAAAGACGCT 
ATGGTGAAGA 
ATATACAATT 
TAGGCTGTGG 
GTTAAAGTAC 
TTAATTCTGT 
TGTGGGGAGC 
TGGACTGGGA 
T 



GAGCACTTGT 
GAAGAGAATG 
CCCATCGACA 
GTGTTAAAAA 
CGTCATCTTC 
GAGGATTTTG 
TTCATGTTAG 
AAGTTTATGT 
ATAGTAGGCT 
ATTCAGTTTT 
TAAGATTTTA 
ATATACATCT 
CAAGTCCCAT 
GAAGTCTTCA 
CTGTTTCCCC 
TTCAATGGGA 
TCCTGTGAGC 
ATTATTCATC 
ACTCTGGGAA 
GAAGACTAGC 
AAGAGTTGAG 
TTCTTTAAGG 
ACACTTTTTC 
CGACTTATAT 
AGTGGGTATT 
ACAAAATATT 
GTGGGGTTTG 
TGAAGACACT 
GCAGCTCCCA 
GAGTGGGAGC 
ATGGACTTAA 
AGAGATATTT 
TGCAGTAACC 
GTGGCTGTCC 
TACACAATAT 
CCATCCTCTC 
AGCATTACTT 



CACGACATGA 
AAATGAAGAA 
GTGAAACCAA 
CCATTATAAA 
CCTTTCCTCC 
GGTGGCCAAA 
TCTTACCGAA 
GGCACACAAT 
AAAAACATTT 
AGTTTCAAAA 
TCCCTTTCTC 
AGCACAGCTG 
CTCACCTCTA 



TCAGAACACC 
GTGAACATTT 
AAAGTGCAGG 
AAACTCTTTG 
GAATGTCTCC 
AGCTAGCAAG 
CACAGGCAAC 
AAGAATAAAA 
TCGAGTCCAA 
ATTTGAAAGA 
TCTTATTTGA 
TATTTGTTTT 
CTTTTGTTTT 
GTCAATGAGG 
GGTTTAGAGA 
TGTCAGGCAG 
TTATGCTAAC 
TTATATAGAC 
ATTTAATGTC 
TCTTAAGACA 
GTGTTTTTTA 
GCCAAGCCAT 
TGTAGATGTA 
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Seq ID NO: 258 Protein sequence 
Protein Accession #: NP_004163.1 



MTKSNGEEPK 
GTILGFTLRP 
MRAWYYMTT 
NLVEACFKQF 
VNALGLWFS 
IVEMEDMGVI 
GTSSSSATLP 
LNFGQIITIS 
TTNVLGDSLG 
KM 



11 
I 

MGGRMERFQQ 
YRMSYREVKY 
TIIAWIGII 
KTNYKKRSFK 
MCFGFV1GNM 
GGQLAMYTVT 
ITFKCLEENN 
ITATAAS IGA 
AGIVEHLSRH 



21 
I 

GVRKRTLIiAK 
FSFPGELLMR 
IVIIIHPGKG 
VPIQANETLV 
KEQGQALREF 
VrVGLLIHAV 
GVDKRVTRFV 
AGIPQAGLVT 
ELKNRDVEMG 



31 
I 

KKVQNITKED 
MLQMLVLPLI 
TKENMHREGK 
GAVINNVSKA 
FDSLNEAIMR 
IVLPLLYFLV 
LPVGATINMD 
MVIVLTSVGL 



41 
I 

VKSYLFRNAF 
ISSLVTGMAA 
IVRVTAADAF 
METLTRITEE 
LVAVIMWYAP 
TRKNPWVFIG 
GTALYEALAA 
PTDDITLIIA 
KPYQLIAQDN 



51 
I 

VM/TVTAVIV 
LDSKASGKMG 
LDLIRNMFPP 
LVPVPGSVNG 
VGILFLIAGK 
GLLQALITAL 
IF1AQVNNFE 
VDWFLDRLRT 
ETEKPIDSET 



60 
120 
180 
240 
300 
360 
420 
480 
540 



Seq ID NO: 259 DNA sequence 

Nucleic Acid Accession #: NMJ>21948. 

Coding sequence: 4 8.. 27 83 



TGTGGCACTG 
TCCTGCCCCT 
TGGAAGGAGA 
TGCAGGGCGT 
CGCCGAGCCG 
GCCGGGAGGC 
GGTTCGGOGT 
GCGAGCTGCG 
ACAGCAGCGA 
CTGCCOGCTA 
ACATCGCCAC 
CTGGCTGGCT 
ACGGAGACAT 
TCTATGATGT 
CAGAGAAGCT 
CCACCACGGG 
GGCTAGCTGA 
GCTTGCCTGG 
ACAGCCGCTT 
CCTCCAACCC 
AGACCCTGGA 
TCTACTCCAT 
CAGAGGCCCC 



11 
I 

CCTGCGTACC 
GCTGGCAGCC 
CAGCTCAGAG 
GCTCGGCGGC 
CCGGGCTGTG 
AGAGGTGCTG 
GGCACTGCCT 
CCCCAACGAC 
OGCTGTGGAG 
TGCTTTCTCC 
CCCGGAGCAG 
GTCGGATCAG 
GGATGGCTTC 
GTACTGTTAT 
GACATTGGAG 
CCAACTGTAT 
TGGCAGTGTG 
TGTCAAGACT 
CAACGTCTAC 
AGCCTCCAAC 
GGAACTGCAG 
CCCCATCATG 
TAGGACGCTC 



21 
I 

CAACCCCAGC 
CTGGTCCTGG 
GACCGCGCTT 
GCCCTCACCA 
CTGGGCTCTC 
GTGGCGCGGG 
GCGTACCCAG 
TCAGGTATCT 
GTCAAGGTCA 
TTTTCTGGGG 
CTCTATGCCG 
ACCGTGAGGT 
CCCGGGGTCC 
GCTGAAGACC 
GAAGCACGGG 
GCAGCCTGGG 
CGCTACCCCA 
CTCTTCCTCT 
TGCTTCCGAG 
CCAGCCTCTG 
CTGCCTCAGG 
GAGGACGGAG 
CTAGAATTTG 



31 
I 

CCTGGGTAGC 
CCCAGGCTCC 
TTCGCGTGCG 
TCCCTTGCCA 
CGCGGGTCAA 
GAGTGCGCGT 
CGTOGCTCAC 
ATCGCTGTGA 
AAGGGGTCGT 
CCCAGGAGGC 
CCTACCTTGG 
ATCCCATCCA 
GGAACTATGG 
TAAATGGAGA 
CGTACTGCCA 
ATGGTGGCCT 
TCGTCACACC 
TCCCCAACCA 
ACTCGGCCCA 
ATGGACTAGA 
AAGCCACAGA 
GAGGTGGAAG 
AAACACAATC 



41 
I 

CTGCAGCATG 
TGCAGCTTTA 
CATCGCGGGC 
CGTCCACTAC 
GTGGACTTTC 
CAAGGTGAAC 
CGACGTCTCC 
GGTCCAGCAC 
CTTTCTCTAC 
CTGTGCCCGC 
GGGCTATGAG 
GACCCCACGA 
TGTGGTGGAC 
ATTGTTCCTG 
GGAGCGGGGT 
GGACCACTGC 
CAGCCAGCGC 
GACTGGCTTC 
GCCTTCTGCC 
GGCTATCGTC 
GAGTGAATCC 
CTCCACTCCA 
CATGGTACGG 




GAGGCCTACC 
CTGGGGCTGA 
GGCATCGATG 
CGAGAGGGCT 
ATTGGAGCCC 
CAATGTGATG 
GAGGCCTGTT 
CCGGATGACC 
GGTGACCCTC 
GCAGAGATTG 
AGCCCAGGGT 




60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
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TCTCAGAAGA GGAAGGTAAG GCATTGGAGG AAGAAGAGAA ATATGAAGAT GAAGAAGAGA 1440 

AAGAGGAGGA AGAAGAAGAG GAGGAGGTGG AGGATGAGGC TCTGTGGGCA TGGCCCAGCG 1500 

AGCTCAGCAG CCCGGGCCCT GAGGCCTCTC TCCCCACTGA GCCAGCAGCC CAGGAGGAGT 1560 

CACTCTCCCA GGCGCCAGCA AGGGCAGTCC TGCAGCCTGG TGCATCACCA CTTCCTGATG 1620 

GAGAGTCAGA AGCTTCCAGG CCTCCAAGGG TCCATGGACC ACCTACTGAG ACTCTGCCCA 1680 

CTCCCAGGGA GAGGAACCTA GCATCCCCAT CACCTTCCAC TCTGGTTGAG GCAAGAGAGG 1740 

TGGGGGAGGC AACTGGTGGT CCTGAGCTAT CTGGGGTCCC TOGAGGAGAG AGCGAGGAGA 1800 

CAGGAAGCTC CGAGGGTGCC CCTTCCCTGC TTCCAGCCAC ACGGGCCCCT GAGGGTACCA I860 

GGGAGCTGGA GGCCCCCTCT GAAGATAATT CTGGAAGAAC TGCCCCAGCA GGGACCTCAG 1920 

TGCAGGCCCA GCCAGTGCTG CCCACTGACA GCGCCAGCCG AGGTGGAGTG GCCGTGGTCC 1980 

CCGCATCAGG TGACTGTGTC CCCAGCCCCT GCCACAATGG TGGGACATGC TTGGAGGAGG 2040 

AGGAAGGGGT CCGCTGCCTA TGTCTGCCTG GCTATGGGGG GGACCTGTGC GATGTTGGCC 2100 

TCCGCTTCTG CAACCCCGGC TGGGACGCCT TCCAGGGOGC CTGCTACAAG CACTTTTCCA 2160 

CACGAAGGAG CTGGGAGGAG GCAGAGACCC AGTGCCGGAT GTAGGGCGCG CATCTGGCCA 2220 

GCATCAGCAC ACCCGAGGAA CAGGACTTCA TCAACAACCG GTACOGGGAG TACCAGTGGA 2280 

TCGGACTCAA CGACAGGACC ATCGAAGGCG ACTTCTTGTG GTCGGATGGC GTCCCCCTGC 2340 

TCTATGAGAA CTGGAACCCT GGGCAGCCTG ACAGCTACTT CCTGTCTGGA GAGAACTGCG 2400 

TGGTCATGGT GTGGCATGAT CAGGGACAAT GGAGTQAOGT GCCCTGCAAC TACCACCTGT 2460 

CCTACACCTG CAAGATGGGG CTGGTGTCCT GTGGGCCGCC ACCGGAGCTG CCCCTGGCTC 2520 

AAGTGTTCGG CCGCCCACGG CTGCGCTATG AGGTGGACAC TGTGCTTCGC TACCGGTGCC 2580 

GGGAAGGACT GGCCCAGCGC AATCTGCCGC TGATCCGATG CCAAGAGAAC GGTCGTTGGG 2640 

AGGCCCCCCA GATCTCCTGT GTGCCCAGAA GACCTGCCCG AGCTCTGCAC CCAGAGGAGG 2700 

ACCCAGAAGG ACGTCAGGGG AGGCTACTGG GACGCTGGAA GGCGCTGTTG ATCCCCCCTT 2760 

CCAGCCCCAT GCCAGGTCCC TAGGGGGCAA GGCCTTGAAC ACTGCCGGCC ACAGCACTGC 2820 
CCTGTCACCC AAATTTTCCC TCACACCCTG CGCTCACCAC AGGAAGTGAC AACATGAC 

Seq ID NO: 260 Protein sequence 
Protein Accession #: NP_068767.l 

1 11 21 31 41 51 

i i i i i i 

MAQLFLPLLA ALVLAQAPAA LADVLEGDSS EDRAFRVRIA GDAPLQGVLG GALTIPCHVH 60 

YLRPPPSRRA VLGSPRVKWT FLSRGREAEV LVARGVRVKV NEAYRFRVAL PAYPASLTDV 120 

SLALSELRPN DSGIYRCEVQ HGIDDSSDAV EVKVKGWFL YREGSARYAF SFSGAQEACA 180 

RIGAHIATPE QLYAAYLGGY EQCDAGWLSD QTVRYPIQTP REACYGDMDG FPGVRNYGW 240 

DPDDLYDVYC YAEDLNGELF LGDPPEKLTL EEARAYCQER GAEIATTGQL YAAWDGGLDH 300 

CSPGWLADGS VRYPIVTPSQ RCGGGLPGVK TLFLFPNQTG FPNKHSRFHV YCFRDSAQPS 360 

AIPEASNPAS NPASDGLEAI VTVTE7LEEL QLPQEATESE SRGAIYSIPI MEDGGGGSST 420 

PEDPAEAPRT LLEFETQSMV PPTGFSEEEG KALEEEEKYE DEEEKEEEEE EEEVEDEALW 480 

AWPSELSSPG PEASLPTEPA AQEESLSQAP ARAVLQPGAS PLPDGESEAS RPPRVHGPPT 540 

ETLPTPRERN LASPSPSTLV EAREVGEATG GPELSGVPRG ESEETGSSEG APSLLPATRA 600 

PEGTRELEAP SEDKSGRTAP AGTSVQAQPV LPTDSASRGG VAWPASGDC VPSPCHNGGT 660 

CLEEEEGVRC LCLPGYGGDL CDVGLRFCNP GWDAFQGACY KHFSTRRSWE EAETQCRMYG 720 

AHIASISTPE EQDFINNRYR EYQWIGLNDR TIEGDFLWSD GVPLLYENWN PGQPDSYFLS 780 

GENCWMVWH DQGQWSDVPC NYHLSYTCKM GLVSCGPPPE LPLAQVFGRP RLRYEVDTVL 840 

RYRCREGLAQ RNLPLIRCQE NGRWEAPQIS CVPRRPARAL HPEEDPEGRQ GRLLGRWKAL 900 
LIPPSSPMPG P 

Seq ID NO: 261 DNA sequence 

Nucleic Acid Accession #: NMJ>04386.1 

Coding sequence? 2.. 3967 

1 11 21 31 41 51 

I I I I I 1 

GATGGGGGCC COGTTTGTCT GGGCCTTGGG CCTTTTGATG CTGCAGATGC TGCTCTTTGT 60 

GGCTGGGGAA CAGGGCACAC AGGATATCAC CGATGCCAGC GAAAGGGGGC TCCACATGCA 120 

GAAGCTGGGG TCTGGGTCAG TGCAGGCTGC GCTGGCGGAG CTGGTGGCCC TGCCCTGTCT 180 

CTTTACCCTG CAGCCACGGC CAAGCGCAGC COGAGATGCC CCTCGGATAA AGTGGACCAA 240 

GGTGCGGACT GCGTCGGGCC AGCGACAGGA CTTGCCCATC CTGGTGGCCA AGGACAATGT 300 

CGTGAGGGTG GCCAAAAGCT GGCAGGGACG AGTGTCACTG CCTTCCTACC CCCGGCGCCG 360 

AGCCAACGCC ACGCTACTTC TGGGGCCACT GAGGGCCAGT GACTCTGGGC TGTACCGCTG 420 

CCAGGTGGTG AGGGGCATGG AGGATGAGCA GGACCTGGTG CCCTTGGAGG TGACAGGTGT 480 

TGTGTTCCAC TACCGATCAG CCCGGGACCG CTATGCACTG ACCTTCGCTG AGGCCCAGGA 540 

GGCCTG CCGT CTCAGCTCAG CCATCATTGC AGCCCCTOGG CATCTACAGG CTGCCTTTGA 600 

GGATGGCTTT GACAACTGTG ATGCTGGCTG GCTCTCTGAC CGCACTGTTC GGTATCCTAT 660 

CACCCAGTCC CGTCCTGGTT GCTATGGCGA CCGTAGCAGC CTTCCAGGGG TTCGGAGCTA 720 

TGGGAGGCGC AACCCACAGG AACTCTACGA TGTGTATTGC TTTGCCCGGG AGCTGGGGGG 780 

CGAGGTCTTC TACGTGGGCC CGGCCCGCCG CCTGACACTG GCCGGCGCGC GTGCACAGTG 840 

CCGCCGCCAG GGTGCCGCGC TGGCCTCGGT GGGACAGCTG CACCTGGCCT GGCATGAGGG 900 

CCTGGACCAG TGCGACCCGG GCTGGCTGGC CGAOGGCAGC GTGCGCTACC OGATCCAGAC 960 

GCCGCGCCGG CGCTGCGGGG GCCCAGCCCC GGGCGTGCGC ACCGTCTACC GCTTCGCTAA 1020 

CCGGACCGGC TTCCCCTCAC CCGCCGAGCG CTTCGACGCC TACTGCTTCC GAGCTCATCA 1080 

CCCCACGTCA CAACATGGAG ACCTAGAGAC CCCATCCTCT GGGGATGAGG GGGAGATTCT 1140 

GTCAGCAGAG GGGCCCCCAG TTAGAGAACT GGAGCCCACC CTGGAGGAGG AAGAGGTGGT 1200 

CACCCCTGAC TTCCAGGAGC CTCTGGTGTC CAGTGGGGAA GAAGAAACCC TGATTTTGGA 1260 

GGAGAAGCAG GAGTCTCAAC AGACCCTCAG CCCTACCCCT GGGGACCCCA TGCTGGCCTC 1320 

ATGGCCCACT GGGGAAGTGT GGCTAAGCAC GGTGGCCCCC AGCCCTAGCG ACATGGGGGC 1380 

AGGCACTGCA GCAAGTTCAC ACACGGAGGT GGCCCCAACT GACCCTATGC CTAGGAGAAG 1440 

GGGGCGCTTC AAAGGGTTGA ATGGGCGCTA CTTCCAGCAG CAGGAACCGG AGCCGGGGCT 1500 

GCAAGGGGGG ATGGAGGCCA GCGCCCAGCC CCCCACCTCA GAGGCTGCAG TGAACCAAAT 1560 

GGAGCCTCCG TTGGCCATGG CAGTCACAGA GATGTTGGGC AGTGGCCAGA GCCGGAGCCC 1620 

CTGGGCTGAT CTGACCAATG AGGTGGATAT GCCTGGAGCT GGTTCTGCTG GTGGCAAGAG 1680 

CTCCCCAGAG CCCTGGCTGT GGCCCCCTAC CATGGTCCCA CCCAGCATCT CAGGCCACAG 1740 

CAGGGCCCCT GTCCTGGAGC TAGAGAAAGC CGAGGGCCCC AGTGCCAGGC CAGCCACCCC 1800 
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AGACCTGTTT TGGTCCCCCT TGGAGGCCAC TGTCTCAGCT CCCAGCCCTG CCCCCTGGGA 1860 
GGCATTCCCT GTGGCCACCT CCCCAGATCT CCCTATGATG GCCATGCTGC GTGGTCCCAA 1920 
AGAGTGGATG CTACCACACC CCACCCCCAT CTCCACCGAG GCCAATAGAG TTGAGGCACA 1980 
TGGTGAGGCC ACOGCCACGG CTCCACCCTC CCCTGCTGCA GAGACCAAGG TGTATTCCCT 2040 
GCCTCTCTCT TTGACCCCAA CAGGACAGGG TGGAGAGGCC ATGCCCACAA CACCTGAGTC 2100 
CCCCAGGGCA GACTTCAGAG AAACTGGGGA GACCAGCCCT GCTCAGGTCA ACAAAGCTGA 2160 
GCACTCCAGC TCCAGCCCAT GGCCTTCTGT AAACAGGAAT GTGGCTGTAG GTTTTGTCCC 2220 
CACTGAGACT GCCACTGAGC CAACGGGCCT CAGGGGTATC CCGGGGTCTG AGTCTGGGGT 2280 
CTTCGACACA GCAGAAAGCC CCACTTCTGG CTTGCAGGCC ACTGTAGATG AGGTGCAGGA 2340 
CCCCTGGCCC TCAGTGTACA GCAAAGGGCT GGATGCAAGT TCCCCATCTG CCCCCCTGGG 2400 
GAGCCCTGGA GTCTTCTTGG TACCCAAAGT CACCCCAAAT TTGGAGCCTT GGGTTGCTAC 2460 
AGATGAAGGA CCCACTGTGA ATCCCATGGA TTCCACAGTC ACGCCGGCCC CCAGTGATGC 2520 
TAGTGGAATT TGGGAACCTG GATCCCAGGT GTTTGAAGAA GCCGAAAGCA CCACCTTGAG 2580 
CCCTCAGGTG GCCCTGGATA CAAGCATTGT GACGCCCCTC ACGACCCTGG AGCAGGGGGA 2640 
CAAGGTTGGA GTTCCAGCCA TGTCTACACT GGGCTCCTCA AGCTCCCAAC CCCACCCAGA 2700 
GCCAGAGGAT CAGGTGGAGA CCCAGGGAAC ATCAGGAGCT TCAGTGCCTC CGCATCAGAG 2760 
CAGTCCCCTA GGGAAACCGG CTGTTCCTCC TGGGACACCG ACTGCAGCCA GTGTGGGCGA 2820 
GTCTGCCTCA GTTTCCTCAG GGGAGCCTAC GGTACCGTGG GACCCCTCCA GCACCCTGCT 2880 
GCCTGTCACC CTGGGCATAG AGGACTTCGA ACTGGAGGTC CTGGCAGGGA GCCOGGGTGT 2940 
AGAGAGCTTC TGGGAGGAGG TGGCAAGTGG AGAGGAGCCA GCCCTGCCAG GGACCCCTAT 3000 
GAATGCAGGT GCGGAGGAGG TGCACTCAGA TCCCTGTGAG AACAACCCTT GTCTTCATGG 3060 
AGGGACATGT AATGCCAATG GCACCATGTA TGGCTGTAGC TGTGATCAGG GCTTCGCCGG 3120 
GGAGAACTGT GAGATTGACA TTGATGACTG CCTCTGCAGC CCCTGTGAGA ATGGAGGCAC 3180 
CTGTATTGAT GAGGTCAATG GCTTTGTCTG CCTTTGCCTC CCCAGCTATG GGGGCAGCTT 3240 
TTGTGAGAAA GACACCGAGG GCTGTGACCG CGGCTGGCAT AAGTTCCAGG GCCACTGTTA 3300 
CCGCTATTTT GCCCACCGGA GGGCATGGGA AGATGCCGAG AAGGACTGCC GCCGCCGCTC 3360 
CGGCCACCTG ACCAGCGTCC ACTCACCGGA GGAACACAGC TTCATTAATA GCTTTGGGCA 3420 
TGAAAACACG TGGATCGGCC TGAACGACAG GATOGTGGAG AGAGATTTCC AGTGGACGGA 3480 
CAACACCGGG CTGCAATTTG AGAACTGGCG AGAGAACCAG COGGACAATT TCTTCGCGGG 3540 
TGGCGAGGAC TGTGTGGTGA TGGTGGCGCA TGAAAGCGGG OGCTGGAACG ATGTCCCCTG 3600 
CAACTACAAC CTACCCTATG TCTGCAAGAA GGGCACAGTG CTCTGTGGTC CCCCTCCGGC 3660 
AGTGGAGAAT GCCTCACTCA TCGGTGCCCG CAAGGCCAAG AACAATGTCC ATGCCACTGT 3720 
AAGGTACCAG TGCAATGAAG GATTTGCCCA GCACCATGTG GTCACCATTC GATGCCGGAG 3780 
CAATGGCAAG TGGGACAGGC CCCAAATTGT CTGCACCAAA CCCAGACGTT CACATCGGAT 3840 
GCGGGGACAC CACCACCACC ACCAACACCA CCACCAGCAT CACCACCACA AATCCCGCAA 3900 
GGAGCGCAGA AAACACAAGA AACACCCAAC GGAGGACTGG GAGAAGGACG AAGGGAATTT 3960, 
TTGCTGAAGA ACCAGAAAAA AGAAAGCACA ACACCTTTCC CATGCCTCCT CTGGAGCCTT 4020 
CGCCTGGGGA GACAGAACCC AGAGAGAAAC AAGAGAGTCC AGAAGTCCCT GAACCCCAAA 4080 
CTGTTCTCGC AAAAAAAATA TTCCTTTGAA CAAAGGTCTT CTTTTCCTTT TTTTACATAC 4140 
ACAAGATCTT CTTGGCAGGT GGAGCCAGGT GTCTGAAAAG TTCATTCTCG TCTGGCTGAA 4200 
CTCTGGGAGT GTGTCCCAGC TGAGGGAAGC ACAAGTAGCA AAGCTCATTG GTCTGGTCTC 4260 
TTGTTTGCCA GGCTGATTGA AGCAGGCCTT GATGAGGGTG CATGAGTGTA TGTTTGCATT 4320 
CACATGAAGG AATTGCTTTT CACACCAGAA ATTCAGACTT AGTCAATGTT GGCTGAATTC 4380 
CTAAATCCAG GAAGAAGCCT GGACGTAGGG TCATTAGCTT TGGGAATAGA AGGCTACACA 4440 
GAAGCACACT GTTTTTGAAC TTGACAACAG CTCTCCCTTT ACCCTGGACT TCAGCCCAAG 4500 
TTCCGTCTTT GGTCTTGGTG GATAAACACA CAGTGTGGAG ATCCCACGTA CTGCATTTTA 4560 
GGGATGTTTT TAGGACAACC TCCCTCCATG CCTTCAGAGT TAGGAGTGAG AATGATCAAA 4620 
GCAATATGTA GGTGATGGAG GGAGAGTGTA TTGCTAACCC TTCCAGGTCT AGTCCAGCGC 4680 
TGAGATTTGG TGGTTCTGCA TGTGTGATGA ATCTCTTTCA CACAAATAGA CGAGAGGATA 4740 
TTTAGGGCTA GATGAGCCCA GATTTCTTCC CCCTCCATCT CTCAGGGAGA CAAAGAACCT 4800 
CCTTCCTGGA CCAAGGAGGT GCTGCCAAGT TTTCTAGCCC AGTGCACATA CCGAGTCCTT 4860 
AAGCAGACAT TGGTAGTGCC CCTGCCCTGG GTCCCACTCC TGCCCCACCC CACCCTTGTC 4920 
CCTGGCCATT GCCTGGTGGT CTAGAAACAC TTAAAACTTG AAGTAGTGAC ACCTACCTGC 4980 
GGTCATATTG TAGAGAGATG CTCAGTGTTA AAACTGAAAC ACACAAACAC ACACACACAC 5040 
ACATTTTTCT CTTGTAGATT TTAATTTTTT AAGTGGGAAA GAACTCACCT TGCCTTCCTC 5100 
CCCCAAATGT GCAACCTGTA AAAGGTCTCT CCACACCAGG GGCCAGGATC CAGTTCCCTC 5160 
ATCTCTGGCA GGAAAGATCC ACAGCTTTTC CTCCATGTCT GTTACTCACT TTCAGCAGTC 5220 
CGGGTAAAAT CTGTGGATCA GGGTTAAAAA AGCACCGTGG AGAATGGCCC TCTTCAGGAA 5280 
AGAAAAATAA GCAAATGAAT GGTCCACCTA GGGGTTCAGT AAAGAAAGAA ATGTGTTAAC 5340 
TGAGCCTGAA TCCCTTCTGG GAAGTAATAA TGACCATTGA CAACTAAGAA GTAGACACCA 5400 
TGCTAAAGAC TTACATACAA TCTCCTTGAA TCTTCTCAAT AGCCCATTGA CTTAGAAACT 5460 
GTTACTTTCC CATTTTACAC ACAGTGAAAC TGAGGCTCAG ATATAAAGGA AAGGTACTGG 5520 
CTTGAAGTCA CAACCACGAC AGGAGTAAGG ATTTGGAATA AGGATTTGGT CCTGTTTTCT 5580 
GGACCAAATC CTTACTCTGG CTCTGCTTAC ACTTTCTCTC CATCACCAAA TCCTTACTCC S640 
AAATCCAGAA GTCAGAGCCA ACTCCCATCT TGGTTCTGAC CGAAATCCTG CTCTGGACTC 5700 
TGGAGAGGAG ATTGAAATAT AATTGCACCC TCATACACAT TTAGGAAATG GTTAAGAAGT 5760 
GTAAACTGAA CCCTTATCCT TGTCTTCAAT CTTCCTCCCT GTAGACATCT ATCTTATTAT 5820 
GGTTATTATT CAGAAAACCC AGGGATACAG GTTTGTCTTC TTACTTTGAT AACTCTTCTT 5880 
AGTTTAAAAT AATAATAATA ACACATCTTT GGTCATCTAT GTCACACAAA AATTTTCCTT 5940 
TGTTTGCGGG GGGCTGGGGA TGCAGTGTTT TTTGGGGGGT CTTGGTTTAT GCTCCCTGCC 6000 
CTTGAGCCCC TCAGCCGTTT GCCCTGCCCC CACCTCGGCT CCATGGTGGG AGGGGGCTCT 6060 
GGTCTTTTCT AAAGTGGGCG GTTTGTCTTT TGATCTTTCC CTTTTGGATG TGCGTGTGTG 6120 
TCTGCGTGTG CCATGTGCGT GGCAOGCATA TGAGTGTGTG TGCGTGTGAA CGGCTTTGGG 6180 
TCCTGCTGGT TTTGCTGTGA GCTGCAGTGT TCTGTGGGTC TGTGGTATCT GACACTGTGG 6240 
ACATTAATGT ACTTCTTGGA CATTTTAATA AATTTTTTAA CAGTTCAAAA AAAAAAAAAA 6300 
AAAAAAAAAA 

Seq ID NO: 262 Protein sequence 
Protein Accession ft: NP 004377-1 



1 11 21 31 41 51 

I 1 I I I I 

MGAPFVWALG LLMLQMLLFV AGEQGTQDIT DASERGLHMQ KLGSGSVQAA LAELVALPCL 
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FTLQPRPSAA 
ANATLLLGPL 
ACRLSSAIIA 
GRRNPQELYD 
LDQCDPGWLA 
* PTSQHGDLET 
EKQESQQTLS 
GRFKGLNGRY 
WADLTNEVDM 
DLFWSPLEAT 
GEATATAPPS 
HSSSSPWPSV 
PWPSVYSKGL 
SGIWEPGSQV 
PEDQVETQGT 
PVTLGIEDFE 
GTCNANGTMY 
CEKDTEGCDR 
ENTWIGLNDR 
NYNLPYVCKK 
NGKWDRPQIV 
C 



RDAPRIKWTK 
RASDSGLYRC 
APRHLQAAFE 
VYCFARELGG 
DGSVRYPIQT 
PSSGDEGEIL 
PTPGDPKXjAS 
FQQQEPEPGL 
PGAGSAGGKS 
VSAPSPAPWE 
PAAETKVYSL 
NRNVAVGFVP 
DASSPSAPLG 
FEEAESTTLS 
SGASVPPHQS 
LEVLAGSPGV 
GCSCDQGFAG 
GWHKFOGHCY 
IVERDFQWTD 
GTVLCGPPPA 
CTKPRRSHRM 



VRTASGQRQD 
QWRGIEDEQ 
DGFDNCDAGW 
EVFYVGPARR 
PRRRCGGPAP 
SAEGPPVREL 
WPTGEVWLST 
QGGMEASAQP 
SPBPWLWPPT 
AFPVATSPDL 
PL9LTPTGQG 
TETATEPTGL 
SPGVFLVPKV 
POVALDTSIV 
SPLGKPAVPP 
ESFWEEVASG 
ENCEIDIDDC 
RYFAHRRAWE 
NTGLQFENWR 
VENASLIGAR 
RGHHHHHQHH 



LPILVAKDNV 
DLVPLEVTGV 
LSDRTVRYPI 
LTLAGARAQC 
GVRTVYRFAN 
EPTLEEEEW 
VAPSPSDMGA 
PTSEAAVNQM 
MVPPSISGHS 
PMMAMLRGPK 
GEAMPTTPES 
RGIPGSESGV 
TPNLEPWVAT 
TPLTTLEQGD 
GTPTAASVGE 
EEPALPGTPM 
LCSPCENGGT 
DAEKDCRRRS 
EKQPDNFFAG 
KAKNNVHATV 
HQHHHHKSRK 



VRVAKSWQGR 
VFHYRSARDR 
TQSRPGCYGD 
RRQGAALASV 
RTGFPSPAER 
TPDFQEPLVS 
GTAASSHTEV 
EPPLAMAVTE 
RAPVLELEKA 
EWMLPHPTPI 
PRADFRETGE 
FDTAESPTSG 
DEGPTVNPMD 
KVGVPAMSTL 
SASVSSGEPT 
NAGAEEVHSD 
CIDEVNGFVC 
GHLTSVHSPE 
GEDCWMVAH 
RYQCNEGFAQ 
ERRKHKKHPT 



VSLPSYPRRR 
YALTFAEAQE 
RSSLPGVRSY 
GQLHLAWHEG 
FDAYCFRAHH 
SGEEETLILE 
APTDPMPRRR 
MLGSGQSRSP 
EGPSARPATP 
STEANRVEAH 
TSPAQVNKAE 
LQATVDEVQD 
STVTPAPSDA 
GSSSSQPHPE 
VPWDPSSTLL 
PCENNPCLHG 
LCLPSYGGSF 
EHSFINSFGH 
ESGRWNDVPC 
HHWTIRCRS 



120 
180 • 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 



Seq ID NO: 263 DMA sequence 
Nucleic Acid Accession # 
Coding sequence: 7.. 2085 

11 



Eos sequence 



1 11 21 31 41 51 

I I I I I I 

GCCGCGATGG CCAGCACCAG GAGTATCGAG CTGGAGCACT TTGAGGAACG GGACAAAAGG 60 

CCGCGGCCGG GGTCGCGGAG AGGGGCCCCC AGCTCCTCCG GGGGCAGCAG CAGCTCGGGC 120 

CCCAAGGGGA ACGGGCTCAT CCCCAGTCCG GCGCACAGTG CCCACTGCAG CTTCTACCGC 180 

ACGCGGACCC TGCAGGCCCT CAGCTCGGAG AAGAAGGCCA AGAAGGCGCG CTTCTACCGG 240 

AACGGGGACC GCTACTTCAA GGGCCTGGTG TTTGCCATCT CCAGCGACCG CTTCCGGTCC 300 

TTCGATGCGC TCCTCATAGA GCTCACCCGC TCCCTGTCGG ACAACGTGAA CCTGCCCCAG 360 

GGTGTCCGCA CTATCTACAC CATCGACGGC AGCCGGAAGG TCACCAGCCT GGACGAGCTG 420 

CTGGAAGGTG AGAGTTACGT GTGTGCATCC AATGAACCAT TTCGTAAAGT CGATTACACC 480 

AAAAATATTA ATCCAAACTG GTCTGTGAAC ATCAAGGGTG GGACATCCCG AGCGCTGGCT 540 

GCTGCCTCCT CTGTGAAAAG TGAAGTAAAA GAAAGTAAAG ATTTCATCAA ACCCAAGTTA 600 

GTGACTGTGA TTCGAAGTGG AGTGAAGCCT AGAAAAGCCG TGCGGATCCT TCTGAATAAA 660 

AAGACTGCTC ATTCCTTTGA ACAAGTCTTA ACAGATATCA CCGAAGCCAT TAAACNAGCC 720 

TCAGGAGTCG TCAAGAGGCT CTGCACCCTG GATGGAAAGC AGGTGAGAGT TACGTGTGTG 780 

CATCTGCCAG ACTTTTTTGG TGATGACGAT GTTTTTATTG CATGTGGACC AGAAAAATTT 840 

CGTTATGCCC AAGATGACTT TGTCCTGGAT CATAGTGAAT GTCGTGTCCT GAAGTCATCT 900 

TATTCTCGAT CCTCAGCTGT TAAGTATTCT GGATCCAAAA GCCCTGGGCC CTCTCGACGC 960 

AGCCAGATTT CTGCTCATGG CAGATCTTCT TCCAATGTAA ACGGTGGACC TGAGCTTGAC 1020 

CGTTGCATAA GTCCTGAAGG TGTGAATGGA AACAGATGCT CTGAATCATC AACTCTTCTT 1080 

GAGAAATACA AAATTGGAAA GGTCATTGGT GATGGCAATT TTGCAGTAGT CAAAGAGTGT 1140 

ATAGACAGGT CCACTGGAAA GGAGTTTGCC CTAAAGATTA TAGACAAAGC CAAATGTTGT 1200 

GGAAAGGAAC ACCTGATTGA GAATGAAGTG TCAATACTGC GCCGAGTGAA ACATCCCAAT 1260 

ATCATTATGC TGGTCGAGGA GATGGAAACA GCAACTGAGC TCTTTCTGGT GATGGAATTG 1320 

GTCAAAGGTG GAGATCTCTT TGATGCAATT ACTTCGTCGA CCAAGTACAC TGAGAGAGAT 1380 

GGCAGTGCCA TGGTGTACAA CTTAGCCAAT GCCCTCAGGT ATCTCCATGG CCTCAGCATC 1440 

GTGCACAGAG ACATCAAACC AGAGAATCTC TTGGTGTGTG AATATCCTGA TGGAACCAAG 1500 

TCTTTGAAAC TGGGAGACTT TGGGCTTGCG ACTGTGGTAG AAGGCCCTTT ATACACAGTC 1560 

TGTGGCACAC CCACTTATGT GGCTCCARAA ATCATTGCTG AAACTGGCTA TGGCCTGAAG 1620 

GTGGACATTT GGGCAGCTGG TGTGATCACA TACATACTTC TCTGTGGATT CCCACCATTC 1680 

CGAAGTGAGA ACAATCTCCA GGAAGATCTC TTCGACCAGA TCTTGGCTGG GAAGCTGGAG 1740 

TTTCCGGCCC CCTACTGGGA TAACATCACG GACTCTGCCA AGGAATTAAT CAGTCAAATG 1800 

CTTCAGGTAA ATGTTGAAGC TCGGTGTACC GCGGGACAAA TCCTGAGTCA CCCCTGGGTG I860 

TCAGATGATG CCTCCCAGGA GAATAACATG CAAGCTGAGG TGACAGGTAA ACTAAAACAG 1920 

CACTTTAATA ATGCGCTCCC CAAACAGAAC AGCACTACCA CCGGGGTCTC CGTCATCATG 1980 

GTGAGTGGAA GGCGGCAGGT CTGGCCTGAC TGCGGAGCCG GCCTTGAAGT TTTTGAATTA 2040 

GGTAGCCGGG AGCTGCCCTC ACATGGAAGT TGGTGCCTTC CGTAGTCCTA TTTCATATGA 2100 

AGATTGGCTT GGCATGTGGA GGGCACTCAT TCGGCAACTC CCAGGCTTTG GGCACTGTGT 2160 

GGAGGGGCTT GTGTAGGGAC CAGCAGGCCT GGTGTGAGGG GTCCAGGCGT CAAGGAGCTC 2220 

CTGGCTGGGC CCTCTGGGCA GCTGCTTCCA CTCTTGTCTC TGCCTTCTCA TCTAGAGAGA 2280 

CTCCCAAGCC CTGGAGGGGT GTGTTGTGTT AGGAATTAAC TCCCTGCCTA CCCCAAGGCC 2340 

TCAGAAATAG ATTATTAGAG ATGTGAATTA TTCTTTGAGA CTTGGGATAA GAAACAGCCA 2400 

AAGCTAAACA TATTTCAGTT TTAAAAAATC AGTGTTTTAT AAAACACAGT TTGGGGCTTT 2460 

TAAAGGTACA TAATCAAGGA AAAAAATATA TATTCATTTT TCAGGGTTGG TAACATTTTA 2520 

TGAGATGTCA GTGACAACGA TGGCCTTATT TTTTTCAGCC TTTTCTTCTT CCAAAATGTT 2580 

TCTTAAGGCA ACTCTCCTAA ATACATAAAC ACAACAAATT AAAATGAAAA GTGACATGAG 2640 

AGTAAATGAA TCAAAAGGAA AAAACATTGA ACCAGAGGTG AGGGCAGCAC ACCCGCAGCA 2700 

GCTGTCCAGG CCTGAGCCAA TGCAACCCTG GG CGGGAAGG CCAGCTCACC GTGAGCAGGT 2760 

AGAAGCCAGC CAGCCACCCA GGCAGGGACC TTGGTTCTCC CCACACACTC CCAGGAGCAG 2820 

GGAACAGGGG TGGAGTGGCC TTTCCCAGAG CTGGAGTTGG CTGCAGCAGC TTTCGAATCA 2880 

GACCTGCCAA GGTGATGGGC GTCTGAGTTT CACATCTGGG CCCCCCGTGA CCCCACTGAG 2940 

TCCTGACAGC TAAGGATGGG CCACCTCCAC AGCTCCGTCA CTCGTACTTG GGACAGGCCT 3000 

CTCATCCTCT GGGAAGGTCC TCCTTGTTTC CTACCCAACT AGAAGGGAAA CAGTGGCATA 3060 

TTCTCATGGT ACATGGTTGT CTGAAAGCCT TACCTAGGAA GACGCAGGGT CTAGATAGAA 3120 

GCTATAAGGA AGCCACACAC ATAACCCACA TCCCCACACC CCCAACATCC CCCACACTCC 3180 
CCACACCCCC CACACCCCCC ACATCCCCAC CATAATTACC CCCACCTCCA AATATCTCAT 
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MASTRSIELE 
TLQALSSEKK 
RTIYTIDGSR 
SSVKSEVKES 
WKRLCTLDG 
RSSAVKYSGS 
YKIGKVIGDG 
MLVEEMETAT 
RDIKPENLLV 
IWAAGVITYI 
VNVEARCTAG 
GRRQVWPDCG 



11 
I 

HFEERDKRPR 
AKKARFYRNG 
KVTSLDELLE 
KDFIKPKLVT 
KOVRVTCVHL 
KSPGPSRRSQ 
NFAWKECID 
ELFLVMSLVK 
CEYPDGTKSL 
LLCGFPPFRS 
QILSHPWVSD 
AGLEVFELGS 



21 
I 

PGSRRGAPSS 
DRYFKGLVFA 
GESYVCASNE 
VIRSGVKPRK 
PDFFGDDDVF 



31 



RSTGKEFALK 
GGDLFDAITS 
KLGDFGLATV 
ENNLQEDLFD 
DASQENNMQA 
RELPSHGSWC 



ISSDRFRSFD 
PFRKVDYTKN 
AVRILLNKKT 
IACGPEKFRY 
VNGGPELDRC 
IIDKAKCCGK 
STKYTERDGS 
VEGPLYTVCG 
QILAGKLEFP 
EVTGKLKQHF 
LP 



41 
I 

GNGLIPSPAH 
ALLIELTRSL 
INPNWSVNIK 
AHSFEQVLTD 
AQDDFVLDHS 
ISPEGVNGNR 
EHLIENEVSI 
AMVYNLANAL 
TPTYVAPXII 
APYWDNITDS 
NNALPKQNST 



51 
! 

SAHCSFYRTR 
SDNVNLPQGV 
GGTSRALAAA 
ITEAIKXASG 
ECRVLKSSYS 
CSESSTLLEK 
LRRVKHPNII 
RYLHGLSIVH 
AETGYGLKVD 
AKELISQMLQ 
TTOVSVIMVS 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 



Seq ID NO: 265 DNA sequence 

Nucleic Acid Accession #: AB0206B4.1 

Coding sequence: 1..1744 

1 11 21 31 41 51 

I I I I I I 

CCCCCTTGTC ATTAATACAT TAAAAAGATT CAATCTTTAC GCTGAGGTAA TTTTGGCCAG 
TTGGTACCGG ATTTATACCA AAATAATGGA CTTGATTGGT ATTCAAACCA AGATATGTTG 
GACGGTTACC AGAGGAGAAG GACTCAGTCC TATTGAAAGC TGTGAAGGAT TGGGAGATCC 
TGCTTGCTTT TATGTTGCTG TAATTTTTAT TTTAAATGGA CTAATGATGG CATTATTCTT 
CATATATGGC ACATATTTAA GTGGCAGCCG ATTAGGAGGC CTGGTTACAG TGTTGTGCTT 
CTTTTTCAAT CATGGAGAGT GTACCCGTGT AATGTGGACA CCACCTCTCC GTGAAAGCTT 
CTCATATCCA TTTCTTGTTC TTCAGATGTT GCTAGTGACT CATATTCTCA GGGCTACAAA 
ACTTTATAGA GGAAGCTTGA TTGCACTCTG CATTTCCAAT GTATTTTTCA TGCTTCCTTG 
GCAGTTTGCT CAGTTTGTAC TTCTTACTCA GATTGCATCA TTATTTGCAG TATATGTTGT 
CGGGTACATT GATATATGTA AATTACGGAA GATCATTTAT ATACACATGA TTTCTCTTGC 
ACTTTGTTTT GTTTTGATGT TTGGGAACTC AATGTTATTA ACTTCTTATT ATGCTTCTTC 
TTTGGTAATT ATTTGGGGTA TTCTGGCAAT GAAACCACAT TTCCTGAAAA TAAATGTATC 
TGAACTTAGT TTATGGGTTA TTCAAGGATG TTTTTGGTTA TTTGGAACTG TCATACTTAA 
ATACTTGACA TCTAAAATTT TTGGTATTGC AGATGACGCT CATATTGGCA ACTTACTAAC 
ATCAAAATTC TTT AGITATA AGGATTTTGA TACTTTATTG TATACCTGTG CAGCGGAGTT 
TGACTTTATG GAAAAAGAGA CTCCACTGAG ATACACAAAG ACATTATTGC TTCCAGTTGT 
TCTTGTAGTG TTTGTTGCTA TTGTTAGAAA GATTATTAGT GATATGTGGG GTGTCTTAGC 
TAAACAACAG ACACATGTAA GAAAACACCA GTTTGATCAT GGAGAGCTGG TTTACCATGC 
ATTGCAATTG TTAGCATATA CAGCCCTTGG TATTTTAATT ATGAGACTAA AACTCTTCTT 
GACACCACAC ATGTGTGTTA TGGCATCACT GATCTGCTCA AGACAGCTAT TTGGATGGCT 
CTTTTGCAAA GTACATCCTG GTGCTATTGT GTTTGCTATA TTAGCAGCAA TGTCAATACA 
AGGTTCAGCA AATCTGCAAA CCCAGTGGAA TATTGTAGGG GAGTTCAGCA ATTTGCCCCA 
AGAAGAACTT ATAGAATGGA TCAAATATAG TACTAAACCA GATGCAGTGT TTGCGGGTGC 
CATGCCCACG ATGGCAAGTG TTAAGCTCTC TGCACTTCGG CCCATTGTGA ATCATCCACA 
TTATGAAGAC GCAGGCTTGA GAGCCAGAAC AAAAATAGTA TACTCAATGT ATAGTCGGAA 
AGCAGCCGAA GAAGTGAAGC GAGAACTGAT AAAGTTAAAA GTGAACTATT ACATTCTAGA 
AGAGTCATGG TGTGTAAGAA GATCGAAGCC TGGTTGCAGT ATGCCTGAAA TTTGGGATGT 
AGAAGATCCT GCCAATGCTG GGAAAACTCC CTTATGTAAC CTCTTGGTGA AGGATTCCAA 
ACCTCACTTC ACCACTGTAT TCCAGAACAG TGTTTACAAA GTCCTAGAAG TTGTAAAAGA 
ATGACTGCTA CATGACCTGC TGCCTACGGA GAACTACATC TGTAATGGTT TTAATGTTTT 
GCTAAGTCAT GTGTTGTTCA TATCCCAAAA ACTTTTATAG GTAACTGTTT TCAAATAGAA 
AACGTTTTAT TTGGTCAATT TGAATGTCAT TCTAATTATA AAAATGACTT ACACCTTTAT 
CAATTGGTTA CTATTTCAAT GCACCCTTTA AAATTTGCTA TGCAAATGAG TATATGCTTG 
TACTTGACTT TAATATTTGT GCTAAAGTGA GCAAAGCTAC CTGTATAAAG AAAACACAGT 
GGGTTGTGAC AAGGATGACA TGAAAATACA GGACAATTCT GACAATGTAG GGGCTGATTT 
TATAGTGTAA GAACTATTAA TGCCCCTTGC TTCTTTTTTC TGCCTCTTGC TCTTGTCTTT 
TGGACATTTC AGTGATTGTA AGTTCTTCGG TCATGTCAGC CCCTGTCATC AACTTGAGTT 
ACAGTAGATG GGGCAGACAT GGAGTGTTTG CTATATAAAA CTATCTGTTT GTTTTACTTC 
CTTGTGCGCT TTTTGTTCTC TGTTCTCTTG TTAATGAAGC TTTTCCTGCC CATTATTAAT 
CCAAACTCTT GGACCTTGTG GTTAGGAAAT TCCCTTAACT TCCAGCCATA TGGCATTATC 
GTGTCTCTTT CTCTCTCTCT CTTGCTCTCT CTCTTCTCCT CTTCCCCATA TTTTCTGTCA 
AATAAGTACT GTTTACTCAT TTAGTTGCTT ATCAAGTACT TATTCTTGGT TTTAAAAAAA 
ATTAATGGTA ACTGTATTTT TCTCATTTTT AGCATTATTC AAATGTTTAT ATTTTAATAC 
CTTTAAACCA CTTTAAAGTT TTTTCATGTT TAATTATAGT TTTAAGAAAA ACTATTTTGA 
ACAACCCCAA ATATAGTGCA TCTAGAAACT AATGTATATT TGATTAGACA TCATTTATAG 
TGGAACAGTA GACTGTAGTA CATGGTAATT TTTCTTTTAC TATTAAGATA CAATAAAACA 
TGACTAATTT TGCTGTCAAA AATGTAAAGA ATAATGATAA ATGGAGTTTT TTATATTTTA 
CTTTTAAGAT TGCCTGTCTT TAATAAGACA AAGCCTTAAG CCTTATGTTA TAATTTTGGT 
TCTAAAAACC ATCATTTCAG TATAAGGAAT AAGTATATTT CGTCCTCCTC TTTAGTTTTT 
TTCTTCCTAT TTATTTTTAT TTTGAAAAAT TTCTACACCT TCTTTGAATT CCTTGTATGA 
ATTTTTGTTT CTTAGAAGTT AATTTGTGTG AAATGAGATT CTTCAAAACG ATGAAACCTC 
ATAGCTCTGA GAAAAGGTTT TAGGGTTTTA AATTCTAAGC AAAGCGTGAC TATGGCTGAC 
AGACTACACA TTTAATTATA CAGCTTCTCT TTCTTAACCA CAGGCAGATT AACCTCATTG 
TGGATTGTCC TTCAGACCTT AGTCCTCAGG CATGGTTTCT GGTGCCCACT CCTGGAAGCC 
GCTGTTCCCT TTCTACCTTC TTACCAGAGC CCAAGGGCAG GCCTGGTCCC GGGGAAGCAG 
CAGCTTGCTG ACATAAGTCA GCTGCAAAGG CTGAGGAGTG TGCCCTCAGA GAAGCACCGC 
CCCCCAGTCT TGTGCCAGCG CCTAGAGCCG CAGCTCCCAG GGATGCTCCT TCGCTGGAGG 
CAGCCCAGGA GAGGGACTCT GGCAGCGTTC TTCAGATTTG TGGCCACTGT TTCTCATTTG 
CTGGTTGACT GTTTTTATTT CTTAGGCTTT TGCTAGTTTT AGAAAATAGG GAAGCAGCCC 
TTGATTTGTG GATTAAAAGC AACATTTGAG CGATGATGCA CAACAGTCCA GGAAAATGGG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
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CGGTGGACAC 
TGCTTATCTG 
ATTTTTAAAA 
TCCAAAGAAG 
CTATTATAAC 
GAAGGATCCT 
TAGGGGAGCC 
TCGCAGCCCA 
TTCCATAGGC 
AATATTTTTG 
TTGCTACAAC 
AAAAATGCTT 
GAAAGTATGT 
ATGCCAGTTG 



TTGAGGCTGA 
TGATTGTTGC 
ATTATACTTT 
TTCACATGTA 
TGCTTCATTT 
TTTGTAGCAG 
AGTTTGGAGC 
AGCACTGTGG 
GTACAAAACA 
CTTTAGTATG 
ATTTTCGAAA 
GGCAAAAAAA 
ATCAGGAATA 
TTTACATGTA 



GGATGGGAGT 
TCACCTGAGT 
TACATTTATT 
ATAAGTAGAA 
GCTGGGAACC 
TGTTTATGAA 
AGAGGCCTGA 
AGCATCCACA 
GTATTAAAGC 
AGGAAAGTAA 
ACAAAGTTGG 
ATATAGTGTT 
AAGTGATATT 
CTATATATGT 



TGACATGAGC 
GTGGCTGATT 
TTATATTTTT 
ATTCTGTATA 
ATTAAAAGTA 
TGTAACCCCC 
AGGTCCCTGC 
CCTTTGATGG 
TCAGTGTTTT 
GGATGGGCAA 
GGCTGTATTT 
AAAATAGGCC 
GCATAGGAGT 
TAAATTAAAA 



AGGGAGAGGG 
GTGTACATCC 
CTCACCCCCA 
GGAAAAAAGC 
ATATAAATTA 
AGCAAAATAT 
TATGCAGCCG 
CAATGCAGAT 
GCATATTGTT 
AGAAGCGATC 
CTTTAAAAAG 
AGTGATATTA 
ATTGTATTTT 
AAAATCATGA 



AGGTGCGCGC 
AGCAGTTACA 
GTAATTTCCT 
ATTAAAAATA 
GCTTTTTCCA 
GGCTATATAT 
TGGCCACAGC 
TGGTAGCAGG 
AGCATTTACA 
AAAATAGCTA 
ATAAGCCTCT 
ATGAGAAAAT 
TATGAATTTT 
GAAATG 



Seq ID NO: 266 Protein sequence 
Protein Accession # : BAA74900.1 



1 
I 

PLVINTLKRF 
ACFYVAVIFI 
SYPFLVLQML 
GYIDICKLRK 
ELSLWVIQGC 
DFMEKETPLR 
LQLLAYTAIX3 
GSANLQTQWN 
YEDAGLRART 
EDPANAGKTP 



11 
I 

NLYPEVILAS 
LNGLMMALFF 
LVTHILRATK 
IIYIHMISLA 
FWLFGTVILK 
YTKTLLIiPW 
ILIMRLKLFL 
IVGEFSNLPQ 
KIVYSMYSRK 
LCNLLVKDSK 



21 
I 

WYRIYTKIMD 
IYGTYLSGSR 
LYRGSLIALC 
LCFVLMFGNS 
YLTSKIFGIA 
LWFVAIVRK 
TPHMCVMASL 
EELIEWIKYS 
AAEEVKRELI 
PHFTTVFQNS 



31 
I 

LIGIQTKICW 
LGGLVTVLCF 
ISMVFFMLPW 
MLLTSYYASS 
DDAHIGNLLT 
IISDMWGVLA 
ICSRQLFGWL 
TKPDAVFAGA 
KLKVNYYILE 
VYKVLEWKE 



41 
I 

TVTRGEGLSP 
FFNHGECTRV 
QFAQFVLLTQ 
LVIIWGILAM 
SKFFSYKDFD 
KQQTHVSKHQ 
FCKVHPGAIV 
MPTMASVKLS 
ESWCVRRSKP 



51 
I 

IESCEGLGDP 
MWTPPLRESF 
IASLFAVYW 
KPHFLKINVS 
TLLYTCAAEF 
PDHGELVYHA 
FAILAAMSIQ 
ALRPIVNHPH 
GCSMPEIWDV 



Seq ID NOx 267 DNA sequence 
Nucleic Acid Accession #: U26744.1 
Coding sequence: 59-. 1600 

1 11 21 31 

I I I I 

CTTCAAAGAA TATAGATGGT TTTGAAAAGT TCATGCTGTC 
GATTGAAGAT AGTGGGAAAA GAGGAAATAC CATGGCAGAA 
GATGAGGGCT CAAGATCTGG ATCGCATCCG ACTCTCCACC 
TAGGTTTGTT CAGAAGAAAT GCAATTTGCA CCTGGTGGAC 
ATTGCGGGAA AATGCTCTGA ACAACCTGGA CCCAAACACT 
AGAGGCTGTG CTCTCCACTA TTTTTTACCA GCTCAACAAA 
AATCCATGTG GAGCAGTCCA TCAGCCTCCT CCTTAACTTC 
GGAAGGCCAT GGTAAAATTT CAGTATTTGC TGTCAAAATG 
AGGGAAGATC ATGGACAAAT TAAGATATAT TTTCTCAATG 
GATGGTTTAT GGACGATATG ACCAATTCCT TCGGGAAGTT 
TTTGGAGGGT CCTTCATTTG GTTACACAGA ACAGTCAGCC 
GAAAAAAGTC ACGTTAAATG GTTTCTTGGA CAOGCTTATG 
TCTGGTCTGG TTGCCTCTTC TGCATCGACT AGCAAATGTG 
TGAGTGTTCC TACTGCCACA GTGAGAGTAT GATGGGATTT 
TCACAATTAC CAGCTCTGTC AGGACTGCTT CTGGAGGGGA 
CAACCAGCAC CAAATGAAAG AGTACACGTC ATGGAAATCA 
TGCATTAAGC AAGTCCCTGA GCTGTGCTTC CAGCCGTGAA 
AGATCAGCCT GAGAAGCCAC TCAACTTGGC TCACATCGTT 
TGTAACCAGC ATGAACGACA CCCTGTTCTC CCACTCTGTT 
TATTACCAGG AGCATGCTTG AGAGTTCAAA CCGGCTTGAT 
CAGGTATGCG GCAAGGCTGG CAGCAGAGTC CTCTTCGTCT 
TGCTCCTGAC ATCTCTTTCA CCATCGATGC GAATAAGCAG 
GCTAGAAAAC AAGAACAGAG AAATCTTACA GGAGATCCAG 
ACAAGCTTCT CAGCCCACGC CAGAGAAGGC ACAGCAAAAC 
CCGGCTCCTC AGACAGCGCA AAGATGAGCT GGAACAGAGA 
CCGGAGAGAG CTAATGGTCC AGTTGGAGGG TCTCATGAAG 
GAAGCAGGGA GTAAGTTATG TCCCCTACTG CAGGTCTTAA 
CGACCTGCGG TTTTCTCATT GCTTTTGCTC TAATGTATGT 
AGAAAAAAGT CATACTAATT TGCTTCTTTT TCAATGTAGT 
ATTTAGCATT TTTTATAACT ATCACTACTA TCCACATCAA 
TTAGAAAAGG GAACGAATTG TCATTTATTG GAAACATTTT 
TTCAAACCAG TCTTAGCTTT TCAAGTTGTT GATCAGACCC 
CACAGTCACT AGAGATACCC TGAGGTTCAT GTCATCCCAA 
TAACCTCTAC ACCCACTCAC ACTGTGAGTA TTCAGTTCGG 
GTGAAACCTC TTTTTATAAA AATCAGGCAA TTAAATCCCT 
GCCTTGTTCC CCATGGCTCA CCAAAATGTG CTCAATTTTG 
ATAACTGACT ATTCACGTCC CATCTTTTTG GCTCTTCCCC 
GGTTGACAGT AATCTCTTTT TAAAAAGTAA CTCTCAGCTT 
TTCGGCTCCG GGAGACGAGA GGGTCATTAC ATACTTTTTT 
ATTGTGACTT TATAGCCTAA ACTGGAGCTG TCTGAACCTG 
CAGGGGGAGC AGCAAACTC 

Seq ID NO: 268 Protein sequence 
Protein Accession #: AAC50426.1 



41 
I 

CCTTCATTGA 
AGAAGACAGC 
TACAGAACAG 
ATATGGAATG 
GAACTCAACG 
CGGATGCCAA 
CTGCTTGCAG 
GCTTTAGCCA 
ATTTCTGACT 
CTCAAACTAC 
AGATCCTGTT 
TCAGATCCTC 
GAAAATGTCT 
CGCTACCGAT 
CATGCCGGTG 
CCTGCTAAGA 
CCTTTGCACC 
GATACTTGGC 
CCCTCCTCAG 
GAAGAACACA 
CAGCCACCTC 
CAAAGGCAGC 
AGACTTCGGC 
CCCACCCTGC 
ATGTCTGCTC 
CTACTAAAGG 
CTAACAGTGG 
TCATGCTTCA 
GCTTGAATTG 
AAGAAGAACT 
AGATCCCCAG 
TTCTCTTAAC 
AACCCACAGC 
TTTCATTTTA 
TTTCATCACA 
TGAGAGAAAG 
AAAGCAGAAT 
TTTCCTTAGC 
TTTTTTCTGG 
TGGTCAGGCT 



3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 



60 
120 
180 
240 
300 
360 
420 
480 
540 



51 
I 

ATTTTAGAAT 
TGTTTGCAGA 
CATGCAAGCT 
TCATAGAAGC 
TGTCCCGCTT 
CCACTCACCA 
CGTTTGATCC 
CATTGTGTGG 
CCAGTGGGGT 
CCACGGAAGT 
TCTCCCAACA 
CCCCGCAGTG 
TCCATCCGGT 
GCCAACAGTG 
GTTCTCATAG 
AGCTGACTAA 
CCATGTTCCC 
CTCCCAGACC 
GAAGTCCTTT 
GGCTAATTGC 
AGCAGAGAAG 
TGATTGCTGA 
TAGAGCATGA 
TGGCAGAACT 
TCCAGGAGAG 
AAGAAGAACT 
AGGGGCCTGC 
GTTTGGAAAG 
AGATATATAA 
ATGACATCTT 
AGGTATAAGT 
AGAGAGATAC 
ACTCAGAAGC 
CTGAAAACCT 
CAATTATTGA 
ACTGTACTCC 
CCTTACTGTT 
ACCAGAGCCT 
AAATAGGGGC 
CAAGAGCCAG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 



11 
I 



21 
1 



31 



51 



1056 



WO 03/042661 



PCT/US02/36810 



5 

10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



MIEDSGKRGN 
ALRENALNNL 
PEGHGKISVP 
VLEGPSFGYT 
VECSYCHSES 
NALSKSLSCA 
FITRSMLESS 
ELENKNREIL 
SRRELMVQLE 



TMAERROLFA 
DPNTELNVSR 
AVKMALATLC 
EQSARSCFSQ 
MMGFRYRCQQ 
SSREPLHPMF 
NRLDEEHRLI 
QEIQRLRLEH 
GLMKLLKEEE 



EMRAQDLDRI 
LEAVLSTIFY 
GGKIMDKLRY 
QKKVTLNGFL 
CHNYQIiOQDC 
PDQPEKPLNL 
ARYAARIiAAE 
EQASQPTPEK 
LKQGVSYVPY 



RLSTYRTACK 
QLKKRMPTTH 
IFSMISDSSG 
DTLMSDPPPQ 
FWRGHAGGSH 
AHIVDTWPPR 
SSSSQPPQQR 
AQQNPTLLAE 
CRS 



LRFVQKKCNL 
QIHVEQSISL 
VMVYGRYDQF 
CLVWLPLLHR 
SNQHQMKEYT 
PVTSMNDTLF 
SAPDISFTID 
LRLLRQRKDE 



HLVDIWNVIE 
LLNFLLAAFD 
LREVLKLPTE 
LANVENVFHP 
SWKSPAKKLT 
SHSVPSSGSP 
ANKQQRQLIA 
LEQRMSALQE 



60 
120 
160 
240 
300 
360 
420 
480 



Seq ID NO: 269 DNA sequence 

Nucleic Acid Accession ft: NM_001276.1 

Coding sequence: 127.. 1278 



ll 



AGTGGAGTGG 
TAGCTGGCAC 
GCCAGAATGG 
TGCTCTGCAT 
GGGAGCTGCT 
GCCAATATAA 
ATGCTCAACA 
TGGAACTTTG 
TTCATCAAGT 
TGGCTCTACC 
GCCGAATTTA 
TCTGCGGGGA 
GATTTCATTA 
CACAGTCCCC 
TATGCTGTGG 
CCCACCTTCG 
TCAGGACCGG 
ATCTGTGACT 
GCCACCAAGG 
CAGTACCTGA 
TTCCAGGGCT 
GCACTCGCTG 
CCCCCTCTGG 
GCCTCAGTCT 
GCCCTGGTGG 
GACTCGGGAT 
TGGCAAGGGA 
TGGCAAGCTC 
TACCCCCTGC 
ACTTCCCCTT 
CGCTTTGCTT 
TCTTCTGGGT 
ATGTT 



GACAGGTATA 
CAGGAGCCGT 
GTGTGAAGGC 
ACAAACTGGT 
TCCCAGATGC 
GCAACGATCA 
CACTCAAGAA 
GGTCTCAAAG 
CAGTACCGCC 
CTGGACGGAG 
TAAAGGAAGC 
AGGTCACCAT 
GCATCATGAC 
TGTTCCGAGG 
GGTACATGTT 
GGAGGAGCTT 
GAATTCCAGG 
TCCTCCGCGG 
GCAACCAGTG 
AGGATAGGCA 
CCTTCTGCGG 
CAACGTAGCC 
CTCCAGCTGG 
CCCTCCCTTG 
GCAGAGAGGT 
TAGTACACAC 
ATTTCTTCAA 
TATCACCAAG 
AAAGCCAGCT 
CCTAATTCCA 
TGGTCTATCT 
TCCTTCCTCT 



21 
I 

TAAAGGAAGT 
GGGCAAGGGA 
GTCTCAAACA 
CTGCTACTAC 
CCTTGACCGC 
CATCGACACC 
CAGGAACCCC 
ATTTTCCAAG 
ATTCCTGCGC 
AGACAAACAG 
CCAGCCAGGG 
TGACAGCAGC 
CTACGATTTT 
TCAGGAGGAT 
GAGGCTGGGG 
CACTCTGGCT 
CCGGTTCACC 
AGCCACAGTC 
GGTAGGATAC 
GCTGGCAGGC 
CCAGGATCTG 
CTCTGTTCTG 
CCGGGAGCCT 
GGGCCTATGC 
AGGGATGGGG 
TTGTTGATGA 
CTCCCTGCCC 
GAGCCAAACA 
TGAAACCTTC 
CAGCTGCTCA 
TTGAGCGCCC 
GAGCCTTGGG 



31 
I 

ACAGGGCCTG 
AGAGGCCACA 
GGCTTTGTGG 
ACCAGCTGGT 
TTCCTCTGTA 
TGGGAGTGGA 
AACCTGAAGA 
ATAGCCTCCA 
ACCCATGGCT 
CATTTTACCA 
AAAAAGCAGC 
TATGACATTG 
CATGGAGCCT 
GCAAGTCCTG 
GCTCCTGCCA 
TCTTCTGAGA 
AAGGAGGCAG 
CATAGAACCC 
GACGACCAGG 
GCCATGGTAT 
CGCTTCCCTC 
CACACAGCAC 
GATCACCTGC 
AGAGGTCCAC 
CTGTGGGGAT 
TTAATGGAAA 
CCTAGCCCTC 
TCCTACAAGA 
ACTTAGGAAC 
ATAAAGTACA 
ACTAGACCCA 
ACCCCTGAGC 



41 
I 

GGGAAGAGGC 
CCCTGCCCTG 
TCCTGGTGCT 
CCCAGTACCG 
CCCACATCAT 
ATGATGTGAC 
CTCTCTTGTC 
ACACCCAGAG 
TTGATGGGCT 
CCCTAATCAA 
TCCTGCTCAG 
CCAAGATATC 
GGCGTGGGAC 
ACAGATTCAG 
GTAAGCTGGT 
CTGGTGTTGG 
GGACCCTTGC 
TCGGCCAGCA 
AAAGCGTCAA 
GGGCCCTGGA 
TCACCAATGC 
GGGGGCCAAG 
CCTGCTGAGT 
AACACACAGA 
AGTGAGGCAT 
TGTTTACAGA 
CTTATCAAAG 
CACAGTGACC 
GTAATCGTGT 
AGAGTTTAAC 
CTGGACTCAC 
TTGCAGAGAT 



SI 
I 

CCTGTCTAGG 
CTCTGCTGCA 
GCTCCAGTGC 
GGAAGGCGAT 
CTACAGCTTT 
GCTCTACGGC 
TGTCGGAGGA 
TCGCCGGACT 
GGACCTTGCC 
GGAAATGAAG 
CGCAGCACTG 
CCAACACCTG 
CACAGGCCAT 
CAACACTGAC 
GATGGGCATC 
AGCCCCAATC 
CTACTATGAG 
GGTCCCCTAT 
AAGCAAGGTG 
CCTGGATGAC 
CATCAAGGAT 
GATGCCCCGT 
CCCAGGCTGA 
TTTGAGCTCA 
CGCAATGTAA 
TCCCCAAGCC 
GACACCATTT 
ATACTAATTA 
CCCCTATCCT 
AGTGTGTTGG 
CTCCCCCATC 
GAAGGCCGCC 



Seq ID NO: 270 Protein -sequence 
Protein Accession #: NP 001267.1 



1 
I 

MGVKASQTGF 
ISNDHIDTWE 
KSVPPFLRTH 
GKVTIDSSYD 
VGYMLRLGAP 
DFLRGATVHR 
GSFCGQDLRF 



11 

I 

WLVLLQCCS 
WNDVTLYGML 
GFDGLDLAWI* 
IAKISQBLDF 
ASKLVMGIPT 
TLGQQVPYAT 
PLTNAIKDAI* 



21 
I 

AYKLVCYYTS 
NTLKNRNPNL 
YPGRRDKQHF 
ISIMTYDFHG 
FGRSFTLASS 
KGNQWVGYDD 
AAT 



31 
I 

WSQYREGDGS 
KTLLSVGGWN 
TTLIKEMKAE 
AWRGTTGHHS 
ETGVGAPISG 
QESVKSKVQY 



41 
I 

CFPDALDRFL 
FGSQRFSKIA 
FIKEAQPGKK 
PLFRGQEDAS 
PGIPGRFTKE 
LKDRQIiAGAM 



Seq ID NO: 271 DNA sequence 

Nucleic Acid Accession 8: NM_006474. 

Coding sequence: 181.. 669 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1660 
1740 
1800 
1860 
1920 



51 
I 

CTHIIYSFAN 
SNTQSRRTFI 
QLLLSAALSA 
PDRFSNTDYA 
AGTLAYYEIC 
VWAliDLDDFQ 



60 
120 
180 
240 
300 
360 



GCTGCCTAGG 
TCCGGCCCCC 
TTCCCCCAGC 
ATGTGGAAGG 
GAAGGAGCCA 
GTTGCCATGC 
AAGTCTGGCT 
GAGGATCTGC 
GCCTCAAACG 
GTTGAGAAAG 
GCCATCGGTT 
TCGCCCTAAA 
TTCTGACTCT 
CGGGCCCATT 



11 
I 

GTCTGGAAAG 
CCACCGTCGC 
TCAGAATCTT 
TGTCAGCTCT 
GCACAGGCCA 
CAGGTGCCGA 
TGACAACTCT 
CAACTTCAGA 
TGGCCACCAG 
ATGGTTTGTC 
TCATTGGTGG 
GAG CTGAAGG 
GTGGCCCTGT 
CAGATTCCAC 



21 
I 

CTCGGGCACC 
GCTCCTCCAG 
GCTGCTCGGC 
GCTCTTCGTT 
GCCAGAAGAT 
AGATGATGTG 
GGTGGCAACA 
AAGCACAGTC 
TCACTCCACG 
AACAGTGACC 
AATCATCGTT 
GTTACGCCCT 
CCCTGAGCTC 
GGTGACTTTC 



31 
1 

CTCCCTCTCC 
GCTGGGCCTG 
CCCCAGGAGA 
TTGGGAAGCG 
GACACTGAGA 
GTGACTCCAG 
AGTGTCAACA 
CACGCGCAAG 
GAGAAAGTGG 
CTGGTTGGAA 
GTGGTTATGC 
GCTTGCCAAC 
GTGGGGAGAA 
CGTTTGCCAA 



41 
I 

GGGGCTCCTG 
TGGCCGCGGT 
GCAACAACTC 
OGTCGCTCTG 
CTACAGGTTT 
GAACCAGCGA 
GTGTAACAGG 
AACAAAGTCC 
ATGGAGACAC 
TCATAGTTGG 
GAAAAATGTC 
GTGCTTTAAA 
GATGACCCTG 
ATTAACCGAG 



51 
I 

CTCCCACCCC 
GCTTTTAATT 
AACGGGAACG 
GGTCCTGGCA 
GGAAGGCGGC 
AGACCGCTAT 
CATTCGCATC 
AAGCGCCACA 
ACAGACAACA 
GGTCTTACTA 
GGGAAGGTAC 
AAAAGACCGT 
GGAACAT7TG 
GAAAGACCTT 



60 
120 
160 
240 
300 
360 
420 
460 
540 
600 
660 
720 
780 
840 



1057 



WO 03/042661 



TCACCAGATT TGGTTCTTAA ACTTT 



Seq ID NO: 272 Protein sequence 
Protein Accession #: NP_006465.1 

1 11 21 31 41 51 

I I I I I 

MWKVSALLFV LGSASLWVLA EGASTGQPED DTETTGLEGG VAMPGAEDDV VTPGTSEDRY 60 
KSGLTTLVAT SVKSVTGIRI EDLPTSESTV HAQEQSPSAT ASNVATSHST EKVDGDTQTT 120 
VEKDGLSTVT LVGIIVGVIiL AIGPIGGIIV WMRKMSGRY SP 



Seq ID NO: 273 DNA sequence 

Nucleic Acid Accession 8: CAT cluster 



1 11 21 31 41 51 

I I I I I I 

GCGGCCGCCA GCTTGCAAAG CCGAAGTCTG GCCGOGCTCT TCGACTCGCT GCGCCACGTC 60 

CCCGGGGGTG CCGAGCCGGC GGGGGGTGAG GTGGCTGCGC CGGCGGCCGG GCTAGGAGGT 120 

GCGGGCACTG GGGGCGCGGG AGGGGACGTG GCAGGCCCCG CGGGGGCCAC GGCGATCCCA 180 

GGGGCCAGGA AGGTCCCGCT GOGGGCACGC AATCTGCCTC CGTCCTTCTT CACGGAGCCG 240 

TCCCGGGCAG GCGGCGGCGG GTGTGGCCCG TCGGGGCGGG ACGTGAGCTT GGGCGACCTG 300 

GAGAAGGGCG CGGAGGCCGT GGAGTTCTTT GAGCTGCTGG GGCCCGACTA CGGCGCCGGC 360 

ACGGAGGCGG CAGTCTTGCT TGCCGCCGAG CCTCTOGACG TGTTCCCCGC CGGAGCCTCC 420 

GTACTGCGGG GACCCCCGGA GCTGGAGCCC GGCCTCTTTG AGCCGCCGCC GGCAGTGGTG 480 

GGAAACCTAC TGTACCCCGA GCCCTGGAGC GTCCCGGGCT GCTCCCCGAC CAAAAAGAGC 540 

CCCTTGACTG CCCTCCGCGG CGGGTTGACC TTGAACGAGC CCTTGAGCCC CCTGT ACCCC 600 

GCCGCTGCGA ATTTCTCCCG GCGGGGAGGA CGGGCOGGGC CATTTGGCTT CTTTCGCCCC 660 
CTTCTTTCCA GACTGCGCTT TGC 



Seq ID NO: 274 DNA sequence 

Nucleic Acid Accession ft: Eos sequence 

1 11 21 31 41 51 

| | I I I I 

CAAAGAGGCC GGGCTCCAGC TCCGGGGGTC CCCGCAGTAC GGAGGCTCCG GCGGGGAACA 60 
CGTCGAGAGG CTCGGCGGCA AGCAAGACTG CCGCCTCCGT GCCGGCGCCG TAGTCGGGCC 120 
CCAGCAGCTC AAAGAACTCC ACGGCCTCCG CGCCCTTCTC CAGGTCGCCC AAGCTCACGT 180 
CCGGCCCCGA CGGGCCACAC CCGCCGCCGC CTGCCCGGGA CGGCTCGGTG AAGAAGGAOG 240 
GAGGCAGATT GCGTGCCCGC AGCGGGACCT TCCTGGCCCC TGGGATCGCC GTGGCCCCCG 300 
CGGGGCCTGC CACGTCCCCT CCCGCGCCCC CAGTGCCCGC ACCTCCTAGC CCGGCCGCCG 360 
GCGCAGCCAC CTCACCCCCC GCCGGCTCGG CACCCCCGGG GACGTGGCGC AGCGAGTCGA 420 
AGAGCGCGGC CAGACTTCGG CTTTGCAAGC TGGCGGCCGC 



Seq ID NO: 275 DNA sequence 

Nucleic Acid Accession 4h NM_001118.1 

Coding sequence : 74 1651 

1 11 21 31 41 51 

AGCCCAGAGA CACATTGGGG CTGACCTGCC GCTGCTGTCA GTGGGAGGCC AGTGGTGCTG 60 

GCCAAGAAGT GTCATGGCTG GTGTCGTGCA CGTTTCCCTG GCTGCTCACT GCGGGGCCTG 120 

TCCGTGGGGC CGGGGCAGAC TCCGCAAAGG ACGCGCAGCC TGCAAGTCCG CGGCCCAGAG 180 

ACACATTGGG GCTGACCTGC CGCTGCTGTC AGTGGGAGGC CAGTGGTGCT GGCCAAGAAG 240 

TGTCATGGCT GGTGTCGTGC AOGTTTCCCT GGCTGCTCTC CTCCTGCTGC CTATGGCCCC 300 

TGCCATGCAT TCTGACTGCA TCTTCAAGAA GGAGCAAGCC ATGTGCCTGG AGAAGATCCA 360 

GAGGGCCAAT GAGCTGATGG GCTTCAATGA TTCCTCTCCA GGCTGTCCTG GGATGTGGGA 420 

CAACATCACG TGTTGGAAGC CCGCCCATGT GGGTGAGATG GTCCTGGTCA GCTGCCCTGA 480 

GCTCTTCCGA ATCTTCAACC CAGACCAAGT CTGGGAGACC GAAAOCATTG GAGAGTCTGA 540 

TTTTGGTGAC AGTAACTCCT TAGATCTCTC AGACATGGGA GTGGTGAGCC GGAACTGCAC 600 

GGAGGATGGC TGGTOGGAAC CCTTCCCTCA TTACTTTGAT GCCTGTGGGT TTGATGAATA 660 

TGAATCTGAG ACTGGGGACC AGGATTATTA CTACCTGTCA GTGAAG GCCC TCTACACGGT 720 

TGGCTACAGC ACATCCCTCG TCACCCTCAC CACTGCCATG GTCATCCTTT GTCGCTTCCG 780 

GAAGCTGCAC TGCACACGCA ACTTCATCCA CATGAACCTG TTTGTGTCGT TCATGCTGAG 840 

GGCGATCTCC GTCTTCATCA AAGACTGGAT TCTGTATGCG GAGCAGGACA GCAACCACTG 900 

CTTCATCTCC ACTGTGGAAT GTAAGGCCGT CATGGTTTTC TTCCACTACT GTGTTGTGTC 960 

CAACTACTTC TGGCTGTTCA TCGAGGGCCT GTACCTCTTC ACTCTGCTGG TGGAGACCTT 1020 

CTTCCCTGAA AGGAGATACT TCTACTGGTA CACCATCATT GGCTGGGGGA CCCCAACTGT 1080 

GTGTGTGACA GTGTGGGCTA CGCTGAGACT CTACTTTGAT GACACAGGCT GCTGGGATAT 1140 

GAATGACAGC ACAGCTCTGT GGTGGGTGAT CAAAGGCCCT GTGGTTGGCT CTATCATGGT 1200 

TAACTTTGTG CTTTTTATTG GCATTATCGT CATCCTTGTG CAGAAACTTC AGTCTCCAGA 1260 

CATGGGAGGC AATGAGTCCA GCATCTACTT GOGACTGGCC CGGTCCACCC TGCTGCTCAT 1320 

CCCACTATTC GGAATCCACT ACACAGTATT TGCCTTCTCC CCAGAGAATG TCAGCAAAAG 1380 

GGAAAGACTC GTGTTTGAGC TGGGGCTGGG CTCCTTCCAG GGCTTTGTGG TGGCTGTTCT 1440 

CTACTGTTTT CTGAATGGTG AGGTACAAGC GGAGATCAAG CGAAAATGGC GAAGCTGGAA 1500 

GGTGAACCGT TACTTCGCTG TGGACTTCAA GCACCGACAC CCGTCTCTGG CCAGCAGTGG 1560 

GGTGAATGGG GGCACCCAGC TCTCCATCCT GAGCAAGAGC AGCTCCCAAA TCCGCATGTC 1620 
TGGCCTCCCT GCTGACAATC TGGCCACCTG AGCCATGCTC CCCT 



Seq ID NO: 276 Protein sequence 
Protein Accession 8: NP_001109.1 

1 ll 21 31 41 - 51 

I 1 1 1 I I 

MAGWHVSLA AHCGACPHGR GRLRKGRAAC KSAAQRHIGA DLPLLSVGGQ WCWPRSVMAG 

1058 



WO 03/042661 



WHVSLAALL LLPMAPAMHS DCIFKKEQAM CLEKIQRANE LMGFNDSSPG CPGMWDNITC 120 

WKPAHVGEMV LVSCPELPRI FNPDQVWETB TIGESDFGDS NSLDLSDMGV VSRNCTEDGW 180 

SEPFPHYFDA CGFDEYESET GDQDYYYLSV KALYTVGYST SLVTLTTAMV ILCRFRKLHC 240 

TRNFIHMNLF VSFMLRAISV FIKDWILYAE QDSNHCFIST VECKAVMVFP HYCWSNYFW 300 

LFIEGLYLFT LLVETFFPER RYFYWYTIIG WGTPTVCVTV WATLRLYFDD TGCWDMNDST 360 

ALWWVIKGPV VGSIMVNFVL FIGIIVILVQ KLQSPDMGGK ESSIYLRLAR STLLLIPLFG 420 

IHYTVFAFSP ENVSKRERLV FELGLGSFQG FWAVLYCFL NGEVQAEIKR KWRSWKVNRY 480 
FAVDFKHRHP SLASSGVNGG TQLSILSKSS SQIRMSGLPA DNLAT 

Seq ID NO: 277 DNA sequence 

Nucleic Acid Accession #: NM_004000.1 

Coding sequence: 36.. 1193 

1 11 21 31 41 51 

I I I I I I 

AGAAGAAGCT GGCCAAGGAT ATGGGAGCAA CCACCATGGA CCAGAAGTCT CTCTGGGCAG 60 

GTGTAGTGGT CTTGCTGCTT CTCCAGGGAG GATCTGCCTA CAAACTGGTT TGCTACTTTA 120 

CCAACTGGTC CCAGGACCGG CAGGAACCAG GAAAATTCAC CCCTGAGAAT ATTGACCCCT 180 

TCCTATGCTC TCATCTCATC TATTCATTCG CCAGCATCGA AAACAACAAG GTTATCATCA 240 

AGGACAAGAG TGAAGTGATG CTCTACCAGA CCATCAACAG TCTCAAAACC AAGAATCCCA 300 

AACTGAAAAT TCTCTTGTCC ATTGGAGGGT ACCTGTTTGG TTCCAAAGGG TTCCACCCTA 360 

TGGTGGATTC TTCTACATCA CGCTTGGAAT TCATTAACTC CATAATCCTG TTTCTGAGGA 420 

ACCATAACTT TGATGGACTG GATGTAAGCT GGATCTACCC AGATCAGAAA GAAAACACTC 480 

ATTTCACTGT GCTGATTCAT GAGTTAGCAG AAGCCTTTCA GAAGGACTTC ACAAAATCCA 540 

CCAAGGAAAG GCTTCTCTTG ACTGCGGGCG TATCTGCAGG GAGGCAAATG ATTGATAACA 600 

GCTATCAAGT TGAGAAACTG GCAAAAGATC TGGATTTCAT CAACCTCCTG TCCTTTGACT 660 

TCCATGGGTC TTGGGAAAAG CCCCTTATCA CTGGCCACAA CAGCCCTCTG AGCAAGGGGT 720 

GGCAGGACAG AGGGCCAAGC TCCTACTACA ATGTGGAATA TGCTGTGGGG TACTGGATAC 780 

ATAAGGGAAT GCCATCAGAG AAGGTGGTCA TGGGCATCCC CACATATGGG CACTCCTTCA B4 0 

CACTGGCCTC TGCAGAAACC ACCGTGGGGG CCCCTGCCTC TGGCCCTGGA GCTGCTGGAC 900 

CCATCACAGA GTCTTCAGGC TTCCTGGCCT ATTATGAGAT CTGCCAGTTC CTGAAAGGAG 960 

CCAAGATCAC GCGCCTCCAG GATCAGCAGG TTCCCTACGC AGTCAAGGGG AACCAGTGGG 102 0 

TGGGCTATGA TGATGTGAAG AGTATGGAGA CCAAGGTTCA ' GTTCTTAAAG AATTTAAACC 1080 

TGGGAGGAGC CATGATCTGG TCTATTGACA TGGATGACTT CACTGGCAAA TCCTGCAACC 1140 

AGGGCCCTTA CCCTCTTGTC CAAGCAGTCA AGAGAAGCCT TGGCTCCTTG TGAAGGATTA 1200 

ACTTACAGAG AAGCAGGCAA GATGACCTTG CTGCCTGGGG CCTGCTCTCT CCCAGGAATT 1260 

CTCATGTGGG ATTCCCCTTG CCAGGCTGGC CTTTGGATCT CTCTTCCAAG CCTTTCCTGA 1320 

CTTCCTCTTA GATCATAGAT TGGACCTGGT TTTGTTTTCC TGCAGCTGTT GACTTGTTGC 1380 
CCTGAAGTAC AATAAAAAAA ATTCATTTTG CTCCAGTA 

Seq ID NO: 278 Protein sequence 
Protein Accession #: NP_003991.1 

1 11 21 31 41 51 

I I 1 I I I 

MDQKSLWAGV WLLLLQGGS AYKLVCYFTN WSQDRQEPGK FTPENTDPFL CSHLIYSFAS 60 

IENNKVIIKD KSEVMLYQTI NSLKTKNPKL KILLS IGGYL PGSKGFHPMV DSSTSRLEFI 120 

NSIILFLRNH NFDGLDVSWI YPDQKENTHF TVLIHELAEA FQKDFTKSTK ERLLLTAGVS 180 

AGRQMIDNSY QVEKLAKDLD FINLLSFDFH GSWEKPLITG HNSPLSKGWQ DRGPSSYYNV 240 

EYAVGYWIHK GMPSEKWMG IPTYGHSFTL ASAETTVGAP ASGPGAAGPI TESSGFLAYY 300 

EICQFLKGAK ITRLQDQQVP YAVKGNQWVG YDDVKSMETK VQFLKNLNLG GAMIWSIDMD 360 
DFTGKSCNQG PYPLVQAVKR SLGSL 

Seq ID NO: 279 DNA sequence 

Nucleic Acid Accession #x NMJJ15166.1 

Coding sequence: 116.. 1249 

1 11 21 31 41 51 

I 1 I I 1 I 

TGCTGGAAGT CCCTCACCCA GAGACCAGTG CTCCCAAOGG CAGAGCAGCG GGGGAGATAA 60 

AGAACTGGTG ACACGTGGCT GTACATTCAG CACAGCTGTG GTGTCCCCAA GTGCCATGAC 120 

CCAGGAGCCA TTCAGAGAGG AGCTGGCCTA TGACCGGATG CCCACGCTGG AGOGGGGCCG 180 

GCAAGACCCC GCCAGCTATG CCCCAGACGC GAAGCCGAGC GACCTGCAGC TGTCGAAGAG 240 

ACTGCCCCCC TGCTTCAGCC ACAAGACGTG GGTCTTCTCT GTGCTGATGG GGAGCTGCCT 300 

CCTGGTGACC TCGGGGTTTT CGCTGTACCT GGGGAACGTG TTCCCGGCTG AGATGGATTA 360 

CTTGCGCTGT GCTGCAGGCT CTTGCATCCC CTCGGCAATT GTGAGCTTCA CCGTCTCCAG 420 

GAGGAACGCC AATGTGATTC CCAACTTTCA GATATTGTTT GTTTCCACGT TTGCTGTGAC 480 

CACTACGTGT TTAATTTGGT TTGGATGCAA ACTAGTCCTG AACCCATCAG CAATAAACAT 540 

CAACTTCAAC CTCATCCTGC TGCTCCTGCT GGAGCTGCTC ATGGCGGCCA CGGTGATCAT 600 

CGCTGCACGG TCCAGCGAGG AGGACTGCAA GAAAAAGAAG GGCTCCATGT CTGACAGCGC 660 

CAACATTCTG GACGAAGTGC CATTTCCTGC TCGGGTCCTG AAATCTTACT CAGTCGTCGA 720 

GGTAATCGCA GGCATCTCTG CCGTCCTCGG GGGGATCATT GCCCTGAACG TGGATGACTC 780 

AGTTTCAGGC CCACACCTCT CAGTGACGTT CTTTTGGATC CTAGTGGCCT GCTTTCCAAG 840 

TGCCATTGCC AGTCATGTGG CAGCAGAGTG TCCCAGCAAG TGTCTGGTGG AGGTCCTGAT 900 

TGCCATAAGC AGCCTCACGT CTCCGCTGCT GTTCACAGCC TCTGGATATC TGTCATTCAG 960 

CATCATGAGA ATCGTGGAGA TGTTTAAGGA TTACCCGCCA GCCATAAAAC CATCCTACGA 1020 

TGTGCTGCTG CTGCTGCTGC TGCTAGTGCT CCTGCTGCAG GCCGGCCTCA ACACGGGCAC 1080 

CGCCATCCAG TGCGTGCGCT TCAAGGTCAG TGCAAGGCTG CAGGGTGCAT CCTGGGACAC 1140 

CCAGAACGGC CCGCAGGAGC GCCTGGCTGG GGAGGTGGCC AGGAGCCCCC TGAAGGAGTT 1200 

CGACAAGGAG AAAGCCTGGA GAGCCGTCGT GGTGCAAATG GCCCAGTGAC CCCCAGACGC 1260 

GGAAACCGGG TGGCAGCGCC CAGCCTGGCC CCAAGCATGG AAACGCACAA CCCCTAATCG 1320 

CCCTGAGCTA CTGCTTCTAA CACCTCTTTT CCCTTGTGTG AGGGCAAACC AGGCTGCAGG 1380 

TGGGGTTTTC ACTTCCTAGG GTAGTTTAAT TTTAAAATAG GCCAATGTTG GCTAGTCTGT 1440 

GCCTCAGTGA GATCAGTCAG CTCCGAGTGG CTCCCGTGTC GTAACAGCAG GAGCATGGCC 1500 
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10 

15 

20 

25 

30 

35 

40 
45 
50 
55 
60 
65 
70 
75 
80 



GCAACTTCCC 
CTGGCCCCAG 
CTGA.CCAGAC 
GCTTAGGGTC 
GGTGCACCTG 
TTGGCGCCTC 
GCGACACAGG 
TGCCCTGTCT 
CCCACGTCCT 
CX3GGCACTGT 
TCATGCCCAG 
GGGCTGCATC 
GGGAAGCTGA 
GGTCAGAGCC 
TTGTCTTGAT 
GGGGTCTTAG 
GGTGTGAGAA 
TCCTCGTCCC 
GGGCGACTGT 
TTCACAATGG 
CGCCGTGGGG 
GTGGCCTGGG 
CGGCCTGCAG 
GAGTGACTCT 
CAACCATTGA 
CAGGCCCTCT 
CTTTGGCCAC 
CCGGATGCAG 
GCGGGGTCTG 
TCTTCTTGGO 
TTTTTGCTGG 
ATGGGGTTTA 
AAACGCTGAC 



AGGCCGAGGA 
CTCCTCTTCC 
TGTGGGTCCC 
TGTCCCGGGT 
CCCCAGGCTC 
TGTCCTGGCC 
CTGCCGCTGC 
CCCTACTTAG 
GGGTGCTGTC 
CATGCCCAGC 
CTTGGAGTGC 
CAAGGCTTTG 
GCCCCTCCCT 
CAGGAATTCT 
CCTCACTGAG 
GGTACAGCCC 
TAACACTGGC 
GAGGGCGCCT 
GAAACTGGAA 
TTTTGTTTCT 
TGTATTGACT 
GCCATCAAGA 
ACCCTGCCAG 
GGCTGCCAAG 
AGGGGGATGC 
CTGCAGTCCT 
AGCCCCAAAC 
ACCCCACCCT 
CTCCTGGGAA 
GGCAGACCAG 
CTTGCACTTT 
AAAGAACTGT 
TTGCAGACGT 



AGGGCCCCCG 
TGCCTCTCTC 
TGCGTCTCCT 
ACTCAGTCAG 
AGGCTGCCCA 
CGGGAGCCGC 
TCTCCCAGGT 
ACCCAGTCAT 
ATGCCCAGCT 
TTGGAGTGCC 
CACGTGGCCG 
TAAACCGGCT 
OCTGTGTTTG 
CACAGGTTCA 
AAGGCCCCAG 
AGGCGGTCAC 
TTTGGGTAGT 
GGGTCCTCCC 
TGCTGTTACT 
TTCCTTCTGA 
GGTCCCCCAT 
TCAAAGAACC 
GGGGTTTGAG 
ATACCTTCAT 
AGGCAAGACA 
CGGCAGCTCC 
AAGCGCCACC 
GTCTGCAGGC 
TGGGGTGGGA 
AGAGCTCAAG 
CTTATTTTCT 
CTCTTTCAGT 
G 



GCTCGGCCTC 
ATGGCTTGGG 
GCCCACTCTG 
CCCAGTGGGA 
GCGGCTCTTC 
AGGGGCCCCT 
GAAATCCACA 
TCTAGAGGGA 
TGGAGTGCCA 
ACATGGCCGC 
CTGCTGTGAC 
GGACCACGTC 
CTCCCATTAC 
CCCAGCGCCC 
GGCAGOGGTC 
TGCCCACCTG 
GCCATGGCCA 
AGCTGACGGC 
TTGATAATTA 
TCTGAGAAGA 
GGGCTGCTGG 
AGGAGGCCTG 
GACCCTCCCA 
GGTGTTCATG 
CCTTCCCAGC 
GGAAGGACAC 
GTGGGAGAGG 
CACCCCCACC 
GCCACAGGGA 
TTTCAGAGTC 
TATTTTAGAG 
CTACATTTTT 



TTGAGAGCCC 
CTGGAGTGGG 
ACCGGGCTTC 
TCTTACCCAC 
CTGGACAGTG 
CCTCCAGAGC 
CCAGTCCACG 
TCCACCGCCA 
CGTGGCCGCT 
TGCCCACGTC 
AGGCAGTGTT 
TCCCTGGCCC 
TCAAAATGCA 
TCTACCTCCT 
TTCTCCATCT 
CCAGGCTGCA 
GGAGTGGGTT 
AGTAAATCCA 
CTTTCCAGCA 
CATGAACGTT 
AAAGGCCCGG 
GGAGATGCAG 
GGTTTCCCAC 
ACAAGTGGAA 
TGCTCCTAGA 
AGTCAGGGGC 
AGAGGCTGCT 
TCCCTGCAGC 
CGACCCGGGG 
AGAATTAGGC 
CGCTTAAAAA 
GTTTAATACG 



CACCCCTGAA 
CTCTCTGGAC 
CTCCCTCCAC 
TTCCCTGCAA 
AGAGCAGGGC 
CTGGGCGCAA 
CCGGGTCGCC 
CACTGGCCGG 
GCCCACGTCC 
CCGGGCACTG 
CTTGGGGGTG 
CAGTGACOGG 
GGACAGATCA 
AGCAAGTACT 
CCGCTGTTTT 
GGGACAGTTG 
TCCCTGOGTC 
CAGTGAGTTG 
GGTGTTTTCC 
TTCTCTTCAC 
AGATGCATCT 
CTGGATGGGG 
TGCGGAACAG 
TCATTATTTT 
GGGGACAAGC 
CGGGCAAACA 
GTCACTGGTA 
TTTGAGGCTG 
CGGGCTGATG 
ACTTGGAACG 
ATCCGGAAAA 
CTTGAGCAAT 



1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2560 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 



Seq ID NO: 280 Protein sequence 
Protein Accession NP 055981.1 



MTQEPFREEL 
CLLVTSGPSL 
VTTTCLIWFG 
SANILDEVPF 
PSAIASHVAA 
YDVLLLLLLL 
EFDKEKAWRA 



11 
I 

AYDRMPTLER 
YLGNVFPAEM 
CKLVLNPSAI 
PARVLKSYSV 
ECPSKClrVEV 
VLLLQAGLNT 
WVQMAQ 



21 
I 

GRQDPASYAP 
DYLRCAAGSC 
NINFNLILLL 
VEVIAGISAV 
LIAISSLTSP 
GTAIQCVRPK 



31 

I 

DAKPSDLQLS 
IPSAIVSFTV 
LLELLMAATV 
LGGIIALNVD 
LLFTASGYLS 
VSARLQGASW 



41 

I 

KRLPPCFSHK 
SRRNANVIPN 
IIAARSSEED 
DSVSGPHLSV 
FSIMRIVEMP 
DTQNGPQERL 



51 
I 

TWVFSVLMGS 
FQILFVSTFA 
CKKKKGSMSD 
TFFWILVACF 
KDYPPAIKPS 
AGEVARSPLK 



Seq ID NO: 281 DNA sequence 

Nucleic Acid Accession #: NMJJ0451B. 

Coding sequence: 43.. 2577 



11 



21 



31 



GCTGAGCCTG 
AACGGCGGCG 
CTGGACCCCG 
GCCCCCAAGC 
CCCCCCAAGC 
CGGCCGCGCG 
CTCGTGCTGT 
TACATCCTGG 
GCCGCAGGCT 



TCCCAGGGCA 
CGGATGATCC 
GCCCACAGCA 

GCACTCTGGT 
ACCTGGAACG 
CTGCCTGCAG 
AAGCACTTTG 
TACGCCACCA 
GTCACCGTGC 
CTCAAGAGTA 
CAGAAGGTCA 
AAGGGGTCCC 
GACAGCCCCA 
GCTTTCCGCA 
GAGGACATTG 
GGCCTCAAAG 
TTCAAGGAGA 
CACCTGGACA 
CGGGGCCCAG 
GAGGACCCCA 



AGCCCGACCC GGGGCGCCTC 
TATACCCCGG CCCGAGCGGG 
GCGCGCCCGA CTCCACCCGG 
GCGGCAGCAT CCTCAGCAAA 
GCAACGCCTT CTACCGCAAG 
GCTGGGCGTT CATCTACCAC 
CTGTGTTTTC CACCATCAAG 
AAATCGTGAC TATCGTGGTG 
GCTGCTGCOG GTACOGTGGC 
TGATTGACAT CATGGTGCTC 
ACGTCTTTGC CACATCTGCG 
GCATGGACCG GCGGGGAGGC 
AGGAGCTGGT CACTGCCTGG 
TGTACTTGGC AGAGAAGGGG 
GGGGCCTGAT CACGCTGACC 
GCAGGCTCCT TGCGGCAACC 
GCATCTTGGG GTCTGGGTTT 
AGAAGAGGCG GAACCCGGCA 
ACCTCTCGCG CACAGACCTG 
CCATGTACAG ACTTATCCCC 
AATCTGGACT CGCTTTCAGG 
GTTTGAAAGA TCGTGTCTTC 
CGCAGGCCCA GACTGTGAGG 
GCAAGGTGCC CAAGAGCTGG 
TCAAGGGTGC CGOGTCACGG 
TGGATGACAA GAGCTGCCCC 
TCAGCATCAG AGCCGTGTGT 
GCCTGCGGCC CTACGACGTG 
TGCTGTCCCG AATTAAGAGC 
CGATCACGGA CAAGGACCGC 
GCATGATGGG ACGGCTCGGG 



41 
I 



CCGCCAGGCA CCATGGTGCA 
GAGAAGAAGC TGAAGGTGGG 
GACGGGGCGC TGCTGATCGC 
CCrCGCGOGG GCGGCGCGGG 
CTGCAGAATT TCCTCTACAA 
GCCTACGTGT TCCTCCTGGT 
GAGTATGAGA AGAGCTCGGA 
TTTGGCGTGG AGTACTTCGT 
TGGAGGGGGC GGCTCAAGTT 
ATCGCCTCCA TTGCGGTGCT 
CTCCGGAGCC TGCGCTTCCT 
ACCTGGAAGC TGCTGGGCTC 
TACATCGGCT TCCTTTGTCT 
GAGAACGACC ACTTTGACAC 
ACCATTGGCT ACGGGGACAA 
TTCACCCTCA TCGGTGTCTC 
GCCCTGAAGG TTGAGGAGCA 
GCAGGCCTGA TCCAGTCGGC 
CACTCCACGT GGCAGTACTA 
CCGCTGAACC AGCTGGAGCT 
AAGGACCCCC CGCCGGAGCC 
TCCAGCCCCC GAGGOGTGGC 
CGGTCACCCA GCGCCGACCA 
AGCTTCGGGG ACCGCAGCCG 
CAGAACTCAG AAGAAGCAAG 
TGCGAGTTTG TGACCGAGGA 
GTCATGCGGT TCCTGGTGTC 
ATGGAOGTCA TCGAGCAGTA 
CTGCAGTCCA GAGTGGACCA 
ACCAAGGGCC CGGCCGAGGC 
AAGGTGGAGA AGCAGGTCTT 




GGGGGCCCTC 
GCGGATCTGG 
TGCCCGGAAA 



GCAGATTCTG 
TGTGGTCTAT 
CATCCTGGCC 
CTAOGCGGAT 
GTACCCCCAG 
CTTCTTCGOG 
GCACAGGCAG 
CTGGAGATTC 
CGAGCGAACG 
GCTGAGGAAC 
GTCTCCAAGC 
TGCCAAGGGG 
GAGCCTCGAG 
GGCACGCCAG 
CCTCCCCGGA 
CCTGACCCCG 
CAAGCGGAAG 
CTCAGCCGGC 
GATCGTGGGG 
GGAGCTGCCC 
GTCCATGGAG 



60 
120 
180 
240 
300 
360 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
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AAGAAGCTGG ACTTCCTGGT GAATATCTAC ATGCAGCGGA TGGGCATCCC CCCGACAGAG 1920 
ACCGAGGCCT ACTTTGGGGC CAAAGAGCCG GAGCCGGCGC CGCCGTACCA CAGCCCGGAA 1980 
GACAGCCGGG AGCATGTCGA CAGGCACGGC TGCATTGTCA AGATCGTGCG CTCCAGCAGC 2040 
TCCACGGGCC AGAAGAACTT CTCGGCGCCC COGGCOGOGC CCCCTGTCCA GTGTCCGCCC 2100 
TCCACCTCCT GGCAGCCACA GAGCCACCCG CGCCAGGGCC ACGGCACCTC CCCCGTGGGG 2160 
GACGACGGCT CCCTGGTGCG CATCCCGCCG CCGCCTGCCC ACGAGCGGTC GCTGTCCGCC 2220 
TAOGGCGGGG GCAACCGCGC CAGCATGGAG TTCCTGCGGC AGGAGGACAC CCCGGGCTGC 2280 
AGGCCCCCCG AGGGGACCCT GCGGGACAGC GACACGTCCA TCTCCATCCC GTCCGTGGAC 2340 
CACGAGGAGC TGGAGCGTTC CTTCAGCGGC TTCAGCATCT CCCAGTCCAA GGAGAACCTG 2400 
GATGCTCTCA ACAGCTGCTA CGCGGCCGTG GCGCCTTGTG CCAAAGTCAG GCCCTACATT 2460 
GCGGAGGGAG AGTCAGACAC CGACTCCGAC CTCTGTACCC CGTGCGGGCC CCCGCCACGC 2520 
TCGGCCACCG GCGAGGGTCC CTTTGGTGAC GTGGGCTGGG CCGGGCCCAG GAA GTGAGG C 2580 
GGCGCTGGGC CAGTGGACCC GCCCGCGGCC CTCCTCAGCA CGGTGCCTCC GAGGTTTTGA 2640 
GGCGGGAACC CTCTGGGGCC CTTTTCTTAC AGTAACTGAG TGTGGCGGGA AGGGTGGGCC 2700 
CTGGAGGGGC CCATGTGGGC TGAAGGATGG GGGCTCCTGG CAGTGACCTT TTACAAAAGT 2760 
TATTTTCCAA CAGGGCACTC CCAGGCCCTG TOGCCATTGA GGTGCCTCCG CTGGGCTGTC 2820 
TCCTCACCCC TCCCTGTGCT GGAGCCTGTC CCAAAAAGGT GCCAACTGGG AGGCCTCGGA 2880 
AGCCACTGTC CAGGCTCCCA CTGCCTGTCT GCTCTGTTCC CAAAGGCAGC GTGTGTGGCC 2940 
TOGGGCCCTG OGGTGGCATG AAGCATCCCT TCTGGTGTGG GCATCGCTAC GTGTTTTGGG 3000 
GGCAGCGTTT CACGGCGGTG CCCTTGCTGT CTCCCTTGGG CTGGCTCGAG CCTGGGGTCC 3060 
ATGTCCCTTT GCCGTCCCGT CATGGGGCAG GGAATCCATA GCGGGGCCCA CAGGCAGGGG 3120 
TATGAGTGCG TCCCACCCAA CGCAGCACCA GCCCCGGCCA CCGCTCCCCG TGTCCCCAGT 3180 
TCCGTCTCAG CTACCTGGAC TCCAGGACCC TGGAGAAGGG AGACCTGGCA GTGGAGGGAG 3240 
GCTGTGCTGT GTGTCCCCCT GCAGGTGTGA CCCCGCCTGC TCTTTCCTCC CCCGCCAGGT 3300 
GTGGCCCCGC CTGCTCTTTC CTCCCCCACC AGTATGGCCC CACCTGCTCT TTCCTNCCCC 3360 
CCCAAGGTGT GGCCCCACCT GTTCTTTCCT CCCCTGCOGA GGTGTGACCC CACCTGCTCT 3420 
TTCCTCCCTC CCAGTATGGC CCCACCTGCT CTTTCCTCCC CCGAGGTGAG GCCCCGCCTG 3480 
CTCTTTCCTC CCATGGGAGC CGCTGAGGCG TGCGCACCTG GGCACAGGTT GGGGCTCTGC 3540 
AGGATGAGGA AGACAGGCCA ATCCCTTCCC TCCCAGAAGC TGGCCGCCCA GCAGGAGGGA 3600 
CTGAGGCCAG ACTCATGTCC AGCAAGGAAC GTGTGGTGTG TCCCCTGGGA AGTCTCTGGG 3660 
CCCTGGGAAG AGGGAAGGTG CACGTCCTGG GATGGTTGCG GGGCCCTGTT TTGGGAGACA 3720 
AAGGGGTAGA GGGTCTGTCT TGNGCCCCCC CAGACTCTAG CCCGAGCAGT GCAGCCACCT 3780 
ACTGCCCCAC CTCAGAGAAG TGCAGCGGGA AGGAGGCTGG AGGTGGTGCG GCGCTGCCTC 3840 
GGGTGTCTGC GTGAATGAGC GTGGCCAAGG ACCAGTGCCA CCTCATGGCA AAGAGCTCCC 3900 
GCAGTGTTTG TTAGAGTGCA CATCCTACGT GCCCACTGGC ACACACACGT GCTCACATAC 3960 
ANGTCNGKGT ACAGGNGTAC ACATGCANGC TTGCACACAT GCACACAGAC CACATAGCAC 4020 
ACATGTGCAC TGACCACACC TGTATAGACC ATGCACAGTA CACATACGTG CATACACATG 4080 
CCTGCATACA GGCATACACA TGCACGCTTA CATGTACACG TGCACAGATC ACACACATGC 4140 
ACACACGTGT AGCTCACACA CAGTATACAC ATACACAAGT GCACAGACCA CACACAGCAC 4200 
TAACACATGC ACACACAAAG TGCATAGGCC ACACAGCACA TGCACACAGG TGCACAGACC 4260 
ACACAGCACA CACAAGTGCA CAGAGCACAC TGCACACATG CACACACACA CGCGTGCATG 4320 
CACACTCCTC GCACTTCCAG CCTTGGAGCC CTTCTGTCTC TGGTCTTTCT CTTTGACCCT 4380 
GCTGAGTGTA AGCTGCCTGG GGAGGGGCTA CAAGGAGTAA TTGTGGCTTT AGGGGTCGTG 4440 
GTGATGCTGG AATGTCAAGC GCCGTCGTGG GGTATCCGAC TGTCCGGGCT CCTGGTCCGC 4500 
AGTGGCAGAG CGCCAGGCAG AGCCAATCAG GGTCTCGTGC TGCCCTTCCC CCCCACAGCC 4560 
TGGCAGCCAT CCAGAGGAGG GGCTCTACCA GATGCCAAGG TGCCCCGGTG TCTGTATGGG 4620 
TGTCCGGTTG GGTCCTGTGT TTGGTCTGCC CTGGAGGTGG CTGGGCCCTC CTGGGATGGG 4680 
TGGCTCAGCC TCGAATCCCA GGCCCCAGCC CAGGCAGGTG CTGCTGCCTG TTGTGGTTTC 4740 
CTGGCCCAGC TTCTCCTTCT CCCTCTGCAT AAAATCACAG TCCGTGAGTC TTCCAGCTGC 4800 
CACCACGGCT GGGACACGCT GGGGGAGGGC TCCTCCCATG CCTCCTGCAC ACAGCCGTCT 4860 
GAGCAGGGCA GGTGCCAACA CCCCCCACCG GAGACACGCT GCCCCTCAGC GATGCCCCTA 4920 
CCTTTTGGGG GGCCTCGTCT CAAGCCCCCC CTTGGAGGCT GAAATCACCC CAGGCACTGT 4980 
GAGGGCTTCT CCAGGGGGAC ACCCTTTGAG CTGTGGGTCT GATCACCCCA AGTCCCGCAC 5040 
ACGGAGGAGA GGCACAGCCA GGGCGTGTGG TTTAATGTTT GCCCCTTCGG GGCTGGAGGT 5100 
CTCAGTGTTT CTAGATTCCA GACCCTGCTG CCAGAGAGAC CTGCTGCCGG AGAGAAGGGG 5160 
AGGAGGACTC CAGCTGGGCT CGGTCCCCCA CAGTCAGGGA CCCCCATAAA GGACACCCCC 5220 
TTCTCTCTAG AAAGAGCTGG GCTCTCAGCT ATTTCTAGTT GCTTCCCAGA AGCCGAGGAG 5280 
CAGAAGGAGC TGTGAGAGCT TTGCAGAAAC GCCCTTGTCC CCGCCCTCCT GAGCTATGAA 5340 
TGCCGTACAG AGCAGAGGCT GGGGCATTGG CAAGATCACA GGTTGATGCT GCACAGCCCC 5400 
ATTGACACAA ACCCTCAAAG CAGACGTGAG AGGGACGGTT CACAAAGCTT GGACCTGCCG 5460 
TGGAGGGTGC CCGGCAGACG TGGCGTGAGA GGGACGGCTC ACGAGGCTTG GACCTGCTGT 5520 
GGAGGGTGCC CAGCAGACGT GGTGTGAGAG GAACGGCTCA CGAGACTTGG ACCTGGTGGA 5580 
GGGTGCCCAG CAGACGTGGT GTGAGAGGGA CGGCTCACAG GGCTTGGACC GGAGAGAGAT 5640 
GGCTCATGAG ACTTGGACCT GCCGTGGAGG GTGCCCAGCA GACGTGGTAT GAGAGGGATG 5700 
GCTCACGAGG CTTGGACCTG GTGGAGGGTG CCCGGCAGAC GTGTGAGAGG GACGGTTCAC 5760 
AAGGCTTGGA CCTGCCATGG AGGGTGCCCA GCAGACGTGG TGTGAGAGGG ACAGCTCACG 5820 
AGGCTTGGAC CTGCCGTGGA GGGTGCCCAG CAGGGGGCTG AGCTCTGAGG GGTGGGTGCT 5880 
CAGTGCACGG GTGCCCCCAG TGTCCTCTGA TCCTGTCCGG TGCCTCCCCC AACCCCCACA 5940 
CCCATGCAGA ACTCCCAGGT CACATGCACG TATGTCCAGG GCATGGGGGT GGCGTGAAGA 6000 
GGCCTGGTCA GGGCCTTTAG GGGCTGCAGG ACGGAATGGC CACCTGGGGA GCCTGTGTGG 6060 
CTGTGCCGGG CAGCCATCCT GCATTCCCAC CCAGCGCGCA GTCTCCACCT CGGCCCCAGC 6120 
AAAGCGCTAA GCAGCCGGAG AGACAGCCAG GGCGGCTTCC TGAAGGATGT GGGATGGTGG 6180 
ACTCCGGGGT CGAGGGAATA CGCAGGTTCC TGTCCTCCGG GAGACCTAGA GAAG CTGCAC 6240 
ACCCAGGAGC TTTCCATGAC CCGGGAGCAT GAGTGAATGG GGGGTTCCAG TTTGCTGAAC 6300 
TTTGCTGTCT TGTAAGGGTG GGGGCTGACG GCCGACCCTG GGAGGAGGTG ACACCGCAGG 6360 
GGGAGGTTGT GGGCAACGGT GGAGGAGGAG AGACGGGAGG GGACCATTTG GGATGGAGGG 6420 
GCCTCTTCAG AGTTTTAAAA GGCGTTTGTG GGGTGGAGTT GAGTGTGCTC TGGGCTTGGA 6480 
CACTTGCCGT GGTGCCCCTG GCTGGCCGAG GAGACTGGCT CTGGCCAGGG CCCCGTCCTG 6540 
AGAGGTCCTC AGCGTCTGAC TCTCGGCCAG GCGCCAGCAA GGAGGGGCCG GTCCCCGGGG 6600 
CTACCAGGCA GGCACGTGCA CATCGCCATC GCCACACGCC AACTCCGCCT GGGTTTTACA 6660 
AAGTCGTTGC CTTAATGCAT GTGGACAGGA ACTCCCTGAG GTCGCCCCAT GCCCCCTGGC 6720 
TGTGCCAGGT ACGGACGCCC TGGACCCTGC GAACAGGTGG GGCGGGCGAG GGGCCCAAGG 6780 
GACGGGCTCC AGAGACACGC GCAGGGCAGG AGGGGTCTCA CGGAGGGGTC TCGCACTGAG 6840 
GCGCCCAGAG CTGGTGGTCC CGCTGGACGC CATCCCTCTG CCCGGGATCC ACACGGCCCA 6900 
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CGTGTGCCCG CCATGCCCGC GCCCCACGCC ATTGCAGTCT TCCATCCTCT GGCCGTGACG 6960 

GTGGCTGCAG CTTCCCCATT TGCGCCGTTG CCTCTGGCTG TCTGCACTTT TGTTCATGCT 7020 

CCAAAGAACA TTTCATAATG CCTTCAGTAC CGACGTACAC TTCTGACCAT TTTGTATGTG 70 BO 

TCCTTGTGCC GTAGTGACCA GGCCTTTTTT TGGTGGATGT GTTACCCCGC ACACTTCAAT 7140 

CTCAACTTTG TGCACCGTCC ATTTTCTAGG GATAGACGCC CAGGGAATGA ACTCTAGTTT 7200 

TCTAACAGAT TAGCTGAGAT ATTAACTTAC TCACACGGAC AGGTTGATGC CAGAGCCGTA 7260 

AGAATGCGCC AGTGCGGGTT TGCGGGGGAC TTCGGGTGTG GGGTCCTGCG GCCGCGATGG 7320 

CCGTGGAAGG TTCTGGGGAT CCCTGCTGCC ACGGGGACGA GTTCGGACGC CAGGTGGACC 7380 
TGTGCACTCA GTAAAACGCA GTGATTCAAA AAAAAAAAAA 

'seq ID NO: 282 Protein sequence 
Protein Accession #: NP_004509.1 

is 1 11 21 T T T 

MVQKSRNGGV YPGPSGEKKL KVGFVGLDPG APDSTRDGAL LIAGSEAPKR GSILSKPRAG 60 

GAGAGKPPKR NAPYRKLQNF LYNVLERPRG WAFIYHAYVF LLVFSCLVLS VPSTIKEYEK 120 

SSEGALYILE IVTIWFGVE YFVRIWAAGC CCRYRGWRGR LKFARKPFCV IDIMVLIASI 180 

AVLAAGSQGN VFATSALRSL RFIiQILRMIR MDRRGGTWKL LGSWYAHSK ELVTAWYIGF 240 

20 LCLILASFXiV YLAEKGENDH FDTYADALWW GLITLTTIGY GDKYPQTWNG RLLAATFTLI 300 

GVSFFALPAG ILGSGFALKV QEQHRQKHFE KRRNPAAGLI QSAWRFYATN LSRTDLHSTW 360 

QYYERTVTVP MYRLIPPLNQ LELLRNLKSK SGLAFRKDPP PEPSPSQKVS LKDRVFSSPR 420 

GVAAKGKGSP QAQTVRRSPS ADQSLEDSPS KVPKSWSFGD RSRARQAFRI KGAASRONSE 480 

EASLPGEDIV DDKSCPCEFV TEDLTPGLKV SIRAVCVMRF LVSKRKFKES LRPYDVMDVI 540 

25 EQYSAGHLDM LSRIKSLQSR VDQIVGRGPA ITDKDRTKGP AEAELPEDPS MMGRLGKVEK 600 

QVLSMEKKLD FLVNIYMQRM GIPPTETEAY FGAKEPEPAP PYHSPEDSRE HVDRHGCIVK 660 

IVRSSSSTGQ KNFSAPPAAP PVQCPPSTSW QPQSHPRQGH GTSPVGDHGS LVRIPPPPAH 720 

ERSLSAYGGG NRASMEFLRQ EDTPGCRPPE GTLRDSDTSI SIPSVDHEEL ERSFSGFSIS 780 

QSKENLDALN SCYAAVAPCA KVRPYIAEGE SDTDSDLCTP CGPPPRSATG EGPFGDVGWA 840 

30 GPRK 



Seq ID NO: 2838 DNA sequence 
Nucleic Acid Accession #: AF152496.1 
^ Coding sequence: 1..2391 

1 11 21 31 41 51 

| | I I I I _ 

ATGGAGGCGG GAGGAGAGCG ATTTCTTAGA CAAAGGCAAG TCTTGCTTCT CTTTGTTTTT 60 

CTGGGAGGGT CTCTGGCTGG GTCCGAGTCA AGACGCTATT CTGTGGCTGA GGAAAAAGAG 120 

40 AAGGGCTTTT TAATAGCCAA CCTAGCAAAG GATCTGGGAC TAAGGGTAGA GGAACTGGCC 180 

GCGAGGGGGG CCCAAGTTGT GTCCAAAGGG AACAAACAGC ATTTTCAGCT CAGTCATCAG 240 

ACAGGTGATT TGCTCCTGAA TGAGAAATTG GACCGGGAGG AGCTATGCGG CCCCACAGAA 300 

CCATGCATAC TACATTTTCA GATATTACTG CAAAACCCTT TGCAATTCGT TACAAACGAG 360 

CTCCGTATCA TAGATGTAAA TGACCATTCT CCGGTATTCT TTGAAAATGA AATGCATCTG 420 

45 AAAATCCTAG AAAGCACTCT GCCAGGAACA GTAATTCCTT TGGGAAATGC TGAGGACTTG 480 

GATGTGGGAA GAAACAGCCT CCAAAACTAC ACTATCACTC CGAATTCCCA CTTCCACGTA 540 

CTCACTCGCA GTCGTAGGGA CGGAAGGAAG TACCCGGAAC TAGTACTGGA TAAAGCGCTC 600 

GATCGGGAGG AGCAGCCGGA ACTCAGCTTA ACGCTCACCG CGCTGGACGG CGGCTCTCCC 660 

CCTCGGTCTG GGACAGCCCA GATAAACATC CAGGTCTTAG ATATAAACGA CAATGCACCA 720 

50 GAATTTGCAC AGCCGCTCTA TGAGGTTGCA GTTCTAGAGA ATACCCCCGT TAACTCTGTC 780 

ATTGTCACTG TCTCGGCTTC TGACTTAGAT ACAGGAAGTT TTGGGACAAT ATCATATGCA 840 

TTTTTTCATG CTTCTGAAGA AATTCGCAAA ACTTTTCAGC TAAATCCAAT TACTGGTGAT 900 

ATGCAACTGG TCAAATATTT GAATTTTGAA GCGATTAATA GTTATGAAGT CGACATCGAG 960 

GCCAAGGATG GCGGAGGCCT ATCCGGAAAG TCTACAGTCA TAGTCCAGGT GGTTGATGTC 1020 

55 AACGACAACC CACCGGAACT GACCTTGTCT TCAGTAAACA GCCCTATTCC TGAGAACTCG 1080 

GGAGAGACTG TACTGGCTGT TTTCAGTGTT TCTGATCTAG ACTCTGGAGA CAACGGAAGA 1140 

GTGATGTGTT CCATTGAGAA CAATCTCCCC TTCTTCCTGA AACCATCTGT AGAGAATTTT 1200 

TACACCCTAG TGTCAGAAGG CGCGCTGGAC AGAGAGACCA GATCCGAGTA CAACATTACC 1260 

ATCACTATCA CTGACCTGGG GACACCCAGG CTGAAAACCA AGTACAACAT AACCGTGCTG 1320 

60 GTCTCCGACG TCAATGACAA CGCCCCCGCC TTCACCCAAA TCTCCTACAC CCTGTTOGTC 1380 

CGCGAGAACA ACAGCCCCGC CCTGCACATC GGCAGTGTCA GCGCCACAGA CAGAGACTCA 1440 

GGCACCAACG CCCAGGTAAC CTACTCGCTG CTGCCGCCCC AGGACCCGCA CCTGCCCCTC 1500 

TCTTCCCTGG TCTCCATCAA CGCGGACAAC GGCCACCTGT TTGCCCTCAG GTCGCTGGAC 1560 

TACGAGGCCC TGCAGGCGTT CGAGTTCCGC GTGGGCGCCA CAGACCGTGG CTCCCCGGCT 1620 

65 TTGAGCAGCG AGGCGCTGGT GCGCGTGCTG GTGCTGGACG CCAACGACAA CTCGCCCTTC 1680 

GTGCTGTACC CGCTGCAGAA CGGCTCCGCG CCCTGCACCG AGCTGGTGCC CCGGGCGGCT 1740 

GAGCCGGGCT ACCTGGTGAC CAAGGTGGTG GCGGTGGACG GCGACTCGGG CCAGAACGCC 1B00 

TGGCTGTCGT ACCAGCTGCT CAAGGCCACG GAGCCCGGGC TGTTCGGCGT GTGGGCGCAC 1860 

AATGGCGAAG TGCGCACCGC CAGGCTGCTG AGCGAGCGCG ACGCGGCCAA GCACAGGCTG 1920 

70 GTGGTGCTGG TCAAGGACAA TGGCGAGCCT CCGCGCTCGG CXACCGCCAC GCTGCATGTG 1980 

CTCCTGGTGG ACGGCTTCTC CCAGCCCTAC CTGCCTCTCC CGGAGGCGGC ACCGGCCCAG 2040 

GCCCAGGCCG ACTTGCTCAC CGTCTACCTG GTGGTGGCAT TGGCCTCGGT GTCTTCGCTC 2100 

TTCCTCTTTT CGGTGCTCCT GTTCGTGGCG GTGCGGCTGT GCAGGAGGAG CAGGGCGGCC 2160 

TCGGTGGGTC GCTGCTCGGT GCCCGAGGGC CCCTTTCCAG GGCAGATGGT GGACGTGAGC 2220 

75 GGCACCGGGA CCCTGTCCCA GAGCTACCAG TACGAGGTGT GTCTGACTGG AGGCTCCGGG 2280 

ACAAATGAGT TCAAGTTCCT GAAGCCAATT ATCCCCAACT TCGTTGCTCA GGGTGCAGAG 2340 
AGGGTTAGCG AGGCAAATCC CAGTTTCAGG AAGAGCTTTG AATTCAGTTA A 



OA Seq ID NO: 284 Protein sequence 
80 Protein Accession #= AAD43757.1 



1 11 21 31 41 51 

I I | 1 I I 

MEAGGERFLR QRQVLLLFVF LGGSIAGSES RRYSVAEEKE KGFLIANLAK DLGLRVEELA 
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ARGAQWSKG NKQHFQLSHQ TGDLLLNEKL DREELCGPTE PCILHFQILL QNPLQFVTNE 120 

LRIIDVNDHS PVFFENEMHL KILESTLPGT VIPLGNAEDL DVGRNSW5NY TITPNSHFHV 180 

LTRSRRDGRK YPELVLDKAL DREEQPELSL TLTALDGGSP PRSGTAQINI QVLDINDNAP 240 

EFAQPLYEVA VLENTPVNSV IVTVSASDLD TGSFGTISYA FFHASEEIRK TPQUJPITGD 300 

MQLVKYLNFE AINSYEVDIE AKDGGGLSGK STVIVQWDV NDNPPELTLS SVNSPIPENS 360 

GETVLAVFSV SDLDSGDNGR VMCSIENNLP FFLKPSVENF YTLVSEGALD RETRSEYNIT 420 

ITITDLGTPR LKTKYNITVL VSDVNDNAPA FTQISYTLFV RENNSPALHI GSVSATDRDS 480 

GTNAQVTYSL LPPQDPHLPL SSLVSINADN GHLFALRSLD YEALQAFEFR VGATDRGSPA 540 

LSSEALVRVL VLDANDNSPF VLYPLQNGSA PCTELVPRAA EPGYLVTKW AVDGDSGQNA 600 

WLSYQLLKAT EPGLFGVWAH NGEVRTARLli SERDAAKHRL WLVKDNGEP PRSATATLHV 660 

LLVDGFSQPY LPLPEAAPAQ AQADLLTVYL WALASVSSL FLFSVLLFVA VRLCRRSRAA 720 

SVGRCSVPEG PFPGQMVDVS GTGTLSQSYQ YEVCLTGGSG TNEFKFLKPI IPNFVAQGAE 780. 
RVSEANPSFR KSFEFS 

Seq ID NO: 285 DMA sequence 

Nucleic Acid Accession #: NM_001794.2 

Coding sequence: 15.. 2765 

1 11 21 31 41 51 

I. I I I I I 

CGCCGGCGGG GAAGATGACC GCGGGCGCCG GCGTGCTCCT TCTGCTGCTC TCGCTCTCCG 60 

GCGCGCTCCG GGCCCATAAT GAGGATCTTA CAACTAGAGA GACCTGCAAG GCTGGGTTCT 120 

CTGAAGATGA TTACACGGCA TTAATCTCCC AAAATATTCT AGAAGGGGAA AAGCTACTTC 180 

AAGTCAAGTT CAGCAGCTGT GTGGGGACCA AGGGGACACA ATATGAGACC AACAGCATGG 240 

ACTTCAAAGT TGGGGCAGAT GGGACAGTCT TCGCCACCCG GGAGCTGCAG GTCCCCTCCG 300 

AGCAGGTGGC GTTCACGGTG ACTGCATGGG ACAGCCAGAC AGCAGAGAAA TGGGACGCCG 360 

TGGTGCGGTT GCTGGTGGCC CAGACCTGGT CCCCGCACTC TGGACACAAG CCGCAGAAAG 420 

GAAAGAAGGT CGTGGCTCTG GACCCCTCTC CGCCTCCGAA GGACACCCTG CTGCCGTGGC 480 

CCCAGCACCA GAACGCCAAC GGGCTGAGGC GGCGCAAACG GGACTGGGTC ATCCCGCCCA 540 

TCAACGTGCC CGAGAACTCG CGCGGGCCCT TCCCGCAGCA GCTCGTGAGG ATCOGGTCCG 600 

ACAAAGACAA TGACATCCCC ATCCGGTACA GCATCACGGG AGTGGGCGCC GACCAGCCCC 660 

CCATGGAGGT CTTCAGCATT GACTCCATGT CCGGCCGGAT GTACGTCACA AGGCCCATGG 720 

ACCGGGAGGA GCACGCCTCT TACCACCTCC GAGCCCACGC TGTGGACATG AATGGCAACA 780 

AGGTGGAGAA CCCCATCGAC CTGTACATCT ACGTCATCGA CATGAATGAC AACCGCCCTG 840 

AGTTCATCAA CCAGGTCTAC AACGGCTCOG TGGACGAGGG CTCCAAGCCA GGCACCTACG 900 

TGATGACCGT CACGGCCAAC GATGCTGACG ACAGCACCAC GGCCAACGGG ATGGTGCGGT 960 

ACCGGATCGT GACCCAGACC CCACAGAGCC CGTCCCAGAA TATGTTCACC ATCAACAGCG 1020 

AGACTGGAGA TATCGTCACA GTGGCGGCTG GCCTGGACCG AGAGAAAGTT CAGCAGTACA 1080 

CAGTCATCGT TCAGGCCACA GATATGGAAG GAAATCTCAA CTATGGCCTC TCAAACACAG 1140 

CCACAGCCAT CATCACGGTG ACAGATGTGA ATGACAACCC GCCAGAATTT ACOGCCAGCA 1200 

CGTTTGCAGG GGAGGTCCCC GAAAACCGCG TGGAGACCGT GGTCGCAAAC CTCACGGTGA 1260 

TGGACCGAGA TCAGCCCCAC TCTCCAAACT GGAATGCCGT TTACCGCATC ATCAGTGGGG 1320 

ATCCATCCGG GCACTTCAGC GTCOGCACAG ACCCCGTAAC CAAOGAGGGC ATGGTCACCG 1380 

TGGTGAAGGC AGTCGACTAC GAGCTCAACA GAGCTTTCAT GCTGACAGTG ATGGTGTCCA 1440 

ACCAGGCGCC CCTGGCCAGC GGAATCCAGA TGTCCTTCCA GTCCACGGCA GGGGTGACCA 1500 

TCTCCATCAT GGACATCAAC GAGGCTCCCT ACTTCCCCTC AAACCACAAG CTGATCCGCC 1560 

TGGAGGAGGG CGTGCCCCCC GGCACCGTGC TGACCACGTT TTCAGCTGTG GACCCTGACC 1620 

GGTTCATGCA GCAGGCTGTG AGATACTCAA AGCTGTCAGA CCCAGCGAGC TGGCTGCACA 1680 

TCAATGCCAC CAACGGCCAG ATCACCACGG CGGCAGTGCT GGACCGTGAG TCCCTCTACA . 1740 

CCAAAAACAA CGTCTACGAG GCCACCTTCC TGGCAGCTGA CAATGGGATA CCCCCGGCCA 1800 

GCGGCACCGG GACCCTCCAG ATCTATCTCA TTGACATCAA CGACAACGCC CCTGAGCTGC 1860 

TGCCCAAGGA GGCGCAGATC TGCGAGAAGC CCAACCTGAA CGCCATCAAC ATCACGGCGG 1920 

CCGACGCTGA CGTCGACCCC AACATCGGCC CCTACGTCTT CGAGCTGCCC TTTGTCCCGG 1980 

CGGCCGTGCG GAAGAACTGG ACCATCACCC GCCTGAACGG TGACTATGCC CAACTCAGCT 2040 

TGCGCATCCT GTACCTGGAG GCCGGGATGT ATGACGTCCC CATCATCGTC ACAGACTCTG 2100 

GAAACCCTCC CCTGTCCAAC ACGTCCATCA TCAAAGTCAA GGTGTGCCCA TGTGATGACA 2160 

ACGGGGACTG CACCACCATT GGCGCAGTGG CAGCGGCTGG TCTGGGCACC GGTGCCATCG 2220 

TGGCCATCCT CATCTGCATC CTCATCCTGC TGACCATGGT CCTGCTGTTT GTCATGTGGA 2280 

TGAAGCGGCG AGAGAAGGAG CGCCACACGA AGCAGCTGCT CATTGACCCC GAGGACGACG 2340 

TCCGCGACAA CATCCTCAAG TATGACGAGG AAGGCGGTGG CGAGGAGGAC CAGGACTACG 2400 

ACCTCAGCCA GCTGCAGCAG CCGGAAGCCA TGGGGCACGT GCCAAGCAAA GCCCCTGGCG 2460 

TGCGTCGCGT GGATGAGCGG CCGGTGGGCG CTGAGCCCCA GTACCOGATC AGGCCCATGG 2520 

TGCCGCACCC AGGCGACATC GGTGACTTCA TCAATGAGGG ACTCCGCGCT GCTGACAACG 2580 

ACCCCACGGC ACCCCCCTAT GACTCCCTGC TGGTCTTCGA CTACGAGGGG AGCGGCTCCA 2640 

CCGCAGGCTC CGTCAGCTCC CTGAACTCAT CCAGTTCCGG GGACCAAGAC TACGATTACC 2700 

TCAACGACTG GGGGCCCAGA TTCAAGAAGC TGGCGGACAT GTATGGAGGT GGTGAAGAGG 2760 

ATTGACTGAC CTCGCATCTT CGGACCGAAG TGAGAGCCGT GCTCGGACGC CGGAGGAGCA 2820 

GGACTGAGCA GAGGCGGCCG GTCTTCCCGA CTCCCTGCGG CTGTGTCCTT AGTGCTGTTA 2880 

GGAGGCCCCC CAATCCCCAC GTTGAGCTGT CTAGCATGAG CACCCACCCC CACAGCGCCC 2940 

TGCACCCGGC CGCTGCCCAG CACCGCGCTG GCTGGCACTG AAGGACAGCA AGAGGCACTC 3000 

TGTCTTCACT TGAATTTCCT AGAACAGAAG CACTGTTTTT AAAAAAAAAA AAAAAAAAAG 3060 
AAG 

Seq ID NO: 286 Protein sequence 
Protein Accession #: NP_001785.2 

1 11 21 31 41 51 

I 1 I I I I 

MTAGAGVLLL LLSLSGALRA HNEDLTTRET CKAGFSEDDY TALISQNILE GEKLLQVKFS 60 
SCVGTKGTQY ETNSMDFKVG ADGTVFATRE LQVPSEQVAF TVTAWDSQTA EKWDAWRLL 120 
VAQTSSPHSG HKPQKGKKW ALDPSPPPKD TLLPWPQHQN ANGLRRRKRD WVIPPINVPE 180 
NSRGPFPQQL VRIRSDKDND IPIRYSITGV GADQPPMEVF SIDSMSGRMY VTRPMDREEH 240 
ASYHLRAHAV DMNGNKVENP IDLYI YVIDM NDNRPEFINQ VYNGSVDEGS KPGTYVMTVT 300 
ANDADDSTTA NGMVRYRIVT QTPQSPSQNM FTINSETGDI VTVAAGLDRE KVQQYTVIVQ 360 
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ATDMEGNLNY 
PHSPNWNAVY 
ASGIQMSFQS 
AVRYSKLSDP 
LQIYLIDIND 
NWTITRLNGD 
TIGAVAAAGL 
LKYDEEGGGE 
DIGDFINEGL 
PRFKKLADMY 



GLSNTATAII 
RIISGDPSGH 
TAGVTISIMD 
ASWLHINATN 
NAPELLPKEA 
YAQLSLRILY 
GTGAIVAILI 
EDQDYDLSQL 
RAADNDPTAP 
GGGEED 



TVTDVNDNPP 
FSVRTDPVTN 
INEAPYFPSN 
GQITTAAVLD 
QICEKPNLNA 
LEAGMYDVPI 
CILILLTMVL 
QQPEAMGHVP 
PYDSLLVFDY 



EFTASTFAGE 
EGMVTWKAV 
HKLIRLEEGV 
RESLYTKNNV 
INITAADADV 
IVTDSGNPPL 
LFVMWMKRRE 
SKAPGVRRVD 
EGSGSTAGSV 



VPENRVETW 
DYELNRAFML 
PPGTVLTTFS 
YEATFIiAADN 
DPNIGPYVFE 
SNTSIIKVKV 
KERHTKQLLI 
ERPVGAEPQY 
SSLNSSSSGD 



ANLTVMDRDQ 
TVMVSNQAPL 
AVDPDRFMQQ 
GIPPASGTGT 
LPFVPAAVRK 
CPCDDNGDCT 
DPEDDVRDNI 
PIRPMVPHPG 
QDYDYLNDWG 



420 
4B0 
540 
600 
660 
720 
780 
840 
900 



Seq ID NO: 287 DNA sequence 

Nucleic Acid Accession ft: AF1S2495.1 

Coding sequence: 1..2397 



ATGGAGGCGG 
GTTTTGCTGG 
ACGGAGAGTG 
CTTGCTGTGA 
AGGCAGACCG 
ACAGAGCCCT 
GCGGAGCTAC 
CTTTTGAAAA 
GACTTGGATG 
CATCTTAATT 
GCCCTGGATC 
AGTCCACCCA 
GTCCCAGAGT 
TCCCAGGTTG 
TATGCATTTT 
GGAGAACTGC 
ATTCAGGCGA 
GATTTGAATG 
AACTTGCAGG 
GGAAGGATGG 
AACTTTTACA 
ATCACCATCA 
GTGCTGGTCT 
TTCGTCCGCG 
GACTOGGGCA 
CCCCTCGCCT 
CTGGACTACG 
CCGGCGTTGA 
CCCTTCGTGC 
GCGGCCGAGC 
AACGCCTGGC 
GCGCACAATG 
AGGCTGGTGG 
CACGTGCTCC 
GCCCAGGCCC 
TCGCTCTTCC 
GCGGCCTCGG 
GTGAGCGGCA 
TCCGGGACAA 
GCAGAGAGGG 



11 

I 

GAGAGGGGAA 
GCATAGCTCA 
GCTCCTTTGT 
GGGGGGCCAG 
GGGATTTGTT 
GTGTCCTACC 
GGATTAGGGA 
TTCCAGAAAG 
TAGGAACCAA 
TACAAGACAG 
GCGAGGAGCA 
GGTCCGGCAC 
TTGCAAAGCT 
CCATCGTCTC 
CCCAAGCATC 
TTTTAAGACA 
CAGATGGTGG 
ACAATCCTCC 
ACACCCTCAT 
TGTGCTCCAT 
CTCTGGTGAT 
CCGTCACCGA 
CCGACGTCAA 
AGAACAACAG 
CCAACGCCCA 



AGGCCCTGCA 
GCAGCGAGGC 
TGTACCCGCT 
CGGGCTACCT 
TGTCGTACCA 
GCGAGGTGCG 
TGCTGGTCAA 
TGGTGGACGG 
AGGCCGACTT 
TCTTCTCGGT 
TGGGTCGCTG 
CCGGGACCCT 
ATGAGTTCAA 
TTAGCGAGGC 



21 
I 

GGAGCGCGTT 
GGCTAGTTGC 
GGCCAATTTG 
GGTCGTTTCC 
GTTAAATGAG 
TTTCCAGGTG 
CGTAAATGAT 
TATCACTCCT 
CAGTCTCCAA 
TCTCGATGGC 
GCCTGAGATC 
GGCCCTGGTA 
GCTCTATGAG 
TGCCAGGGAT 
TGAAGACATT 
GAAACTGGAT 
GGGCCTATCT 
GGAACTAACT 
TGCTGTATTC 
CCAAGATGAT 
AAGCACGGCC 
CTTCGGGACA 
TGACAACGCC 
CCCCGCCCTG 
GGTCACCTAC 
CATCAACGCG 
GGCGTTCGAG 
GCTGGTGCGC 
GCAGAACGGC 
GGTGACCAAG 
GCTGCTCAAG 
CACCGCCAGG 
GGACAATGGC 
CTTCTCCCAG 
GCTCACCGTC 
GCTCCTGTTC 
CTCGGTGCCC 
GTCCCAGAGC 
GTTCCTGAAG 
AAATCCCAGT 




CTGGACCGGG 
CCCAGGCTGA 
CCCGCCTTCA 
CACATCGGCA 
TCGCTGCTGC 
GACAAOGGCC 
TTCCGCGTGG 
GTGCTGGTGC 
TCCGCGCCCT 
GTGGTGGCGG 
GCCACGGAGC 
CTGCTGAGGG 
GAGCCTCCGC 
CCCTACCTGC 
TACCTGGTGG 
GTGGCGGTGC 
GAGGGCCCCT 
TACCAGTACG 
CCAATTATCC 
TTCAGGAAGA 



41 
I 

GGCAAGTCCT 
ACTATTCAGT 
TGGGGCTGGA 
AAATGCATTT 
GGGAGGAGCT 
ATCCCTTGCA 
TTTTCCTAGA 
TCTTAATAGA 
TCAGTCCCAA 
CACAGCTGGT 
TCACAGCGCT 
TTGTGGACAT 
CGGAGGACAG 
GAACTAATGG 
TTCGATTAAG 
TCCAGACATA 
TGGTATTTGT 
TTATCAATCA 
ATCCTGACTC 
TCTTGAAACC 
AGACCAGATC 
AAACCGAGCA 
CCCAAACCTC 
GCGTCAGCGC 
CGCCCCAGGA 
ACCTGTTCGC 
GCGCCGCAGA 
TGGACGCCAA 
GCACCGAGCT 
TGGACGGCGA 
CCGGGCTGTT 
AGCGCGACGC 
GCTCGGCCAC 
TGCTCCCGGA 
TGGCGTTGGC 
GGCTGTGCAG 
TTCCAGGGCA 
AGGTGTGTCT 
CCAACTTCGT 
GCTTTGAATT 



51 
I 

GATATTCTTT 
GGCCGAGGAA 
GAT AGGAGAA 
GCAGTTCGAT 
GTGCGGCCCC 
GTTTTTTCAG 
CAAAGAAATA 
ACGTGCCCAG 
TTTCCACTTT 
GCTGAACAGA 
AGATGGCGGG 
CAATGACAAC 
CCCCGTTGGA 
AGAAATATCT 
TGCAAAATCG 
CACAGTAAAT 
CCAAGTGATG 
GATCCCAGAA 
CGGAGACAAC 
TTCTGTTGAG 
CGAATACAAC 
CAACATAACC 
CTACACCCTG 
CACAGACAGA 
CCCGCACCTG 
TCTCCAGTCG 
CCGCGGCTCC 
CGACAACTCG 
GGTGCCCCGG 
CTCGGGCCAG 
CGGCGTGTGG 
TGCCAAGGAG 
CGCCACGCTG 
GGCGGCACCG 
CTCGGTGTCT 
GAGGAGCAGG 
GATGGTGGAC 
GACTGGAGGC 
TGCTCAGGGT 
CACTTAA 



Seq ID NO: 288 Protein sequence 
Protein Accession #: AAD43 756.1 



LAVRGARWS 
AELRIRDVND 
HLNLQDSLDG 
VPEFAKLLYE 
GELLLRQKLD 
NLQDTLIAVF 
ITITVTDFGT 
DSGTNAQVTY 
PALS SEAL VR 
NAWLSYQLLK 
HVLLVDGFSQ 
AASVGRCSVP 
AERVSEANPS 



11 
1 

PKQRQVLIFF 
KGKKMHLQFD 
HSPVFLDKEI 
IILPQLVLNR 
VQIPEDSFVG 
FESIQTYTVN 



PRLKTEHNIT 
SLLPPQDPHL 
VLVLDANDNS 
ATEPGLFGVW 
PYLLLPEAAP 
EGPFPGQMVD 
FRKSFEPT 



21 
1 

VLLGIAQASC 
RQTGDLLLNE 
LLKIPESITP 
ALDREEQPEI 
SQVAIVSARD 
IQATDGGGLS 
GRMVCSIQDD 
VLVSDVNDNA 
PLASLVSINA 
PFVLYPLQNG 
AHNGEVRTAR 
AQAQADLLTV 
VSGTGTLSQS 



31 
I 

QPRHYSVAEE 
KLDREELCGP 
GTTFLIERAQ 
RLTLTALDGG 
LDIGTNGEIS 
GTCWFVQVM 
LPFFLKPSVE 
PAFTQTSYTL 
DNGHLFALQS 
SAPCTELVPR 
LLRERDAAKQ 
YLWALASVS 
YQYEVCLTGG 



41 
I 

TESGSFVANL 
TEPCVLPFQV 
DLDVGTNSLQ 
SPPRSGTALV 
YAFSQASEDI 
DLNDNPPELT 
NFYTLVISTA 
FVRENNSPAL 
LDYEALQAFE 
AAEPGYLVTK 
RLWLVKDNG 
SLFLFSVLLF 
SGTNEFKFLK 



60 
120 
160 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 



Seq ID NO: 289 DNA sequence 

Nucleic Acid Accession #: NM_018674.1 

Coding sequence: 3 90.. 2 00 9 



11 
I 



21 



31 
1 



41 



51 
I 

LKDLGLEIGE 
LLENPLQFFQ 
NYTISPNPHF 
RIEWDINDN 
RKTFRLSAKS 
MSTLINQIPE 
LDRETRSEYN 
HIGSVSATDR 
FRVGAADRGS 
WAVDGDSGQ 
EPPRSATATL 
VAVRLCRRSR 
PIIPNFVAQG 



51 
I 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



1064 



WO 03/042661 



PCT/US02/36810 



10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



CGGAGCACAT GCTGAGCGGA GCGGCTGGGG CTGCGCGGCG TGGCGGAGCA GCGCTCGCTC 60 

CCTCGCTCAC TCGCTOGCTC GCAGGGACAC ACGCAGGGGC TGACAGCTGT GCTGGTGCTG 120 

ATAAGGGAAG CCACAAGGAG ACGATCGAGG AGAGAGACAA GCGGCAGCAG AGGCAGCAGC 180 

GGCAGAGGCA GCACCAGGGC TGCGGAGCTG CTGGGAGTGG GAGTGACTCC CCCACCTCGG 240 

GCCCCCACCC TGTCCCTGTC CTCTTCCCGC TTGCCCTGAG TTTAGAAGAG CAGCCGCTGC 300 

CACCACTGCC ACTCGGGAGG GCACCAGGGC TGCTGGCTAG GGAGGGACAG GGCAGGGAGG 360 

CTCTGGCCAG TCCCAGCAGC CGGGGACAGA TGCCGATCGA GATTGTGTGC AAAATCAAAT 420 

TTGCTGAGGA GGATGCGAAA CCCAAGGAGA AGGAGGCAGG GGATGAGCAG AGCCTCCTCG 480 

GGGCTGTTGC CCCTGGAGCA GCCCCCCGAG ACCTGGCCAC CTTTGCCAGC ACCAGCACCC 540 

TGCATGGACT GGGCCGGGCC TGTGGCCCAG GCCCCCACGG ACTGCGCAGA ACCCTGTGGG 600 

CACTGGCCCT ACTCACCTCG CTGGCTGCCT TCCTGTACCA GGCGGCTGGC CTGGCCCGGG 660 

GCTACCTGAC CCGGCCTCAC CTGGTGGCAA TGGACCCCGC TGCCCCAGCC CCAGTGGCGG 720 

GCTTCCCGGC TGTCACCCTC TGCAATATCA ACCGCTTCCG GCATTCGGCA CTCAGCGATG 780 

CCGACATCTT CCACCTGGCC AATCTGACAG GGCTGCCCCC CAAAGACCGG GATGGGCACC 840 

GTGCGGCTGG CCTGCGCTAC CCAGAGCCTG ACATGGTAGA CATCCTCAAC CGCACTGGCC 900 

ACCAGCTCGC CGACATGCTT AAGAGCTGCA ACTTCAGTGG GCATCACTGC TCCGCCAGCA 960 

ACTTCTCTGT GGTCTATACT CGCTATGGGA AGTGTTACAC CTTCAACGCG GACCCGCGGA 1020 

GCTCGCTGCC CAGCCGGGCA GGGGGCATGG GCAGTGGCCT GGAGATCATG CTGGACATCC 1080 

AGCAGGAGGA GTACCTGCCC ATCTGGAGGG AGACAAATGA GACGTCGTTT GAGGCAGGTA 1140 

TTCGGGTGCA GATCCACAGC CAGGAGGAGC CGCCCTACAT CCACCAGCTG GGGTTCGGGG 1200 

TGTCCCCAGG CTTCCAGACC TTTGTGTCCT GCCAGGAACA GCGGCTGACC TACCTGCCCC 1260 

AGCCCTGGGG CAACTGCCGC GCAGAGAGTG AGCTCAGGGA GCCTGAGCTT CAGGGCTACT 1320 

CGGCCTACAG TGTGTCTGCC TGCCGGCTGC GCTGTGAAAA GGAGGCCGTG CTTCAGCGCT 1380 

GCCACTGCCG GATGGTGCAC ATGCCAGGCA ATGAGACCAT CTGCCCACCA AATATCTACA 1440 

TCGAGTGTGC AGACCACACA CTGGACTCCC TGGGTGGGGG CCCTGAGGGC CCGTGCTTCT 1500 

GCCCCACCCC CTGCAACCTG ACACGCTATG GGAAAGAGAT CTCCATGGTC AGGATCCCCA 1560 

ACAGGGGCTC AGCCCGGTAC CTGGCGAGGA AGTACAACCG CAACGAGACC TACATACGGG 1620 

AGAACTTCCT GGTCCTAGAT GTCTTCTTTG AGGCCCTGAC CTCTGAAGCC ATGGAGCAGC 1680 

GAGCAGCCTA TGGCCTGTCA GCCCTGCTGG GAGACCTCGG GGGACAGATG GGCCTGTTCA 1740 

TTGGGGCCAG CATCCTCACG TTGCTGGAGA TCCTCGACTA CATCTATGAG GTGTCCTGGG 1800 

ATCGACTGAA GCGGGTATGG AGGCGTCCCA AGACCCCCCT GCGGACCTCC ACTGGGGGCA 1860 

TCTCCACTTT GGGGCTTCAG GAGCTGAAGG AACAGAGTCC CTGCCCGAGC CTGGGCCGAG 1920 

CGGAGGGTGG GGGGGTCAGC AGTCTGCTCC CCAATCACCA CCACCCCCAC GGTCCCCCAG 1980 

GAGGTCTCTT TGAAGATTTT GCTTGCTAGG ACGGTGCTGT GACTGAAAGG ACCCAGGAGT 2040 

CTGGGACCCC TCCTGGGATC CCCAGCACAT TCTCCTGCTC CTGGGAGAGG CCTGGGGGCG 2100 

GTGCTCACTG GGAGGGCCAG GACTCAGTTC CTGCTCTCAT CCTCCCCTGC CCTGATGTCA 2160 

GCTGCTTTGC ACAAAGGTCC TTCTTGTCCA CACCCCTTAT CCCCAGGCTG GTGCCCCGGG 2220 

AGGGCTGGAG ACCAGGCCAT GGGCCCTCAC GGAGAGGAAG GGAAGGAAGG AGAGGGAGGG 2280 

GGAGGATAGA GCCCATCCCA GCCGGGGAGG GGGAGCCCTC TGTACATTTG TAAATATTTA 2340 

GGGAAAGCCG GGTGGGGGGA GGGGATACAG ATGTAGAAGG TGGGTAGGGC TACAGGGGTG 2400 

GGTGATTTAG GGACAGCCAG GGTCCCAGCC CCAATGTCAG CAGGATAGGG AGAGCCCCAG 2460 

GACTCAGGAG TGCTGGGCTG GTCCTACTTC CTGCCCCTCT CCAGGCCCAG CTCCCCTCTT 2520 

GGCAGGGGGA GAGGATGGCC CAGCAGGCCT GGCCCAGCTC CCAGTTCCCC CTGCACCAGC 2580 

CCCACCCCTA GAGTCCCTTC TATAGGGAGG GGGCAGGAGA CCTTCCAGAC TTCGGCTGAG 2640 

CTTGGAGGGT GGGAAGGGAG CCTTCTCAGT CCTCTCTCCC TCCAGTCTGA TTTTATAAAG 2700 
TGCTGACGAG 

Seq ID NO: 290 Protein sequence 
Protein Accession ft: NP_061144.1 



1 
I 

MPIEIVCKIK 
GPHGLRRTLW 
NRPRHSALSD 
NFSGHHCSAS 
ETNETSPEAG 
ELREPELQGY 
LGGGPBGPCP 
EALTSEAMEQ 
KTPLRTSTGG 



11 
I 

FAEEDAKPKE 
ALALLTSLAA 
AD I FHLANLT 
NFSWYTRYG 
IRVQIHSQEE 
SAYSVSACRI* 
CPTPCNLTRY 
RAAYGLSALL 
I STIiGLQELK 



21 
I 

KEAGDEQSLL 
FLYQAAGLAR 
GLPPKDRDGH 
KCYTFNADPR 
PPYIHQLGFG 
RCEKEAVLQR 
GKEISMVRIP 



31 
I 

GAVAPGAAPR 
GYLTRPHLVA 
RAAGLRYPEP 
SSIiPSRAGGM 
VSPGFQTFVS 
CHCRMVHMPG 
NRGSARYIiAR 
IGASILTLLE 
AEGGGVSSLL 



41 
I 

DLATFASTST 
MDPAAPAPVA 
DMVDILNRTG 
GSGLEIMLDI 
CQEQRLTYLP 
NETICPPNIY 
KVNRNETYIR 
IU5YIYEVSW 



51 
I 

LHGLGRACGP 
GFPAVTLCNI 
HQIiADMLKSC 
QQEEYLPIWR 
QPWGNCRAES 
IECADHTLDS 
ENFLVLDVFF 
DRLKRVWRRP 
GGLFEDFAC 



60 
120 
180 
240 
300 
360 
420 
480 



Seq ID NO: 291 DNA sequence 

Nucleic Acid Accession ft: Eos sequence 

Coding sequence: 6 2.. 8 95 



1 
I 

CACTGCTCTG 
TATGATCATC 
CAAGGATGGA 
AGAAGCACGG 
TGAAGGCGGC 
TGTCTGTGCT 
GCCCAACTGT 
TGAAAGATGG 
AGATCCAAAG 
CTGCTACTGG 
TGACCTTGAA 
TGATGTCCAT 
TACAGGAAAT 
CCAAATCAAA 
TACTACTTCT 
AAAAAAAGGA 



11 
I 

AGAATTTGTG 
TTAATTTACT 
ATTTTTCATA 
TCTGGCAAAT 
CATCTCGCAA 
GCTGGATGGA 
GGATTTGGAA 
GATGCCTATT 
CAAATTTTTA 
CACATTAGAC 
GATGACCCAG 
GGCTTTGTGG 
GTCATGACCT 
TATGTTGCAA 
ACTGGAAATA 
TGATCAAAAC 



21 
I 

AGCAGCCCCT 
TATTTCTCTT 
ACTCCATATG 
ACAAGCTCAC 
CTTACAAGCA 
TGGCTAAGGG 
AAACTGGCAT 
GCTACAACCC 
AATCTCCAGG 
TCAAGTATGG 
GTTGCTTGGC 
GAAGATACTG 
TGAAGTTTCT 
TGGATCCTGT 
AAAACTTTTT 
ACACAGTGTT 



31 
I 

AACAGGCTGT 
GCTATGGGAA 
GCTTGAACGA 
CTACGCAGAA 
GCTAGAGGCA 
CAGAGTTGGA 
TATTGATTAT 
ACACGCAAAG 
CTTCCCAAAT 
TCAGCGTATT 
TGATTATGTT 
TGGAGATGAG 
AAGTGATGCT 
ATCCAAATCC 
AGCTGGAAGA 
TATGTTGGAA 



41 

I 

TACTTCACTA 
GACACTCAAG 
GCAGCCGGTG 
GCTAAGGCGG 
GCCAGAAAAA 
TACCCCATTG 
GGAATCCGTC 
GAGTGTGGTG 
GAGTACGAAG 
CACCTGAGTT 
GAAATATATG 
CTTCCAGATG 
TCAGTGACAG 
AGTCAAGGAA 
TTTAGCCACT 
TCTTTTGGAA 



51 
I 

CAACTGACGA 
GATGGGGATT 
TGTACCACAG 
TGTGTGAATT 
TTGGATTTCA 
TGAAGCCAGG 
TCAATAGGAG 
GCGTCTTTAC 
ATAACCAAAT 
TTTTAGATTT 
ACAGTTACGA 
ACATCATCAG 
CTGGAGGTTT 
AAAATACAAG 
TATAAAAAAA 
CTCCTTTGAT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 



1065 



WO 03/042661 



CTCACTGTTA TTATTAACAT TTATTTATTA TTTTTCTAAA TGTGAAAGCA ATACATAATT 1020 

TAGGGAAAAT TGGAAAATAT AGGAAACTTT AAACGAGAAA ATGAAACCTC TCATAATCCC 1080 

ACTGCATAGA AATAACAAGC GTTAACATTT TCATATTTTT TTCTTTCAGT CATTTTTCTA 1140 

TTTGTGGTAT ATGTATATAT GTACCTATAT GTATTTGCAT TTGAAATTTT GGAATCCTGC 1200 

TCTATGTACA GTTTTGTATT ATACTTTTTA AATCTTGAAC TTTATAAACA TTTTCTGAAA 1260 

TCATTGATTA TTCTACAAAA ACATGATTTT AAACAGCTGT AAAATATTCT ATGATATGAA 1320 

TGTTTTATGC ATTATTTAAG CCTGTCTCTA TTGTTGGAAT TTCAGGTCAT TTTCATAAAT 13 BO 
ATTGTTGCAA TAAATATCCT TGAACACACA AAAAAAAAAA AA 

Seq ID NO: 292 Protein sequence 
Protein Accession #: Eos sequence 

1 11 21 31 41 51 

I I 1 I I I 

MIILIYLFLL LWEDTQGWGF KDGIFHNSIW LERAAGVYHR EARSGKYKLT YAEAKAVCEF 60 

EGGHLATYRQ LEAARKIGFH VCAAGWMAKG RVGYPIVKPG PNCGFGKTGI IDYGIRLNRS 120 

ERWDAYCYNP HAKECGGVFT DPKQIFKSPG FPNEYEDNQI CYWHIRLKYG QRIHLSFLDF 180 

DLEDDPGCLA DYVEIYDSYD DVHGFVGRYC GDELPDDIIS TGNVMTLKFL SDASVTAGGF 240 
QIKYVAMDPV SKSSQGKNTS TTSTGNKNFL AGRFSHL 

Seq ID NO: 293 DNA sequence 

Nucleic Acid Accession #: NM_007115.1 

Coding sequence: 69.. 902 

1 11 21 31 41 51 

1 I I 1 I I 

GAATTCGCAC TGCTCTGAGA ATTTGTGAGC AGCCCCTAAC AGGCTGTTAC TTCACTACAA 60 

CTGACGATAT GAT CAT CTTA ATTTACTTAT TTCTCTTGCT ATGGGAAGAC ACTCAAGGAT 120 

GGGGATTCAA GGATGGAATT TTTCATAACT CCATATGGCT TGAACGAGCA GCCGGTGTGT 180 

ACCACAGAGA AGCACGGTCT GGCAAATACA AGCTCACCTA CGCAGAAGCT AAGGCGGTGT .240 

GTGAATTTGA AGGCGGCCAT CTCGCAACTT ACAAGCAGCT AGAGGCAGCC AGAAAAATTG 300 

GATTTCATGT CTGTGCTGCT GGATGGATGG CTAAGGG CAG AGTTGGATAC CCCATTGTGA 360 

AGCCAGGGCC CAACTGATGA TTTGGAAAAA CTGG CATT AT TGATTATGGA ATCCGTCTCA 420 

ATAGGAGTGA AAGATGGGAT GCCTATTGCT ACAACCCACA CGCAAAGGAG TGTGGTGGCG 480 

TCTTTACAGA TCCAAAGCGA ATTTTTAAAT CTCCAGGCTT CCCAAATGAG TACGAAGATA 540 

ACCAAATCTG CTACTGGCAC ATTAGACTCA AGTATGGTCA GCGTATTCAC CTGAGTTTTT 600 

TAGATTTTGA CCTTGAAGAT GACCCAGGTT GCTTGGCTGA TTATGTTGAA ATATATGACA 660 

GTTACGATGA TGTCCATGGC TTTGTGGGAA GATACTGTGG AGATGAGCTT CCAGATGACA 720 

TCATCAGTAC AGGAAATGTC ATGACCTTGA AGTTTCTAAG TGATGCTTCA GTGACAGCTG 780 

GAGGTTTCCA AATCAAATAT GTTGCAATGG ATCCTGTATC CAAATCCAGT CAAGGAAAAA 840 

ATACAAGTAC TACTTCTACT GGAAATAAAA ACTTTTTAGC TGGAAGATTT AGCCACTTAT 900 

AAAAAAAAAA AAGGATGATC AAAACACACA GTGTTTATGT TGGAATCTTT TGGAACTCCT 960 

TTGATCTCAC TGTTATTATT AACATTTATT TATTATTTTT CTAAATGTGA AAGAAATACA 1020 

TAATTTAGGG AAAATTGGAA AATATAGGAA ACTTTAAACG AGAAAATGAA ACCTCTCATA 1080 

ATCCCACTGC ATAGAAATAA CAAGCGTTAA CATTTTCATA TTTTTTTCTT TCAGTCATTT 1140 

TTGTATTTGT GGTATATGTA TATATGTACC TATATGTATT TGCATTTGAA ATTTTGGAAT 1200 

CCTGCTCTAT GTACAGTTTT GTATTATACT TTTTAAATCT TGAACTTTAT GAACATTTTC 1260 

TGAAATCATT GATTATTCTA CAAAAACATG ATTTTAAACA GCTGTAAAAT ATTCTATGAT 1320 
ATGAATGTTT TATGCATTAT TTAAGCCTGT CTCTATTGTT GGAATTTCAG GTCATTTTCA 1380 
TAAATATTGT TGCAATAAAT ATCCTTCGGA ATTC 

Seq ID NO: 294 Protein sequence 
Protein Accession #: NP_009046.1 

1 11 21 31 41 51 

I I I I I I 

MIILIYLFLL LWEDTQGWGF KDGIFHNSIW LERAAGVYHR EARSGKYKLT YAEAKAVCEF 60 
EGGHLATYKQ LEAARKIGFH VCAAGWMAKG RVGYPIVKPG PNXXFGKTGI IDYGIRLNRS 120 
ERWDAYCYNP HAKECGGVFT DPKRIFKSPG FPNEYEDNQI CYWHIRLKYG QRIHLSFLDF 180 
DLEDDPGCLA DYVEIYDSYD DVHGFVGRYC GDELPDDIIS TGNVMTLKFL SDASVTAGGF 240 
QIKYVAMDPV SKSSQGKNTS TTSTGNKNFL AGRFSHL 



Seq ID NO: 295 DNA sequence 

Nucleic Acid Accession #: NM J) 012 18. 2 

Coding sequence: 116.. 1180 

1 11 21 31 41 51 

I I 1 I I I 

GTACTCGCCA CGGCACCCAG GCTGCGCGCA CGCGGTCCCG GTGTGCAGCT GGAGAGCGAG 60 

CGGCCACCGG GAGCCCCCGG CACAGCCCGC GCCCGCCCCG CAGGAGCCCG CGAAGATGCC 120 

CCGGCGCAGC CTGCACGCGG CGGCCGTGCT CCTGCTGGTG ATCTTAAAGG AACAGCCTTC 180 

CAGCCCGGCC CCAGTGAACG GTTCCAAGTG GACTTATTTT GGTCCTGATG GGGAGAATAG 240 

CTGGTCCAAG AAGTACCCGT CGTGTGGGGG CCTGCTGCAG TCCCCCATAG ACCTGCACAG 300 

TGACATCCTC CAGTATGACG CCAGCCTCAC GCCCCTCGAG TTCCAAGGCT ACAATCTGTC 360 

TGCCAACAAG CAGTTTCTCC TGACCAACAA TGGCCATTCA GTGAAGCTGA ACCTGCCCTC 420 

GGACATGCAC ATCCAGGGCC TCCAGTCTOG CTACAGTGCC ACGCAGCTGC ACCTGCACTG 480 

GGGGAACCCG AATGACCCGC AOGGCTCTGA GCACACCGTC AGCGGACAGC ACTTCGCCGC 540 

CGAGCTGCAC ATTGTCCATT ATAACTCAGA CCTTTATCCT GACGCCAGCA CTGCCAGCAA 600 

CAAGTCAGAA GGCCTCGCTG TCCTGGCTGT TCTCATTGAG ATGGGCTCCT TCAATCCGTC 660 

CTATGACAAG ATCTTCAGTC ACCTTCAACA TGTAAAGTAC AAAGGCCAGG AAGCATTCGT 720 

CCCGGGATTC AACATTGAAG AGCTGCTTCC GGAGAGGACC GCTGAATATT ACCGCTACCG 780 

GGGGTCCCTG ACCACACCCC CTTGCAACCC CACTGTGCTC TGGACAGTTT TCCGAAACCC 840 

CGTGCAAATT TCCCAGGAGC AGCTGCTGGC TTTGGAGACA GCCCTGTACT GCACACACAT 900 

1066 



WO 03/042661 



PCT/US02/36810 



5 

10 
15 
20 
25 
30 



GGACGACCCT 
GAGGCTGGTA 
GGGCATCATC 
GTCCATTTGG 
CAAGCCAGCC 
ACAT CCAGG A 
ACCTCAAGGT 
TGGGTGCCTC 
TGCACTCTCA 
GAAGTGGTCT 
AAAACACTTT 
GCTTCTCCAA 
CAACTTCTTT 
CTTTAATCTG 
AGAAAAAAGA 
CCTCTGGATC 
GGTGGGGATG 
TCTGGGAGAG 
TGTAAGGACA 
ATGTTTTATA 
CTGTGTCCTG 
AATAGGCAGA 
CACTTCTGCA 
CTGAAGATGT 
ATCTGTGCAG 
ATACTCCATA 
TTCCTCCCGC 
ACCCCTAGTT 
GACAGCATGA 
CCCTCTGACT 
AGTAAGCCAA 
AAAAAAAAAA 



TCCCCCAGAG 
TACACCTCCT 
CTCTCACTGG 
CTTTTCAGAA 
ACCAAGATGG 
AGGACCTTGC 
GTTCTCTGTA 
CTTGCCTTGG 
GACCAAGCAG 
GAATTCTGGA 
CATCCGGGGC 
GCTTTCAGAC 
CCTCTGGAGA 
CACCTTACAA 
AAACAAGAGA 
ACAGACACTG 
AGAAGGGAGA 
TCTCACTTTG 
GAATAATACA 
CTCTACAGAT 
GAGGAGACAT 
AACAAAATGA 
GGATCCCTAC 
CAAATCGTGG 
GGGGCTGGCA 
TTCTTCCCCG 
AGGTGAGAAC 
GAGAGCACCT 
AATGTCCTCT 
CTAAGAATTC 
GGTGATGCCA 
AAAAA 



AAATGATCAA 
TCTCCCAAGT 
CCCTGGCTGG 
GGAAGAGTAT 
AGACTGAGGC 
TTTGGACCCT 
GCTCAATCTG 
GACCATGGCC 
CAGGAATTCA 
ATCACAAACC 
TTTGCCAGAG 
AAGAATGTGC 
CGGGGCATCT 
CTCGGGGACA 
TATACATTGT 
CACAGACCTT 
GAGAGCTAGA 
GAATCAGAAT 
ATATTAAGTC 
TTTACAAATA 
GAGTTCCGAG 
CTGTAGAACT 
TGCCAACCTG 
TTTAGATCAA 
TCATGTATTT 
AATCACACAG 
CCTGTGGAGA 
TGCAGTTCCC 
TGAAGCATAG 
TCTCTTCTGG 
CTGCACTCTA 



CAACTTCCGG 
GCAAGTCTGT 
CATTCTTGGC 
CAAAAAAGGT 
CCACGCTTGA 
ACACACTTCG 
CAAACATGCC 
ACCCCAGAGC 
AAGCTGCTTG 
AAGCCATGCT 
CGTGCTTTCA 
ACTCTCTGCT 
CCCTCTGATT 
AATGGGGACA 
GATATATTAG 
AGGGAATGGC 
GGGACAAAGA 
TGGAATCACA 
CAAATCCAAC 
ATGAGGCTGT 
ATGACCCAAT 
TATTCTCTGT 
GAATGGAGAC 
ATATATTTCA 
AGGGGCAAGT 
ACAGTTTCTG 
TGAGTCAGTG 
CGAGAACTTT 
CTTTTTAAAT 
AATCGCTTGA 
GCCTGGGTGA 



CAGGTCCAGA 
ACTGCGGCAG 
ATCTGTATTG 
GATAACAAGG 
GGTCCCCGGA 
GCTCTCTGGA 
AGGCCTCAGG 
CATCCGATCG 
CTGTAACTGT 
GGTGGGCCAT 
AGTGTCCTGG 
TAGGTTTTGC 
TCCTTCTGCT 
GGAAGGATCA 
GGACACTTTC 
AGGTTCAAGT 
GAATGAGAAG 
TTCTGTTTAT 
CTCCTGTCAG 
TCCTTGAAAA 
CTGCCTTTGA 
AGGCCAAATT 
TTTTATCTAC 
AGCTATAAAA 
AATAATGGAA 
ACAGGCGCAA 
CCATGACTGA 
CTGATTCACA 
ATCTTTTTCC 
ACCCAGGAGG 
CAGAGCGAGA 



AGTTCGATGA 
GACTGAGTCT 
TGGTGGTGGT 
GAGTCATTTA 
GCTCCCGGGC 
CACTTGCGAC 
GATCCTCTGC 
ATGGATGGGA 
GTGAGATTGT 
TAATGGTTGG 
AAAGTCTGCT 
TTGGGAAACT 
ATGACAAAAC 
AGTTGTAGAG 
ACAGTCCTGT 
TCCACTTCTT 
ACATGGATGA 
CAAGCCATAA 
TGGAGCAGTT 
TGTGTTGTTG 
ATCTGGAGGA 
TCATTTCAGC 
TTCTCTCTCT 
GCAGGAGGTT 
TGCTACTAAG 
CTCCTCCATT 
GAAGGAACCG 
GTCTCATTTT 
TTCTACTCCT 
CGGAGGTTGC 
CTCCATCTCA 



960 
1020 

iobo 

1140 

1200 

1260 

1320 

1380 

1440 

1S00 

1S60 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 



35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



Seq ID NO: 296 Protein sequence 
Protein Accession #: NP_001209.1 



MPRRSLHAAA 
HSDILQYDAS 
HWGNPNDPHG 
PSYDKIFSKL 
NPVQISQEQL 
SLGIILSLAL 



11 

I 

VLLLVILKEQ 
LTPLEFQGYN 
SEHTVSGQHF 
QHVKYKGQEA 
LALETALYCT 
AGILGICIW 



21 
1 

PSSPAPVNGS 
LSANKQFLLT 
AAELHIVHYN 
FVPGFNIEEL 
HMDDPSPREM 
WSIWLFRRK 



31 
I 

KWTYFGPDGE 
NNGHSVKLNL 
SDLYPDASTA 
LPERTAEYYR 
INNFRQVQKF 
SIKKGDNKGV 



41 

I 

NSWSKKYPSC 
PSDMHIQGLQ 
SNKSEGLAVL 
YRGSLTTPPC 
DERLVYTSFS 
IYKPATKMET 



51 
I 

GGLLQSPIDL 
SRYSATQLHL 
AVLIEMGSFN 
NPTVLWTVFR 
OVQVCTAAGI* 
EAHA 



Seq ID NO: 297 DNA sequence 

Nucleic Acid Accession #: NMJ)06632.l 

Coding sequence : 377 . . 1582 



1 
I 

ACGCGTCCGC 
GCGGCCGTGC 
GACAAGAAGA 
CTTAACAAGC 
GGCGTGCTTC 
CTAGGAGAGG 
CAAGAACGCA 
ATGTTCTGCT 
ACAAAATGTC 
CCAGCTCAAT 
CCCAAAGAGT 
GTGGGGCCCT 
GGGATGCTTT 
TGTCTTCTAT 
TTATGATGAC 
CTCCTTGAAA 
ATCTCTACCC 
AATGGTTGTA 
TGGACTTCTA 
TCTGGCAGAT 
AATTTTAGGA 
CTATATCACA 
AGGGATTTAT 
ATCAAGAGGA 
TAGTCAGGAC 
GTTAGGACTA 
GAGAAAACTC 
TATTGCATAA 
GTGGCTCTTT 
ATGTAAGATG 



11 



CCACGCGTCC 
TGGAGCACCT 
CCCGCATCAT 
TGCTGGGCGA 
TGCCCCAGAA 
AGAAACACAA 
CAAGATATGC 
CGCTATGGAA 
ATCATGAACA 
GATTCCTCTG 
CTTCCTGCAA 
CCACAAGAAC 
ACTGCCATCC 
ATCTTTGGAG 
CCCTTTTCCT 
CAACAGGTCG 
ATTTGGTCCA 
TACATACCAA 
TCTGCCCTTC 
TTCCTTCTAA 
AGTCTCCCCT 
GCAACTGCCT 
ATCAATGTCT 
TTTTCGAGCA 
CCTGAGTTTG 
CTCTTCTACC 
ACTCGTTTAT 
AATAGAAGGC 
TCAATTATGA 
AATAAAAATT 



21 
I 

GCCCACGCGT 
GACCGCCGAG 
CCTGCGCCAC 
AGTCACCATC 
GACCAAGAGC 
TGGCCACCAA 
AAGTGGATGA 
TAGCCCTCGT 
TCACCATGGT 
AGGTGCTGCC 
AGTCCTCAAT 
GAAGCAGACT 
TCATAGGTGG 
GTGTTGGCTG 
ATCCATGGAT 
GGTCTTCTAA 
TATGTTTAGG 
CTTACATCAG 
CTTTTATTGT 
CCAAAAAGTT 
CTTCAGCACT 
TGCTGACGCT 
TAGATATTGC 
TAGCACCTGT 
GGTGGAGGAA 
TCATATTTGG 
GAAGTTATCC 
TTCCGTGATG 
GATCAGTTCA 
CAAATAAAAT 



31 
I 

CCGGTCGGGG 
ATCCTGGAGC 
CTGTAGCTGG 
GCGCAGGGCG 
CACCACAAGG 
GACAGAGTTG 
GACACTGATC 
CTTACATTTC 
AGCCATGGTC 
TGTTGACTCA 
ACTTGGGGGT 
CTGCAGCATT 
CTTCATTAGT 
TGTCTGCTGC 
AAGCACCTCA 
GCAGCCTCTT 
CTGTTTCAGC 
CTCTGTGTAC 
TGCCTGGGTC 
TAGACTCATC 
CATTGTGTCT 
CTCTTGCGGA 
TCCAAGGTAT 
CATTGTACCC 
TGTCTTCTTC 
AGAAGCAGAT 
CACCTTGGAT 
AAAATACCAG 
TTATTTTATT 
GATAACTAAG 



41 



60 
120 
180 
240 
300 



CCAGAGCGCA 
TGGCTGGCAA 
CCATTCGCAA 
GTGTCCTGCC 
CCAAGGGTGA 
AGTCCCACAG 
CCCAGGAAAG 
TGCAATTTCA 
AACAGCACAA 
TTTGGTGGCC 
CAGTTTGCAA 
GCTTTATCAG 
GAAAGCCTTG 
CTTCTCTGGT 
GAAAAAGAAT 
CCCATCAAAG 
CATCAATGGT 
CATGTTAACA 
ATAGGCATGG 
ACTGTGAGGA 
CTGCCTTACC 
TTAAGCACAT 
TCCAGTTTTC 
ACTGTCAGCG 
TTGCTGTTTG 
GTCCAAGAAT 
GGAAAAGTCA 
TGAAAAGATT 
CAGACTTTTT 
AAAAAAAAAA 



51 
I 

GGTGTACCTG 
CCCGGCCCGC 
CGGCGAGGAG 
CAACATTCAG 
AAACCATTCA 
CAAGGGAGAG 
GTCCAAGTTT 
CAACGATAGC 
GCCCTCAATC 
TAAGTAAAGC 
TTTGGGAAAA 
GAATGTTACT 
GGTGGCCCTT 
TTGTTGTGAT 
ACATCATATC 
CTATGCTCAG 
TAGTTAGCAC 
TCAGAGACAA 
TGGGAGGCTA 
AAATTGCCAC 
TCAATTCCGG 
TGTGTCAGTC 
TCATGGGAGC 
GATTTCTTCT 
CCGTTAACCT 
GGGCTAAAGA 
TTAGGCACCG 
TTTTTTTCCT 
TTTGAGAGAA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 



Seq ID NO: 298 Protein sequence 
Protein Accession #: NP_006623.1 



1067 



WO 03/042661 



PCT7US02/36810 



1 11 21 31 41 51 

I I I I I I 

MQVDETLIPR KGPSLCSARY GIALVLHFCN FTTIAQNVIM NITMVAKVNS TSPQSQLNDS 60 

5 SEVLPVDSFG GLSKAPKSLP AKSSILGGQF AIWEKWGPPQ ERSRLCSIAL SGMLLGCFTA 120 

ILIGGFISET LGWPFVFYIF GGVGCVCCLL WFWIYDDPF SYPWISTSEK EYIISSLKQQ 180 

VGSSKQPLPI KAMLRSLPIW SICLGCFSHQ WLVSTMWYI PTYISSVYHV NIRDNGLLSA 240 

LPFIVAWVIG MVGGYLADFL LTKKFRLITV RKIATILGSL PSSALIVSLP YLNSGYITAT 300 

ALLTLSCGLS TLCQSGIYIN VLDIAPRYSS FLMGASRGFS SIAPVIVPTV SGFLLSQDPB 360 
10 FGWRNVFFLL FAVNLLGLLF YLIFGEADVQ EWAKERKLTR L 



Seq ID NO: 299 DNA sequence 

Nucleic Acid Accession #: NM_003058-1 

Coding sequence: 145.. 1812 

1 11 21 31 41 51 

I I I 1 1 I 

GGCCCTGCCC TGAAGGCTGG TCACTTGCAG AGGTAAACTC CCCTCTTTGA CTTCTGGCCA 60 

GGGTTTGTGC TGAGCTGGCT GCAGCCGCTC TCAGCCTCGC TCCGGGCACG TCGGGCAGCC 120 

20 TCGGGCCCTC CTGCCTGCAG GATCATGCCC ACCACCGTGG ACGATGTCCT GGAGCATGGA 180 

GGGGAGTTTC ACTTTTTCCA GAAGCAAATG TTTTTCCTCT TGGCTCTGCT CTCGGCTACC 240 

TTCGCGCCCA TCTACGTGGG CATCGTCTTC CTGGGCTTCA CCCCTGACCA CCGCTGCCGG 300 

AGCCCCGGAG TGGCCGAGCT GAGTCTGCGC TGCGGCTGGA GTCCTGCAGA GGAACTGAAC 360 

TACACGGTGC CGGGCCCAGG ACCTGCGGGC GAAGCCTCCC CAAGACAGTG TAGGCGCTAC 420 

25 GAGGTGGACT GGAACCAGAG CACCTTTGAC TGCGTGGACC CCCTGGCCAG CCTGGACACC 480 

AACAGGAGCC GCCTGCCACT GGGCCCCTGC CGGGACGGCT GGGTGTACGA GACGCCTGGC 540 

TCGTCCATCG TCACCGAGTT TAACCTGGTA TGTGCCAACT CCTGGATGTT GGACCTATTC 600 

CAGTCATCAG TGAATGTAGG ATTCTTTATT GGCTCTATGA GTATCGGCTA CATAGCAGAC 660 

AGGTTTGGCC GTAAGCTCTG CCTCCTAACT ACAGTCCTCA TAAATGCTGC AGCTGGAGTT 720 

30 CTCATGGCCA TTTCCCCAAC CTATACGTGG ATGTTAATTT TTCGCTTAAT CCAAGGACTG 780 

GTCAGCAAAG CAGGCTGGTT AATAGGCTAC ATCCTGATTA CAGAATTTGT TGGGCGGAGA 840 

TATCGGAGAA CAGTGGGGAT TTTTTACCAA GTTGCCTATA CAGTTGGGCT CCTGGTGCTA 900 

GCTGGGGTGG CTTACGCACT TCCTCACTGG AGGTGGTTGC AGTTCACAGT TGCTCTGCCC 960 

AACTTCTTCT TCTTGCTCTA TTACTGGTGC ATACCTGAGT CTCCCAGGTG GCTGATCTCC 1020 

35 CAGAATAAGA ATGCTGAAGC CATGAGAATC ATTAAGCACA TCGCAAAGAA AAATGGAAAA 1080 

TCTCTACCCG CCTCCCTTCA GCGCCTGAGA CTTGAAGAGG AAACTGGCAA GAAATTGAAC 1140 

CCTTCATTTC TTGACTTGGT CAGAACTCCT CAGATAAGGA AACATACTAT GATATTGATG 1200 

TACAACTGGT TCACGAGCTC TGTGCTCTAC CAGGGCCTCA TCATGCACAT GGGCCTTGCA 1260 

GGTGACAATA TCTACCTGGA TTTCTTCTAC TCTGCCCTGG TTGAATTCCC AGCTGCCTTC 1320 

40 ATGATCATCC TCACCATCGA CCGCATCGGA CGCCGTTACC CTTGGGCTGC ATCAAATATG 1380 

GTTGCAGGGG CAGCCTGTCT GGCCTCAGTT TTTATACCTG GTGATCTACA ATGGCTAAAA 1440 

ATTATTATCT CATGCTTGGG AAGAATGGGG ATCACAATGG CCTATGAGAT AGTCTGCCTG 1500 

GTCAATGCTG AGCTGTACCC CACATTCATT AGGAATCTTG GCGTCCACAT CTGTTCCTCA 1560 

ATGTGTGACA TTGGTGGCAT CATCACGCCA TTCCTGGTCT ACCGGCTCAC TAACATCTGG 1620 

45 CTTGAGCTCC CGCTGATGGT TTTCGGCGTA CTTGGCTTGG TTGCTGGAGG TCTGGTGCTG 1680 

TTGCTTCCAG AAACTAAAGG GAAAGCTTTG CCTGAGACCA TCGAGGAAGC CGAAAATATG 1740 

CAAAGACCAA GAAAAAATAA AGAAAAGATG ATTTACCTCC AAGTTCAGAA ACTAGACATT 1800 

CCATTGAACT AAGAAGAGAG ACCGTTGCTG CTGTCATGAC CTAGCTTTGA TGGCAGGAAG 1860 

ACCAAAAGTA GAAATCCCTG CACTCATCAC AAAGCCCATA CAACTCAACC AAACTTACCC 1920 

50 CTGAGCCCTA TCAACCTAGG TCTACAGCCA GTGGAGTCTA TTGTACACTG TGGAAAAATA 1980 

CCCATGGGAC CAGATCCTGC CAAATTCTTC CAGCTCACTT TATTCTCAGC ATTCCTAGGA 2040 

CATTGGACAT TGGTTTTCTG GAGGGTTTTT TTTCCGATCT TTGTATTTTT TTAAATTTGA 2100 

TTCTTTTCTT TGCAATGCTA GCAACCAGAA TACATAGGGG AACTGTGGGC TAGGCAAANA 2160 

AAATAGAAAA AGTGTGAAAA ACAGTAAAGT TGGGAGAGGA GCATCTATTT TCTTAAAGAA 2220 
55 ATAAAACACC NAAAACAAAA AAAAAAAAAA AAAAAAA 



Seq ID NO: 300 Protein sequence 
Protein Accession #: NP_003049.l 

60 1 11 21 31 41 51 

I I I 1 1 I 

MPTTVDDVLE HGGEFHFFQK QMFFLLALLS ATFAPIYVGI VFLGFTPDHR CRSPGVAELS 60 

LRCGWSPAEE LNYTVPGPGP AGEASPRQCR RYEVDWNQST FDCVDPLASL DTNRSRLPLG 120 

PCRDGWVYET PGSSIVTEFN LVCANSWMLD LFQSSVNVGF FIGSMSIGYI ADRFGRKLCL 180 

65 LTTVLINAAA GVLMAISPTY TWMLIFRLIQ GLVSKAGWLI GYILITEFVG RRYRRTVGIF 240 

YQVAYTVGLL VLAGVAYALP HWRWLQFTVA LPNFFFLLYY WCIPESPRWL ISQNKNAEAM 300 

RIIKHIAKKN GKSLPASLQR LRLEEETGKK LNPSFLDLVR TPQIRKHTMI LMYNWFTSSV 360 

LYQGLIMHMG LAGDNIYLDF FYSALVEFPA AFMIILTIDR IGRRYPWAAS NMVAGAACLA 420 

A SVFIPGDLQH LKIIISCLGR MGITMAYEIV CLVNAELYPT FIRNLGVHIC SSMCDIGGII 480 

70 TPFLVYRIiTN IWLELPLMVF GVLGLVAGGb VLLLPETKGK ALPETIEEAE NMQRPRKNKE 540 
KMIYLQVQKL DIPI2J 



75 



Seq ID NOj 301 DNA sequence 

Nucleic Acid Accession ft: NM_012206.1 

Coding sequence: 52.. 1131 



1 11 21 31 41 51 

I I I I I 1 

OA GTTACCCAGC ATTGTGAGTG ACAGAGCCTG GATCTGAACG CTGATCCCAT AATGCATCCT 60 

80 CAAGTGGTCA TCTTAAGCCT CATCCTACAT CTGGCAGATT CTGTAGCTGG TTCTGTAAAG 120 

GTTGGTGGAG AGGCAGGTCC ATCTGTCACA CTACCCTGCC ACTACAGTGG AGCTGTCACA 180 

TCAATGTGCT GGAATAGAGG CTCATGTTCT CTATTCACAT GCCAAAATGG CATTGTCTGG 240 

ACCAATGGAA CCCACGTCAC CTATCGGAAG GACACACGCT ATAAGCTATT GGGGGACCTT 300 

TCAAGAAGGG ATGTCTCTTT GACCATAGAA AATACAGCTG TGTCTGACAG TGGCGTATAT 360 

1068 



WO 03/042661 



TGTTGCOGTG TTGAGCACCG TGGGTGGTTC AATGACATGA AAATCACCGT ATCATTGGAG 
ATTGTGCCAC CCAAGGTCAC GACTACTCCA ATTGTCACAA CTGTTCCAAC CGTCACGACT 
GTTCGAACGA GCACCACTGT TCCAACGACA ACGACTGTTC CAACGACAAC TGTTCCAACA 
ACAATGAGCA TTCCAACGAC AACGACTGTT CCGACGACAA TGACTGTTTC AACGACAACG 
AGCGTTCCAA CGACAACGAG CATTCCAACA ACAACAAGTG TTCCAGTGAC AACAACGGTC 
TCTACCTTTG TTCCTCCAAT GCCTTTGCCC AGGCAGAACC ATGAACCAGT AGCCACTTCA 
CCATCTTCAC CTCAGCCAGC AGAAACCCAC CCTACGACAC TGCAGGGAGC AATAAGGAGA 
GAACCCACCA GCTCACCATT GTACTCTTAC ACAACAGATG GGAATGACAC CGTGACAGAG 
TCTTCAGATG GCCTTTGGAA TAACAATCAA ACTCAACTGT TCCTAGAACA TAGTCTACTG 
ACGGCCAATA CCACTAAAGG AATCTATGCT GGAGTCTGTA TTTCTGTCTT GGTGCTTCTT 
GCTCTTTTGG GTGTCATCAT TGCCAAAAAG TATTTCTTCA AAAAGGAGGT TCAACAACTA 
AGTGTTTCAT TTAGCAGCCT TCAAATTAAA GCTTTGCAAA ATGCAGTTGA AAAGGAAGTC 
CAAGCAGAAG ACAATATCTA CATTGAGAAT AGTCTTTATG CCACGGACTA AGACCCAGTG 
GTGCTCTTTG AGAGTTTACG CCCATGACTG CAGAAGACTG AACAGGTATC AGCACATCAG 
ATGTCTTTTA GACTCCAAGA CAATTTTTCT GTTTCAGTTT CATCTGGCAT TCCAACATGT 
CAGTGATACT GGGTAGAGTA ACTCTCCCAC TCCAAACTGT GTATAGTCAA CCTCATCATT 
AATGTAGTCC TAATTTGTTT TGCTAAAACT GGCTCAATCC TTCTGATCAT TGCAGAGTTT 
TCTCTCAAAC ATGAACACTT TAGAATTGTA TGTTCTCTTT AGACCCCATA AATCCTGTAT 

Seq ID NO: 302 Protein sequence 
Protein Accession #: NP_036338.1 

1 11 21 31 41 51 

I I 1 I I I 

MHPQWILSL ILHLADSVAG SVKVGGEAGP SVTLPCHYSG AVTSMCWNRG SCSLFTCQNG 60 

IVWTNGTHVT YRKDTRYKLL GDLSRRDVSI* TIENTAVSDS GVYCCRVEHR GWFNDMKITV 120 

SLEIVPPKVT TTPIVTTVPT VTTVRTSTTV PTTTTVPTTT VPTTMSIPTT TTVPTTMTVS 180 

TTTSVPTTTS IPTTTSVPVT TTVSTFVPPM PkPRQNHEPV ATSPSSPQPA ETHPTTLQGA 240 

IRREPTSSPL YSYTTDGNDT VTESSDGLWN NNQTQLFLEH SLLTANTTKG IYAGVCISVL 300 
VLIiALLGVI I AKKYFFKKEV QQLSVSFSSL QIKALQNAVE KEVQAEDNIY IENSLYATD 

Seq ID NO: 303 DNA sequence 

Nucleic Acid Accession tf: NM_001044.1 

Coding sequence: 129.. 1991 

1 11 21 31 41 51 

1 I I 1 I I 

ACCGCTCCGG AGCGGGAGGG GAGGCTTCGC GGAAOGCTCT CGGCGCCAGG ACTCGCGTGC 60 

AAAGCCCAGG CCCGGGCGGC CAGACCAAGA GGGAAGAAGC ACAGAATTCC TCAACTCCCA 120 

GTGTGCCCAT GAGTAAGAGC AAATGCTCCG TGGGACTCAT GTCTTCOGTG GTGGCCCCGG 180 

CTAAGGAGCC CAATGCCGTG GGCCCGAAGG AGGTGGAGCT CATCCTTGTC AAGGAGCAGA 240 

ACGGAGTGCA GCTCACCAGC TCCACCCTCA CCAACCCGCG GCAGAGCCCC GTGGAGGCCC 300 

AGGATCGGGA GACCTGGGGC AAGAAGATCG ACTTTCTCCT GTCCGTCATT GGCTTTGCTG 360 

TGGACCTGGC CAACGTCTGG CGGTTCCCCT ACCTGTGCTA CAAAAATGGT GGCGGTGCCT 420 

TCCTGGTCCC CTACCTGCTC TTCATGGTCA TTGCTGGGAT GCCACTTTTC TACATGGAGC 480 

TGGCCCTCGG CCAGTTCAAC AGGGAAGGGG CCGCTGGTGT CTGGAAGATC TGCCCCATAC 540 

TGAAAGGTGT GGGCTTCACG GTCATCCTCA TCTCACTGTA TGTCGGCTTC TTCTACAACG 600 

TCATCATCGC CTGGGCGCTG CACTATCTCT TCTCCTCCTT CACCAOGGAG CTCCCCTGGA 660 

TCCACTGCAA CAACTCCTGG AACAGCCCCA ACTGCTCGGA TGCCCATCCT GGTGACTCCA 720 

GTGGAGACAG CTCGGGCCTC AACGACACTT TTGGGACCAC ACCTGCTGCC GAGTACTTTG 780 

AACGTGGCGT GCTGCACCTC CACCAGAGCC ATGGCATCGA CGACCTGGGG CCTCCGCGGT 840 

GGCAGCTCAC AGCCTGCCTG GTGCTGGTCA TCGTGCTGCT CTACTTCAGC CTCTGGAAGG 900 

GCGTGAAGAC CTCAGGGAAG GTGGTATGGA TCACAGCCAC CATGCCATAC GTGGTCCTCA 960 

CTGCCCTGCT CCTGCGTGGG GTCACCCTCC CTGGAGCCAT AGACGGCATC AGAGCATACC 1020 

TGAGCGTTGA CTTCTACCGG CTCTGCGAGG CGTCTGTTTG GATTGACGCG GCCACCCAGG 1080 

TGTGCTTCTC CCTGGGCGTG GGGTTCGGGG TGCTGATCGC CTTCTCCAGC TACAACAAGT 1140 

TCACCAACAA CTGCTACAGG GACGCGATTG TCACCACCTC CATCAACTCC CTGACGAGCT 1200 

TCTCCTCCGG CTTCGTCGTC TTCTCCTTCC TGGGGTACAT GGCACAGAAG CACAGTGTGC 1260 

CCATCGGGGA CGTGGCCAAG GACGGGCCAG GGCTGATCTT CATCATCTAC CCGGAAGCCA 1320 

TCGCCACGCT CCCTCTGTCC TCAGCCTGGG CCGTGGTCTT CTTCATCATG CTGCTCACCC 1380 

TGGGTATCGA CAGCGCCATG GGTGGTATGG AGTCAGTGAT CACCGGGCTC ATCGATGAGT 1440 

TCCAGCTGCT GCACAGACAC CGTGAGCTCT TCACGCTCTT CATCGTCCTG GCGACCTTCC 1S00 

TCCTGTCCCT GTTCTGCGTC ACCAACGGTG GCATCTACGT CTTCACGCTC CTGGACCATT 1560 

TTGCAGCCGG CACGTCCATC CTCTTTGGAG TGCTCATCGA AGCCATCGGA GTGGCCTGGT 1620 

TCTATGGTGT TGGGCAGTTC AGCGACGACA TCCAGCAGAT GACCGGGCAG CGGCCCAGCC 1680 

TGTACTGGCG GCTGTGCTGG AAGCTGGTCA GCCCCTGCTT TCTCCTGTTC GTGGTCGTGG 1740 

TCAGCATTGT GACCTTCAGA CCCCCCCACT ACGGAGCCTA CATCTTCCCC GACTGGGCCA 1800 

ACGCGCTGGG CTGGGTCATC GCCACATCCT CCATGGCCAT GGTGCCCATC TATGCGGCCT 1860 

ACAAGTTCTG CAGCCTGCCT GGGTCCTTTC GAGAGAAACT GGCCTACGCC ATTGCACOCG 1920 

AGAAGGACCG TGAGCTGGTG GACAGAGGGG AGGTGCGCCA GTTCACGCTC CGCCACTGGC 1980 

TCAAGGTGTA GAGGGAGCAG AGACGAAGAC CCCAGGAAGT CATCCTGCAA TGGGAGAGAC 2040 

ACGAACAAAC CAAGGAAATC TAAGTTTCGA GAGAAAGGAG GGCAACTTCT ACTCTTCAAC 2100 

CTCTACTGAA AACACAAACA ACAAAGCAGA AGACTCCTCT CTTCTGACTG TTTACACCTT 2160 

TCCGTGCCGG GAGCGCACCT CGCCGTGTCT TGTGTTGCTG TAATAACGAC GTAGATCTGT 2220 

GCAGCGAGGT CCACCCCGTT GTTGTCCCTG CAGGGCAGAA AAACGTCTAA CTTCATGCTG 2280 

TCTGTGTGAG GCTCCCTCCC TCCCTGCTCC CTGCTCCCGG CTCTGAGGCT GCCCCAGGGG 2340 

CACTGTGTTC TCAGGCGGGG ATCACGATCC TTGTAGAOGC ACCTGCTGAG AATCCCCGTG 2400 

CTCACAGTAG CTTCCTAGAC CATTTACTTT GCCCATATTA AAAAGCCAAG TGTCCTGCTT 2460 

GGTTTAGCTG TGCAGAAGGT GAAATGGAGG AAACCACAAA TTCATGCAAA GTCCTTTCCC 2520 

GATGCGTGGC TCCCAGCAGA GGCCGTAAAT TGAGCGTTCA GTTGACACAT TGCACACACA 2580 

GTCTGTTCAG AGGCATTGGA GGATGGGGGT CCTGGTATGT CTCACCAGGA AATTCTGTTT 2640 

ATGTTCTTGC AGCAGAGAGA AATAAAACTC CTTGAAACCA GCTCAGGCTA CTGCCACTCA 2700 

GGCAGCCTGT GGGTCCTTGT GGTGTAGGGA ACGGCCTGAG AGGAGCGTGT CCTATCCCCG 2760 

GACGCATGCA GGGCCCCCAC AGGAGCGTGT CCTATCCCCG GACGCATGCA GGGCCCCCAC 2820 
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10 



15 



20 



AGGAGCATGT 
AACGCATGCA 
TGGAGCGTGT 
GACCGGACGC 
CCCACAGGAG 
CCCAGGACGC 
TGAGCCGTGA 
CATCAATAAC 
ATCAAAACAA 
TTTACAAGAA 
CACACTGCCC 
TCGGTGGGCC 
TCCCCTGCAA 
GCCTGTGAAC 
GGGAGGGACA 
GCGGCTTCCC 
GTTGTTGAAG 
AGCAACCCAG 
TAAGCACAAT 



CCTATCCCTG 
GGGCCCCCAC 
ACTACCCCAG 
ATGCAGGGCC 
CGTGTACTAC 
ATGCAGGGCC 
CCTCCAGGAA 
AACAGTTTTT 
ATTCAAGAAT 
TAATTAGCAA 
TCTGCCACTG 
TGCCTACGTG 
GTGGACGTGG 
TGCCAGGCAG 
CAGAGGACGG 
CATTGCCTTC 
ACAGCACAGA 

AAAAGACATC 



GACGCATGCA 
AGGAGCGTGT 
GACGCATGCA 
CCCACAGGAG 
CCCAGGATGC 
CCCATGCAGG 
GGGACCCCAC 
ATGTTTGCGA 
GCAGTATCOG 
TACTGAGTGA 
ACAGGAAAGT 
CTGCCCGAGG 
GCTCCAGGGA 
CTGCAGTTAG 
CTTCCCCATC 
TGGGGAGGGA 
GAGCGGCTTC 
TGTCTGTTGA 
CACAATGGAA 



GGGCCCCCAC 
ACTACCCCAG 
GGGCCCCCAC 
CGTGTACTAC 
ATGCAGGGCC 
CAGCCTGCAG 
TGGAATTTTA 
ATGGCTTTTT 
CGAGCCTGCT 
AGGATGTTGG 
GGATGCCATA 
GCAGGGGCCG 
CTGGAGTGTA 
CACAGAGGAT 
GCCTTCTGGC 
CACAGAGGAC 
CCCATCGCCT 
CCAATCTCTA 
AAAAAAAAAG 



AGGAGCGTGT 
GACGCATGCA 
AGGAGCGTGT 
CCCAGGACGC 
CCCACAGGAG 
ACCAACACTC 
TTTCTCTCAG 
AAAATCATAT 
TGCTGATATT 
CCAAAAGCTG 
GTTTGAATTC 
TGCAGGGCCA 
ATGCTCGGTG 
GGCTTCCCCA 
CGCTGCAGTC 
AGTTTCCCCA 
TCTGGGGAGG 
TTCAGCATCG 
GAATTC 



ACTACCCCAG 
GGGCCCCCAC 
CCTATCCCCG 
ATGCAGGGCC 
CGTGTACTAC 



GTGCGTGCCA 
TTACCTGTGA 
GCAGTTTTTG 
CTTTCCATGG 
ATGCCTCAAG 
GTCATGGCTG 
GGAGCCGTCA 
TTGCCTTCTG 
AGCACAGAGA 
TCGCCTTCTG 
GGCTCCGTGT 
TGTGGGTCCC 



2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3S40 
3600 
3660 
3720 
3780 
3840 
3900 



Seq ID NO: 304 Protein sequence 
Protein Accession #: NP 001035.1 



25 

30 

35 

40 

45 

50 
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60 

65 

70 

75 

80 



MSKSfCCSVGL 
ETWGKKIDFL 
GQFNREGAAG 
NNSWNSPNCS 
TACLVLVIVL 
DFYRLCEASV 
GFWFSFLGY 
DSAMGGMESV 
GTSILFGVLI 
VTFRPPHYGA 
RELVDRGEVR 



11 
I 

MSSWAPAKE 
LSVIGFAVDL 
VWKICPILKG 
DAHPGDSSGD 
LYFSLWKGVK 
WIDAATQVCF 
MAQKHSVPIG 
ITGLIDEFQL 
EAIGVAWFYG 
YIFPDWANAL 
QFTLRHWLKV 



21 

I 

PNAVGPKEVE 
ANVWRFPYLC 
VGFTVILISL 
SSGLNDTFGT 
TSGKWWITA 
SLGVGFGVLI 
DVAKDGPGLI 
LHRHRELFTL 
VGQFSDDIQQ 
GWVIATSSMA 



31 
I 

LILVKEQNGV 
YKNGGGAFLV 
YVGFFYNVII 
TPAAEYFERG 
TMPYWLTAL 
AFSSYNKFTN 
FIIYPEAIAT 
FIVLATFLLS 
MTGQRPSLYW 
MVPIYAAYKF 



41 

I 

QLTSSTLTNP 
PYLLFMVIAG 
AWALHYLFSS 
VLHLHQSHGI 
LLRGVTLPGA 
NCYRDAIVTT 
LPLSSAWAW 
LFCVTNGGIY 
RLCWKLVSPC 
CSLPGSFREK 



51 
I 

RQSPVEAQDR 
MPLFYMELAL 
FTTELPWIHC 
DDLGPPRWQL 
IDGIRAYLSV 
SINSLTSFSS 
FFIMLLTLGI 
VFTLLDHFAA 
FLLFVWVSI 
LAYAIAPEKD 



Seq ID NO: 305 DNA sequence 

Nucleic Acid Accession #: NMJ)01216.1 

Coding sequence: 43.. 1422 



GCCCGTACAC 
AGCCCCTGGC 
CTGTCACTGC 
TCCCCCTTGG 
AGTGAAGAGG 
GAGGATCTAC 
TCCCTGAAGT 
AATAATGCCC 
COGCCCTGGC 
CGCCCCCAGC 
CTCCCGCCGC 
CCTCCTGGGC 
CTGCACTGGG 
CCTGCCGAGA 
GGGCGCCCGG 
AGTGCCTATG 
CAGGTCCCAG 
TATGAGGGGT 
CAGACAGTGA 
GGTGACTCTC 
GAGGCCTCCT 
AATTCCTGCC 
ACCAGCGTCG 
GTGAGCTACC 
TGTGAGAAGC 
ATGCCACTTC 



11 
I 

ACCGTGTGCT 
TCCCTCTGTT 
TGCTTCTGAT 
GAGGAGGCTC 
ATTCACCCAG 
CTGGAGAGGA 
TAGAGGATCT 
ACAGGGACAA 
CCCGGGTGTC 
TCGCCGCCTT 
TCCCAGAACT 
TAGAGATGGC 
GGGCTGCAGG 
TCCACGTGGT 
GAGGCCTGGC 
AGCAGTTGCT 
GACTGGACAT 
CTCTGACTAC 
TGCTGAGTGC 
GGCTACAGCT 
TCCCTGCTGG 
TGGCTGCTGG 
CGTTCCTTGT 
GCCCAGCAGA 
CAGCCAGAGG 
CTTTTAACTG 



21 
I 

GGGACACCCC 
GATCCCGGCC 
GCCTGTCCAT 
TTCTGGGGAA 
AGAGGAGGAT 
GGATCTACCT 
ACCTACTGTT 
AGAAGGGGAT 
CCCAGCCTGC 
CTGCCCGGCC 
GCGCCTGCGC 
TCTGGGTCCC 
TCGTCCGGGC 
TCACCTCAGC 
CGTGTTGGCC 
GTCTCGCTTG 
ATCTGCACTC 
ACCGCCCTGT 
TAAGCAGCTC 
GAACTTCCGA 
AGTGGACAGC 
TGACATCCTA 
GCAGATGAGA 
GGTAGCCGAG 
CATCTGAGGG 
CCAAGAAATT 



31 
I 

ACAGTCAGCC 
CCTGCTCCAG 
CCCCAGAGGT 
GATGACCCAC 
CCACCCGGAG 
GAAGTTAAGC 
GAGGCTCCTG 
GACCAGAGTC 
GCGGGCCGCT 
CTGCGCCCCC 
AACAATGGCC 
GGGCGGGAGT 
TCGGAGCACA 
ACCGCCTTTG 
GCCTTTCTGG 
GAAGAAATCG 
CTGCCCTCTG 
GCCCAGGGTG 
CACACCCTCT 
GCGACGCAGC 
AGTCCTCGGG 
GCCCTGGTTT 
AGGCAGCACA 
ACTGGAGCCT 
GGAGCCGGTA 
TTTTAAAATA 



41 

1 

GCATGGCTCC 
GCCTCACTGT 
TGCCCCGGAT 
TGGGCGAGGA 
AGGAGGATCT 
CTAAATCAGA 
GAGATCCTCA 
ATTGGCGCTA 
TCCAGTCCCC 
TGGAACTCCT 
ACAGTGTGCA 
ACCGGGCTCT 
CTGTGGAAGG 
CCAGAGTTGA 
AGGAGGGCCC 
CTGAGGAAGG 
ACTTCAGCCG 
TCATCTGGAC 
CTGACACCCT 
CTTTGAATGG 
CTGCTGAGCC 
TTGGCCTCCT 
GAAGGGGAAC 
AGAGGCTGGA 
ACTGTCCTGT 
AATATTTATA 



51 
I 

CCTGTGCCCC 
GCAACTGCTG 
GCAGGAGGAT 
GGATCTGCCC 
ACCTGGAGAG 
AGAAGAGGGC 
AGAACCCCAG 
TGGAGGCGAC 
GGTGGATATC 
GGGCTTCCAG 
ACTGACCCTG 
GCAGCTGCAT 
CCACCGTTTC 
CGAGGCCTTG 
GGAAGAAAAC 
CTCAGAGACT 
CTACTTCCAA 
TGTGTTTAAC 
GTGGGGACCT 
GCGAGTGATT 
AGTCCAGCTG 
TTTTGCTGTC 
CAAAGGGGGT 
TCTTGGAGAA 
CCTGCTCATT 
AT 



Seq ID NO: 306 Protein sequence 
Protein Accession #: NP 001207.1 



1 
I 

MAPLCPSPWL 
GEEDLPSEED 
DPQEPQNNAK 
ELLGFQLPPL 
VEGHRFPAEI 
EEGSETQVPG 
DTLWGPGDSR 
GLLFAVTSVA 



PLLIPAPAPG 
SPREEDPPGE 



PELRLRNNGH 
HWHLSTAFA 
LDISALLPSD 
LQLNFRATQP 
FLVQMRRQHR 



21 
I 

LTVQLLLSLL 
EDLPGEEDLP 
WRYGGDPPWP 
SVQLTLPPGL 
RVDEALGRPG 
FSRYFQYEGS 
LNGRVIEASF 
RGTKGGVSYR 



31 
I 

LLMPVHPQRL 
GEEDLPEVKP 
RVSPACAGRF 
EMALGPGREY 
GLAVliAAFLE 
LTTPPCAQGV 
PAGVDSSPRA 
PAEVAETGA 



41 

1 

PRMQEDSPLG 
KSEEEGSLKL 
QSPVDIRPGL 
RALQLHLHWG 
EGPEENSAYE 
IWTVFNQTVM 
AEPVQLNSCL 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 



51 
I 

GGSSGEDDPL 
EDLPTVEAPG 
AAFCPALRPL 
AAGRPGSEHT 
QLLSRLEEIA 
LSAKQLKTLS 
AAGDILALVF 



60 
120 
180 
240 
300 
360 
420 
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Seq ID NO: 307 DMA sequence 

Nucleic Acid Accession #: NM_003039.1 

Coding sequence : 76.-1581 

1 11 21 31 41 51 

I I I I I I 

CTTCTCTCTC CATTCAGTGC ACGCGTTACT TTGGCTAAAA GGAGGTGAGC GGCACTCTGC 60 

CCTTCCAGAG CAAGCATGGA GCAACAGGAT CAGAGCATGA AGGAAGGGAG GCTGACGCTT 120 

GTGCTTGCCC TGGCAACCCT GATAGCTGCC TTTGGGTCAT CCTTCCAGTA TGGGTACAAC 180 

GTGGCTGCTG TCAACTCCCC AGCACTGCTC ATGCAACAAT TTTACAATGA GACTTACTAT 240 

GGTAGGACCG GTGAATTCAT GGAAGACTTC GCCTTGACGT TGCTGTGGTC TGTAACCGTG 300 

TCCATGTTTC CATTTGGAGG GTTTATCGGA TCCCTCCTGG TCGGCCCCTT GGTGAATAAA 360 

TTTGGCAGAA AAGGGGCCTT GCTGTTCAAC AACATATTTT CTATCGTGCC TGCGATCTTA 420 

ATGGGATGCA GCAGAGTCGC CACATCATTT GAGCTTATCA TTATTTCCAG ACTTTTGGTG 480 

GGAATATGTG CAGGTGTATC TTCCAACGTG GTCCCCATGT ACTTAGGGGA GCTGGCCCCT 540 

AAAAACCTGC GGGGGGCTCT CGGGGTGGTG CCCCAGCTCT TCATCACTGT TGGCATCCTT 600 

GTGGCCCAGA TCTTTGGTCT TCGGAATCTC CTTGCAAACG TAGATGGCTG GCCGATCCTG 660 

CTGGGGCTGA CCGGGGTCCC CGCGGCGCTG CAGCTCCTTC TGCTGCCCTT CTTCCCCGAG 720 

AGCCCCAGGT ACCTGCTGAT TCAGAAGAAA GACGAAGCGG CCGCCAAGAA AGCCCTACAG 780 

ACGCTGCGCG GCTGGGACTC TGTGGACAGG GAGGTGGCCG AGATCCGGCA GGAGGATGAG 840 

GCAGAGAAGG CCGCGGGCTT CATCTCCGTG CTGAAGCTGT TCCGGATGCG CTCGCTGCGC 900 

TGGCAGCTGC TGTCCATCAT CGTCCTCATG GGCGGCCAGC AGCTGTCGGG CGTCAACGCT 960 

ATCTACTACT ACGCGGACCA GATCTACCTG AGCGCCGGCG TGCCGGAGGA GCACGTGCAG 1020 

TACGTGACGG CCGGCACCGG GGCCGTGAAC GTGGTCATGA CCTTCTGCGC CGTGTTCGTG 10 BO 

GTGGAGCTCC TGGGTCGGAG GCTGCTGCTG CTGCTGGGCT TCTCCATCTG CCTCATAGCC 1140 

TGCTGCGTGC TCACTGCAGC TCTGGCACTG CAGGACACAG TGTCCTGGAT GCCATACATC 1200 

AGCATCGTCT GTGTCATCTC CTACGTCATA GGACATGCCC TCGGGCCCAG TCCCATACCC 1260 

GCGCTGCTCA TCACTGAGAT CTTCCTGCAG TCCTCTCGGC CATCTGCCTT CATGGTGGGG 1320 

GGCAGTGTGC ACTGGCTCTC CAACTTCACC GTGGGCTTGA TCTTCCCGTT CATCCAGGAG 1380 

GGCCTCGGCC CGTACAGCTT CATTGTCTTC GCCGTGATCT GCCTCCTCAC CACCATCTAC 1440 

ATCTTCTTGA TTGTCCCGGA GACCAAGGCC AAGACGTTCA TAGAGATCAA CCAGATTTTC 1500 

ACCAAGATGA ATAAGGTGTC TGAAGTGTAC CCGGAAAAGG AGGAACTGAA AGAGCTTCCA 1560 

CCTGTCACTT CGGAACAGTG ACTCTGGAGA GGAAGCCAGT GGAGCTGGTC TGCCAGGGGC 1620 

TTCCCACTTT GGCTTATTTT TCTGACTTCT AGCTGTCTGT GAATATCCAG AAATAAAACA 1680 

ACTCTGATGT GGAATGCAGT CCTCATCTCC AGCCTCCCCA CCCCAGTGGG AACTGTGCAA 1740 

AGGGCTGCCT TGCTGTTCTT GAAGCTGGGC TGTCTCTCTC CATGTTGGCC TGTCACCAGA 1800 

CCCGAGTCAA TTAAACAGCT GGTCCTCCAC TTTGCTGGTT CAGCCTTCGT GTGGCTCCTG 1860 

GTAACGTGGC TCCACCTTGA TGGGTCAACC TTTGTGTGGC TCCTGGTAAC ATAACAACAA 1920 

CAGTTACTAT AGTGGTGAGA TGGAAGGAAT CAAATTTTGC CAGAGAAACT AACTCGGTGG 1980 

CCCCAACAGG TCTTCCGGGG CCATGGGCAT TTGTTTAGAG CCAAATTCAT CCTCTTACCA 2040 

GATCCTTTTC CAGAAATACC TGTCTAGGAA GGTGTGATGT CAGAAACAAT GACATCCAGA 2100 

AAGCTGAGGA ACAGGTTCCT GTGGAGACAC TGAGTCAGAA TTCTTCATCC AAATTATTTT 2160 
GTTAGTGGAA AATGGAATTG CTTCTGTGTA GTCAATAAAA TGAACCTGAT CACTTTTC 

Seq ID NO: 308 Protein sequence 
Protein Accession #: NP_o 03 030.1 



1 
I 

MEQQDQSMKE 
FMEDFPLTLL 
VATSFELIII 
GLRNLLANVD 
DSVDREVAEI 
DQIYliSAGVP 
AAIALQDTVS 
LSNFTVGIiIP 



11 
I 

GRLTLVLALA 
WSVTVSMPPF 
SRLLVGICAG 
GWPILLGLTG 



EEHVQYVTAG 
WMPYISIVCV 
PFIQEGLGPY 
LKELPPVTSE 



21 
I 

TLIAAFGSSF 
GGFIGSLLVG 
VSSNWPMYL 
VPAALQLLLL 
GFISVLKLFR 
TGAVNWMTP 
ISYVIGHALG 
SFIVFAVICL 
Q 



31 
I 

QYGYNVAAVN 
PLVNKFGRKG 
GELAPKNLRG 
PFFPBSPRYL 
MRSLRWQLLS 
CAVFWELLG 
PSPIPALI.IT 
LTTIYIFLIV 



41 

I 

SPALLMQQFY 
AliLFNNIFSI 
ALGWPQLFI 
LIQKKDEAAA 
HVLMGGQQL 
RRLLLLLGFS 
EIFLQSSRPS 
PETKAKTFIE 



51 
I 

VPAILMGCSR 
TVGILVAQIF 
KKALQTLRGW 
SGVNAIYYYA 
ICLIACCVLT 
AFMVGGSVHW 
INQIFTKMNK 



60 
120 
180 
240 
300 
360 
420 
480 



Seq ID NO: 309 DMA sequence 

Nucleic Acid Accession #: NM_001252.1 

Coding sequence: 13 8.. 719 



1 
i 

GGCTGGTCCC 
GTTTCCTTCC 
AGGTGATCGC 
ATGGGTGCGT 
TGGTGTGCAT 
GGGACGTAGC 
GGCAGGGGGG 
AGCTACGTAT 
GCTCCTCCAC 
CCGCCTCCCG 
CCCAGCGCCT 
TTTTGCCTTC 
CACTGCTGCT 
AGTGTACACA 
AGGACAAGAG 



11 
I 

CTGACAGGTT 
TTCCTTCTCG 
CGCGGCGATG 
CCTGCGGGCT 
CCAGCGCTTC 
TGAGCTGCAG 
CCCAGCACTG 
CCATCGTGAT 
GACGGCCTCC 
TAGCATCAGC 
GACGCCCCTG 
CCGAAACACT 
GATTAGGGTT 
CAGGGGCCAC 
AAGGCATTGA 



21 
I 

GAAGCAAGTA 
GCAGCGCTCC 
CCGGAGGAGG 
GCTTTGGTCC 
GCACAGGCTC 
CTGAATCACA 
GGCCGCTCCT 
GGCATCTACA 
AGGCACCACC 
CTGCTGCGTC 
GCCCGAGGGG 
GATGAGACCT 
TTTTAAATTT 
CCGGGGTTGG 
GCTTTTTCTT 



31 
I 

GACGCCCAGG 
GCGCCCCCAT 
GTTCGGGCTG 
CATTGGTCGC 
AGCAGCAGCT 
CAGGACCTCA 
TCCTGCATGG 
TGGTACACAT 
CCACCACCCT 
TCAGCTTCCA 
ACACACTCTG 
TCTTTGGAGT 
TATTTTATTT 
GGTGGGAGTG 
TCATTTTCCT 



41 
I 

AGCCCCGGGA 
CGCCCCTCCT 
CTCGGTGCGG 
GGGCTTGGTG 
GCGGCTCGAG 
GCAGGACCCC 
ACCAGAGCTG 
CCAGGTGACG 
GGCCGTGGGA 
CCAAGGTTGT 
CACCAACCTC 
GCAGTGGGTG 
TATTTAAGTT 
TGGTGGGGGG 
ATTAAAAA 



51 
I 

GGGGGCTGCA 
GCGCTAGCGG 
CGCAGGCCCT 
ATCTGCCTCG 
TCACTTGGGT 
AGGCTATACT 
GACAAGGGGC 
CTGGCCATCT 
ATCTGCTCTC 
ACCATTGCCT 
ACTGGGACAC 
CGCCCCTGAC 
CAAGAGAAAA 
TAGTGGTGGC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 



Seq ID NO: 310 Protein sequence 
Protein Accession #: NP_001243.1 
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1 11 21 31 41 51 

I I I I i I 

MPEEGSGCSV RRRPYGCVLR AALVPLVAGL VICLWCIQR FAQAQQQLPL ESLGWDVAEL 60 

QLNHTGPQQD PRLYWQGGPA LGRSFLHGPE LDKGQLRIHR DGIYMVHIQV TLAICSSTTA 120 

SRHHPTTLAV GICSPASRSI SLLRLSFHQG CTIASQRLTP LARGDTLCTN LTGTLLPSRN 180 
TDETPFGVQW VRP 

Seq ID NO: 311 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 1..3978 

1 11 21 31 41 51 

I I I I I I 

ATGGTGGGTG AAGGACCCTA CCTTATCTCA GATCTGGACC AGCGAGGCCG GOGGAGATCC 60 

TTTGCAGAAA GATATGACCC CAGCCTGAAG ACCATGATCC CAGTGCGACC CTGTGCAAGG 120 

TTAGCACCCA ACCCGGTGGA TGATGCCGGG CTACTCTCCT TCGCCACATT TTCCTGGCTC 180 

ACGCCGGTGA TGGTGAAAGG CTACCGGCAA AGGCTGACCG TAGACACCCT GCCCCCATTG 240 

TCGACATATG ACTCATCTGA CACCAATGCC AAAAGATTTC GAGTCCTTTG GGATGAAGAG 300 

GTAGCAAGGG TGGGTCCTGA GAAGGCCTCT CTGAGCCACG TGGTGTGGAA ATTCCAGAGG 360 

ACACGCGTGT TGATGGACAT CGTGGCCAAC ATCCTGTGCA TCATCATGGC AGCCATAGGG 420 

CCGACAGTTC TCATTCACCA AATCCTCCAG CAGACTGAGA GGACCTCTGG GAAAGTCTGG 480 

GTTGGCATTG GACTGTGCAT AGCCCTTTTT GCCACCGAGT TTACCAAAGT CTTCTTTTGG 540 

GCCCTTGCCT GGGCCATCAA CTACCGCACG GCCATCCGGT TGAAGGTGGC GCTCTCCACC 600 

TTGGTTTTTG AAAACCTAGT GTCCTTCAAG ACATTGACCC ACATCTCTGT TGGOGAGGTG 660 

CTCAATATAC TGTCAAGTGA TAGCTATTCT TTGTTTGAAG CTGCCTTGTT TTGTCCTTTG 720 

CCAGCCACCA TCCCGATCCT AATGGTCTTT TGTGCGGCGT ACGCCTTTTT CATTCTGGGG 780 

CCCACAGCTC TCATCGGGAT ATCAGTGTAT GTCATATTCA TACCCGTCCA GATGTTTATG 840 

GCCAAGCrCA ATTCAGCTTT CCGAAGGTCA GCAATTTTGG TGACAGACAA GCGAGTTCAG 900 

ACAATGAATG AGTTTCTGAC CTGCATCAGG CTGATCAAAA TGTATGCCTG GGAGAAATCT 960 

TTTACCAACA CTATCCAAGA TATAAGAAGG AGGGAAAGAA AATTACTGGA AAAAGCTGGA 1020 

TTTGTCCAAA GTGGAAACTC TGCCCTGGCC CCCATCGTGT CCACCATAGC CATCGTGCTG 1080 

ACATTATCCT GCCACATCCT CCTGAGACGC AAACTCACCG CACCCGTGGC ATTTAGTGTG 1140 

ATTGCCATGT TTAATGTAAT GAAGTTTTCC ATTGCAATCT TGCCCTTCTC CATCAAAGCA 1200 

ATGGCTGAAG CGAATGTCTC TCTAAGGAGA ATGAAGAAAA TTCTCATAGA TAAAAG CCCC 1260 

CCATCTTACA TCACCCAACC AGAAGACCCA GATACTGTCT TGCTTTTAGC AAATGCCACC 1320 

TTGACATGGG AGCATGAAGC CAGCAGGAAA AGTACCCCAA AGAAATTGCA GAACCAGAAA 1380 

AGGCATTTAT GCAAGAAACA GAGGTCAGAG GCATACAGTG AGAGGAGTCC ACCAGCCAAG 1440 

GGAGCCACTG GCCCAGAGGA GCAAAGTGAC AGCCTCAAAT CGGTTCTGCA CAGCATAAGC 1500 

TTTGTGGTGA GAAAGTTATG TCGTTATCCC GAAGCCCAGC TCCTGGCTTG GAGGTGGCCA 1560 

GCAGTGTTTG TTGGGAGAAT CATCAGAGGA TACAGGCCTC ATGGATTTTC TGCTAAAGAC 1620 

AAGGATGAAT CTAGAAGGCT TCTTACTTGG CCCCAAGAAG TGGATAGGAC TCAAAGGGCA 1680 

GCCAAATACC TGGGGAAGAT CTTGGGAATA TGTGGGAATG TGGGAAGTGG AAAGAGCTCC 1740 

CTCCTTGCAG CTCTCCTAGG ACAGATGCAG CTGCAGAAAG GGGTGGTGGC AGTCAATGGA 1800 

ACTTTGGCCT ACGTTTCACA GCAGGCATGG ATCTTTCATG GAAATGTGAG AGAAAACATA I860 

CTCTTTGGAG AAAAGTATGA TCACCAAAGG TATCAGCACA CAGTCCGCGT CTGTGGCCTC 1920 

CAGAAGGACC TGAGCAACCT CCCCTATGGA GACCTGACTG AGATTGGGGA GCGGGGCCTC 1980 

AACCTCTCTG GGGGGCAGAG GCAGAGGATT AGCCTGGCCC GCGCTGTCTA CTCCGACCGT 2040 

CAGCTCTACC TGCTGGACGA CCCCCTGTCG GCCGTGGACG CCCACGTGGG GAAGCACGTC 2100 

TTTGAGGAGT GCATTAAGAA GACGCTCAGG GGAAAGACAG TCGTCCTGGT GACCCACCAG 2160 

CTACAGTTCT TAGAGTCTTG TGATGAAGTT ATTTTATTAG AAGATGGAGA GATTTGTGAA 2220 

AAGGGAACCC ACAAGGAGTT AATGGAGGAG AGAGGGCGCT ATGCAAAACT GATTCACAAC 2280 

CTGCGAGGAT TGCAGTTCAA GGATCCTGAA CACCTTTACA ATGCAGCAAT GGTGGAAGCC 2340 

TTCAAGGAGA GCCCTGCTGA GAGAGAGGAA GATGCTGGTA TAATCGGGTA CCTCCTTTCT 2400 

CTCTTCACTG TGTTCCTCTT CCTCCTGATG ATTGGCAGCG CTGCCTTCAG CAACTGGTGG 2460 

CTGGGTCTCT GGTTGGACAA GGGCTCACGG ATGACCTGTG GGCCCCAGGG CAACAGGACC 2520 

ATGTGTGAGG TCGGCGCGGT GCTGGCAGAC ATCGGTCAGC ATGTGTACCA GTGGGTGTAC 2580 

ACTGCAAGCA TGGTGTTCAT GCTGGTGTTT GGCGTCACCA AAGGCTTCGT CTTCACCAAG 2640 

ACCACACTGA TGGCATCCTC CTCTCTGCAT GACACGGTGT TTGATAAGAT CTTAAAGAGC 2700 

CCAATGAGTT TCTTTGACAC GACTCCCACT GGCAGGCTAA TGAACCGTTT TTCCAAGGAT 2760 

ATGGACGAGC TGGATGTGAG GCTGCCGTTT CACGCAGAGA ACTTTCTGCA GCAGTTTTTT 2820 

ATGGTGGTGT TTATTCTCGT GATCTTGGCT GCTGTGTTTC CTGCTGTCCT TTTAGTCGTG 2880 

GCCAGCCTTG CTGTAGGCTT CTTCATTCTG TTACGCATTT TCCACAGAGG AGTCCAGGAG 2940 

CTCAAGAAGG TGGAGAATGT CAGCCGGTCA CCCTGGTTCA CCCACATCAC CTCCTCCATG 3000 

CAGGGCCTGG GCATCATTCA CGCCTATGGC AAGAAGGAGA GCTGCATCAC CTATACTTCA 3060 

TCCAAAGGCC TGTCATTGTC ATACATCATC CAGCTGAGCG GACTGCTCCA AGTGTGTGTG 3120 

CGAACGGGAA CAGAGAGGCA AGCCAAATTC ACCTCCGTGG AGCTGCTCAG GGAATACATT 31 BO 

TCGACCTGTG TTCCTGAATG CACTCATCCC CTCAAAGTGG GGACCTGTCC CAAGGACTGG 3240 

CCCAGCTGTG GGGAGATCAC CTTCAGAGAC TATCAGATGA GATACAGAGA CAACACCCCC 3300 

CTTGTTCTCG ACAGCCTGAA CTTGAACATA CAAAGTGGGC AGACAGTCGG GATTGTTGGA 3360 

AGAACAGGTT CCGGAAAGTC ATCGTTAGGA ATGGCTTTGT TTCGTCTGGT GGAGCCAGCC 3420 

AGTGGCACAA TCTTTATTGA TGAGGTGGAT ATCTGCATTC TCAGCTTGGA AGACCTCAGA 3480 

ACCAAGCTGA CTGTGATCCC ACAGGATCCT GTCCTGTTTG TAGGTACAGT AAGGTACAAC 3540 

TTGGATCCCT TTGAGAGTCA CACCGATGAG ATGCTCTGGC AGGTTCTGGA GAGAACATTC 3600 

ATGAGAGACA CAATAATGAA ACTCCCAGAA AAATTACAGG CAGAAGTCAC AGAAAATGGA 3660 

GAAAACTTCT CAGTAGGGGA ACGTCAGCTG CTTTGTGTGG CCCGAGCTCT TCTCCGTAAT 3720 

TCAAAGATCA TTCTCCTTGA TGAAGCCACC GCCTCTATGG ACTCCAAGAC TGACACCCTG 3780 

GTTCAGAACA CCATCAAAGA TGCCTTCAAG GGCTGCACTG TGCTGACCAT CGCCCACGGC 3840 

CTCAACACAG TTCTCAACTG CGATCACGTC CTGGTTATGG AAAATGGGAA GGTGATTGAG 3900 

TTTGACAAGC CTGAAGTCCT TGCAGAGAAG CCAGATTCTG CATTTGCGAT GTTACTAGCA 3960 
GCAGAAGTCA GATTGTAG 

Seq ID NO: 312 Protein sequence 
Protein Accession #: Eos sequence 



1072 



WO 03/042661 



1 11 21 31 41 51 

I I I 1 I I 

MVGEGPYLIS DLDQRGRRRS FAERYDPSLK TMIPVRPCAR LAPNPVDDAG LLSFATPSWL 
TPVMVKGYRQ RLTVDTLPPL STYDSSDTNA KRFRVLWDEE VARVGPEKAS LSHWWKFQR 
TRVLMDIVAN ILCIIMAAIG PTVLIHQILQ QTERTSGKVW VGIGLCIALF ATEFTKVFFH 
ALAWAINYRT AIRLKVALST LVFENLVSFK TLTHISVGEV LNILSSDSYS LFEAALFCPL 
PATIPILMVF CAAYAFFILG PTALIGISVY VIFIPVQMFM AKLNSAFRRS AILVTDKKVQ 
TMNEFLTCIR LIKMYAWEKS FTNTIQDIRR RERKLLEKAG FVQSGNSALA PIVSTIAIVL 
TLSCHILLRR KLTAPVAFSV IAMFNVMKFS IAILPFSIKA MAEANVSLRR MKKILIDKSP 
PSYITQPEDP DTVLLLANAT LTWEHEASRK STPKKLQNQK. RHLCKKQRSE AYSERSPPAK 
GATGPEBQSD SLKSVLHSIS FWRKLCRYP EAQLLAWRWP AVFVGRIIRG YRPHGFSAKD 
KDESRRLLTW PQEVDRTQRA AKYLGKILGI CGNVGSGKSS LLAALIX3QMQ LQKGWAVNG 
TLAYVSQQAW IFHGNVRENI LFGEKYDHQR YQHTVRVCGL QKDLSNLPYG DLTEIGERGL 
NLSGGQRQRI SLARAVYSDR QLYLLDDPLS AVDAHVGKHV FEECIKKTLR GKTWLVTHQ 
LQFLESCDEV ILLEDGEICE KGTHKELMEE RGRYAKLIHN LRGLOFKDPE HLYNAAMVEA 
FKESPAEREE DAGIIGYLLS LFTVFLFLLM IGSAAFSNWW LGLWLDKGSR MTCGPQGNRT 
MCEVGAVLAD IGQHVYQWVY TASMVFMLVF GVTKGFVFTK TTLMASSSLH DTVFDKILKS 
PMSFFDTTPT GRLMNRFSKD MDELDVRLPF HAENFLQQFF MWFILVILA AVFPAVLLW 
ASLAVGFFIL LRIFHRGVQE LKKVENVSRS PWFTHITSSM QGLGUHAYG KKESCITYTS 
SKGLSLSY1I QLSGIiLQVCV RTGTETQAKF TSVELLREYI STCVPECTHP LKVGTCPKDW 
PSCGEITFRD YQMRYRDNTP LVLDSLNLNI QSGQTVGIVG RTGSGKSSLG MALFRLVEPA 
SGTIFIDEVD ICILSLEDLR TKLTVIPQDP VLFVGTVRYN IJ3PFESHTDE MLWQVLERTF 
MRDTIMKLPE KLQAEVTENG ENFSVGERQL LCVARALLRN SKI ILLDEAT ASMDSKTDTL 
VQNTIKDAFK GCTVLT I AHR LNTVLNCDHV LVMENGKVIE FDKPEVLAEK PDSAFAMLLA 
AEVRL 

Seq ID NO: 313 DNA sequence 
Nucleic Acid Accession #: Z31560 
Coding sequence: 1-966 

1 11 21 31 41 51 

I 1 I I I I 

CACAGCGCCC GCATGTACAA CATGATGGAG ACGGAGCTGA AGCCGCCGGG CCCGCAGCAA 60 

ACTTCGGGGG GCGGCGGCGG CAACTCCACC GCGGCGGCGG CCGGCGGCAA CCAGAAAAAC 120 

AGCCCGGACC GCGTCAAGCG GCCCATGAAT GCCTTCATGG TGTGGTCCCG CGGGCAGCGG 180 

CGCAAGATGG CCCAGGAGAA CCCCAAGATG CACAACTCGG AGATCAGCAA GCGCCTGGGC 240 

GCCGAGTGGA AACTTTTGTC GGAGACGGAG AAGCGGCCGT TCATOGACGA GGCTAAGCGG 300 

CTGCGAGCGC TGCACATGAA GGAGCACCCG GATTATAAAT ACCGGCCCGG GCGGAAAACC 360 

AAGACGCTCA TGAAGAAGGA TAAGTACACG CTGCCCGGCG GGCTGCTGGC CCCCGGCGGC 420 

AATAGCATGG OGAGCGGGGT CGGGGTGGGC GCCGGCCTGG GCGCGGGCGT GAACCAGCGC 480 

ATGGACAGTT ACGCGCACAT GAACGGCTGG AGCAACGGCA GCTACAGCAT GATGCAGGAC 540 

CAGCTGGGCT ACCCGCAGCA CCCGGGCCTC AATGCGCACG GCGCAGCGCA GATGCAGCCC 600 

ATGCACCGCT ACGACGTGAG CGCCCTGCAG TACAACTCCA TGACCAGCTC GCAGACCTAC 660 

ATGAACGGCT CGCCCACCTA CAGCATGTCC TACTCGCAGC AGGGCACCCC TGGCATGGCT 720 

CTTGGCTCCA TGGGTTCGGT GGTCAAGTCC GAGGCCAGCT CCAGCCCXXTC TGTGGTTACC 780 

TCTTCCTCCC ACTCCAGGGC GCCCTGCCAG GCCGGGGACC TCCGGGACAT GATCAGCATG 840 

TATCTCCCGG GCGCCGAGGT GCCGGAACCC GCCGCCCCCA GCAGACTTCA CATGTCCCAG 900 

CACTACCAGA GCGGCCCGGT GCCCGGCACG GCCATTAACG GCACACTGCC CCTCTCACAC 960 

ATGTGAGGGC CGGACAGCGA ACTGGAGGGG GGAGAAATTT TCAAAGAAAA ACGAGGGAAA 1020 

TGGGAGGGGT GCAAAAGAGG AGAGTAAGAA ACAGCATGGA GAAAACCCGG TACGCTCAAA 1080 
AAAAA 

Seq ID NO: 314 Protein sequence 
Protein Accession #t CAA8343S 

1 11 21 31 41 51 

| | I I I I 

HSARMYNMME TELKPPGPQQ TSGGGGGNST AAAAGGNQKN SPDRVKRPMN AFMVWSRGQR 60 

RKMAQENPKM HNSEISKRLG AEWKLI*SETE KRPFIDEAKR LRALHMKEHP DYKYRPRRKT 120 

KTIiMKKDKYT LPGGLLAPGG NSMASGVGVG AGLGAGVNQR MDSYAHMNGW SNGSYSMM0D 180 

QLGYPQHPGI* NAHGAAQMQP MHRYDVSALQ YNSMTSSQTY MNGSPTYSMS YSQQGTPGMA 240 

LGSMGSWKS EASSSPPWT SSSHSRAPCQ AGDLRDMISM YLPGAEVPEP AAPSRLHMSQ 300 
HYQSGPVPGT AINGTLPLSH M 

Seq ID NO: 315 DNA sequence 
Nucleic Acid Accession fl: U91618 
Coding sequence: 2 9.. 541 

1 11 21 31 41 51 

| 1 I 1 I I 

CGGACTTGGC TTGTTAGAAG GCTGAAAGAT GATGGCAGGA ATGAAAATCC AGCTTGTATG 60 

CATGCTACTC CTGGCTTTCA GCTCCTGGAG TCTGTGCTCA GATTCAGAAG AGGAAATGAA 120 

AGCATTAGAA GCAGATTTCT TGACCAATAT GCATACATCA AAGATTAGTA AAGCACATGT 180 

TCCCTCTTGG AAGATGACTC TGCTAAATGT TTGCAGTCTT GTAAATAATT TGAACAGCCC 240 

AGCTGAGGAA ACAGGAGAAG TTCATGAAGA GGAGCTTGTT GCAAGAAGGA AACTTCCTAC 300 

TGCTTTAGAT GGCTTTAGCT TGGAAGCAAT GTTGACAATA TACCAGCTCC ACAAAATCTG 360 

TCACAGCAGG GCTTTTCAAC ACTGGGAGTT AATCCAGGAA GATATTCTTG ATACTGGAAA 420 

TGACAAAAAT GGAAAGGAAG AAGTCATAAA GAGAAAAATT CCTTATATTC TGAAACGGCA 480 

GCTGTATGAG AATAAACCCA GAAGACCCTA CATACTCAAA AGAGATTCTT ACTATTACTC 540 

AGAGAATAAA TCATTTATTT ACATGTGATT GTGATTCATC ATCCCTTAAT TAAATATCAA 600 

ATTATATTTG TGTGAAAATG TGACAAACAC ACTTATCTGT C TCTTC TACA ATTGTGGTTT 660 

ATTGAATGTG TTTTTCTGCA CTAATAGAAA TTAGACTAAG TGTTTTCAAA TAAATCTAAA 720 
TCTTCAAAAA AAAAAAAAAA AAATGGGGCC GCAATT 



1073 



WO 03/042661 



PCT/US02/36810 



Seq ID NO: 316 Protein sequence 
Protein Accession ft: AAB50564 
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1 11 21 31 41 51 

| 1 I I I I 

MMAGMKIQLV CMLLLAFSSW SLCSDSEEEM KALEADFLTN MHTSKISKAH VPSWKMTLLN 
VCSLVNNLNS PAEETGEVHE EELVARRKLP TALDGFSLEA MLTIYQLHKI CHSRAFQHWE 
LIQEDILDTG NDKNGKEEVI KRKIPY1LKR QLYENKPRRP YILKRDSYYY 

Seq ID NO: 317 DNA sequence 

Nucleic Acid Accession ftt NM_006536.2 

Coding sequence t 109- .2940 

1 ll 21 31 41 51 

| | I i I I 

ACCTAAAACC TTGCAAGTTC AGGAAGAAAC CATCTGCATC CATATTGAAA ACCTGACACA 
ATGTATGCAG CAGGCTCAGT GTGAGTGAAC TGGAGGCTTC TCTACAACAT GACCCAAAGG 
AGCATTGCAG GTCCTATTTG CAACCTGAAG TTTGTGACTC TCCTGGTTGC CTTAAGTTCA 
GAACTCCCAT TCCTGGGAGC TGGAGTACAG CTTCAAGACA ATGGGTATAA TGGATTGCTC 
ATTGCAATTA ATCCTCAGGT ACCTGAGAAT CAGAACCTCA TCTCAAACAT TAAGGAAATG 
ATAACTGAAG CTTCATTTTA CCTATTTAAT GCTACCAAGA GAAGAGTATT TTTCAGAAAT 
ATAAAGATTT TAATACCTGC CACATGGAAA GCTAATAATA ACAGCAAAAT AAAACAAGAA 
TCATATGAAA AGGCAAATGT CATAGTGACT GACTGGTATG GGGCACATGG AGATGATCCA 
TACACCCTAC AATACAGAGG GTGTGGAAAA GAGGGAAAAT ACATTCATTT CACA CCTAAT 
TTCCTACTGA ATGATAACTT AACAGCTGGC TAOGGATCAC GAGGCCGAGT GTTTGTCCAT 
GAATGGGCCC ACCTCCGTTG GGGTGTGTTC GATGAGTATA ACAATGACAA ACCTTTCTAC 
ATAAATGGGC AAAATCAAAT TAAAGTGACA AGGTGTTCAT CTGACATCAC AGGCATTTTT 
GTGTGTGAAA AAGGTCCTTG CCCCCAAGAA AACTGTATTA TTAGTAAGCT TTTTAAAGAA 
GGATCCACCT TTATCTACAA TAGCACCCAA AATGCAACTG CATCAATAAT GTTCATGCAA 
AGTTTATCTT CTGTGGTTGA ATTTTGTAAT GCAAGTACCC ACAACCAAGA AGCACCAAAC 
CTACAGAACC AGATGTGCAG CCTCAGAAGT GCATGGGATG TAATCACAGA CTCTGCTGAC 
TTTCACCACA GCTTTCCCAT GAATGGGACT GAGCTTCCAC CTCCTCCCAC ATTCTCGCTT 
GTACAGGCTG GTGACAAAGT GGTCTGTTTA GTGCTGGATG TGTCCAGCAA GATGGCAGAG 
GCTGACAGAC TCCTTCAACT ACAACAAGCC GCAGAATTTT ATTTGATGCA GATTGTTGAA 
ATTCATACCT TCGTGGGCAT TGCCAGTTTC GACAGCAAAG GAGAGATCAG AGCCCAGCTA 
CACCAAATTA ACAGCAATGA TGATCGAAAG TTGCTGGTTT CATATCTGCC CACCACTGTA 
TCAGCTAAAA CAGACATCAG CATTTGTTCA GGGCTTAAGA AAGGATTTGA GGTGGTTGAA 
AAACTGAATG GAAAAGCTTA TGGCTCTGTG ATGATATTAG TGACCAGOGG AGATGATAAG 
CTTCTTGGCA ATTGCXTACC CACTGTGCTC AGCAGTGGTT CAACAATTCA CTCCATTGCC 
CTGGGTTCAT CTGCAGCCCC AAATCTGGAG GAATTATCAC GTC TTAC AGG AGGTTTAAAG 
TTCTTTGTTC CAGATATATC AAACTCCAAT AGCATGATTG ATGCTTTCAG TAGAATTTCC 
TCTGGAACTG GAGACATTTT CCAGCAACAT ATTCAGCTTG AAAGTACAGG TGAAAATGTC 
AAACCTCACC ATCAATTGAA AAACACAGTG ACTGTGGATA ATACTGTGGG CAACGACACT 
ATGTTTCTAG TTACGTGGCA GGCCAGTGGT CCTCCTGAGA TTATATTATT TGATCCTGAT 
GGACGAAAAT ACTACACAAA TAATTTTATC ACCAATCTAA CTTTTCGGAC AGCTAGTCTT 
TGGATTCCAG GAACAGCTAA GCCTGGGCAC TGGACTTACA CCCTGAACAA TACCCATCAT 
TCTCTGCAAG CCCTGAAAGT GACAGTGACC TCTCGCGCCT CCAACTCAGC TGTGCCCCCA 
GCCACTGTGG AAGCCTTTGT GGAAAGAGAC AGCCTCCATT TTCCTCATCC TGTGATGATT 
TATGCCAATG TGAAACAGGG ATTTTATCCC ATTCTTAATG CCACTGTCAC TGCCACAGTT 
GAGCCAGAGA CTGGAGATCC TGTTACGCTG AGACTCCTTG A TGAT GGAGC AGGTGCTGAT 
GTTATAAAAA ATGATGGAAT TTACTCGAGG TATTTTTTCT CCTTTGCTGC AAATGGTAGA 
TATAGCTTGA AAGTGCATGT CAATCACTCT CCCAGCATAA GCACCCCAGC CCACTCTATT 
CCAGGGAGTC ATGCTATGTA TGTACCAGGT TACACAGCAA ACGGTAATAT TCAGATGAAT 
GCTCCAAGGA AATCAGTAGG CAGAAATGAG GAGGAGCGAA AGTGGGGCTT TAGCCGAGTC 
AGCTCAGGAG GCTCCTTTTC AGTGCTGGGA GTTCCAGCTG GCCCCCACCC TGATGTGTTT 
CCACCATGCA AAATTATTGA CCTGGAAGCT GTAAAAGTAG AAGAGGAATT GACCCTATCT 
TGGACAGCAC CTGGAGAAGA CTTTGATCAG GGCCAGGCTA CA AGCTAT GA AATAAGAATG 
AGTAAAAGTC TACAGAATAT CCAAGATGAC TTTAACAATG CTATTTTAGT AAATACATCA 
AAGCGAAATC CTCAGCAAGC TGGCATCAGG GAGATATTTA CGTTCTCACC CCAGATTTCC 
ACGAATGGAC CTGAACATCA GCCAAATGGA GAAACACATG AAAGCCACAG AATTTATGTT 
GCAATAOGAG CAATGGATAG GAACTCCTTA CAGTCTGCTG TATCTAACAT TGCCCAGGCG 
CCTCTGTTTA TTCCCCCCAA TTCTGATCCT GTACCTGCCA GAGATTATCT TATATTGAAA 
GGAGTTTTAA CAGCAATGGG TTTGATAGGA ATCATTTGCC TTATTATAGT TGTGACACAT 
CATACTTTAA GCAGGAAAAA GAGAGCAGAC AAGAAAGAGA ATGGAACAAA ATTATTATAA 
ATAAATATCC AAAGTGTCTT CCTTCTTAGA TATAAGACCC ATGGCCTTCG ACTACA AAAA 
CATACTAACA AAGTCAAATT AACATCAAAA CTGTATTAAA ATGCATTGAG TTTTTGTACA 
ATACAGATAA GATTTTTACA TGGTAGATCA ACAATTCTTT TTGGGGGTAG ATTAGAAAAC 
CCTTACACTT TGGCTATGAA CAAATAATAA AAATTATTCT TT AAAGTA AT GTCTTTAAAG 
GCAAAGGGAA GGGTAAAGTC GGACCAGTGT CAAGGAAAGT TTGTTTTATT GAGGTGGAAA 
AATAGCCCCA AGCAGAGAAA AGGAGGGTAG GTCTGCATTA TAACTGTCTG TGTGAAGCAA 
TCATTTAGTT ACTTTGATTA ATTTTTCTTT TCTCCTTATC TGTGCAGTAC AGGTTGCTTG 
TTTACATGAA GATCATGCTA TATTTTATAT ATGTAGCCCC TAATGCAAAG CTCTTT ACCT 
CTTGCTATTT TGTTATATAT ATTTCAGATG ACATCTCCCT GCTAATGCTC AGAGATCTTT 
TTTCACTGTA AGAGGTAACC TTTAACAATA TGGGTATTAC CTTTGTCTCT TCATACCGGT 
TTTATGACAA AGGTCTATTG AATTTATTTG TNTGTAAGTT TCTACTCCCA TCAAAGCAGC 
TTTCTAAGTT TATTGCCTTG GGTTATTATG GAATGATAGT TATAGCCCOT TATAATGCCT 
TACCTAGGAA A 

Seq ID NO: 318 Protein sequence 
Protein Accession #t NP_006527.1 
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60 
120 
160 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
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MTQRSIAGPI CNLKFVTLLV ALSSELPFLG AGVQI/QDNGY NGLLIAINPQ VPENQNLISN 
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IKEMITEASF YLFNATKRRV FFRNIKILIP ATWKANNNSK IKQESYEKAN VIVTDWYGAH 120 

GDDPYTLQYR GCGKEGKYIH FTPNFLUJDN LTAGYGSRGR VFVHEWAHLR WGVFDEYNND 160 

KPFYINGQNQ IKVTRCSSDI TGIFVCEKGP CPQENCIISK LFKEGCTFIY NSTQNATASI 240 

MFMQSLSSW EFCNASTHNQ EAPNLQNQMC SLRSAWDVIT DSADFHHSFP MNGTELPPPP 300 

TFSLVQAGDK WCLVLDVSS KMAEADRLLQ LQQAAEFYLM QIVEIHTFVG IASFDSKGEI 360 

RAQLHQINSN DDRKLLVSYL PTTVSAKTDI SICSGLKKGF EWEKLNGKA YGSVMILVTS 420 

GDDKLLGNCL PTVLSSGSTI HSIALGSSAA PNLEELSRLT GGLKFFVPDI SNSNSMIDAF 480 

SRISSGTGDI FQQHIQLEST GENVKPHHQL KNTVTVDNTV GNDTMFLVTW QASGPPEIIL 540 

FDPDGRKYYT NNFITNLTFR TASLWIPGTA KPGHWTYTLN NTHHSLQALK VTVTSRASNS GOO 

AVPPATVEAF VERDSLHFPH PVMIYANVKQ GFYPILNATV TATVEPETGD PVTLRLLDDG 660 

AGADVIKNDG IYSRYFFSFA ANGRYSLKVH VNHSPSISTP AHSIPGSHAM YVPGYTANGN 720 

IQMNAPRKSV GRNEEERKWG FSRVSSGGSF SVLGVPAGPH PDVFPPCKII DLEAVKVEEE 780 

I/TI»SWTAPGE DFDQGQATSY EIRMSKSLQN IQDDFNNAIL VNTSKRNPQQ AGIREIFTFS 840 

PQISTNGPEH QPNGETHESH RIYVAIRAMD RNSLQSAVSN IAQAPLFIPP NSDPVPARDY 900 
LILKGVLTAM GLIGIICLII WTHHTLSRK KRADKKENGT KLL 

Seq ID NO: 319 DNA sequence 

Nucleic Acid Accession #s NM_000228.1 

Coding sequence: 82.. 3600 

1 11 21 31 41 SI 

| | I I 1 S 

GCTTTCAGGC GATCTGGAGA AAGAACGGCA GAACACACAG CAAGGAAAGG TCCTTTCTGG 60 

GGATCACCCC ATTGGCTGAA GATGAGACCA TTCTTCCTCT TGTGTTTTGC CCTGCCTGGC 120 

CTCCTGCATG CCCAACAAGC CTGCTCCCGT GGGGCCTGCT ATCCACCTGT TGGGGACCTG 180 

CTTGTTGGGA GGACCCGGTT TCTCCGAGCT TCATCTACCT GTGGACTGAC CAAGCCTGAG 240 

ACCTACTGCA CCCAGTATGG CGAGTGGCAG ATGAAATGCT GCAAGTGTGA CTCCAGGCAG 300 

CCTCACAACT ACTACAGTCA CCGAGTAGAG AATGTGGCTT CATCCTCCGG CCCCATGCGC 360 

TGGTGGCAGT CCCAGAATGA TGTGAACCCT GTCTCTCTGC AGCTGGACCT GGACAGGAGA 420 

TTCCAGCTTC AAGAAGTCAT GATGGAGTTC CAGGGGCCCA TGCCCGCCGG CATGCTGATT 480 

GAGCGCTCCT CAGACTTCGG TAAGACCTGG CGAGTGTACC AGTACCTGGC TGCCGACTGC 540 

ACCTCCACCT TCCCTCGGGT CCGCCAGGGT CGGCCTCAGA GCTGGCAGGA TGTTCGGTGC 600 

CAGTCCCTGC CTCAGAGGCC TAATGCACGC CTAAATGGGG GGAAGGTCCA ACTTAACCTT 660 

ATGGATTTAG TGTCTGGGAT TCCAGCAACT CAAAGTCAAA AAATTCAAGA GGTGGGGGAG 720 

ATCACAAACT TGAGAGTCAA TTTCACCAGG CTGGCCCCTG TGCCCCAAAG GGGCTACCAC 780 

CCTCCCAGCG CCTACTATGC TGTGTCCCAG CTCCGTCTGC AGGGGAGCTG CTTCTGTCAC 840 

GG CCATGCTG ATCGCTG CGC ACCCAAGCCT GGGGCCTCTG CAGGCCCCTC CACCGCTGTG 900 

CAGGTCCACG ATGTCTGTGT CTGCCAGCAC AACACTGCCG GCCCAAATTG TGAGCGCTGT 960 

GCACCCTTCT ACAACAACCG GCCCTGGAGA CCGGCGGAGG GCCAGGACGC CCATGAATGC 1020 

CAAAGGTGCG ACTGCAATGG GCACTCAGAG ACATGTCACT TTGACCCCGC TGTGTTTGCC 1080 

GCCAGCCAGG GGGCATATGG AGGTGTGTGT GACAATTGCC GGGACCACAC CGAAGGCAAG 1140 

AACTGTGAGC GGTGTCAGCT GCACTATTTC CGGAACCGGC GCCCGGGAGC TTCCATTCAG 1200 

GAGACCTGCA TCTCCTGCGA GTGTGATCCG GATGGGGCAG TGCCAGGGGC TCCCTGTGAC 1260 

CCAGTGACCG GGCAGTGTGT GTGCAAGGAG CATGTGCAGG GAGAGCGCTG TGACCTATGC 1320 

AAGCCGGGCT TCACTGGACT CACCTACGCC AACCCGCAGG GCTGCCACCG CTGTGACTGC 1380 

AACATCCTGG GGTCCCGGAG GGACATGCCG TGTGACGAGG AGAGTGGGCG CTGCCTTTGT 1440 

CTGCCCAACG TGGTGGGTCC CAAATGTGAC CAGTGTGCTC CCTACCACTG GAAGCTGGCC 1500 

AGTGGCCAGG GCTGTGAACC GTGTGCCTGC GACCCGCACA ACTCCCCTCA GCCCACAGTG 1560 

CAACCAGTTC ACAGGGCAGT GCCCTGTCGG GAAGGCTTTG GTGGCCTGAT GTGCAGCGCT 1620 

GCAGCCATCC GCCAGTGTCC AGACCGGACC TATGGAGACG TGGCCACAGG ATGCCGAGCC 1680 

TGTGACTGTG ATTTCCGGGG AACAGAGGGC CCGGGCTGCG ACAAGGCATC AGGCCGCTGC 1740 

CTCTGCCGCC CTGGCTTGAC CGGGCCCCGC TGTGACCAGT GCCAGCGAGG CTACTGCAAT 1800 

CGCTACCCGG TGTGCGTGGC CTGCCACCCT TGCTTCCAGA CCTATGATGC GGACCTCCGG 1860 

GAGCAGGCCC TGCGCTTTGG TAGACTCCGC AATGCCACCG CCAGCCTGTG GTCAGGGCCT 1920 

GGGCTGGAGG ACCGTGGCCT GGCCTCCCGG ATCCTAGATG CAAAGAGTAA GATTGAGCAG 1980 

ATCCGAGCAG TTCTCAGCAG CCCCGCAGTC ACAGAGCAGG AGGTGGCTCA GGTGGCCAGT 2040 

GCCATCCTCT CCCTCAGGCG AACTCTCCAG GGCCTGCAGC TGGATCTGCC CCTGGAGGAG 2100 

GAGACGTTGT CCCTTCCGAG AGACCTGGAG AGTCTTGACA GAAGCTTCAA TGGTCTCCTT 2160 

ACTATGTATC AGAGGAAGAG GGAGCAGTTT GAAAAAATAA GCAGTGCTGA TCCTTCAGGA 2220 

GCCTTCCGGA TGCTGAGCAC AGCCTACGAG CAGTCAGCCC AGGCTGCTCA GCAGGTCTCC 2280 

GACAGCTCGC GCCTTTTGGA CCAGCTCAGG GACAGCCGGA GAGAGGCAGA GAGGCTGGTG 2340 

CGGCAGGCGG GAGGAGGAGG AGGCACCGGC AGCCCCAAGC TTGTGGCCCT GAGGCTGGAG 2400 

ATGTCTTCGT TGCCTGACCT GACACCCACC TTCAACAAGC TCTGTGGCAA CTCCAGGCAG 2460 

ATGGCTTGCA CCCCAATATC ATGCCCTGGT GAGCTATGTC CCCAAGACAA TGGCACAGCC 2520 

TGTGGCTCCC GCTGCAGGGG TGTCCTTCCC AGGGCCGGTG GGGCCTTCTT GATGGCGGGG 2580 

CAGGTGGCTG AGCAGCTGCG GGGCTTCAAT GCCCAGCTCC AGCGGACCAG GCAGATGATT 2640 

AGGGCAGCCG AGGAATCTGC CTCACAGATT CAATCCAGTG CCCAGCGCTT GGAGACCCAG 2700 

GTGAGCGCCA GCCGCTCCCA GATGGAGGAA GATGTCAGAC GCACACGGCT CCTAATCCAG 2760 

CAGGTCCGGG ACTTCCTAAC AGACCCCGAC ACTGATGCAG CCACTATCCA GGAGGTCAGC 2820 

GAGGCCGTGC TGGCCCTGTG GCTGCCCACA GACTCAGCTA CTGTTCTGCA GAAGATGAAT 2880 

GAGATCCAGG CCATTGCAGC CAGGCTCCCC AACGTGGACT TGGTGCTGTC CCAGACCAAG 2940 

CAGGACATTG CGCGTGCCCG CCGGTTGCAG GCTGAGGCTG AGGAAGCCAG GAGCCGAGCC 3000 

CATGCAGTGG AGGGCCAGGT GGAAGATGTG GTTGGGAACC TGCGGCAGGG GACAGTGGCA 3060 

CTGCAGGAAG CTCAGGACAC CATGCAAGGC ACCAGCCGCT CCCTTCGGCT TATCCAGGAC 3120 
AGGGTTGCTG AGGTTCAGCA GGTACTGCGG CCAGCAGAAA AGCTGGTGAC AAGCATGACC 3180 
AAGCAGCTGG GTGACTTCTG GACACGGATG GAGGAGCTCC GCCACCAAGC CCGGCAGCAG 3240 
GGGGCAGAGG CAGTCCAGGC CCAGCAGCTT GCGGAAGGTG CCAGCGAGCA GGCATTGAGT 3300 
GCCCAAGAGG GATTTGAGAG AATAAAACAA AAGTATGCTG AGTTGAAGGA CCGGTTGGGT 3360 
CAGAGTTCCA TGCTGGGTGA GCAGGGTGCC CGGATCCAGA GTGTGAAGAC AGAGGCAGAG 3420 
GAGCTGTTTG GGGAGACCAT GGAGATGATG GACAGGATGA AAGACATGGA GTTGGAGCTG 3480 
CTGCGGGGCA GCCAGGCCAT CATGCTGCGC TCGGCGGACC TGACAGGACT GGAGAAGCGT 3540 
GTGGAGCAGA TCCGTGACCA CATCAATGGG CGCGTGCTCT ACTA TGCCAC CTGCAAGTGA 3600 
TGCTACAGCT TCCAGCCCGT TGCCCCACTC ATCTGCCGCC TTTGCTTTTG GTTGGGGGCA 3660 
GATTGGGTTG GAATGCTTTC CATCTCCAGG AGACTTTCAT GCAGCCTAAA GTACAGCCTG 3720 
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GACCACCCCT GGTGTGTAGC TAGTAAGATT ACCCTGAGCT GCAGCTGAGC CTGAGCCAAT 3780 
GGGACAGTTA CACTTGACAG ACAAAGATGG TGGAGATTGG CATGCCATTG AAACTAAGAG 3840 
CTCTCAAGTC AAGGAAGCTG GGCTGGGCAG TATCCCCCGC CTTTAGTTCT CCACTGGGGA 3900 
GGAATCCTGG ACCAAGCACA AAAACTTAAC AAAAGTGATG TAAAAATGAA AAGCCAAATA 3960 
AAAATCTTTG G 

Seq ID NO: 320 Protein sequence 
Protein Accession «: NP_000219.l 

1 11 21 31 41 51 

I I I I 1 i 

MRPFFX.LCFA LPGLLHAQQA CSRGACYPPV GDLLVGRTRF LRASSTCGLT KPETYCTQYG 
EWQMKCCKCD SRQPHNYYSH RVENVASSSG FMRWWQSQND VNPVSLQLDL DRRFQLQEVM 
MEFQGPMPAG KLIERSSDFG KTWRVYQYLA ADCTSTFPRV RQGRPQSWQD VRCQSLPQRP 
NARLNGGKVQ LNLMDLVSGI PATQSQKIQE VGEITNLRVN FTRLAPVPQR GYHPPSAYYA 
VSQLRLQGSC FCHGHADRCA PKPGASAGPS TAVQVHDVCV CQHNTAGPNC ERCAPFYKNR 
PHRPAEGQDA HEOQRCDCNG HSETCHFDPA VFAASQGAYG GVCDNCRDHT EGKNCERCQL 
HYFRNRRPGA SIQETCISCE CDPDGAVPGA PCDPVTGQCV CKEHVQGERC DLCKPGFTGL 
TYANPQGCHR CDCNILGSRR DMPCDEESGR CLCLPNWGP KCDQCAPYHW KIASGQGCEP 
CACDPHNSPQ PTVQPVHRAV PCREGFGGLM CSAAAIRQCP DRTYGDVATG CRACDCDFRG 
TEGPGCDKAS GRCLCRPGLT GPRCDQOQRG YCNRYPVCVA CHPCFQTYDA DLREQALRFG 
RLRNATASLW SGPGLEDRGL ASRILDAKSK IEQIRAVLSS PAVTEQEVAQ VASAILSLRR 
TLQGLQLDLP LEEETLSLPR DLESLDRSFN GLLTMYQRKR EQFEKISSAD PSGAFRMLST 
AYEQSAQAAQ QVSDSSRLLD QLRDSRREAE RLVRQAGGGG GTGSPKLVAL RLEMSSLPDL 
TPTFNKLCGN SRQMACTPIS CPGELCPQDN GTACGSRCRG VLPRAGGAFL MAGQVAEQLR 
GFNAQLQRTR QMIRAAEESA SQIQSSAQRL ETQVSASRSQ MEEDVRRTRL LIQQVRDFLT 
DPDTDAATIQ EVSEAVLALW LPTDSATVLQ KMNEIQAIAA RLPNVDLVLS QTKQDIARAR 
RLQAEAEEAR SRAHAVEGQV EDWGNLRQG TVALQEAQDT MQGTSRSLRL IQDRVAEVQQ 
VLRPAEKLVT SMTKQLGDFW TRMEELRHQA RQQGAEAVQA QQLAEGASEQ ALSAQEGFER 
IKQKYAELKD RLGQSSMLGE QGARIQSVKT EAEELFGETM EMMDRMKDME LELLRGSQAI 
MLRSADLTGL EKRVEQIRDH INGRVLYYAT CK 

Seq ID NO: 321 DNA sequence 
Nucleic Acid Accession #: NM_001944.1 
Coding sequence: 84.. 3083 

1 11 21 31 41 51 

I 1 i I I I 

TTTTCTTAGA CATTAACTGC AGACGGCTGG CAGGATAGAA GCAGCGGCTC ACTTGGACTT 60 

TTTCACCAGG GAAATCAGAG ACAATGATGG GGCTCTTCCC CAGAACTACA GGGGCTCTGG 120 

CCATCTTCGT GGTGGTCATA TTGGTTCATG GAGAATTGCG AATAGAGACT AAAGGTCAAT 180 

ATGATGAAGA AGAGATGACT ATGCAACAAG CTAAAAGAAG GCAAAAACGT GAATGGGTGA 240 

AATTTGCCAA ACCCTGCAGA GAAGGAGAAG ATAACTCAAA AAGAAACCCA ATTGCCAAGA 300 

TTACTTCAGA TTACCAAGCA ACCCAGAAAA TCACCTACCG AATCTCTGGA GTGGGAATCG 360 

ATCAGCCGCC TTTTGGAATC TTTGTTGTTG ACAAAAACAC TGGAGATATT AACATAACAG 420 

CTATAGTCGA CCGGGAGGAA ACTCCAAGCT TCCTGATCAC ATGTCGGGCT CTAAATGCCC 480 

AAGGACTAGA TGTAGAGAAA CCACTTATAC TAACGGTTAA AATTTTGGAT ATTAATGATA S40 

ATCCTCCAGT ATTTTCACAA CAAATTTTCA TGGGTGAAAT TGAAGAAAAT AGTGCCTCAA 600 

ACTCACTGGT GATGATACTA AATGCCACAG ATGCAGATGA ACCAAACCAC TTGAATTCTA 660 

AAATTGCCTT CAAAATTGTC TCTCAGGAAC CAGCAGGCAC ACCCATGTTC CTCCTAAGCA 720 

GAAACACTGG GGAAGTCCGT ACTTTGACCA ATTCTCTTGA CCGAGAGCAA GCTAGCAGCT 780 

ATCGTCTGGT TGTGAGTGGT GCAGACAAAG ATGGAGAAGG ACTATCAACT CAATGTGAAT 840 

GTAATATTAA AGTGAAAGAT GTCAACGATA ACTTCCCAAT GTTTAGAGAC TCTCAGTATT 900 

CAGCACGTAT TGAAGAAAAT ATTTTAAGTT CTGAATTACT TCGATTTCAA GTAACAGATT 960 

TGGATGAAGA GTACACAGAT AATTGGCTTG CAGTATATTT CTTTACCTCT GGGAATGAAG 1020 

GAAATTGGTT TGAAATACAA ACTGATCCTA GAACTAATGA AGGCATCCTG AAAGTGGTGA 1080 

AGGCTCTAGA TTATGAACAA CTACAAAGCG TGAAACTTAG TATTGCTGTC AAAAACAAAG 1140 

CTGAATTTCA CCAATCAGTT ATCTCTCGAT ACCGAGTTCA GTCAACCCCA GTCACAATTC 1200 

AGGTAATAAA TGTAAGAGAA GGAATTGCAT TCCGTCCTGC TTCCAAGACA TTTACTGTGC 1260 

AAAAAGGCAT AAGTAGCAAA AAATTGGTGG ATTATATCCT GGGAACATAT CAAGCCATCG 1320 

ATGAGGACAC TAACAAAGCT GCCTCAAATG TCAAATATGT CATGGGACGT AACGATGGTG 1380 

GATACCTAAT GATTGATTCA AAAACTGCTG AAATCAAATT TGTCAAAAAT ATGAACCGAG 1440 

ATTCTACTTT CATAGTTAAC AAAACAATCA CAGCTGAGGT TCTGGCCATA GATGAATACA 1500 

CGGGTAAAAC TTCTACAGGC ACGGTATATG TTAGAGTACC CGATTTCAAT GACAATTGTC 1560 

CAACAGCTGT CCTCGAAAAA GATGCAGTTT GCAGTTCTTC ACCTTCCGTG GTTGTCTCCG 1620 

CTAGAACACT GAATAATAGA TACACTGGCC CCTATACATT TGCACTGGAA GATCAACCTG 1680 

TAAAGTTGCC TGCCGTATGG AGTATCACAA CCCTCAATGC TACCTCGGCC CTCCTCAGAG 1740 

CCCAGGAACA GATACCTCCT GGAGTATACC ACATCTCCCT GGTACTTACA GACAGTCAGA 1B0O 

ACAATCGGTG TGAGATGCCA CGCAGCTTGA CACTGGAAGT CTGTCAGTGT GACAACAGGG 1860 

GCATCTGTGG AACTTCTTAC CCAACCACAA GCCCTGGGAC CAGGTATGGC AGGCCGCACT 1920 

CAGGGAGGCT GGGGCCTGCC GCCATCGGCC TGCTGCTCCT TGGTCTCCTG CTGCTGCTGT 1980 

TGGCCCCCCT TCTGCTGTTG ACCTGTGACT GTGGGGCAGG TTCTACTGGG GGAGTGACAG 2040 

GTGGTTTTAT CCCAGTTCCT GATGGCTCAG AAGGAACAAT TCATCAGTGG GGAATTGAAG 2100 

GAGCCCATCC TGAAGACAAG GAAATCACAA ATATTTGTGT GCCTCCTGTA ACAGCCAATG 2160 

GAGCCGATTT CATGGAAAGT TCTGAAGTTT GTACAAATAC GTATGCCAGA GGCACAGCGG 2220 

TGGAAGGCAC TTCAGGAATG GAAATGACCA CTAAGCTTGG AGCAGCCACT GAATCTGGAG 2280 

GTGCTGCAGG CTTTGCAACA GGGACAGTGT CAGGAGCTGC TTCAGGATTC GGAGCAGCCA 2340 

CTGGAGTTGG CATCTGTTCC TCAGGGCAGT CTGGAACCAT GAGAACAAGG CATTCCACTG 2400 

GAGGAACCAA TAAGGACTAC GCTGATGGGG CGATAAGCAT GAATTTTCTG GACTCCTACT 2460 

TTTCTCAGAA AGCATTTGCC TGTGCGGAGG AAGACGATGG CCAGGAAGCA AATGACTGCT 2520 

TGTTGATCTA TGATAATGAA GGCGCAGATG CCACTGGTTC TCCTGTGGGC TCCGTGGGTT 2580 

GTTGCAGTTT TATTGCTGAT GACCTGGATG ACAGCTTCTT GGACTCACTT GGACCCAAAT 2640 

TTAAAAAACT TGCAGAGATA AGCCTTGGTG TTGATGGTGA AGGCAAAGAA GTTCAGCCAC 2700 

CCTCTAAAGA CAGCGGTTAT GGGATTGAAT CCTGTGGCCA TCCCATAGAA GTCCAGCAGA 2760 
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CAGGATTTGT 
CTGGGTCTGT 
TAACGGAGAC 
CACTTCTCAC 
CTGGCAACCT 
ATCCTTGCTC 
ATCTTTGGAC 
TGGCACTTAT 
TACCCCAAAA 
TCTTAAAGTT 



TAAGTGCCAG 
CCAGCCAGCT 
TTACTCGGCT 
ACAAAATGTG 
AGCTGGCCCA 
CCGTCTAATA 
TAAAGTATTC 
TAGCTTCTCT 
GCAATATGTT 
TTTCAAAACC 



ACTTTGTCAG 
GTTTCCATCC 
TCTGGTTCCC 
ATAGTGACAG 
ACGCAGCTAC 
TGACCAGAAT 
AAAATAGCAT 
CATAAACTGA 
GTCACTCCTA 
CTAAAATCAT 



GAAGTCAAGG 
CTGACCCTCT 
TCGTGCAACC 
AAAGGGTGAT 
GAGGGTCACA 
GAGCTGGAAT 
AGCAAAGCTC 
TCACGATTAT 
ATTCTCAAGT 
ATTCGC 



AGCTTCTGCT 
GCAGCATGGT 
TTCCACTGCA 
CTGTCCCATT 
TACTATGCTC 
ACCACACTGA 
ACTGTATTGG 
AAATTAAATG 
ACTATTCAAA 



TTGTCCGCCT 
AACTATTTAG 
GGCTTTGATC 
TCCAGTGTTC 
TGTACAGAGG 
CCAAATCTGG 
GCTAATAATT 
TTTGGGTTCA 
TTGTAGTAAA 



2B20 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 



Seq ID NO: 322 Protein sequence 
Protein Accession #: NP_001935.1 



1 
I 

MMGLFPRTTG 
GEDNSKRNPI 
PSFLITCRAL 
ATDADEPNHL 
DKDGEGLSTQ 
WLAVYFFTSG 
SRYRVQSTPV 
SNVKYVMGRN 
VYVRVPDFND 
ITTLNATSAL 
TTSPGTRYGR 
GSEGTIHQWG 
MTTKLGAATE 
DGAISMNFLD 
LDDSFLDSLG 
LSGSQGASAL 
VTERVICPIS 



11 
I 

ALAIFVWIL 
AKITSDYQAT 
NAQGLDVEKP 
NSKIAFKIVS 
CECNIKVKDV 
NEGNWFEIQT 
TIQVIKVREG 
DGGYLMIDSK 
NCPTAVLEKD 
LRAQEQIPPG 
PHSGRLGPAA 
IEGAHPEDKE 
SGGAAGFATG 
SYFSQKAFAC 
PKFKKLAEIS 
SASGSVQPAV 
SVPGNLAGPT 



21 
I 

VHGELRIETK 
QKITYRISGV 
LILTVKILDI 
QEPAGTPMFL 
NDNFPMFRDS 
DPRTNEGILK 
IAFRPASKTF 
TAEIKFVKNM 
AVCSSSPSW 
VYHISLVLTD 
IGLLLLGLLL 
ITNICVPPVT 
TVSGAASGFG 
AEEDDGQEAN 
LGVDGEGKEV 
SIPDPLQKGN 
QLRGSHTMLC 



31 
I 

GQYDEEEMTM 
GIDQPPFGIF 
NDNPPVFSQQ 
LSRNTGEVRT 
QYSARIEENI 
WKALDYEQL 
TVQKGISSKK 
NRDSTFIVNK 
VSARTLNNRY 



LLLAPLLLLT 
ANGADFMESS 
AATGVGICSS 
DCLLIYDNEG 
QPPSKDSGYG 
YLVTETYSAS 
TEDPCSRLI 



41 
I 

QQAKRRQKRE. 
WDKNTGDIN 
IFMGEIEENS 
LTNSLDREQA 
LSSELLRFQV 
QSVKLSIAVK 
LVDYILGTYQ 
TITAEVLAID 
TGPYTFALED 
SLTLEVCQCD 
CDCGAGSTGG 
EVCTNTYARG 
GQSGTMRTRH 
ADATGSPVGS 
IESCGHPIEV 
GSLVQPSTAG 



51 
I 

WVKFAKPCRE 
ITAIVDREET 
ASNSLVMILN 
SSYRLWSGA 
TDLDEEYTDN 
NKAEFHQSVI 
AIDED TNKAA 
EYTGKTSTGT 
QPVKLPAVWS 
NRGICGTSYP 
VTGGFIPVPD 
TAVEGTSGME 
STGGTNKDYA 
VGCCSFIADD 
QQTGFVKCQT 
FDPLLTQNVI 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 



Seq ID NO: 323 DNA sequence 

Nucleic Acid Accession #: XM_0S8069.2 

Coding sequence! 1..1413 



ATGAAGTTTC 
AGCTCTACAA 
TATGGCCTTG 
AAGGAAAAAA 
ACATCTACCC 
AGGGAAATGC 
TACACACCTG 
TGGAGTAATG 
GTGGTTTTTG 
CTAGCCCATG 
GAATTCTGGA 
GGCCATTCCT 
AAATATGTTG 
CTGTATGGAG 
CTCTGTGACC 
TTCAAAGACA 
ATTTCTTCCT 
AGAAATCAAG 
GAGCCAAATT 
GATGCAGCTG 
TGGAGGTATG 
AACTTCCAAG 
TATTTCTTCC 
ACACTGAAAA 



11 
I 

TTCTAATACT 
GCCTGGAAAA 
AGATAAACAA 
TCCAAGAAAT 
TGGAGATGAT 
CAGGGGGGCC 
ACATGAACCG 
TTACCCCCTT 
CCCGTGGAGC 
CTTTTGGACC 
CTACACATTC 
TAGGTCTTGG 
ACATCAACAC 
ACCCAAAAGA 
CCAATTTGAG 
GGTTCTTCTG 
TATGGCCAAC 




21 
I 

GCTCCTGCAG 
AAATAATGTG 
ACTTCCAGTG 
GCAGCACTTC 
GCACGCACCT 
CGTATGGAGG 
TGAGGATGTT 
GAAATTCAGC 
TCATGGAGAC 
TGGATCTGGC 
AGGAGGCACA 
CCATTCTAGT 
ATTTCGCCTC 
GAACCAACGC 
TTTTGATGCT 
GCTGAAGGTT 
CTTGCCATCT 
TAAAGATGAC 
CATACATTCT 
ACGTTTTTAT 
ACAGATGATG 
TAAAATTGAT 
CCAATTTGAA 
GTTTGGTTGT 



31 
I 

GCCACTGCTT 
CTATTTGGTG 
ACAAAAATGA 
TTGGGTCTGA 
CGATGTGGAG 
AAACATTATA 
GACTACGCAA 
AAGATTAACA 
TTCCATGCTT 
ATTGGAGGGG 
AACTTGTTCC 
GATCCAAAGG 
TCTGCTGATG 
TTGCCAAATC 
GTCACTACCG 
TCTGAGAGAC 
GGCATTGAAG 
AAATACTGGT 
TTTGGTTTTC 
AGGACCTACT 
GACCCTGGTT 
GCAGTCTTCT 
TATGACTTCC 
TGA 



41 



CTGGAGCTCT 
AAAGATACTT 
AATATAGTGG 
AAGTGACCGG 
TCCCCGATGT 
TCACCTACAG 
TCCGGAAAGC 
CAGGCATGGC 
TTGATGGCAA 
ATGCACATTT 
TCACTGCTGT 
CCGTAATGTT 
ACATACGTGG 
CTGACAATTC 
TGGGAAATAA 
CAAAGACCAG 
CTGCTTATGA 
TAATTAGCAA 
CTAACTTTGT 
TCTTTGTAGA 
ATCCCAAACT 
ACTCTAAAAA 
TACTCCAACG 



51 
I 

TCCCCTGAAC 
AGAAAAATTT 
AAACTTAATG 
GCAACTGGAC 
CCATCATTTC 
AATCAATAAT 
TTTCCAAGTA 
TGACATTTTG 
AGGTGGAATC 
CGATGAGGAC 
TCACGAGATT 
CCCCACCTAC 
CATTCAGTCC 
AGAACCAGCT 
GATCTTTTTC 
TGTTAATTTA 
AATTGAAGCC 
TTTAAGACCA 
GAAAAAAATT 
TAACCAGTAT 
GATTACCAAG 
CAAATACTAC 
TATCACCAAA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 



Seq ID NO: 324 Protein sequence 
Protein Accession fl: P39900 



MKFLLILLIiQ 
KEKIQEMQHF 
YTPDMNREDV 
LAHAFGPGSG 
KYVDINTFRL 
FKDRFFWLiKV 
EPNYPKSIHS 
NFQGIGPKID 



11 
I 

ATASGALPLN 
LGLKVTGQLD 
DYAIRKAFQV 
IGGDAHFDED 
SADDIRGIQS 
SERPKTSVNL 
FGFPNFVKKI 
AVFYSKNKYY 



21 
I 

SSTSLEKNNV 
TSTLEMMHAP 
WSNVTPLKFS 
EFWTTHSGGT 
LYGDPKENQR 
ISSLWPTLPS 
DAAVFNPRFY 
YFFQGSNQFE 



31 41 51 

I I I 

LFGERYLEKF YGLEINKLPV TKMKYSGNLM 60 

RCGVPDVHHF REMPGGPVWR KHYITYRINN 120 

KINTGMADIL WFARGAHGD FHAFDGKGGI 180 

NLFLTAVHEI GHSLGLGHSS DPKAVMFPTY 240 

LPNPDNSEPA LCDPNLSFDA VTTVGNKIFF 300 

GIEAAYEIEA RNQVFLFKDD KYWLISNLRP 360 

RTYFFVDNQY WRYDERRQMM DPGYPKLITK 420 
YDFLLQRITK TLKSNSWFGC 



Seq ID NO: 325 DNA sequence 

Nucleic Acid Accession #: NM_024423.1 

Coding sequence: 64.. 2590 



11 



21 



31 



41 



51 

1077 



WO 03/042661 



GGCAGGTCTC GCTCTCGGCA CCCTCCCGGC GCCCGCGTTC TCCTGGCCCT GCCCGGCATC 60 
CCGATGGCCG CCGCTGGGCC CCGGCGCTCC GTGCGCGGAG CCGTCTGCCT GCATCTGCTG 120 
CTGACCCTCG TGATCTTCAG TCGTGATGGT GAAGCCTGCA AAAAGGTGAT ACTTAATGTA 180 
CCTTCTAAAC TAGAGGCAGA CAAAATAATT GGCAGAGTTA ATTTGGAAGA GTGCTTCAGG 240 
TCTGCAGACC TCATCCGGTC AAGTGATCCT GATTTCAGAG TTCTAAATGA TGGGTCAGTG 300 
TACACAGCCA GGGCTGTTGC GCTGTCTGAT AAGAAAAGAT CATTTACCAT ATGGCTTTCT 360 
GACAAAAGGA AACAGACACA GAAAGAGGTT ACTGTGCTGC TAGAACATCA GAAGAAGGTA 420 
TCGAAGACAA GACACACTAG AGAAACTGTT CTCAGGCGTG CCAAGAGGAG ATGGGCACCT 480 
ATTCCTTGCT CTATGCAAGA GAATTCCTTG GGCCCTTTCC CATTGTTTCT TCAACAAGTT S40 
GAATCTGATG CAGCACAGAA CTATACTGTC TTCTACTCAA TAAGTGGACG TGGAGTTGAT 600 
AAAGAACCTT TAAATTTGTT TTATATAGAA AGAGACACTG GAAATCTATT TTGCACTCGG 660 
CCTGTGGATC GTGAAGAATA TGATGTTTTT GATTTGATTG CTTATGCGTC AACTG CAG AT 720 
GGATATTCAG CAGATCTGCC CCTCCCACTA CCCATCAGGG TAGAGGATGA AAATGACAAC 780 
CACCCTGTTT TCACAGAAGC AATTTATAAT TTTGAAGTTT TGGAAAGTAG TAGACCTGGT 840 
ACTACAGTGG GGGTGGTTTG TGCCACAGAC AGAGATGAAC CGGACACAAT GCATACGCGC 900 
CTGAAATACA GCATTTTGCA GCAGACACCA AGGTCACCTG GGCTCTTTTC TGTGCATCCC 960 
AGCACAGGCG TAATCACCAC AGTCTCTCAT TATTTGGACA GAGAGGTTGT AGACAAGTAC 1020 
TCATTGATAA TGAAAGTACA AGACATGGAT GGCCAGTTTT TTG GATTG AT AGGCACATCA 1080 
ACTTGTATCA TAACAGTAAC AGATTCAAAT GATAATGCAC CCACTTTCAG ACAAAATGCT 1140 
TATGAAGCAT TTGTAGAGGA AAATGCATTC AATGTGGAAA TCTTACGAAT ACCTATAGAA 1200 
GATAAGGATT TAATTAACAC TGCCAATTGG AGAGTCAATT TTACCATTTT AAAGGGAAAT 1260 
GAAAATGGAC ATTTCAAAAT CAGCACAGAC AAAGAAACTA ATGAAGGTGT TCTTTCTGTT 1320 
GTAAAGCCAC TGAATTATGA AGAAAACCGT CAAGTGAACC TGGAAATTGG AGTAAACAAT 1380 
GAAGCGCCAT TTGCTAGAGA TATTCCCAGA GTGACAGCCT TGAACAGAGC CTTGGTTACA 1440 
GTTCATGTGA GGGATCTGGA TGAGGGGCCT GAATGCACTC CTGCAGCCCA ATATGTGCGG IS 00 
ATTAAAGAAA ACTTAGCAGT GGGGTCAAAG ATCAACGGCT ATAAGGCATA TGACCCCGAA 1560 
AATAGAAATG GCAATGGTTT AAGGTACAAA AAATTGCATG ATCCTAAAGG TTGGATCACC 1620 
ATTGATGAAA TTTCAGGGTC AATCATAACT TCCAAAATCC TGGATAGGGA GGTTGAAACT 1680 
CCCAAAAATG AGTTGTATAA TATTACAGTC CTGGCAATAG ACAAAGATGA TAGATCATGT 1740 
ACTGGAACAC TTGCTGTGAA CATTGAAGAT GTAAATGATA ATCCACCAGA AATACTTCAA 1800 
GAATATGTAG TCATTTGCAA ACCAAAAATG GGGTATACCG ACATTTTAGC TGTTGATCCT 1860 
GATGAACCTG TCCATGGAGC TCCATTTTAT TTCAGTTTGC CCAATACTTC TCCAGAAATC 1920 
AGTAGACTGT GGAGCCTCAC CAAAGTTAAT GATACAGCTG CCCGTCTTTC ATATCAGAAA 1980 
AATGCTGGAT TTCAAGAATA TACCATTCCT ATTACTGTAA AAGACAGGGC CGGCCAAGCT 2040 
GCAACAAAAT TATTGAGAGT TAATCTGTGT GAATGTACTC ATCCAACTCA GTGTCGTGCG 2100 
ACTTCAAGGA GTACAGGAGT AATACTTGGA AAATGGGCAA TCCTTGCAAT ATTACTGGGT 2160 
ATAGCACTGC TCTTTTCTGT ATTGCTAACT TTAGTATGTG GAGTTTTTGG TGCAACTAAA 2220 
GGGAAAOGTT TTCCTGAAGA TTTAGCACAG CAAAACTTAA TTATATCAAA CACAGAAGCA 2280 
CCTGGAGACG ATAGAGTGTG CTCTGCCAAT GGATTTATGA CCCAAACTAC CAACAACTCT 2340 
AGCCAAGGTT TTTGTGGTAC TATGGGATCA GGAATGAAAA ATGGAGGGCA GGAAACCATT 2400 
GAAATGATGA AAGGAGGAAA CCAGACCTTG GAATCCTGCC GGGGGGCTGG GCATCATCAT 2460 
ACCCTGGACT CCTGCAGGGG AGGACACACG GAGGTGGACA ACTGCAGATA CACTTACTCG 2520 
GAGTGGCACA GTTTTACTCA ACCCCGTCTC GGTGAAGAAT CCATTAGAGG ACACACTGGT 2580 
TAAAAATTAA ACATAAAAGA AATTGCATCG ATGTAATCAG AATGAAGACC GCATGCCATC 2640 
CCAAGATTAT GTCCTCACTT ATAACTATGA GGGAAGAGGA TCTCCAGCTG GTTCTGTGGG 2700 
CTGCTGCAGT GAAAAGCAGG AAGAAGATGG CCTTGACTTT TTAAATAATT TGGAACCCAA 2760 
ATTTATTACA TTAGCAGAAG CATGCACAAA GAGATAATGT CACAGTGCTA CAATTAGGTC 2820 
TTTGTCAGAC ATTCTGGAGG TTTCCAAAAA TAATATTGTA AAGTTCAATT TCAACATGTA 2880 
TGTATATGAT GATTTTTTTC TCAATTTTGA ATTATGCTAC TCACCAATTT ATATTTTTAA 2940 
AGCCAGTTGT TGCTTATCTT TTCCAAAAAG TGAAAAATGT TAAAA CAGA C AACTGGTAAA 3000 
TCTCAAACTC CAGCACTGGA ATTAAGGTCT CTAAAGCATC TGCTCTTTTT TTTTTTTACG 3060 
GATATTTTAG TAATAAATAT GCTGGATAAA TATTAGTCCA ACAATAGCTA AGTTATGCTA 3120 
ATATCACATT ATTATGTATT CACTTTAAGT GATAGTTTAA AAAATAAACA AGAAATATTG 3180 
AGTATCACTA TGTGAAGAAA GTTTTGGAAA AGAAACAATG AAGACTGAAT TAAATTAAAA 3240 
ATGTTGCAGC TCATAAAGAA TTGGGACTCA CCCCTACTGC ACTACCAAAT TCATTTGACT 3300 
TTGGAGGCAA AATGTGTTGA AGTGCCCTAT GAAGTAGCAA TTTTCTATAG GAATATAGTT 3360 
GGAAATAAAT GTGTGTGTGT ATATTATTAT TAATCAATGC AATATTTAAA ATGAAATGAG 3420 
AACAAAGAGG AAAATGGTAA AAACTTGAAA TGAGGCTGGG GTATAGTTTG TCCTACAATA 3480 
GAAAAAAGAG AGAGCTTCCT AGGCCTGGGC TCTTAAATGC TGCATTATAA CTGAGTCTAT 3540 
GAGGAAATAG TTCCTGTCCA ATTTGTGTAA TTTGTTTAAA ATTGTA AATA AATTAAACTT 3600 
TTCTGGTTTC TGTGGGAAGG AAATAGGGAA TCCAATGGAA CAGTAGCTTT GCTTTGCAGT 3660 
CTGTTTCAAG ATTTCTGCAT CCACAAGTTA GTAGCAAACT GGGGAATACT CGCTGCAGCT 3720 
GGGGTTCCCT GCTTTTTGGT AGCAAGGGTC CAGAGATGAG GTGTTTTTTT CGGGGAGCTA 3780 
ATAACAAAAA CATTTTAAAA CTTACCTTTA CTGAAGTTAA ATCCTCTATT GCTGTTTCTA 3840 
TTCTCTCTTA TAGTGACCAA CATCTTTTTA ATTTAGATCC AAATAACCAT GTCCTCCTAG 3900 
AGTTTAGAGG CTAGAGGGAG CTGAGGGGAG GATCTTACTG AAAGCACCCT GGGGAGATTG 3960 
ATTGTCCTTA AACCTAAGCC CCACAAACTT GACACCTGAT CAGGTCTGGG AGCTACAAAA 4020 
TTTCATTTTT CTCCTCACTG CCCTTCTTCT GAGTGGCATT GGCCTGAATC AAGGAAAGCC 4080 
AGGCCTTGTG GGCCCCCTTC TTTCGGCTTT CTGCTAAAGC AACACCTCCA GCAGAGATTC 4140 
CCTTAAGTGA CTCCAGGTTT TCCACCATCC TTCAGCGTGA ATTAATTTTT AATCAGTTTG 4200 
CTTTCTCCAG AGAAATTTTA AAATAATAGA AGAAATAGAA ATTTTGAATG TATAAAAGAA 4260 
AAAGATCAAG TTGTCATTTT AGAACAGAGG GAACTTTGGG AGAAAGCAGC CCAAGTAGGT 4320 
TATTTGTACA GTCAGAGGGC AACAGGAAGA TGCAGGCCTT CAAGGGCAAG GAGAGGCCAC 4380 
AAGGAATATG GGTGGGAGTA AAAGCAACAT CGTCTGCTTC ATACTTTTTC CTAGGCTTGG 4440 
CACTGCCTTT TCCTTTCTCA GGCCAATGGC AACTGCCATT TGAGTCCGGT GAGGGATCAG 4500 
CCAACCTCTT CTCTATGGCT CACCTTATTT GGAGTGAGAA ATCAAGGAGA CAGAGCTGAC 4560 
TGCATGATGA GTCTGAAGGC ATTTGCAGGA TGAGCCTGAA CTGGTTGTGC AGAACAAACA 4620 
AGGCATTCAT GGGAATTGTT GTATTCCTTC TGCAGCCCTC CTTCTGGGCA CTAAGAAGGT 4680 
CTATGAATTA AATGCCTATC TAAAATTCTG ATTTATTCCT ACATTTTCTG TT TTCTAATT 4740 
TGACCCTAAA ATCTATGTGT TTTAGACTTA GACTTTTTAT TGCCCCCCCC CCCTTTTTTT 4800 
TTGAGACGGA GTCTCGCTCT GACGCACAGG CTGGAGTGCA GTGGCTCCGA TCTCTGCTCA 4860 
CTGAAAGCTC CGCCTCCCGG GTTCATGCCA TTCTCCTGCC TCAGCCTCCT GAGTAGCTGG 4920 
GACTACAGGC GCCCACCACC ACGCCCGGCT AATTTTTTGT ATTTTTAATA GAGAOGGGGT 4980 



1078 



WO 03/042661 



TTCACTGTGT TAGCCAGGAT GGTCTCGATC TCCTGACCTC GTGATCCGCC TGCCTCGGCC 
TCCCAAAGTG CTGGGATTAC AGGCATGACC CACCGCTCCC GGCCTTGTTT TCCGTTTAAA 
GTCGTCTTCT TTTAATGTAA TCATTTTGAA CATGTGTGAA AGTTGATCAT ACGAATTGGA 
TCAATCTTGA AATACTCAAC CAAAAGACAG TCGAGAAGCC AGGGGGAGAA AGAACTCAGG 
GCACAAAATA TTGGTCTGAG AATGGAATTC TCTGTAAGCC TAGTTGCTGA AATTTCCTGC 
TGTAACCAGA AGCCAGTTTT ATCTAACGGC TACTGAAACA CCCACTGTGT TTTGCTCACT 
CCCACTCACC GATCAAAACC TGCTACCTCC CCAAGACTTT ACTAGTGCCG ATAAACTTTC 
TCAAAGAGCA ACCAGTATCA CTTCCCTGTT TATAAAACCT CTAACCATCT CTTTGTTCTT 
TGAACATGCT GAAAACCACC TGGTCTGCAT GTATGCCCGA ATTTGTAATT CTTTTCTCTC 
AAATGAAAAT TTAATTTTAG GGATTCATTT CTATATTTTC ACATATGTAG TATTATTATT 
TCCTTATATG TGTAAGGTGA AATTTATGGT ATTTGAGTGT GCAAGAAAAT ATATTTTTAA 
AGCTTTCATT TTTCCCCCAG TGAATGATTT AGAATTTTTT ATGTAAATAT ACAGAATGTT 
TTTTCTTACT TTTATAAGGA AGCAGCTGTC TAAAATGCAG TGGGGTTTGT TTTGCAATGT 
TTTAAACAGA GTTTTAGTAT TGCTATTAAA AGAAGTTACT TTGCTTTTAA AGAAACTTGG 
CTGCTTAAAA TAAGCAAAAA TTGGATGCAT AAAGTAATAT TTACAGATGT GGGGAGATGT 
AATAAAACAA TATTAACTTG GCTGCTTAAA ATAAGCAAAA ATTGGATGCA TAAAGTAATA 
TTTACAGATG TGGGGAGATG TAATAAAACA ATATTAACTT GGTTTCTTGT TTTTGCTGTA 
TTTAGAGATT AAATAATTCT AAGATGATCA CTTTGCAAAA TTATGCTTAT GGCTGGCATG 
GAAATAGAAA TACTCAATTA TGT CTTTGTT GTATTAATGG GGAATATTTT GGACAATGTT 
TCATTATCAA ATTGTCGACA TCATTAATAT ATATTGTAAT GTTGGGAAGA GATCACTATT 
TTGAAGCACA GCTTTACAGA TGAGTATCTA TGATACATAT GTATAATAAA TTTTGATCGG 
GTATTAAAAG TATTAGAAGG TGGTTATAAT TGCAGAGTAT TCCATGAATA GTACACTGAC 
ACAGGGGTTT TACTTTGAGG ACCAGTGTAG TCAAGGGAAA ACATGAGTTA AAA AGAAA AG 
CAGGCAATAT TGCAGTCTTG ATTCTGCCAC TTACAGGATA GATAATGCCT GAACTTTAAT 
GACAAGATGA TCCAACCATA AAGGTGCTCT GTGCTTCACA GTGAATCTTT TCCCCATGCA 
GGAGTGTGCT CCCCTACAAA CGTTAAGACT GATCATTTCA AAAATCTATT AGCTATATCA 
AAAGCCTTAC ATTTTAATAT AGGTTGAACC AAAATTTCAA TTCCAGTAAC TTCTATTGTA 
ACCATTATTT TTGTGTATGT CTTCAAGAAT GTTCATTGGA TTTTTGTTTG TAATAGTAAA 
ATACCGGATA CATTTCACGT GTCCTTCAGT ATTGATTTGG TTGAATATTG GGTCATAATG 
GTTGAGAAGC ATGGACACTA GAG CCAGAAT GCTTGGATAT GAATCCTGGA TCTGTCACTT 
ACTTCTGTGT GACCTTTGAA AGGCTACTTA TTTCCTCTCT TAGCTTTCTC ATTAAAATCA 
ATGAACAATG CCAGCCTCAT GGGGTTGTTG AATGATTAAA TTAGTTAATA TACCTAAAGT 
ACATAGAACA CTGCCTGCAC ATAGTAAAAG AATTATAAGT GTGAGGTAGT TGGTAAAATT 
ATGTAGTTGG ATATACTACC GAACAATATC TAATCTCTTT TTAGGGAAAT AAAGTTTGTG 
CATATATATA ATCCCGAAAC ATG 

Seq ID NO: 326 Protein sequence 
Protein Accession #: NP_J>77741.1 

1 11 21 31 41 51 

I I I I I I 

MAAAGPRRSV RGAVCLHLLL TLVIFSRDGE ACKKVILNVP SKLEADKIIG RVNLEECFRS 60 

ADLIRSSDPD FRVLNDGSVY TARAVALSDK KRSFTIWLSD KRKQTQKEVT VLLEHQKKVS 120 

KTRHTRETVL RRAKRRWAPI PCSMQENSLG PFPLFLQQVE SDAAQNYTVP YSISGRGVDK 180 

EPLNLFYIER DTGNLFCTRP VDREEYDVFD LIAYASTADG YSADLPLPLP IRVEDENDNH 240 

PVFTEAIYNF EVLESSRPGT TVGWCATDR DEPDTMHTRL KYSILQQTPR SPGLFSVHPS 300 

TGVITTVSHY LDREWDKYS LIMKVQDMDG QFFGLIGTST CIITVTDSND NAPTFRQNAY 360 

EAFVEENAFN VEILRIPIED KDI*INTANWR VNPTILKGNE NGHFKISTDK ETNEGVLSW 420 

KPLNYEENRQ VNLEIGVNNE APFARDIPRV TALNRALVTV HVRDLDEGPE CTPAAQYVRI 480 

KENLAVGSKI NGYKAYDPEN RNGNGLRYKK LHDPKGWITI DEISGSIITS KILDREVETP 540 

KNELYNITVL AIDKDDRSCT GTLAVNIEDV NDNPPEILQE YWICKPKMG YTDILAVDPD 600 

EPVHGAPFYF SLPNTSPEIS RLWSLTKVND TAARLSYQKN AGFQEYTIPI TVKDRAGQAA 660 

TKLLRVNLCE CTHPTQCRAT SRSTGVILGK WAILAILLGI ALLFSVLLTL VCGVFGATKG 720 

KRFPEDLAQQ NLIISNTEAP GDDRVCSANG FMTQTTNNSS QGFCGTMGSG MKNGGQETIE 780 
MMKGGNQTLE SCRGAGHHHT LDSCRGGHTE VDNCRYTYSE WHSFTQPRLG EESIRGHTG 

Seq ID NO: 327 DNA sequence 

Nucleic Acid Accession #: NM_001941.2 

Coding sequence: 64.. 2754 

1 11 21 31 41 51 

I I I I I I 

GGCAGGTCTC GCTCTCGGCA CCCTCCCGGC GCCCGCGTTC TCCTGGCCCT GCCCGGCATC 60 

CCGATGGCCG CCGCTGGGCC CCGGOGCTCC GTGCGCGGAG CCGTCTGCCT GCATCTGCTG 120 

CTGACCCTCG TGATCTTCAG TCGTGATGGT GAAGCCTGCA AAAAGGTGAT ACTTAATGTA 180 

CCTTCTAAAC TAGAGGCAGA CAAAATAATT GGCAGAGTTA ATTTGGAAGA GTGCTTCAGG 240 

TCTGCAGACC TCATCCGGTC AAGTGATCCT GATITCAGAG TTCTAAATGA TGGGTCAGTG 300 

TACACAGCCA GGGCTGTTGC GCTGTCTGAT AAGAAAAGAT CATTTACCAT ATGGCTTTCT 360 

GACAAAAGGA AACAGACACA GAAAGAGGTT ACTGTGCTGC TAGAACATCA GAAGAAGGTA 420 

TCGAAGACAA GACACACTAG AGAAACTGTT CTCAGGCGTG CCAAGAGGAG ATGGGCACCT 480 

ATTCCTTGCT CTATGCAAGA GAATTCCTTG GGCCCTTTCC CATTGTTTCT TCAACAAGTT 540 

GAATCTGATG CAGCACAGAA CTATACTGTC TTCTACTCAA TAAGTGGACG TGGAGTTGAT 600 

AAAGAACCTT TAAATTTGTT TTATATAGAA AGAGACACTG GAAATCTATT TTGCACTCGG 660 

CCTGTGGATC GTGAAGAATA TGATGTTTTT GATTTGATTG CTTATGCGTC AACTGCAGAT 720 

GGATATTCAG CAGATCTGCC CCTCCCACTA CCCATCAGGG TAGAGGATGA AAATGACAAC 780 

CACCCTGTTT TCACAGAAGC AATTTATAAT TTTGAAGTTT TGGAAAGTAG TAGACCTGGT 840 

ACTACAGTGG GGGTGGTTTG TGCCACAGAC AGAGATGAAC CGGACACAAT GCATACGCGC 900 

CTGAAATACA GCATTTTGCA GCAGACACCA AGGTCACCTG GGCTCTTTTC TGTGCATCCC 960 

AGCACAGGCG TAATCACCAC AGTCTCTCAT TATTTGGACA GAGAGGTTGT AGACAAGTAC 1020 

TCATTGATAA TGAAAGTACA AGACATGGAT GGCCAGTTTT TTGGATTGAT AGGCACATCA 1080 

ACTTGTATCA TAACAGTAAC AGATTCAAAT GATAATGCAC CCACTTTCAG ACAAAATGCT 1140 

TATGAAGCAT TTGTAGAGGA AAATGCATTC AATGTGGAAA TCTTACGAAT ACCTATAGAA 1200 

GATAAGGATT TAATTAACAC TGCCAATTGG AGAGTCAATT TTACCATTTT AAAGGGAAAT 1260 

GAAAATGGAC ATTTCAAAAT CAGCACAGAC AAAGAAACTA ATGAAGGTGT TCTTTCTGTT 1320 



1079 



WO 03/042661 



GTAAAGCCAC TGAATTATGA AGAAAACCGT CAAGTGAACC TGGAAATTGG AGTAAACAAT 1380 
GAAGCGCCAT TTGCTAGAGA TATTCCCAGA GTGACAGCCT TGAACAGAGC CTTGGTTACA 1440 
GTTCATGTGA GGGATCTGGA TGAGGGGCCT GAATGCACTC CTGCAGCCCA ATATGTGCGG 1500 
ATTAAAGAAA ACTTAGCAGT GGGGTCAAAG ATCAACGGCT ATAAGGCATA TGACCCCGAA 1560 
AATAGAAATG GCAATGGTTT AAGGTACAAA AAATTGCATG ATCCTAAAGG TTGGATCACC 1620 
ATTGATGAAA TTTCAGGGTC AATCATAACT TCCAAAATCC TGGATAGGGA GGTTGAAACT 1680 
CCCAAAAATG AGTTGTATAA TATTACAGTC CTGGCAATAG ACAAAGATGA TAGATCATGT 1740 
ACTGGAACAC TTGCTGTGAA CATTGAAGAT GTAAATGATA ATCCACCAGA AATACTTCAA 1800 
GAATATGTAG TCATTTGCAA ACCAAAAATG GGGTATACCG ACATTTTAGC TGTTGATCCT 1860 
GATGAACCTG TCCATGGAGC TCCATTTTAT TTCAGTTTGC CCAATACTTC TCCAGAAATC 1920 
AGTAGACTGT GGAGCCTCAC CAAAGTTAAT GATACAGCTG CCCGTCTTTC ATATCAGAAA 1980 
AATGCTGGAT TTCAAGAATA TACCATTCCT ATTACTGTAA AAGACAGGGC CGGCCAAGCT 2040 
GCAACAAAAT TATTGAGAGT TAATCTGTGT GAATGTACTC ATCCAACTCA GTGTCGTGCG 2100 
ACTTCAAGGA GTACAGGAGT AATACTTGGA AAATGGGCAA TCCTTGCAAT ATTACTGGGT 2160 
ATAGCACTGC TCTTTTCTGT ATTGCTAACT TTAGTATGTG GAGTTTTTGG TGCAACTAAA 2220 
GGGAAACGTT TTCCTGAAGA TTTAGCACAG CAAAACTTAA TTATATCAAA CACAGAAGCA 2280 
CCTGGAGACG ATAGAGTGTG CTCTGCCAAT GGATTTATGA CCCAAACTAC CAACAACTCT 2340 
AGCCAAGGTT TTTGTGGTAC TATGGGATCA GGAATGAAAA ATGGAGGGCA GGAAACCATT 2400 
GAAATGATGA AAGGAGGAAA CCAGACCTTG GAATCCTGCC GGGGGGCTGG G CAT CAT CAT 2460 
ACCCTGGACT CCTGCAGGGG AGGACACACG GAGGTGGACA ACTGCAGATA CACTTACTCG 2520 
GAGTGGCACA GTTTTACTCA ACCCCGTCTC GGTGAAAAAT TGCATCGATG TAATCAGAAT 2580 
GAAGACCGCA TGCCATCCCA AGATTATGTC CTCACTTATA ACTATGAGGG AAGAGGATCT 2640 
CCAGCTGGTT CTGTGGGCTG CTGCAGTGAA AAGCAGGAAG AAGATGGCCT TGACTTTTTA 2700 
AATAATTTGG AACCCAAATT TATTACATTA GCAGAAGCAT GCACAAAGAG ATAATGTCAC 2760 
AGTGCTACAA TTAGGTCTTT GTCAGACATT CTGGAGGTTT CCAAAAATAA TATTGTAAAG 2820 
TTCAATTTCA ACATGTATGT ATATGATGAT TTTTTTCTCA ATTTTGAATT ATGCTACTCA 2880 
CCAATTTATA TTTTTAAAGC CAGTTGTTGC TTATCTTTTC CAAAAAGTGA AAAATGTTAA 2940 
AACAGACAAC TGGTAAATCT CAAACTCCAG CACTGGAATT AAGGTCTCTA AAGCATCTGC 3000 
TCTTTTTTTT TTTTACGGAT ATTTTAGTAA TAAATATGCT GGATAAATAT TAGTCCAACA 3060 
ATAGCTAAGT TATGCTAATA TCACATTATT ATGTATTCAC TTTAAGTGAT AGTTTAAAAA 3120 
ATAAACAAGA AATATTGAGT ATCACTATGT GAAGAAAGTT TTGGAAAAGA AACAATGAAG 3180 
ACTGAATTAA ATTAAAAATG TTGCAGCTCA TAAAGAATTG GGACTCACCC CTACTGCACT 3240 
ACCAAATTCA TTTGACTTTG GAGGCAAAAT GTGTTGAAGT GCCCTATGAA " GTAGCAATTT 3300 
TCTATAGGAA TATAGTTGGA AATAAATGTG TGTGTGTATA TTATTATTAA TCAATGCAAT 3360 
ATTTAAAATG AAATGAGAAC AAAGAGGAAA ATGGTAAAAA CTTGAAATGA GGCTGGGGTA 3420 
TAGTTTGTCC TACAATAGAA AAAAGAGAGA GCTTCCTAGG CCTGGGCTCT TAAATGCTGC 3480 
ATTATAACTG AGTCTATGAG GAAATAGTTC CTGTCCAATT TGTGTAATTT GTTTAAAATT 3540 
GTAAATAAAT TAAACTTTTC TGGTTTCTGT GGGAAGGAAA TAGGGAATCC AATGGAACAG 3600 
TAGCTTTGCT TTGCAGTCTG TTTCAAGATT TCTGCATCCA CAAGTTAGTA GCAAACTGGG 3660 
GAATACTCGC TGCAGCTGGG GTTCCCTGCT TTTTGGTAGC AAGGGTCCAG AGATGAGGTG 3720 
TTTTTTTCGG GGAGCTAATA ACAAAAACAT TTTAAAACTT ACCTTTACTG AAGTTAAATC 3780 
CTCTATTGCT GTTTCTATTC TCTCTTATAG TGACCAACAT CTTTTTAATT TAGATCCAAA 3840 
TAACCATGTC CTCCTAGAGT TTAGAGGCTA GAGGGAGCTG AGGGGAGGAT CTTACTGAAA 3900 
GCACCCTGGG GAGATTGATT GTCCTTAAAC CTAAGCCCCA CAAACTTGAC ACCTGATCAG 3960 
GTCTGGGAGC TACAAAATTT CATTTTTCTC CTCACTGCCC TTCTTCTGAG TGGCATTGGC 4020 
CTGAATCAAG GAAAGCCAGG CCTTGTGGGC CCCCTTCTTT CGGCTTTCTG CTAAAGCAAC 4080 
ACCTCCAGCA GAGATTCCCT TAAGTGACTC CAGGTTTTCC ACCATCCTTC AGCGTGAATT 4140 
AATTTTTAAT CAGTTTGCTT TCTCCAGAGA AATTTTAAAA TAATAGAAGA AATAGAAATT 4200 
TTGAATGTAT AAAAGAAAAA GATCAAGTTG TCATTTTAGA ACAGAGGGAA CTTTGGGAGA 4260 
AAGCAGCCCA AGTAGGTTAT TTGTACAGTC AGAGGGCAAC AGGAAGATGC AGGCCTTCAA 4320 
GGGCAAGGAG AGGCCACAAG GAATATGGGT GGGAGTAAAA GCAACATCGT CTGCTTCATA 4380 
CTTTTTCCTA GGCTTGGCAC TGCCTTTTCC TTTCTCAGGC CAATGGCAAC TGCCATTTGA 4440 
GTCCGGTGAG GGATCAGCCA ACCTCTTCTC TATGGCTCAC CTTATTTGGA GTGAGAAATC 4500 
AAGGAGACAG AGCTGACTGC ATGATGAGTC TGAAGGCATT TGCAGGATGA GCCTGAACTG 4560 
GTTGTGCAGA ACAAACAAGG CATTCATGGG AATTGTTGTA TTCCTTCTGC AGCCCTCCTT 4620 
CTGGGCACTA AGAAGGTCTA TGAATTAAAT GCCTATCTAA AATTCTGATT TATTCCTACA 4680 
TTTTCTGTTT TCTAATTTGA CCCTAAAATC TATGTGTTTT AGACTTAGAC TTTTTATTGC 4740 
CCCCCCCCCC TTTTTTTTTG AGACGGAGTC TCGCTCTGAC GCACAGGCTG GAGTGCAGTG 4800 
GCTCCGATCT CTGCTCACTG AAAGCTCCGC CTCCCGGGTT CATGCCATTC TCCTGCCTCA 4860 
GCCTCCTGAG TAGCTGGGAC TACAGGCGCC CACCACCACG CCCGGCTAAT TTTTTGTATT 4920 
TTTAATAGAG ACGGGGTTTC ACTGTGTTAG CCAGGATGGT CTCGATCTCC TGACCTCGTG 4980 
ATCCGCCTGC CTCGGCCTCC CAAAGTGCTG GGATTACAGG CATGACCCAC CGCTCCCGGC 5040 
CTTGTTTTCC GTTTAAAGTC GTCTTCTTTT AATGTAATCA TTTTGAACAT GTGTGAAAGT 5100 
TGATCATACG AATTGGATCA ATCTTGAAAT ACTCAACCAA AAGACAGTCG AGAAGCCAGG 5160 
GGGAGAAAGA ACTCAGGGCA CAAAATATTG GTCTGAGAAT GGAATTCTCT GTAAGCCTAG 5220 
TTGCTGAAAT TTCCTGCTGT AACCAGAAGC CAGTTTTATC TAACGGCTAC TGAAACACCC 5280 
ACTGTGTTTT GCTCACTCCC TCACTCACCG ATCAAAACCT GCTACCTCCC CAAGACTTTA 5340 
CTAGTGCCGA TAAACTTTCT CAAAGAGCAA CCAGTATCAC TTCCCTGTTT ATAAAACCTC 5400 
TAACCATCTC TTTGTTCTTT GAACATGCTG AAAACCACCT GGTCTGCATG TATGCCCGAA 5460 
TTTGTAATTC TTTTCTCTCA AATGAAAATT TAATTTTAGG GATTCATTTC TATATTTTCA 5520 
CATATGTAGT ATTATTATTT CCTTATATGT GTAAGGTGAA ATTTATGGTA TTTGAGTGTG 5580 
CAAGAAAATA TATTTTTAAA GCTTTCATTT TTCCCCCAGT GAATGATTTA GAATTTTTTA 5640 
TGTAAATATA CAGAATGTTT TTTCTTACTT TTATAAGGAA GCAGCTGTCT AAAATGCAGT 5700 
GGGGTTTGTT TTGCAATGTT TTAAACAGAG TTTTAGTATT GCTATTAAAA GAAGTTACTT 5760 
TGCTTTTAAA GAAACTTGGC TGCTTAAAAT AAGCAAAAAT TGGATGCATA AAGTAATATT 5820 
TACAGATGTG GGGAGATGTA ATAAAACAAT ATTAACTTGG TTTCTTGTTT TTGCTGTATT 5880 
TAGAGATTAA ATAATTCTAA GATGATCACT TTGCAAAATT ATGCTTATGG CTGGCATGGA 5940 
AATAGAAATA CTCAATTATG TCTTTGTTGT ATTAATGGGG AATATTTTGG ACAATGTTTC 6000 
ATTATCAAAT TGTCGACATC ATTAATATAT ATTGTAATGT TGGGAAGAGA TCACTATTTT 6060 
GAAGCACAGC TTTACAGATG AGTATCTATG ATACATATGT ATAATAAATT TTGATCGGGT 6120 
ATTAAAAGTA TTAGAAGGTG GTTATAATTG CAGAGTATTC CATGAATAGT ACACTGACAC 6180 
AGGGGTTTTA CTTTGAGGAC CAGTGTAGTC AAGGGAAAAC ATGAGTTAAA AAGAAAAGCA 6240 
GGCAATATTG CAGTCTTGAT TCTGCCACTT ACAGGATAGA TAATGCCTGA ACTTTAATGA 6300 
CAAGATGATC CAACCATAAA GGTGCTCTGT GCTTCACAGT GAATCTTTTC CCCATGCAGG 6360 
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AGTGTGCTCC CCTACAAACG TTAAGACTGA TCATTTCAAA AATCTATTAG CTATATCAAA 6420 

AGCCTTACAT TTTAATATAG GTTGAACCAA AATTTCAATT CCAGTAACTT CTATTGTAAC 6480 

CATTATTTTT GTGTATGTCT TCAAGAATGT TCATTGGATT TTTGTTTGTA ATAGTAAAAT 6540 

ACCGGATACA TTTCACGTGT CCTTCAGTAT TGATTTGGTT GAATATTGGG TCATAATGGT 6600 

5 TGAGAAGCAT GGACACTAGA GCCAGAATGC TTGGATATGA ATCCTGGATC TGTCACTTAC 6660 

TTCTGTGTGA CCTTTGAAAG GCTACTTATT TCCTCTCTTA GCTTTCTCAT TAAAATCAAT 6720 

GAACAATGCC AGCCTCATGG GGTTGTTGAA TGATTAAATT AGTTAATATA CCTAAAGTAC 6780 

ATAGAACACT GCCTGCACAT AGTAAAAGAA TTATAAGTGT GAGGTAGTTG GTAAAATTAT 6840 

GTAGTTGGAT ATACTACCGA ACAATATCTA ATCTCTTTTT AGGGAAATAA AGTTTGTGCA 6900 
10 TATATATAAT CCCGAAACAT G 



Seq ID NO: 328 Protein sequence 
Protein Accession ft: NPJJ01932.1 

15 1 11 21 31 41 51 

| I I 1 I I 

MAAAGPRRSV RGAVCLHLLL TLVIFSRDGE ACKKVILNVP SKLEADKIIG RVNLEECFRS 60 

ADLIRSSDPD FRVLNDGSVY TARAVALSDK KRSFTIWLSD KRKQTQKEVT VLLEHQKKVS 120 

KTRHTRETVL RRAKRRWAPI PCSMQENSLG PFPLFLQQVE SDAAQNYTVF YSISGRGVDK 180 

20 EPLNLFYIER DTGNLFCTRP VDREEYDVFD LIAYASTADG YSADLPLPLP IRVEDENDNH 240 

PVFTEAIYNF EVLESSRPGT TVGWCATDR DEPDTMHTRL KYSILQQTPR SPGLFSVHPS 300 

TGVITTVSHY LDREWDKYS LIMKVQDMDG QFFGLIGTST CIITVTDSND NAPTFRQNAY 360 

EAFVEENAFN VEILRIPIED KDLINTANWR VNFTILKGNE NGHPKISTDK ETNEGVLSW 420 

KPLNYEENRQ VNLEIGVNNE APFARDIPRV TALNRALVTV HVRDLDEGPE CTPAAQYVRI 480 

25 KENLAVGSKI NGYKAYDPEN RNGNGLRYKK IiHDPKGWITI DEISGSIITS KILDREVETP 540 

KNELYNITVL AIDKDDRSCT GTLAVNIEDV NDNPPEILQE YWICKPKMG YTDILAVDPD 600 

EPVHGAPFYF SLPNTSPEIS RL.WSLTKVND TAARLSYQKN AGFQEYTIPI TVKDRAGQAA 660 

TKLLRVNLCE CTHPTQCRAT SRSTGVILGK WAILAILLGI ALLFSVLLTL VCGVFGATKG 720 

KRFPEDLAQQ NLIISNTEAP GDDRVCSANG FMTQTTNNSS QGFCGTMGSG MKNGGQETIE 780 

30 MMKGGNQTLE SCRGAGHHHT LDSCRGGHTE VDNCRYTYSE WHSFTQPRLG EKLHRCKQNE 840 
DRMPSQDYVL TYNYEGRGSP AGSVGCCSEK QEEDGLDFLN NLEPKFITLA EACTKR 

Seq ID NO: 329 DNA sequence 
35 Nucleic Acid Accession ft: NM_016583.2 
Coding sequence: 72.. 842 

1 11 21 31 41 51 

40 GGAGTGGGGG AGAGAGAGGA GACCAGGACA GCTGCTGAGA CCTCTAAGAA GTCCAGATAC 60 

TAAGAGCAAA GATGTTTCAA ACTGGGGGCC TCATTGTCTT CTACGGGCTG TTAGCCCAGA 120 

CCATGGCCCA GTTTGGAGGC CTGCCCGTGC CCCTGGACCA GACCCTGCCC TTGAATGTGA 180 

ATCCAGCCCT GCCCTTGAGT CCCACAGGTC TTGCAGGAAG CTTGACAAAT GCCCTCAGCA 240 

ATGGCCTGCT GTCTGGGGGC CTGTTGGGCA TTCTGGAAAA CCTTCCGCTC CTGGACATCC 300 

45 TGAAGCCTGG AGGAGGTACT TCTGGTGGCC TCCTTGGGGG ACTGCTTGGA AAAGTGACGT 360 

CAGTGATTCC TGGCCTGAAC AACATCATTG ACATAAAGGT CACTGACCCC CAGCTGCTGG 420 

AACTTGGCCT TGTGCAGAGC CCTGATGGCC ACCGTCTCTA TGTCACCATC CCTCTCGGCA 480 

TAAAGCTCCA AGTGAATACG CCCCTGGTCG GTGCAAGTCT GTTGAGGCTG GCTGTGAAGC 540 

TGGACATCAC TGCAGAAATC TTAGCTGTGA GAGATAAGCA GGAGAGGATC CACCTGGTCC 600 

50 TTGGTGACTG CACCCATTCC CCTGGAAGCC TGCAAATTTC TCTGCTTGAT GGACTTGGCC 660 

CCCTCCCCAT TCAAGGTCTT CTGGACAGCC TCACAGGGAT CTTGAATAAA GTCCTGCCTG 720 

AGTTGGTTCA GGGCAACGTG TGCCCTCTGG TCAATGAGGT TCTCAGAGGC TTGGACATCA 780 

CCCTGGTGCA TGACATTGTT AACATGCTGA TCCACGGACT ACAGTTTGTC ATCAAGGTCT 840 
AAGCCTTCCA GGAAGGGGCT GGCCTCTGCT GAGCTGCTTC CCAGTGCTCA CAGATGGCTG 900 
55 GCCCATGTGC TGGAAGATGA CACAGTTGCC TTCTCTCCGA GGAACCTGCC CCCTCTCCTT 960 

TCCCACCAGG CGTGTGTAAC ATCCCATGTG CCTCACCTAA TAAAATGGCT CTTCTTCTGC 1020 
AAAAAAAAAA AAAAAAAAAA AAAAAAAAA 

Seq ID NO: 330 Protein sequence 
60 Protein Accession #: NPJJS7667.1 

1 11 21 31 41 51 

i I 1 1 I I 

MFQTGGIilVF YGLLAQTMAQ FGGLPVPLDQ TLPLNVNPAL PLSPTGLAGS LTNALSNGLL 60 
65 SGGLLGILEN LPLLDILKPG GGTSGGLLGG LLGKVTSVIP GLNNIIDIKV TDPQLLELGL 120 

VQSPDGHRLY VTIPLGIKLQ VNTPLVGASL LRLAVKLDIT AEILAVRDKQ ERIHLVLCDC 180 
THSPGSLQIS LLDGLGPLPI QGLLDSLTGI LNKVLPELVQ GNVCPLVNEV LRGLDITLVH 240 
DIVNMLIHGL QFVIKV 



70 



Seq ID NO: 331 DNA sequence 

Nucleic Acid Accession ft: NM_004363.1 

Coding sequence: 115.. 2223 



75 1 11 21 31 41 51 

| | I 1 I i 

CTCAGGGCAG AGGGAGGAAG GACAGCAGAC CAGACAGTCA CAGCAGCCTT GACAAAACGT 60 

TCCTGGAACT CAAGCTCTTC TCCACAGAGG AGGACAGAGC AGACAGCAGA GACCATGGAG 120 

rt _ TCTCCCTCGG CCCCTCCCCA CAGATGGTGC ATCCCCTGGC AGAGGCTCCT GCTCACAGCC 180 

80 TCACTTCTAA CCTTCTGGAA CCCGCCCACC ACTGCCAAGC TCACTATTGA ATCCACGCCG 240 

TTCAATGTCG CAGAGGGGAA GGAGGTGCTT CTACTTGTCC ACAATCTGCC CCAGCATCTT 300 

TTTGGCTACA GCTGGTACAA AGGTGAAAGA GTGGATGGCA ACCGTCAAAT TATAGGATAT 360 

GTAATAGGAA CTCAACAAGC TACCCCAGGG CCCGCATACA GTGGTCGAGA GATAATATAC 420 

CCCAATGCAT CCCTGCTGAT CCAGAACATC ATCCAGAATG ACACAGGATT CTACACCCTA 480 

1081 
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CACGTCATAA AGTCAGATCT TGTGAATGAA GAAGCAACTG GCCAGTTCCG GGTATACCCG 540 

GAGCTGCCCA AGCCCTCCAT CTCCAGCAAC AACTCCAAAC CCGTGGAGGA CAAGGATGCT 600 

GTGGCCTTCA CCTGTGAACC TGAGACTCAG GACGCAACCT ACCTGTGGTG GGTAAACAAT 660 

CAGAGCCTCC CGGTCAGTCC CAGGCTGCAG CTGTCCAATG GCAACAGGAC CCTCACTCTA 720 

TTCAATGTCA CAAGAAATGA CACAGCAAGC TACAAATGTG AAACCCAGAA CCCAGTGAGT 780 

GCCAGGCGCA GTGATTCAGT CATCCTGAAT GTCCTCTATG GCCCGGATGC CCCCACCATT 840 

TCCCCTCTAA ACACATCTTA CAGATCAGGG GAAAATCTGA ACCTCTCCTG CCACGCAGCC 900 

TCTAACCCAC CTGCACAGTA CTCTTGGTTT GTCAATGGGA CTTTCCAGCA ATCCACCCAA 960 

GAGCTCTTTA TCCCCAACAT CACTGTGAAT AATAGTGGAT CCTATACGTG CCAAGCCCAT 1020 

AACTCAGACA CTGGCCTCAA TAGGACCACA GTCACGACGA TCACAGTCTA TGCAGAGCCA 1080 

CCCAAACCCT TCATCACCAG CAACAACTCC AACCCCGTGG AGGATGAGGA TGCTGTAGCC 1140 

TTAACCTGTG AACCTGAGAT TCAGAACACA ACCTACCTGT GGTGGGTAAA TAATCAGAGC 1200 

CTCCCGGTCA GTCCCAGGCT GCAGCTGTCC AATGACAACA GGACCCTCAC TCTACTCAGT 1260 

GTCACAAGGA ATGATGTAGG ACCCTATGAG TGTGGAATCC AGAACGAATT AAGTGTTGAC 1320 

CACAGCGACC CAGTCATCCT GAATGTCCTC TATGGCCCAG ACGACCCCAC CATTTCCCCC 1380 

TCATACACCT ATTACCGTCC AGGGGTGAAC CTCAGCCTCT CCTGCCATGC AGCCTCTAAC 1440 

CCACCTCCAC AGTATTCTTG GCTGATTGAT GGGAACATCC AGCAACACAC ACAAGAGCTC 1500 

TTTATCTCCA ACATCACTGA GAAGAACAGC GGACTCTATA CCTGCCAGGC CAATAACTCA 1560 

GCCAGTGGCC ACAGCAGGAC TACAGTCAAG ACAATCACAG TCTCTGCGGA GCTGCCCAAG 1620 

CCCTCCATCT CCAGCAACAA CTCCAAACCC GTGGAGGACA AGGATGCTGT GGCCTTCACC 1680 

TGTGAACCTG AGGCTCAGAA CACAACCTAC CTGTGGTGGG TAAATGGTCA GAGCCTCCCA 1740 

GTCAGTCCCA GGCTGCAGCT GTCCAATGGC AACAGGACCC TCACTCTATT CAATGTCACA 1800 

AGAAATGACG CAAGAGCCTA TGTATGTGGA ATCCAGAACT CAGTGAGTGC AAACCGCAGT 1860 

GACCCAGTCA CCCTGGATGT CCTCTATGGG CCGGACACCC CCATCATTTC CCCCCCAGAC 1920 

TCGTCTTACC TTTCGGGAGC GAACCTCAAC CTCTCCTGCC ACTCGGCCTC TAA CCCA TCC 1980 

CCGCAGTATT CTTGGCGTAT CAATGGGATA CCGCAGCAAC ACACACAAGT TCTCTTTATC 2040 

GCCAAAATCA CGCCAAATAA TAACGGGACC TATGCCTGTT TTGTCTCTAA CTTGGCTACT 2100 

GGCCGCAATA ATTCCATAGT CAAGAGCATC ACAGTCTCTG CATCTGGAAC TTCTCCTGGT 2160 

CTCTCAGCTG GGGCCACTGT CGGCATCATG ATTGGAGTGC TGGTTGGGGT TGCTCTGATA 2220 

TAGCAGCCCT GGTGTAGTTT CTTCATTTCA GGAAGACTGA CAGTTGTTTT GCTT CTTCCT 2280 

TAAAGCATTT GCAACAGCTA CAGTCTAAAA TTGCTTCTTT ACCAAGGATA TTTACAGAAA 2340 

AGACTCTGAC CAGAGATCGA GACCATCCTA GCCAACATCG TGAAACCCCA TCTCTACTAA 2400 

AAATACAAAA ATGAGCTGGG CTTGGTGGCG CGCACCTGTA GTCCCAGTTA CTCGGGAGGC 2460 

TGAGGCAGGA GAATCGCTTG AACCCGGGAG GTGGAGATTG CAGTGAGCCC AGATCGCACC 2520 

ACTGCACTCC AGTCTGGCAA CAGAGCAAGA CTCCATCTCA AAAAGAAAAG AAAAGAAGAC 2580 

TCTGACCTGT ACTCTTGAAT ACAAGTTTCT GATACCACTG CACTGTCTGA GAATTTCCAA 2640 

AACTTTAATG AACTAACTGA CAGCTTCATG AAACTGTCCA CCAAGATCAA GCAGAGAAAA 2700 

TAATTAATTT CATGGGACTA AATGAACTAA TGAGGATTGC TGATTCTTTA AATGTCTTGT 2760 

TTCCCAGATT TCAGGAAACT TTTTTTCTTT TAAGCTATCC ACTCTTACAG CAATTTGATA 2820 

AAATATACTT TTGTGAACAA AAATTGAGAC ATTTACATTT TCTCCCTATG TGGTCGCTCC 2880 

AGACTTGGGA AACTATTCAT GAATATTTAT ATTGTATGGT AATATAGTTA TTGCACAAGT 2940 
TCAATAAAAA TCTGCTCTTT GTATAACAGA AAAA 

Seq ID NO: 332 Protein sequence 
Protein Accession #: NP_004354.1 

1 11 21 31 41 51 

I I 1 I I I 

MESPSAPPHR WCIPWQRLLL TASLLTFWNP PTTAKLTIES TPFNVAEGKE VLLLVHNLPQ 60 

HLFGYSWYKG ERVDGNRQII GYVIGTQQAT PGPAYSGREI IYPNASLLIQ NIIQNDTGFY 120 

TLHVIKSDLV NEEATGQFKV YPEX*PKPSIS SNNSKPVEDK DAVAFTCEPE TQDATYLWWV 180 

NNQSLPVSPR LQLSNGNRTL TLFNVTRNDT ASYKCETQNP VSARRSDSVI LNVLYGPDAP 240 

TISPLNTSYR SGENLNLSCH AASNPPAQYS WFVNGTFQQS TQELFIPNIT VNNSGSYTCQ 300 

AHNSDTGLNR TTVTTITVYA EPPKPFITSN NSNPVEDEDA VALTCEPEIQ NTTYLWWVNN 360 

QSLPVSPRLQ LSNDNRTLTIi LSVTRNDVGP YEOGIQNELS VDHSDPVILN VLYGPDDPTI 420 

SPSYTYYRPG VNLSLSCHAA SNPPAQYSWL IDGNIQQHTQ ELFISNITEK NSGLYTCQAN 480 

NSASGHSRTT VKTITVSAEL PKPSISSNNS KPVEDKDAVA FTCEPEAQNT TYLWWVNGQS 540 

LPVSPRLQLS NGNRTLTLFN VTRNDARAYV CGIQNSVSAN RSDPVTLDVL YGPDTPIISP 600 

PDSSYLSGAN LNLSCHSASN PSPQYSWRIN GIPQQHTQVI* FIAKITPNNN GTYACFVSNL 660 
ATGRNNSIVK SITVSASGTS PGLSAGATVG IMIGVLVGVA LI 

Seq ID NO: 333 DNA sequence 

Nucleic Acid Accession #: NM_006952.1 

Coding sequence : 1 1 . . 7 9 3 

1 11 21 31 41 51 

I I I I I 

AATCCCGACA ATGGCGAAAG ACAACTCAAC TGTTCGTTGC TTCCAGGGCC TGCT GATTT T 60 

TGGAAATGTG ATTATTGGTT GTTGCGGCAT TGCCCTGACT GCGGAGTGCA TCTTCTTTGT 120 

ATCTGACCAA CACAGCCTCT ACCCACTGCT TGAAGCCACC GACAACGATG ACATCTATGG 180 

GGCTGCCTGG ATCGGCATAT TTGTGGGCAT CTGCCTCTTC TGCCTGTCTG TTCTAGGCAT 240 

TGTAGGCATC ATGAAGTCCA GCAGGAAAAT TCTTCTGGCG TATTTCATTC T GATGTTTA T 300 

AGTATATGCC TTTGAAGTGG CATCTTGTAT CACAGCAGCA ACACAACGAG ACTTTTTCAC 360 

ACCCAACCTC TTCCTGAAGC AGATGCTAGA GAGGTACCAA AACAACAGCC CTCCAAACAA 420 

TGATGACCAG TGGAAAAACA ATGGAGTCAC CAAAACCTGG GACAGGCTCA TGCTCCAGGA 480 

CAATTGCTGT GGCGTAAATG GTCCATCAGA CTGGCAAAAA TACACATCTG CCTTCCGGAC 540 

TGAGAATAAT GATGCTGACT ATCCCTGGCC TCGTCAATGC TGTGTTATGA ACAATCTTAA 600 

AGAACCTCTC AACCTGGAGG CTTGTAAACT AGGCGTGCCT GGTTTTTATC ACAA TCAGGG 660 

CTGCTATGAA CTGATCTCTG GTCCAATGAA CCGACACGCC TGGGGGGTTG CCTGGTTTGG 720 

ATTTGCCATT CTCTGCTGGA CTTTTTGGGT TCTCCTGGGT ACCATGTTCT ACTGGAGCAG 780 
AATTGAATAT TAAGAA 



Seq ID NO: 334 Protein sequence 
Protein Accession ft: NP_008883.l 
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1 11 21 31 41 51 

| I I 1 i I 

MAKDNSTVRC FQGLLIFGNV IIGCCGIALT AECIFFVSDQ HSLYPLLEAT DNDDIYGAAW 
IGIFVGICLF CLSVLGIVGI MKSSRKILLA YFILMFIVYA FEVASCITAA TQRDFFTPNL 
FLKQMLERYQ NNSPPNNDDQ WKNNGVTKTW DRLMLQDNCC GVNGPSDWQK YTSAFRTENN 
DADYPWPRQC CVMNNLKEPL NLEACKLGVP GFYHNQGCYE LISGPMNRHA KGVAWFGFAI 
LCWTFWVLLG TMFYWSRIEY 

Seq ID NO: 33S DNA sequence 

Nucleic Acid Accession #: NM_002638.1 

Coding sequence: 120.. 473 



60 
120 
1B0 
240 



CAATACAGCT 
GCTGGACTGC 
TGAGGGCCAG 
AGGCAGCTGT 
TCAATGGACA 
CGCAAGAGCC 
TCGGGTGCGC 
TCAAGAAGTG 
CGGTCCTTGC 



GAGCTGCCTC 



11 
i 

AAGGAATTAT 
ATAAAGATTG 
CAGCTTCTTG 
CACGGGAGTT 
AGATCCCGTT 
AGTCAAAGGT 
CATGTTGAAT 
CTGTGAAGGC 
TGCACCTGTG 
CCCCTTCCCA 
TCTCATCCAC 



21 
I 

CCCTTGTAAA 
GTATGGCCTT 
ATCGTGGTGG 
CCTGTTAAAG 
AAAGGACAAG 
CCAGTCTCCA 
CCCCCTAACC 
TCTTGCGGGA 
CCGTCCCCAG 
CACTGTCCAT 
TTTCCAATAA 



31 
I 

TACCACAGAC 
AGCTCTTAGC 
TGTTCCTCAT 
GTCAAGACAC 
TTTCAGTTAA 
CTAAGCCTGG 
GCTGCTTGAA 
TGGCCTGTTT 
AGCTACAGGC 
TCTTCCTCCC 
A 



41 

1 

CCGCCCTGGA 
CAAACACCTT 
CGCTGGGACG 
TGTCAAAGGC 
AGGTCAAGAT 
CTCCTGCCCC 
AGATACTGAC 
CGTTCCCCAG 
CCCATCTGGT 
ATTCAGGATG 



51 
1 

GCCAGGCCAA 
CCTGACACCA 
CTGGTTCTAG 
CGTGTTCCAT 
AAAGTCAAAG 
ATTATCTTGA 
TGCCCAGGAA 
TGAAGGGAGC 
CCTAAGTCCC 
CCCACGGCTG 



Seq ID NO: 336 Protein sequence 
Protein Accession ft: NP 002629.1 



11 



21 



51 



31 41 

I i 1 

MRASSFLIW VFLIAGTLVL EAAVTGVPVK GQDTVKGRVP FNGQDPVKGQ VSVKGQDKVK 
AQEPVKGPVS TKPGSCPIIL IRCAMLNPPN RCLKDTDCPG IKKCCEGSCG MACFVPQ 

Seq ID NO: 337 DNA sequence 

Nucleic Acid Accession Hi NM_001793.2 

Coding sequence : 71 .. 2560 



1 
I 

AAAGGGGCAA 
CTCTGCAGCC 
CTGGCTGCAG 
CTTGGAGGCG 



TGGCGAGACA 
ATCCAAACGT 
TGAAAATGGC 
AGACACCAAG 
CTTCGCTGTA 
GATTGCCAAG 
CCCCATGAAC 
GGACACCTTC 
GACAG CCACG 
CCATAGCCAA 
CACCATCAGC 
CATCCAGGCC 
GATCCTTGAT 
GCCTGAGAAT 
CAACTCACCA 
TACCATCACC 
TTTTGAGGCC 
GCTGAAGCTC 
ACCTGTGTTT 
GCCTGTGTGT 
CATCCTGAGA 
TGTGGGCACC 
GGTCTTGGCC 
ACTGATTGAT 
CCAAAGCCCT 
CCCTTTCCAG 
GGAAGGTGAC 
GCACCTTTCT 
GTGCGACTGC 
CCCTGTGCTG 
GAGAAAGAAG 
CGTCTTCTAC 
GCTCCACCGA 
CATCATCCCG 
TATAATTGAG 
CTTGGTGTTC 
CTCCGCCTCC 
GAAGCTGGCA 



11 
I 

GAGCTGAGCG 
ATGGGGCTCC 
TGCGCGGCCT 
GGAGGCGCGG 
CAAGAGCCAG 
GTCCAGGAAA 
ATCTTACGAA 
AAGGGTCCCT 
ATTTTCTACA 
GAGAAGGAGA 
TATGAGCTCT 
ATCTCCATCA 
CGAGGGAGTG 
GATGAGGATG 
GAACCAAAGG 
GTCATCTCCA 
ACAGACATGG 
GCCAATGACA 
GCAGTGGGCC 
GCGTGGCGTG 
ACCCACCCTG 
AAAAACCAGC 
CCAACCTCCA 
GTCCCACCCT 
GTCTACACTG 
GACCCAGCAG 
CTCGACCGTG 
ATGGACAATG 
GTCAATGACC 
GTGCGCCAGG 
GCCCAGCTCA 
ACAGTGGTCT 
CTGTCTGACC 
CATGGCCATG 
GGGGCTGTCC 
CGGAAGATCA 
TATGGCGAAG 
GGTCTGGAGG 
ACACCCATGT 
AACCTGAAGG 
GACTATGAGG 
GACCAAGACC 
GACATGTACG 



21 
I 

GAACACCGGC 
CTCGTGGACC 
CCGAGCCGTG 
AGCAGGAGCC 
CTCTGTTTAG 
GAAGGTCACT 
GACACAAGAG 
TCCCCCAGAG 
GCATCACGGG 
CAGGCTGGTT 
TTGGCCACGC 
TCGTGACCGA 
TCTTAGAGGG 
ATGCCATCTA 
ACCCACACGA 
GTGGCCTGGA 
ATGGGGACGG 
ATGCTCCCAT 
ATGAGGTGCA 
CCACCTACCT 
AGAGCAACCA 
ACACCCTGTA 
CAGCCACCAT 
CCAAAGTCGT 
CAGAAGACCC 
GGTGGCTAGC 
AGGATGAGCA 
GAAGCCCTCC 
ATGGCCCAGT 
TGCTGAACAT 
CAGATGACTC 
TGTCCCTGAA 
ATGGCAACAA 
TCGAAACCTG 
TGGCTCTGCT 
AGGAGCCCCT 
AGGGGGGTGG 
CCAGGCCGGA 
ACCGTCCTCG 
CGGCTAACAC 
GCAGCGGCTC 
AAGATTACGA 
GTGGCGGGGA 



31 

I 

CCGCCGTCGC 
TCTCGCGTCT 
CCGGGCGGTC 
CGGCCAGGCG 
CACTGATAAT 
GAAGGAAAGG 
AGATTGGGTG 
ACTGAATCAG 
GCCGGGGGCA 
GTTGTTGAAT 
TGTGTCAGAG 
CCAGAATGAC 
AGTCCTACCA 
CACCTACAAT 
CCTCATGTTC 
CCGGGAAAAA 
CTCCACCACC 
GTTTGACCCC 
GAGGCTGACG 
TATCATGGGC 
GGGCATCCTG 
CGTTGAAGTG 
AGTGGTCCAC 
TGAGGTCCAG 
TGACAAGGAG 
CATGGACCCA 
GTTTGTGAGG 
CACCACTGGC 
CCCTGAGCCC 
CACGGACAAG 
AGACATCTAC 
GAAGTTCCTG 
AGAGCAGCTG 
CCCTGGACCC 
GTTCCTCCTG 
CCTACTCCCA 
CGAAGAGGAC 
GGTGGTTCTC 
GCCAGCCAAC 
AGACCCCACA 
CGACGCCGCG 
TTATCTGAAC 
GGACGACTAG 



41 
I 

GGCAGCTGCT 
CTCCTCCTTC 
TTCAGGGAGG 
CTGGGGAAAG 
GATGACTTCA 
AATCCATTGA 
GTTGCTCCAA 
CTCAAGTCTA 
GACAG CCCCC 
AAGCCACTGG 
AATGGTGCCT 
CACAAGCCCA 
GGTACTTCTG 
GGGGTGGTTG 
ACCATTCACC 
GTCCCTGAGT 
ACGGCAGTGG 
CAGAAGTACG 
GTCACTGATC 
GGTGACGACG 
ACAACCAGGA 
ACCAACGAGG 
GTGGAGGATG 
GAGGGCATCC 
AATCAAAAGA 
GACAGTGGGC 
AACAACATCT 
ACGGGAACCC 
CGTCAGATCA 
GACCTGTCTC 
TGGACGGCAG 
AAGCAGGATA 
ACGGTGATCA 
TGGAAGGGAG 
CTGGTGCTGC 
GAAGATGACA 
CAGGACTATG 
CGCAATGACG 
CCAGATGAAA 
GCCCCGCCCT 
TCCCTGAGCT 
GAGTGGGGCA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



51 
I 

TCACCCCTCT 
TCCAGGTTTG 
CTGAAGTGAC 
TATTCATGGG 
CTGTGCGGAA 
AGATCTTCCC 
TATCTGTCCC 
ATAAAGATAG 
CTGAGGGTGT 
ACCGGGAGGA 
CAGTGGAGGA 
AGTTTACCCA 
TGATGCAGGT 
CTTACTCCAT 
GGAGCACAGG 
ACACACTGAC 
CAGTAGTGGA 
AGGCCCATGT 
TGGACGCCCC 
GGGACCATTT 
AGGGTTTGGA 
CCCCTTTTGT 
TGAATGAGGC 
CCACTGGGGA 
TCAGCTACCG 
AGGTCACAGC 
ATGAAGTCAT 
TTCTGCTAAC 
CCATCTGCAA 
CCCACACCTC 
AGGTCAACGA 
CATATGACGT 
GGGCCACTGT 
GTTTCATCCT 
TTTTGTTGGT 
CCCGTGACAA 
ACATCACCCA 
TGGCACCAAC 
TCGGCAACTT 
ACGACACCCT 
CCCTCACCTC 
GCCGCTTCAA 
TGCAGGGCTG 



60 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 

960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
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GGGACCAAAC GTCAGGCCAC AGAGCATCTC CAAGGGGTCT CAGTTCCCCC TTCAGCTGAG 
GACTTCGGAG CTTGTCAGGA AGTGGCCGTA GCAACTTGGC GGAGACAGGC TATGAGTCTG 
ACGTTAGAGT GGTTGCTTCC TTAGCCTTTC AGGATGGAGG AATGTGGGCA GTTTGACTTC 
AGCACTGAAA ACCTCTCCAC CTGGGCCAGG GTTGCCTCAG AGGCCAAGTT TCCAGAAGCC 
TCTTACCTGC CGTAAAATGC TCAACCCTGT GTCCTGGGCC TGGGCCTGCT GTGACTGACC 
TACAGTGGAC TTTCTCTCTG GAATGGAACC TTCTTAGGCC TCCTGGTGCA ACTTAATTTT 
TTTTTTTAAT GCTATCTTCA AAACGTTAGA GAAAGTTCTT CAAAAGTGCA GCCCAGAGCT 
GCTGGGCCCA CTGGCCGTCC TGCATTTCTG GTTTCCAGAC CCCAATGCCT CCCATTCGGA 
TGGATCTCTG CGTTTTTATA CTGAGTGTGC CTAGGTTGCC CCTTATTTTT T ATTTTCC CT 
GTTGCGTTGC TATAGATGAA GGGTGAGGAC AATCGTGTAT ATGTACTAGA ACTTTTTTAT 
TAAAGAAACT TTTCCCAGAA AAAAA 

Seq ID NO: 338 Protein sequence 
Protein Accession #: NP_001784.2 

I 11 21 31 41 51 

II I I i I 
MGLPRGPLAS LLLLQVCWLQ CAASEPCRAV PREAEVTLEA GGAEQEPGQA LGKVFMGCPG 60 
QEPALFSTDN DDFTVRNGET VQERRSLKER NPLKXFPSKR ILRRHKRDWV VAPISVPENG 120 
KGPFPQRLNQ L.KSNKDRDTK IFYSITGPGA DSPPEGVFAV EKETGWLLUJ KPLDREE I AK 1BD 
YELFGHAVSE NGASVEDPMN ISIIVTDQND HKPKFTQDTF RGSVLEGVLP GTSVMQVTAT 240 
DEDDAIYTYN GWAYSIHSQ EPKDPHDLMP TIHRSTGTIS VISSGLDREK VPEYTLTIQA 300 
TDMDGDGSTT TAVAWEILD ANDNAPMFDP QKYEAHVPEN AVGHEVQRLT VTDLDAPNSP 360 
AWRATYLIMG GDDGDHFTIT THPESNQGIL TTRKGLDFEA KNQHTLYVEV TNEAPFVLKL 420 
PTSTATIWH VEDVNEAPVF VPPSKWEVQ EGIPTGEPVC VYTAEDPDKE NQKISYRILR 480 
DPAGWLAMDP DSGQVTAVGT LDREDEQFVR NNIYEVMVLA MDNGSPPTTG TGTLLLTLID 540 
VNDHGPVPEP RQITICNQSP VRQVLNITDK DLSPHTSPFQ AQLTDDSDIY WTAEVNEEGD 600 
TWLSLKKFL KQDTYDVHIiS LSDHGNKEQL TVIRATVCDC HGHVETCPGP WKGGFILPVL 660 
GAVLALLFLL LVLLLLVRKK RKIKEPLLLP EDDTRDNVFY YGEEGGGEED QDYDITQLHR 720 
GLEARPEWL RNDVAPTIIP TPMYRPRPAN FDEIGNFIIE NLKAANTDPT APPYDTLLVF 780 
DYEGSGSDAA SLSSLTSSAS DQDQDYDYLN EWGSRFKKLA DMYGGGEDD 

Seq ID NO: 339 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence : 1 . - 672 

1 11 21 31 41 51 

I I I I I I 

ATGAGGCTCC AAAGACCCCG ACAGGCCCCG GCGGGTGGGA GGCGCGOGCC CCGGGGCGGG 60 
CGGGGCTCCC CCTACCGGCC AGACCCGGGG AGAGGCGOGC GGAGGCTGCG AAGGTTCCAG 120 
AAGGGCGGGG AGGGGGCGCC GCGCGCTGAC CCTCCCTGGG CACCGCTGGG GACGATGGCG 180 
CTGCTCGCCT TGCTGCTGGT CGTGGCCCTA CCGCGGGTGT GGACAGACGC CAACCTGACT 240 
GCGAGACAAC GAGATCCAGA GGACTCCCAG CGAACGGACG AGGGTGACAA TAGAGTGTGG 300 
TGTCATGTTT GTGAGAGAGA AAACACTTTC GAGTGCCAGA ACCCAAGGAG GTGCAAATGG 360 
ACAGAGCCAT ACTGCGTTAT AGCGGCCGTG AAAATATTTC CACGTTTTTT CATGGTTGCG 420 
AAGCAGTGCT CCGCTGGTTG TGCAGCGATG GAGAGACCCA AGCCAGAGGA GAAGCGGTTT 480 
CTCCTGGAAG AGCCCATGCC CTTCTTTTAC CTCAAGTGTT GTAAAATTCG CTACTGCAAT 540 
TTAGAGGGGC CACCTATCAA CTCATCAGTG TTCAAAGAAT ATGCTGGGAG CATGGGTGAG 600 
AGCTGTGGTG GGCTGTGGCT GGCCATCCTC CTGCTGCTGG CCTCCATTGC AGCCGGCCTC 660 
AGCCTGTCTT GA 

Seq ID NO: 340 Protein sequence 
Protein Accession # : Eos sequence 

1 11 21 31 41 51 

I I I I I I 

MRLQRPRQAP AGGRRAPRGG RGSPYRPDPG RGARRLRRFQ KGGEGAPRAD PPWAPLGTMA 60 
T J. RT, T.T.WAT. PRVWTDANLT ARQRDPEDSQ RTDEGDNRVW CHVCERENTF ECQNPRRCKW 120 
TEPYCVIAAV KIFPRFFMVA KQCSAGCAAM ERPKPEEKRF LLEEPMPFFY LKCCKIRYCN 180 
LEGPPINSSV FKEYAGSMGE SCGGLWLAII, LLLASIAAGL SLS 

Seq ID NO: 341 DNA sequence 

Nucleic Acid Accession #: XM_035292.2 

Coding sequence: 5 3.. 157 6 

1 11 21 31 41 51 

I II I I 1 

GCTCGCTGGG CCGCGGCTCC CGGGTGTCCC AGGCCCGGCC GGTGCGCAGA GCATGGCGGG 60 

TGCGGGCCCG AAGCGGCGCG CGCTAGCGGC GCCGGCGGCC GAGGAGAAGG AAGAGGCGCG 120 

GGAGAAGATG CTGGCCGCCA AGAGOGCGGA CGGCTCGGCG CCGGCAGGCG AGGGCGAGGG 180 

CGTGACCCTG CAGCGGAACA TCACGCTGCT CAACGGCGTG GCCATCATCG TGGGGACCAT 240 

TATCGGCTCG GGCATCTTCG TGACGCCCAC GGGCGTGCTC AAGGAGGCAG GCTCGCCGGG 300 

GCTGGCGCTG GTGGTGTGGG CCGCGTGCGG OGTCTTCTCC ATCGTGGGCG CGCTCTGCTA 360 

CGCGGAGCTC GGCACCACCA TCTCCAAATC GGGCGGCGAC TACGCCTACA TGCTGGAGGT 420 

CTACGGCTCG CTGCCCGCCT TCCTCAAGCT CTGGATCGAG CTGCTCATCA TCCGGCCTTC 480 

ATCGCAGTAC ATCGTGGCCC TGGTCTTCGC CACCTACCTG CTCAAGCCGC TCTTCCCCAC 540 

CTGCCCGGTG CCCGAGGAGG CAGCCAAGCT CGTGGCCTGC CTCTGCGTGC TGCTGCTCAC 600 

GGCCGTGAAC TGCTACAGCG TGAAGGCCGC CACCCGGGTC CAGGATGCCT TTGCCGCCGC 660 

CAAGCTCCTG GCCCTGGCCC TGATCATCCT GCTGGGCTTC GTCCAGATCG GAAAGGGTGA 720 

TGTGTCCAAT CTAGATCCCA ACTTCTCATT TGAAGGCACC AAACTGGATG TGGGGAACAT 780 

TGTGCTGGCA TTATACAGCG GCCTCTTTGC CTATGGAGGA TGGAATTACT TGAATTTCGT 840 

CACAGAGGAA ATGATCAACC - CCTACAGAAA CCTGCCCCTG GCCATCATCA TCTCCCTGCC 900 

CATCGTGACG CTGGTGTACG TGCTGACCAA CCTGGCCTAC TTCACCACCC TGTCCACCGA 960 

GCAGATGCTG TCGTCCGAGG CCGTGGCCGT GGACTTCGGG AACTATCACC TGGGOGTCAT 1020 
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GTCCTGGATC ATCCCCGTCT TCGTGGGCCT GTCCTGCTTC GGCTCCGTCA ATGGGTCCCT 1080 

GTTCACATCC TCCAGGCTCT TCTTCGTGGG GTCCCGGGAA GGCCACCTGC CCTCCATCCT 1140 

CTCCATGATC CACCCACAGC TCCTCACCCC CGTGCCGTCC CTCGTGTTCA CGTGTGTGAT 1200 

GACGCTGCTC TACGCCTTCT CCAAGGACAT CTTCTCCGTC ATCAACTTCT TCAGCTTCTT 1260 

CAACTGGCTC TGCGTGGCCC TGGCCATCAT CGGCATGATC TGGCTGCGCC ACAGAAAGCC 1320 

TGAGCTTGAG CGGCCCATCA AGGTGAACCT GGCCCTGCCT GTGTTCTTCA TCCTGGCCTG 1380 

CCTCTTCCTG ATCGCCGTCT CCTTCTGGAA GACACCCGTG GAGTGTGGCA TCGGCTTCAC 1440 

CATCATCCTC AGCGGGCTGC CCGTCTACTT CTTCGGGGTC TGGTGGAAAA ACAAGCCCAA 1500 

GTGGCTCCTC CAGGGCATCT TCTCCACGAC CGTCCTGTGT CAGAAGCTCA TGCAGGTGGT 1560 
CCCCCAGGAG ACATAGCCAG GAGGCCGAGT GGCTGCCGGA GGAGCATGC 

Seq ID NO: 342 Protein sequence 
Protein Accession #: XP_035292.2 

1 11 21 31 41 51 

1 I I I I I 

MAGAGPKRRA LAAPAAEEKE EAREKMLAAK SADGSAPAGE GEGVTLQRNI TLLNGVAI IV 60 
GTIIGSGIFV TPTGVLKEAG SPGLALWWA ACGVFSIVGA LCYAELGTTI SKSGGDYAYM 120 
LEVYGSLPAF LKLWIELLII RPSSQYIVAL VFATYLLKPL FPTCPVPEEA AKLVACLCVL 180 
LLTAVNCYSV KAATRVQDAF AAAKLLALAL IILLGFVQIG KGDVSNLDPN FSFEGTKLDV 240 
GNIVLALYSG LFAYGGWNYL NFVTEEMINP YRNLPLAIII SLPIVTLVYV LTNLAYFTTL 300 
STEQMLSSEA VAVDFGNYHL GVMSWIIPVF VGLSCFGSVN GSLFTSSRLF FVGSREGHLP 360 
SILSMIHPQL LTPVPSLVFT CVMTLLYAFS KDIFSVINFF SFFNWLCVAL AI IGMIWLRH 420 
RKPELERPIK VNLALPVFFI LACLFLIAVS FWKTPVECGI GFTIILSGLP VYFFGVWWKN 480 
KPKWLLQGIF STTVLCQKLM QWPQET 

Seq ID NO; 343 DNA sequence 

Nucleic Acid Accession #: NM_005268.1 

Coding sequence: 168.. 989 

1 11 21 31 41 51 

I I I I I I 

TAAAAAGCAA AAGAATTCGC GGCCGCGTCG ACACGGGCTT CCCCGAAAAC CTTCCCCGCT 60 

TCTGGATATG AAATTCAAGC TGCTTGCTGA GTCCTATTGC CGGCTGCTGG GAGCCAGGAG 120 

AGCCCTGAGG AGTAGTCACT CAGTAGCAGC TGACGCGTGG GTCCACCATG AACTGGAGTA 180 

TCTTTGAGGG ACTCCTGAGT GGGGTCAACA AGTACTCCAC AGCCTTTGGG CGCATCTGGC 240 

TGTCTCTGGT CTTCATCTTC CGCGTGCTGG TGTACCTGGT GACGGCCGAG CGTGTGTGGA 300 

GTGATGACCA CAAGGACTTC GACTGCAATA CTCGCCAGCC CGGCTGCTCC AACGTCTGCT 360 

TTGATGAGTT CTTCCCTGTG TCCCATGTGC GCCTCTGGGC CCTGCAGCTT ATCCTGGTGA 420 

CATGCCCCTC ACTGCTCGTG GTCATGCACG TGGCCTACCG GGAGGTTCAG GAGAAGAGGC 480 

ACCGAGAAGC CCATGGGGAG AACAGTGGGC GCCTCTACCT GAACCCCGGC AAGAAGCGGG 540 

GTGGGCTCTG GTGGACATAT GTCTGCAGCC TAGTGTTCAA GGCGAGCGTG GACATCGCCT 600 

TTCTCTATGT GTTCCACTCA TTCTACCCCA AATATATCCT CCCTCCTGTG GTCAAGTGCC 660 

ACGCAGATCC ATGTCCCAAT ATAGTGGACT GCTTCATCTC CAAGCCCTCA GAGAAGAACA 720 

TTTTCACCCT CTTCATGGTG GCCACAGCTG CCATCTGCAT CCTGCTCAAC CTCGTGGAGC 780 

TCATCTACCT GGTGAGCAAG AGATGCCACG AGTGCCTGGC AGCAAGGAAA GCTCAAGCCA 640 

TGTGCACAGG TCATCACCCC CACGGTACCA CCTCTTCCTG CAAACAAGAC GACCTCCTTT 900 

CGGGTGACCT CATCTTTCTG GGCTCAGACA GTCATCCTCC TCTCTTACCA GACCGCCCCC 960 

GAGACCATGT GAAGAAAACC ATCTTGTGAG GGGCTGCCTG GACTGGTCTG GCAGGTTGGG 1020 

CCTGGATGGG GAGGCTCTAG CATCTCTCAT AGGTGCAACC TGAGAGTGGG GGAGCTAAGC 1080 

CATGAGGTAG GGGCAGGCAA GAGAGAGGAT TCAGACGCTC TGGGAGCCAG TTCCTAGTCC 1140 

TCAACTCCAG CCACCTGCCC CAGCTCGACG GCACTGGGCC AGTTCCCCCT CTGCTCTGCA 1200 
GCTCGGTTTC CTTTTCTAGA ATGGAAATAG TGAGGGCCAA TGC 

Seq ID NO: 344 Protein sequence 
Protein Accession #: NP_005259.1 

1 11 21 31 41 51 

I I I I I I 

MNWSIFEGLL SGVNKYSTAF GRIWLSLVFI FRVLVYLVTA EKVWSDDHKD FDCNTRQPGC 60 
SNVCFDEFFP VSHVRLWALQ LILVTCPSLL WMHVAYREV QEKRHREAHG ENSGRLYLNP 120 
GKKRGGLWWT YVCSLVFKAS VDIAFLYVPH SFYPKYILPP WKCHADPCP NIVDCFISKP 180 
SEKNIFTLFM VATAAICILL NLVELIYLVS KRCHECLAAR KAQAMCTGHH PHGTTSSCKQ 240 
DDLLSGDLIF LGSDSHPPLL PDRPRDHVKK TIL 

Seq ID NO: 345 DNA sequence 

Nucleic Acid Accession #: NM_002391.1 

Coding sequence: 26.. 457 

1 11 21 31 41 51 

I I I I I I 

CGGGCGAAGC AGCGCGGGCA GCGAGATGCA GCACCGAGGC TTCCTCCTCC TCACCCTCCT 60 

CGCCCTGCTG GCGCTCACCT CCGCGGTCGC CAAAAAGAAA GATAAGGTGA AGAAGGGCGG 120 

CCCGGGGAGC GAGTGCGCTG AGTGGGCCTG GGGGCCCTGC ACCCCCAGCA GCAAGGATTG 180 

CGGCGTGGGT TTCCGCGAGG GCACCTGCGG GGCCCAGACC CAGCGCATCC GGTGCAGGGT 240 

GCCCTGCAAC TGGAAGAAGG AGTTTGGAGC CGACTGCAAG TACAAGTTTG AGAACTGGGG 300 

TGCGTGTGAT GGGGGCACAG GCACCAAAGT CCGCCAAGGC ACCCTGAAGA AGGCGCGCTA 360 

CAATGCTCAG TGCCAGGAGA CCATCCGCGT CACCAAGCCC TGCACCCCCA AGACCAAAGC 420 

AAAGGCCAAA GCCAAGAAAG GGAAGGGAAA GGACTAGACG CCAAGCCTGG ATGCCAAGGA 480 

GCCCCTGGTG TCACATGGGG CCTGGCCACG CCCTCCCTCT CCCAGGCCCG AGATGTGACC 540 

CACCAGTGCC TTCTGTCTGC TCGTTAGCTT TAATCAATCA TGCCCTGCCT TGTCCCTCTC 600 

ACTCCCCAGC CCCACCCCTA AGTGCCCAAA GTGGGGAGGG ACAAGGGATT CTGGGAAGCT 660 

TGAGCCTCCC CCAAAGCAAT GTGAGTCCCA GAGCCCGCTT TTGTTCTTCC CCACAATTCC 720 

ATTACTAAGA AACACATCAA ATAAACTGAC TTTTTCCCCC CAATAAAAGC TCTTCTTTTT 780 
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TAATAT 

\ 

Seq ID NO: 346 Protein sequence 
^ Protein Accession #: NP_002382.1 

1 11 21 31 41 51 

MQHRGFU.LT LLALLALTSA VAKKKDKVKK GGPGSECAEW AWGPCTPSSK DCGVGFREGT 60 
CGAQTQRIRC RVPCNWKKEF GADCKYKFEN WGACDGGTGT KVRQGTLKKA RYNAQCQETI 120 
10 RVTKPCTPKT KAKAKAKKGK GKD 



15 



35 



Seq ID NO: 347 DNA sequence 

Nucleic Acid Accession 8: NM_006783. 

Coding sequence: 1..786 



11 21 31 41 51 

ATGGATTGGG GGACGCTGCA CACTTTCATC GGGGGTGTCA ACAAACACTC CACCAGCATC 60 

GGGAAGGTGT GGATCACAGT CATCTTTATT TTCCGAGTCA TGATCCTAGT GGTGGCTGCC 120 

20 CAGGAAGTGT GGGGTGACGA GCAAGAGGAC TTCGTCTGCA ACACACTGCA ACCGGGATGC 180 

AAAAATGTGT GCTATGACCA CTTTTTCCCG GTGTCCCACA TCCGGCTGTG GGCCCTCCAG 240 

CTGATCrrCG TCTCCACCCC AGCGCTGCTG GTGGCCATGC ATGTGGCCTA CTACAGGCAC 300 

GAAACCACTC GCAAGTTCAG GCGAGGAGAG AAGAGGAATG ATTTCAAAGA CATAGAGGAC 360 

ATTAAAAAGC ACAAGGTTCG GATAGAGGGG TGGCTGTGGT GGACGTACAC CAGCAGCATC 420 

25 TTTTTCCGAA TCATCTTTGA AGCAGCCTTT ATGTATGTGT TTTACTTCCT TTACAATGGG 480 

TACCACCTGC CCTGGGTGTT GAAATGTGGG ATTGACCCCT GCCCCAACCT TGTTGACTGC 540 

TTTATTTCTA GGCCAACAGA GAAGACCGTG TTTACCATTT TTATGATTTC TGCGTCTGTG 600 

ATTTGCATGC TGCTTAACGT GGCAGAGTTG TGCTACCTGC TGCTGAAAGT GTGTTTTAGG 660 

AGATCAAAGA GAGCACAGAC GCAAAAAAAT CACCCCAATC ATGCCCTAAA GGAGAGTAAG 720 

30 CAGAATGAAA TGAATGAGCT GATTTCAGAT AGTGGTCAAA ATGCAATCAC AGGTTTCCCA 780 
AGCTAA 



Seq ID NO: 348 Protein sequence 
Protein Accession ft: NP_006774.1 



1 11 21 31 41 51 

KDWGTLHTF I GGVNKHSTSI GKVWITVIFI FRVMILWAA QEVKGDEQED FVCNTLQPGC 60 

KNVCYDHFFP VSHIRLWALQ LIFVSTPALI* VAMHVAYYRH ETTRKFRRGE KRNDFKDIED 120 

40 IKKHKVRIEG SLWWTYTSSI FFRIIFEAAF MYVFYFLYNG YHLPWVLKCG IDPCPNLVDC 180 

FISRPTEKTV FTIFMISASV ICMLLNVAEL CYL.LLKVCFR RSKRAQTQKN HPNHALKESX 240 
QNEMNELISD SGQNAITGFP S 

Seq ID NO: 349 DNA sequence 
45 Nucleic Acid Accession #: NMJ)02571.1 
Coding sequence : 99 . - 587 

1 11 21 31 41 51 

50 CATCCCTCTG GCTCCAGAGC TCAGAGCCAC CCACAGCCGC AGCCATGCTG TGCCTCCTGC 60 

TCACCCTGGG CGTGGCCCTG GTCTGTGGTG TCCCGGCCAT GGACATCCCC CAGACCAAGC 120 

AGGACCTGGA GCTCCCAAAG TTGGCAGGGA CCTGGCACTC CATGGCCATG GCGACCAACA 180 

ACATCTCCCT CATGGCGACA CTGAAGGCCC CTCTGAGGGT CCACATCACC TCACTGTTGC 240 

CCACCCCCGA GGACAACCTG GAGATCGTTC TGCACAGATG GGAGAACAAC AGCTGTGTTG 300 

55 AGAAGAAGGT CCTTGGAGAG AAGACTGGGA ATCCAAAGAA GTTCAAGATC AACTATACGG 360 

TGGCGAACGA GGCCACGCTG CTCGATACTG ACTACGACAA TTTCCTGTTT CTCTGCCTAC 420 

AGGACACCAC CACCCCCATC CAGAGCATGA TGTGCCAGTA CCTGGCCAGA GTCCTGGTGG 480 

AGGACGATGA GATCATGCAG GGATTCATCA GGGCTTTCAG GCCCCTGCCC AGGCACCTAT 540 

GGTACTTGCT GGACTTGAAA CAGATGGAAG AGCCGTGCCG TTTCTAGCTC ACCTCCGCCT 600 

60 CCAGGAAGAC CAGACTCCCA CCCTTCCACA CCTCCAGAGC AGTGGGACTT CCTCCTGCCC 660 

TTTCAAAGAA TAACCACAGC TCAGAAGACG ATGACGTGGT CATCTGTGTC GCCATCCCCT 720 

TCCTGCTGCA CACCTGCACC ATTGCCATGG GGAGGCTGCT CCCTGGGGGC AGAGTCTCTG 780 
GCAGAGGTTA TTAATAAACC CTTGGAGCAT G 

65 Seq ID NO: 350 Protein sequence 
Protein Accession #» NP__002562.1 

1 11 21 31 41 51 

70 MDIPQTKQDL ELPKLAGTWH SMAMATNNIS LMATLKAPLR VHITSLLPTP EDNLEIVLHR 60 
WENNSCVEKK VLGEKTGNPK KFKINYTVAN EATLLDTDYD NFLFLCLQDT TTPIQSMMCQ 120 
YLARVtiVEDD EIMQGFIRAF RPLPRHLWYL LDLKQMEEPC RF 

Seq ID NO: 351 DNA sequence 
75 Nucleic Acid Accession #: NM_006500.1 
Coding sequence: 27 -.1967 

1 11 21 31 |1 51 

80 ACTTGCGTCT CGCCCTCCGG CCAAGCATGG GGCTTCCCAG GCTGGTCTGC GCCTTCTTGC 60 

TCGCCGCCTG CTGCTGCTGT CCTCGCGTCG CGGGTGTGCC CGGAGAGGCT GAGCAGCCTG 120 

CGCCTGAGCT GGTGGAGGTG GAAGTGGGCA GCACAGCCCT TCTGAAGTGC GGCCTCTCCC 180 

AGTCCCAAGG CAACCTCAGC CATGTCGACT GGTTTTCTGT CCACAAGGAG AAGCGGACGC 240 

TCATCTTCCG TGTGCGCCAG GGCCAGGGCC AGAGCGAACC TGGGGAGTAC GAGCAGCGGC 300 
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TCAGCCTCCA GGACAGAGGG GCTACTCTGG CCCTGACTCA AGTCACCCCC CAAGACGAGC 
GCATCTTCTT GTGCCAGGGC AAGCGCCCTC GGTCCCAGGA GTACCGCATC CAGCTCCGCG 
TCTACAAAGC TCCGGAGGAG CCAAACATCC AGGTCAACCC CCTGGGCATC CCTGTGAACA 
GTAAGGAGCC TGAGGAGGTC GCTACCTGTG TAGGGAGGAA CGGGTACCCC ATTCCTCAAG 
TCATCTGGTA CAAGAATGGC CGGCCTCTGA AGGAGGAGAA GAACCGGGTC CACATTCAGT 
CGTCCCAGAC TGTGGAGTCG AGTGGTTTGT ACACCTTGCA GAGTATTCTG AAGGCACAGC 
TGGTTAAAGA AGACAAAGAT GCCCAGTTTT ACTGTGAGCT CAACTACCGG CTGCCCAGTG 
GGAACCACAT GAAGGAGTCC AGGGAAGTCA CCGTCCCTGT TTTCTACCCG ACAGAAAAAG 
TGTGGCTGGA AGTGGAGCCC GTGGGAATGC TGAAGGAAGG GGACCGCGTG GAAATCAGGT 
GTTTGGCTGA TGGCAACCCT CCACCACACT TCAGCATCAG CAAGCAGAAC CCCAGCACCA 
GGGAGGCAGA GGAAGAGACA ACCAACGACA ACGGGGTCCT GGTGCTGGAG CCTGCCCGGA 
AGGAACACAG TGGGCGCTAT GAATGTCAGG CCTGGAACTT GGACACCATG ATATCGCTGC 
TGAGTGAACC ACAGGAACTA CTGGTGAACT ATGTGTCTGA CGTCCGAGTG AGTCCCGCAG 
CCCCTGAGAG ACAGGAAGGC AGCAGCCTCA CCCTGACCTG TGAGGCAGAG AGTAGCCAGG 
ACCTCGAGTT CCAGTGGCTG AGAGAAGAGA CAGACCAGGT GCTGGAAAGG GGGCCTGTGC 
TTCAGTTGCA TGACCTGAAA CGGGAGGCAG GAGGCQGCTA TCGCT GCGTG GCGTCTGTGC 
CCAGCATACC CGGCCTGAAC CGCACACAGC TGGTCAAGCT GGCCATTTTT GGCCCCCCTT 
GGATGGCATT CAAGGAGAGG AAGGTGTGGG TGAAAGAGAA TATGGTGTTG AATCTGTCTT 
GTGAAGCGTC AGGGCACCCC CGGCCCACCA TCTCCTGGAA CGTCAACGGC ACGGCAAGTG 
AACAAGACCA AGATCCACAG CGAGTCCTGA GCACCCTGAA TGTCCTCGTG ACCCCGGAGC 
TGTTGGAGAC AGGTGTTGAA TGCACGGCCT CCAACGACCT GGGCAAAAAC ACCAGCATCC 
TCTTCCTGGA GCTGGTCAAT TTAACCACCC TCACACCAGA CTCCAACACA ACCACTGGCC 
TCAGCACTTC CACTGCCAGT CCTCATACCA GAGCCAACAG CACCTCCACA GAGAGAAAGC 
TGCCGGAGCC GGAGAGCCGG GGCGTGGTCA TCGTGGCTGT GATTGTGTGC ATCCTGGTCC 
TGGCGGTGCT GGGCGCTGTC CTCTATTTCC TCTATAAGAA GGGCAAGCTG CCGTGCAGGC 
GCTCAGGGAA GCAGGAGATC ACGCTGCCCC CGTCTCGTAA GACCGAACTT GTAGTTGAAG 
TTAAGTCAGA TAAGCTCCCA GAAGAGATGG GCCTCCTGCA GGGCAGCAGC GGTGACAAGA 
GGGCTCCGGG AGACCAGGGA GAGAAATACA TCGATCTGAG GCATTAGCCC CGAATCACTT 
CAGCTCCCTT CCCTGCCTGG ACCATTCCCA GCTCCCTGCT CACTCTTCTC TCAGCCAAAG 
CCTCCAAAGG GACTAGAGAG AAGCCTCCTG CTCCCCTCAC CTGCACACCC CCTTTCAGAG 
GGCCACTGGG TTAGGACCTG AGGACCTCAC TTGGCCCTGC AAGCCGCTTT TCAGGGACCA 
GTCCACCACC ATCTCCTCCA CGTTGAGTGA AGCTCATCCC AAGCAAGGAG CCCCAGTCTC 
CCGAGCGGGT AGGAGAGTTT CTTGCAGAAC GTGTTTTTTC TTTACACACA TTATGGCTGT 
AAATACCTGG CTCCTGCCAG CAGCTGAGCT GGGTAGCCTC TCTGAGCTGG TTTCCTGCCC 
CAAAGGCTGG CTTCCACCAT CCAGGTGCAC CACTGAAGTG AGGACACACC GGAGCCAGGC 
GCCTGCTCAT GTTGAAGTGC GCTGTTCACA CCCGCTCCGG AGAGCACCCC AGCGGCATCC 
AGAAGCAGCT GCAGTGTTGC TGCCACCACC CTCCTGCTCG CCTCTTCAAA GTCTCCTGTG 
ACATTTTTTC TTTGGTCAGA AGCCAGGAAC TGGTGTCATT CCTTAAAAGA TACGTGCCGG 
GGCCAGGTGT GGTGGCTCAC GCCTGTAATC CCAGCACTTT GGGAGGCCGA GGCGGGCGGA 
TCACAAAGTC AGGACGAGAC CATCCTGGCT AACACGGTGA AACCCTGTCT CTACTAAAAA 
TACAAAAAAA AATTAGCTAG GCGTAGTGGT TGGCACCTAT AGTCCCAGCT ACTCGGAAGG 
CTGAAGCAGG AGAATGGTAT GAATCCAGGA GGTGGAGCTT GCAGTGAGCC GAGACCGTGC 
CACTGCACTC CAGCCTGGGC AACACAGCGA GACTCCGTCT CGAGGAAAAA AAAAGAAAAG 
ACGCGTACCT GCGGTGAGGA AGCTGGGCGC TGTTTTCGAG TTCAGGTGAA TTAGCCTCAA 
TCCCCGTGTT CACTTGCTCC CATAGCCCTC TTGATGGATC ACGTAAAACT GAAAGGCAGC 
GGGGAGCAGA CAAAGATGAG GTCTACACTG TCCTTCATGG GGATTAAAGC TATGGTTATA 
TTAGCACCAA ACTTCTACAA ACCAAGCTCA GGGCCCCAAC CCTAGAAGGG CCCAAATGAG 
AGAATGGTAC TTAGGGATGG AAAACGGGGC CTGGCTAGAG CTTCGGGTGT GTGTGTCTGT 
CTGTGTGTAT GCATACATAT GTGTGTATAT ATGGTTTTGT CAGGTGTGTA AATTTGCAAA 
TTGTTTCCTT TATATATGTA TGTATATATA TATATGAAAA TATATATATA TATGAAAAAT 
AAAGCTTAAT TGTCCCAGAA AATCATACAT TGCTTTTTTA TTCTACATGG GTACCACAGG 
AACCTGGGGG CCTGTGAAAC TACAACCAAA AGGCACACAA AACOGTTTCC AGTTGGCAGC 
AGAGATCAGG GGTTACCTCT GCTTCTGAGC AAATGGCTCA AGCTCTACCA GAGCAGACAG 
CTACCCTACT TTTCAGCAGC AAAACGTCCC GTATGACGCA GCACGAAGGG CCTGGCAGGC 
TGTTAGCAGG AGCTATGTCC CTTCCTATCG TTTCCGTCCA CTT 
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Seq ID NO: 352 Protein sequence 
Protein Accession #: NP_006491.1 



GLPRLVCAFL 
WFSVEKEKRT 
RSQEYRIQLR 
KEEKNRVHIQ 
TVPVFYPTEK 
NGVLVLEPAR 
TLTCEAESSQ 
LVKLAIPGPP 
STLNVLVTPE 
RANSTSTERK 
PSRKTEliWE 



11 
I 

LAACCCCPRV 
LIPRVRQGQG 
VYKAPKEPNI 
SSQTVESSGL 
VWLEVEPVGM 
KEHSGRYECQ 
DLEFQWLREE 
WMAPKERKVW 
LLETGVECTA 
LPEPESRGW 
VKSDKLPEEM 



21 
I 

AGVPGEAEQP 
QSEPGEYEQR 
QVNPLGIPVN 
YTLQSILKAQ 
LKEGDRVEIR 
AWNLDTMISL 
TDQVLERGPV 
VKENMVLNLS 
SNDLGKNTSI 
IVAVIVCILV 
GLLQGSSGDK 



31 
I 

APELVEVEVG 
LSW2DRGATL 
SKEPEEVATC 
LVKEDKDAQF 
CliADGNPPPH 
IiSEPQELLVN 
LQLHDLKREA 
CEASGHPRPT 
LPLELVNIiTT 
LAVLGAVLYF 
RAPGDOGEKY 



41 

! 

STALLKCGIiS 
ALTQVTPQDE 
VGRNGYPIPQ 
YCELNYRLPS 
FSISKQNPST 
YVSDVRVSPA 
GGGYRCVASV 
ISWNVNGTAS 
LTPDSNTTTG 
LYKKGKLPCR 
IDLRH 



51 
I 

QSQGNLSHVD 
RIFliOQGKRP 
VIWYKNGRPIi 



APERQEGSSL 
PSIPGLNRTQ 
EQDQDPQRVL 
LSTSTASPHT 
RSGKQEITLP 
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Seq ID NO: 353 DNA sequence 

Nucleic Acid Accession ft: NM_003183.3 

Coding sequence: 16 5.. 263 9 

1 11 21 31 41 51 

I I I I I 1 

TCGAGCCTGG CGGTAGAATC TTCCCAGTAG GCGGCGCGGG AGGGAAAAGA GGATTGAGGG 
GCTAGGCCGG GCGGATCCCG TCCTCCCCCG ATGTGAGCAG TTTTCCGAAA CCCCGTCAGG 
CGAAGGCTGC CCAGAGAGGT GGAGTCGGTA GCGGGGCCGG GAACATGAGG CAGTCTCTCC 
TATTCCTGAC CAGCGTGGTT CCTTTCGTGC TGGCGCCGCG ACCTCCGGAT GACCCGGGCT 
TCGGCCCCCA CCAGAGACTC GAGAAGCTTG ATTCTTTGCT CTCAGACTAC GATATTCTCT 

1087 
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CTTTATCTAA TATCCAGCAG CATTCGGTAA GAAAAAGAGA TCTACAGACT TCAACACATG 
TAGAAACACT ACTAACTTTT TCAGCTTTGA AAAGGCATTT TAAATTATAC CTGACATCAA 
GTACTGAACG TTTTTCACAA AATTTCAAGG TCGTGGTGGT GGATGGTAAA AACGAAAGCG 
AGTACACTGC AAAATGGCAG GACTTCTTCA CTGGACACGT GGTTGGTGAG CCTGACTCTA 
5 GGGTTCTAGC CCACATAAGA GATGATGATG TTATAATCAG AATCAACACA GATGGGGCCG 
AATATAACAT AGAGCCACTT TGGAGATTTG TTAATGATAC CAAAGACAAA AGAATGTTAG 
TTTATAAATC TGAAGATATC AAGAATGTTT CAOGTTTGCA GTCTCCAAAA GTGTGTGGTT 
ATTTAAAAGT GGATAATGAA GAGTTGCTCC CAAAAGGGTT AGTAGACAGA GAACCACCTG 
AAGAGCTTGT TCATCGAGTG AAAAGAAGAG CTGACCCAGA TCCCATGAAG AACACGTGTA 
10 AATTATTGGT GGTAGCAGAT CATCGCTTCT ACAGATACAT GGGCAGAGGG GAAGAGAGTA 
CAACTACAAA TTACTTAATA GAGCTAATTG ACAGAGTTGA TGACATCTAT CGGAACACTT 
CATGGGATAA TGCAGGTTTT AAAGGCTATG GAATACAGAT AGAGCAGATT CGCATTCTCA 
AGTCTCCACA AGAGGTAAAA CCTGGTGAAA AGCACTACAA CATGGCAAAA AGTTACCCAA 
ATGAAGAAAA GGATGCTTGG GATGTGAAGA TGTTGCTAGA GCAATTTAGC TTTGATATAG 
15 CTGAGGAAGC ATCTAAAGTT TGCTTGGCAC ACCTTTTCAC ATACCAAQAT TTTGATATGG 
GAACTCTTGG ATTAGCTTAT GTTGGCTCTC CCAGAGCAAA CAGCCATGGA GGTGTTTGTC 
CAAAGGCTTA TTATAGCCCA GTTGGGAAGA AAAATATCTA TTTGAATAGT GGTTTGACGA 
GCACAAAGAA TTATGGTAAA ACCATCCTTA CAAAGGAAGC TGACCTGGTT ACAACTCATG 
AATTGGGACA TAATTTTGGA GCAGAACATG ATCCGGATGG TCTAGCAGAA TGTGCCCCGA 
20 ATGAGGACCA GGGAGGGAAA TATGTCATGT ATCCCATAGC TGTGAGTGGC GATCACGAGA 
ACAATAAGAT GTTTTCAAAC TGCAGTAAAC AATCAATCTA TAAGACCATT GAAAGTAAGG 
CCCAGGAGTG TTTTCAAGAA CGCAGCAATA AAGTTTGTGG GAACTCGAGG GTGGATGAAG 
GAGAAGAGTG TGATCCTGGC ATCATGTATC TGAACAACGA CACCTGCTGC AACAGCGACT 
GCACGTTGAA GGAAGGTGTC CAGTGCAGTG ACAGGAACAG TCCTTGCTGT AAAAACTGTC 
25 AGTTTGAGAC TGCCCAGAAG AAGTGCCAGG AGGCGATTAA TGCTACTTGC AAAGGCGTGT 
CCTACTGCAC AGGTAATAGC AGTGAGTGCC CGCCTCCAGG AAATGCTGAA AATGACACTG 
TTTGCTTGGA TCTTGGCAAG TGTAAGGATG GGAAATGCAT CCCTTTCTGC GAGAGGGAAC 
AGCAGCTGGA GTCCTGTGCA TGTAATGAAA CTGACAACTC CTGCAAGGTG TGCTGCAGGG 
ACCTTTCTGG CCG CTGTGTG CCCTATGTCG ATGCTGAACA AAAGAACTTA TTTTTGAGGA 
30 AAGGAAAGCC CTGTACAGTA GGATTTTGTG ACATGAATGG CAAATGTGAG AAACGAGTAC 
AGGATGTAAT TGAACGATTT TGGGATTTCA TTGACCAGCT GAGCATCAAT ACTTTTGGAA 
AGTTTTTAGC AGACAACATC GTTGGGTCTG TCCTGGTTTT CTCCTTGATA TTTTGGATTC 
CTTTCAGCAT TCTTGTCCAT TGTGTGGATA AGAAATTGGA TAAACAGTAT GAATCTCTGT 
CTCTGTTTCA CCCCAGTAAC GTCGAAATGC TGAGCAGCAT GGATTCTGCA TCGGTTCGCA 
35 TTATCAAACC CTTTCCTGCG CCCCAGACTC CAGGCCGCCT GCAGCCTGCC CCTGTGATCC 
CTTCGGCGCC AGCAGCTCCA AAACTGGACC ACCAGAGAAT GGACACCATC CAGGAAGACC 
CCAGCACAGA CTCCCATATG GACGAGGATG GGTTTGAGAA GGACCCCTTC CCAAATAGCA 
GCACAGCTGC CAAGTCATTT GAGGATCTCA CGGACCATCC GGTCGCCAGA AGTGAAAAGG 
CTGCCTCCTT TAAACTGCAG CGTCAGAATC GTGTTAACAG CAAAGAAACA GAGTGCTAAT 
40 TTAGTTCTCA GCTCTTCTGA CTTAAGTGTG CAAAATATTT TTATAGATTT GACCTACAAA 
TCAATCACAG CTTGTATTTT GTGAAGACTG GGAAGTGACT TAGCAGATGC TGGTCATGTG 
TTTGAACTTC CTGCAGGTAA ACAGTTCTTG TGTGGTTTGG CCCTTCTCCT TTTGAAAAGG 
TAAGGTGAAA GTGAATCTAC TTATTTTGAG GCTTTCAGGT TTTAGTTTTT AAAATATCTT 
TTGACCTGTG GTGCAAAAGC AGAAAATACA GCTGGATTGG GTTATGAATA TTTACGTTTT 
45 TGTAAATTAA TCTTTTATAT TGATAACAGC ACTGACTAGG GAAATGATCA GTTTTTTTTT 
ATACACTGTA ATGAACCGCT GAATATGAAG CATTTGGCAT TTATTTGTGA GAAAAGTGGA 
ATAGTTTTTT TTTTTTTTTT TTTTTTTTGC CTTCAACTAA AAACAAAGGA GATAAATTTA 
GTATACATTG TATCTAAATT GTGGGTCTAT TTCTAGTTAT TACCCAGAGT TTTTATGTAG 
CAGGGAAAAT ATATATCTAA ATTTAGAAAT CATTTGGGTT AATATGGCTC TTCATAATTC 
50 TAAGACTAAT GCTCAGAACC TAACCACTAC CTTACAGTGA GGGCTATACA TGGTAGCCAG 
TTGAATTTAT GGAATCTACC AACTGTTTAG GGCCCTGATT TGCTGGGCAG TTTTTCTGTA 
TTTTATAAGT ATCTTCATGT ATCCCTGTTA CTGATAGGGA TACATGTCTT AGAAAATTCA 
CTATTGGCTG GGAGTGGTGG CTCATGCCTG TAATCCCAGC ACTTGGAGAG GCTGAGGTTG 
CGCCACTACA CTCCAGCCTG GGTGACAGAG TGAGATCTGC CTC 
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Seq ID NO: 354 Protein sequence 
Protein Accession ft: NP_003174.2 

1 11 21 31 41 

I 1 | I I 

MRQSLLPLTS WPFVLAPRP PDDPGFGPHQ RLEKLDSLLS DYDILSLSNI 
QTSTHVETLL TPSALKRHFK LYLTSSTERF SQNFKWWD GKNESEYTAK 
GEPDSRVLAH IRDDDVIIRI NTDGAEYNIE PLWRFVNDTK DKRMLVYKSE 
PKVCGYLKVD NEELLPKGLV DREPPEELVH KVKRRADFDP MKNTCKLLW 
RGEESTTTNY LIELIDRVDD IYRNTSWDNA GFKGYGIQIE QIRILKSPQE 
AKSYPNEEKD AWDVKMLLEQ FSFDIAEEAS KVCLAHLFTY QDPDMGTLGL 
HGGVCPKAYY SPVGKKNIYL NSGLTSTKNY GKTILTKEAD LVTTHELGHN 
AECAPNEDQG GKYVMYPIAV SGDHENNKMP SNCSKQSIYK TIESKAQECF 
SRVDEGEECD PGIMYLNNDT CCNSDCTLKE GVQCSDRNSP CCKNCQPETA 
TCKGVSYCTG NSSECPPPGN AENDTVCLDL GKCKDGKCIP FCEREQQLES 
KVCCRDLSGR CVPYVDAEQK NLPLRKGKPC TVGFCDMNGK CEKRVQDVIE 
INTFGKF1AD NIVGSVLVFS LIFWIPFSIL VHCVDKKLDK QYESLSLFHP 
SASVRIIKPF PAPQTPGRLQ PAPVIPSAPA APKLDHQRMD TIQEDPSTDS 
PFPNSSTAAK SFEDLTDHPV ARSEKAASPK LQRQNRVNSK ETEC 

Seq ID NO: 355 DNA sequence 

Nucleic Acid Accession ft: NM_021832.1 

Coding sequence: 164.. 2248 



51 
I 

QQHSVRKRDL 
WQDPPTGHW 
DIKNVSRLQS 
ADHRFYRYMG 
VKPGEKHYNM 
AYVGSPRANS 
FGAEHDPDGL 
QERSNKVCGN 
QKKCQEAINA 
CACNETDNSC 
RFWDFIDQLS 
SNVEMLSSMD 
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11 



21 31 41 51 

| I 1 | I i 

TCGAGCCTGG CGGTAGAATC TTCCCAGTAG GCGGCGCGGG AGGAAAAGAG GATTGAGGGG 
CTAGGCCGGG CGGATCCCGT CCTCCCCCGA TGTGAGCAGT TTTCCGAAAC CCCGTCAGGC 
GAAGGCTGCC CAGAGAGGTG GAGTOGGTAG CGGGGCCGGG AACATGAGGC AGTCTCTCCT 

1088 
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ATTCCTGACC AGCGTGGTTC CTTTCGTGCT GGCGCCGCGA CCTCCGGATG ACCCGGGCTT 240 
CGGCCCCCAC CAGAGACTCG AGAAGCTTGA TTCTTTGCTC TCAGACTACG ATATTCTCTC 300 
TTTATCTAAT ATCCAGCAGC ATTCGGTAAG AAAAAGAGAT CTACAGACTT CAACACATGT 360 
AGAAACACTA CTAACTTTTT CAGCTTTGAA AAGGCATTTT AAATTATACC TGACATCAAG 420 
TACTGAACGT TTTTCACAAA ATTTCAAGGT CGTGGTGGTG GATGGTAAAA ACGAAAGCGA 480 
GTACACTGTA AAATGGCAGG ACTTCTTCAC TGGACACGTG GTTGGTGAGC CTGACTCTAG 540 
GGTTCTAGCC CACATAAGAG ATGATGATGT TATAATCAGA ATCAACACAG ATGGGGCCGA 600 
ATATAACATA GAGCCACTTT GGAGATTTGT TAATGATACC AAAGACAAAA GAATGTTAGT 660 
TTATAAATCT GAAGATATCA AGAATGTTTC ACGTTTGCAG TCTCCAAAAG TGTGTGGTTA 720 
TTTAAAAGTG GATAATGAAG AGTTGCTCCC AAAAGGGTTA GTAGACAGAG AACCACCTGA 780 
AGAGCTTGTT CATCGAGTGA AAAGAAGAGC TGACCCAGAT CCCATGAAGA ACACGTGTAA 840 
ATTATTGGTG GTAGCAGATC ATCGCTTCTA CAGATACATG GGCAGAGGGG AAGAGAGTAC 900 
AACTACAAAT TACTTAATAG AGCTAATTGA CAGAGTTGAT GACATCTATC GGAACACTTC 960 
ATGGGATAAT GCAGGTTTTA AAGGCTATGG AATACAGATA GAGCAGATTC GCATTCTCAA 1020 
GTCTCCACAA GAGGTAAAAC CTGGTGAAAA GCACTACAAC ATGGCAAAAA GTTACCCAAA 1080 
TGAAGAAAAG GATGCTTGGG ATGTGAAGAT GTTGCTAGAG CAATTTAGCT TTGATATAGC 1140 
TGAGGAAGCA TCTAAAGTTT GCTTGGCACA CCTTTTCACA TACCAAGATT TTGATATGGG 1200 
AACTCTTGGA TTAGCTTATG TTGGCTCTCC CAGAGCAAAC AGCCATGGAG GTGTTTGTCC 1260 
AAAGGCTTAT TATAGCCCAG TTGGGAAGAA AAATATCTAT TTGAATAGTG GTTTGACGAG 1320 
CACAAAGAAT TATGGTAAAA CCATCCTTAC AAAGGAAGCT GACCTGGTTA CAACTCATGA 1380 
ATTGGGACAT AATTTTGGAG CAGAACATGA TCCGGATGGT CTAGCAGAAT GTGCCCCGAA 1440 
TGAGGACCAG GGAGGGAAAT ATGTCATGTA TCCCATAGCT GTGAGTGGCG ATCACGAGAA 1500 
CAATAAGATG TTTTCAAACT GCAGTAAACA ATCAATCTAT AAGACCATTG AAAGTAAGGC 1560 
CCAGGAGTGT TTTCAAGAAC GCAGCAATAA AGTTTGTGGG AACTCGAGGG TGGATGAAGG 1620 
AGAAGAGTGT GATCCTGGCA TCATGTATCT GAACAACGAC ACCTGCTGCA ACAGCGACTG 1680 
CACGTTGAAG GAAGGTGTCC AGTGCAGTGA CAGGAACAGT CCTTGCTGTA AAAACTGTCA 1740 
GTTTGAGACT GCCCAGAAGA AGTGCCAGGA GGCGATTAAT GCTACTTGCA AAGGCGTGTC 1800 
CTACTGCACA GGTAATAGCA GTGAGTGCCC GCCTCCAGGA AATGCTGAAG ATGACACTGT 1860 
TTGCTTGGAT CTTGGCAAGT GTAAGGATGG GAAATGCATC CCTTTCTGCG AGAGGGAACA 1920 
GCAGCTGGAG TCCTGTGCAT GTAATGAAAC TGACAACTCC TGCAAGGTGT GCTGCAGGGA 1980 
CCTTTCCGGC CGCTGTGTGC CCTATGTCGA TGCTGAACAA AAGAACTTAT TTTTGAGGAA 2040 
AGGAAAGCCC TGTACAGTAG GATTTTGTGA CATGAATGGC AAATGTGAGA AACGAGTACA 2100 
GGATGTAATT GAACGATTTT ' GGGATTTCAT TGACCAGCTG AGCAT CAATA CTTTTGGAAA 2160 
GTTTTTAGCA GACAACATCG TTGGGTCTGT CCTGGTTTTC TCCTTGATAT TTTGGATTCC 2220 
TTTCAGCATT CTTGTCCATT GTGTGTAACG TCGAAATGCT GAGCAGCATG GATTCTGCAT 2280 
CGGTTCGCAT TATCAAACCC TTTCCTGCGC CCCAGACTCC AGGCCGCCTG CAGCCTGCCC 2340 
CTGTGATCCC TTCGGCGCCA GCAGCTCCAA AACTGGACCA CCAGAGAATG GACACCATCC 2400 
AGGAAGACCC CAGCACAGAC TCACATATGG ACGAGGATGG GTTTGAGAAG GACCCCTTCC 2460 
CAAATAGCAG CACAGCTGCC AAGTCATTTG AGGATCTCAC GGACCATCCG GTCACCAGAA -2520 
GTGAAAAGGC TGCCTCCTTT AAACTGCAGC GTCAGAATCG TGTTGACAGC AAAGAAACAG 2580 
AGTGCTAATT TAGTTCTCAG CTCTTCTGAC TTAAGTGTGC AAAATATTTT TATAGATTTG 2640 
ACCTACAATC AATCACAGCT TATATTTTGT GAAGACTGGG AA GTGA CTTA GCAGATGCTG 2700 
GTCATGTGTT TGAACTTCCT GCAGGTAAAC AGTTCTTGTG TGGTTTGGCC CTTCTCCTTT 2760 
TGAAAAGGTA AGGTGAAGGT GAATCTAGCT TATTTTGAGG CTTTCAGGTT TTAGTTTTTA 2820 
AAATATCTTT TGACCTGTGG TGCAAAAGCA GAAAATACAG CTGGATTGGG TTATGAGTAT 2880 
TTACGTTTTT GTAAATTAAT CTTTTATATT GATAACAGGC ACTGACTAGG GAAATGATCA 2940 
GTTTTTTTTT ATACACTGTA ATGAACCGCT GAATATGAAG CATTTGGCAT TTATTTGTGA 3000 
GAAAAGTGGA ATAQTTTTTT TTTTTTTTTT TTTTTTTTGC CTTCAACTAA AAACAAAGGA 3060 
GATAAATTTA GTATACATTG TATCTAAATT GTGGGTCTAT TTCTAGTTAT TACCCAGAGT 3120 
TTTTATGTAG CAGGGAAAAT ATATATCTAA ATTTAGAAAT CATTTGGGTT AATATGGCTC 3180 
TTCATAATTC TAAGACTAAT GCTCAGAACC TAACCACTAC CTTACAGTGA GGGCTATACA 3240 
TGGTAGCCAG TTGAATTTAT GGAATCTACC AACTGTTTAG GGCCCTGATT TGCTGGGCAG 3300 
TTTTTCTGTA TTTTATAAGT ATCTTCATGT ATCCCTGTTA CTGATAGGGA TACATGTCTT 3360 
AGAAAATTCA CTATTGGCTG GGAGTGGTGG CTCATGCCTG TAATCCCAGC ACTTGGAGAG 3420 
3421 GCTGAGGTTG CGCCACTACA CTCCAGCCTG GGTGACAGAG TGAGATCTGC CTC 



Seq ID NO: 356 Protein sequence 
Protein Accession #: NPJ)68604.1 

1 11 21 31 41 51 

MRQSLLFLTS VVPFVLAPRP PDDPGFGPHQ RLEKLDSLLS DYDILSLSNI QQHSVRKRDL 60 

QTSTHVETLL TFSALKRHFK LYLTSSTERF SQNFKWWD GKNBSEYTVK WQDFFTGHW 120 

GEPDSRVLAH IRDDDVIIRI NTDGAEYNIE PLWRFVNDTK DKRMLVYXSE DIKNVSRLQS 180 

PKVCGYLKVD NEELLPKGLV DREPPEELVH RVKRRADPDP MKNTCKLLW ADHRFYRYMG 240 

RGEESTTTNY LIELIDRVDD IYRNTSWDNA GFKGYGIQIE QIRILKSPQE VKPGEKHYNM 300 

AKSYPNEEKD AWDVKMLLEQ PSFDIAEEAS KVCLAHLFTY QDFDMGTI/5I. AYVGSPRANS 360 

HGGVCPKAYY SPVGKKNIYL NSGLTSTKNY GKTIL.TKEAD LVTTHELGHN FGAEHDPDGL 420 

AECAPNEDQG GKYVMYPIAV SGDHENNKMP SNCSKQSIYK TIESKAQECF QERSNKVCGK 480 

SRVDEGEECD PGIMYLNNDT CCNSDCTLKE GVQCSDRNSP CCKNCQFETA QKKCQEAINA 540 

TCKGVSYCTG NSSECPPPGN AEDDTVCLDL GKCKDGKCIP FCEREQQLES CACNETDNSC 600 

KVCCRDLSGR CVPYVDAEQK NLFLRKGKPC TVGFCDMNGK CEKRVQDVIE RFWDFIDQLS 660 
INTFGKFLAD NIVGSVLVFS LIFWIPPSIL VHCV 



Seq ID NO: 357 DNA sequence 

Nucleic Acid Accession #: NM_004994.l 

Coding sequence: 20.. 2143 

1 11 21 31 *l 51 

AGACACCTCT GCCCTCACCA TGAGCCTCTG GCAGCCCCTG GTCCTGGTGC TCCTGGTGCT 60 

GGGCTGCTGC TTTGCTGCCC CCAGACAGCG CCAGTCCACC CTTGTGCTCT TCCCTGGAGA 120 

CCTGAGAACC AATCTCACCG ACAGGCAGCT GGCAGAGGAA TACCTGTACC GCTATGGTTA 180 

CACTCGGGTG GCAGAGATGC GTGGAGAGTC GAAATCTCTG GGGCCTGCGC TGCTGCTTCT 240 
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CCAGAAGCAA CTGTCCCTGC CCGAGACCGG 
GCGAACCCCA CGGTGCGGGG TCCCAGACCT 
CAAGTGGCAC CACCACAACA TCACCTATTG 
GGCGGTGATT GACGACGCCT TTGCCCGCGC 
5 CACCTTCACT CGCGTGTACA GCCGGGACGC 
GCACGGAGAC GGGTATCCCT TCGACGGGAA 
TGGCCCCGGC ATTCAGGGAG ACGCCCATTT 
GGGCGTCGTG GTTCCAACTC GGTTTGGAAA 
CATCTTCGAG GGCCGCTCCT ACTCTGCCTG 
10 CTGGTGCAGT ACCACGGCCA ACTACGACAC 
GAGACTCTAC ACCCGGGACG GCAATGCTGA 
CCAAGGCCAA TCCTACTCCG CCTGCACCAC 
CGCCACCACC GCCAACTACG ACCGGGACAA 
CTOGACGGTG ATGGGGGGCA ACTCGGCGGG 
15 GGGTAAGGAG TACTCGACCT GTACCAGCGA 
TACCACCTCG AACTTTGACA GCGACAAGAA 
TTTGTTCCTC GTGGCGGOGC ATGAGTTCGG 
GCCGGAGGCG CTCATGTACC CTATGTACCG 
CGACGTGAAT GGCATCCGGC ACCTCTATGG 
20 AACCACCACC ACACCGCAGC CCACGGCTCC 
TGTCCACCCC TCAGAGCGCC CCACAGCTGG 
AGGTCCCCCC ACTGCTGGCC CTTCTACGGC 
TGCCTGCAAC GTGAACATCT TCGACGCCAT 
CAAGGATGGG AAGTACTGGC GATTCTCTGA 
25 CCTTATCGCC GACAAGTGGC CCGCGCTGCC 
GCTCTCCAAG AAGCTTTTCT TCTTCTCTGG 
GGTGCTGGGC CCGAGGCGTC TGGACAAGCT 
CGGGGCCCTC CGGAGTGGCA GGGGGAAGAT 
GTTCGACGTG AAGGCGCAGA TGGTGGATCC 
30 CCCGGGGGTG CCTTTGGACA CGCACGACGT 
CCAGGACCGC TTCTACTGGC GCGTGAGTTC 
GGGCTACGTG ACCTATGACA TCCTGCAGTG 
GCAGTGCCAT GTAAATCCCC ACTGGGACCA 
CAAACTGGTA TTCTGTTCTG GAGGAAAGGG 
35 TCACCTTTGT TTTTTGTTGG AGTGTTTCTA 



TGAGCTGGAT AGCGCCACGC TGAAGGCCAT 300 

GGGCAGATTC CAAACCTTTG AGGGCGACCT 360 

GATCCAAAAC TACTCGGAAG ACTTGCCGCG 420 

CTTCGCACTG TGGAGCGCGG TGACGCCGCT 480 

AGACATCGTC ATCCAGTTTG GTGTCGCGGA 540 

GGACGGGCTC CTGGCACACG CCTTTCCTCC 600 

CGACGATGAC GAGTTGTGGT CCCTGGGCAA 660 

CGCAGATGGC GCGGCCTGCC ACTTCCCCTT 720 

CACCACCGAC GGTCGCTCCG ACGGCTTGCC 780 

CGACGACCGG TTTGGCTTCT GCCCCAGCGA 840 

TGGGAAACCC TGCCAGTTTC CATTCATCTT 900 

GGACGGTCGC TCCGACGGCT ACCGCTGGTG 960 

GCTCTTCGGC TTCTGCCCGA CCCGAGCTGA 1020 

GGAGCTGTGC GTCTTCCCCT TCACTTTCCT 1080 

GGGCCGCGGA GATGGGCGCC TCTGGTGCGC 1140 

GTGGGGCTTC TGCCCGGACC AAGGATACAG 1200 

CCACGCGCTG GGCTTAGATC ATTCCTCAGT . 1260 

CTTCACTGAG GGGCCCCCCT TGCATAAGGA 1320 

TCCTCGCCCT GAACCTGAGC CACGGCCTCC 1380 

CCCGACGGTC TGCCCCACCG GACCCCCCAC 1440 

CCCCACAGGT CCCCCCTCAG CTGGCCCCAC 1500 

CACTACTGTG CCTTTGAGTC CGGTGGACGA 1560 

CGCGGAGATT GGGAACCAGC TGTATTTGTT 1620 

GGGCAGGGGG AGCCGGCCGC AGGGCCCCTT 1680 

CCGCAAGCTG GACTCGGTCT TTGAGGAGCC 1740 

GCGCCAGGTG TGGGTGTACA CAGGCGCGTC 1800 

GGGCCTGGGA GCCGACGTGG CCCAGGTGAC 1860 

GCTGCTGTTC AGCGGGCGGC GCCTCTGGAG 1920 

CCGGAGCGCC AGCGAGGTGG ACCGGATGTT 1980 

CTTCCAGTAC CGAGAGAAAG CCTATTTCTG 2040 

CCGGAGTGAG TTGAACCAGG TGGACCAAGT 2100 

CCCTGAGGAC TAGGGCTCCC GTCCTGCTTT 2160 

ACCCTGGGGA AGGAGCCAGT TTGCCGGATA 2220 

AGGAGTGGAG GTGGGCTGGG CCCTCTCTTC 2280 
ATAAACTTGG ATTCTCTAAC CTTT 



Seq ID NO: 358 Protein sequence 
Protein Accession #s NP_004985.1 



40 

45 

50 

55 

60 

65 

70 

75 

80 



i 
I 

MSLWQPLVLV 
RGESKSLGPA 
ITYWIQNYSE 
PDGKDGLLAH 
YSACTTDGRS 
ACTTDGRSDG 
CTSEGRGDGR 
PMYRFTEGPP 
PTAGPTGPPS 



LDKLGLGADV 
THDVFQYREK 



11 
I 

LLVLGCCFAA 
LLLLQKQLSL 
DLPRAVIDDA 
AFPPGPGIQG 
DGLPWCSTTA 
YRWCATTANY 
LWCATTSNFD 
LHKDDVNGIR 
AGPTGPPTAG 
QGPFLIADKW 
AQVTGALRSG 
AYFCQDRFYW 



21 
I 

PRQRQSTLVL 
PETGELDSAT 
FARAFALWSA 
DAHFDDDELW 
NYDTDDRFGF 
DRDKLFGFCP 
SDKKWGFCPD 
HLYGPRPEPE 
PSTATTVPLS 
PALPRKLDSV 
RGKMLLFSGR 



31 
I 

FPGDLRTNLT 
LKAMRTPRCG 
VTPLTFTRVY 
SDGKGVWPT 
CPSERLYTRD 
TRADSTVMGG 
QGYSLFLVAA 
PRPPTTTTPQ 
PVDDACNVNI 
FEEPLSKKLF 
RL.WRFDVKAQ 
VDQVGYVTYD 



41 

1 

DRQLAEEYLY 
VPDLGRFQTF 
SRDADIVIQF 
RFGNADGAAC 
GNADGKPCQF 
NSAGELCVFP 
HEFGHALGLD 
PTAPPTVCPT 
FDAIABIGNQ 
FFSGRQVWVY 
MVDPRSASEV 
ILQCPED 



RYGYTRVAEM 



GVAEHGDGYP 
HFPFIFEGRS 
PFIPQGQSYS 
FTFLGKEYST 
HSSVPEALMY 
GPPTVHPSER 
LYLFKDGKYW 
TGASVLGPRR 
DRMFPGVPLD 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 



Seq ID NO: 359 DNA sequence 

Nucleic Acid Accession #: NMJJ0O213.1 

Coding sequence: 127.. 5385 



CGCCCGCGCG 
CCCCGAGGTA 
AAGAGGATGG 
AGCGTCAGCC 
ACGGAGTGTG 
CGGCGCTGCA 
GTCATGGAGA 
AGCCAGATGT 
GAGCTGGAGG 
TCCAACTCCA 
GTCCTGAGCC 
AGCGTCCCGC 
CCCCCCTTCT 
AAACTGCAGG 
ATCCTGCAGA 
CTGGTCTTCT 
GGCATCATGA 
TACAGGACAC 
ATCATCCCCA 
TATTTCCCTG 
CTGGAGGAGG 
CGAGGCCTTC 
CACATCCGGC 
GATGGGACGC 



11 
I 

CTGCAGCCCC 
GGTCCAGGAC 
CAGGGCCACG 
TCTCTGGGAC 
TCCGTGTGGA 
ACACCCAGGC 
GCAGCTTCCA 
CCCCCCAAGG 
TGTTTGAGCC 
TGTCCGATGA 
AGCTCACCAG 
AGACGGACAT 
CCTTCAAGAA 
GAGAGCGGAT 
CAGCTGTGTG 
CCACCGAGTC 
GCCGCAACGA 
AGGACTACCC 
TCTTTGCTGT 
TCTCCTCACT 
CCTTCAATCG 
GGACAGAGGT 
GGGGGGAAGT 
ACGTGTGCCA 



21 
I 

ATCTCCTAGC 
GGGCGCACAG 
CCCCAGCCCA 
CTTGGCAAAC 
TAAGGACTGC 
GGAGCTGCTG 
AATCACAGAG 
CCTGCGGGTC 
ACTGGAGAGC 
TCTGGACAAC 
CGACTACACT 
GAGGCCTGAG 
CGTCATCAGC 
CTCAGGCAAC 
CACGAGGGAC 
AGCCTTCCAC 
TGAACGGTGC 
GTCGGTGCCC 
CACCAACTAC 
GGGGGTGCTG 
GATCCGCTCC 
CACCTCCAAG 
GGGTATATAC 
GCTGCCGGAG 



31 
I 



41 



GGCAGCCCAG GCGCGGAGGG 
CAGCAGCCGA GGCTGGCCGG 
TGGGCCAGGC TGCTCCTGGC 
CGCTGCAAGA AGGCCCCAGT 
GCCTACTGCA CAGACGAGAT 
GCCGCGGGCT GCCAGCGGGA 
GAGACCCAGA TTGACACCAC 
CGTCTGCGGC CCGGTGAGGA 
CCCGTGGACC TGTACATCCT 
CTCAAGAAGA TGGGGCAGAA 
ATTGGATTTG GCAAGTTTGT 
AAGCTGAAGG AGCCCTGGCC 
CTGACAGAAG ATGTGGATGA 
CTGGATGCTC CTGAGGGCGG 
ATTGGCTGGC GCCCGGACAG 
TATGAGGCTG ATGGCGCCAA 
CACCTGGACA CCACGGGCAC 
ACCCTGGTGC GCCTGCTCGC 
TCCTATAGCT ACTACGAGAA 
CAGGAGGACT CGTCCAACAT 
AACCTGGACA TCCGGGCCCT 
ATGTTCCAGA AGACGAGGAC 
CAGGTGCAGC TGCGGGCCCT 
GACCAGAAGG GCAACATCCA 



51 
I 

AGCGAGTCCG 
GAGAGGGAGG 
AGCCTTGATC 
GAAGAGCTGC 
GTTCAGGGAC 
GAGCATCGTG 
CCTGCGGCGC 
GCGGCATTTT 
CATGGACTTC 
CCTGGCTCGG 
GGACAAAGTC 
CAACAGTGAC 
GTTCCGGAAT 
CTTCGATGCC 
CACCCACCTG 
CGTGCTGGCT 
CTACACCCAG 
CAAGCACAAC 
GCTTCACACC 
CGTGGAGCTG 
AGACAGCCCC 
TGGGTCCTTT 
TGAGCACGTG 
TCTGAAACCT 



60 
120 
180 
240 
300 
360 
420 
480 

540 

600 . 

660 

720 

780 

840 

900 

960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 



1090 



WO 03/042661 



PCT/US02/36810 



TCCTTCTCCG ACGGCCTCAA GATGGACGCG GGCATCATCT GTGATGTGTG CACCTGCGAG 
CTGCAAAAAG AGGTGCGGTC AGCTCGCTGC AGCTTCAACG GAGACTTCGT GTGCGGACAG 
TGTGTGTGCA GCGAGGGCTG GAGTGGCCAG ACCTGCAACT GCTCCACCGG CTCTCTGAGT 
GACATTCAGC CCTGCCTGCG GGAGGGCGAG GACAAGCCGT GCTCCGGCCG TGGGGAGTGC 
5 CAGTGCGGGC ACTGTGTGTG CTACGGCGAA GGCCGCTACG AGGGTCAGTT CTGCGAGTAT 
GACAACTTCC AGTGTCCCCG CACTTCCGGG TTCCTCTGCA ATGACCGAGG ACGCTGCTCC 
ATGGGCCAGT GTGTGTGTGA GCCTGGTTGG ACAGGCCCAA GCTGTGACTG TCCCCTCAGC 
AATGCCACCT GCATCGACAG CAATGGGGGC ATCTGTAATG GACGTGGCCA CTGTGAGTGT 
GGCOGCTGCC ACTGCCACCA GCAGTCGCTC TACACGGACA CCATCTGCGA GATCAACTAC 
10 TCGGCGATCC ACCCGGGCCT CTGCGAGGAC CTACGCTCCT GCGTGCAGTG CCAGGCGTGG 
GGCACCGGCG AGAAGAAGGG GCGCACGTGT GAGGAATGCA ACTTCAAGGT CAAGATGGTG 
GACGAGCTTA AGAGAGCCGA GGAGGTGGTG GTGCGCTGCT CCTTCCGGGA CGAGGATGAC 
GACTGCACCT ACAGCTACAC CATGGAAGGT GACGGCGCCC CTGGGCCCAA CAGCACTGTC 
CTGGTGCACA AGAAGAAGGA CTGCCCTCCG GGCTCCTTCT GGTGGCTCAT CCCCCTGCTC 
15 CTCCTCCTCC TGCCGCTCCT GGCCCTGCTA CTGCTGCTAT GCTGGAAGTA CTGTGCCTGC 
TGCAAGGCCT GCCTGGCACT TCTCCCGTGC TGCAACCGAG GTCACATGGT GGGCTTTAAG 
GAAGACCACT ACATGCTGCG GGAGAACCTG ATGGCCTCTG ACCACTTGGA CACGCCCATG 
CTGCGCAGCG GGAACCTCAA GGGCCGTGAC GTGGTCCGCT GGAAGGTCAC CAACAACATG 
CAGCGGCCTG GCTTTGCCAC TCATGCCGCC AGCATCAACC CCACAGAGCT GGTGCCCTAC 
20 GGGCTGTCCT TGCGCCTGGC CCGCCTTTGC ACCGAGAACC TGCTGAAGCC TGACACTCGG 
GAGTGCGCCC AGCTGCGCCA GGAGGTGGAG GAGAACCTGA ACGAGGTCTA CAGGCAGATC 
TCCGGTGTAC ACAAGCTCCA GCAGACCAAG TTCCGGCAGC AGCCCAATGC CGGGAAAAAG 
CAAGACCACA CCATTGTGGA CACAGTGCTG ATGGCGCCCC GCTCGGCCAA GCCGGCCCTG 
CTGAAGCTTA CAGAGAAGCA GGTGGAACAG AGGGCCTTCC ACGACCTCAA GGTGGCCCCC 
25 GGCTACTACA CCCTCACTGC AGACCAGGAC GCCCGGGGCA TGGTGGAGTT CCAGGAGGGC 
GTGGAGCTGG TGGACGTACG GGTGCCCCTC TTTATCCGGC CTGAGGATGA CGACGAGAAG 
CAGCTGCTGG TGGAGGCCAT CGACGTGCCC GCAGGCACTG CCACCCTCGG CCGCCGCCTG 
GTAAACATCA CCATCATCAA GGAGCAAGCC AGAGACGTGG TGTCCTTTGA GCAGCCTGAG 
TTCTCGGTCA GCCGCGGGGA CCAGGTGGCC CGCATCCCTG TCATCCGGCG TGTCCTGGAC 
30 GGCGGGAAGT CCCAGGTCTC CTACCGCACA CAGGATGGCA CCGCGCAGGG CAACCGGGAC 
TACATCCCCG TGGAGGGTGA GCTGCTGTTC CAGCCTGGGG AGGCCTGGAA AGAGCTGCAG 
GTGAAGCTCC TGGAGCTGCA AGAAGTTGAC TCCCTCCTGC GGGGCCGCCA GGTCCGCCGT 
TTCCACGTCC AGCTCAGCAA CCCTAAGTTT GGGGCCCACC TGGGCCAGCC CCACTCCACC 
ACCATCATCA TCAGGGACCC AGATGAACTG GACCGGAGCT TCACGAGTCA GATGTTGTCA 
35 TCACAGCCAC CCCCTCACGG CGACCTGGGC GCCCCGCAGA ACCCCAATGC TAAGGCCGCT 

GGGTCCAGGA AGATCCATTT CAACTGGCTG CCCCCTTCTG GCAAGCCAAT GGGGTACAGG 
GTAAAGTACT GGATTCAGGG TGACTCCGAA TCCGAAGCCC ACCTGCTCGA CAGCAAGGTG 
CCCTCAGTGG AGCTCACCAA CCTGTACCCG TATTGCGACT ATGAGATGAA GGTGTGCGCC 
TACGGGGCTC AGGGCGAGGG ACCCTACAGC TCCCTGGTGT CCTGCCGCAC CCACCAGGAA 
40 GTGCCCAGCG AGCCAGGGCG TCTGGCCTTC AATGTCGTCT CCTCCACGGT GACCCAGCTG 
AGCTGGGCTG AGCCGGCTGA GACCAACGGT GAGATCACAG CCTACGAGGT CTGCTATGGC 
CTGGTCAACG ATGACAACCG ACCTATTGGG CCCATGAAGA AAGTGCTGGT TGACAACCCT 
AAGAACCGGA TGCTGCTTAT TGAGAACCTT CGGGAGTCCC AGCCCTACCG CTACACGGTG 
AAGGCGCGCA ACGGGGCCGG CTGGGGGCCT GAGCGGGAGG CCATCATCAA CCTGGCCACC 
45 CAGCCCAAGA GGCCCATGTC CATCCCCATC ATCCCTGACA TCCCTATCGT GGACGCCCAG 
AGCGGGGAGG ACTACGACAG CTTCCTTATG TACAGCGATG ACGTTCTACG CTCTCCATCG 
GGCAGCCAGA GGCCCAGCGT CTCCGATGAC ACTGAGCACC TGGTGAATGG CCGGATGGAC 
TTTGCCTTCC CGGGCAGCAC CAACTCCCTG CACAGGATGA CCACGACCAG TGCTGCTGCC 
TATGGCACCC ACCTGAGCCC ACACGTGCCC CACCGCGTGC TAAGCACATC CTCCACCCTC 
50 ACACGGGACT ACAACTCACT GACCCGCTCA GAACACTCAC ACTCGACCAC ACTGCCGAGG 
GACTACTCCA CCCTCACCTC CGTCTCCTCC CACGACTCTC GCCTGACTGC TGGTGTGCCC 
GACACGCCCA CCCGCCTGGT GTTCTCTGCC CTGGGGCCCA CATCTCTCAG AGTGAGCTGG 
CAGGAGCCGC GGTGCGAGCG GCCGCTGCAG GGCTACAGTG TGGAGTACCA GCTGCTGAAC 
GGCGGTGAGC TGCATCGGCT CAACATCCCC AACCCTGCCC AGACCTCGGT GGTGGTGGAA 
55 GACCTCCTGC CCAACCACTC CTACGTGTTC CGCGTGCGGG CCCAGAGCCA GGAAGGCTGG 
GGCCGAGAGC GTGAGGGTGT CATCACCATT GAATCCCAGG TGCACCCGCA GAGCCCACTG 
TGTCCCCTGC CAGGCTCCGC CTTCACTTTG AGCACTCCCA GTGCCCCAGG CCCGCTGGTG 
TTCACTGCCC TGAGCCCAGA CTCGCTGCAG CTGAGCTGGG AGCGGCCACG GAGGCCCAAT 
GGGGATATCG TCGGCTACCT GGTGACCTGT GAGATGGCCC AAGGAGGAGG GCCAGCCACC 
60 GCATTCCGGG TGGATGGAGA CAGCCCCGAG AGCCGGCTGA CCGTGCCGGG CCTCAGCGAG 
AACGTGCCCT ACAAGTTCAA GGTGCAGGCC AGGACCACTG AGGGCTTCGG GCCAGAGCGC 
GAGGGCATCA TCACCATAGA GTCCCAGGAT GGAGGACCCT TCCCGCAGCT GGGCAGCCGT 
GCCGGGCTCT TCCAGCACCC GCTGCAAAGC GAGTACAGCA GCATCACCAC CACCCACACC 
AGCGCCACCG AGCCCTTCCT AGTGGATGGG CCGACCCTGG GGGCCCAGCA CCTGGAGGCA 
65 GGCGGCTCCC TCACCCGGCA TGTGACCCAG GAGTTTGTGA GCCGGACACT GACCACCAGC 
GGAACCCTTA GCACCCACAT GGACCAACAG TTCTTCCAAA CTTGACCGCA CCCTGCCCCA 
CCCCCGCCAT GTCCCACTAG GCGTCCTCCC GACTCCTCTC CCGGAGCCTC CTCAGCTACT 
CCATCCTTGC ACCCCTGGGG GCCCAGCCCA CCCGCATGCA CAGAGCAGGG GCTAGGTGTC 
TCCTGGGAGG CATGAAGGGG GCAAGGTCCG TCCTCTGTGG GCCCAAACCT ATTTGTAACC 
70 AAAGAGCTGG GAGCAGCACA AGGACCCAGC CTTTGTTCTG CACTTAATAA ATGGTTTTGC 
TACTG 



1500 
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75 



80 



Seq ID NO: 360 Protein sequence 
Protein Accession #: NP_000204.1 

1 11 21 31 

| I I I 

MAGPRPSPWA RLLLAALISV SLSGTLANRC KKAPVKSCTE 
CNTOAELLAA GCORESIWM ESSPQITEET QIDTTLRRSQ 
EVFEPLESPV DLYILMDFSN SMSDDLDNLK KMGQNIARVL 
PQTDMRPEKL KEPWPNSDPP FSFKNVISLT EDVDEFRNKL 
QTAVCTRDIG WRPDSTHItLV FSTESAFHYE ABGANVLAGI 
TQDYPSVPTL VRLLAKHNII PIFAVTNYSY SYYEKLHTYF 
EAFNRIRSNIi DIRALDSPRG LRTEVTSKMF QKTRTGSFHI 



41 51 
I I 

CVRVDKDCAY CTDEMFRDRR 60 

MSPQGLRVRL RPGEERHFEL 120 

SQLTSDYTIG FGKFVDKVSV 180 

QGERISGNLD APEGGFDAIL 240 

MSRNDERCHL DTTGTYTQYR 300 

PVSSLGVLQE DSSNIVELLE 360 

RRGEVGIYQV QLRALEHVDG 420 



1091 



WO 03/042661 



THVCQLPEDQ KGKIHLKPSF SDGLKMDAGI ICDVCTCELQ KEVRSARCSF NGDFVCGQCV 480 

CSEGWSGQTC NCSTGSLSDI QPCLREGEDK PCSGRGECQC GHCVCYGEGR YEGQFCEYDN S40 

FQCPRTSGFL CNDRGRCSMG QCVCEPGWTG PSCDCPLSNA TCIDSNGGIC NGRGHCECGR 600 

CHCHQQSLYT DTICEINYSA IHPGLCEDLR SCVQCQAWGT GEKKGRTCEE CNFKVKMVDE 660 

LKRAEEWVR CSFRDEDDDC TYSYTMEGDG APGPNSTVLV HKKKDCPPGS FWWLIPLLLL 720 

LLPLLALLLL LCWKYCACCK ACLALLPCCN RGHMVGFKED HYMLRENLMA SDHLDTPMLR 780 

SGNLKGRDW RWKVTNNMQR PGFATHAASI NPTELVPYGL SLRLARLCTE NLLKPDTREC B40 

AQLRQEVEEN LNEVYRQISG VHKLQQTKFR QQPNAGKKQD HTIVDTVLMA PRSAKPALLK 900 

LTEKQVEQRA FHDLKVAPGY YTl/TADQDAR GMVEFQEGVE LVDVRVPLFI RPEDDDEKQL 960 

LVEAIDVPAG TATLGRRLVN ITIIKEQARD WSFEQPEFS VSRGDQVARI PVIRRVLDGG 1020 

KSQVSYRTQD GTAQGNRDYI PVEGELLFQP GEAWKELQVK LLELQEVDSL LRGRQVRRFH 1080 

VQLSNPKFGA HLGQPHSTTI I IRDPDELDR SFTSQMLSSQ PPPHGDLGAP QNPNAKAAGS 1140 

RKIHFNWLPP SGKPMGYRVK YWIQGDSESE AHLLDSKVPS VELTNLYPYC DYEMKVCAYG 1200 

AQGEGPYSSL VSCRTHQEVP SEPGRLAFNV VSSTVTQLSW AEPAETNGEI TAYEVCYGLV 1260 

NDDHRPIGPM KKVLVDNPKN RMLLIENLRE SQPYRYTVKA RNGAGWGPER EAIINLATQP 1320 

KRPMSIPIIP DIPIVDAQSG EDYDSFLMYS DDVLRSPSGS QRPSVSDDTE HLVNGRMDFA 1380 

FPGSTNSLHR MTTTSAAAYG THLSPHVPHR VLSTSSTLTR DYNSLTRSEH SHSTTLPRDY 1440 

STLTSVSSHD SRLTAGVPDT PTRLVFSALG PTSLRVSWQE PRCERPLQGY SVEYQLLNGG 1500 

ELHRLNIPNP AQTSVWEDIi LPNHSYVFRV RAQSQEGWGR EREGVITIES QVEPQSPLCP 1560 

LPGSAFTLST PSAPGPLVFT ALSPDSLQLS WERPRRPNGD IVGYLVTCEM AQGGGPATAF 1620 

RVDGDSPESR LTVPGLSENV PYKFKVQART TEGFGPEREG IITIESQDGG PFPQLGSRAG 1680 

LFQHPLQSEY SSITTTHTSA TEPFLVDGPT LGAQHLEAGG SLTRHVTQEF VSRTLTTSGT 1740 
LSTHMDQQFF QT 



Seq ID NO: 361 DNA sequence 

Nucleic Acid Accession #: NM_013332.l 

Coding sequence : 1 . . 63 



1 
I 



GCACGAGGGC 
AGTAACCGAC 
CGGCTGTTCC 
GCAGAGGAGT 
GGTACTGACC 
GAGCCCATCG 
CAAGGGCCTT 
ATATTTTGGA 
TGAGCACCGT 
CTCATTGCCT 
CACTTTGGGA 
CAACATGGCG 
GGCCTGTAAT 
GAGGTTGCAG 
CATCTCAAAA 
TTATGGCTAT 
TCCTCAGCAG 
TGATATTTTC 
TATGCTCAAT 
CAGTTGAAGA 
TTCTCATTTT 
GGTGGGATGC 
TTTTTTCTAA 



11 
I 

GCTTTTGTCT 
TTTCCTCCGG 
CCCGGAGGGT 
AGGGTCCTTT 
CTACTCTCCA 
CCTGGGACCT 
CCAGACCATC 
ACACTGACCT 
TGTAACCAGA 
GGGCAAGGCC 
GGCTGAGGTG 
AAACCCCATC 
CCCAGTTCCT 
TGAACCGAGA 
AAAAAAAGAA 
GAGATAGGTT 
TATGGCTCTG 
AACCCTACTT 
TATTTGGTGT 
GGTTGTGTGG 
ACATTTTAAA 
CAAAGCCTGC 
CTAATAAAGT 



21 
I 

CCGGTGAGTT 
ACTCCTGCAC 
CCAGAGGCCT 
CAGCCATGAA 
TCTTCGTTAG 
CCTGGACCAC 
CATCCAGAAG 
AGACATGTCC 
GAACTATTAC 
TGTTTAGGCC 
GGTGGATCAC 
TCTACTAAAA 
TGGGAGGCTG 
TCGCACTGCT 
AAGAAAAAGC 
GATCTCGCCC 
ACATCTCTTA 
CCTAAACATC 
TGAGCCTCTC 
GTGGGCTGTT 
GTCGTTCCTC 
TCAAGTTATG 
GGAATATATA 



31 



41 



TTGTGGCGGG 
GACCTGCTCC 
TTCAGAAGGA 
GCATGTGTTG 
AGTGATGGAG 
CAGAAGCCAA 
CATGTGATAA 
AGATGGGAGT 
TAGGCCTTGA 
GGTTGCGGTG 
CTGAGGTCAG 
ATACAAAAGT 
AGGCGGGAGA 
GTACCCAGCC 
CTGTTTAATG 
TTACCCCGGG 
GATGTCCCAA 
TGTCTGGGGT 
TTCCACAAGA 
GGGAGTGAGG 
CAACATAGTG 
GACATTGTGG 
TTTCAAAAAA 



AAGCTTCTGC 
TACAGCCGGC 
GAAGGCAGCT 
AACCTCTACC 
TCCCTAGAAG 
CTAGCCAACA 
GACCTCCTTC 
CCCATTCCTA 
AGAACCTGTC 
GCTCATGCCT 
GAGTTCGAGA 
TAGCTGGGTG 
ATTGCTTGAA 
TGGGCCACAG 
CACAGGTGTG 
GTCTGGTGTA 
CTTCAGCTGT 
TCCTTTAGTC 
GCTCCTCCAT 
ATGGAGTGTT 
TGTATTGGTC 
CCACCATGTG 
AAAAAAAAAA 



51 
I 

GCTGGTGCTT 
GATCCACTCC 
CTGTTTCTCT 
TGTTAGGTGT 
GCTTACTAGA 
CAGAGCCCAC 
CATACTGGCC 
GCAGACAAGC 
TAACTGGATG 
GTAATCCTAG 
CCAGCCTCGC 
TGGTGGCAGA 
CCOGGGGACG 
TGCAAGACTC 
AGTGGATTGC 
TGCTGTGCTT 
TGGGAGATGG 
TTGAATGTCT 
GTTTGGATAG 
CAGTGCCCAT 
TGAAGGGGGT 
GCTTAAATGA 
AA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 



Seq ID NO: 362 Protein sequence 
Protein Accession «: NP_037464.1 

1 11 21 31 41 51 

| | | I I I 

MKHVLNLYLL GWLTLLSIF VRVMESLEG1» LESPSPGTSW TTRSQLANTE PTKGLPDHPS 
RSM 



60 



Seq ID NO.* 363 DNA sequence 

Nucleic Acid Accession #: NM_023915.1 

Coding sequence: 250.. 1326 



1 
I 

GGCACGAGGG 
TCAAAGCTTA 
GTGAATGGAC 
CCCACGCCTC 
AACTGAAGAA 
CAAGAGAGTC 
AATGAATTTG 
TTGCTGAATG 
TTCTATCTCA 
ATAGTCCATG 
TCAGTTTTGT 
GATCGCTATC 
ACGAAGGTTT 
ATCCTGACAA 
CCTTTGGGGG 
GTGCTGGTGA 



11 
I 

TTTCGTTTTC 
TTCTTAATTA 
AGCCAGCCAC 
AATCGTCCCC 
TGGGGTTCAA 
ACAATTCAGG 
ACACAATTGT 
GTTTAGCAGT 
AAAACATAGT 
ATGCAGGATT 
TTTATGCAAA 
TGAAGGTGGT 
TATCTGTTTG 
ATGGTCAGCC 
TCAAATGGCA 
TTCTGATCGG 



21 
I 

ATGCTTTACC 
GAGACAAGAA 
CACAATGAAA 
AAGTGTTTCC 
CTTGACGCTT 
CAACAGGAGC 
CTTGCCGGTG 
GTGGATCTTC 
GGTTGCAGAC 
TGGACCTTGG 
CATGTATACT 
CAAGCCATTT 
TGTTTGGGTG 
AACAGAGGAC 
TACGGCAGTC 
ATGTTACATA 



31 



41 
I 



AGAAAATCCA CTTCCCTGCC 
ACCTGTTTCA ACTTGAAGAC 
GAAATCAAAC CAGGAATAAC 
TGACACGCAT CTTTGCTTAC 
GCAAAATTAC CAAATAACGA 
GACGGGCCAG GAAAGAACAC 
CTTTATCTCA TTATATTTGT 
TTCCACATTA GGAATAAAAC 
CTCATAATGA CGCTGACATT 
TACTTCAAGT TTATTCTCTG 
TCCATCGTGT TCCTTGGGCT 
GGGGACTCTC GGATGTACAG 
ATCATGGCTG TTTT6TCTTT 
AATATCCATG ACTGCTCAAA 
ACCTATGTGA ACAGCTGCTT 
GCCATATCCA GGTACATCCA 



51 
I 

GACCTTAGTT 
ACCGTATGAG 
CTATGCTGAA 
AGTGCATCAC 
GCTGCACGGC 
CACCCTTCAC 
GGCAAG CATC 
CAGCTTCATA 
TCCATTTCGA 
CAGATACACT 
GATAAGCATT 
CATAACCTTC 
GCCAAACATC 
ACTTAAAAGT 
GTTTGTGGCC 
CAAATCCAGC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 

960 



1092 



WO 03/042661 



PCT/US02/36810 



5 

10 

15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 



AGGCAATTCA 
GTGGCTGTGT 
AGTCACTTAG 
ATTACACTTT 
TGTAGGTCAT 
ATCAGATCAC 
GTGTAGGCCT 
TTCATTATCC 



TAAGTCAGTC 
TTTTTACCTG 
ACAGGCTTTT 
TCTTGTCTGC 
TTTCAAGAAG 
TGCAAAGTGT 
TTTATTGTTT 
TTAAAAAAAA 



AAGCCGAAAG 
CTTTCTACCA 
AGATGAATCT 
GTGTAATGTT 
GCTGTTCAAA 
GAGAAGATCG 
GTTGGAATCG 
AA 



CGAAAACATA 
TATCACTTGT 
GCACAAAAAA 
TGCCTGGATC 
AAATCAAATA 
GAAGTTCGCA 
ATATGTACAA 



ACCAGAGCAT 
GCAGAATTCC 
TCCTATATTA 
CAATAATTTA 
TCAGAACCAG 
TATATTATGA 
AGTGTAAATA 



CAGGGTTGTT 
TTTTACTTTT 
CTGCAAAGAA 
CTTTTTCATG 
GAGTGAAAGC 
TTACACTGAT 
AATGTTTCTT 



1020 
1080 
1140 
1200 
1260 
1320 
1380 



Seq ID NO: 364 Protein sequence 
Protein Accession B: NP_076404 



1 
I 

MGFNLTLAKL 
GLAVWIFFHI 
FYANHYTSIV 
NGQPTEDNIH 
ISQSSRKRKH 
FLSACNVCLD 



11 21 

I I 
PNNELHGQES HNSGNRSDGP 
RNKTSFIFYL KNIWADLIM 
FLGLISIDRY LKWKPFGDS 
DCSKLKSPLG VKWHTAVTYV 
NQSIRVWAV FFTCFLPYHL 
PIIYFFMCRS FSRRLFKKSN 



31 41 51 

I 1 I 

GKNTTIiHNEF DTIVLPVLYL IIFVASILLN 
TLTFPFRIVH DAGFGPWYFK FILCRYTSVL 
RMYSITFTKV LSVCVWVIMA VLSLPNIILT 
NSCLFVAVLV ILIGCYIAIS RYIHKSSRQF 
CRIPFTFSHL DRLLDESAQK ILYYCKEITL 
IRTRSESIRS LQSVRRSEVR IYYDYTDV 



60 
120 
180 
240 
300 



Seq ID NO: 365 DNA sequence 

Nucleic Acid Accession #s NMJ>05365.1 

Coding sequence: 1..94B 



ATGTCTCTCG 
GAGGACTTGG 
TCCTCTGACA 
CCTCAGGGAG 
GAGGGCTCCA 
GAGTTCATGT 
CACAAATATC 
AATTACAAGC 
TTTGGCACTG 
CTTGGCCTCT 
CTGATCATTG 
TGGGAAGCGT 
CCCAGGAAGC 
CCCGGCAGTG 
AGCTATGAGA 
CCATCCCTTT 



11 

i 

AGCAGAGGAG 
GCCTGATGGG 
GCAAGGAGGA 
GCGCTTCCTC 
GCAGTCAAGA 
TCCAAGAAGC 
GAGTCAAGGA 
GCTACTTTCC 
ATGTGAAGGA 
CGTGCGATAG 
TCCTGGGTGT 
TGAGTGTGAT 
TGCTCACCCA 
ATCCTGCGCA 
AGGTCATAAA 
ATGAAGAGGT 



21 

1 

TCCGCACTGC 
TGCACAGGAA 
GGAGGTGTCT 
CTCCATTTCC 
AGAGGAAGAG 
ACTGAAATTG 
GCCGGTCACA 
TGTGATCTTC 
GGTGGACCCC 
CATGCTGGGT 
GATCCTAACC 
GGGGGTGTAT 
AGATTGGGTG 
CTACGAGTTC 
TTATTTGGTC 
TTTGGGAGAG 



31 41 

I I 

AAGCCTGATG AAGACCTTGA 
CCCACAGGCG AGGAGGAGGA 
GCTGCTGGGT CATCAAGTCC 
GTCTACTACA CTTTATGGAG 
CCAAGCTCCT CGGTCGACCC 
AAGGTGGCTG AGTTGGTTCA 
AAGGCAGAAA TGCTGGAGAG 
GGCAAAGCCT CCGAGTTCAT 
GCCGGCCACT CCTACATCCT 
GATGGTCATA GCATGCCCAA 
AAAGACAACT GCGCCCCTGA 
GTTGGGAAGG AGCACATGTT 
CAGGAAAACT ACCTGGAGTA 
CTGTGGGGTT CCAAGGCCCA 
ATGCTCAATG CAAGAGAGCC 
GAGCAAGAGG GAGTCTGA 



51 
I 

AGCCCAAGGA 
GACTACCTCC 
TCCCCAGAGT 
CCAATTCGAT 
AGCTCAGCTG 
TTTCCTGCTC 
CGTCATCAAA 
GCAGGTGATC 
TGTCACTGCT 
GGCCGCCCTC 
AGAGGTTATC 
CTACGGGGAG 
CCGGCAGGTG 
CGCTGAAACC 
CATCTGCTAC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 

900 



Seq ID NO: 366 Protein sequence 
Protein Accession tt: NP_005356.1 



60 
120 
180 
240 
300 



1 11 21 31 41 51 

MSLEQRSPHC KPDEDLEAQG EDLGLMGAQE PTGEEEETTS SSDSKEEEVS AAGSSSPPQS 
PQGGASSSIS VYYTLVJSQFD EGSSSQEEEE PSSSVDPAQL EFMFQEALKL KVAELVHFLL 
HKYRVKEPVT KAEMLESVIK NYKRYFPVIF GKASEFMQVI FGTDVKEVDP AGHSYILVTA 
LGLSCDSMLG DGHSMPKAAL LIIVLGVILT KDNCAPEEVI WEALSVMGVY VGKEHMPYGE 
PRKLLTQDWV QENYLEYRQV PGSDPAHYEF LWGSKAHAET SYEKVINYLV MLNAREPICY 
PSLYEEVLGB EQEGV 

Seq ID NO: 367 DNA sequence 
Nucleic Acid Accession #: NM_014400 
Coding sequence: 8 6.. 112 6 

1 11 21 31 41 51 

GGTTACTCAT CCTGGGCTCA GGTAAGAGGG CCCGAGCTCG GAGGCGGCAC ACCCAGGGGG 60 

GACGCCAAGG GAGCAGGACG GAGCCATGGA CCCCGCCAGG AAAGCAGGTG CCCAGGCCAT 120 

GATCTGGACT GCAGGCTGGC TGCTGCTGCT GCTGCTTCGC GGAGGAGCGC AGGCCCTGGA 180 

GTGCTACAGC TGCGTGCAGA AAGCAGATGA CGGATGCTCC CCGAACAAGA TGAAGACAGT 240 

GAAGTGCGCG CCGGGCGTGG ACGTCTGCAC CGAGGCCGTG GGGGCGGTGG AGACCATCCA 300 

CGGACAATTC TCGCTGGCAG TGCSGGGTTG CGGTTCGGGA CTCCCCGGCA AGAATGACCG 360 

CGGCCTGGAT CTTCACGGGC TTCTGGCGTT CATCCAGCTG CAGCAATGCG CTCAGGATCG 420 

CTGCAACGCC AAGCTCAACC TCACCTCGCG GGCGCTCGAC CCGGCAGGTA ATGAGAGTGC 480 

ATACCCGCCC AACGGCGTGG AGTGCTACAG CTGTGTGGGC CTGAGCCGGG AGGCGTGCCA 540 

GGGTACATCG CCGCCGGTCG TGAGCTGCTA CAACGCCAGC GATCATGTCT ACAAGGGCTG 600 

CTTCGACGGC AACGTCACCT TGACGGCAGC TAATGTGACT GTGTCCTTGC CTGTCCGGGG 660 

CTGTGTCCAG GATGAATTCT GCACTCGGGA TGGAGTAACA GGCCCAGGGT TCACGCTCAG 720 

TGGCTCCTGT TGCCAGGGGT CCCGCTGTAA CTCTGACCTC CGCAACAAGA CCTACTTCTC 780 

CCCTCGAATC CCACCCCTTG TCCGGCTGCC CCCTCCAGAG CCCACGACTG TGGCCTCAAC 840 

CACATCTGTC ACCACTTCTA CCTCGGCCCC AGTGAGACCC ACATCCACCA CCAAACCCAT 900 

GCCAGCGCCA ACCAGTCAGA CTCCGAGACA GGGAGTAGAA CACGAGGCCT CCCGGGATGA 960 

GGAGCCCAGG TTGACTGGAG GCGCCGCTGG CCACCAGGAC CGCAGCAATT CAGGGCAGTA 1020 

TCCTGCAAAA GGGGGGCCCC AGCAGCCCCA TAATAAAGGC TGTGTGGCTC CCACAGCTGG 1080 

ATTGGCAGCC CTTCTGTTGG CCGTGGCTGC TGGTGTCCTA CTGTGAGCTT CTCCACCTGG 1140 

AAATTTCCCT CTCACCTACT TCTCTGGCCC TGGGTACCCC TCTTCTCATC ACTTCCTGTT 1200 

CCCACCACTG GACTGGGCTG GCCCAGCCCC TGTTTTTCCA ACATTCCCCA GTATCCCCAG 1260 

CTTCTGCTGC GCTGGTTTGC GGCTTTGGGA AATAAAATAC CGTTGTATAT ATTCTGGCAG 1320 

1093 



WO 03/042661 



PCT/US02/36810 



10 

15 

20 

25 

30 
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70 
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GGGTGTTCTA GCTTTTTGAG GACAGCTCCT GTATCCTTCT CATCCTTGTC TCTCCGCTTG 
TCCTCTTGTG ATGTTAGGAC AGAGTGAGAG AAGTCAGCTG TCACGGGGAA GGTGAGAGAG 
AGGATGCTAA GCTTCCTACT CACTTTCTCC TAGCCAGCCT GGACTTTGGA GCGTGGGGTG 
GGTGGGACAA TGGCTCCCCA CTCTAAGCAC TGCCTCCCCT ACTCCCCGCA TCTTTGGGGA 
ATCGGTTCCC CATATGTCTT CCTTACTAGA CTGTGAGCTC CTCGAGGGCA GGGACCGTGC 
CTTATGTCTG TGTGTGATCA GTTTCTGGCA CATAAATGCC TCAATAAAGA TTTAATTACT 
TTGTATAGTG AAAAAAAA 

Seq ID NO: 368 Protein sequence 
Protein Accession #: NP_055215 



1380 
1440 
1500 
1560 
1620 
1680 



KDPARKAGAQ 
CTEAVGAVET 
SRALDPAGNE 
AANVTVSLPV 
LPPPEPTTVA 
AGHQDRSNSG 



11 

I 

AMIWTAGWLL 
IHGQFSLAVX 
SAYPPNGVEC 
RGCVQDEFCT 
STTSVTTSTS 
QYPAKGGPQQ 



21 31 41 51 

| 1 I I 

LLLLRGGAQA LECYSCVQKA DDGCSPNKMK TVKCAPGVDV 
GCGSGLPGKN DRGLDLHGLL AFIQLQQCAQ DRCNAKLNLT 
YSCVGLSREA CQGTSPPWS CYNASDHVYX GCFDGNVTLT 
RDGVTGPGPT LSGSCCQGSR CNSDLRNKTY FSPRIPPLVR 
APVRPTSTTK PMPAPTSQTP ROGVEHEASR DEEPRLTGGA 
PHNKGCVAPT AGLAALLLAV AAGVLL 



60 
120 
180 
240 
300 



Seq ID NO: 369 DNA sequence 

Nucleic Acid Accession #j NM_005329.1 

Coding sequence: 1..1662 



1 11 21 31 41 

{ | I 1 1 

ATGCCGGTGC AGCTGACGAC AGCCCTGCGT GTGGTGGGCA CCAGCCTGTT 
GTGCTGGGTG GCATCCTGGC AGCCTATGTG ACGGGCTACC AGTTCATCCA 
CACTACCTGT CCTTCGGCCT GTACGGCGCC ATCCTGGGCC TGCACCTGCT 
CTTTTTGCCT TCCTGGAGCA CCGGCGCATG CGACGTGCCG GCCAGGCCCT 
TCCCCGCGGC GGGGCTCGGT GGCACTGTGC ATTGCCGCAT ACCAGGAGGA 
TTGCGCAAGT GCCTGCGCTC GGCCCAGCGC ATCTCCTTCC CTGACCTCAA 
GTGGTGGATG GCAACCGCCA GGAGGACGCC TACATGCTGG ACATCTTCCA 
GGCGGCACCG AGCAGGCCGG CTTCTTTGTG TGGCGCAGCA ACTTCCATGA 
GGTGAGACGG AGGCCAGCCT GCAGGAGGGC ATGGACCGTG TGCGGGATGT 
AGCACCTTCT CGTGCATCAT GCAGAAGTGG GGAGGCAAGC GCGAGGTCAT 
TTCAAGGCCC TCGGCGATTC GGTGGACTAC ATCCAGGTGT GCGACTCTGA 
GATCCAGCCT GCACCATCGA GATGCTTCGA GTCCTGGAGG AGGATCCCCA 
GTCGGGGGAG ATGTCCAGAT CCTCAACAAG TACGACTCAT GGATTTCCTT 
GTGCGGTACT GGATGGCCTT CAACGTGGAG CGGGCCTGCC AGTCCTACTT 
CAGTGTATTA GTGGGCCCTT GGGCATGTAC CGCAACAGCC TCCTCCAGCA 
GACTGGTACC ATCAGAAGTT CCTAGGCAGC AAGTGCAGCT TCGGGGATGA 
ACCAACCGAG TCCTGAGCCT TGGCTACCGA ACTAAGTATA CCGCGCGCTC 
ACAGAGACCC CCACTAAGTA CCTCCGGTGG CTCAACCAGC AAACCCGCTG 
TACTTCCGGG AGTGGCTCTA CAACTCTCTG TGGTTCCATA AGCACCACCT 
TACGAGTCAG TGGTCACGGG TTTCTTCCCC TTCTTCCTCA TTGCCACGGT 
TTCTACCGGG GCCGCAT CTG GAACATTCTC CTCTTCCTGC TGACGGTGCA 
ATTATCAAGG CCACCTACGC CTGCTTCCTT OGGGGCAATG CAGAGATGAT 
CTCTACTCCC TCCTCTATAT GTCCAGCCTT CTGCCGGCCA AGATCTTTGC 
ATCAACAAAT CTGGCTGGGG CACCTCTGGC CGAAAAACCA TTGTGGTGAA 
CTCATTCCTG TGTCCATCTG GGTGGCAGTT CTCCTGGAGG GGCTGGCCTA 
TGCCAGGACC TGTTCAGTGA GACAGAGCTA GCCTTCCTTG TCTCTGGGGC 
GGCTGCTACT GGGTGGCCCT CCTCATGCTA TATCTGGCCA TCATCGCCCG 
AAGAAGCCGG AGCAGTACAG CTTGGCTTTT GCTGAGGTGT GA 



51 
I 

TGCCCTGGCA 
CACGGAAAAG 
CATTCAGAGC 
GAAGCTGCCC 
CCCTGACTAC 
GGTGGTCATG 
CGAGGTGCTG 
GGCAGGCGAG 
GGTGCGGGCC 
GTACACGGCC 
CACTGTGCTG 
AGTAGGGGGA 
CCTGAGCAGC 
TGGCTGTGTG 
GTTCCTGGAG 
CCGGCACCTC 
CAAGTGCCTC 
GAGCAAGTCT 
CTGGATGACC 
TATACAGCTT 
GCTGGTGGGC 
CTTCATGTCC 
CATTGCTACC 
CTTCATTGGC 
CACAGCTTAT 
TATACTGTAT 
GCGATGTGGG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 



Seq ID NO: 370 Protein sequence 
Protein Accession #: NP_0 05320.1 



1 
I 

MPVQLTTALR 
LFAFLEHRRM 
WDGNRQEDA 
STFSCTMQKW 
VGGDVQILNK 
DWYHQKFLGS 
YFREWLYNSL 
IIKATYACFL 
LIPVSIWVAV 
KKPEQYSLAF 



11 
I 

WGTSLFALA 
RRAGQALKLP 
YMLDIFHEVL 
GGKREVMYTA 
YDSWISFIiSS 
KCSFGDDRHL 
WFHKHHLWMT 
RGNAEMIFMS 
LLEGLAYTAY 
AEV 



21 
I 

VLGGILAAYV 
SPRRGSVALC 
GGTEQAGFFV 
FKALGDSVDY 
VRYWMAFNVE 
TNRVLSLGYR 
YESWTGFFP 
LYSLLYMSSL 
CQDLFSETEL 



31 
I 

TGYQFIHTEK 
IAAYQEDPDY 
WRSNFHEAGE 
IQVCDSDTVL 
RAOQSYFGCV 
TKYTARSKCIi 
FFLIATVIQL 
LPAKIFAIAT 
AFLVSGAILY 



41 

I 

HYLSFGLYGA 
LRKCLRSAQR 
GETEASLQEG 
DPACTIEMLR 
QCISGPLGMY 
TETPTKYLRW 
FYRGRIWNIL 
INKSGWGTSG 
GCYWVALLML 



51 
I 

ILGLHLLIQS 
ISFPDLKWM 
MDRVRDWRA 
VLEEDPQVGG 
RNSLLQQFIiE 
LNQQTRWSKS 
LFLLTVQLVG 
RKTIWNFIG 
YLAIIARRCG 



60 
120 
180 
240 
300 
360 
420 
480 
540 



Seq ID NOs 371 DNA sequence 

Nucleic Acid Accession Eos sequence 

Coding sequence: 148-709S 

1 11 21 31 

1 I I 1 

CACACATACG CACGCACGAT CTCACTTCGA TCTATACACT 
CAAAAAAAAC ATTTCCTTCG CTCCCCCTCC CTCTCCACTC 
CGGCGAGGGG CCGCAGACCG TCTGGAAATG CGAATCCTAA 
CAGCTCCTCT GTGTTTGCCG CCTGGATTGG GCTAATGGAT 
CTTGTTGAAG AGATTGGCTG GTCCTATACA GGAGCACTGA 
AAATATCCAA CATGTAATAG CCCAAAACAA TCTCCTATCA 
CAAGTAAATG TGAATCTTAA GAAACTTAAA TTTCAGGGTT 



41 
I 

GGAGGATTAA 
TGAGAAGCAG 
AGCGTTTCCT 
ACTACAGACA 
ATCAAAAAAA 
ATATTGATGA 
GGGATAAAAC 



51 
I 

AACAAACAAA 
AGGAGCCGCA 
CGCTTGCATT 
ACAGAGAAAA 
TTGGGGAAAG 
AGATCTTACA 
ATCATTGGAA 



60 
120 
180 
240 
300 
360 
420 



1094 



WO 03/042661 



AACACATTCA TTCATAACAC TGGGAAAACA GTGGAAATTA ATCTCACTAA TGACTACCGT 
GTCAGCGGAG GAGTTTCAGA AATGGTGTTT AAAGCAAGCA AGATAACTTT TCACTGGGGA 
AAATGCAATA TGTCATCTGA TGGATCAGAG CATAGTTTAG AAGGACAAAA ATTTCCACTT 
GAGATGCAAA TCTACTGCTT TCATGCGGAC CGATTTTCAA GTTTTGAGGA AGCAGTCAAA 
GGAAAAGGGA AGTTAAGAGC TTTATCCATT TTGTTTGAGG TTGGGACAGA AGAAAATTTG 
GATTTCAAAG CGATTATTGA TGGAGTCGAA AGTGTTAGTC GTTTTGGGAA GCAGGCTGCT 
TTAGATCCAT TCATACTGTT GAACCTTCTG CCAAACTCAA CTGACAAGTA TTAC ATTTAC 
AATGGCTCAT TGACATCTCC TCCCTGCACA GACACAGTTG ACTGGATTGT TTTTAAAGAT 
ACAGTTAGCA TCTCTGAAAG CCAGTTGGCT GTTTTTTGTG AAGTTCTTAC AATGCAACAA 
TCTGGTTATG TCATGCTGAT GGACTACTTA CAAAACAATT TTCGAGAGCA ACAGTACAAG 
TTCTCTAGAC AGGTGTTTTC CTCATACACT GGAAAGGAAG AGATTCATGA AGCAGTTTGT 
AGTTCAGAAC CAGAAAATGT TCAGGCTGAC CCAGAGAATT ATACCAGCCT TCTTGTTACA 
TGGGAAAGAC CTCGAGTCGT TTATGATACC ATGATTGAGA AGTTTGCAGT TTTGTACCAG 
CAGTTGGATG GAGAGGACCA AACCAAGCAT GAATTTTTGA CAGATGGCTA TCAAGACTTG 
GGTGCTATTC TCAATAATTT GCTACCCAAT ATGAGTTATG TTCTTCAGAT AGTAGCCATA 
TGCACTAATG GCTTATATGG AAAATACAGC GACCAACTGA TTGTCGACAT GCCTACTGAT 
AATCCTGAAC TTGATCTTTT CCCTGAATTA ATTGGAACTG AAGAAATAAT CAAGGAGGAG 
GAAGAGGGAA AAGACATTGA AGAAGGOGCT ATTGTGAATC CTGGTAGAGA CAGTGCTACA 
AACCAAATCA GGAAAAAGGA ACCCCAGATT TCTACCACAA CACACTACAA TCGCATAGGG 
ACGAAATACA ATGAAGCCAA GACTAACCGA TCCCCAACAA GAGGAAGTGA ATTCTCTGGA 
AAGGGTGATG TTCCCAATAC ATCTTTAAAT TCCACTTCCC AACCAGTCAC TAAATTAGCC 
ACAGAAAAAG ATATTTCCTT GACTTCTCAG ACTGTGACTG AACTGCCACC TCACACTGTG 
GAAGGTACTT CAGCCTCTTT AAATGATGGC TCTAAAACTG TTCTTAGATC TCCACATATG 
AACTTGTCGG GGACTGCAGA ATCCTTAAAT ACAGTTTCTA TAACAGAATA TGAGGAGGAG 
AGTTTATTGA CCAGTTTCAA GCTTGATACT GGAGCTGAAG ATTCTTCAGG CTCCAGTCCC 
GCAACTTCTG CTATCCCATT CATCTCTGAG AACATATCCC AAGGGTATAT ATTTTCCTCC 
GAAAACCCAG AGACAATAAC ATATGATGTC CTTATACCAG AATCTGCTAG AAATGCTTCC 
GAAGATTCAA CTTCATCAGG TTCAGAAGAA TCACTAAAGG ATCCTTCTAT GGAGGGAAAT 
GTGTGGTTTC CTAGCTCTAC AGACATAACA GCACAGCCCG ATGTTGGATC AGGCAGAGAG 
AGCTTTCTCC AGACTAATTA CACTGAGATA CGTGTTGATG AATCTGAGAA GACAACCAAG 
TCCTTTTCTG CAGGCCCAGT GATGTCACAG GGTCCCTCAG TTACAGATCT GGAAATGCCA 
CATTATTCTA CCTTTGCCTA CTTCCCAACT GAGGTAACAC CTCATGCTTT TACCCCATCC 
TCCAGACAAC AGGATTTGGT CTCCACGGTC AACGTGGTAT ACTCGCAGAC AACCCAACCG 
GTATACAATG GTGAGACACC TCTTCAACCT TCCTACAGTA GTGAAGTCTT TCCTCTAGTC 
ACCCCTTTGT TGCTTGACAA TCAGATCCTC AACACTACCC CTGCTGCTTC AAGTAGTGAT 
TCGGCCTTGC ATGCTACGCC TGTATTTCCC AGTGTCGATG TGTCATTTGA ATCCATCCTG 
TCTTCCTATG ATGGTGCACC TTTGCTTCCA TTTTCCTCTG CTTCCTTCAG TAGTGAATTG 
TTTCGCCATC TGCATACAGT TTCTCAAATC CTTCCACAAG TTACTTCAGC TACCGAGAGT 
GATAAGGTGC CCTTGCATGC TTCTCTGCCA GTGGCTGGGG GTGATTTGCT ATTAGAGCCC 
AGCCTTGCTC AGTATTCTGA TGTGCTGTCC ACTACTCATG CTGCTTCAGA GACGCTGGAA 
TTTGGTAGTG AATCTGGTGT TCTTTATAAA ACGCTTATGT TTTCTCAAGT TGAACCACCC 
AGCAGTGATG CCATGATGCA TGCACGTTCT TCAGGGCCTG AACCTTCTTA TGCCTTGTCT 
GATAATGAGG GCTCCCAACA CATCTTCACT GTTTCTTACA GTTCTGCAAT ACCTGTGCAT 
GATTCTGTGG GTGTAACTTA TCAGGGTTCC TTATTTAGCG GCCCTAGCCA TATACCAATA 
CCTAAGTCTT CGTTAATAAC CCCAACTGCA TCATTACTGC AG CCTAC TCA TGCCCTCTCT 
GGTGATGGGG AATGGTCTGG AGCCTCTTCT GATAGTGAAT TTCTTTTACC TGACACAGAT 
GGGCTGACAG CCCTTAACAT TTCTTCACCT GTTTCTGTAG CTGAATTTAC ATATACAACA 
TCTGTGTTTG GTGATGATAA TAAGGCGCTT TCTAAAACTG AAATAATATA TGGAAATGAG 
ACTGAACTGC AAATTCCTTC TTTCAATGAG ATGGTTTACC CTTCTGAAAG CACAGTCATG 
CCCAACATGT ATGATAATGT AAATAAGTTG AATGCGTCTT TACAAGAAAC CTCTG TTTCC 
ATTTCTAGCA CCAAGGGCAT GTTTCCAGGG TCCCTTGCTC ATACCACCAC TAAGGTTTTT 
GATCATGAGA TTAGTCAAGT TCCAGAAAAT AACTTTTCAG TTCAACCTAC ACATACTGTC 
TCTCAAGCAT CTGGTGACAC TTCGCTTAAA CCTGTGCTTA GTGCAAACTC AGAG CCAGCA 
TCCTCTGACC CTGCTTCTAG TGAAATGTTA TCTCCTTCAA CTCAGCTCTT ATTTTATGAG 
ACCTCAGCTT CTTTTAGTAC TGAAGTATTG CTACAACCTT CCTTTCAGGC TTCTGATGTT 
GACACCTTGC TTAAAACTGT TCTTCCAGCT GTGCCCAGTG ATCCAATATT GGTTGAAACC 
CCCAAAGTTG ATAAAATTAG TTCTACAATG TTGCATCTCA TTGTATCAAA TTCTGCTTCA 
AGTGAAAACA TGCTGCACTC TACATCTGTA CCAGTTTTTG ATGTGTCGCC TACTTCTCAT 
ATGCACTCTG CTTCACTTCA AGGTTTGACC ATTTCCTATG CAAGTGAGAA ATATGAACCA 
GTTTTGTTAA AAAGTGAAAG TTCCCACCAA GTGGTACCTT CTTTGTACAG TAATGATGAG 
TTGTTCCAAA CGGCCAATTT GGAGATTAAC CAGGCCCATC CCCCAAAAGG AAGGCATGTA 
TTTGCTACAC CTGTTTTATC AATTGATGAA CCATTAAATA CACTAATAAA TAAGCTTATA 
CATTCCGATG AAATTTTAAC CTCCACCAAA AGTTCTGTTA CTGGTAAGGT ATTTGCTGGT 
ATTCCAACAG TTGCTTCTGA TACATTTGTA TCTACTGATC ATTCTGTTCC TATAGGAAAT 
GGGCATGTTG CCATTACAGC TGTTTCTCCC CACAGAGATG GTTCTGTAAC CTCAACAAAG 
TTGCTGTTTC CTTCTAAGGC AACTTCTGAG CTGAGTCATA GTGCCAAATC TGATGCCGGT 
TTAGTGGGTG GTGGTGAAGA TGGTGACACT GATGATGATG GTGATGATGA TGATGATGAC 
AGAGGTAGTG ATGGCTTATC CATTCATAAG TGTATGTCAT GCTCATCCTA TAGAGAATCA 
CAGGAAAAGG TAATGAATGA TTCAGACACC CACGAAAACA GTCTTATGGA TCAGAATAAT 
CCAATCTCAT ACTCACTATC TGAGAATTCT GAAGAAGATA ATAGAGTCAC AAGTGTATCC 
TCAGACAGTC AAACTGGTAT GGACAGAAGT CCTGGTAAAT CACCATCAGC AAATGGGCTA 
TCCCAAAAGC ACAATGATGG AAAAGAGGAA AATGACATTC AGACTGGTAG TGCTCTGCTT 
CCTCTCAGCC CTGAATCTAA AGCATGGGCA GTTCTGACAA GTGATGAAGA AAGTGG AT CA 
GGGCAAGGTA CCTCAGATAG CCTTAATGAG AATGAGACTT CCACAGATTT CAGTTTTGCA 
GACACTAATG AAAAAGATGC TGATGGGATC CTGGCAGCAG GTGACTCAGA AATAACTCCT 
GGATTCCCAC AGTCCCCAAC ATCATCTGTT ACTAGCGAGA ACTCAGAAGT GTTCCACGTT 
TCAGAGGCAG AGGCCAGTAA TAGTAGCCAT GAGTCTCGTA TTGGTCTAGC TGAGGGGTTG 
GAATCCGAGA AGAAGGCAGT TATACCCCTT GTGATCGTGT CAGCCCTGAC TTTTATCTGT 
CTAGTGGTTC TTGTGGGTAT TCTCATCTAC TGGAGGAAAT GCTTCCAGAC TGCACACTTT 
TACTTAGAGG ACAGTACATC CCCTAGAGTT ATATCCACAC CTCCAACACC TATCTTTCCA 
ATTTCAGATG ATGTCGGAGC AATTCCAATA AAGCACTTTC CAAAGCATGT TGCAGATTTA 
CATGCAAGTA GTGGGTTTAC TGAAGAATTT GAGACACTGA AAGAGTTTTA CCAGGAAGTG 
CAGAGCTGTA CTGTTGACTT AGGTATTACA GCAGACAGCT CCAACCACCC AGACAACAAG 
CACAAGAATC GATACATAAA TATCGTTGCC TATGATCATA GCAGGGTTAA GCTAGCACAG 

1095 
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CTTGCTGAAA AGGATGGCAA ACTGACTGAT TATATCAATG CCAATTATGT TGATGGCTAC 
AACAGACCAA AAGCTTATAT TGCTGCCCAA GGCCCACTGA AATCCACAGC TGAAGATTTC 
TGGAGAATGA TATGGGAACA TAATGTGGAA GTTATTGTCA TGATAACAAA CCTCGTGGAG 
AAAGGAAGGA GAAAATGTGA TCAGTACTGG CCTGCCGATG GGAGTGAGGA GTA CGGGA AC 
TTTCTGGTCA CTCAGAAGAG TGTGCAAGTG CTTGCCTATT ATACTGTGAG GAATTTTACT 
CTAAGAAACA CAAAAATAAA AAAGGGCTCC CAGAAAGGAA GACCCAGTGG ACGTGTGGTC 
ACACAGTATC ACTACACGCA GTGGCCTGAC ATGGGAGTAC CAGAGTACTC CCTGCCAGTG 
CTGACCTTTG TGAGAAAGGC AGCCTATGCC AAGCGCCATG CAGTGGGGCC TGTTGTCGTC 
CACTGCAGTG CTGGAGTTGG AAGAACAGGC ACATATATTG TGCTAGACAG TATGTTGCAG 
CAGATTCAAC ACGAAGGAAC TGTCAACATA TTTGGCTTCT TAAAACACAT CCGTTCACAA 
AGAAATTATT TGGTACAAAC TGAGGAGCAA TATGTCTTCA TTCATGATAC ACTGGTTGAG 
GCCATACTTA GTAAAGAAAC TGAGGTGCTG GACAGTCATA TTCATGCCTA TGTTAATGCA 
CTCCTCATTC CTGGACCAGC AGGCAAAACA AAGCTAGAGA AACAATTCCA GCTCCTGAGC 
CAGTCAAATA TACAGCAGAG TGACTATTCT GCAGCCCTAA AGCAATGCAA CAGGGAAAAG 
AATCGAACTT CTTCTATCAT CCCTGTGGAA AGATCAAGGG TTGGCATTTC ATCCCTGAGT 
GGAGAAGGCA CAGACTACAT CAATGCCTCC TATATCATGG GCTATTACCA GAGCAATGAA 
TTCATCATTA CCCAGCACCC TCTCCTTCAT ACCATCAAGG ATTTCTGGAG GATGATATGG 
GACCATAATG CCCAACTGGT GGTTATGATT CCTGATGGCC AAAACATGGC AGAAGATGAA 
TTTGTTTACT GGCCAAATAA AGATGAGCCT ATAAATTGTG AGAGCTTTAA GGTCACTCTT 
ATGGCTGAAG AACACAAATG TCTATCTAAT GAGGAAAAAC TTATAATTCA GGACTTTATC 
TTAGAAGCTA CACAGGATGA TTATGTACTT GAAGTGAGGC ACTTTCAGTG TCCTAAATGG 
CCAAATCCAG ATAGCCCCAT TAGTAAAACT TTTGAACTTA TAAGTGTTAT AAAAGAAGAA 
GCTG CCAAT A GGGATGGGCC TATGATTGTT CATGATGAGC ATGGAGGAGT GACGGCAGGA 
ACTTTCTGTG CTCTGACAAC CCTTATGCAC CAACTAGAAA AAGAAAATTC CGTGGATGTT 
TACCAGGTAG CCAAGATGAT CAATCTGATG AGGCCAGGAG TCTTTGCTGA CATTGAGCAG 
TATCAGTTTC TCTACAAAGT GATCCTCAGC CTTGTGAGCA CAAGGCAGGA AGAGAATCCA 
TCCACCTCTC TGGACAGTAA TGGTGCAGCA TTGCCTGATG GAAATATAGC TGAGAGCTTA 
GAGTCTTTAG TTTAACACAG AAAGGGGTGG GGGGACTCAC ATCTGAGCAT TGTTTTCCTC 
TTCCTAAAAT TAGGCAGGAA AATCAGTCTA GTTCTGTTAT CTGTTGATTT CCCATCACCT 
GACAGTAACT TTCATGACAT AGGATTCTGC CGCCAAATTT ATATCATTAA CAATGTGTGC 
CTTTTTGCAA GACTTGTAAT TTACTTATTA TGTTTGAACT AAAATGATTG AATTTTACAG 
TATTTCTAAG AATGGAATTG TGGTATTTTT TTCTGTATTG ATTTTA ACAG AAAATTTCAA 
TTTATAGAGG TTAGGAATTC CAAACTACAG AAAATGTTTG TTTTTAGTGT CAAATTTTTA 
GCTGTATTTG TAGCAATTAT CAGGTTTGCT AGAAATATAA CTTTTAATAC AGTAGCCTGT 
AAATAAAACA CTCTTCCATA TGATATTCAA CATTTTACAA CTGCAGTATT CACCTAAAGT 
AGAAATAATC TGTTACTTAT TGTAAATACT GCCCTAGTGT CTCCATGGAC CAAATTTATA 
TTTATAATTG TAGATTTTTA TATTTTACTA CTGAGTCAAG TTTTCTAGTT CTGTGTAATT 
GTTTAGTTTA ATGACGTAGT TCATTAGCTG GTCTTACTCT ACCAG TTTTC TGACATTGTA 
TTGTGTTACC TAAGTCATTA ACTTTGTTTC AGCATGTAAT TTTAACTTTT GTGGAAAATA 
GAAATACCTT CATTTTGAAA GAAGTTTTTA TGAGAATAAC ACCTTACCAA ACATTGTTCA 
AATGGTTTTT ATCCAAGGAA TTGCAAAAAT AAATATAAAT ATTGCCATTA AAAAAAAAAA 
AAAAAAAAAA AAAAAAAAAA AAAA 
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Seq ID NO: 372 Protein sequence t 

Protein Accession ft: built from XP_031379 



1 11 21 31 41 51 

MRILKRFIiAC IQLLCVCRLD WANGYYRQQR KLVEEIGWSY TGALNQKNWG KKYPTCNSPK 
QSPINIDEDL TQVNVNLKKL KFQGWDKTSL ENTFIHNTGK TVEINLTKDY RVSGGVSEMV 
FKASKITFHW GKCNMSSDGS EHSLEGQKFP LEMQIYCFDA DRFSSFEEAV KGKGKLRALS 
ILFEVGTEEN LDFKAIIDGV ESVSRFGKQA ALDPFILLNL LPNSTDKYYI YNGSLTSPPC 
TDTVDWrVFK DTVSISESQL AVFCEVI/TMQ QSGYVMLMDY LQNNFREQQY KFSRQVFSSY 
TGKEEIHEAV CSSEPENVQA DPENYTSLLV TWERPRWYD TMIEKFAVLY QQLDGEDQTK 
HEFLTDGYQD LGAILNNLLP NMSYVLQIVA ICTNGLYGKY SDQLIVDMPT DNPELDLFPE 
LIGTEEIIKE EEEGKDIEEG AIVNPGRDSA TNQIRKKEPQ ISTTTHYNRI GTKYNEAKTN 
RSPTRGSEFS GKGDVPNTSL NSTSQPVTKL ATEKDISLTS QTVTELPPHT VEGTSASLND 
GSKTVLRSPH MNLSGTAESL NTVSITEYEE ESLLTSFKLD TGAEDSSGSS PATSAIPFIS 
ENISQGYIFS SENPETITYD VLIPESARNA SEDSTSSGSE ESLKDPSMEG NVWFPSSTDI 
TAQPDVGSGR ESFLQTNYTE IRVDESEKTT KSFSAGPVMS QGPSVTDLEM PHYSTFAYFP 
TEVTPHAFTP SSRQQDLVST VNWYSQTTQ PVYNGETPLQ PSYSSEVFPL VTPLLLDNQI 
LNTTPAASSS DSALHATPVF PSVDVSFESI LSSYDGAPLL PFSSASFSSE LFRHLHTVSQ 
ILPQVTSATE SDKVPLHASL PVAGGDLLLE PSLAQYSDVL STTHAASETii EFGSESGVLY 
KTLMFSQVEP PSSDAMMHAR SSGPEPSYAL SDNEGSQHIF TVSYSSAIPV HDSVGVTYQG 
SLFSGPSHIP IPKSSLITPT ASLLQPTHAL SGDGEWSGAS SDSEFLLPDT DGLTALNISS 
PVSVAEFTYT TSVFGDDNKA LSKSEIIYGN ETELQIPSFN EMVYPSESTV MPHMYDNVNK 
LNASLQETSV SISSTKGMFP GSLiAHTTTKV FDHEISQVPE NNFSVQPTHT VSQASGDTSL 
KPVLSANSEP ASSDPASSEM LSPSTQLLFY ETSASFSTEV LLQPSFQASD VDTLLKTVLP 
AVPSDPILVE TPKVDKISST MLHLIVSNSA SSENMLHSTS VPVPDVSPTS HMHSASLQGL 
TISYASEKYE PVLLKSESSH QWPSLYSND ELFQTANLEI NQAHPPKGRH VFATPVLSID 
EPUTTLINKL IHSDEILTST KSSVTGKVFA GIPTVASDTF VSTDHSVPIG KGHVAITAVS 
PHRDGSVTST KLLFPSKATS ELSHSAKSDA GLVGGGEDGD TDDDG DDDDD DRGSDGLSIH 
KCKSCSSYRE SQEKVMNDSD THEMSLMDQN NPISYSLSEN SEEDNRVTSV SSDSQTGMDR 
SPGKSPSANG LSQKHNDGKE ENDIQTGSAL LPLSPESKAH AVLTSDEESG SGQGTSDSLN 
ENETSTDFSF ADTNEKDADG ILAAGDSEIT PGFPQSPTSS VTSENSEVFH VSEAEASNSS 
HESRIGIAEG LESEKKAVIP LVIVSALTFI CLWLVGILI YWRKCFQTAH FYLEDSTSPR 
VISTPPTPIF PISDDVGAIP IKHFPKHVAD LHASSGFTEE FETLKEFYQE VQSCTVDLGI. 
TADSSNHPDN KHKNRYINTV AYDHSRVKLA QLAEKDGKLT DYINANYVDG YNRPKAYIAA 
QGPLKSTAED FWRMIWEHNV EVIVMITNLV EKGRRKCDQY WPADGSEEYG NFLVTQKSVQ 
VLAYYTVRNF TLRNTKIKKG SQKGRPSGRV VTQYHYTQWP DMGVPEYSLP VLTFVRKAAY 
AKRHAVGPW VHCSAGVGRT GTYIVLDSML QQIQHEGTVN IFGFLKHIRS QRNYLVQTEE 
QYVFIHDTLV EAILSKETEV LDSHIHAYVN ALLI PGPAGK TKLEKQFQLL SQSNIQQSDY 
SAALKQCNRE KNRTSSIIPV ERSRVGISSL SGEGTDYINA SYIMGYYQSN EFIITQHPLL 
HTIKDFWRMI HDHNAQLWM IPDGQNMAED EFVYWPNKDE PINCESFKVT LMAEEHKCLS 
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NEEKLIIQDF I LEATQDDYV LEVRHFQCPK WPNPDSPISK TFELISVIKE EAANRDGPMI 
VHDEHGGVTA GTFCALTTLM HQLEKENSVD VYQVAKMINL MRPGVFADIE QYQFLYKVIL 
SLVSTRQEEN PSTSLDSNGA ALPDGNIAES LESLV 



2220 
2280 



Seq ID NO: 373 DMA sequence 

Nucleic Acid Accession «: built from NM_002851 
Coding sequence: 148-4518 

1 11 21 31 41 51 

| | | I i I 

CACACATACG CACGCACGAT CTCACTTCGA TCTATACACT GGAGGATTAA AACAAACAAA 
CAAAAAAAAC ATTTCCTTCG CTCCCCCTCC CTCTCCACTC TGAGAAGCAG AGGAGCCGCA 
CGGCGAGGGG CCGCAGACCG TCTGGAAATG CGAATCCTAA AGCGTTTCCT CGCTTGCATT 
CAGCTCCTCT GTGTTTGCCG CCTGGATTGG GCTAATGGAT ACTACAGACA ACAGAGAAAA 
CTTGTTGAAG AGATTGGCTG GTCCTATACA GGAGCACTGA ATCAAAAAAA TTGGGGAAAG 
AAATATCCAA CATGTAATAG CCCAAAACAA TCTCCTATCA ATATTGATGA AGATCTTACA 
CAAGTAAATG TGAATCTTAA GAAACTTAAA TTTCAGGGTT GGGATAAAAC ATCATTGGAA 
AACACATTCA TTCATAACAC TGGGAAAACA GTGGAAATTA ATCTCACTAA TGACTACCGT 
GTCAGCGGAG GAGTTTCAGA AATGGTGTTT AAAGCAAGCA AGATAACTTT TCACTGGGGA 
AAATGCAATA TGTCATCTGA TGGATCAGAG CATAGTTTAG AAGGACAAAA ATTTCCACTT 
GAGATGCAAA TCTACTGCTT TGATGCGGAC CGATTTTCAA GTTTTGAGGA AGCAGTCAAA 
GGAAAAGGGA AGTTAAGAGC TTTATCCATT TTGTTTGAGG TTGGGACAGA AGAAAATTTG 
GATTTCAAAG CGATTATTGA TGGAGTCGAA AGTGTTAGTC GTTTTGGGAA GCAGGCTGCT 
TTAGATCCAT TCATACTGTT GAACCTTCTG CCAAACTCAA CTGACAAGTA TTACATTTAC 
AATGGCTCAT TGACATCTCC TCCCTGCACA GACACAGTTG ACTGGATTGT TTTTAAAGAT 
ACAGTTAGCA TCTCTGAAAG CCAGTTGGCT GTTTTTTGTG AAGTTCTTAC AATGCAACAA 
TCTGGTTATG TCATGCTGAT GGACTACTTA CAAAACAATT TTCGAGAGCA ACAGTACAAG 
TTCTCTAGAC AGGTGTTTTC CTCATACACT GGAAAGGAAG AGATTCATGA AGCAGTTTGT 
AGTTCAGAAC CAGAAAATGT TCAGGCTGAC CCAGAGAATT ATACCAGCCT TCTTGTTACA 
TGGGAAAGAC CTCGAGTCGT TTATGATACC ATGATTGAGA AGTTTGCAGT TTTGTACCAG 
CAGTTGGATG GAGAGGACCA AACCAAGCAT GAATTTTTGA CAGATGGCTA TCAAGACTTG 
GGTGCTATTC TCAATAATTT GCTACCCAAT ATGAGTTATG TTCTTCAGAT AGTAGCCATA 
TGCACTAATG GCTTATATGG AAAATACAGC GACCAACTGA TTGTCGACAT GCCTACTGAT 
AATCCTGAAC TTGATCTTTT CCCTGAATTA ATTGGAACTG AAGAAATAAT CAAGGAGGAG 
GAAGAGGGAA AAGACATTGA AGAAGGCGCT ATTGTGAATC CTGGTAGAGA CAGTGCTACA 
AACCAAATCA GGAAAAAGGA ACCCCAGATT TCTACCACAA CACACTACAA TCGCATAGGG 
ACGAAATACA ATGAAGCCAA GACTAACCGA TCCCCAACAA GAGGAAGTGA ATTCTCTGGA 
AAGGGTGATG TTCCCAATAC ATCTTTAAAT TCCACTTCCC AACCAGTCAC TAAATTAGCC 
ACAGAAAAAG ATATTTCCTT GACTTCTCAG ACTGTGACTG AACTGCCACC TCACACTGTG 
GAAGGTACTT CAGCCTCTTT AAATGATGGC TCTAAAACTG TTCTTAGATC TCCACATATG 
AACTTGTCGG GGACTGCAGA ATCCTTAAAT ACAGTTTCTA TAACAGAATA TGAGGAGGAG 
AGTTTATTGA CCAGTTTCAA GCTTGATACT GGAGCTGAAG ATTCTTCAGG C TCCA GTCCC 
GCAACTTCTG CTATCCCATT CATCTCTGAG AACATATCCC AAGGGTATAT ATTTTCCTCC 
GAAAACCCAG AGACAATAAC ATATGATGTC CTTATACCAG AATCTGCTAG AAATGCTTCC 
GAAGATTCAA CTTCATCAGG TTCAGAAGAA TCACTAAAGG ATCCTTCTAT GGAGGGAAAT 
GTGTGGTTTC CTAGCTCTAC AGACATAACA GCACAGCCCG ATGTTGGATC AGGCAGAGAG 
AGCTTTCTCC AGACTAATTA CACTGAGATA CGTGTTGATG AATCTGAGAA GACAACCAAG 
TCCTTTTCTG CAGGCCCAGT GATGTCACAG GGTCCCTCAG TTACAGATCT GGAAATGCCA 
CATTATTCTA CCTTTGCCTA CTTCCCAACT GAGGTAACAC CTCATGCTTT TACCCCATCC 
TCCAGACAAC AGGATTTGGT CTCCACGGTC AACGTGGTAT ACTCGCAGAC AACCCAACCG 
GTATACAATG CAGAGGCCAG TAATAGTAGC CATGAGTCTC GTATTGGTCT AG CTGAGG GG 
TTGGAATCCG AGAAGAAGGC AGTTATACCC CTTGTGATCG TGTCAGCCCT GACTTTTATC 
TGTCTAGTGG TTCTTGTGGG TATTCTCATC TACTGGAGGA AATGCTTCCA GACTGC ACAC 
TTTTACTTAG AGGACAGTAC ATCCCCTAGA GTTATATCCA CACCTCCAAC ACCTATCTTT 
CCAATTTCAG ATGATGTCGG AGCAATTCCA ATAAAGCACT TTCCAAAGCA TGTTGCAGAT 
TTACATGCAA GTAGTGGGTT TACTGAAGAA TTTGAGACAC TGAAAGAGTT TTACCAGGAA 
GTGCAGAGCT GTACTGTTGA CTTAGGTATT ACAGCAGACA GCTCCAACCA CCCAGACAAC 
AAGCACAAGA ATCGATACAT AAATATCGTT GCCTATGATC ATAGCAGGGT TAAGCTAGCA 
CAGCTTGCTG AAAAGGATGG CAAACTGACT GATTATATCA ATGCCAATTA TGTTGATGGC 
TACAACAGAC CAAAAGCTTA TATTGCTGCC CAAGGCCCAC TGAAATCCAC AGCTGAAGAT 
TTCTGGAGAA TGATATGGGA ACATAATGTG GAAGTTATTG TCATGATAAC AAACCTCGTG 
GAGAAAGGAA GGAGAAAATG TGATCAGTAC TGGCCTGCCG ATGGGAGTGA GGAGTACGGG 
AACTTTCTGG TCACTCAGAA GAGTGTGCAA GTGCTTGCCT ATTATACTGT GAGGAATTTT 
ACTCTAAGAA ACACAAAAAT AAAAAAGGGC TCCCAGAAAG GAAGACCCAG TGGACGTGTG 
GTCACACAGT ATCACTACAC GCAGTGGCCT GACATGGGAG TACCAGAGTA CTCCCTGGCA 
GTGCTGACCT TTGTGAGAAA GGCAGCCTAT GCCAAGCGCC ATGCAGTGGG GCCTGTTGTC 
GTCCACTGCA GTGCTGGAGT TGGAAGAACA GGCACATATA TTGTGCTAGA CAGTATGTTG 
CAGCAGATTC AACACGAAGG AACTGTCAAC ATATTTGGCT TCTTAAAACA CATCCGTTCA 
CAAAGAAATT ATTTGGTACA AACTGAGGAG CAATATGTCT TCATTCATGA TACACTGGTT 
GAGGCCATAC TTAGTAAAGA AACTGAGGTG CTGGACAGTC ATATTCATGC CTATGTTAAT 
GCACTCCTCA TTCCTGGACC AGCAGGCAAA ACAAAGCTAG AGAAACAATT CCAGCTCCTG 
AGCCAGTCAA ATATACAGCA GAGTGACTAT TCTGCAGCCC TAAAGCAATG CAACAGGGAA 
AAGAATCGAA CTTCTTCTAT CAT CCCTGTG GAAAGATCAA GGGTTGGCAT TTCATCCCTG 
AGTGGAGAAG GCACAGACTA CATCAATGCC TCCTATATCA TGGGCTATTA CCAGAGCAAT 
GAATTCATCA TTACCCAGCA CCCTCTCCTT CATACCATCA AGGATTTCTG GAGGATGATA 
TGGGACCATA ATGCCCAACT GGTGGTTATG ATTCCTGATG GCCAAAACAT GGCAGAAGAT 
GAATTTGTTT ACTGGCCAAA TAAAGATGAG CCTATAAATT GTQAGAGCTT TAAGGT CACT 
CTTATGGCTG AAGAACACAA ATGTCTATCT AATGAGGAAA AACT TATA AT TCAGGACTTT 
ATCTTAGAAG CTACACAGGA TGATTATGTA CTTGAAGTGA GGCACTTTCA GTGTCCTAAA 
TGGCCAAATC CAGATAGCCC CATTAGTAAA ACTTTTGAAC TTATAAGTGT TATAAAAGAA 
GAAGCTGCCA ATAGGGATGG GCCTATGATT GTTCATGATG AGCATGGAGG AGTGACGGCA 
GGAACTTTCT GTGCTCTGAC AACCCTTATG CACCAACTAG AAAAAGAAAA TTCCGTGGAT 
GTTTACCAGG TAGCCAAGAT GATCAATCTG ATGAGGCCAG GAGTCTTTGC TGACATTGAG 
CAGTATCAGT TTCTCTACAA AGTGATCCTC AGCCTTGTGA GCACAAGGCA GGAAGAGAAT 
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CCATCCACCT CTCTGGACAG TAATGGTGCA GCATTGCCTG ATGGAAATAT AGCTGAGAGC 4500 

TTAGAGTCTT TAGTTTAACA CAGAAAGGGG TGGGGGGACT CACATCTGAG CATTGTTTTC 4560 

CTCTTCCTAA AATTAGGCAG GAAAATCAGT CTAGTTCTGT TATCTGTTGA TTTCCCATCA 4620 

CCTGACAGTA ACTTTCATGA CATAGGATTC TGCCGCCAAA TTTATATCAT TAACAATGTG 4680 

TGCCTTTTTG CAAGACTTGT AATTTACTTA TTATGTTTGA ACTAAAATGA TTGAATTTTA 4740 

CAGTATTTCT AAGAATGGAA TTGTGGTATT TTTTTCTGTA TTGATTTTAA CAGAAAATTT 4800 

CAATTTATAG AGGTTAGGAA TTCCAAACTA CAGAAAATGT TTGTTTTTAG TGTCAAATTT 4860 

TTAGCTGTAT TTGTAGCAAT TATCAGGTTT GCTAGAAATA TAACTTTTAA TACAGTAGCC 4920 

TGTAAATAAA ACACTCTTCC ATATGATATT CAACATTTTA CAACTGCAGT ATTCACCTAA 4980 

AGTAGAAATA ATCTGTTACT TATTGTAAAT ACTGCCCTAG TGTCTCCATG GACCAAATTT S040 

ATATTTATAA TTGTAGATTT TTATATTTTA CTACTGAGTC AAGTTTTCTA GTTCTGTGTA 5100 

ATTGTTTAGT TTAATGACGT AGTTCATTAG CTGGTCTTAC TCTACCAGTT TTCTGACATT 5160 

GTATTGTGTT ACCTAAGTCA TTAACTTTGT TTCAGCATGT AATTTTAACT TTTGTGGAAA 5220 

ATAGAAATAC CTTCATTTTG AAAGAAGTTT TTATGAGAAT AACACCTTAC CAAACATTGT 5280 

TCAAATGGTT TTTATCCAAG GAATTGCAAA AATAAATATA AATATTGCCA TTAAAAAAAA 5340 
AAAAAAAAAA AAAAAAAAAA AAAAAAA 



Seq ID NO: 374 Protein sequence; 

Protein Accession #: built from XP_031379 



1 

I 

MRILKRFIAC 
QSPINIDEDL 
FKASKITFHW 
ILFEVGTEEN 
TDTVDWIVFK 
TGKEEIHEAV 
HEFLTDGYQD 
LIGTEEIIKE 
RSPTRGSEFS 
GSKTVLRSPH 
ENISQGYIFS 
TAQPDVGSGR 
TEVTPHAFTP 
PLVIVSALTF 
PIKHFPKHVA 
VAYDHSRVKL 
VEVIVMITNL 
GSQKGRPSGR 
TGTYIVLDSM 
VLDSHIHAYV 
VERSRVGISS 
MIPDGQNMAE 
VLEVRHFQCP 



IQliLCVCRLD 
TQVNVNLKKL 



LDFKAIIDGV 
DTVSISESQL 
CSSEPENVQA 
LGAILNNLLP 



AALPDGNIAE 



GKGDVPNTSL 
MHLSGTAESI* 
SENPETITYD 
ESFLQTNYTE 
SSRQQDLVST 
ICLWIiVGIL 
DLHASSGFTE 
AQLAEKDGKL 
VEKGRRKCDQ 
WTQYHYTQW 
LQQIQHEGTV 
NALLIPGPAG 
I»SGEGTDYIN 
DEFVYWPNKD 
KWPNPDSPIS 
DVYQVAKMIN 
SLESLV 



21 
1 

WANGYYRQQR 
KFQGWDKTSL 
EHSLEGQKFP 
ESVSRFGKQA 
AVFCEVLTMQ 
DPENYTSLLV 
NMSYVLQIVA 
AIVNPGRDSA 
NSTSQPVTKL 
NTVSITEYEE 
VLIPESARNA 
IRVDESEKTT 
VNWYSQTTQ 
IYWRKCFQTA 
EFETLREFYQ 
TDYINANYVD 
YWPADGSEEY 
PDMGVPEYSL 
NIFGFLKHIR 
KTKLEKQFQL 
ASYIMGYYQS 
EPINCESFKV 
KTFELISVIK 
LMRPGVFADI 



31 
I 

KLVEEIGWSY 
ENTFIHNTGK 
LEMQIYCFDA 
ALDPFILLiNL 
QSGYVMLMDY 
TWERPRWYD 
ICTNGLYGKY 
TNQIRKKEPQ 
ATEKDISLTS 
ESLLTSFKLD 
SEDSTSSGSE 
KSFSAGPVMS 
PVYNAEASNS 
HFYLEDSTSP 
EVQSCTVDLG 
GYNRPKAYIA 
GNFLVTQKSV 
PVLTFVRKAA 
SQRNYLVQTE 
LSQSNIQQSD 
NEFIITQHPL 
TLMAEEHKCb 
EEAANRDGPM 
EQYQFLYKVI 



41 
I 

TGALNQKNVJG 
TVEINLTNDY 
DRFSSFEEAV 
LPNSTDKYYI 
LQNNFREQQY 
TMIEKFAVLY 
SDQLIVDMPT 
ISTTTHYNRI 
QTVTELPPHT 



ESLKDPSMEG 
QGPSVTDLEM 
SHESRIGLAE 
RVISTPPTPI 
ITADSSNHPD 
AQGPLKSTAB 
QVLAYYTVRM 
YAKRHAVGPV 
EQYVFIHDTL 
YSAALKQCNR 
LHTIKDFWRM 
SNEEKLIIOD 
IVHDEHGGVT 
LSLVSTRQEE 



51 
I 

KKYPTCNSPK 
RVSGGVSEMV 
KGKGKLRALS 
YNGSLTSPPC 
KFSRQVFSSY 
QQLDGEDQTK 
DNPELDLFPE 
GTKYNEAKTN 
VEGTSASLND 
PATSAIPFIS 
MVWFPSSTDI 
PHYSTFAYFP 
GLESEKKAVI 
FPISDDVGAI 
NKHKNRYINI 
DFWRMIWEHN 
FTLRNTKIKK 
WHCSAGVGR 
VEAILSKETE 
EKNRTSSIIP 
IWDHNAQLW 
FILEATQDDY 
AGTFCALTTL 
NPSTSLDSNG 
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Seq ID NO: 375 DNA sequence 

Nucleic Acid Accession ft: EOS sequence 

Coding sequence: 148-4494 



CACACATACG 
CAAAAAAAAC 
CGGCGAGGGG 
CAGCTCCTCT 
CTTGTTGAAG 
AAATATCCAA 
CAAGTAAATG 
AACACATTCA 
GTCAGCGGAG 
AAATGCAATA 
GAGATGGAAA 
GGAAAAGGGA 
GATTTCAAAG 
TTAGATCCAT 
AATGGCTCAT 
ACAGTTAGCA 
TCTGGTTATG 
TTCTCTAGAC 
AGTTCAGAAC 
TGGGAAAGAC 
CAGTTGGATG 
GGTGCTATTC 
TGCACTAATG 
AATCCTGAAC 
GAAGAGGGAA 
AACCAAATCA 
ACGAAATACA 
AAGGGTGATG 
ACAGAAAAAG 



11 21 

I I 

CAOGCACGAT CTCACTTCGA 
ATTTCCTTCG CTCCCCCTCC 
CCGCAGACCG TCTGGAAATG 
GTGTTTGCCG CCTGGATTGG 
AGATTGGCTG GTCCTATACA 
CATGTAATAG CCCAAAACAA 
TGAATCTTAA GAAACTTAAA 
TTCATAACAC TGGGAAAACA 
GAGTTTCAGA AATGGTGTTT 
TGTCATCTGA TGGATCAGAG 
TCTACTGCTT TGATGCAGAC 
AGTTAAGAGC TTTATCCATT 
CGATTATTGA TGGAGTCGAA 
TCATACTGTT GAACCTTCTG 
TGACATCTCC TCCCTGCACA 
TCTCTGAAAG CCAGTTGGCT 
TCATGCTGAT GGACTACTTA 
AGGTGTTTTC CTCATACACT 
CAGAAAATGT TCAGGCTGAC 
CTCGAGTCGT TTATGATACC 
GAGAGGACCA AACCAAGCAT 
TCAATAATTT GCTACCCAAT 
GCTTATATGG AAAATACAGC 
TTGATCTTTT CCCTGAATTA 
AAGACATTGA AGAAGGCGCT 
GGAAAAAGGA ACCCCAGATT 
ATGAAGCCAA GACTAACCGA 
TTCCCAATAC ATCTTTAAAT 
ATATTTCCTT GACTTCTCAG 



31 41 51 

1 ! I 

TCTATACACT GGAGGATTAA AACAAACAAA 60 

CTCTCCACTC TGAGAAGCAG AGGAGCCGCA 120 

CGAATCCTAA AGCGTTTCCT CGCTTGCATT 180 

GCTAATGGAT ACTACAGACA ACAGAGAAAA 240 

GGAGCACTGA ATCAAAAAAA TTGGGGAAAG 300 

TCTCCTATCA ATATTGATGA AGATCTTACA 360 

TTTCAGGGTT GGGATAAAAC ATCATTGGAA 420 

GTGGAAATTA ATCTCACTAA TGACTACCGT 480 

AAAGCAAGCA AOATAACTTT TCACTGGGGA 540 

CATAGTTTAG AAGGACAAAA ATTTCCACTT 600 

CGATTTTCAA GTTTTGAGGA AGCAGTCAAA 660 

TTGTTTGAGG TTGGGACAGA AGAAAATTTG 720 

AGTGTTAGTC GTTTTGGGAA GCAGGCTGCT 780 

CCAAACTCAA CTGACAAGTA TTACATTTAC 840 

GACACAGTTG ACTGGATTGT TTTTAAAGAT 900 

GTTTTTTGTG AAGTTCTTAC AATGCAACAA 960 

CAAAACAATT TTCGAGAGCA ACAG TACAA G 1020 

GGAAAGGAAG AGATTCATGA AGCAGTTTGT 1080 

CCAGAGAATT ATACCAGCCT TCTTGTTACA 1140 

ATGATTGAGA AGTTTGCAGT TTTGTACCAG 1200 

GAATTTTTGA CAGATGGCTA TCAAGACTTG 1260 

ATGAGTTATG TTCTTCAGAT AGTAGCCATA 1320 

GACCAACTGA TTGTCGACAT GCCTACTGAT 1380 

ATTGGAACTG AAGAAATAAT CAAGGAGGAG 1440 

ATTGTGAATC CTGGTAGAGA CAGTGCTACA 1500 

TCTACCACAA CACACTACAA TCGCATAGGG 1560 

TCCCCAACAA GAGGAAGTGA ATTCTCTGGA 1620 

TCCACTTCCC AACCAGTCAC TAAATTAGCC 1680 

ACTGTGACTG AACTGCCACC TCACACTGTG 1740 
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GAAGGTACTT CAGCCTCTTT AAATGATGGC TCTAAAACTG TTCTTAGATC TCCACATATG 1800 
AACTTGTCGG GGACTGCAGA ATCCTTAAAT ACAGTTTCTA TAACAGAATA TGAGGAGGAG 1860 
AGTTTATTGA CCAGTTTCAA GCTTGATACT GGAGCTGAAG ATTCTTCAGG CTCCAGTCCC 1920 
GCAACTTCTG CTATCCCATT CATCTCTGAG AACATATCCC AAGGGTATAT ATTTTCCTCC 1980 
GAAAACCCAG AGACAATAAC ATATGATGTC CTTATACCAG AATCTGCTAG AAATGCTTCC 2040 
GAAGATTCAA CTTCATCAGG TTCAGAAGAA TCACTAAAGG ATCCTTCTAT GGAGGGAAAT 2100 
GTGTGGTTTC CTAGCTCTAC AGACATAACA GCACAGCCCG ATGTTGGATC AGGCAGAGAG 2160 
AGCTTTCTCC AGACTAATTA CACTGAGATA CGTGTTGATG AATCTGAGAA GACAACCAAG 2220 
TCCTTTTCTG CAGGCCCAGT GATGTCACAG GGTCCCTCAG TTACAGATCT GGAAATGCCA 2280 
CATTATTCTA CCTTTGCCTA CTTCCCAACT GAGGTAACAC CTCATGCTTT TACCCCATCC 2340 
TCCAGACAAC AGGATTTGGT CTCCACGGTC AACGTGGTAT ACTCGCAGAC AACCCAACCG 2400 
GTATACAATG AGGCCAGTAA TAGTAGCCAT GAGTCTCGTA TTGGTCTAGC TGAG GGGTTG 2460 
GAATCCGAGA AGAAGGCAGT TATACCCCTT GTGATCGTGT CAGCCCTGAC TTTTAT CTGT 2520 
CTAGTGGTTC TTGTGGGTAT TCTCATCTAC TGGAGGAAAT GCTTCCAGAC TGCACACTTT 2580 
TACTTAGAGG ACAGTACATC CCCTAGAGTT ATATCCACAC CTCCAACACC TATCTTTCCA 2640 
ATTTCAGATG ATGTCGGAGC AATTCCAATA AAGCACTTTC CAAAGCATGT TGCAGATTTA 2700 
CATGCAAGTA GTGGGTTTAC TGAAGAATTT GAGGAAGTGC AGAGCTGTAC TGTTGACTTA 2760 
GGTATTACAG CAGACAGCTC CAACCACCCA GACAACAAGC ACAAGAATCG ATACATAAAT 2820 
ATCGTTGCCT ATGATCATAG CAGGGTTAAG CTAGCACAGC TTGCTGAAAA GGATGGCAAA 2 8 BO 
CTGACTGATT ATATCAATGC CAATTATGTT GATGGCTACA ACAGACCAAA AGCTTATATT 2940 
GCTGCCCAAG GCCCACTGAA ATCCACAGCT GAAGATTTCT GGAGAATGAT ATGGGAACAT 3000 
AATGTGGAAG TTATTGTCAT GATAACAAAC CTCGTGGAGA AAGGAAGGAG AAAATGTGAT 3060 
CAGTACTGGC CTGCCGATGG GAGTGAGGAG TACGGGAACT TTCTGGTCAC TCAGAAGAGT 3120 
GTGCAAGTGC TTGCCTATTA TACTGTGAGG AATTTTACTC TAAGAAACAC AAAAATAAAA 3180 
AAGGGCTCCC AGAAAGGAAG ACCCAGTGGA CGTGTGGTCA CACAGTATCA CTACACGCAG 3240 
TGGCCTGACA TGGGAGTACC AGAGTACTCC CTGCCAGTGC TGACCTTTGT GAGAAAGGCA 3300 
GCCTATGCCA AGCGCCATGC AGTGGGGCCT GTTGTCGTCC ACTGCAGTGC TGGAGTTGGA 3360 
AGAACAGGCA CATATATTGT GCTAGACAGT ATGTTGCAGC AGATTCAACA CGAAGGAACT 3420 
GTCAACATAT TTGGCTTCTT AAAACACATC CGTTCACAAA GAAATTATTT GGTACAAACT 3480 
GAGGAGCAAT ATGTCTTCAT TCATGATACA CTGGTTGAGG CCATACTTAG TAAAGAAACT 3540 
GAGGTGCTGG ACAGTCATAT TCATGCCTAT GTTAATGCAC TCCTCATTCC TGGACCAGCA 3600 
GGCAAAACAA AGCTAGAGAA ACAATTCCAG CTCCTGAGCC AGTCAAATAT ACAGCAGAGT 3660 
GACTATTCTG CAGCCCTAAA GCAATGCAAC AGGGAAAAGA ATCGAACTTC TTCTATCATC 3720 
CCTGTGGAAA GATCAAGGGT TGGCATTTCA TCCCTGAGTG GAGAAGGCAC AGACTACATC 3780 
AATGCCTCCT ATATCATGGG CTATTACCAG AGCAATGAAT TCATCATTAC CCAGCACCCT 3840 
CTCCTTCATA CCATCAAGGA TTTCTGGAGG ATGATATGGG ACCATAATGC CCAACTGGTG 3900 
GTTATGATTC CTGATGGCCA AAACATGGCA GAAGATGAAT TTGTTTACTG GCCAAATAAA 3960 
GATGAGCCTA TAAATTGTGA GAGCTTTAAG GTCACTCTTA TGGCTGAAGA ACACAAATGT 4020 
CTATCTAATG AGGAAAAACT TATAATTCAG GACTTTATCT TAGAAGCTAC ACAGGATGAT 4080 
TATGTACTTG AAGTGAGGCA CTTTCAGTGT CCTAAATGGC CAAATCCAGA TAGCCCCATT 4140 
AGTAAAACTT TTGAACTTAT AAGTGTTATA AAAGAAGAAG CTGCCAATAG GGATGGGCCT 4200 
ATGATTGTTC ATGATGAGCA TGGAGGAGTG ACGGCAGGAA CTTTCTGTGC TCTGACAACC 4260 
CTTATGCACC AACTAGAAAA AGAAAATTCC GTGGATGTTT ACCA GGTAG C CAAGATGATC 4320 
AATCTGATGA GGCCAGGAGT CTTTGCTGAC ATTGAGCAGT ATCAGTTTCT CTACAAAGTG 4380 
ATCCTCAGCC TTGTGAGCAC AAGGCAGGAA GAGAATCCAT CCACCTCTCT GGACAGTAAT 4440 
GGTGCAGCAT TGCCTGATGG AAATAT AG CT GAGAGCTTAG AGTCTTTAGT TTAACACAGA 4500 
AAGGGGTGGG GGGACTCACA TCTGAGCATT GTTTTCCTCT TCCTAAAATT AGGCAGGAAA 4560 
ATCAGTCTAG TTCTGTTATC TGTTGATTTC CCATCACCTG ACAGTAACTT TCATGACATA 4620 
GGATTCTGCC GCCAAATTTA TATCATTAAC AATGTGTGCC TTTTTGCAAG ACTTGTAATT 4680 
TACTTATTAT GTTTGAACTA AAATGATTGA ATTTTACAGT ATTTCTAAGA ATGGAATTGT 4740 
GGTATTTTTT TCTGTATTGA TTTTAACAGA AAATTTCAAT TTATAGAGGT TAGGAATTCC 4800 
AAACTACAGA AAATGTTTGT TTTTAGTGTC AAATTTTTAG CTGTATTTGT AGCAATTATC 4860 
AGGTTTG CTA GAAATATAAC TTTTAATACA GTAGCCTGTA AATAAAACAC TCTTCCATAT 4920 
GATATTCAAC ATTTTACAAC TGCAGTATTC ACCTAAAGTA GAAATAATCT GTTACTTATT 4980 
GTAAATACTG CCCTAGTGTC TCCATGGACC AAATTTATAT TTATAATTGT AGATTTTTAT 5040 
ATTTTACTAC TGAGTCAAGT TTTCTAGTTC TGTGTAATTG TTTAGTTTAA TGACGTAGTT 5100 
CATTAGCTGG TCTTACTCTA CCAGTTTTCT GACATTGTAT TGTGTTACCT AAGTCATTAA 5160 
CTTTGTTTCA GCATGTAATT TTAACTTTTG TGGAAAATAG AAATACCTTC ATTTTGAAAG 5220 
AAGTTTTTAT GAGAATAACA CCTTACCAAA CATTGTTCAA ATGGTTTTTA TCCAAGGAAT 5280 
TGCAAAAATA AATATAAATA TTGCCATTAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 5340 
AAA 

Seq ID NO: 376 Protein sequence : 
Protein Accession #: EOS sequence 

1 11 21 31 41 51 

| | I I t I 

MRILKRPLAC IQLLCVCRLD WANGYYRQQR KLVEEIGWSY TGALNQKNWG KKYPTCNSPK 
QSPINIDEDL TQVNVNLKKL KFQGWDKTSL ENTFIHNTGK TVEINLTNDY RVSGGVSEMV 
FKASKITFHW GKCNMSSDGS EHSLEGQKFP LEMQIYCFDA DRPSSFEEAV KGKGKLRALS 
1LFEVGTEEN LDFKAIIDGV ESVSRFGKQA ALDPPILLNL LPNSTDKYYI YNGSLTSPPC 
TDTVWIVFK DTVSISESQL AVFCEVLTMQ QSGYVMLMDY LQNNFREQQY KFSRQVFSSY 
TGKEEIHEAV CSSEPENVQA DPENYTSLLV TWERPRWYD TMIEKFAVLY QQLDGEDQTK 
HEFLTDGYQD LGAILNNLLP NMSYVLQIVA ICTUGLYGKY S DQLIV DMPT ENPELDLFPE 
LIGTEEIIKE EEEGKDIEEG AIVNPGRDSA TNQIRKKEPQ ISTTTHYNRI GTKYKEAKTN 
RSPTRGSEFS GKGDVPNTSL NSTSQPVTKL ATEKDISLTS QTVTELPPHT VEGTSASLND 
GSKTVLRSPH MNLSGTAESL NTVSITEYEE ESLLTSFKLD TGAEDSSGSS PATSAIPFIS 
EN1SQGYIFS SENPETITYD VLIPESARNA SEDSTSSGSE ESLKDPSMEG NVWFPSSTDI 
TAQPDVGSGR ESFLGTNYTE IRVDESEKTT RSFSAGPVMS QGPSVTDLEM PHYSTFAYFP 
TEVTPHAFTP SSRQQDLVST VNWYSQTTQ PVYNEASNSS HESR1GLAEG LESEKKAVIP 
LVIVSALTFI CLWLVGILI YWRKCFQTAH FYL.EDSTSPR VISTPPTPIF PISDDVGAIP 
IKHFPKHVAD LHASSGFTEE FEEVQSCTVD LGITADSSNH PDNKHKNRYI NIVAYDHSRV 
KLAQLAEKDG KLTDYINANY VDGYNRPKAY IAAOGPLKST AEDFWRMIWE HNVEVIVMIT 
NLVEKGRRKC DQYWPADGSE EYGNFLVTQK SVQVLAYYTV RNFTLRNTKI KKGSQKGRPS 



1099 



WO 03/042661 



PCT/US02/36810 



GRWTQYHYT QWPDMGVPEY SLPVLTFVRK AAYAKRHAVG PWVHCSAGV GRTGTYIVLD 1080 
SMUJQIQHEG TVNIFGFLKH IRSQRNYLVQ TEEQYVFIHD TLVEAILSKE TEVLDSH1HA 1140 
YVNALLIPGP AGKTKLEKQF QLLSQSNIQQ SDYSAALKQC NREKNRTSSI IPVERSRVGI 1200 
SSLSGEGTDY INASYIMGYY QSNEPI1TQH PLLHTIKDFW RMIWDHNAQL WMIPDGQNM 1260 
.. 5 AEDEFVYWPN KDEPINCESF KVTLMAEEHK CLSNEEKL 1 1 QDFILEATQD DYVLEVRHFQ 1320 
CPKWPNPDSP ISKTFELISV IKEEAANRDG PMIVHDEHGG VTAGTFCALT TLMHQLEKEK 1380 
SVDVYQVAKM INLMRPGVFA DIEQYQFLYK VILSLVSTRQ EENPSTSLDS NGAALPDGNI 1440 
AESLESLV 

10 

Seq ID KO: 377 DNA sequence 

Nucleic Acid Accession #t EOS sequence 

Coding sequence: 501-4514 

15 1 11 21 31 41 51 

CACACATACG CACGCACGAT CTCACTTCGA TCTATACACT GGAGGATTAA AACAAACAAA 60 
CAAAAAAAAC ATTTCCTTCG CTCCCCCTCC CTCTCCACTC TGAGAAGCAG AGGAGCCGCA 120 
CGGCGAGGGG CCGCAGACCG TCTGGAAATG CGAATCCTAA AGCGTTTCCT CGCTTGCATT 180 
20 CAGCTCCTCT GTGTTTGCCG CCTGGATTGG GCTAATGGAT ACTACAGACA ACAGAGAAAA 240 
CTTGTTGAAG AGATTGGCTG GTCCTATACA GGAGCACTGA ATCAAAAAAT TGGGGAAAGA 300 
AATATCCAAC ATGTAATAGC CCAAAACAAT CTCCTATCAA TATTGATGAA GATCTTACAC 360 
AAGTAAATGT GAATCTTAAG AAACTTAAAT TTCAGGGTTG GGATAAAACA TCATTGGAAA 420 
ACACATTCAT TCATAACACT GGGAAAACAG TGGAAATTAA TCTCACTAAT GACTACCGTG 480 
25 TCAGCGGAGG AGTTTCAGAA ATGGTGTTTA AAGCAAGCAA GATAACTTTT CACTGGGGAA 540 
AATGCAATAT GTCATCTGAT GGATCAGAGC ATAGTTTAGA AGGACAAAAA TTTCCACTTG 600 
AGATGCAAAT CTACTGCTTT GATGCGGACC GATTTTCAAG TTTTGAGGAA GCAGTCAAAG 660 
GAAAAGGGAA GTTAAGAGCT TTATCCATTT TGTTTGAGGT TGGGACAGAA GAAAATTTGG 720 
ATTTCAAAGC GATTATTGAT GGAGTCGAAA GTGTTAGTCG TTTTGGGAAG CAGGCTGCTT 780 
30 TAGATCCATT CATACTGTTG AACCTTCTGC CAAACTCAAC TGACAAGTAT TACATTTACA 840 
ATGGCTCATT GACATCTCCT CCCTGCACAG ACACAGTTGA CTGGATTGTT TTTAAAGATA 900 
CAGTTAGCAT CTCTGAAAGC CAGTTGGCTG TTTTTTGTGA AGTTCTTACA ATGCAACAAT 960 
CTGGTTATGT CATGCTGATG GACTACTTAC AAAACAATTT TCGAGAGCAA CAGTACAAGT 1020 
TCTCTAGACA GGTGTTTTCC TCATACACTG GAAAGGAAGA GATTCATGAA GCAGTTTGTA 1080 
35 GTTCAGAACC AGAAAATGTT CAGGCTGACC CAGAGAATTA TACCAGCCTT CTTGTTACAT 1140 
GGGAAAGACC TCGAGTCGTT TATGATACCA TGATTGAGAA GTTTGCAGTT TTGTACCAGC 1200 
AGTTGGATGG AGAGGACCAA ACCAAGCATG AATTTTTGAC AGATGGCTAT CAAGACTTGG 1260 
GTGCTATTCT CAATAATTTG CTACCCAATA TGAGTTATGT TCTTCAGATA GTAGCCATAT 1320 
GCACTAATGG CTTATATGGA AAATACAGCG ACCAACTGAT TGTCGACATG CCTACTGATA 1380 
40 ATCCTGAACT TGATCTTTTC CCTGAATTAA TTGGAACTGA AGAAATAATC AAGGAGGAGG 1440 
AAGAGGGAAA AGACATTGAA GAAGGCGCTA TTGTGAATCC TGGTAGAGAC AGTGCTACAA 1500 
ACCAAATCAG GAAAAAGGAA CCCCAGATTT CTACCACAAC ACACTACAAT CGCATAGGGA 1560 
CGAAATACAA TGAAGCCAAG ACTAACCGAT CCCCAACAAG AGGAAGTGAA TTCTCTGGAA 1620 
AGGGTGATGT TCCCAATACA TCTTTAAATT CCACTTCCCA ACCAGTCACT AAATTAGCCA 1680 
45 CAGAAAAAGA TATTTCCTTG ACTTCTCAGA CTGTGACTGA ACTGCCACCT CACACTGTGG 1740 
AAGGTACTTC AGCCTCTTTA AATGATGGCT CTAAAACTGT TCTTAGATCT CCACATATGA 1800 
ACTTGTCGGG GACTGCAGAA TCCTTAAATA CAGTTTCTAT AACAGAATAT GAGGAGGAGA I860 
GTTTATTGAC CAGTTTCAAG CTTGATACTG GAGCTGAAGA TTCTTCAGGC TCCAGTCCCG 1920 
CAACTTCTGC TATCCCATTC ATCTCTGAGA ACATATCCCA AGGGTATATA TTTTCCTCCG 1980 
50 AAAACCCAGA GACAATAACA TATGATGTCC TTATACCAGA ATCTGCTAGA AATGCTTCCG 2040 
AAGATTCAAC TTCATCAGGT TCAGAAGAAT CACTAAAGGA TCCTTCTATG GAGGGAAATG 2100 
TGTGGTTTCC TAGCTCTACA GACATAACAG CACAGCCCGA TGTTGGATCA GGCAGAGAGA 2160 
GCTTTCTCCA GACTAATTAC ACTGAGATAC GTGTTGATGA ATCTGAGAAG ACAACCAAGT 2220 
CCTTTTCTGC AGGCCCAGTG ATGTCACAGG GTCCCTCAGT TACAGATCTG GAAATGCCAC 2280 
55 ATTATTCTAC CTTTGCCTAC TTCCCAACTG AGGTAACACC TCATGCTTTT ACCCCATCCT 2340 
CCAGACAACA GGATTTGGTC TCCACGGTCA ACGTGGTATA CTCGCAGACA ACCCAACCGG 2400 
TATACAATGA GGCCAGTAAT AGTAGCCATG AGTCTCGTAT TGGTCTAGCT GAGGGGTTGG 2460 
AATCCGAGAA GAAGGCAGTT ATACCCCTTG TGATCGTGTC AGCCCTGACT TTTATCTGTC 2520 
TAGTGGTTCT TGTGGGTATT CTCATCTACT GGAGGAAATG CTTCCAGACT GC ACACT TTT 2S80 
60 ACTTAGAGGA CAGTACATCC CCTAGAGTTA TATCCACACC TCCAACACCT ATCTTTCCAA 2640 
TTTCAGATGA TGTCGGAGCA ATTCCAATAA AGCACTTTCC AAAGCATGTT GCAGATTTAC 2700 
ATGCAAGTAG TGGGTTTACT GAAGAATTTG AGACACTGAA AGAGTTTTAC CAGGAAGTGC 2760 
AGAGCTGTAC TGTTGACTTA GGTATTACAG CAGACAGCTC CAACCACCCA GACAACAAGC 2820 
ACAAGAATCG ATACATAAAT ATCGTTGCCT ATGATCATAG CAGGGTTAAG CTAGCACAGC 2880 
65 TTGCTGAAAA GGATGGCAAA CTGACTGATT ATATCAATGC CAATTATGTT GATGGCTACA 2940 
ACAGACCAAA AGCTTATATT GCTGCCCAAG GCCCACTGAA ATCCACAGCT GAAGATTTCT 3000 
GGAGAATGAT ATGGGAACAT AATGTGGAAG TTATTGTCAT GATAACAAAC CTCGTGGAGA 3060 
AAGGAAGGAG AAAATGTGAT CAGTACTGGC CTGCCGATGG GAGTGAGGAG TACGGGAACT 3120 
TTCTGGTCAC TCAGAAGAGT GTGCAAGTGC TTGCCTATTA TACTGTGAGG AATTTTACTC 3180 
70 TAAGAAACAC AAAAATAAAA AAGGGCTCCC AGAAAGGAAG ACCCAGTGGA CGTGTGGTCA 3240 
CACAGTATCA CTACACGCAG TGGCCTGACA TGGGAGTACC AGAGTACTCC CTGCCAGTGC 3300 
TGACCTTTGT GAGAAAGGCA GCCTATGCCA AGCGCCATGC AGTGGGGCCT GTTGTCGTCC 3360 
ACTGCAGTGC TGGAGTTGGA AGAACAGGCA CATATATTGT GCTAGACAGT ATGTTGCAGC 3420 
AGATTCAACA CGAAGGAACT GTCAACATAT TTGGCTTCTT AAAACACATC CGTTCACAAA 3480 
75 GAAATTATTT GGTACAAACT GAGGAGCAAT ATGTCTTCAT TCATGATACA CTGGTTGAGG 3540 
CCATACTTAG TAAAGAAACT GAGGTGCTGG ACAGTCATAT TCATGCCTAT GTTAATGCAC 3600 
TCCTCATTCC TGGACCAGCA GGCAAAACAA AGCTAGAGAA ACAATTCCAG CTCCTGAGCC 3660 
AGTCAAATAT ACAGCAGAGT GACTATTCTG CAGCCCTAAA GCAATGCAAC AGGGAAAAGA 3720 
ATCGAACTTC TTCTATCATC CCTGTGGAAA GATCAAGGGT TGGCATTTCA TCCCTGAGTG 3780 
80 GAGAAGGCAC AGACTACATC AATGCCTCCT ATATCATGGG CTATTACCAG AGCAATGAAT 3840 
TCATCATTAC CCAGCACCCT CTCCTTCATA CCATCAAGGA TTTCTGGAGG ATGATATGGG 3900 
ACCATAATGC CCAACTGGTG GTTATGATTC CTGATGGCCA AAACATGGCA GAAGATGAAT 3960 
TTGTTTACTG GCCAAATAAA GATGAGCCTA TAAATTGTGA GAGCTTTAAG GTCACTCTTA 4020 
TGGCTGAAGA ACACAAATGT CTATCTAATG AGGAAAAACT TATAATTCAG GACTTTATCT 4080 
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TAGAAGCTAC ACAGGATGAT TATGTACTTG AAGTGAGGCA CTTTCAGTGT CCTAAATGGC 4140 

CAAATCCAGA TAGCCCCATT AGTAAAACTT TTGAACTTAT AAGTGTTATA AAAGAAGAAG 4200 

CTGCCAATAG GGATGGGCCT ATGATTGTTC ATGATGAGCA TGGAGGAGTG ACGGCAGGAA 4260 

CTTTCTGTGC TCTGACAACC CTTATGCACC AACTAGAAAA AGAAAATTCC GTGGATGTTT 4320 

5 ACCAGGTAGC CAAGATGATC AATCTGATGA GGCCAGGAGT CTTTGCTGAC ATTGAGCAGT 4380 

ATCAGTTTCT CTACAAAGTG ATCCTCAGCC TTGTGAGCAC AAGGCAGGAA GAGAATCCAT 4440 

CCACCTCTCT GGACAGTAAT GGTGCAGCAT TGCCTGATGG AAATATAGCT GAGAGCTTAG 45 00 

AGTCTTTAGT TTAACACAGA AAGGGGTGGG GGGACTCACA TCTGAGCATT GTTTTCCT CT 4560 

TCCTAAAATT AGGCAGGAAA ATCAGTCTAG TTCTGTTATC TGTTGATTTC CCATCACCTG 4620 

10 ACAGTAACTT T CATG ACAT A GGATTCTGCC GCCAAATTTA TATCATTAAC AATGTGTGCC 4680 

TTTTTGCAAG ACTTGTAATT TACTTATTAT GTTTGAACTA AAATGATTGA ATTTTACAGT 4740 

ATTTCTAAGA ATGGAATTGT GGTATTTTTT TCTGTATTGA TTTTAACAGA AAATTTCAAT 4800 

TTATAGAGGT TAGGAATTCC AAACTACAGA AAATGTTTGT TTTTAGTGTC AAATTTTTAG 4860 

CTGTATTTGT AGCAATTATC AGGTTTGCTA GAAATATAAC TTTTAATACA GTAGCCTGTA 4920 

15 AATAAAACAC T CTTCCAT AT GATATTCAAC ATTTTACAAC TGCAGTATTC ACCTAAAGTA 4980 

GAAATAATCT GTTACTTATT GTAAATACTG CCCTAGTGTC TCCATGGACC AAATTTATAT 5040 

TTATAATTGT AGATTTTTAT ATTTTACTAC TGAGTCAAGT TTTCTAGTTC TGTGTAATTG 5100 

TTTAGTTTAA TGACGTAGTT CATTAGCTGG TCTTACTCTA CCAGTTTTCT GACATTGTAT 5160 

TGTGTTACCT AAGTCATTAA CTTTGTTTCA GCATGTAATT TTAACTTTTG TGGAAAATAG 5220 

20 AAATACCTTC ATTTTGAAAG AAGTTTTTAT GAGAATAACA CCTTACCAAA CATTGTTCAA 5280 

ATGGTTTTTA TCCAAGGAAT TGCAAAAATA AATATAAATA TTGCCATTAA AAAAAAAAAA 5340 
AAAAAAAAAA AAAAAAAAAA AAA 

Seq ID NO: 378 Protein sequence: 
25 Protein Accession #: EOS sequence 

1 11 21 31 41 51 

| ) I I I I 

MVFKASKITF HWGKCNMSSD GSEHSLEGQK FPLEMQIYCF DADRFSSPEE AVKGKGKLRA 
30 LSILFEVGTE ENLDFKAIID GVESVSRFGK QAALDPFILL NLLPNSTDKY YIYNGSLTSP 
PCTDTVDWIV FKDTVSISES QliAVFCEVLT MQQSGYVMLM DYLQNNFREQ QYKFSRQVFS 
SYTGKEEIHE AVCSSEPENV QADPENYTSL LVTWERPRW YDTMIEKFAV LYQQLDGEDQ 
TKHEFLTDGY QDLGAILNNL LPNMSYVLQI VAICTNGLYG KYSDQLIVDM PTDNPELDLF 
PELIGTEEII KEEEEGKDIE EGAIVNPGRD SATNQI RKKE PQISTTTHYN RIGTKYNEAK 
35 TNRSPTRGSE FSGKGDVPNT SLNSTSQPVT KLATEKDISL TSQTVTELPP HTVEGTSASL 
NDGSKTVLRS PHMNLSGTAE SLNTVSITEY EEESLLTSFK LDTGAEDSSG SSPATSAIPF 
ISENISQGYI FSSENPETIT YDVLIPESAR NASEDSTSSG SEESLKDPSM EGNVWFPSST 
DITAQPDVGS GRESFLQTNY TEIRVDESEK TTKSFSAGPV MSQGPSVTDL EMPHYSTFAY 
FPTEVTPHAF TPSSRQQDLV STVNWYSQT TQPVYNEASN SSHESRIGLA EGLESEKKAV 
40 IPLVIVSALT FICLWLVGI LIYWRKCFQT AHFYLEDSTS PRVISTPPTP IFPISDDVGA 
IPIKHFPKHV ADLHASSGFT EEFETLKEFY QEVQSCTVDL GITADSSNHP DNKHKNRYIN 
IVAYDHSRVK LAQLAEKDGK LTDYINANYV DGYNRPKAYI AAQGPLKSTA EDFWRMIWEH 
NVEVIVMITN LVEKGRRKCD QYWPADGSEE YGNFLVTQKS VQVLAYYTVR NFTLRNTKIK 
KGSQKGRPSG RWTQYHYTQ WPDMGVPEYS LPVLTFVRKA AYAKRHAVGP VWHCSAGVG 
45 RTGTYIVLDS MLQQIQHEGT VNIFGFLKHI RSQRNYLVQT EEQYVPIHDT LVEAILSKET 
EVLDSHIHAY VN ALL I PGP A GKTKLEKQFQ LLSQSNIQQS DYSAALKQOJ REKNRTSSII 
PVERSRVGIS SLSGEGTDYI NASYIMGYYQ SKEFIITQHP LLHTIKDFWR MIWDHNAQLV 
VMIPDGQNMA EDEFVYWPNK DEPINCESFK VTLMAEEHKC LSNEEKLIIQ DFILEATQDD 
YVLEVRHFQC PKHPNPDSPI SKTFELISVI KEEAANRDGP MIVHDEHGGV TAGTFCAIiTT 
50 LMHQLEKENS VDVYQVAKMI NLMRPGVFAD IEQYQFLYKV ILSLVSTRQE ENPSTSLDSN 
GAALPDGNIA ESLESLV 

Seq ID NO: 379 DNA sequence 
Nucleic Acid Accession #: EOS sequence 
55 Coding sequence: 148-4632 

1 11 21 31 41 51 

| | | I I I 

CACACATAOG CACGCACGAT CTCACTTCGA T CTAT ACACT GGAGGATTAA AACAAACAAA 

60 CAAAAAAAAC ATTTCCTTCG CTCCCCCTCC CTCTCCACTC TGAGAAGCAG AGGAGCCGCA 120 

CGGCGAGGGG CCGCAGACCG TCTGGAAATG CGAATCCTAA AACGTTTCCT CGCTTGCATT 180 

CAGCTCCTCT GTGTTTGCCG CCTGGATTGG GCTAATGGAT ACTACAGACA ACAGAGAAAA 240 

CTTGTTGAAG AGATTGGCTG GTCCTATACA GGAGCACTGA ATCAAAAAAA TTGGGGAAAG 300 

AAATATCCAA CATGTAATAG CCCAAAACAA TCTCCTATCA ATATTGATGA AGATCTTACA 360 

65 CAAGTAAATG TGAATCTTAA GAAACTTAAA TTTCAGGGTT GGGATAAAAC ATCATTGGAA 420 

AACACATTCA TTCATAACAC TGGGAAAACA GTGGAAATTA ATCTCA CTAA TGACTACCGT 480 

GTCAGCGGAG GAGTTTCAGA AATGGTGTTT AAAGCAAGCA AGATAACTTT TCACTGGGGA 540 

AAATGCAATA TGTCATCTGA TGGATCAGAG CATAGTTTAG AAGGACAAAA ATTTCCACTT 600 

GAGATGCAAA TCTACTGCTT TGATGCGGAC CGATTTTCAA GTTTTGAGGA AGCAGTCAAA 660 

70 GGAAAAGGGA AGTTAAGAGC TTTATCCATT TTGTTTGAGG TTGGGACAGA AGAAAATTTG 720 

GATTTCAAAG CGATTATTGA TGGAGTCGAA AGTGTTAGTC GTTTTGGGAA GCAGGCTGCT 780 

TTAGATCCAT TCATACTGTT GAACCTTCTG CCAAACTCAA CTGACAAGTA TTACATTTAC 840 

AATGGCTCAT TGACATCTCC TCCCTGCACA GACACAGTTG ACTGGATTGT TTTTAAAGAT 900 

ACAGTTAGCA TCTCTGAAAG CCAGTTGGCT GTTTTTTGTG AAGTTCTTAC AATGCAACAA 960 

75 TCTGGTTATG TCATGCTGAT GGACTACTTA CAAAACAATT TTCGAGAGCA ACAGTACAAG 1020 

TTCTCTAGAC AGGTGTTTTC CTCATACACT GGAAAGGAAG AGATTCATGA AGCAGTTTGT 1080 

AGTTCAGAAC CAGAAAATGT TCAGGCTGAC CCAGAGAATT ATACCAGCCT TCTTGTTACA 1140 

TGGGAAAGAC CTCGAGTCGT TTATGATACC ATGATTGAGA AGTTTGCAGT TTTGTACCAG 1200 

CAGTTGGATG GAGAGGACCA AACCAAG CAT GAATTTTTGA CAGATGGCTA TCAAGACTTG 1260 

80 GGTGCTATTC TCAATAATTT GCTACCCAAT ATGAGTTATG TTCTTCAGAT AGTAGCCATA 1320 

TGCACTAATG GCTTATATGG AAAATACAGC GACCAACTGA TTGTCGACAT GCCTACTGAT 1380 

AATCCTGAAC TTGATCTTTT CCCTGAATTA ATTGGAACTG AAGAAATAAT CAAGGAGGAG 1440 

GAAGAGGGAA AAGACATTGA AGAAGGCGCT ATTGTGAATC CTGGTAGAGA CAGTGCTACA 1500 

AACCAAATCA GGAAAAAGGA ACCCCAGATT TCTACCACAA CACACTACAA TOGCATAGGG 1560 
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ACGAAATACA ATGAAGCCAA GACTAACCGA TCCCCAACAA GAGGAAGTGA ATTCTCTGGA 
AAGGGTGATG TTCCCAATAC ATCTTTAAAT TCCACTTCCC AACCAGTCAC TAAATTAGCC 
ACAGAAAAAG ATATTTCCTT GACTTCTCAG ACTGTGACTG AACTGCCACC TCACACTGTG 
GAAGGTACTT CAGCCTCTTT AAATGATGGC TCTAAAACTG TTCTTAGATC TCCACATATG 
AACTTGTCGG GGACTGCAGA atccttaaat ACAGTTTCTA TAACAGAATA TGAGGAGGAG 
AGTTTATTGA CCAGTTTCAA GCTTGATACT GGAGCTGAAG ATTCTTCAGG CTCCAGTCCC 
GCAACTTCTG CTATCCCATT CATCTCTGAG AACATATCCC AAGGGTATAT ATTTTCCTCC 
GAAAACCCAG AGACAATAAC ATATGATGTC CTTATACCAG AATCTGCTAG AAATGCTTCC 
GAAGATTCAA CTTCATCAGG TTCAGAAGAA TCACTAAAGG ATCCTTCTAT GGAGGGAAAT 
GTGTGGTTTC CTAGCTCTAC AGACATAACA GCACAGCCOG ATGTTGGATC AGGCAGAGAG 
AGCTTTCTCC AGACTAATTA CACTGAGATA CGTGTTGATG AATCTGAGAA GACAACCAAG 
TCCTTTTCTG CAGGCCCAGT GATGTCACAG GGTCCCTCAG TTACAGATCT GGAAATGCCA 
CATTATTCTA CCTTTGCCTA CTTCCCAACT GAGGTAACAC CTCATGCTTT TACCCCATCC 
TCCAGACAAC AGGATTTGGT CTCCACGGTC AACGTGGTAT ACTCGCAGAC AACCCAACCG 
GTATACAATG AGGCCAGTAA TAGTAGCCAT GAGTCTCGTA TTGGTCTAGC TGAGGGGTTG 
GAATCCGAGA AGAAGGCAGT TATACCCCTT GTGATCGTGT CAGCCCTGAC TTTTATCTGT 
CTAGTGGTTC TTGTGGGTAT TCTCATCTAC TGGAGGAAAT GCTTCCAGAC TGCACACTTT 
TACTTAGAGG ACAGTACATC CCCTAGAGTT ATATCCACAC CTCCAACACC TATCTTTCCA 
ATTTCAGATG ATGTCGGAGC AATTCCAATA AAGCACTTTC CAAAGCATGT TGCAGATTTA 
CATGCAAGTA GTGGGTTTAC TGAAGAATTT GAGACACTGA AAGAGTTTTA CCAGGAAGTG 
CAGAGCTGTA CTGTTGACTT AGGTATTACA GCAGACAGCT CCAACCACCC AGACAACAAG 
CACAAGAATC GATACATAAA TATCGTTGCC TAT GAT CAT A GCAGGGTTAA GCTAGCACAG 
CTTOCTGAAA AGGATGGCAA ACTGACTGAT TATATCAATG CCAATTATGT TGATGGCTAC 
AACAGACCAA AAGCTTATAT TGCTGCCCAA GGCCCACTGA AATCCACAGC TGAAGATTTC 
TGGAGAATGA TATGGGAACA TAATGTGGAA GTTATTGTCA TGATAACAAA CCTCGTGGAG 
AAAGGAAGGA GAAAATGTGA TCAGTACTGG CCTGCCGATG GGAGTGAGGA GTACGGGAAC 
TTTCTGGTCA CTCAGAAGAG TGTGCAAGTG CTTGCCTATT ATACTGTGAG GAATTTTACT 
CTAAGAAACA CAAAAATAAA AAAGGGCTCC CAGAAAGGAA GACCCAGTGG ACGTGTGGTC 
ACACAGTATC ACTACACGCA GTGGCCTGAC ATGGGAGTAC CAGAGTACTC CCTGCCAGTG 
CTGACCTTTG TGAGAAAGGC AGCCTATGCC AAGCGCCATG CAGTGGGGCC TGTTGTCGTC 
CACTGCAGTG CTGGAGTTGG AAGAACAGGC ACATATATTG TGCTAGACAG TATGTTGCAG 
CAGATTCAAC ACGAAGGAAC TGTCAACATA TTTGGCTTCT TAAAACACAT CCGTTCACAA 
AGAAATTATT TGGTACAAAC TGAGGAGCAA TATGTCTTCA TTCATGATAC ACTGGTTGAG 
GCCATACTTA GTAAAGAAAC TGAGGTGCTG GACAGTCATA TTCATGCCTA TGTTAATGCA 
CTCCTCATTC CTGGACCAGC AGGCAAAACA AAGCTAGAGA AACAATTCCA GGGTCTCACT 
CTGTCACCCA GGCTGGAGTG CAGAGGCACA ATCTCGGCTC ACTGCAACCT TCCTCTCCCT 
GGCTTAACTG ATCCTCCTAC CTCAGCCTCC CGAGTGGCTG GGACTATACT CCTGAGCCAG 
TCAAATATAC AGCAGAGTGA CTATTCTGCA GCCCTAAAGC AATGCAACAG GGAAAAGAAT 
CGAACTTCTT CTATCATCCC TGTGGAAAGA TCAAGGGTTG GCATTTCATC CCTGAGTGGA 
GAAGGCACAG ACTACATCAA TGCCTCCTAT ATCATGGGCT ATTACCAGAG CAATGAATTC 
ATCATTACCC AGCACCCTCT CCTTCATACC ATCAAGGATT TCTGGAGGAT GATATGGGAC 
CATAATGCCC AACTGGTGGT TATGATTCCT GATGGCCAAA ACATGGCAGA AGATGAATTT 
GTTTACTGGC CAAATAAAGA TGAGCCTATA AATTGTGAGA GCTTTAAGGT CACTCTTATG 
GCTGAAGAAC ACAAATGTCT ATCTAATGAG GAAAAACTTA TAATTCAGGA CTTTATCTTA 
GAAGCTACAC AGGATGATTA TGTACTTGAA GTGAGGCACT TTCAGTGTCC TAAATGGCCA 
AATCCAGATA GCCCCATTAG TAAAACTTTT GAACTTATAA GTGTTATAAA AGAAGAAGCT 
GCCAATAGGG ATGGGCCTAT GATTGTTCAT GATGAGCATG GAGGAGTGAC GGCAGGAACT 
TTCTGTGCTC TGACAACCCT TATGCACCAA CTAGAAAAAG AAAATTCCGT GGATGTTTAC 
CAGGTAGCCA AGATGATCAA TCTGATGAGG CCAGGAGTCT TTGCTGACAT TGAGCAGTAT 
CAGTTTCTCT ACAAAGTGAT CCTCAGCCTT GTGGGCACAA GGCAGGAAGA GAATCCATCC 
ACCTCTCTGG ACAGTAATGG TGCAGCATTG CCTGATGGAA ATATAGCTGA GAGCTTAGAG 
TCTTTAGTTT AACACAGAAA GGGGTGGGGG GACTCACATC TGAGCATTGT TTTCCTCTTC 
CTAAAATTAG GCAGGAAAAT CAGTCTAGTT CTGTTATCTG TTGATTTCCC ATCACCTGAC 
AGTAACTTTC ATGACATAGG ATTCTGCCGC CAAATTTATA TCATTAACAA TCTCTGCCTT 
TTTGCAAGAC TTGTAATTTA CTTATTATGT TTGAACTAAA ATGATTGAAT TTTACAGTAT 
TTCTAAGAAT GGAATTGTGG TATTTTTTTC TGTATTGATT TTAACAGAAA ATTTCAA TTT 
ATAGAGGTTA GGAATTCCAA ACTACAGAAA ATGTTTGTTT TTAGTGTCAA ATTTTTAGCT 
GTATTTGTAG CAATTATCAG GTTTGCTAGA AATATAACTT TTAATACAGT AGCCTGTAAA 
TAAAACACTC TTCCATATGA TATTCAACAT TTTACAACTG CAGTATTCAC CTAAAGTAGA 
AATAATCTGT TACTTATTGT AAATACTGCC CTAGTGTCTC CATGGACCAA ATTTATATTT 
ATAATTGTAG ATTTTTATAT TTTACTACTG AGTCAAGTTT TCTAGTTCTG TGTAATTGTT 
TAGTTTAATG ACGTAGTTCA TTAGCTGGTC TTACTCTACC AGTTTTCTGA CATTGTATTG 
TGTTACCTAA GTCATTAACT TTGTTTCAGC ATGTAATTTT AACTTTTGTG GAAAATAGAA 
ATACCTTCAT TTTGAAAGAA GTTTTTATGA GAATAACACC TTACCAAACA TTGTTCAAAT 
GGTTTTTATC CAAGGAATTG CAAAAATAAA TATAAATATT GCCATTAAAA AAAAAAAAAA 
AAAAAAAAAA AAAAAAAAAA A 
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Seq ID NO: 380 Protein sequence ! 
Protein Accession ft: EOS sequence 



I 

MRILKRFLAC 
QSPINIDEOL 
FKASKITFHW 
ILFEVGTEEN 
TDTVDWIVFK 
TGKEEIHEAV 
HEFLTDGYQD 
LIGTEEIIKE 
RSPTRGSEFS 
GSKTVLRSPH 
ENISQGYIFS 
TAQPDVGSGR 



11 21 

I 1 
IQLLCVCRLD WANGYYRQQR 
TQVNVNLKKL KFQGWDKTSL 



liDFKAI IDGV 
DTVSISESQL 
CSSEPENVQA 
LGAlLNNItLP 
EEEGKDIEEG 
GKGDVPNTSL 
MNLSGTAESL 
SENPETITYD 
ESFLQTNYTE 



ESVSRFGKQA 
AVFCEVLTMQ 
DPENYTSIiLV 
NMSYVIiQIVA 
AIVNPGRDSA 
NSTSQPVTKL 
NTVSITEYEE 
VLIPESARNA 
IRVDESEKTT 



31 
1 

KLVEEIGWSY 
ENTFIHNTGK 
LEMQIYCFDA 
ALDPFILLNL 
QSGYVMLMDY 
TWERPRWYD 
ICTNGLYGKY 
TNQIRKKEPQ 
ATEKDISLTS 
ESLLTSFKLD 
SEDSTS5GSE 
KSFSAGPVMS 



41 
I 

TGALNQKNWG 
TVEINIiTNDY 
DRFSSFEEAV 
IiPHSTDKYYI 
LQNNFREQQY 
TMIEKFAVLY 
SDQLIVDMPT 
ISTTTHYNRI 
QTVTELPPHT 



ESLKDPSMEG 
QGPSVTDLEM 



51 
I 

KKYPTCNSPK 
RVSGGVSEMV 
KGKGKLRALS 
YNGSLTSPPC 
KFSRQVFSSY 
QQLDGEDQTK 
DNPELDLFPE 
GTKYNEAKTN 
VEGTSASLND 
PATSAIPFIS 
NVWFPSSTDI 
PHYSTFAYFP 
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TEVTPHAFTP 
LVIVSALTFI 
IKHFPKHVAD 
AYDHSRVKLA 
EVIVMITNLV 
SQKGRPSGRV 
GTYIVLDSML 
LDSHIHAYVN 
SRVAGTILLS 
YIKGYYQSNE 
INCESFKVTL 
FELISVIKEE 
RPGVFADIEQ 



SSRQQDLVST 
CLWLVGILI 
LHASSGFTEE 
QLAEKDGKLT 
EKGRRKCDQY 
VTQYHYTQWP 
QQIQHEGTVN 
ALLIPGPAGK 
QSNIQQSDYS 
FIITQHPLLH 
MAEEHKCLSN 
AANRDGPMIV 
YQFLYKVILS 



VNWYSQTTQ 
YWRKCFQTAH 
FETLKEFYQE 
DYINANYVDG 
WPADGSEEYG 
DMGVPEYSLP 
IFGFLKHIRS 
TKLEKQFQGL 
AALKOCNREK 
TIKDFWRMIW 
EEKLIIQDFI 
HDEHGGVTAG 
LVGTRQEENP 



PVYNEASNSS 
FYIiEDSTSPR 
VQSCTVDLGI 
YNRPKAYIAA 
NFLVTQKSVQ 
VLTFVRKAAY 
QRNYLVQTEE 
TLSPRLECRG 
NRTSSIIPVE 
DHNAQLWMI 
LEATQDDYVL 
TFCALTTLMH 
STSLDSNGAA 



HESRIGLAEG 
VISTPPTPIF 
TADSSNHPDN 
QGPLKSTAED 
VLAYYTVRNF 
AKRHAVGPW 
QYVFIHDTLV 
TISAHCNLPL 
RSRVGISSLS 
PDGQNMAEDE 
EVRHFQCPKW 
QLEKENSVDV 
LPDGNIAESL 



LESEKKAVIP 
PISDDVGAIP 
KHKNRYINIV 
FWRMIWEHNV 
TLRNTKIKKG 
VHCSAGVGRT 
EAILSKETEV 
PGLTDPPTSA 
GEGTDYINAS 
FVYWPNKDEP 
PNPDSPISKT 
YQVAKMINLM 
ESLV 
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Seq ID NO: 381 DNA sequence 

Nucleic Acid Accession #; NM_002851.1 

Coding sequence: 148.. 7092 

1 11 21 31 41 51 

illlll 
CACACATACG CACGCACGAT CTCACTTCGA TCTATACACT GGAGGATTAA AACAAACAAA 
CAAAAAAAAC ATTTCCTTCG CTCCCCCTCC CTCTCCACTC TGAGAAGCAG AGGAGCCGCA 
CGGCGAGGGG CCGCAGACCG TCTGGAAATG CGAATCCTAA AGCGTTTCCT CGCTTGCATT 
CAGCTCCTCT GTGTTTGCCG CCTGGATTGG GCTAATGGAT ACTACAGACA ACAGAGAAAA 
CTTGTTGAAG AGATTGGCTG GTCCTATACA GGAGCACTGA ATCAAAAAAA TTGGGGAAAG 
AAATATCCAA CATGTAATAG CCCAAAACAA TCTCCTATCA ATATTGATGA AGATCTTACA 
CAAGTAAATG TGAATCTTAA GAAACTTAAA TTTCAGGGTT GGGATAAAAC ATCATTGGAA 
AACACATTCA TTCATAACAC TGGGAAAACA GTGGAAATTA ATCTCACTAA TGACTACCGT 
GTCAG CGGAG GAGTTTCAGA AATGGTGTTT AAAGCAAGCA AGATAACTTT TCACTGGGGA 
AAATGCAATA TGTCATCTGA TGGATCAGAG CATAGTTTAG AAGGACAAAA ATTTCCACTT 
GAGATGCAAA TCTACTGCTT TGATGCGGAC CGATTTTCAA GTTTTGAGGA AGCAGTCAAA 
GGAAAAGGGA AGTTAAGAGC TTTATCCATT TTGTTTGAGG TTGGGACAGA AGAAAATTTG 
GATTTCAAAG CGATTATTGA TGGAGTCGAA AGTGTTAGTC GTTTTGGGAA GCAGGCTGCT 
TTAGATCCAT TCATACTGTT GAACCTTCTG CCAAACTCAA CTGACAAGTA TTACATTTAC 
AATGGCTCAT TGACATCTCC TCCCTGCACA GACACAGTTG ACTGGATTGT TTTTAAAGAT 
ACAGTT AG CA TCTCTGAAAG CCAGTTGGCT GTTTTTTGTG AAGTTCTTAC AATGCAACAA 
TCTGGTTATG TCATGCTGAT GGACTACTTA CAAAACAATT TTCGAGAGCA ACAGTACAAG 
TTCTCTAGAC AGGTGTTTTC CTCATACACT GGAAAGGAAG AGATTCATGA AGCAGTTTGT 
AGTTCAGAAC CAGAAAATGT TCAGGCTGAC CCAGAGAATT ATACCAGCCT TCTTGTTACA 
TGGGAAAGAC CTCGAGTCGT TTATGATACC ATGATTGAGA AGTTTGCAGT TTTGTACCAG 
CAGTTGGATG GAGAGGACCA AACCAAGCAT GAATTTTTGA CAGATGGCTA TCAAGACTTG 
GGTGCTATTC TCAATAATTT GCTACCCAAT ATGAGTTATG TTCTTCAGAT AGTAG CCATA 
TGCACTAATG GCTTATATGG AAAATACAGC GACCAACTGA TTGTCGACAT GCCTACTGAT 
AATCCTGAAC TTGATCTTTT CCCTGAATTA ATTGGAACTG AAGAAATAAT CAAGGAGGAG 
GAAGAGGGAA AAGACATTGA AGAAGGCGCT ATTGTGAATC CTGGTAGAGA CAGTGCTACA 
AACCAAATCA GGAAAAAGGA ACCCCAGATT TCTACCACAA CACACTACAA TCGCATAGGG 
AGGAAATACA ATGAAGCCAA GACTAACCGA TCCCCAACAA GAGGAAGTGA ATTCTCTGGA 
AAGGGTGATG TTCCCAATAC ATCTTTAAAT TCCACTTCCC AACCAGTCAC TAAATTAGCC 
ACAGAAAAAG ATATTTCCTT GACTTCTCAG ACTGTGACTG AACTGCCACC TCACACTGTG 
GAAGGTACTT CAGCCTCTTT AAATGATGGC TCTAAAACTG TTCTTAGATC TCCACATATG 
AACTTGTCGG GGACTGCAGA ATCCTTAAAT ACAGTTTCTA TAACAGAATA TGAGGAGGAG 
AGTTTATTGA CCAGTTTCAA G CTTGAT ACT GGAGCTGAAG ATTCTTCAGG CTCCAGTCCC 
GCAACTTCTG CTATCCCATT CATCTCTGAG AACATATCCC AAGGGTATAT ATTTTCCTCC 
GAAAACCCAG AGACAATAAC ATATGATGTC CTTATACCAG AATCTGCTAG . AAATGCTTCC 
GAAGATTCAA CTTCATCAGG TTCAGAAGAA TCACTAAAGG ATCCTTCTAT GGAGGGAAAT 
GTGTGGTTTC CTAGCTCTAC AGACATAACA GCACAGCCCG ATGTTGGATC AGGCAGAGAG 
AGCTTTCTCC AGACTAATTA CACTGAGATA CGTGTTGATG AATCTGAGAA GACAACCAAG 
TCCTTTTCTG CAGGCCCAGT GATGTCACAG GGTCCCTCAG TTACAGATCT GGAAATGCCA 
CATTATTCTA CCTTTGCCTA CTTCCCAACT GAGGTAACAC CTCATGCTTT TACCCCATCC 
TCCAGACAAC AGGATTTGGT CTCCACGGTC AACGTGGTAT ACTCGCAGAC AACCCAACCG 
GTATACAATG GTGAGACACC TCTTCAACCT TCCTACAGTA GTGAAGTCTT TCCTCTAGTC 
ACCCCTTTGT TGCTTGACAA TCAGATCCTC AACACTACCC CTGCTGCTTC AAGTAGTGAT 
TCGGCCTTGC ATGCTACGCC TGTATTTCCC AGTGTCGATG TGTCATTTGA ATCCATCCTG 
TCTTCCTATG ATGGTGCACC TTTGCTTCCA TTTTCCTCTG CTTCCTTCAG TAGTGAATTG 
TTTCGCCATC TGCATACAGT TTCTCAAATC CTTCCACAAG TTACTTCAGC TACCGAGAGT 
GATAAGGTGC CCTTGCATGC TTCTCTGCCA GTGGCTGGGG GTGATTTGCT ATTAGAGCCC 
AGCCTTGCTC AGTATTCTGA TGTGCTGTCC ACT ACT CATG CTGCTTCAGA GACGCTGGAA 
TTTGGTAGTG AATCTGGTGT TCTTTATAAA ACGCTTATGT TTTCTCAAGT TGAACCACCC 
AGCAGTGATG CCATGATGCA TGCACGTTCT TCAGGGCCTG AACCTTCTTA TGCCTTGTCT 
GATAATGAGG GCTCCCAACA CATCTTCACT GTTTCTTACA GTTCTGCAAT ACCTGTGCAT 
GATTCTGTGG GTGTAACTTA TCAGGGTTCC TTATTTAGCG GCCCTAGCCA TATACCAATA 
CCTAAGTCTT CGTTAATAAC CCCAACTGCA TCATTACTGC AGCCTACTCA TGCCCTCTCT 
GGTGATGGGG AATGGTCTGG AGCCTCTTCT GATAGTGAAT TTCTTTTACC TGACACAGAT 
GGGCTGACAG CCCTTAACAT TTCTTCACCT GTTTCTGTAG CTGAATTTAC ATATACAACA 
TCTGTGTTTG GTGATGATAA TAAGGCGCTT TCTAAAAGTG AAATAATATA TGGAAATGAG 
ACTGAACTGC AAATTCCTTC TTTCAATGAG ATGGTTTACC CTTCTGAAAG CACAGTCATG 
CCCAACATGT ATGATAATGT AAATAAGTTG AATGCGTCTT TACAAGAAAC CTCTGTTTCC 
ATTTCTAGCA CCAAGGGCAT GTTTCCAGGG TCCCTTGCTC ATACCACCAC TAAGGTTTTT 
GATCATGAGA TTAGTCAAGT TCCAGAAAAT AACTTTTCAG TTCAACCTAC ACATACTGTC 
TCTCAAGCAT CTGGTGACAC TTCGCTTAAA CCTGTGCTTA GTGCAAACTC AGAGCCAGCA 
TCCTCTGACC CTGCTTCTAG TGAAATGTTA TCTCCTTCAA CTCAGCTCTT ATTTTATGAG 
ACCTCAGCTT CTTTTAGTAC TGAAGTATTG CTACAACCTT CCTTTCAGGC TTCTGATGTT 
GACACCTTGC TTAAAACTGT TCTTCCAGCT GTGCCCAGTG ATCCAATATT GGTTGAAACC 

1103 
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CCCAAAGTTG ATAAAATTAG TTCTACAATG TTGCATCTCA TTGTATCAAA TTCTGCTTCA 
AGTGAAAACA TGCTGCACTC TACATCTGTA CCAGTTTTTG ATGTGTCGCC TACTTCTCAT 
ATGCACTCTG CTTCACTTCA AGGTTTGACC ATTTCCTATG CAAGTGAGAA ATATGAACCA 
GTTTTGTTAA AAAGTGAAAG TTCCCACCAA GTGGTACCTT CTTTGTACAG TAATGATGAG 
TTGTTCCAAA CGGCCAATTT GGAGATTAAC CAGGCCCATC CCCCAAAAGG AAGGCATGTA 
TTTGCTACAC CTGTTTTATC AATTGATGAA CCATTAAATA CACTAATAAA TAAGCTTATA 
CATTCCGATG AAATTTTAAC CTCCACCAAA AGTTCTGTTA CTGGTAAGGT ATTTGCTGGT 
ATTCCAACAG TTGCTTCTGA TACATTTGTA TCTACTGATC ATTCTGTTCC TATAGGAAAT 
GGGCATGTTG CCATTACAGC TGTTTCTCCC CACAGAGATG GTTCTGTAAC CTCAACAAAG 
TTGCTGTTTC CTTCTAAGGC AACTTCTGAG CTGAGTCATA GTGCCAAATC TGATGCCGGT 
TTAGTGGGTG GTGGTGAAGA TGGTGACACT GATGATGATG GTGATGATGA TGATGACAGA 
GATAGTGATG GCTTATCCAT TCATAAGTGT ATGTCATGCT CATCCTATAG AGAATCACAG 
GAAAAGGTAA TGAATGATTC AGACACCCAC GAAAACAGTC TTATGGATCA GAATAATCCA 
ATCTCATACT CACTATCTGA GAATTCTGAA GAAGATAATA GAGTCACAAG TGTATCCTCA 
GACAGTCAAA CTGGTATGGA CAGAAGTCCT GGTAAATCAC CATCAGCAAA TGGGCTATCC 
CAAAAGCACA ATGATGGAAA AGAGGAAAAT GACATTCAGA CTGGTAGTGC TCTGCTTCCT 
CTCAGCCCTG AATCTAAAGC ATGGGCAGTT CTGACAAGTG ATGAAGAAAG TGGATCAGGG 
CAAGGTACCT CAGATAGCCT TAATGAGAAT GAGACTTCCA CAGATTTCAG TTTTGCAGAC 
ACTAATGAAA AAGATGCTGA TGGGATCCTG GCAGCAGGTG ACTCAGAAAT AACTCCTGGA 
TTCCCACAGT CCCCAACATC ATCTGTTACT AGCGAGAACT CAGAAGTGTT CCACGTTTCA 
GAGGCAGAGG CCAGTAATAG TAGCCATGAG TCTCGTATTG GTCTAGCTGA GGGGTTGGAA 
TCCGAGAAGA AGGCAGTTAT ACCCCTTGTG ATCGTGTCAG CCCTGACTTT TAT CTGTCT A 
GTGGTTCTTG TGGGTATTCT CATCTACTGG AGGAAATGCT TCCAGACTGC ACACTTTTAC 
TTAGAGGACA GTACATCCCC TAGAGTTATA TCCACACCTC CAACACCTAT CTTTCCAATT 
TCAGATGATG TCGGAGCAAT TCCAATAAAG CACTTTCCAA AGCATGTTGC AGATTTACAT 
GCAAGTAGTG GGTTTACTGA AGAATTTGAG ACACTGAAAG AGTTTTACCA GGAAGTGCAG 
AGCTGTACTG TTGACTTAGG TATTACAGCA GACAGCTCCA ACCACCCAGA CAACAAGCAC 
AAGAATCGAT ACATAAATAT CGTTGCCTAT GATCATAGCA GGGTTAAGCT AGCACAGCTT 
GCTGAAAAGG ATGGCAAACT GACTGATTAT ATCAATGCCA ATTATGTTGA TGGCTACAAC 
AGACCAAAAG CTTATATTGC TGCCCAAGGC CCACTGAAAT CCACAGCTGA AGATTTCTGG 
AGAATGATAT GGGAACATAA TGTGGAAGTT ATTGTCATGA TAACAAACCT CGTGGAGAAA 
GGAAGGAGAA AATGTGATCA GTACTGGCCT GCCGATGGGA GTGAGGAGTA CGGGAACTTT 
CTGGTCACTC AGAAGAGTGT GCAAGTGCTT GCCTATTATA CTGTGAGGAA TTTTACTCTA 
AGAAACACAA AAATAAAAAA GGGCTCCCAG AAAGGAAGAC CCAGTGGACG TGTGGTCACA 
CAGTATCACT ACACGCAGTG GCCTGACATG GGAGTACCAG AGTACTCCCT GCCAGTGCTG 
ACCTTTGTGA GAAAGGCAGC CTATGCCAAG CGCCATGCAG TGGGGCCTGT TGTCGTCCAC 
TGCAGTGCTG GAGTTGGAAG AACAGGCACA TATATTGTGC TAGACAGTAT GTTGCAGCAG 
ATTCAACACG AAGGAACTGT CAACATATTT GGCTTCTTAA AACACATCCG TTCACAAAGA 
AATTATTTGG TACAAACTGA GGAGCAATAT GTCTTCATTC ATGATACACT GGTTGAGGCC 
ATACTTAGTA AAGAAACTGA GGTGCTGGAC AGTCATATTC ATGCCTATGT TAATGCACTC 
CTCATTCCTG GACCAGCAGG CAAAACAAAG CTAGAGAAAC AATTCCAGCT CCTGAGCCAG 
TCAAATATAC AGCAGAGTGA CTATTCTGCA GCCCTAAAGC AATGCAACAG GGAAAAGAAT 
CGAACTTCTT CTATCATCCC TGTGGAAAGA TCAAGGGTTG GCATTTCATC CCTGAGTGGA 
GAAGGCACAG ACTACATCAA TGCCTCCTAT ATCATGGGCT ATTACCAGAG CAATGAATTC 
ATCATTACCC AGCACCCTCT CCTTCATACC ATCAAGGATT TCTGGAGGAT GATATGGGAC 
CATAATGCCC AACTGGTGGT TATGATTCCT GATGGCCAAA ACATGGCAGA AGATGAATTT 
GTTTACTGGC CAAATAAAGA TGAGCCTATA AATTGTGAGA GCTTTAAGGT CACTCTTATG 
GCTGAAGAAC ACAAATGTCT ATCTAATGAG GAAAAACTTA TAATTCAGGA CTTTATCTTA 
GAAGCTACAC AGGATGATTA TGTACTTGAA GTGAGGCACT TTCAGTGTCC TAAATGGCCA 
AATCCAGATA GCCCCATTAG TAAAACTTTT GAACTTATAA GTGTTATAAA AGAAGAAGCT 
GCCAATAGGG ATGGGCCTAT GATTGTTCAT GATGAGCATG GAGGAGTGAC GGCAGGAACT 
TTCTGTGCTC TGACAACCCT TATGCACCAA CTAGAAAAAG AAAATTCOGT GGATGTTTAC 
CAGGTAGCCA AGATGATCAA TCTGATGAGG CCAGGAGTCT TTGCTGACAT TGAGCAGTAT 
CAGTTTCTCT ACAAAGTGAT CCTCAGCCTT GTGAGCACAA GGCAGGAAGA GAATCCATCC 
ACCTCTCTGG ACAGTAATGG TGCAGCATTG CCTGATGGAA ATATAGCTGA GAGCTTAGAG 
TCTTTAGTTT AACACAGAAA GGGGTGGGGG GACTCACATC TGAGCATTGT TTTCCTCTTC 
CTAAAATTAG GCAGGAAAAT CAGTCTAGTT CTGTTATCTG TTGATTTCCC ATCACCTGAC 
AGTAACTTTC ATGACATAGG ATTCTGCCGC CAAATTTATA TCATTAACAA TGTGTGCCTT 
TTTGCAAGAC TTGTAATTTA CTTATTATGT TTGAACTAAA ATGATTGAAT TTTACAGTAT 
TTCTAAGAAT GGAATTGTGG TATTTTTTTC TGTATTGATT TTAACAGAAA A TTTCAA TTT 
ATAGAGGTTA GGAATTCCAA ACTACAGAAA ATGTTTGTTT TTAGTGTCAA ATTTTTAGCT 
GTATTTGTAG CAATTATCAG GTTTGCTAGA AATATAACTT TTAATACAGT AGCCTGTAAA 
TAAAACACTC TTCCATATGA TATTCAACAT TTTACAACTG CAGTATTCAC CTAAAGTAGA 
AATAATCTGT TACTTATTGT AAATACTGCC CTAGTGTCTC CATGGACCAA ATTTATATTT 
ATAATTGTAG ATT TTT AT AT TTTACTACTG AGTCAAGTTT T CTAGTT CTG TGTAATTGTT 
TAGTTTAATG ACGTAGTTCA TTAGCTGGTC TTACTCTACC AGTTTTCTGA CATTGTATTG 
TGTTACCTAA GTCATTAACT TTGTTTCAGC ATGTAATTTT AACTTTTGTG GAAAATAGAA 
ATACCTTCAT TTTGAAAGAA GTTTTTATGA GAATAACACC TTACCAAACA TTGTTCAAAT 
GGTTTTTATC CAAGGAATTG CAAAAATAAA TATAAATATT GCCATTAAAA AAAAAAAAAA 
AAAAAAAAAA AAAAAAAAAA A 



3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5S80 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7620 
7680 
7740 
7800 
7860 
7920 



Seq ID NO: 382 Protein sequence 
Protein Accession #: NP_002842.1 



MRILKRFLAC 
QSPINIDEDL 
FKASKITFHW 
IIiFEVGTEEN 
TDTVDWIVFK 
TGKEEIHEAV 
HEFLTDGYQD 
LIGTEEIIKE 



11 
I 

IQIiLCVCRLD 
TQVNVNLKKL 
GKCNMSSDGS 
LDFKAI IDGV 
DTVSISESQL 
CSSEPEHVQA 
LGAIIiNNLLP 



21 
I 

WANGYYRQQR 
KFQGWDKTSL 
EHSLEGQKFP 
ESVSRFGKQA 
AVFCEVLTMQ 
DPENYTSLLV 
NMSYVLQIVA 
AIVNPGRDSA 



31 
I 

KLVEEIGWSY 
BNTFIHNTGK 
LEMQIYCFDA 
ALDPFILLNL 
QSGYVMLMDY 
TWERPRWYD 
ICTNGLYGKY 
TNQIRKKEPQ 



41 
I 

TGALNQKNWG 
TVEXNLTNDY 
DRFSSFEEAV 
LPNSTDKYYI 
LQNNFREQQY 
TMIEKPAVLY 
SDQLIVDMPT 
ISTTTHYNRI 



60 
120 



51 
I 

KKYPTCNSPK 
RVSGGVSEMV 
KGKGKLRALS 180 
YNGSLTSPPC 240 
KFSRQVFSSY 300 
QQLDGEDQTK 360 
DNPELDLFPE 420 
GTKYNEAKTN 480 
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10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 



RSPTRGSEFS 
GSKTVLRSPH 
ENISQGYIFS 
TAQPDVGSGR 
TEVTPHAFTP 
LNTTPAASSS 
ILPQVTSATE 
KTLMFSQVEP 
SLFSGPSHIP 
PVSVAEFTYT 
LNASLQBTSV 
KPVLSANSEP 
AVPSDPILVE 
TISYASEKYE 
EPLNTLINKL 
PHRDGSVTST 
CMSCSSYRES 
PGKSPSAKGL 
NETSTDFSFA 
ESRIGIiAEGL 
ISTPPTPIFP 
ADSSNHPDNK 
GPLKSTAEDF 
LAYYTVRNFT 
KRHAVGPWV 
YVFIHDTLVE 
AAliKQCNREK 
TIKDFWRMIW 
EEKLIIQDFI 
HDEHGGVTAG 
LVSTRQEENP 



GKGDVPNTSL NSTSQPVTKL 
MNLSGTAESL NTVSITEYEE 
SENPETITYD VLIPESARNA 
ESFLQTNYTE IRVDESEKTT 
SSRQQDLVST VNWYSQTTQ 
DSALHATPVF PSVDVSFESI 
SDKVPLHASL PVAGGDLLLE 
PSSDAMMHAR SSGPEPSYAL 
IPKSSLITPT ASLLQPTHAL 
TSVFGDDNKA LSKSEIIYGN 
SISSTKGMFP GSIiAHTTTKV 
ASSDPASSEM LSPSTQLLFY 
TPKVDKISST MLHLIVSNSA 
PVLLKSESSH QWPSLYSND 
IHSDEILTST KSSVTGKVFA 
KLL.FP SKATS ELSHSAKSDA 
QEKVMNDSDT HENSLMDQNN 
SQKHNDGKEE NDIQTGSALL 
DTHEKDADGI LAAGDSEITP 
ESEKKAVIPIi VIVSALTFIC 
ISDDVGAIPI KHFPXHVADL 
HKNRYINIVA YDHSRVKLAQ 
WRMIWEHNVE VIVMITNIiVE 
LRNTKIKKGS QKGRPSGRW 
HCSAGVGRTG TYIVUDSMLQ 
AILSKETEVL DSHIHAYVNA 
NRTSSIIPVE RSRVGISSLS 
DHNAQLWMI PDGQNMAEDE 
LEATQDDYVL EVRHFQCPKW 
TFCALTTLMH QL.EKENSVDV 
STSLDSNGAA LPDGNIAESL 



ATEKDISLTS QTVTELPPHT VEGTSASLND 540 

ESLLTSFKLD TGAEDSSGSS PATSAIPPIS 600 

SEOSTSSGSE ESLKDPSMEG NVWFPSSTDI 660 

KSFSAGPVMS QGPSVTDLEM PHYSTFAYFP 720 

PVYNGETPLQ PSYSSEVFPL VTPLLLDUQI 780 

LSSYDGAPLL PFSSASFSSE LFRHLHTVSQ 840 

PSLAQYSDVL STTHAASETL EFGSBSGVLY 900 

SDNEGSQHIF TVSYSSAIPV HDSVGVTYQG 960 

SGDGEWSGAS SDSEFLLPDT DGLTALNISS 1020 

ETELQIPSFK EMVYPSESTV MPNMYDNVNK 1080 

FDHEISQVPE NNFSVQPTHT VSQASGDTSL 1140 

ETSASFSTEV LLQPSFQASD VDTLLKTVLP 1200 

SSENMLHSTS VPVFDVSPTS HMHSASLQGI* 1260 

ELFQTANLEI NQAHPPKGRH VFATPVLSID 1320 

GIPTVASDTF VSTDHSVPIG NGHVAITAVS 1380 

GLVGGGEDGD TDDDGDDDDD RDSDGLSIHK 1440 

PISYSLSENS EEKJRVTSVS SDSQTGMDRS 1500 

PLSPESKAWA VLTSDEESGS GQGTSDSLNE 1560 

GFPQSPTSSV TSENSEVFHV SEAEASNSSH 1620 

LWLVGILIY WRKCPQTAHF YLEDSTSPRV 1680 

HASSGFTEEF ETLKEFYQEV QSCTVDLGIT 1740 

LAEKDGKLTD YINANYVDGY NRPKAY1AAQ 1800 

KGRRKCDQYW PADGSEEYGN FLVTQKSVQV 1860 

TQYHYTQWPD MGVPEYSLPV LTFVRKAAYA 1920 

QIQHEGTVNI FGFLKHIRSQ RNYLVQTEEQ 1980 

LLIPGPAGKT KLEKQFQLLS QSNIQQSDYS 2040 

GEGTDYINAS YIMGYYQSNE FIITQHPLLH 2100 

FVYWPNKDEP INCESFKVTL MAEEHKCLSN 2160 

PNPDSPISKT FELISV1KEE AANRDGPMIV 2220 

YQVAKMINLM RPGVFADIEQ YQFLYKVILS 2280 
ESLV 



Seq ID NO: 383 DMA sequence 

Nucleic Acid Accession ft: NM_00568B.l 

Coding sequence: 126 -.4439 

1 11 21 31 41 SI 

CCGGGCAGGT GGCTCATGCT CGGGAGCGTG GTTGAGCGGC TGGCGCGGTT GTCCTGGAGC 
AGGGGCGCAG GAATTCTGAT GTGAAACTAA CAGTCTGTGA GCCCTGGAAC CTCCGCTCAG 
AGAAGATGAA GGATATCGAC ATAGGAAAAG AGTATATCAT CCCCAGTCCT GGGTATAGAA 
GTGTGAGGGA GAGAACCAGC ACTTCTGGGA CGCACAGAGA CCGTGAAGAT TCCAAGTTCA 
GGAGAACTOG ACCGTTGGAA TGCCAAGATG CCTTGGAAAC AGCAGCCCGA GCCGAGGGCC 
TCTCTCTTGA TGCCTCCATG CATTCTCAGC TCAGAATCCT GGATGAGGAG CATCCCAAGG 
GAAAGTACCA TCATGGCTTG AGTGCTCTGA AGCCCATCCG GACTACTTCC AAACACCAGC 
ACCCAGTGGA CAATGCTGGG CTTTT T T CCT GTATGACTTT TTCGTGGCTT TCTTCTCTGG 
CCCGTGTGGC CCACAAGAAG GGGGAGCTCT CAATGGAAGA CGTGTGGTCT CTGTCCAAGC 
ACGAGTCTTC TGACGTGAAC TGCAGAAGAC TAGAGAGACT GTGGCAAGAA GAGCTGAATG 
AAGTTGGGCC AGACGCTGCT TCCCTGCGAA GGGTTGTGTG GATCTTCTGC CGCACCAGGC 
TCATCCTGTC CATCGTGTGC CTGATGATCA CGCAGCTGGC TGGCTTCAGT GGACCAGCCT 
TCATGGTGAA ACACCTCTTG GAGTATACCC AGGCAACAGA GTCTAACCTG CAGTACAGCT 
TGTTGTTAGT GCTGGGCCTC CTCCTGACGG AAATCGTGCG GTCTTGGTCG CTTGCACTGA 
CTTGGGCATT GAATTACCGA ACCGGTGTCC GCTTGCGGGG GGCCATCCTA ACCATGGCAT 
TTAAGAAGAT CCTTAAGTTA AAGAACATTA AAGAGAAATC CCTGGGTGAG CTCATCAACA 
TTTGCTCCAA CGATGGGCAG AGAATGTTTG AGGCAGCAGC CGTTGGCAGC CTGCTGGCTG 
GAGGACCCGT TGTTGCCATC TTAGGCATGA TTTATAATGT AATTATTCTG GGACCAACAG 
GCTTCCTGGG ATCAGCTGTT TTTATCCTCT TTTACCCAGC AATGATGTTT GCATCACGGC 
TCACAGCATA TTTCAGGAGA AAATGCGTGG CCGCCACGGA TGAACGTGTC CAG AAGAT GA 
ATGAAGTTCT TACTTACATT AAATTTATCA AAATGTATGC CTGGGTCAAA GCATTTTCTC 
AGAGTGTTCA AAAAATCCGC GAGGAGGAGC GTCGGATATT GGAAAAAGCC GGGTACTTCC 
AGGGTATCAC TGTGGGTGTG GCTCCCATTG TGGTGGTGAT TGCCAGCGTG GTGACCTTCT 
CTGTTCATAT GACCCTGGGC TTCGATCTGA CAGCAGCACA GGCTTTCACA GTGGTGACAG 
TCTTCAATTC CATGACTTTT GCTTTGAAAG TAACACOGTT TTCAGTAAAG TCCCTCTCAG 
AAGCCTCAGT GGCTGTTGAC AGATTTAAGA GTTTGTTTCT AATGGAAGAG GTTCACATGA 
TAAAGAACAA ACCAGCCAGT CCTCACATCA AGATAGAGAT GAAAAATGCC ACCTTGGCAT 
GGGACTCCTC CCACTCCAGT ATCCAGAACT CGCCCAAGCT GACCCCCAAA ATGAAAAAAG 
ACAAGAGGGC TTCCAGGGGC AAGAAAGAGA AGGTGAGGCA GCTGCAGCGC ACTGAGCATC 
AGGCGGTGCT GGCAGAGCAG AAAGGCCACC TCCTCCTGGA CAGTGACGAG CGGCCCAGTC 
CCGAAGAGGA AGAAGGCAAG CACATCCACC TGGGCCACCT GCGCTTACAG AGGACACTGC 
ACAGCATCGA TCTGGAGATC CAAGAGGGTA AACTGGTTGG AATCTGCGGC AGTGTGGGAA 
GTGGAAAAAC CTCTCTCATT TCAGCCATTT TAGGCCAGAT GACGCTTCTA GAGGGCAGCA 
TTGCAATCAG TGGAACCTTC GCTTATGTGG CCCAGCAGGC CTGGATCCTC AATGCTACTC 
TGAGAGACAA CATCCTGTTT GGGAAGGAAT ATGATGAAGA AAGATACAAC TCTGTGCTGA 
ACAGCTGCTG CCTGAGGCCT GACCTGGCCA TTCTTCCCAG CAGCGACCTG ACGGAGATTG 
GAGAGCGAGG AGCCAACCTG AGCGGTGGGC AGCGCCAGAG GATCAGCCTT GCCCGGGCCT 
TGTATAGTGA CAGGAGCATC TACATCCTGG ACGACCCCCT CAGTGCCTTA GATGCCCATG 
TGGGCAACCA CATCTTCAAT AGTGCTATCC GGAAACATCT CAAGTCCAAG ACAGTTCTGT 
TTGTTACCCA CCAGTTACAG *TACCTGGTTG ACTGTGATGA AGTGATCTTC ATGAAAGAGG 
GCTGTATTAC GGAAAGAGGC ACCCATGAGG AACTGATGAA TTTAAATGGT GACTATGCTA 
CCATTTTTAA TAACCTGTTG CTGGGAGAGA CACCX3CCAGT TGAGATCAAT TCAAAAAAGG 
AAACCAGTGG TTCACAGAAG AAGTCACAAG ACAAGGGTCC TAAAACAGGA TCAGTAAAGA 
AGGAAAAAGC AGTAAAGCCA GAGGAAGGGC AGCTTGTGCA GCTGGAAGAG AAAGGGCAGG 
GTTCAGTGCC CTGGTCAGTA TATGGTGTCT ACATCCAGGC TGCTGGGGGC CCCTTGGCAT 
TCCTGGTTAT TATGGCCCTT TTCATGCTGA ATGTAGGCAG CACCGCCTTC AGCACCTGGT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
13B0 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2S20 
2580 
2640 
2700 
2760 
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GGTTGAGTTA CTGGATCAAG CAAGGAAGCG GGAACACCAC TGTGACTCGA GGGAACGAGA 2820 
CCTCGGTGAG TGACAGCATG AAGGACAATC CTCATATGCA GTACTATGCC AGCATCTACG 2880 
CCCTCTCCAT GGCAGTCATG CTGATCCTGA AAGCCATTCG AGGAGTTGTC TTTGTCAAGG 2940 
GCACGCTGCG AGCTTCCTCC CGGCTGCATG ACGAGCTTTT CCGAAG GATC CTTCGAAGCC 3000 
5 CTATGAAGTT TTTTGACACG ACCCCCACAG GGAGGATTCT CAACAGGTTT TCCAAAGACA 3060 
TGGATGAAGT TGACGTGCGG CTGCCGTTCC AGGCCGAGAT GTTCATCCAG AACGTTATCC 3120 
TGGTGTTCTT CTGTGTGGGA ATGATCGCAG GAGTCTTCCC GTGGTTCCTT GTGGCAGTGG 3180 • 
GGCCCCTTGT CATCCTCTTT TCAGTCCTGC ACATTGTCTC CAGGGTCCTG ATTCGGGAGC 3240 
TGAAGCGTCT GGACAATATC ACGCAGTCAC CTTTCCTCTC CCACATCACG TCCAGCATAC 3300 
10 AGGGCCTTGC CACCATCCAC GCCTACAATA AAGGGCAGGA GTTTCTGCAC AGATACCAGG 3360 
AGCTGCTGGA TGACAACCAA GCTCCTTTTT TTTTGTTTAC GTGTGCGATG CGGTGGCTGG 3420 
CTGTGCGGCT GGACCTCATC AGCATCGCCC TCATCACCAC CACGGGGCTG ATGATCGTTC 3480 
TTATGCACGG GCAGATTCCC CCAGCCTATG OGGGTCTCGC CATCTCTTAT GCTGTCCAGT 3540 
TAACGGGGCT GTTCCAGTTT ACGGTCAGAC TGGCATCTGA GACAGAAGCT CGATTCACCT 3600 
15 CGGTGGAGAG GATCAATCAC TACATTAAGA CTCTGTCCTT GGAAGCACCT GCCAGAATTA 3660 
AGAACAAGGG TCCCTCCCCT GACTGGCCCC AGGAGGGAGA GGTGACCTTT GAGAACGCAG 3720 
AGATGAGGTA CCGAGAAAAC CTCCCTCTTG TCCTAAAGAA AGTATCCTTC ACGATCAAAC 3780 
CTAAAGAGAA GATTGGCATT GTGGGGCGGA CAGGATCAGG GAAGTCCTCG CTGGGGATGG 3840 
CCCTCTTCCG TCTGGTGGAG TTATCTGGAG GCTGCATCAA GATTGATGGA GTGAGAATCA 3900 
20 GTGATATTGG CCTTGCCGAC CTCCGAAGCA AACTCTCTAT CATTCCTCAA GAGCCGGTGC 3960 
TGTTCAGTGG CACTGTCAGA TCAAATTTGG ACCCCTTCAA CCAGTACACT GAAGACCAGA 4020 
TTTGGGATGC CCTGGAGAGG ACACACATGA AAGAATGTAT TGCTCAGCTA CCTCTGAAAC 4080 
TTGAATCTGA AGTGATGGAG AATGGGGATA ACTTCTCAGT GGGGGAACGG CAGCTCTTGT 4140 
GCATAGCTAG AGCCCTGCTC CGCCACTGTA AGATTCTGAT TTTAGATGAA GCCACAGCTG 4200 
25 CCATGGACAC AGAGACAGAC TTATTGATTC AAGAGACCAT CCGAGAAGCA TTTGCAGACT 4260 
GTACCATGCT GACCATTGCC CATCGCCTGC ACACGGTTCT AGGCTCCGAT AGGATTATGG 4320 
TGCTGGCCCA GGGACAGGTG GTGGAGTTTG ACACCCCATC GGTCCTTCTG TCCAACGACA 4380 
GTTCCCGATT CTATGCCATG TTTGCTGCTG CAGAGAACAA GGTCGCTGTC AAGGGCTGAC 4440 
TCCTCCCTCT TGACGAAGTC TCTTTTCTTT AGAGCATTGC CATTCCCTGC CTGGGGCGGG 4500 
30 CCCCTCATCG CGTCCTCCTA CCGAAACCTT GCCTTTCTCG ATTTTATCTT TCGCACAGCA 4560 
GTTCCGGATT GGCTTGTGTG TTTCACTTTT AGGGAGAGTC ATATTTTGAT TATTGTATTT 4620 
ATTCCATATT CATGTAAACA AAATTTAGTT TTTGTTCTTA ATTGCACTCT AAAAGGTTCA 4680 
GGGAACCGTT ATTATAATTG TATCAGAGGC CTATAATGAA GCTTTATACG TGTAG CTAT A 4740 
TCTATATATA ATTCTGTACA TAGCCTATAT TTACAGTGAA AATGTAAGCT GTTTATTTTA 4800 
35 TATTAAAATA AGCACTGTGC TAATAACAGT GCATATTCCT TTCTATCATT TTTGTACAGT 4860 
TTGCTGTACT AGAGATCTGG TTTTGCTATT AGACTGTAGG AAGAGTAGCA TTTCATTCTT 4920 
CTCTAGCTGG TGGTTTCACG GTGCCAGGTT TTCTGGGTGT CCAAAGGAAG ACGTGTGGCA 4980 
ATAGTGGGCC CTCCGACAGC CCCCTCTGCC GCCTCCCCAC AGCCGCTCCA GGGGTGGCTG 5040 
GAGACGGGTG GGCGGCTGGA GACCATGCAG AGCGCCGTGA GTTCTCAGGG CTCCTGCCTT 5100 
40 CTGTCCTGGT GTCACTTACT GTTTCTGTCA GGAGAGCAGC GGGGCGAAGC CCAGGCCCCT 5160 
TTTCACTCCC TCCATCAAGA ATGGGGATCA CAGAGACATT CCTCCGAGCC GGGGAGTTTC 5220 
TTTCCTGCCT TCTTCTTTTT GCTGTTGTTT CTAAACAAGA ATCAGTCTAT CCACAGAGAG 5280 
TCCCACTGCC TCAGGTTCCT ATGGCTGGCC ACTGCACAGA GCTCTCCAGC TCCAAGACCT 5340 
GTTGGTTCCA AGCCCTGGAG CCAACTGCTG CTTTTTGAGG TGGCACTTTT TCATTTGCCT 5400 
45 ATTCCCACAC CTCCACAGTT CAGTGGCAGG GCTCAGGATT TCGTGGGTCT GTT TTCC TTT 5460 
CTCACCGCAG TCGTCGCACA GTCTCTCTCT CTCTCTCCCC TCAAAGTCTG CAACTTTAAG 5520 
CAGCTCTTGC TAATCAGTGT CTCACACTGG CGTAGAAGTT TTTGTACTGT AAAGAGACCT 5580 
ACCTCAGGTT GCTGGTTGCT GTGTGGTTTG GTGTGTTCCC GCAAACCCCC TTTGTGCTGT 5640 
GGGGCTGGTA GCTCAGGTGG GCGTGGTCAC TGCTGTCATC AGTTGAATGG TCAGCGTTGC 5700 
50 ATGTCGTGAC CAACTAGACA TTCTGTCGCC TTAGCATGTT TGCTGAACAC CTTGTGGAAG 5760 
CAAAAATCTG AAAATGTGAA TAAAATTATT TTGGATTTTG TAAAAAAAAA AAAAAAAAAA 5820 
AAAAAAAAAA AAAAAAAA 

Seq ID NO: 384 Protein sequence 
55 Protein Accession #: NP_00S679.1 

1 11 21 31 41 51 

MKDIDIGKEY IIPSPGYRSV RERTSTSGTH RDREDSKFRR TRPLECQDAL ETAARAEGLS 60 ' 

60 LDASMHSQLR ILDEEHPKGK YHHGLSALKP IRTTSKHQHP VDNAGLFSCM TFSWLSSLAR 120 

VAHKKGELSM EDVWSLSKHE SSDVNCRRLB RLWQEELNEV GPDAASLRRV VWIFCRTRLI 180 

LSIVCLMITQ IiAGFSGPAFM VKHLLEYTQA TESNLQYSLL LVLGLLLTEI VRSWSLALTW 240 

ALNYRTGVRL RGAILTMAFK KILKLKNIKE KSLGELINIC SNDGQRMFEA AAVGSLIAGG 300 

PWAILGMIY NVI IIjGPTGF LGSAVFILFY PAMMFASRLT AYFRRKCVAA TDERVQKMNE 360 

65 VLTYIKFIKM YAWVKAFSQS VQKIREEERR ILEKAGYFQG ITVGVAPIW VIASWTFSV 420 

HMTLGFDLTA AQAFTWTVF NSMTFALKVT PFSVKELSEA SVAVDRFKSL FLMEEVHMIK 480 

NKPASPHIKI EMKNATLAV3D SSHSSIQNSP KLTPKMKKDK RASRGKKEKV RQLQRTEHQA 540 

VLAEQKGHLL LDSDERPSPE EEEGKHIHLG HLRLQRTLHS IDLEIQEGKL VGICGSVGSG 600 

KTSLISAILG QMTLLEGSIA ISGTFAYVAQ QAWILNATLR DNILFGKEYD EERYNSVLNS 660 

70 CCLRPDIAIL PSSDLTEIGE RGANLSGGQR QRISLARALY SDRSIYILDD PLSALDAHVG 720 

NHIFNSAIRK HLKSKTVLFV THQLQYLVDC DEVIFMKEGC ITERGTHEEL MNLNGDYATI 780 

FNNLLIX3ETP PVEINSKKET SGSQKKSQDK GPKTGSVKKE KAVKPEEGQI* VQLEEKGQGS 840 

VPWSVYGVYI QAAGGPLAFL VIMALFMLNV GSTAFSTWWL SYWIKQGSGN TTVTRGNETS 900 

VSDSMKDNPH MQYYASIYAL SMAVMLILKA IRGWFVKGT LRASSRLHDE LFRRILRSPM 960 

75 KFFDTTPTGR ILNRFSKDMD EVDVRLPFQA EMFIQNVILV FFCVGMIAGV FPWFLVAVGP 1020 

LVILFSVLHI VSRVLIRELK RLDNITQSPF LSHITSSIQG LATIHAYNKG QEFLHRYQEL 1080 

LDDNQAPFFL FTCAMRWLAV RLDLISIALI TTTGLMIVLM HGQIPPAYAG LAISYAVQLT 1140 

GLFQFTVRLA SETEARFTSV ERINHYIKTL SLEAPARIKN KAPSPDWPQE GEVTFENAEM 1200 

RYRENLPLVL KKVSFTIKPK EKIGIVGRTG SGKSSLGMAL FRLVELSGGC IKIDGVRISD 1260 

80 IGLADLRSKL SIIPCEPVLF SGTVRSNLDP FNQYTEDQIW DALERTHMKE CIAQLPLKLE 1320 

SEVMENGDNF SVGERQLLCI ARALLRHCKI LILDEATAAM DTETDLLIQB TIREAFADCT 1380 
MLTIAHRLHT VLGSDRIMVL AQGQWEFDT PSVLLSNDSS RFYAMPAAAE NKVAVKG 
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Seq ID NO: 385 DNA sequence 

Nucleic Acid Accession ft: NM_001327.1 

Coding sequence: 8 9.. 6 31 

1 " I" I" I" I 1 

AGCAGGGGGC GCTGTGTGTA CCGAGAATAC GAGAATACCT CGTGGGCCCT GACCTTCTCT 60 

CTGAGAGCCG GGCAGAGGCT CCGGAGCCAT GCAGGCCGAA GGCCGGGGCA CAGGGGGTTC 120 

GACGGGCGAT GCTGATGGCC CAGGAGGCCC TGGCATTCCT GATGGCCCAG GGGGCAATGC 180 

TGGCGGCCCA GGAGAGGCGG GTGCCACGGG CGGCAGAGGT CCCCGGGGCG CAGGGGCAGC 240 

AAGGGCCTCG GGGCCGGGAG GAGGCGCCCC GCGGGGTCCG CATGGCGGCG CGGCTTCAGG 300 

GCTGAATGGA TGCTGCAGAT GCGGGGCCAG GGGGCCGGAG AGCCGCCTGC TTGAGTTCTA 360 

CCTCGCCATG CCTTTCGCGA CACCCATGGA AGCAGAGCTG GCCCGCAGGA GCCTGGCCCA 420 

GGATGCCCCA CCGCTTCCCG TGCCAGGGGT GCTTCTGAAG GAGTTCACTG TGTCCGGCAA 480 

CATACTGACT ATCCGACTGA CTGCTGCAGA CCACCGCCAA CTGCAGCTCT CCATCAGCTC 540 

CTGTCTCCAG CAGCTTTCCC TGTTGATGTG GATCACGCAG TGCTTTCTGC CCGTGTTTTT 600 

GGCTCAGCCT CCCTCAGGGC AGAGGCGCTA AGCCCAGCCT GGCGCCCCTT CCTAGGTCAT 660 

GCCTCCTCCC CTAGGGAATG GTCCCAGCAC GAGTGGCCAG TTCATTGTGG GGGCCTGATT 720 
GTTTGTCGCT GGAGGAGGAC GGCTTACATG TTTGTTTCTG TAGAAAATAA AACTGAGCTA 



Seq ID NO: 386 Protein sequence 
Protein Accession #: NP_001318.1 

1 11 21 31 41 51 

MQAEGRGTGG STGDADGPGG PGIPDGPGGN AGGPGEAGAT GGRGPRGAGA ARASGPGGGA 60 
PRGPHGGAAS GLNGCCRCGA RGPESRLLEF YLAMPFATPM EAELARRSLA QDAPPLPVPG 120 
VLLKEFTVSG NILTIRLTAA DHRQLQLSIS SCLQQLSLLM WITQCFLPVF LAQPPSGQRR 



Seq ID NO: 387 DNA sequence 

Nucleic Acid Accession «: Eos sequence 

Coding sequence: 52.. 459 

1 " 21 j 1 ' . " 

CCTCGTGGGC CCTGACCTTC TCTCTGAGAG CCGGGCAGAG GCTCCGGAGC CATGCAGGCC 60 

GAAGGCCAGG GCACAGGGGG TTCGACGGGC GATGCTGATG GCCCAGGAGG CCCTGGCATT 120 

CCTGATGGCC CAGGGGGCAA TGCTGGCGGC CCAGGAGAGG CGGGTGCCAC GGGCGGCAGA 180 

GGTCCCCGGG GCGCAGGGGC AGCAAGGGCC TCGGGGCCGA GAGGAGGCGC CCCGCGGGGT 240 

CCGCATGGCG GTGCCGCTTC TGCGCAGGAT GGAAGGTGCC CCTGCGGGGC CAGGAGGCCG 300 

GACAGCCGCC TGCTTCAGTT CCGACTGACT GCTGCAGACC ACCGCCAACT GCAGCTCTCC 360 

ATCAGCTCCT GTCTCCAGCA GCTTTCCCTG TTGATGTGGA TCACGCAGTG CTTTCTGCCC 420 

GTGTTTTTGG CTCAGGCTCC CTCAGGGCAG AGGCGCTAAG CCCAGCCTGG CGCCCCTTCC 480 

TAGGTCATGC CTCCTCCCCT AGGGAATGGT CCCAGCACGA GTGGCCAGTT CATTGTGGGG 540 

GCCTGATTGT TTGTCGCTGG AGGAGGACGG CTTACATGTT TGTTTCTGTA GAAAATAAAG 600 
CTGAGCTA 



Seq ID NO: 388 Protein sequence 
Protein Accession #: Eos sequence 

1 11 21 31 41 51 

MQAEGQGTGG STGDADGPGG PGIPDGPGGN AGGPGEAGAT GGRGPRGAGA ARASGPRGGA 60 
PRGPHGGAAS AQDGRCFCGA RRPDSRLLQF RLTAADHRQL QLSISSCLQQ LSLLMWITQC 120 
FLPVFLAQAP SGQRR 



Seq ID NO: 389 DNA sequence 
Nucleic Acid Accession #: NM_005562.1 
Coding sequence: 90.. 3671 

1 11 21 31 41 51 

ACAGCGGAGC GCAGAGTGAG AACCACCAAC CGAGGCGCCG GGCAGCGACC CCTGCAGCGG 60 

AGACAGAGAC TGAGCGGCCC GGCACCGCCA TGCCTGCGCT CTGGCTGGGC TGCTGCCTCT 120 

GCTTCTCGCT CCTCCTGCCC GCAGCCCGGG CCACCTCCAG GAGGGAAGTC TGTGATTGCA 180 

ATGGGAAGTC CAGGCAGTGT ATCTTTGATC GGGAACTTCA CAGACAAACT GGTAATGGAT 240 

TCCGCTGCCT CAACTGCAAT GACAACACTG ATGGCATTCA CTGCGAGAAG TGCAAGAATG 300 

GCTTTTACOG GCACAGAGAA AGGGACCGCT GTTTGCCCTG CAATTGTAAC TCCAAAGGTT 360 

CTCTTAGTGC TCGATGTGAC AACTCTGGAC GGTGCAGCTG TAAACCAGGT GTGACAGGAG 420 

CCAGATGCGA CCGATGTCTG CCAGGCTTCC ACATGCTCAC GGATGCGGGG TGCACCCAAG 480 

ACCAGAGACT GCTAGACTCC AAGTGTGACT GTGACCCAGC TGGCATCGCA GGGCCCTGTG 540 

ACGCGGGCCG CTGTGTCTGC AAGCCAGCTG TTACTGGAGA ACGCTGTGAT AGGTGTCGAT 600 

CAGGTTACTA TAATCTGGAT GGGGGGAACC CTGAGGGCTG TACCCAGTGT TTCTGCTATG 660 

GGCATTCAGC CAGCTGCCGC AGCTCTGCAG AATACAGTGT CCATAAGATC ACCTCTACCT 720 

TTCATCAAGA TGTTGATGGC TGGAAGGCTG TCCAACGAAA TGGGTCTCCT GCAAAGCTCC 780 

AATGGTCACA GCGCCATCAA GATGTGTTTA GCTCAGCCCA ACGACTAGAC CCTGTCTATT 840 

TTGTGGCTCC TGCCAAATTT CTTGGGAATC AACAGGTGAG CTATGGGCAA AGCCTGTCCT 900 

TTGACTACCG TGTGGACAGA GGAGGCAGAC ACCCATCTGC CCATGATGTG ATTCTGGAAG 960 

GTGCTGGTCT ACGGATCACA GCTCCCTTGA TGCCACTTGG CAAGACACTG CCTTGTGGGC 1020 

TCACCAAGAC TTACACATTC AGGTTAAATG AGCATCCAAG CAATAATTGG AGCCCCCAGC 1080 

TGAGTTACTT TGAGTATCGA AGGTTACTGC GGAATCTCAC AGCCCTCCGC ATCCGAGCTA 1140 

CATATGGAGA ATACAGTACT GGGTACATTG ACAATGTGAC CCTGATTTCA GCCCGCCCTG 1200 

TCTCTGGAGC CCCAGCACCC TGGGTTGAAC AGTGTATATG TCCTGTTGGG TACA AGGGG C 1260 

AATTCTGCCA GGATTGTGCT TCTGGCTACA AGAGAGATTC AGCGAGACTG GGGCCTTTTG 1320 
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GCACCTGTAT TCCTTGTAAC TGTCAAGGGG GAGGGGCCTG TGATCCAGAC ACAGGAGATT 1380 
GTTATTCAGG GGATGAGAAT CCTGACATTG AGTGTGCTGA CTGCCCAATT GGTTTCTACA 1440 
ACGATCCGCA CGACCCCCGC AGCTGCAAGC CATGTCCCTG TCATAACGGG TTCAGCTGCT 1500 
CAGTGATGCC GGAGACGGAG GAGGTGGTGT GCAATAACTG CCCTCCCGGG GTCACCGGTG 1560 
5 CCCGCTGTGA GCTCTGTGCT GATGGCTACT TTGGGGACCC CTTTGGTGAA CATGGCCCAG 1620 
TGAGGCCTTG TCAGCCCTGT CAATGCAACA ACAATGTGGA CCCCAGTGCC TCTGGGAATT 1680 
GTGACCGGCT GACAGGCAGG TGTTTGAAGT GTATCCACAA CACAGCCGGC ATCTACTGCG 1740 
ACCAGTGCAA AGCAGGCTAC TTCGGGGACC CATTGGCTCC CAACCCAGCA GACAAGTGTC 1800 
GAGCTTGCAA CTGTAACCCC ATGGGCTCAG AGCCTGTAGG ATGTCGAAGT GATGGCACCT 1860 
10 GTGTTTGCAA GCCAGGATTT GGTGGCCCCA ACTGTGAGCA TGGAGCATTC AGCTGTCCAG 1920 
CTTGCTATAA TCAAGTGAAG ATTCAGATGG ATCAGTTTAT GCAGCAGCTT CAGAGAATGG 1980 
AGGCCCTGAT TTCAAAGGCT CAGGGTGGTG ATGGAGTAGT ACCTGATACA GAGCTGGAAG 2040 
GCAGGATGCA GCAGGCTGAG CAGGCCCTTC AGGACATTCT GAGAGATGCC CAGATTTCAG 2100 
AAGGTGCTAG CAGATCCCTT GGTCTCCAGT TGGCCAAGGT GAGGAGCCAA GAGAACAGCT 2160 
15 ACCAGAGCCG CCTGGATGAC CTCAAGATGA CTGTGGAAAG AGTTCGGGCT CTGGGAAGTC 2220 
AGTACCAGAA CCGAGTTCGG GATACTCACA GGCTCATCAC TCAGATGCAG CTGAGCCTGG 2280 
CAGAAAGTGA AGCTTCCTTG GGAAACACTA ACATTCCTGC CTCAGACCAC TACGTGGGGC 2340 
CAAATGGCTT TAAAAGTCTG GCTCAGGAGG CCACAAGATT AGCAGAAAGC CACGTTGAGT 2400 
CAGCCAGTAA CATGGAGCAA CTGACAAGGG AAACTGAGGA CTATTCCAAA CAAGCCCTCT 2460 
20 CACTGGTGCG CAAGGCCCTG CATGAAGGAG TCGGAAGCGG AAGCGGTAGC CCGGACGGTG 2520 
CTGTGGTGCA AGGGCTTGTG GAAAAATTGG AGAAAACCAA GTCCCTGGCC CAGCAGTTGA 2580 
CAAGGGAGGC CACTCAAGCG GAAATTGAAG CAGATAGGTC TTATCAGCAC AGTCTCCGCC 2640 
TCCTGGATTC AGTGTCTCGG CTTCAGGGAG TCAGTGATCA GTCCTTTCAG GTGGAAGAAG 2700 
CAAAGAGGAT CAAACAAAAA GOGGATTCAC TCTCAACGCT GGTAACCAGG CATATGGATG 2760 
25 AGTTCAAGCG TACACAAAAG AATCTGGGAA ACTGGAAAGA AGAAGCACAG CAGCTCTTAC 2820 
AGAATGGAAA AAGTGGGAGA GAGAAATCAG ATCAGCTGCT TTCCCGTGCC AATCTTGCTA 2880 
AAAGCAGAGC ACAAGAAGCA CTGAGTATGG GCAATGCCAC TTTTTATGAA GTTGAGAGCA 2940 
TCCTTAAAAA CCTCAGAGAG TTTGACCTGC AGGTGGACAA CAGAAAAGCA GAAGCTGAAG 3000 
AAGCCATGAA GAGACTCTCC TACATCAGCC AGAAGGTTTC AGATGCCAGT GACAAGACCC 3060 
30 AGCAAGCAGA AAGAGCCCTG GGGAGCGCTG CTGCTGATGC ACAGAGGGCA AAGAATGGGG 3120 
CCGGGGAGGC CCTGGAAATC TCCAGTGAGA TTGAACAGGA GATTGGGAGT CTGAACTTGG 3180 
AAGCCAATGT GACAGCAGAT GGAGCCTTGG CCATGGAAAA GGGACTGGCC TCTCTGAAGA 3240 
GTGAGATGAG GGAAGTGGAA GGAGAGCTGG AAAGGAAGGA GCTGGAGTTT GACACGAATA 3300 
TGGATGCAGT ACAGATGGTG ATTACAGAAG CCCAGAAGGT TGATACCAGA GCCAAGAACG 3360 
35 CTGGGGTTAC AATCCAAGAC ACACTCAACA 1 CATTAGACGG CCTCCTGCAT CTGATGGACC 3420 
AGCCTCTCAG TGTAGATGAA GAGGGGCTGG TCTTACTGGA GCAGAAGCTT TCCCGAGCCA 3480 
AGACCCAGAT CAACAGCCAA CTGCGGCCCA TGATGTCAGA GCTGGAAGAG AGGGCACGTC 3540 
AGCAGAGGGG CCACCTCCAT TTGCTGGAGA CAAGCATAGA TGGGATTCTG GCTGATGTGA 3600 
AGAACTTGGA GAACATTAGG GACAACCTGC CCCCAGGCTG CTACAATACC CAGGCTCTTG 3660 
40 AGCAACAGTG AAGCTGCCAT AAATATTTCT CAACTGAGGT TCTTGGGATA CAGATCTCAG 3720 
GGCTCGGGAG CCATGTCATG TGAGTGGGTG GGATGGGGAC ATTTGAACAT GTTTAATGGG 3780 
TATGCTCAGG TCAACTGACC TGACCCCATT CCTGATCCCA TGGCCAGGTG GTTGTCTTAT 3840 
TGCACCATAC TCCTTGCTTC CTGATGCTGG GCAATGAGGC AGATAGCACT GGGTGTGAGA 3900 
ATGATCAAGG ATCTGGACCC CAAAGAATAG ACTGGATGGA AAGACAAACT GCACAGGCAG 3960 
45 ATGTTTGCCT CATAATAGTC GTAAGTGGAG TCCTGGAATT TGGACAAGTG CTGTTGGGAT 4020 
ATAGTCAACT TATTCTTTGA GTAATGTGAC TAAAGGAAAA AACTTTGACT TTGCCCAGGC 4080 
ATGAAATTCT TCCTAATGTC AGAACAGAGT GCAACCCAGT CACACTGTGG CCAGTAAAAT 4140 
ACTATTGCCT CATATTGTCC TCTGCAAGCT TCTTGCTGAT CAGAGTTCCT CCTACTTACA 4200 
ACCCAGGGTG TGAACATGTT CTCCATTTTC AAGCTGGAAG AAGTGAGCAG TGTTGGAGTG 4260 
50 AGGACCTGTA AGGCAGGCCC ATTCAGAGCT ATGGTGCTTG CTGGTGCCTG CCACCTTCAA 4320 
GTTCTGGACC TGGGCATGAC ATCCTTTCTT TTAATGATGC CATGGCAACT TAGAGATTGC 4380 
ATTTTTATTA AAGCATTTCC TACCAGCAAA GCAAATGTTG GGAAAGTATT TACTTTTTCG 4440 
GTTTCAAAGT GATAGAAAAG TGTGGCTTGG GCATTGAAAG AGGTAAAATT CTCTAGATTT 4500 
ATTAGTCCTA ATTCAATCCT ACTTTTCGAA CACCAAAAAT GATGCGCATC AATGTATTTT 4560 
55 ATCTTATTTT CTCAATCTCC TCTCTCTTTC CTCCACCCAT AATAAGAGAA TGTTCCTACT 4620 
CACACTTCAG CTGGGTCACA TCCATCCCTC CATTCATCCT TCCATCCATC TTTCCATCCA 4680 
TTACCTCCAT CCATCCTTCC AACATATATT TATTGAGTAC CTACTGTGTG CCAGGGGCTG 4740 
GTGGGACAGT GGTGACATAG TCTCTGCCCT CATAGAGTTG ATTGTCTAGT GAGGAAGACA 4800 
AGCATTTTTA AAAAATAAAT TTAAACTTAC AAACTTTGTT TGTCACAAGT GGTGTTTATT 4860 
60 GCAATAACCG CTTGGTTTGC AACCTCTTTG CTCAACAGAA CATATGTTGC AAGAC CCTCC 4920 
CATGGGGGCA CTTGAGTTTT GGCAAGGCTG ACAGAGCTCT GGGTTGTGCA CATTTCTTTG 4980 
CATTCCAGCT GTCACTCTGT GCCTTTCTAC AACTGATTGC AACAGACTGT TGAGTTATGA S040 
TAACACCAGT GGGAATTGCT GGAGGAACCA GAGGCACTTC CACCTTGGCT GGGAAGACTA 5100 
TGGTGCTGCC TTGCTTCTGT ATTXCCTTGG ATTTTCCTGA AAGTGTTTTT AAATAAAGAA 5160 
65 CAATTGTTAG ATGCC 

Seq ID NO: 390 Protein sequence 
Protein Accession #: NP_005553.1 

70 1 11 21 31 41 51 

MPALWLGCCL CFSLLLPAAR ATSRREVCDC NGKSRQCIPD RELHRQTGNG FRCLNCNDNT 60 

DGIHCEKCKN GFYRHRERDR CLPCNCNSKG SLSARCDNSG RCSCKPGVTG ARCDRCLPGF 120 

HMLTDAGCTQ DQRLLDSKCD CDPAGIAGPC DAGRCVCKPA VTGERCDRCR SGYYNLDGGN 180 

75 PEGCTQCFCY GHSASCRSSA EYSVHKITST FHQDVDGWKA VQRNGSPAKL QWSQRHQDVF 240 

SSAQRLDPVY FVAPAKFXGN QQVSYGQSLS FDYRVDRGGR HPSAHDVILE GAGLRITAPL 300 

MPLGKTLPCG LTKTYTFRLN EHPSNNWSPQ LSYFEYRRLL RNLTALRIRA TYGEYSTGYI 360 

DNVTLISARP VSGAPAPWVE QCICPVGYKG QFCQDCASGY KRDSARLGPF GTCIPCKCQG 420 

GGACDPDTGD CYSGDENPDI ECADCPIGFY NDPHDPRSCK PCFCHNGFSC SVMPETEEW 480 

80 CNNCPPGVTG ARCELCADGY FGDPPGEHGP VRPCQPCQCN NNVDPSASGN CDRLTGRCLK 540 

CIHNTAGIYC DQCKAGYFGD PLAPNPADKC RACNCNPMGS EPVGCRSDGT CVCKPGFGGP 600 

NCEHGAFSCP ACYNQVKIQM DQFMQQLQRM EALISKAOGG DGWPDTELB GRMQQAEQAL 660 

QDILRDAQIS EGASRSLGLQ LAKVRSQENS YQSRLDDLKM TVERVRALGS QYQNRVRDTH 720 

RLITQMQLSL AESEASLGNT NIPASDHYVG PNGFKSLAQE ATRLAESHVE SASNMEQLTR 780 
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ETEDYSKQAL SLVRKALHEG VGSGSGSPDG AWQGLVEKL EKTKSLAQQL TREATQAEIE 840 

ADRSYQHSLR LLDSVSRLQG VSDQSFQVEE AKRIKQKADS LSTLVTRHMD EFKRTQKNLG 900 

KWKEEAQQLL QNGKSGREKS DQLLSRANLA KSRAQEALSM GNAT FY EVES ILKKLREFDL 960 

QVDNRKAEAE EAMKRLSYIS QKVSDASDKT QQAERALGSA AADAQRAKNG AGEAliEISSE 1020 

5 IEQEIGSLNL EANVTADGAL AMEKGLASLK SEMREVEGEL ERKELEFDTN MDAVQMVITE 1080 

AQKVDTRAKN AGVTIQDTLN TLDGLLHLMD QPLSVDEEGL VLLEQKLSRA KTQINSQLRP 1140 
MMSELEERAR QQRGHLHLLE TSIDGILADV KNLENIRDNL PPGCYNTQAL EQQ 

Seq ID NO: 391 DNA sequence 
10 Nucleic Acid Accession ft: AF10 1051.1 
Coding sequence: 221.856 

1 11 21 31 41 51 

15 GAGCAACCTC AGCTTCTAGT ATCCAGACTC CAGCGCCGCC CCGGGCGCGG ACCCCAACCC 60 
CGACCCAGAG CTTCTCCAGC GGCGGCGCAG CGAGCAGGGC TCCCCGCCTT AACTTCCTCC 120 
GCGGGGCCCA GCCACCTTCG GGAGTCCGGG TTGCCCACCT GCAAACTCTC CGCCTTCTGC 180 
ACCTGCCACC CCTGAGCCAG CGCGGGCGCC CGAGCGAGTC ATGGCCAACG CGGGGCTGCA 240 
GCTGTTGGGC TTCATTCTCG CCTTCCTGGG ATGGATCGGC GCCATCGTCA GCACTGCCCT 300 
20 GCCCCAGTGG AGGATTTACT CCTATGCCGG CGACAACATC GTGACCGCCC AGGCCATGTA 360 
CGAGGGGCTG TGGATGTCCT GCGTGTCGCA GAGCACCGGG CAGATCCAGT GCAAAGTCTT 420 
TGACTCCTTG CTGAATCTGA GCAGCACATT GCAAGCAACC CGTGCCTTGA TGGTGGTTGG 480 
CATCCTCCTG GGAGTGATAG CAATCTTTGT GGCCACCGTT GGCATGAAGT GTATGAAGTG 540 
CTTGGAAGAC GATGAGGTGC AGAAGATGAG GATGGCTGTC ATTGGGGGTG CGATATTTCT 600 
25 TCTTGCAGGT CTGGCTATTT TAGTTGCCAC AGCATGGTAT GGCAATAGAA TOGTTCAAGA 660 
ATTCTATGAC CCTATGACCC CAGTCAATGC CAGGTACGAA TTTGGTCAGG CTCTCTTCAC 720 
TGGCTGGGCT GCTGCTTCTC TCTGCCTTCT GGGAGGTGCC CTACTTTGCT GTTCCTGTCC 780 
CCGAAAAACA ACCTCTTACC CAACACCAAG GCCCTATCCA AAACCTGCAC CTTCCAGCGG 840 
GAAAGACTAC GTGTGACACA GAGGCAAAAG GAGAAAATCA TGTTGAAACA AACCGAAAAT 900 
30 GGACATTGAG ATACTATCAT TAACATTAGG ACCTTAGAAT TTTGGGTATT GTAATCTGAA 960 
GTATGGTATT ACAAAACAAA CAAACAAACA AAAAACCCAT GTGTTAAAAT ACTCAGTGCT 1020 
AAACATGGCT TAATCTTATT TTATCTTCTT TCCTCAATAT AGGAGGGAAG ATTTTACCAT 1080 
TTGTATTACT GCTTCCCATT GAGTAATCAT ACTCAAATGG GGGAAGGGGT GCTCCTTAAA 1140 
TATATATAGA TATGTATATA TACATGTTTT TCTATTAAAA ATAGACAGTA AAATACTATT 1200 
35 CTCATTATGT TGATACTAGC ATACTTAAAA TATCTCTAAA ATAGGTAAAT GTATTTAATT 1260 

CCATATTGAT GAAGATGTTT ATTGGTATAT TTTCTTTTTC GTCCTTATAT ACATATGTAA 1320 
CAGTCAAATA TCATTTACTC TTCTTCATTA GCTTTGGGTG CCTTTGCCAC AAGACCTAGC 1380 
CTAATTTACC AAGGATGAAT TCTTTCAATT CTTCATGCGT GCCCTTTTCA TATACTTATT 1440 
TTATTTTTTA CCATAATCTT ATAGCACTTG CATCGTTATT AAGCCCTTAT TTGTTTTGTG 1500 
40 TTTCATTGGT CTCTATCTCC TGAATCTAAC ACATTTCATA GCCTACATTT TAGTTTCTAA 1560 
AGCCAAGAAG AATTTATTAC AAATCAGAAC TTTGGAGGCA AATCTTTCTG CATGACCAAA 1620 
GTGATAAATT CCTGTTGACC TTCCCACACA ATCCCTGTAC TCTGACCCAT AGCACTCTTG 1680 
TTTGCTTTGA AAATATTTGT CCAATTGAGT AGCTGCATGC T GTTCC CCCA GGTGTTGTAA 1740 
CACAACTTTA TTGATTGAAT TTTTAAGCTA CTTATTCATA GTTTTATATC CCCCTAAACT 180O 
45 ACCTTTTTGT TCCCCATTCC TTAATTGTAT TGTTTTCCCA AGTGTAATTA TCATGCGTTT 1860 
TATATCTTCC TAATAAGGTG TGGTCTGTTT GTCTGAACAA AGTGCTAGAC T TTCT GGAGT 1920 
GATAATCTGG TGACAAATAT TCTCTCTGTA GCTGTAAGCA AGTCACTTAA TCTTTCTACC 1980 
TCTTTTTTCT ATCTGCCAAA TTGAGATAAT GATACTTAAC CAGTTAGAAG AGGTAGTGTG 2040 
AATATTAATT AGTTTATATT ACTCTCATTC TTTGAACATG AACTATGGCT ATGTAGTGTC 2100 
50 TTTATTTGCT CAGCTGGCTG AGACACTGAA GAAGTCACTG AACAAAACCT ACACACGTAC 2160 
CTTCATGTGA TTCACTGCCT TCCTCTCTCT ACCAGTCTAT TTCCACTGAA CAAAACCTAC 2220 
ACACATACCT TCATGTGGTT CAGTGCCTTC CTCTCTCTAC CAGTCTATTT CCACTGAACA 2280 
AAACCTACGC ACATACCTTC ATGTGGCTCA GTGCCTTCCT CTCTCTACCA GTCTATTTCC 2340 
ATTCTTTCAG CTGTGTCTGA CATGTTTGTG CTCTGTTCCA TTTTAACAAC TGCTCTTACT 2400 
55 TTTCCAGTCT GTACAGAATG CTATTTCACT TGAGCAAGAT GATGTATGGA AAGG GTGTTG 2460 
GCACTGGTGT CTGGAGACCT GGATTTGAGT CTTGGTGCTA TCAATCACCG TCTGTGTTTG 2520 
AGCAAGGCAT TTGGCTGCTG TAAGCTTATT GCTTCATCTG TAAGCGGTGG TTTGTAATTC 2580 
CTGATCTTCC CACCTCACAG TGATGTTGTG GGGATCCAGT GAGATAGAAT ACATGTAAGT 2640 
GTGGTTTTGT AATTTGAAAA GTGCTATACT AAGGGAAAGA ATTGAGGAAT TAACTGCATA 2700 
60 CGTTTTGGTG TTGCTTTTCA AATGTTTGAA AATAAAAAAA TGTTAAGAAA TGGGTTTCTT 2760 
GCCTTAACCA GTCTCTCAAG TGATGAGACA GTGAAGTAAA ATTGAGTGCA CTAAACGAAT 2820 
AAGATTCTGA GGAAGTCTTA TCTTCTGCAG TGAGTATGGC CCAATGCTTT CTGTGGCTAA 2880 
ACAGATGTAA TGGGAAGAAA TAAAAGCCTA CGTGTTGGTA AATCCAACAG CAAGGGAGAT 2940 
TTTTGAATCA TAATAACTCA TAAGGTGCTA TCTGTTCAGT GATGCCCTCA GAGCTCTTGC 3000 
65 TGTTAGCTGG CAGCTGACGC TGCTAGGATA GTTAGTTTGG AAATGGTACT TC ATAAT AAA 3060 
CTACACAAGG AAAGTCAGCC ACCGTGTCTT ATGAGGAATT GGACCTAATA AATTTT AGTG 3120 
TGCCTTCCAA ACCTGAGAAT ATATGCTTTT GGAAGTTAAA ATTTAAATGG CTTTTGCCAC 3180 
ATACATAGAT CTTCATGATG TGTGAGTGTA ATTCCATGTG GATATCAGTT ACCAAACATT 3240 
ACAAAAAAAT TTTATGGCCC AAAATGACCA ACGAAATTGT TACAATAGAA TTTA TCCAAT 3300 
70 TTTGATCTTT TTATATTCTT CTACCACACC TGGAAACAGA CCAAT AGACA TTTTGGGGTT 3360 
TTATAATGGG AATTTGTATA AAGCATTACT CTTTTTCAAT AAATTGTTTT TTAATTTAAA 3420 
AAAAGGAAAA AAAAAAAAAA AAA 

Seq ID NO: 392 Protein sequence 
75 Protein Accession ft: AAD16433.1 

1 11 21 31 41 51 

MANAGLQLIiG FILAFLGWIG AIVSTALPQW RIYSYAGDNI VTAQAMYEGL WMSCVSQSTG 60 . 
80 QIQCKVFDSL LNLSSTLQAT RALMWGILL GVIAIFVATV GMKCMKCLED DEVQKMRMAV 120 
IGGAIFLLAG IiAILVATAWY GNRIVQEFYD PMTPVNARYE FGQALFTGWA AASLCLLGGA 180 
LLCCSCPRKT TSYPTPRPYP KPAPSSGKDY V 
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Seq ID NO: 393 DNA sequence 

Nucleic Acid Accession #: NM_006180.1 

Coding sequence: 352-. 2820 

1 11 21 31 41 51 

I I I I I I 

CCCCCATTCG CATCTAACAA GGAATCTGCG CCCCAGAGAG TCCCGGACGC CGCCGGTCGG 60 

TGCCCGGCGC GCCGGGCCAT GCAGCGACGG CCGCCGCGGA GCTCCGAGCA GCGGTAGCGC 120 

CCCCCTGTAA AGCGGTTCGC TATGCCGGGA CCACTGTGAA CCCTGCCGCC TGCCGGAACA 180 

CTCTTCGCTC CGGACCAGCT CAGCCTCTGA TAAGCTGGAC TCGGCACGCC CGCAACAAGC 240 

ACCGAGGAGT TAAGAGAGCC GCAAGCGCAG GGAAGGCCTC CCCGCACGGG TGGGGGAAAG 300 

CGGCCGGTGC AGCGCGGGGA CAGGCACTCG GGCTGGCACT GGCTGCTAGG GATGTCGTCC 360 

TGGATAAGGT GGCATGGACC CGCCATGGCG CGGCTCTGGG GCTTCTGCTG GCTGGTTGTG 420 

GGCTTCTGGA GGGCCGCTTT CGCCTGTCCC ACGTCCTGCA AATGCAGTGC CTCTCGGATC 480 

TGGTGCAGCG ACCCTTCTCC TGGCATCGTG GCATTTCCGA GATTGGAGCC TAACAGTGTA 540 

GATCCTGAGA ACATCACCGA AATTTTCATC GCAAACCAGA AAAGGTTAGA AATCATCAAC 600 

GAAGATGATG TTGAAGCTTA TGTGGGACTG AGAAATCTGA CAATTGTGGA TTCTGGATTA 660 

AAATTTGTGG CTCATAAAGC ATTTCTGAAA AACAGCAACC TGCAGCACAT CAATTTTACC 720 

CGAAACAAAC TGACGAGTTT GTCTAGGAAA CATTTCCGTC ACCTTGACTT GTCTGAACTG 780 

ATCCTGGTGG GCAATCCATT TACATGCTCC TGTGACATTA TGTGGATCAA GACTCTCCAA 840 

GAGGCTAAAT CCAGTCCAGA CACTCAGGAT TTGTACTGCC TGAATGAAAG CAGCAAGAAT 900 

ATTCCCCTGG CAAACCTGCA GATACCCAAT TGTGGTTTGC CATCTGCAAA TCTGGCCGCA 960 

CCTAACCTCA CTGTGGAGGA AGGAAAGTCT ATCACATTAT CCTGTAGTGT GGCAGGTGAT 1020 

CCGGTTCCTA ATATGTATTG GGATGTTGGT AACCTGGTTT CCAAACATAT GAATGAAACA 1080 

AGCCACACAC AGGGCTCCTT AAGGATAACT AACATTTCAT CCGATGACAG TGGGAAGCAG 1140 

ATCTCTTGTG TGGCGGAAAA TCTTGTAGGA GAAGATCAAG ATTCTGTCAA CCTCACTGTG 1200 

CATTTTGCAC CAACTATCAC ATTTCTCGAA TCTCCAACCT CAGACCACCA CTGGTGCATT 1260 

CCATTCACTG TGAAAGGCAA CCCCAAACCA GCGCTTCAGT GGTTCTATAA OGGGGCAATA 1320 

TTGAATGAGT CCAAATACAT CTGTACTAAA ATACATGTTA CCAATCACAC GGAGTACCAC 1380 

GGCTGCCTCC AGCTGGATAA TCCCACTCAC ATGAACAATG GGGACTACAC TCTAATAGCC 1440 

AAGAATGAGT ATGGGAAGGA TGAGAAACAG ATTTCTGCTC ACTTCATGGG CTGGCCTGGA 1500 

ATTGACGATG GTGCAAACCC AAATTATCCT GATGTAATTT ATGAAGATTA TGGAACTGCA 1560 

GCGAATGACA TCGGGGACAC CACGAACAGA AGTAATGAAA TCCCTTCCAC AGACGTCACT 1620 

GATAAAACCG GTCGGGAACA TCTCTCGGTC TATGCTGTGG TGGTGATTGC GTCTGTGGTG 1680 

GGATTTTGCC TTTTGGTAAT GCTGTTTCTG CTTAAGTTGG CAAGACACTC CAAGTTTGGC 1740 

ATGAAAGGCC CAGCCTCCGT TATCAGCAAT GATGATGACT CTGCCAGCCC ACTCCATCAC 1800 

ATCTCCAATG GGAGTAACAC TCCATCTTCT TCGGAAGGTG GCCCAGATGC TGTCATTATT 1860 

GGAATGACCA AGATCCCTGT CATTGAAAAT CCCCAGTACT TTGGCATCAC CAACAGTCAG 1920 

CTCAAGCCAG ACACATTTGT TCAGCACATC AAGCGACATA ACATTGTTCT GAAAAGGGAG 1980 

CTAGGCGAAG GAGCCTTTGG AAAAGTGTTC CTAGCTGAAT GCTATAACCT CTGTCCTGAG 2040 

CAGGACAAGA TCTTGGTGGC AGTGAAGACC CTGAAGGATG CCAGTGACAA TGCACGCAAG 2100 

GACTTCCACC GTGAGGCCGA GCTCCTGACC AACCTCCAGC ATGAGCACAT CGTCAAGTTC 2160 

TATGGCGTCT GCGTGGAGGG CGACCCCCTC ATCATGGTCT TTGAGTACAT GAAGCATGGG 2220 

GACCTCAACA AGTTCCTCAG GGCACACGGC CCTGATGCCG TGCTGATGGC TGAGGGCAAC 2280 

CCGCCCACGG AACTGACGCA GTCGCAGATG CTGCATATAG CCCAGCAGAT CGCCGCGGGC 2340 

ATGGTCTACC TGGCGTCCCA GCACTTCGTG CACCGCGATT TGGCCACCAG GAACTGCCTG 2400 

GTCGGGGAGA ACTTGCTGGT GAAAATCGGG GACTTTGGGA TGTCCCGGGA CGTGTACAGC 2460 

ACTGACTACT ACAGGGTCGG TGGCCACACA ATGCTGCCCA TTCGCTGGAT GCCTCCAGAG 2520 

AGCATCATGT ACAGGAAATT CACGAOGGAA AGCGAOGTCT GGAGCCTGGG GGTCGTGTTG 2580 

TGGGAGATTT TCACCTATGG CAAACAGCCC TGGTACCAGC TGTCAAACAA TGAGGTGATA 2640 

GAGTGTATCA CTCAGGGCCG AGTCCTGCAG CGACCCCGCA CGTGCCCCCA GGAGGTGTAT 2700 

GAGCTGATGC TGGGGTGCTG GCAGCGAGAG CCCCACATGA GGAAGAACAT CAAGGGCATC 2760 

CATACCCTCC TTCAGAACTT GGCCAAGGCA TCTCCGGTCT ACCTGGACAT TCTAGGCTAG 2820 

GGCCCTTTTC CCX^GACCGA TCCTTCCCAA CGTACTCCTC AGACGGGCTG AGAGGATGAA 2880 

CATCTTTTAA CTGCCGCTGG AGGCCACCAA GCTGCTCTCC TTCACTCTGA CAGTATTAAC 2940 

ATCAAAGACT CCGAGAAGCT CTCGAGGGAA GCAGTGTGTA CTTCTTCATC CATAGACACA 3000 

GTATTGACTT CTTTTTGGCA TTATCTCTTT CTCTCTTTCC ATCTCCCTTG GTTGTTCCTT 3060 

TTTCTTTTTT TAAATTTTCT TTTTCTTCTT TTTTTTCGTC TTCCCTGCTT CACGATTCTT 3120 

ACCCTTTCTT TTGAATCAAT CTGGCTTCTG CATTACTATT AACTCTGCAT AGACAAAGGC 3180 

CTTAACAAAC GTAATTTGTT ATATCAGCAG ACACTCCAGT TTGCCCACCA CAACTAACAA 3240 

TGCCTTGTTG TATTCCTGCC TTTGATGTGG ATGAAAAAAA GGGAAAACAA ATATTTCACT 3300 

TAAACTTTGT CACTTCTGCT GTACAGATAT CGAGAGTTTC TATGGATTCA CTTCTATTTA 3360 

TTTATTATTA TTACTGTTCT TATTGTTTTT GGATGGCTTA AGCCTGTGTA TAAAAAAGAA 3420 

AACTTGTGTT CAATCTGTGA AGCCTTTATC TATGGGAGAT TAAAACCAGA GAGAAAGAAG 3480 

ATTTATTATG AACCGCAATA TGGGAGGAAC AAAGACAACC ACTGGGATCA GCTGGTGTCA 3540 

GTCCCTACTT AGGAAATACT CAGCAACTGT TAGCTGGGAA GAATGTATTC GGCACCTTCC 3600 

CCTGAGGACC TTTCTGAGGA GTAAAAAGAC TACTGGCCTC TGTGCGATGG ATGATTCTTT 3660 
TCCCATCACC AGAAATGATA GCGTGCAGTA GAGAGCAAAG ATGGCTT 

Seq ID NO: 394 Protein sequence 
Protein Accession ft: NP_00617l.l 



MSSWIRWHGP 
NSVDPENITE 
NFTRNKLTSIi 
SKNIPLiANLQ 
NETSHTQGSL 
WCIPFTVKGN 
LIAKNEYGKD 
DVTDKTGREH 
LHHISNGSNT 
KRELGBGAFG 



11 
I 

AMARLWGFCW 
IFIANQKRLB 
SRKHFRHLDL 
IPNCGLPSAN 
RITNISSDDS 
PKPALQWPYN 
EKQISAHFKG 
LSVYAVWIA 
PSSSBGGPDA 
KVFUVECYNL 



21 

I 

LWGFWRAAF 
IINEDDVEAY 
SELILVGNPF 
LAAPNLTVEE 
GXQISCVAEN 
GAII2JESKYI 
WPGIDDGANP 
SWGFCLLVM 
VIIGMTKIPV 
CPBQDKILVA 



31 
I 

ACPTSCKCSA 
VGLRNLTIVD 
TCSCDIMWIK 
GXSITLSCSV 
LVGEDQDSVN 
CTKIHVTNHT 
NYPDVIYEDY 
LFLLKLARHS 
IENPQYFGIT 
VKTLKDASDN 



41 

i 

SRIWCSDPSP 
SGLKFVAHKA 
TLQEAKSSPD 
AGDPVPNMYW 
IiTVHFAPTIT 
EYHGCLQLDN 
GTAANDIGDT 
KFGMKGPASV 
NSQLKPDTFV 
ARKDFHREAE 



51 
I 

GIVAFPRLEP 
FLKNSNLQHI 
TQDLYCLNES 
DVGNLVSKHM 
FLESPTSDHH 
PTHMNNGDYT 
TNRSNEIPST 
ISNDDDSASP 
QHIKRHNIVL 
LLTNLQHEHI 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



1110 



WO 03/042661 



VKFYGVCVEG DPLIMVFEYM KHGDLNKFLR AHGPDAVLMA EGNPPTELTQ SQMLHIAQQI 660 

AAGMVYLASQ HFVHRDLATR NCLVGENLLV KIGDFGMSRD VYSTDYYRVG GHTMLPIRWM 720 

PPESIMYRKF TTESDVWSl/G WLWEIFTYG KQPWYQLSNN EVIECITQGR VLQRPRTCPQ 780 
EVYELMLGCW QREPHMRKNI KGIHTLLQNL AKASPVYLDI LG 

Seq ID NO: 395 DNA sequence 
Nucleic Acid Accession 8: AF410899 
Coding sequence-. 4B3..2999 

1 11 21 31 41 51 

I I I I I I 

GGGAGCAGGA GCCTCGCTGG CTGCTTCGCT CGCGCTCTAC GCGCTCAGTC CCCGGCGGTA 60 
GCAGGAGCCT GGACCCAGGC GCCGGCGGCG GGCGTGAGGC GCCGGAGCCC GGCCTCGAGG 120 
TGCATACCGG ACCCCCATTC GCATCTAACA AGGAATCTGC GCCCCAGAGA GTCCCGGACG 180 
CCGCCGGTCG GTGCCCGGCG CGCCGGGCCA TGCAGCGACG GCCGCCGCGG AGCTCCGAGC 240 
AGCGGTAGCG CCCCCCTGTA AAGCGGTTCG CTATGCCGGG ACCACTGTGA ACCCTGCCGC 300 
CTGCCGGAAC ACTCTTCGCT CCGGACCAGC TCAGCCTCTG ATAAGCTGGA CTCGGCACGC 360 
CCGCAACAAG CACCGAGGAG TTAAGAGAGC CGCAAGCGCA GGGAAGGCCT CCCCGCACGG 420 
GTGGGGGAAA GCGGCCGGTG CAGCGCGGGG ACAGGCACTC GGGCTGGCAC TGGCTGCTAG 480 
GGATGTCGTC CTGGATAAGG TGGCATGGAC CCGCCATGGC GCGGCTCTGG GGCTTCTGCT 540 
GGCTGGTTGT GGGCTTCTGG AGGGCCGCTT TCGCCTGTCC CACGTCCTGC AAATGCAGTG 600 
CCTCTCGGAT CTGGTGCAGC GACCCTTCTC CTGGCATCGT GGCATTTCCG AGATTGGAGC 660 
CTAACAGTGT AGATCCTGAG AACATCACCG AAATTTTCAT CGCAAACCAG AAAAGGTTAG 720 
AAATCATCAA CGAAGATGAT GTTGAAGCTT ATGTGGGACT GAGAAATCTG ACAATTGTGG 780 
ATTCTGGATT AAAATTTGTG GCTCATAAAG CATTTCTGAA AAACAGCAAC CTGCAGCACA 840 
TCAATTTTAC CCGAAACAAA CTGACGAGTT TGTCTAGGAA ACATTTCCGT CACCTTGACT 900 
TGTCTGAACT GATCCTGGTG GGCAATCCAT TTACATGCTC CTGTGACATT ATGTGGATCA 960 
AGACTCTCCA AGAGG CT AAA TCCAGTCCAG ACACTCAGGA TTTGT ACTG C CTGAATGAAA 1020 
GCAGCAAGAA TATTCCCCTG GCAAACCTGC AGATACCCAA TTGTGGTTTG CCATCTGCAA 1080 
ATCTGGCCGC ACCTAACCTC ACTGTGGAGG AAGGAAAGTC TATCACATTA TCCTGTAGTG 1140 
TGGCAGGTGA TCCGGTTCCT AATATGTATT GGGATGTTGG TAACCTGGTT TCCAAACATA 1200 
TGAATGAAAC AAGCCACACA CAGGGCTCCT TAAGGATAAC TAACATTTCA TCCGATGACA 1260 
GTGGGAAGCA GATCTCTTGT GTGGGGGAAA ATCTTGTAGG AGAAGATCAA GATTCTGTCA 1320 
ACCTCACTGT GCATTTTGCA CCAACTATCA CATTTCTCGA ATCTCCAACC TCAGACCACC 1380 
ACTGGTGCAT TCCATTCACT GTGAAAGGCA ACCCCAAACC AGCGCTTCAG TGGTTCTATA 1440 
ACGGGGCAAT ATTGAATGAG TCCAAATACA TCTGTACTAA AATACATGTT ACCAATCACA 1500 
CGGAGTACCA CGGCTGCCTC CAGCTGGATA ATCCCACTCA CATGAACAAT GGGGACTACA 1560 
CTCTAATAGC CAAGAATGAG TATGGGAAGG ATGAGAAACA GATTTCTGCT CACTTCATGG 1620 
GCTGGCCTGG AATTGACGAT GGTGCAAACC CAAATTATCC TGATGTAATT TATGAAGATT 1680 
ATGGAACTGC AGCGAATGAC ATCGGGGACA CCACGAACAG AAGTAATGAA ATCCCTTCCA 1740 
CAGACGTCAC TGATAAAACC GGTCGGGAAC ATCTCTCGGT CTATGCTGTG GTGGTGATTG 1800 
CGTCTGTGGT GGGATTTTGC CTTTTGGTAA TGCTGTTTCT GCTTAAGTTG GCAAGACACT 1860 
CCAAGTTTGG CATGAAAGAT TTCTCATGGT TTGGATTTGG GAAAGTAAAA TCAAGACAAG 1920 
GTGTTGGCCC AGCCTCCGTT ATCAGCAATG ATGATGACTC TGCCAGCCCA CTCCATCACA 1980 
TCTCCAATGG GAGTAACACT CCATCTTCTT CGGAAGGTGG CCCAGATGCT GTCATTATTG 2040 
GAATGACCAA GATCCCTGTC ATTGAAAATC CCCAGTACTT TGGCATCACC AACAGTCAGC 2100 
TCAAGCCAGA CACATTTGTT CAGCACATCA AGCGACATAA CATTGTTCTG AAAAGGGAGC 2160 
TAGGCGAAGG AGCCTTTGGA AAAGTGTTCC TAGCTGAATG CTATAACCTC TGTCCTGAGC 2220 
AGGACAAGAT CTTGGTGGCA GTGAAGACCC TGAAGGATGC CAGTGACAAT GCACGCAAGG 2280 
ACTTCCACCG TGAGGCCGAG CTCCTGACCA ACCTCCAGCA TGAGCACATC GTCAAGTTCT 2340 
ATGGCGTCTG CGTGGAGGGC GACCCCCTCA TCATGGTCTT TGAGTACATG AAGCATGGGG 2400 
ACCTCAACAA GTTCCTCAGG GCACACGGCC CTGATGCCGT GCTGATGGCT GAGGGCAACC 2460 
CGCCCACGGA ACTGACGCAG TCGCAGATGC TGCATATAGC CCAGCAGATC GCCGCGGGCA 2520 
TGGTCTACCT GGCGTCCCAG CACTTCGTGC ACCGCGATTT GGCCACCAGG AACTGCCTGG 2580 
TCGGGGAGAA CTTGCTGGTG AAAATCGGGG ACTTTGGGAT GTCCCGGGAC GTGTACAGCA 2640 
CTGACTACTA CAGGGTCGGT GGCCACACAA TGCTGCCCAT TCGCTGGATG CCTCCAGAGA 2700 
GCATCATGTA CAGGAAATTC ACGACGGAAA GCGACGTCTG GAGCCTGGGG GTCGTGTTGT 2760 
GGGAGATTTT CACCTATGGC AAACAGCCCT GGTACCAGCT GTCAAACAAT GAGGTGATAG 2820 
AGTGTATCAC TCAGGGCCGA GTCCTGCAGC GACCCCGCAC GTGCCCCCAG GAGGTGTATG 2880 
AGCTGATGCT GGGGTGCTGG CAGCGAGAGC CCCACATGAG GAAGAACATC AAGGGCATCC 2940 
ATACCCTCCT TCAGAACTTG GCCAAGGCAT CTCCGGTCTA CCTGGACATT CTAGGCTAGG 3000 
GCCCTTTTCC CCAGACCGAT CCTTCCCAAC GTACTCCTCA GACGGGCTGA GAGGATGAAC 3060 
ATCTTTTAAC TGCCGCTGGA GGCCACCAAG CTGCTCTCCT TCACTCTGAC AGTATTAACA 3120 
TCAAAGACTC CGAGAAGCTC TCGAGGGAAG CAGTGTGTAC TTCTTCATCC ATAGACACAG 3180 
TATTGACTTC TTTTTGGCAT TATCTCTTTC TCTCTTTCCA TCTCCCTTGG TTGTTCCTTT 3240 
TTCTTTTTTT AAATTTTCTT TTTC XTC TT T TTTTTCGTCT TCCCTGCTTC ACGATTCTTA 3300 
CCCTTTCTTT TGAATCAATC TGGCTTCTGC ATTACTATTA ACTCTGCATA GACAAAGGCC 3360 
TTAACAAACG TAATTTGTTA TATCAGCAGA CACTCCAGTT TGCCCACCAC AACTAACAAT 3420 
GCCTTGTTGT ATTCCTGCCT TTGATGTGGA TGAAAAAAAG GGAAAACAAA TATTTCACTT 3480 
AAACTTTGTC ACTTCTGCTG TACAGATATC GAGAGTTTCT ATGGATTCAC TTCTATTTAT 3540 
TTATTATTAT TACTGTTCTT ATTGTTTTTG GATGGCTTAA GCCTGTGTAT AAAAAAGAAA 3600 
ACTTGTGTTC AATCTGTGAA GCCTTTATCT ATGGGAGATT AAAACCAGAG AGAAAGAAGA 3660 
TTTATTATGA ACCGCAATAT GGGAGGAACA AAGACAACCA CTGGGATCAG CTGGTGTCAG 3720 
TCCCTACTTA GGAAATACTC AGCAACTGTT AGCTGGGAAG AATGTATTCG GCACC TTCCC 3780 
CTGAGGACCT TTCTGAGGAG TAAAAAGACT ACTGGCCTCT GTGCCATGGA TGATTCTTTT 3840 
CCCATCACCA GAAATGATAG CGTGCAGTAG AGAGCAAAGA TGGCTTCCGT GAGACACAAG 3900 
ATGGCGCATA GTGTGCTCGG ACACAGTTTT GTCTTCGTAG GTTGTGATGA TAGCACTGGT 3960 
TTGTTTCTCA AGCGCTATCC ACAGAACCTT TGTCAACTTC AGTTGAAAAG AGGTGGATTC 4020 
ATGTCCAGAG CTCATTTCGG GGTCAGGTGG GAAAGCC 

Seq ID NO: 396 Protein sequence 
Protein Accession ft: AAL67 965.1 

1 11 21 31 41 51 

1111 



WO 03/042661 



MSSWIRWHGP AMARLWGFCW LWGFWRAAF ACPTSCKCSA SRIWCSDPSP GIVAFPRLEP 60 

NSVDPENITE IFIANQKRLE I INEDDVEAY VGLRNLTIVD SGLKFVAHKA FLKNSNLOHI 120 

NFTRNKLTSL SRKHFRHLDL SELILVGNPF TCSCDIMWIK TLQEAKSSPD TQDLYCLNES 180 

SKNIPLANLQ IPNCGLPSAN LAAPNLTVEE GKSITLSCSV AGDPVPNMYW DVGNLVSKHM 240 

NETSHTQGSL RITNISSDDS GKQISCVAEM LVGEDQDSVN LTVHFAPTIT FLESPTSDHH 300 

WCIPFTVKGN PKPALQWFYN GAILNESKYI CTKIHVTNHT EYHGCLQLDN PTHMNNGDYT 360 

LIAKNEYGKD EKQISAHFMG WPG1DDGANP NYPDVIYEDY GTAANDIGDT TNRSNEIPST 420 

DVTDKTGREH LSVYAWVIA SWGFCLLVM LFLLKLARHS KFGMKDFSWF GFGKVKSRQG 480 

VGPASVISND DDSASPLHHI SNGSNTPSSS EGGPDAVIIG MTKIPVIENP QYFGITNSQL 540 

KPDTFVQHIK RHNIVLKREb GEGAFGKVFL AECYNLCPEQ DKILVAVKTL KDASDNARKD 600 

FHREAELLTN LQHEHIVKFY GVCVEGDPLI MVFEYMKHGD LNKFLRAHGP DAVLMAEGNP 660 

PTELTQSQML HIAQQIAAGM VYLASQHFVH RDLATRNCLV GENLLVKIGD FGMSRDVYST 720 

DYYRVGGHTM LPIRWMPPES IMYRKFTTES DVWSLGWLW EIFTYGKQPW YQLSNNEVIE 780 
CITQGRVLQR PRTCPQEVYE LMLGCWQREP HMRKNIKGIH TLLQNLAKAS PVYLDILG 

Seq ID NO: 397 DNA sequence 
Nucleic Acid Accession #: AB052906 
Coding sequence: 74.. 814 

1 11 21 31 41 51 

I I I I I I 

AAAACCTTGA GGTGATTCAT CTTCCAGGCT CTCCTTCCAT CAAGTCTCTC CTCCCTAGCG 60 

CTCTGGGTCC TTAATGGCAG CAGCCGCCGC TACCAAGATC CTTCTGTGCC TCCCGCTTCT 120 

GCTCCTGCTG TCCGGCTGGT CCCGGGCTGG GCGAGCCGAC CCTCACTCTC TTTGCTATGA 180 

CATCACCGTC ATCCCTAAGT TCAGACCTGG ACCACGGTGG TGTGCGGTTC AAGGCCAGGT 240 

GGATGAAAAG ACTTTTCTTC ACTATGACTG TGGCAACAAG ACAGTCACAC CTGTCAGTCC 300 

CCTGGGGAAG AAACTAAATG TCACAACGGC CTGGAAAGCA CAGAACCCAG TACTGAGAGA 360 

GGTGGTGGAC ATACTTACAG AGCAACTGCG TGACATTCAG CTGGAGAATT ACACACCCAA 420 

GGAACCCCTC ACCCTGCAGG CCAGGATGTC TTGTGAGCAG AAAGCTGAAG GACACAGCAG 480 

TGGATCTTGG CAGTTCAGTT TCGATGGGCA GATCTTCCTC CTCTTTGACT CAGAGAAGAG 540 

AATGTGGACA ACGGTTCATC CTGGAGCCAG AAAGATGAAA GAAAAGTGGG AGAATGACAA 600 

GGTTGTGGCC ATGTCCTTCC ATTACTTCTC AATGGGAGAC TGTATAGGAT GGCTTGAGGA 660 

CTTCTTGATG GGCATGGACA GCACCCTGGA GCCAAGTGCA GGAGCACCAC TCGCCATGTC 720 

CTCAGGCACA ACCCAACTCA GGGCCACAGC CACCACCCTC ATCCTTTGCT GCCTCCTCAT 780 

CATCCTCCCC TGCTTCATCC TCCCTGGCAT CTGAGGAGAG TCCTTTAGAG TGACAGGTTA 840 

AAGCTGATAC CAAAAGGCTC CTGTGAGCAC GGTCTTGATC AAACTCGCCC TTCTGTCTGG 900 

CCAGCTGCCC ACGACCTACG GTGTATGTCC AGTGGCCTCC AGCAGATCAT GATGACATCA 960 

TGGACCCAAT AGCTCATTCA CTGCCTTGAT TCCTTTTGCC AACAATTTTA CCAGCAGTTA 1020 

TACCTAACAT ATTATGCAAT TTTCTCTTGG TGCTACCTGA TGGAATTCCT GCACTTAAAG 1080 

TTCTGGCTGA CTAAACAAGA TATATCATTT TCTTTCTTCT CTTTTTGTTT GGAAAATCAA 1140 

GTACTTCTTT GAATGATGAT CTCTTTCTTG CAAATGATAT TGTCAGTAAA ATAATCACGT 1200 

TAGACTTCAG ACCTCTGGGG ATTCTTTCCG TGTCCTGAAA GAGAATTTTT AAATTATTTA 1260 

ATAAGAAAAA ATTTATATTA ATGATTGTTT CCTTTAGTAA TTTATTGTTC TGTACTGATA 1320 
TTTAAATAAA GAGTTCTATT TCCCAAAAAA AAAAAAAAAA AA 

Seq ID NO: 398 Protein sequence 
Protein Accession #: BAB61048.1 

1 11 21 31 41 51 

I I I 1 I I 

MAAAAATKIL LCLPLLLLLS GWSRAGRADP HSLCYDITVI PKFRPGPRWC AVQGQVDEKT 60 

FLHYDCGNKT VTPVSPLGKK LNVTTAWKAQ NPVLREWDI LTEQLRDIQL ENYTPKEPLT 120 

LQARMSCEQK AEGHSSGSWQ FSFDGQIFLL FDSEKRMWTT VHPGARKMKE KWENDKWAM 180 

SFHYFSMGDC IGWLEDFLMG MDSTLEPSAG APLAMSSGTT QLRATATTLI LCCLLIILPC 240 
FILPGI 

Seq ID NO: 399 DNA sequence 

Nucleic Acid Accession #: NM__001898.1 

Coding sequence: 57.. 482 

1 11 21 31 41 51 

| | I I 1 I 

GGCTCTCACC CTCCTCTCCT GCAGCTCCAG CTTTGTGCTC TGCCTCTGAG GAGACCATGG 60 

CCCAGTATCT GAGTACCCTG CTGCTCCTGC TGGCCACCCT AGCTGTGGCC CTGGCCTGGA 120 

GCCCCAAGGA GGAGGATAGG ATAATCCCGG GTGGCATCTA TAACGCAGAC CTCAATGATG 180 

AGTGGGTACA GCGTGCCCTT CACTTCGCCA TCAGCGAGTA TAACAAGGCC ACCAAAGATG 240 

ACTACTACAG ACGTCCGCTG CGGGTACTAA GAGCCAGGCA ACAGACCGTT GGGGG GGTGA 300 

ATTACTTCTT CGACGTAGAG GTGGGCCGCA CCATATGTAC CAAGTCCCAG CCCAACTTGG 360 

ACACCTGTGC CTTCCATGAA CAGCCAGAAC TGCAGAAGAA ACAGTTGTGC TCTTTCGAGA 420 

TCTACGAAGT TCCCTGGGAG AACAGAAGGT CCCTGGTGAA ATCCAGGTGT CAAGAATCCT 480 

AGGGATCTGT GCCAGGCCAT TCGCACCAGC CACCACCCAC TCCCACCCCC TGTAGTGCTC 540 

CCACCCCTGG ACTGGTGGCC CCCACCCTGC GGGAGGCCTC CCCATGTGCC TGCGCCAAGA 600 

GACAGACAGA GAAGGCTGCA GGAGTCCTTT GTTGCTCAGC AGGGCGCTCT GCCCTCCCTC 660 

CTTCCTTCTT GCTTCTAATA GCCCTGGTAC ATGGTACACA CCCCCCCACC TCCTGCAATT 720 
AAACAGTAGC ATCGCC 

Seq ID NO: 400 Protein sequence 
Protein Accession #: NP_001889.1 

1 11 21 31 41 51 

1 I I I I I 

MAQYLSTLLL LLAT LAV ALA WSPKEEDRII PGGIYNADLN DEWVQRALHF AISEYNKATK 60 

DDYYRRPLRV LRARQQTVGG VNYFFDVEVG RTICTKSQPN LDTCAFHEQP ELQKKQLCSF 120 



1112 



WO 03/042661 



EIYEVPWENR RSLVKSRCQE S 



Seq ID NO: 401 PNA sequence 

Nucleic Acid Accession ft: NM_003976.2 

Coding sequence: 299.961 



CTCTGAGCTT 
CATGGAGTTG 
CTACTTCTGC 
GGGTGGCAGG 
CAGGAGGGTG 
GGAACTTGGA 
TGCCCTGTGG 
GGGCTCCGCG 
CGCCGGCCAC 
GCCGCCGCAG 
CGGGGGCCGC 
GGGCTGCCGC 
CGACGAGCTG 
CGACCTCAGC 
GCCCGTCAGC 
CAACAGCACC 
AGGGCTCGCT 
CCTCCCGCAG 
AGGCCCCTAC 
CAGCCCCAGA 
GGAGCCCTTC 
CCCTCCTCTG 
ACAGCATTTG 
CCTGTACTCA 



11 
I 

CTCTGAGCCT 
TGAAAGAATA 
TGGGTTGAGT 
CCGGTCCCCC 
GGGGAACAGC 
CTTGGAGGCC 
CCCACCCTGG 
CCCCGCAGCC 
CTGCCGGGGG 
CCTTCTCGGC 
GCGGCGCGGG 
CTGCGCTCGC 
GTGCGTTTCC 
CTGGCCAGCC 
CAGCCCTGCT 
TGGAGAACCG 
CCAGGGCTTT 
AGTCCCACTA 
CGGTGGGTGA 
GCCCTCACCC 
GGACCCACTT 
ATGAACACTA 
AAGGACACAT 
CTCATGGGAG 



21 
I 

TGTTTGCTCA 
GCTGCAAAGC 
CTAGCTGTGT 
ACAAAAGATA 
TCAACAATGG 
TCTCCACGCT 
CCGCTCTGGC 
CTGCCCCCCG 
GACGCACGGC 
CCGCGCCCCC 
CTGGGGGCCC 
AGCTGGTGCC 
GCTTCTGCAG 
TACTGGGCGC 
GCCGACCCAC 
TGGACCGCCT 
GCAGACTGGA 
GCCAGCGGCC 
TGGATATCAT 
TGCGGATCCC 
CTCACAGACT 
CAGTGGCTGA 
ATTGCAGTTG 
CTGGCCCC 



31 
I 

TCTGGAAAAA 
ACCTAACACA 
AGGCCCCTTG 
ACTCATCTCT 
CTGATGGGCG 
GTCCCACTGC 
TCTGCTGAGC 
CGAAGGCCCC 
CCGCTGGTGC 
GCCGCCTGCA 
GGGCAGCCGC 
GGTGCGCGCG 
CGGCTCCTGC 
CGGGGCCCTG 
GCGCTACGAA 
CTCCGCCACC 
CCCTTACCGG 
TCAGCCAGGG 
CCCCGAACAG 
AGCCTAAAAG 
CTGGCACTGG 
GGCATCAGCC 
CTTGGTTGAA 



41 
I 

GGGGATTAAA 
TAGTAAGGTT 
TTCCTCACCT 
TAATTTGCAA 
CTCCTGGTGT 
CCCTGGCCTA 
AGCGTCGCAG 
CCGCCTGTCC 
AGTGGAAGAG 
CCCCCATCTG 
GCTCGGGCAG 
CTCGGCCTGG 
CGCCGCGCGC 
CGACCGCCCC 
GCGGTCTCCT 
GCCTGCGGCT 
TGGCTCTTCC 
ACGAAGGCCT 
GTGAAGGGAC 
ACACCAGAGA 
CCAGGCCTCG 
CCCGCCCAGG 
AGTGCCTGTG 



51 
I 

CCATTTACCT 
CCCAGTGCAG 
GGAGAAACTG 
GCTGCCTCAA 
TGATAGAGAT 
GGCGGCAGCC 
AGGCCTCCCT 
TGGCGTCCCC 
CCCGGCGGCC 
CTCTTCCCCG 
CGGGGGCGCG 
GCCACCGCTC 
GCTCTCCACA 
CGGGCTCCCG 
TCATGGACGT 
GCCTGGGCTG 
TGCCTGGGAC 
CAAAGCTGAG 
AACTGACTAG 
CCTCAGCTAT 
AACCTGGGAC 
CCCTGTAGGG 
CTGGAACTGG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 



Seq ID NO: 402 Protein sequence 
Protein Accession #: NP_003967.1 

11 



1 11 21 31 41 51 

I I I I 1 I 

MELGLGGLST LSHCPWPRRQ PALWPTIAAL ALLSSVAEAS LGSAPRSPAP REGPPPVLAS 
PAGHliPGGRT ARWCSGRARR PPPQPSRPAP PPPAPPSALP RGGRAARAGG PGSRARAAGA 
RGCRLRSQLV PVRALGLGHR SDELVRFRFC SGSCRRARSP HDLSLASLLG AGALRPPPGS 
RPVSQPCCRP TRYEAVSFMD VNSTWRTVDR LSATACGCLG 



60 
120 
180 



Seq ID NO: 403 PNA sequence 
Nucleic Acid Accession ft: NM_OS7091.1 
Coding sequence: 783.-1445 

1 11 21 31 41 51 

ACTGGCCGCT GAGAGAAGAA TCGGGTGGAG CAGAGAGCAG CTGCTGCAGG GCAGACAGCC 
GGACCCCCAA ATCTGCACGT ACCAGCAGTC AGCCGCCCCA CGCAGGGACC GGCTTACCCC 
TCGCTCCCCG CCCTCACTCA CTTTCTCCCG CCCTCGGCCC GGCCTCCCAG CTCTCTACTT 
CGCGTGTCTA CAAACTCAAC TCCCGGTTTC CGTGCCTCTC CACCGCTCGA GTTCTCTACT 
CTCCATATCC GAGGGGCCCC TCCCAGCATC TACCCCCCTC CCAACCTCGG GGGACCTAGC 
CAAGCTAGGG GGGACTGGAT CCGACGGGTG GAGCAGCCAG GTGAGCCCCG AAAGGTGGGG 
CGGGGCAGGG GCGCTCCCAG CCCCACCCCG GGATCTGGTG ACGCTGGGGC TGGAATTTGA 
CACCGGACGG CTGCGGCGGC GGGCAGGAGG CTGCTGAGGG ATGGAGTTGG GCCCGGCCCC 
CAGACAAGGC CCGGGGGCTC CGCCAGCAGC AGGTCCCTCG GGCCCCAGCC CTCGCTGCCA 
CCCGGGCCTG GAGCCCCACA CCCGAGGGTG CAGACTGGCT GCCAAGGCCA CACTTTTGGC 
TAAAAGAGGC ACTGCCAGGT GTACAGTCCT GGGCATGCGC TGTTTGAGCT TCGGGGGAGA 
GCCCAGCACT GGTCCCCGGA AAGGTGCCTA GAAGAACAAG GTGCAGGACC CCGTGCTGCC 
TCAACAGGAG GGTGGGGGAA CAGCTCAACA ATGGCTGATG GGCGCTCCTG GTGTTGATAG 
AGATGGAACT TGGACTTGGA GGCCTCTCCA CGCTGTCCCA CTGCCCCTGG CCTAGGCGGC 
AGCCTGCCCT GTGGCCCACC CTGGCCGCTC TGGCTCTGCT GAGCAGCGTC GCAGAGGCCT 
CCCTGGGCTC CGCGCCCCGC AGCCCTGCCC CCCGCGAAGG CCCCCCGCCT GTCCTGGCGT 
CCCCCGCCGG CCACCTGCCG GGGGGACGCA CGGCCCGCTG GTGCAGTGGA AGAGCCCGGC 
GGCCGCCGCC GCAGCCTTCT CGGCCCGCGC CCCCGCCGCC TGCACCCCCA TCTGCTCTTC 
CCCGCGGGGG CCGCGCGGCG CGGGCTGGGG GCCOGGGCAG CCGCGCTCGG GCAGCGGGGG 
CGCGGGGCTG CCGCCTGCGC TCGCAGCTGG TGCCGGTGCG CGCGCTCGGC CTGGGCCACC 
GCTCCGACGA GCTGGTGCGT TTCCGCTTCT GCAGCGGCTC CTGCCGCCGC GCGCGCTCTC 
CACACGACCT CAGCCTGGCC AGCCTACTGG GCGCCGGGGC CCTGCGACCG CCCCCGGGCT 
CCCGGCCCGT CAGCCAGCCC TGCTGCCGAC CCACGCGCTA CGAAGCGGTC TCCTTCATGG 
ACGTCAACAG CACCTGGAGA ACCGTGGACC GCCTCTCCGC CACCGCCTGC GGCTGCCTGG 
GCTGAGGGCT CGCTCCAGGG CTTTGCAGAC TGGACCCTTA CCGGTGGCTC TTCCTGCCTG 
GGACCCTCCC GCAGAGTCCC ACTAGCCAGC GGCCTCAGCC AGGGACGAAG GCCTCAAAGC 
TGAGAGGCCC CTACCGGTGG GTGATGGATA TCATCCCCGA ACAGGTGAAG GGACAACTGA 
CTAGCAGCCC CAGAGCCCTC ACCCTGCGGA TCCCAGCCTA AAAGACACCA GAGACCTCAG 
CTATGGAGCC CTTCGGACCC ACTTCTCACA GACTCTGGCA CTGGCCAGGC CTCGAACCTG 
GGACCCCTCC TCTGATGAAC ACTACAGTGG CTGAGGCATC AGCCCCCGCC CAGGCCCTGT 
AGGGACAGCA TTTGAAGGAC ACATATTGCA GTTGCTTGGT TGAAAGTGCC TGTGCTGGAA 
CTGGCCTGTA CTCACTCATG GGAGCTGGCC CC 

Seq ID NO: 404 Protein sequence 
Protein Accession ftx NPJ>03967.1 

1113 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 



WO 03/042661 



1 11 21 

I I I 

MBLGLGGLST LSHCPWPRRQ PALWPTLAAL 
PAGHLPGGRT ARWCSGRARR PPPQPSRPAP 
RGCRLRSQLV PVRALGLGHR SDELVRFRFC 
RPVSQPCCRP TRYEAVSFMD VNSTWRTVDR 



31 41 51 

I 1 I 

ALLSSVAEAS LGSAPRSPAP REGPPPV1AS 60 
PPPAPPSAIiP RGGRAARAGG PGSRARAAGA 120 
SGSCRRARSP HDLSLASLLG AGALRPPPGS 180 
LSATACGCLG 



Seq ID NO: 405 PNA sequence 

Nucleic Acid Accession 8: NM_057160.1 

Coding sequence : 1 - - 714 

1 11 21 31 41 51 

ATGCCCGGCC TGATCTCAGC COGAGGACAG CCCCTCCTTG AGGTCCTTCC TCCCCAAGCC 60 

CACCTGGGTG CCCTCTTTCT CCCTGAGGCT CCACTTGGTC TCTCCGCGCA GCCTGCCCTG 120 

TGGCCCACCC TGGCCGCTCT GGCTCTGCTG AGCAGCGTCG CAGAGGCCTC CCTGGGCTCC 180 

GCGCCCCGCA GCCCTGCCCC CCGCGAAGGC CCCCCGCCTG TCCTGGCGTC CCCCGCCGGC 240 

CACCTGCCGG GGGGACGCAC GGCCCGCTGG TGCAGTGGAA GAGCCCGGCG GCCGCCGCCG 300 

CAGCCTTCTC GGCCCGCGCC CCCGCCGCCT GCACCCCCAT CTGCTCTTCC CCGCGGGGGC 360 

CGCGCGGCGC GGGCTGGGGG CCCGGGCAGC CGCGCTCGGG CAGCGGGGGC GCGGGGCTGC 420 

CGCCTGCGCT CGCAGCTGGT GCCGGTGCGC GCGCTCGGCC TGGGCCACCG CTCCGACGAG 480 

CTGGTGCGTT TCCGCTTCTG CAGCGGCTCC TGCCGCCGCG CGCGCTCTCC ACACGACCTC 540 

AGCCTGGCCA GCCTACTGGG CGCCGGGGCC CTGCGACCGC CCCCGGGCTC CCGGCCCGTC 600 

AGCCAGCCCT GCTGCCGACC CACGCGCTAC GAAGCGGTCT CCTTCATGGA CGTCAACAGC 660 

ACCTGGAGAA CCGTGGACCG CCTCTCCGCC ACCGCCTGCG GCTGCCTGGG CTGAGGGCTC 720 

GCTCCAGGGC TTTGCAGACT GGACCCTTAC CGGTGGCTCT TCCTGCCTGG GACCCTCCCG 780 

CAGAGTCCCA CTAGCCAGCG GCCTCAGCCA GGGACGAAGG CCTCAAAGCT GAGAGGCCCC 840 

TACCGGTGGG TGATGGATAT CATCCCCGAA CAGGTGAAGG GACAACTGAC TAGCAGCCCC 900 

AGAGCCCTCA CCCTGCGGAT CCCAGCCTAA AAGACACCAG AGACCTCAGC TATGGAGCCC 960 

TTCGGACCCA CTTCTCACAG ACTCTGGCAC TGGCCAGGCC TCGAACCTGG GACCCCTCCT 1020 

CTGATGAACA CTACAGTGGC TGAGGCATCA GCCCCCGCCC AGGCCCTGTA GGGACAGCAT 1080 

TTGAAGGACA CATATTGCAG TTGCTTGGTT GAAAGTGCCT GTGCTGGAAC TGGCCTGTAC 1140 
TCACTCATGG GAGCTGGCCC C 

Seq ID NO: 406 Protein sequence 
Protein Accession #: NP_476501.1 

1 11 21 31 41 51 

MPGLISARGQ PLLEVLPPQA HLGALFLPEA PLGLSAQPAL WPTIAALALL SSVAEASLGS 60 

APRSPAPREG PPPVLASPAG HLPGGRTARW CSGRARRPPP QPSRPAPPPP APPSALPRGG 120 

RAARAGGPGS RARAAGARGC RLRSQLVPVR ALGLGHRSDB LVRFRFCSGS CRRARSPHDL 180 
SLASLLGAGA LRPPPGSRPV SQPCCRPTRY EAVSFMDVNS TWRTVDRLSA TACGCLG 



Seq ID NO*. 407 PNA sequence 

Nucleic Acid Accession ft: NM__057090.1 

Coding sequence: 29.. 715 

1 11 21 31 41 51 

CTGATGGGCG CTCCTGGTGT TGATAGAGAT GGAACTTGGA CTTGGAGGCC TCTCCACGCT 60 

GTCCCACTGC CCCTGGCCTA GGCGGCAGGC TCCACTTGGT CTCTCCGCGC AGCCTGCCCT 120 

GTGGCCCACC CTGGCCGCTC TGGCTCTGCT GAGCAGCGTC GCAGAGGCCT CCCTGGGCTC ISO 

CGCGCCCCGC AGCCCTGCCC CCCGCGAAGG CCCCCCGCCT GTCCTGGCGT CCCCCGCCGG 240 

CCACCTGCCG GGGGGACGCA CGGCCCGCTG GTGCAGTGGA AGAGCCCGGC GGCCGCCGCC 300 

GCAGCCTTCT CGGCCCGCGC CCCCGCCGCC TGCACCCCCA TCTGCTCTTC CCCGCGGGGG 360 

CCGCGCGGCG CGGGCTGGGG GCCCGGGCAG CCGCGCTCGG GCAGCGGGGG CGCGGGGCTG 420 

CCGCCTGCGC TCGCAGCTGG TGCCGGTGCG CGCGCTCGGC CTGGGCCACC GCTCCGACGA 480 

GCTGGTGCGT TTCCGCTTCT GCAGCGGCTC CTGCCGCCGC GCGCGCTCTC CACACGACCT 540 

CAGCCTGGCC AGCCTACTGG GCGCCGGGGC CCTGCGACCG CCCCCGGGCT CCCGGCCCGT 600 

CAGCCAGCCC TGCTGCCGAC CCACGCGCTA CGAAGCGGTC TCCTTCATGG ACGTCAACAG 660 

CACCTGGAGA ACCGTGGACC GCCTCTCCGC CACCGCCTGC GGCTGCCTGG GCTGAGGGCT 720 

CGCTCCAGGG CTTTGCAGAC TGGACCCTTA CCGGTGGCTC TTCCTGCCTG GGACCCTCCC 780 

GCAGAGTCCC ACTAGCCAGC GGCCTCAGCC AGGGACGAAG GCCTCAAAGC TGAGAGGCCC 840 

CTACCGGTGG GTGATGGATA TCATCCCCGA ACAGGTGAAG GGACAACTGA CTAGCAGCCC 900 

CAGAGCCCTC ACCCTGCGGA TCCCAGCCTA AAAGACACCA GAGACCTCAG CTATGGAGCC 960 

CTTCGGACCC ACTTCTCACA GACTCTGGCA CTGGCCAGGC CTCGAACCTG GGACCCCTCC 1020 

TCTGATGAAC ACTACAGTGG CTGAGGCATC AGCCCCCGCC CAGGCCCTGT AGGGACAGCA 1080 

TTTGAAGGAC ACATATTGCA GTTGCTTGGT TGAAAGTGCC TGTGCTGGAA CTGGCCTGTA 1140 
CTCACTCATG GGAGCTGGCC CC 



Seq ID NO; 408 Protein sequence 
Protein Accession #: NP_476431.1 

1 11 21 

I I I 

MELGLGGLiST LSHCPWPRRQ APLGLSAQPA 
GPPPVLASPA GHLPGGRTAR WCSGRARRPP 
SRARAAGARG CRLRSQLVPV RALGLGHRSD 
AIiRPPPGSRP VSQPCCRPTR YEAVSFMDVN 



31 41 51 

I I I 

IAfpTLAALAI* LSSVAEASLG SAPRSPAPRE 60 
PQPSRPAPPP PAPPSALPRG GRAARAGGPG 120 
ELVRFRFCSG SCRRARSPHD LSLASLLGAG 180 
STWRTVDRLS ATACGCLG 



1114 



WO 03/042661 



Seq ID NO: 409 DNA sequence 

Nucleic Acid Accession (*: Eos sequence 

Coding sequence : 1 .. 1*7 4 6 

1 11 21 31 41 51 

ATGCCACTGA AGCATTATCT CCTTTTGCTG GTGGGCTGCC AAGCCTGGGG TGCAGGGTTG 60 
GCCTACCATG GCTGCCCTAG CGAGTGTACC TGCTCCAGGG CCTCCCAGGT GGAGTGCACC 120 
GGGGCACGCA TTGTGGCGGT GCCCACCCCT CTGCCCTGGA ACGCCATGAG CCTGCAGATC 180 
CTCAACACGC ACATCACTGA ACTCAATGAG TCCCCGTTCC TCAATATCTC AGCCCTCATC 240 
GCCCTGAGGA TTGAGAAGAA TGAGCTGTCG CGCATCACGC CTGGGGCCTT CCGAAACCTG 300 
GGCTCGCTGC GCTATCTCAG CCTCGCCAAC AACAAGCTGC AGGTTCTGCC CATCGGCCTC 360 
TTCCAGGGCC TGGACAGCCT TGAGTCTCTC CTTCTGTCCA GTAACCAGCT GTTGCAGATC 420 
CAGCCGGCCC ACTTCTCCCA GTGCAGCAAC CTCAAGGAGC TGCAGTTGCA CGGCAACCAC 480 
CTGGAATACA TCCCTGACGG AGCCTTCGAC CACCTGGTAG GACTCACGAA GCTCAATCTG 540 
GGCAAGAATA GCCTCACCCA CATCTCACCC AGGGTCTTCC AGCACCTGGG CAATCTCCAG 600 
GTCCTCCGGC TGTATGAGAA CAGGCTCACG GATATCCCCA TGGGCACTTT TGATGGGCTT 660 
GTTAACCTGC AGGAACTGGC TCTACAGCAG AACCAGATTG GACTGCTCTC CCCTGGTCTC 720 
TTCCACAACA ACCACAACCT CCAGAGACTC TACCTGTCCA ACAACCACAT CTCCCAGCTG 780 
CCACCCAGCA TCTTCATGCA GCTGCCCCAG CTCAACCGTC TTACTCTCTT TGGGAATTCC 840 
CTGAAGGAGC TCTCTCTGGG GATCTTCGGG CCCATGCCCA ACCTGCGGGA GCTTTGGCTC 900 
TATGACAACC ACATCTCTTC TCTACCCGAC AATGTCTTCA GCAACCTCCG CCAGTTGCAG 960 
GTCCTGATTC TTAGCCGCAA TCAGATCAGC TTCATCTCCC CGGGTGCCTT CAACGGGCTA 1020 
ACGGAGCTTC GGGAGCTGTC CCTCCACACC AACGCACTGC AGGACCTGGA CGGGAATGTC 1080 
TTCCGCATGT TGGCCAACCT GCAGAACATC TCCCTGCAGA ACAATCGCCT CAGACAGCTC 1140 
CCAGGGAATA TCTTCGCCAA OGTCAATGGC CTCATGGCCA TCCAGCTGCA GAACAACCAG 1200 
CTGGAGAACT TGCCCCTCGG CATCTTCGAT CACCTGGGGA AACTGTGTGA GCTGCGGCTG 1260 
TATGACAATC CCTGGAGGTG TGACTCAGAC ATCCTTCCGC TCCGCAACTG GCTCCTGCTC 1320 
AACCAGCCTA GGTTAGGGAC GGACACTGTA CCTGTGTGTT TCAGCCCAGC CAATGTCCGA 1380 
GGCCAGTCCC TCATTATCAT CAATGTCAAC GTTGCTGTTC CAAGCGTCCA TGTCCCTGAG 1440 
GTGCCTAGTT ACCCAGAAAC ACCATGGTAC CCAGACACAC CCAGTTACCC TGACACCACA 1500 
TCCGTCTCTT CTACCACTGA GCTAACCAGC CCTGTGGAAG ACTACACTGA TCTGACTACC 1560 
ATTCAGGTCA CTGATGACCG CAGCGTTTGG GGCATGACCC AGGCCCAGAG CGGGCTGGCC 1620 
ATTGCCGCCA TTGTAATTGG CATTGTCGCC CTGGCCTGCT CCCTGGCTGC CTGCGTCGGC 1680 
TGTTGCTGCT GCAAGAAGAG GAGCCAAGCT GTCCTGATGC AGATGAAGGC ACCCAATGAG 1740 
TGTTAAAGAG GCAGGCTGGA GCAGGGCTGG GGAATGATGG GACTGGAGGA CCTGGGAATT 1800 
TCATCTTTCT GCCTCCACCC CTGGGTCCAT GGAGCTTTCC CGTGATTGCT CTTTCTGGCC 1860 
CTAGATAAAG GTGTGCCTAC CTCTTCCTGA CTTGCCTGAT TCTCCCGTAG AGAAGCAGGT 1920 
CGTGCCGGAC CTTCCTACAA TCAGGAAGAT AGATCCAACT GGCCATGGCA AAAGCCCTGG 1980 
GGATTTCCGA TTCATACCCC TGGGCTTCCT TCGAGAGGGC TCTTCCTCCA AATCCTCCCC 2040 
ACCTGTCCTC CAAGAACAGC CTTCCCTGCG CCCAGGCCCC CTCCGGGCCT CTGTAGACTC 2100 
AGTTAGTCCA CAGCCTGCTC ACTTCGTGGG AATAGTTCTC CGCTGAGATA GCCCCTCTCG 2160 
CCTAAGTATT ATGTAAGTTG ATTTCCCTTC TTTTGTTTCT CTTGTTTGTG CTATGGCTTG 2220 
ACCCAGCATG TCCCCTCAAA TGAAAGTTCT CCCCTTGATT TTCTGCTCCT GAAGGCAGGG 2280 
TGAGTTCTCT CCTCAAAGAA GACTTCAAAC CATTTAACTG GTTTCTTAAG AGCCGTCAAT 2340 
CAGCCTGGTT TTGGGGATGC TATGAAAGAG AGAAGGAAAA TCATGCCGCT CAGTTCCTGG 2400 
AGACAGAAGA GCCGTCATCA GTGTCTCACT TGTGATTTTT ATCTGGAAAA GG AAGA AACA 2460 
CCCCAGCACA GCAAGCTCAG CCTTTTAGAG AAGGATATTT CCAAACTGCA AACTTTGCTT 2520 
TGAAAAGTTT AGCCCTTTAA GGAATGAAAT CATGTAGAAT TTTGGACTTC TAAAAACATT 2580 
AAAATCAGCT TATTAATACG GGATAGAGAA AGAAATCTGG TGCCTGGGGG TCCCTGTGTT 2640 
CACCCCTAGA GTTTGTTTTA AAATTTTTAA TTGAAGCATG TGAAGTGTAC STGCAGAAAA 2700 
GTGGGAACAT GATAGTGTAT GGCTTGGTGG ATTTTCACAA ACTGAACATA CCTGTGTAAT 2760 
CAGCATCTAG ACCCAGACCC AGAGCATCAC AAATATCCCC CATCCTGGGC TTTTCCCAGA 2820 
GGAGATGGGG GCTTCTGAAG ATGGACTTAC CTGGGACCTG CCCCCCATGA GCCAGGACGG 2880 
TCCCCCCACA GTCAGCCTGT GCAAAGGCCC CGTGGCCAGG GGTGGAGGAG AATATGTGGG 2940 
TGTGGACAGG ATGGGAGACT GTGGCCTGAA CAGGAGATTT TATTATATCT GGAGACCCTG 3000 
AGAGACCCTG AGACCTGGGG CACCATGGCT GGCCAGGTCA GAAGCATCCT GACTGCAGAG 3060 
GTCCGTGCAG CCACACCCTC TTCCCTGCCA GCAAGTTGTC TGCGGC TCAT CGGAGGCCCC 3120 
TCOGCCTGGA GCCTTCTATG GACGTGATAT GCCTGTATCT GTTTTTAATT TTCATTCTTC 3180 
ACTTAGGGGA AGTGAAATCG CTCAGAGATG AGATCCTTTA ATTGAAAACG AAGTGTAACG 3240 
GAATCTAGTG TCTTTCTAAT GTGGTAAAAT TCTCCATCAA CATCACAGTC AGCTGGCAGC 3300 
TGAACTTCAG AATCTCACTT ACAGCAGGCG ACACGGGGGT ACACCGATGG GTCACACTGG 3360 
GTCTGGGGGC TCCCTGGAGC TCCTCCTGCG TGTGGTCTGG TTAGGAGTTG AGTTGTTTGC 3420 
TCCAGGGTTA TTCTCCTCCT CGAGTCACAG TCACACGAAT ACCTGCCTTC TCTGGCTTTC 3480 
CTGCTATACA CATATTCACA TGGCGCTCAA GAAGTTAGGC TCATGGCAAC GTGTGTCTTT 3540 
CTCTGGACAA CTGGCCCAGT TTACAGTGAA ATGGAGAATT TCAGGTCTCC ACGTCTGCCC 3600 
AGGAAAGAAC TTCAGCTGAC TCCACGGGGA TCTGGAAATC CACGACCAAT CCCGATCGGC 3660 
TCTTATTAGC TCCCCGCTCC ACAAGACACC TGTGCTTTGG AAATCCACCA CCAATCCCGA 3720 
TCGGCTCTTA TTAGCTCCCC GCTCCACAAG ACACCTGTGA TCTGGAAATC TACCACCAAT 3780 
CCCGATCGGC TCTTATTAGC TCCCCGCTCC ACAAGACACC TGTGACATCC TCCAGGGCCA 3840 
CAGGAGCACG TGCTGACCAG TTTTCCCTTC CAGTTCCTGC ACAAAAAGTG TCGAGAGGGC 3900 
TGTTTGCAAA CACTAGTGCA CTTTGTAGCT TTTCACCCTC TGTCCCAGGG AATCTAGGAG 3960 
AGATGAGGCC CGTCAGAGTC AAGAGATGTC ATCCCCCCAG GGTCTCCAAG GCATTTCCAC 4020 
ACTATTGGTG GCACCTGGAG GACATGCACC AAGGCTTGCC AGAGCCAACA GGAAGTGAGC 4080 
CCAGAGCATG GCACATGAGC ATCACCCGCT GATGGTGGCC TGCTGTGCCT GGTGCCAACA 4140 
GGGGCATCCC GGCCCGTACC CCTCCAGACA GGAAGCATGG GTTTGCCCAC AGACCTGTCG 4200 
GGTGCTCCTG TGAGTGGCCT CCAGATGTCT TTGTGCATAG GCACAAGTGG GCCAGGGCTG 4260 
GAGGGAGGTG GGAAACCTCA TCATCCGGTG GGCCCTGCCA ATCTTAACCC AGAACCCTTA 4320 
GGTATTCCTG GCAGTAGCCA TGACATTGGA GCACCTTCCT CTCCAGCCAG AGGCTGACCT 4380 
GAGGGCCACT GTCCTCAGAT GACACCACCC AGGAGCACCC TAGGTGAGGG GTGAGGGCCC 4440 
CCTTATGTGA ACCTCTTGCC TCTTCCTTTC TCCCATCAGA GTGGTTGGAT GGAGCCATTG 4500 
GCCTCCTTTT CTTCAGCGGG CCCTTCAACC TCTCTGCACC ATGTT GTCT G GCTGAGGAGC 4560 
TACTAGAAAA GCTGAGTGGA GTCTCCTTTC CAACAGGATG ATGCATTTGC TCAATTCTCA 4620 
GGGCTGGAAT GAGCCGGCTG GTCCCCCAGA AAGCTGGAGT GGGGTACAGA GTTCAGTTTT 4680 



1115 



WO 03/042661 



PCT/US02/36810 



5 

10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



CCTCTCTGTT 
GTGTTGGAGA 
TGCACAGATA 
GGTAGGAGTG 
GGTGTTCAAT 
TTGTCTTGGG 
TTAGTCTTGG 
GGAAAAAATA 
TGGGCTGTAT 
AACTTTTCAT 
GAACTTCCAA 
AGTTGGTCGA 
GCCCCCAGAT 
GGAAGGAAGC 
CTCCTTCCGC 
CTTCATGCTG 
GCCCCAGTGC 
AGCCCTGGTG 
GGTGTACAGA 



TACAGCTCCT 

AGAAACAACA 

CTCTTCAAGC 

CCGCCTCTAC 

AGGCTGGGAG 

CTTTCGTCAT 

TCATCAGAAC 
AACTCTTCCA 

GTATATTGTT 
GGACACAATT 
ACTCAGGAAG 
CAGATGTTAG 
CCCACAGTCA 
CATGGCTGTG 
CCCAGGTTTC 
CCTTCAAAGC 
TTGGCGATGC 
GGCAGGGTTG 
ATCAACAATA 



TGACAGTCCC 
AAAGCCAATT 
ACTGGACGTG 
CCACTTGTGA 
TTTTATTTAT 
TAAACCAAAG 
CTCACTTGGT 
TCCCTTAAAG 
CTTCCTCCTT 
TCCACAACCT 
TTTGCAGAGA 
ATGTATCCTA 
GAACTGAATC 
GTTCAGAGAG 
TTCTTCTCTT 
TAGATCATGT 
ATTTACAGAT 
GGGGGTCTGT 
AATAATATAC 



ACGCCCATCT 
AGAACCACTA 
GATTCTCTCT 
TGGGGTACAG 
CTCTTCAAAC 
GAAATGGAAG 
ACCATATAGA 
AATAGAATAG 
AGAATTTAGA 
TTCAGATGCT 
GCAGACAGCT 
GCTTTTAGCC 
TGCGTTGTTG 
GGTGGGCTGG 
AAGGAGAGAT 
TTGCCTTGCT 
TTCTAGGCCC 
CTTCTGCTGG 
ATGTAT 



GGAGTGGGAG 
TTTTTAAAAA 
CTAGCCCTCA 
AGGCACTTGC 
TTTGTACAAG 
CCATTCCCCT 
TCAAAAGCTT 
TTTGTCCCTC 
GATACAAGAG 
GATGTAGAGC 
AGAGATAACT 
ATAAACCACT 
GGAAGCCAGC 
CAAGCCACTT 
TGTTCTCACC 
TAGAGAATTA 
TCAGGGTTTT 
ATGCTGCTTG 



CTGGGAGTTA 
GTGCTTACTG 
GCACCCCTGC 
TCTTCTGCAT 
AGCTCATGGC 
GTTGCTCTCC 
TGTAACCACA 
TCATGGGAAT 
TTCTACTTAG 
TATTGGGAAA 
CGGGACCCAG 
CAAAGATTCA 
AGTGGCCTTG 
CCGGGGAAAA 
AACCCGCTGC 
CTGCAAATCA 
GTAGAGTGTG 
TAATCCATTT 



4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 



Seq ID NO: 410 Protein sequence 
Protein Accession #: BAB84587.1 



1 
i 

MPLKHYLLLL 
LNTHITELNE 
FOGLDSLESIi 
GKNSLTHISP 



YDNHISSLPD 
FRMLANLQNI 
YDNPWRCDSD 
VPSYPETPWY 
IAAIVIGIVA 



11 
I 

VGCCAWGAGL 
SPFUUSALI 
LIiSSNQLLQI 
RVFQHLGNLQ 
YLSNNHISQL 
NVFSNLRQLQ 
SLQNNRLRQL 
ILPLRNWLLL 
PDTPSYPDTT 
LACSLAACVG 



21 
I 

AYHGCPSECT 
AUUEKNELS 
QPAHFSQCSN 
VLRLYENRLT 
PPSIFMQLPQ 
VLILSRNQIS 
PGNIFANVNG 
NQPRLGTDTV 
SVSSTTELTS 



31 
i 

CSRASQVECT 
RITPGAFRNL 
LKELQLHGNH 
DIPMGTFDGL 
LNRLTIiFGNS 
FISPGAFNGIi 
L.MAIQLQNNQ 
PVCFSPANVR 
PVEDYTDLTT 
VLMQMKAPNE 



41 
I 

GARIVAVPTP 
GSLRYLSLAN 
LEYIPDGAFD 
VNLQELALQQ 
LKELSLGIFG 
TELRELSLHT 
LENLPLGIFD 
GQSLIIINVN 
IQVTDDRSVW 
C 



51 
I 

LPWNAMSLQI 
NKI/QVIiPIGL 
HLVGLTKLNL 
NQIGLLSPGl* 
PMPNLRELWL 
NALQDLDGNV 
HLGKLCELRL 
VAVPSVHVPE 
GMTQAQSGLA 



60 
120 
180 
240 
300 
360 
420 
480 
540 



Seq ID NO: 411 DMA sequence 
Nucleic Acid Accession #» XMJ>98151 
Coding sequence : 1 . . 447 



1 
I 

ATGATGCATT 
AGTGGGGTAC 
TCTGGAGTGG 
CTATTTTTAA 
TCTCTAGCCC 
CAGAGGCACT 
AACTTTGTAC 
AAGCCATTCC 



11 

I 

TGCTCAATTC 
AGAGTTCAGT 
GAGCTGGGAG 
AAAGTGCTTA 
TCAGCACCCC 
TGCTCTTCTG 
AAGAGCTCAT 
CCTGTTGCTC 



21 
I 

TCAGGGCTGG 
TTTCCTCTCT 
TCAGTGTTGG 
CTGTGCACAG 
TGCGGTAGGA 
CATGGTGTTC 
GGCTTGTCTT 
TCCTTAG 



31 
I 

AATGAGCCGG 
GTTTACAGCT 
AGAAGAAACA 
ATACTCTTCA 
GTGCCGCCTC 
AATAGGCTGG 
GGGCTTTCGT 



41 

I 

CTGGTCCCCC 
CCTTGACAGT 
ACAAAAGCCA 
AGCACTGGAC 
TACCCACTTG 
GAGTTTTATT 
CATTAAACCA 



51 
1 

AGAAAGCTGG 
CCCACGCCCA 
ATTAGAACCA 
GTGGATTCTC 
TGATGGGGTA 
TATCTCTTCA 
AAGGAAATGG 



60 
120 
180 
240 
300 
360 
420 



Seq ID NO: 412 Protein sequence 
Protein Accession #: XP_098151 

1 11 21 31 41 51 

I | | | I I 

MMHLLNSQGW NEPAGPPESW SGVQSSVFLS VYSSLTVPRP SGVGAGSQCW RRNNKSQLEP 
LFLKSAYCAQ ILFKHWTWIL SIALSTPAVG VPPLPTCDGV QRHLLFCMVF NRLGVLFISS 
NFVQELMACL GLSSLNQRKW KPFPCCSP 

Seq ID NO: 413 DNA sequence 

Nucleic Acid Accession #: NM__002658.1 

Coding sequence: 77.. 1372 



60 
120 



GTCCCCGCAG 
CCCCGACCTC 
GAGCGACTCC 
TGGAGGAACA 
GAAATTCGGA 
TCACTTTTAC 
CTCTGCCACT 
CCTGGGGAAA 
GCAGGTGGGC 
AAAGCCCTCC 
CCGCTTTAAG 
CATCTACAGG 
CCCTTGCTGG 
CATCGTCTAC 
GGTGGAAAAC 
CATTGCCTTG 
ACAGACCATC 
CACTGGCTTT 



11 
I 

CGCCGTCGCG 
GCCACCATGA 
AAAGGCAGCA 
TGTGTGTCCA 
GGGCAGCACT 
CGAGGAAAGG 
GTCCTTCAGC 
CATAATTACT 
CTAAAGCCGC 
TCTCCTCCAG 
ATTATTGGGG 
AGGCACCGGG 
GTGATCAGCG 
CTGGGTCGCT 
CTCATCCTAC 
CTGAAGATCC 
TGCCTGCCCT 
GGAAAAGAGA 



21 
I 

CCCTCCTGCC 
GAGCCCTGCT 
ATGAACTTCA 
ACAAGTACTT 
GTGAAATAGA 
CCAGCACTGA 
AAACGTACCA 
GCAGGAACCC 
TTGTCCAAGA 
AAGAATTAAA 
GAGAATTCAC 
GGGGCTCTGT 
CCACACACTG 
CAAGGCTTAA 
ACAAGGACTA 
GTTCCAAGGA 
CGATGTATAA 
ATTCTACCGA 



31 
I 

GCAGGCCACC 
GGCGCGCCTG 
TCAAGTTCCA 
CTCCAACATT 
TAAGTCAAAA 
CACCATGGGC 
TGCCCACAGA 
AGACAACCGG 
GTGCATGGTG 
ATTTCAGTGT 
CACCATCGAG 
CACCTACGTG 
CTTCATTGAT 
CTCCAACACG 
CAGCGCTGAC 
GGGCAGGTGT 
CGATCCCCAG 
CTATCTCTAT 



41 
I 

GAGGCCGCCG 
CTTCTCTGCG 
TCGAACTGTG 
CACTGGTGCA 
ACCTGCTATG 
CGGCCCTGCC 
TCTGATGCTC 
AGGCGACCCT 
CATGACTGCG 
GGCCAAAAGA 
AACCAGCCCT 
TGTGGAGGCA 
TACCCAAAGA 
CAAGGGGAGA 
ACGCTTGCTC 
GCGCAGCCAT 
TTTGGCACAA 
CCGGAGCAGC 



51 
I 

CCGTCTAGCG 
TCCTGGTCGT 
ACTGTCTAAA 
ACTGCCCAAA 
AGGGGAATGG 
TGCCCTGGAA 
TTCAGCTGGG 
GGTGCTATGT 
CAGATGGAAA 
CTCTGAGGCC 
GGTTTGCGGC 
GCCTCATCAG 
AGGAGGACTA 
TGAAGTTTGA 
ACCACAACGA 
CCCGGACTAT 
GCTGTGAGAT 
TGAAAATGAC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 



1116 



WO 03/042661 



TGTTGTGAAG 
CACCACCAAA 
CTCAGGGGGA 
CTGGGGCCGT 
CTTACCCTGG 
AGGGAGGAAA 
TCCATCAGCT 
CACCACCAGG 
CAGACCCTCT 
TGTCTTTTTC 
GGCTCGAAGG 
AATGAATAAT 
AATGTGGGAG 
ATTCCATGAA 
GCTGTGAGTG 
AAACTGTGTG 
CTGGGGCCTC 
ACCTGTGACC 
ATCCCTTCCT 
ACACTGAATA 
ATCAATAAAA 



CTGATTTCCC 
ATGCTATGTG 
CCCCTCGTCT 
GGATGTGCCC 
ATCCGCAGTC 
CGGGCACCAC 
GTAAGAAGAG 
GTGAACGACA 
GGCCAGGATG 
TGGACTGAAG 
GAGAGCCAGC 
TTCCCAATTA 
CAGOGGTTTG 
TGTATCAGGA 
TAAGTGTGAG 
GACTGTGATG 
TTGGGTCCCC 
AGCACTGTCT 
TTTAGCCTAG 
TTTATATTTC 
TGTGATTTTT 



ACCGGGAGTG 
CTGCTGACCC 
GTTCCCTCCA 
TGAAGGACAA 
ACACCAAGGA 
CCGCTTTCTT 
ACTGGGAAGA 
ATAGCTTTAC 
GAGGGGTGGT 
CCTGCAGGAG 
TCCCCCGACC 
GGAAGTGTAA 
GGGAGCAGAG 
AATATATATG 
TAAGAGCTGG 
CCACACAGAG 
CACGTGACAG 
CAGTTTCACT 
TTCATCCAAT 
ACTATTTTTA 
CTGA 



TCAGCAGCCC 
CCAATGGAAA 
AGGCCGCATG 
GCCAGGCGTC 
AGAGAATGGC 
GCTGGTTGTC 
TAGGCTCTGC 
CCTCACGGAT 
CCTGACTCAA 
TTAAAAAGGG 
GGTGGGCATT 
GCAGCTGAGG 
ACACTAACGA 
TGTGTGTATG 
TGTCTGATTG 
TGGTCTTTCT 
TGCCTGGGAA 
TTCACATAGA 
CCTCACTGGG 
TTTATATTTT 



CACTACTACG 
ACAGATTCCT 
ACTTTGACTG 
TACACGAGAG 
CTGGCCCTCT 
ATTTTTGCAG 
ACAGATGGAT 
AGGCCTGGGT 
CATGTTACTG 
CAGGGCATCT 
TGTGAGGCCC 
TCTCTTGAGG 
CTTCAGGGCA 
TTTGCACACT 
TTAAGTCTAA 
GGAGAGGTTA 
TGTACTTATT 
TGTCCCTTTC 
TGGGGTGAGG 
TGTAATTTTA 



GCTCTGAAGT 
GCGAGGGAGA 
GAATTGTGAG 
TCTCACACTT 
GAGGGTCCCC 
TAGAGTCATC 
TTGCCTGTGG 
GCTGGCTGCC 
ACCAGCAACT 
CCTGTGCATG 
ATGGTTGAGA 
GAGCTTAGCC 
GGGCTCTGAT 
TGTTGTGTGG 
ATATTTCCTT 
TAGGTCACTC 
CTGCAGCATG 
TTGGCCAGTT 
ACCACTCCTT 
AATAAAAGTG 



Seq ID NO: 414 Protein sequence 
Protein Accession fl: NP_002 649-1 



1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 



MRALLARLLIi 
HCEIDKSKTC 



GGEFTTIENQ 
RSRLNSNTQG 
PSMYNDPQFG 
CAADPQWKTD 



11 

I 

CVLWSDSKG 
VEGNGHFYRG 
PWCYVQVGLK 
PWFAAIYRRH 
EMKFEVENLI 
TSCEITGFGK 
SCQGDSGGPL 
L 



21 
1 

SNELHQVPSN 
KASTDTMGRP 
PLVQECMVHD 
RGGSVTYVCG 
LHKDYSADTL 
ENSTDYLYPE 
VCSLQGRMTI* 



31 
I 

CDCLNGGTCV 
CLPWNSATVL 
CADGKKPSSP 
GSLISPCWVI 
A1IHNDIALLK 
QLKMTWKLI 
TGIVSWGRGC 



41 
I 

SNKYFSNIHW 
QQTYHAHRSD 
PEELKPQCGQ 
SATHCFIDYP 
IRSKEGRCAQ 
SHRECQQPHY 
ALKDKPGVYT 



51 
1 

CNCPKKFGGQ 
ALQLGLGKHN 
KTLRPRFKII 
KKEDYIVYLG 
PSRTIQTICLi 
YGSEVTTKML 
RVSHFLPWIR 



60 
120 
180 
240 
300 
360 
420 



Seq ID NO: 415 DNA sequence 

Nucleic Acid Accession #: NM_024422.1 

Coding sequence: 202.. 2907 

1 11 21 31 41 51 

CGCCAAAGGA AAAGCCCCTT GGATGAGAGG CAGGCGCTTC AGAGAAGCTA AGAAAAGCAC 
CTCTCCGCGC GCCCCACCTC CTCCGCCTCG CGCTCCTCCT GAGCAGCGGG CCCAGACTGC 
GCTCCGGCCG CGGCCCTCGC CCCGCGGAGC CCTCCTACCC CGGCCCGACG CTCGGCCCGC 
GACCTGCCCC GAGCCCTCTC CATGGAGGCA GCCCGCCCCT CCGGCTCCTG GAACGGAGCC 
CTCTGCCGGC TGCTCCTGCT GACCCTCGCG ATCTTAATAT TTGCCAGTGA TGCCTGCAAA 
AATGTGACAT TACATGTTCC CTCCAAACTA GATGCCGAGA AACTTGTTGG TAGAGTTAAC 
CTGAAAGAGT GCTTTACAGC TGCAAATCTA ATTCATTCAA GTGATCCTGA CTTCCAAATT 
TTGGAGGATG GTTCAGTCTA TACAACAAAT ACTATTCTAT TGTCCTCGGA GAAGA GAAGT 
TTTACCATAT TACTTTCCAA CACTGAGAAC CAAGAAAAGA AGAAAATATT TGTCTTTTTG 
GAGCATCAAA CAAAGGTCCT AAAGAAAAGA CATACTAAAG AAAAAGTTCT AAGGCGOGOC 
AAGAGAAGAT GGGCTCCAAT TCCTTGTTCG ATGCTAGAAA ACTCCTTGGG TCCTTTTCCA 
CTTTTCCTTC AACAGGTTCA ATCTGACACG GCCCAAAACT ATACCATATA CTATTCCATA 
AGAGGTCCTG GAGTTGACCA AGAACCTCGG AATTTATTTT ATGTGGAGAG AGACACTGGA 
AACTTGTATT GTACTCGTCC TGTAGATCGT GAGCAGTATG AATCTTTTGA GATAATTGCC 
TTTGCAACAA CTCCAGATGG GTATACTCCA GAACTTCCAC TGCCCCTAAT AATCAAAATA 
GAGGATGAAA ATGATAACTA CCCAATTTTT ACAGAAGAAA CTTATACTTT TACAATTTTT 
GAAAATTGCA GAGTGGGCAC TACTGTGGGA CAAGTGTGTG CTACTGACAA AGATGAGCCT 
GACACGATGC ACACACGCCT GAAGTACTCC ATCATTGGGC AGGTGCCACC ATCACCCACC 
CTATTTTCTA TGCATCCAAC TACAGGCGTG ATCACCACAA CATCATCTCA GCTAGACAGA 
GAGTTAATTG ACAAGTACCA GTTGAAAATA AAAGTACAAG ACATGGATGG TCAGTATTTT 
GGTCTACAGA CAACTTCAAC TTGTATCATT AACATTGATG ATGTAAATGA CCACTTGCCA 
ACATTTACTC GTACTTCTTA TGTGACATCA GTGGAAGAAA ATACAGTTGA TGTGGAAATC 
TTACGAGTTA CTGTTGAGGA TAAGGACTTA GTGAATACTG CTAACTGGAG AGCTAATTAT 
ACCATTTTAA AGGGCAATGA AAATGGCAAT TTTAAAATTG TAACAGATGC CAAAACCAAT 
GAAGGAGTTC TTTGTGTAGT TAAGCCTTTG AATTATGAAG AAAAGCAACA GATGATCTTG 
CAAATTGGTG TAGTTAATGA AGCTCCATTT TCCAGAGAGG CTAGTCCAAG ATCAGCCATG 
AGCACAGCAA CAGTTACTGT TAATGTAGAA GATCAGGATG AGGGCCCTGA GTGTAACCCT 
CCAATACAGA CTGTTCGCAT GAAAGAAAAT GCAGAAGTGG GAACAACAAG CAATGGATAT 
AAAGCATATG ACCCAGAAAC AAGAAGTAGC AGTGGCATAA GGTAT AAGAA ATTAACTGAT 
CCAACAGGGT GGGTCACCAT TGATGAAAAT ACAGGATCAA TCAAAGTTTT CAGAAGCCTG 
GATAGAGAGG CAGAGACCAT CAAAAATGGC ATATATAATA TTACAGTCCT TGCATCAGAC 
CAAGGAGGGA GAACATGTAC GGGGACACTG GGCATTATAC TTCAAGACGT GAATGATAAC 
AGCCCATTCA TACCTAAAAA GACAGTGATC ATCTGCAAAC CCACCATGTC ATCTGCGGAG 
ATTGTTGCGG TTGATCCTGA TGAGCCTATC CATGGCCCAC CCTTTGACTT TAGTCTGGAG 
AGTTCTACTT CAGAAGTACA GAGAATGTGG AGACTGAAAG CAATTAATGA TACAGCAGCA 
CGTCTTTCCT ATCAGAATGA TCCTCCATTT GGCTCATATG TAGTACCTAT AACAGTGAGA 
GATAGACTTG GCATGTCTAG TGTCACTTCA TTGGATGTTA CACTGTGTGA CTGCATTACC 
GAAAATGACT GCACACATCG TGTAGATCCA AGGATTGGCG GTGGAGGAGT ACAACTTGGA 
AAGTGGGCCA TCCTTGCAAT ATTGTTGGGC ATAGCATTGC TCTTTTGCAT CCTGTTTAGG 
CTGGTCTGTG GGGCTTCTGG GACGTCTAAA CAACCAAAAG TAATTCCTGA TGATTTAGCC 
CAGCAGAACC TAATTGTATC AAACACAGAA GCTCCTGGAG ATGACAAAGT GTATTCTGCG 
AATGGCTTCA CAACCCAAAC TGTGGGCGCT TCTGCTCAGG GAGTTTGTGG CACCGTGGGA 
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TCAGGAATCA AAAACGGAGG TCAGGAGACC 
TCGGAATCCT GCCGGGGGGC TGGCCACCAT 
ACGGAGGTGG ACAACTGCAG ATACACTTAC 
CTTGGTGAAA AAGTGTATCT GTGTAATCAA 
5 GTCCTGACAT ATAACTATGA AGGAAGAGGA 
GAACGACAAG AAGAAGATGG GCTTGAATTT 
CTAGCAGAAG CATGCATGAA GAGATGAGTG 
TTTATGACTT TTAAAAAAAA TTACAAACCA 
TGGGG GTTTT TCTCTCATTA TTTGGATGGA 

10 AGACACTATA AACAAGTACA CAAATTTTTC 
TCTATCCAAG GAGGTCTACA GAGAAATTAA 
ATGACAACAG CCAATTTATA GTGCAATAAA 
TATTTGAAGC ACAACCTAAT GGAAAATTGT 
AATTAAGTGT TCATGTGGTG CTTGGAAACT 

15 ACTGCATTCT TGCTATTATT TTATTCTTGT 
TTTCTAGCCA GGCATTGACT ATTACAATTT 



ATCGAAATGG TGAAAGGAGG ACACCAGACC 2580 

CACACCCTGG ACTCCTGCAG GGGAGGACAC 2640 

TCGGAGTGGC ACAGTTTTAC TCAGCCCOGT 2700 

GATGAAAATC ACAAGCATGC CCAAGACTAT 2760 

TOGGTGGCTG GGTCTGTAGG TTGTTGCAGT 2B20 

TTGGATAATT TGGAGCCCAA ATTTAGGACA 2880 

TGTTCTAATA AGTCTCTGAA AGCCAGTGGC 2940 

AGAATTTTTT AAAGCAGAAG ATGCTATTTG 3000 

ATCTCTTTGG TCAAATGCAC ATTTACAGAG 3060 

AATTTTTACA TATTTTTAAA TTACTTATCT 3120 

AGTCTGCCTT ATTTGTTACA TTTGGGTATA 3180 

ATGTAATTAA TTCAAGTCCT TATTATAGAC 3240 

AGAGACCTTG CTTTAACATT ATCTCCAGTT 3300 

GTTGTTTTCC TGAACATCTA AAGTGTGTAG 3360 

AATGTGACCT TTTCACTGTG CAAAGGGAGA 3420 
CATT 



20 
25 
30 
35 



Seq ID NOt 416 Protein seqruence 
Protein Accession #: NP_077740.1 



MEAARPSGSW 
ANLIHSSDPD 
KKRHTKEKVL 
EPRNLFYVER 
PIFTEETYTF 
TGVITTTSSQ 
VTSVEENTVD 
KPLNYEEKQQ 
KENAEVGTTS 
KNGIYNITVL 
EPIHGPPFDF 
VTSLDVTLCD 
TSKQPKVIPD 
QETIEMVKGG 



11 
I 

NGALCRLLLL 
FQILEDGSVY 
RRAKRRWAPI 
DTGNLYCTRP 
TIFENCRVGT 
LDRELIDKYQ 
VEILRVTVED 
MILQIGWNE 
NGYKAYDPET 
ASDQGGRTCT 
SLESSTSEVQ 
CITENDCTHR 
DIAQQNLIVS 
HQTSESCRGA 
QDYVLTYNYE 



TLAILIFASD 
TTNTILLSSE 



VDREQYESFE 
TVGQVCATDK 
LKIKVQDMDG 
KDLVNTANWR 
APFSREASPR 
RSSSGIRYKK 
GTLGIILQDV 
RMWRLKAIND 
VDPRIGGGGV 
NTEAPGDDKV 
GHHHTLDSCR 
GRGSVAGSVG 



31 
I 

ACKNVTLHVP 
KRSFTILliSN 
PFPLFLQQVQ 
IIAFATTPDG 
DEPDTMHTRL 
QYFGLQTTST 
ANYTILKGNE 
SAMSTATVTV 
LTDPTGWVTI 
NDNSPFIPKK 
TAARLSYQND 
QLGKWAILAI 
YSANGFTTQT 
GGHTEVDNCR 



41 

i 

SKLDAEKLVG 
TENQEKKKIF 
SDTAQNYTIY 
YTPELPLPLI 
KYSIIGQVPP 
CIINIDDVND 
NGNFKIVTDA 
NVEDQDEGPE 
DENTGSIKVF 
TVIICKPTMS 
PPFGSYWPI 
LLGIALLFCI 
VGASAQGVCG 
YTYSEWHSFT 
LEFLDNLEPK 



51 
I 

RVNLKECFTA 
VFLEHQTKVL 
YSIRGPGVDQ 
IKIEDENDNY 
SPTLFSMHPT 
HLPTFTRTSY 
KTNEGVLCW 
CNPPIQTVRM 
RSLDREAETI 
SAEIVAVDPD 
TVRDRLGMSS 
LFTLVCGASG 
TVGSGIKNGG 
QPRLGEKVYL 
FRTLAEACMK 



60 
120 
160 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 

900 



40 

45 

50 

55 

60 

65 

70 

75 

80 



Seq ID NO: 417 DNA sequence 

Nucleic Acid Accession #: NM_004949.1 

Coding sequence: 202.. 2745 

1 11 21 31 41 51 

I I I I I I 

CGCCAAAGGA AAAGCCCCTT GGATGAGAGG CAGGCGCTTC AGAGAAGCTA AGAAAAGCAC 
CTCTCCGCGC GCCCCACCTC CTCCGCCTCG CGCTCCTCCT GAGCAGCGGG CCCAGACTGC 
GCTCCGGCCG CGGCCCTCGC CCCGCGGAGC CCTCCTACCC CGGCCCGACG CTCGGCCCGC 
GACCTGCCCC GAGCCCTCTC CATGGAGGCA GCCCGCCCCT CCGGCTCCTG GAACGGAGCC 
CTCTGCCGGC TGCTCCTGCT GACCCTCGCG ATCTTAATAT TTGCCAGTGA TGCCTGCAAA 
AATGTGACAT TACATGTTCC CTCCAAACTA GATGCCGAGA AACTTGTTGG TAGAGTTAAC 
CTGAAAGAGT GCTTTACAGC TGCAAATCTA ATTCATTCAA GTGATCCTGA CTTCCAAATT 
TTGGAGGATG GTTCAGTCTA TACAACAAAT ACTATTCTAT TGTCCTCGGA GAA GAGAAG T 
TTTACCATAT TACTTTCCAA CACTGAGAAC CAAGAAAAGA AGAAAATATT TGTCTTTTTG 
GAGCATCAAA CAAAGGTCCT AAAGAAAAGA CATACTAAAG AAAAAGTTCT AA GGCGCG CC 
AAGAGAAGAT GGGCTCCAAT TCCTTGTTCG ATGCTAGAAA ACTCCTTGGG TCCTTTTCCA 
CTTTTCCTTC AACAGGTTCA ATCTGACACG GCCCAAAACT ATACCATATA CTATTCCATA 
AGAGGTCCTG GAGTTGACCA AGAACCTCGG AATTTATTTT ATGTGGAGAG AGACACTGGA 
AACTTGTATT GTACTCGTCC TGTAGATCGT GAGCAGTATG AATCTTTTGA GATAATTGCC 
TTTGCAACAA CTCCAGATGG GTATACTCCA GAACTTCCAC TGCCCC TAAT AATCAAAATA 
GAGGATGAAA ATGATAACTA CCCAATTTTT ACAGAAGAAA CTTATACTTT TACAATTTTT 
GAAAATTGCA GAGTGGGCAC TACTGTGGGA CAAGTGTGTG CTACTGACAA AGATGAGCCT 
GACACGATGC ACACACGCCT GAAGTACTCC ATCATTGGGC AGGTGCCACC ATCACCCACC 
CTATTTTCTA TGCATCCAAC TACAGGCGTG ATCACCACAA CATCATCTCA GCTAGACAGA 
GAGTTAATTG ACAAGTACCA GTTGAAAATA AAAGTACAAG ACATGGATGG TCAGTATTTT 
GGTCTACAGA CAACTTCAAC TTGTATCATT AACATTGATG ATGTAAATGA CCACTTGCCA 
ACATTTACTC GTACTTCTTA TGTGACATCA GTGGAAGAAA ATACAGTTGA TGTGGAAATC 
TTACGAGTTA CTGTTGAGGA TAAGGACTTA GTGAATACTG CTAACTGGAG AGCTAATTAT 
ACCATTTTAA AGGGCAATGA AAATGGCAAT TTTAAAATTG TAACAGATGC CAAAACCAAT 
GAAGGAGTTC TTTGTGTAGT TAAGCCTTTG AATTATGAAG AAAAGCAACA GATGATCTTG 
CAAATTGGTG TAGTTAATGA AGCTCCATTT TCCAGAGAGG CTAGTCCAAG ATCAGCCATG 
AGCACAGCAA CAGTTACTGT TAATGTAGAA GATCAGGATG AGGGCCCTGA GTGTAACCCT 
CCAATACAGA CTGTTCGCAT GAAAGAAAAT GCAGAAGTGG GAACAACAAG CAATGGATAT 
AAAGCATATG ACCCAGAAAC AAGAAGTAGC AGTGGCATAA GGTATAAGAA ATTAACTGAT 
CCAACAGGGT GGGTCACCAT TGATGAAAAT ACAGGATCAA TCAAAGTTTT CAGAAGCCTG 
GATAGAGAGG CAGAGACCAT CAAAAATGGC ATATATAATA TTACAGTCCT TGCATCAGAC 
CAAGGAGGGA GAACATGTAC GGGGACACTG GG CATT AT AC TTCAAGACGT GAATGATAAC 
AGCCCATTCA TACCTAAAAA GACAGTGATC ATCTGCAAAC C CACC ATGTC ATCTGCGGAG 
ATTGTTGOGG TTGATCCTGA TGAGCCTATC CATGGCCCAC CCTTTGACTT TAGTCTGGAG 
AGTTCTACTT CAGAAGTACA GAGAATGTGG AGACTGAAAG CAATTAATGA TACAGCAGCA 
CGTCTTTCCT ATCAGAATGA TCCTCCATTT GGCTCATATG TAGTACCTAT AACAGTGAGA 
GATAGACTTG GCATGTCTAG TGTCACTTCA TTGGATGTTA CACTGTGTGA CTGCATTACC 
GAAAATGACT GCACACATCG TGTAGATCCA AGGATTGGCG GTGGAGGAGT ACAACTTGGA 
AAGTGGGCCA TCCTTGCAAT ATTGTTGGGC ATAGCATTGC TCTTTTGCAT CCTGTTTACG 
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CTGGTCTGTG GGGCTTCTGG GACGTCTAAA 
CAGCAGAACC TAATTGTATC AAACACAGAA 
AATGGCTTCA CAACCCAAAC TGTGGGOGCT 
TCAGGAATCA AAAACGGAGG TCAGGAGACC 
5 TCGGAATCCT GCCGGGGGGC TGGCCACCAT 
ACGGAGGTGG ACAACTGCAG ATACACTTAC 
CTTGGTGAAG AATCCATTAG AGGACACACT 
GTATCTGTGT AATCAAGATG AAAATCACAA 
CTATGAAGGA AGAGGATCGG TGGCTGGGTC 
10 AGATGGGCTT GAATTTTTGG ATAATTTGGA 
CATGAAGAGA TGAGTGTGTT CTAATAAGTC 
AAAAAATTAC AAACCAAGAA TTTTTTAAAG 
TCATTATTTG GATGGAATCT CTTTGGTCAA 
AGTACACAAA TTTTTCAATT TTTACATATT 
15 TCTACAGAGA AATTAAAGTC TGCCTTATTT 
TTTATAGTGC AATAAAATGT AATTAATTCA 
CCTAATGGAA AATTGTAGAG ACCTTGCTTT 
GTGGTGCTTG GAAACTGTTG TTTT CCTGA A 
ATTATTTTAT TCTTGTAATG TGACCTTTTC 
20 TTGACTATTA CAATTTCATT 



CAACCAAAAG TAATTCCTGA TGATTTAGCC 2400 

GCTCCTGGAG ATGACAAAGT GTATTCTGCG 2460 

TCTGCTCAGG GAGTTTGTGG CACCGTGGGA 2520 

ATCGAAATGG TGAAAGGAGG ACACCAGACC 2580 

CACACCCTGG ACTCCTGCAG GGGAGGACAC 2640 

TCGGAGTGGC ACAGTTTTAC TCAGCCCCGT 2700 

CTGATTAAAA ATTAAACAAT GAAAGAAAGT 2760 

GCATGCCCAA GACTATGTCC TGACATATAA 2820 

TGTAGGTTGT TGCAGTGAAC GACAAGAAGA 2880 

GCCCAAATTT AGGACACTAG CAGAAGCATG 2940 

TCTGAAAGCC AGTGGCTTTA TGACTTTTAA 3000 

CAGAAGATGC TATTTGTGGG GGTTTTTCTC 3060 

ATGCACATTT ACAGAGAGAC ACTATAAACA 3120 

TTTAAATTAC TTATCTTCTA TCCAAGGAGG 3180 

GTTACATTTG GGTATAATGA CAACAGCCAA 3240 

AGTCCTTATT ATAGACTATT TGAAGCACAA 3300 

AACATTATCT CCAGTTAATT AAGTGTTCAT 3360 

CATCTAAAGT GTGTAGACTG CATTCTTGCT 3420 

ACTGTGCAAA GGGAGATTTC TAGCCAGGCA 3480 



Seq ID NO: 418 Protein sequence 
Protein Accession #: NP_004940.1 

25 1 11 21 31 41 51 

MEAARPSGSW NGALCRLLLL TLAILIFASD ACKNVTLHVP SKLDAEKLVG RVNLKECFTA 60 

ANLIHSSDPD FQILEDGSVY TTNTILLSSE KRSFTILLSN TENQEKKKIF VFLEHQTKVL 120 

KKRHTKEKVL RRAKRRWAPI PCSMLENSLG PFPLFLQQVQ SDTAQNYTIY YSIRGPGVDQ 180 

30 EPRNLFYVER DTGNLYCTRP VDREQYESFE IIAFATTPDG YTPELPL»PLI IKIED END NY 240 

PIFTEETYTF TIFENCRVGT TVGQVCATDK DEPDTMHTRL KYSIIGQVPP SPTLFSMHPT 300 

TGVITTTSSQ LDRELIDKYQ LKIKVQDMDG QYFGLQTTST CIINIDDVND HLPTFTRTSY 360 

VTSVEENTVD VEILRVTVED KDLVNTANWR ANYTILKGNE NGNFKIVTDA KTNEGVLCW 420 

KPLNYEEKQQ MILQIGWNE APFSREASPR SAMSTATVTV NVEDQDEGPE CNPPIQTVRM 480 

35 KENAEVGTTS NGYKAYDPET RSSSGIRYKK LTDPTGWVTI DENTGSIKVF RSLDREAETI 540 

KNGIYNITVL ASDQGGRTCT GTLGIILQDV NDNSPFIPKK TVIICKPTMS SAEIVAVDPD 600 

EPIHGPPFDF SLESSTSEVQ RMWRLKAIND TAARLSYQND PPFGSYWPI TVRDRI/3MSS 660 

VTSLDVTLCD CITENDCTHR VDPRIGGGGV QLGKWAILAI LLGIALLFCI LFTLVCGASG 720 

TSKQPKVIPD DLAQQNLIVS NTEAPGDDKV YSANGFTTQT VGASAQGVCG TVGSGIKNGG 780 

40 QETIEMVKGG HOTSESCRGA GHHHTLDSCR GGHTEVDNCR YTYSEWHSFT QPRLGEESIR 840 
GHTLIKN 



Seq ID NO; 419 DNA sequence 
Nucleic Acid Accession #: NM_002722.1 
45 Coding sequence: 14.. 3 01 

1 ll 21 31 41 51 

ACTCTGGACT CCGGATGGCT GCCGCACGCC TCTGCCTCTC CCTGCTGCTC CTGTCCACCT 60 

50 GCGTGGCTCT GTTACTACAG CCACTGCTGG GTGCCCAGGG AGCCCCACTG GAGCCAGTGT 120 

ACCCAGGGGA CAATGCCACA CCAGAGCAGA TGGCCCAGTA TGCAGCTGAT CTCCGTAGAT 180 

ACATCAACAT GCTGACCAGG CCTAGGTATG GGAAAAGACA CAAAGAGGAC ACGCTGGCCT 240 

TCTCGGAGTG GGGGTCCCCG CATGCTGCTG TCCCCAGGGA GCTCAGCCCG CTGGACTTAT 300 

AATGCCACCT TCTGTCTCCT ACGACTCCAT GAGCAGCGCC AGCCCAGCTC TCCCCTCTGC 360 

55 ACCCTTGGCT CTGGCCAAAG CTTGCTCCCT GCTCCCACAC AGGCTCAATA AAGCAAGTCA 420 
AAGCC 



60 



65 



Seq ID NO: 420 Protein sequence 
Protein Accession #: NP_0Q2713.1 

1 11 21 31 41 51 

MAAARLCLSL LLLSTCVALL LQPLLGAQGA PLEPVYPGDN ATPEQMAQYA ADLRRYINML 60 
TRPRYGKRHK EDTLAFSEWG SPHAAVPREL SPLDL 

Seq ID NO: 421 DNA sequence 
Nucleic Acid Accession ft: NMJ532545.1 
Coding sequences 46.. 718 

70 1 11 21 31 41 51 

AAACTGATCT TCAATGCACT AAGAGAAGGA GACTCTCAAA CCAAAAATGA CCTGGAGGCA 60 

CCATGTCAGG CTTCTGTTTA CGGTCAGTTT GGCATTACAG ATCATCAATT TGGGAAACAG 120 

CTATCAAAGA GAGAAACATA ACGGCGGTAG AGAGGAAGTC ACCAAGGTTG CCACTCAGAA 180 

75 GCACCGACAG TCACCGCTCA ACTGGACCTC CAGTCATTTC GGAGAGGTGA CTGGGAGCGC 240 

CGAGGGCTGG GGGCCGGAGG AGCCGCTCCC CTACTCCCGG GCTTTCGGAG AGGGTGCGTC 300 

CGCGCGGCCG CGCTGCTGCA GGAACGGCGG TACCTGCGTG CTGGGCAGCT TCTGCGTGTG 360 

CCCGGCCCAC TTCACCGGCC GCTACTGCGA GCATGACCAG AGGCGCAGTG AATGCGGCGC 420 

CCTGGAGCAC GGAGCCTGGA CCCTCCGOGC CTGCCACCTC TGCAGGTGCA TCTTCGGGGC 480 

80 CCTGCACTCC CTCCCCCTCC AGACGCCTGA CCGCTGTGAC CCGAAAGACT TCCTGGCCTC 540 

CCACGCTCAC GGGCCGAGCG CCGGGGGCGC GCCCAGCCTG CTACTCTTGC TGCCCTGCGC 600 

ACTCCTGCAC CGCCTCCTGC GCCCGGATGC GCCCGCGCAC CCTCGGTCCC TGGTCCCTTC 660 

CGTCCTCCAG OGGGAGCGGC GCCCCTGCGG AAGGCCGGGA CTTGGGCATC GCCTTTAATT 720 

TTCTATGTTG TAAATAATAG ATGTGTTTAG TTTACCGTAA GCTGAAGCAC TGGGTGAATA 780 
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TTTTTATTGG GTAATAAATA TTTTCATGAA AGCGCCAAAA AAAAAAAAAA AAAAAAAAAA 
AAAAAA 

Seq ID NO: 422 Protein sequence 
Protein Accession #: NP_115934.1 



31 



41 



51 



1 11 21 f j 

irWRHHVRLL FTVSLALQII NLGNSYQREK HNGGREEVTK VATQKHRQSP LNWTSSHFGE 
10 VTGSAEGWGP EEPLPYSRAF GEGASARPRC CRNGGTCVLG SFCVCPAHFT GRYCEHDQRR 
SECGALEHGA HTLRACHLCR CIFGALHCLP LQTPDRCDPK DPLASHAHGP SAGGAPSLLL 
LLPCALLHRL LRPDAPAHPR SLVPSVLQRE RRPCGRPGLG HRL 



840 



60 
120 
180 



15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



Seq ID NO: 423 DNA sequence 

Nucleic Acid Accession #: NM_006533.1 

Coding sequence: 72.. 467 



1 

1 

AGGGAGAGAG 
CACAGTCCAC 
TCTCCGGACC 
CGGACCAGGA 
CCGACTGCCG 
AGGGCCGTGG 
CTGCTCGCCT 
GCAAAGTCGA 
CTGGCCCTGC 



11 
I 

GGAGGGGAGG 
GATGGCCCGG 
TGGTGTCAGG 
GTGCAGCCAC 
ATTCCTGACC 
GCGGCTCTTC 
GGGCTATTTC 
TGTGAAGACA 
CGTTTCCCCT 



21 

I 

AAATTGGAGA 
TCCCTGGTGT 
GGTGGTCCTA 
CCTATCTCCA 
ATTCACCGGG 
TGGGGAGGCA 
CCCAGTAGCA 
GACAAATGGG 
CCTTGGGTTT 



31 
I 

CCCCAGCACC 
GCCTTGGTGT 
TGCCCAAGCT 
TGGCTGTGGC 
GCCAAGTGGT 
GCGTTCAGGG 
TTGTCCGAGA 
ATTTCTACTG 
ATGCAAATAC 



41 

1 

CCCTTGCTCA 
CATCATCTTG 
GGCTGACCGG 
CCTTCAGGAC 
GTATGTCTTC 
AGATTACTAT 
GGACCAGACC 
CCAGTGAGCT 
AATCAGCCCA 



51 
I 

CTCTCTTGCT 60 

CTGTCTGCCT 120 

AAGCTGTGTG 180 

TACATGGCCC 240 

TCCAAGCTGA 300 

GGAGATCTGG 360 

CTGAAACCTG 420 

CAGCCTACCG 480 
GTGCAAAC 



Seq ID NO: 424 Protein sequence 
Protein Accession 8: NP_006524.1 

1 11 21 31 41 51 

llARSLVCLGV llLUSAFSGP GVRGGPMPKL ADRKLCADQE CSHPISMAVA LQDYMAPDCR 
FLTIHRGQW YVFSKLKGRG RLFWGGSVQG DYYGDLAARL GYFPSSIVRE DQTLKPGKVD 
VKTDKWDFYC Q 

Seq ID NO: 425 DNA sequence 

Nucleic Acid Accession #: NM_080B70.1 

Coding sequence: 3.. 7 10 

1 11 21 31 41 51 

AGATGACACA AGTCACAGAA AAGTCCACAG AACACCCAGA AAAGACCACG TCAACCACAG 
AGAAAACCAC AAGAACCCCA GAAAAGCCTA CGCTATACTC AGAGAAGACC ATATGCACCA 
AAGGGAAAAA CACACCAGTC CCAGAAAAGC CTACAGAAAA CCTGGGGAAC ACCACACTGA 
CCACTGAGAC CATAAAAGCC CCAGTAAAGT CCACAGAAAA CCCAGAAAAA ACAGCAGCAG 
TCACAAAGAC TATAAAACCT TCAGTCAAGG TCACAGGAGA CAAATCTCTC ACTACTACCT 
CTTCTCATCT AAATAAAACT GAAGTTACTC ATCAGGTGCC CACTGGTTCT TTCACCCTCA 
TTACATCTAG AACGAAGCTG AGTTCTATCA CATCAGAAGC CACAGGAAAC GAGAGCCATC 
CATACCTCAA TAAAGATGGC TCACAGAAAG GTATCCACGC TGGACAGATG GGAGAGAATG 
ATTCATTCCC TGCATGGGCC ATAGTTATTG TGGTCCTGGT GGCTGTGATT CTCCTCCTGG 
TGTTCCTTGG CCTGATCTTC TTGGTCTCCT ATATGATGCG GACACGCCGC ACACTAACCC 
AGAACACCCA GTACAATGAT GCAGAGGATG AGGGTGGCCC CAATTCCTAC CCGGTCTACC 
TGATGGAGCA GCAGAATGTT GGCATGGGCC AGATCCCTTC CCCACGGTGA TCTTGGAGTA 
GGCGCCCAGC CCTGGCTCTT CCATGCTCTG CCCCTTTCCT GGATGAGGAA CCGGACTCAC 
AATTTCTATT TCCGGGACTA CAGGAAGGGC AGAGAATACT GACGGTTACC AGTATTAACC 
CTTCATCTGT TCTTGAAACT GGTTGGGGAA TGAGGTGATA AGCAAGGAGG GTGTAAGTTT 
AGGGGACAAA GAAGAAAGAA TGAATAATAC GAGCAGACAT TCTCTGTAGA AGGTAATGGT 
CTGAGAATGA AAAGGTGTTT GATGGACATG TTGTGGGGGC ACCAATGCAG AACACTGCAC 
TGAGTCCTAA AGGAAGGACA GGAGCCTTAT AGGCAATGCC CCAGACTGAC TTGTGAGTGG 
GGTTTATGGG GAAAGGGAGG GACTGAGGGC AGAGTCTCTG GGTTTCAGGA CAGCATTATG 
TTATTTCCAT TCACTATTAC TTAAGAGTTT GTGTGTAAAC AGGCTCATCT CTGAGTTCTC 
AGGACCCTTG CCCCCACCCC CATTTTTTTA ATGAAAAAAA AAAACAAAAA AAACGGATCC 
AAGAAGAAAA GAGAATTTAT TTCCTTCTCC ACTCTCTCCA TGCCCTGGAG AAAAAAAAGT 
CCAGAAGAAA TCATAAATAT CTCTCATCTA CATGGTTGCT TCCTCTTCCT CCCAAATCCC 
TTAGTTTTCC TAAATGTCTA CAGTGGACGC CCTGTTGGTT TGGCTTGCTG GGTTGTGGGT 
GGACACGCAA GGAGGGGATT TTTATTTGGC CAGCAGTCTC ACCCACTGAT CTCCACCCCA 
GACCTTCCCT GATTGGTGTC TCAGCATTTA TTTTCCTGTC TCTTCCACCA AAAGCCAGCT 
GTAGCTTTAT CTCGTAAAAG TTACCCATCT TCTCTACTGT CCCCATTCTC TCTCCTCCCA 
CCTTCACCCC AGATTCAAGT TTTCCTCCTT GTAGGCATTT CATCTGTGTG TGTTTTCTGG 
ATTTTCTCTC TCTCTTCTTA TGGCCATTTC ACCTTATTAC TGATTGGGTA GAGGGGGAAA 
AGGAGAATGA TGATGATAGT TTCCTTCTGT CTATTGACCT TTTTTATAAT AAAGTATAAC 
ATGTT 
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Seq ID NO: 426 Protein sequence 
Protein Accession NP_543146.1 

1 11 21 31 41 51 

MTQVTEKSTE HPEKTTSTTE KTTRTPEKPT LYSEKTICTK GKNTPVPEKP TENLGNTTLT 60 
TETIKAPVKS TENPEKTAAV TKTIKPSVKV TGDKSLTTTS SHLNKTEVTH QVPTGSFTLI 120 
TSRTKLSSIT SEATGNESHP YLNKDGSQKG IHAGQMGEND SFPAWAIVIV VLVAVILLLV 180 
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FLGLIFLVSY MMRTRRTLTQ NTQWDAEDE GGPNSYPVYL MEQQNLGMGQ IPSPR 

Seq ID NO: 427 DNA sequence 
Nucleic Acid Accession ft: XM_0694B0.1 
5 Coding sequence: 1..4383 

1 11 21 |1 41 51 

ATGGACACTG TGCTGGTGCT GCTCCTGGGC CTGCAGGCCT TGGCCGGACC CAGTCCGAAG 60 
10 CCCCAGAAGG ACTCTGTCTC AGACTGGGCC ATTGTGTTGA TCACTCTCAC TTTGGTGGCA 120 
GCAATTGTCA GCCTAATGTA CGGTATCAAG AAGGCCTGCC AGTTCOGGAG GGAGATGAGT 180 
CTGGGGTGTG GCTGTGGCTC TGTGACCCCT TACAGCAGCC ACCATGAGGG GGAGGCTGCC 240 
AGCCAGCGCT ACTCTTGTCA AATGAAAGCT TCTTGGGGGG CAGGTGCTAC TACATTCCAA 300 
GAATATCAGA AAACTGGGGA ACTCTCAACA TCCGATCACA TATTTCCCCT CACTCCAGGC 360 
15 CTTGTTTATA GTATCCCTTT TGATCACATT GTTCTGCATT CAGGACAAAG ACCTCCAGAG 420 
CTCCCTAAAT CTACAGAAAT CCATGAGCAA AAACGCCACT GCAACACCAC ACGCCATTCT 480 
AAGCCAACTG ACAAGCCTAC AGGCAACTCC AAAACTATAG ACCACAAAAG CTCTACAGAT 540 
AATCATGAGG CTCCTCCCAC TTCTGAAGAA AACTCCAGCA ACCAAGGGAA AGACCCAATG 600 
ATCCGGAACC AGCGCTCTGT TGATCCTGCT GACTCCACTA CCACACATAA AGAATCCGCT 660 
20 GGAAAAAAAC ATATAACGCC AGCACCCAAG AGCAAAATAA ACTGTCGTAA GTCCACAACA 720 
GGCAAATCAA CGGTAACAAG AAAATCAGAT AAAACTGGAA GACCTTTGGA AAAGTCCATG 780 
AGTACTTTGG ATAAGACAAG TACCAGCTCA CATAAGACTA CAACTTCCTT CCACAACTCA 840 
GGCAATTCAC AGACCAAGCA AAAAAGCACA TCTTTTCCAG AAAAAATCAC AGCAGCCTCA 900 
AAAACAACAT ACAAGACCAC AGGAACCCCA GAAGAGTCAG AAAAAACTGA AGATTCCAGA 960 
25 ACAACAGTTG CCTCAGACAA GCTCCTGACA AAAACTACAA AAAACATACA AGAGACCATA 1020 

TCAGCCAATG AGCTCACACA ATCTCTAGCA GAGCCTACAG AACATGGAGG AAGGACAGCC 1080 
AATGAGAACA ACACACCATC CCCAGCAGAG CCTACAGAAA ATAGAGAAAG GACAGCCAAT 1140 
GAGAACACCA CACTATCCCC AGCAGAGCCT ACAGAAAATA GAGAAAGGAC AGCCAATGAG 1200 
AACACCGCAC CATTCCCAGC AGGGCCTACA GAAAATAGAG AAATGACAGC CAATGAGAAT 1260 
30 ACCACACTAT TCCCAGCAGA G CCTACAG AA CATGGAGAAA GGACAGCCAA TGAGAACACC 1320 
ACACCATCCC CAGCAGAGCC TACAGAACAT GGAGAAAGGA CAGCCAATGA GAACACTACA 1380 
CCATCCCCAG CAGAGCCTAC AGAACATGGA GAAAGGACCC CATTTGCCAA TGACAAAACC 1440 
ACATCATCCT CAGCAGAGTC TACAGAACAT GGAGAAAGGA CCCCACTGGC CAACGAGAAC 1500 
ACCACACCAT CCCCAGCAGA GCCTACAGAA AATAGAGAAA GGACAGCCAA TGAGAACACC 1560 
35 ACACCATCCC CAGCAGGGCC TACAGAAAAC AGAGAAACGA CAGCCAACGA GAAGACCACA 1620 

CTATCCCCAG TAGAGCCTAC AGAAAATAGA GAAACAACAG CCAATGAGAA GACCACACCA 1680 
TCCCCAGCAG AGCCTACAGA AAATGGACAA AGGACCCCAT TTGCCAATGA GAAAACCACA 1740 
TCATCCTCAG CAGAGCCTAC AGAACACGGA GAAAGGACCC CACTGGCCAA TGAGAACACC 1800 
ACACCATCCC CAGCAGAGCC TACAGAAAAT AGAGAAAGGA CAGC CAATGA GAAGACCACA I860 
40 CCATCCCCAG CAGAGCCTAC AGAAAATGGA GACAGGACTC CTTTGGCCAA TGAGAAGACC 1920 
ACGCCATCTC TAGCAGAGCC TACAGAAAAT GGACAAAGGA CCCCATTTGC CAATGAGAAG 1980 
ACCACATCAT CCTCAGCAGA GCCTACAGAA CACGAAGAAA GGACTCCACT GGCCAATGAG 2040 
AACACCACAC CATCCCCGGC AGAGCCTACA GAAAATAGAG AAAGGACAGC CAATGAGAAC 2100 
ACCACACCAT CCCCAGCAGG GCCTACAGAA AATAGAGAAA TGACAGCCAA CGAGAAGACC 2160 
45 ACACTATTCC CAGCAGAGCC TACAGAAAAT AGAGAAAGGA CAGCCAATGA GAAGACCACA 2220 
TCATCCCCAG CAGAGCCTAC AGAAAATGGA CAAAGGACCC CATTTGCCAA TGAGAAAACC 2280 
ACATCATCCC CAGCAGAGCC TACAGAACAC GGAGAAAGGA CCCCACTGGC CAATGAGAAC 2340 
ACCACACTAT CCCCAGCAGA GCCTACAGAA AATAGAGAAA GGACAGCCAA TGAGAAGACC 2400 
ACACCATTCC CAGCAGAGCC TACAGAAAAT AGAGAAAGGA CAGCCAATGA GAACACCACA 2460 
50 CCATCCCCAG CACAGCCTAC AGAAAATGGA GACAGGACTC CATTGGCCAA TGAGAAGACC 2520 
ACACCATCTC TAGCAGAGCC TACAGAAAAT GGAAAAAGGA CCCCATTTGC CAATGAGAAG 2580 
ACCACATCAT CCTCAGCAGA GCCTACAGAA CACGCAGAAA GGACTCCACT GGCCAATGAG 2640 
AACACCACAT CATCCCCAGC AGAGCCTACA GAAAATAGAG AAAGGACAGC CAATGAGAAG 2700 
ACCACACAAT TCCCAGCAGA GCCTACAGAA AATAGAGAAA GCACAGCCAA TGAGAAGACC 2760 
55 ACACCATTCC CAGCAGAGCC TACAGAAAAT AGAGAATGGA CAGCCAATGA GAACACCACA 2820 
CTATCCCCAG CAGAGCCTAC AGAACATGAA GAAATGACCC CATTGGCCAA TGAGAAGACC 2880 
ACACTATCCC CAGCAGAGCC TACAGAAAAT GGAGAAAGGA CCCCATTTAC CAATGAGAAG 2940 
ACCACACCAT CCTCAGCAGA GCCTACAGAA CATGGAGAAA GGACCCCACT GGCCAATGAG 3000 
ATCACCACAC CATCCCGAGC AGAGCCTACA GAACATGGAG AAAGGATAGC CAATGAGAAG 3060 
60 GCCACACCAT CCCCAGCAAA GCCTACAGAA CATGGAGAAA CGACAGTCAA TGAGGACACC 3120 
ACACCATCCT CAGCAGAGCC TACAGAAAAT GGAGAAAGGA CCCCACTGGC CAATGAGAAC 3180 
ACCACAACAT CCCCAACAGA GTCTACAGAA CATGGAGAAA GGACAGCCAA TGAGAAGACC 3240 
ACACCATCCC CAGCAGAGCC TACAGAACAT GGAGAAAGGA CACCATCAGC CAATGAGAAG 3300 
ACCATACCAT CTCCAGCAAA GCCTACAGAA CACGAAGAAA TGACCCCATC GGCCAATGAG 3360 
65 AACACCACAC CATCCCCAGT AAAGCCTACA GAACATGGAG AAAAGACTAC ATTGGCCAAT 3420 
GAGAAGATCA CACTATCCCC AGAAGGGCCT ACAGAACATG GAGCAAAAAC TACGTCGGCC 3480 
AATGAGAAGA TCACACCATC CCTAGCAAAG CCTACAGAAC ATGGAGAAAG GACCACATCA 3540 
CCCAATGACA AGATCACCTC ATCTGCAGCA GAGTCTACAG AACATAGAGA TAGGGCTACA 3600 
TCAGCCAATG TGATCACACC AGCCCCAGCA GAGCCTATAA AACATGCAAA AAGGACCACA 3660 
70 TTGGCCCATG AGAAGATGAC ACAAGTCACA GAAAAGTCCA CAGAACACCC AGAAAAGACC 3720 
ACGTCAACCA CAGAGAAAAC CACAAGAACC CCAGAAAAGC CTACGCTATA CTCAGAGAAG 3780 
ACCATATGCA CCAAAGGGAA AAACACACCA GTCCCAGAAA AGCCTACAGA AAACCTGGGG 3840 
AACACCACAC TGACCACTGA GACCATAAAA GCCCCAGTAA AGTCCACAGA AAACCCAGAA 3900 
AAAACAGCAG CAGTCACAAA GACTATAAAA CCTTCAGTCA AGGTCACAGG AGACAAATCT 3960 
75 CTCACTACTA CCTCTTCTCA TCTAAATAAA ACTGAAGTTA CTCATCAGGT GCCCACTGGT 4020 
TCTTTCACCC TCATTACATC TAGAACGAAG CTGAGTTCTA TCACATCAGA AGCCACAGGA 4080 
AACGAGAGCC ATCCATACCT CAATAAAGAT GGCTCACAGA AAGGTATCCA CGCTGGACAG 4140 
ATGGGAGAGA ATGATTCATT CCCTGCATGG GCCATAGTTA TTGTGGTCCT GGTGGCTGTG 4200 
ATTCTCCTCC TGGTGTTCCT TGGCCTGATC TTCTTGGTCT CCTATATGAT GCGGACACGC 4260 
80 CGCACACTAA CCCAGAACAC CCAGTACAAT GATGCAGAGG ATGAGGGTGG CCCCAATTCC 4320 
TACCCGGTCT ACCTGATGGA GCAGCAGAAT CTTGGCATGG GCCAGATCCC TTCCCCACGG 4380 
TGA 



1121 



WO 03/042661 



PCT/US02/36810 



Seq ID NO: 428 Protein sequence 
Protein Accession fl: XP_0694B0.1 
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MDTVLVLLLG 
LGCGCGSVTP 
LVYSIPFDHI 
NHEAPPTSEE 
GKSTVTRKSD 
KTTYKTTGTP 
NENNTPSPAE 
TTLPPAEPTE 
TSSSAESTEH 
LSPVEPTENR 
TPSPAEPTEN 
TTSSSAEPTE 
TLFPAEPTEN 
TTIiSPAEPTE 
TPSLAEPTEN 
TTQFPAEPTE 
TLSPAEPTEN 
ATPSPAKPTE 
TPSPAEPTEH 
EKITLSPEGP 
SANVITPAPA 
TICTKGKNTP 
LTTTSSHLNK 
MGENDSFPAW 
YPVYLMEQON 
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LQALAGPSPK 
YSSHHEGEAA 
VLHSGQRPPE 
NSSNQGKDPM 
KTGRPLEKSM 
EESEKTEDSR 
PTENRERTAN 
HGERTANEKT 
GERTPLANEN 
ETTANEKTTP 
RERTANEKTT 
HEERTPLANE 
RERTANEKTT 
NRERTANEKT 
GKRTPFANEK 
NRESTANEKT 
GERTPFTNEK 
HGETTVNEDT 
GERTPSANEK 
TEHGAKTTSA 
EPIKHAKRTT 
VPEKPTENLG 
TEVTHQVPTG 
AIVIWLVAV 
LGMGQIPSPR 
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PQKDSVSDWA 
SQRYSCQMKA 
LPKSTEIHEQ 
IRNQRSVDPA 
STLDKTSTSS 
TTVASDKLLT 
ENTTLSPAEP 
TPSPAEPTEH 
TTPSPAEPTE 
SPAEPTENGQ 
PSPAEPTENG 
NTTPSPAEPT 
SSPAEPTENG 
TPFPAEPTEN 
TTSSSAEPTE 
TPFPAEPTEN 
TTPSSAEPTE 
TPSSAEPTEN 
TIPSPAKPTE 
NEKITPSLAK 
LAHEKMTQVT 
NTTLTTETIK 
SFTLITSRTK 
ILLLVFLGLI 



31 
I 

IVLITLTLVA 
SWGAGATTFQ 
KRHCNTTRHS 
DSTTTHKESA 
HKTTTSFHNS 
KTTKNIQETI 
TENRERTANE 
GERTANENTT 
NRERTANENT 
RTPFANEKTT 
DRTPLANEKT 



QRTPFANEKT 
RERTANENTT 
HAERTPLANE 
REWTANENTT 
HGERTPLANE 
GERTPLANEN 
HEEMTPSANE 
PTEHGERTTS 



APVKSTENPE 
LSSITSEATG 
PLVSYMMRTR 
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AIVSLMYGIK 
EYQKTGELST 
KPTDKPTGNS 
GKKHITPAPK 
GNSQTKQKST 
SANELTQSLA 
NTAPFPAGPT 
PSPAEPTEHG 
TPSPAGPTEN 
SSSAEPTEHG 
TPSLAEPTEN 
TTPSPAGPTE 
TSSPAEPTEH 
PSPAQPTENG 
NTTSSPAEPT 
LSPAEPTEHE 
ITTPSRAEPT 
TTTSPTESTE 
NTTPSPVKPT 
PNDKITSSAA 
TSTTEKTTRT 
KTAAVTKTIK 
NESHPYLNKD 
RTLTQNTQYN 
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KACQFRREMS 
SDHIFPLTPG 
KTIDHKSSTD 
SKINCRKSTT 
SFPEKITAAS 
EPTEHGGRTA 
ENREMTANEN 
ERTPFANDKT 
RETTANEKTT 
ERTPLANENT 
GQRTPFANEK 
NREMTANEKT 
GERTPLANEN 
DRTPLANEKT 
ENRERTANEK 
EMTPLANEKT 
EHGERIANEK 
HGERTANEKT 
EHGEKTTLAN 
ESTEHRDRAT 
PEKPTLYSEK 
PSVKVTGDKS 
GSQKGXHAGQ 
DAEDEGGPNS 
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Seq ID NO: 429 DNA sequence 

Nucleic Acid Accession tf: FGENESH predicted 

Coding sequence : 1 . . 10674 

1 ll 21 31 41 51 

ATGTGGCCTC GCCTGGCCTT TTGTTGCTGG GGTCTGGCGC TCGTTTCGGG CTGGGCGACC 
TTTCAGCAGA TGTCCCCGTC GCGCAATTTC AGCTTCCGCC TCTTCCCCGA GACCGCGCCC 
GGGGCCCCCG GGAGTATCCC CGCGCCGCCC GCTCCTGGCG ACGAAGCGGC GGGGAGCAGA 
GTGGAGCGGC TGGGCCAGGC GTTCCGGCGA CGCGTGCGGC TGCTGCGGGA GCTCAGCGAG 
CGCCTGGAGC TTGTCTTCCT GGTGGATGAT TCGTCCAGCG TGGGCGAAGT CAACTTCCGC 
AGCGAGCTCA TGTTCGTCCG CAAGCTGCTG TCCGACTTCC CCGTGGTGCC CACGGCCACG 
CGCGTGGCCA TCGTGACCTT CTCGTCCAAG AACTACGTGG TGCCGGGCGT CGATTACATC 
TCCACCCGCC GCGCGCGCCA GCACAAGTGC GCGCTGCTCC TCCAAGAGAT CCCTGCCATC 
TCCTACCGAG GTGGCGGCAC CTACACCAAG GGCGCCTTCC AGCAAGCCGC GCAAATTCTT 
CTTCATGCTA GAGAAAACTC AACAAAAGTT GTATTTCTCA TCACTGATGG ATATTCCAAT 
GGGGGAGACC CTAGACCAAT TGCAGCGTCA CTGCGAGATT CAGGAGTGGA GATCTTCACT 
TTTGGCATAT GGCAAGGGAA CATTCGAGAG CTGAATGACA TGGCT TCCAC CCCAAAGGAG 
GAGCACTGTT ACCTGCTACA CAGTTTTGAA GAATTTGAGG CTTTAGCTCG COGGGCATTG 
CATGAAGATC TACCTTCTGG GAGTTTTATT CAAGATGATA TGGTCCACTG CTCATATCTT 
TGTGATGAAG GCAAGGACTG CTGTGACCGA ATGGGAAGCT GCAAATGTGG GACACACACA 
GGCCATTTTG AGTGCATCTG TGAAAAGGGG TATTACGGGA AAGGTCTGCA GTATGAATGC 
ACAGCTTGCC CATCGGGGAC ATACAAACCT GAAGGCTCAC CAGGAGGAAT CAGCAGTTGC 
ATTCCATGTC CTGATGAAAA TCACACCTCT CCACCTGGAA GCACATCCCC TGAAGACTGT 
GTCTGCAGAG AGGGATACAG GGCATCTGGC CAGACCTGTG AACTTGTCCA CTGCCCTGCC 
CTGAAGCCTC CCGAAAATGG TTACTTTATC CAAAACACTT GCAACAACCA CTTCAATGCA 
GCCTGTGGGG TCCGATGTCA CCCTGGATTT GATCTTGTGG GAAGCAGCAT CATCTTATGT 
CTACCCAATG GTTTGTGGTC OGGTTCAGAG AGCTACTGCA GAGTAAGAAC ATGTCCTCAT 
CTCCGCCAGC CGAAACATGG CCACATCAGC TGTTCTACAA GGGAAATGTT ATATAAGACA 
ACATGTTTGG TTGCCTGTGA TGAAGGGTAC AGACTAGAAG GCAGTGATAA GCTTACTTGT 
CAAGGAAACA GCCAGTGGGA TGGGCCAGAA CCCCGGTGTG TGGAGCGCCA CTGTTCCACC 
TTTCAGATGC CCAAAGATGT CATCATATCC CCCCACAACT GTGGCAAGCA GCCAGCCAAA 
TTTGGGACGA TCTGCTATGT AAGTTGCCGC CAAGGGTTCA TTTTATCTGG AGTCAAAGAA 
ATGCTGAGAT GTACCACTTC TGGAAAATGG AATGTCGGAG TTCAGGCAGC TGTGTGTAAA 
GACGTGGAGG CTCCTCAAAT CAACTGTCCT AAGGACATAG AGGCTAAGAC TCTGGAACAG 
CAAGATTCTG CCAATGTTAC CTGGCAGATT CCAACAGCTA AAGACAACTC TGGTGAAAAG 
GTGTCAGTCC ACGTTCATCC AGCTTTCACC CCACCTTACC TTTTCCCAAT TGGAGATGTT 
GCTATCGTAT ACACGGCAAC TGACCTATCC GGCAACCAGG CCAGCTGCAT TTTCCATATC 
AAGGTTATTG ATGCAGAACC ACCTGTCATA GACTGGTGCA GATCTCCACC TCCCGTCCAG 
GTCTCGGAGA AGGTACATGC CGCAAGCTGG GATGAGCCTC AGTTCTCAGA CAACTCAGGG 
GCTGAATTGG TCATTACCAG AAGTCATACA CAAGGAGACC TTTTCCCTCA AGGGGAGACT 
ATAGTACAGT ATACAGCCAC TGACCCCTCA GGCAATAACA GGACATGTGA TATCCATATT 
GTCATAAAAG GTTCTCCCTG TGAAATTCCA TTCACACCTG TAAATGGGGA TTTTATATGC 
ACTCCAGATA ATACTGGAGT CAACTGTACA TTAACTTGCT TGGAGGGCTA TGATTTCACA 
GAAGGGTCTA CTGACAAGTA TTATTGTGCT TATGAAGATG GCGTCTGGAA ACCAACATAT 
ACCACTGAAT GGCCAGACTG TGCCAAAAAA CGTTTTGCAA ACCACGGGTT CAAGTCCTTT 
GAGATGTTCT ACAAAGCAGC TGGTTGTGAT GACACAGATC TGATGAAGAA GTTTTCTGAA 
GCATTTGAGA CGACCCTGGG AAAAATGGTC CCATCATTTT GTAGTGATGC AGAGGACATT 
GACTGCAGAC TGGAGGAGAA CCTGACCAAA AAATATTGCC TAGAATATAA TTATGACTAT 
GAAAATGGCT TTGCAATTGG ACCAGGTGGC TGGGGTGCAG CTAATAGGCT GGATTACTCT 
TACGATGACT TCCTGGACAC TGTGCAAGAA ACAGCCACAA GCATCGGCAA TGCCAAGTCC 
TCACGGATTA AAAGAAGTGC CCCATTATCT GACTATAAAA TTAAGTTAAT TTTTAACATC 
ACAGCTAGTG TGCCATTACC CGATGAAAGA AATGATACCC TTGAATGGGA AAATCAGCAA 
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CGACTCCTTC AGACATTGGA AACTATCACA AATAAACTGA AAAGGACTCT CAACAAAGAC 
CCCATGTATT CCTTTCAGCT TGCATCAGAA ATACTTATAG CCGACAGCAA TTCATTAGAA 
ACAAAAAAGG CTTCCCCCTT CTGCAGACCA GGCTCAGTGC TGAGAGGGCG TATGTGTGTC 
AATTGCCCTT TGGGAACCTA TTATAATCTG GAACATTTCA CCTGTGAAAG CTGCCGGATC 
GGATCCTATC AAGATGAAGA AGGGCAACTT GAGTGCAAGC TTTGCCCCTC TGGGATGTAC 
ACGGAATATA TCCATTCAAG AAACATCTCT GATTGTAAAG CTCAGTGTAA ACAAGGCACC 
TACT CAT ACA GTGGACTTGA GACTTGTGAA TCGTGTCCAC TGGGCACTTA TCAGCCAAAA 
TTTGGTTCCC GGAGCTGCCT CTCGTGTCCA GAAAACACCT CAACTGTGAA AAGAGGAGCC 
GTGAACATTT CTGCATGTGG AGTTCCTTGT CCAGAAGGAA AATTCTCGCG TTCTGGGTTA 
ATGCCCTGTC ACCCATGTCC TCGTGACTAT TACCAACCTA ATGCAGGGAA GGCCTTCTGC 
CTGGCCTGTC CCTTTTATGG AACTACCCCA TTCGCTGGTT CCAGATCCAT CACAGAATGT 
TCAACTTCAG TTCTGAATAT TACTATTTTC GGTGGATTTG GGCATCTGGA GTTGTTAAAT 
TGTCCTTCTG AGGTTTTCCA TGAATGCTTC TTTAACCCTT GCCACAATAG TGGAACCTGC 
CAGCAACTTG GGCGTGGTTA TGTTTGTCTC TGTCCACTTG GATATACAGG CTTAAAGTGT 
GAAACAGACA TCGATGAGTG CAGCCCACTG CCTTGCCTCA ACAATGGAGT TTGTAAAGAC 
CTAGTTGGGG AATTCATTTG TGAGTGCCCA TCAGGTTACA CAGGTCAGOG GTGTGAAGAA 
AATATAAATG AGTGTAGCTC CAGTCCTTGT TTAAATAAAG GAATCTGTGT TGATGGTGTG 
GCTGGCTATC GTTGCACATG TGTGAAAGGA TTTGTAGGCC TGCATTGTGA AACAGAAGTC 
AATGAATGCC AGTCAAACCC ATGCTTAAAT AATGCAGTCT GTGAAGACCA GGTTGGGGGA 
TTCTTGTGCA AATGCCCACC TGGATTTTTG GGTACCCGAT GTGGAAAGAA CGTCGATGAG 
TGTCTCAGTC AGCCATGCAA AAATGGAGCT ACCTGTAAAG ACGGTGCCAA TAGCTTCAGA 
TGCCTGTGTG CAGCTGGCTT CACAGGATCA CACTGTGAAT TGAACATCAA TGAATGTCAG 
TCTAATCCAT GTAGAAATCA GGCCACCTGT GTGGATGAAT TAAATTCATA CAGTTGTAAA 
TGTCAGCCAG GATTTTCAGG CAAAAGGTGT GAAACAGAAC AGTCTACAGG CTTTAACCTG 
GATTTTGAAG TTTCTGGCAT CTATGGATAT GTCATGCTAG ATGGCATGCT CCCATCTCTC 
CATGCTCTAA CCTGTACCTT CTGGATGAAA TCCTCTGACG ACATGAACTA TGGAACACCA 
ATCTCCTATG CAGTTGATAA CGGCAGCGAC AATACCTTGC TCCTGACTGA TTATAACGGC 
TGGGTTCTTT ATGTGAATGG CAGGGAAAAG ATAACAAACT GTCCCTCGGT GAATGATGGC 
AGATGGCATC ATATTGCAAT CACTTGGACA AGTGCCAATG GCATCTGGAA AGTCTATATC 
GATGGGAAAT TATCTGACGG TGGTGCTGGC CTCTCTGTTG GTTTGCCCAT ACCTGGTGGT 
GGTGCGTTAG TTCTGGGGCA AGAGCAAGAC AAAAAAGGAG AGGGATTCAG CCCAGCTGAG 
TCTTTTGTGG GCTCCATAAG CCAGCTCAAC CTCTGGGACT ATGTCCTGTC TCCACAGCAG 
GTGAAGTCAC TGGCTACCTC CTGCCCAGAG GAACTCAGTA AAGGAAACGT GTTAGCATGG 
CCTGATTTCT TGTCAGGAAT TGTGGGGAAA GTGAAGATCG ATTCTAAGAG CATATTTTGT 
TCTGATTGCC CACGCTTAGG AGGGTCAGTG CCTCATCTGA GAACTGCATC TGAAGATTTA 
AAGCCAGGTT CCAAAGTCAA TCTGTTCTGT GATCCAGGCT TCCAGCTGGT CGGGAACCCT 
GTGCAGTACT GTCTGAATCA AGGACAGTGG ACACAACCAC TTCCTCACTG TGAACGCATT 
AGCTGTGGGG TGCCACCTCC TTTGGAGAAT GGCTTCCATT CAGCCGATGA CTTCTATGCT 
GGCAGCACAG TAACCTACCA GTGCAACAAT GGCTACTATC TATTGGGTGA CTCAAGGATG 
TTCTGTACAG ATAATGGGAG CTGGAACGGC GTTTCACCAT CCTGCCTTGA TGTCGATGAG 
TGTGCAGTTG GATCAGATTG TAGTGAGCAT GCTTCTTGCC TGAACGTAGA TGGATCCTAC 
ATATGTTCAT GTGTCCCACC GTACACAGGA GATGGGAAAA ACTGTGCAGA ACCTATAAAA 
TGTAAGGCTC CAGGAAATCC GGAAAATGGC CACTCCTCAG GTGAGATTTA TACAGTAGGT 
GCCGGAGTCA CATTTTCGTG TCAGGAAGGA TACCAGTTGA TGGGAGTAAC CAAAATCACA 
TGTTTGGAGT CTGGAGAATG GAATCATCTA ATACCATATT GTAAAGCTGT TTCATGTGGT 
AAACCGGCTA TTCCAGAAAA TGGTTGCATT GAGGAGTTAG CATTTACTTT TGGCAGCAAA 
GTGACATATA GGTGTAATAA AGGATATACT CTGGCCGGTG ATAAAGAATC ATCCTGTCTT 
GCTAACAGTT CTTGGAGTCA TTCCCCTCCT GTGTGTGAAC CAGTGAAGTG TTCTAGTCCG 
GAAAATATAA ATAATGGAAA ATATATTTTG AGTGGGCTTA CCTACCTTTC TACTGCATCA 
TATTCATGCG ATACAGGATA CAGCTTACAG GGCCCTTCCA TTATTGAATG CACGGCTTCT 
GGCATCTGGG ACAGAGCGCC ACCTGCCTGT CACCTCGTCT TCTGTGGAGA ACCACCTGCC 
ATCAAAGATG CTGTCATTAC GGGGAATAAC TTCACTTTCA GGAACACCGT CACTTACACT 
TGCAAAGAAG GCTATACTCT TGCTGGTCTT GACACCATTG AATGCCTGGC CGACGGCAAG 
TGGAGTAGAA GTGACCAGCA GTGCCTGGCT GTCTCCTGTG ATGAGCCACC CATTGTGGAC 
CACGCCTCTC CAGAGACTGC CCATCGGCTC TTTGGAGACA TTGCATTCTA CTACTGCTCT 
GATGGTTACA GCCTAGCAGA CAATTCCCAG CTTCTCTGCA ATGCCCAGGG CAAGTGGGTA 
CCCCCAGAAG GTCAAGACAT GCCCCGTTGT ATAGCTCATT TCTGTGAAAA ACCTCCATCG 
GTTTCCTATA GCATCTTGGA ATCTGTGAGC AAAGCAAAAT TTGCAGCTGG CTCAGTTGTG 
AGCTTTAAAT GCATGGAAGG CTTTGTACTG AACACCTCAG CAAAGATTGA ATGTATGAGA 
GGTGGGCAGT GGAACCCTTC CCCCATGTCC ATCCAGTGCA TCCCTGTGCG GTGTGGAGAG 
CCACCAAGCA TCATGAATGG CTATGCAAGT GGATCAAACT ACAGTTTTGG AGCCATGGTG 
GCTTACAG CT GCAACAAGGG GTTCTACATC AAAGGGGAAA AGAAGAGCAC CTGCGAAGCC 
ACAGGGCAGT GGAGTAGTCC TATACCGACG TGCCACCOGG TATCTTGTGG TGAACCACCT 
AAGGTTGAGA ATGGCTTTCT GGAGCATACA ACTGGCAGGA TCTTTGAGAG TGAAGTGAGG 
TATCAGTGTA ACCCGGGCTA TAAGTCAGTC GGAAGTCCTG TATTTGTCTG CCAAGCCAAT 
CGCCACTGGC ACAGTGAATC CCCTCTGATG TGTGTTCCTC TCGACTGTGG AAAACCTCCC 
CCGATCCAGA ATGGCTTCAT GAAAGGAGAA AACTTTGAAG TAGGGTCCAA GGTTCAGTTT 
TTCTGTAATG AGGGTTATGA GCTTGTTGGT GACAGTTCTT GGACATGTCA GAAATCTGGC 
AAATGGAATA AGAAGTCAAA TCCAAAGTGC ATGCCTGCCA AGTGCCCAGA GCCGCCCCTC 
TTGGAAAACC AGCTAGTATT AAAGGAGTTG ACCACCGAGG TAGGAGTTGT GACATTTTCC 
TGTAAAGAAG GGCATGTCCT GCAAGGCCCC TCTGTCCTGA AATGCTTGCC ATCCCAGCAA 
TGGAATGACT CTTTCCCTGT TTGTAAGATT GTTCTTTGTA CCCCACCTCC CCTAATTTCC 
TTTGGTGTCC CCATTCCTTC TTCTGCTCTT CATTTTGGAA GTACTGTCAA GTATTCTTGT 
GTAGGTGGGT TTTTCCTAAG AGGAAATTCT ACCACCCTCT GCCAACCTGA TGGCACCTGG 
AGCTCTCCAC TGCCAGAATG TGTTCCAGTA GAATGTCCCC AACCTGAGGA AATCCCCAAT 
GGAATCATTG ATGTGCAAGG CCTTGCCTAT CTCAGCACAG CTCTCTATAC CTGCAAGCCA 
GGCTTTGAAT TGGTGGGAAA TACTACCACC CTTTGTGGAG AAAATGGTCA CTGGCTTGGA 
GGAAAACCAA CATGTAAAGC CATTGAGTGC CTGAAACCCA AGGAGATTTT GAATGGCAAA 
TTCTCTTACA CGGACCTACA CTATGGACAG ACCGTTACCT ACTCTTGCAA CCGAGGCTTT 
CGGCTCGAAG GTCCCAGTGC CTTGACCTGT TTAGAGACAG GTGATTGGGA TGTAGATGCC 
CCATCTTGCA ATGCCATCCA CTGTGATTCC CCACAACCCA TTGAAAATGG TTTTGTAGAA 
GGTGCAGATT ACAGCTATGG TGCCATAATC ATCTACAGTT GCTTCCCTGG GTTTCAGGTG 
GCTGGTCATG CCATGCAGAC CTGTGAAGAG TCAGGATGGT CAAGTTCCAT CCCAACATGT 
ATGCCAATAG ACTGTGGCCT CCCTCCTCAT ATAGATTTTG GAGACTGTAC TAAACTCAAA 
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GATGACCAGG GATATTTTGA GCAAGAAGAC GACATGATGG AAGTTCCATA TGTGACTCCT 7920 
CACCCTCCTT ATCATTTGGG AGCAGTGGCT AAAACCTGGG AAAATACAAA GGAGTCTCCT 7980 
GCTACACATT CATCAAACTT TCTGTATGGT ACCATGGTTT CATACACCTG TAATCCAGGA 8040 
TATGAACTTC TGGGGAACCC TGTGCTGATC TGCCAGGAAG ATGGAACTTG GAATGGCAGT 8100 
5 GCACCATCCT GCATTTCAAT TGAATGTGAC TTGCCTACTG CTCCTGAAAA TGGCTTTTTG 8160 
CGTTTTACAG AGACTAGCAT GGGAAGTGCT GTGCAGTATA GCTGTAAACC TGGACACATT 8220 
CTAGCAGGCT CTGACTTAAG GCTTTGTCTA GAGAATAGAA AGTGGAGTGG TGCCTCCCCA 8280 
CGCTGTGAAG CCATTTCATG CAAAAAGCCA AATCCAGTCA TGAATGGATC CATCAAAGGA 8340 
AGCAACTACA CATACCTGAG CACGTTGTAC TATGAGTGTG ACCCCGGATA TGTGCTGAAT 8400 
10 GGCACTGAGA GGAGAACATG CCAGGATGAC AAAAACTGGG ATGAGGATGA GCCCATTTGC 8460 
ATTCCTGTGG ACTGCAGTTC ACCCCCAGTC TCAGCCAATG GCCAGGTGAG AGGAGACGAG 8520 
TACACATTCC AAAAAGAGAT TGAATACACT TGCAATGAAG GGTTCTTGCT TGAGGGAGCC 8580 
AGGAGTCGGG TTTGTCTTGC CAATGGAAGT TGGAGTGGAG CCACTCCCGA CTGTGTGCCT 8640 
GTCAGATGTG CCACCCCGCC ACAACTGGCC AATGGGGTGA CGGAAGGCCT GGACTATGGC 8700 
15 TTCATGAAGG AAGTAACATT CCACTGTCAC GAGGGCTACA TCTTGCACGG TGCTCCAAAA 8760 
CTCACCTGTC AGTCAGATGG CAACTGGGAT GCAGAGATTC C TCTCTG TAA ACCAGTCAAC 8820 
TGTGGACCTC CTGAAGATCT TGCCCATGGT TTCCCTAATG GTTTTTCCTT TATTCATGGG 8880 
GGCCATATAC AGTATCAGTG CTTTCCTGGT TATAAGCTCC ATGGAAATTC ATCAAGAAGG 8940 
TGCCTCTCCA ATGGCTCCTG GAGTGGCAGC TCACCTTCCT GCCTGCCTTG CAGATGTTCC 9000 
20 ACACCAGTAA TTGAATATGG AACTGTCAAT GGGACAGATT TTGACTGTGG AAAGGCAGCC 9060 
CGGATTCAGT GCTTCAAAGG CTTCAAGCTC CTAGGACTTT CTGAAATCAC CTGTGAAGCC 9120 
GATGGCCAGT GGAGCTCTGG GTTCCCCCAC TGTGAACACA CTTCTTGTGG TTCTCTTCCA 9180 
ATGATACCAA ATGCGTTCAT CAGTGAGACC AGCTCTTGGA AGGAAAATGT GATAACTTAC 9240 
AGCTGCAGGT CTGGATATGT CATACAAGGC AGTTCAGATC TGATTTGTAC AGAGAAAGGG 9300 
25 GTATGGAGCC AGCCTTATCC AGTCTGTGAG CCCTTGTCCT GTGGGTCCCC ACCGTCTGTC 9360 
GCCAATGCAG TGGCAACTGG AGAGG CACAC ACCTATGAAA GTGAAGTGAA ACTCAGATGT 9420 
CTGGAAGGTT ATACGATGGA TACAGATACA GATACATTCA CCTGTCAGAA AGATGGTCGC 9480 
TGGTTCCCTG AGAGAATCTC CTGCAGTCCT AAAAAATGTC CTCTCCCGGA AAACATAACA 9540 
CATATACTTG TACATGGGGA CGATTTCAGT GTGAATAGGC AAGTTTCTGT GTCATGTGCA 9600 
30 GAAGGGTATA CCTTTGAGGG AGTTAACATA TCAGTATGTC AGCTTGATGG AACCTGGGAG 9660 
CCACCATTCT CCGATGAATC TTGCAGTCCA GTTTCTTGTG GGAAACCTGA AAGTCCAGAA 9720 
CATGGATTTG TGGTTGGCAG TAAATACACC TTTGAAAGCA CAATTATTTA TCAGTGTGAG 9780 
CCTGGCTATG AACTAGAGGG GAACAGGGAA CGTGTCTGCC AGGAGAACAG ACAGTGGAGT 9840 
GGAGGGGTGG CAATATGCAA AGAGACCAGG TGTGAAACTC CACTTGAATT TCTCAATGGG 9900 
35 AAAGCTGACA TTGAAAACAG GACGACTGGA CCCAACGTGG TATATTCCTG CAACAGAGGC 9960 
TACAGTCTTG AAGGGCCATC TGAGG CACAC TGCACAGAAA ATGGAACCTG GAGCCACCCA 10020 
GTCCCTCTCT GCAAACCAAA TCCATGCCCT GTTCCTTTTG TGATTCCCGA GAATGCTCTG 10080 
CTGTCTGAAA AGGAGTTTTA TGTTGATCAG AATGTGTCCA TCAAATGTAG GGAAGGTTTT 10140 
CTGCTGCAGG GCCACGGCAT CATTACCTGC AACCCCGACG AGACGTGGAC ACAGACAAGC 10200 
40 GCCAAATGTG AAAAAATCTC ATGTGGTCCA CCAGCTCACG TAGAAAATGC AATTGCTCGA 10260 
GGCGTACATT ATCAATATGG AGACATGATC ACCTACTCAT GTTACAGTGG ATACATGTTG 10320 
GAGGGTTTCC TGAGGAGTGT TTGTTTAGAA AATGGAACAT GGACATCACC TCCTATTTGC 10380 
AGAGCTGTCT GTCGATTTCC ATGTCAGAAT GGGGGCATCT GCCAACGCCC AAATGCTTGT 10440 
TCCTGTCCAG AGGGCTGGAT GGGGCGCCTC TGTGAAGAAC CAATCTGCAT TCTTCCCTGT 10500 
45 CTGAACGGAG GTCGCTGTGT GGCCCCTTAC CAGTGTGACT GCCCGCCTGG CTGGACGGGG 10560 
TCTCGCTGTC ATACAGCTGT TTGCCAGTCT CCCTGCTTAA ATGGTGGAAA ATGTGTAAGA 10620 
CCAAACCGAT GTCACTGTCT TTCTTCTTGG ACGGGACATA ACTGTTCCAG GTAA 

Seq ID NO: 430 Protein sequence 
50 Protein Accession #: FGENESH predicted 

I 11 21 31 41 51 

II | ] 1 I 
MWPRLAFCCW GLALVSGWAT FQQMSPSRNF SFRLFPETAP GAPGSIPAPP APGDEAAGSR 

55 VERLGQAFRR RVRUjRELSE RLELVFLVDD SSSVGEVNFR SELMFVRKLL SDFPWPTAT 
RVAIVTPSSK NYWPRVDYI STRRARQHKC ALLLQEIPAI SYRGGGTYTK GAFQQAAQIL 
LHARENSTKV VFLITDGYSN GGDPRPIAAS LRDSGVEIFT FGIWQGNIRE UIDMASTPKE 
EHCYLLHSFE EFEALARRAI* HEDLPSGSFI QDDMVHCSYL CDEGKDCCDR KGSCKCGTHT 
GHFECICEKG YYGKGLQYEC TACPSGTYKP EGSPGGISSC IPCPDENHTS PPGSTSPEDC 
60 VCREGYRASG QTCELVHCPA LKPPENGYFI QNTCNNHFNA ACGVRCHPGF DLVGSSIILC 
LPNGLWSGSE SYCRVRTCPH LRQPKHGHIS CSTREMLYKT TCLVACDEGY RLEGSDKLTC 
QGNSQWDGPE PRCVERHCST FQMPKDVIIS PHNCGKQPAK FGTICYVSCR QGFILSGVKE 
MLRCTTSGKW NVGVQAAVCK DVEAPQINCP KDIEAKTLEQ QDSANVTWQI PTAKDNSGEK 
VSVHVHPAFT PPYLFPIGDV AIVYTATDLS GNQASCIFHI KVIDAEPPVI DWCRSPPPVQ 
65 VSEKVHAASW DEPQFSDNSG AELVITRSHT QGDLFPQGET IVQYTATDPS GNNRTCDIHI 
VIKGSPCEIP FTPVNGDFIC TPDNTGVNCT LTCLEGYDFT EGSTDKYYCA YEDGVWKPTY 
TTEWPDCAKK RFANHGFKSP EMFYKAARCD DTDLMKKPSE AFETTLGKMV PSFCSDAEDI 
DCRLEENLTK KYCLEYNYDY EMGFAIGPGG WGAANRLDYS YDDFLDTVQE TATSIGNAKS 
SRIKRSAPLS DYKIKLIFNI TASVPLPDER NDTLEWENQQ RLLQTLETIT NKLKRTLNKD 
70 PMYSFQLASE ILIADSNSLE TKKASPPCRP GSVLRGRMCV NCPLGTYYNI» EHFTCESCRI 
GSYQDEEGQL ECKLCPSGMY TEYIHSRNIS DCKAQCKQGT YSYSGLETCE SCPLGTYQPK 
FGSRSCLSCP ENTSTVKRGA VNISACGVPC PEGKFSRSGL MPCHPCPRDY YQPNAGKAFC 
LACPFYGTTP FAGSRSITEC STSVLNITIF GGFGHLELLN CPSEVFHECF FNPCHNSGTC 
QQLGRGYVCL CPLGYTGLKC ETDIDECSPL PCLNNGVCKD LVGEFICECP SGYTGQRCEE 
75 NINECSSSPC LNKGICVDGV AGYRCTCVKG FVGLHCETEV NECQSNPCLN NAVCEDQVGG 
FLCKCPPGFL GTRCGKNVDE CLSQPCKNGA TCKDGANSFR CLCAAGFTGS HCELNINECQ 
SNPCRNQATC VDEUJSYSCK CQPGFSGKRC ETEQSTGFNL DFEVSGIYGY VMLDGMLPSIi 
HALTCTFWMK SSDDMNYGTP ISYAVDNGSD NTLLLTDYNG WVLYVNGREK ITNCPSVNDG 
RWKHIAITWT SANGIWKVYI DGKLSDGGAG LSVGLPIPGG GALVLGQEQD KKGEGFSPAE 
80 SFVGSISQLN LWDYVLSPQQ VKSIATSCPE ELSKGNVLAW PDFLSGXVGK VKIDSKSIFC 
SDCPRLGGSV PHLRTASEDL KPGSKVNLFC DPGFQLVGNP VQYCLNQGQW TQPLPHCERI 
SCGVPPPLEN GPHSADDPYA GSTVTYQCNN GYYLLGDSRM FCTDNGSWNG VSPSCLDVDE 
CAVGSDCSEH ASCLNVDGSY ICSCVPPYTG DGKNCAEPIK CKAPGNPENG HSSGEIYTVG 
AGVTFSCQEG YQLMGVTKIT CLESGEWNHL IPYCKAVSCG KPAIPENGCI EELAPTFGSK 
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VTYRCNKGYT LAGDKESSCL ANSSWSHSPP VCEPVKCSSP ENINNGKYIL 
YSCDTGYSLQ GPSIIECTAS GIWDRAPPAC HLVFCGEPPA IKDAVITGNN 
CKEGYTLAGL DTIECLADGK WSRSDQQCLA VSCDEPPIVD HASPETAHRL 
DGYSLADNSQ LLCNAQGKWV PPEGQDMPRC IAHFCEKPPS VSYSILESVS 
SFKCMEGFVL NTSAKIECMR GGQWNPSPMS IQCIPVRCGE PPSIMNGYAS 
AYSCNKGFYI KGEKKSTCEA TGQWSSPIPT CHPVSCGEPP KVENGFLEHT 
YQCNPGYKSV GSPVFVCQAN RHWESESPLM CVPLDOGKPP PIQNGFMKGE 
FCNEGYELVG DSSWTCQKSG KWNKKSNPKC MPAKCPEPPL LENQLVXiKEL 
CKEGHVLQGP SVLKCLPSQQ WNDSFPVCKI VLCTPPPLIS FGVPIPSSAL 
VGGFFLRGNS TTLCQFDGTW SSPLPECVPV ECPQPEEIPN GIIDVQGLAY 
GFEI*VGNTTT LCGENGHWLG GKPTCKAIEC LKPKEILNGK FSYTDLHYGQ 
RLEGPSALTC LETGDWDVDA PSCNAIHCDS PQPIENGFVE GADYSYGAII 
AGHAMQTCEE SGWSSSIPTC MPIDCGLPPH IDFGDCTKLK DDQGYFEQED 
HPPYHLGAVA KTWENTKESP ATHSSNFLYG TMVSYTCNPG YELLGNPVLI 
APSCISIECD LiPTAPENGFL RFTETSMGSA VQYSCKPGHI IAGSPLRLCL 
RCEAISCKKP NPVMNGSIKG SNYTYLSTLY YECDPGYVLN GTERRTOQDD 
IPVDCSSPPV SANGQVRGDE YTFQKEIEYT CNEGFLLEGA RSRV CLANGS 
VRCATPPQLA NGVTEGLDYG FMKEVTFHCH EGYILHGAPK LTCQSDGNWD 
CGPPEDLAHG FPNGFSFIHG GHIQYQCFPG YKLHGNSSRR CLSNGSWSGS 
TPVIEYGTVN GTDFDCGKAA RIQCFKGFKL LGLSEITCEA DGOWSSGFPH 
MIPNAFISET SSWKENVITY SCRSGYVIQG SSDLICTEKG VWSQPYPVCE 
ANAVATGEAH TYESEVKLRC LEGYTMDTDT DTFTCQKDGR WFPERISCSP 
HILVHGDDFS VNRQVSVSCA EGYTFEGVNI SVCQLDGTWE PPFSDESCSP 
HGFWGSKYT FESTIIYQCE PGYELEGNRE RVCQENRQWS GGVAICKETR 
KADIENRTTG PNWYSCNRG YSLEGPSEAH CTENGTWSHP VPLCKPNPCP 
LSEKEFYVDQ NVSIKCREGF LLQGHGIITC NPDETWTOTS AKCEKISCGP 
GVHYQYGDMI TYSCYSGYML EGFLRSVCLE NGTWTSPPIC RAVCRFPCQN 
SCPEGWMGRL CEEPICILPC LNGGRCVAPY QCDCPPGVTTG SRCHTAVCQS 
PNRCHCLSSW TGHNCSR 

Seq ID NO: 431 PNA sequence 
Nucleic Acid Accession #: FGENESH predicted 
Coding sequence : 1 . . 3 90 



SGLTYLSTAS 
FTFRNTVTYT 
FGDIAFYYCS 
KAKFAAGSW 
GSNYSFGAMV 
TGRIFESEVR 
NFEVGSKVQF 
TTEVGWTFS 
HFGSTVKYSC 
LSTALYTCKP 
TVTYSQJRGF 
IYSCFPGFQV 
DMMEVPYVTP 
CQEDGTWNGS 



KNWDEDEPIC 
WSGATPDCVP 
AEIPLCKPVN 
SPSCLPCRCS 



PLSCGSPPSV 
KKCPIjPENIT 
VSCGKPESPE 
CETPIjEFLNG 
VPFVIPENAL 
PAHVENAIAR 
GGICQRPNAC 
PCLNGGKCVR 
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ATGAGGTTCA 
TATGTGTCAG 
GAACCATGGC 
CAGTGCTGTT 
TGCACCTTCT 
TTTGTTGTGA 
AGTAAATGTG 



11 
I 

GTGTCTCAGG 
TCTGTCTCCT 
TGTGCCAGCC 
ACAATGACGC 
GGCCCTGCTT 
AGCTGAAGGT 
AAAGAGGCCG 



21 
I 

CATGAGGACC 
CCTCTTGTGT 
GGCACCCAGG 
CATCGTGTCC 
TGAGCTCTGC 
TCAGGGTGTG 
GATATGTTAG 



31 41 51 

I I 1 

GACTACCCCA GGAGTGTGCT GGCTCCTGCT 60 

CCAAGGGAAG TCATCGCTCC CGCTGGCTCA 120 

TGTGGAGACA AGATCTACAA CCCCTTGGAG 180 

CTGAGCGAGA CCCGCCAATG TGGTCCCCCC 240 

TGTCTTGATT CCTTTGGCCT CACAAACGAT 300 

AATTCCCAGT GCCACTCATC TCCCATCTCC 360 



Seq ID NO: 432 Protein sequence 
Protein Accession ft: FGENESH predicted 

1 11 21 31 41 51 

MRFSVSGMRT DYPRSVIAPA YVSVCLLLLC PREVIAPAGS EPWLCQPAPR CGDKIYNPLE 
QCCYNDAIVS LSETRQCX3PP CTFWPCFELC CLDSFGLTND FWKLKVQGV NSQCHSSPIS 
SKCERGRIC 

Seq ID NO: 433 PNA sequence 

Nucleic Acid Accession #: NMJ>07231.1 

Coding sequence: 89.. 2017 

1 11 21 31 41 SI 

TAGGAACAGG GGAGAGTGCA CCTGCTACCA GTCAAGCTCA GCCAGACTGC AAGAGGAGGC 
GAGGCGGAGC CAGCCGAGGG AGTGAACCAT GGACAAGTTG AAATGCCOGA GTTTCTTCAA 
GTGCAGGGAG AAGGAGAAAG TGTCGGCTTC ATCAGAGAAT TTCCATGTTG GTGAAAATGA 
TGAGAATCAG GACCGTGGTA ACTGGTCCAA AAAATCGGAT TATCTTCTAT CTATGATTGG 
ATACGCAGTG GGATTAGGAA ATGTGTGGAG ATTTCCATAT CTGACCTACA GCAATGGTGG 
AGGCGCCTTC TTGATACCTT ATGCAATTAT GTTAGCATTG GCTGGTTTAC CTTTGTTCTT 
TCTGGAGTGT TCACTGGGAC AATTTGCTAG CTTAGGTCCA GTTTCAGTTT GGAGGATTCT 
TCCATTGTTT CAAGGTGTGG GAATTACAAT GGTCCTGATC TCCA TTTTTG TGACAATCTA 
TTACAATGTC ATAATTGCCT ATAGTCTTTA CTACATGTTT GCTTCTTTTC AAAGTGAACT 
ACCATGGAAA AATTGTTCTT CGTGGTCAGA TAAAAACTGT AGCAGATCAC CAATAGTAAC 
TCACTGTAAT GTGAGTACAG TGAATAAAGG AATACAAGAG ATCATCCAAA TGAATAAAAG 
CTGGGTAGAC ATCAACAATT TTACCTGCAT CAACGGCAGT GAAATTTATC AGCCAGGGCA 
GCTTCCCAGT GAACAATATT GGAATAAAGT GGCGCTCCAA CGGTCAAGTG GAATGAATGA 
GACTGGAGTA ATTGTTTGGT ATTTAGCACT TTGTCTTCTT CTGGCTTGGC TCATAGTTGG 
AGCAGCACTA TTTAAAGGAA TCAAATCGTC TGGCAAGGTG GTATATTTTA CAGCTCTTTT 
CCCCTATGTG GTCCTACTCA TCCTGTTAGT ACGAGGTGCA ACTCTGGAGG GTGCTTCAAA 
AGGCATTTCA TACTATATTG GAGCCCAGTC AAATTTTACA AAACTTAAGG AAtOTGAGGT 
ATGGAAAGAT GCTGCCACTC AGATATTTTA CTCCCTTTCA GTGGCTTGGG GTGGCTTAGT 
TGCTCTATCA TCTTACAATA AGTTCAAAAA CAACTGCTTC TCTGATGCCA TTGTGGTTTG 
TTTGACAAAC TGTCTCACTA GCGTGTTTGC TGGATTTGCT ATTTTTTCTA TATTGGGACA 
CATGGCCCAT ATATCTGGAA AGGAAGTTTC TCAAGTTGTA AAATCAGGTT TTGATTTGGC 
ATTCATTGCC TATCCAGAGG CTCTAGCCCA ACTCCCAGGT GGTCCATTTT GGTCCATATT 
ATTTTTTTTC ATGCTTTTAA CTTTGGGTCT CGATTCTCAG TTTGCTTCGA TTGAAACGAT 
CACAACAACA ATTCAAGATT TATTTCCCAA AGTGATGAAG AAAATGAGGG TTCCCATAAC 
TTTGGGCTGC TGCTTGGTTT TGTTTCTCCT TGGTCTCGTC TGTGTGACTC AGGCTGGAAT 
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TTACTGGGTT CATCTGATTG ACCACTTCTG 
ACTGGAGCTA GTTGGAATCA TCTGGATTTA 
AATGATGATT GGAGCAAAGA GGTGGATATT 
AATTACGCCT ATCCTTTTGA TTGCAATATT 
TAATTATGGC GCAATTCCAT ACCCTGACTG 
TTTCTGCATT ATTTGGATAC CAATTATGGC 
CATCTTTCAA CGCCTTATAA GTTGCTGCAG 
ACAACATCGT GGGGAAAGAT ATAAAGACAT 
AATACCTACT GTTAGTGGCA GCAGAAAACC 
ATTGTATAAT GTGATTTTTT TTAGAATAGG 
ATAGGAAAAT GTACATACTA TGTTCATGAT 
TGCAATATAA AAATGTGAAT CTCTTAATTC 
GATTGTCTAT CTGTATAACA CACACA CACA 
TTTTGTAAAT AGTATATGCA TTTTTAATAC 
AGAGAATAGT TATATTTTCT ATTACACAAG 
AATATACAAT CTATCTTTTC CACAAATATC 
AAGTGTTAAA ACTGAAGGCC TCACTTAATT 
CTATACATAC TATAAGAGGA CTGTAGTTTA 
TCGTGCATTT GTTACAGCTC ATGTTTTCTA 
AAAGTGAAAT AAGATGGAAA AATTAGGATC 
TTGAGTTTTG AGTACCTCTT TTCCCATATA 
AAACTATGAC CCATTTAATT TCTATTGTGT 
TCTCTGAAAT ATAGGTTTAA TTTCAAATAG 
GCTTTAATCA ATTCTAGCAT TTTATTACTG 
TATATGAGTC AGTTACTACT TACAGGTGAT 
CAAACTTGTC AAGAATGAGA AAAGCCAAAT 
AAGGATAATT AAATATATCA CTAAGAGCTT 
GTTTAAGCAT CATAGCCTAT GATGATAAAC 
AATTCTTAAA TTTCTTAACC TAGGCTTCAG 
ATTTTCTGTG TGTACATGTA CATGCATTTT 
AT AT CAAGG A AAACTGTGAC CCAAAGAAGT 
TTGTGCTTGG CAAATGAGTG ACAATAGAAG 
TTTCTCTTCC TTGTGATTGA AGATGAAAGG 
TACTGGTAAC TTATTTAGGG GGGAGGGGAC 
TTGAACCACC TCTCTAAGTT ATGTACGTAT 
AGGGTTTTCT TTTTCTTTTT CCTTTTTTTT 
CTTGTTCTGT TGCCTGGGCT GGAGTGCAGT 
CTTCTGGATT CAAGTGATTC TCCTGCCTCA 
CGCCACCACA CCAGCTAATT TTTGTATTTT 
AGGCTGGTCT TGAACTCCCG ACCTCAAGTG 
AGATTACAGG TGTGAGCCAC CGTGCCCGGC 
CAAATGCTGT TAGTAAGTTT CAGGAGATTG 
AATATTTGAG AATAGATGGC TAACAAGAGG 
CAGTATGTTA CATGCATTCT ACCACTACAT 
ATAGGTGACT TTTGCAATTC AGGGAAGATT 
GGGGGAGGTG TGAAGGGAAA GAAATTCTTT 
GTTTTTGTCA CTATCTTCAC AAGTGTGTAA 
TTGTAGGGTG CTCAATGGTG GTTTTTATTA 
CATCATTCTA CATAATGGAA AACATTTACA 
GAGATAATTT ATGGTATTGT ATTGTAAACC 
AATTAAAATC ACT AT AT CTC 

Seq ID NO; 434 Protein sequence 
Protein Accession #: NP_009162.1 



TGCTGGATGG 
TGGAGGGAAC 
CTGGCTATGG 
TATCTGGTCA 
GGGAGTTGCT 
TATCATAAAA 
ACCAGCTTCT 
GGTAGATCCT 
GGAATGAGAT 
GGGAACCTTA 
AGTGTGATTT 
TCAGCCATGT 
CACCTAAGAG 
ATTGGAGGCT 
TTTAAAAATA 
AGTGGGAAAT 
AGAAACGTGA 
ATACTTTTTA 
TATGAACTTA 
CTACAGCCAG 
CAATCTTCCT 
TTCACCAAAT 
AATATGGACT 
TAATACAGGG 
AACTTGCATA 
TAGAAAATCC 
TATATATTGA 
ACTGCCTATA 
GGAGCATATG 
TCTAGGGAGA 
TTAAGAATCA 
AAATAATTTT 
AGTAAGAAAT 
ATGAAGGTAG 
ATATAAGCTG 
TTTTTTTGGT 
GGCATGATCT 
GCCTCTTGAG 
TAGTAGAGGC 
ATCTGTCTAC 
CCATTCTAAG 
TTAATTCCTC 
TTAGAAATAC 
TTTGGTGCTA 
TGGGCATATT 
TCAAAAGCTG 
AGCACAGATT 
TTATTACTCA 
TCAAATCCCA 
ATTAATGAAA 



GGCATTTTAA 
AGATTCATTG 
TGGAGAGCTT 
TTGGTGCAAT 
TTAGGCTGGT 
ATAATTCAGG 
AACTGGGGTC 
AAAAAAGAGG 
CTCATTGAAA 
TTTATTTGTG 
TTTTCACATT 
GCTTATTATA 
TCTCTATTTC 
TTATTTTGAA 
TTATTAACTT 
AAATCAGCAC 
TAAATATATG 
CCCAAATATG 
GTCATTAATG 
TACGTGATAA 
TCCTTAGGTA 
TCAGTGTTGT 
TAAATGTTAA 
CTGATAGAGT 
CTATTGGAAG 
TATGTCCTAG 
TTATATATTG 
TATGTAAATA 
AAACCAAAAT 
GAGTCCGTAG 
CATACAGTGC 
TCTTACACAT 
TAAGGCATTT 
GTAAATAGGT 
AAATTGTGTT 
GGGGGGCTGG 
CAGCTCACTG 
TAGCTGGGAC 
GAAGTTTCCC 
CTCGGCCTCC 
GGTTTTCTTT 
AGTTATACCA 
TTTTCCTTAA 
TTTAAGGTGT 
AAATGAAAGA 
ACCACAAAGA 
TCAACAGAGT 
GATTCCACAG 
CTTACTTTAA 
ACTTTTTCAC 



TTGCAGCTAT 
AGGATACAGA 
GCTGGTTTGT 
TTCATAGACC 
GTATGATTGT 
CTAAAGGAAA 
CATACCTGGA 
CTGACCATGA 
AAAAT AT ATG 
TGTTAACTGA 
TAAGCAGGAA 
TTTCTTTTTA 
ACAATTATAT 
CTAATTTCTT 
GTATTTTCTT 
ATTTGAAAGA 
GACAAATGGA 
TTTAAAAACA 
TTCTTTATAA 
ATCTAGAAAA 
ATTTGGAAGA 
TCATTATACC 
TGAGAAACTG 
GATTTTGTCT 
ATAAAGTTGT 
TTTCCTTACC 
TTGACAACTG 
GCTTTTCATC 
TATATGGAAC 
GTTTATCAGA 
TGCTGGCTTT 
TTTAAAACGT 
GTTTAATTTA 
AGGCCTCTAA 
TGACATTCTG 
GGGTCAGAGT 
CAACCTCTGC 
TACAGGTGCC 
CATGTTGGCC 
TAAAGTGCTG 
GAAGACAGGT 
GATTTTATAA 
TTTTAATCCA 
GCAATTTTCT 
ATATCTAATT 
GTAGTTAAAA 
GCTTGGCATA 
TGGCAAGAAA 
TGCGAACTTG 
AGTTGAGTGA 



1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3160 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 



1 
I 

MDKLKCPSFF 
RFPYLTYSNG 
MVLISIFVTI 
GIQEIIQMNK 
LCLLLAWLIV 
SNFTKLKEAE 
AGFAIFSILG 
LDSQFASIET 
CAGWGILIAA 
FIWSLVQFHR 
RPASNWGPYLi 



11 
I 

KCREKEKVSA 
GGAFLIPYAI 
YYNVIIAYSL 
SWVDINNFTC 
GAALFKGIKS 
VWKDAATQIF 
HMAHISGKEV 
ITTTIQDLFP 
ILELVGIIWI 
PNYGAIPYPD 
EQHRGERYKD 



21 
I 

SSENFHVGEN 
MLAliAGLPLF 
YYMFASFQSE 
INGSEIYQPG 
SGKWYFTAL 
YSLSVAWGGL 
SQWKSGFDL 
KVMKKMRVPI 
YGGNRFIEDT 
WGVALGWCHI 
KVDPKKEADH 



31 
I 

DENQDRGNWS 
FLECSLGQFA 
LPWXNCSSWS 
QLPSEQYWNK 
FPYWLLILL 
VALSSYNKFK 
AFIAYPEALA 
TDGCCLVLFL 
EMMIGAKRWI 
VFCIIWIPIM 
EIPTVSGSRK 



41 
I 

KKSDYLLSMI 
SLGPVSVWRI 
DKNCSRSPIV 
VAIjQRSSGMN 
VRGATLEGAS 
KNCFSDAIW 
QLPGGPFWSI 
LGLVCVTQAG 
FWLWWRACWF 
AIIKIIQAKG 
PE 



51 
I 

GYAVGLGNVW 
LPLFQGVGIT 
THCNVSTVNK 
ETGVIVWYLA 
KGISYYIGAQ 
CLTNCLTSVF 
LFFFMLLTLG 
IYWVHLIDHF 
VITPILLIAI 
NIFQBLISCC 



60 
120 
180 
240 
300 
360 
420 
480 
540 

600 



Seq ID NO: 43S DNA sequence 
Nucleic Acid Accession #: M18728.1 
Coding sequence: 51-. 1085 



1 

\ 

GGAGCTCAAG 
CCTCAGCCCC 
TTCTAACCTT 
ATGTCGCAGA 
GTTACAGCTG 
TAGGAACTCA 
ATGCATCCCT 
TCATAAAGTC 
TGCCCAAGCC 



11 
I 

CTCCTCTACA 
TCCCTGCAGA 
CTGGAACCCA 
GGGGAAGGAG 
GTACAAAGGC 
ACAAGCTACC 
GCTGATCCAG 
AGATCTTGTG 
CTCCATCTCC 



21 
I 

AAGAGGTGGA 
TTGCATGTCC 
CCCACCACTG 
GTTCTTCTAC 
GAAAGAGTGG 
CCAGGGCCCG 
AACGTCACCC 
AATGAAGAAG 
AGCAACAACT 



31 
I 

CAGAGAAGAC 
CCTGGAAGGA 
CCAAGCTCAC 
TCGCCCACAA 
ATGGCAACAG 
CATACAGTGG 
AGAATGACAC 
CAACCGGACA 
CCAACCCCGT 



41 
I 

AGCAGAGACC 
GGTCCTGCTC 
TATTGAATCC 
CCTGCCCCAG 
TCTAATTGTA 
TCGAGAGACA 
AGGATTCTAT 
GTTCCATGTA 
GGAGGACAAG 



51 

ATGGGACCCC 60 

ACAGCCTCAC 120 

ACGCCATTCA 180 

AATCGTATTG 240 

GGATATGTAA 300 

ATATACCCCA 360 

ACCCTACAAG 420 

TACCCGGAGC 480 

GATGCTGTGG 540 



1126 



WO 03/042661 



PCT/US02/36810 



10 
15 
20 
25 
30 
35 



CCTTCACCTG 
GCCTCCCGGT 
GCGTCAAAAG 
ACCGCAGTGA 
CCTCAAAGGC 
ACCCACCTGC 
TCTTTATCCC 
CAGCCACTGG 
TCCTCTCAGC 
TATAGCAGCC 
GAATTCTTCT 
CTCTGCTCCT 
ACCCTCAGGC 
GCAAACCATG 
AACAAGACTC 
TGCCTCTTTC 
GGGTAACTTA 
AGATCCTTTA 
AAATGTACAG 
TTTAATTCAA 
GGAGGAGTCT 
GCTGAGACTA 
CTGACTCATT 
CTCTTGGTAT 
CTCTAAAAGC 
GGCTGGAATT 
ATAAAAGCCC 
TCTCACCTAG 
GTTAAGGAAG 
ACACAGGAGA 
TTTACAAAAA 
TGTTTCCTTG 
CTATCACTGT 
TAGCTCTATA 



TGAACCTGAG 
CAGTCCCAGG 
GAACGATGCA 
CCCAGTCACC 
CAATTACCGT 
ACAGTACTCT 
CAACATCACT 
CCTCAATAGG 
TGTGGCCACC 
CTGGTGTATT 
AGCTCCTCCA 
GAAGCCCTAT 
CTGAGGTGTG 
GTGAGAAATT 
CTCATCATGA 
GCTTGGCAGG 
ACAGAGTGTC 
GTGCACCCAG 
TGGTCCTTTT 
CCCAGCCATG 
GTGCAGTTTC 
AGTTGTAGAA 
CTTTATTCTA 
TACCCTCCTA 
TTTAAATGTC 
ACAAAACTCA 
CAAATGGTGG 
GTGAGCGCAT 
AAGATAGATC 
TTCCAGTCTA 
AGTAACCTGA 
TTCCAATTTG 
ACTTGTAGAG 
ACT 



GTTCAGAACA 
'CTGCAGCTGT 
GGATCCTATG 
CTGAATGTCC 
CCAGGGGAAA 
TGGTTTATCA 
GTGAATAATA 
ACCACAGTCA 
GTCGGCATCA 
TTCGATATTT 
ATCCCATTTT 
ATGCTGGAGA 
TGCCACTCAG 
GACGACTTCA 
TAAGGCTCTT 
ATGATGCTGT 
AGATCTATCT 
TGACTGACAT 
CAGAGTTGGA 
CAATGCCAAA 
TGACACTTGT 
ATTAACAAAT 
TTTTAGTTGG 
ATAGTCATAC 
TGCATGCAGC 
GAGAAATGTG 
TAACTGATAA 
TGAGCCAGTG 
CAATTAAAAA 
CTTGAGTTAG 
ACTAATCTGA 
ACAAAACCCA 
TGGTGCTGCT 



CAACCTACCT 
CCAATGGCAA 
AATGTGAAAT 
TCTATGGCCC 
ATCTGAACCT 
ATGGGACGTT 
GCGGATCCTA 
CGATGATCAC 
CGATTGGAGT 
CAGGAAGACT 
ATCCCATGGA 
TGGACAACTC 
AGACTTCACC 
CACTATGGAC 
ACCCCCTTTT 
CATTAGTATT 
TGTCAATCCC 
TAGCAGCATC 
CTTCTAGACT 
TAATAGAATT 
TGTTGAACAT 
GTGCTGCTTG 
TTTGTATCTT 
TAGTAGTCAT 
CAGCCATCAA 
TCATCAGGAG 
TAGCACTAAT 
GTGCTAAATG 
AAATTAAAAC 
CATAATACAG 
TGTTAACCAA 
CTGTTCTTGT 
TTAATTCATA 



GTGGTGGGTA 
CATGACCCTC 
ACAGAACCCA 
AGATGTCCCC 
CTCCTGCCAC 
CCAGCAATCC 
TATGTGCCAA 
AGTCTCTGGA 
GCTGGCCAGG 
GGCAGATTGG 
ACCACTAAAA 
AATGAAAATT 
TAACTAGAGA 
AGCTTTTCCC 
AATTTGTCCT 
TCACAAGAAG 
AACGTTTTAC 
TTTAACACAG 
CACCTGTTCT 
GCTCCCTACC 
GGCTAAATAC 
GTTAAAATGG 
GCCTAAGGTG 
ACTCCCTGGT 
ATAGTGAATG 
AACATCATAA 
GCTTTAAGAT 
CTACATACTC 
CAATTTAAAA 
AAGTCCCCTC 
TGTATTTATT 
ATTGTATTGC 
AATCACAAAT 



AATGGTCAGA 
ACTCTACTCA 
GCGAGTGCCA 
ACCATTTCCC 
GCAGCCTCTA 
ACACAAGAGC 
GCCCATAACT 
AGTGCTCCTG 
GTGGCTCTGA 
ACCAGACCCT 
ACAAGGTCTG 
TAAAGGGAAA 
CAGTCAAACT 
AAGATGTCAA 
TGCTTATGCC 
TAGCTTCAGA 
ATAAAATAAG 
CCGTGTGTTC 
CACTCCCTGT 
AGCTGAACAG 
AATGGGTATC 
CTACACTCAT 
CGTAGTCCAA 
GTAGTGTATT 
GTCTCTCTTT 
CCCATGAAGG 
TTGGTCACAC 
CAACTGAAAT 
AAAAAAAAGA 
TACTTTAACT 
TCTGTGGTTC 
CCAGGGGGAG 
AAAAGCCAAT 



600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 



Seq ID NO: 436 Protein sequence 
Protein Accession #: AAA59907.1 



1 11 21 31 41 51 

40 | | | i I i 

MGPPSAPPCR LHVPWKEVLL TASLLTFWNP PTTAKLTIES TPFNVAEGKE VLLLAHNLPQ 60 
NRIGYSWYKG ERVDGNSLIV GYVIGTQQAT PGPAYSGRET IYPNASLLIQ NVTQNDTGFY 120 
TLQVIKSDLV NEEATGQFHV YPELPKPSIS SNNSNPVEDK DAVAPTCEPE VQNTTYLWWV 180 
NGQSLPVSPR LQLSNGNMTL TIJjSVKRNDA GSYECEIQNP ASANRSDPVT LNVLYGPDVP 240 
45 TISPSKANYR PGENLNLSCH AASNPPAQYS WFIKGTFQQS TQELFIPNIT VNNSGSYMCQ 
AHNSATGLNR TTVTMITVSG SAPVLSAVAT VGITIGVLAR VALI 



300 



Seq ID NO i 437 DNA sequence 
Nucleic Acid Accession #: M18728.1 
50 Coding sequence: 1355.. 1657 

1 11 21 31 41 51 

| I 1 I I I 

GGAGCTCAAG CTCCTCTACA AAGAGGTGGA CAGAGAAGAC AGCAGAGACC ATGGGACCCC 60 

55 CCTCAGCCCC TCCCTGCAGA TTGCATGTCC CCTGGAAGGA GGTCCTGCTC ACAGCCTCAC 120 

TTCTAACCTT CTGGAACCCA CCCACCACTG CCAAGCTCAC TATTGAATCC ACGCCATTCA 180 

ATGTCGCAGA GGGGAAGGAG GTTCTTCTAC TCGCCCACAA CCTGCCCCAG AATCGTATTG 240 

GTTACAGCTG GTACAAAGGC GAAAGAGTGG ATGGCAACAG TCTAATTGTA GGATATGTAA 300 

TAGGAACTCA ACAAGCTACC CCAGGGCCCG CATACAGTGG TCGAGAGACA ATATACCCCA 360 

60 ATGCATCCCT GCTGATCCAG AACGTCACCC AGAATGACAC AGGATTCTAT ACCCTACAAG 420 

TCATAAAGTC AGATCTTGTG AATGAAGAAG CAACCGGACA GTTCCATGTA TACCCGGAGC 480 

TGCCCAAGCC CTCCATCTCC AGCAACAACT CCAACCCCGT GGAGGACAAG GATGCTGTGG 540 

CCTTCACCTG TGAACCTGAG GTTCAGAACA CAACCTACCT GTGGTGGGTA AATGGTCAGA 600 

GCCTCCCGGT CAGTCCCAGG CTGCAGCTGT CCAATGGCAA CATGACCCTC ACTCTACTCA 660 

65 GCGTCAAAAG GAACGATGCA GGATCCTATG AATGTGAAAT ACAGAACCCA GCGAGTGCCA 720 

ACCGCAGTGA CCCAGTCACC CTGAATGTCC TCTATGGCCC AGATGTCCCC ACCATTTCCC 780 

CCTCAAAGGC CAATTACCGT CCAGGGGAAA ATCTGAACCT CTCCTGCCAC GCAGCCTCTA 840 

ACCCACCTGC ACAGTACTCT TGGTTTATCA ATGGGACGTT CCAGCAATCC ACACAAGAGC 900 
TCTTTATCCC CAACATCACT GTGAATAATA GCGGATCCTA TATGTGCCAA GCCCATAACT 960 

70 CAGCCACTGG CCTCAATAGG ACCACAGTCA CGATGATCAC AGTCTCTGGA AGTGCTCCTG 1020 

TCCTCTCAGC TGTGGCCACC GTCGGCATCA CGATTGGAGT GCTGGCCAGG GTGGCTCTGA 1080 

TATAGCAGCC CTGGTGTATT TTCGATATTT CAGGAAGACT GGCAGATTGG ACCAGACCCT 1140 

GAATTCTTCT AGCTCCTCCA ATCCCATTTT ATCCCATGGA ACCACTAAAA ACAAGGTCTG 1200 

CTCTGCTCCT GAAGCCCTAT ATGCTGGAGA TGGACAACTC AATGAAAATT TAAAGGGAAA 1260 

75 ACCCTCAGGC CTGAGGTGTG TGCCACTCAG AGACTTCACC TAACTAGAGA CAGTCAAACT 1320 

GCAAACCATG GTGAGAAATT GACGACTTCA CACTATGGAC AGCTTTTCCC AAGATGTCAA 1380 

AACAAGACTC CTCATCATGA TAAGGCTCTT ACCCCCTTTT AATTTGTCCT TGCTTATGCC 1440 

TGCCTCTTTC GCTTGGCAGG ATGATGCTGT CATTAGTATT TCAC AAGAA G TAGCTTCAGA 1500 

GGGTAACTTA ACAGAGTGTC AGATCTATCT TGTCAATCCC AACGTTTTAC ATAAAATAAG 1560 

80 AGATCCTTTA GTGCACCCAG TGACTGACAT TAGCAGCATC TTTAACACAG CCGTGTGTTC 1620 

AAATGTACAG TGGTCCTTTT CAGAGTTGGA CTTCTAGACT CACCTGTTCT CACTCCCTGT 1680 

TTTAATTCAA CCCAGCCATG CAATGCCAAA TAATAGAATT GCTCCCTACC AGCTGAACAG 1740 

GGAGGAGTCT GTGCAGTTTC TGACACTTGT TGTTGAACAT GGCTAAATAC AATGGGTATC 1800 

GCTGAGACTA AGTTGTAGAA ATTAACAAAT GTGCTGCTTG GTTAAAATGG CTACACTCAT 1860 



1127 



WO 03/042661 



CTGACTCATT CTTTATTCTA TTTTAGTTGG TTTGTATCTT GCCTAAGGTG CGTAGTCCAA 1920 

CTCTTGGTAT TACCCTCCTA ATAGTCATAC TAGTAGTCAT ACTCCCTGGT GTAGTGTATT 1980 

CTCTAAAAGC TTTAAATGTC TGCATGCAGC CAGCCATCAA ATAGTGAATG GTCTCTCTTT 2040 

GGCTGGAATT ACAAAACTCA GAGAAATGTG TCATCAGGAG AACATCATAA CCCATGAAGG 2100 

ATAAAAGCCC CAAATGGTGG TAACTGATAA TAGCACTAAT GCTTTAAGAT TTGGTCACAC 2160 

TCTCACCTAG GTGAGCGCAT TGAGCCAGTG GTGCTAAATG CTACATACTC CAACTGAAAT 2220 

GTTAAGGAAG AAGATAGATC CAATTAAAAA AAATTAAAAC CAATTTAAAA AAAAAAAAGA 22 BO 

ACACAGGAGA TTCCAGTCTA CTTGAGTTAG CATAATACAG AAGTCCCCTC TACTTTAACT 2340 

TTTACAAAAA AGTAACCTGA ACTAATCTGA TGTTAACCAA TGTATTTATT TCTGTGGTTC 2400 

TGTTTCCTTG TTCCAATTTG ACAAAACCCA CT GTT C T TGT ATTGTATTGC CCAGGGGGAG 2460 

CTATCACTGT ACTTGTAGAG TGGTGCTGCT TTAATTCATA AATCACAAAT AAAAGCCAAT 2520 
TAGCTCTATA ACT 



Seq ID NO: 436 Protein sequence 
Protein Accession #: AAA59908.1 



1 11 21 31 41 51 

I ! I I I I 

MDSFSQDVKT RLLIMIRLLP PFNLSLLMPA SFAWQDDAVI SISQEVASEG NLTECQIYLV 
NPNVLHKIRD PLVHPVTDIS SIFNTAVCSN VQWSPSELDF 



Seq ID NO: 439 DNA sequence 
Nucleic Acid Accession #: M18728.1 
Coding sequence: 2370.. 2501 

1 11 21 31 41 51 

I I I I I I 

GGAGCTCAAG CTCCTCTACA AAGAGGTGGA CAGAGAAGAC AGCAGAGACC ATGGGACCCC 60 

CCTCAGCCCC TCCCTGCAGA TTGCATGTCC CCTGGAAGGA GGTCCTGCTC ACAGCCTCAC 120 

TTCTAACCTT CTGGAACCCA CCCACCACTG CCAAGCTCAC TATTGAATCC ACGCCATTCA 180 

ATGTCGCAGA GGGGAAGGAG GTTCTTCTAC TCGCCCACAA CCTGCCCCAG AATCGTATTG 240 

GTTACAGCTG GTACAAAGGC GAAAGAGTGG ATGGCAACAG TCTAATTGTA GGATATGTAA 300 

TAGGAACTCA ACAAGCTACC CCAGGGCCCG CATACAGTGG TCGAGAGACA ATATACCCCA 360 

ATGCATCCCT GCTGATCCAG AACGTCACCC AGAATGACAC AGGATTCTAT ACCCTACAAG 420 

TCATAAAGTC AGATCTTGTG AATGAAGAAG CAACCGGACA GTTCCATGTA TACCCGGAGC 480 

TGCCCAAGCC CTCCATCTCC AGCAACAACT CCAACCCCGT GGAGGACAAG GATGCTGTGG 540 

CCTTCACCTG TGAACCTGAG GTTCAGAACA CAACCTACCT GTGGTGGGTA AATGGTCAGA 600 

GCCTCCCGGT CAGTCCCAGG CTGCAGCTGT CCAATGGCAA CATGACCCTC ACTCTACTCA 660 

GCGTCAAAAG GAACGATGCA GGATCCTATG AATGTGAAAT ACAGAACCCA GCGAGTGCCA 720 

ACCGCAGTGA CCCAGTCACC CTGAATGTCC TCTATGGCCC AGATGTCCCC ACCATTTCCC 780 

CCTCAAAGGC CAATTACCGT CCAGGGGAAA ATCTGAACCT CTCCTGCCAC GCAGCCTCTA B40 

ACCCACCTGC ACAGTACTCT TGGTTTATCA ATGGGACGTT CCAGCAATCC ACACAAGAGC 900 

TCTTTATCCC CAACATCACT GTGAATAATA GCGGATCCTA TATGTGCCAA GCCCATAACT 960 

CAGCCACTGG CCTCAATAGG ACCACAGTCA CGATGATCAC AGTCTCTGGA AGTGCTCCTG 1020 

TCCTCTCAGC TGTGGCCACC GTCGGCATCA CGATTGGAGT GCTGGCCAGG GTGGCTCTGA 1080 

TATAGCAGCC CTGGTGTATT TTCGATATTT CAGGAAGACT GGCAGATTGG ACCAGACCCT 1140 

GAATTCTTCT AGCTCCTCCA ATCCCATTTT ATCCCATGGA ACCACTAAAA ACAAGGTCTG 1200 

CTCTGCTCCT GAAGCCCTAT ATGCTGGAGA TGGACAACTC AATGAAAATT TAAAGGGAAA 1260 

ACCCTCAGGC CTGAGGTGTG TGCCACTCAG AGACTTCACC TAACTAGAGA CAGTCAAACT 1320 

GCAAACCATG GTGAGAAATT GACGACTTCA CACTATGGAC AGCTTTTCCC AAGATGTCAA 1380 

AACAA GACT C CTCATCATGA TAAGGCTCTT ACCCCCTTTT AATTTGTCCT TGCTTATGCC 1440 

TGCCTCTTTC GCTTGGCAGQ ATGATGCTGT CATTAGTATT TCACAAGAAG TAGCTTCAGA 1500 

GGGTAACTTA ACAGAGTGTC AGATCTATCT TGTCAATCCC AACGTTTTAC ATAAAATAAG 1560 

AGATCCTTTA GTGCACCCAG TGACTGACAT TAGCAGCATC TTTAACACAG CCGTGTGTTC 1620 

AAATGTACAG TGGTCCTTTT CAGAGTTGGA CTTCTAGACT CACCTGTTCT CACTCCCTGT 1680 

TTTAATTCAA CCCAGCCATG CAATGCCAAA TAATAGAATT GCTCCCTACC AGCTGAACAG 1740 

GGAGGAGTCT GTGCAGTTTC TGACACTTGT TGTTGAACAT GGCTAAATAC AATGGGTATC 1800 

GCTGAGACTA AGTTGTAGAA ATTAACAAAT GTGCTGCTTG GTTAAAATGG CTACACTCAT I860 

CTGACTCATT CTTTATTCTA TTTTAGTTGG TTTGTATCTT GCCTAAGGTG CGTAGTCCAA 1920 

CTCTTGGTAT TACCCTCCTA ATAGTCATAC TAGTAGTCAT ACTCCCTGGT GTAGTGTATT 1980 

CTCTAAAAGC TTTAAATGTC TGCATGCAGC CAGCCATCAA ATAGTGAATG GTCTCTCTTT 2040 

GGCTGGAATT ACAAAACTCA GAGAAATGTG TCATCAGGAG AACATCATAA CCCATGAAGG 2100 

ATAAAAGCCC CAAATGGTGG TAACTGATAA TAGCACTAAT GCTTTAAGAT TTGGTCACAC 2160 

TCTCACCTAG GTGAGCGCAT TGAGCCAGTG GTGCTAAATG CTACATACTC CAACTGAAAT 2220 

GTTAAGGAAG AAGATAGATC CAATTAAAAA AAATTAAAAC CAATTTAAAA AAAAAAAAGA 2280 

ACACAGGAGA TTCCAGTCTA CTTGAGTTAG CATAATACAG AAGTCCCCTC TACTTTAACT 2340 

TTTACAAAAA AGTAACCTGA ACTAATCTGA TGTTAACCAA TGTATTTATT TCTGTGGTTC 2400 

TGTTTCCTTG TTCCAATTTG ACAAAACCCA CTGTTCTTGT ATTGTATTGC CCAGGGGGAG 2460 

CTATCACTGT ACTTGTAGAG TGGTGCTGCT TTAATTCATA AATCACAAAT AAAAGCCAAT 2520 
TAGCTCTATA ACT 



Seq ID NO: 440 Protein sequence 
Protein Accession #: AAA59909.1 



1 11 21 31 41 51 

I I I I I I 

MLTNVFISW IiFPCSNLTKP TVLVLYCPGG AITVLVEWCC FNS 



Seq ID NO: 441 DNA sequence 

Nucleic Acid Accession fh NM_002381.2 

Coding sequence: 64.. 1524 



1 11 21 31 41 51 

1128 



WO 03/042661 



I I I I I I 

AAATCCGAGC CTCGCGTGGG CTCCTGGCCC CCGACGGACA CCACCAGGCC CACGGAGCCC 60 

ACCATGCCGC GCCCGGCCCC CGCGCGCCGC CTCCCGGGAC TCCTCCTGCT GCTCTGGCCG 120 
CTGCTGCTGC TGCCCTCCGC CGCCCCCGAC CCCGTGGCCC GCCCGGGCTT CCGGAGGCTG 180 

GAGACCCGAG GTCCCGGGGG CAGCCCTGGA CGCCGCCCCT CTCCTGCGGC TCCCGACGGC 240 
GCGCCCGCTT CCGGGACCAG CGAGCCTGGC CGCGCCCGCG GTGCAGGTGT TTGCAAGAGC 300 
AGACCCTTGG ACCTGGTGTT TATCATTGAT AGTTCTCGTA GCGTACGGCC CCTGGAATTC 360 
ACCAAAGTGA AAACTTTTGT CTCCCGGATA ATCGACACTC TGGACATTGG GCCAGCCGAC 420 
ACGOGGGTGG CAGTGGTGAA CTATGCTAGC ACTGTGAAGA TCGAGTTCCA ACTCCAGGCC 480 
TACACAGATA AGCAGTCCCT GAAGCAGGCT GTGGGTCGAA TCACACCCTT GTCAACAGGC S40 
ACCATGTCAG GCCTAGCCAT CCAGACAGCA ATGGACGAAG CCTTCACAGT GGAGGCAGGG 600 
GCTCGAGAGC CCTCTTCTAA CATCCCTAAG GTGGCCATCA TTGTTACAGA TGGGAGGCCC 660 
CAGGACCAGG TGAATGAAGT GGCGGCTCGG GCCCAAGCAT CTGGTATTGA GCTCTATGCT 720 
GTGGGCGTGG ACCGGGCAGA CATGGCGTCC CTCAAGATGA TGGCCAGTGA GCCCCTAGAG 780 
GAGCATGTTT TCTACGTGGA GACCTATGGG GTCATTGAGA AACTTTCCTC TAGATTCCAG 840 
GAAACCTTCT GTGCGCTGGA CCCCTGTGTG CTTGGAACAC ACCAGTGCCA GCACGTCTGC 900 
ATCAGTGATG GGGAAGGCAA GCACCACTGT GAGTGTAGCC AAGGATACAC CTTGAATGCC 960 

GACAAGAAAA CGTGTTCAGC TCTTGATAGG TGTGCTCTTA ACACCCACGG ATGTGAGCAC 1020 

ATCTGTGTGA ATGACAGAAG TGGCTCTTAT CATTGTGAGT GCTATGAAGG TTATACCTTG 1080 

AATGAAGACA GGAAAACTTG TTCAGCTCAA GATAAATGTG CTTTGGGTAC CCATGGGTGT 1140 

CAGCACATTT GTGTGAATGA CAGAACAGGG TCCCATCATT GTGAATGCTA TGAGGGCTAC 1200 

ACTCTGAATG CAGATAAAAA AACATGTTCA GTCCGTGACA AGTGTGCCCT AGGCTCTCAT 12 GO 

GGTTGCCAGC ACATTTGTGT GAGTGATGGG GCCGCATCCT ACCACTGTGA TTGCTATCCT 1320 

GGCTACACCT TAAATGAGGA CAAGAAAACA TGTTCAGCCA CTGAGGAAGC ACGAAGACTT 1380 

GTTTCCACTG AAGATGCTTG TGGATGTGAA GCTACACTGG CATTCCAGGA CAAGGTCAGC 1440 

TCGTATCTTC AAAGACTGAA CACTAAACTT GATGACATTT TGGAGAAGTT GAAAATAAAT IS 00 

GAATATGGAC AAATACATOG TTAAATTGCT CCAATTTCTC ACCTGAAAAT GTGGACAGCT 1560 

TGGTGTACTT AATACTCATG CATTCTTTTG CACACCTGTT ATTGCCAATG TTCCTGCTAA 1620 

TAATTTGCCA TTATCTGTAT TAATGCTTGA ATATTACTGG ATAAATTGTA TGAAGATCTT 1680 

CTGCAGAATC AGCATGATTT TTCCAAGGAA ATACATATGC AGATACTTAT TAAGAGCAAA 1740 

CTTTAGTGTC TCTAAGTTAT GACTGTGAAA TGATTGGTAG GAAATAGAAT GAAAAGTTTA 1800 

GTGTTTCTTT ATCTACTAAT TGAGCCATTT AATTTTTAAA TGTTTATATT AGATAACCAT 1860 

ATTCACAATG GAAACTTTAG GTCTAGTTTC TTTTGATAGT ATTTATAATA TAAATCAATC 1920 

TTATTACTGA GAGTGCAAAT TGTACAAGGT ATTTACACAT ACAACTTCAT ATAACTGAGA 1980 

TGAATGTAAT TTTGAACTGT TTAACACTTT TTGTTTTTTG CTTATTTTGT TGGAGTATTA 2040 

TTGAAGATGT GATCAATAGA TTGTAATACA CATATCTAAA AATAGTTAAC ACAGATCAAG 2100 

TGAACATTAC ATTGCCATTT TTAATTCATT CTGGTCTTTG AAAGAAATGT ACTACTAAAG 2160 

AGCACTAGTT GTGAATTTAG GGTGTTAAAC TTTTTACCAA GTACAAAAAT CCCAAATTCA 2220 

CTTTATTATT TTGCTTCAGG ATCCAAGTGA CAAAGTTATA TATTTATAAA ATTGCTATAA 2280 

ATCGACAAAA TCTAATGTTG TCTTTTTAAT GTTAGTGATC CACCTGCCTC AGCCTCCCAA 2340 

AGTGCTGGGA TTACAGGCTT GAAAGTCTAA CTTTTTTTTA CTTATATATT TGATACATAT 2400 

AATTCTTTTG GCTTTGAAAC TTGCAACTTT GAGAACAAAA CAGTCCTTTA AATTTTGCAC 2460 

TGCTCAATTC TGTTTTTCGT TTGCATTGTC TTTAATATAA TAAAAGTTAT TACCTTTACA 2520 

TATTATCATG TCTATTTTTG ATGACTCATC AATTTTGTCT ATTAAAGATA TTTCTTTAAA 2580 
TTAAAAAAAA AAAAAAAAA 

Seq ID NO: 442 Protein sequence 
Protein Accession 8: NP 002372.1 



MPRPAPARRL 
PASGTSEPGR 
RVAWNYAST 
REPSSNIPKV 
HVFYVETYGV 
KKTCSALDRC 
HICVNDRTGS 
YTLNEDKKTC 
YGQIHR 



11 
I 

PGLLLLLHPL 
ARGAGVCKSR 
VKIEFQLQAY 
AIIVTDGRPQ 
IEKLSSRFQE 
ALNTHGCEHI 
HHCECYEGYT 
SATEEARRLV 



21 
I 

LLLPSAAPDP 
PLDLVFIIDS 
TDKQSLKQAV 
DQVNEVAARA 
TFCALDPCVL 
CVNDRSGSYH 
LNADKKTCSV 
STEDACGCEA 



31 
I 

VARPGFRRIiE 
SRSVRPLEFT 
GRITPLSTGT 
QASGIELYAV 
GTHQCQHVCI 
CECYEGYTLN 
RDKCALGSHG 
TLAFQDKVSS 



41 
I 

TRGPGGSPGR 
KVKTFVSRII 
MSGIAIQTAM 
GVDRADMASL 
SDGEGKHHCE 
EDRKTCSAQD 
CQHICVSDGA 
YLQRLNTKLD 



51 
I 

RPSPAAPDGA 
DTfcDIGPADT 
DEAFTVEAGA 
KMMASEPLEE 
CSQGYTLNAD 
KCALGTHGCQ 
ASYHCDCYPG 
DILEKLKINE 



60 
120 
180 
240 
300 
360 
420 
480 



Seq ID NO: 443 DMA sequence 

Nucleic Acid Accession #; NMJU6639.1 

Coding sequence: 40.. 429 



1 
I 

CGGTTGCTGC 
GAGCAAGCGC 
AAGTGCATGG 
GCTGCAGCAC 
CTGACCTTCG 
GAGAAGTTCA 
ATCCAGTGAC 
TTCTAGAGCC 
GGGGTGGGGG 
AGGTGTCTGG 
ACAAAACAGC 
CCTTCCTTAG 
TCACTCAGAT 
TTAACACTAG 
CCCAAAGCGG 




TGACACTGAC 
GACCTGGGGG 
GTCCTGAAAT 
GGGCTGGCCC 
GGAGGAGATA 



21 
I 

CGGGCGCAGG 
GCTGGGGCTC 
CCCCTGCTCC 
TTGCAGGGCG 
CTTCCGGCTG 
GCTTTCTGGC 
AGAGGAGACC 
CTGCCAGCCG 
TCCCAGACGC 
ACCTCTGAGG 
TCTGGCTCCA 
TAAGGAACTG 
CCAGGCTGAC 
TCCACCACGG 
ACTAGGAGGG 
TTTATTTTGG 



31 
I 

ACGTGCACTA 
TGGCTGGCGT 
CGCGGCAGCT 
CGACCGCACA 
CTTTGGCCCA 
TTTTTGGTCT 
GGCGGAGAGG 
GGGCTCGCCC 
GGGGGGAGCC 
CCTGGGCCCA 
GAACAGAAAG 
CAGCATTTGC 
TTGGGGGGCA 
GGGTCACCCT 
CTGGCCCTAA 
GGAGAGTTTG 




CCTGGAGCGC 
GCGACTTCTG 

GGAGACGATG 
GCTGCCCAGC 
ACTCATCATT 
AAGCTCCTCC 
GGGTTCAGGG 
GGAGCCTCAC 
ACAGGGGAGG 
GACTTGACAC 
GGGGGGTTAG 
GATACAGACC 
GAGGGGAGGG 



CGTGGCCGGG 
GGACCTGGAC 
CCTGGGCTGC 
CGCTCTGAGC 
CCGCAGGAGA 
TGTGGCGCTG 
CATTCATCCA 
AACCACAAGG 
GAACCTTCCA 
GCTGGCTCAC 
GGGGTGCCCT 
TAGGCCCCAC 
GGACCTATTT 
CCCCCAACTC 
AGAATTTATT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
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AATAAAAGAA TCTTTAACTT TAAAAAAAAA AAAAAAAA 

Seq ID NO: 444 Protein sequence 
Protein Accession 8: NP_057723.1 

1 11 21 31 41 51 

| I I I I I 

MARGSLRRLL RLLVLGLWLA LLRSVAGEQA PGTAPCSRGS SWSADLDKCM DCASCRARPH 60 
SDFCLGCAAA PPAPFRLLWP ILGGALSLTF VLGLLSGFLV WRRCRRREKF TTPIEETGGE 120 
GCPAVALIQ 

Seq ID NO: 445 DNA sequence 

Nucleic Acid Accession #: AF322916.1 

Coding sequence: 50.. 4300 

1 11 21 31 41 51 

I I I I I I 

GCACTCCGCA GCCTTTAAGG TTCGCGCGGG GGCCAGGCAA GAGTTAGCCA TGAAGAGCCT 60 
CAAGTCCCGC CTGAGGAGGC AGGACGTGCC CGGCCCCGCG TOGTCTGGCG CCGCCGCCGC 120 
CAGCGGGCAT GCAGCAGATT GGAATAAATA TGATGACCGA TTGATGAAAG CAGCAGAAAG 180 
GGGGGATGTA GAAAAAGTGA CCTCAATCCT TGCTAAAAAG GGGGTCAATC CAGGCAAACT 240 
AGATGTGGAA GGCAGATCTG TCTTCCATGT TGTGACCTCA AAGGGGAATC TTGAGTGTTT 300 
GAATGCCATC CTTATACATG GAGTTGATAT TACAACCAGT GACACTGCAG GGAGAAATGC 360 
TCTTCACCTG GCTGCTAAGT ATGGACATGC ATTGTGCCTA CAAAAACTTC TACAGTACAA 420 
TTGTCCCACT GAGCATGCAG ACCTGCAGGG AAGAACTGCA CTTCAAAAAA AAGCAATGGC 480 
AGATTGTCCT TCTAGCATAC AGCTGCTTTG TGACCATGGG GCCTCTGTGA ATGCCAAAGA 540 
TGTAAAAGGG CGGACACCAC TTGTTCTGGC TACTCAGATG AGTAGGCCAA CAATATGTCA GOO 
ACTGCTGATA GATAGAGGAG CGGATGTTAA TTCCAGAGAC AAACAAAACA GAACTGCCCT 660 
CATGCTAGGT TGCGAATATG GTTGCAGAGA TGCAGTAGAA GTCTTAATTA AAAATGGTGC 720 
TGATATAAGC TTGCTGGATG CGCTTGGCCA TGATAGTTCT TACTATGCAA GAATTGGTGA 780 
CAATCTGGAC ATTCTAACCT TGTTGAAGAC TGCATCGGAA AATACCAACA AAGGGAGAGA 840 
ACTTTGGAAG AAAGGACCAT CTTTGCAACA GCGAAATTTG ACACACATGC AAGATGAAGT 900 
AAATGTGAAG TCACATCAGA GGGAGCATCA AAATATTCAG GATTTGGAGA TTGAAAATGA 960 
AGATTTGAAA GAGAGGTTGA GAAAAATTCA GCAAGAACAA AGAATACTTT TGGATAAAGT 1020 
CAATGGTTTA CAGTTACAGC TGAATGAGGA AGTTATGGTT GCTGATGATC TGGAAAGCGA 1080 
GAGAGAAAAG CTGAAGTCCC TTTTGGCAGC TAAAGAAAAG CAACATGAAG AAAGCTTAAG 1140 
GACTATTGAG GCTCTGAAAA ATAGATTTAA ATATTTTGAG AGTGATCATT TAGGATCAGG 1200 
AAGTCATTTC AGTAACGGAA AAGAAGATAT GCTTCTTAAA CAAGGTCAGA TGTATATGGC 1260 
AGACTCACAG TGTACTTCCC CAGGTATACC AGCCCATATG CAAAGCAGAT CTATGTTAAG 1320 ' 
ACCTCTGGAA CTATCTTTAC CCAGTCAAAC GTCATACTCT GAAAATGAAA TTTTAAAGAA 1380 
AGAGTTAGAA GCAATGCGAA CTTTCTGTGA GTCAGCAAAA CAAGACCGAC TGAAGCTCCA 1440 
AAATGAACTG GCACACAAAG TGGCAGAATG CAAAGCTTTA GCATTAGAAT GTGAAAGGGT 1500 
CAAGGAGGAT TCAGATGAAC AGATAAAGCA ATTAGAAGAT GCATTAAAAG ATGTGCAGAA 1560 
GAGGATGTAT GAGTCAGAAG GTAAAGTTAA ACAAATGCAG ACCCATTTTC TTGCCCTTAA 1620 
AGAACACTTA AGAAGTGAAG CAGCCTCAGG GAATCACAGA CTAACCGAGG AACTGAAGGA 1680 
TCAGTTGAAA GACTTGAAAG TAAAATATGA AGGTGCTTCA GCAGAAGTGG GGAAATTAAG 1740 
AAACCAAATC AAACAAAATG AGATGATAGT AGAAGAGTTT AAGAGGGATG AAGGCAAGCT 1800 
GATAGAAGAA AATAAGCGAT TACAGAAGGA ACTTAGTATG TGTGAAATGG AGCGAGAGAA 1860 
GAAAGGAAGA AAGGTCACAG AGATGGAAGG CCAGGCAAAA GAATTGTCAG CGAAGTTGGC 1920 
CCTTTCCATT CCAGCTGAAA AATTTGAAAA CATGAAGAGC TCATTATCAA ATGAAGTGAA 1980 
TGAGAAAGCA AAAAAATTAG TAGAAATGGA AAGAGAACAT GAAAAATCAC TTAGTGAAAT 2040 
TAGACAGTTA AAGAGAGAAC TTGAGAATGT TAAGGCCAAG CTTGCTCAGC ACGTCAAACC 2100 
AGAGGAACAT .GAACAGGTTA AGAGCAGATT AGAACAGAAA TCAGGAGAAC TTGGGAAGAA 2160 
GATCACTGAG TTAACATTGA AAAATCAGAC ACTACAAAAG GAAATTGAAA AAGTTTATTT 2220 
GGATAATAAG CTCCTCAAGG AGCAAGCACA TAACTTAACA ATTGAAATGA AAAATCATTA 2280 
TGTTCCTTTA AAAGTAAGTG AAGACATGAA AAAGTCACAT GATGCAATTA TTGATGATCT 2340 
TAATAGAAAG CTTTTAGATG TAACACAAAA ATATACAGAA AAGAAGTTGG AAATGGAGAA 2400 
ATTGCTACTG GAAAATGACA GCTTAAGTAA GGATGTAAGC CGCCTAGAAA CTGTGTTTGT 2460 
ACCTCCTGAG AAACATGAAA AAGAGATAAT AGCTCTGAAA TCCAATATTG TTGAACTTAA 2520 
GAAACAGCTG TCTGAACTTA AGAAAAAATG TGGTGAAGAC CAGGAGAAAA TACACGCTCT 2580 
CACATCTGAA AACACTAACT TGAAGAAGAT GATGAGTAAT CAGTATGTGC CAGTTAAAAC 2640 
CCATGAAGAG GTTAAAATGA CACTGAATGA CACGTTAGCC AAAACTAACA GAGAATTATT 2700 
AGATGTGAAG AAAAAATTTG AAGATATAAA TCAGGAATTT GTAAAAATAA AAGATAAGAA 2760 
TGAAATATTA AAAAGAAACC TGGAAAACAC TCAGAACCAA ATAAAAGCTG AGTACATCAG 2820 
CCTGGCAGAG CACGAGGCAA AGATGAGCTC GCTAAGTCAG AGCATGAGAA AGGTGCAGGA 2880 
TAGTAATGCT GAAATCTTGG CCAACTACAG AAAAGGCCAA GAAGAGATTG TGACACTGCA 2940 
TGCCGAAATT AAAGCCCAGA AGAAGGAGCT CGACACAATA CAAGAATGCA TTAAGGTAAA 3000 
ATATGCCCCA ATTGTCAGCT TTGAGGAGTG CGAGAGAAAA TTTAAAGCAA CAGAGAAAGA 3060 
ACTAAAAGAC CAGTTATCAG AGCAGACACA AAAGTATAGT GTCAGTGAAG AAGAAGTCAA 3120 
GAAAAACAAG CAAGAGAATG ACAAGTTAAA GAAGGAGATT TTTACCCTTC AGAAAGATTT 3180 
GAGAGATAAG ACAGTTCTCA TTGAGAAGTC TCATGAAATG GAAAGAGCAT TAAGCAGAAA 3240 
AACAGACGAG CTAAACAAAC AGTTAAAAGA CTTGTCACAG AAATACACGG AAGTAAAGAA 3300 
TGTGAAAGAG AAGCTAGTAG AAGAAAATGC CAAACAGACT TCTGAGATAC TTGCAGTGCA 3360 
AAATCTTTTG CAAAAACAAC ATGTTCCATT GGAACAGGTT GAGGCTCTGA AAAAATCTCT 3420 
TAATGGCACA ATTGAAAATC TAAAGGAAGA ACTGAAGAGT ATGCAAAGGT GTTACGAGAA 3480 
AGAGCAGCAG ACAGTGACCA AACTGCATCA ATTGTTGGAG AATCAAAAGA ACTCTTCTGT 3540 
ACCCCTGGCA GAGCATTTGC AGATTAAAGA AGCATTTGAG AAAGAAGTTG GAATCATAAA 3600 
AGCCAGCTTG AGAGAAAAGG AAGAAGAAAG CCAAAACAAA ATGGAAGAAG TCTCCAAACT 3660 
TCAGTCGGAG GTTCAGAATA CTAAACAAGC ATTAAAAAAA TTAGAGACTA GAGAGGTAGT 3720 
TGACTTGTCT AAATATAAAG CAACAAAAAG TGATTTGGAG ACACAGATTT CTAGCTTAAA 3780 
TGAAAAATTG GCCAATCTGA ATAGAAAGTA TGAGGAAGTA TGTGAGGAAG TTTTGCATGC 3840 
CAAAAAGAAG GAAATATCTG CAAAAGATGA GAAGGAATTA CTGCATTTCA GCATTGAGCA 3900 
AGAAATTAAG GATCAGAAGG AACGATGTGA TAAGTCCTTA ACAACAATCA CAGAGTTACA 3960 
AAGAAGAATA CAAGAATCTG CTAAACAAAT AGAAGCAAAA GATAATAAGA TAACTGAACT 4020 



1130 
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GCTTAATGAT GTGGAAAGAT TAAAACAGGC ACTCAATGGC CTTTCCCAAC TCACCTACAC 
AAGTGGGAAC • CCCACCAAGA GGCAGAGCCA GCTGATTGAC ACTCTGCAGC ACCAAGTGAA 
ATCTCTGGAG CAACAGCTGG CCGATGCTGA CAGACAGCAC CAAGAAGTAA TTGCAATTTA 
TCGGACACAC CTTCTTAGTG CTGCACAGGG TCACATGGAT GAAGATGTTC AGGAGGCTCT 
GCTCCAGATC ATACAAATGC GGCAGGGGCT TGTGTGCTAG CCGTTAGCAC TGACT GCCAG 
TATCTGTTTT ATCTTGCTGG TGCTGAACAT TCTTTGTGCA ACTCCATGGT CTTTCTGGGC 
CTTACTGTGC TGGTATAATT AAAATAAAAT ATATTTTGTT CTGGGTGT 

Seq ID NO: 446 Protein sequence 
Protein Accession #: AAG49S77.1 

1 11 21 31 41 51 

I I 1 I I I 

MKSLKSRLRR QDVPGPASSG AAAASAHAAD WNKYDDRLMK AAERGDVEKV TSILAKKGVN 60 

PGKLDVEGRS VPHWTSKGN LECLNAILIH GVDITTSDTA GRNALHLAAK YGHALCLQKL 120 

LQYNCPTEHA DLQGRTALQK KAMADCPSSI QLLCDHGASV NAKDVKGRTP LVLATQMSRP 180 

TICQLLIDRG ADVNSRDKQN RTALMDGCEY GCRDAVEVLI KNGADISLLD ALGHDSSYYA 240 

RIGDNLDILT LLKTASENTN KGRELWKKGP SLQQRNLTHM QDEVNVKSHQ REHQNIQDLE 300 

IENEDLKERL RKIQQEQRIL LDKVNGLQLQ LNEEVMVADD LESEREKLKS LLAAKEKQHE 360 
ESLRTIEALK NRPKYFESDH LGSGSHFSNR KEDMLLKQGQ MYMADSQCTS PGIPAHMQSR 420 
SMLRPLELSL PSQTSYSEME ILKKELEAMR TFCESAKQDR LKLQNEIiAHK VAECKALALE 480 
CERVKEDSDE QIKQLEDALK DVQKRMYESE GKVKQMQTHF LALKEHLTSE AASGNHRLTE S40 
ELKDQLKDLK VKYEGASAEV GKLRNQIKQN EMIVEEPKRD EGKLIEENKR LQKELSMCEM 600 
EREKKGRKVT EMEGQAKELS AKLALSIPAE KFENMKSSLS NEVNEKAKKL VEMEREHEKS 660 
LSEIRQLKRE LENVKAKLAQ HVKPEEHEQV KSRLEQKSGE LGKKITELTL KNQTLQKEIE 720 
KVYLDNKLLK EQAHNLTIEM KNHYVPLKVS EDMKKSHDAI IDDLNRKLLD VTQKYTEKKL 7 B0 
EMEKLLLEND SLSKDVSRLE TVFVPPEKHE KEIIALKSNI VELKKQLSEL KKKCGEDQEK 840 
IHALTSENTN LKKMMSNQYV PVKTHEEVKM TLNDTLAKTN RELLDVKKKF EDINQEPVKI 900 
KDKNEILKRN LENTQNQIKA EYISLAEHEA KMSSLSQSMR KVQDSNAEIL ANYRKGQEEI 960 

VTLHAEIKAQ KKELDTIQEC IKVKYAPIVS PEECERKFKA TEKELKDQLS EQTQKYSVSE 1020 

EEVKKNKQEN DKLKKEIFTL QKDLRDKTVL IEKSHEMERA LSRKTDELNK QLKDLSQKYT 1080 

EVKNVKEKLV EENAKQTSEI LAVQNLLQKQ HVPLEQVEAL KKSLNGTIEN LKEELKSMQR 1140 

CYEKEQQTVT KLHQLLENQK NSSVPLAEHL QIKEAFEKEV GlIKASLREK EEESQNKMEE 1200 

VSKLQSEVQN TKQALKKLET REWDLSKYK ATKSDLETQI SSLNEKLANL NRKYEEVCEE 1260 

VLHAKKKEIS AKDEKELLHF SIEQEIKDQK ERCDKSLTTI TELQRRIQES AKQIEAKDNK 1320 

ITELLNDVER LKQALNGLSQ LTYTSGNPTK RQSQLIDTLQ HQVKSLEQQL ADADRQHQEV 1380 
IAIYRTHIiLS AAQGHMDEDV QEALLQIIQM RQGLVC 

Seq ID NO: 447 DNA sequence 
Nucleic Acid Accession NM_003020.1 
Coding sequence: 2 9.. 664 

1 ll 21 31 41 51 

I 1 I I I I 

CGCTCCTCGG GCTGCCCCTC GGTTGACAAT GGTCTCCAGG ATGGTCTCTA CCATGCTATC 60 

TGGCCTACTG TTTTGGCTGG CATCTGGATG GACTCCAGCA TTTGCTTACA GCCCCCGGAC 120 

CCCTGACCGG GTCTCAGAAG CAGATATCCA GAGGCTGCTT CATGGTGTTA TGGAGCAATT 180 

GGGCATTGCC AGGCCCCGAG TGGAATATCC AGCTCACCAG GCCATGAATC TTGTGGGCCC 240 

CCAGAGCATT GAAGGTGGAG CTCATGAAGG ACTTCAGCAT TTGGGTCCTT TTGGCAACAT 300 

CCCCAACATC GTGGCAGAGT TGACTGGAGA CAACATTCCT AAGGACTTTA GTGAGGATCA 360 

GGGGTACCCA GACCCTCCAA ATCCCTGTCC TGTTGGAAAA ACAGATGATG GATGTCTAGA 420 

AAACACCCCT GACACTGCAG AGTTCAGTCG AGAGTTCCAG TTGCACCAGC ATCTCTTTGA 480 

TCCGGAACAT GACTATCCAG GCTTGGGCAA GTGGAACAAG AAACTCCTTT ACGAGAAGAT 540 

GAAGGGAGGA GAGAGACGAA AGCGGAGGAG TGTCAATCCA TATCTACAAG GACAGAGACT 600 

GGATAATGTT GTTGCAAAGA AGTCTGTCCC CCATTTTTCA GATGAGGATA AGGATCCAGA 660 

GTAAAGAGAA GATGCTAGAC GAAAACCCAC ATTACCTGTT AGGCCTCAGC ATGGCTTATG 720 

TGCACGTGTA AATGGAGTCC CTGTGAATGA CAGCATGTTT CTTACATAGA TAATTATGGA 780 

TACAAAGCAG CTGTATGTAG ATAGTGTATT GTCTTCACAC CGATGATTCT GCTTTTTGCT 840 

AAATTAGAAT AAGAGCTTTT TTGTTTCTTG GGTTTTTAAA ATGTGAATCT GCAATGATCA 900 

TAAAAATTAA AATGTGAATG TCAACAATAA AAAGCAAGAC TATGAAAGGC TCAGATTTCT 960 

TGCAGTTTAA AATGGTGTCT GAGGTTGTAC TATTTTGGCC AAGTCTGTAG AAAGCTGTCA 1020 

TTTGATTTTG ATTATGTAGT TCATCCAGCC CTTGGGCATT GTTATACACC AGTAAAGAAG 1080 

GCTGTACTCA AGAGGAGGAG CTGACACATT TCACTTGGCT GCGTCTTAAT AAACATGAAT 1140 
GCAAGCATTG GC 

Seq ID NO: 448 Protein sequence 
Protein Accession #: NP_003011.1 

1 11 21 31 41 51 

I I I 1 I I 

MVSRMVSTML SGLLPWIASG HTPAFAYSPR TPDRVSEADI QRLLHGVMEQ LGIARPRVEY 60 

PAHQAMNIiVG PQSIEGGAHE GLQHLGPFGN I PNI VAELTG DNIPKDFSED QGYPDPPNPC 120 

PVGKTDDGCL ENTPDTAEFS REPQLHQHLF DPEHDYPGLG KWNKKLLYEK MKGGERRKRR 180 
SVNPYLQGQR LDNWAKKSV PHFSDEDKDP E 

Seq ID NO: 449 DNA sequence 

Nucleic Acid Accession #: NM_003816.l 

Coding sequence : 7 9 . . 2 53 8 

1 11 21 31 41 51 

I 1 I I I 1 

CGGCAGGGTT GGAAAATGAT GGAAGAGGCG GAGGTGGAGG CGACCGAGTG CTGAGAGGAA 60 

CCTGCGGAAT CGGCCGAGAT GGGGTCTGGC GCGCGCTTTC CCTCGGGGAC CCTTCG TGTC 120 

CGGTGGTTGC TGTTGCTTGG CCTGGTGGGC CCAGTCCTCG GTGCGGCGCG GCCAGGCTTT 180 
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CAACAGACCT CACATCTTTC TTCTTATGAA ATT AT AACT C CTTGGAGATT AACTAGAGAA 240 
AGAAGAGAAG CCCCTAGGCC CTATTCAAAA CAAGTATCTT ATGTTATTCA GGCTGAAGGA 300 
AAAGAGCATA TTATTCACTT GGAAAGGAAC AAAGACCTTT TGCCTGAAGA TTTTGTGGTT 360 
TATACTTACA ACAAGGAAGG GACTTTAATC ACTGACCATC CCAATATACA GAATCATTGT 420 
5 CATTATCGGG GCTATGTGGA GGGAGTTCAT AATTCATCCA TTGCTCTTAG CGACTGTTTT 480 
GGACTCAGAG GATTGCTGCA TTTAGAGAAT GCGAGTTATG GGATTGAACC CCTGCAGAAC 540 
AGCTCTCATT TTGAGCACAT CATTTATCGA ATGGATGATG TCTACAAAGA GCCTCTGAAA 600 
TGTGGAGTTT CCAACAAGGA TATAGAGAAA GAAACTGCAA AGGATGAAGA GGAAGAGCCT 660 
CCCAGCATGA CTCAGCTACT TCGAAGAAGA AGAGCTGTCT TGCCACAGAC CCGGTATGTG 720 
10 GAGCTGTTCA TTGTCGTAGA CAAGGAAAGG TATGACATGA TGGGAAGAAA TCAGACTGCT 780 
GTGAGAGAAG AGATGATTCT CCTGGCAAAC TACTTGGATA GTATGTATAT TATGTTAAAT B40 
ATTCGAATTG TGCTAGTTGG ACTGGAGATT TGGACCAATG GAAACCTGAT CAACATAGTT 900 
GGGGGTGCTG GTGATGTGCT GGGGAACTTC GTGCAGTGGC GGGAAAAGTT TCTTATCACA 960 
CGTCGGAGAC ATGACAGTGC ACAGCTAGTT CTAAAGAAAG GTTTTGGTGG AACTGCAGGA 1020 
15 ATGGCATTTG TGGGAACAGT GTGTTCAAGG AGCCACGCAG GCGGGATTAA TGTGTTTGGA 1080 
CAAATCACTG TGGAGACATT TGCTTCCATT GTTGCTCATG AATTGGGTCA TAATCTTGGA 1140 
ATGAATCACG ATGATGGGAG AGATTGTTCC TGTGGAGCAA AGAGCTGCAT CATGAATTCA 1200 
GGAGCATCGG GTTCCAGAAA CTTTAGCAGT TGCAGTGCAG AGGACTTTGA GAAGTTAACT 1260 
TTAAATAAAG GAGGAAACTG CCTTCTTAAT ATTCCAAAGC CTGATGAAGC CTATAGTGCT 1320 
20 CCCTCCTGTG GTAATAAGTT GGTGGACGCT GGGGAAGAGT GTGACTGTGG TACTCCAAAG 1380 
GAATGTGAAT TGGACCCTTG CTG CGAAGG A AGTACCTGTA AGCTTAAATC ATTTGCTGAG 1440 
TGTGCATATG GTGACTGTTG TAAAGACTGT CGGTTCCTTC CAGGAGGTAC TTTATGCCGA 1500 
GGAAAAACCA GTGAGTGTGA TGTTCCAGAG TACTGCAATG GTTCTTCTCA GTTCTGTCAG IS 60 
CCAGATGTTT TTATTCAGAA TGGATATCCT TGCCAGAATA ACAAAGCCTA TTGCTACAAC 1620 
25 GGCATGTGCC AGTATTATGA TGCTCAATGT CAAGTCATCT TTGGCTCAAA AGCCAAGGCT 16 BO 
GCCCCCAAAG ATTGTTTCAT TGAAGTGAAT TCTAAAGGTG ACAGATTTGG CAATTGTGGT 1740 
TTCTCTGGCA ATGAATACAA GAAGTGTGCC ACTGGGAATG CTTTGTGTGG AAAGCTTCAG 1800 
TGTGAGAATG TACAAGAGAT ACCTGTATTT GGAATTGTGC CTG CT ATT AT TCAAACGCCT I860 
AGTCGAGGCA CCAAATGTTG GGGTGTGGAT TTCCAGCTAG GATCAGATGT TCCAGATCCT 1920 
30 GGGATGGTTA ACGAAGGCAC AAAATGTGGT GCTGGAAAGA TCTGTAGAAA CTTCCAGTGT 1980 
GTAGATGCTT CTGTTCTGAA TTATGACTGT GATGTTCAGA AAAAGTGTCA TGGACATGGG 2040 
GTATGTAATA GCAATAAGAA TTGTCACTGT GAAAATGGCT GGGCTCCCCC AAATTGTGAG 2100 
ACTAAAGGAT ACGGAGGAAG TGTGGACAGT GGACCTACAT ACAATGAAAT GAATACTGCA 2160 
TTGAGGGACG GACTTCTGGT CTTCTTCTTC CTAATTGTTC CCCTTATTGT CTGTGCTATT 2220 
35 TTTATCTTCA TCAAGAGGGA TCAACTGTGG AGAAGCTACT TCAGAAAGAA GAGATCACAA 2280 

ACATATGAGT CAGATGGCAA AAATCAAGCA AACCCTTCTA GACAGCCGGG GAGTGTTCCT 2340 
CGACATGTTT CTCCAGTGAC ACCTCCCAGA GAAGTTCCTA TATATGCAAA CAGATTTGCA 2400 
GTACCAACCT ATGCAGCCAA GCAACCTCAG CAGTTCCCAT CAAGGCCACC TCCACCACAA 2460 
CCGAAAGTAT CATCTCAGGG AAACTTAATT CCTGCCCGTC CTGCTCCTGC ACCTCCTTTA 2520 
40 TATAGTTCCC TCACTTGATT TTTTTAACCT TCTTTTTGCA AATGTCTTCA GGGAACTGAG 2580 
CTAATACTTT TTTTTTTTCT TGATGTTTTC TTGAAAAGCC TTTCTGTTGC AACTATGAAT 2640 
GAAAACAAAA CACCACAAAA CAGACTTCAC TAACACAGAA AAACAGAAAC TGAQTGTGAG 2700 
AGTTGTGAAA TACAAGGAAA TGCAGTAAAG CCAGGGAATT TACAATAACA TTTCCGTTTC 2760 
CATCATTGAA TAAGTCTTAT TCAGTCATCG GTGAGGTTAA TGCACTAATC ATGGATTTTT 2820 
45 TGAACATGTT ATTGCAGTGA TTCTCAAATT AACTGTATTG GTGTAAGATT TTTGTCATTA 2880 
AGTGTTTAAG TGTTATTCTG AATTTTCTAC CTTAGTTATC ATTAATGTAG TTCCTCATTG 2940 
AACATGTGAT AATCTAATAC CTGTGAAAAC TGACTAATCA GCTGCCAATA ATATCTAATA 3000 
TTTTTCATCA TGCACGAATT AATAATCATC ATACTCTAGA ATCTTGTCTG TCACTCACTA 3060 
CATGAATAAG CAAATATTGT CTTCAAAAGA ATGCACAAGA ACCACAATTA AGATGTCATA 3120 
50 TTATTTTGAA AGTACAAAAT ATACTAAAAG AGTGTGTGTG TATTCACGCA GTTACTCGCT 3180 
TCCATTTTTA TGACCTTTCA ACTATAGGTA ATAACTCTTA GAGAAATTAA TTTAATATTA 3240 
GAATTTCTAT TATGAATCAT GTGAAAGCAT GACATTCGTT CACAATAGCA CTATTTTAAA 3300 
TAAATTATAA GCTTTAAGGT ACGAAGTATT TAATAGATCT AATCAAATAT GTTGATTCAT 3360 
GGCTATAATA AAGCAGGAGC AATTATAAAA TCTTCAATCA ATTGAACTTT TACAAAACCA 3420 
55 CTTGAGAATT TCATGAGCAC TTTAAAATCT GAACTTTCAA AGCTTGCTAT TAAATCATTT 3480 
AGAATGTTTA CATTTACTAA GGTGTGCTGG GTCATGTAAA ATATTAGACA CTAATATTTT 3540 
CATAGAAATT AGGCTGGAGA AAGAAGGAAG AAATGGTTTT CTTAAATACC TACAAAAAAG 3600 
TTACTGTGGT ATCTATGAGT TATCATCTTA GCTGTGTTAA AAATGAATTT TTACTATGGC 3660 
AGATATGGTA TGGATCGTAA AATTTTAAGC ACTAAAAATT TTTTCATAAC CTTTCATAAT 3720 
60 AAAGTTTAAT AATAGGTTTA TTAACTGAAT TTCATTAGTT TTTTAAAAGT GTTTTTGGTT 3780 
TGTGTATATA TACATATACA AATACAACAT TTACAATAAA TAAAATACTT GAAATTCTCA 3840 
AAAAAAAAAA AAAAAAAAAA AAAAA 

Seq ID NO: 450 Protein sequence 
65 Protein Accession #: NP_003807.1 

1 11 21 31 41 51 

| | I I I I 

MGSGARFPSG TLRVRWLLLL GLVGPVLGAA RPGFQOTSHL SSYEIITPWR LTRERREAPR 60 

70 PYSKQVSYVI QAEGKEHIIH LERNKDLLPE DFWYTYNKE GTLITDHPNI QNHCHYRGYV 120 

EGVHNSSIAL SDCFGLRGLL HLENASYGIE PLQNSSHFEH IIYRMDDVYK EPL.KCGVSNK 180 

DIEKETAKDE EEEPPSMTQL LRRRRAVLPQ TRYVELFIW DKERYDMMGR KQTAVREEMI 240 

LLANYLDSMY IMLNIRIVLV GLEIWTNGKL INIVGGAGDV LGNFVQWREK FLTTRRRHDS 300 

AQLVLKKGFG GTAGMAFVGT VCSRSHAGGI NVFGQITVET FASIVAHELG HN1/3MNHDDG 360 

75 RDCSCGAKSC IMNSGASGSR HPSSCSAEDP EKLTLNKGGN CLLNIPKPDE AYSAPSCGNK 420 

LVDAGEECDC GTPKECELDP CCEGSTCKIjK SFAECAYGDC CKDCRFIiPGG TLCRGKTSEC 480 

DVPEYCNGSS QFCQPDVFIQ NGYPCQNNKA YCYNGMCQYY DAQCQVIFGS KAKAAPKDCP 540 

IEVNSKGDRF GNCGFSGNEY KKCATGNAIiC GKLQCENVQE IPVFGIVPAI IQTPSRGTKC 600 

WGVDFQLGSD VPDPGMVNEG TKCGAGKICR NFQCVDASVL NYDCD VQKKC HGHGVCNSNK 660 

80 NCHCENGWAP PNCETKGYGG SVDSGPTYNE MNTALRDGLL VFFFLIVPLI VCAIFIFIKR 720 

DQIiWRSYFRK KRSQTYESDG KNQANPSRQP GSVPRHVSPV TPPREVPIYA NRFAVPTYAA 780 
KQPQQFPSRP PPPQPKVSSQ GNLIPARPAP APPLYSSLT 



1132 



WO 03/042661 



PCT/US02/36810 



Seq ID NO: 451 DNA sequence 

Nucleic Acid Accession «: NM_016650.1 

Coding sequence: 196.. 789 

5 1 11 21 31 41 51 

I 1 I I I I 

GGTTTCAATA TATGCAGATG TCTCGATATA GGAATGAAAT TAOGTCTTTG GAACAACTTA 60 

AATAAGTCAA ATATACTTGG AGCTTTAAAA ATTAAAAGGA GAGAGATTCG AGCACCTTTT 120 

CTGCTGCCAT GACAACCATG CAAGGAATGG AACAGGCCAT GCCAGGGTTG GCCTGGTGTG 180 

1U CCCCAGCTGG GAAACATGGC TGTCATACAT TCACATCTGT GGAAAGGATT GCAAGAGAAG 240 

TTCTTGAAGG GAGAACCCAA AGTCCTTGGG GTTGTGCAGA TTCTGACTGC CCTGATGAGC 300 

CTTAGCATGG GAATAACAAT GATGTGTATG GCATCTAATA CTTATGGAAG TAACCCTATT 360 

TCCGTGCATA TCGGGTACAC AATTTGGGGG TCAGTAATGT TTATTATTTC AGGATCCTTG 420 

1 _ TCAATTGCAG CAGGAATTAG AACTACAAAA GGCCTGGTCC GAGGTAGTCT AGGAATGAAT 480 

15 ATCACCAGCT CTGTACTGGC TGCATCAGGG ATCTTAATCA ACACATTTAG CTTGGCGTTT 540 

TATTCATTCC ATCACCCTTA CTGTAACTAC TATGGCAACT CAAATAATTG TCATGGGACT 600 

ATGTCCATCT TAATGGGTCT GGATGGCATG GTGCTCCTCT TAAGTGTGCT GGAATTCTGC 660 

ATTGCTGTGT CCCTCTCTGC CTTTGGATGT AAAGTGCTCT GTTGTACCCC TGGTGGGGTT 720 

GTGTTAATTC TGCCATCACA TTCTCACATG GCAGAAACAG CATCTCCCAC ACCACTTAAT 780 

GAGGTTTGAG GCCAACAAAA GATCAACAGA CAAATGCTCC AGAAATCTAT GCTGACTGTG 840 

ACACAAGAGC CTCACATGAG AAATTACCAG TATCCAACTT CGATACTGAT AGACGTGTTG 900 

ATATTATTAT TATATGTAAT CCAATTATGA ACTGTGTGTG TATAGAGAGA TAATAAATTC 960 

AAAATTATGT TCTCATTTTT TTCCCTGGAA CTCAATAACT CACTTCACTG GCTCTTTATC 1020 
GAGAGTACTA GGAGTTAAAT TAATAAATAA TGCATTTAAT GAGGCCACAG GAAAAA 
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Seq ID NO: 452 Protein sequence 
Protein Accession #: NP_0S7734.1 



„ 1 11 21 31 41 51 

30 [ | | | | | 

MAVIHSHLWK GLQEKFLKGE PKVLGWQIL TALMSLSMGI TMMCMASNTY GSNPISVHIG 60 
YTIWGSVMFI ISGSLSIAAG IRTTKGLVRG SLGMNITSSV LAASGILINT FSLAFYSFHH 120 
PYCNYYGNSN NCHGTMSILM GLDGMVLLLS VLEFCIAVSL SAFGCKVLCC TPGGWLILP 180 
SHSHMAETAS PTPLNEV 



Seq ID NO: 453 DNA sequence 

Nucleic Acid Accession #: NMJ>02091.1 

Coding sequence: 5 6.. 5 03 

1 11 21 31 41 51 

I I I I I I 

AGTCTCTGCT CTTCCCAGCC TCTCCGGCGC GCTCCAAGGG CTTCCCGTCG GGACCATGCG 60 

CGGCAGTGAG CTCCCGCTGG TCCTGCTGGC GCTGGTCCTC TGCCTAGCGC CCCGGGGGCG 120 

AGCGGTCCCG CTGCCTGCGG GCGGAGGGAC OGTGCTGACC AAGATGTACC CGCGCGGCAA 180 

CCACTGGGCG GTGGGGCACT TAATGGGGAA AAAGAGCACA GGGGAGTCTT CTTCTGTTTC 240 

TGAGAGAGGG AGCCTGAAGC AGCAGCTGAG AGAGTACATC AGGTGGGAAG AAGCTGCAAG 300 

GAATTTGCTG GGT CTCATAG AAGCAAAGGA GAACAGAAAC CACCAGCCAC CTCAACCCAA 360 

GGCCTTGGGC AATCAGCAGC CTTCGTGGGA TTCAGAGGAT AGCAGCAACT TCAAAGATGT 420 

AGGTTCAAAA GGCAAAGTTG GTAGACTCTC TGCTCCAGGT TCTCAACGTG AAGGAAGGAA 480 

CCCCCAGCTG AACCAGCAAT GATAATGATG GCCTCTCTCA AAAGAGAAAA ACAAAACCCC 540 

TAAGAGACTG AGTTCTGCAA GCATCAGTTC TACGGATCAT CAACAAGATT TCCTTGTGCA 600 

AAATATTTGA CTATTCTGTA TCTTTCATCC TTGACTAAAT TCGTGATTTT CAAGCAGCAT 660 

CTTCTGGTTT AAACTTGTTT GCTGTGAACA ATTGTCGAAA AGAGTCTTCC AATTAATGCT 720 

TTTTTATATC TAGGCTACCT GTTGGTTAGA TTCAAGGCCC CGAGCTGTTA CCATTCACAA 780 
TAAAAGCTTA AACACAT 

Seq ID NO: 454 Protein sequence 
Protein Accession #: NP_002082.1 

1 11 21 31 41 51 

I I I I I I 

MRGSELPLVL LALVLCLAPR GRAVPLPAGG GTVLTKMYPR GNHWAVGHLM GKKSTGESSS 60 
VSERGSLKQQ LREYIRWEEA ARNLLGLIEA KENRNHQPPQ PKALGNQQPS WDSEDSSNFK 120 
DVGSKGKVGR LSAPGSQREG RNPQLNQQ 

Seq ID NO: 455 DNA sequence 

Nucleic Acid Accession #: NMJJ16522.1 

Coding sequence: 265.. 1299 

1 11 21 31 41 51 

I I I I ! I 

GCGGAAGCAG CGAGGAGGGA GCCCCCTTTG GCCGTCCTCC GTGGAACCGG TTTTCCGAGG 60 

CTGGCAAAAG CCGAGGCTGG ATTTGGGGGA GGAATATTAG ACTCGGAGGA GTCTGCGCGC 120 

TTTTCTCCTC CCCGCGCCTC CCGGTCGCCG CGGGTTCACC GCTCAGTCCC CGCGCTCGCT 180 

CCGCACCCCA CCCACTTCCT GTGCTCGCCC GGGGGGCGTG TGCCGTGCGG CTGCCGGAGT 240 

TCGGGGAAGT TGTGGCTGTC GAGAATGGGG GTCTGTGGGT ACCTGTTCCT GCCCTGGAAG 300 

TGCCTCGTGG TCGTGTCTCT CAGGCTGCTG TTCCTTGTAC CCACAGGAGT GCCCGTGCGC 360 

AGCGGAGATG CCACCTTCCC CAAAGCTATG GACAACGTGA CGGTCCGGCA GGGGGAGAGC 420 

GCCACCCTCA GGTGCACTAT TGACAACCGG GTCACCCGGG TGGCCTGGCT AAACCGCAGC 480 

ACCATCCTCT ATGCTGGGAA TGACAAGTGG TGCCTGGATC CTCGCGTGGT CCTTCTGAGC 540 

AACACCCAAA CGCAGTACAG CATCGAGATC CAGAACGTGG ATGTGTATGA CGAGGGCCCT 600 

TACACCTGCT CGGTGCAGAC AGACAACCAC CCAAAGACCT CTAGGGTCCA CCTCATTGTG 660 

CAAGTATCTC CCAAAATTGT AGAGATTTCT TCAGATATCT CCATTAATGA AGGGAACAAT 720 

ATTAGCCTCA CCTGCATAGC AACTGGTAGA CCAGAGCCTA CGGTTACTTG GAGACACATC 780 
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TCTCCCAAAG 
CGGGAACAGT 
CGGAGAGTAA 
GTCCCCGTGG 
TTCCAGTGGT 
AACAGACCTT 
TACACTTGCG 
CCAGGCGCCG 

CGGGAAAGGC 
CCAATCAGAT 
GGGAGGGGAA 
CCTTGCAGAT 
CCCGGCTTGG 
GGCTCAGCCT 
AAATTCAATC 
CCGGCCCAAG 
GAAACGTGAA 



CAGGGGACTA 
AGGTCACCGT 
GACAAAAGGG 
ACAAGGATGA 
TCCTCTCAAA 
TGGCCTCCAA 
TCAGCGAGGT 
TGGTCTTGCA 
TGCCGCCACC 
ATATACAAAT 
CAAAGAATAC 
ATTTAGGTAC 
ACCCACTGCA 
CTCTGCCCAC 
AGTCCATAGA 
CGTGGCGCTG 
ATAAAAAGAG 



TGTGAGTGAA 
CGAGTGCAGT 
GAACTATCCA 
GACACTGCAG 
CAAAAGACTG 
ACTCATCTTC 
CAAGCTGGGC 
GAGCAACGGC 
CCTGCTTCTC 
ACCACCACCA 
GAAATTAGAA 

AATGGAGTTT 
AGCTGCATCG 
AGACTGCCCC 
GACGAACAGA 
CGGGCACTTT 
CAAAAAAAAA 



GACGAATACT 
GCCTCCAATG 
CCATACATTT 
TGTGAAGCCT 
ATTGAAGGAA 
TTCAATGTCT 
CACACCAATG 
ACGTCGAGGA 
AAATTTTGAT 
ACACAACAGC 
GAAACACAGC 
AAAGAGTTTT 
TCTTTTCCCA 
TGCAACCTCT 
CACGTGGAAC 
ATGAGACCTT 
GGTAGACTGT 



TGGAAATTCA 
ACGTGGCCGC 
CAGAAGCCAA 
CAGCAGTCCC 
AGAAAGGGGT 
CTGAACATGA 
CCAGCATCAT 
GGGCAGGCTG 
GTGAGTGCCA 
AATGGCAACA 
CTCATGGGAC 
AAAAAAGAAA 
AACGGGAAGA 
TTGGTGCCAG 
ATTCTGGAGC 
CCGGCCCAAG 
GCCACCAGGG 



GGGCATCACC 
GCCCGTGGTA 
GGGTACAGGT 
CTCAGCAGAA 
GAAAGTGGAA 
CTATGGGAAC 
GCTATTTGGT 
CGTCTGGCTG 
CTTCCCCACC 
CCGACAGCAA 
AGAAATTTGA 
TTGAAAATTG 
ACACAGCACA 
TGTGGGCAAG 
TGGCCATCCC 
CGTGGCGCTT 
CGTGTGTTGT 



840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 



Seq ID NO: 456 Protein sequence 
Protein Accession #: NP_057606.1 



MGVCGYLFLP 
NRVTRVAWLN 
NHPKTSRVHL 
SEDEYLE I QG 
LQCEASAVPS 
LGHTNASIML 



11 

I 

WKCLVWSIiR 
RSTILYAGND 
IVQVSPKIVE 
ITREQSGDYE 
AEFQWYKDDK 
FGPGAVSEVS 



21 
I ' 

LLFItVPTGVP 
KWCLDPRWL 
ISSDISINEG 
CSASNDVAAP 
RLIEGKKGVK 
NGTSRRAGCV 



31 
I 

VRSGDATPPK 
LSNTOTQYSI 
NKISLTCIAT 
WRRVKVTVN 
VENRPFLSKL 
WLLPLLVLHLi 



41 
I 

AMDNVTVRQG 
EIQNVDVYDE 
GRPEPTVTWR 
YPPYISEAKG 
IFFNVSEHDY 
LLKF 



51 
I 

ESATLRCTID 
GPYTCSVQTD 
HISPKAVGFV 
TGVPVGQKGT 
GNYTCVASNK 



60 
120 
180 
240 
300 



Seq ID NO: 457 DNA sequence 

Nucleic Acid Accession #: NM_012261.1 

Coding sequence: 203.. 1045 



GATTTGCTCT 
ACAGAATACG 
CACTCCAGCG 
CCTCATTCGG 
ACTTCGAGTT 
GGAAAATCTC 
TGGGACGACG 
GGCCAGCAAC 
TGAGGTGAAG 
CGCATATGCA 
GGCGACTTGG 
CAAAGACGCA 
CACCCCCGCT 
TGATCCGCAG 
TATCTCAGAT 
GGAAGAAACC 
CGCGATTTAC 
ATCCCAGTAT 
CCAACTGGAT 
CATAGCTACA 
AACCCACGGA 
ATGCTGGGGA 
TGACTCTCCA 
TTGAAAACAT 
TGCTCCCTTG 
TCATGCTCCC 
GTTTAGTGAT 
AAAACGACTA 
GGGGGACCTG 
TTCTCTGGC 



11 
I 

GCCAGCAGCT 
CGCTCCCTCC 
GCGACTTTGA 
GGCACTGCGA 
CTCCTGATGT 
TCAGGCCTTT 
TGTCTCATGG 
TACGTAGATC 
GGCCGCTGTG 
CTCAAAATGC 
AGGCTGAGCA 
GTCAGTGCTG 
GGGAAGTCCT 
AAGACGGTCA 
TTTGTCTTCA 
TTGCCCCTGA 
CACGTCCACC 
AAGCACATGG 
CAGGTAGAAC 
ATCAAACAGG 
AGGGGGAGAC 
GGAGGGGAGG 
AAGAGCAATA 
GCTTCTTTGA 
GACACAGCTG 
TGCAGCAAGA 
TGTCTTGGGA 
ATGTAACTAT 
AAGAATCAAT 



21 
I 

GTCGGTGCCG 
CTCCCCCTTC 
GGGATTCCCT 
GTATGGATCT 
TGTTCCATAC 
CCACTAACCC 
CAGAGTTTGC 
TGATCACAGA 
GCCACAGCCA 
TCTTTGTAAA 
AAGTGCAGTT 
GGAAGCACAC 
ATGAGTGTCA 
CCATGATCCT 
GTGAAGAGCA 
TTTTGGGGCT 
ACAAAATGAC 
GCTAGAGGCC 
AACAAAAGCA 
CCTGGGTATC 
TCTTTCGGAT 
AGGGTCTCAG 
AATGCCACTT 
GGAGGAAACC 
GCTTATCCTA 
CCCCTGAAAG 
ATGTTTCACT 
GCAGAGTTGT 
CTGTGTGAGT 



31 
I 

OGCTCGACAC 
TCTGTCCCCC 
CTCTGGCGGC 
CCAAGGAAGA 
AATGGCTCAA 
TGAAAAAGAT 
AGCCAAATTT 
ACAGGCCGAT 
GTCGGAGCTG 
GGAAAGCCAC 
TGTCTACGAC 
AGCCAACTCG 
AGCTCAACAA 
GTCTGOGGTC 
TAAATGCCCA 
CATCTTGGGC 
TGCCAACCAG 
GTTAGGCAGG 
CTTTTCCATC 
TGAGGCTTGC 
TTGTAGGGTG 
ACAGCTTTCG 
GGAGCTGTAT 
CCTTTAGGTT 
TACAGTTGTC 
TGATTCATGC 
GCTACCCGCA 
TTGGACTTCT 
CTGTTTTTCA 




Seq ID NO: 458 Protein sequence 
Protein Accession #: NP_036393.1 

21 



1 11 21 31 41 51 

I 1 I I I I 

MDLQGRGVPS IDRLRVLLML PHTMAQIMAE QEVENLSGLS TNPEKDIFW RENGTTCLMA 
EFAAKFIVPY DVWASNYVDL ITEQADIALT RGAEVKGRCG HSQSELQVPW VDRAYALKMI* 
FVKESHNMSK GPEATWRLSK VQFVYDSSEK THFKDAVSAG KHTANSHHLS ALVTPAGKSY 
ECQAQQTISL ASSDPQKTVT MILSAVHIQP FDIISDFVFS EEHKCPVDER EQLEETLPLI 
LGLILGLVIM VTLAIYHVHH KMTANQVQIP RDRSQYKHMG 

Seq ID NO: 459 DNA sequence 

Nucleic Acid Accession #: NM_001169.1 

Coding sequence: 85.-870 



60 
120 
180 
240 



11 



21 



31 



51 

1134 



WO 03/042661 



TAGGAGATAA GAGTATCTTG CACAGCAGGT GCAGGTTTCC CAGCAGCTCA GGCAAGAGTC 60 

CGATGTTTGT GCCATCTGAT CCTGATGTCT GGAGAGATAG CCATGTGTGA GCCTGAATTT 120 

GGCAATGACA AGGCCAGGGA GCCGAGCGTG GGTGGCAGGT GGCGAGTGTC CTGGTACGAA 180 

CGGTTTGTGC AGCCATGTCT GGTCGAACTG CTGGGCTCTG CTCTCTTCAT CTTCATCGGG 240 

TGCCTGTCGG TCATTGAGAA TGGGACGGAC ACTGGGCTGC TGCAGCCGGC CCTGGCCCAC 300 

GGGCTGGCTT TGGGGCTCGT GATTGCCACG CTGGGGAATA TCAGTGGTGG ACACTTCAAC 360 

CCTGCGGTGT CCCTGGCAGC CATGCTGATC GGAGGCCTCA ACCTGGTGAT GCTCCTCCCG 420 

TACTGGGTCT CACAGCTGCT CGGGGGGATG CTCGGGGCTG CCTTGGCCAA GGTGGTGAGT 480 

CCTGAGGAGA GGTTCTGGAA TGCATCTGGG GCGGCCTTTG TGACAGTCCA GGAGCAGGGG 540 

CAGGTGGCAG GGGCGTTGGT GGCAGAGATC ATCCTGACGA CGCTGCTGGC CCTGGC TGTA 600 

TGCATGGGTG CCATCAATGA GAAGACAAAG GGCCCTCTGG CCCCGTTCTC CATCGGCTTT 660 

GCCGTCACCG TGGATATCCT GGCTGGGGGC CCTGTGTCTG GAGGCTGCAT GAATCCCGCC 720 

CGTGCTTTTG GACCTGCGGT GGTGGCCAAC CACTGGAACT TCCACTGGAT CTACTGGCTG 780 

GGCCCACTCC TGGCTGGCCT GCTTGTTGGA CTGCTCATTA GGTGCTTCAT TGGAGATGGG 840 

AAGACCCGCC TCATCCTGAA GGCTCGGTGA GCAGAGCTCG TGGGATTCCT GCTGCTCCAG 900 

GTGTCCTCAG CTCACCTGTC CCAGACTGAG GACAGGGGAG TTCCTGCATT TCCTGCCAGG 960 

GCAGAGGCCC AGAGGAGCGA CCCCCTGCTT CCACTGCTTG GGCCTGCTTT CTCAGATAGA 1020 

CTGACTGCTG AGGAGGCTCT AGGTTCTTGG AATTCCTTTG TGCTCATCAG AGACCCCAGC 1080 

CTGGGGAACA CGCTGCCCGC ACTGCCCAGA GAGCAGTGCA AACACCACAA CACGAGCGTG 1140 

TTTCTTGAGA GGAATGTCCC CGAGTTGGAC AAGGAGGCTG TTTCTGCACA TCAGCTCATT 1200 

TCCCGCACCC CATTTCTTGC TTGATTGCTT TGTTGGGGGC CTGGCCACTT CCTTGCTTCT 1260 
CAAGCTGACA ATTCTCACTT TGCAATAAAT AGTCCAGTGT TTCCTTCAT 



Seq ID NO: 460 Protein sequence 
Protein Accession #: NP_001160.1 

1 11 21 31 41 SI 

1 I I I I 1 

MSGEIAMCEP EFGNDKAREP SVGGRWRVSW YERFVQPCLV ELLGSALFIF IGCLSVIENG 60 
TDTGLLQPAL AHGLALGLV I ATLGMISGGH FNPAVSLAAM LIGGLNLVML LPYWVSQLLG 120 
GMLGAAIiAKV VSPEERFWNA SGAAFVTVQE QGQVAGALVA EIILTTLLAIi AVCMGAINEK 1B0 
TKGPLAPFSI GFAVTVDILA GGPVSGGCMN PARAFGPAW ANHWNFHWIY WLGPLLAGLL 240 
VGLLIRCFIG DGKTRLILKA R 



Seq ID NO: 461 DNA sequence 
Nucleic Acid Accession *h NM_003226.1 
Coding sequence: 2.. 22 6 

1 11 21 31 41 51 

I I I I 1 1 

GATGCTGGGG CTGGTCCTGG CCTTGCTGTC CTCCAGCTCT GCTGAGGAGT ACGTGGGCCT 60 
GTCTGCAAAC CAGTGTGCCG TGCCGGCCAA GGACAGGGTG GACTGCGGCT ACCCCCATGT 120 
CACCCCCAAG GAGTGCAACA ACCGGGGCTG CTGCTTTGAC TCCAGGATCC CTGGAGTGCC 180 
TTGGTGTTTC AAGCCCCTGA CTAGGAAGAC AGAATGCACC TTCTGAGGCA CCTCCAGCTG 240 
CCCCTGGGAT GCAGGCTGAG CACCCTTGCC CGGCTGTGAT TGCTGCCAGG CACTGTTCAT 300 
CTCAGTTTTT CTGTCCCTTT GCTCCCGGCA AGCTTTCTGC TGAAAGTTCA TATCTGGAGC 360 
CTGATGTCTT AACGAATAAA GGTCCCATGC TCCACCCG 

Seq ID NO: 462 Protein sequence 
Protein Accession #s NP_O03217.1 

1 11 21 31 41 51 

I I I I 

MLGLVLALLS SSSAEEYVGL SANQCAVPAK DRVDCGYPHV TPKECNNRGC CFDSRIPGVP 60 
WCFKPLTRKT ECTF 

Seq ID NO: 463 DNA sequence 

Nucleic Acid Accession #: NM_002993.1 

Coding sequence: 64.. 408 

1 11 21 31 41 51 

] I i I I I 

GGCACGAGCC AGTCTCCGCG CCTCCACCCA GCTCAGGAAC CCGCGAACCC TCTCTTGACC 60 

ACTATGAGCC TCCCGTCCAG CCGCGCGGCC CGTGTCCCGG GTCCTTCGGG CTCCTTGTGC 120 

GCGCTGCTCG CGCTGCTGCT CCTGCTGACG CCGCCGGGGC CCCTCGCCAG CGCTGGTCCT 180 

GTCTCTGCTG TGCTGACAGA GCTGCGTTGC ACTTGTTTAC GCGTTACGCT GAGAGTAAA'C 240 

CCCAAAACGA TTGGTAAACT GCAGGTGTTC CCCGCAGGCC CGCAGTGCTC CAAGGTGGAA 300 

GTGGTAGCCT CCCTGAAGAA CGGGAAGCAA GTTTGTCTGG ACCCGGAAGC CCCTTTTCTA 360 

AAGAAAGTCA TCCAGAAAAT TTTGGACAGT GGAAACAAGA AAAACTGAGT AACAAAAAAG 420 

ACCATGCATC ATAAAATTGC CCAGTCTTCA GCGGAGCAGT TTTCTGGAGA TCCCTGGACC 480 

CAGTAAGAAT AAGAAGGAAG GGTTGGTTTT TTTCCATTTT CTACATGGAT TCCCTACTTT 540 

GAAGAGTGTG GGGGAAAGCC TACGCTTCTC CCTGAAGTTT ACAGCTCAGC TAAT GAAGTA 600 

CTAATATAGT ATTTCCACTA TTTACTGTTA TTTTACCTGA T AAGTT ATTG AACCCTTTGG 660 

CAATTGACCA TATTGTGAGC AAAGAATCAC TGGTTATTAG TCTTTCAATG AATATTGAAT 720 

TGAAGATAAC TATTGTATTT CTATCATACA TTCCTTAAAG TCTTACCGAA AAGGCTGTGG 780 

ATTTCGTATG GAAATAATGT TTTATTAGTG TGCTGTTGAG GGAGGTATCC TGTTGTTCTT 840 

ACTCACTCTT CTCATAAAAT AGGAAATATT TTAGTTCTGT TTTCTTGGGG AATATGTTAC 900 

TCTTTACCCT AGGATGCTAT TTAAGTTGTA CTGTATTAGA ACACTGGGTG TGTCATACCG 960 

TTATCTGTGC AGAATATATT TCCTTATTCA GAATTTCTAA AAATTTAAGT TCTGTAAGGG 1020 

CTAATATATT CTCTTCCTAT GGTTTTAGAT GTTTGATGTC TTCTTAGTAT GGCATAATGT 1080 

CATGATTTAC TCATTAAACT TTGATTTTGT ATGCTATTTT TTCACTATAG GATGACTATA 1140 



1135 



WO 03/042661 



ATTCTGGTCA CTAAATATAC ACTTTAGATA GATGAAGAAG CCCAAAAACA GATAAATTCC 1200 

TGATTGCTAA TTTACATAGA AATGTATTCT CTTGGTTTTT TAAATAAAAG CAAAATTAAC 1260 

AATGATCTGT GCTCTGCAAA GTTTTGAAAA TATATTTGAA CAATTTGAAT AT AAA TT CAT 1320 

CATTTAGTCC TCAAAATATA TACAGCATTG CTAAGATTTT CAGATATCTA TTGTGGATCT 1380 

TTTAAAGGTT TTGACCATTT TGTTATGAGG AATTATACAT GTATCACATT CACTATATTA 1440 

AAATTGCACT TTTATTTTTT CCTGTGTGTC ATGTTGGTTT TTGGTACTTG TATTGTCATT 1500 
TGGAGAAACA ATAAAAGATT TCTAAACCAA AAAAAAAAAA AAAAAAA 



Seq ID NO: 464 Protein sequence ' 
Protein Accession ft: NP_002984.1 

1 11 21 31 41 51 

| | I I I I 

MSLPSSRAAR VPGPSGSLCA LLALLLLLTP PGPLASAGPV SAVLTELRCT CLRVTLRVNP 
KTIGKLQVFP AGPQCSKVEV VASLKNGKQV CLDPEAPFIiK KVIQKILDSG NKKN 



Seq ID NO: 465 DNA sequence 
Nucleic Acid Accession ft: NMJJ02038.2 
Coding sequence: 108.. 500 

1 11 21 31 41 51 

i I I 1 I 

GAACCGTTTA CTCGCTGCTG TGCCCATCTA TCAGCAGGCT CCGGGCTGAA GATTGCTTCT 60 

CTTCTCTCCT CCAAGGTCTA GTGACGGAGC CCGCGCGCGG CGCCACCATG CGGCAGAAGG 120 

CGGTATCGCT TTTCTTGTGC TACCTGCTGC TCTTCACTTG CAGTGGGGTG GAGGCAGGTA 180 

AGAAAAAGTG CTCGGAGAGC TCGGACAGCG GCTCCGGGTT CTGGAAGGCC CTGACCTTCA 240 

TGGCCGTCGG AGGAGGACTC GCAGTCGCCG GGCTGCCCGC GCTGGGCTTC ACCGGCGCCG 300 

GCATCGCGGC CAACTCGGTG GCTGCCTCGC TGATGAGCTG GTCTGCGATC CTGAATGGGG 360 

GCGGCGTGCC CGCCGGGGGG CTAGTGGCCA CGCTGCAGAG CCTCGGGGCT GGTGGCAGCA 420 

GCGTCGTCAT AGGTAATATT GGTGCCCTGA TGGGCTACGC CACCCACAAG TATCTCGATA 480 

GTGAGGAGGA TGAGGAGTAG CCAGCAGCTC CCAGAACCTC TTCTTCCTTC TTGGCCTAAC 540 

TCTTCCAGTT AGGATCTAGA ACTTTGCCTT TTTTTTTTTT TTTTTTTTTT TTTGAGATGG 600 

GTTCTCACTA TATTGTCCAG GCTAGAGTGC AGTGGCTATT CACAGATGCG AACATAGTAC 660 

ACTGCAGCCT CCAACTCCTA GCCTCAAGTG ATCCTCCTGT CTCAACCTCC CAAGTAGGAT 720 

TACAAGCATG CGCCGACGAT GCCCAGAATC CAGAACTTTG TCTATCACTC TCCCCAACAA 780 
CCTAGATGTG AAAACAGAAT AAACTTCACC CAGAAAA 

Seq ID NO: 466 Protein sequence 
Protein Accession ft: NP_002029.3 

1 11 21 31 41 51 

MRQKAVSLFL CYLLLFTCSG VEAGKKKCSE SSDSGSGFWK ALTFMAVGGG LAVAGLPALG 60 
FTGAGIAANS VAASLMSWSA ILNGGGVPAG GLVATLQSLG AGGSSWIGN IGALMGYATH 120 
KYLDSEEDEE 



Seq ID NO: 467 DNA sequence 

Nucleic Acid Accession ft: NMJ303469.2 

Coding sequence: 92.. 1945 

1 11 21 31 41 51 

GAAACGGCCC GAGAAGCTCG CCCGGAGAAC GGGGAGGAAT ATGCTGTGGA GCTCCTCTGC 60 

CATATAAACA AAAAGAGGAA ATCTTTCAAA CATGGCTGAA GCAAAGACCC ACTGGCTTGG 120 

AGCAGCCCTG TCTCTTATCC CTTTAATTTT CCTCATCTCT GGGGCTGAAG CAGCTTCATT 180 

TCAGAGAAAC CAGCTGCTTC AGAAAGAACC AGACCTCAGG TTGGAAAATG TCCAAAAGTT 240 

TCCCAGTCCT GAAATGATCA GGGCTTTGGA GTACATAGAA AACCTCCGAC AACAAGCTCA 300 

TAAGGAAGAA AGCAGCCCAG ATTATAATCC CTACCAAGGT GTCTCTGTCC CCCTTCAGCA 360 

AAAAGAAAAT GGCGATGAAA GCCACTTGCC CGAGAGGGAT TCACTGAGTG AAGAAGACTG 420 

GATGAGAATA ATACTCGAAG CTTTGAGACA GGCTGAAAAT GAGCCTCAGT CTGCACCAAA 480 

AGAAAATAAG CCCTATGCCT TGAATTCAGA AAAGAACTTT CCAATGGACA TGAGTGATGA 540 

TTATGAGACA CAGCAGTGGC CAGAAAGAAA GCTTAAGCAC ATGCAATTCC CTCCTATGTA 600 

TGAAGAGAAT TCCAGGGATA ACCCCTTTAA ACGCACAAAT GAAATAGTGG AGGAACAATA 660 

TACTCCTCAA AGCCTTGCTA CATTGGAATC TGTCTTCCAA GAGCTGGGGA AACTGACAGG 720 

ACCAAACAAC CAGAAACGTG AGAGGATGGA TGAGGAGCAA AAACTTTATA CGGATGATGA 780 

AGATGATATC TACAAGGCTA ATAACATTGC CTATGAAGAT GTGGTCGGGG GAGAAGACTG 840 

GAACCCAGTA GAGGAGAAAA TAGAGAGTCA AACCCAGGAA GAGGTGAGAG ACAGCAAAGA 900 

GAATATAGGA AAAAATGAAC AAATCAACGA TGAGATGAAA CGCTCAGGGC AGCTTGGCAT 960 

CCAGGAAGAA GATCTTCGGA AAGAGAGTAA AGACCAACTC TCAGATGATG TCTCCAAAGT 1020 

AATTGCCTAT TTGAAAAGGT TAGTAAATGC TGCAGGAAGT GGGAGGTTAC AGAATGGGCA 1080 

AAATGGGGAA AGGGCCACCA GGCTTTTTGA GAAACCTCTT GATTCTCAGT CTATTTATCA 1140 

GCTGATTGAA ATCTCAAGGA ATTTACAGAT ACCCCCAGAA GACTTAATTG AGATGCTCAA 1200 

AACTGGGGAG AAGCCGAATG GATCAGTGGA ACCGGAGCGG GAGCTTGACC TTCCTGTTGA 1260 

CCTAGATGAC ATCTCAGAGG CTGACTTAGA CCATCCAGAC CTGTTCCAAA ATAGGATGCT 1320 

CTCCAAGAGT GGCTACCCTA AAACACCTGG TCGTGCTGGG ACTGAGGC CC TACCAGACGG 1380 

GCTCAGTGTT GAGGATATTT TAAATCTTTT AGGGATGGAG AGTGCAGCAA ATCAGAAAAC 1440 

GTCGTATTTT CCCAATCCAT ATAACCAGGA GAAAGTTCTG CCAAGGCTCC CTTATGGTGC 1500 

TGGAAGATCT AGATCGAACC AGCTTCCCAA AGCTGCCTGG ATTCCACATG TTGAAAACAG 1560 

ACAGATGGCA TATGAAAACC TGAACGACAA GGATCAAGAA TTAGGTGAGT ACTTGGCCAG 1620 

GATGCTAGTT AAATACCCTG AGATCATTAA TTCAAACCAA GTGAAGCGAG TTCCTGGTCA 1680 

AGGCTCATCT GAAGATGACC TGCAGGAAGA GGAACAAATT GAGCAGGCCA TCAAAGAGCA 1740 

TTTGAATCAA GGCAGCTCTC AGGAGACTGA CAAGCTGGCC CCGGTGAGCA AAAGGTTCCC 1800 

TGTGGGGCCC CCGAAGAATG ATGATACCCC AAATAGGCAG TACTGGGATG AAGATCTGTT 1860 

AATGAAAGTG CTGGAATACC TCAATCAAGA AAAGGCAGAA AAGGGAAGGG AGCATATTGC 1920 

1136 



WO 03/042661 



TAAGAGAGCA ATGGAAAATA TGTAAGCTGC TTTCATTAAT TACCCTACTT TCATTCCTCC 1980 

CACCCCAAGC AAATCCCAAC ATTTCTCTTC AGTGTGTTGA CTTCTATCCT GTTAACACTG 2040 

TAATATCTTT AAATGATGTA CAGGCAGATG AAACCAGGTC ACTGGGGAGT CTGCTTCATT 2100 

TCCTCTGAGC TGTTATCTTG TGTATGGATA TGTGTAAATG TTATGACTCC TTGATAAAAA 2160 

ATTTATTATG TCCATTATTC AAGAAAGATA TCTATGACTG TGTTTAATAG TATATCTAAT 2220 

GGCTGTGGCA TTGTTGATGC TCACATATGA TAAAAAAGTG TCCTATAATT CTATTGAAAG 2280 

TTTTTAATAT TTATTGAATT ATTTTGTTAC TGTCTGTAGC GTTTTGTGGA GTACTGGACC 2340 
AAAAAAATAA AGCATTATAA ATATA 



Seq ID NO: 468 Protein sequence 
Protein Accession NP_003460.1 

1 11 21 31 41 51 

MAEAKTHWLG AALSLIPLIF LISGAEAASF QRNQLLQKEP DLRLENVQKF PSPEKIRALE 60 

YIENLRQQAH KEESSPDYNP YQGVSVPLQQ KENGDESHLP ERDSLSEEDW MR I I LEAL RQ 120 

AENEPQSAPK ENKPYALNSE KNFPMDMSDD YETQQWPERK LKHMQFPPMY EENSRDNPFK 180 

RTNEIVEEQY TPQSLATLES VFQELGKLTG PNNQKRERMD EEQKLYTDDE DDIYKANNIA 240 

YEDWGGEDW NPVEEKIESQ TQEEVRDSKE NIGKNEQIND EMKRSGQLGI QEEDLRKESK 300 

DQLSDDVSKV IAYLKRLVNA AGSGRLQNGQ NGERATRLFE KPLDSQSIYQ LIEISRNLQI 360 

PPEDLIEMLK TGEKPNGSVE PERELDLPVD LDDISEADLD HPDLFQNRMIi SKSGYPKTPG 420 

RAGTEALPDG LSVEDILNLL GMESAANQKT SYFFNPYNQE KVLPRLPYGA GRSRSNQLPK 480 

AAWIPHVENR QMAYENLNDK DQELGEYLAR MLVKYPEI IN SKQVKRVPGQ GSSEDDLQEE 540 

EQIEQAIKEH LNQGSSQETD KLAPVSKRFP VGPPKNDDTP NRQYWDEDLL MKVLEYLNQE 600 
KAEKGREHIA KRAMENM 



Seq ID NO: 469 DNA sequence 

Nucleic Acid Accession #: NM__006398.1 

Coding sequence: 19.. 516 

1 11 21 31 41 51 

I ' I 1 I I I 

GGCCCCTTGT CTGCAGAGAT GGCTCCCAAT GCTTCCTGCC TCTGTGTGCA TGTCCGTTCC 60 

GAGGAATGGG ATTTAATGAC CTTTGATGCC AACCCATATG ACAGCGTGAA AAAAATCAAA 120 

GAACATGTCC GGTCTAAGAC CAAGGTTCCT GTGCAGGACC AGGTTCTTTT GCTGGGCTCC 180 

AAGATCTTAA AGCCACGGAG AAGCCTCTCA TCTTATGGCA TTGACAAAGA GAAGACCATC 240 

CACCTTACCC TGAAAGTGGT GAAGCCCAGT GATGAGGAGC TGCCCTTGTT TCTTGTGGAG 300 

TCAGGTGATG AGGCAAAGAG GCACCTCCTC CAGGTGCGAA GGTCCAGCTC AGTGGCACAA 360 

GTGAAAGCAA TGATCGAGAC TAAGACGGGT ATAATCCCTG AGACCCAGAT TGTGACTTGC 420 

AATGGAAAGA GACTGGAAGA TGGGAAGATG ATGGCAGATT ACGGCATCAG AAAGGGCAAC 480 

TTACTCTTCC TGGCATCTTA TTGTATTGGA GGGTGACCAC CCTGGGGATG GGGTGTTGGC 540 

AGGGGTCAAA AAGCTTATTT CTTTTAATCT CTTACTCAAC GAACACATCT TCTGATGATT 600 

TCCCAAAATT AATGAGAATG AGATGAGTAG AGTAAGATTT GGGTGGGATG GGTAGGATGA 660 

AGTATATTGC CCAACTCTAT GTTTCTTTGA TTCTAACACA ATTAATTAAG TGACATGATT 720 
TTTACTAATG TATTACTGAG ACTAGTAAAT AAATTTTTAA GGCAAAATAG AGCATTC 



Seq ID NO: 470 Protein sequence 
Protein Accession NP_006389.1 

1 11 21 31 41 51 

MAPNASCLCV HVRSEEWDLM TFDANPYDSV KKIKEHVRSK TKVPVQDQVL LLGSKILKPR 60 
RSLSSYGIDK EKTIHLTLKV VKPSDEELPL FLVESGDEAK RHLLQVRRSS SVAQVKAMIE 120 
TKTGIIPETQ IVTCNGKRLE DGKMMADYGI RKGNLLFLAS YCIGG 



Seq ID NO: 471 DNA sequence 

Nucleic Acid Accession #: XMJJ94741.1 

Coding sequence : 1 . . 948 

1 11 21 31 41 51 

| | | i I I 

ATGAAGGCCA ACTACAGCGC AGAGGAGCGC TTTCTCCTGC TGGGTTTCTC CGACTGGCCT 60 

XCCCTGCAGC CGGTCCTCTT CGCCCTTGTC CTCCTGTGCT ACCTCCTGAC CTTGACGGGC 120 

AACTCGGCGC TGGTGCTGCT GGCGGTGCGC GACCCGCGCC TGCACACGCC CATGTACTAC 180 

TTCCTCTGCC ACCTGGCCTT GGTAGACGCG GGCTTCACTA CTAGCGTGGT GCCGCCGCTG 240 

CTGGCCAACC TGCGCGGACC AGCGCTCTGG CTGCCGCGCA GCCACTGCAC GGCCCAGCTG 300 

TGCGCATCGC TGGCTCTGGG TTCGGCCGAA TGCGTCCTCC TGGCGGTGAT GGCTCTGGAC 360 

CGCGCGGCCG CAGTGTGCCG CCCGCTGCGC TATGCGGGGC TCGTCTCCCC GCGCCTATGT 420 

CGCACGCTGG CCAGCGCCTC CTGGCTAAGC GGCCTCACCA ACTCGGTTGC GCAAACCGCG 480 

CTCCTGGCTG AGCGGCCGCT GTGCGCGCCC CGCCTGCTGG ACCACTTCAT CTGTGAGCTG 540 

CCGGCGTTGC TCAAGCTGGC CTGCGGAGGC GACGGAGACA CTACCGAGAA CCAGATGTTC 600 

GCCGCCCGCG TGGTCATCCT GCTGCTGCCG TTTGCCGTCA TCCTGGCCTC CTACGGTGCC 660 

GTGGCCCGAG CTGTCTGTTG CATGCGGTTC AGCGGAGGCC GGAGGAGGGC GGTGGGCACG 720 

TGTGGGTCCC ACCTGACAGC CGTCTGCCTG TTCTACGGCT CGGCCATCTA CACCTACCTG 780 

CAGCCCGCGC AGCGCTACAA CCAGGCACGG GGCAAGTTCG TATCGCTCTT CTACACCGTG 840 

GTCACACCTG CTCTCAACCC GCTCATCTAC ACCCTCAGGA ATAAGAAAGT GAAGGGGGCA 900 
GCGAGGAGGC TGCTGCGGAG TCTGGGGAGA GGCCAGGCTG GGCAGTGA 



Seq ID NO: 472 Protein sequence 
Protein Accession #: XP_09474l.l 

1 -11 21 31 41 51 

MKANYSAEER FLLLGFSDWP SLQPVLFALV LLCYLLTLTG NSALVLLAVR DPRLHTPMYY 
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FLCHLALVDA GFTTSWPPL LANLRGPALW LPRSHCTAQL CASLALGSAE CVLLAVMALD 
RAAAVCRPLR YAGLVSPRLC RTLASASWLS GLTNSVAQTA LLAERPLCAP RLLDHFICEL 
PALLKLACGG DGDTTENQMF AARWILLLP FAVILASYGA VARAVCCMRF SGGRRRAVGT 
CGSHLTAVCL FYGSAIYTYL QPAQRYNQAR GKFVSLFYTV VTPALNPLIY TLRNKKVKGA 
ARRLLRSLGR GQAGQ 

Seq ID NO: 473 DNA sequence 
Nucleic Acid Accession 8: NM_001062.1 
Coding sequence: 76.. 1380 

1 ll 21 31 41 SI 

| I I I I I 

GCTCTCATTA CCTTCTGCCC ATCACTTAAT AAATAGCCAG OCAATTCATC AACATTCTGG 
TACACTGTTG GAGAGATGAG ACAGTCACAC CAGCTGCCCC TAGTGGGGCT CTTACTGTTT 
T CTTTTA TTC CAAGCCAACT ATGCGAGATT TGTGAGGTAA GTGAAGAAAA CTACATCCGC 
CTAAAACCTC TGTTGAATAC AATGATCCAG TCAAACTATA ACAGGGGAAC CAGCGCTGTC 
AATGTTGTGT TGTCCCTCAA ACTTGTTGGA ATCCAGATCC AAACCCTGAT GCAAAAGATG 
ATCCAACAAA TCAAATACAA TGTGAAAAGC AGATTGTCAG ATGTAAGCTC GGGAGAGCTT 
GCCTTGATTA TACTGGCTTT GGGAGTATGT CGTAACGCTG AGGAAAACTT AATATATGAT 
TACCACCTGA CTGACAAGCT AGAAAATAAA TTCCAAGCAG AAATTGAAAA TATGGAAGCA 
CACAATGGCA CTCCCCTGAC TAACTACTAC CAGCTCAGCC TGGACGTTTT GGCCTTGTGT 
CTGTTCAATG GGAACTACTC AACCGCCGAA GTTGTCAACC ACTTCACTCC TGAAAATAAA 
AACTATTATT TTGGTAGCCA GTTCTCAGTA GATACTGGTG CAATGGCTGT CCTGGCTCTG 
ACCTGTGTGA AGAAGAGTCT AATAAATGGG CAGATCAAAG CAGATGAAGG CAGTTTAAAG 
AACATCAGTA TTTATACAAA GTCACTGGTA GAAAAGATTC TGTCTGAGAA AAAAGAAAAT 
GGTCTCATTG GAAACACATT TAGCACAGGA GAAGCCATGC AGGCCCTCTT TGTATCATCA 
GACTATTATA ATGAAAATGA CTGGAATTGC CAACAAACTC TGAATACAGT GCTCACGGAA 
ATTTCTCAAG GAGCATTCAG TAATCCAAAC GCTGCAGCCC AGGTCTTACC TGCCCTGATG 
GGAAAGACCT TCTTGGATAT TAACAAAGAC TCTTCTTGCG TCTCTGCTTC AGGTAACTTC 
AACATCTCCG CTGATGAGCC TATAACTGTG ACACCTCCTG ACTCACAATC ATATATCTCC 
GTCAATTACT CTGTGAGAAT CAATGAAACA TATTTCACCA ATGTCACTGT GCTAAATGGT 
TCTGTCTTCC TCAGTGTGAT GGAGAAAGCC CAGAAAATGA ATGATACTAT ATTTGGTTTC 
ACAATGGAGG AGCGCTCATG GGGGCCCTAT ATCACCTGTA TTCAGGGCCT ATGTGCCAAC 
AATAATGACA GAACCTACTG GGAACTTCTG AGTGGAGGCG AACCACTGAG CCAAGGAGCT 
GGTAGTTACG TTGTCCGCAA TGGAGAAAAC TTGGAGGTTC GCTGGAGCAA ATACTAATAA 
GCCCAAACTT TCCTCAGCTG CATAAAATCC ATTTGCAGTG GAGTTCCATG TTTATTGTCC 
TTATGCCTTC TTCTTCATTT ATCCCAGTAC GAGCAGGAGA GTTAATAACC TCCCCTTCTC 
TCTCTACATG TTCAATAAAA GTTGTTGAAA GATTAAC 

Seq ID NO: 474 Protein sequence 
Protein Accession #: NP_001053.1 

1 11 21 31 41 51 

I I I I I I 

MRQSHQLPLV GLLLFSFIPS QLCEICEVSE ENYIRLKPLL NTMIQSNYNR GTSAVNWLS 

LKLVGIQIQT LMQKMIQQIK YNVKSRLSDV SSGELALIIL ALGVCRNAEE NLIYDYHLTD 120 

KLENKFQAEI ENMEAHNGTP LTNYYQLSLD VLALCLFNGN YSTAEWNHF TPENKNYYFG 180 

SQFSVDTGAM AVLALTCVKK SLINGQIKAD EGSLKNISIY TKSLVEKILS EKKENGLIGN 240 

TFSTGEAMQA LFVSSDYYNE NDWNCQQTLN TVLTEISQGA FSNPNAAAQV LPALMGKTFL 300 

DINKDSSCVS ASGNFNISAD EPITVTPPDS QSYISVNYSV RINETYFTNV TVLNGSVFLS 360 

VMEKAQKMND TIFGFTMEER SWGPYITCIQ GLCANNNDRT YWELLSGGEP LSQGAGSYW . 420 
RNGENLEVRW SKY 

Seq ID NO: 475 DNA sequence 

Nucleic Acid Accession #: NM_004852.1 

Coding sequence : 89 . . 1546 

1 11 21 31 41 51 

| 1 | i 1 1 

GCCCCCGCCC GCCCCGGGCC CTGATGGACT GAATGAAGGC TGCCTACACC GCCTATCGAT 60 

GCCTCACCAA AGACCTAGAA CGTGCGCCAT GAACCCGGAG CTGACAATGG AAAGTCTGGG 120 

CACTTTGCAC GGCGCGCGCG GCGGCGGCAG TGGCGGGGGC GGCGGCGGGG GCGGCGGGGG 180 

CGGCGGCGGG GGCCCGGGCC ATGAGCAGGA GCTGCTGGCC AGCCCCAGCC CCCACCACGC 240 

GCGCCGCGGC CCGCGTGGCT CGCTGCGGGG CCCTCCGCCG CCTCCAACCG CGCACCAGGA 300 

GCTGGGCACG GCGGCAGCGG CGGCAGCGGC GGCGTCGCGC TCGGCCATGG TCACCAGCAT 360 

GGCCTCGATC CTGGACGGCG GCGACTACCG GCCCGAGCTC TCCATCCCGC TGCACCACGC 420 

CATGAGCATG TCCTGCGACT CGTCTCCGCC TGGCATGGGC ATGAGCAACA CCTACACCAC 480 

GCTGACACCG CTCCAGCCGC TGCCACCCAT CTCCACCGTG TCTGACAAGT TCCACCACCC 540 

TCACCCGCAC CACCATCCGC ACCACCACCA CCACCACCAC CACCAGCGCC TGTCCGGCAA 600 

CGTCAGCGGC AGCTTCACCC TCATGCGCGA CGAGCGCGGG CTCCCGGCCA TGAACAACCT 660 

CTACAGTCCC TACAAGGAGA TGCCCGGCAT GAGCCAGAGC CTGTCCCCGC TGGCCGCCAC 720 

GCCGCTGGGC AACGGGCTAG GCGGCCTCCA CAACGCGCAG CAGAGTCTGC CCAACTACGG 780 

TCCGCCGGGC CACGACAAAA TGCTCAGCCC CAACTTCGAC GCGCACCACA CTGCCATGCT 840 

GACCCGCGGT GAGCAACACC TGTCCCGCGG CCTGGGCACC CCACCTGCGG CCATGATGTC 900 

GCACCTGAAC GGCCTGCACC ACCCGGGCCA CACTCAGTCT CACGGGCCGG TGCTGGCACC 960 

CAGTCGCGAG CGGCCACCCT CGTCCTCATC GGGCTCGCAG GTGGCCACGT CGGGCCAGCT 1020 

GGAAGAAATC AACACCAAAG AGGTGGCCCA GOGCATCACA GCGGAGCTGA AGCGCTACAG 1080 

TATCCCCCAG GCGATCTTTG CGCAGAGGGT GCTGTGCCGG TCTCAGGGGA CTCTCTCCGA 1140 

CCTGCTCCGG AATCCAAAAC CGTGGAGTAA ACTCAAATCT GGCAGGGAGA CCTTCCGCAG 1200 

GATGTGGAAG TGGCTTCAGG AGCCCGAGTT CCAGCGCATG TCCGCCTTAC GCCTGGCAGC 1260 

GTGCAAACGC AAAGAGCAAG AACCAAACAA AGACAGGAAC AATTCCCAGA AGAAGTCCCG 1320 

CCTGGTGTTC ACTGACCTCC AACGCCGAAC ACTCTTCGCC ATCTTCAAGG AGAACAAACG 1380 

CCCGTCAAAG GAGATGCAGA TCACCATTTC CCAGCAGCTG GGCCTGGAGC TCACAACCGT 1440 

CAGCAACTTC TTCATGAACG CCCGGCGCCG CAGCCTGGAG AAGTGGCAAG ACGATCTGAG 1500 
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CACAGGGGGC TCCTCGTCCA CCTCCAGCAC GTGTACCAAA GCATGATGGA AGGACTCTCA 
CTTGGGCACA AGTCACCTCC AAATGAGGAC AACAGATACC AAAAGAAAAC AAAGGAAAAA 
GACACCGGAT TCCTAGCTGG GGCCCTTCAC TGGTG 

Seq ID NO: 476 Protein sequence 
Protein Accession #: NP 004843.1 



iseo 
1620 



MNPELTMESL 
GPPPPPTAHQ 
PGMGMSNTYT 
DERGLPAMNN 
PNFDAHHTAM 
SGSQVATSGQ 
KIiKSGRETPR 
TLFAIFKENK 
TCTKA 



11 
I 

GTLHGARGGG 
ELGTAAAAAA 
TLTPLQPLPP 
LYSPYKEMPG 
XiTRGEQHLSR 
LEEINTKEVA 
RMWKWLQEPE 
RPSKEMQITI 



21 
I 

SGGGGGGGGG 
AASRSAMVTS 
I STVSDKFHH 
MSQSLSPLAA 
GLGTPPAAMM 
QRITAELKRY 
FQRMSALRLA 
SQQLGLELTT 



31 
I 

GGGGGPGHEQ 
MASILDGGDY 
PHPHHHPHHH 
TPLGNGLGGL 
SHLNGLHHPG 
SIPQAIFAQR 
ACKRKEQEPN 
VSNFPMNARR 



41 
I 

ELLASPSPHM 
RPELSIPLHH 
HHHHHQRLSG 
HNAQQSLPNY 
HTQSHGPVLA 
VLCRSQGTLS 
KDRNNSQKKS 
RSLEKWQDDL 



51 
I 

ARRGPRGSLR 60 

AMSMSCDSSP 120 

NVSGSFTLMR 180 

GPPGHDKMLS 240 

PSRERPPSSS 300 

DLLRNPKPWS 360 

RLVPTDLQRR 420 
STGGSSSTSS . 480 



Seq ID NO: 477 DNA sequence 

Nucleic Acid Accession #: NM_013271.1 

Coding sequence: 27.. 809 



TCCGGAGCCA 
CCGGGGGCGT 
TCTGCGCGCG 
AGACTGGCGC 
TGCAGGAGCT 
GGGCCGAGGC 
TCTGGGGCGC 
CTGCAGCGCA 
CCCAGCTTGT 
ACGACGGCCC 
CCGAGCTGTT 
TGGCAGCCCC 
CTGAGGGCGT 
TGCCTGCACG 
CAGAAGTGCC 
TTACCCCGGC 
GATCTGAGC 



11 
I 

GGCTCGCTGG 
CGGCCTTTTG 
GCCGGTAAAG 
TCCTCGCCGC 
GGCGCGGGCG 
GCAGGAGGCT 
CCCCCGCAAC 
GCTCGCTCGC 
CCCCGCGCCC 
CGCGGGCCCG 
GAGGTACTTG 
GCGCCGCCTC 
GCTGGGGGCG 



CCCGCCATCC 
CAGCCAGCCC 



21 
I 

GGCAGCATGG 
GTGCTGCTGC 
GAACCCCGCG 
TTCCGGCGGT 
CTGGCGCATC 
GAGGATCAGC 
TCTGATCCGG 
GCTCTGCTCC 
GTCCCCGCCG 
GATGCTGAGG 
CTGGGACGGA 
CGCCGTGCCG 
CTGCTGCGTG 
CCACCCTGAG 
CGCCACCAGG 
TCTCACCCGA 



31 
I 

CGGGGTOGCC 
TGCTCGGCCT 
GCCTAAGCGC 
CAGTGCCCCG 
TGCTGGAGGC 
AGGCGCGCGT 
CTCTGGGCCT 
GCGCCCGCCT 
CGGCGCTCCG 
AGGCAGGCGA 
TTCTTGCGGG 
CCGACCACGA 
TGAAACGCCT 
CACTGCCCGG 
ACTTCTCCCC 
GGATCCCTAC 



41 
I 

GCTGCTCTGG 
GTTTCGGCCG 
AGCGTCTCCG 
AGGTGAGGCG 
CGAACGTCAG 
CCTGGCGCAG 
GGACGACGAC 
TGACCCTGCC 
ACCCCGGCCC 
CGAGACACCC 
AAGCGCGGAC 
TGTGGGCTCT 
AGAGACCCCG 
ATCCCGTGCA 
GCCAGCACGT 



GGGCCGGGGG 

CCCTTGGCTG 
GCGGGGGCGG 
GAGCGGGCGC 
CTGCTGCGCG 
CCCGACGCGC 
GCCCTAGCAG 
CCGGTCTACG 
GACGTGGACC 
TCCGAGGGGG 
GAGCTGCCCC 
GCGCCCCAGG 
CCCTGGGACC 
CCAGAGCAAC 
ACAATAACAT 



Seq ID NO: 478 Protein sequence 
Protein Accession #: NP 037403.1 



11 



21 



31 41 SI 

I i i I I I 

MAGSPLLWGP RAGGVGLLVL LLLGLFRPPP ALCARPVKEP RGLSAASPPL AETGAPRRFR 
RSVPRGEAAG AVQELARALA HLLEAERQER ARAEAQEAED QQARVLAQLL RVWGAPRNSD 
PALGLDDDPD APAAQLARAL LRARLDPAAL AAQLVPAPVP AAALRPRPPV YDDGPAGPDA 
EEAGDETPDV DPELLRYLLG R I LAGS ADS E GVAAPRRLRR AADHDVGSEL PPEGVLGALL 
RVKRLETPAP QVPARRLLPP 

Seq ID NOt 479 DNA sequence 
Nucleic Acid Accession #: NM_002214 
Coding sequence-. 681.. 2990 



1 
I 

CCCAGAGCCG 
CTGCCGACTT 
GTTGGCCTCC 
TCCCCTCGAC 
TAGGGTGGTT 
CTAAGCTGAT 
TGTCCCGGAG 
TGGCCGTCGA 
GGCCGTAGGG 
CCGAGCCGCG 
GGCCCCGAGG 
GGGGCGGGCT 
TCTGCCTGCA 
CACTTGTTCT 
CCTGTGCCAG 
TTTCAGGTGG 
GCTCAGTTGA 
TTAATACCCA 
ATTTTATGCT 
ATGTCTCAGC 
CTAGAAAAAT 
AAACAGTTTC 
ACAATTTAGA 
TCACTGAGTT 



11 
I 

CCTCCCCCTG 
GTCTTTGCCC 
CTGCCCACCT 
CTCGCCGGCG 
TCCCCCCCAG 
TTATGCAGCA 
CAGGCTGCGG 
AGGAGGTGCT 
GCCCTGAGAT 
GGGTCCGCCT 
TCGCCCGGGA 
GTTTTGCATT 
AAACGACCGG 
TGGACTGGGC 
GTGCCTTGCG 
ATCAAGAAGT 
TTCAATAGAA 
GGTGACACCA 
GAAAGTTCAT 
ATCAATGCAC 
GGCATTTTTC 
ACCATACATT 
CTGCATGCCT 
TGAGAAAGCA 



21 
I 

TTGCTGGCAT 
GCTGCTCCGC 
GTGGAAGCAA 
TACCCTCCCA 
CTTCGGGCTT 
GAAGCCCCAC 
AGCCCTTGCA 
TCTCGCGGAG 
GCCGAGCGGT 
GCTAGGCCTG 
GGCCGAGCCC 
ATGTGCGGCT 
CGAGGTCCCG 
CAAGGTGAAG 
CTGGGTCCAG 
GAACGTTGTG 
TACCCATCTG 
GGAGAAGTGT 
CCTCTGAAGA 
AATAATATAG 
TCCCGTGACT 
AGCATCCACC 
CCCCATGGAT 
GTTCATAGAC 



31 
1 

CCCGAGCTTC 
AGACGGGGCT 
CTGCGCTGAT 
CAGATCCAGC 
TGTTTGGGTT 
CGGCTGGAGA 
GAGCCCTCTC 
ACCGCGGGAC 
GCCCGGGCCC 
CGGAAAACGT 
GCGTCCGGAA 
CGGCCCTGGC 
CCTCGTTCCT 
ACAATAGATG 
AATGTGGATG 
ATATTGTTTC 
TGCATGTTAT 
CTATCCAGCT 
AATATCCTGT 
AAAAATTAAA 
TTCGTCTTGG 
CCGAAAGGAT 
ACATCCATGT 
AGAAGATCTC 



41 
I 

CTCCCTTGCC 
GCAAAGCTGC 
TGATGOGCCA 
ATCACCCAGT 
TGATTGTGTT 
GAAACAAAAG 
TCCAGTCGCC 



60 
120 
180 
240 
300 
360 
420 
460 
540 
600 
660 
720 
780 
840 
900 
960 



GCTTACCTGC 
CCTAGCGACA 
GGCAGCCAGG 
TTTTTTT ACC 
CTGGGCAGCC 
TGCATCTTCA 
GTGTGTTCAA 
CAATTTAATA 
AATACCCACT 
GCGTCCAGGA 
GGATCTTTAT 
TTCCGTTGGA 
ATTTGGCTCA 
TCATAATCAA 
GCTGTCTTTG 
TGGAAACATA 



51 
I 

AGCCAGGACG 
AACTAATGGT 
CAGACTTTTT 
GAATGTACAT 
TGGCTCTTCG 
CTCTTTTCTT 
GCCGGGCCCT 
GAGCCGGGAG 
ACCGCTTGCT 

CGGCGGGCGC 
GCTGCATTTG 
TGGGTGTTTT 
AATGCAGCAT 
GAGGATTTCA 
AGCAAAGGCT 
GAAAATGAAA 
GCCGAAGCTA 
TATCTTGTTG 
AACGATTTAT 
TACGTTGATA 
TGCAGTGACT 
ACAGAGAACA 
GATACACCAG 



60 
120 
180 
240 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
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AAGGAGGTTT TGACGCCATG CTTCAGGCAG CTGTCTGTGA AAGTCATATC GGATGGCGAA 1500 
AAGAGGCTAA AAGATTGCTG CTGGTGATGA CAGATCAGAC GTCTCATCTC GCTCTTGATA 1560 
GCAAATTGGC AGGCATAGTG GTGCCCAATG ACGGAAACTG TCATCTGAAA AACAACGTCT 1620 
ACGTCAAATC GACAACCATG GAACACCCCT CACTAGGCCA ACTTTCAGAG AAATTAATAG 1680 
ACAACAACAT TAATGTCATC TTTGCAGTTC AAGGAAAACA ATTTCATTGG TATAAGGATC 1740 
TTCTACCCCT CTTGCCAGGC ACCATTGCTG GTGAAATAGA ATCAAAGGCT GCAAACCTCA 1800 
ATAATTTGGT AGTGGAAGCC TATCAGAAGC TCATTTCAGA AGTGAAAGTT CAGGTGGAAA 1860 
ACCAGGTACA AGGCATCTAT TTTAACATTA CCGCCATCTG TCCAGATGGG TCCAGAAAGC 1920 
CAGGCATGGA AGGATGCAGA AACGTGACGA GCAATGATGA AGTTCTTTTC AATGTAACAG 1980 
TTACAATGAA AAAATGTGAT GTCACAGGAG GAAAAAACTA TGCAATAATC AAACCTATTG 2040 
GTTTTAATGA AACCGCTAAA ATTCATATAC ACAGAAACTG CAGCTGTCAG TGTGAGGACA 2100 
ACAGAGGACC TAAAGGAAAG TGTGTAGATG AAACTTTTCT AGATTCCAAG TGTTTCCAGT 2160 
GTGATGAGAA TAAATGTCAT TTTGATGAAG ATCAGTTTTC TTCTGAGAGT TGCAAGTCAC 2220 
ACAAGGATCA GCCTGTTTGC AGTGGTCGAG GAGTTTGTGT TTGTGGGAAA TGTTCATGTC 2280 
ACAAAATTAA GCTTGGAAAA GTGTATGGAA AATACTGTGA AAAGGATGAC TTTTCTTGTC 2340 
CATATCACCA TGGAAATCTG TGTGCTGGGC ATGGAGAGTG TGAAGCAGGC AGATGCCAAT 2400 
GCTTCAGTGG CTGGGAAGGT GATCGATGCC AGTGCCCTTC AGCAGCAGCC CAGCACTGTG 2460 
TCAATTCAAA GGGCCAAGTG TGCAGTGGAA GAGGCACGTG TGTGTGTGGA AGGTGTGAGT 2520 
GCACCGATCC CAGGAGCATC GGCCGCTTCT GTGAACACTG CCCCACCTGT TATACAGCCT 2580 
GCAAGGAAAA CTGGAATTGT ATGCAATGCC TTCACCCTCA CAATTTGTCT CAGGCTATAC 2640 
TTGATCAGTG CAAAACCTCA TGTGCTCTCA TGGAACAACA GCATTATGTC GACCAAACTT 2700 
CAGAATGTTT CTCCAGCCCA AGCTACTTGA GAATATTTTT CATCATTTTC ATAGTTACAT 2760 
TCTTGATTGG GTTGCTTAAA GTCCTGATCA TTAGACAGGT GATACTACAA TGGAATAGTA 2820 
ATAAAATTAA GTCCTCATCA GATTACAGAG TGTCAGCCTC AAAAAAGGAT AAGTTGATTC 2880 
TGCAAAGTGT TTGCACAAGA GCAGTCACCT ACCGACGTGA GAAGCCTGAA GAAATAAAAA 2940 
TGGATATCAG CAAATTAAAT GCTCATGAAA CTTTCAGGTG CAACTTCTAA AAAAAGATTT 3000 
TTAAACACTT AATGGGAAAC TGGAATTGTT AATAATTGCT CCTAAAGATT ATAATTTTAA 3060 
AAGTCACAGG AGGAGACAAA TTGCTCACGG TCATGCCAGT TGCTGGTTGT ACACTCGAAC 3120 
GAAGACTGAC AAGTATCCTC ATCATGATGT GACTCACATA GCTGCTGACT TTTTCAGAGA 3180 
AAAATGTGTC TTACTACTGT TTGAGACTAG TGTCGTTGTA GCACTTTACT GTAATATATA 3240 
ACTTATTTAG ATCAGCATAG AATGTAGATC CTCTGAAGAG CACTGATTAC ACTTTACAGG 3300 
TACCTGTTAT CCCTACGCTT CCCAGAGAGA ACAATGCTGT GAGAGAGTTT AGCATTGTGT 3360 
CACTACAAGG GTACAGTAAT CCCTGCACTG GACATGTGAG GAAAAAAATA ATCTGGCAAG 3420 
TATATTCTAA GGTTGCCAAA CACTTCAACA GTTGGTGGTT GAATAGACAA GAACAGCTAG 3480 
ATGAATAAAT GATTCGTGTT TCACTCTTTC AAGAGGTGAA CAGATACAAC CTTAATCTTA 3540 
AAAGATTATT GCTTTTTAAA GTGTGTAGTT TTATGCATGT GTGTTTATGG TTTGCTTATT 3600 
TTTGCAAGAT GGATACTAAT TCCAGCATTC TCTCCTCTTT GCCTTTATGT TTTGTTTTCT 3660 
TTTTTACAGG ATAAGTTTAT GTATGTCACA GATGACTGGA TTAATTAAGT GCTAAGTTAC 3720 
TACTGCCATA AAAAACTAAT AATACAATGT CACTTTATCA GAATACTAGT TTTAAAAGCT 3780 
GAATGTTAA 

Seq ID NO: 480 Protein sequence 
Protein Accession It: NP_002205 

1 11 21 31 41 51 

I I I ! ! I 

MCGSALAFFT AAFVCLQNDR RGPASFLWAA WVFSLVLGLG QGEDNRCASS NAASCARCLA 60 
LGPECGWCVQ EDFISGGSRS ERCDIVSNLI SKGCSVDSIE YPSVHVIIPT ENEINTQVTP 120 
GEVSIQLRPG AEANFMLKVH PLKKYFVDLY YLVDVSASMH NNIEKLNSVG NDLSRKMAFF 180 
SRDPRLGFGS YVDKTVSPYI SIHPERIHNQ CSDYNLDCMP PHGYIHVLSL TENITEFEKA 240 
VHRQKISGNI DTPEGGFDAM LQAAVCESHI GWRKEAKRLL LVMTDQTSHL ALDSKLAGIV 300 
VPNDGNCHLK NNVYVKSTTM EHPSLGQLSE KLIDNNINVI FAVQGKQFHW YKDLLPLLPG 360 
TIAGBIESKA ANLNNLWEA YQKLISEVKV QVENQVQGIY FNITAICPDG SRKPGMEGCR 420 
NVTSNDEVLF NVTVTMKKCD VTGGKNYAII KPIGFNETAK IHIHRNCSCQ CEDNRGPKGK 480 
CVDETFLDSK CFQCDENKCH FDEDQFSSES CKSHKDQPVC SGRGVCVCGK CSCHKIKLGK 540 
VYGKYCEKDD FSCPYHHGNL CAGHGECEAG RCQCFSGWEG DRCQCPSAAA QHCVNSKGQV 600 
CSGRGTCVCG RCECTDPRSI GRFCEHCPTC YTACKENWNC MQCLHPHNLS QAILDQCKTS 660 
CALMEQQHYV DQTSECFSSP SYLRIFFIIF IVTFLIGLLK VLIIRQVTLQ WNSNKIKSSS 720 
DYRVSASKKD KLILQSVCTR AVTYRREKPE EIKMDISKLN AHETFRCNF 

Seq ID NO: 481 DNA sequence 

Nucleic Acid Accession #: NM_003318.1 

Coding sequence: 1..2574 

1 11 21 31 41 51 

i 1 I I I I 

ATGGAATCCG AGGATTTAAG TGGCAGAGAA TTGACAATTG ATTCCATAAT GAACAAAGTG 60 

AGAGACATTA AAAATAAGTT TAAAAATGAA GACCTTACTG ATGAACTAAG CTTGAATAAA 120 

ATTTCTGCTG ATACTACAGA TAACTCGGGA ACTGTTAACC AAATTATGAT GATGGCAAAC 180 

AACOCAGAGG ACTGGTTGAG TTTGTTGCTC AAACTAGAGA AAAACAGTGT TCCGCTAAGT 240 

GATGCTCTTT TAAATAAATT GATTGGTCGT TACAGTCAAG CAATTGAAGC GCTTCCCCCA 300 

GATAAATATG GCCAAAATGA GAGTTTTGCT AGAATTCAAG TGAGATTTGC TGAATTAAAA 360 

GCTATTCAAG AGCCAGATGA TGCACGTGAC TACTTTCAAA TGGCCAGAGC AAACTGCAAG 420 

AAATTTGCTT TTGTTCATAT ATCTTTTGCA CAATTTGAAC TGTCACAAGG TAATGTCAAA 480 

AAAAGTAAAC AACTTCTTCA AAAAGCTGTA GAACGTGGAG CAGTACCACT AGAAATGCTG 540 

GAAATTGCCC TGCGGAATTT AAACCTCCAA AAAAAGCAGC TGCTTTCAGA GGAGGAAAAG 600 

AAGAATTTAT CAGCATCTAC GGTATTAACT GCCCAAGAAT CATTTTCCGG TTCACTTGGG 660 

CATTTACAGA ATAGGAACAA CAGTTGTGAT TCCAGAGGAC AGACTACTAA AGCCAGGTTT 720 

TTATATGGAG AGAACATGCC ACCACAAGAT GCAGAAATAG GTTACCGGAA TTCATTGAGA 780 

CAAACTAACA AAACTAAACA GTCATGCCCA TTTGGAAGAG TCCCAGTTAA CCTTCTAAAT 840 

AGCCCAGATT GTGATGTGAA GACAGATGAT TCAGTTGTAC CTTGTTTTAT GAAAAGACAA 900 

ACCTCTAGAT CAGAATGCCG AGATTTGGTT GTGCCTGGAT CTAAACCAAG TGGAAATGAT 960 

TCCTGTGAAT TAAGAAATTT AAAGTCTGTT CAAAATAGTC ATTTCAAGGA ACCTCTGGTG 1020 

TCAGATGAAA AGAGTTCTGA ACTTATTATT ACTGATTCAA TAACCCTGAA GAATAAAACG 1080 
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GAATCAAGTC TTCTAGCTAA ATTAGAAGAA 
GAGAGTAACC AGAAACAGTG GCAATCTAAG 
GCTGCATCTT CAAATCACTG GCAGATTCCG 
AAACATACCA CTTTTGAGCA ACCTGTCTTT 
5 ACATCTAAAT GGTTTGACCC AAAATCTATT 

GATTACATGA GCTGTTTTAG AACTCCAGTT 
TTGTCAACAC CTTATGGCCA ACCTGCCTGT 
ACTCCACTTC AAAATTTACA GGTTTTAGCA 
AAAGGAAGAA TTTATTCCAT TTTAAAGCAG 

10 CAGGTGTTAA ATGAAAAGAA ACAGATATAT 
GATAACCAAA CTCTTGATAG TTACCGGAAC 
CACAGTGATA AGATCATCCG ACTTTATGAT 
GTAATGGAGT GTGGAAATAT TGATCTTAAT 
CCATGGGAAC GCAAGAGTTA CTGGAAAAAT 

15 CATGGCATTG TTCACAGTGA TCTTAAACCA 
AAGCTAATTG ATTTTGGGAT TGCAAACCAA 
GATTCTCAGG TTGGCACAGT TAATTATATG 
TCCAGAGAGA ATGGGAAATC TAAGTCAAAG 
GGATGTATTT TGTACTATAT GACTTACGGG 

20 ATTTCTAAAT TACATGCCAT AATTGATCCT 
GAGAAAGATC TTCAAGATGT GTTAAAGTGT 
TCCATTCCTG AGCTCCTGGC TCATCCCTAT 
ATGGCCAAGG GAACCACTGA AGAAATGAAA 
TCTCCTAACT CCATTTTGAA AGCTGCTAAA 

25 AGTCATAATT CTTCATCCTC CAAGACTTTT 



ACTAAAGAGT ATCAAGAACC AGAGGTTCCA 1140 

AGAAAGTCAG AGTGTATTAA CCAGAATCCT 1200 

GAGTTAGCCC GAAAAGTTAA TACAGAGCAG 1260 

TCAGTTTCAA AACAGTCACC ACCAATATCA 1320 

TGTAAGACAC CAAGCAGCAA TACCTTGGAT 1380 

GTAAAGAATG ACTTTCCACC TGCTTGTCAG 1440 

TTCCAGCAGC AACAGCATCA AATACTTGCC 1500 

TCTTCTTCAG CAAATGAATG CATTTCGGTT 15 SO 

ATAGGAAGTG GAGGTTCAAG CAAGGTATTT 1620 

GCTATAAAAT ATGTGAACTT AGAAGAAGCA 1680 

GAAATAGCTT ATTTGAATAA ACTACAACAA 1740 

TATGAAATCA CGGACCAGTA CATCTACATG 1800 

AGTTGGCTTA AAAAGAAAAA ATCCATTGAT 1860 

ATGTTAGAGG CAGTTCACAC AATCCATCAA 1920 

GCTAACTTTC TGATAGTTGA TGGAATGCTA 1980 

ATGCAACCAG ATACAACAAG TGTTGTTAAA 2040 

CCACCAGAAG CAATCAAAGA TATGTCTTCC 2100 

ATAAGCCCCA AAAGTGATGT TTGGTCCTTA 2160 

AAAACACCAT TTCAGCAGAT AATTAATCAG 2220 

AATCATGAAA TTGAATTTCC CGATATTCCA 2280 

TGTTTAAAAA GGGACCCAAA ACAGAGGATA 2340 

GTTCAAATTC AAACTCATCC AGTTAACCAA 2400 

TATGTTCTGG GCCAACTTGT TGGTCTGAAT 2460 

ACTTTATATG AACACTATAG TGGTGGTGAA 2520 
GAAAAAAAAA GGGGAAAAAA ATGA 



Seq ID NO: 482 Protein sequence 
Protein Accession #; NP_003309.1 

30 1 11 21 31 41 51 

I - I I I I I 

MESEDLSGRE LTIDSIMNKV RDIKNKFKNE DLTDELSLNK ISADTTDNSG TVNQIMMMAN 60 

NPEDWLSLLL KLEKNSVPLS DALLNKLIGR YSQAIEALPP DKYGQNESFA RIQVRFAELK 120 

AIQEPDDARD YFQMARANCK KFAFVHISFA QFELSQGNVK KSKQLLQKAV ERGAVPLEML 180 

35 EIALRNLNLQ KKOLLSEEEK KNLSASTVLT AQESFSGSLG HLQNRNNSCD SRGQTTKARF 240 

LYGENMPPQD AEIGYRNSLR QTNKTKQSCP FGRVPVNLLN SPDCDVKTDD SWPCFMKRQ 300 

TSRSECRDLV VPGSKPSGND SCELRNLKSV QNSHFKEPLV SDEKSSELII TDSITLKNKT 360 

ESSLLAKLEE TKEYQEPEVP ESNQKQWQSK RKSECINQNP AASSNHWQIP ELARKVNTEQ 420 

KHTTFEQPVF SVSKQSPPIS TSKWFDPKSI CKTPSSNTLD DYMSCFRTPV VKNDFPPACQ 480 

40 LSTPYGQPAC FQQQOHQILA TPLQNLQVLA SSSANECISV KGRIYSIXjKQ IGSGGSSKVF 540 

QVLNEKKQIY AIKYVNLEEA DNQTLDSYRN EIAYLNKLQQ HSDKIIRLYD YEITDQYIYM 600 

VMECGNIDLN SWLKKKKSID PWERKSYWKN MLEAVHTIHQ HGIVHSDLKP ANFLIVDGML 660 

KLIDFGIANQ MQPDTTSWK DSQVGTVNYM PPEAIKDMSS SRENGKSKSK ISPKSDVWSL 720 

GCILYYMTYG KTPFQQIINQ ISKLHAIIDP NHEIEFPDIP EKDLQDVLKC CLKRDPKQRI 780 

45 SIPELLAHPY VQIQTHPVNQ MAKGTTEEMK YVLGQLVGLN SPNSILKAAK TLYEHYSGGE 840 
SHNSSSSKTF EKKRGKK 

Seq ID NO: 483 DNA sequence 
' Nucleic Acid Accession #: NM_003667.1 
50 Coding sequence: 1..2651 

1 ll 21 31 41 51 

1 I I I I I 

ATGGACACCT CCCGGCTCGG TGTGCTCCTG TCCTTGCCTG TGCTGCTGCA GCTGGCGACC 60 

55 GGGGGCAGCT CTCCCAGGTC TGGTGTGTTG CTGAGGGGCT GCCCCACACA CTGTCATTGC 120 

GAGCCCGACG GCAGGATGTT GCTCAGGGTG GACTGCTCCG ACCTGGGGCT CTCGGAGCTG 1B0 

CCTTCCAACC TCAGCGTCTT CACCTCCTAC CTAGACCTCA GTATGAACAA CATCAGTCAG 240 

CTGCTCCCGA ATCCCCTGCC CAGTCTCCGC TTCCTGGAGG AGTTACGTCT TGCGGGAAAC 300 

GCTCTGACAT ACATTCCCAA GGGAGCATTC ACTGGCCTTT ACAGTCTTAA AGTTCTTATG 360 

60 CTGCAGAATA ATCAGCTAAG ACACGTACCC ACAGAAGCTC TGCAGAATTT GCGAAGCCTT 420 

CAATCCCTGC GTCTGGATGC TAACCACATC AGCTATGTGC CCCCAAGCTG TTTCAGTGGC 480 

CTGCATTCCC TGAGGCACCT GTGGCTGGAT GACAATGCGT TAACAGAAAT CCCCGTCCAG 540 

GCTTTTAGAA GTTTATCGGC ATTGCAAGCC ATGACCTTGG CCCTGAACAA AATACACCAC 600 

ATACCAGACT ATGCCTTTGG AAACCTCTCC AGCTTGGTAG TTCTACATCT CCATAACAAT 660 

65 AGAATCCACT CCCTGGGAAA GAAATGCTTT GATGGGCTCC ACAGCCTAGA GACTTTAGAT 720 

TTAAATTACA ATAACCTTGA TGAATTCCCC ACTGCAATTA GGACACTCTC CAACCTTAAA 780 

GAACTACATT TCTATGACAA TCCCATCCAA TTTGTTGGGA GATCTGCTTT TCAACATTTA 840 

CCTGAACTAA GAACACTGAC TCTGAATGGT GCCTCACAAA TAACTGAATT TCCTGATTTA 900 

„ ACTGGAACTG CAAACCTGGA GAGTCTGACT TTAACTGGAG CACAGATCTC ATCTCTTCCT 960 

70 CAAACCGTCT GCAATCAGTT ACCTAATCTC CAAGTGCTAG ATCTGTCTTA CAACCTATTA 1020 

GAAGATTTAC CCAGTTTTTC AGTCTGCCAA AAGCTTCAGA AAATTGACCT AAGACATAAT 1080 

GAAATCTACG AAATTAAAGT TGACACTTTC CAGCAGTTGC TTAGCCTCCG ATCGCTGAAT 1140 

TTGGCTTGGA ACAAAATTGC TATTATTCAC CCCAATGCAT TTTCCACTTT GCCATCCCTA 1200 

_ _ ATAAAGCTGG ACCTATCGTC CAACCTCCTG TCGTCTTTTC CTATAACTGG GTTACATGGT 1260 

75 TTAACTCACT TAAAATTAAC AGGAAATCAT GCCTTACAGA GCTTGATATC ATCTGAAAAC 1320 

TTTCCAGAAC TCAAGGTTAT AGAAATGCCT TATGCTTACC AGTGCTGTGC ATTTGGAGTG 1380 

TGTGAGAATG CCTATAAGAT TTCTAATCAA TGGAATAAAG GTGACAACAG CAGTATGGAC 1440 

GACCTTCATA AGAAAGATGC TGGAATGTTT CAGGCTCAAG ATGAACGTGA CCTTGAAGAT 1500 

o/% TTCCTGCTTG ACTTTGAGGA AGACCTGAAA GCCCTTCATT CAGTGCAGTG TTCACCTTCC 1560 

80 CCAGGCCCCT TCAAACCCTG TGAACACCTG CTTGATGGCT GGCTGATCAG A ATTGGA GTG 1620 

TGGACCATAG CAGTTCTGGC ACTTACTTGT AATGCTTTGG TGACTTCAAC AGTTTTCAGA 1680 

TCCCCTCTGT ACATTTCCCC CATTAAACTG TTAATTGGGG TCATCGCAGC AGTGAACATG 1740 

CTCACGGGAG TCTCCAGTGC OGTGCTGGCT GGTGTGGATG CGTTCACTTT TGGCAGCTTT 1800 

GCACGACATG GTGCCTGGTG GGAGAATGGG GTTGGTTGCC ATGTCATTGG TTTTTTGTCC I860 

1141 



WO 03/042661 



PCT/US02/36810 



10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 
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ATTTTTGCTT 
TCTGTGAAAT 
ATTTTGCTCT 
AAGTATGGCG 
TACATGGTCG 
ACCAAGCTCT 
GTAAAACACA 
TTGTCCTTCT 
CTTCTGGTGG 
CCTCACTTTA 
AAACACCCAA 
ACTCAAGCCT 
GTGCCATCAC 
CCATGTCTTA 



CAGAATCATC 
ATTCTGCAAA 
GTGCCCTGCT 

CTCTCATCTT 
ACTGCAATTT 
TTGCCCTGTT 
CCTCTTTAAT 
TAGTCCCACT 
AGGAGGATCT 
GCTTGATGTC 
TGGTAACCTT 
CAGCTTATCC 
A 



TGTTTTCCTG 
ATTTGAAACG 
GGCCTTGACC 
CTGCCTGCCT 
GCTCAATTCC 
GGACAAGGGA 

gctcttcacc 
aaaccttaca 
tcctgcatgt 
ggtgagcctg 

AATTAACTCT 
TACCAGCTCC 

agtgactgag 



CTTACTCTGG 
AAAGCTCCAT 
ATGGCCGCAG 
TTGCCTTTTG 
CTTTGCTTCC 
GACCTGGAGA 
AACTGCATCC 
TTTATCAGTC 
CTCAATCCCC 
AGAAAGCAAA 
GATGATGTCG 
AGCATCACTT 
AGCTGCCATC 



CAGCCCTGGA 
TTTCTAGCCT 
TTCCCCTGCT 
GGGAGCCCAG 
TCATGATGAC 
ATATTTGGGA 
TAAACTGCCC 
CTGAAGTAAT 
TTCTCTACAT 
CCTACGTCTG 
AAAAACAGTC 
ATGACCTGCC 
TTTCCTCTGT 



GCGTGGGTTC 
GAAAGTAATC 
GGGTGGCAGC 
CACCATGGGC 
CATTGCCTAC 
CTGCTCTATG 
TGTGGCTTTC 
TAAGTTTATC 
CTTGTTCAAT 
GACAAGATCA 
CTGTGACTCA 
TCCCAGTTCC 
GGCATTTGTC 



1920 
1980 
2040 
2100 
2160 
2220 
22S0 
2340 
2400 
2460 
2520 
2580 
2640 



Seq ID NO: 484 Protein sequence 
Protein Accession #: NP_0036S8.1 



1 

1 

MDTSRIiGVIiL 
PSNLSVFTSY 
LQNNQLRHVP 
AFRSIjSALQA 
LNYNNLDEFP 
TGTANLESLT 
EIYEIKVDTF 
LTHLKLTGNH 
DLHKKDAGMF 
WTIAVLALTC 
ARHGAWWENG 
ILLCALLALT 
TKLYCNLD KG 
LLWVPLPAC 
TQALVTFTSS 



11 
I 

SLPVLLQLAT 
LDLSMNNISQ 
TEALQNLRSL 
MTLALNKIHH 
TAIRTLSNLK 
LTGAQISSLP 
QQLLSLRSLN 
ALQSLISSEN 
QAQDERDLED 
NALVTSTVFR 
VGCHVIGFLS 
MAAVPLLGGS 
DLENIWDCSM 
LNPLLYILFN 
SITYDLPPSS 



21 
I 

GGSSPRSGVL 
LLPNPLPSLR 
QSLRLDANHI 
IPDYAFGNLS 
ELHFYDNPIQ 
QTVCNQLPNL 
LAWNKIAIIH 
FPELKVI EMP 
FLLDFEEDLK 
SPLYISPIKL 
IFASESSVFL 
KYGASPLCLP 
VKHIALLLFT 
PHFKEDLVSL 
VPSPAYPVTE 



31 
I 

LRGCPTHCHC 
FLEELRLAGN 
SYVPPSCFSG 
SLWLHUJNN 
FVGRSAFQHL 
QVLDLSYNLL 
PNAFSTLPSL 
YAYQCCAFGV 
ALHSVQCSPS 
LIGVIAAVNM 
LTLAALERGF 
LPFGEPSTMG 
NCILNCPVAF 
RKQTYVWTRS 
SCHLSSVAFV 



41 
I 

EPDGRMLLRV 
ALTYIPKGAF 
LHSLRHLWLD 
RIHSLGKKCF 
PELRTLTLNG 
EDLPSFSVCQ 
IKLDLSSNLL 
CENAYKISNQ 
PGPFKPCEHL 
LTGVSSAVLA 
SVKYSAKFET 
YMVALILLNS 
LSFSSLINLT 
KHPSLMSINS 
PCL 



51 
I 

DCSDLGLSEL 
TGLYSLKVLM 
DNALTEIPVQ 
DGLHSLETLD 
ASQITEFPDIj 
KLQKIDLRHN 
SSFPITGIiHG 
WNKGDNSSMD 
LDGWLIRIGV 
GVDAFTFGSF 
KAPFSSLKVI 
LCFLMMTIAY 
FISPEVIKFI 
DDVEKQSCDS 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 



Seq ID NO: 485 DNA sequence 

Nucleic Acid Accession #: NM__005756.1 

Coding sequence: 73.. 3117 



1 
1 

AGCCAGCCCG 
CTCGCGGTCA 
GTTTTACTGA 
TCCCTGGAAG 
GTCAGTTTTG 
AGCTTACTCC 
GCTTCAGGCG 
GCATTTTTTA 
CAACATATAA 
GAGCTCAACA 
GAGGCCCAAA 
TGTGCTGCAA 
TCTGTCAGGA 
CAGGATCCCA 
TCCATCCCAG 
GCTGAGCCTC 
CAACCCCTTT 
CCACAGTCTG 
CCTGTGAAAG 
AGCGCACCTC 
CAAGTGTTGC 
GAAATGATCA 
GCTCAAAGAT 
ACTATAAGTC 
TTCAACACAA 
CAAGCTCCTG 
CCAGCTCATG 
TTGTTTCAGG 
GTTGCAAACC 
AACCCGAGCC 
GGCAGAGGAG 
ATCTGTACCT 
GTGCTGCCTG 
TCAATTTTTC 
TACCCTTCCA 
CTCCTGGACT 
TTTCTTCATT 
TACCTGGCCC 
ATTGTCGGTT 
AACTATGGGC 
ATCAACAACA 
CTGAACGTCA 



11 
I 

AGGACGCGAG 
GGATGGTTTT 
CGTTCAAGAT 
AAGATACTGA 
CCCCCTCCTC 
CTTCAAACGA 
TCAAACCCCA 
GAGGTGAGAT 
CGAATGGCAC 
AAACCCTGCA 
GCACATTAAA 
TAGCCGCTTT 
TACCCTGCCC 
TTGTCTGTCT 
TGGTGCCTCG 
CAGATTATTC 
CACCCCAGCC 
AAACGATCTC 
CCTCATTTTC 
CTGTCCAGAC 
AGATGGAGAA 
ACCAAGTCAG 
TGCTGAAAGT 
TAACCTCCCC 
CTACCTTTGT 
AGAACAGTAT 
ACATGGAGCT 
ATCCTTCCCT 
TGACCGTCAG 
AGGATGAGTT 
GCTGGTCAGA 
GTAGCCATCT 
CTCAAATGAT 
TGTCAGTGAC 
AAATCCTCAT 
CGTGGATTGC 
ATTTTCTCTT 
TTGTCAAAGT 
GGGGGGTACC 
TTGGATCCTA 
ATGCAGTATT 
GCATGTTCAT 




TGCTGACCAT 
GGCCACTGTG 
ACCTGTGACC 
TTCAGCTCCC 
TTCCCCTATG 
CTCTCCCACC 
AGACATCGTC 
GGCTCTGTCC 
CAGACTCCTT 
AGTGGATGAC 
TTCTTTGGCT 
GGCCCAAGAC 
TGGCACAATT 
AGCTTCCAGG 
GGAGAACCTC 
GAACTTGACA 
AACAGTGAGA 
CAATGGCTGC 
AACAAGCTTC 
GGCTCTGACG 
TCTTGTAACC 
CCAGCTGTGT 
TCTGTATAAG 
GGTCTCATTC 
ATTTAATACT 
AGCTGTGGTT 
TGGGAAATTC 
CTACATTACG 
TGTGGTCCTG 



31 
I 

GCACAGAGGT 
CAGTGTGGCC 
ATCATTTGTC 
TTGTCACCAC 
GAAACAACAA 
ACTAAAATCA 
TGCAATTTGT 
TATGATAAAG 
GTCCTGTCTC 
GAGACTTACT 
ACAATAAAAC 
AAGATTCGAC 
GAAGAGTTGG 
CCACGTGGCC 
CTTTCCCAGG 
CACAATGTTC 
ATAGCTTCCA 
CCCCAAACCC 
GTGTCTGCCC 
AACACCAGCA 
TTGGGCAGCC 
CATTCCCCGC 
ATTGGCCTAC 
CTGGCTGTGA 
CCTGCAAATC 
ACTCTTCCTT 
GTTCAGTTCA 
TCTCTGATCA 
AGAAACGTGA 
TGTGTATTTT 
TCTGTCAAAG 
GGCGTTCTGC 
TTCATTACAT 
TACATAGCTT 
GCTGCTCTGC 
ATGCAAGGCC 
ACATGGATGG 
TACATCCGAA 
GTGACCATCA 
CCCAATGGTT 
GTGGTGGGAT 
GTTCAGCTCT 



41 



TCTCCACTTT 
ATGTTGGCAG 
TTCATGTOGT 
CACCTGCTAA 
GCCTCAATGA 
CTATAGTAAA 
CATCTATTTG 
AAAGCACTGT 
TAAGTGAATT 
TTATAATGTG 
TGAATAATAC 
CAATGGAACA 
GAAAGCTTCA 
CACCATTTTC 
TCCCCAAAGC 
CCTCTCCAAT 
GCCCTGCCAT 
ATGTCTCCGG 
CTGCGAATGT 
GTATTTCTGA 
TGGAGCCTAA 
CTGACATGCT 
AGCTGAACTT 
TCAGAGTGAA 
TTCAGGTTTC 
CATCGCTGAT 
ATTTTTTTGA 
GCTACGTCAT 
CAGTCACATT 
GGGACTTGGG 
ACAGGAGATT 
TGGACCTATC 
ATATTGGTTG 
TTGAAAAGAT 
TTCTGCTGAA 
TCTGCATCTC 
GCCTAGAAGC 
AATACATCCT 
TCCTGACTAT 
CACCGGATGA 
ATTTCTGTGT 
GTCGAATTAA 



51 
I 

GTTTTCTGAA 
AACTGAAGAA 
TCTGGTAACA 
ATTATCTGTT 
TGTTACTTTA 
AACCTTCAAT 
CAATGACTCA 
TCCCCAGAAT 
AAAACGCTCA 
TGCTACAGCA 
AATGAATGCA 
CTGCTGCTGT 
GTGTGACCTG 
TTCCAGCCAA 
TACCTCTTTT 
AGGGGAGATT 
TGACATGCCC 
CACCCCACCT 
CAACACTACC 
TCTTGAGAAC 
CCTCGCAGGA 
GGCCCCTCTG 
TTCAAACACG 
TGCCAGTAGT 
TCTGGAAACC 
GAATAATTTA 
AACACCTGCT 
ATCATCGAGT 
AAAGCACATC 
CAGAAATGGT 
GAATGAAACC 
TAGGACATCT 
TGGGCTTTCA 
CCGGAGGGAT 
CCTGGTCTTC 
AGTGGCTGTA 
ATTCCATATG 
TAAATTCTGC 
ATCCCCAGAT 
CTTCTGCTGG 
GATATTTTTG 
AAAGAAGAAG 
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WO 03/042661 
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CAACTGGGAG 
TTTTTACTGG 
TTCATGTATC 
TGTGTGGCCA 
OGGCTGGCTG 
GTAAACCAAG 
TCCACCACAC 
TCTACAGAGA 
TTCACTGGAA 
ATGGCTCTCA 
CTTTCTTCTA 
CACAATGTGA 
AAGCTAATTA 
TTTAGACATT 
ATACACTAAG 
TTTAAAGAGG 
TTGGTGAGTT 
GTCAAAAATC 
TTGTAGAATC 
AAATCCTAAA 
TCAAGAAATA 
TCATACAGTG 
TGGGTCATGG 
GTGCACGACC 
CCCCAGTAGT 
AGAGGGATGA 
ATACACTGCC 
CTTCTCTTAA 
ACTGCATGGT 
TTGTTTTTTA 
GGGAACTGTC 
TATACAGGGT 
TTATTAGGAA 
ATTTCAAGTG 
CTGACTTGTC 
AAAATCAAAA 



CCCAGCGAAA 
GAATAACTTG 
TGTTTGCCAT 
AAGAAAATGT 
AAAATTCTGA 
GAGTGTCCAG 
TGCTAGTGAA 
GGAATGGGGT 
AACAGCACAT 
GAAGGACTTC 
AAATCAAAGC 
GATGTATGAA 
AGGGCGATGA 
TCTGATTTGG 
AATGACTCCT 
CTAAGTTATC 
TAGTTGTGCA 
TTACTTCTAC 
ATGGTTGTTT 
GCTCCTTTTG 
ATGATCCCAG 
CCTTTGAGCA 
TTTGACAAAC 
TGTACAGCCA 
CAGATTATAG 
ACTGTCATCC 
GCTTCTCAAA 
AAAGGTACAT 
ACATTGTTGA 
AATGTGTCTT 
CTACACTGCT 
CTATCTTGCT 
CATTTCAAAC 
AATGTTGGAT 
TTTGCAATAT 
ATGTTAAAAT 



AACCAGTATT 
GGGCTTTGCC 
CTTTAATACC 
CAGGAAGCAA 
CTGGAGTAAA 
CTCTTCAAAT 
TAATGATTGC 
CTCTTTTAGT 
GTTTAACGAG 
AAAGCGGGGA 
ATGATGCTTG 
AATCAACTCA 
TTATTATTAC 
TTTCTTATCT 
ATAAAGAAAA 
TTTGATAACA 
TGCCTTTGTT 
ATTTTTTTGT 
TGTCTCACGT 
GAGATGATAT 
CCAGACTGAG 
AGTTAGGAAA 
AGAGTGAGAG 
AACACAGCAT 
AATCTGCACC 
AGACCATGTG 
TCCCCAGAGC 
ATATATGGAA 
TTGTTATGAC 
GAAGAATGCA 
ATTGTTG CT A 
TCCTACCTAC 
CCCTTTTAGT 
CTCAGACTAA 
TTCTTTTCTG 
CAATGAAATA 



CAAGACCTCA 
TTCTTTGCCT 
TTACAAGGAT 
TGGAGGCGGT 
ACTGCTACTA 
TCCTTACAGT 
TCAGTACACG 
GTTCAGAATG 
AAGGAAGATT 
AGCTTACACT 
ACAGTGTGAA 
TTTTATTCTC 
AAGAAGAAAC 
TTCATTTTAT 
CAAAAAAAGG 
TCATATAAAG 
GTATATAAGC 
ATTTATTTTC 
GATAATTCAG 
AGGATGTGAA 
AAAATGTAAG 
AGATGCCCCC 
ACCATATTTT 
CCAATATGAA 
AAGATGTTTA 
TCAGGAAAAT 
CTTTAGGAAC 
AAAAATCATA 
TGGTACACTC 
CAGTGACAAG 
CATGTATCGA 
ATCTGCTTGA 
TAAGTCTTTC 
CCATAGTAAT 
ATTTATTTAA 
AATTTGCAGT 



GGAGTATCGC 
GGGGACCAGT 
TTTTCATATT 
ATCTTTGTTG 
ATGGTTTAAA 
CAAGCAGTAA 
CAAGCGGGAA 
GAGATGTGTG 
CCTGCAATGG 
TTATTGAGCA 
ATGTCCAATT 
GGCAACATCT 
CAAGACATTA 
AAGAAGGTTG 
TAGTGAACTT 
CAACTGTTGA 
TAAATTCTAG 
TACTGTGTAA 
AAAATCCTTG 
ATACAGAAAC 
CAGACAGTGC 
ACTGGGCAGA 
AGCCCCACTC 
TACCCATCCC 
GCTTTATACC 
TGTGAACGTA 
AGGAGAGTAG 
TTGCOGTTCT 
TGGCCCAGCC 
GGGAGTAGCT 
GCCTTGATTG 
GCAGTGCCTC 
ACTAAGGTTC 
AATACACATT 
TTTTCTTGTA 
TAAGA 



TGGCCTTACA 
TAACGTGACC 
CATCTTTTAC 
TGGAAAGTTA 
GAAGCAGACT 
CTCCACTAAC 
TGGAAATGCT 
CCTTCACGAT 
GAAAGGCCGT 
AATGTGATTC 
TTACCTTTTA 
GGAGAAGCAT 
CACCATGGTT 
GTTTTAAACA 
TCAGCTACCT 
CTTCAGCCTG 
TGACCCATGT 
ATGTATTCCT 
CTCGTTCCGC 
CTCAGTGAAA 
CACAGTTAGC 
CACAGCCCTA 
ACCCTCTTGG 
CTGACCGCAT 
TTGGCCACAG 
GATGAGGTAC 
ACTAGGATTC 
TTAAAAGGCA 
AGAGCTATAA 
ATTGGGAACA 
CTCCTAGTTA 
AAGTACATCC 
TCTTGCATAT 
TCTGTGAGTG 
TTTATATGTT 



2580 

2640 

2700 

2760 

2820 

2880 

2 940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 
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3480 

3540 
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3660 
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3840 

3900 
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4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 



Seq ID NO: 486 Protein sequence 
Protein Accession #: NP_005747.1 



MVFSVRQCGH 
PSSNEVETTS 
QNQHITNGTL 
NACAVIAALE 
SQSIPWPRA 
MPPQSETISS 
ENQVLQMEKA 
NTTISLTSPS 
NLPAHDMELA 
HINPSQDELT 
TSVLPAQMMA 
VFLLDSWIAL 
FCIVGWGVPA 
FLLNVSMFIV 
VTFMYLFAIF 
QTVNQGVSSS 
HDFTGKQHMF 



11 

I 

VGRTEEVLLT 
LNDVTLSLLP 
TGVLSLSELK 
RVKIRPMEHC 
TVLSQVPKAT 
PMPQTHVSGT 
LSLGSLEPNL 
LALAVIRVNA 
SRVQFNFFET 
VRCVFWDLGR 
LTFITYIGCG 
YKMQGLCISV 
WVTIILTIS 
VLVQLCRIKK 
NTLQGFFIFI 
SNSLQSSSNS 



21 
I 

FKIFLVIICL 
SNETGVKPQR 
RSELNKTIiQT 
CCSVRIPCPS 
SFAEPPDYSP 
PPPVKASFSS 
AGEMINQVSR 
SSFNTTTFVA 
PALFODPSLE 
NGGRGGWSDN 
LSSIFLSVTL 
AVFLHYFLLV 
PDNYGLGSYG 
KKQIiGAQRKT 
FYCVAKEMVR 
TNSTTLLVNN 
GRMALRRTSK 



31 
I 

HWLVTSLEE 
NICNLSSICN 
LSETYFIMCA 
SPEELEKLQC 
VTHNVPSPIG 
PTVSAPANVN 
LLHSPPDMLA 
QDPANLQVSL 
NLSLISYVIS 
GCSVKDRRLN 
VTYIAFEKIR 
SFTWMGLEAF 
KFPNGSPDDF 
SIQDLRSIAG 
KQWRRYIiCCG 
DCSVHASGNG 
RGSLHFIEQM 



41 
I 

DTDNSSLSPP 
DSAFFRGEIM 
TAEAQSTLNC 
DLQDPIVCLA 
EIQPLSPQPS 
TTSAPPVQTD 
PLAQRLLKW 
ETQAPENSIG 
SSVANLTVRN 
ETICTCSHLT 
RDYPSKIMQ 
HMYLALVKVF 
CWINNNAVFY 
LTFLLGITWG 
KLRLAENSDW 
NASTERNGVS 



51 
I 

PAKLSWSFA 
FQYDKESTVP 
TFTIKLNNTM 
DHPRGPPFSS 
APIASSPAID 
IVNTSSISDL 
DDIGIiQLNFS 
TITLPSSLMN 
LTRNVTVTLK 
SFGVLLDLSR 
LCAALLLLNIj 
NTYIRKYILK 
ITWGYFCVI 
FAFFAWGPVN 
SKTATNGLKK 
FSVQNGDVCL 



60 
120 
180 
240 
300 
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420 
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Seq ID NOi 487 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 1..2904 



1 
I 

ATGGTTTTCT 
TTCAAGATAT 
GATACTGATA 
AACGAAACAG 
CCCCAGAGAA 
GAGATCATGT 
GGCACCTTAA 
ATAAAACTGA 
ATTCGACCAA 
GAGTTGGAAA 
CGTGGCCCAC 
TCCCAGGTCC 
AATGTTCCCT 
GCTTCCAGCC 
CAAACCCATG 
TCTGCCCCTG 
ACCAGCAGTA 
GGCAGCCTGG 



11 
I 

CTGTCAGGCA 
TCCTTGTCAT 
ATTCCAGTTT 
AAAAAACTAA 
ATATCTGCAA 
TTCAATATGA 
CTGGAGTCCT 
ATAATACAAT 
TGGAACACTG 
AGCTTCAGTG 
CATTTTCTTC 
CCAAAGCTAC 
CTCCAATAGG 
CTGCCATTGA 
TCTCCGGCAC 
CGAATGTCAA 
TTTCTGATCT 
AGCCTAACCT 



21 
I 

GTGTGGCCAT 
CATTTGTCTT 
GTCACCACCA 
AATCACTATA 
TTTGTCATCT 
TAAAGAAAGC 
GTCTCTAAGT 
GAATGCATGT 
CTGCTGTTCT 
TGACCTGCAG 
CAGCCAATCC 
CTCTTTTGCT 
GGAGATTCAA 
CATGCCCCCA 
CCCACCTCCT 
CACTACCAGC 
TGAGAACCAA 
CGCAGGAGAA 



31 
I 

GTTGGCAGAA 
CATGTCGTTC 
CCTGATGTTA 
GTAAAAACCT 
ATTTGCAATG 
ACTGTTCCCC 
GAATTAAACA 
GCTGTAATAG 
GTCAGGATAC 
GATCCCATTG 
ATCCCAGTGG 
GAGCCTCCAG 
CCCCTTTCAC 
CAGTCTGAAA 
GTGAAAGCCT 
GCACCTCCTG 
GTGTTGCAGA 
ATGATCAACC 



41 
i 

CTGAAGAAGT 
TGGTAACATC 
CTTTAAGCTT 
TCAATGCTTC 
ACTCAGCATT 
AGAATCAACA 
CATTAAATTG 
CTGCTTTGGA 
CCTGCCCTTC 
TCTGTCTTGC 
TGCCTCGGGC 
ATTATTCACC 
CCCAGCCTTC 
CGATCTCTTC 
CATTTTCCTC 
TCCAGACAGA 
TGGAGAAGGC 
AAGTCAGCAG 



51 
I 

TTTACTGACG 
CCTGGAAGAA 
ACTCCCTTCA 
AGGCGTCAAA 
TTTTAGAGGT 
TATAACGAAT 
TACATTCACA 
AAGAGTAAAG 
CTCCCCAGAA 
TGACCATCCA 
CACTGTGCTT 
TGTGACCCAC 
AGCTCCCATA 
CCCTATGCCC 
TCCCACCGTG 
CATCGTCAAC 
TCTGTCCTTG 
ACTCCTTCAT 
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TCCCCGCCTG ACATGCTGGC CCCTCTGGCT 
GGCCTACAGC TGAACTTTTC AAACACGACT 
GCTGTGATCA GAGTGAATGC CAGTAGTTTC 
GCAAATCTTC AGGTTTCTCT GGAAACCCAA 
CTTCCTTCAT CGCTGATGAA TAATTTACCA 
CAGTTCAATT TTTTTGAAAC ACCTGCTTTG 
CTGATCAGCT ACGTCATATC ATCGAGTGTT 
AACGTGACAG TCACATTAAA GCACATCAAC 
GTATTTTGGG ACTTGGGCAG AAATGGTGGC 
GTCAAAGACA GGAGATTGAA TGAAACCATC 
GTTCTGCTGG ACCTATCTAG GACATCTGTG 
ATTACATATA TTGGTTGTGG GCTTTCATCA 
ATAGCTTTTG AAAAGATCCG GAGGGATTAC 
GCTCTGCTTC TGCTGAACCT GGTCTTCCTC 
CAAGGCCTCT GCATCTCAGT GGCTGTATTT 
TGGATGGGCC TAGAAGCATT CCATATGTAC 
ATCCGAAAAT ACATCCTTAA ATTCTGCATT 
ACCATCATCC TGACTATATC CCCAGATAAC 
AATGGTTCAC CGGATGACTT CTGCTGGATC 
GTGGGATATT TCTGTGTGAT ATTTTTGCTG 
CAGCTCTGTC GAATTAAAAA GAAGAAGCAA 
GACCTCAGGA GTATCGCTGG CCTTACATTT 
TTTGCCTGGG GACCAGTTAA CGTGACCTTC 
CAAGGATTTT TCATATTCAT CTTTTACTGT 
AGGCGGTATC TTTGTTGTGG AAAGTTACGG 
GCTACTAATG GTTTAAAGAA GCAGACTGTA 
TTACAGTCAA GCAGTAACTC CACTAACTCC 
GTACACGCAA GCGGGAATGG AAATGCTTCT 
CAGAATGGAG ATGTGTGCCT TCACGATTTC 
GAAGATTCCT GCAATGGGAA AGGCCGTATG 
TTACACTTTA TTGAGCAAAT GTGA 

Seq ID NO: 488 Protein sequence 
Protein Accession #: Eos sequence 




GGATGACATT 
TTTGGCTCTG 
CCAAGACCCT 
CACAATTACT 
TTCCAGGGTT 
GAACCTCTCT 
CTTGACAAGA 
AGTGAGATGT 
TGGCTGCTCT 
AAGCTTCGGC 
TCTGACGTTC 
TGTAACCTAC 
GCTGTGTGCT 
GTATAAGATG 
CTCATTCACA 
TAATACTTAC 
TGTGGTTGTG 
GAAATTCCCC 
CATTACGGTG 
GGTCCTGGTT 
CAGTATTCAA 
CTTTGCCTTC 
TAATACCTTA 
GAAGCAATGG 
GAGTAAAACT 
TTCAAATTCC 
TGATTGCTCA 
TTTTAGTGTT 
TAACGAGAAG 
GCGGGGAAGC 



1140 
1200 
1260 
1320 
1380 
1440 
1SO0 
1560 
1620 
1680 
1740 
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2640 
2700 
2760 
2820 
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MVFSVRQCGH 
NETEKTKITI 
GTLTGVLSLS 
ELEKLQCDLQ 
NVPSPIGEIQ 
SAPANVNTTS 
SPPDMLAPLA 
ANLQVSLETQ 
IiISYVISSSV 
VKDRRLNETI 
IAFEKIRRDY 
WMGLEAFHMY 
NGSPDDFCWI 
DLRSIAGLTF 
RRYLCCGKLR 
VHASGNGNAS 
LHFIEQM 



11 

1 

VGRTEEVLLT 
VKTFKASGVK 
ELNTLNCTFT 
DPIVCLADHP 
PLSPQPSAPI 
APPVQTDIVN 
QRLLKWDDI 
APENSIGTIT 
ANLTVRNLTR 
CTCSHLTSFG 
PSKILIQLCA 
LALVKVFNTY 
NNNAVFYITV 
LLGITWGFAF 
LAENSDWSKT 
TERNGVSFSV 



21 
I 

FKIFLVIICL 
PQRNICNLSS 
IKLNNTMNAC 
RGPPFSSSQS 
ASSPAIDMPP 
TSSISDLENQ 
GLQLNFSNTT 
LPSSLMNNLP 
NVTVTLKHIN 
VLLDLSRTSV 
ALLLLNLVFL 
IRKYILKFCI 
VGYFCVIFLL 
FAWGPVNVTF 
ATNGLKKQTV 
QNGDVCLHDF 



31 
1 

HWLVTSLEE 
ICMDSAFFRG 
AVIAALERVK 
IPWPRATVL 
QSETISSPMP 
VLQMEKALSL 
ISLTSPSIiAL 
ARDMELASRV 
PSQDELTVRC 
LPAQMMALTF 
LDSWIALYKM 
VGWGVPAVW 
NVSMFIWLV 
MYLFAIFNTL 



41 
I 

DTDNSSLSPP 
EIMFQYDKES 



SQVPKATSFA 
QTHVSGTPPP 
GSLEPNLAGE 
AVIRVNASSF 
QFNFFETPAL 
VFWDLGRNGG 
ITYIGCGLSS 
QGLCISVAVF 
TIILTISPDN 
QLCRIKKKKQ 
QGFFIFIFYC 



51 
I 

PDVTLSLLPS 
TVPQNQHITN 
VRIPCPSSPE 
EPPDYSPVTH 
VKASFSSPTV 
MINQVSRLLH 
NTTTFVAQDP 



TGKQHMFNEK EDSCNGKGRM 



RGGWSDNGCS 
IFLSVTLVTY 
LHYFIiLVSFT 
YGLGSYGKFP 
LGAQRKTSIQ 
VAKENVRKQW 
TTLLVNNDCS 
ALRRTSKRGS 



Seq ID NO: 489 PNA sequence 

Nucleic Acid Accession #» Eos sequence 

Coding sequence: 1..2811 
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1 11 21 

I I I 

ATGGTTTTCT CTGTCAGGCA GTGTGGCCAT 
TTCAAGATAT TCCTTGTCAT CATTTGTCTT 
GATACTGATA ATTCCAGTTT GTCACCACCA 
GTTACTTTAA GCTTACTCCC TTCAAACGAA 
AATTTGTCAT CTATTTGCAA TGACTCAGCA 
GATAAAGAAA GCACTGTTCC CCAGAATCAA 
CTGTCTCTAA GTGAATTAAA ACGCTCAGAG 
ACTTACTTTA TAATGTGTGC TACAGCAGAG 
ATAAAACTGA ATAATACAAT GAATGCATGT 
ATTCGACCAA TGGAACACTG CTGCTGTTCT 
GAGTTGGAAA AGCTTCAGTG TGACCTGCAG 
CGTGGCCCAC CATTTTCTTC CAGCCAATCC 
TCCCAGGTCC CCAAAGCTAC CTCTTTTGCT 
AATGTTCCCT CTCCAATAGG GGAGATTCAA 
GCTTCCAGCC CTGCCATTGA CATGCCCCCA 
CAAACCCATG TCTCCGGCAC CCCACCTCCT 
TCTGCCCCTG CGAATGTCAA CACTACCAGC 
ACCAGCAGTA TTTCTGATCT TGAGAACCAA 
GGCAGCCTGG AGCCTAACCT CGCAGGAGAA 
TCCCCGCCTG ACATGCTGGC CCCTCTGGCT 
GGCCTACAGC TGAACTTTTC AAACACGACT 
GCTGTGATCA GAGTGAATGC CAGTAGTTTC 
GCAAATCTTC AGGTTTCTCT GGAAACCCAA 



41 



31 

I I 
GTTGGCAGAA CTGAAGAAGT 
CATGTCGTTC TGGTAACATC 
CCTGAGGTTG AAACAACAAG 
ACAGGCGTCA AACCCCAGAG 
TTTTTTAGAG GTGAGATCAT 
CATATAACGA ATGGCACCTT 
CTCAACAAAA CCCTGCAAAC 
GCCCAAAGCA CATTAAATTG 
GCTGTAATAG CTGCTTTGGA 
GTCAGGATAC CCTGCCCTTC 
GATCCCATTG TCTGTCTTGC 
ATCCCAGTGG TGCCTCGGGC 
GAGCCTCCAG ATTATTCACC 
CCCCTTTCAC CCCAGCCTTC 
CAGTCTGAAA CGATCTCTTC 
GTGAAAGCCT CATTTTCCTC 
GCACCTCCTG TCCAGACAGA 
GTGTTGCAGA TGGAGAAGGC 
ATGATCAACC AAGTCAGCAG 
CAAAGATTGC TGAAAGTAGT 
ATAAGTCTAA CCTCCCCTTC 
AACACAACTA CCTTTGTGGC 
GCTCCTGAGA ACAGTATTGG 




ACTCCTTCAT 
GGATGACATT 
TTTGGCTCTG 
CCAAGACCCT 
CACAATTACT 
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1144 



WO 03/042661 



CTTCCTTCAT CGCTGATGAA TAATTTACCA GCTCATGACA TGGAGCTAGC TTCCAGGGTT 1440 

CAGTTCAATT TTTTTGAAAC ACCTGCTTTG TTTCAGGATC CTTCCCTGGA GAACCTCTCT IS 00 

CTGATCAGCT ACGTCATATC ATOGAGTGTT GCAAACCTGA CCGTCAGGAA CTTGACAAGA 1560 

AACGTGACAG TCACATTAAA GCACATCAAC CCGAGCCAGG ATGAGTTAAC AGTGAGATGT 1620 

GTATTTTGGG ACTTGGGCAG AAATGGTGGC AGAGGAGGCT GGTCAGACAA TGGCTGCTCT 1680 

GTCAAAGACA GGAGATTGAA TGAAACCATC TGTACCTGTA GCCATCTAAC AAGCTTCGGC 1740 

GTTCTGCTGG ACCTATCTAG GACATCTGTG CTGCCTGCTC AAATGATGGC TCTGACGTTC 1800 

ATTACATATA TTGGTTGTGG GCTTTCATCA ATTTTTCTGT CAGTGACTCT TGTAACCTAC 1860 

ATAGCTTTTG AAAAGATCCG GAGGGATTAC CCTTCCAAAA TCCTCATCCA GCTGTGTGCT 1920 

GCTCTGCTTC TGCTGAACCT GGTCTTCCTC CTGGACTCGT GGATTGCTCT GTATAAGATG 1980 

CAAGGCCTCT GCATCTCAGT GGCTGTATTT CTTCATTATT TTCTCTTGGT CTCATTCACA 2040 

TGGATGGGCC TAGAAGCATT CCATATGTAC CTGGCCCTTG TCAAAGTATT TAATACTTAC 2100 

ATCCGAAAAT ACATCCTTAA ATTCTGCATT GTCGGTTGGG GGGTACCAGC TGTGGTTGTG 2160 

ACCATCATCC TGACTATATC CCCAGATAAC TATGGGCTTG GATCCTATGG GAAATTCCCC 2220 

AATGGTTCAC CGGATGACTT CTGCTGGATC AACAACAATG CAGTATTCTA CATTACGGTG 2280 

GTGGGATATT TCTGTGTGAT ATTTTTGCTG AACGTCAGCA TGTTCATTGT GGTCCTGGTT 2340 

CAGCTCTGTC GAATTAAAAA GAAGAAGCAA CTGGGAGCCC AGCGAAAAAC CAGTA TTCAA 2400 

GACCTCAGGA GTATCGCTGG CCTTACATTT TTACTGGGAA TAACTTGGGG CTTTGCCTTC 2460 

TTTGCCTGGG GACCAGTTAA CGTGACCTTC ATGTATCTGT TTGCCATCTT TAATACCTTA 2520 

CAAGGATTTT TCATATTCAT CTTTTACTGT GTGGCCAAAG AAAATGTCAG GAAGCAATGG 2580 

AGGCGGTATC TTTGTTGTGG AAAGTTACGG CTGGCTGAAA ATTCTGGAAA TGCTTCTACA 2640 

GAGAGGAATG GGGTCTCTTT TAGTGTTCAG AATGGAGATG TGTGCCTTCA CGATTTCACT 2700 

GGAAAACAGC ACATGTTTAA CGAGAAGGAA GATTCCTGCA ATGGGAAAGG CCGTATGGCT 2760 
CTCAGAAGGA CTTCAAAGCG GGGAAGCTTA CACTTTATTG AGCAAATGTG A 

Seq ID NO: 490 Protein sequence 
Protein Accession tf: Eos sequence 

1 11 21 31 41 51 

| 1 | I I I 

MVFSVRQCGH VGRTEEVLLT FKIFLVIICL HWLVTSLEE DTDNSSLSPP PEVETTStiND 60 

VTLSLLPSNE TGVKPQRNIC NLSSICNDSA FFRGEIMFQY DKESTVPQNQ HITNGTLTGV 120 

LSLSELKRSE UJKTLQTLSE TYFIMCATAE AQSTLNCTFT IKLNNTMNAC AVIAALERVK 180 

IRPMEHCCCS VRIPCPSSPE ELEKLQCDLQ DPIVCLADHP RGPPFSSSQS IPWPRATVL 240 

SQVPKATSFA EPPDYSPVTH NVPSPIGEIQ PLSPQPSAPI ASSPAIDMPP QSETISSPMP 300 

QTHVSGTPPP VKASFSSPTV SAPANVNTTS APPVQTDIVN TSSISDIiENQ VLQMEKALSL 360 

GSLEPNLAGE MINQVSRLLH SPPDMLAPLA QRLLKWDDI GLQLNFSNTT ISLTSPSLAL 420 

AVIRVNASSF NTTTFVAQDP ANLQVSLETQ APENSIGTIT LPSSLMNNLP AHDMELASRV 480 

QFNFFETPAL FQDPSLENLS LISYVISSSV ANLTVRNLTR NVTVTLKHIN PSQDELTVRC 540 

VFWDLGRNGG RGGWSDNGCS VKDRRLNETI CTCSHLTSFG VLLDLSRTSV LPAQMMALTF 600 

ITYIGCGLSS IFLSVTLVTY IAFEKIRRDY PSKILIQLCA ALLLLNLVFL LDSWIALYKM 660 

QGLCISVAVF LHYFLLVSFT WMGLEAFHMY LALVKVFNTY IRKYILKFCI VGWGVPAVW 720 

TIILTISPDN YGLGSYGKFP NGSPDDFCWI NNNAVFYITV VGYFCVIFLIi NVSMFIWLV 780 

QLCRIKKKKQ LGAQRKTSIQ DLRSIAGLTF LLGITWGFAF FAWGPVNVTF MYLFAIFNTL 840 

QGFFIFIFYC VAKENVRKQW RRYLCCGKLR IAENSGNAST ERNGVSFSVQ NGDVCLHDFT 900 
GKQHMFNEKE DSCNGKGRMA LRRTSKRGSL HFIEQM 

Seq ID NO: 491 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence j 1..3045 

1 11 21 31 41 SI 

i 1 | I I I 

ATGGTTTTCT CTGTCAGGCA GTGTGG CCAT GTTGGCAGAA CTGAAGAAGT TTTACTGACG 60 

TTCAAGATAT TCCTTGTCAT CATTTGTCTT CATGTCGTTC TGGTAACATC CC TGGAA GAA 120 

GATACTGATA ATTCCAGTTT GTCACCACCA CCTGCTAAAT TATCTGTTGT CAGTTTTGCC 180 

CCCTCCTCCA ATGAGGTTGA AACAACAAGC CTCAATGATG TTACTTTAAG CTTACTCCCT 240 

TCAAACGAAA CAGAAAAAAC TAAAATCACT ATAGTAAAAA CCTTCAATGC TTCAGGCGTC 300 

AAACCCCAGA GAAATATCTG CAATTTGTCA TCTATTTGCA ATGACTCAGC ATTTTTTAGA 360 

GGTGAGATCA TGTTTCAATA TGATAAAGAA AGCACTGTTC CCCAGAATCA ACATATAACG 420 

AATGGCACCT TAACTGGAGT CCTGTCTCTA AGTGAATTAA AACGCTCAGA GCTCAACAAA 480 

ACCCTGCAAA CCCTAAGTGA GACTTACTTT ATAATGTGTG CTACAGCAGA GGCCCAAAGC 540 

ACATTAAATT GTACATTCAC AATAAAACTG AATAATACAA TGAATGCATG TGCTGCAATA 600 

GCCGCTTTGG AAAGAGTAAA GATTCGACCA ATGGAACACT GCTGCTGTTC TGTCAGGATA 660 

CCCTGCCCTT CCTCCCCAGA AGAGTTGGGA AAGCTTCAGT GTGACCTGCA GGATCCCATT 720 

GTCTGTCTTG CTGACCATCC ACGTGGCCCA CCATTTTCTT CCAGCCAATC CATCCCAGTG 780 

GTGCCTCGGG CCACTGTGCT TTCCCAGGTC CCCAAAGCTA CCTCTTTTGC TGAG CCTC CA 840 

GATTATTCAC CTGTGACCCA CAATGTTCCC TCTCCAATAG GGGAGATTCA ACCCCTTTCA 900 

CCCCAGCCTT CAGCTCCCAT AGCTTCCAGC CCTGCCATTG ACATGCCCCC ACAGTCTGAA 960 

ACGATCTCTT CCCCTATGCC CCAAACCCAT GTCTCCGGCA CCCCACCTCC TGTGAAAGCC 1020 

TCATTTTCCT CTCCCACCGT GTCTGCCCCT GCGAATGTCA ACACTACCAG CGCACCTCCT 1080 

GTCCAGACAG ACATCGTCAA CACCAGCAGT ATTTCTGATC TTGAGAACCA AGTGTTGCAG 1140 

ATGGAGAAGG CTCTGTCCTT GGGCAGCCTG GAGCCTAACC TCGCAGGAGA AATGATCAAC 1200 

CAAGTCAGCA GACTCCTTCA TTCCCCGCCT GACATGCTGG CCCCTCTGGC TCAAAGATTG 1260 

CTGAAAGTAG TGGATGACAT TGGCCTACAG CTGAACTTTT CAAACACGAC TATAAGTCTA 1320 

ACCTCCCCTT CTTTGGCTCT GGCTGTGATC AGAGTGAATG CCAGTAGTTT CAACACAACT 1380 

ACCTTTGTGG CCCAAGACCC TGCAAATCTT CAGGTTTCTC TGGAAACCCA AGCTCCTGAG 1440 

AACAGTATTG GCACAATTAC TCTTCCTTCA TCGCTGATGA ATAATTTACC AGCTCATGAC 1500 

ATGGAGCTAG CTTCCAGGGT TCAGTTCAAT TTTTTTGAAA CACCTGCTTT GTTTCAGGAT 1560 

CCTTGCCTGG AGAACCTCTC TCTGATCAGC TACGTCATAT CATCGAGTGT TGCAAACCTG 1620 

ACCGTCAGGA ACTTGACAAG AAACGTGACA GTCACATTAA AGCACATCAA CCCGAGCCAG 1680 

GATGAGTTAA CAGTGAGATG TGTATTTTGG GACTTGGGCA GAAATGGTGG CAGAGGAGGC 1740 

TGGTCAGACA ATGGCTGCTC TGTCAAAGAC AGGAGATTGA ATGAAACCAT CTGTACCTGT 1800 

AGCCATCTAA CAAGCTTCGG CGTTCTGCTG GACCTATCTA GGACATCTGT GCTGCCTGCT 1860 
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CAAATGATGG CTCTGAOGTT CATTACATAT ATTGGTTGTG GGCTTTCATC AATTTTTCTG 1920 

TCAGTGACTC TTGTAACCTA CATAGCTTTT GAAAAGATCC GGAGGGATTA CCCTTCCAAA 1980 

ATCCTCATCC AGCTGTGTGC TGCTCTGCTT CTGCTGAACC TGGTCTTCCT CCTGGACTCG 2040 

TGGATTGCTC TGTATAAGAT GCAAGGCCTC TGCATCTCAG TGGCTGTATT TCTTCATTAT 2100 

TTTCTCTTGG TCTCATTCAC ATGGATGGGC CTAGAAGCAT TCCATATGTA CCTGGCCCTT 2160 

GTCAAAGTAT TTAATACTTA CATCCGAAAA TACATCCTTA AATTCTGCAT TGTCGGTTGG 2220 

GGGGTACCAG CTGTGGTTGT GACCATCATC CTGACTATAT CCCCAGATAA CTATGGGCTT 2280 

GGATCCTATG GGAAATTCCC CAATGGTTCA CCGGATGACT TCTGCTGGAT CAACAACAAT 2340 

GCAGTATTCT ACATTACGGT GGTGGGATAT TTCTGTGTGA TATTTTTGCT GAACGTCAGC 2400 

ATGTTCATTG TGGTCCTGGT TCAGCTCTGT CGAATTAAAA AGAAGAAGCA ACTGGGAGCC 2460 

CAGCGAAAAA CCAGTATTCA AGACCTCAGG AGTATCGCTG GCCTTACATT TTTACTGGGA 2520 

ATAACTTGGG GCTTTGCCTT CTTTGCCTGG GGACCAGTTA ACGTGACCTT CATGTATCTG 2580 

TTTGCCATCT TTAATACCTT ACAAGGATTT TTCATATTCA TCTTTTACTG TGTGGCCAAA 2640 

GAAAATGTCA GGAAGCAATG GAGGCGGTAT CTTTGTTGTG GAAAGTTAGG GCTGGCTGAA 2700 

AATTCTGACT GGAGTAAAAC TGCTACTAAT GGTTTAAAGA AGCAGACTGT AAACCAAGGA 2760 

GTGTCCAGCT CTTCAAATTC CTTACAGTCA AGCAGTAACT CCACTAACTC CACCACACTG 2820 

CTAGTGAATA ATGATTGCTC AGTACAOGCA AGCGGGAATG GAAATGCTTC TACAGAGAGG 2880 

AATGGGGTCT CTTTTAGTGT TCAGAATGGA GATGTGTGCC TTCACGATTT CACTGGAAAA 2940 

CAGCACATGT TTAACGAGAA GGAAGATTCC TGCAATGGGA AAGGCCGTAT GGCTCTCAGA 3000 
AGGACTTCAA AGCGGGGAAG CTTACACTTT ATTGAGCAAA TGTGA 

Seq ID NO; 492 Protein sequence 
Protein Accession #: Eos sequence 

1 11 21 31 41 51 

| | | I I I 

MVFSVRQCGH VGRTEEVLLT FKIFLVIICL HWLVTSLEE DTDNSSLSPP PAKLSWSFA 60 

PSSNEVETTS LNDVTLSLLP SNETEKTKIT IVKTFNASGV KPQRNICNLS SICNDSAFFR 120 

GEIMFQYDKE STVPQNQHIT NGTLTGVLSL SELKRSELNK TLQTLSETYF IMCATAEAQS 180 

TLNCTFTIKL NNTMNACAAI AALERVKIRP MEHCCCSVRI PCPSSPEELG KLQCDLQDPI 240 

VCLADHPRGP PFSSSQSIPV VPRATVLSQV PKATSFAEPP DYSPVTHNVP SPIGEIQPLS 300 

PQPSAPIASS PAIDMPPQSE TISSPMPQTH VSGTPPPVKA SFSSPTVSAP ANVKTTSAPP 360 

VQTDIVNTSS ISDLENQVLQ MEKALSLGSL EPNLAGEMIN QVSRLLHSPP DMLAPLAQRL 420 

LKWDDIGLQ LNFSNTTISIi TSPSIALAVI RVNASS FNTT TFVAQDPANL QVSLETQAPE 480 

NSIGTITLPS SLMNNLPAHD MELASRVQFN FFETPALFQD PSLENLSLIS YVISSSVANL 540 

TVRNLTRNVT VTLKHINPSQ DELTVRCVFW DLGRNGGRGG WSDNGCSVKD RRLNETICTC 600 

SHLTSFGVLL DLSRTSVLPA QMMALTFITY IGCGLSSIFL SVTLVTYIAF EKIRRDYPSK 660 

ILIQLCAAW* LLNLVFLLDS WIALYKMQGL CISVAVFLHY FLLVSFTWMG LEAFHMYLAL 720 

VKVFNTYIRX YILKFCIVGW GVPAWVTII LTISPDNYGL GSYGKFPNGS PDDFCWINNN 780 

AVFYITWGY FCVIFLLNVS MFIWLVQLC RIKKKKQLGA QRKTSIQDLR SIAGLTFLLG 840 

ITWGFAFFAW GPVNVTFMYL FAIFNTLQGF FIFIFYCVAK ENVRKQWRRY LCCGKLRLAE 900 

NSDWSKTATN GLKKQTVNQG VSSSSNSLQS SSNSTNSTTL LVNNDCSVHA SGNGNASTER 960 
NGVSFSVQNG DVCLHDFTGK QHMFNEKEDS CNGKGRMALR RTSKRGSLHF IEQM 

Seq ID NO: 493 DNA sequence 
Nucleic Acid Accession ft: NMJJ15507 
Coding sequence: 241.. 1902 

1 ll 21 31 41 51 

I I 11 I I 

CCGCAGAGGA GCCTCGGCCA GGCTAGCCAG GGCGCCCCCA GCCCCTCCCC AGGCCGCGAG 60 

CGCCCCTGCC GCGGTGCCTG GCCTCCCCTC CCAGACTGCA GGGACAGCAC CCGGTAACTG 120 

CGAGTGGAGC GGAGGACCCG AGCGGCTGAG GAGAGAGGAG GCGGCGGCTT AGCTGCTACG 180 

GGGTCCGGCC GGCGCCCTCC CGAGGGGGGC TCAGGAGGAG GAAGGAGGAC CCGTG CGAG A 240 

ATGCCTCTGC CCTGGAGCCT TGCGCTCCCG CTGCTGCTCT CCTGGGTGGC AGGTGGTTTC 300 

GGGAACGCGG CCAGTGCAAG GCATCACGGG TTGTTAGCAT CGGCACGTCA GCCTGGGGTC 360 

TGTCACTATG GAACTAAACT GGCCTGCTGC TACGGCTGGA GAAGAAACAG CAAGGGAGTC 420 

TGTGAAGCTA CATGCGAACC TGGATGTAAG TTTGGTGAGT GCGTGGGACC AAACAAATGC 480 

AGATGCTTTC CAGGATACAC CGGGAAAACC TGCAGTCAAG ATGTGAATGA GTG TGGAATG 540 

AAACCCCGGC CATGCCAACA CAGATGTGTG AATACACAGG GAAGCTACAA GTGCTTTTGC 600 

CTCAGTGGCC. ACATGCTCAT GCCAGATGCT ACGTGTGTGA ACTCTAGGAC ATGTGCCATG 660 

ATAAACTGTC AGTACAGCTG TGAAGACACA GAAGAAGGGC CACAGTGCCT GTGTCCATCC 720 

TCAGGACTCC GCCTGGCCCC AAATGGAAGA GACTGTCTAG ATATTGATGA ATGTGCCTCT 780 

GGTAAAGTCA TCTGTCCCTA CAATCGAAGA TGTGTGAACA CATTTGGAAG CTACTACTGC 840 

AAATGTCACA TTGGTTTCX3A ACTGCAATAT ATCAGTGGAC GATATGACTG TATAGATATA 900 

AATGAATGTA CTATGGATAG CCATACGTGC AGCCACCATG CCAATTGCTT CAATACCCAA 960 

GGGTCCTTCA AGTGTAAATG CAAGCAGGGA TATAAAGGCA ATGGACTTCG GTGTTCTGCT 1020 

ATCCCTGAAA ATTCTGTGAA GGAAGTCCTC AGAGCACCTG GTACCATCAA AGACAGAATC 1080 

AAGAAGTTGC TTGCTCACAA AAACAGCATG AAAAAGAAGG CAAAAATTAA AAATGTTACC 1140 

CCAGAACCCA CCAGGACTCC TACCCCTAAG GTGAACTTGC AGCCCTTCAA CTATGAAGAG 1200 

ATAGTTTCCA GAGGCGGGAA CTCTCATGGA GGTAAAAAAG GGAATGAAGA GAAAATGAAA 1260 

GAGGGGCTTG AGGATGAGAA AAGAGAAGAG AAAGCCCTGA AGAATGACAT AGAGGAGCGA 1320 

AGCCTGCGAG GAGATGTGTT TTTCCCTAAG GTGAATGAAG CAGGTGAATT CGGCCTGATT 1380 

CTGGTCCAAA GGAAAGCGCT AACTTCCAAA CTGGAACATA AAGATTTAAA TATCTCGGTT 1440 

GACTGCAGCT TCAATCATGG GATCTGTGAC TGGAAACAGG ATAGAGAAGA TGATTTTGAC 1500 

TGGAATCCTG CTGATCGAGA TAATGCTATT GGCTTCTATA TGGCAGTTCC GGCCTTGGCA 1560 

GGTCACAAGA AAGACATTGG CCGATTGAAA CTTCTCCTAC CTGACCTGCA ACCCCAAAGC 1620 

AACTTCTGTT TGCTCTTTGA TTACCGGCTG GCCGGAGACA AAGTCGGGAA ACTTCGAGTG 1680 

TTTGTGAAAA ACAGTAACAA TGCCCTGGCA TGGGAGAAGA CCACGAGTGA GGATGAAAAG 1740 

TGGAAGACAG GGAAAATTCA GTTGTATCAA GGAACTGATG CTACCAAAAG CATCATTTTT 1800 

GAAGCAGAAC GTGGCAAGGG CAAAACCGGC GAAATOGCAG TGGATGGCGT CTTGCTTGTT 1860 

TCAGGCTTAT GTCCAGATAG CCTTTTATCT GTGGATGACT GAATGTTACT ATCTTTATAT 1920 

TTGACTTTGT ATGTCAGTTC CCTGGTTTTT TTGATATTGC ATCATAGGAC CTCTGGCATT 1980 

TTAGAATTAC TAGCTGAAAA ATTGTAATGT ACCAACAGAA ATATTATTGT AAGATGCCTT 2040 
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TCTTGTATAA GATATGCCAA TATTTGCTTT AAATATCATA TCACTGTATC TTCTCAGTCA 
TTTCTGAATC TTTCCACATT ATATTATAAA ATATGGAAAT GTCAGTTTAT CTCCCCTCCT 
CAGTATATCT GATTTGTATA AGTAAGTTGA TGAGCTTCTC TCTACAACAT TTCTAGAAAA 
TAGAAAAAAA AGCACAGAGA AATGTTTAAC TGTTTGACTC TTATGATACT TCTTGGAAAC 
TATGACATCA AAGATAGACT TTTGCCTAAG TGGCTTAGCT GGGTCTTTCA TAGCCAAACT 
TGTATATTTA AATTCTTTGT AATAATAATA TCCAAATCAT CAAAAAAAAA AAAAAAAA 



2100 
2160 
2220 
2280 
2340 



Seq ID NO: 494 Protein sequence 
Protein Accession ft: NP_056322 



1 
I 

MPLPWSLALP 
CEATCEPGCK 
LSGHMLMPDA 
GKVICPYNRR 
GSFKCKCKQG 
PEPTRTPTPK 
SLRGDVFFPK 
WNPADRDNAI 
FVKNSNNALA 
SGLCPDSLLS 



11 
I 

LLLSWVAGGF 
FGECVGPNKC 
TCVNSRTCAM 
CVNTFGSYYC 
YKGNGLRCSA 
VNLQPFNYEE 
VNEAGEFGLI 
GFYMAVPALA 
WEKTTSEDEK 
VDD 



21 
I 

GNAASARHHG 
RCFPGYTGKT 
INCQYSCEDT 
KCHIGFELQY 
IPENSVKEVL 
IVSRGGNSHG 
LVQRKALTSK 
GHKKDIGRIiK 
WKTGKIQLYQ 



31 

I 

LIiASARQPGV 
CSQDVNEOGM 
EEGPQCLCPS 
ISGRYDCIDI 
RAPGTIKDRI 
GKKGNEEKMK 
LEHKDLNISV 
LLLPDLQPQS 
GTDATKSIIF 



41 
I 

CHYGTKLACC 
KPRPCQHRCV 
SGLRLAPNGR 
NECTMDSHTC 
KKLLAHKNSM 
EGLEDEKREE 
DCSFNHGICD 
NFCLLFDYRL 
EAERGKGKTG 



51 
I 



NTHGSYKCFC 
DCLDIDECAS 
SHHANCFKTQ 
KKKAKIKNVT 
KALKNDIEER 
WKQDREDDFD 
AGDKVGKLRV 
EIAVDGVLLV 



60 
120 
160 
240 
300 
360 
420 
480 
540 



Seq ID NO: 495 DNA sequence 

Nucleic Acid Accession ft: NM_003506.1 

Coding sequence: 2 59.. 23 7 9 



11 



21 31 41 51 

I I I I 

GCAGCTCCAG TCCCGGACGC AACCCCGGAG CCGTCTCAGG TCCCTGGGGG GAACGGTGGG 
TTAGACGGGG ACGGGAAGGG ACAGCGGCCT TCGACCGCCC CCCGAGTAAT TGACCCAGGA 
CTCATTTTCA GGAAAGCCTG AAAATGAGTA AAATAGTGAA ATGAGGAATT TGAACATTTT 
ATCTTTGGAT GGGGATCTTC -TGAGGATGCA AAGAGTGATT CATCCAAGCC ATGTGGTAAA 
ATCAGGAATT TGAAGAAAAT GGAGATGTTT ACATTTTTGT TGACGTGTAT TTTTCTACCC 
CTCCTAAGAG GGCACAGTCT CTTCACCTGT GAACCAATTA CTGTTCCCAG ATGTATGAAA 
ATGGCCTACA ACATGACGTT TTTCCCTAAT CTGATGGGTC ATTATGACCA GAGTATTGCC 
GCGGTGGAAA TGGAGCATTT TCTTCCTCTC GCAAATCTGG AATGTTCACC AAACATTGAA 
ACTTTCCTCT GCAAAGCATT TGTACCAACC TGCATAGAAC AAATTCATGT GGTTCCACCT 
TGTCGTAAAC TTTGTGAGAA AGTATATTCT GATTGCAAAA AATTAATTGA CACTTTTGGG 
ATCCGATGGC CTGAGGAGCT TGAATGTGAC AGATTACAAT ACTGTGATGA GACTGTTCCT 
GTAACTTTTG ATCCACACAC AGAATTTCTT GGTCCTCAGA AGAAAACAGA ACAAGTCCAA 
AGAGACATTG GATTTTGGTG TCCAAGGCAT CTTAAGACTT CTGGGGGACA AGGATATAAG 
TTTCTGGGAA TTGACCAGTG TGCGCCTCCA TGCCCCAACA TGTATTTTAA AAGTGATGAG 
CTAGAGTTTG CAAAAAGTTT TATTGGAACA GTTTCAATAT TTTGTCTTTG TGCAACTCTG 
TTCACATTCC TTACTTTTTT AATTGATGTT AGAAGATTCA GATACCCAGA GAGACCAATT 
ATATATTACT CTGTCTGTTA CAGCATTGTA TCTCTTATGT ACTTCATTGG ATTTTTGCTG 
GGCGATAGCA CAGCCTGCAA TAAGGCAGAT GAGAAGCTAG AACTTGGTGA CACTGTTGTC 
CTAGGCTCTC AAAATAAGGC TTGCACCGTT TTGTTCATGC TTTTGTATTT TTTCACAATG 
GCTGGCACTG TGTGGTGGGT GATTCTTACC ATTACTTGGT TCTTAGCTGC AGGAAGAAAA 
TGGAGTTGTG AAGCCATCGA GCAAAAAGCA GTGTGGTTTC ATGCTGTTGC ATGGGGAACA 
CCAGGTTTCC TGACTGTTAT GCTTCTTGCT CTGAACAAAG TTGAAGGAGA CAACATTAGT 
GGAGTTTGCT TTGTTGGCCT TTATGACCTG GATGCTTCTC GCTACTTTGT ACTCTTGCCA 
CTGTGCCTTT GTGTGTTTGT TGGGCTCTCT CTTCTTTTAG CTGGCATTAT TTCCTTAAAT 
CATGTTCGAC AAGTCATACA ACATGATGGC CGGAACCAAG AAAAACTAAA GAAATTTATG 
ATTCGAATTG GAGTCTTCAG CGGCTTGTAT CTTGTGCCAT TAGTGACACT TCTCGGATGT 
TACGTCTATG AGCAAGTGAA CAGGATTACC TGGGAGATAA CTTGGGTCTC TGATCATTGT 
CGTCAGTACC ATATCCCATG TCCTTATCAG GCAAAAGCAA AAGCTCGACC AGAATTGGCT 
TTATTTATGA TAAAATACCT GATGACATTA ATTGTTGGCA TCTCTGCTGT CTTCTGGGTT 
GGAAGCAAAA AGACATGCAC AGAATGGGCT GGGTTTTTTA AA CGAAAT CG CAAGAGAGAT 
CCAATCAGTG AAAGTCGAAG AGTACTACAG GAATCATGTG AGTTTTTCTT AAAGCACAAT 
TCTAAAGTTA AACACAAAAA GAAGCACTAT AAACCAAGTT CACACAAGCT GAAGGTCATT 
TCCAAATCCA TGGGAACCAG CACAGGAGCT ACAGCAAATC ATGGCACTTC TGCAGTAGCA 
ATTACTAGCC ATGATTACCT AGGACAAGAA ACTTTGACAG AAATCCAAAC CTCACCAGAA 
ACATCAATGA GAGAGGTGAA AGCGGACGGA GCTAGCACCC CCAGGTTAAG AGAACAGGAC 
TCTGGTGAAC CTGCCTCGCC AGCAGCATCC ATCTCCAGAC TCTCTGGGGA ACAGGTCGAC 
GGGAAGGGCC AGGCAGGCAG TGTATCTGAA AGTGCGCGGA GTGAAGGAAG GATTAGTCCA 
AAGAGTGATA TTACTGACAC TGGCCTGGCA CAGAGCAACA ATTTGCAGGT CCCCAGTTCT 
TCAGAACCAA GCAGCCTCAA AGGTTCCACA TCTCTGCTTG TTCAC CCAGT TTCAGGAGTG 
AGAAAAGAGC AGGGAGGTGG TTGTCATTCA GATACTTGAA GAACATTTTC TCTCGTTACT 
CAGAAGCAAA TTTGTGTTAC ACTGGAAGTG ACCTATGCAC TGTTTTGTAA GAATCACTGT 
TACGTTCTTC TTTTGCACTT AAAGTTGCAT TGCCTACTGT TATACTGGAA AAAATAGAGT 
TCAAGAATAA TATGACTCAT TTCACACAAA GGTTAATGAC AACAATATAC CTGAAAACAG 
AAATGTGCAG GTTAATAATA TTTTTTTAAT AGTGTGGGAG GACAGAGTTA GAGGAATCTT 
CCTTTTCTAT TTATGAAGAT TCTACTCTTG GTAAGAGTAT TTTAAGATGT ACTATGCTAT 
TTTACCTTTT TGATATAAAA TCAAGATATT TCTTTGCTGA AGTATTTAAA TCTTATCCTT 
GTATCTTTTT ATACATATTT GAAAATAAGC TTATATGTAT TTGAACTTTT TTGAAATCCT 
ATTCAAGTAT TTTTATCATG CTATTGTGAT ATTTTAGCAC TTTGGTAGCT TTTACACTGA 
ATTTCTAAGA AAATTGTAAA ATAGTCTTCT TTTATACTGT AAAAAAAGAT ATACCAAAAA 
GTCTTATAAT AGGAATTTAA CTTTAAAAAC CCACTTATTG ATACCTTACC ATCTAAAATG 
TGTGATTTTT ATAGTCTCGT TTTAGGAATT TCACAGATCT AAATTATGTA ACTGAAATAA 
GGTGCTTACT CAAAGAGTGT CCACTATTGA TTGTATTATG CTGCTCACTG ATCCTTCTGC 
ATATTTAAAA TAAAATGTCC TAAAGGGTTA GTAGACAAAA TGTTAGTCTT TTGTATATTA 
GGCCAAGTGC AATTGACTTC CCTTTTTTAA TGTTTCATGA CCACCCATTG ATTGTATTAT 
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60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
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AACCACTTAC AGTTGCTTAT ATTTTTTGTT TTAA CTTTTG TTTCTTAACA TTTAGAATAT 3300 
TACATTTTGT ATTATACAGT ACCTTTCTCA GACATTTTGT AG 

Seq ID NO: 496 Protein sequence 
Protein Accession #: NP_003497.1 

1 11 21 31 41 51 

| | | I I I 

MEMFTFLLTC IFLPLLRGHS LFTCEPITVP RCMKMAYNMT FFPNLMGKYD QSIAAVEMEH 60 

FLPLANLECS PNIETFLCKA FVPTCIEQIH WPPCRKLCE KVYSDCKKLI DTFGIRWPEE 120 

LECDRLQYCD ETVPVTFDPH TEFLGPQKKT EQVQRDIGFW CPRHLKTSGG QGYKFLGIDQ 180 

CAPPCPNMYF KSDELEFAKS PIGTVSIFCL CATLFTFLTF LIDVRRFRYP ERPIIYYSVC 240 

YSIVSLMYFI GFLLGDSTAC NKADEKLELG DTWLGSQNK ACTVLFMLLY FFTMAGTWfW 300 

VILTITWFLA AGRKWSCEAI EQKAVWFHAV AWGTPGFLTV MLLALNKVEG DNISGVCFVG 360 

LYDLDASRYF VLLPLCLCVF VGLSLLLAGI ISLNHVRQVI QHDGRNQEKL KKFMIRIGVF 420 

SGLYLVPLVT LLGCYVYEQV NRITWEITWV SDHCRQYHIP CPYQAKAKAR PELALFMIKY 480 

LMTLIVGISA VFWVGSKKTC TEWAGFFKRN RKRDPISESR RVLQESCEFF LKHNSKVKHK 540 

KKHYKPSSHK LKVISKSMGT STGATANHGT SAVAITSHDY LGQETLTEIQ TSPETSMREV 600 

KADGASTPRL REQDCGEPAS PAASISRLSG EQVDGKGQAG SVSESARSEG RISPKSDITD 660 
TGLAQSNNLQ VPSSSEPSSL KGSTSLLVHP VSGVRKEQGG GCHSDT 

Seq ID NO: 497 DNA sequence 
Nucleic Acid Accession g: NM_00504G 
Coding sequence: 16.. 777 

1 11 21 31 41 51 

I I I I I I 

GGATTTCCGG GCTCCATGGC AAGATCCCTT CTCCTGCCCC TGCAGATCCT ACTGCTATCC 60 
TTAGCCTTGG AAACTGCAGG AGAAGAAGCC CAGGGTGACA AGATTATTGA TGGCGCCCCA 120 
TGTGCAAGAG GCTCCCACCC ATGGCAGGTG GCCCTGCTCA GTGGCAATCA GCTCCACTGC 180 
GGAGGCGTCC TGGTCAATGA GCGCTGGGTG CTCACTGCCG CCCACTGCAA GATGAATGAG 240 
TACACCGTGC ACCTGGGCAG TGATACGCTG GGCGACAGGA GAGCTCAGAG GATCAAGGCC 300 
TCGAAGTCAT TCCGCCACCC CGGCTACTCC ACACAGACCC ATGTTAATGA CCTCATGCTC 360 
GTGAAGCTCA ATAGCCAGGC CAGGCTGTCA TCCATGGTGA AGAAAGTCAG GCTGCCCTCC 420 
CGCTGCGAAC CCCCTGGAAC CACCTGTACT GTCTCCGGCT GGGGCACTAC CACGAGCCCA 480 
GATGTGACCT TTCCCTCTGA CCTCATGTGC GTGGATGTCA AGCTCATCTC CCCCCAGGAC 540 
TGCACGAAGG TTTACAAGGA CTTACTGGAA AATTCCATGC TGTGCGCTGG CATCCCCGAC 600 
TCCAAGAAAA ACGCCTGCAA TGGTGACTCA GGGGGACCGT TGGTGTGCAG AGGTACCCTG 660 
CAAGGTCTGG TGTCCTGGGG AACTTTCCCT TGCGGCCAAC CCAATGACCC AGGAGTCTAC 720 
ACTCAAGTGT GCAAGTTCAC CAAGTGGATA AATGACACCA TGAAAAAGCA TCGCTAACGC 780 
CACACTGAGT TAATTAACTG TGTGCTTCCA ACAGAAAATG CACAGGAGTG AGGACGCCGA 840 
TGACCTATGA AGTCAAATTT GACTTTACCT TTCCTCAAAG ATATATTTAA ACCTCATGCC 900 
CTGTTGATAA ACCAATCAAA TTGGTAAAGA CCTAAAACCA AAACAAATAA AGAAACACAA 960 
AACCCTCAA 

Seq ID NO: 498 Protein sequence 
Protein Accession #: NP_005037 

1 11 21 31 41 51 

| I 1 I I I 

MARSLLLPLQ ILLLSLALBT AGEEAQGDKI IDGAPCARGS HPWQVALLSG NQLHCGGVLV 60 
NERWVLTAAH CKMNEYTVHL GSDTLGDRRA QRIKASKSFR HPGYSTQTHV NDLMLVKLNS 120 
QARLSSMVKK VRLPSRCEPP GTTCTVSGWG TTTSPDVTFP SDLMCVDVKL ISPQDCTKVY 180 
KDLLENSMLC AGIPDSKKNA CNGDSGGPLV CRGTLQGLVS WGTFPCGQPN DPGVYTQVCK 240 
FTKWINDTMK KHR 

Seq ID NO: 499 DNA sequence 
Nucleic Acid Accession 8: NM_007196 
Coding sequence : 182 . . 962 

1 11 21 31 41 51 

I I I I I I 

GTTCCCAGAA GCTCCCCAGG CTCTAGTGCA GGAGGAGAAG GAGGAGGAGC AGGAGGTGGA 60 

GATTCCCAGT TAAAAGGCTC CAGAATCGTG TACCAGGCAG AGAACTGAAG TACTGGGGCC 120 

TCCTCCACTG GGTCCGAATC AGTAGGTGAC CCCGCCCCTG GATTCTGGAA GACCTCACCA 180 

TGGGACGCCC CCGACCTOGT GCGGCCAAGA CGTGGATGTT CCTGCTCTTG CTGGGGGGAG 240 

CCTGGGCAGG ACACTCCAGG GCACAGGAGG ACAAGGTGCT GGGGGGTCAT GAGTGCCAAC 300 

CCCATTCGCA GCCTTGGCAG GCGGCCTTGT TCCAGGGCCA GCAACTACTC TGTGGCGGTG 360 

TCCTTGTAGG TGGCAACTGG GTCCTTACAG CTGCCCACTG TAAAAAACCG AAATACACAG 420 

TACGCCTGGG AGACCACAGC CTACAGAATA AAGATGGCCC AGAGCAAGAA ATACCTGTGG 480 

TTCAGTCCAT CCCACACCCC TGCTACAACA GCAGCGATGT GGAGGACCAC AACCATGATC 540 

TGATGCTTCT TCAACTGCGT GACCAGGCAT CCCTGGGGTC CAAAGTGAAG CCCATCAGCC 600 

TGGCAGATCA TTGCACCCAG CCTGGCCAGA AGTGCACCGT CTCAGGCTGG GG CACT GTCA 660 

CCAGTCCCCG AGAGAATTTT CCTGACACTC TCAACTGTGC AGAAGTAAAA ATCTTTCCCC 720 

AGAAGAAGTG TGAGGATGCT TACCCGGGGC AGATCACAGA TGGCATGGTC TGTGCAGGCA 780 

GCAGCAAAGG GGCTGACACG TGCCAGGGCG ATTCTGGAGG CCCCCTGGTG TGTGATGGTG 840 

CACTCCAGGG CATCACATCC TGGGGCTCAG ACCCCTGTGG GAGGTCCGAC AAACCTGGCG 900 

TCTATACCAA CATCTGCCGC TACCTGGACT GGATCAAGAA GATCATAGGC AGCAAGGGCT 960 
GATTCTAGGA TAAGCACTAG ATCTCCCTTA ATAAACTCAC AACTCTC 

Seq ID NO: 500 Protein sequence 
Protein Accession #: NP_O09127 



1 11 21 31 41 51 
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MGRPRPRAAK TWMFLLLLGG AWAGHSRAQE DKVLGGHECQ PHSQPWQAAL FQGQQLLCGG 60 

VLVGGNWVLT AAHCKKPKYT VRLGDHSLQN KDGPEQEIPV VQSIPHPCYN SSDVEDHNHD 120 

LMLLQLRDQA SLGSKVKPIS LADHCTQPGQ KCTVSGWGTV TSPRENFPDT LNCAEVKIFP 180 

QKKCEDAYPG QITDGMVCAG SSKGADTCQG DSGGPLVCDG ALQGITSWGS DPCGRSDKPG 240 
VYTNICRYLD WIKKIIGSKG 



Seq ID NO: 501 DKA segue DC e 
Nucleic Acid Accession ft: NM_006103 
Coding sequence: 29.. 406 

1 11 21 31 |1 51 

CACCTGCACC CCGCCCGGGC ATAGCACCAT GCCTGCTTGT CGCCTAGGCC CGCTAGCCGC 60 

CGCCCTCCTC CTCAGCCTGC TGCTGTTCGG CTTCACCCTA GTCTCAGGCA CAGGAGCAGA 120 

GAAGACTGGC GTGTGCCCCG AGCTCCAGGC TGACCAGAAC TGCACGCAAG AGTGCGTCTC 180 

GGACAGCGAA TGCGCCGACA ACCTCAAGTG CTGCAGCGCG GGCTGTGCCA CCTTCTGCCT 240 

TCTCTGCCCA AATGATAAGG AGGGTTCCTG CCCCCAGGTG AACATTAACT TTCCCCAGCT 300 

CGGCCTCTGT CGGGACCAGT GCCAGGTGGA CAGCCAGTGT CCTGGCCAGA TGAAATGCTG 360 

CCGCAATGGC TGTGGGAAGG TGTCCTGTGT CACTCCCAAT TTCTGAGGTC CAGCCACCAC 420 

CAGGCTGAGC AGTGAGGAGA GAAAGTTTCT GCCTGGCCCT GCATCTGGTT CCAGCCCACC 480 

TGCCCTCCCC TTTTTCGGGA CTCTGTATTC CCTCTTGGGC TGACCACAGC TTCTCCCTTT 540 
CCCAACCAAT AAAGTAACCA CTTTCAGCAA AAAAAAAAAA AAAA 



Seq ID NO: 502 Protein sequence 
Protein Accession ft: NP_006094 

1 11 21 31 41 51 

MPACRLGPIA AALLLSLLLP GFTLVSGTGA EKTGVCPELQ ADQNCTQECV SDSECADNLK 60 
CCSAGCATFC LLCPNDKEGS CPQVNINFPQ LGLCRDQCQV DSQCPGQMKC CRNGCGKVSC 120 
VTPNF 



Seq ID NO: 503 DNft sequence 
Nucleic Acid Accession ft: NMJJ02407 
Coding sequence: 65.-352 

1 11 21 31 41 51 

CCTCCACAGC AACTTCCTTG ATCCCTGCCA CGCACGACTG AACACAGACA GCAGCOGCCT 60 

CGCCATGAAG CTGCTGATGG TCCTCATGCT GGCGGCCCTC CTCCTGCACT GCTATGCAGA 120 

TTCTGGCTGC AAACTCCTGG AGGACATGGT TGAAAAGACC ATCAATTCCG ACATATCTAT 180 

ACCTGAATAC AAAGAGCTTC TTCAAGAGTT CATAGACAGT GATGCCGCTG CAGAGGCTAT 240 

GGGGAAATTC AAGCAGTGTT TCCTCAACCA GTCACATAGA ACTCTGAAAA ACTTTGGACT 300 

GATGATGCAT ACAGTGTACG ACAGCATTTG GTGTAATATG AAGAGTAATT AACTTTACCC 360 

AAGGCGTTTG GCTCAGAGGG CTACAGACTA TGGCCAGAAC TCATCTGTTG ATTGCTAGAA 420 

ACCACTTTTC TTTCTTGTGT TGTCTTTTTA TGTGGAAACT GCTAGACAAC TGTTGAAACC 480 
TCAAATTCAT TTCCATTTCA ATAACTAACT GCAAATC 



Seq ID NO: 504 Protein sequence 
Protein Accession ft: NP_002398 

1 11 21 31 41 51 

i 1 I I i I 

MKLLMVLMLA ALLLHCYADS GCKLLEDMVE KTINSDISIP EYKEIiLQEFI DSDAAAEAMG 
KFKQCFLNQS HRTLKNFGLM MHTVYDSIWC NMKSN 



Seq ID NO: 505 DNA sequence 

Nucleic Acid Accession ft: NM_014791.1 

Coding sequence: 171- . 2126 

1 11 21 31 41 51 

TTGGCGGGCG GAAGCGGCCA CAACCCGGCG ATCGAAAAGA TTCTTAGGAA OGCCGTACCA 60 

GCCGCGTCTC TCAGGACAGC AGGCCCCTGT CCTTCTGTCG GGCGCCGCTC AGCCGTGCCC 120 

TCCGCCCCTC AGGTTCTTTT TCTAATTCCA AATAAACTTG CAAGAGGACT AT GAAAGA TT 180 

ATGATGAACT TCTCAAATAT TATGAATTAC ATGAAACTAT TGGGACAGGT GGCTTTGCAA 240 

AGGTCAAACT TGCCTGCCAT ATCCTTACTG GAGAGATGGT AGCTATAAAA ATCATGGATA 300 

AAAACACACT AGGGAGTGAT TTGCCCCGGA TCAAAACGGA GATTGAGGCC TTGAAGAACC 360 

TGAGACATCA GCATATATGT CAACTCTACC ATGTGCTAGA GACAGCCAAC AAAATATTCA 420 

TGGTTCTTGA GTACTGCCCT GGAGGAGAGC TGTTTGACTA TATAATTTCC CAGGATCGCC 480 

TGTCAGAAGA GGAGACCCGG GTTGTCTTCC GTCAGATAGT ATCTGCTGTT GCTTATGTGC 540 

ACAGCCAGGG CTATGCTCAC AGGGACCTCA AGCCAGAAAA TTTGCTGTTT GATGAATATC 600 

ATAAATTAAA GCTGATTGAC TTTGGTCTCT GTGCAAAACC CAAGGGTAAC AAGGATTACC 660 

ATCTACAGAC ATGCTGTGGG AGTCTGGCTT ATGCAGCACC TGAGTTAATA CAAGGCAAAT 720 

CATATCTTGG ATCAGAGGCA GATGTTTGGA GCATGGGCAT ACTGTTATAT GTTCTTATGT 780 

GTGGATTTCT ACCATTTGAT GATGATAATG TAATGGCTTT ATACAAGAAG ATTATGAGAG 840 

GAAAATATGA TGTTCCCAAG TGGCTCTCTC CCAGTAGCAT TCTGCTTCTT CAACAAATGC 900 

TGCAGGTGGA CCCAAAGAAA CGGATTTCTA TGAAAAATCT AT TGAAC CAT CCCTGGATCA 960 

TGCAAGATTA CAACTATCCT GTTGAGTGGC AAAGCAAGAA TCCTTTTATT CACCTCGATG 1020 

ATGATTGCGT AACAGAACTT TCTGTACATC ACAGAAACAA CAGGCAAACA ATGGAGGATT 1080 

TAATTTCACT GTGGCAGTAT GATCACCTCA CGGCTACCTA TCTTCTGCTT CTAGCCAAGA 1140 

AGGCTCGGGG AAAACCAGTT CGTTTAAGGC TTTCTTCTTT CTCCTGTGGA CAAGCCAGTG 1200 

CTACCCCATT CACAGACATC AAGTCAAATA ATTGGAGTCT GGAAGATGTG ACCGCAAGTG 1260 
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ATAAAAATTA TGTGGCGGGA TTAATAGACT ATGATTGGTG TGAAGATGAT TTATCAACAG 1320 

GTGCTGCTAC TCCCCGAACA TCACAGTTTA CCAAGTACTG GACAGAATCA AATGGGGTGG 1380 

AATCTAAATC ATTAACTCCA GCCTTATGCA GAACACCTGC AAATAAATTA AAGAACAAAG 1440 

AAAATGTATA TACTCCTAAG TCTGCTGTAA AGAATGAAGA GTACTTTATG TTTCCTGAGC 1500 

CAAAGACTCC AGTTAATAAG AACCAGCATA AGAGAGAAAT ACTCACTACG CCAAATCGTT 1560 

ACACTACACC CTCAAAAGCT AGAAACCAGT GCCTGAAAGA AACTCCAATT AAAATACCAG 1620 

TAAATTCAAC AGGAACAGAC AAGTTAATGA CAGGTGTCAT TAGCCCTGAG AGGCGGTGCC 1680 

GCTCAGTGGA ATTGGATCTC AACCAAGCAC ATATGGAGGA GACTCCAAAA AGAAAGGGAG 1740 

CCAAAGTGTT TGGGAGCCTT GAAAGGGGGT TGGATAAGGT TATCACTGTG CTCACCAGGA 1800 

GCAAAAGGAA GGGTTCTGCC AGAGACGGGC CCAGAAGACT AAAGCTTCAC TATAATGTGA I860 

CTACAACTAG ATTAGTGAAT CCAGATCAAC TGTTGAATGA AATAATGTCT ATTCTTCCAA 1920 

AGAAGCATGT TGACTTTGTA CAAAAGGGTT ATACACTGAA GTGTCAAACA CAGTCAGATT 1980 

TTGGGAAAGT GACAATGCAA TTTGAATTAG AAGTGTGCCA GCTTCAAAAA CCOGATGTGG 2040 

TGGGTATCAG GAGGCAGCGG CTTAAGGGCG ATGCCTGGGT TTACAAAAGA TTAGTGGAAG 2100 

ACATCCTATC TAGCTGCAAG GTATAATTGA TGGATTCTTC CATCCTGCCG GATGAGTGTG 2160 

GGTGTGATAC AGCCTACATA AAGACTGTTA TGATCGCTTT GATTTTAAAG TTCATTGGAA 2220 

CTACCAACTT GTTTCTAAAG AGCTATCTTA AGACCAATAT CTCTTTGTTT TTAAACAAAA 2280 

GATATTATTT TGTGTATGAA TCTAAATCAA GCCCATCTGT CATTATGTTA CTGTCTTTTT 2340 

TAATCATGTG GTTTTGTATA TTAATAATTG TTGACTTTCT TAGATTCACT TCCATATGTG 2400 

AATGTAAGCT CTTAACTATG TCTCTTTGTA ATGTGTAATT TCTTTCTGAA ATAAAACCAT 2460 
TTGTGAATAT 



Seq ID NO: 506 Protein sequence 
Protein Accession #: NP_055606.1 

1 11 21 31 41 51 

! i I 1 I I ' 

MKDYDELLKY YELHETIGTG GFAKVKLACH ILTGEMVAIK IMDKNTLGSD LPRIKTEIEA 60 

LKNLRHQHIC QLYHVLETAK KIFMVLEYCP GGELFDYIIS QDRLSEEETR WFRQIVSAV 120 

AYVHSQGYAH RDLKPENLLF DEYHKLKLID FGLCAKPKGN KDYHLQTCCG SLAYAAPELI 180 

QGKSYLGSEA DVWSMGILLY VLMCGFLPFD DDNVMALYKK IMRGKYDVPK WLSPSSILLL 240 

QQMLQVDPKK RISMKNLLNH PWIMQDYNYP VEWQSKNPFI HLDDDCVTEL SVHHRNNRQT 300 

MEDLISLWQY DHLTATYLLL LAKKARGKPV RLRLSSFSCG QASATPFTDI KSNNWSLEDV 360 

TASDKNYVAG LIDYDWCEDD LSTGAATPRT SQFTKYWTES NGVESKSLTP ALCRTPANKL 420 

KNKENVYTPK SAVKNEEYFM FPEPKTPVNK NQHKREILTT PNRYTTPSKA RNQCLKETPI 480 

KIPVNSTGTD KLMTGVISPE RRCRSVELDL NQAHMEETPK RKGAKVFGSL ERGLDKVITV 540 

LTRSKRKGSA RDGPRRLKLH YNVTTTRLVN PDQLLNEIMS ILPKKHVDFV QKGYTLKCQT 600 
QSDFGKVTMQ FELEVCQLQK PDWGIRRQR LKGDAWVYKR LVEDILSSCK V 



Seq ID NO: . 507 DNA sequence 
Nucleic Acid Accession 8: NMJ>00582 
Coding sequencer 88.. 990 

1 11 21 31 41 51 

| | I I I I 

GCAGAGCACA GCATCGTCGG GACCAGACTC GTCTCAGGCC AGTTGCAGCC TTCTCAGCCA 60 

AACGCCGACC AAGGAAAACT CACTACCATG AGAATTGCAG TGATTTGCTT TTGCCTCCTA 120 

GGCATCACCT GTGCCATACC AGTTAAACAG GCTGATTCTG GAAGTTCTGA GGAAAAGCAG 180 

CTTTACAACA AATACCCAGA TGCTGTGGCC ACATGGCTAA ACCCTGACCC ATCTCAGAAG 240 

CAGAATCTCC TAGCCCCACA GACCCTTCCA AGTAAGTCCA ACGAAAGCCA TGACCACATG 300 

GATGATATGG ATGATGAAGA TGATGATGAC CATGTGGACA GCCAGGACTC CATTGACTCG 360 

AACGACTCTG ATGATGTAGA TGACACTGAT GATTCTCACC AGTCTGATGA GTCTCACCAT 420 

TCTGATGAAT CTGATGAACT GGTCACTGAT TTTCCCACGG ACCTGCCAGC AACCGAAGTT 480 

TTCACTCCAG TTGTCCCCAC AGTAGACACA TATGATGGCC GAGGTGATAG TGTGGTTTAT 540 

GGACTGAGGT CAAAATCTAA GAAGTTTCGC AGACCTGACA TCCAGTACCC TGATGCTACA 600 

GACGAGGACA TCACCTCACA CATGGAAAGC GAGGAGTTGA ATGGTGCATA CAAGGCCATC 660 

CCCGTTGCCC AGGACCTGAA CGCGCCTTCT GATTGGGACA GCCGTGGGAA GGACAGTTAT 720 

GAAACGAGTC AGCTGGATGA CCAGAGTGCT GAAACCCACA GCCACAAGCA GTCCAGATTA 780 

TATAAGCGGA AAGCCAATGA TGAGAGCAAT GAGCATTCCG ATGTGATTGA TAGTCAGGAA 840 

CTTTCCAAAG TCAGCCGTGA ATTCCACAGC CATGAATTTC ACAGCCATGA AGATATGCTG 900 

GTTGTAGACC CCAAAAGTAA GGAAGAAGAT AAACACCTGA AATTTCGTAT TTCTCATGAA 960 

TTAGATAGTG CATCTTCTGA GGTCAATTAA AAGGAGAAAA AATACAATTT CTCACTTTGC 1020 

ATTTAGTCAA AAGAAAAAAT GCTTTATAGC AAAATGAAAG AGAACATGAA ATGCTTCTTT 1080 

CTCAGTTTAT TGGTTGAATG TGTATCTATT TGAGTCTGGA AATAACTAAT GTGTTTGATA 1140 

ATTAGTTTAG TTTGTGGCTT CATGGAAACT CCCTGTAAAC TAAAAGCTTC AGGGTTATGT 1200 

CTATGTTCAT TCTATAGAAG AAATGCAAAC TATCACTGTA TTTTAATATT TGTTATTCTC 1260 

T CATGAATAG AAATTTATGT AGAAGCAAAC AAAATACTTT TACCCACTTA AAAAGAGAAT 1320 

ATAACATTTT ATGTCACTAT AATCTTTTGT TTTTTAAGTT AGTGTATATT TTGTTGTGAT 1380 

TATCTTTTTG TGGTGTGAAT AAATCTTTTA TCTTGAATGT AATAAGAATT TGGTGGTGTC 1440 

AATTGCTTAT TTGTTTTCCC ACGGTTGTCC AGCAATTAAT AAAACATAAC CTTTTTTACT 1500 
GCCTAAAAAA AAAAAAAAAA AAAA 

Seq ID NO: 508 Protein sequence 
Protein Accession #: NP_000573 

1 11 21 31 41 51 

I I I I I 

MRIAVICFCL LGITCAIPVK QADSGSSEEX QLYNKYPDAV ATWLNPDPSQ KQNLLAPQTL 60 

PSKSNESHDH MDDMDDEDDD DHVDSQDSID SNDSDDVDDT DDSHQSDESH HSDESDELVT 120 

DFPTDLPATE VFTPWPTVD TYDGRGDSW YGXjRSKSKKF RRPDIQYPDA TDEDITSHME 180 

SEELNGAYKA IPVAQDUJAP SDWDSRGKDS YETSQLDDQS AETHSHKQSR LYKRKANDES 240 
NEHSDVIDSQ ELSKVSREFH SHEFHSHEDM LWDPKSKEE DKHLKFRISH ELDSASSEVN 
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Seq ID NO: 509 DNA sequence 

Nucleic Acid Accession «: AB051390.1 

Coding sequence: 34.. 2457 

1 11 21 31 41 SI 

I I I I I 1 

AGCGGCCGCG GCACAAAGTT GGGGGCCGCG AAGATGAGGC TGTCCCCGGC GCCCCTGAAG 60 
CTGAGCCGGA CTCCGGCACT GCTGGCCCTG GCGCTGCCCC TGGCCGCGGC GCTGGCCTTC 120 
TCCGACGAGA CCCTGGACAA AGTGCCCAAG TCAGAGGGCT ACTGCAGCCG TATCCTGCGC 180 
GCCCAGGGCA CGCGGCGCGA GGGCTACACC GAGTTCAGCC TCCGCGTGGA GGGCGACCCC 240 
GACTTCTACA AGCCGGGAAC CAGCTACCGC GTAACACTTT CAGCTGCTCC TCCCTCCTAC 300 
TTCAGAGGAT TCACATTAAT TGCCCTCAGA GAGAACAGAG AGGGTGATAA GGAAGAAGAC 360 
CATGCTGGGA CCTTCCAGAT CATAGACGAA GAAGAAACTC AGTTTATGAG CAATTGCCCT 420 
GTTGCAGTCA CTGAAAGCAC TCCACGGAGG AGGACCCGGA TCCAGGTGTT TTGGATAGCA 480 
CCACCAGCGG GAACAGGCTG CGTGATTCTG AAGGCCAGCA TCGTACAAAA ACGCATTATT 540 
TATTTTCAAG ATGAGGGCTC TCTGACCAAG AAACTTTGTG AACAAGATTC CACATTTGAT 600 
GGGGTGACTG ACAAACCCAT CTTAGACTGC TGTGCCTGCG GAACTGCCAA GTACAGACTC 660 
ACATTTTATG GGAATTGGTC CGAGAAGACA CACCCAAAGG ATTACCCTCG TCGGGCCAAC 720 
CACTGGTCTG CGATCATCGG AGGATCCCAC TCCAAGAATT ATGTACTGTG GGAATATGGA 780 
GGATATGCCA GCGAAGGCGT CAAACAAGTT GCAGAATTGG GCTCACCCGT GAAAATGGAG 840 
GAAGAAATTC GACAACAGAG TGATGAGGTC CTCACCGTCA TCAAAGCCAA AGCCCAATGG 900 
CCAGCCTGGC AGCCTCTCAA CGTGAGAGCA GCACCTTCAG CTGAATTTTC CGTGGACAGA 960 
ACGCGCCATT TAATGTCCTT CCTGACCATG ATGGGCCCTA GTCCCGACTG GAACGTAGGC 1020 
TTATCTGCAG AAGATCTGTG CACCAAGGAA TGTGGCTGGG TCCAGAAGGT GGTGCAAGAC 1080 
CTGATTCCCT GGGACGCTGG CACCGACAGC GGGGTGACCT ATGAGTCACC CAACAAACCC 1140 
ACCATTCCCC AGGAGAAAAT CCGGCCCCTG ACCAGCCTGG ACCATCCTCA GAGTCCTTTC 1200 
TATGACCCAG AGGGTGGGTC CATCACTCAA GTAGCCAGAG TTGTCATCGA GAGAATCGCA 1260 
CGGAAGGGTG AACAATGCAA TATTGTACCT GACAATGTCG ATGATATTGT AGCTGACCTG 1320 
GCTCCAGAAG AGAAAGATGA AGATGACACC CCTGAAACCT GCATCTACTC CAACTGGTCC 1380 
CCATGGTCCG CCTGCAGCTC CTCCACCTGT GACAAAGGCA AGAGGATGCG ACAGCGCATG 1440 
CTGAAAGCAC AGCTGGACCT CAGCGTCCCC TGCCCTGACA CCCAGGACTT CCAGCCCTGC 1500 
ATGGGCCCTG GCTGCAGTGA CGAAGACGGC TCCACCTGCA CCATGTCCGA GTGGATCACC 1560 
TGGTCGCCCT GCAGCATCTC CTGCGGCATG GGCATGAGGT CCCGGGAGAG GTATGTGAAG 1620 
CAGTTCCCGG AGGACGGCTC CGTGTGCACG CTGCCCACTG AGGAAACGGA GAAGTGCACG 1680 
GTCAACGAGG AGTGCTCTCC CAGCAGCTGC CTGATGACCG AGTGGGGCGA GTGGGACGAG 1740 
TGCAGCGCCA CCTGCGGCAT GGGCATGAAG AAGCGGCACC GCATGATCAA GATGAACCCC 1800 
GCAGATGGCT CCATGTGCAA AGCCGAGACA TCACAGGCAG AGAAGTG CAT GATGCCAGAG 1860 
TGCCACACCA TCCCATGCTT GCTGTCCCCA TGGTCCGAGT GGAGTGACTG CAGCGTGACC 1920 
TGCGGGAAGG GCATGCGAAC CCGACAGCGG ATGCTCAAGT CTCTGGCAGA ACTTGGAGAC 1980 
TGCAATGAGG ATCTGGAGCA GGTGGAGAAG TGCATGCTCC CTGAATGCCC CATTGACTGT 2040 
GAGCTCACCG AGTGGTCCCA GTGGTCGGAA TGTAACAAGT CATGTGGGAA AGGCCACGTG 2100 
ATTCGAACCC GGATGATCCA AATGGAGCCT CAGTTTGGAG GTGCACCCTG CCCAGAGACT 2160 
GTGCAGCGAA AAAAGTGCCG CATCCGAAAA TGCCTTCGAA ATCCATCCAT CCAAAAGCTA 2220 
CGCTGGAGGG AGGCCCGAGA GAGCCGGCGG AGTGAGCAGC TGAAGGAAGA GTCTGAAGGG 2280 
GAGCAGTTCC CAGGTTGTAG GATGCGCCCA TGGACGGCCT GGTCAGAATG CACCAAACTG 2340 
TGCGGAGGTG GAATTCAGGA ACGTTACATG ACTGTAAAGA AGAGATTCAA AAGCTCCCAG 2400 
TTTACCAGCT GCAAAGACAA GAAGGAGATC AGAGCATGCA ATGTTCATCC TTGTTAGCAA 2460 
GGGTACGAGT TCCCCAGGGC TGCACTCTAG ATTCCAGAGT CACCAATGGC TGGATTATTT 2520 
GCTTGTTTAA GACAATTTAA ATTGTGTACG CTAGTTTTCA TTTTTGCAGT GTGGTTCGCC 2580 
CAGTAGTCTT GTGGATGCCA GAGACATCCT TTCTGAATAC TTCTTGATGG GTACAGGCTG 2640 
AGTGGGGCGC CCTCACCTCC AGCCAGCCTC TTCCTGCAGA GGAGTAGTGT CAGCCACCTT 2700 
GTACTAAGCT GAAACATGTC CCTCTGGAGC TTCCACCTGG CCAGGGAGGA CGGAGACTTT 2760 
GACCTACTCC ACATGGAGAG GCAACCATGT CTGGAAGTGA CTATGCCTGA GTCCCAGGGT 2820 
GCGGCAGGTA GGAAACATTC ACAGATGAAG ACAGCAGATT CCCCACATTC TCATCTTTGG 2880 
CCTGTTCAAT GAAACCATTG TTTGCCCATC TCTTCTTAGT GGAACTTTAG GTCTCTTTTC 2940 
AAGTCTCCTC AGTCATCAAT AGTTCCTGGG GAAAAACAGA GCTGGTAGAC TTGAAGAGGA 3000 
GCATTGATGT TGGGTGGCTT TTGTTCTTTC ACTGAGAAAT TCGGAATACA TTTGTCTCAC 3060 
CCCTGATATT GGTTCCTGAT GCCCCCCCAA CAAAAATAAA TAAATAAATT ATGGCTGCTT 3120 
TATTTAAATA TAAGGTAGCT AGTTTTTACA CCTGAGATAA ATAATAAGCT TAGAGTGTAT 3180 
TTTTCCC^TG CTTTTGGGGG TTCAGAGGAG TATGTACAAT TCTTCTGGGA AGCCAGCCTT 3240 
CTGAACTTTT TGGTACTAAA TCCTTATTGG AACCAAGACA AAGGAAGCAA AATTGGTCTC 3300 
TTTAGAGACC AATTTGCCTA AATTTTAAAA TCTTCCTACA CACATCTAGA OGTTCAAGTT 3360 
TGCAAATCAG TTTTTAGCAA GAAAACATTT TTGCTATACA AACATTTTGC TAAGTCTGCC 3420 
CAAAGCCCCC CCAATGCATT CCTTCAACAA AATACAATCT CTGTACTTTA AAGTTATTTT 3480 
AGTCATGAAA TTTTATATGC AGAGAGAAAA AGTTACCGAG ACAGAAAACA AATCTAAGGG 3540 
AAAGGAATAT TATGGGATTA AGCTGAGCAA GCAATTCTGG TGGAAAGTCA AACCTGTCAG 3600 
TGCTCCACAC CAGGGCTGTG GTCCTCCCAG ACATGCATAG GAATGGCCAC AGGTTTACAC 3660 
TGCCTTCCCA GCAATTATAA GCACACCAGA TTCAGGGAGA CTGACCACCA AGGGATAGTG 3720 
TAAAAGGACA TTTTCTCAGT TGGGTCCATC AGCAGTTTTT CTTCCTGCAT TTATTGTTGA 3780 
AAACTATTGT TTCATTTCTT CTTTTATAGG CCTTATTACT GCTTAATCCA AATGTGTACC 3840 
ATTGGTGAGA CACATACAAT GCTCTGAATA CACTACGAAT TTGTATTAAA CACATCAGAA 3900 
TATTTCCAAA TACAACATAG TATAGTCCTG AATATGTACT TTTAACACAA GAGAGACTAT 3960 
TCAATAAAAA CTCACTGGGT CTTTCATGTC TTTAAGCTAA GTAAGTGTTC AGAAGGTTCT 4020 
TTTTTATATT GTCCTCCACC TCCATCATTT TCAATAAAAG ATAGGGCTTT TGCTCCCTTG 4080 
TTCTTGGAGG GACCATTATT ACATCTCTGA ACTACCTTTG TATCCAACAT GTTTTAAATC 4140 
CTTAAATGAA TTGCTTTCTC CCAAAAAAAG CACAATATAA AGAAACACAA GATTTAATTA 4200 
TTTTTCTACT TGGGGGGAAA AAAGTCCTCA TGTAGAAGCA CCCACTTTTG CAATGTTGTT 4260 
CTAAGCTATC TATCTAACTC TCAGCCCATG ATAAAGTTCC TTAAGCTGGT GATTCCTAAT 4320 
CAAGGACAAG CCACCCTAGT GTCTCATGTT TGTATTTGGT CCCAGTTGGG TACATTTTAA 4380 
AATCCTGATT TTGGAGACTT AAAACCAGGT TAATGGCTAA GAATGGGTAA CATG ACTC TT 4440 
GTTGGATTGT TATTTTTTGT TTGCAATGGG GAATTTATAA GAAGCATCAA GTCTCTTTCT 4500 
TACCAAAGTC TTGTTAGGTG GTTTATAGTT CTTTTGGCTA ACAAATCATT TTGGAAATAA 4560 
AGATTTTTTA CTACAAAAAT G 
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Seq ID NO: S10 Protein sequence 
Protein Accession fl: BAB18461.1 

1 11 21 31 41 51 

( I I I i I 

MRLSPAPLKXi SRTPALLALA LPLAAALAFS DETLDKVPKS BGYCSRILRA QGTRREGYTE 60 

PSIiRVEGDPD FYKPGTSYRV TLSAAPPSYF RGFTLIALRE NREGDKEEDH AGTFQIIDEE 120 

ETQFMSNCPV AVTESTPRRR TRIQVFWIAP PAGTGCVILK ASIVQKRIIY FQDEGSLTKK 1B0 

IjCEQDSTFDG VTDKPILDCC ACGTAKYRLT PYGKHSEKTH PKDYPRRANH WSAIIGGSHS 240 

KNYVL.WEYGG YASEGVKQVA ELGSPVKMEE EIRQQSDEVL TVIKAKAQWP AWQPLNVRAA 300 

PSAEFSVDRT RHLMSFLTMM GPSPDWNVGL SAEDLCTKEC GWVQKWQDL IPWDAGTDSG 360 

VTYESPNKPT IPQEKIRPLT SLDHPQSPFY DPEGGSITQV ARWIERIAR KGEQCNIVPD 420 

NVDDIVADLA PEEKDEDDTP ETCIYSNWSP WSACSSSTCD KGKRMRQRML KAQLDLSVPC 480 

PDTQDFQPCM GPGCSDEDGS TCTMSEWITW SPCSISCGMG MRSRERYVKQ FPEDGSVCTL 540 

PTEETEKCTV NEECSPSSCL MTEKGEWDEC SATOGMGMKK RHRMIKMNPA DGSMCKAETS 600 

QAEKCMMPEC HTIPCLLSPW SEWSDCSVTC GKGMRTRQRM LKSLAELGDC NEDLEQVEKC 660 

MLPECPIDCE LTEWSQWSEC NKSCGKGHVI RTRMIQMEPQ FGGAPCPETV QRKKCRIRKC 720 

LRNPSIQKLR WREARESRRS EQLKEESEGE QFPGCRMRPW TAWSECTKbC GGGIQERYMT 780 
VKKRFKSSQF TSCKDKKEIR ACNVHPC 

Seq ID NO: 511 DNA sequence 

Nucleic Acid Accession ft: NM_0 03 108.1 

Coding sequence: 76. .1401 



1 11 21 31 41 51 

| | I I I 1 

GGGGTGGGAG GGGGAGGGGG ACCTCCGCAC GAGACCCAGC GGCCCGGGTT GGAGCGTCCA 
GCCCTGCAAC GGATCATGGT GCAGCAGGCG GAGAGCTTGG AAGCGGAGAG CAACCTGCCC 
CGGGAGGCGC TGGACACGSA GGAGGGCGAA TTCATGGCTT GCAGCCCGGT GGCCCTGGAC 
GAGAGCGACC CAGACTGGTG CAAGACGGCG TCGGGCCACA TCAAGCGGCC GATGAACGCG 
TTCATGGTAT GGTCCAAGAT CGAACGCAGG AAGATCATGG AGCAGTCTCC GGACATGCAC 
AACGCCGAGA TCTCCAAGAG GCTGGGCAAG CGCTGGAAAA TGCTGAAGGA CAGCGAGAAG 
ATCCCGTTCA TCCGGGAGGC GGAGCGGCTG CGGCTCAAGC ACATGGCCGA CTACCCCGAC 
TACAAGTACC GGCCCCGGAA AAAGCCCAAA ATGGACCCCT CGGCCAAGCC CAGCGCCAGC 
CAGAGCCCAG AGAAGAGCGC GGCCGGCGGC GGCGGCGGGA GCGCGGGCGG AGGCGCGGGC 
GGTGCCAAGA CCTCCAAGGG CTCCAGCAAG AAATGCGGCA AGCTCAAGGC CCCCGCGGCC 
GCGGGCGCCA AGGCGGGCGC GGGCAAGGCG GCCCAGTCCG GGGACTACGG GGGCGCGGGC 
GACGACTACG TGCTGGGCAG CCTGCGCGTG AGCGGCTOGG GCGGCGGCGG CGCGGGCAAG 
ACGGTCAAGT GCGTGTTTCT GGATGAGGAC GACGACGACG ACGACGACGA CGACGAGCTG 
CAGCTGCAGA TCAAACAGGA GCCGGACGAG GAGGACGAGG AACCACCGCA CCAGCAGCTC 
CTGCAGCCGC CGGGGCAGCA GCCGTCGCAG CTGCTGAGAC GCTACAACGT CGCCAAAGTG 
CCCGCCAGCC CTACGCTGAG CAGCTCGGCG GAGTCCCCCG AGGGAGCGAG CCTCTACGAC 
GAGGTGCGGG CCGGCGCGAC CTCGGGCGCC GGGGGCGGCA GCCGCCTCTA CTACAGCTTC 
AAGAACATCA CCAAGCAGCA CCCGCCGCCG CTCGCGCAGC CCGCGCTGTC GCCCGCGTCC 
TCGCGCTCGG TGTCCACCTC CTCGTCCAGC AGCAGCGGCA GCAGCAGCGG CAGCAGCGGC 
GAGGACGCCG ACGACCTGAT GTTCGACCTG AGCTTGAATT TCTCTCAAAG CGCGCACAGC 
GCCAGCGAGC AGCAGCTGGG GGGCGGCGCG GCGGCCGGGA ACCTGTCCCT GTCGCTGGTG 
GATAAGGATT TGGATTCGTT CAGCGAGGGC AGCCTGGGCT CCCACTTCGA GTTCCCCGAC 
TACTGCACGC CGGAGCTGAG CGAGATGATC GCGGGGGACT GGCTGGAGGC GAACTTCTCC 
GACCTGGTGT TCACATATTG AAAGGCGCCC GCTGCTCGCT CTTTCTCTCG GAGGGTGCAG 
AGCTGGGTTC CTTGGGAGGA AGTTGTAGTG GTGATGATGA TGATGATGAT AATGATGATG 
ATGATGGTGG TGTTGATGGT GGCGGTGGTA GGGTGGAGGG GAGAGAAGAA GATGCTGATG 
ATATTGATAA GATGTCGTGA CGCAAAGAAA TTGGAAAACA TGATGAAAAT TTTGGTGGAG 
TTAAAGTGAA ATGAGTAGTT TTTAAACATT TTTCCTGTCC TTTTTTTGTC CCCCCTCCCT 
TCCTTTATCG TGTCTCAAGG TAGTTGCATA CCTAGTCTGG AGTTGTGATT ATTTTCCCAA 
AAAATGTGTT TTTGTAATTA CTATTTCTTT TTCCTGAAAT TCGTGATTGC AACAAAGGCA 
GAGGGGGCGG CGCGGCGGAG GGGAGGTAGG ACCCGCTCCG GAAGGCGCTG TTTGAAGCTT 
GTCGGTCTTT GAAGTCTGGA AGACGTCTGC AGAGGACCCT TTTGGCAGCA CAACTG TTAC 
TCTAGGGAGT TGGTGGAGAT ATTTTTTTTT CTTAAGAGAA CTTAAAGAAC TGGTGATTTT 
TTTTTAACAA AAAAAGGG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 

960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 



Seq ID NO: 512 Protein sequence 
Protein Accession #: NP_0 03 099-1 



1 

1 

MVQQAESLEA 
KIERRKIMEQ 
RKKPKMDPSA 
GAGKAAQSGD 
QEPDEEDEEP 
ATSGAGGGSR 
LMFDLSLNFS 
LSEMIAGDWL 



11 
I 

SSNLPREALD 
SPDMHNAEIS 
KPSASQSPEK 
YGGAGDDYVL 
PHQQIiIiQPPG 
LYYSFKNITK 
QSAHSASEQQ 
EANFSDLVFT 



21 
I 

TEEGEFKACS 
KRLGKRWKML 
SAAGGGGGSA 
GSLRVSGSGG 
QQPSQLLRRY 
QHPPPLAQPA 
LGGGAAAGNIt 
Y 



31 
I 

PVALDESDPD 
KDSEKIPFIR 
GGGAGGAKTS 
GGAGKTVKCV 
NVAKVPASPT 
LSPASSRSVS 
SLSLVDKDLD 



41 

I 

WCKTASGHIK 
EAERLRLKHM 
KGSSKKCGKL 
FLDEDDDDDD 
LSSSAESPEG 
TSSSSSSGSS 
SFSEGSLGSH 



51 
I 

RPMNAFMVWS 
ADYPDYKYRP 
KAPAAAGAKA 
DDDELQLQIK 
ASLYDEVRAG 



FEFPDYCTPE 



60 
120 
180 
240 
300 
360 
420 



Seq ID NO: 513 DNA sequence 

Nucleic Acid Accession #: CAT Cluster 

1 11 21 31 41 51 

GGTCGACCTA AATCTGATAA CTGGCTTATT ATGTAATTTA TTGGTGTTAT TATAGTAGAG 
ATTGGTAATC TACAGTAAGA TTTTCAGTTA GGATTTGAGA TTATGATAAT AACTAATAGA 
ATATTTCTAA ATTGGAATTA GAAGATTGTT GTATGACAGA GAGTCAGGAC TTGCCATTTG 

1152 



60 
120 
180 



WO 03/042661 



PCT/US02/36810 



5 

10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80' 



GCAAACATCA 
AAGGATATCT 
CTCTGACAAT 
ACTACCAATT 
CAACATAGAC 
AAAAACCAGA 



AAGTCATTGT TTGGTGTGTA ATAGTACAAA ATCATCTTGC TTAACAGAGA 
GTTGCTCCCG AATGAAACAA TTTTTCTGAA ATAGAGGGCC CAGAATTGGT 
TAATAAAGAC ATCAAAGATA GCAAAATGAT TTTTATATCT TAGGGCCAAT 
TAATAATTAA AACAAGTTCT GGTGAGCTCT GAACTTGGCA GAATTGGTGG 
TTTGGATTTT CCAAATTCCC CACATAAAAC AAAGGGGATC AACTAGATAG 
AACCTTTGGA AATATCTGTT TAAAAAAAAA AAAAAGTCGA CGCGGGCC 



240 
300 
360 
420 
480 



Seq ID KO: 514 DNA sequence 

Nucleic Acid Accession #: CAT cluster 



1 
I 

GGAGCCACAG 
GGGCAGTTAC 
TACAGGATTC 
CTTGGCTTCA 
CCTGCAGGTC 
GGTAAATTCT 
GCATTCTATG 
AACTACTCAG 
TCAATGCAAC 
TATCAGATAT 
GACTATAGGT 
CTTGAGTGAC 
CATACTTGTT 
GGTTGTTGGA 
CCATGTCCCC 
GGCAGTTTAA 
GGGCCAGAAC 
CTCCATGTAC 
AAATCCATTC 
AAACTGTACT 
ATATGTAATT 
TGCCCTCATT 
GCCCGGGGTC 
AGCCCAGTTG 
CCAGTTCTGA 
TGTAACAACA 
AACTTATTAA 
CCTGACTCAC 
GATGGCCTAA 
GCGGCCGCGA 



11 



21 



31 



41 



TGAAAGTCAA 
TCTTTTGTAG 
TCTGGGTTGT 
CTCTGACTTG 
TTAAAAGAAC 
GAACCCTTGA 
TCTATGAACA 
TTTAAAACCT 
AGTACACTCT 
TATAAATTAG 
TTAAGAATGT 
AACTACTGTA 
CATAAAGAGT 
AGCTCCACCA 
CAACAGGCTG 
ATGCTTTCAA 
TAGGTCACTG 
CCAAAGTGGA 
TACCATTCCC 
TTTGAAGTAA 
TGGGAATGAC 
TGCAATACAG 
CCTATTCCCC 
TAGTCGTAAG 
AAGGAATGCA 
CAGACATGAT 
ATTGTGAATC 
TGCCTACATA 
GACAGCTGTA 
ATTTAGTAG 



GAATGTCAGT 
TATAACATTG 
ATTCCCAGAA 
TGTGACACAT 
CTTTCCAAAC 
ACACTAAAAC 
GTATGTCTTT 
AATTCTTCTC 
TATTCAGGCA 
TATGGTTTAG 
AAGCAGAAGT 
ATGACAAAAT 
CAGCAAGGAC 
TTTTAGATAG 
GCAAAATTTG 
CATGGAAAGT 
GGCCCGGACC 
GAGAAGCCAG 
TTTG CCTAAA 
ATCCTGGTAG 
ATTCACTTAA 
GACCAAAATG 
TCCATCCCTT 
AATTTTACTT 
AAGTCAAATT 
ACAAGGCTGA 
AACAAAAAAT 
CCCCTCTTGT 
GATGTTTTTA 



GATTCCACAT 
AGCTGATAGC 
GTCTGGAGGT 
AAAAATTGTG 
TATGAAACAG 
TATTCTAACT 
TCTATATAAC 
ATAATCTCAG 
TTTGAAAGAA 
TCTTTGTCAT 
TCTGCACCAA 
ATCAGTGGCT 
CCTGCTCATT 
CTCCCTTCAA 
GCTCTGGATG 
GGACACCGGC 
TAACTTOGGA 
ATACTGAGAA 
GTGAAAAGAT 
CTTGCATGGG 
GCTCATAGAA 
CACTAACCAC 
TAAATGAGGC 
AATTCAAGAA 
TTGCATCTTC 
CAATGACATT 
TATGTTCTTT 
TCCTCAGTTC 
TTTAGCAAAA 



TTAATATCTA 
ACATAGTGTA 
CATTTGGATA 
ATGAAATGTC 
CCCAGCAGCA 
GCACATAGAA 
AGAGAAAATC 
TACTTTTGAA 
AGAATTCGAG 
GAAATTCTAC 
TCAGAATAAG 
TAATACAATG 
ATGGTCCCTC 
AGTCAGCCAT 
GCTTCAAGGA 
CACTCCCACT 
GGGTTGGGGA 
ACATCAATAA 
GAGTACTTTC 
GGCTGGATTT 
TATCATTATT 
AAAACCCCCC 
ATTCTATGAT 
TTATTCTCAC 
TTTGCTCAAG 
ATGATTTAAA 
ATTTTATGGT 
TTATCCCTGA 
AAAAAAAAAA 



CATTTTTGCA 
GACAAGTGAA 

CTATAGATGT 
CTGAGTTAGA 
TTGGCAAGTA 
TTTTTAAGCA 
TGAAGACATA 
ATCTAGTTTG 
TTAATTTTTG 
CTACATTATG 
GTTTTTCTCT 
AGGGACCCAG 
CTTTGGCAGT 
TTGAGCATCG 
CACATCCCTT 
ATTGTAATTC 
TGGCTAACAG 
ATCAATTTGT 
CCAGAAAGCC 
TGATGTAAAA 
TCCCCACGGG 
TTGGAATGGA 
TGAATATGTG 
GGCCTTTAGA 
TATGTTAAAC 
TTTGCATAGT 
TTTCTTACAG 
AAAAGTCGAC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 



Seq ID NO: 515 DNA sequence 
Nucleic Acid Accession #: NM_012427 
Coding sequence: 43.. 924 



CTTGTGGTTC 
CCCTGGATGT 
GTTCTCGCCA 
AACCAGGACC 
ATCATCAATG 
AGGCCCAACC 
GCCCACTGCA 
TATGAATCTG 
CACCCTGGCC 
AAAGATGTCA 
GTGTCTGGCT 
TTGAATATCA 
GACACCATGT 
GGGCCTGTGG 
GCCCGGCCCA 
GAAACCATCC 
GCTGCAGGGA 
ATGTTCATCT 
ATTGGGCTGA 
CGGGGGTTGC 
TCTCTGCAGC 



11 
I 

CTCTCTACTT 
GGGTGCTCTG 
ACAATGATGT 
TGGGAGCTGG 
GATCCGACTG 
AGCTCTACTG 
GGAAGAAAGT 
GGCAGCAGAT 
ACTCTAACGA 
GACCCATCAA 
GGGGGACAAC 
GCGTGCTAAG 
TCTGCGCCGG 
TCTGCAATGG 
ACAGACCGGG 
AGGCCAACTC 
CAGCCCTGAC 
CTCCAGCCCC 
CCGTGTCTCT 
GTCTCAATCT 
TCTGACCCAA 



21 
I 

GGGGAAATCA 
TGCTCTGATC 
TTCCTGTGAC 
GGCCGGGGAA 
CGATATGCAC 
CGGGGCGGTG 
TTTCAGAGTC 
GTTCCAGGGG 
CCTCATGCTC 
CGTCTCCTCT 
CAAGAGCCCC 
TCAGAAAAGG 
TGACAAAGCA 
CTCCCTGCAG 
TGTCTACACG 
CTGAGTCATC 
ACTCCTTTCA 
TGACCCCATG 
CTAGTTGAAC 
CCCTGGGGCA 
ATTTAGTCCC 



31 
I 

GGTGCAGCGG 
ACAGCCTTGC 
CACCCCTCTA 
GACGCCCGGT 
ACCCAGCCGT 
TTGGTGCATC 
CGTCTCGGCC 
GTCAAATCCA 
ATCAAACTGA 
CATTGTCCCT 
CAAGTGCACT 
TGCGAGGATG 
GGTAGAGACT 
GGACTCGTGT 
AACCTCTGCA 
CCAGGACTCA 
GACCCTCATT 
TCTCCTGGAC 
CCTGGGAACA 
CTTTCATCCT 
AGAAATAAAC 



41 
I 

CCATGGCTAC 
TTCTGGGGGT 
ACACCGTGCC 
CGGATGACAG 
GGCAGGCCGC 
CACAGTGGCT 
ACTACTCCCT 
TCCCCCACCC 
ACAGAAGAAT 
CTGCTGGGAC 
TCCCTAAGGT 
CTTACCCGAG 
CCTGCCAGGG 
CCTGGGGAGA 
AGTTCACCAA 
GCACACCGGC 
CCTTCCCAGA 
TCAGGGTCTG 
ATTTCCAAAA 
CAAGCTCAGG 
TGAGAAGTGG 



51 
I 

AGCAAGACCC 
CACAGAGCAT 
CTCTGGGAGC 
CAGCAGCCGC 
GCTGTTGCTA 
GCTCACGGCC 
GTCACCAGTT 
TGGCTACTCC 
TCGTCCCACT 
AAAGTGCTTG 
CCTCCAGTGC 
ACAGATAGAT 
TGATTCTGGG 
TTACCCTTGT 
GTGGATCCAG 
ATCCCCACCT 
GATGTTGAGA 
CTTCCCCCAC 
CTGTCCAGGG 
GCCCATCCCT 
AAAAAAAAAA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 



Seq ID NO: 516 Protein sequence 
Protein Accession tf: NP_036559 

1 11 21 31 41 51 

| j I I I I 

MATARPPWMW VLCALITALL LGV9&HVLAN NDVSCDHPSN TVPSGSNQDL GAGAGEDARS 
DDSSSRIING SDCDMHTQPW QAALLLRPNQ LYCGAVLVHP QWLLTAAHCR KKVFRVRLGH 
YSLSPVYESG QQMFQGVKSI PHPGYSHPGH SNDLMLIKLN RRIRPTKDVR PINVSSHCPS 
AGTKCLVSGW GTTKSPQVHP PKVLQCLNIS VLSQKRCEDA YPRQIDDTMF CAGDKAGRDS 
CQGDSGGPW CNGSLQGLVS WGDYPCARPN RPGVYTNLCK FTKWIQETIQ ANS 



60 
120 
180 
240 



1153 



WO 03/042661 



Seq ID NO: 517 DNA sequence 
Nucleic Acid Accession #: NM_001719 
coding sequence: 123.. 1418 

1 11 21 31 41 51 

I I I I I i 

GGGCGCAGCG GGGCCCGTCT GCAGCAAGTG ACCGACGGCC GGGACGGCCG CCTGCCCCCT 60 

CTGCCACCTG GGGCGGTGCG GGCCCGGAGC CCGGAGCCCG GGTAGCGCGT AGAGCCGGCG 120 

CGATGCACGT GCGCTCACTG CGAGCTGCGG CGCCGCACAG CTTCGTGGCG CTCTGGGCAC 180 

CCCTGTTCCT GCTGCGCTCC GCCCTGGCCG ACTTCAGCCT GGACAACGAG GTGCACTCGA 240 

GCTTCATCCA CCGGQGCCTC CGCAGCCAGG AGCGGCGGGA GATGCAGCGC GAGATCCTCT 300 

CCATTTTGGG CTTGCCCCAC CGCCCGCGCC CGCACCTCCA GGGCAAGCAC AACTCGGCAC 360 

CCATGTTCAT GCTGGACCTG TACAACGCCA TGGCGGTGGA GGAGGGCGGC GGGCCCGGCG 420 

GCCAGGGCTT CTCCTACCCC TACAAGGCCG TCTTCAGTAC CCAGGGCCCC CCTCTGGCCA 480 

GCCTGCAAGA TAGCCATTTC CTCACCGACG CCGACATGGT CATGAGCTTC GTCAACCTCG 540 

TGGAACATGA CAAGGAATTC TTCCACCCAC GCTACCACCA TCGAGAGTTC CGGTTTGATC 600 

TTTCCAAGAT CCCAGAAGGG GAAGCT6TCA CGGCAGCCGA ATTCCGGATC TACAAGGACT 660 

ACATCCGGGA ACGCTTCGAC AATGAGACGT TCCGGATCAG CGTTTATCAG GTGCTCCAGG 720 

AGCACTTGGG CAGGGAATCG GATCTCTTCC TGCTCGACAG CCGTACCCTC TGGGCCTCGG 780 

AGGAGGGCTG GCTGGTGTTT GACATCACAG CCACCAGCAA CCACTGGGTG GTCAATCCGC 840 

GGCACAACCT GGGCCTGCAG CTCTCGGTGG AGACGCTGGA TGGGCAGAGC ATCAACCCCA 900 

AGTTGGCGGG CCTGATTGGG CGGCACGGGC CCCAGAACAA GCAGCCCTTC ATGGTGGCTT 960 

TCTTCAAGGC CACGGAGGTC CACTTCCGCA GCATCCGGTC CACGGGGAGC AAACAGCGCA 1020 

GCCAGAACCG CTCCAAGACG CCCAAGAACC AGGAAGCCCT GCGGATGGCC AACGTGGCAG 1080 

AGAACAGCAG CAGCGACCAG AGGCAGGCCT GTAAGAAGCA CGAGCTGTAT GTCAGCTTCC 1140 

GAGACCTGGG CTGGCAGGAC TGGATCATCG CGCCTGAAGG CTACGCCGCC TACTACTGTG 1200 

AGGGGGAGTG TGCCTTCCCT CTGAACTCCT ACATGAACGC CACCAACCAC GCCATCGTGC 1260 

AGACGCTGGT CCACTTCATC AACCCGGAAA CGGTGCCCAA GCCCTGCTGT GCGCCCACGC 1320 

AGCTCAATGC CATCTCCGTC CTCTACTTCG ATGACAGCTC CAACGTCATC CTGAAGAAAT 1380 

ACAGAAACAT GGTGGTCCGG GCCTGTGGCT GCCACTAGCT CCTCCGAGAA TTCAGACCCT 1440 

TTGGGGCCAA GTTTTTCTGG ATCCTCCATT GCTCGCCTTG GCCAGGAACC AGCAGACCAA 1500 

CTGCCTTTTG TGAGACCTTC CCCTCCCTAT CCCCAACTTT AAAGGTGTGA GAGTATTAGG 1560 

AAACATGAGC AGCATATGGC TTTTGATCAG TTTTTCAGTG GCAGCATCCA ATGAACAAGA 1620 

TCCTACAAGC TGTGCAGGCA AAACCTAGCA GGAAAAAAAA ACAACGCATA AAGAAAAATG 1680 

GCCGGGCCAG GTCATTGGCT GGGAAGTCTC AGCCATGCAC GGACTCGTTT CCAGAGGTAA 1740 

TTATGAGCGC CTACCAGCCA GGCCACCCAG CCGTGGGAGG AAGGGGGCGT GGCAAGGGGT 1800 

GGGCACATTG GTGTCTGTGC GAAAGGAAAA TTGACCCGGA AGTTCCTGTA ATAAATGTCA 1B60 
CAATAAAACG AATGAATG 

Seq ID NO: 518 Protein sequence 
Protein Accession #: NP_001710 

1 11 21 31 41 51 

I I 1 I I I 

MHVRSLRAAA PHSFVALWAP LFLLRSALAD FSLDNEVHSS PIHRRLRSQE RREMQREILS 60 

ILGLPHRPRP HLQGKHNSAP MFMLDLYNAM AVEEGGGPGG QGFSYPYKAV FSTQGPPLAS 120 

LQDSHFLTDA DMVMSFVNLV EHDKEFFHPR YHBREFRFDL SKIPEGEAVT AAEFRIYKDY 180 

IRERFDNETF RISVYQVLQE HLGRESDLFL LDSRTLWASE BGWLVFDITA TSNHWWNPR 240 

HNLGLQLSVE TLDGQSINPK LAGIilGRHGP QNKQPFMVAF FKATEVHFRS IRSTGSKQRS 300 

QNRSKTPKNQ EALRMANVAE NSSSDQRQAC KKHELYVSFR DLGWQDWIIA PEGYAAYYCE 360 

GECAFPLNSY MNATNHAIVQ TLVHFINPET VPKPCCAPTQ LNAISVLYFD DSSNVILKKY 420 
RNMWRACGC H 

Seq ID NO: 519 DNA sequence 

Nucleic Acid Accession Eos sequence 

Coding sequence: 264.. 782 

1 11 21 31 41 51 

I I 1 1 I 1 

CCCTGCTCCA GTCACACCCG GAAGCTGACT GGTCCACGCA CAGCTGAAGC ATGAGGAAAC 60 

TCATCGCGGG ACTAATTTTC CTTAAAATTT AGACTTGCAC AGTAAGGACT TCAACTGACC 120 

TTCCTCAGAC TGAGAACTGT TTCCAGTATA TACATCAAGT CACTGAGATC TCCAGCACCC 180 

TGCCGGTGGC ACTACTGAGA GACGAGGTGC CAGGGTGGTT CCTGAAAGTG CCTGA GCCC C 240 

AACTTATCAG CAAGGAGCTC ATCATGCTGA CAGAAGTCAT GGAGGTCTGG CATGGCTTAG 300 

TGATCGCGGT GGTGTCCCTC TTCCTGCAGG CCTGCTTCCT CACCGCCATC AACTACCTGC 360 

TCAGCAGGCA CATGGCCCAC AAGAGTGAAC AGATACTGAA AGCGGCCAGT CTCCAGGTTC 420 

CCAGGCCCAG CCCTGGCCAC CATCATCCAC CTGCTGTCAA AGAGATGAAG GAGACTCAGA 480 

CAGAGAGAGA CATCCCAATG TCTGATTCCC TTTACAGGCA TGACAGCGAC ACACCCTCAG 540 

ATAGCTTGGA TAGCTCCTGC AGTTCGCCTC CTGCCTGCCA GGCCACAGAG GATGTGGATT 600 

ACACACAAGT CGTCTTTTCT GACCCTGGAG AACTAAAAAA TGACTCCCCG CTGGACTATG 660 

AGAACATAAA GGAAATCACA GATTATGTCA ATGTCAATCC AGAAAGACAC AAGCCCAGTT 720 

TCTGGTATTT TGTCAACCCT GCTCTGTCTG AGCCAGCGGA ATATGATCAA GTGGCCATGT 780 

GAATTCCAAA TATTTTTAAT GGGGTCCAGT TCTCTATGGA TTCTTACATT TAATTTGTAG 840 

GGAAATGCCA TTTTTCCCCC TTAAACAAGG CATGGGGCTC ACAAGTCTAT GGAGACAGGC 900 

CAAAAAGAAT GTGGAGAAGA AAACTGATAA ATACACAGAG GTCCTCAAGA CCCATGGACT 960 

CCTGGTCTGT ACCCAAAAAA GCTGTTCGTT CCTCAAAAAC AAAAACAAGG CTTGGCTGGG 1020 

AAAACAGGCC AATGCCCCGG CAAGAAAGGT TGAGATCAGA TGTTAGGAAG AACTTTCAGG 1080 

TAAAGTATGA GAACTATGGA GTCCATCAGC AGAGATAGTA GTGAAGTCTC TCCCCAGGGA 1140 

AAATTTTAAA AAGGTTGAAT CAGCTGTTGT AGAGTTCTAT TTGGCAATCT CATGGTTAAA 1200 

TGACTTCCCT TTGAGCTCTT TAATTATTGG CAATAAACAA CTTCTTTAAA AGTTTTAAAT 1260 
AAAATAGCAA CCACCACCA 



Seq ID NO: S20 Protein sequence 
Protein Accession #: Eos sequence 



1154 



WO 03/042661 



PCT/US02/36810 



11 



21 
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31 41 51 

III 
MLTEVMEVWH GLVIAWSLP LQACFLTAIN YLLSRHMAHK SEQILKAASL QVPRPSPGHH 
HPPAVKEMKE TQTERDIPMS DSLYRHDSDT PSDSLDSSCS SPPACQATED VDYTQWFSD 
PGELKNDSPL DYENIKEITD YVNVNPERHK PSFWYFVNPA LSEPAEYDQV AM 

Seq ID NO: 521 DNA sequence 

Nucleic Acid Accession ft: Eos sequence 

Coding sequence: 107.. 328 



60 
120 



CTGCTCTGTC 
ATGGGGTCCA 
CCTTAAACAA 
GAAAACTGAT 
AAGCTGTTCG 
GGCAAGAAAG 
GAGTCCATCA 
ATCAGCTGTT 
TTTAATTATT 
A 



11 

I 

TGAGCCAGCG 
GTTCTCTATG 
GGCATGGGGC 
AAATACACAG 
TTCCTCAAAA 
GTTGAGATCA 
GCAGAGATAG 
GTAGAGTTCT 
GGCAATAAAC 



21 

I , 

GAATATGATC 
GATTCTTACA 
TCACAAGTCT 
AGGTCCTCAA 
ACAAAAACAA 
GATGTTAGGA 
TAGTGAAGTC 
ATTTGGCAAT 
AACTTCTTTA 



31 
I 

AAGTGGCCAT 
TTTAATTTGT 
ATGGAGACAG 
GACCCATGGA 
GGCTTGGCTG 
AGAACTTTCA 
TCTCCCCAGG 
CTCATGGTTA 
AAAGTTTTAA 



41 

I 

GTGAATTCCA 
AGGGAAATGC 
GCCAAAAAGA 
CTCCTGGTCT 
GGAAAACAGG 
GGTAAAGTAT 
GAAAATTTTA 
AATGACTTCC 
ATAAAATAGC 



51 
I 

AATATTTTTA 
CATTTTTCCC 
ATGTGGAGAA 
GTACCCAAAA 
CCAATGCCCC 
GAGAACTATG 
AAAAGGTTGA 
CTTTGAGCTC 
AACCACCACC 



60 
120 
180 
240 
300 
360 
420 
480 
540 



Seq ID NO: 522 Protein sequence 
Protein Accession ft: Eos sequence 

1 11 21 31 41 51 

I 1 I I I I 

MPFFPLKQGM GLTSLWRQAK KNVEKKTDKY TEVLKTHGLL VCTQKSCSFL KNKNKAWLGK 
QANAPARKVE IRC 

Seq ID NO: 523 DNA sequence 

Nucleic Acid Accession ftt Eos sequence 

Coding sequence : 2 1 1 .. 1 8 9 5 



I 

GGATCTGAGG 
GGGGCTTGGG 
GAGGAATTAT 
TGATTTTTTT 
GTGCTTTTTC 
CACAGGTTCC 
CTTGTGCTGA 
GAAGGTAATT 
AAAATATCGG 
TTCCGACACT 
GCCAATTATT 
TTCTTTGAAC 
GCTGTGGCTA 
CACATGCACT 
GTAGTCCATG 
CAAAATTCCA 
GTTGTGATGT 
TACCTGCATA 
ATCTTGATAG 
ACTCTGGCTG 
GCACCGATCT 
CTAGCTACCA 
AAACTGGCCA 
GTATGCCTGC 
TTCTTCAACT 
GTTCAGGCAG 
ACACCGCCAT 
AGCAGCCAGT 
AAGATCGCCA 
TCAGAGCAGG 
CAGGGAGATG 
GAAGGATGCC 



11 



21 



31 



41 



GGCGCCCAGT 
AGGCAGCCTG 
CTGATAAAAT 
CCCTCGAAAA 
TTTTCTCTTC 
TTGAACAGCT 
AAGCGAAAGT 
GTTTCCCTGA 
CTGTTCCATG 
GTAACCOCAA 
CAGACTGCCT 
GCCTCTATGT 
TTCTCATCAT 
TATTTGTGTC 
CTCACATAGG 
TTGAGGCAAC 
TTATTTACTT 
ATCTCATCTT 
GCTGGGGGTT 
ATGCGAGGTG 
TAGCAGCTAT 
AAATCTGGGA 
AATCGACACT 
CTCACTCCTT 
CCTTTCAGGG 
AGGTGAAGAA 
GTGGCAGCCG 
CACAGGTGGC 
GCAGACAGCC 
ACTGCCTGCC 
ATATTCTAAT 
AAGGAGAAAC 



CACTTCCTCC 
CTCTCCAGTC 
TCCTGGGTTA 
TGACCTTTTT 
TTTTTCTACG 
GGATTCTGAT 
ACAATGTGAA 
ATGGGATGGA 
CCCTCCTTAT 
TGGAACATGG 
TCGCTTTCTG 
AATGTATACC 
TGGTTACTTC 
TTTCATGCTQ 
AGTAAAGGAG 
TTCTGTGGAC 
CCTGGCTACA 
TGTGGCTTTC 
TCCAGCAGCA 
CTGGGAACTT 
TGGGCTGAAT 
GACCAATGCA 
GGTCCTGGTC 
CACTGGGCTC 
TTTCTTTGTG 
GATGTGGAGT 
CAGATGCGGC 
GGCCAGCACA 
TGACAGCCAC 
ACACTCTTTC 
GGAGAAGCCT 
TGAGGATGTT 



ACGTTCTCGT 
CCTATCCACC 
ATATTTTTAA 
ATGCTTCGAA 
ATAAATGAAA 
GGCACCATTA 
CTCAACATCA 
CTCATTTGTT 
ATTTATGACT 
GATTTTATGC 
CAGCCAGATA 
GTTGGCTACT 
AGACGATTGC 
AGAGCTACAA 
CTGGAGTCCC 
AAATCACAAT 
AATTATTATT 
TTTTCGGACA 
TTTGTTGCAG 
AGTGCTGGAG 
TTTATTCTGT 
GTTGGGCATG 
CTAGTCTTTG 
GGGTGGGAGA 
TCTATCATCT 
CGGTGGAATC 
TCAGTGCTCA 
CGCATGGTGC 
ATCACTTTAC 
CACGAGGAGA 
TCCAGGCCTA 
CTCTGA 



GCTGGGCGGG 
CACAGGTTTT 
AAACGGAGAG 
GCAGTTTGTC 
GCATTTCTTC 
CTATAGAGGA 
CAGCTCAACT 
GGCCCAGAGG 
TCAACCATAA 
ACAGCTTAAA 
TCAGCATAGG 
CCATCTCTTT 
ATTGCACTAG 
GCATCTTTGT 
TAATAATGCA 
ATATCGGGTG 
GGATCCTGGT 
CCAAATACCT 
CATGGGCTGT 
ACATCAAGTG 
TTCTGAATAC 
ACACAAGGAA 
GAGTGCATTA 
TCCGCATGCA 
ACTGCTACTG 
TCTCCGTGGA 
CCACCGTGAC. 
TTATCTCTGG 
CTGGCTATGT 
CCAAGGAAGA 
TGGAATCTAA 



Seq ID NO: 524 Protein sequence 
Protein Accession ft: Eos sequence 



l 

1 

MLRSSLSTSI 
LNITAQLQEG 
DFMHSLNKTW 
RRLHCTRNYI 
KSQYIGCKIA 
FVAAWAVARA 
VGHDTRKQYR 
SIIYCYCNGE 



11 
I 

VLFLFSSFST 
EGNCFPEWDG 
ANYSDCLRFL 
HMHLFVSFML 
WMFIYFLAT 
TItADARCWEL 
KLAKSTLVLV 
VQAEVKKMWS 



21 
I 

INESISSRKR 
LICWPRGTVG 
QPDISIGKQE 
RATSIFVKDR 
NYYWILVEGL 
SAGDIKWIYQ 
LVFGVHYIVF 
RWNLSVDWKR 



31 
I 

HRFLEQLDSD 
KISAVPCPPY 
FFERLYVMYT 
WHAHIGVKE 
YLHNLIFVAF 
APIIAAIGLN 
VCLPHSFTGL 
TPPCGSRRCG 



41 



51 
I 

AGGAGCGGAT 
TTGGGTCGGA 
TTTTTAAAAA 
AACCAGCATA 
AAGAAAAAGG 
GCAGATTGTC 
CCAGGAGGGA 
AACAGTGGGG 
AGGAGTTGCT 
TAAAACATGG 
AAAGCAAGAA 
TGGTTCCTTG 
GAACTATATC 
CAAAGACAGA 
GGATGACCCA 
CAAGATTGCT 
GGAAGGTCTC 
GTGGGGCTTC 
GGCACGAGCA 
GATTTATCAA 
GGTTAGAGTT 
GCAATACAGG 
CATCGTGTTC 
CTGTGAGCTC 
CAATGGAGAG 
CTGGAAAAGG 
. GCACAGCACC 
CAAAGCTGCC 
CTGGAGTAAC 
TAGTGGGAGG 
CCCAGACACT 



51 



60 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
I860 



GTITIEEQIV LVLKAKVQCE 60 

IYDFNHKGVA FRHCNPNGTW 120 

VGYSISFGSL AVAILIIGYF 180 

LESLIMQDDP QNSIEATSVD 240 

FSDTKYLHGF ILIGWGFPAA 300 

FILFLNTVRV IATKIWETHA 360 

GWEIRMHCEL FFNSFQGFFV 420 

SVLTTVTHST SSQSQVAAST 480 
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RMVLISGKAA KIASRQPDSH ITLPGYVWSN SEQDCLPHSF HEETKEDSGR QGDDILMEKP 540 
SRPMESNPDT EGCQGETEDV L 



Seq ID NO: S25 DNA sequence 
Nucleic Acid Accession #: NM_O0SO48 
Coding sequence: 143.. 1795 

1 11 21 31 41 51 

| 1 I 1 I I 

GGCCGGTGGC CCGGGCCCGA CCACCCCAGC TGCGCGTCGT TACTGGCCAC AAGTTTGCTC 
TGGGCCAGCC AAGTTGGCAA CTTGGAAGCT TCTCCCGGGC TCTGGAGGAG GGTCCCTGCT 
TCTTCCTACA GCCGTTCCGG GCATGGCCGG GCTGGGGGCG TCGCTCCACG TCTGGGGTTG 
GCTAATGCTC GGCAGCTGCC TCCTGGCCAG AGCCCAGCTG GATTCTGATG GCACCATTAC 
TATAGAGGAG CAGATTGTCC TTGTGCTGAA AGCGAAAGTA CAATGTGAAC TCAACATCAC 
AGCTCAACTC CAGGAGGGAG AAGGTAATTG TTTCCCTGAA TGGGATGGAC TCATTTGTTG 
GCCCAGAGGA ACAGTGGGGA AAATATCGGC TGTTCCATGC CCTCCTTATA TTTATGACTT 
CAACCATAAA GGAGTTGCTT TCCGACACTG TAACCCCAAT GG AACA TGGG ATTTTATGCA 
CAGCTTAAAT AAAACATGGG CCAATTATTC AGACTGCCTT CGCTTTCTGC AGCCAGATAT 
CAGCATAGGA AAGCAAGAAT TCTTTGAACG CCTCTATGTA ATGTATACCG TTGGCTACTC 
CATCTCTTTT GGTTCCTTGG CTGTGGCTAT TCTCATCATT GGTTACTTCA GACGATTGCA 
TTGCACTAGG AACTATATCC ACATGCACTT ATTTGTGTCT TTCATGCTGA GAGCTACAAG 
CATCTTTGTC AAAGACAGAG TAGTCCATGC TCACATAGGA GTAAAGGAGC TGGAGTCCCT 
AATAATGCAG GATGACCCAC AAAATTCCAT TGAGGCAACT TCTGTGGACA AATCACAATA 
TATCGGGTGC AAGATTGCTG TTGTGATGTT TATTTACTTC CTGG CTAC AA ATTATTATTG 
GATCCTGGTG GAAGGTCTCT ACCTGCATAA TCTCATCTTT GTGGCTTTCT TTTCGGACAC 
CAAATACCTG TGGGGCTTCA TCTTGATAGG CTGGGGGTTT CCAGCAGCAT TTGTTGCAGC 
ATGGGCTGTG GCACGAGCAA CTCTGGCTGA TGCGAGGTGC TGGGAACTTA GTGCTGGAGA 
CATCAAGTGG ATTTATCAAG CACCGATCTT AGCAGCTATT GGGCTGAATT TTATTCTGTT 
TCTGAATACG GTTAGAGTTC TAGCTACCAA AATCTGGGAG ACCAATGCAG TTGG GCATG A 
CACAAGGAAG CAATACAGGA AACTGGCCAA ATCGACACTG GTCCTGGTCC TAGTCTTTGG 
AGTGCATTAC ATCGTGTTCG TATGCCTGCC TCACTCCTTC ACTGGGCTCG GGTGGGAGAT 
CCGCATGCAC TGTGAGCTCT TCTTCAACTC CTTTCAGGGT TTCTTTGTGT CTATCATCTA 
CTGCTACTGC AATGGAGAGG TTCAGGCAGA GGTGAAGAAG ATGTGGAGTC GGTGGAATCT 
CTCCGTGGAC TGGAAAAGGA CACCGCCATG TGGCAGCCGC AGATGCGGCT CAGTGCTCAC 
CACCGTGACG CACAGCACCA GCAGCCAGTC ACAGGTGGCG GCCAGCACAC GCATGGTGCT 
TATCTCTGGC AAAGCTGCCA AGATCGCCAG CAGACAGCCT GACAGCCACA TCACTTTACC 
TGGCTATGTC TGGAGTAACT CAGAGCAGGA CTGCCTGCCA CACTCTTTCC ACGAGGAGAC 
CAAGGAAGAT AGTGGGAGGC AGGGAGATGA TATTCTAATG GAGAAGCCTT CCAGGCCTAT 
GGAATCTAAC CCAGACACTG AAGGATGCCA AGGAGAAACT GAGGATGTTC TCTGAATGGA 
CATTTGTGGC TGACTTTCAT GGGCTGGTCC AATGGCTGGT TGTGTGAGAG GGCTTGGCTG 
ATACTCCTAT GCTTGAGTTC AAAGGCTGAA AATTCAGTTA AGGTGTTACT TAATAATAGT 
TTTTAGGCTC CATGAATTGG CTCCTGTAAA TACTAACGAC ATGAAAATGC AAGTGTCAAT 
GGAGTAGTTT ATTACCTTCT ATTGGCATCA AGTTTTCCTC TAAATTAATG TATGGTATTT 
GCTCTGTGAT TGTTCATTTT TTTCTGCTAC TTTTGGGTAG AAAAAAGATT CAATTGCTTG 
GCTGTAGCTT TCT CTCATAT ATATCACCCT AAATATAATG AAGATCTTTT AGTGTGTATC 
ATTTTCCTTT TAGAAACTAG TATTCTCTTA TTTCTTACTT TAATGTACTT CTATCACTGC 
ATTTATTTTG CCTGTGCATA GGAGCAATTA GGATCTAAAA AAATATATGG GAAGATAAAA 
GATCTAAGAA CAAGTACTTG CTGGAAAATT AGTTGGCTGG ACATTGATAA AATAATGCAT 
TTATAACAAT TACATGTGTT TTTGGGAACA AGGAAAATTT CTCAAAAAAG AATATTTCAC 
ACATCCCTTC TTTTGAATGG CCTCTTTGTG ACCAGCCAGA CCTCAG GTCT TCACTCTTTC 
TTCTTTGTAA ACCATGTCAT GTGGAAAGAT TTCCTCAGTT AGTGAGCTTG TGTCTGCAAA 
TTGATTTTGT TTGTAATGTA TTTTGATAGC AAATCATGCT GCATCTATAT CTTTTTCTTG 
TTTGAGCTGT TACTACATTG TACATGGCAT GTGGGATCAA TTAAAAATTT GTTTTAAAAA 
T 



60 
120 
1B0 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
13B0 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 

2640 



Seq ID NO: 526 Protein sequence 
Protein Accession #: NP_005039 

1 ll 21 31 41 51 

MAGLGASLHV WGWLMLGSCL LARAQLDSDG TITIEEQIVL VLKAKVQCEL NITAQLQEGE 60 

GNCFPEWDGL ICWPRGTVGK ISAVPCPPYI YDFNHKGVAF RHCNPNGTWD FMHSLNKTWA 120 

NYSDCLRFLQ PDISIGKQEF FERLYVMYTV GYSISFGSLA VAILIIGYFR RLHCTRNYIH 180 

MHLFVSFMLR ATSIFVKDRV VHAHIGVKEL ESLIKQDDPQ NSIEATSVDK SQYIGCK1AV 240 

VMPIYFLATN YYWI1»VEGLY LHNLIFVAFF SDTKYLWGFI LIGWGFPAAP VAAWAVARAT 300 

IiADARCWELS AGDIKWIYQA PILAAIGLNF ILFLNTVRVL ATKIWETNAV GHDTRKQYRK 360 

LAKSTLVLVXi VFGVHYIVFV CLPHSFTGLG WEIRMHCELF FNSFQGFPVS IIYCYCNGEV 420 

QAEVKKMWSR WNLSVDWKRT PPCGSRRCGS VLTTVTHSTS SQSQVAASTR MVLISGKAAK 480 

IASRQPDSHI TLPGYVWSNS EQDCLPHSFH EETKEDSGRQ GDDILMEKPS RPMESNPDTE S40 
GCOGETEDVL 



Seq ID NO: 527 DNA sequence 
Nucleic Acid Accession XM_036683 
Coding sequence: 38.. 3655 

1 ll 21 31 41 51 

GCTTTGCCCA GTAGTTGGAA AGTGAACTCG ACTCGTGATG GTTCTCCTGT CACTTTGGTT 60 

GAT AG CAG CC GCTCTGGTAG AGGTT AGGAC TTCAGCTGAT GGACAAGCTG GTAATGAAGA 120 

AATGGTGCAA ATAGATTTAC CAATAAAGAG ATATAGAGAG TATGAGCTGG TGACTCCAGT 180 

CAGCACAAAT CTAGAAGGAC GCTATCTCTC CCATACTCTT TCTGCGAGTC ACAAAAAGAG 240 

GTCAGCGAGG GACGTGTCTT CCAACCCTGA GCAGTTGTTC TTTAACATCA CGGCATTTGG 300 

AAAAGATTTT CATCTGCGAC TAAAGCCCAA CACTCAACTA GTAGCTCCTG GGGCTGTTGT 360 

GGAGTGGCAT GAGACATCTC TGGTGCCTGG GAATATAACC GATCCCATTA ACAACCATCA 420 
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ACCAGGAAGT GCTACGTATA GAATCCGGAA AACAGAGCCT TTGCAGACTA ACTGTGCTTA 4 BO 
TGTTGGTGAC ATCGTGGACA TTCCAGGAAC CTCTGTTGCC ATCAGCAACT GTGATGGTCT 540 
GGCTGGAATG ATAAAAAGTG ATAATGAAGA GTATTTCATT GAACCCTTGG AAAGAGGTAA 600 
ACAGATGGAG GAAGAAAAAG GAAGGATTCA TGTTGTCTAC AAGAGATCAG CTGTAGAACA 660 
GGCTCCCATA GACATGTCCA AAGACTTCCA CTACAGAGAG TCGGACCTGG AAGGCCTTGA 720 
TGATCTAGGT ACTGTTTATG GCAACATCCA CCAGCAGCTG AATGAAACAA TGAGACGCCG 780 
CAGACACGCG GGAGAAAACG ATTACAATAT CGAGGTACTG CTGGGAGTGG ATGACTCTGT 840 
GGTCCGTTTC CATGGCAAAG AGCACGTCCA AAACTACCTC CTGACCCTAA TGAACATTGT 900 
GAATGAAATT TACCATGATG AGTCCCTCGG AGTGCATATA AATGTGGTCC TGGTGCGCAT 960 
GATAATGCTG GGATATGCAA AGTCCATCAG CCTCATAGAA AGGGGAAACC CATCCAGAAG 1020 
CTTGGAGAAT GTGTGTCGCT GGGCGTCCCA ACAGCAAAGA TCTGATCTCA ACCACTCTGA 1080 
ACACCATGAC CATGCAATTT TTTTAACCAG GCAAGACTTT GGACCTGCTG GAATGCAAGG 1140 
ATATGCTCCA GTCACCGGCA TGTGTCATCC AGTGAGAAGT TGTACCCTGA ATCATGAGGA 1200 
TGGTTTTTCA TCTGCTTTTG TAGTAGCCCA TGAAACGGGC CATGTGTTGG GAATGGAGCA 1260 
TGATGGACAA GGCAACAGGT GTGGTGATGA GACTGCTATG GGAAGTGTCA TGGCTCCCTT 1320 
GGTACAAGCA GCATTGCATC GTTACCACTG GTCCCGATGC AGTGGTCAAG AACTGAAAAG 1380 
ATATATCCAT TCCTATGACT GTCTCCTTGA TGACCCTTTT GATCATGATT GGCCTAAACT 1440 
CCCAGAACTT CCTGGAATCA ATTATTCTAT GGATGAGCAA TGTCGTTTTG ATTTTGGTGT 1500 
TGGCTATAAA ATGTGCACCG CGTTCCGAAC CTTTGACCCA TGTAAACAGC TGTGGTGTAG 1560 
CCATCCTGAT AATCCCTACT TTTGTAAGAC TAAAAAGGGA CCTCCACTTG ATGGGACTGA 1620 
ATGTGCTGCT GGAAAATGGT GCTATAAGGG TCATTGCATG TGGAAGAATG CTAATCAGCA 1680 
AAAACAAGAT GGCAATTGGG GGTCATGGAC TAAATTTGGC TCCTGTTCTC GGACATGTGG 1740 
AACTGGTGTT CGTTTCAGAA CACGCCAGTG CAATAATCCC ATGCCCATCA ATGGTGGTCA 1800 
GGATTGTCCT GGTGTTAATT TTGAGTACCA GCTTTGTAAC ACAGAAGAAT GCCAAAAACA 1860 
CTTTGAGGAC TTCAGAGCAC AGCAGTGTCA GCAGCGAAAC TCCCACTTTG AATAC CAGA A 1920 
TACCAAACAC CACTGGTTGC CATATGAACA TCCTGACCCC AAGAAAAGAT GCCACCTTTA 1980 
CTGTCAGTCC AAGGAGACTG GAGATGTTGC TTACATGAAA CAACTGGTGC ATGATGGAAC 2040 
GCACTGTTCT TACAAAGATC CAT AT AG CAT ATGTGTGCGA GGAGAGTGTG TGAAAGTGGG 2100 
CTGTGATAAA GAAATTGGTT CTAATAAGGT TGAGGATAAG TGTGGTGTCT GTGGAGGAGA 2160 
TAATTCCCAC TGCCGAACCG TGAAGGGGAC ATTTACCAGA ACTCCCAGGA AGCTTGGGTA 2220 
CCTTAAGATG TTTGATATAC CCCCTGGGGC TAGACATGTG TTAATCCAAG AAGACGAGGC 2280 
TTCTCCTCAT ATTCTTGCTA TTAAGAACCA GGCTACAGGC CATTATATTT TAAATGGCAA 2340 
AGGGGAGGAA GCCAAGTCGC GGACCTTCAT AGATCTTGGT GTGGAGTGGG ATTATAACAT 2400 
TGAAGATGAC ATTGAAAGTC TTCACACCGA TGGACCTTTA CATGATCCTG TTATTGTTTT 2460 
GATTATACCT CAAGAAAATG ATACCCGCTC TAGCCTGACA TATAAGTACA TCATCCATGA 2520 
AGACTCTGTA CCTACAATCA ACAGCAACAA TGTCATCCAG GAAGAATTAG ATACTTTTGA 2580 
GTGGGCTTTG AAGAGCTGGT CTCAGTGTTC CAAACCCTGT GGTGGAGGTT TCCAGTACAC 2640 
TAAATATGGA TGCCGTAGGA AAAGTGATAA TAAAATGGTC CATCGCAGCT TCTGTGAGGC 2700 
CAACAAAAAG CCGAAACCTA TTAGACGAAT GTGCAATATT CAAGAGTGTA CACATCCACT 2760 
CTGGGTAGCA GAAGAATGGG AACACTGCAC CAAAACCTGT GGAAGTTCTG GCTATCAGCT 2820 
TCGCACTGTA CGCTGCCTTC AGCCACTCCT TGATGGCACC AACCGCTCTG TGCACAGCAA 2880 
ATACTGCATG GGTGACCGTC CCGAGAGCCG CCGGCCCTGT AACAGAGTGC CCTGCCCTGC 2940 
ACAGTGGAAA ACAGGACCCT GGAGTGAGTG TTCAGTGACC TGCGGTGAAG GAACGGAGGT 3000 
GAGGCAGGTC CTCTGCAGGG CTGGGGACCA CTGTGATGGT GAAAAGCCTG AGTCGGTCAG 3060 
AGCCTGTCAA CTGCCTCCTT GTAATGATGA ACCATGTTTG GGAGACAAGT CCATATTCTG 3120 
TCAAATGGAA GTGTTGGCAC GATACTGCTC CATACCAGGT TATAACAAGT TATGTTGTGA 3180 
GTCCTGCAGC AAGCGCAGTA GCACCCTGCC ACCACCATAC CTTCTAGAAG CTGCTGAAAC 3240 
TCATGATGAT GTCATCTCTA ACCCTAGTGA CCTCCCTAGA TCTCTAGTGA TGCCTACATC 3300 
TTTGGTTCCT TATCATTCAG AGACCCCTGC AAAGAAGATG TCTTTGAGTA GCATCTCTTC 3360 
AGTGGGAGGT CCAAATGCAT ATGCTGCTTT CAGGCCAAAC AGTAAACCTG ATGGTGCTAA 3420 
TTTACGCCAG AGGAGTGCTC AGCAAGCAGG AAGTAAGACT GTGAGACTGG TCACCGTACC 3480 
ATCCTCCCCA CCCACCAAGA GGGTCCACCT CAGTTCAGCT TCACAAATGG CTGCTGCTTC 3540 
CTTCTTTGCA GCCAGTGATT CAATAGGTGC TTCTTCTCAG GCAAGAACCT CAAAGAAAGA 3600 
TGGAAAGATC ATTGACAACA GACGTCCGAC AAGATCATCC ACCTTAGAAA GATGAGAAAG 3660 
TGAACCAAAA AGGCTAGAAA CCAGAGGAAA ACCTGGACAA CCTCTCTCTT CCCATGGTGC 3720 
ATATGCTTGT TTAAAGTGGA AATCTCTATA GATCGTCAGC TCATTTTATC TGTAATTGGA 3780 
AGAACAGAAA GTGCTGGCTC ACTTTCTAGT TGCTTTCATC CTCCTTTTGT TCTGCATTGA 3840 
CTCATTTACC AGAATTCATT GGAAGAAATC ACCAAAGATT ATTACAAAAG AAAAATATGT 3900 
TGCTAAGATT GTGTTGGTCG CTCTCTGAAG CAGAAAAGGG ACTGGAACCA ATTGTGCATA 3960 
TCAGCTGACT TTTTGTTTGT TTTAGAAAAG TTACAGTAAA AATTAAAAAG AGATACCAAT 4020 
GGTTTACACT TTAACAAGAA ATTTTGGATA TGGAACAAAG AATTCTTAGA CTTGTATTCC 4080 
TATTTATCTA TATTAGAAAT ATTGTATGAG CAAATTTGCA GCTGTTGTGT AAATACTGTA 4140 
TATTGCAAAA ATCAGTATTA TTTTAAGAGA TGTGTTCTCA AATGATTGTT TACTATATTA 4200 
CATTTCTGGA TGTTCTAGGT GCCTGTCGTT GAGTATTGCC TTGTTTGACA TTCTATAGGT 4260 
TAATTTTCAA AGCAGAGTAT TACAAAAGAG AAGTTAGAAT TACAGCTACT GACAATATAA 4320 
AGGGTTTTGT TGAATCAACA ATGTGATACG TAAATTATAG AAAAAGAAAA GAAACACAAA 4380 
AGCTATAGAT ATACAGATAT CAGCTTACCT ATTGCCTTCT ATACTTATAA TTTAAAGGAT 4440 
TGGTGTCTTA GTACACTTGT GGTCACAGGG ATCAACGAAT AGTAAATAAT GAACTCGTGC 4500 
AAGACAAAAC TGAAACCCTC TTTCCAGGAC CTCAGTAGGC ACCGTTGAGG TGTCCTTTGT 4560 
TTTTGTGTGT ■GTGTGTTCTT TTTTAATTTT CGCATTGTTG ACAGATACAA ACAGTTATAC 4620 
TCAATGTACT GTAATAATCG CAAAGGAAAA AGTTTTGGGA TAACTTATTT GTATGTTGGT 4680 
AGCTGAGAAA AATATCATCA GTCTAGAATT GATATTTGAG TATAGT AGAG CTTTGGGGCT 4740 
TTGAAGGCAG GTTCAAGAAA GCATATGTCG ATGGTTGAGA TATTTATTTT CCATATGGTT 4800 
CATGTTCAAA TGTTCACAAC CACAATGCAT CTGACTGCAA TAATGTGCTA ATAATTTATG 4860 
TCAGTAGTCA CCTTGCTCAC AGCAAAGCCA GAAATGCTCT CTCCAGGGAG TAGATG TAAA 4920 
GTACTTGTAC ATAGAATTCA GAACTGAAGA TATTTATTAA AAGTTGATTT TTTTTTCTTG 4980 
ATAGTATTTT TATGTACTAA ATATTTACAC TAATATCAAT TACATATTTT GGTAAACTAG 5040 
AGAGACATAA TTAGAGATGC ATGCTTTGTT CTGTGCATAG AGACCTTTAA GCAAACTACT 5100 
ACAGCCAACT CAAAAGCTAA AACTGAACAA ATTTGATGTT ATGCAAACAT CTTGCATTTT 5160 
TAGTAGTTGA TATTAAGTTG ATGACTTGTT TCCCTTCAAG GAAACATTAA ATTGTATGGA 5220 
CTCAGCTAGC TGTTCAATGA AATTGTGAAT TAGAAACATT TTTAAAAGTT TTTGAAAGAG 5280 
ATAAGTGCAT CATGAATTAC ATGTACATGA GAGGAGATAG TGATATCAGC ATAATGATTT 5340 
TGAGGTCAGT ACCTGAGCTG TCTAAAAATA TATTATACAA ACTAAAATGT AGATGAATTA 5400 
ACCTCTCAAA GCACAGAATG TGCAAGAACT TTTGCATTTT AATCGTTGTA AACTAACAGC 5460 
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TTAAACTATT GACTCTATAC CTCTAAAGAA TTGCTGCTAC TTTGTGCAAG AACTTTGAAG 
GTCAAATTAG GCAAATTCCA GATAGTAAAA CAATCCCTAA GCCTTAAGTC TTTTTTTTTT 
CCTAAAAATT CCCATAGAAT AAAATTCTCT CTAGTTTACT TGTGTGTGCA TACATCTCAT 
CCACAGGGGA AGATAAAGAT GGTCACACAA ACAGTTTCCA TAAAGATGTA CATATTCATT 
ATACTTCTGA CCTTTGGGCT TTCTTTTCTA CTAAGCTAAA AATTCCTTTT TATCAAAGTG 
TACACTACTG ATGCTGTTTG TTGTACTGAG AGCACGTACC AATAAAAATG TTAACAAAAT 
AT 

Seq ID NO: 528 Protein sequence 
Protein Accession ft: XP_036683 

1 11 21 31 41 51 

1 1 I I I I 

MVLLSLWLIA AALVEVRTSA DGQAGNEEMV QIDLPIKRYR EYELVTPVST NLEGRYLSHT 

LSASHKKRSA RDVSSNPEQL FFNITAFGKD FHLRLKPNTQ LVAPGAWEW HETSLVPGNI 120 

TDPINNHQPG SATYRIRKTE PLQTNCAYVG DIVDIPGTSV AISNCDGLAG MIKSDNEEYF 180 

IEPLERGKQM EEEKGRIHW YKRSAVEQAP IDMSKDFHYR ESDLEGLDDL GTVYGNIHQQ 240 

LNETMRRRRH AGENDYNIEV LLGVDDSWR FHGKEHVQNY LLTLMNIVNE IYHDESLGVH 300 

INWLVRMIM LGYAKSISLI ERGNPSRSLE NVCRWASQQQ RSDLNHSEHH DHAIFLTRQD 360 

FGPAGMQGYA PVTGMCHPVR SCTLNHEDGP SSAFWAHET GHVLGMEHDG QGNRCGDETA 420 

MGSVMAPLVQ AAFHRYHWSR CSGQBLKRYI HSYDCLLDDP FDHDWPKLPE LPGINYSMDE 480 

QCRFDFGVGY KMCTAFRTFD PCKQLWCSHP DNPYFCKTKK GPPIiDGTECA AGKWCYKGHC 540 

MWKNANQQKQ DGNWGSWTKF GSCSRTCGTG VRFRTRQCNN PMPINGGQDC PGVNFEYQLC S00 

NTEECQKHFE DFRAQQOQQR NSHFEYQNTK HHWLPYEHPD PKKRCHLYCQ SKETGDVAYM 6G0 

KQLVHDGTHC SYKDPYSICV RGECVKVGCD KEIGSNKVED KOGVCGGDNS HCRTVKGTFT 720 

RTPRKLGYLK MFDIPPGARH VLIQEDEASP HILAIKNQAT GHYILNGKGE EAKSRTFIDL 780 

GVEWDYNIED DIESLHTDGP LHDPVIVLII PQENDTRSSL TYKYI IHEDS VPTINSNNVI 840 

QEEIiDTFEWA LKSWSQCSKP CGGGFQYTKY GCRRKSDNKM VHRSFCEANK KPKPIRRMCN 900 

IQECTHPLWV AEEWEHCTKT CGSSGYQLRT VRCLQPLLDG TNRSVHSKYC MGDRPESRRP 960 

QJRVPCPAQW KTGPWSECSV TCGEGTEVRQ VLCRAGDHCD GEKPESVRAC QLPPCNDEPC 1020 

LGDKSIFCQM EVLARYCSIP GYNKLCCESC SKRSSTLPPP YLLEAAETHD DVISNPSDLP 1080 

RSLVMPTSLV PYHSETPAKK MSLSSISSVG GPNAYAAFRP NSKPDGANLR QRSAQQAGSK 1140 

TVRLVTVPSS PPTKRVHLSS ASQMAAASFF AASDSIGASS QARTSKKDGK I IDNRRPTRS 1200 
STLER 

Seq ID NO: 529 DNA sequence 
Nucleic Acid Accession ft: NM_002774 
Coding sequence: 246.. 980 

1 11 21 31 41 51 

i I I I I I 

AGGCGGACAA AGCCCGATTG TTCCTGGGCC CTTTCCCCAT CGCGCCTGGG CCTGCTCCCC 60 

AGCCCGGGGC AGGGGCGGGG GCCAGTGTGG TGACACACGC TGTAGCTGTC TCCCCGGCTG 120 

GCTGGCTCGC TCTCTCCTGG GGACACAGAG GTCGGCAGGC AGCACACAGA GGGACCTACG 180 

GGCAGCTGTT CCTTCCCCCG ACTCAAGAAT CCCCGGAGGC CCGGAGGCCT GCAGCAGGAG 240 

CGGCCATGAA GAAGCTGATG GTGGTGCTGA GTCTGATTGC TGCAGCCTGG GCAGAGGAGC 300 

AGAATAAGTT GGTGCATGGC GGACCCTGCG ACAAGACATC TCACCCCTAC CAAGCTGCCC 360 

TCTACACCTC GGGCCACTTG CTCTGTGGTG GGGTCCTTAT CCATCCACTG TGGGTCCTCA 420 

CAGCTGCCCA CTGCAAAAAA CCGAATCTTC AGGTCTTCCT GGGGAAGCAT AACCTTCGGC 480 

AAAGGGAGAG TTCCCAGGAG CAGAGTTCTG TTGTCCGGGC TGTGATCCAC CCTGACTATG 540 

ATGCCGCCAG CCATGACCAG GACATCATGC TGTTGCGCCT GGCACGCCCA GCCAAACTCT 600 

CTGAACTCAT CCAGCCCCTT CCCCTGGAGA GGGACTGCTC AGCCAACACC ACCAGCTGCC 660 

ACATCCTGGG CTGGGGCAAG ACAGCAGATG GTGATTTCCC TGACACCATC CAGTGTGCAT 720 

ACATCCACCT GGTGTCCCGT GAGGAGTGTG AGCATGCCTA CCCTGGCCAG ATCACCCAGA 780 

ACATGTTGTG TGCTGGGGAT GAGAAGTACG GGAAGGATTC CTGCCAGGGT GATTCTGGGG 840 

GTCCGCTGGT ATGTGGAGAC CACCTCCGAG GCCTTGTGTC ATGGGGTAAC ATCCCCTGTG 900 

GATCAAAGGA GAAGCCAGGA GTCTACACCA ACGTCTGCAG ATACACGAAC TGGATCCAAA 960 

AAACCATTCA GGCCAAGTGA CCCTGACATG TGACATCTAC CTCCCGACCT ACCACCCCAC 1020 

TGGCTGGTTC CAGAAOGTCT CTCACCTAGA CCTTGCCTCC CCTCCTCTCC TGCCCAGCTC 1080 

TGACCCTGAT GCTTAATAAA CGCAGCGACG TGAGGGTCCT GATTCTCCCT GGTTTTACCC 1140 

CAGCTCCATC CTTGCATCAC TGGGGAGGAC GTGATGAGTG AGGACTTGGG TCCTCGGTCT 1200 

TACCCCCACC ACTAAGAGAA TACAGGAAAA TCCCTTCTAG GCATCTCCTC TCCCCAACCC 1260 

TTCCACACGT TTGATTTCTT CCTGCAGAGG CCCAGCCAOG TGTCTGGAAT CCCAGCTCCG 1320 

CTGCTTACTG TCGGTGTCCC CTTGGGATGT ACCTTTCTTC ACTGCAGATT TCTCACCTGT 1380 

AAGATGAAGA TAAGGATGAT ACAGTCTCCA TCAGGCAGTG GCTGTTGGAA AGATTTAAGA 1440 

TTTCACACCT ATGACATACA TGGGATAGCA CCTGGGCCGC CATGCACTCA ATAAAGAATG 1500 
TATTTT 

Seq ID NO: S30 Protein sequence 
Protein Accession #t NP_00276S 

1 11 21 31 41 51 

1 I I 1 I I 

MKKLMWLSL IAAAWAEEQN KLVHGGPCDK TSHPYQAALY TSGHLLCGGV LIHPLWVLTA 60 
AHCKKPNLQV FLGKHNLRQR ESSQEQSSW RAVIHPDYDA ASHDQDIMUj RLARPAKLSE 120 
LIQPLPLERD CSANTTSCHI LGWGKTADGD FPDTIQCAYI HLVSREECEH AYPGQITQNM 180 
LCAGDEKYGK DSCQGDSGGP LVCGDHLRGL VSWGNIPCGS KEKPGVYTNV CRYTNWIQKT 240 
IQAK 

Seq ID NO: 531 DNA sequence 
Nucleic Acid Accession ft: NMJ>12152 
Coding sequence: 4 3.. 1104 

1 11 21 31 41 51 
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CTTCTTTAAA 
GACAAGCACA 
GGAACAAAGC 
TCTAATTCTC 
TACCTGTTGG 
ATGTTTAACA 
GGGCTTCTGG 
AGGCACATGT 
CTGCTCATTT 
TGGAATTGCC 
TACCTTGTTT 
CTGCGGATCT 
TCCATCAGCC 
GCGTTTGTGG 
AGGCAGTGTG 
. GTCGTGAACC 
ATGATCTGCT 
GTCCTCAGCA 
GTCTGCAATA 
GTCTTAGG 



TTTCTTTCTA 
TGGACTTTTT 
TTGTGATTGT 
TGGTCATCGC 
CTAATTTAGC 
CAGGCCCAGT 
ACAGTAGCTT 
CAATCATGAG 
TGCTTGTCTG 
TCTGCAACAT 
TCTGGACAGT 
ACGTGTACGT 
GCGGGAGGAC 
TATGCTGGAC 
GCGTGCAGCA 
CCATCATCTA 
GCTTCTCTCA 
GGAGTGACAC 
AAAGCACTTC 



GGATGTTCAC 
TTATAATAGG 
TTTGTGTGTT 
GGCAGTGATC 
TGCTGCCGAT 
TTCAAAAACT 
GACTGCTTCC 
GATGCGGGTC 
GGCCATCGCC 
CTCTGCCTGC 
GTCCAACCTC 
CAAGAGGAAA 
ACCCATGAAG 
CCCGGGCCTG 
TGTGAAAAGG 
CTCCTACAAG 
GGAGAACCCA 
AGGCAGCCAG 
CTAAACTCTG 



TTCTTCTCCA 
AGCAACACTG 
GGGACGTTTT 
AAAAACAGAA 
TTCTTCGCTG 
TTGACTGTCA 
CTCACCAACT 
CATAGCAACC 
ATTTTTATGG 
TCTTCCCTGG 
ATGGCCTTCC 
ACCAACGTCT 
CTAATGAAGA 
GTGGTTCTGC 
TGGTTCCTGC 
GACGAGGACA 
GAGAGGCGTC 
TACATAGAGG 
GATGCCTCTC 



CAATGAATGA 
ATACTGTCGA 
TCTGCCTGTT 
AATTTCATTT 
GAATTGCCTA 
ACCGCTGGTT 
TGCTGGTTAT 
TGACCAAAAA 
GGGCGGTCCC 
CCCCCATTTA 
TCATCATGGT 
TGTCTCCGCA 
CGGTGATGAC 
TCCTCGACGG 
TGCTGGCGCT 
TGTATGGCAC 
CCTCTCGCAT 
ATAGTATTAG 
GGCCCACCCA 



I 



GTGTCACTAT 
TGACTGGACA 
TATTTTTTTT 
CCCCTTCTAC 
TGTATTCCTG 
TCTCCGTCAG 
CGCCGTGGAG 
GAGGGTGACA 
CACACTGGGC 
CAGCAGGAGT 
TGTGGTGTAC 
TACAAGTGGG 
TGTCTTAGGG 
CCTGAACTGC 
GCTCAACTCC 
CATGAAGAAG 
CCCCTCCACA 
CCAAGGTGCA 
GGTGATGACT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 



Seq ID NOs 532 Protein sequence 
Protein Accession #: NP_036284 



l 
I 

MNECHYDKHM 
FHFPFYYLLA 
LVIAVERHMS 
PIYSRSYLVF 
VMTVLGAFW 
YGTMKKMICC 



11 
I 

DFFYNRSNTD 
NIAAADFFAG 
IMRMRVHSNL 
WTVSNLMAFL 
CWTPGLWLL 
FSQENPERRP 



21 



31 



41 



51 



TVDDWTGTKL VIVLCVGTFF CLFIFFSNSL VIAAVIKNRK 
IAYVFLMFNT GPVSKTLTVN RWFLRQGLLD SSLTASLTNL 
TKKKVTIjLIL liVWAIAIFMG AVPTLGWNCL CNISACSSLA 
IMWVYLRIY VYVKRKTNVL SPHTSGSISR RRTPMKLMKT 
LDGLNCRQCG VQHVKRWFLL LALLNSWNP IIYSYKDEDM 
SRIPSTVLSR SDTGSQYIED SISQGAVCNK STS 
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120 
180 

240 
300 



Seq ID NO: 533 DNA sequence 
Nucleic Acid Accession #: NM_002821 
Coding sequence: 150.. 3 3 62 

31 



41 



51 



1 11 21 

111! 

AACTCCCGCC TCGGGACGCC TCGGGGTCGG GCTCCGGCTG CGGCTGCTGC TGCGGCGCCC 60 

GCGCTCCGGT GCGTCCGCCT CCTGTGCCCG CCGCGGAGCA GTCTGCGGCC CGCCGTGCGC 120 

CCTCAGCTCC TTTTCCTGAQ CCCGCCGCGA TGGGAGCTGC GCGGGGATCC CCGGCCAGAC 180 

CCCGCCGGTT GCCTCTGCTC AGCGTCCTGC TGCTGCOGCT GCTGGGCGGT ACCCAGACAG 240 

CCATTGTCTT CATCAAGCAG CCGTCCTCCC AGGATGCACT GCAGGGGCGC CGGGCGCTGC 300 

TTCGCTGTGA GGTTGAGGCT CCGGGCCCGG TACATGTGTA CTGGCTGCTC GATGGGGCCC 360 

CTGTCCAGGA CACGGAGCGG CGTTTCGCCC AGGGCAGCAG CCTGAGCTTT GCAGCTGTGG 420 

ACCGGCTGCA GGACTCTGGC ACCTTCCAGT GTGTGGCTCG GGATGATGTC ACTGGAGAAG 480 

AAGCCCGCAG TGCCAACGCC TCCTTCAACA TCAAATGGAT TGAGGCAGGT CCTGTGGTCC S40 

TGAAGCATCC AGCCTCGGAA GCTGAGATCC AGCCACAGAC CCAGGTCACA CTTGGTTGCC 600 

ACATTGATGG GCACCCTCGG CCCACCTACC AATGGTTCCG AGATGGGACC CCCCTTTCTG 660 

ATGGTCAGAG CAACCACACA GTCAGCAGCA AGGAGCGGAA CCTGACGCTC CGGCCAGCTG 720 

GTCCTGAGCA TAGTGGGCTG TATTCCTGCT GCGCCCACAG TGCTTTTGGC CAGGCTTGCA 780 

GCAGCCAGAA CTTCACCTTG AGCATTGCTG ATGAAAGCTT TGCCAGGGTG GTGCTGGCAC 840 

CCCAGGACGT GGTAGTAGCG AGGTATGAGG AGGCCATGTT CCATTGCCAG TTCTCAGCCC 900 

AGCCACCCCC GAGCCTGCAG TGGCTCTTTG AGGATGAGAC TCCCATCACT AACCGCAGTC 960 

GCCCCCCACA CCTCCGCAGA GCCACAGTGT TTGCCAACGG GTCTCTGCTG CTGACCCAGG 1020 

TCCGGCCACG CAATGCAGGG ATCTACCGCT GCATTGGCCA GGGGCAGAGG GGCCCACCCA 1080 

TCATCCTGGA AGCCACACTT CACCTAGCAG AGATTGAAGA CATGCCGCTA TTTGAGCCAC 1140 

GGGTGTTTAC AGCTGGCAGC GAGGAGCGTG TGACCTGCCT TCCCCCCAAG GGTCTGCCAG 1200 

AGCCCAGCGT GTGGTGGGAG CACGCGGGAG TCCGGCTGCC CACCCATGGC AGGGTCTACC 1260 

AGAAGGGCCA CGAGCTGGTG TTGGCCAATA TTGCTGAAAG TGATGCTGGT GTCTACACCT 1320 

GCCACGCGGC CAACCTGGCT GGTCAGCGGA GACAGGATGT CAACATCACT GTGGCCACTG 1380 

TGCCCTCCTG GCTGAAGAAG CCCCAAGACA GCCAGCTGGA GGAGGGCAAA CCCGGCTACT 1440 

TGGATTGCCT GACCCAGGCC ACACCAAAAC CTACAGTTGT CTGGTACAGA AACCAGATGC 1500 

TCATCTCAGA GGACTCACGG TTCGAGGTCT TCAAGAATGG GACCTTGCGC ATCAACAGCG 1560 

TGGAGGTGTA TGATGGGACA TGGTACCGTT GTATGAGCAG CACCCCAGCC GGCAGCATCG 1620 

AGGCGCAAGC CCGTGTCCAA GTGCTGGAAA AGCTCAAGTT CACACCACCA CCCCAGCCAC 1680 

AGCAGTGCAT GGAGTTTGAC AAGGAGGCCA CGGTGCCCTG TTCAGCCACA GGCCGAGAGA 1740 

AGCCCACTAT TAAGTGGGAA CGGGCAGATG GGAGCAGCCT CCCAGAGTGG GTGACAGACA 1800 

ACGCTGGGAC CCTGCATTTT GCCCGGGTGA CTCGAGATGA CGCTGGCAAC TACACTTGCA 1860 

TTGCCTCCAA CGGGCCGCAG GGCCAGATTC GTGCCCATGT CCAGCTCACT GTGGCAGTTT 1920 

TTATCACCTT CAAAGTGGAA CCAGAGCGTA CGACTGTGTA CCAGGGCCAC ACAGCCCTAC 1980 

TGCAGTGCGA GGCCCAGGGG GACCCCAAGC CGCTGATTCA GTGGAAAGGC AAGGACCGCA 2040 

TCCTGGACCC CACCAAGCTG GGACCCAGGA TGCACATCTT CCAGAATGGC TCCCTGGTGA 2100 

TCCATGACGT GGCCCCTGAG GACTCAGGCC GCTACACCTG CATTGCAGGC AACAGCTGCA 2160 

ACATCAAGCA CACGGAGGCC CCCCTCTATG TCGTGGACAA GCCTGTGCCG GAGGAGTCGG 2220 

AGGGCCCTGG CAGCCCTCCC CCCTACAAGA TGATCCAGAC CATTGGGTTG TCGGTGGGTG 2280 

CCGCTGTGGC CTACATCATT GCCGTGCTGG GCCTCATGTT CTACTGCAAG AAGCGCTGCA 2340 

AAGCCAAGCG GCTGCAGAAG CAGCCCGAGG GCGAGGAGCC AGAGATGGAA TGCCTCAACG 2400 

GAGGGCCTTT GCAGAAOGGG CAGCCCTCAG CAGAGATCCA AGAAGAAGTG GCCTTGACCA 2460 

GCTTGGGCTC CGGCCCCGCG GCCACCAACA AACGCCACAG CACAAGTGAT AAGATGCACT 2520 

TCCCACGGTC TAGCCTGCAG CCCATCACCA CGCTGGGGAA GAGTGAGTTT GGGGAGGTGT 2580 

TCCTGGCAAA GGCTCAGGGC TTGGAGGAGG GAGTGGCAGA GACCCTGGTA CTTGTGAAGA 2640 

1159 



WO 03/042661 



GCCTGCAGAC GAAGGATGAG CAGCAGCAGC TGGACTTCCG GAGGGAGTTG GAGATGTTTG 2700 

GGAAGCTGAA CCACGCCAAC GTGGTGOGGC TCCTGGGGCT GTGCCGGGAG GCTGAGCCCC 2760 

ACTACATGGT GCTGGAATAT GTGGATCTGG GAGACCTCAA GCAGTTCCTG AGGATTTCCA 2820 

AGAGCAAGGA TGAAAAATTG AAGTCACAGC CCCTCAGCAC CAAGCAGAAG GTGGCCCTAT 2 8 BO 

GCACCCAGGT AGCCCTGGGC ATGGAGCACC TGTCCAACAA CCGCTTTGTG CATAAGGACT 2940 

TGGCTGCGCG TAACTGCCTG GTCAGTGCCC AGAGACAAGT GAAGGTGTCT GCCCTGGGCC 3000 

TCAGCAAGGA TGTGTACAAC AGTGAGTACT ACCACTTCCG CCAGGCCTGG GTGCCGCTGC 3060 

GCTGGATGTC CCCCGAGGCC ATCCTGGAGG GTGACTTCTC TACCAAGTCT GATGTCTGGG 3X20 

CCTTCGGTGT GCTGATGTGG GAAGTGTTTA CACATGGAGA GATGCCCCAT GGTGGGCAGG 3180 

CAGATGATGA AGTACTGGCA GATTTGCAGG CTGGGAAGGC TAGACTTCCT CAGCCCGAGG 3240 

GCTGCCCTTC CAAACTCTAT CGGCTGATGC AGCGCTGCTG GGCCCTCAGC CCCAAGGACC 3300 

GGCCCTCCTT CAGTGAGATT GCCAGCGCCC TGGGAGACAG CACCGTGGAC AGCAAGCCGT 3360 

GAGGAGGGAG CCCGCTCAGG ATGGCCTGGG CAGGGGAGGA CATCTCTAGA GGGAAGCTCA 3420 

CAGCATGATG GGCAAGATCC CTGTCCTCCT GGGCCCTGAG GTGCCCTAGT GCAACAGGCA 3480 

TTGCTGAGGT CTGAGCAGGG CCTGGCCTTT CCTCCTCTTC CTCACCCTCA TCCTTTGGGA 3540 

GGCTGACTTG GACCCAAACT GGGCGACTAG GGCTTTGAGC TGGGCAGTTT CCCCTGCCAC 3600 

CTCTTCCTCT ATCAGGGACA GTGTGGGTGC CACAGGTAAC CCCAATTTCT GGCCTTCAAC 3660 

TTCTCCCCTT GACCGGGTCC AACTCTGCCA CTCATCTGCC AACTTTGCCT GGGGAGGGCT 3720 

AGGCTTGGGA TGAGCTGGGT TTGTGGGGAG TTCCTTAATA TTCTCAAGTT CTGGGCACAC 3780 

AGGGTTAATG AGTCTCTTGC CCACTGGTCC ACTTGGGGGT CTAGACCAGG ATTATAGAGG 3840 

ACACAGCAAG TGAGTCCTCC CCACTCTGGG CTTGTGCACA CTGACCCAGA CCCACGTCTT 3900 

CCCCACCCTT CTCTCCTTTC CTCATCCTAA GTGCCTGGCA GATGAAGGAG TTTTCAGGAG 3960 

CTTTTGACAC TATATAAACC GCCCTTTTTG TATGCACCAC GGGCGGCTTT TATATGTAAT 4020 

TGCAGCGTGG GGTGGGTGGG CATGGGAGGT AGGGGTGGGC CCTGGAGATG AGGAGGGTGG 4080 

GCCATCCTTA CCCCACACTT TTATTGTTGT CGTTTTTTGT TTGTTTTGTT TTTTTGTTTT 4140 
TGTTTTTGTT TTTACACTCG CTGCTCTCAA TAAATAAGCC TTTTTTA 



Seq ID NO: 534 Protein sequence 
Protein Accession #: NP_002812 

1 11 21 31 41 51 

llllll 

MGAARGSPAR PRRLPLLSVL LLPLLGGTQT AIVFIKQPSS QDALQGRRAL LRCEVEAPGP 60 

VHVYWLLDGA PVQDTERRFA QGSSLSFAAV DRLQDSGTFQ CVARDDVTGE EARSANASFN 120 

IKWIEAGPW LKHPASEAEI QPQTQVTLRC HIDGHPRPTY QWFRDGTPLS DGQSNHTVSS 180 

KERNLTLRPA GPEHSGLYSC CAHSAFGQAC SSQNFTLSIA DESFARWLA PQDWVARYE 240 

EAMFHCQFSA QPPPSLQWLF EDETPITNRS RPPHLRRATV FANGSLLLTQ VRPRNAGIYR 300 

CIGQGQRGPP 1 1 LEATLHLA EIEDMPLFEP RVFTAGSEER VTCLPPKGLP EPSVWWEHAG 360 

VRLPTHGRVY QKGHELVLAN IAESDAGVYT CHAANLAGQR RQDVNITVAT VPSWLKKPQD 420 

SQLEEGKPGY LDCLTQATPK PTWWYRNQM LISEDSRFEV FKNGTLRINS VEVYDGTWYR 480 

CMSSTPAGSI EAQARVQVLE KLKFTPPPQP QQCMEFDKEA TVPCSATGRE KPTIKWERAD 540 

GSSLPEWVTD NAGTLHFARV TRDDAGNYTC IASNGPQGQI RAHVQLTVAV FITFKVEPER 600 

TTVYQGHTAL LQCEAQGDPK PLIQWKGKDR ILDPTKLGPR MHIFQNGSLV IHDVAPEDSG 660 

RYTCIAGNSC NIKHTEAPLY WDKPVPEES EGPGSPPPYK MIQTIGLSVG AAVAYIIAVL 720 

GLMFYCKKRC KAKRLQKQPE GEEPEMECLN GGPLQNGQPS AEIQEEVALT SLGSGPAATN 780 

KRHSTSDKMH FPRSSLQPIT TLGKSEFGEV FLAKAQGLEE GVAETLVLVK SLQTKDEQQQ 840 

LDFRRELEMF GKLNHANWR LLGLCREAEP HYMVLEYVDL GDLKQFLRIS KSKDEKLKSQ 900 

PLSTKQKVAI* CTQVALGMEH LSNNRFVHKD LAARNCLVSA QRQVKVSALG LSKDVYNSEY 960 

YHFRQAWVPL RWMSPEAILE GDFSTKSDVW AFGVLMWEVF THGEMPHGGQ ADDEVLADLQ 1020 
AGKARLPQPE GCPSKLYRLM QRCWALSPKD RPSFSEIASA LGDSTVDSKP 

Seq ID NO: 535 DNA sequence 
Nucleic Acid Accession #: NM_013952 
Coding sequence: 161.. 1357 



1 11 21 31 41 51 

I I I I I 1 

TTCAGAAGGA GGAGAGACAC CGGGCCCAGG GCACCCTCGC GGGCGGGCGG ACCCAAGCAG 60 

TGAGGGCCTG CAGCCGGCCG GCCAGGGCAG CGGCAGGCGC GGCCCGGACC TACGGGAGGA 120 

AGCCCCGAGC CCTCGGCGGG CTGCGAGCGA CTCCCCGGCG ATGCCTCACA ACTCCATCAG 180 

ATCTGGCCAT GGAGGGCTGA ACCAGCTGGG AGGGGCCTTT GTGAATGGCA GACCTCTGCC 240 

GGAAGTGGTC CGCCAGCGCA TCGTAGACCT GGCCCACCAG GGTGTAAGGC CCTGCGACAT 300 

CTCTCGCCAG CTCCGCGTCA GCCATGGCTG CGTCAGCAAG ATCCTTGGCA GGTACTACGA 360 

GACTGGCAGC ATCCGGCCTG GAGTGATAGG GGGCTCCAAG CCCAAGGTGG CCACCCCCAA 420 

GGTGGTGGAG AAGATTGGGG ACTACAAACG CCAGAACCCT ACCATGTTTG CCTGGGAGAT 480 

CCGAGACCGG CTCCTGGCTG AGGGCGTCTG TGACAATGAC ACTGTGCCCA GTGTCAGCTC 540 

CATTAATAGA ATCATCCGGA CCAAAGTGCA GCAACCATTC AACCTCCCTA TGGACAGCTG 600 

CGTGGCCACC AAGTCCCTGA GTCCCGGACA CACGCTGATC CCCAGCTCAG CTGTAACTCC 660 

CCCGGAGTCA CCCCAGTCGG ATTCCCTGGG CTCCACCTAC TCCATCAATG GGCTCCTGGG 720 

CATCGCTCAG CCTGGCAGCG ACAAGAGGAA AATGGATGAC AGTGATCAGG ATAGCTGCCG 780 

ACTAAGCATT GACTCACAGA GCAGCAGCAG CGGACCCCGA AAGCACCTTC GCACGGATGC 840 

CTTCAGCCAG CACCACCTCG AGCCGCTCGA GTGCCCATTT GAGCGGCAGC ACTACCCAGA 900 

GGCCTATGCC TCCCCCAGCC ACACCAAAGG CGAGCAGGGC CTCTACCCGC TGCCCTTGCT 960 

CAACAGCACC CTGGACGACG GGAAGGCCAC CCTGACCCCT TCCAACACGC CACTGGGGCG 1020 

CAACCTCTCG ACTCACCAGA CCTACCCCGT GGTGGCAGCT CCGCCCTTTT GGATCTGCAG 1080 

CAAGTCGGCT CCGGGGTCCC GCCCTTCAAT GCCTTTCCCC ATGCTGCCTC CGTGTACGGG 1140 

CAGTTCACGG GCCAGGCCCT CCTCTCAGGG CGAGAGATGG TGGGGCCCAC GCTGCCCGGA 1200 

TACCCACCCC ACATCCCCAC CAGCGGACAG GGCAGCTATG CCTCCTCTGC CATCGCAGGC 1260 

ATGGTGGCAG GAAGTGAATA CTCTGGCAAT GCCTATGGCC ACACCCCCTA CTCCTCCTAC 1320 

AGCGAGGCCT GGGGCTTCCC CAACTCCAGC TTGCTGAGTT CCCCATATTA TTACAGTTCC 1380 

ACATCAAGGC CGAGTGCACC GCCCACCACT GCCACGGCCT TTGACCATCT GTAGTTGCCA 1440 
TGGGGACAGT G 
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Seq ID NO: 536 Protein sequence 
Protein Accession 8: NP_039246 

1 11 21 31 41 51 

I I I I I I 

MPHNSIRSGH GGLNQLGGAF VNGRPLPEW RQRIVDLAHQ GVRPCDISRQ LRVSHGCVSK 60 

ILGRYYETGS IRPGVIGGSK PKVATPKWE KIGDYKRQNP TMFAWEIRDR LLAEGVCDND 120 

TVPSVSSINR IIRTKVQQPF NLPMDSCVAT KSLSPGHTLI PSSAVTPPES PQSDSLGSTY 180 

SINGLLGIAQ PGSDKRKMDD SDQDSCRLSI DSQSSSSGPR KHLRTDAFSQ HHLEPLECPF 240 

ERQHYPEAYA SPSHTKGEQG LYPLPLLNST LDDGKATLTP SNTPLGRNLS THQTYPWAA 300 

PPFWICSKSA PGSRPSMPFP MLPPCTGSSR ARPSSQGERW WGPRCPDTHP TSPPADRAAM 360 
PPLPSQAWWQ EVNTLAMPMA TPPTPPTARP GASPTPAC 

Seq ID NO: 537 DNA sequence 

Nucleic Acid Accession $: NMJJ03466.1 

Coding sequence : 11 .. 13 63 

1 11 21 31 41 51 

I I I I I I 

GAATTCGGCG ATGCCTCACA ACTCCATCAG ATCTGGCCAT GGAGGGCTGA ACCAGCTGGG 60 

AGGGGCCTTT GTGAATGGCA GACCTCTGCC GGAAGTGGTC CGCCAGCGCA TCGTAGACCT 120 

GGCCCACCAG GGTGTAAGGC CCTGCGACAT CTCTCGCCAG CTCCGCGTCA GCCATGGTTG 180 

CGTCAGCAAG ATCCTTGGCA GGTACTACGA GACTGGCAGC ATCCGGCCTG GAGTGATAGG 240 

GGGCTCCAAG CCCAAGGTGG CCACCCCCAA GGTGGTGGAG AAGATTGGGG ACTACAAACG 300 

CCAGAACCCT ACCATGTTTG CCTGGGAGAT CCGAGACCGG CTCCTGGCTG AGGGCGTCTG 360 

TGACAATGAC ACTGTGCCCA GTGTCAGCTC CATTAATAGA ATCATCCGGA CCAAAGTGCA 420 

GCAACCATTC AACCTCCCTA TGGACAGCTG CGTGGCCACC AAGTCCCTGA GTCCCGGACA 480 

CACGCTGATC CCCAGCTCAG CTGTAACTCC CCCGGAGTCA CCCCAGTCGG ATTCCCTGGG 540 

CTCCACCTAC TCCATCAATG GGCTCCTGGG CATCGCTCAG CCTGGCAGCG ACAAGAGGAA 600 

AATGGATGAC AGTGATCAGG ATAGCTGCCG ACTAAGCATT GACTCACAGA GCAGCAGCAG 660 

CGGACCCCGA AAGCACCTTC GCACGGATGC CTTCAGCCAG CACCACCTCG AGCCGCTCGA 720 

GTGCCCATTT GAGCGGCAGC ACTACCCAGA GGCCTATGCC TCCCCCAGCC ACACCAAAGG 780 

CGAGCAGGGC CTCTACCCGC TGCCCTTGCT CAACAGCACC CTGGACGACG GGAAGGCCAC 840 

CCTGACCCCT TCCAACACGC CACTGGGGCG CAACCTCTCG ACTCACCAGA CCTACCCCGT 900 

GGTGGCAGAT CCTCACTCAC CCTTCGCCAT AAAGCAGGAA ACCCCCGAGG TGTCCAGTTC 960 

TAGCTCCACC CCTTCCTCTT TATCTAGCTC CGCCTTTTTG GATCTGCAGC AAGTCGGCTC 1020 

CGGGGTOCCG CCCTTCAATG CCTTTCCCCA TGCTGCCTCC GTGTACGGGC AGTTCACGGG 1080 

CCAGGCCCTC CTCTCAGGGC GAGAGATGGT GGGGCCCACG CTGCCCGGAT ACCCACCCCA 1140 

CATCCCCACC AGCGGACAGG GCAGCTATGC CTCCTCTGCC ATCGCAGGCA TGGTGGCAGG 1200 

AAGTGAATAC TCTGGCAATG CCTATGGCCA CACCCCCTAC TCCTCCTACA GCGAGGCCTG 1260 

GCGCTTCCCC AACTCCAGCT TGCTGAGTTC CCCATATTAT TACAGTTCCA CATCAAGGCC 1320 
GAGTGCACCG CCCACCACTG CCACGGCCTT TGACCATCTG TAGTTGAAGC TT 

Seq ID NO: 538 Protein sequence 
Protein Accession 8: NP__003457 

1 11 21 31 41 51 

I I i I t I 

MPHNSIRSGH GGLNQLGGAF VNGRPLPEW RQRIVDLAHQ GVRPCDISRQ LRVSHGCVSK 60 

ILGRYYETGS IRPGVIGGSK PKVATPKWE KIGDYKRQNP TMFAWEIRDR LLAEGVCDND 120 

TVPSVSSINR IIRTKVQQPF NLPMDSCVAT KSLSPGHTLI PSSAVTPPES PQSDSLGSTY 180 

SINGLLGIAQ PGSDKRKMDD SDQDSCRLSI DSQSSSSGPR KHLRTDAFSQ HHLEPLECPF 240 

ERQHYPEAYA SPSHTKGEQG LYPLPLLNST LDDGKATLTP SNTPLGRNLS THQTYPWAD 300 

PHSPFAIKQE TPEVSSSSST PSSLSSSAFL DLQQVGSGVP PFNAFPHAAS VYGQFTGQAL 360 

LSGREMVGPT LPGYPPHIPT SGQGSYASSA IAGMVAGSEY SGNAYGHTPY SSYSEAWRFP 420 
NSSLLSSPYY YSSTSRPSAP PTTATAFDHL 

Seq ID NO: 539 DNA sequence 
Nucleic Acid Accession #: NMJJ06799 
Coding sequence : 19 . . 963 

I 11 21 31 41 51 

II I I I I 

GCCGCGGGAG AGGAGGCCAT GGGCGCGCGC GGGGGGCTGC TGCTGGCGCT GCTGCTGGCT 60 

CGGGCTGGAC TCAGGAAGCC GGAGTCGCAG GAGGCGGCGC CGTTATCAGG ACCATGCGGC 120 

CGACGGGTCA TCACGTCGCG CATCGTGGGT GGAGAGGACG CCGAACTCGG GCGTTGGCCG 180 

TGGCAGGGGA GCCTGCGCCT GTGGGATTCC CACGTATGCG GAGTGAGCCT GCTCAGCCAC 240 

CGCTGGGCAC TCACGGCGGC GCACTGCTTT GAAACCTATA GTGACCTTAG TGATCCCTCC 300 

GGGTGGATGG TCCAGTTTGG CCAGCTGACT TCCATGCCAT CCTTCTGGAG CCTGCAGGCC 360 

TACTACACCC GTTACTTCGT ATCGAATATC TATCTGAGCC CTCGCTACCT GGGGAATTCA 420 

CCCTATGACA TTGCCTTGGT GAAGCTGTCT GCACCTGTCA CCTACACTAA ACACATCCAG 480 

CCCATCTGTC TCCAGGCCTC CACATTTGAG TTTGAGAACC GGACAGACTG CTGGGTGACT 540 

GGCTGGGGGT ACATCAAAGA GGATGAGGCA CTGCCATCTC CCCACACCCT CGAGGAAGTT 600 

CAGGTCGCCA TCATAAACAA CTCTATGTGC AACCACCTCT TCCTCAAGTA CAGTTTCCGC 660 

AAGGACATCT TTGGAGACAT GGTTTGTGCT GGCAATGCCC AAGGCGGGAA GGATGCCTGC 720 

TTCGGTGACT CAGGTGGACC CTTGGCCTGT AACAAGAATG GACTGTGGTA TCAGATTGGA 780 

GTCGTGAGCT GGGGAGTGGG CTGTGGTCGG CCCAATCGGC CCGGTGTCTA CACCAATATC 840 

AGCCACCACT TTGAGTGGAT CCAGAAGCTG ATGGCCCAGA GTGGCATGTC CCAGCCAGAC 900 

CCCTCCTGGC CGCTACTCTT TTTCCCTCTT CTCTGGGCTC TCCCACTCCT GGGGCCGGTC 960 

TGAGCCTACC TGAGCCCATG CAGCCTGGGG CCACTGCCAA GTCAGGCCCT GGTTCTCTTC 1020 
TGTCTTGTTT GGTAATAAAC ACATTCCAGT TGATGCCTTG CAGGGCATTC TTCAAAA 

Seq ID NO: 540 Protein sequence 
Protein Accession #: NP__006790 
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1 11 21 31 41 51 

! I I I I I 

MGARGALLLA LLLARAGLRK PESQEAAPLS GPCGRRVITS RIVGGEDAEL GRWPWQGSLR 
LWDSRVCGVS LLSHRWALTA AHCFETYSDL SDPSGWKVQF GQLTSMPSFW SLQAYYTRYF 
VSNIYLSPRY LGNSPYDIAL VKLSAPVTYT KHIQPICLQA STFEFENRTD CWVTGWGYIK 
EDEALPSPHT LQEVQVAIIN NSMCNHLFLK YSFRKDIFGD MVCAGNAQGG KDACFGDSGG 
PLACNKNGLW YQIGWSWGV GCGRPNRPGV YTNISHHFEW IQKLMAQSGM SQPDPSWPLL 
PFPLLWALPL LGPV 

Seq ID NO: 541 DNA sequence 
Kucleic Acid Accession ft : NMJ)14344 
Coding sequence : 13 1 . . 1444 
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1 11 21 31 41 51 

I I I I ! I 

GCGGCCGCGA TGGGGCCGAA GCGCCCGAAG CCCCGGAGCC CACAAACTGC CGGGCCCGCC 
TCGCCGCCGG GACCCGGGTG CCTGGGCTCG GCTTGAAGCG GCGGCGGCGC ACCGGCACAG 
CCGCGGGAGC ATGGGCAGGA GGATGCGGGG CGCCGCCGCC ACCGCGGGGC TCTGGCTGCT 
GGCGCTGGGC TCGCTGCTGG CGCTGTGGGG AGGGCTCCTG CCGCCGCGGA CCGAGCTGCC 
CGCCTCCCGG CCGCCCGAAG ACCGACTCCC ACX3GCGCCCG GCCCGGAGCG GCGGCCCCGC 
GCCCGCGCCT CGCTTCCCTC TGCCCCCGCC CCTGGCGTGG GACGCCCGCG GCGGCTCCCT 
GAAAACTTTC CGGGOGCTGC TCACCCTGGC GGCCGGCGCG GACGGCCCGC CCCGGCAGTC 
CCGGAGCGAG CCCAGGTGGC ACGTGTCAGC CAGGCAGCCC OGGCCGGAGG AGAGCGCCGC 
GGTGCACGGG GGCGTCTTCT GGAGCCGCGG CCTGGAGGAG CAGGTGCCCC CGGGCTTTTC 
GGAGGCCCAG GCGGCGGCGT GGCTGGAGGC GGCTCGCGGC GCCCGGATGG TGGCCCTGGA 
GCGCGGGGGT TGCGGGCGCA GCTCCAACCG ACTGGCCCGT TTTGCCGACG GCACCCGCGC 
CTGCGTGCGC TACGGCATCA ACCCGGAGCA GATTCAGGGC GAGGCCCTGT CTTACTATCT 
GGCGCGCCTG CTGGGCCTCC AGCGCCACGT GCCGCCGCTG GCACTGGCTC GGGTGGAGGC 
TCGGGGCGCG CAGTGGGCGC AGGTGCAGGA GGAGCTGCGC GCTGCGCACT GGACCGAGGG 
CAGCGTGGTG AGCCTGACAC GCTGGCTGCC CAACCTCACG GACGTGGTGG TGCCCGCGCC 
CTGGCGCTCG GAGGACGGCC GTCTGCGCCC CCTCCGGGAT GCCGGGGGTG AGCTGGCCAA 
CCTCAGCCAG GCGGAGCTGG TGGACCTAGT ACAATGGACC GACTTAATCC TTTTCGACTA 
CCTGACGGCC AACTTCGACC GGCTCGTAAG CAACCTCTTC AGCCTGCAGT GGGACCCGCG 
CGTCATGCAG CGTGCCACCA GCAACCTGCA CCGCGGTCCG GGCGGGGCGC TGGTCTTTCT 
GGACAATGAG GCGGGCTTGG TGCACGGCTA CCGGGTAGCA GGCATGTGGG ACAAGTATAA 
CGAGCCGCTG TTGCAGTCAG TGTGCGTGTT CCGCGAGCGG ACCGCGCGGC GCGTCCTGGA 
GCTGCACCGC GGACAGGACG CCGCGGCCCG GCTGCTGCGC CTCTACCGGC GCCACGAGCC 
TCGCTTCCCC GAGCTGGCCG CCCTTGCAGA CCCCCACGCT CAGCTGCTAC AGCGCCGCCT 
CGACTTCCTC GCCAAGCACA TTTTGCACTG TAAGGCCAAG TACGGCCGCC GGTCTGGGAC 
TTAGTGTCAC CGGGAGGAAA AGAGAGAGAT CTGGGGCTGG GGTATGGATG ATGGGGGGAA 
GGGCGGTCGC CTCTGCCACT GTCAGGGACC AGCCGGCCAA CGCCCACCCG CAAAGGTGTC 
TAAAAACTTC AGCTTTTCAC CCACCTGCCC CTTTCTTTCA ATCCCACGCT GTTTCCTTTC 
AAAGTTCTGG GAGGACGAAC TCACCGAGGC GAGAAGTGTA ACATTCTCTC CACCCAGCTT 
ATAAAAGGAT TCTTTACTGT GCCAGCACGG GGATTGGATC CGAAGAAACT GGCTACTGGG 
GTTTGGCCCC CGAGTGGCCG TCCCTGTGGG AGATGCACCC CATTCTTGGG CCCCCCTCAT 
TCCCTTTCCG AAAAAGGAAA ACTTGCGTTT GAGCCGTTGA GCTAATTCTG CAATTTTCTA 
CCAAACAGAG CGCTGGTGGC CCCGGAGCAG GGCTGTGACA TTGGCTGGTG GAGCCCCTTC 
CTGTGTTCTC CCTTTGTTCC AGCGCCGCGA TGGTGAGATC ACTGTTCCAA GCAGGGGGAC 
GGCTCGCGAT AGGACAAAGA GAGCAGGACC TCCAGACTCT GGGGAGCCCT GCAGACCTTG 
ACAATTTGCC TGACTCATTC CTGACCTCTT GTCATTTTGG CCTGAAGGCT ACAAATTCAG 
GGTCAGCTGT ATGCACTAAG TCAAATAATG AATTTCTTCC TCCCTCTCGC AACCGACCAA 
AATTTTGACA ACGATGATGT TCACCAGAAG GAAAAAAAAA TCAGTTTTAT GCACTTTATT 
TTGTTTTGAT TTTCATTTTT TATTAAGAAA AAATTTTATT TTACAGAATT TACCTTCTCT 
GTATATATGT GCATAAAGTG TGGTGTAAAT ATACTAAACA AACTTATATT TCAATAAAAG 
GGAGTTTAAA ATTTAAAAAA AAAAAAA 

Seq ID NO: 542 Protein sequence 
Protein Accession ft: NP_055159 
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1920 
1980 
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2100 
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1 
I 

MGRRMRGAAA 
RFPLPPPLAW 
GVFWSRGLEE 
YGINPEQIQG 
SLTRWLPNLT 
NFDRLVSNLF 
IiQSVCVFRER 
AKHILHCKAK 



11 
I 

TAGLWLlxALG 
DARGGSLKTF 
QVPPGFSEAQ 
EALSYYIiARL 
DWVPAPWRS 
SLQWDPRVMQ 
TARRVLEIiHR 
YGRRSGT 



21 
I 

SLLALWGGIiL 
RALLTLAAGA 
AAAWLEAARG 
LGLQRHVPPL 
EDGRLRPLRD 
RATSNLHRGP 
GQDAAARLLR 



31 
I 

PPRTELPASR 



ARMVALERGG 
ALARVEARGA 
AGGELANLSQ 
GGAIiVFLDNE 
I»YRRHEPRFP 



41 
I 

PPEDRIiPRRP 
PRWHVSARQP 
CGRSSNRLAR 
QWAQVQEELR 
AEIiVDLVQWT 
AGLVHGYRVA 
ELAAIiADPEA 



51 
I 

ARSGGPAPAP 
RPEESAAVHG 
FADGTRACVR 
AAHWTEGSW 
DLILFDYLTA 
GKWDKYNEPL 
QLLQRRLDFL 



60 
120 
180 
240 
300 
360 
420 



Seq ID NO: 543 DNA sequence 

Nucleic Acid Accession ft: XM_007652.4 

Coding sequence : 1 . . 1290 

1 11 21 31 41 51 

! I 1 t I I 

ATGGCCGGCT CTGGCGCGTG GAAGCGCCTC AAATCTATGC TAAGGAAGGA TGATGCGCCG 
CTGTTTTTAA ATGACACCAG CGCCTTTGAC TTCTCGGATG AGGCGGGGGA cgaggggctt 
TCTCGGTTCA ACAAACTTCG AGTTGTGGTG GCCGATGAOG GTTCCGAAGC CCCGGAAAGG 
CCTGTTAACG GGGCGCACCC GACCCTCCAG GCCGACGATG ATTCCTTACT GGACCAAGAC 
TTACCTTTGA CCAACAGTCA GCTGAGTTTG AAGGTGGACT CCTGTGACAA CTGCAGCAAA 
CAGAGAGAGA TACTGAAGCA GAGAAAGGTG AAAGCCAGGT TGACCATTGC TGCCGTTCTG 
TACTTGCTTT TCATGATTGG AGAACTTGTA GGTGGATACA TTGCAAATAG CCTAGCAATC 

1162 



60 
120 
180 
240 
300 
360 
420 



WO 03/042661 



ATGACAGATG CACTTCATAT GTTAACTGAC CTAAGCGCCA TCATACTCAC CCTGCTTGCT 480 

TTGTGGCTAT CATCAAAATC ACCAACCAAA AGATTCACCT TTGGATTTCA TCGCTTAGAG 540 

GTTTTGTCAG CTATGATTAG TGTGCTGTTG GTGTATATAC TTATGGGATT CCTCTTATAT 600 

GAAGCTGTGC AAAGAACTAT CCATATGAAC TATGAAATAA ATGGAGATAT AATGCTCATC 660 

ACCGCAGCTG TTGGAGTTGC AGTTAATGTA ATAATGGGGT TTCTGTTGAA CCAGTCTGGT 720 

CACCGTCACT CCCATTCCCA CTCCCTGCCT TCAAATTCCC CTACCAGAGG TT CTGG GTGT 780 

GAACGTAACC ATGGGCAGGA TAGCCTGGCA GTGAGAGCTG CATTTGTACA TGCTTTGGGA 840 

GATTTGGTAC AGAGTGTTGG TGTGCTAATA GCTGCATACA TCATACGATT CAAGC CAGAA 900 

TACAAGATTG CTGATCCCAT CTGTACATAC GTATTTTCAT TACTTGTGGC TTTTACAACA 960 

TTTCGAATCA TATGGGATAC AGTAGTTATA ATACTAGAAG GTGTGCCAAG CCATTTGAAT 1020 

GTAGACTATA TCAAAGAAGC CTTGATGAAA ATAGAAGATG TATATTCAGT CGAAGATTTA 1080 

AATATCTGGT CTCTCACTTC AGGAAAATCT ACTGCCATAG TTCACATACA GCTAATTCCT 1140 

GGAAGTTCAT CTAAATGGGA GGAAGTACAG TCCAAAGCAA ACCATTTATT ATTGAACACA 1200 

TTTGGCATGT ATAGATGTAC TATTCAGCTT CAGAGTTACA GGCAAGAAGT GGACAGAACT 1260 
TGTGCAAATT GTCAGAGTTC TAGTCCCTGA 

Seq ID NO: 544 Protein sequence 
Protein Accession «: XP_007652.1 

1 11 21 31 41 51 

I I 1 1 I I 

MAGSGAWKRL KSMLRKDDAP LFLNDTSAFD PSDEAGDEGL SRFNKLRVW ADDGSEAPER 60 

PVNGAHPTLQ ADDDSLLDQD LPLTNSQLSL KVDSCDNCSK QREILKQRKV KARLTIAAVL 120 

YLLFMIGELV GGYIANSLAI MTDALHMLTD LSAIILTUiA LWLSSKSPTK RFTFGFHRLE 180 

VLSAMISVLL VYILMGFLLY EAVQRTIHMN YEINGDIMLI TAAVGVAVNV IMGFLLNQSG 240 

HRHSHSHSLP SNSPTRGSGC ERNHGQDSLA VRAAFVHALG DLVQSVGVLI AAYIIRFKPE 300 

YKIADPICTY VFSLLVAFTT FRIIWDTWI ILEGVPSHLN VDYIKEALMK IEDVYSVEDL 360 

NIWSLTSGKS TAIVHIQLIP GSSSKWEEVQ SKANHLLLNT FGMYRCTIQL QSYRQEVDRT 420 
CANCQSSSP 

Seq ID NO: 54S DNA sequence 
Nucleic Acid Accession #: AB03776S.1 
Coding sequence: 1..2478 

1 11 21 31 41 51 

I i I 1 I I 

ATGTTTTCCG GCTTCAATGT CTTTAGAGTT GGGATCTCTT TTGTCATAAT GTGCATTTTT 60 

TACATGCCAA CAGTAAACTC TTTACCAGAA CTGAGTCCTC AGAAATATTT TAGTACATTG 120 

CAACCAGGAA AAGCCTCTTT AGCTTATTTT TGTCAAGCTG ATTCCCCAAG AACATCTGTA 180 

TTTCTTGAAG AACTGAATGA GGCTGTTAGA CCTCTGCAGG ACTATGGAAT TTCAGTTGCC 240 

AAGGTTAATT GTGTCAAAGA AGAAATATCA AGATACTGTG GAAAAGAAAA GGATTTGATG 300 

AAAGCATATT TATTCAAGGG CAACATATTG CTCAGAGAAT TCCCTACTGA CACCTTGTTT 360 

GATGTGAATG CCATTGTCGC CCATGTTCTC TTTGCTCTTC TTTTTAGTGA AGTGAAATAT 420 

ATTACCAACC TGGAAGACCT TCAGAACATA GAAAATGCTC TGAAAGGAAA AGCAAATATT 480 

ATATTCTCAT ATGTAAGAGC CATTGGAATA CCAGAGCACA GAGCAGTCAT GGAAGCCGCT 540 

TTTGTGTATG GGACTACATA CCAATTTGTC TTAACCACAG AAATTGCCCT TTTGGAAAGT 600 

ATTGGCTCTG AGGATGTGGA ATATGCACAT CTCTACTTTT TTCATTGTAA ACTAGTCTTG 660 

GACTTGACCC AGCAATGTAG AAGAACACTA ATGGAACAGC CATTGACTAC ACTGAACATT 720 

CACCTGTTTA TTAAGACAAT GAAAGCACCT CTGTTGACTG AAGTTGCTGA AGATCCTCAA 780 

CAAGTTTCAA CTGTCCATCT. CCAACTGGGC TTACCACTGG TTTTTATTGT TAGCCAACAG 840 

GCTACTTATG AAGCTGATAG AAGAACTGCA GAATGGGTTG CTTGGCGTCT TCTGGGAAAA 900 
GCAGGAGTTC TACTCTTGTT AAGGGACTCT TTGGAAGTGA ACATTCCTCA AGATGCTAAT 960 

GTGGTCTTCA AAAGAGCAGA AGAGGGAGTT CCAGTGGAAT TTTTGGTATT ACATGATGTT 1020 

GATTTAATAA TATCTCATGT GGAAAATAAT ATGCACATTG AGGAAATACA AGAAGATGAA 1080 

GACAATGACA TGGAAGGTCC AGATATAGAT GTTCAGGATG ATGAAGTGGC AGAAACTGTT 1140 

TTCAGAGATA GGAAGAGAAA ATTACCTTTG GAACTTACAG TGGAACTAAC AGAAGAAACA 1200 

TTTAATGCAA CAGTGATGGC TTCTGACAGC ATAGTACTCT TCTATGCTGG TTGGCAAGCA 1260 

GTATCCATGG CATTTTTGCA ATCCTATATT GATGTGGCAG TTAAACTGAA AGGCACATCT 1320 

ACTATGCTTC TTACTAGAAT AAACTGTGCA GATTGGTCTG ATGTATGTAC TAAGCAAAAT 1380 

GTTACTGAAT TTCCTATCAT AAAGATGTAC AAGAAAGGCG AGAACCCAGT ATCTTATGCT 1440 

GGAATGTTAG GAACCGAAGA TCTCCTAAAA TTTATCCAGC TCAACAGGAT TTCATATCCA 1500 

GTGAATATAA CATCGATCCA AGAAGCAGAA GAATATTTAA GTGGGGAATT ATATAAAGAC 1560 

CTCATCTTGT ATTCTAGTGT GTCAGTATTG GGACTATTTA GTCCAACCAT GAAAACAGCA 1620 

AAAGAAGATT TTAGTGAAGC AGGAAACTAC CTAAAAGGAT ATGTTATCAC TGGAATTTAT 1680 

TCTGAAGAAG ATGTTTTGCT ACTGTCAACC AAATATGCTG CAAGTCTTCC AGCCCTGCTG 1740 

CTTGCCAGAC ACACAGAAGG CAAAATAGAG AGCATCCCAC TAGCTAGCAC ACATGCACAA 1800 

GACATAGTTC AAATAATAAC AGATGCACTA CTGGAAATGT TTCCGGAAAT CACTGTGGAA 1860 

AATCTTCCCA GTTATTTCAG ACTTCAGAAA CCATTATTGA TTTTGTTCAG TGATGGCACT 1920 

GTAAATCCTC AGTATAAAAA AGCAATATTG ACACTGGTAA AGCAGAAATA CTTGGATTCA 1980 

TTTACTCCAT GCTGGTTAAA TCTAAAGAAT ACTCCAGTGG GGAGAGGAAT CTTGAGGGCA 2040 

TATTTTGATC CTCTGCCTCC CCTTCCTCTT CTTGTTTTGG TGAATCTGCA TTCAGGTGGC 2100 

CAAGTATTTG CATTTCCTTC AGACCAGGCT ATAATTGAAG AAAACCTTGT ATTGTGGCTG 2160 

AAGAAATTAG AAGCAGGACT AGAAAATCAT ATCACAATTT TACCTGCTCA AGAATGGAAA 2220 

CCTCCTCTTC CAGCTTATGA TTTTCTAAGT ATGATAGATG CCGCAACATC TCAACGTGGC 2280 

ACTAGGAAAG TTCCCAAGTG TATGAAAGAA ACAGATGTGC AGGAGAATGA TAAGGAACAA 2340 

CATGAAGATA AATCGGCAGT CAGAAAAGAA CCGATTGAAA CTCTGAGAAT AAAGCATTGG 2400 

AATAGAAGTA ATTGGTTTAA AGAAGCAGAA AAATCATTTA GACGTGATAA AGAGTTAGGA 2460 
TGCTCAAAAG TGAACTAA 

Seq ID NO: 546 Protein sequence 
Protein Accession #: BAA92582.1 

1 11 21 31 41 51 

1 I i I I I 
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MFSGFNVFRV 
FLEELNEAVR 
. DVNAIVAHVL 
FVYGTTYQFV 
HLFIKTMKAP 
AGVLLLLRDS 
DNDMEGPDID 
VSMAFLQSYI 
GMLGTEDLLK 
KEDFSEAGNY 
DIVQIITDAL 
FTPCWLNLKN 
KKLEAGLENH 
HEDKSAVRKE 



GISFVIMCIF 
PLQDYGISVA 
FALLFSEVKY 
LTTEIALLES 
LLTEVAEDPQ 
LEVNIPQDAN 
VQDDEVAETV 
DVAVKLKGTS 
FIQLNRISYP 
LKGYVITGIY 
LEMFPEITVE 
TPVGRGILRA 
ITILPAQEWK 
PIETLiRIKHW 



YMPTVNSLPE 
KVNCVKEEIS 
ITNLEDLQNI 
IGSEDVEYAH 
QVSTVHLQLG 
WFKRAEEGV 
FRDRKRKLPL 
TMLLTRINCA 
VNITSIQEAE 
SEEDVIiLLST 
NLPSYFRLQK 
YFDPLPPIiPL 
PPtiPAYDFLS 
NRSNWFKEAE 



LSPQKYFSTL 
RYCGKEKDLM 
ENALKGKANI 
LYFFHCKLVL 
LPLVFIVSQQ 
PVEFLVLHDV 
ELTVELTEET 
DWSDVCTKQN 
EYLSGELYKD 
KYAASLPALL 
PLLILFSDGT 
LVLVNLHSGG 
MIDAATSQRG 
KSFRRDKELG 



QPGKASLAYF 
KAYLFKGNIL 
IFSYVRAIGI 
DLTQQCRRTL 
ATYEADRRTA 
DLIISHVENN 
FNATVMASDS 
VTEFPIIKMY 
LILYSSVSVL 
LARHTEGKIE 
VNPQYKKAIL 
QVFAFPSDQA 
TRKVPKCMKE 
CSKVN 



CQADSPRTSV 
LREFPTDTLF 
PEHRAVMEAA 
MEQPLTTLNI 
EWVAWRLLGK 
MHIEEIQEDE 
IVLFYAGWQA 
KKGENPVSYA 
GLFSPTMKTA 
SIPLASTHAQ 
TLVKQKYLDS 
I IEENLVLWL 
TDVQENDKEQ 



60 
120 
160 
240 
300 
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420 
480 
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600 
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720 
780 



Seq ID NO: 547 pna sequence 

Nucleic Acid Accession ft: NMJ)33102.1 

Coding sequence! 1. .1662 



11 



21 
I 




ATGGTCCAGA GGCTGTGGGT GAGCCGCCTG 
GTCAACCTGC TAACCTTTGG CCTGGAGGTG 
CCTCTGCTGC TGGAAGTGGG GGTAGAGGAG AAGTTCATGA 
CCAGTGCTGG GCCTGGTCTG TGTCCCGCTC CTAGGCTCAG 
CGCTATGGCC GCCGCCGGCC CTTCATCTGG GCACTGTCCT 
TTTCTCATCC CAAGGGCCGG CTGGCTAGCA GGGCTGCTGT 
GAGCTGGCAC TGCTCATCCT GGGCGTGGGG CTGCTGGACT 
ACTCCACTGG AGGCCCTGCT CTCTGACCTC TTCCGGGACC 
TACTCTGTCT ATGCCTTCAT GATCAGTCTT GGGGGCTGCC 
ATTGACTGGG ACACCAGTGC CCTGGCCCCC TACCTGGGCA 
GGCCTGCTCA CCCTCATCTT CCTCACCTGC GTAGCAGCCA 
GCAGCGCTGG GCCCCACCGA GCCAG CAGAA GGGCTGTCGG 
TGCTGTCCAT GCCGGGCCCG CTTGGCTTTC CGGAACCTGG 
CACCAGCTGT GCTGCCGCAT GCCCCGCACC CTGCGCCGGC 
AGCTGGATGG CACTCATGAC CTTCACGCTG TTTTACACGG 
TACCAGGGCG TGCCCAGAGC TGAGCCGGGC ACCGAGGCCC 
GTTCGGATGG GCAGCCTGGG GCTGTTCCTG CAGTGCGCCA 
GTCATGGACC GGCTGGTGCA GCGATTCGGC ACTCGAGCAG 
GCTTTCCCTG TGGCTGCCGG TGCCACATGC CTGTCCCACA 
TCAGCCGCCC TCACCGGGTT CACCTTCTCA GCCCTGCAGA 
TCCCTCTACC ACCGGGAGAA GCAGGTGTTC CTGCCCAAAT 
GCTAGCAGTG AGGACAGCCT GATGACCAGC TTCCTGCCAG 
TTCCCTAATG GACACGTGGG TGCTGGAGGC AGTGGCCTGC 
TGCGGGGCCT CTGCCTGTGA TGTCTCCGTA CGTGTGGTGG 
AGGGTGGTTC OGGGCCGGGG CATCTGCCTG GACCTCGCCA 
CTGTCCCAGG TGGCCCCATC CCTGTTTATG GGCTCCATTG 
ACTGCCTATA TGGTGTCTGC CGCAGGCCTG GGTCTGGTGG 
GTAGTATTTG ACAAGAGCGA CTTGGCCAAA TACTCAGCGT 

Seq ID NO i 548 Protein sequence 
Protein Accession ft: NP_149093.1 



41 
I 

GGAAAGCCCA 
CAGGCATCAC 
CCATGGTGCT 
CCAGTGACCA 
TGGGCATCCT 
GCCCGGATCC 
TCTGTGGCCA 
CGGACCACTG 
TGGGCTACCT 
CCCAGGAGGA 
CACTGCTGGT 
CCCCCTCCTT 
GCGCCCTGCT 
TCTTCGTGGC 
ATTTCGTGGG 
GGAGACACTA 
TCTCCCTGGT 
TCTATTTGGC 
GTGTGGCCGT 
TCCTGCCCTA 
ACCGAGGGGA 
GCCCTAAGCC 
TCCCACCTCC 
TGGGTGAGCC 
TCCTGGATAG 
TCCAGCTCAG 
CCATTTACTT 
GA 



51 
1 

GCTCTTGCTG 
CTATGTGCGG 
GGGCATTGGT 
CTGGCGTGGA 
GCTGAGCCTC 
CAGGCCCCTG 
GGTGTGCTTC 
TOGCCAGGCC 
CCTGCCTGCC 
GTGCCTCTTT 
GGCTGAGGAG 
GTCGCCCCAC 
TCCCCGGCTG 
TGAGCTGTGC 
CGAGGGGCTG 
TGATGAAGGC 
CTTCTCTCTG 
CAGTGTGGCA 
GGTGACAGCT 
CACACTGGCC 
CACTGGAGGT 
TGGAGCTCCC 
ACCCGCGCTC 
CACCGAGGCC 
TGCCTTCCTG 
CCAGTCTGTC 
TGCTACACAG 



1 
I 

MVQRLWVSRL 
PVLGLVCVPL 
ELALLILGVG 
IDWDTSALiAP 
CCPCRARLAF 
YQGVPRAEPG 
AFPVAAGATC 
ASSEDSLMTS 
RWPGRGICL 
WFDKSDliAK 



11 
I 

LRHRKAQLLL 
LGSASDHWRG 
LLDFCGQVCF 
YLGTQEECLF 
RNLGALLPRL 
TEARRHYDEG 
LSHSVAWTA 
FLPGPKPGAP 
DLAILDSAFL 
YSA 



21 
I 

VNLI/TFGLEV 
RYGRRRPFIW 
TPLEALLSDL 
GLLTLIFLTC 
HQLCCRMPRT 
VRMGSIX3LFL 
SAALTGFTFS 
FPNGHVGAGG 
LSQVAPSLFM 



31 
I 

CLAAGITYVP 
ALSLGILLSL 
FRDPDBCRQA 
VAATLLVAEE 
LRRLFVAELC 
QCAISLVFSIi 
ALQILPYTLA 
SGLLPPPPAL 
GSIVQLSQSV 



41 
I 

PLLIiEVGVEE 
FLIPRAGWLA 
YSVYAFMISL 
AALGPTEPAE 
SWMALWTFTL 
VMDRLVQRFG 
SLYHREKQVF 
CGASACDVSV 
TAYMVSAAGL 



Seq ID NO: 54 9 PNA sequence 

Nucleic Acid Accession ft: Eos sequence 

Coding sequence: 1..1389 



ATGGGCTACC 
ACCCTTGTTT 
GTTGTCAACT 
GGGTTTCCTT 
GTTTTATTGA 
AAAACTTTCG 
ATAGCAATGA 
ATCCCAGGAG 
ACAGTTACCT 
TCCCTCATCT 
TCACTGGGTC 
ATTCAAGCGG 



11 
I 

AGAGGCAGGA 
CTGAACATGA 
CGATTATAGG 
TGGGAATATT 
TAAAAGGAGG 
GCTTTCCAGG 
TAAGTTACAA 
TTGATCCTGA 
TTACTCTGCC 
CTACAGGTTT 
CACACATACC 
TCGGGGTTAT 



21 
I 

GCCTGTCATC 
GTATAAAGAG 
ATCTGGTATA 
GCTTTTATTC 
GGCCCTCTCT 
GTATCTGCTC 
TATAATAGCT 
AAACGTGTTT 
TTTATCCTTG 
AACAACTCTG 
AAAAACAGAA 
GTCTTTTGCA 



31 
I 

CCGCCGCAGA 
AAAACCTGTC 
ATAGGATTGC 
TGGGTTTCAT 
GGAACAGATA 
CTCTCTGTTC 
GGAGATACTT 
ATTGGTCGCC 
TACCGAAATA 
ATTCTTGGAA 
GACGCTTGGG 
TTTATTTGCC 



41 

I 

GAGATTTAGA 
AGTCTGCTGC 
CTTATTCAAT 
ATGTTACGGA 
CCTACCAGTC 
TTCAGTTTTT 
TGAGCAAAGT 
ACTTCATTAT 
TAGCAAAGCT 
TTGTAATGGC 
TATTTGCAAA 
ACCATAACTC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1360 
1440 
1500 
1560 
1620 



51 
I 

KFMTMVLGIG 
GLLCPDPRPL 
GGCLGYLLPA 
GLSAPSLSPB 
FYTDFVGEGL 
TRAVYIiASVA 
LPKYRGDTGG 
RWVGEPTEA 
GLVAIYFATQ 



60 
120 
180 
240 
300 
360 
420 
480 
540 



51 
I 

TGACAGAGAA 
TCTTTTTAAT 
GAAGCAAGCT 
CTTTTCCCTT 
TTTGGTCAAT 
GTATCCTTTT 
TTTTCAAAGA 
TGGACTTTCC 
TGGAAAGGTC 
AAGGGCAATT 
GCCCAATGCC 
CTTCTTAGTT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 



1164 



WO 03/042661 



TACAGTTCTC TAGAAGAACC CACAGTAGCT AAGTGGTCCC GCCTTATCCA TATGTCCATC 
GTGATTTCTG TATTTATCTG TATATTCTTT GCTACATGTG GATACTTGAC ATTTACTGGC 
TTCACCCAAG GGGACTTATT TGAAAATTAC TGCAGAAATG ATGACCTGGT AACATTTGGA 
AGATTTTGTT ATGGTGTCAC TGTCATTTTG ACATACCCTA TGGAATGCTT TGTGACAAGA 
GAGGTAATTG CCAATGTGTT TTTTGGTGGG AATCTTTCAT CGGTTTTCCA CATTGTTGTA 
ACAGTGATGG TCATCACTGT AGCCACGCTT GTGTCATTGC TGATTGATTG CCTCGGGATA 
GTTCTAGAAC TCAATGGTGT GCTCTGTGCA ACTCCCCTCA TTTTTATCAT TCCATCAGCC 
TGTTATCTGA AACTGTCTGA AGAACCAAGG ACACACTCCG ATAAGATTAT GTCTTGTGTC 
ATGCTTCCCA TTGGTGCTGT GGTGATGGTT TTTGGATTCG TCATGGCTAT TACAAATACT 
CAAGACTGCA CCCATGGGCA GGAAATGTTC TACTGCTTTC CTGACAATTT CTCTCTCACA 
AATACCTCAG AGTCTCATGT TCAGCAGACA ACACAACTTT CTACTTTAAA TATTAGTATC 
TTTCAATGA 

Seq ID NO: 550 Protein sequence 
Protein Accession #: Eos sequence 

1 11 21 31 41 51 

I 1 | I I I 

MGYQRQEPVI PPQRDLDDRE TLVSEHEYKE KTCQSAALFN WNSIIGSGI IGLPYSMKQA 60 
GFPLGILLLF WVSYVTDFSIi VLLIKGGALS GTDTYQSLVN KTFGFPGYLL LSVLQFLYPF 120 
IAMISYNIIA GDTLSKVFQR IPGVDPENVF IGRHFIIGLS TVTFTLPLSL YRNIAKLGKV 180 
SLISTGLTTL ILGIVMARAI SLGPHIPKTE DAWVFAKPNA IQAVGVMSFA FICHHNSFLV 240 
YSSLEEPTVA KWSRLIHMSI VISVFICIFF ATCGYLTFTG FTQGDLFENY CRNDDLVTFG 300 
RFCYGVTVIL TYPMECFVTR EVIANVFFGG NLSSVFHIW TVMVITVATL VSLLIDCLGI 360 
VLELNGVLCA TPLIFIIPSA CYLKLSEEPR THSDKIMSCV MLPIGAWMV FGFVMAITNT 420 
QDCTHGQEMF YCFPDNFSLT NTSESHVQQT TQLSTLNISI FQ 

Seq ID NO: 551 DNA sequence 

Nucleic Acid Accession ti: Eos sequence 

Coding sequence: 1. .1284 

I 11 21 31 41 SI 

II I I I i 

ATGGGCTACC AGAGGCAGGA GCCTGTCATC CCGCCGCAGA GAGGATTGCC TTATTCAATG 60 

AAGCAAGCTG GGTTTCCTTT GGGAATATTG CTTTTATTCT GGGTTTCATA TGTTACAGAC 120 

TTTTCCCTTG TTTTATTGAT AAAAGGAGGG GCCCTCTCTG GAACAGATAC CTACCAGTCT 180 

TTGGTCAATA AAACTTTCGG CTTTCCAGGG TATCTGCTCC TCTCTGTTCT TCAGTTTTTG 240 

TATCCTTTTA TAGCAATGAT AAGTTACAAT ATAATAGCTG GAGATACTTT GAGCAAAGTT 300 

TTTCAAAGAA TCCCAGGAGT TGATCCTGAA AACGTGTTTA TTGGTCGCCA CTTCATTATT 360 

GGACTTTCCA CAGTTACCTT TACTCTGCCT TTATCCTTGT ACCGAAATAT AGCAAAGCTT 420 

GGAAAGGTCT CCCTCATCTC TACAGGTTTA ACAACTCTGA TTCTTGGAAT TGTAATGGCA 480 

AGGGCAATTT CACTGGGTCC ACACATACCA AAAACAGAAG ACGCTTGGGT ATTTGCAAAG 540 

CCCAATGCCA TTCAAGCGGT CGGGGTTATG TCTTTTGCAT TTATTTGCCA CCATAACTCC 600 

TTCTTAGTTT ACAGTTCTCT AGAAGAACCC ACAGTAGCTA AGTGGTCCCG CCTTATCCAT 660 

ATGTCCATCG TGATTTCTGT ATTTATCTGT ATATTCTTTG CTACATGTGG ATACTTGACA 720 

TTTACTGGCT TCACCCAAGG GGACTTATTT GAAAATTACT GCAGAAATGA TGACCTGGTA 780 

ACATTTGGAA GATTTTGTTA TGGTGTCACT GTCATTTTGA CATACCCTAT GGAATGCTTT 840 

GTGACAAGAG AGGTAATTGC CAATGTGTTT TTTGGTGGGA ATCTTTCATC GGTTTTCCAC 900 

ATTGTTGTAA CAGTGATGGT CATCACTGTA GCCAOGCTTG TGTCATTGCT GATTGATTGC 960 

CTCGGGATAG TTCTAGAACT CAATGGTGTG CTCTGTGCAA CTCCCCTCAT TTTTATCATT 1020 

CCATCAGCCT GTTATCTGAA ACTGTCTGAA GAACCAAGGA CACACTCCGA TAAGATTATG 1080 

TCTTGTGTCA TGCTTCCCAT TGGTGCTGTG GTGATGGTTT TTGGATTCGT CATGGCTATT 1140 

ACAAATACTC AAGACTGCAC CCATGGGCAG GAAATGTTCT ACTGCTTTCC TGACAATTTC 1200 

TCTCTCACAA ATACCTCAGA GTCTCATGTT CAGCAGACAA CACAACTTTC TACTTTAAAT 1260 
ATTAGTATCT TTCAACTCGA GTAA 

Seq ID NO: 552 Protein sequence 
Protein Accession #: Eos sequence 

1 11 21 31 41 51 

I I I i ! I 

MGYQRQEPVI PPQRGLPYSM KQAGFPDGIL LLFWVSYVTD FSLVLLIKGG ALSGTDTYQS 60 
LVNKTFGFPG YLLLSVLQFL YPFIAMISYN I IAGDTLSKV FQRIPGVDPE NVFIGRHFII 120 
GLSTVTFTLP LSLYRNIAKL GKVSLISTGL TTLILGIVMA RAISLGPHIP KTEDAWVFAK 180 
PNAIQAVGVM SFAFICHHNS FLVYSSLEEP TVAKWSRLIH MSIVISVFIC IFFATCGYLT 240 
FTGFTQGDLF ENYCRNDDLV TFGRFCYGVT VILTYPMECF VTREVIANVF FGGNLSSVFH 300 
IWTVMVITV ATLVSLLIDC LGIVLELNGV LCATPLIFII PSACYLKLSE EPRTHSDKIM 360 
SCVMLPIGAV VMVFGFVMAI TNTQDCTHGQ EMFYCFPDNF SLTNTSESHV QQTTQLSTLN 420 
ISIFQLE 

Seq ID NO: 553 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 1..1203 

1 11 21 31 41 51 

| I I I I 1 

ATGGGCTACC AGAGGCAGGA GCCTGTCATC CCGCCGCAGT TTTCCCTTGT TTTATTGATA 60 
AAAGGAGGGG CCCTCTCTGG AACAGATACC TACCAGTCTT TGGTCAATAA AACTTTCGGC 120 
TTTCCAGGGT ATCTGCTCCT CTCTGTTCTT CAGTTTTTGT ATCCTTTTAT AGCAATGATA 180 
AGTTACAATA TAATAGCTGG AGATACTTTG AGCAAAGTTT TTCAAAGAAT CCCAGGAGTT 240 
GATCCTGAAA ACGTGTTTAT TGGTCGCCAC TTCATTATTG GACTTTCCAC AGTTACCTTT 300 
ACTCTGCCTT TATCCTTGTA CCGAAATATA GCAAAGCTTG GAAAGGTCTC CCTCATCTCT 360 
ACAGGTTTAA CAACTCTGAT TCTTGGAATT GTAATGGCAA GGGCAATTTC ACTGGGTCCA 420 
CACATACCAA AAACAGAAGA CGCTTGGGTA TTTGCAAAGC CCAATGCCAT TCAAGCGGTC 480 
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GGGGTTATGT CTTTTGCATT TATTTGCCAC CATAACTCCT TCTTAGTTTA CAGTTCTCTA 
GAAGAACCCA CAGTAGCTAA GTGGTCCCGC CTTATCCATA TGTCCATCGT GATTTCTGTA 
TTTATCTGTA TATTCTTTGC TACATGTGGA TACTTGACAT TTACTGGCTT CACCCAAGGG 
GACTTATTTG AAAATTACTG CAGAAATGAT GACCTGGTAA CATTTGGAAG ATTTTGTTAT 
GGTGTCACTG TCATTTTGAC ATACCCTATG GAATGCTTTG TGACAAGAGA GGTAATTGCC 
AATGTGTTTT TTGGTGGGAA TCTTTCATCG GTTTTCCACA TTGTTGTAAC AGTGATGGTC 
ATCACTGTAG CCACGCTTGT GTCATTGCTG ATTGATTGCC TCGGGATAGT TCTAGAACTC 
AATGGTGTGC TCTGTGCAAC TCCCCTCATT TTTATCATTC CATCAGCCTG TTATCTGAAA 
CTGTCTGAAG AACCAAGGAC ACACTCCGAT AAGATTATGT CTTGTGTCAT GCTTCCCATT 
GGTGCTGTGG TGATGGTTTT TGGATTCGTC ATGGCTATTA CAAATACTCA AGACTGCACC 
CATGGGCAGG AAATGTTCTA CTGCTTTCCT GACAATTTCT CTCTCACAAA TACCTCAGAG 
TCTCATGTTC AGCAGACAAC ACAACTTTCT ACTTTAAATA TTAGTATCTT TCAACTCGAG 
TAA 

Seq ID NO: 554 Protein sequence 
Protein Accession #: Eos sequence 

1 11 21 31 41 51 

| I 1 I I I 

MGYQRQEPVI PPQFSLVLLI KGGALSGTDT YQSLVNKTFG FPGYLLLSVL QFLYPFIAMI 
SYNIIAGDTL SKVFQRIPGV DPENVFIGRH FIIGLSTVTF TLPLSLYRNI AKLGKVSLIS 120 
TGLTTLILGI VMARAISLGP HIPKTEDAWV FAKPNAIQAV GVMSFAFICH HNSFLVYSSL 180 
EEPTVAKWSR LIHMSIVISV FICIFFATCG YLTFTGFTQG DLFENYCRND DLVTFGRFCY 240 
GVTVILTYPM ECFVTREVIA NVFFGGNLSS VFHIWTVMV ITVATLVSLL IDCLGIVLEL 300 
NGVLCATPLI FIIPSACYLK LSEEPRTHSD KIMSCVMLPI GAWMVFGFV MAITNTQDCT 360 
HGQEMFYCFP DNFSLTNTSE SHVQQTTQLS TLNISIFQLE 

Seq ID NO: 5SS DNA sequence 

Nucleic Acid Accession ft: Eos sequence 

Coding sequence : 1 . - 1140 

1 11 21 31 41 51 

I I I I I i 

ATGGGCTACC AGAGGCAGGA GCCTGTCATC CCGCCGCAGG TCAATAAAAC TTTCGGCTTT 60 

CCAGGGTATC TGCTCCTCTC TGTTCTTCAG TTTTTGTATC CTTTTATAGC AATGATAAGT 120 

TACAATATAA TAGCTGGAGA TACTTTGAGC AAAGTTTTTC AAAGAATCCC AGGAGTTGAT 180 

CCTGAAAAGG TGTTTATTGG TCGCCACTTC ATTATTGGAC TTTCCACAGT TACCTTTACT 240 

CTGCCTTTAT CCTTGTACCG AAATATAGCA AAGCTTGGAA AGGTCTCCCT CATCTCTACA 300 

GGTTTAACAA CTCTGATTCT TGGAATTGTA ATGGCAAGGG CAATTTCACT GGGTCCACAC 360 

ATACCAAAAA CAGAAGACGC TTGGGTATTT GCAAAGCCCA ATGCCATTCA AGCGGTCGGG 420 

GTTATGTCTT TTGCATTTAT TTGCCACCAT AACTCCTTCT TAGTTTACAG TTCTCTAGAA 480 

GAACCCACAG TAGCTAAGTG GTCCCGCCTT ATCCATATGT CCATCGTGAT TTCTGTATTT 540 

ATCTGTATAT TCTTTGCTAC ATGTGGATAC TTGACATTTA CTGGCTTCAC CCAAGGGGAC 600 

TTATTTGAAA ATTACTGCAG AAATGATGAC CTGGTAACAT TTGGAAGATT TTGTTATGGT 660 

GTCACTGTCA TTTTGACATA CCCTATGGAA TGCTTTGTGA CAAGAGAGGT AATTGCCAAT 720 

GTGTTTTTTG GTGGGAATCT TTCATCGGTT TTCCACATTG TTGTAACAGT GATGGTCATC 780 

ACTGTAGCCA CGCTTGTGTC ATTGCTGATT GATTGCCTCG GGATAGTTCT AGAACTCAAT 840 

GGTGTGCTCT GTGCAACTCC CCTCATTTTT ATCATTCCAT CAGCCTGTTA TCTGAAACTG 900 

TCTGAAGAAC CAAGGACACA CTCCGATAAG ATTATGTCTT GTGTCATGCT TCCCATTGGT 960 

GCTGTGGTGA TGGTTTTTGG ATTCGTCATG GCTATTACAA ATACTCAAGA CTGCACCCAT 1020 

GGGCAGGAAA TGTTCTACTG CTTTCCTGAC AATTTCTCTC TCACAAATAC CTCAGAGTCT 1080 
CATGTTCAGC AGACAACACA ACTTTCTACT TTAAATATTA GTATCTTTCA ACTCGAGTAA 

Seq ID NO: 556 Protein sequence 
Protein Accession #: Eos sequence 

1 11 21 31 41 51 

111)11 

MGYQRQEPVI PPQVNKTFGF PGYLLLSVLQ FLYPFIAMIS YNIIAGDTLS KVFQRIPGVD 60 
PENVFIGRHP IIGLSTVTFT LPLSLYRNIA KLGKVSLIST GLTTLILGIV MARAISLGPH 120 
IPKTEDAWVF AKPNAIQAVG VMSFAFICHH NSFLVYSSLE EPTVAKWSRL IHMSIVISVF 180 
ICIFFATCGY LTFTGFTQGD LFENYCRNDD LVTFGRFCYG VTVILTYPME CFVTREVIAN 240 
VFFGGNLSSV FHIWTVMVI TVATLVSLLI DCLGIVLELN GVLCATPLIF I IPSACYLKL 300 
SEEPRTHSDK IMSCVMLPIG AWMVFGFVM AITNTQDCTH GQEMFYCFPD NFSLTNTSES 360 
HVQQTTQLST LNISIFQDE 

Seq ID NO: 557 DNA sequence 

Nucleic Acid Accession #: XM_05718B.l 

Coding sequence: 769.. 4269 

1 11 21 31 41 51 

| | I I I I 

ATGGGATTGC CTCTCCCTCT CACTCTGGGC TTCTGTCCCA CTCTTATCTT AGTGTCAGTC 60 

CTCCCCCAAG TCTGTGTCCC TCTCTCTCCC CTAAATCTCT GGCCCCTCCT TTCTGAGTTC 120 

CTGCCCTTCC CCCAATTCTT TGGTTTTTGC ATCCCCCTCT GCCCCTTGCC TCAGTCAAGT 180 

CTCCCCTGTG TGTCTCTCTC CCCCCGGCCC GGACCTCTGC ACCCCCCAGG TCGCTGTCCC 240 

TCTGTCCCCT TATCGCGGCC TGGGACCCGC CCTCTCCCCG CCTCCCGCTT TGGCGTCTCC 300 

AAGACTCCCC GCCCCCCAGA CCTCGCCCOG CCCCAGGCTA GGCTGGAAAG TGGAGGATCC 360 

GGTTTGCTCT GGGCGGGTCT GGAAGCAGAG CCGGCGGAGG GAGCGCCGGG GCCCTGGGCT 420 

GCAGGAGGTT GCGGCGGCCG CGGCAGCATG GTGGTGCCGG AGAAGGAGCA GAGCTGGATC 480 

CCCAAGATCT TCAAGAAGAA GACCTGCACG ACGTTCATAG TTGACTCCAC AGATCCGGGG 540 

GATGGGGTCT CGCTCTATTG CCCAGGCTGT TCTCAAACTC CTGGGCTCAA GCAGTCCTTC 600 

TGCCTTGACC TCCCAAAGTG CTTGGATTGT GCCCAGCCGT CCTTGAAGTT TTGCTCAGAA 660 

GAGCAAACTT TCTGGGAAGT AGCTGCAGGT GTTGGAAGTA GCTGCAGGGG AACTAGGGGA 720 
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TTCAAGGATG GAGCTGAAAT GGGTGAACGG ACAAAGTCGG TAAACTGAAT GGAGGATGCC 780 
TTCGGGGCAG CCGTGGTGAC CGTGTGGGAC AGCGATGCAC ACACCACGGA GAAGCCCACC 840 
GATGCCTACG GAGAGCTGGA CTTCACGGGG GCCGGCCGCA AGCACAGCAA TTTCCTCCGG 900 
CTCTCTGACC GAACGGATCC AGCTGCAGTT TATAGTCTGG TCACACGCAC ATGGGGCTTC 960 
CGTGCCCCGA ACCTGGTGGT GTCAGTGCTG GGGGGATCGG GGGGCCCCGT CCTCCAGACC 1020 
TGGCTGCAGG ACCTGCTGCG TCGTGGGCTG GTGCGGGCTG CCCAGAGCAC AGGAGCCTGG 1080 
ATTGTCACTG GGGGTCTGCA CACGGGCATC GGCCGGCATG TTGGTGTGGC TGTACGGGAC 1140 
CATCAGATGG CCAGCACTGG GGGCACCAAG GTGGTGGCCA TGGGTGTGGC CCCCTGGGGT 1200 
GTGGTCCGGA ATAGAGACAC CCTCATCAAC CCCAAGGGCT CGTTCCCTGC GAGGTACCGG 1260 
TGGCGCGGTG ACCCGGAGGA CGGGGTCCAG TTTCCCCTGG ACTACAACTA CTCGGCCTTC 1320 
TTCCTGGTGG ACGACGGCAC ACACGGCTGC CTGGGGGGCG AGAACCGCTT CCGCTTGCGC 1380 
CTGGAGTCCT ACATCTCACA GCAGAAGACG GGCGTGGGAG GGACTGGAAT TGACATCCCT 1440 
GTCCTGCTCC TCCTGATTGA TGGTGATGAG AAGATGTTGA CGCGAATAGA GAACGCCACC 1500 
CAGGCTCAGC TCCCATGTCT CCTCGTGGCT GGCTCAGGGG GAGCTGCGGA CTGCCTGGCG 1560 
GAGACCCTGG AAGACACTCT GGCCCCAGGG AGTGGGGGAG CCAGGCAAGG CGAAGCCCGA 1620 
GATCGAATCA GGCGTTTCTT TCCCAAAGGG GACCTTGAGG TCCTGCAGGC CCAGGTGGAG 1680 
AGGATTATGA CCCGGAAGGA GCTCCTGACA GTCTATTCTT CTGAGGATGG GTCTGAGGAA 1740 
TTCGAGACCA TAGTTTTGAA GGCCCTTGTG AAGGCCTGTG GGAGCTCGGA GGCCTCAGCC 1800 
TACCTGGATG AGCTGCGTTT GGCTGTGGCT TGGAACCGCG TGGACATTGC CCAGAGTGAA 1860 
CTCTTTCGGG GGGACATCCA ATGGCGGTCC TTCCATCTCG AAGCTTCCCT CATGGACGCC 1920 
CTGCTGAATG ACCGGCCTGA GTTCGTGCGC TTGCTCATTT CCCACGGCCT CAGCCTGGGC 1980 
CACTTCCTGA CCCCGATGCG CCTGGCCCftA CTCTACAGCG CGGCGCCCTC CAACTCGCTC 2040 
ATCCGCAACC TTTTGGACCA GGCGTCCCAC AGCGCAGGCA CCAAAGCCCC AGCCCTAAAA 2100 
GGGGGAGCTG CGGAGCTCCG GCCCCCTGAC GTGGGGCATG TGCTGAGGAT GCTGCTGGGG 2160 
AAGATGTGCG CGCCGAGGTA CCCCTCCGGG GGCGCCTGGG ACCCTCACCC AGGCCAGGGC 2220 
TTCGGGGAGA GCATGTATCT GCTCTCGGAC AAGGCCACCT CGCCGCTCTC GCTGGATGCT 2280 
GGCCTCGGGC AGGCCCCCTG GAGCGACCTG CTTCTTTGGG CACTGTTGCT GAACAGGGCA 2340 
CAGATGGCCA TGTACTTCTG GGAGATGGGT TCCAATGCAG TTTCCTCAGC TCTTGGGGCC 2400 
TGTTTGCTGC TCCGGGTGAT GGCACGCCTG GAGCCTGACG CTGAGGAGGC AGCACGGAGG 2460 
AAAGACCTGG CGTTCAAGTT TGAGGGGATG GGCGTTGACC TCTTTGGCGA GTGCTATCGC 2520 
AGCAGTGAGG TGAGGGCTGC CCGCCTCCTC CTCCGTCGCT GCCCGCTCTG GGGGGATGCC 2580 
ACTTGCCTCC AGCTGGCCAT GCAAGCTGAC GCCCGTGCCT TCTTTGCCCA GGATGGGGTA 2640 
CAGTCTCTGC TGACACAGAA GTGGTGGGGA GATATGGCCA GCACTACACC CATCTGGGCC 2700 
CTGGTTCTCG CCTTCTTTTG CCCTCCACTC ATCTACACCC GCCTCATCAC CTTCAGGAAA 2760 
TCAGAAGAGG AGCCCACACG GGAGGAGCTA GAGTTTGACA TGGATAGTGT CATTAATGGG 2820 
GAAGGGCCTG TCGGGACGGC GGACCCAGCC GAGAAGACGC CGCTGGGGGT CCCGCGCCAG 2880 
TCGGGCCGTC CGGGTTGCTG CGGGGGCCGC TGCGGGGGGC GCCGGTGCCT ACGCCGCTGG 2940 
TTCCACTTCT GGGGCGCGCC GGTGACCATC TTCATGGGCA ACGTGGTCAG CTACCTGCTG 3000 
TTCCTGCTGC TTTTCTCGCG GGTGCTGCTC GTGGATTTCC AGCCGGCGCC GCCCGGCTCC .3060 
CTGGAGCTGC TGCTCTATTT CTGGGCTTTC ACGCTGCTGT GCGAGGAACT GCGCCAGGGC 3120 
CTGAGCGGAG GCGGGGGCAG CCTCGCCAGC GGGGGCCCCG GGCCTGGCCA TGCCTCACTG 3180 
AGCCAGCGCC TGCGCCTCTA CCTCGCCGAC AGCTGGAACC AGTGCGACCT AGTGGCTCTC 3240 
ACCTGCTTCC TCCTGGGCGT GGGCTGCCGG CTGACCCCGG GTTTGTACCA CCTGGGCCGC 3300 
ACTGTCCTCT GCATCGACTT CATGGTTTTC ACGGTGCGGC TGCTTCACAT CTTCACGGTC 3360 
AACAAACAGC TGGGGCCCAA GATCGTCATC GTGAGCAAGA TGATGAAGGA CGTGTTCTTC 3420 
TTCCTCTTCT TCCTCGGCGT GTGGCTGGTA GCCTATGGCG TGGCCACGGA GGGGCTCCTG 3480 
AGGCCACGGG ACAGTGACTT CCCAAGTATC CTGCGCCGCG TCTTCTACCG TCCCTACCTG 3540 
CAGATCTTCG GGCAGATTCC CCAGGAGGAC ATGGACGTGG CCCTCATGGA GCACAGCAAC 3600 
TGCTCGTGGG AGCCCGGCTT CTGGGCACAC CCTCCTGGGG CCCAGGCGGG CACCTGCGTC 3660 
TCCCAGTATG CCAACTGGCT GGTGGTGCTG CTCCTCGTCA TCTTCCTGCT CGTGGCCAAC 3720 
ATCCTGCTGG TCAACTTGCT CATTGCCATG TTCAGTTACA CATTCGGCAA AGTACAGGGC 3780 
AACAGCGATC TCTACTGGAA GGCGCAGCGT TACCGCCTCA TCCGGGAATT CCACTCTOGG 3840 
CCCGCGCTGG CCCCGCCCTT TATCGTCATC TCCCACTTGC GCCTCCTGCT CAGGCAATTG 3900 
TGCAGGCGAC CCCGGAGCCC CCAGCCGTCC TCCCCGGCCC TCGAGCATTT CCGGGT TTAC 3960 
CTTTCTAAGG AAGCCGAGCG GAAGCTGCTA ACGTGGGAAT CGGTGCATAA GGAGAACTTT 4020 
CTGCTGGCAC GCGCTAGGGA CAAGCGGGAG AGCGACTCCG AGCGTCTGAA GCGCACGTCC 4080 
CAGAAGGTGG ACTTGGCACT GAAACAGCTG GGACACATCC GCGAGTACGA ACAGCGCCTG 4140 
AAAGTGCTGG AGCGGGAGGT CCAGCAGTGT AGCCGCGTCC TGGGGTGGGT GGCCGAGGCC 4200 
CTGAGCCGCT CTGCCTTGCT GCCCCCAGGT GGGCCGCCAC CCCCTGACCT GCCTGGGTCC 4260 
AAAGACTGAG CCCTGCTGGC GGACTTCAAG GAGAAGCCCC CACAGGGGAT TTTGCTCCTA 4320 
GAGTAAGGCT CATCTGGGCC TCGGCCCCCG CACCTGGTGG CCTTGTCCTT GAGGTGAGCC 4380 
CCATGTCCAT CTGGGCCACT GTCAGGACCA CCTTTGGGAG TGTCATCCTT ACAAACCACA 4440 
GCATGCCCGG CTCCTCCCAG AACCAGTCCC AGCCTGGGAG GATCAAGGCC TGGATCCCGG 4500 
GCCGTTATCC ATCTGGAGGC TGCAGGGTCC TTGGGGTAAC AGGGACCACA GACCCCTCAC 4560 
CACTCACAGA TTCCTCACAC TGGGGAAATA AAGCCATTTC AGAGGAAAAA AAAAAAAAAA 4620 
AAAAAAAAAA AAAAAAAAAA A 

Seq ID NO: 5S8 Protein sequence 
Protein Accession #: XP_057188.l 

1 11 21 31 41 51 

| | 1 I I I 

MEDAFGAAW TVWDSDAHTT EKPTDAYGEL DFTGAGRKHS NFLRLSDRTD PAAVYSLVTR 60 

TWGFRAPNLV VSVLGGSGGP VLQTWLQDLL RRGLVRAAQS TGAWIVTGGL HTGIGRHVGV 120 

AVRDHQMAST GGTKWAMGV APWGWRNRD TLINPKGSFP ARYRWRGDPE DGVQFPLDYN 180 

YSAFFLVDDG THGCLGGENR FRLRLESYIS QQKTGVGGTG IDIPVLLLLI DGDEKMLTRI 240 

ENATQAQLPC LLVAGSGGAA DCLAETLEDT LAPGSGGARQ GEARDRIRRF FPKGDLEVLQ 300 

AQVERIMTRK ELLTVYSSED GSEEFETIVL KALVKACGSS EASAYLDELR LAVAWNRVDI 360 

AQSELFRGDI QWRSFHLEAS LMDALLNDRP EFVRLLISHG LSLGHFLTPM RliAQLYSAAP 420 

SNSLIRNLLD QASHSAGTKA PALKGGAAEL RPPDVGHVLR MLLGKMCAPR YPSGGAWDPH 480 

PGQGFGESMY LLSDKATSPL SLDAGLGQAP WSDLLLWAUj LNRAQMAMYF WEMGSNAVSS 540 

ALGACLLLRV MARLEPDAEE AARRKDLAFK FEGMGVDLFG ECYRSSEVRA ARLLLRRCPL 600 

WGDATCLQLtA MQADARAFPA QDGVQSLLTQ KWWGDMASTT PIWALVLAFF CPPLIYTRLI 660 

TFRKSEEEPT REELEFDMDS VINGEGPVGT ADPAEKTPLG VPRQSGRPGC CGGRCGGRRC 720 
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PCT/US02/36810 



LRRWFHFWGA 
LRQGLSGGGG 
HLGRTVLCID 
EGLLRPRDSD 
GTCVSQYANW 
FHSRPALAPP 
KENFLLARAR 
VAEALSRSAL 
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80 



PVTIFMGNW 
SLASGGPGPG 
FMVFTVRLLH 
FPSILRRVFY 
LWLLLVIFL 
FIVISHLRLL 
DKRBSDSERL 
LPPGGPPPPD 



SYLLFLLLFS 
HASLSQRLRL 
IFTVNKQLGP 
RPYLQIFGQI 
LVANILLVNL 
LRQLCRRPRS 
KRTSQKVDLA 
LPGSKD 



RVLLVDFQPA PPGSLELLLY 
YLADSWNQCD LVALTCFLLG 
KIVIVSKMMK DVFFFLFFLG 
PQEDMDVALM EHSNCSSEPG 
LIAMFSYTFG KVQGNSDLYW 
PQPSSPALEH FRVYLSKEAE 
LKQLGHIREY EQRLKVLERE 



FWAFTLLCEE 780 

VGCRLTPGLY 840 

VWLVAYGVAT 900 

FWAHPPGAQA 960 

KAQRYRLIRE 1020 

RKLLTWESVH 1080 

VQQCSRVLGW 1140 



Seq ID NO: 5S9 DNA sequence 

Nucleic Acid Accession #: NM_006853.1 

Coding sequence: 26.. 874 



AGGAATCTGC 
ATCGGGCAGA 
CATGAGGATT 
CAGGATCATC 
CGAGAAGACG 
AGCCCACTGC 
GGAGGGCTGT 
CAGCCTCCCC 
CTCCATCACC 
CAGCTGCCTC 
CTTGCGATGC 
CAACATCACA 
GGGTGACTCC 
CCAGGATCCG 
GGACTGGATC 
ACCCTCCATT 
CAAGACCCTC 
AATCAACCTG 
GACTCTGGGA 
TCCTGGCCAT 



11 
1 

GCTCGGGTTC 
GGTCTCACAG 
CTGCAGTTAA 
AAGGGGTTCG 
CGGCTACTCT 
CTCAAGCCCC 
GAGCAGACCC 
AACAAAGACC 
TGGGCTGTGC 
ATTTCCGGCT 
GCCAACATCA 
GACACCATGG 
GGGGGCCCTC 
TGTGCGATCA 
CAGGAGACGA 
TCCACTTGGT 
TACGAACATT 
GGGTTCGAAA 
ATGACAACAC 
ATATCAAGGT 



21 
I 

CGCAGATGCA 
CAGCCAAGGA 
TCCTGCTTGC 
AGTGCAAGCC 
GTGGGGCGAC 
GCTACATAGT 
GGACAGCCAC 
ACCGCAATGA 
GACCCCTCAC 
GGGGCAGCAC 
CCATCATTGA 
TGTGTGCCAG 
TGGTCTGTAA 
CCCGAAAGCC 
TGAAGAACAA 
GTTTGGTTCC 
CTTTGGGCCT 
TCAGTGAGAC 
CTGGTTTGTT 
TTCAATAAAT 



31 
I 

GAGGTTGAGG 
ACCTGGGGCC 
TCTGGCAACA 
TCACTCCCAG 
GCTCATCGCC 
TCACCTGGGG 
TGAGTCCTTC 
CATCATGCTG 
CCTCTCCTCA 
GTCCAGCCCC 
GCACCAGAAG 
CGTGCAGGAA 
CCAGTCTCTT 
TGGTGTCTAC 
TTAGACTGGA 
TGTTCACTCT 
CCTGGACTAC 
CTGGATTCAA 
CTCTGTTGTA 
ATTTGCTAAA 



41 

! 

TGGCTGCGGG 
CGCTCCTCCC 
GGGCTTGTAG 
CCCTGGCAGG 
CCCAGATGGC 
CAGCACAACC 
CCCCACCCCG 
GTGAAGATGG 
OGCTGTGTCA 
CAGTTACGCC 
TGTGAGAACG 
GGGGGCAAGG 
CAAGGCATTA 
ACGAAAGTCT 
CCCACCCACC 
GTTAATAAGA 
AGGAGATGCT 
ATTCTGCCTT 
TCCCCAGCCC 
TGAGTG 



51 
I 

ACTGGAAGTC 
CCCTCCAGGC 
GGGGAGAGAC 
CAGCCCTGTT 
TCCTGACAGC 
TCCAGAAGGA 
GCTTCAACAA 
CATOGCCAGT 
CTGCTGGCAC 
TGCCTCACAC 
CCTACCCCGG 
ACTCCTGCCA 
TCTCCTGGGG 
GCAAATATGT 
ACAGCCCATC 
AACCCTAAGC 
GTCACTTAAT 
GAAATATTGT 
CAAAGACAGC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 



Seq ID NO: 560 Protein sequence 
Protein Accession #: NP_006844.l 

21 



1 11 21 31 41 51 

I I I I I I 

MRILQLILLA LATGLVGGET RIIKGFECKP HSQPWQAALF EKTRLLCGAT LIAPRWLLTA 60 

AHCLKPRYIV HLGQHNLQKE EGCEQTRTAT ESFPHPGFNN SLPNKDHRND IMLVKMASPV 120 

SITWAVRPLT LSSRCVTAGT SCLISGWGST SSPQLRLPHT LRCANITIIE HQKCENAYPG 180 

NITDTMVCAS VQEGGKDSCQ GDSGGPLVCN QSLQGIISWG QDPCAITRKP GVYTKVCKYV 240 
DWIQETMKNN 

Seq ID NO: 561 DNA sequence 

Nucleic Acid Accession #: AY046419.1 

Coding sequence: 1..1743 

1 11 21 31 41 51 

I I 'I I I I 

ATGTTTACCT TCCTGTCATC TGTCACTGCT GCTGTCAGTG GCCTCCTGGT GGGTTATGAA 60 

CTTGGGATCA TCTCTGGGGC TCTTCTTCAG ATCAAAACCT TATTAGCOCT GAGCTGCCAT 120 

GAGCAGGAAA TGGTTGTGAG CTCCCTCGTC ATTGGAGCCC TCCTTGCCTC ACTCACCGGA 180 

GGGGTCCTGA TAGACAGATA TGGAAGAAGG ACAGCAATCA TCTTGTCATC CTGCCTGCTT 240 

GGACTCGGAA GCTTAGTCTT GATCCTCAGT TTATCCTACA CGGTTCTTAT AGTGGGACGC 300 

ATTGCCATAG GGGTTTCCAT CTCCCTCTCT TCCATTGCCA CTTGTGTTTA CATCGCAGAG 360 

ATTGCTCCTC AACACAGAAG AGGCCTTCTT GTGTCACTGA ATGAGCTGAT GATTGTCATC 420 

GGCATTCTTT CTGCCTATAT TTCAAATTAC GCATTTGCCA ATGTTTTCCA TGGCTGGAAG 480 

TACATGTTTG GTCTTGTGAT TCCCTTGGGA GTTTTGCAAG CAATTGCAAT GTATTTTCTT 540 

CCTCCAAGCC CTCGGTTTCT GGTGATGAAA GGACAAGAGG GAGCTGCTAG CAAGGTTCTT 600 

GGAAGGTTAA GAGCACTCTC AGATACAACT GAGGAACTCA CTGTGATCAA ATCCTCCCTG 660 

AAAGATGAAT ATCAGTACAG TTTTTGGGAT CTGTTTCGTT CAAAAGACAA CATGCGGACC 720 

CGAATAATGA TAGGACTAAC ACTAGTATTT TTTGTACAAA TCACTGGCCA ACCAAACATA 780 

TTGTTCTATG CATCAACTGT TTTGAAGTCA GTTGGATTTC AAAGCAATGA GGCAGCTAGC 840 

CTCGCCTCCA CTGGGGTTGG AGTCGTCAAG GTCATTAGCA CCATCCCTGC CACTCTTCTT 900 

GTAGACCATG TCGGCAGCAA AACATTCCTC TGCATTGGCT CCTCTGTGAT GGCAGCTTCG 960 

TTGGTGACCA TGGGCATCGT AAATCTCAAC ATCCACATGA ACTTCACCCA TATCTGCAGA 1020 

AGCCACAATT CTATCAACCA GTCCTTGGAT GAGTCTGTGA TTTATGGACC AGGAAACCTG 1080 

TCAACCAACA ACAATACTCT CAGAGACCAC TTCAAAGGGA TTTCTTCCCA TAGCAGAAGC 1140 

TCACTCATGC CCCTGAGAAA TGATGTGGAT AAGAGAGGGG AGACGACCTC AGCATCCTTG 1200 

CTAAATGCTG GATTAAGCCA CACTGAATAC CAGATAGTCA CAGACCCTGG GGACG TCCCA 1260 

GCTTTTTTGA AATGGCTGTC CTTAGCCAGC TTGCTTGTTT ATGTTGCTGC TTTTTCAATT 1320 

GGTCTAGGAC CAATGCCCTG GCTGGTGCTC AGCGAGATCT TTCCTGGTGG GATCAGAGGA 1380 

CGAGCCATGG CTTTAACTTC TAGCATGAAC TGGGGCATCA ATCTCCTCAT CTOGCTGACA 1440 

TTTTTGACTG TAACTGATCT TATTGGCCTG CCATGGGTGT GCTTTATATA TACAATCATG 1500 

AGTCTAGCAT CCCTGCTTTT TGTTGTTATG TTTATACCTG AGACAAAGGG ATGCTCTTTG 1560 

GAACAAATAT CAATGGAGCT AGCAAAAGTG AACTATGTGA AAAACAACAT TTGTTTTATG 1620 

AGTCATCACC AAGAAGAATT AGTGCCAAAA CAGCCTCAAA AAAGA AAAC C CCAGGAGCAG 1680 

CTCTTGGAGT GTAACAAGCT GTGTGGTAGG GGCCAATCCA GGCAGCTTTC TCCAGAGACC 1740 
TAA 
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Seq ID NO: 5G2 Protein sequence 
Protein Accession 8: AAL02327.1 



5 

10 

15 

20 

25 
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i 
I 

MFTFLSSVTA 
GVLIDRYGRR 
IAPQHRRGLL 
PPSPRFLVMK 
RIMIGLTLVF 
VDHVGSKTFL 
STNNNTLRDH 
AFLKWLSLAS 
PLTVTDLIGL 
SHHQEELVPK 



11 

I 

AVSGLLVGYE 
TAIILSSCLL 
VSLNELMIVI 
GQEGAASKVL 
FVQITGQPNI 
CIGSSVMAAS 
FKGISSHSRS 
LLVYVAAFSI 
PWVCFIYTIW 



21 
I 

LGIISGALLQ 
GLGSLVLILS 
GILSAYISNY 
GRLRALSDTT 
LFYASTVLKS 
LVTMGIVNLN 
SLMPLRNDVD 
GLGPMPWLVL 
SLASLLFWM 
LLECNKLCGR 



31 
I 

IKTLLALSCH 
LSYTVLIVGR 
AFANVFHGWK 
EELTVIKSSL 
VGFQSNEAAS 
IHMNFTHICR 
KRGETTSASL 
SEIFPGGIRG 
FIPETKGCSL 
GQSRQLSPET 



41 
I 

EQEMWSSLV 
IAIGVSISLS 
YMFGLVIPLG 
KDEYQYSFWD 
IASTGVGWK 
SHNSINQSLD 
LNAGLSHTEY 
RAMALTSSMN 
EQISMELAKV 



51 
I 

IGALLASLTG 
SIATCVYIAE 
VLQAIAMYFL 
LFRSKDNMRT 
VISTIPATLL 
ESVIYGPGNL 
QIVTDPGDVP 
WGIKLLISLT 
NYVKNNICFM 



60 
120 
180 
240 
300 
360 
420 
460 
540 



Seq ID NO: 563 DNA sequence 

Nucleic Acid Accession #: XM_059466.1 

Coding sequence : 1 . . 894 



1 
I 

ATGGAGCCGC 
CTGCTCGTCA 
GAGAGCTGCG 
CCGCTGTCGC 
GGCCCGGGGC 
GCCGAGTGCG 
CTGGGCATCG 
GCCATCAAGT 
AAGACCATAC 
CTCGGCATGG 
TGGGAGGAGA 
TGCACCATTT 
AAGCTAATTT 
GCCTGGTGCA 
ATTAGCCGGA 



11 
I 

GGGCGCTCGT 
CGGCCATCTT 
AGCGCAGCCG 
ACCTGCCGCT 
GCGCCGACCC 
GCCGGCCCCT 
ACCGGGACAT 
ACCACTTTTC 
AGCAAGATGA 
CCGTAGCCGT 
GCTTGACCCA 
CCCTCTGTAC 
ATAGCCTGCC 
GTTTAGGCTT 
CCAAGATTGC 



21 
1 

CACGGCGCTC 
CACCGACCAC 
CGCGGGCGCC 
GCGGGACTCG 
CGAGTCCTGG 
CTTCGCCACC 
CGACACCCTC 
TCAGCCCATC 
GTGGCACCTG 
CCTTCTCTGC 
GCACGTGGCT 
TTATGCCGCC 
TGCTGATGTG 
TATTGTGGCA 
ACAGCTAAAG 



31 
I 

AGCCTCGGCC 
TGGTACGAGA 
GACCCCCCGG 
CCCCCGCTGG 
CGCTCGCTCC 
TACTCGGGCC 
ATCCTGAAAG 
CGCTTGCGAA 
CTTCATTTAA 
GGCTGCATTG 
GGACTCCTGT 
AGTATCTCGT 
GAACATGGTT 
GCTGGAGGTC 
TCTGGCAGAG 



41 
I 

TCAGCCTGTG 
CCGACCCCCG 
ACCAGAAGAA 
GGCGCCGGCT 
TGGGGCTCGG 
TCTGGAGGAA 
GTATTGCGCA 
ACATTCCTTT 
GAAGAATCAC 
TGGCCACAGT 
TCCTCATGAC 
ATGATTTGAA 
ACAGCTGGTC 
TCTGCATCGC 
ACTCCACGGT 



51 
I 

CTCCCTGGGG 
GCGCCACAAG 
CCGCCTGATG 
GCTCCOGGGC 
CGGGCTGGAC 
GTGCTACTTC 
GCGATGCACG 
TAATTTAACC 
TGCTGGCTTC 
CAGTTTCTTC 
AGGGATATTT 
CCGGCTCCCA 
CATCTTTTGC 
TTATCCGTTT 
ATGA 



Seq ID NO: 564 Protein sequence 
Protein Accession tf: XP_059466.1 

1 11 21 31 41 SI 

I I I I I I 

MEPRALVTAL SLGLSLCSLG LLVTAIFTDH WYETDPRRHK ESCERSRAGA DPPDQKNRLM 
PLSHLPLRDS PPLGRRLLPG GPGRADPESW RSLLGLGGLD AECGRPLFAT YSGLWRKCYF 
LGIDRDIDTL ILKGIAQRCT AIKYHFSQPI RLRNIPFNLT KTIQQDEWHL LHLRRITAGF 
LGMAVAVLLC GCIVATVSFF WEESLTQHVA GLLFLMTGIF CTISLCTYAA SISYDLNRLP 
KLIYSLPADV EHGYSWSIFC AWCSLGFIVA AGGLCIAYPF ISRTKIAQLK SGRDSTV 

Seq ID NO: 565 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 1..3315 



11 



31 



51 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 



60 
120 
1B0 
240 



! I I 1 ! I I 

ATGTCCTTTC GGGCAGCCAG GCTCAGCATG AGGAACAGAA GGAATGACAC TCTGGACAGC 
ACCCGGACCC TGTACTCCAG OGCGTCTOGG AGCACAGACT TGTCTTACAG TGAAAGCGAC 
TTGGTGAATT TTATTCAAGC AAATTTTAAG AAACGAGAAT GTGTCTTCTT TACCAAAGAT 
TCCAAGGCCA CGGAGAATGT GTGCAAGTGT GGCTATGCCC AGAGCCAGCA CATGGAAGGC 
ACCCAGATCA ACCAAAGTGA GAAATGGAAC TACAAGAAAC ACACCAAGGA ATTTCCTACC 
GACGCCTTTG GGGATATTCA GTTTGAGACA CTGGGGAAGA AAGGGAAGTA TATACGTCTG 
TCCTGCGACA CGGACGCGGA AATCCTTTAC GAGCTGCTGA CCCAGCACTG GCACCTGAAA 
ACACCCAACC TGGTCATTTC TGTGACCGGG GGCGCCAAGA ACTTCGCCCT GAAGCCGCGC 
ATGCGCAAGA TCTTCAGCCG GCTCATCTAC ATCGCGCAGT CCAAAGGTGC TTGGATTCTC 
ACGGGAGGCA CCCATTATGG CCTGATGAAG TACATCGGGG AGGTGGTGAG AGATAACACC 
ATCAGCAGGA GTTCAGAGGA GAATATTGTG GCCATTGGCA TAGCAGCTTG GGGCATGGTC 
TCCAACCGGG ACACCCTCAT CAGGAATTGC GATGCTGAGG GCTATTTTTT AGCCCAGTAC 
CTTATGGATG ACTTCACAAG AGATCCACTG TATATCCTGG ACAACAACCA CACACATTTG 
CTGCTCGTGG ACAATGGCTG TCATGGACAT CCCACTGTCG AAGCAAAGCT CCGGAATCAG 
CTAGAGAAGT ATATCTCTGA GCGCACTATT CAAGATTCCA ACTATGGTGG CAAGATCCCC 
ATTGTGTGTT TTGCCCAAGG AGGTGGAAAA GAGACTTTGA AAGCCATCAA TACCTCCATC 
AAAAATAAAA TTCCTTGTGT GGTGGTGGAA GGCTCGGGCC AGATCGCTGA TGTGATCGCT 
AGCCTGGTGG AGGTGGAGGA TGCCCTGACA TCTTCTGCCG TCAAGGAGAA GCTGGTGCGC 
TTTTTACCCC GCACGGTGTC CCGGCTGCCT GAGGAGGAGA CTGAGAGTTG GATCAAATGG 
CTCAAAGAAA TTCTCGAATG TTCTCACCTA TTAACAGTTA TTAAAATGGA AGAAGCTGGG 
GATGAAATTG TGAGCAATGC CATCTCCTAC GCTCTATACA AAGCCTTCAG CACCAGTGAG 
CAAGACAAGG ATAACTGGAA TGGGCAGCTG AAGCTTCTGC TGGAGTGGAA CCAGCTGGAC 
TTAGCCAATG ATGAGATTTT CACCAATGAC CGCCGATGGG AGTCTGCTGA CCTTCAAGAA 
GTCATGTTTA CGGCTCTCAT AAAGGACAGA CCCAAGTTTG TCCGCCTCTT TCTGGAGAAT 
GGCTTGAACC TACGGAAGTT TCTCACCCAT GATGTCCTCA CTGAACTCTT CTCCAACCAC 
TTCAGCACGC TTGTGTACCG GAATCTGCAG ATCGCCAAGA ATTCCTATAA TGATGCCCTC 
CTCACGTTTG TCTGGAAACT GGTTGCGAAC TTCCGAAGAG GCTTCCGGAA GGAAGACAGA 
AATGGCCGGG ACGAGATGGA CATAGAACTC CACGACGTGT CTCCTATTAC TCGGCACCCC 
CTGCAAGCTC TCTTCATCTG GGCCATTCTT CAGAATAAGA AGGAACTCTC CAAAGTCATT 
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60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 ' 

780 

840 

900 

960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 

1740 
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TGGGAGCAGA CCAGGGGCTG CACTCTGGCA GCCCTGGGAG CCAGCAAGCT TCTGAAGACT 1800 
CTGGCCAAAG TGAAGAACGA CATCAATGCT GCTGGGGAGT CCGAGGAGCT GGCTAATGAG 1860 
TACGAGACCC GGGCTGTTGA GCTGTTCACT GAGTGTTACA GCAGCGATGA AGACTTGGCA 1920 
GAACAGCTGC TGGTCTATTC CTGTGAAGCT TGGGGTGGAA GCAACTGTCT GGAGCTGGCG 1980 
GTGGAGGCCA CAGACCAGCA TTTCATCGCC CAGCCTGGGG TCCAGAATTT TCTTTCTAAG 2040 
CAATGGTATG GAGAGATTTC CCGAGACACC AAGAACTGGA AGATTATCCT GTGTCTGTTT 2100 
ATTATACCCT TGGTGGGCTG TGGCTTTGTA TCATTTAGGA AGAAACCTGT CGACAAGCAC 2160 
AAGAAGCTGC TTTGGTACTA TGTGGCGTTC TTCACCTCCC CCTTCGTGGT CTTCTCCTGG 2220 
AATGTGGTCT TCTACATCGC CTTCCTCCTG CTGTTTGCCT ACGTGCTGCT CATGGATTTC 2280 
CATTCGGTGC CACACCCCCC CGAGCTGGTC CTGTACTCGC TGGTCTTTGT CCTCTTCTGT 2340 
GATGAAGTGA GACAGTGGTA CGTAAATGGG GTGAATTATT TTACTGACCT GTGGAATGTG 2400 
ATGGACACGC TGGGGCTTTT TTACTTCATA GCAGGAATTG TATTTCGGCT CCACTCTTCT 2460 
AATAAAAGCT CTTTGTATTC TGGACGAGTC ATTTTCTGTC TGGACTACAT TATTTTCACT 2520 
CTAAGATTGA TCCACATTTT TACTGTAAGC AGAAACTTAG GACCCAAGAT TATAATGCTG 2580 
CAGAGGATGC TGATCGATGT GTTCTTCTTC CTGTTCCTCT TTGCGGTGTG GATGGTGGCC 2640 
TTTGGCGTGG CCAGGCAAGG GATCCTTAGG CAGAATGAGC AGCGCTGGAG GTGGATATTC 2700 
CGTTCGGTCA TCTACGAGCC CTACCTGGCC ATGTTOGGCC AGGTGCCCAG TGACGTGGAT 2760 
GGTACCACGT ATGACTTTGC CCACTGCACC TTCACTGGGA ATGAGTCCAA GCCACTGTGT 2820 
GTGGAGCTGG ATGAGCACAA CCTGCCCCGG TTCCCCGAGT GGATCACCAT CCCCCTGGTG 2880 
TGCATCTACA TGTTATCCAC CAACATCCTG CTGGTCAACC TGCTGGTCGC CATGTTTGGC 2940 
TACACGGTGG GCACCGTCCA GGAGAACAAT GACCAGGTCT GGAAGTTCCA GAGGTACTTC 3000 
CTGGTGCAGG AGTACTGCAG CCGCCTCAAT ATCCCCTTCC CCTTCATCGT CTTCGCTTAC 3060 
TTCTACATGG TGGTGAAGAA GTGCTTCAAG TGTTGCTGCA AGGAGAAAAA CATGGAGTCT 3120 
TCTGTCTGCT GTTTCAAAAA TGAAGACAAT GAGACTCTGG CATGGGAGGG TGTCATGAAG 3180 
GAAAACTACC TTGTCAAGAT CAACACAAAA GCCAACGACA CCTCAGAGGA AATGAGGCAT 3240 
CGATTTAGAC AACTGGATAC AAAGCTTAAT GATCTCAAGG GTCTTCTGAA AGAGATTGCT 3300 
AATAAAATCA AATGA 

Seq ID NO: 566 Protein sequence 
Protein Accession #: Eos sequence 

1 11 21 31 41 51 

I ! i I I I 

MSFRAARLSM RNRRNDTLDS TRTLYSSASR STDLSYSESD LVNFIQANFK KRECVFFTKD 60 
SKATENVCKC GYAQSQHMEG TQINQSEKWN YKKHTKEFPT DAFGDIQFET LGKKGKYIRL 120 
SCDTDAEILY ELLTQHWHLK TPNLVISVTG GAKNFALKPR MRKIFSRLIY IAQSKGAWIL 180 
TGGTHYGLMK YIGEWRDNT ISRSSEENIV AIGIAAWGMV SNRDTLIRNC DAEGYFLAQY 240 
LMDDFTRDPIj YILDNNHTHL LLVDNGCHGH PTVEAKLRNQ LEKYISERTI QDSNYGGKIP 300 
IVCFAQGGGK ETLKAINTSI KNKIPCVWE GSGQIADVIA SLVEVEDALT SSAVKEKLVR 360 
FLPRTVSRLP EEETESWIKW LKEILECSHL LTVIKMEEAG DEXVSNAISY ALYKAFSTSE 420 
QDKDNWNGOL KLLLEWNQLD LANDEIFTND RRWESADLQE VMFTALIKDR PKFVRLFIiEN 480 
GLNLRKFLTH DVLTELFSNH FSTLVYRNLQ IAKNSYNDAL LTFVWKLVAN FRRGFRKEDR 540 
NGRDEMDIEL HDVSPITRHP LQALFIWAIL QNKKELSKVI WEQTRGCTLA ALGASKLLKT 60 0 
LAKVKNDINA AGESEELANE YETRAVELFT ECYSSDEDLA EQLLVYSCEA WGGSNCLELA 660 
VEATDQHFIA QPGVQNFLSK QWYGEISRDT KNWKIILCLF IIPLVGCGFV SFRKKPVDKH 720 
KKLLWYYVAF FTSPFWFSW NWFYIAFLL LFAYVLLMDF HSVPHPPELV LYSLVFVLFC 780 
DEVRQWYVNG VNYFTDLWNV MDTLGLFYFI AGIVFRLHSS NKSSLYSGRV IFCLDYIIFT 840 
LRLIHIFTVS RNLGPKIIML QRMLIDVFFF LFLFAVWMVA FGVARQGILR QNEQRWRWIF 900 
RSVIYEPYUV MFGQVPSDVD GTTYDFAHCT FTGNESKPLC VELDEHNLPR FPEWITIPLV 960 
CIYMI>STNIL IiVNIiLVAMFG YTVGTVQENN DQVWKFQRYF LVQEYCSRLN IPFPFIVFAY 1020 
FYMWKKCFK CCCKEKNMES SVCCFKNEDN ETIiAWEGVMK ENYLVKINTK ANDTSEEMRH 10B0 
RFRQLDTKLN DLKGLLKEIA NKIK 

Seq ID NO: 567 DNA sequence 

Nucleic Acid Accession #: NM^Ooegil.l 

Coding sequence: 1..558 



1 11 21 31 41 51 

I i I I I t 

ATGCCTCGCC TGTTCTTGTT CCACCTGCTA GAATTCTGTT TACTACTGAA CCAATTTTCC 60 

AGAGCAGTCG CGGCCAAATG GAAGGACGAT GTTATTAAAT TATGCGGCCG CGAATTAGTT 120 

CGCGCGCAGA TTGCCATTTG CGGCATGAGC ACCTGGAGCA AAAGGTCTCT GAGCCAGGAA 180 

GATGCTCCTC AGACACCTAG ACCAGTGGCA GAAATTGTAC CATCCTTCAT CAACAAAGAT 240 

ACAGAAACTA TAATTATCAT GTTGGAATTC ATTGCTAATT TGCCACCGGA GCTGAAGGCA 300 

GCCCTATCTG AGAGGCAACC ATCATTACCA GAGCTACAGC AGTATGTACC TGCATTAAAG 360 

GATTCCAATC TTAGCTTTGA AGAATTTAAG AAACTTATTC GCAATAGGCA AAGTGAAGCC 420 

GCAGACAGCA ATCCTTCAGA ATTAAAATAC TTAGGCTTGG ATACTCATTC TCAAAAAAAG 480 

AGACGACCCT ACGTGGCACT GTTTGAGAAA TGTTGCCTAA TTGGTTGTAC CAAAAGGTCT 540 
CTTGCTAAAT ATTGCTGA 

Seq ID NO: 568 Protein sequence 
Protein Accession #: NPJJ08842.1 

1 11 21 31 41 51 

j 1 I I . I I 

MPRLFLFHLL EFCLLLNQFS RAVAAKWKDD VIKLCGRELV RAQIAICGMS TWSKRSLSQE 60 
DAPQTPRPVA EIVPSFINKD TETIIIMLEF IANLPPELKA ALSERQPSLP ELQQYVPALK 120 
DSNLSFEEFK KLIRNRQSEA ADSNPSELKY LGLDTHSQKK RRPYVALFEK CX^jIGCTKRS 180 
LAKYC 

Seq ID NO: 569 DNA sequence 

Nucleic Acid Accession #: XM_0364S3.1 
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Coding sequence: 1..397B 

1 11 21 31 41 51 

I I t I I I 

ATGCTGCCCG TGTACCAGGA GGTGAAGCCC AACCCGCTGC AGGACGCGAA CCTCTGCTCA 60 

CGCGTGTTCT TCTGGTGGCT CAATCCCTTG TTTAAAATTG GCCATAAACG GAGATTAGAG 120 

GAAGATGATA TGTATTCAGT GCTGCCAGAA GACCGCTCAC AGCACCTTGG AGAGGAGTTG 180 

CAAGGGTTCT GGGATAAAGA AGTTTTAAGA GCTGAGAATG ACGCACAGAA GCCTTCTTTA 240 

ACAAGAGCAA TCATAAAGTG TTACTGGAAA TCTTATTTAG TTTTGGGAAT TTTTACGTTA 300 

ATTGAGGAAA GTGCCAAAGT AATCCAGCCC ATATTTTTGG GAAAAATTAT TAATTATTTT 360 

GAAAATTATG ATCCCATGGA TTCTGTGGCT TTGAACACAG CGTACGCCTA TGCCACGGTG 420 

CTGACTTTTT GCACGCTCAT TTTGGCTATA CTGCATCACT TATATTTTTA TCACGTTCAG 480 

TGTGCTGGGA TGAGGTTACG AGTAGCCATG TGCCATATGA TTTATCGGAA GGCACTTCGT 540 

CTTAGTAACA TGGCCATGGG GAAGACAACC ACAGGCCAGA TAGTCAATCT GCTGTCCAAT 600 

GATGTGAACA AGTTTGATCA GGTGACAGTG TTCTTACACT TCCTGTGGGC AGGACCACTG 660 

CAGGCGATCG CAGTGACTGC CCTACTCTGG ATGGAGATAG GAATATCGTG CCTTGCTGGG 720 

ATGGCAGTTC TAATCATTCT CCTGCCCTTG CAAAGCTGTT TTGGGAAGTT GTTCTCATCA 780 

CTGAGGAGTA AAACTGCAAC TTTCACGGAT GCCAGGATCA GGACCATGAA TGAAGTTATA 840 

ACTGGTATAA GGATAATAAA AATGTACGCC TGGGAAAAGT CATTTTCAAA TCTTATTACC 900 

AATTTGAGAA AGAAGGAGAT TTCCAAGATT CTGAGAAGTT CCTGCCTCAG GGGGATGAAT 960 

TTGGCTTCGT TTTTCAGTGC AAGCAAAATC ATCGTGTTTG TGACCTTCAC CACCTACGTG 1020 

CTCCTCGGCA GTGTGATCAC AGCCAGCCGC GTGTTCGTGG CAGTGACGCT GTATGGGGCT 1080 

GTGCGGCTGA CGGTTACCCT CTTCTTCCCC TCAGCCATTG AGAGGGTGTC AGAGGCAATC 1140 

GTCAGCATCC GAAGAATCCA GACCTTTTTG CTACTTGATG AGATATCACA GCGCAACCGT 1200 

CAGCTGCCGT CAGATGGTAA AAAGATGGTG CATGTGCAGG ATTTTACTGC TTTTTGGGAT 1260 

AAGGCATCAG AGACCCCAAC TCTACAAGGC CTTTCCTTTA CTGTCAGACC TGGCGAATTG 1320 

TTAGCTGTGG TCGGCCCCGT GGGAGCAGGG AAGTCATCAC TGTTAAGTGC CGTGCTCGGG 1380 

GAATTGGCCC CAAGTCACGG GCTGGTCAGC GTGCATGGAA GAATTGCCTA TGTGTCTCAG 1440 

CAGCCCTGGG TGTTCTCGGG AACTCTGAGG AGTAATATTT TATTTGGGAA GAAATACGAA 1500 

AAGGAACGAT ATGAAAAAGT CATAAAGGCT TGTGCTCTGA AAAAGGATTT ACAGCTGTTG 1560 

GAGGATGGTG ATCTGACTGT GATAGGAGAT CGGGGAACCA CGCTGAGTGG AGGGCAGAAA 1620 

GCACGGGTAA ACCTTGCAAG AGCAGTGTAT CAAGATGCTG ACATCTATCT CCTGGACGAT 1680 

CCTCTCAGTG CAGTAGATGC GGAAGTTAGC AGACACTTGT TCGAACTGTG TATTTGTCAA 1740 

ATTTTGCATG AGAAGATCAC AATTTTAGTG ACTCATCAGT TGCAGTACCT CAAAGCTGCA 1800 

AGTCAGATTC TGATATTGAA AGATGGTAAA ATGGTGCAGA AGGGGACTTA CACTGAGTTC 1860 

CTAAAATCTG GTATAGATTT TGGCTCCCTT TTAAAGAAGG ATAATGAGGA AAGTGAACAA 1920 

CCTCCAGTTC CAGGAACTCC CACACTAAGG AATCGTACCT TCTCAGAGTC TTCGGTTTGG 1980 

TCTCAACAAT CTTCTAGACC CTCCTTGAAA GATGGTGCTC TGGAGAGCCA AGATACAGAG 2040 

AATGTCCCAG TTACACTATC AGAGGAGAAC CGTTCTGAAG GAAAAGTTGG TTTTCAGGCC 2100 

TATAAGAATT ACTTCAGAGC TGGTGCTCAC TGGATTGTCT TCATTTTCCT TATTCTCCTA 2160 

AACACTGCAG CTCAGGTTGC CTATGTGCTT CAAGATTGGT GGCTTTCATA CTGGGCAAAC 2220 

AAACAAAGTA TGCTAAATGT CACTGTAAAT GGAGGAGGAA ATGTAACCGA GAAGCTAGAT 2280 

CTTAACTGGT ACTTAGGAAT TTATTCAGGT TTAACTGTAG CTACOGTTCT TTTTGGCATA 2340 

GCAAGATCTC TATTGGTATT CTACGTCCTT GTTAACTCTT CACAAACTTT GCACAACAAA 2400 

ATGTTTGAGT CAATTCTGAA AGCTCCGGTA TTATTCTTTG ATAGAAATCC AATAGGAAGA 2460 

ATTTTAAATC GTTTCTCCAA AGACATTGGA CACTTGGATG ATTTGCTGCC GCTGACGTTT 2520 

TTAGATTTCA TCCAGACATT GCTACAAGTG GTTGGTGTGG TCTCTGTGGC TGTGGCCGTG 2580 

ATTCCTTGGA TCGCAATACC CTTGGTTCCC CTTGGAATCA TTTTCATTTT TCTTCGGCGA 2640 

TATTTTTTGG AAACGTCAAG AGATGTGAAG CGCCTGGAAT CTACAACTCG GAGTCCAGTG 2700 

TTTTCCCACT TGTCATCTTC TCTCCAGGGG CTCTGGACCA TCCGGGCATA CAAAGCAGAA 2760 

GAGAGGTGTC AGGAACTGTT TGATGCACAC CAGGATTTAC ATTCAGAGGC TTGGTTCTTG 2820 

TTTTTGACAA CGTCCCGCTG GTTCGCCGTC CGTCTGGATG CCATCTGTGC CATGTTTGTC 2880 

ATCATCGTTG CCTTTGGGTC CCTGATTCTG GCAAAAACTC TGGATGCCGG GCAGGTTGGT 2940 

TTGGCACTGT CCTATGCCCT CACGCTCATG GGGATGTTTC AGTGGTGTGT TCGACAAAGT 3000 

GCTGAAGTTG AGAATATGAT GATCTCAGTA GAAAGGGTCA TTGAATACAC AGACCTTGAA 3060 

AAAGAAGCAC CTTGGGAATA TCAGAAACGC CCACCACCAG CCTGGCCCCA TGAAGGAGTG 3120 

ATAATCTTTG ACAATGTGAA CTTCATGTAC AGTCCAGGTG GGCCTCTGGT ACTGAAGCAT 3180 

CTGACAGCAC TCATTAAATC ACAAGAAAAG GTTGGCATTG TGGGAAGAAC CGGAGCTGGA 3240 

AAAAGTTCCC TCATCTCAGC CCTTTTTAGA TTGTCAGAAC CGGAAGGTAA AATTTGGATT 3300 

GATAAGATCT TGACAACTGA AATTGGACTT CACGATTTAA GGAAGAAAAT GTCAATCATA 3360 

CCTCAGGAAC CTGTTTTGTT CACTGGAACA ATGAGGAAAA ACCTGGATCC CTTTAATGAG 3420 

CACACGGATG AGGAACTGTG GAATGCCTTA CAAGAGGTAC AACTTAAAGA AACCATTGAA 3480 

GATCTTCCTG GTAAAATGGA TACTGAATTA GCAGAATCAG GATCCAATTT TAGTGTTGGA 3540 

CAAAGACAAC TGGTGTGCCT TGCCAGGGCA ATTCTCAGGA AAAATCAGAT ATTGATTATT 3600 

GATGAAGCGA OGGCAAATGT GGATCCAAGA ACTGATGAGT TAATACAAAA AAAAATCCGG 3660 

GAGAAATTTG CCCACTGCAC CGTGCTAACC ATTGCACACA GATTGAACAC CATTATTGAC 3720 

AGCGACAAGA TAATGGTTTT AGATTCAGGA AGACTGAAAG AATATGATGA GCCGTATGTT 3780 

TTGCTGCAAA ATAAAGAGAG CCTATTTTAC AAGATGGTGC AACAACTGGG CAAGGCAGAA 3840 

GCCGCTGCCC TCACTGAAAC AGCAAAACAG GTATACTTCA AAAGAAATTA TCCACATATT 39O0 

GGTCACACTG ACCACATGGT TACAAACACT TCCAATGGAC AGCCCTCGAC CTTAACTATT 3960 
TTCGAGACAG CACTGTGA 



Seq ID NO: 570 Protein sequence 
Protein Accession #: XP_036453.1 

1 11 21 31 41 51 

I I I I I I 

MLPVYQEVKP NPLQDANLCS RVFFWWLNPL FKIGHKRRLE EDDMYSVLPE DRSQHLGEEL 60 
QGFWDKEVLR AENDAQKPSL TRAIIKCYWK SYLVLGIPTL IEESAKVIQP IFLGKIINYF 120 
ENYDPMDSVA LNTAYAYATV LTPCTLILAI LHHLYFYHVQ CAGMRLRVAM CHMIYRKALR 180 
LSNMAMGKTT TGQIVNLLSN DVNKFDQVTV FLHFLWAGPL QAIAVTALLH MEIGISCLAG 240 
MAVLIILLPL QSCPGKLPSS LRSKTATPTD ARIRTMNEVI TGIRIIKMYA WEKSFSNLIT 300 
KLRKKEISK1 LRSSCLRGMN LASPFSASKI IVFVTFTTYV LLGSVITASR VFVAVTLYGA 360 
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VRLTVTLFFP SAIERVSEAI VSIRRIQTPL LLDEISQRNR QLPSDGKKMV HVQDFTAFWD 420 

KASETPTLQG LSFTVRPGEL LAWGPVGAG KSSLLSAVLG ELAPSHGLVS VHGRIAYVSQ 480 

QPWVFSGTLR SNILFGKKYE KERYEKVIKA CALKKDLQLL EDGDLTVIGD RGTTLSGGQK S40 

ARVNIiARAVY QDADIYLLDD PLSAVDAEVS RHLFELCICQ ILHEKITILV THQLQVLKAA 600 

5 SQILILKDGK MVQKGTYTEF LKSGIDFGSL LKKDNEESEQ PPVPGTPTLR NRTFSESSVW 660 

SQQSSRPSLK DGALESQDTE NVPVTLSEEN RSEGKVGFQA YKNYFRAGAH WIVFIFLILL 720 

NTAAQVAYVL QDWWLSYWAN KQSMLNVTVN GGGNVTEKLD LNWYLGIYSG LTVATVLFGI 780 

ARSLLVFYVL VNSSQTLHNK MFESILKAPV LFFDRNPIGR ILNRFSKDIG HLDDLLPLTF 840 

LDFIQTLLQV VGWSVAVAV IPWIAIPLVP LGIIFIFLRR YFLETSRDVK RLESTTRSPV 900 

10 FSHLSSSLQG LWTIRAYKAE ERCQELFDAH QDLHSEAWFL PLTTSRWPAV RLDAICAMFV 960 

IIVAFGSLIL AKTLDAGQVG LALSYALTLM GMFQWCVRQS AEVENMMISV ERVIEYTDLE 1020 

KEAPWEYQKR PPPAWPHEGV IIFDNVNFMY SPGGPLVLKH LTALIKSQEK VGXVGRTGAG 1080 

KSSLISALFR LSEPEGKIWI DKILTTEIGL HDLRKKMSII PQEPVLFTGT MRKNLDPFNE 1140 

HTDEELWNAL QEVQLKETIE DLPGKMDTEL AESGSNFSVG QRQLV CLARA ILRKNQILII 1200 

15 DEATANVDPR TDELIQKKIR EKFAHCTVLT IAHRLNTIID SDKIMVLDSG RLKEYDEPYV 12 SO 

LLQNKESLFY KMVQQLGKAE AAALTETAKQ VYFKRNYPHI GHTDHMVTNT SNGQPSTLTI 1320 
FETAL 

Seq ID NO: 571 DNA sequence 
20 Nucleic Acid Accession #: AF07120 
Coding sequence: 116.. 4093 

1 11 21 31 41 51 

oc ! I I I I I 

25 GGACAGGCGT GGCGGCCGGA GCCCCAGCAT CCCTGCTTGA GGTCCAGGAG CGGAGCCCGC 60 
GGCCACCGCC GCCTGATCAG CGCGACCCCG GCCCGCGCCC GCCCCGCCCG GCAAGATGCT 120 
GCCCGTGTAC CAGGAGGTGA AGCCCAACCC GCTGCAGGAC GCGAACATCT GCTCACGCGT 180 
GTTCTTCTGG TGGCTCAATC CCTTGTTTAA AATTGGCCAT AAACGGAGAT TAGAGGAAGA 240 
TGATATGTAT TCAGTGCTGC CAGAAGACCG CTCACAGCAC CTTGGAGAGG AGTTGCAAGG 300 
30 GTTCTGGGAT AAAGAAGTTT TAAGAGCTGA GAATGACGCA CAGAAGCCTT CTTTAACAAG 360 
AGCAATCATA AAGTGTTACT GGAAATCTTA TTTAGTTTTG GGAATTTTTA CGTTAATTGA 420 
- GGAAAGTGCC AAAGTAATCC AGCCCATATT TTTGGGAAAA ATTATTAATT ATTTTGAAAA 480 
TTATGATCCC ATGGATTCTG TGGCTTTGAA CACAGCGTAC GCCTATGCCA CGGTGCTGAC 540 
TTTTTGCACG CTCATTTTGG CTATACTGCA TCACTTATAT TTTTATCACG TTCAGTGTGC 600 
35 TGGGATGAGG TTACGAGTAG CCATGTGCCA TATGATTTAT CGGAAGGCAC TTCGTCTTAG 660 
TAACATGGCC ATGGGGAAGA CAACCACAGG CCAGATAGTC AATCTGCTGT CCAATGATGT 720 
GAACAAGTTT GATCAGGTGA CAGTGTTCTT ACACTTCCTG TGGGCAGGAC CACTGCAGGC 780 
GATCGCAGTG ACTGCCCTAC TCTGGATGGA GATAGGAATA TCGTGCCTTG CTGGGATGGC 840 
AGTTCTAATC ATTCTCCTGC CCTTGCAAAG CTGTTTTGGG AAGTTGTTCT CATCACTGAG 900 
40 GAGTAAAACT GCAACTTTCA OGGATGCCAG GATCAGGACC ATGAATGAAG TTATAACTGG 960 
TATAAGGATA ATAAAAATGT ACGCCTGGGA AAAGTCATTT TCAAATCTTA TTACCAATTT 1020 
GAGAAAGAAG GAGATTTCCA AGATTCTGAG AAGTTCCTGC CTCAGGGGGA TGAATTTGGC 1080 
TTCGTTTTTC AGTGCAAGCA AAATCATCGT GTTTGTGACC TTCACCACCT AOGTGCTCCT 1140 
CGGCAGTGTG ATCACAGCCA GCCGCGTGTT CGTGGCAGTG ACGCTGTATG GGGCTGTGCG 1200 
45 GCTGACGGTT ACCCTCTTCT TCCCCTCAGC CATTGAGAGG GTGTCAGAGG CAATCGTCAG 1260 
CATCCGAAGA ATCCAGACCT TTTTGCTACT TGATGAGATA TCACAGCGCA ACCGTCAGCT 1320 
GCCGTCAGAT GGTAAAAAGA TGGTGCATGT GCAGGATTTT ACTGCTTTTT GGGATAAGGC 1380 
ATCAGAGACC CCAACTCTAC AAGGCCTTTC CTTTACTGTC AGACCTGGCG AATTGTTAGC 1440 
TGTGGTCGGC CCCGTGGGAG CAGGGAAGTC ATCACTGTTA AGTGCCGTGC TCGGGGAATT 1500 
50 GGCCCCAAGT CACGGGCTGG TCAGCGTGCA TGGAAGAATT GCCTATGTGT CTCAGCAGCC 1560 
CTGGGTGTTC TCGGGAACTC TGAGGAGTAA TATTTTATTT GGGAAGAAAT ATGAAAAGGA 1620 
ACGATATGAA AAAGTCATAA AGGCTTGTGC TCTGAAAAAG GATTTACAGC TGTTGGAGGA 1680 
TGGTGATCTG ACTGTGATAG GAGATCGGGG AACCACGCTG AGTGGAGGGC AGAAAGCACG 1740 
GGTAAACCTT GCAAGAGCAG TGTATCAAGA TGCTGACATC TATCTCCTGG ACGATCCTCT 1800 
. 55 CAGTGCAGTA GATGCGGAAG TTAGCAGACA CTTGTTCGAA CTGTGTATTT GTCAAATTTT I860 
GCATGAGAAG ATCACAATTT TAGTGACTCA TCAGTTGCAG TACCTCAAAG CTGCAAGTCA 1920 
GATTCTGATA TTGAAAGATG GTAAAATGGT GCAGAAGGGG ACTTACACTG AGTTCCTAAA 1980 
ATCTGGTATA GATTTTGGCT CCCTTTTAAA GAAGGATAAT GAGGAAAGTG AACAACCTCC 2040 
AGTTCCAGGA ACTCCCACAC TAAGGAATCG TACCTTCTCA GAGTCTTCGG TTTGGTCTCA 2100 
60 ACAATCTTCT AGACCCTCCT TGAAAGATGG TGCTCTGGAG AGCCAAGATA CAGAGAATGT 2160 
CCCAGTTACA CTATCAGAGG AGAACCGTTC TGAAGGAAAA GTTGGTTTTC AGGCCTATAA 2220 
GAATTACTTC AGAGCTGGTG CTCACTGGAT TGTCTTCATT TTCCTTATTC TCCTAAACAC 2280 
TGCAGCTCAG GTTGCCTATG TGCTTCAAGA TTGGTGGCTT TCATACTGGG CAAACAAACA 2340 
AAGTATGCTA AATGTCACTG TAAATGGAGG AGGAAATGTA ACCGAGAAGC TAGATCTTAA 2400 
65 CTGGTACTTA GGAATTTATT CAGGTTTAAC TGTAGCTACC GTTCTTTTTG GCATAGCAAG 2460 
ATCTCTATTG GTATTCTACG TCCTTGTTAA CTCTTCACAA ACTTTGCACA ACAAAA TGTT 2520 
TGAGTCAATT CTGAAAGCTC CGGTATTATT CTTTGATAGA AATCCAATAG GAAGAATTTT 2580 
AAATCGTTTC TCCAAAGACA TTGGACACTT CGATGATTTG CTGCCGCTGA CGTTTTTAGA 2640 
TTTCATCCAG ACATTGCTAC AAGTGGTTGG TGTGGTCTCT GTGGCTGTGG CCGTGATTCC 2700 
70 TTGGATCGCA ATACCCTTGG TTCCCCTTGG AATCATTTTC ATTTTTCTTC GGCGATATTT 2760 
TTTGGAAACG TCAAGAGATG TGAAGCGCCT GGAATCTACA ACTCGGAGTC CAGTGTTTTC 2820 
CCACTTGTCA TCTTCTCTCC AGGGGCTCTG GACCATCCGG GCATACAAAG CAGAAGAGAG 2680 
GTGTCAGGAA CTGTTTGATG CACACCAGGA TTTACATTCA GAGGCTTGGT TCTTGTTTTT 2940 
_ _ GACAACGTCC CGCTGGTTCG CCGTCCGTCT GGATGCCATC TGTGCCATGT TTGTCATCAT 3000 
75 CGTTGCCTTT GGGTCCCTGA TTCTGGCAAA AACTCTGGAT GCCGGGCAGG TTGGTTTGGC 3060 
ACTGTCCTAT GCCCTCACGC TCATGGGGAT GTTTCAGTGG TGTGTTCGAC AAAGTGCTGA 3120 
AGTTGAGAAT ATGATGATCT CAGTAGAAAG GGTCATTGAA TACACAGACC TTGAAAAAGA 3180 
AGCACCTTGG GAATATCAGA AACGCCCACC ACCAGCCTGG CCCCATGAAG GAGTGATAAT 3240 
CTTTGACAAT GTGAACTTCA TGTACAGTCC AGGTGGGCCT CTGGTACTGA AGCATCTGAC 3300 
80 AGCACTCATT AAATCACAAG AAAAGGTTGG CATTGTGGGA AGAACCGGAG CTGGAAAAAG 3360 
TTCCCTCATC TCAGCCCTTT TTAGATTGTC AGAACCCGAA GGTAAAATTT GGATTGATAA 3420 
GATCTTGACA ACTGAAATTG GACTTCACGA TTTAAGGAAG AAAATGTCAA TCATACCTCA 3480 
GGAACCTGTT TTGTTCACTG GAACAATGAG GAAAAACCTG GATCCCTTTA AGGAGCACAC 3540 
GGATGAGGAA CTGTGGAATG CCTTACAAGA GGTACAACTT AAAGAAACCA TTGAAGATCT 3600 
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10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



TCCTGGTAAA 
ACAACTGGTG 
AGCGACGGCA 
ATTTGCCCAC 
CAAGATAATG 
GCAAAATAAA 
T6CCCTCACT 
CACTGACCAC 
GACAGCACTG 
TTTGGACTAT 
CAAGATGCTA 



ATGGATACTG 
TGCCTTGCCA 
AATGTGGATC 
TGCACCGTGC 
GTTTTAGATT 
GAGAGCCTAT 
GAAACAGCAA 
ATGGTTACAA 
TGAATCCAAC 
GTAAACCACA 
GTTCATTTGA 



AATTAGCAGA 
GGGCAATTCT 
CAAGAACTGA 
TAACCATTGC 
CAGGAAGACT 
TTTACAAGAT 
AACAGGTATA 
ACACTTCCAA 
CAAAATGTCA 
TTGTACTTTT 
ATATTTCTCC 



ATCAGGATCC 
CAGGAAAAAT 
TGAGTTAATA 
ACACAGATTG 
GAAAGAATAT 
GGTGCAACAA 
CTTCAAAAGA 
TGGACAGCCC 
AGTCCGTTCC 
TTTTACTTTG 
C 



AATTTTAGTG 
CAGATATTGA 
CAAAAAAAAA 
AACACCATTA 
GATGAGCCGT 
CTGGGCAAGG 
AATTATCCAC 
TCGACCTTAA 
GAAGGCATTT 
GCAACAAATA 



TTGGACAAAG 
TTATTGATGA 
TCCGGGAGAA 
TTGACAGCGA 
ATGTTTTGCT 
CAGAAGCCGC 
ATATTGGTCA 
CTATTTTCGA 
TCCACTAGTT 
TTTATACATA 



3660 
3720 
37BO 
3840 
3900 
3960 
4020 
4080 
4140 
4200 



Seq ID NO: 572 Protein sequence 
Protein Accession #: AAC27076.1 



1 

1 

ML.PVYQEVKP 
QGFWDKEVLR 
ENYDPMDSVA 
LSNMAMGKTT 
MAVLIILLPL 
NLRKKEISKI 
VRLTVTLFFP 
KASETPTLQG 
QPWVFSGTLR 
ARVNLARAVY 
SQILILKDGK 
SQQSSRPSLK 
NTAAQVAYVL 
ARSLLVFYVL 
LDFIQTLLQV 
FSHLSSSIjQG 
IIVAFGSLIL 
KEAPWEYQKR 
KSSLISALFR 
HTDEELWNAL 
DEATANVDPR 
LLQNKESLFY 
FETAL 



11 
I 

NPLQDANICS 
AEKDAQKPSL 
LNTAYAYATV 
TGQIVNLLSN 
QSCFGKLFSS 
LRSSCLRGMN 
SAIERVSEAI 
LSFTVRPGEL 
SNILFGKKYE 
QDADIYLLDD 
MVQKGTYTEF 
DGALESQDTE 
QDWWLSYWAN 
VNSSQTLHNK 
VGWSVAVAV 
LWTIRAYKAE 
AKTLDAGQVG 
PPPAWPHEGV 
LSEPEGKIWI 
QEVQIiKETIE 
TDELIQKKIR 
KMVQQLGKAE 



21 
I 

RVFFWWLNPL 
TRAIIKCYWK 
LTFCTLILAI 
DVNKFDQVTV 
LRSKTATFTD 
LASFFSASKI 
VSIRRIQTFL 
LAWGPVGAG 
KERYEKVIKA 
PLSAVDAEVS 
LKSGIDFGSL 
NVPVTLSEEN 
KQSMLNVTVN 
MFESILKAPV 
IPWIAIPLVP 
ERCQELFDAH 
LALSYALTLM 
IIFDNVNFMY 
DKILTTEIGL 
DLPGKMDTEL 
EKFAHCTVLT 
AAALTETAKQ 



31 

I 

FKIGHKRRLE 
SYLVLGIFTL 
LHHLYFYHVQ 
FLHFLWAGPL 
ARIRTMNEVI 
IVFVTPTTYV 
LLDEISQRNR 
KSSLLSAVLG 
CALKKDLQLL 
RHLFELCICQ 
LKKDNEESEQ 
RSEGKVGFQA 
GGGNVTEKLD 
LFFDRNPIGR 
LGIIFIFLRR 
QDLHSEAWFIi 
GMFQWCVRQS 
SPGGPLVLKH 
HDLRKKMSII 
AESGSNFSVG 
IAHRLNTIID 
VYFKRNYPHI 



41 

i 

EDDMYSVLPE 
IEESAKVIQP 
CAGMRLRVAM 
QAIAVTALLW 
TGIRIIKMYA 
LLGSVITASR 
QLPSDGKKMV 
ELAPSHGLVS 
EDGDLTVIGD 
ILHEKITILV 
PPVPGTPTLR 
YKNYFRAGAH 
LNWYLGIYSG 
ILNRFSKDIG 
YFLETSRDVK 
FLTTSRWFAV 
AEVENMMISV 
LTALIKSQEK 
PQEPVLFTGT 
QRQIiVCLARA 
SDKIMVLDSG 
GHTDHMVTNT 



51 



IFLGKIINYF 
CHMIYRKALR 
MEIGISCLAG 
WEKSFSNLIT 
VFVAVTLYGA 
HVQDFTAFWD 
VHGRIAYVSQ 
RGTTIiSGGQK 
THQLQYLKAA 
NRTFSESSVW 
WIVFIFLILL 
LTVATVLFGI 
HLDDLLPLTF 
RLESTTRSPV 
RLDAICAMFV 
ERVIEYTDLE 
VGIVGRTGAG 
MRKNLDPFKE 
ILRKNQILII 
RLKEYDEPYV 
SNGQPSTLTI 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 



Seq ID NO: 573 DNA sequence 

Nucleic Acid Accession #: Eob sequence 

Coding sequence: 1..1365 



1 
I 

ATGGAATCAA 
GGCATAAATG 
TTTGCCAAAT 
AGAAATCCTA 
GATGCTCTCA 
CTGTGGGACC 
AGGATAAACC 
TTGATTGTCA 
GCCAGCCGGC 
CTTGCCCGCC 
ATTGAAAATT 
AGCTTGGCCA 
AGAAACCAAC 
ATAGTTGCCA 
CAACTTTATT 
TGTAGAAAAC 
CTCTGCTTAC 
GTTCATGCAA 
ATCTCCTTTG 
TCAGTGAGCA 
GTCGCTCTGC 
GAAGAGTACT 
ATTGTAATTC 



11 
I 

TCTCTATGAT 
GTATCAAAGA 
CCTTGACCAT 
AGTTTGCTTC 
CAAAAACAAA 
TGAGACATCT 
AGTACCCAGA 
AAGGATTTAA 
AGGTTTATAT 
AGTTGAATTT 
TACCCCTAOG 
CATTTTTTTT 
AGAGTGACTT 
TTACTTTGCT 
ACGGCACCAA 
AGCTTGGATT 
CGATGAGAAG 
ATATTGAAAA 
GCATAATGAG 
ATGCTTTAAA 
TCATAAGTAC 
ACAGATTTTA 
TGGATCTTTT 



21 
I 

GGGAAGCCCT 
TGCAAGGAAG 
TCGACTTATT 
TGAATTTTTT 
TATAATATTT 
GCTTGTGGGT 
ATCCAATGCT 
TGTTGTCTCA 
ATGCAGCAAC 
CATTCCCATT 
ACTCTTTACT 
CCTTTATTCC 
TTACAAAATT 
CTCCCTAGTA 
GTATAGGAGA 
ACTAAGTTTT 
GTCAGAGAGA 
CTCTTGGAAT 
CCTTGGCTTA 
CTGGAGAGAA 
TTTCCATGTT 
TACACCACCA 
GCAGCTTTGC 



31 

1 

AAGAGCCTTA 
GTCACTGTAG 
AGATGCGGCT 
CCTCATGTGG 
GTTGCTATAC 
AAAATCCTGA 
GAATATTTGG 
GCTTGGGCAC 
AATATTCAAG 
GACTTGGGAT 
CTCTGGAGAG 
TTTGTCAGAG 
CCTATAGAGA 
TACCTCGCAG 
TTTCCACCTT 
TTCTTCGCTA 
TATTTGTTTC 
GAGGAAGAAG 
CTTTCCCTCC 
TT CAGTTTTA 
TTAATTTATG 
AACTTTGTTC 
AGATACCCAG 



41 
1 

GTGAAACTTG 
GTGTGATTGG 
ATCATGTGGT 
TAGATGTCAC 
ACAGAGAACA 
TTGATGTGAG 
CTTCATTATT 
TTCAGTTAGG 
CGCGACAACA 
CCTTATCATC 
GGCCAGTGGT 
ATGTGATTCA 
TTGTGAATAA 
GTCTTCTGGC 
GGTTGGAAAC 
TGGTCCATGT 
TCAACATGGC 
TTTGGAGAAT 
TGGCAGTCAC 
TTCAGTCTAC 
GATGGAAACG 
TTGCTCTTGT 
ACTGA 



51 
I 

TTTACCTAAT 
AAGTGGAGAT 
CATAGGAAGT 
TCATCATGAA 
TTATACCTCC 
CAATAACATG 
CCCAGATTCT 
ACCTAAGGAT 
GGTTATTGAA 
AGCCAGAGAG 
GGTAGCTATA 
TCCATATGCT 
AACCTTACCT 
AGCTGCTTAT 
CTGGTTACAG 
TGCCTACAGC 
TTATCAGCAG 
TGAAATGTAT 
TTCTATCCCT 
ACTTGGATAT 
AGCTTTTGAG 
TTTGCCCTCA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 



Seq ID NO: 574 Protein sequence 
Protein Accession #: Eos sequence 



1 
I 

MESISMMGSP 
RNPKFASEFF 
RINQYPESNA 
LARQLNFIPI 
RNQQSDFYKI 
CRKQLGLLSF 
ISFGIMSLGL 
EEYYRFYTPP 



11 
I 

KSXjSETCLPN 
PHWDVTHHE 
EYIiASLFPDS 
DLGSLSSARE 
PIEIVNKTLP 
FFAMVHVAYS 
LSLLAVTSIP 
NFVLALVLPS 



21 
I 

GINGIKDARK 
DALTKTNIIF 
LIVKGFNWS 
IENLPLRLFT 
IVAITLLSLV 
LCLPMRRSER 
SVSNALNWRE 
IVILDLLQLC 



31 
I 

VTVGVIGSGD 
VAIHREHYTS 
AWALQLGPKD 
LWRGPVWAI 
YLAGLLAAAY 
YLFLNMAYQQ 
FSFIQSTLGY 
RYPD 



41 

! 

FAKSLTIRLI 
LWDLRHLLVG 
ASRQVYICSN 
SLATFFFLYS 
QLYYGTKYRR 
VHANIENSWN 
VALLISTFHV 



51 
I 

RCGYHWIGS 
KILIDVSNNM 
NIQARQQVIE 
FVRDVIHPYA 
FPPWLETWLQ 
EEEVWRIEMY 
LIYGWKRAFE 



60 
120 
180 
240 
300 
360 
420 
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Seq ID NO: S7S DNA sequence 
Nucleic Acid Accession #: NMJJ01873 . 
Coding sequence: 3.. 1721 



41 



51 



10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



1 11 21 31 

I I I I I I 

AAATGGCGTG CCCGTCTCTC CGCCGGCCCC CTGCCTCGCA GTGGTTTCTC CTGCAGCTCC 60 

CCTGGGCTCC GCGGCCAGTA GTGCAGCCCG TGGAGCCGCG GCTTTGCCCG TCTCCTCTGG 120 

GTGGCCCCAG TGCGCGGGCT GACACTCATT CAGCCGGGGA AGGTGAGGCG AGTAGAGGCT 180 

GGTGCGGAAC TTGCCGCCCC CAGCAGCGCC GGCGGGCTAA GCCCAGGGCC GGGCAGACAA 240 

AAGAGGCCGC OCGCGTAGGA AGGCACGGCC GGCGGCGGOG GAGCGCAGCG ATGGCCGGGC 300 

GAGGGGGCAG CGCGCTGCTG GCTCTGTGCG GGGCACTGGC TGCCTGCGGG TGGCTCCTGG 360 

GCGCCGAAGC CCAGGAGCCC GGGGCGCCCG CGGCGGGCAT GAGGCGGCGC CGGCGGCTGC 420 

AGCAAGAGGA CGGCATCTCC TTCGAGTACC ACCGCTACCC CGAGCTGCGC GAGGOGCTCG 480 

TGTCCGTGTG GCTGCAGTGC ACCGCCATCA GCAGGATTTA CACGGTGGGG CGCAGCTTCG 540 

AGGGCCGGGA GCTCCTGGTC ATCGAGCTGT CCGACAACCC TGGCGTCCAT GAGCCTGGTG 600 

AGCCTGAATT TAAATACATT GGGAATATGC ATGGGAATGA GGCTGTTGGA CGAGAACTGC 660 

TCATTTTCTT GGCCCAGTAC CTATGCAACG AATACCAGAA GGGGAACGAG ACAATTGTCA 720 

ACCTGATCCA CAGTACCCGC ATTCACATCA TGCCTTCCCT GAACCCAGAT GGCTTTGAGA 780 

AGGCAGCGTC TCAGCCTGGT GAACTCAAGG ACTGGTTTGT GGGTCGAAGC AATGCCCAGG 840 

GAATAGATCT GAACCGGAAC TTTCCAGACC TGGATAGGAT AGTGTACGTG AATGAGAAAG 900 

AAGGTGGTCC AAATAATCAT CTGTTGAAAA ATATGAAGAA AATTGTGGAT CAAAACACAA 960 

AGCTTGCTCC TGAGACCAAG GCTGTCATTC ATTGGATTAT GGATATTCCT TTTGTGCTTT 1020 

CTGCCAATCT CCATGGAGGA GACCTTGTGG CCAATTATCC ATATGATGAG ACGCGGAGTG 1080 

GTAGTGCTCA CGAATACAGC TCCTCCCCAG ATGACGCCAT TTTCCAAAGC TTGGCCCGGG 1140 

CATACTCTTC TTTCAACCCG GCCATGTCTG ACCCCAATCG GCCACCATGT CGCAAGAATG 12 00 

ATGATGACAG CAGCTTTGTA GATGGAACCA CCAACGGTGG TGCTTGGTAC AGCGTACCTG 1260 

GAGGGATGCA AGACTTCAAT TACCTTAGCA GCAACTGTTT TGAGATCACC GTGGAGCTTA 1320 

GCTGTGAGAA GTTCCCACCT GAAGAGACTC TGAAGACCTA CTGGGAGGAT AACAAAAACT 1380 

CCCTCATTAG CTACCTTGAG CAGATACACC GAGGAGTTAA AGGATTTGTC CGAGACCTTC 1440 

AAGGTAACCC AATTGCGAAT GCCACCATCT CCGTGGAAGG AATAGACCAC GATGTTACAT 1S00 

CCGCAAAGGA TGGTGATTAC TGGAGATTG C TTATACCTGG AAACTATAAA CTTACAGCCT 1560 

CAGCTCCAGG CTATCTGGCA* ATAACAAAGA AAGTGGCAGT TCCTTACAGC CCTGCTGCTG 1620 

GGGTTGATTT TGAACTGGAG TCATTTTCTG AAAGGAAAGA AGAGGAGAAG GAAGAATTGA 1680 

TGGAATGGTG GAAAATGATG TCAGAAACTT TAAATTTTTA AAAAGGCTTC TAGTTAGCTG 1740 

CTTTAAATCT ATCTATATAA TGTAGTATGA TGTAATGTGG TCTTTTTTTT AGATTTTGTG 1B00 

CAGTTAATAC TTAACATTGA TTTATTTTTT AATCATTTAA ATATTAATCA ACTTTCCTTA 1860 

AAATAAATAG CCTCTTAGGT AAAAATATAA GAACTTGATA TATTTCATTC TCTTATATAG 1920 

TATTCATTTT CCTACCTATA TTACACAAAA AAGTATAGAA AAGATTTAAG TAATTTTGCC 1980 

ATCCTAGGCT TAAATGCAAT ATTCCTGGTA TTATTTACAA TGCAGAATTT TTTGAGTAAT 2040 

TCTAGCTTTC AAAAATTAGT GAAGTTCTTT TACTGTAATT GGTGACAATG TCACATAATG 2100 

AATGCTATTG AAAAGGTTAA CAGATACAGC TCGGAGTTGT GAGCACTCTA CTGCAAGACT 2160 

TAAATAGTTC AGTATAAATT GTCGTTTTTT TCTTGTGCTG ACTAACTATA AGCATGATCT 2220 

TGTTAATGCA TTTTTGATGG GAAGAAAAGG TACATGTTTA CAAAGAGGTT TTATGAAAAG 2280 

AATAAAAATT GACTTCTTGC TTGTACATAT AGGAGCAATA CTATTATATT ATGTAGTCCG 2340 

TTAACACTAC TTAAAAGTTT AGGGTTTTCT CTTGGTTGTA GAGTGGCCCA GAATTGCATT 2400 
CTGAATGAAT AAAGGTTAAA AAAAAATCCC CAGTGAAAAA AAA 



Seq ID NO: 576 Protein sequence 
Protein Accession #: NP_001864.1 



MAGRGGSALL 
EALVSVWLQC 
RELLIFLAQY 
NAQGIDLNRN 
FVLSANLHGG 
RKNDDDSSFV 
NKNSLISYLE 
LTASAPGYIA 



11 

I 

ALCGAIAACG 
TAISRIYTVG 
LCNEYQKGNE 
FPDLDRIVYV 
DLVANYPYDE 
DGTTNGGAWy 
QIHRGVKGFV 
ITKKVAVPYS 



21 
I 

WLLGAEAQEP 
RSPEGRELLV 
TIVNLIHSTR 
NEKEGGPNNH 
TRSGSAHEYS 
SVPGGMQDFN 
RDLQGNPIAN 
PAAGVDPELE 



31 
I 

GAPAAGMRRR 
IELSDNPGVH 
IHIMPSLNPD 
LLKNMKKIVD 
SSPDDAIFQS 
YLSSNCFEIT 
ATISVEGIDH 



41 
I 

RRLQQEDGIS 
EPGEPEFKYI 
GFEXAASQPG 
QNTKLAPETK 
LARAYSSFNP 
VELSCEKFPP 
DVTSAKDGDY 
EELMEWWKMM 



51 
I 

FEYHRYPELR 
GNMHGNEAVG 
ELKDWFVGRS 
AVIHWIMDIP 
AMSDPNRPPC 
EETLKTYWED 
WRLLIPGNYK 
SETLNF 



60 
120 
180 
240 
300 
360 
420 



Seq ID NO: 577 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence : 1 . . 933 



ATGTGCAGCA 
TTCGACAAGA 
TTCCCCTGTG 
GACTGTCCCG 
GCCCGCTACC 
AATAACTGTC 
GGGCAGGTGT 
ATCATCGGCA 
CACCAGCGGA 
CTGCTGTCCC 
AATAATGGCA 
CCACCCTCCT 
CCGCCCTACT 
CGGTCCGGGA 
GACACCAGCC 



11 
I 

ATGGACGGTG 
GTGATGAGAA 
CCAGCGGCAT 
ATGGCAGCGA 
ACTGCAAGAA 
AAGACAACAG 
TTGTGACTTC 
GCTCCGTCAT 
AGCGGAACAA 
GCCTGGTGGT 
TCCAGTATGT 
ACTCCGAGGC 
CTTCTGACAC 
GTGCCAACAG 
ACAGCCCGGG 



21 
I 

CATCCCGGGC 
GGAGTGCCCC 
CCATTGCATC 
TGAAGAGAAC 
CGGCCTCTGT 
TGATGAGGAA 
AGAGAACCAA 
TTTTGTGCTG 
CCTCATGACG 
CCTGGACCAC 
GGCCAGCCAG 
CTTGCTGGAC 
GGAATCTCTG 
TGCCAGCTCC 
GCAGCCTGGC 



31 
1 

GCCTGGCAGT 
AAGGCTAAGT 
ATTGGTCGCT 
TGCACAGCAA 
ATTGACAAGA 
AGCTGTGAAA 
CTTGTGTATT 
GTGGTGGCCC 
CTGCCCGTGC 
CCCCACCACT 
GCGGAGCAGA 
CAGAGGCCTG 
AACCAAGCCG 
CAGGCAGCCA 
CCCCAGGAGG 



41 
I 

GTGACGGGCT 
CGAAATGTGG 
TCCGGTGCAA 
ACCCTCTGCT 
GCTTCATCTG 
GTTCTCAAGA 
ACCCCAGCAT 
TGCTGGCACT 
ACCGGCTGCA 
GCAACGTCAC 
ATGCGTCGGA 
CGTGGTATGA 
ACCTGCCCCC 
GCAGCCTCCT 
GCACTGCTGA 



51 
I 

GCCTGACTGC 
CCCGACCTTC 
TGGGTTTGAG 
TTGCTCCACC 
GGATGGACAG 
ACCOGGCAGT 
CACCTATGCC 
GGTCTTGCAC 
GCACCCTGTG 
CTACAACGTC 
AGTAGGCTCC 
CCTTCCTCCA 
CTACCGCTCC 
GAGCGTGGAA 
GCCCAGGGAC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
7 BO 
840 
900 



1174 
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TCTGAGCCCA GCCAGGGCAC TGAAGAAGTA TAA 

Seq ID NO: 578 Protein sequence 
Protein Accession #: Eos sequence 

1 11 21 31 41 SI 

1 I I I I I 

MCSNGRCIPG AWQCDGLPDC FDKSDEKECP KAKSKCGPTF PPCASGIHCI IGRFRCNGFE 60 

DCPDGSDEEN CTANPLLCST ARYHCKNGLC IDKSFICDGQ NNCQDNSDEE SCESSQEPGS 120 

GQVPVTSENQ LVYYPSITYA IIGSSVIFVL WALLALVLH HQRXRNNLMT LPVHRLQHPV 180 

LLSRLWLDH PHHCNVTYNV NNGIQYVASQ AEQNASEVGS PPSYSEALLD QRPAWYDLPP 240 

PPYSSDTESL NQADLPPYRS RSGSANSASS QAASSLLSVE DTSHSPGQPG PQEGTAEPRD 300 
SEPSQGTEEV 

Seq ID NO: 579 DMA sequence 
Nucleic Acid Accession #: AF179274.1 
Coding sequence: 1..1125 

1 11 21 31 41 51 

I 1 I I I I 

ATGGTGCTGT GGGAGTCCCC GCGGCAGTGC AGCAGCTGGA CACTTTGCGA GGGCTTTTGC 60 

TGGCTGCTGC TGCTGCCCGT CATGCTACTC ATCGTAGCCC GCCCGGTGAA GCTCGCTGCT 120 

TTCCCTACCT CCTTAAGTGA CTGCCAAACG CCCACCGGCT GGAATTGCTC TGGTTATGAT 180 

GACAGAGAAA ATGATCTCTT CCTCTGTGAC ACCAACACCT GTAAATTTGA TGGGGAATGT 240 

TTAAGAATTG GAGACACTGT GACTTGCGTC TGTCAGTTCA AGTGCAACAA TGACTATGTG 300 

CCTGTGTGTG GCTCCAATGG GGAGAGCTAC CAGAATGAGT GTTACCTGCG ACAGGCTGCA 360 

TGCAAACAGC AGAGTGAGAT ACTTGTGGTG TCAGAAGGAT CATGTGCCAC AGATGCAGGA 420 

TCAGGATCTG GAGATGGAGT CCATGAAGGC TCTGGAGAAA CTAGTCAAAA GGAGACATCC 4 B0 

ACCTGTGATA TTTGCCAGTT TGGTGCAGAA TGTGACGAAG ATGCCGAGGA TGTCTGGTGT 540 

GTGTGTAATA TTGACTGTTC TCAAACCAAC TTCAATCCCC TCTGCGCTTC TGATGGGAAA 600 

TCTTATGATA ATGCATGCCA AATCAAAGAA GCATCGTGTC AGAAACAGGA GAAAATTGAA 660 

GTCATGTCTT TGGGTCGATG TCAAGATAAC ACAACTACAA CTACTAAGTC TGAAGATGGG 720 
CATTATGCAA GAACAGATTA TGCAGAGAAT GCTAACAAAT TAGAAGAAAG TGCCAGAGAA -780 

CACCACATAC CTTGTCCGGA ACATTACAAT GGCTTCTGCA TGCATGGGAA GTGTGAGCAT 840 

TCTATCAATA TGCAGGAGCC ATCTTGCAGG TGTGATGCTG GTTATACTGG ACAACACTGT 900 

GAAAAAAAGG ACTACAGTGT TCTATACGTT GTTCCCGGTC CTGTACGATT TCAGTATGTC 960 

TTAATCGCAG CTGTGATTGG AACAATTCAG ATTGCTGTCA TCTGTGTGGT GGTCCTCTGC 1020 

ATCACAAGGA AATGCCCCAG AAGCAACAGA ATTCACAGAC AGAAGCAAAA TACAGGGCAC 1080 
TACAGTTCAG ACAATACAAC AAGAGCGTCC ACGAGGTTAA TCTGA 

Seq ID NO: 580 Protein sequence 
Protein Accession #: NP_057276.2 

1 11 21 31 41 51 

! I I I I 1 

MVLWESPRQC SSWTLCEGFC WLLLLPVMLL IVARPVKLAA FPTSLSDCQT PTGWNCSGYD 60 

DRENDLFLCD TNTCKFDGEC LRIGDTVTCV CQFKCNNDYV PVCGSNGESY QNECYLRQAA 120 

CKQQSEILW SEGSCATDAG SGSGDGVHEG SGETSQKETS TCDICQFGAE CDEDAEDVWC 180 

VCNIDCSQTN FNPLCASDGK SYDNACQIKE ASCQKQEKIE VMSLGRCQDN TTTTTKSEDG 240 

HYARTDYAEN ANKLEESARE HHIPCPEHYN GFCMHGKCEH SINMQEPSCR CDAGYTGQHC 300 

EKKDYSVLYV VPGPVRFQYV LIAAVIGTIQ IAVICVWLC ITRKCPRSNR IHRQKQNTGH 360 
YSSDNTTRAS TRLI 

Seq ID NO: 581 D N A sequence 
Nucleic Acid Accession #j S78203.1 
Coding sequence: 1..2190 

1 11 21 31 41 51 

1 I ! 1 I 1 

ATGAATCCTT TCCAGAAAAA TGAGTCCAAG GAAACTCTTT TTTCACCTGT CTCCATTGAA 60 

GAGGTACCAC CTCGACCACC TAGCCCTCCA AAGAAGCCAT CTCCGACAAT CTGTGGCTCC 120 

AACTATCCAC TGAGCATTGC CTTCATTGTG GTGAATGAAT TCTGCGAGCG CTTTTCCTAT 180 

TATGGAATGA AAGCTGTGCT GATCCTGTAT TTCCTGTATT TCCTGCACTG GAATGAAGAT 240 

ACCTCCACAT CTATATACCA TGCCTTCAGC AGCCTCTGTT ATTTTACTCC CATCCTGGGA 300 

GCAGCCATTG CTGACTCGTG GTTGGGAAAA TTCAAGACAA TCATCTATCT CTCCTTGGTG 360 

TATGTGCTTG GCCATGTGAT CAAGTCCTTG GGTGCCTTAC CAATACTGGG AGGACAAGTG 420 

GTACACACAG TCCTATCATT GATCGGCCTG AGTCTAATAG CTTTGGGGAC AGGAGGCATC 480 

AAACCCTGTG TGGCAGCTTT TGGTGGAGAC CAGTTTGAAG AAAAACATGC AGAGGAACGG 540 

ACTAGATACT TCTCAGTCTT CTACCTGTCC ATCAATGCAG GGAGCTTGAT TTCTACATTT 600 

ATCACACCCA TGCTGAGAGG AGATGTGCAA TGTTTTGGAG AAGACTGCTA TGCATTGGCT 660 

TTTGGAGTTC CAGGACTGCT CATGGTAATT GCACTTGTTG TGTTTGCAAT GGGAAGCAAA 720 

ATATACAATA AACCACCCCC TGAAGGAAAC ATAGTGGCTC AAGTTTTCAA ATGTATCTGG 780 

TTTGCTATTT CCAATCGTTT CAAGAACCGT TCTGGAGACA TTCCAAAGCG ACAGCACTGG 840 

CTAGACTGGG CAGCTGAGAA ATATCCAAAG CAGCTCATTA TGGATGTAAA GGCACTGACC 900 

AGGGTACTAT TCCTTTATAT CCCATTGCCC ATGTTCTGGG CTCTTTTGGA TCAGCAGGGT 960 

TCACGATGGA CTTTGCAAGC CATCAGGATG AATAGGAATT TGGGGTTTTT TGTGCTTCAG 1020 

CCGGACCAGA TGCAGGTTCT AAATCCCTTT CTGGTTCTTA TCTTCATCCC GTTGTTTGAC 1080 

TTTGTCATTT ATCGTCTGGT CTCCAAGTGT GGAATTAACT TCTCATCACT TAGGAAAATG 1140 

GCTGTTGGTA TGATCCTAGC GTGCCTGGCA TTTGCAGTTG CGGCAGCTGT AGAGATAAAA 1200 

ATAAATGAAA TGGCCCCAGC CCAGTCAGGT CCCCAGGAGG TTTTCCTACA AGTCTTGAAT 1260 

CTGGCAGATG ATGAGGTGAA GGTGACAGTG GTGGGAAATG AAAACAATTC TCTGTTGATA 1320 

GAGTCCATCA AATCCTTTCA GAAAACACCA CACTATTCCA AACTGCACCT GAAAACAAAA 1380 

AGCCAGGATT TTCACTTCCA CCTGAAATAT CACAATTTGT CTCTCTACAC TGAGCATTCT 1440 

GTGCAGGAGA AGAACTGGTA CAGTCTTGTC ATTCGTGAAG ATGGGAACAG TATCTCCAGC 1500 



1175 



WO 03/042661 



PCT/US02/36810 



5 

10 
15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



ATGATGGTAA 
AACACTTTGC 
GAAGACTATG 
TGTAGAACAG 
TATCTGTTTG 
ATTCCAGCCA 
GGGGAGGTCA 
ATGAAATCTG 
CTTGTTGTGG 
CTCCTGCTGG 
ACAGAGGATA 
AAACTAGAGA 



AGGATACAGA 
ATAAAGATGT 
GTGTGTCTGC 
AAGATAAGAA 
TTATTACTAA 
ACAAAATGTC 
TGTTCTCTGT 
TGCTCCAGGC 
CACAGTTCAG 
TGATCTGCCT 
TGCGGGGTCC 
CCAAGAAGAC 



AAGCAAAACA 
CAACATCTCC 
TTATAGAACT 
CTTTTCTCTG 
TAACACCAAT 
CATTGCGTGG 
CACAGGTCTT 
AGCTTGGCTA 
TGGCCTGGTA 
GATCTTCTCC 
AGCAGATAAG 
AAAACTCTGA 



ACCAATGGGA 
CTGAGTACAG 
GTGCAAAGAG 
AATTTGGGTC 
CAGGGTCTTC 
CAGCTACCAC 
GAGTTTTCTT 
TTGACAATTG 
CAGTGGGCCG 
ATCATGGGCT 
CACATTCCTC 



TGACAACCGT 
ATACCTCTCT 
GAGAATACCC 
TTCTAGACTT 
AGGCCTGGAA 
AATATGCCCT 
ATTCTCAGGC 
CAGTTGGGAA 
AATTCATTTT 
ACTACTATGT 
ACATCCAGGG 



GAGGTTTGTT 
CAATGTTGGT 
TGCAGTGCAC 
TGGTGCAGCA 
GATTGAAGAC 
GGTTACAGCT 
TCCCTCTAGC 
TATCATCGTG 
GTTTTCCTGC 
TCCTGTAAAG 
GAACATGATC 



1S60 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 



Seq ID NO: S82 Protein sequence 
Protein Accession #i AAB34388.1 



1 
I 

MNPFQKNESK 
YGMKAVLILY 
YVLGHVIKSL 
TRYFSVFYLS 
IYNKPPPEGN 
RVLFLYIPLP 
FVIYRLVSKC 
LADDEVKVTV 
VQEKNWYSLV 
EDYGVSAYRT 
IPANKMSIAW 
LWAQPSGLV 
KLETKKTKL 



11 

I 

ETLFSFVSIE 
FLYFLHWNED 
GALPILGGQV 
INAGSLISTF 
IVAQVFKCIW 
MFWALLDQQG 
GINFSSLRKM 
VGNENNSLLI 
IREDGNSISS 
VQRGEYPAVH 
QLPQYALVTA 
QWAEFILFSC 



21 
I 

EVPPRPPSPP 
TSTSIYHAFS 
VHTVLSLIGL 
ITPMLRGDVQ 
FAISNRFKNR 
SRWTLQAIRM 
AVGMILACLA 
ESIKSFQKTP 
MMVKDTESKT 
CRTEDKNFSli 
GEVMFSVTGL 
LLLVICLIFS 



31 
I 

KKPSPTICGS 
SLCYPTPILG 
SLIALGTGGI 
CPGEDCYAIiA 
SGDIPKRQHW 
NRNI/3FFVLQ 
FAVAAAVEIK 
HYSKLHLKTK 
TNGMTTVRFV 
NLGLiLDFGAA 
EFSYSQAPSS 
IMGYYYVPVK 



41 
I 

NYPLSIAFIV 
AAIADSWLGK 
KPCVAAFGGD 
FGVPGLLMVI 
LDWAAEKYPK 
PDQMQVLNPF 
INEMAPAQSG 
SQDFHFHLKY 
NTLHKDVNIS 
YLFVITNNTN 
MKSVLQAAWL 
TEDMRGPADK 



51 
I 

VNEFCERFSY 
FKTIIYLSLV 
QFEEKHAEER 
ALWFAMGSK 
QLIMDVKALT 
LVLIFIPLFD 
PQEVFLQVLN 
HNLSLYTEHS 
LSTDTSLNVG 
QGLQAWKIED 
LTIAVGNIIV 
HIPHIQGNMI 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 



Seq ID NO: 583 DNA sequence 

Nucleic Acid Accession #: NM_032642.1 

Coding sequence: 184 -.1263 

1 11 21 31 41 51 

I I I I I I 

GACCATTAGC AGGCACCCAG GCCTGTCTTT GGCTCGGAAA CGGTGGCCCC CAATGTAGCC 
TAGTTTGAAC CTAGGAACTG CAGGACCAGA GAGATTCCAC TGGAGCCTGA TGGACGGGTG 
ACAGAGGGAA CCCTACTCTG GAAACTGTCA GTCCCAGGGC ACTGGGGAGG GCTGAGGCCG 
ACCATGCCCA GCCTGCTGCT GCTGTTCACG GCTGCTCTGC TGTCCAGCTG GGCTCAGCTT 
CTGACAGACG CCAACTCCTG GTGGTCATTA GCTTTGAACC CGGTGCAGAG ACCCGAGATG 
TTTATCATCG GTGCCCAGCC OGTGTGCAGT CAGCTTCCCG GGCTCTCCCC TGGCCAGAGG 
AAGCTGTGCC AATTGTACCA GGAGCACATG GCCTACATAG GGGAGGGAGC CAAGACTGGC 
ATCAAGGAAT GCCAGCACCA GTTCCGGCAG CGGCGGTGGA ATTGCAGCAC AGCGGACAAC 
GCATCTGTCT TTGGGAGAGT CATGCAGATA GGCAGCCGAG AGACCGCCTT CACCCACGCG 
GTGAGCGCCG CGGGCGTGGT CAACGCCATC AGCCGGGCCT GCCGCGAGGG CGAGCTCTCC 
ACCTGCGGCT GCAGCCGGAC GGCGCGGCCC AAGGACCTGC CCCGGGACTG GCTGTGGGGC 
GGCTGTGGGG ACAACGTGGA GTACGGCTAC CGCTTCGCCA AGGAGTTTGT GGATGCCCGG 
GAGCGAGAGA AGAACTTTGC CAAAGGATCA GAGGAGCAGG GCGGGGTGCT CATGAACCTG 
CAAAACAACG AGGCCGGTCG CAGGGCTGTG TATAAGATGG CAGACGTAGC CTGCAAATGC 
CACGGCGTCT CGGGGTCCTG CAGCCTCAAG ACCTGCTGGC TGCAGCTGGC CGAGTTCCGC 
AAGGTCGGGG ACCGGCTGAA GGAGAAGTAC GACAGCGCGG CCGCCATGCG CGTCACCCGC 
AAGGGCCGGC TGGAGCTGGT CAACAGCCGC TTCACCCAGC CCACCCCGGA GGACCTGGTC 
TATGTGGACC CCAGCCCCGA CTACTGCCTG CGCAACGAGA GCACGGGCTC CCTGGGCACG 
CAGGGCCGCC TCTGCAACAA GACCTCGGAG GGCATGGATG GCTGTGAGCT CATGTGCTGC 
GGGCGTGGCT ACAACCAGTT CAAGAGCGTG CAGGTGGAGC GCTGCCACTG CAAGTTCCAC 
TGGTGCTGCT TCGTCAGGTG TAAGAAGTGC ACGGAGATCG TGGACCAGTA CATCTGTAAA 
TAGCCCGGAG GGCCTGCTCC CGGCCCCCCC TGCACTCTGC CTCACAAAGG TCTATATTAT 
ATAAATCTAT ATAAATCTAT TTTATATTTG TATAAGTAAA TGGGTGGGTG CTATACAATG 
GAAAGATGAA AATGGAAAGG AAGAGCTTAT TTAAGAGACG CTGGAGATCT CTGAGGAGTG 
GACTTTGCTG GTTCTCTCCT CTTGGTGGGT GGGAGACAGG GCTTTTTCTC TCCCTCTGGC 
GAGGACTCTC AGGATGTAGG GACTTGGAAA TATTTACTGT CTGTCCACCA OGGCCTGGAG 
GAGGGAGGTT GTGGTTGGAT GGAGGAGATG ATCTTGTCTG GAAGTCTAGA GTCTTTGTTG 
GTTAGAGGAC TGCCTGTGAT CCTGGCCACT AGGCCAAGAG GCCCTATGAA GGTGGCGGGA 
ACTCAGCTTC AACCTCGATG TCTTCAGGGT CTTGTCCAGA ATGTAGATGG GTTCCGTAAG 
AGGCCTGGTG CTCTCTTACT CTTTCATCCA GGTGCACTTG TGCGGCATCT GCAGTTTACA 
GGAACGGCTC CTTCCCTAAA ATGAGAAGTC CAAGGTCATC TCTGGCCCAG TGACCACAGA 
GAGATCTGCA CCTCCCGGAC TTCAGGCCTG CCTTTCCAGC GAGAATTCTT CATCCTCCAC 
GGTTCACTAG CTCCTACCTG AAGAGGAAAG GGGGCCATTT GACCTGACAT GTCAGGAAAG 
CCCTAAACTG AATGTTTGCG CCTGGGCTGC AGAAGCCAGG GTGCATGACC AGGCTGCGTG 
GACGTTATAC TGTCTTCCCC CACCCCCGGG GAGGGGAAGC TTGAGCTGCT GCTGTCACTC 
CTCCACCGAG GGAGGCCTCA CAAACCACAG GACGCTGCAA CGGGTCAGGC TGGCGGGCCC 
GGCGTGCTCA TCATCTCTGC CCCAGGTGTA CGGTTTCTCT CTGACATTAA ATGCCCTTCA 
TGGAAAAAAA AAAAAGAAAA AAAAAAAAAA AA 

Seq ID NO: 584 Protein sequence 
Protein Accession ft: NPJL16031.1 

1 11 21 31 41 51 

1 1 I I I I 

MPSLLkLLFTA ALLSSWAQLL TDANSWWSLA LNPVQRPEMF IIGAQPVCSQ LPGLSPGQRK 
LCQLYQEHMA YIGEGAKTGI KECQHQFRQR RWNCSTADNA SVFGRVMQIG SRETAFTHAV 

1176 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
16BO 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 



60 
120 



WO 03/042661 



SAAGWNAIS RACREGELST CGCSRTARPK DLPRDWLWGG CGDNVEYGYR FAKE FVD ARE 180 

REKNPAKGSE EQGRVLMNLQ NNEAGRRAVY KMADVACKCH GVSGSCSLKT CWLQIAEFRK 240 

VGDRLKEKYD SAAAMRVTRK GRLELVNSRF TQPTPEDLVY VDPSPDYCLR NESTGSIjGTQ 300 
GRLCNKTSEG MDGCELMCCG RGYNQFKSVQ VERCHCKFHW CCFVRCKKCT EIVDQYICK 

Seq ID NO: 585 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 1..1479 

1 11 21 31 41 51 

I I I I I I 

ATGGCTTTGA ACTCAGGGTC ACCACCAGCT ATTGGACCTT ACTATGAAAA CCATGGATAC 60 

CAACCGGAAA ACCCCTATCC CGCACAGCCC ACTGTGGTCC CCACTGTCTA CGAGGTGCAT 120 

CCGGCTCAGT ACTACCCGTC CCCCGTGCCC CAGTACGCCC CGAGGGTCCT GACGCAGGCT 180 

TCCAACCCOQ TCGTCTGCAC GCAGCCCAAA TCCCCATCCG GGACAGTGTG CACCTCAAAG 240 

ACTAAGAAAG CACTGTGCAT CACCTTGACC CTGGGGACCT TCCTOGTGGG AGCTGCGCTG 300 

GCCGCTGGCC TACTCTGGAA GTTCATGGGC AGCAAGTGCT CGAACTCTGG GATAGAGTGC 360 

GACTCCTCAG GTACCTGCAT CAACCCCTCT AACTGGTGTG ATGGCGTGTC ACACTGCCCC 420 

GGCGGGGAGG ACGAGAATCG GTGTGTTCGC CTCTACGGAC CAAACTTCAT CCTTCAGGTG 480 

TACTCATCTC AGAGGAAGTC CTGGCACCCT GTGTGCCAAG ACGACTGGAA CGAGAACTAC 540 

GGGCGGGCGG CCTGCAGGGA CATGGGCTAT AAGAATAATT TTTACTCTAG CCAAGGAATA 600 

GTGGATGACA GCGGATCCAC CAGCTTTATG AAACTGAACA CAAGTGCCGG CAATGTCGAT 660 

ATCTATAAAA AACTGTACCA CAGTGATGCC TGTTCTTCAA AAGCAGTGGT TTCTTTACGC 720 

TGTATAGCCT GCGGGGTCAA CTTGAACTCA AGCCGCCAGA GCAGGATCGT GGGCGGCGAG 780 

AGOGOGCTCC CGGGGGCCTG GCCCTGGCAG GTCAGCCTGC ACGTCCAGAA CGTCCACGTG 840 

TGCGGAGGCT CCATCATCAC CCCCGAGTGG ATCGTGACAG CCGCCCACTG CGTGGAAAAA 900 

CCTCTTAACA ATCCATGGCA TTGGACGGCA TTTGCGGGGA TTTTGAGACA ATCTTTCATG 960 

TTCTATGGAG CCGGATACCA AGTAGAAAAA GTGATTTCTC ATCCAAATTA TGACTCCAAG 1020 

ACCAAGAACA ATGACATTGC GCTGATGAAG CTGCAGAAGC CTCTGACTTT CAACGACCTA 1080 

GTGAAACCAG TGTGTCTGCC CAACCCAGGC ATGATGCTGC AGCCAGAACA GCTCTGCTGG 1140 

ATTTCCGGGT GGGGGGCCAC CGAGGAGAAA GGGAAGACCT CAGAAGTGCT GAACGCTGCC 1200 

AAGGTGCTTC TCATTGAGAC ACAGAGATGC AACAGCAGAT ATGTCTATGA CAACCTGATC 1260 

ACACCAGCCA TGATCTGTGC CGGCTTCCTG CAGGGGAACG TCGATTCTTG CCAGGGTGAC 1320 

AGTGGAGGGC CTCTGGTCAC TTCGAAGAAC AATATCTGGT GGCTGATAGG GGATACAAGC 1300 

TGGGGTTCTG GCTGTGCCAA AGCTTACAGA CCAGGAGTGT ACGGGAATGT GATGGTATTC 1440 
ACGGACTGGA TTTATCGACA AATGAGGGCA GACGGCTAA 

Seq ID NO: 586 Protein sequence 
Protein Accession #» Eos sequence 

1 11 21 31 41 51 

I I I I I I 

MALNSGSPPA IGPYYENHGY QPENPYPAQP TWPTVYEVH PAQYYPSPVP QYAPRVLTQA 60 

SNPWCTQPK. SPSGTVCTSK TKKALCITLT LGTFLVGAAL AAGLLWKFMG SKCSNSGIEC 120 

DSSGTCINPS NWCDGVSHCP GGEDENRCVR LYGPNFILQV YSSQRKSWHP VCQDDWNENY 180 

GRAACRDMGY KNNFYSSQGI VDDSGSTSFM KLNTSAGNVD IYKKLYHSDA CSSKAWSLR 240 

CIACGVNLNS SRQSRIVGGE SALPGAWPWQ VSLHVQNVHV CGGSIITPEW IVTAAHCVEK 300 

PLNNPWHWTA FAGILRQSFM FYGAGYQVEK VISHPNYDSK TKNNDIALMK LQKPLTFNDL 360 

VKPVCLPNPG MMLQPEQLCW I SGWGATEEK GKTSEVLNAA KVLLIETQRC NSRYVYDNLI 420 

TPAMICAGFL QGNVDSCQGD SGGPLVTSKN NIWWLIGDTS HGSGCAKAYR PGVYGNVMVF 480 
TDWIYRQMRA DG 

Seq ID NO: 587 DNA sequence 

Nucleic Acid Accession 8: NM_005656.1 

Coding sequence! 57.. 1535 

1 11 21 31 41 51 

I I I I I I 

GTCATATTGA ACATTCCAGA TACCTATCAT TACTCGATGC TGTTGATAAC AGCAAGATGG 60 

CTTTGAACTC AGGGTCACCA CCAGCTATTG GACCTTACTA TGAAAACCAT GGATACCAAC 120 

CGGAAAACCC CTATCCCGCA CAGCCCACTG TGGTCCCCAC TGTCTACGAG GTGCATCCGG 180 

CTCAGTACTA CCCGTCCCCC GTGCCCCAGT ACGCCCCGAG GGTCCTGACG CAGGCTTCCA 240 

ACCCCGTCGT CTGCACGCAG CCCAAATCCC CATCCGGGAC AGTGTGCACC TCAAAGACTA 300 

AGAAAGCACT GTGCATCACC TTGACCCTGG GGACCTTCCT CGTGGGAGCT GCGCTGGCCG 360 

CTGGCCTACT CTGGAAGTTC ATGGGCAGCA AGTGCTCCAA CTCTGGGATA GAGTGCGACT 420 

CCTCAGGTAC CTGCATCAAC CCCTCTAACT GGTGTGATGG CGTGTCACAC TGCCCCGGCG 480 

GGGAGGACGA GAATCGGTGT GTTCGCCTCT ACGGACCAAA CTTCATCCTT CAGATGTACT 540 

CATCTCAGAG GAAGTCCTGG CACCCTGTGT GCCAAGACGA CTGGAACGAG AACTACGGGC 600 

GGGCGGCCTG CAGGGACATG GGCTATAAGA ATAATTTTTA CTCTAGCCAA GGAATAGTGG 660 

ATGACAGCGG ATCCACCAGC TTTATGAAAC TGAACACAAG TGCCGGCAAT GTCGATATCT 720 

ATAAAAAACT GTACCACAGT GATGCCTGTT CTTCAAAAGC AGTGGTTTCT TTACGCTGTT 780 

TAGCCTGCGG GGTCAACTTG AACTCAAGCC GCCAGAGCAG GATCGTGGGC GGTGAGAGCG 840 

CGCTCCCGGG GGCCTGGCCC TGGCAGGTCA GCCTGCACGT CCAGAACGTC CACGTGTGCG 900 

GAGGCTCCAT CATCACCCCC GAGTGGATCG TGACAGCCGC CCACTGCGTG GAAAAACCTC 960 

TTAACAATCC ATGGCATTGG ACGGCATTTG CGGGGATTTT GAGACAATCT TTCATGTTCT 1020 

ATGGAGCCGG ATACCAAGTA CAAAAAGTGA TTTCTCATCC AAATTATGAC TCCAAGACCA 1080 

AGAACAATGA CATTGCGCTG ATGAAGCTGC AGAAGCCTCT GACTTTCAAC GACCTAGTGA 1140 

AACCAGTGTG TCTGCCCAAC CCAGGCATGA TGCTGCAGCC AGAACAGCTC TGCTGGATTT 1200 

CCGGGTGGGG GGCCACOGAG GAGAAAGGGA AGACCTCAGA AGTGCTGAAC GCTGCCAAGG 1260 

TGCTTCTCAT TGAGACACAG AGATGCAACA GCAGATATGT CTATGACAAC CTGATCACAC 1320 

CAGCCATGAT CTGTGCCGGC TTCCTGCAGG GGAACGTCGA TTCTTGCCAG GGTGACAGTG 1380 

GAGGGCCTCT GGTCACTTCG AACAACAATA TCTGGTGGCT GATAGGGGAT ACAAGCTGGG 1440 

GTTCTGGCTG TGCCAAAGCT TACAGACCAG GAGTGTACGG GAATGTGATG GTATTCACGG 1500 

ACTGGATTTA TCGACAAATG AAGGCAAACG GCTAATCCAC ATGGTCTTCG TCCTTGACGT 1560 



1177 



WO 03/042661 



PCT/US02/36810 



CGTTTTACAA GAAAACAATG GGGCTGGTTT TGCTTCCCCG TGCATGATTT ACTCTTAGAG 1620 

ATGATTCAGA GGTCACTTCA TTTTTATTAA ACAGTGAACT TGTCTGGCTT TGGCACTCTC 1680 

TGCCATACTG TGCAGGCTGC AGTGGCTCCC CTGCCCAGCC TGCTCTCCCT AACCCCTTGT 1740 

CCGCAAGGGG TGATGGCCGG CTGGTTGTGG GCACTGGCGG TCAATTGTGG AAGGAAGAGG 1800 

5 GTTGGAGGCT GCCCCCATTG AGATCTTCCT GCTGAGTCCT TTCCAGGGGC CAATTTTGGA 1860 

TGAGCATGGA GCTGTCACTT CTCAGCTGCT GGATGACTTG AGATGAAAAA GGAGAGACAT 1920 

GGAAAGGGAG ACAGCCAGGT GGCACCTGCA GCGGCTGCCC TCTGGGGCCA CTTGGTAGTG 1980 

TCCCCAGCCT ACTTCACAAG GGGATTTTGC TGATGGGTTC TTAGAGCCTT AGCAGCCCTG 2040 

GATGGTGGCC AGAAATAAAG GGACCAGCCC TTCATGGGTG GTGACGTGGT AGTCACTTGT 2100 

10 AAGGGGAACA GAAACATTTT TGTTCTTATG GGGTGAGAAT ATAGACAGTG CCCTTGGTGC 2160 

GAGGGAAGCA ATTGAAAAGG AACTTGCCCT GAGCACTCCT GGTGCAGGTC TCCACCTGCA 2220 

CATTGGGTGG GGCTCCTGGG AGGGAGACTC AGCCTTCCTC CTCATCCTCC CTGACCCTGC 2280 

TCCTAGCACC CTGGAGAGTG AATGCCCCTT GGTCCCTGGC AGGGCGCCAA GTTTGGCACC 2340 

ATGTCGGCCT CTTCAGGCCT GATAGTCATT GGAAATTGAG GTCCATGGGG GAAATCAAGG 2400 

15 ATGCTCAGTT TAAGGTACAC TGTTTCCATG TTATGTTTCT ACACATTGAT GGTGGTGACC 2460 
CTGAGTTCAA AGCCATCTT 

Seq ID NO: 588 Protein sequence 
Protein Accession #: NP_005647.l 

20 

1 11 21 31 41 51 

| | 1 1 1 I 

MALNSGSPPA IGPYYENHGY QPENPYPAQP TWPTVYEVH PAQYYPSPVP QYAPRVLTQA 60 
SNPWCTQPK SPSGTVCTSK TKKALCITLT LGTFLVGAAL AAGLLWKFMG SKCSNSGIEC 120 

25 DSSGTCINPS NWCDGVSHCP GGEDENRCVR LYGPNFILQM YSSQRKSWHP VCQDDWNENY 180 
GRAACRDMGY KNNFYSSQGI VDDSGSTSFM KLNTSAGNVD IYKKLYHSDA CSSKAWSLR 240 
CLACGVNLNS SRQSRIVGGE SALPGAWPWQ VSLHVQNVHV CGGSIITPEW IVTAAHCVEK 300 
PLNNPWHWTA FAGILRQSFM FYGAGYQVQK VISHPNYDSK TKNNDIALMK LQKPLTFNDL 360 
VKPVCLPNPG MMLQPEQLCW ISGWGATEEK GKTSEVUJAA KVLLIETQRC NSRYVYDNLI 420 

30 TPAMICAGFL QGNVDSCQGD SGGPLVTSNN NIWWLIGDTS WGSGCAKAYR PGVYGNVMVF 480 
TDWIYRQMKA NG 

Seq ID NOi 589 DNA sequence 
Nucleic Acid Accession #: NM_001935.1 
35 Coding sequence: 1..2301 

1 ll 21 31 41 51 

I i I • i I I 

ATGAAGACAC CGTGGAAGAT TCTTCTGGGA CTGCTGGGTG CTGCTGCGCT TGTCACCATC 60 

40 ATCACCGTGC CCGTGGTTCT GCTGAACAAA GGCACAGATG ATGCTACAGC TGACAGTCGC 120 

AAAACTTACA CTCTAACTGA TTACTTAAAA AATACTTATA GACTGAAGTT ATACTCCTTA 180 

AGATGGATTT CAGATCATGA ATATCTCTAC AAACAAGAAA ATAATATCTT GGTATTCAAT 240 

GCTGAATATG GAAACAGCTC AGTTTTCTTG GAGAACAGTA CATTTGATGA GTTTGGACAT 300 

TCTATCAATG ATTATTCAAT ATCTCCTGAT GGGCAGTTTA TTCTCTTAGA ATACAACTAC 360 

45 GTGAAGCAAT GGAGGCATTC CTACACAGCT TCATATGACA TTTATGATTT AAATAAAAGG 420 

CAGCTGATTA CAGAAGAGAG GATTCCAAAC AACACACAGT GGGTCACATG GTCACCAGTG 480 

GGTCATAAAT TGGCATATGT TTGGAACAAT GACATTTATG TTAAAATTGA ACCAAATTTA 540 

CCAAGTTACA GAATCACATG GACGGGGAAA GAAGATATAA TATATAATGG AATAACTGAC 600 

TGGGTTTATG AAGAGGAAGT CTTCAGTGCC TACTCTGCTC TGTGGTGGTC TCCAAACGGC 660 

50 ACTTTTTTAG CATATGCCCA ATTTAACGAC ACAGAAGTCC CACTTATTGA ATACTCCTTC 720 

TACTCTGATG AGTCACTGCA GTACCCAAAG ACTGTACGGG TTCCATATCC AAAGGCAGGA 780 

GCTGTGAATC CAACTGTAAA GTTCTTTGTT GTAAATACAG ACTCTCTCAG CTCAGTCACC 840 

AATGCAACTT CCATACAAAT CACTGCTCCT GCTTCTATGT TGATAGGGGA TCACTACTTG 900 

TGTGATGTGA CATGGGCAAC ACAAGAAAGA ATTTCTTTGC AGTGGCTCAG GAGGATTCAG 960 

55 AACTATTCGG TCATGGATAT TTGTGACTAT GATGAATCCA GTGGAAGATG GAACTGCTTA 1020 

GTGGCACGGC AACACATTGA AATGAGTACT ACTGGCTGGG TTGGAAGATT TAGGCCTTCA 1080 

GAACCTCATT TTACCCTTGA TGGTAATAGC TTCTACAAGA TCATCAGCAA TGAAGAAGGT 1140 

TACAGACACA TTTGCTATTT CCAAATAGAT AAAAAAGACT GCACATTTAT TACAAAAGGC 1200 

ACCTGGGAAG TCATCGGGAT AGAAGCTCTA ACCAGTGATT ATCTATACTA CATTAGTAAT 1260 

60 GAATATAAAG GAATGCCAGG AGGAAGGAAT CTTTATAAAA TCCAACTTAG TGACTATACA 1320 

AAAGTGACAT GCCTCAGTTG TGAGCTGAAT CCGGAAAGGT GTCAGTACTA TTCTGTGTCA 1380 

TTCAGTAAAG AGGCGAAGTA TTATCAGCTG AGATGTTCCG GTCCTGGTCT GCCCC TCTAT 1440 

ACTCTACACA GCAGCGTGAA TGATAAAGGG CTGAGAGTCC TGGAAGACAA TTCAGCTTTG 1500 

GATAAAATGC TGCAGAATGT CCAGATGCCC TCCAAAAAAC TGGACTTCAT TATTTTGAAT 1560 

65 GAAACAAAAT TTTGGTATCA GATGATCTTG CCTCCTCATT TTGATAAATC CAAGAAATAT 1620 

CCTCTACTAT TAGATGTGTA TGCAGGCCCA TGTAGTCAAA AAGCAGACAC TGTCTT CAGA 1680 

CTGAACTGGG CCACTTACCT TGCAAGCACA GAAAACATTA TAGTAGCTAG CTTTGATGGC 1740 

AGAGGAAGTG GTTACCAAGG AGATAAGATC ATGCATGCAA TCAACAGAAG ACTGGGAACA 1800 

TTTGAAGTTG AAGATCAAAT TGAAGCAGCC AGACAATTTT CAAAAATGGG ATTTGTGGAC 1860 

70 AACAAACGAA TTGCAATTTG GGGCTGGTCA TATGGAGGGT ACGTAACCTC AATGGTCCTG 1920 

GGATCGGGAA GTGGOGTGTT CAAGTGTGGA ATAGCCGTGG OGCCTGTATC CCGGTGGGAG 1980 

TACTATGACT CAGTGTACAC AGAACGTTAC ATGGGTCTCC CAACTCCAGA AGACAACCTT 2040 

GACCATTACA GAAATTCAAC AGTCATGAGC AGAGCTGAAA ATTTTAAACA AGTTGAGTAC 2100 

CTCCTTATTC ATGGAACAGC AGATGATAAC GTTCACTTTC AGCAGTCAGC TCAGATCTCC 2160 

75 AAAGCCCTGG TCGATGTTGG AGTGGATTTC CAGGCAATGT GGTATACTGA TGAAGACCAT 2220 

GGAATAGCTA GCAGCACAGC ACACCAACAT ATATATACCC ACATGAGCCA CTTCATAAAA 2280 
CAATGTTTCT CTTTACCTTA G 



80 



Seq ID NO: 590 Protein sequence 
Protein Accession #: NP_001926.1 



1 



11 



21 



31 



41 



51 




60 



WO 03/042661 



RWISDHEYLY KQENNILVFN AEYGNSSVFL ENSTFDEFGH SINDYSISPD GQFILLEYNY 120 

VKQWRHSYTA SYDIYDLNKR QLITEERIPN NTQWVTWSPV GHKLAYVWNN DIYVKIEPNIi 180 

PSYRITWTGK EDIIYNGITD WVYEEEVFSA YSALWWSPNG TFLAYAQFND TEVPLIEYSF 240 

YSDESLQYPK TVRVPYPKAG AVNPTVKFFV VNTDSLSSVT NATS IQ I TAP ASMLIGDHYL 300 

CDVTHATQER ISLQWLRRIQ NYSVMDICDY DESSGRWNCL VARQHIEMST TGWVGRFRPS 360 

EPHFTLDGNS FYKIISNEEG YRHICYFQID KKDCTFITKG TWEVIGIEAL TSDYLYYISN 420 

EYKGMPGGRN LYKIQLSDYT KVTCLSCELN PERCQYYSVS FSKEAKYYQL RCSGPGLPLY 480 

TLHSSVNDKG LRVLEDNSAL DKMLQNVQMP SKKLDFIILN ETKFWYQMIL PPHFDKSKKY 540 

PLLLDVYAGP CSQKADTVFR LNWATYLAST ENIIVASFDG RGSGYQGDKI MHAINRRLGT 600 

FEVEDQIEAA RQFSKMGFVD NKRIAIWGWS YGGYVTSMVL GSGSGVFKCG IAVAPVSRWE 660 

YYDSVYTERY MGLPTPEDNL DHYRNSTVMS RAENFKQVEY LLIHGTADDN VHFQQSAQIS 720 
KALVDVGVDF QAMWYTDEDH GIASSTAHQH IYTHMSHFIK QCFSLP 



Seq ID NO: 591 PNA sequence 

Nucleic Acid Accession #: NM_016077.1 

Coding sequence : 128 . . 667 

1 11 21 31 41 51 

TCGCTTTGTG ATTCTTGATC CGGAACTTTG TCACCCAGGA ACCCCGGAAG AGGTAGCTCA 60 

CGCGATAGAA ACGTGTTCGC TTGCCCAGAA GAAGGGAAGG CGCGAGTGAG GAAAGGAGGT 120 

ACTGTAGATG CCCTCCAAAT CCTTGGTTAT GGAATATTTG GCTCATCCCA GTACACTCGG 180 

CTTGGCTGTT GGAGTTGCTT GTGGCATGTG CCTGGGCTGG AGCCTTCGAG TATGCTTTGG 240 

GATGCTCCCC AAAAGCAAGA CGAGCAAGAC ACACACAGAT ACTGAAAGTG AAGCAAGCAT 300 

CTTGGGAGAC AGCGGGGAGT ACAAGATGAT TCTTGTGGTT CGAAATGACT TAAAGATGGG 360 

AAAAGGGAAA GTGGCTGCCC AGTGCTCTCA TGCTGCTGTT TCAGCCTACA AGCAGATTCA 420 

AAGAAGAAAT CCTGAAATGC TCAAACAATG GGAATACTGT GGCCAGCCCA AGGTGGTGGT 480 

CAAAGCTCCT GATGAAGAAA CCCTGATTGC ATTATTGGCC CATGCAAAAA TGCTGGGACT 540 

GACTGTAAGT TTAATTCAAG ATGCTGGACG TACTCAGATT GCACCAGGCT CTCAAACTGT 600 

CCTAGGGATT GGGCCAGGAC CAGCAGACCT AATTGACAAA GTCACTGGTC ACCTAAAACT 660 

TTACTAGGTG GACTTTGATA TGACAACAAC CCCTCCATCA CAAGTGTTTG AAGCCTGTCA 720 

GATTCTAACA ACAAAAGCTG AATTTCTTCA CCCAACTTAA ATGTTCTTGA GATGAAAATA 780 
AAACCTATTC CCATGTTCTA AAAAAA 

Seq ID NO: 592 Protein sequence 
Protein Accession 8: NP_057161.1 

1 ii 21 31 41 51 

| j 1 1 I I 

MPSKSLVMEY LAHPSTLGLA VGVACGMCLG WSIJWCFGML PKSKTSKTHT DTESEASILG 60 
DSGEYKMILV VRNDLKMGKG KVAAQCSHAA VSAYKQIQRR NPEMLKQWEY CGQPKWVKA 120 
PDEETLIALL AHAKMLGLTV SLIQDAGRTQ IAPGSOTVLG IGPGPADLID KVTGHLKLY 

Seq ID NO: 593 PNA sequence 

Nucleic Acid Accession ft: FGENESH predicted 

Coding sequence: 1..1896 

1 11 21 31 41 51 

ATGCGCGCCG TGCCGCTGCC CGCCCCGCTC CTGCCGCTGC TGCTGCTCGC GCTCCTGGCC 60 

GCTCCCGCCG CCCGCGCCAG CAGAGCCGAG TCCGTCTCCG CGCCGTGGCC CGAACCCGAG 120 

CGCGAGTCGC GGCCACCGCC CGGCCCGGGG CCCGGGAACA CCACCCGGTT TGGGTCTGGG 180 

GCGGCGGGCG GCAGCGGCAG CTCCAGCTCC AACAGCAGTG GCGACGCCTT GGTGACCCGC 240 

ATTTCCATCC TCCTCCGCGA CCTACCCACC CTCAAGGCAG CCGTGATCGT GGCGTTCGCC 300 

TTTACCACCC TCCTCATCGC CTGCCTGCTG CTGCGCGTCT TCAGGTCGGG AAAGAGGTTA 360 

AAGAAGACAC GCAAGTATGA TATCATCACC ACTCCAGCAG AGCGAGTGGA AATGGCGCCA 420 

CTAAATGAAG AGGATGATGA AGATGAGGAC TCCACAGTAT TCGACATCAA ATACAGAGTG 480 

TCCTTGCCGG CTGCACTGAG ACGTCAGCTG CCAGGGTGCC AGAOGCTACT GACAGTTCCT 540 

GTGCCCCCAC CCTTCATCCT CGACATTGAC CTTCCAGCAA GATGCAGTGG AAGGCCTGAT 600 

GGTGGAATCA GACCTGGTAA AACCTGTTTC CCAGCCTGGT GGCATCCTGT GGAAAGTTGG 660 

TCAGCTGCAA CCTGGGGTGT GAAGGACTGG ACCTGGAAGC OCTCTTGCGT CGGAGGTGTT 720 

GAAACCAAAA CGAACGTTAT GTATAAAACC CCAGCTCCAT CGTGCGTGTC AGGCATCTGC 780 

TCAGACTGTC ACTGGCAAGC TCGTTTCCAC GTCACCACAA TGG AGTTG CT TCTGCCACCC 840 

TTTGGGCATC CCTTTAAAGT GCCCCCTACT TCTACTCCCC ATGGTTTTCG ACAACTGCAG 900 

CTGAATCTCA TGGAAAAGCT GGATTCCTCT GCCTTACGCA GAAACACCCG GGCTCCATCT 960 

GCCAGGTGCT TGCCACTGGT CCTGGCAGAA ATGGCGGCTG CTGAAAGTGA CCTTCCAAAT 1020 

CCTTGGTGGC ACTTCAGCGC CACAGGCTCT CCAATAAAAA CCCTTTACAC ACAAACCATG 1080 

AGTACCTTGG GCTTGGATGT TTTCTGTGGT GCCGGCCAGC GGGGCACCTT TTGTGAAGAC 1140 

AGAGCAGTGA CTAAGGTTCT CCAGGGTAGC TCTTTCTCCA AACAGCTGCG CTGGAAGCCA 1200 

GCCCTAGAGA GTGGGTTTCC CCATCATCTC AGGCTTCTCA GAGAGTGTCC TCOGCTGAGC 1260 

ACCCATCCTG TCAGGTTGGC TCGTTCAGAT GCCCGGGGAC AAGCCAGCCT GACGGGGAGG 1320 

AGGGTGTTTC GGCGTCCGCG GCAGTCTCTG CATGGCGGAG GGTCAGCGGG TACCGCAACT 1380 

TGCCTTTTGG TTTTGAAGAT TCTGTTGAGG CGCCATCCTC ACCTTGACCT CTTCTACAAA 1440 

ATCTGTCTCC CCTGCTGTGC CGTGGAACAC CTACGGGAAG CCAAOAGAAG CTCAGTGACT 1500 

GTCCTTGCGT CATTTGAGCA GAGCCCACAA AAGGCAGCTG CTGCCCACGG GGAGCCTGTC 1560 

AAACGAGGGC CCAGTGGGCA ATTGACCAGA CACACATGCC CTGGCTGGGG GATCACACAT 1620 

GCGAACCTGC AGACAATTCC AGATACCCAA GGCCAGGAAG GCCCACGTGA GGATGTCACT 1680 

CACCCTGGAG GAGACTTGGA TGGGGTGGCA AATTTCTATT TGGAGGAAGA GGGTTTCCAG 1740 

GATGGCAGAT GCCAGAAGAT GGTCCTGATG TCTGAGGAAG GGCCACCTAG TT TGACAGGA 1800 

TGTGAGAGGC TCACAGGTTC CCATCACTTC TCCAGCCATT CCAAGTCTTG GTCCTTCCTT 1860 
TCCCCCCGAC AGCCCCTGTT TCTGTCCAGG CCCTGA 



Seq ID NO: 594 Protein sequence 
Protein Accession #: FGENESH predicted 
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MRAVPLPAPL 
AAGGSGSSSS 
KKTRKYDIIT 
VPPPFILDID 
ETKTNVMYKT 
LNLMEKLDSS 
STLGLDVPCG 
THPVRLARSD 
ICLPCCAVEH 
ANLQTIPDTQ 
CERLTGSHHP 



11 
I 

LPLLLLALLA 
NSSGDALVTR 
TPAERVEMAP 
LPARCSGRPD 
PAPSCVSGIC 
ALRRNTRAPS 
AGQRGTFCED 
ARGQASLTGR 
LREAKRSSVT 
GQEGPREDVT 
SSHSKSWSFL 



21 
I 

APAARASRAE 
ISILLRDLPT 
LN E EDDEDED 
GGIRPGKTCF 
SDCHWQARFH 
ARCLPLVLAE 
RAVTKVLQGS 
RVFRRPRQSL 
VLASFEQSPQ 
HPGGDLDGVA 
SPRQPLFIjSR 



31 
I 

SVSAPWPEPE 
LKAAVIVAFA 
STVFDIKYRV 
PAWWHPVESW 
VTTMELLLPP 
MAAAESDLPN 
SFSKQLRWKP 
HGGGSAGTAT 
KAAAAHGEPV 
NFYLEEEGFQ 
P 



41 
I 

RESRPPPGPG 
FTTLLIACLL 
SLPAALRRQL 
SAATWGVKDW 
FGHPFKVPPT 
PWWHFSATGS 
ALESGFPHHL 
CLLVLKILLR 
KRGPSGQLTR 
DGRCQKMVLM 



51 
I 

PGNTTRFGSG 
LRVFRSGKRL 
PGCQTLLTVP 
TWKPSCVGGV 
STPHGFRQLQ 
PIKTLYTQTM 
RLLRECPPLS 
RHPHLDLFYK 
HTCPGWGITH 
SEEGPPSLTG 



Seq ID NO: 595 DNA sequence 

Nucleic Acid Accession #: NM_021614.1 

Coding sequence: 1..1740 



1 
I 

ATGAGCAGCT 
CGCCGGAACC 
GGAGGAGGTG 
TCCTCAGCCC 
CTCTATGGAA 
CACGGCAGCA 
CTGGGCCACC 
TTCGGCATGT 
GACAAGGCGT 
CTGCTCGGTC 
GGAGCAGATG 
ATACTGGTGT 
CTTGCCTTCT 
CCAATGTTCT 
ACTGATGCCT 
GTTATGAAGA 
TTATGGATAA 
GTTACTAGCA 
TATGGTGACA 
ATGGGTGCTG 
AAAGCAGAAA 
AATGCAGCTG 
AAAAAGATAG 
CAATTAAGAA 
GACTTGGCAA 
GAAGACTTCG 
ATCCACGCCC 
GAGGCTCAGA 
TCGTCCAGGA 



11 
I 

GCAGGTACAA 
TGCACGAGAT 
GCGGCGCGTC 
CCGAGATCGT 
CCGGCGGCGG 
GCAGTGGCAC 
GGCGCGCCCT 
TCGGCATCGT 
CGCTGTATTC 
TGATCATCGT 
ACTGGAGAAT 
GTGCTATTCA 
CCTATGCCCC 
TAAGACTCTA 
CCTCTAGAAG 
CTTTAATGAC 
TTGCCGCATG 
ACTTCCTTGG 
TGGTACCTAA 
GTTGCACAGC 
AACACGTGCA 
CCAATGTACT 
ATCATGCAAA 
GTGTAAAAAT 
AGACCCAGAA 
AGAAGAGGAT 

TGGAGAGCTA 
GGCGGCGGTC 



21 31 

I 1 

CGGGGGCGTC ATGCGGCCGC 
GG ACT CAG AG GCGCAGCCCC 
CTCCCCGTCT GCAGCCGCTG 
GGTGTCTAAG CCCGAGCACA 
AGGCAGCACT GGAGGAGGCG 
CAAGTCCAGC AAAAAGAAAA 
GTTCGAAAAG CGCAAGCGGC 
GGTCATGGTC ATCGAGACCG 
CTTAGCTCTG AAATGCCTTA 
GTACCACGCC AGGGAAATAC 
AGCCATGACT TATGAGCGTA 
TCCCATACCT GGGAATTATA 
ATCCACAACC ACCGCTGATG 
TCTGATTGCC AGAGTCATGC 
CATTGGAGCA CTTAATAAGA 
TATATGCCCA GGAACTGTAC 
GACTGTCCGA GCTTGTGAAA 
AGCGATGTGG TTGATATCAA 
CACATACTGT GGAAAAGGAG 
CCTGGTGGTA GCTGTAGTGG 
CAATTTCATG ATGGATACTC 
CAGGGAAACA TGGCTAATTT 
AGTAAGAAAA CATCAACGAA 
GGAGCAGAGG AAACTGAATG 
CATCATGTAT GATATGATTT 
TGTTACCCTG GAAACAAAAC 
CATAAGCCAG ACCATCAGGC 
CGACAAGCAC GTCACTTACA 
CTCTTCCACA GCACCACCAA 



41 51 

I I 
TCAGCAACTT GAGCGCGTCC 
TGCAGCCCCC CGCGTCTGTC 
CCGCCGCCGC CGCTGTTTCG 
ACAACTCCAA CAACCTGGCG 
GCGGCGGTGG CGGGAGCGGG 
ACCAGAACAT CGGCTACAAG 
TCAGCGACTA CGCGCTCATC 
AGCTGTCGTG GGGCGCCTAC 
TCAGTCTCTC CACGATCATC 
AGTTGTTCAT GGTGGACAAT 
TTTTCTTCAT CTGCTTGGAA 
CATTCACATG GACGGCCCGG 
TGGATATTAT TTTATCTATA 
TTTTACATAG CAAACTTTTC 
TAAACTTCAA TACACGTTTT 
TCTTGGTTTT TAGTATCTCA 
GGTACCATGA TCAACAGGAT 
TAACTTTTCT CTCCATTGGT 
TCTGCTTACT TACTGGAATT 
CAAGGAAGCT AGAACTTACC 
AGCTGACTAA AAGAGTAAAA 
ACAAAAATAC AAAGCTAGTG 
AATTCCTGCA AGCTATTCAT 
ACCAAGCAAA CACTTTGGTG 
CTGACTTAAA CGAAAGGAGT 
TAGAGACTTT GATTGGTAGC 
AGCAGCAGAG AGATTTCATT 
ATGCTGAGCG GTCCCGGTCC 
CTTCATCAGA GAGTAGCTAG 



Seq ID NO: 596 Protein sequence 
Protein Accession ft: NP_067627.1 



1 
I 

MSSCRYNGGV 
SSAPEIWSK 
LGHRRALFEK 
LLGLUVYHA 
LAFSYAPSTT 
VMKTLKTICP 
YGDMVPNTYC 
NAAANVLRET 
DLAKTQNIMY 



11 

I 

MRPLSNLiSAS 
PEHNNSNNLA 
RKRLSDYALI 
REIQLFMVDN 
TADVDIILSI 
GTVLLVFSIS 
GKGVCLLTGI 
WLIYKNTKLV 



21 

I 



LYGTGGGGST 
FGMFGIWMV 
GADDWRIAMT 
PMFliRLYLIA 
LWI IAAWTVR 
MGAGCTALW 
KKIDHAKVRK 
EDFEKRIVTL 



31 
I 

AQPLQPPASV 
GGGGGGGGSG 
IETELSWGAY 
YERIFFICLE 
RVMLLHSKLF 
ACERYHDQQD 
AWARKLELT 
HQRKFLQAIH 
ETKLETLIGS 



41 
I 

GGGGGASSPS 
HGSSSGTKSS 
DKASLYSLAL 
ILVCAIHPIP 
TDASSRSIGA 
VTSNFLGAMW 
KAEKHVHNFM 
QLRSVKMEQR 
IHAUPGLISQ 



60 
120 
180 
240 
300 
360 
420 
460 
540 
600 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 



51 
I 

AAAAAAAAVS 
KKKNQNIGYK 
KCLISLSTII 
GNYTFTWTAR 
LNKINFNTRF 
LISITFLSIG 
MDTQLTKRVK 
KLNDQANTLV 
TIRQOQRDFI 



EAQMESYDKH VTYNAERSRS 



Seq ID NO: 597 DNA sequence 

Nucleic Acid Accession #: NM_016029.1 

Coding sequence: 228.. 1097 



60 
120 
180 
240 
300 
360 
420 
480 
540 



1 
I 

CTGCGATCCC 
GGGCGTGCGC 
TGCTCCTGCT 
TATGGGCCGA 
TGACTGGAGC 
TTTCTCTTGT 
TAGAGAATGG 
CTGGTTCCCA 
TGGTCAACAA 
ACAGAAAGCT 
CTCACATGAT 



11 
I 

GCAGGGCAGC 
GGCCGCAATG 
CTTGGTGCAG 
GTGGCAGGGA 
CTCGAGTGGA 
GCTGTCAGCC 
CAATTTAAAA 
TGAAGCGGCT 
TGGTGGAATG 
AATAGAGCTT 
CGAGAGGAAG 



21 
I 

GACGCGACTC 
AACTGGGAGC 
CTGCTGCGCT 
CGACGCCCAG 
ATTGGTGAGG 
AGAAGAGTGC 
GAAAAAGATA 
ACCAAAGCTG 
TCCCAGCGTT 
AACTACTTAG 
CAAGGAAAGA 



31 
I 

TGGTGCGGGC 
TGCTGCTGTG 
TCCTGAGGGC 
AATGGGAGCT 
AGCTGGCTTA 
ATGAGCTGGA 
TACTTGTTTT 
TTCTCCAGGA 
CTCTGTGCAT 
GGACGGTGTC 
TTGTTACTGT 



41 

I 

CGTCTTCTTC 
GCTGCTGGTG 
TGACGGCGAC 
GACTGATATG 
CCAGTTGTCT 
AAGGGTGAAA 
GCCCCTTGAC 
GTTTGGTAGA 
GGATACCAGC 
CTTGACAAAA 
GAATAGCATC 




CTGGGTATCA 



1180 
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TATCTGTACC 
ATGGCCTTCG 
GACCTGTGCA 
GCAATAATGG 
TCAGCATGGC 
CATATTTGTG 
AAAGGATTGA 
AGACAAAACA 
AAACATGAAA 
ACTTTTTAAT 
AGATTGOCAT 



TCTTTCCATT 
AACAGAACTT 
ATCAAATAtT 
AGACCAGTCC 
CAATGATTTG 
GCAATACATG 
GAACTTTAAG 
TGACTGAAAA 
ACAGCAATCT 
AGATATGACT 
GAATCTTGCA 



GGATACTGTG 
GCCACATACC 
GTGGAGAATT 
CACAAGATGA 
AAAGAAGTTT 
CCAACCTGGG 
AGTGGTGTGG 
GAGCACCTGT 
TCTTATGCTT 
TTGCTTCCAA 
AA 



CTAGCAAGCA 
CAGGTATAAT 
CCCTAGCTGG 
CAACCAGTCG 
GGATCTCAGA 
CCTGGTGGAT 
ATGCAGACTC 
ACTTTTCAAG 
CTGAATAATC 
CATGGAATGA 



TGCTCTCCGG 
AGTTTCTAAC 
AGAAGTCACA 
TTGTGTGCGG 
ACAACCTTTC 
AACCAACAAG 
TTCTTATTTT 
CCACTGGAGG 
AAAGACTAAT 
AATAAAAAAT 



ATTTGCCCAG 
AAGACTATAG 
CTGATGTTAA 
TTGTTAGTAA 
ATGGGGAAGA 
AAAATCTTTA 
GAGAAATGGA 
TTGTGATTTT 
AAATAATAAA 



720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 



Seq ID NO: 598 Protein sequence 
Protein Accession ft: NP_0S7113.1 

1 11 21 31 41 51 

I I I I I I 

MNWELLLWLL VLCALLLLLV QLLRFLRADG DLTLLWAEWQ GRRPEWELTD MWWVTGASS 
GIGEELAYQL SKLGVSLVLS ARRVHELERV KRRCLENGNL KEKDILVLPL DLTDTGSHEA 
ATKAVLQEFG RIDILVNNGG MSQRSLCMDT SLDVYRKLIE LNYLGTVSLT KCVLPHMIER 
KQGKIVTVNS ILGIISVPLS IGYCASKHAL RGFFNGLRTE LATYPGIIVS NICPGPVQSN 
IVENSLAGEV TKTIGNNGDQ SHKMTTSRCV RLMLISMAND LKEVWISEQP FLLVTYLWQY 
MPTWAWWITN KMGKKRIENF KSGVDADSSY FKIFKTKHD 

Seq ID NO: 599 DNA sequence 

Nucleic Acid Accession ft: NM_000793.2 

Coding sequence: 401.. 1222 



1 11 21 31 41 51 

! I I I I I 

GCCTGCAGAG AGAGGCACTT TGCACCACAG ACAGATAGCA AGAAGGGAAA GACAGAGAGT 
GAGAAAAAAG AGGAGTCAGT CGCTCCTGGG GAAGGGAGAG AGTGAGACTG GGAGAAAGAG 
AAGCACAGAA AGTGTGTGTA AAACGGAGTA AAGAAAGAAA AAAAAAAAAC TACCCTTAAA 
GCACATTTAA AAAAAAAAAA CTCTGGCAAT TCAAGAAAGA AACAGGCTAC GTTTAAAGAG 
CATAGAGACA ATGAAAGGCT AAAGAAAATT TTAAAATCTC TGCCACAGTC TCATAGGTGC 
TTGGAAATGA AAGTAGAACT GCCTGTCTTT AACGGACTCT GACAGAGAGG GTGAAGGGGA 
ACCAGAGCGC ACAAGGGAAC TGACTCAGGA GGCAGAGAAG ATGGGCATCC TCAGCGTAGA 
CTTGCTGATC ACACTGCAAA TTCTGCCAGT TTTTTTCTCC AACTGCCTCT TCCTGGCTCT 
CTATGACTCG GTCATTCTGC TCAAGCACGT GGTGCTGCTG TTGAGCCGCT CCAAGTCCAC 
TCGGGGAGAG TGGCGGCGCA TGCTGACCTC AGAGGGACTG CGCTGCGTCT GGAAGAGCTT 
CCTCCTCGAT GCCTACAAAC AGGTGAAATT GGGTGAGGAT GCCCCCAATT CCAGTGTGGT 
GCATGTCTCC AGTACAGAAG GAGGTGACAA CAGTGGCAAT GGTACCCAGG AGAAGATAGC 
TGAGGGAGCC ACATGCCACC TTCTTGACTT TGCCAGCCCT GAGCGCCCAC TAGTGGTCAA 
CTTTGGCTCA GCCACTTGAC CTCCTTTCAC GAGCCAGCTG CCAGCCTTCC GCAAACTGGT 
GGAAGAGTTC TCCTCAGTGG CTGACTTCCT GCTGGTCTAC ATTGATGAGG CTCATCCATC 
AGATGGCTGG GCGATACCGG GGGACTCCTC TTTGTCTTTT GAGGTGAAGA AGCACCAGAA 
CCAGGAAGAT CGATGTGCAQ CAGCCCAGCA GCTTCTGGAG CGTTTCTCCT TGCCGCCCCA 
GTGGCGAGTT GTGGCTGACC GCATGGACAA TAACGCCAAC ATAGCTTACG GGGTAGCCTT 
TGAACGTGTG TGCATTGTGC AGAGACAGAA AATTGCTTAT CTGGGAGGAA AGGGCCCCTT 
CTCCTACAAC CTTCAAGAAG TCCGGCATTG GCTGGAGAAG AATTTCAGCA AGAGATGAAA 
GAAAACTAGA TTAGCTGGTT AAAGGTATGA TTATAAGAGA GCTTATTGTT TTAAAAAGTT 
ATATAAAGGC AAGGAAATTA AGAACTGAAT CCATATTTCA ACAGAGCCCT ATTGGCTTAC 
TGAAAGACAG GAGTTTATCT ATCGGAAGAA CATGAATCTC TAACAGCTCC ATACTTCTTT 
CACTACTCAA ATGGCATTGG GCTGAGTAAG TAACCATATC ACCTCTCTTC TTAGTAAAAA 
GCCCTATGTG AAAAGATCCC AAGATGGAGA GGAAGAAACG CTAATTCAGC ATGTGTTCAT 
TCTGCATTGA GAAGGAACTG ATACATCTGA TGCATGCTTT GAGACCAGAA GAAAAGACTT 
ACCTGAATAA TTACTACATT AGGGAAGCTA CTGTCTACGT TAAGATAAAG GGTATTGCCT 
TGGCTCTATT TGGCATGGAT GGAGCCCAGT TGGAAAATTC CCAAATATTA CAACAAGTCC 
TTGAACCCAG GCCATGTGGT TAGACGTTGG TGTTAAGGTT AGACCTTATG TTAGAGTCAT 
TTCTGATGTT CCAGCTTCTA GCCATGTAGT GCTCTCAGTC TTCATACCCC AGAAATTATT 
GGTATATTTG TAGATACOGA GAATGATCCC TCAGTCTGAG AGGTTAGAAT GATCATCTGT 
AATCTGAGGG TTAATTTCTA GGCAGGTGGA GAGAGTGGTA AAAAAGAAAT GAAATTGACA 
AGCTAGGAAA GAGGAGGCAG AAAGATTTGG AAAATTCACA GAGTTTCACC CTTAAGCTGT 
AGAGAGTGGG TCACATTTGT TAGCCACGGA AACATAGAAA CATACACAAG GCCAGAAAAA 
GAAGAAGGAG CTCAACTAAA AGTGGCATAG AGAATACACA TATAAAAACA ATATATTTGT 
CATATGCTCC TAGAGAGGAG AAAGGGGTGA TTGAAAGAAA AAAAAATACT TAAATATTTG 
TAATTGTGAG GGGTTTCTTT TGGAAATAAT TACTTTTGAA CCATGTATGT GGTATGTATA 
TTTTCAGTGG GTTAATTATA CCCCATGATA CCTATTAAAG GAAAACCAGT GGGTCTGGTG 
GTGCTGGTCT TTTCCTCCCC ATTCCTACAA TTTCTATGTG GCCCAAGTCA TTCCTAATCT 
TGGTCTCTAT AGCAGTGTTC TCTCTGAATG CTGAGCTGAA GAAATTATAC GTACATACAC 
ACATACATAC ATACATACAA ATATATGTAT ATATATTCTC AGCTGCTGCG GGAGGTAGGT 
ACCATGGCCA TTCAGCACAG CCTTGATTTC CTCCCAAAGT AGGTGAGCTA TAGTGAAGAA 
TAGGTGCAAA CAAACAAGCT TACTTCCATT GCAAAATAGA AGAAGAGGAA GTTAGAGATA 
ATTCTGATCA ATCATTTTGG AGGCTTTGTT ATAAGGCAAC CCCOGGTATA TCATGGAATT 
TCCATTGACA TTTGAATTTG GACTTGGATC TTCCCTTGGT CCCATTAGCT GAGGTTTAGT 
AATCTAAAGT CCCTATAGTA TATGATTATA ATGCTATTTT AAAAAATATA TATATAAAAT 
ATTTTTTTCT TTTTAAAATA GACACTATAG TTTTACCCAT AAGTAATATT TAAAGATTAT 
AGCTCCCAAA AGAATGGACC AACCACTTTC GTATCATAAT TTCTTTTTGG TAAATATGAG 
ACTATTATGA AATCATAGTA TATGATTGTA TTTAAAGGTA CAATCAAAGG ATCTTTTGTC 
CATTCCATTA ATAACTGAAT AAAAAATAAA TAAAATGGAT AGAAAAAAAC TAAAGTTGAA 
AATACATTCT TAAACTAGTT GTCTGAAATG AGAAAAGAGT GAGAACTAGG TGTGCAAGAA 
CCAAACGTAT TTTATTTTAT TTTTTAAATG GGAGCAACAT ATCAGTOGTG TCACCAGCTG 
GTATATTGTG TAAATATTAA AGCTCCATTG GGACTGATTT TTCATGGCAA CATCAGCTTT 
CTAATGTTCT AAATTCTATA AAAACCACCC ACAAAGAAAC AAAGCAAATT TCATTATCTA 
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ATGAGTTGCT GGAAAATCAT ATTGAGAATA ATTATTTCAG ATTCCTCAGT TGTTAACTTC 
TACATTCAAG GGCTTATCTC TGCCCCCATT GATTTTTAAC CTCAAAATGG TGTGAGATTT 
ACTGTGGAAC CCTAAAGCAG TAAAATAAAA AACCTGGTTG CAGCACATTC ACACTGTTGT 
CCTTAAAATT CCCCTTTTTT CTCTATGTAC GATAAAGTAA CAGTATGTCA GATAAGCCGG 
XGGGGGGATG AGATTAGGCT GAGGCAGTGC TAGTCAACTG GGGGAAAAGG ATGATGGAAA 
AATCACCCAG TTGTGCTATA TTTTTAAAGA AGGAGGTCGT TTATGTGTGC AGACAATTCT 
CCCTGAGGTT AGCCCAATGG AGAAATGAAG CAGAGGAAGG AAACATAGAA AGACATGGGC 
TATCAGGGAG GAAGATGTTC AATAGAACAT GCAAGAATTT CTGGAAGAAA GGCTGTGGAA 
GGGCCAATGG AGAAAATGAA TGGACAAAGC TCAGGAATCC CTACGCTATG TAGAATGTTC 
TTGGTGTTAT CAGGGTTAAG CCCTGTAATT ATGTAACCTA TTTATCGCAA CATGAATTTT 
TATGATTTCT TGTGATGTAT TCTTTTATGA AATTAACAAG AACTCATTAT TTTGAGGTAG 
AGGAAAATCA ATGCTTTATC TGATATGCTG AGAAATTATT AGATTGCCAA TACTCATGTG 
CGTTTCATGT GTTTTATAAG GTTTGTTCCT TTGAAGAATT GTAGTTCTTA GTCCCACAGG 
GAAATGTGTA TCTATTTATA TATCATAGTA TAAATCTATG ATATATTTAT ATCATATATA 
AAAGTCTGAG TTCTCTTTCT TAGTCCCTAA TCATGTTTCT CCCATAGGCT GTGTTTACAT 
GGAGCTATCG GTTTAGCCTT TTAAGCTTCA TTAGCTTGTC TATTATTGAA ATAGTTTCCA 
AGAAATTTTA GATATTATCA TAACATCTGG GTCTACTCAA ACACTTATTG TTTGAAAGAC 
TTATGTCTTG GACCTATCAA AAACTGACTT TATTTATTGC TTAGTGAAAA TACTAGTGGG 
ATCAACAATG ATTTTCTTGA ATGGGCATGA ATGGAGATGC COGCACAGTA ATGTAGAAAT 
GTTTCATACA GCTATTAAAA TGTAACTGAC CTCCTTAGAG GCAGATTAGT AACTGTTCCT 
ACTTTGTATA GCTAAGTGAC AGTCACTTAA CTTACATGAC TTTCTTTTTT CACATTGGGT 
CTCTGGTCCT GTGTCTTCAC CTCATTTATA GCACGTCTCC TTGATTTTTG GTAGTATCAA 
CTTCCCAGTG ATCTGTTCAG TTAAGTTCTT CTCCCGTTAA CCAGGAAGTG CTTATTCTCT 
CATCACAGTG GGAAGAATAG CCTATTGTCT TTCATTTTGC CTGAGTGTAT TTTACTATTT 
GGGCTCTGAA ATAAAAATTA TGAAATATGG TGAGGTCACA TGTTGGTGCT GCCTTGCTGC 
ATAAAATTCT AGGAGGGCAG GTTAGGAGAC AGTTATGTAT GGCCTTTCGG GAAAATTCAA 
AGGGTGGGAT TACAAGGGTG TTCCTCAGGC ATGCCCCTAT GGGCCCTATG TGGAAGCAAG 
AAGAATTGAC TGATTTACAG GACTTCTCTT TATGTCAATC TTAAGAGGAT GGATGAATCT 
GGACATTTGT TCCACCCGAC CTCTGACTGA TGGTTTGGAA AATAACTTTA ATTAGGATCA 
TATGACCATT GAAAAAGGAA AAATGTAGAC TCTGACTTCC GTCCCACTGA AGGATTAATG 
AAAACCTTTA CTAGCATTTA GAGCTTTTCA GAACATCCCC ACTGTCATGT GTCTCAGCAG 
TGGAGACTGC AAGTAAGGCT TTTAATTTTA GGAGGTTTTT TTTTTTTTTT TTTTTTCCCC 
TAAATGGTAT GGCCAAAAGT CAGAGTTAAA ATATATATAG TTAGATTCCA ACTTCCTCCT 
TCACTCTAAA AATAGAATCC AAACCCACTC TTCATATATG CTTCCAGAAT GGGGCTTAAG 
TACCAATCTC TGCTTTGCAA TGGGCACAAT CTTGGTCATG TCCTGAGGCT CTCTAAGAAA 
AGAGAGGATC TAGGATGGGA GAG CT AG AAA GTTGCTAACT GGGAAGAACA AGGCCCTGAG 
GGGTTGGTCT ACCAATCTGG GAAGATTTGA AAACAAACTT CTCGCAACTG AAGGAAGGCT 
GAAGGCTGCT GCAAGTCATT GAGTGACTTT AGGATGAGCA AAACATTGGG CCACTTCCTA 
ATGCCCTATG TGTATAGTAC CAGAAGCAAG GTCTCAGACT TAACAGACCC AGCTCTGTTC 
CAAGGTGAGT CTGAACCAAT AGAAAGCAAA CATGTGCAGA TATCCAAACA AGACTGCTCA 
TGCAAGTCGG GGCTGGCTAC CCGTCTTAGG CAGCAACAGC AGAGCTCCAG G GAGCTT ATT 
CAATATTTAC TGAGACTTCG AAGACCCAGC AGATGTTTAA TGAAGTCACT ATTTTGGCTC 
AAACCCTCCA CTTCTCCCCC TCCCCTCAAA AAGCCAACAG GTAAACACAT AAATGAAAGA 
AACCCACAGA AGGGGATGGG AAATAAAGAA AATTCTCTCA AGACTTCTCC AGGCCCATGT 
CACTGGTCAG CGTGGTTTTT ATGTGTATTA GGATTGGGGG ATGTGAAGAA ATAAGTATCC 
AGTACTTTAT AACCAAAGCA ATTAAATGAT ATTGGGGTAG GGAATGTTGG CCAGTTTTGT 
TTAGTTTTGC CATCACATTG TCACCCAGAC CTCACCTAGC CCCAAGTAAT CGGGCGCCCC 
GAAGAGGGAG ACAGAGATGT GCCAGAGTTG ACCCAGTGTG CGGATGATAA CTACTGACGA 
AAGAGTCATC GACCTCAGTT AGTGGTTGGA TGTAGTCACA TTAGTTTGCC TCTCCCCATC 
TTTGTCTCCC TGGCAAGGAG AATATGCGGG ACATGATGCT AAGAGCCCTG GGTAAATGTG 
GTGAGAATGC ACGCGTGCAT ATGCTACACA TATGTGCTTC TCAGTTGCAG AAAATGAACT 
GCTTTGGGAG ATTATCAGTA GAAAGAGTGT TATCATATTG GTGCTGAGTG CTATGTGTGC 
TTATACAATT TGTTCTTGTA TTTTAATAAA CTTTGAATAA AAGAATAAAA AAAAAAAAAA 
AAAAAAAAA 

Seq ID NO: 600 Protein sequence 
Protein Accession #: NP_000784.2 

1 11 21 31 41 51 

} | I I I I 

MGILSVDLLI TLQILPVFFS NCLPLALYDS VILLKHWLL LSRSKSTRGE WRRMLTSEGL 
RCVWKSFLLD AYKQVKLGED APKSSWHVS STEGGDKSGN GTQEKIAEGA TCHLLDFASP 
ERPLWNFGS ATUPPFTSQL PAFRKLVEEF SSVADFLLVY IDEAHPSDGW AIPGDSSLSF 
EVKKHQNQED RCAAAQQLLE RFSLPPQCKV VADRMDNNAN IAYGVAFERV CIVQRQKIAY 
LGGKGPFSYN LQEVRHWLEK NFSKRUKKTR LAG 

Seq ID NO: 601 DNA sequence 

Nucleic Acid Accession #: NM_005233.1 

Coding sequence! 101.. 3052 
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CCATGGATGG 
ATGGAGATAT 
CCTCCTCCTT 
CAATGAAGTC 
TCCATCACAT 
TTACCAGGTG 
CCCCAGGAAC 
TAGCATTCCA 
TGATGATGAT 
AGCTGATGAA 
GATTAGAGAA 
TGCTTGTGTT 



11 
I 

TAACTTCTCC 
GCTCCTCTCA 
CTCAGCTGCT 
AATCTACTGG 
GGGTGGGAAG 
TGCAATGTCA 
TCAGCTCAGA 
TTGGTTTTAG 
CATGGGGTGA 
AGTTTCACTC 
GTAGGTCCTG 
GCCTTGGTGT 



21 
I 

AGCAATCAGA 
CTGCCCTCTG 
CTGTTCTCGA 
ATTCAAAAAC 
AGATCAGTGG 
TGGACCACAG 
AGATTTATGT 
GAACTTGCAA 
AATTTCGAGA 
AAATGGATCT 
TCAACAAGAA 
CTGTGAGAGT 



31 
1 

GCGCTCCCCC 
CACCAGCAAC 
CAGCTTCGGG 
AATTCAAGGG 
TGTGGATGAA 
TCAAAACAAT 
GGAGCTCAAG 
GGAGACATTC 
GCATCAGTTT 
TGGGGACCGT 
GGGATTTTAT 
ATACTTCAAA 



41 
I 

TCACATCAGT 
ATGGATTGTC 
GAACTGATTC 
GAGCTGGGCT 
CATTACACAC 
TGGCTGAGAA 
TTCACTCTAC 
AACCTGTACT 
ACAAAGATTG 
ATTCTGAAGC 
TTGGCATTTC 
AAGTGCCCAT 



51 

I 

GGCATGCTTC 
AGCTCTCCAT 
CGCAGCCTTC 
GGATCTCTTA 
CCATCAGGAC 
CAAACTGGGT 
GAGACTG CAA 
ACATGGAGTC 
ACACCATTGC 
TCAACACTGA 
AAGATGTTGG 
TTACAGTGAA 
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GAATCTGGCT ATGTTTCCAG ACACGGTACC 
AGGGTCTTGT GTCAACAATT CTAAGGAGGA 
AGGCGAATGG CTTGTACCCA TTGGCAAGTG 
TTTTATGTGC CAAGCTTGTC GACCAGGTTT 
5 TGCTAAGTGC CCGCCTCACA GTTCTACTCA 
GAATAATTAC TTCCGGGCAG ACAAAGACCC 
TTCACCAAGA AATGTTATCT CTAATATAAA 
GCCCCTGGAC ACAGGAGGCC GGAAAGATGT 
GTGGAATATA AAACAGTGTG AGCCATGCAG 
10 TGGACTCACC AACACCACGG TGACAGTGAC 
TGAGATTGAT GCCGTTAATG GGGTGTCAGA 
GGTCAGCATC ACAACTAAT C AGGCTGCTCC 
GACCTCCAGA AATAGCATCT CTTTGTCCTG 
ATTGGACTAC GAGGTCAAAT ACTATGAAAA 
15 GAGGGCAAGA GGCACAAATG TTACCATCAG 
CCAAATCCGA GCCCGAACAG CCGCTGGATA 
AACTAGTCCA GACTCTTTCT CCATCTCTGG 
TTCAGCGGCA GTAGCAATTA TTCTCCTCAC 
CTGTGGCTAT AAGTCAAAAC ATGGGGCAGA 
20 TTTAAAACTT CCAGGTCTCA GGACTTATGT 
AGCTGTTCAT GAGTTTGCCA AGGAATTGGA 
TGGAGCAGGT GAATTTGGAG AGGTGTGCAG 
GATTTCAGTG GCCATTAAAA CCCTGAAAGT 
CCTGGGAGAA GCAAGCATTA TGGGACAGTT 
25 AGTTGTTACC AAAAGTAAGC CAGTTATGAT 
GGATAGTTTC CTACGTAAAC ACGATGCCCA 
TCGAGGGATA GCATCTGGCA TGAAGTACCT 
CGCTGCTCGG AACATCTTGA TCAACAGTAA 
TTCGCGTGTC CTGGAGGATG ACCCAGAAGC 
30 AATCAGGTGG ACATCACCAG AAGCTATAGC 
ATGGAGTTAT GGGATTGTTC TCTGGGAGGT 
GATGTCCAAT CAGGATGTAA TTAAAGCTGT 
GGACTGCCCA GCTGCCTTGT ATCAGCTGAT 
CAGACCCAAG TTTGAGCAGA TTGTTAGTAT 
35 CCTGAAGATC ATCACCAGTG CAGCCGCAAG 

TGTGGATATC TCTACCTTCC GCACAACAGG 
CTGCAAGGAA ATCTTCACGG GCGTGGAGTA 
CACAGATGAC ATGAAAAAGG TTGGTGTCAC 
TAGCATTAAA GCTCTAGAAA CGCAATCAAA 
40 GGAAGTGCTT CTGGACGGAA GTGGTGGCTG 
CAATAATTCT GGAGATACTG GTGGAAGTT 



CATGGACTCC CAGTCCCTGG TGGAGGTTAG 780 

AGATCCTCCA AGGATGTACT GCAGTACAGA 840 

TTCCTGCAAT GCTGGCTATG AAGAAAGAGG 900 

CTACAAGGCA TTGGATGGTA ATATGAAGTG 960 

GGAAGATGGT TCAATGAACT GCAGGTGTGA 1020 

TCCATCCATG GCTTGTACCC GACCTCCATC 1080 

CGAGACCTCA GTTATCCTGG ACTGGAGTTG 1140 

TACCTTCAAC ATCATATGTA AAAAATGTGG 1200 

CCCAAATGTC CGCTTCCTCC CTCGACAGTT 1260 

AGACCTTCTG GCACATACTA ACTACACCTT 1320 

GCTGAGCTCC CCACCAAGAC AGTTTGCTGC 1380 

ATCACCTGTC CTGACGATTA AGAAAGATCG 1440 

GCAAGAACCT GAACATCCTA ATGGGATCAT 1500 

GCAGGAACAA GAAACAAGTT ATACCATTCT 1560 

TAGCCTCAAG CCTGACACTA TATACGTATT 1620 

TGGGACGAAC AGCCGCAAGT TTGAGTTTGA 1680 

TGAAAGTAGC CAAGTGGTCA TGATCGCCAT 1740 

TGTTGTCATC TATGTTTTGA TTGGGAGGTT 1800 

TGAAAAAAGA CTTCATTTTG GCAATGGGCA 1860 

TGACCCACAT ACATATGAAG ACCCTACCCA 1920 

TGCCACCAAC ATATCCATTG ATAAAGTTGT 1980 

TGGTCGCTTA AAACTTCCTT CAAAAAAAGA 2040 

TGGCTACACA GAAAAGCAGA GGAGAGACTT 2100 

TGACCACCCC AATAT CATTC GACTGGAAGG 2160 

TGTCACAGAA TACATGGAGA ATGGTTCCTT 2220 

GTTTACTGTC ATTCAGCTAG TGGGGATGCT 2280 

GTCAGACATG GGCTATGTTC ACCGAGACCT 2340 

CTTGGTGTGT AAGGTTTCTG ATTTCGGACT 2400 

TGCTTATACA ACAAGAGGAG GGAAGATCCC 2460 

CTACCGCAAG TTCACGTCAG CCAGCGATGT 2520 

GATGTCTTAT GGAGAGAGAC CATACTGGGA 2580 

AGATGAGGGC TATCGACTGC CACCCCCCAT 2640 

GCTGGACTGC TGGCAGAAAG ACAGGAACAA 2700 

TCTGGACAAG CTTATCCGGA ATCCCGGCAG 2760 

GCCATCAAAC CTTCTTCTGG ACCAAAGCAA 2820 

TGACTGGCTT AATGGTGTCC GGACAGCACA 2880 

CAGTTCTTGT GACACAATAG CCAAGATTTC 2940 

CGTGGTTGGG CCACAGAAGA AGATCATCAG 3000 

GAATGGCCCA GTTCCCGTGT AAAGCACGAC 30 60 

TGGAAGGCGT CAAGTCATCC TGCAGACAGA 3120 



45 



Seq ID NO: 602 Protein sequence 
Protein Accession #: NP_005224.1 



1 11 21 31 41 51 

I I I I I I 

MDCQLSILLL LSCSVLDSFG ELIPQPSNEV NLLDSKTIQG ELGWISYPSH GWEEISGVDE 60 

HYTPIRTYQV CNVMDHSQNN WLRTNWVPRN SAQKIYVELK FTLRDCNSIP LVLGTCKETF 120 

50 NLYYKESDDD HGVKFREHQP TKIDTXAADE SFTQMDLGDR ILKLNTEIRE VGPVNKKGFY 180 

1AFQDVGACV ALVSVRVYFK KCPFTVKNLA MFPDTVPMDS QSLVEVRGSC VNNSKEEDPP 240 

RMYCSTEGEW LVPIGKCSCN AGYEERGFMC QACRPGPYKA LDGNMKCAKC PPHSSTQEDG 300 

SMNCRCENNY FRADKDPPSM ACTRPPSSPR NVISNINETS VILDWSWPLD TGGRKDVTFN 360 

IICKKCGWNI KQCEPCSPNV RFLPRQFGLT NTTVTVTDLL AHTNYTFEID AVNGVSELSS 420 

55 PPRQFAAVSI TTNQAAPSPV LTIKKDRTSR HSISLSWQEP EHPNGIILDY EVKYYEKQEQ 480 

ETSYTILRAR GTNVTISSLK PDTIYVFQIR ARTAAGYGTM SRKFEPETSP DSPSISGESS 540 

QWMIAISAA VAIILLTWI YVLIGRFCGY KSKHGADEKR LHFGNGHLKL PGLRTYVDPH 600 

TYEDPTQAVH EFAKELDATN ISIDKWGAG EFGEVCSGRL KLPSKKEISV AIKTLKVGYT 6S0 

EKQRRDFLGE ASIMGQFDHP NIIRLEGWT KSKPVMIVTE YMENGSLDSF LRKHDAQFTV 720 

60 IQIiVGMLRGI ASGMKYIiSDM GYVHRDLAAR NILINSNLVC KVSDFGLSRV LEDDPEAAYT 78 0 

TRGGKIPIRW TSPEAIAYRK FTSASDVWSY GIVLWEVMSY GERPYWEMSN QDVIKAVDEG 840 

YRLPPPMDCP AALYQLMLDC WQKDRNNRPK FEQIVSILDK LIRNPGSLKI ITSAAARPSN 900 

LLLDQSNVDX STFRTTGDWL NGVRTAHCKE IFTGVEYSSC DTIAKISTDD MKKVGVTWG 960 
PQKKIISSIK ALETQSKNGP VPV 
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Seq ID NO: 603 DNA sequence 

Nucleic Acid Accession #: NM_005727.1 

Coding sequence : 122 . . 847 



70 1 11 21 31 41 51 

I 1 I I I I 

GCCAGGCGTC CCTCTGCCTG CCCACTCAGT GGCAACACCC GGGAGCTGTT TTGTCCTTTG 60 

TGGAGCCTCA GCAGTTCCCT CTTTCAGAAC TCACTGCCAA GAGCCCTGAA CAGGAGCCAC 120 

_ CATGCAGTGC TTCAGCTTCA TTAAGACCAT GATGATCCTC TTCAATTTGC TCAT CTTT CT 180 

75 GTGTGGTGCA GCCCTGTTGG CAGTGGGCAT CTGGGTGTCA ATCGATGGGG CATCCTTTCT 240 

GAAGATCTTC GGGCCACTGT CGTCCAGTGC CATGCAGTTT GTCAACGTGG GCTACTTCCT 300 

CATCGCAGCC GGCGTTGTGG TCTTTGCTCT TGGTTTCCTG GGCTGCTATG GTGCTAAGAC 360 

TGAGAGCAAG TGTGCCCTCG TGACGTTCTT CTTCATCCTC CTCCTCATCT TCATTGCTGA 420 

_ n GGTTGCAGCT GCTGTGGTCG CCTTGGTGTA CACCACAATG GCTGAGCACT TCCTGACGTT 480 

80 GCTGGTAGTG CCTGOCATCA AGAAAGATTA TGGTTCCCAG GAAGACTTCA CTCAAGTGTG 540 

GAACACCACC ATGAAAGGGC TCAAGTGCTG TGGCTTCACC AACTATACGG ATTTTGAGGA 600 

CTCACCCTAC TTCAAAGAGA ACAGTGCCTT TCCCCCATTC TGTTGCAATG ACAACGTCAC 660 

CAACACAGCC AATGAAACCT GCACCAAGCA AAAGGCTCAC GACCAAAAAG TAGAGGGTTG 720 

CTTCAATCAG CTTTTGTATG ACATCCGAAC TAATGCAGTC ACCGTGGGTG GTGTGGCAGC 780 
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TGGAATTGGG 
ACAATAAGTC 
CCCTGGCAAG 
AATGGACCTG 
ATGCCTGACT 
GTAGCCAGTT 
TAGTGGTGAT 
AAGTGAAATC 
TGTTACAATG 



GGCCTCGAGC 
CACTTCTGCC 
CAGCAGTGAT 
CCCTTTCTGC 
TTCCTTCCAT 
CTGTTGCCCA 
CCCAGTGCTC 
AGCAGAGCCT 
TTAAAAAA 



TGGCTGCCAT 
TCTGCCACTA 
TGGGGGAGGG 
TCCAGACTTG 
TGGTGGGTGG 
TTCCCCCAGT 
TACTGGGGGA 
CTGGGTGGAT 



GATTGTGTCC 
CTGCTGCCAC 
GACAGGATCT 
GGGCTAGATA 
ATGGGTGGGG 
CTATTAAACC 
TGAGAGAAAG 
GTGTAGAAGG 



ATGTATCTGT 
ATGGGAACTG 
AACAATGTCA 
GGGACCACTC 
GGCATTCCAG 
CTTGATATGC 
GCATTTTATA 
CACTTCAAAA 



ACTGCAATCT 
TGAAGAGGCA 
CTTGGGCCAG 
CTTTTAGGCG 
AGCCTCTAAG 
CCCCTAGGCC 
GCCTGGGCAT 
TGCATAAACC 
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Seq ID KO: 604 Protein sequence 
Protein Accession fl: NP_0057i9.1 
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1 11 21 31 41 
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MQCFSFIKTM MILFNLLIFIj OGAALIiAVGI WVSIBGASPL KIFGPLSSSA HQFVNVGYPL 
IAAGVWFAL GFLGCYGAKT ESKCALVTFF FILLLIFIAE VAAAWALVY TTMAEHFLTL 
LWPAIKKDY GSQEDFTQVW NTTMKGLKCC GFTNYTDFED SPYFKENSAP PPPCCNDNVT 
NTANETCTEQ KAHDQKVEGC FNQLLYDIRT NAVTVGGVAA GIGGLELAAM IVSMYLYCNL 
Q 

Seq ID NO: 605 DNA sequence 

Nucleic Acid Accession #: NMJJ00729.2 

Coding sequence : 



1 
I 

GGCTCAGCTG 
AGCCATGAAC 
GACGCAGCCG 
GCCCCGTAGG 
CGCCCTGCTG 
CGTTAAGAAC 
CTGGATGGAT 
GCCGCCATCA 
TCACACTCAT 
TCAATGTGAA 
TGTGCAAACT 
TATGCTATTA 




GCCCAACGGA 
AACTCATTGT 
AATTGTGTCT 
GAAGACAAAA 
AAGTGATTTC 



21 
I 

CCGGTTGGAA 
GCCTGTGCGT 
CAGATCCCGC 
TATCGCAGAG 
TCCAGCAGGC 
TGGACCCCAG 
GCAGTGCCGA 
AGCAACCTCC 
CTGTGGAGTT 
GTAAGATTGT 
CTGTTTTCTT 
ATTCTGCC 



31 

I 

ACGCCAAGCC 
GCTGATGGCG 
GGGCTCCGGG 
AACGGATGGC 
CCGGAAAGCT 
CCACAGGATA 
GGAGTATGAG 
CAACCCAGAG 
TGACATTGAA 
CCAGTGCAAC 
CATCTGTGAC 



41 

I 

AGCTGCCGTC 
GTACTGGCGG 
CTGCAGCGGG 
GAGTCCCGAG 
CCTTCTGGAC 
AGTGACCGGG 
TACCCCTCCT 
GAGGCAGAAT 
TGTATCTATT 
CACACACGCT 
TCCTGTTCTG 



1 
I 

AGCACTCTCC 
ATAACCTCGG 
CAGGGCTGCT 
CTCTTTGCTC 
TCTTTGTGGT 
GGTTGGTTTC 
GCTGCAGTGA 
TGATTCTCTC 
CCATGGAGAA 
GCATTCTTGT 
ATCACCACGG 
GCGTTCTCTA 
GGTGGGGAGG 
CCGAGATAGG 
TGAGGGGATT 
GATGCTCCCT 
GTTCTTAGTG 
TGGGTCATGC 
TGGCAAGAGC 
TCTGACCCAA 
GTGAGCCATT 
GGAGGAGGAA 
CTTTTCACAA 
GTCCCTCATG 
GCTCTTCTGG 
AACTATGGGA 
TTCCCCTCAA 
TCATTGTAGC 
ATCAATATTG 
GAGTGATCAC 




CTATTCCTAC 
TCTGGTCCTG 
AGGAAGCCGT 



CTCTACTGCC 
TGATGGGGTC 
ACACACACTG 
ACTGAGGTCC 
AGATACTGTC 
AGCAAAACAT 
GTCCCTCTTT 
AGAGAGTTTG 
TGGAAAACCT 
GAGACCTCAT 
AGTTTCTCTA 
TCCAAGGGGC 
TGCAGTCAGT 
ACAATCCTAT 
AGGATCAGGG 
CTCTTGGGGA 



21 

I 

CGCAAAATTA 
CTTCCCCTCA 
GCCACTCAGA 
AAAAAAAAAA 
ACTGTTATTA 
GATGCATCAG 
TATGCCAGTG 
TGTGTCATTG 
TTCTTCCTCT 
ATCTACACTA 
ATCCTGGGCT 
AGAAAGAAAT 
TGAATCTGGG 
GGGGGAAGCA 
AAGCCCCTGC 
CAGAGAGCCT 
TCTGGGGCCC 
ACAGACCTAC 
TTTGTGCTGA 
CACATTCCAG 
GGAACAGATA 
TAAGGTCATG 
GGGGGATGGT 
GCCATGGTCT 
AAGTGACTAG 
AGTCTTGCAA 
GTTTCTTTTA 
TACTCTTCCT 
CTCCTAGGCT 
CCCTGCCTAT 



31 
1 

CACACCCCAG 
GTGCX3GTCAC 
GCGCCTCTGT 
GAGCCAACAT 
TGCTATTTGT 
TAGGTCTTTG 
AAGATGCCCT 
CCCTCCTGGT 
CAGGGGCCAC 
GTCATTATGC 
GGATCTGCTT 
AAGGCCGGAC 
AGGGAAGTGG 
AAGGGGGGAG 
CCTGGGGAGA 
CCCTGCAGOC 
CATCAGCTGC 
TGCACTGAGT 
ATACGCTAAG 
TCTGAAGTGC 
TTTAGCTCTG 
CTGGTGGGTT 
CAGAGCCCAG 
TTGCTAGGCC 
TGAACAATTC 
CAGTGCCATG 
AGTATACAAC 
CTAACATTTT 
CAGTGGTAGC 
CCCACTTCAC 



41 
I 

TACACCAGCA 
ATACTTCCAG 
CGCTGGGACC 
GTTGGTATTG 
TAGGACCATT 
GAAAAACTGT 
CAAGACAGTG 
CTTCGTGTTC 
CACACTGGTG 
GAATCGTGAT 
CTGCTTCAGC 
GAGTTCATGG 
AGGTTGCTGT 
GTCAAATCCC 
AAGTAGTTGG 
ACCAGACTTG 
CACAACACCA 
TAAAATAGCG 
CCTGGAAGCC 
CTACTGGGGG 
TGGAATTCAG 
AGCTAAACCA 
TCGAGACCTC 
TCTTGCTGAA 
GGTGGTAAAA 
TTAGGGTTAT 
AGGAGAGAGA 
TGAGGAAGTT 
TCTGGCTTAG 
AGGTGAGGCA 



60 
120 
180 
240 



51 
I 

CTAATCCAAA 
CTGGCGCCCT 
CAGAGGAGGC 
CGCACCTGGG 
GAATGTCCAT 
ACTACATGGG 
AGAGGACCCA 
AAGACAACAA 
TATTAAGTTC 
CACCAGAAGT 
AAAATGTTGT 



Seq ID NO: 606 Protein sequence 
Protein Accession #: NP_000720.1 

1 11 21 31 41 51 

I I I I I I 

MNSGVCLCVL MAVLAAGALT QPVPPADPAG SGLQRAEEAP RRQLRVSQRT DGESRAHLGA 
LLARYIQQAR KAPSGRMSIV KNLQNLDPSH RISDRDYMGW MDFGRRSAEE YEYPS 

Seq ID NO: 607 DNA sequence 

Nucleic Acid Accession #; NM_001423.1 

Coding sequence : 219 . . 692 



51 
I 

GAGGAAACTT 
AAGAGCGGAC 
CTTCAGAACT 
CTGGCTGGTA 
GCCAATGTCT 
ACCAACATTA 
CAGGCCTTCA 
CAGCTCTTCA 
TGCTGGCTGT 
GGAACGCAGT 
TTCATCATCG 
GGATCTGGGG 
ACAGGAAAAA 
AAACCATTAC 
CTAGTACTTT 
GCCTCCAGCT 
GCCCCACTTC 
GTACAAGTTC 
ATCCTGCCCT 

TGACAAAATG 
AGAAGGAGAC 
ACACACGGCT 
AGCCAAGGCA 
GTACCACACA 
GTTTTTAGGA 
TGGACATGGC 
TTGTCTAATT 
ACACCACCTG 
TGGCAATTCT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 



1184 



WO 03/042661 



GGAAGCTGAT TAAAACACAC ATAAACCAAA ACCAAACAAC AGGCCCTTGG GTGAAAGGTG 1860 

CTATATAATT GTGAAGTATT AAGCCTACCG TATTTCAGCC ATGATAAGAA CAGAGTGCCT 1920 

GCATTCCCAG GAAAATACGA AAATCCCATG AGATAAATAA AAATATAGGT GATGGGCAGA 1980 

TCTTTTCTTT AAAATAAAAA AGCAAAAACT CTTGTGGTAC CTAGTCAGAT GGTAGACGAG 2040 

CTGTCTGCTG CCGCAGGAGC ACCTCTATAC AGGACTTAGA AGTAGTATGT TATTCCTGGT 2100 

TAAGCAGGCA TTGCTTTGCC CTGGAGCAGC TATTTTAAGC CATCTCAGAT TCTGTCTAAA 2160 

GGGG T TT TTT GGGAAGACGT TTTCTTTATC GCCCTGAGAA GATCTACCCC AGGGAGAATC 2220 

TGAGACATCT TGCCTACTTT TCTTTATTAG CTTTCTCCTC ATCCATTTCT TTTATACCTT 2280 

TCCTTTTTGG GGAGTTGTTA TGCCATGATT TTTGGTATTT ATGTAAAAGG ATTATTACTA 2340 

ATTCTATTTC TCTATGTTTA TTCTAGTTAA GGAAATGTTG AGGGCAAGCC ACCAAATTAC 2400 

CTAGGCTGAG GTTAGAGAGA TTGGCCAGCA AAAACTGTGG GAAGATGAAC TTTGTCATTA 2460 

TGATTTCATT ATCACATGAT TATAGAAGGC TGTCTTAGTG CAAAAAACAT ACTTACATTT 2520 

CAGACATATC CAAAGGGAAT ACTCACATTT TGTTAAGAAG TTGAACTATG ACTGGAGTAA 2S80 

ACCATGTATT CCCTTATCTT TTACTTTTTT TCTGTGACAT TTATGTCTCA TGTAATTTGC 2640 

ATTACTCTGG TGGATTGTTC TAGTACTGTA TTGGGCTTCT TCGTTAATAG ATTATTTCAT 2700 

ATACTATAAT TGTAAATATT TTGATACAAA TGTTTATAAC TCTAGGGATA TAAAAACAGA 2760 
TTCTGATTCC CTTCAAAAAA AAAAAA 

Seq ID NO: 608 Protein sequence 
Protein Accession ft: NP_001414.1 

1 11 21 31 41 51 

I I I ! I I 

MLVLliAGIFV VHIATVIMLP VSTIANVWLV SNTVDASVGL WKNCTNISCS DSLSYASEDA 60 
LKTVQAFMIL SIIFCVIALL VFVPQLPTME KGNRFFLSGA TTLVCWLCIL VGVSIYTSHY 120 
ANRDGTQYHH GYSYILGWIC FCFSFIIGVL YLVLRKK 

Seq ID NO: 609 DNA sequence 

Nucleic Acid Accession #: NM_004961.2 

Coding sequence: 55.. 1575 

1 11 21 31 41 SI 

| | I I I I 

GCCAGAGCGT GAGCCGCGAC CTCCGCGCAG GTGGTCGCGC CGGTCTCCGC GGAAATGTTG 60 
TCCAAAGTTC TTCCAGTCCT CCTAGGCATC TTATTGATCC TCCAGTCGAG GGTCGAGGGA 120 
CCTCAGACTG AATCAAAGAA TGAAGCCTCT TCCCGTGATG TTGTCTATGG CCCCCAGCCC 180 
CAGCCTCTGG AAAATCAGCT CCTCTCTGAG GAAACAAAGT CAACTGAGAC TGAGACTGGG 240 
AGCAGAGTTG GCAAACTGCC AGAAGCCTCT CGCATCCTGA ACACTATCCT GAGTAATTAT 300 
GACCACAAAC TGCGCCCTGG CATTGGAGAG AAGCCCACTG TGGTCACTGT TGAGATCGCC 360 
GTCAACAGCC TTGGTCCTCT CTCTATCCTA GACATGGAAT ACACCATTGA CATCATCTTC 420 
TCCCAGACCT GGTACGACGA ACGCCTCTGT TACAACGACA CCTTTGAGTC TCTTGTTCTG 480 
AATGGCAATG TGGTGAGCCA GCTATGGATC CCGGACACCT TTTTTAGGAA TTCTAAGAGG 540 
ACCCACGAGC ATGAGATCAC CATGCCCAAC CAGATGGTCC GCATCTACAA GGATGGCAAG 600 
GTGTTGTACA CAATTAGGAT GACCATTGAT GCCGGATGCT CACTCCACAT GCTCAGATTT 660 
CCAATGGATT CTCACTCTTG CCCTCTATCT TTCTCTAGCT TTTCCTATCC TGAGAATGAG 720 
ATGATCTACA AGTGGGAAAA TTTCAAGCTT GAAATCAATG AGAAGAACTC CTGGAAGCTC 780 
TTCCAGTTTG ATTTTACAGG AGTGAGCAAC AAAACTGAAA TAATCACAAC CCCAGTTGGT 840 
GACTTCATGG TCATGACGAT TTTCTTCAAT GTGAGCAGGC GGTTTGGCTA TGTTGCCTTT 900 
CAAAACTATG TCCCTTCTTC CGTGACCACG ATGCTCTCCT GGGTTTCCTT TTGGATCAAG 960 
ACAGAGTCTG CTCCAGCCCG GACCTCTCTA GGGATCACCT CTGTTCTGAC CATGACCACG 1020 
TTGGGCACCT TTTCTCGTAA GAATTTCCCG CGTGTCTCCT ATATCACAGC CTTGGATTTC 1080 
TATATCGCCA TCTGCTTCGT CTTCTGCTTC TGCGCTCTGT TGGAGTTTGC TGTGCTCAAC 1140 
TTCCTGATCT ACAACCAGAC AAAAGCCCAT GCTTCTCCTA AACTCCGCCA TCCTCGTATC 1200 
AATAGCCGTG CCCATGCCCG TACCCGTGCA CGTTCCCGAG CCTGTGCCCG CCAACATCAG 1260 
GAAGCTTTTG TGTGCCAGAT TGTCACCACT GAGGGAAGTG ATGGAGAGGA GCGCCCGTCT 1320 
TGCTCAGCCC AGCAGCCCCC TAGCCCAGGT AGCCCTGAGG GTCCCCGCAG CCTCTGCTCC 1380 
AAGCTGGCCT GCTGTGAGTG GTGCAAGCGT TTTAAGAAGT ACTTCTGCAT GGTCCCCGAT, 1440 
TGTGAGGGCA GTACCTGGCA GCAGGGCCGC CTCTGCATCC ATGTCTACCG CCTGGATAAC 1500 
TACTCGAGAG TTGTTTTCCC AGTGACTTTC TTCTTCTTCA ATGTGCTCTA CTGGCTTGTT 1560 
TGCCTTAACT TGTAGGTACC AGCTGGTACC CTGTGGGGCA ACCTCTCCAG TTCCCCAGGA 1620 
GGTCCAAGCC CCTTGCCAAG GGAGTTGGGG GAAAGCAGCA GCAGCAGCAG GAGCGACTAG 1680 
AGTTTTTCCT GCCCCATTCC CCAAACAGAA GCTTGCAGAG GGTTTGTCTT TGCTGCCCCT 1740 
CTCCCCTACC TGGCCCATTC ACTGAGTCTT CTCAGCAGAC CATTTCAAAT TATTAATAAA 1800 
TGGGCCACCT CCCTCTTCTT CAAGGAGCAT CCGTGATGCT CAGTGTTCAA AACCACAGCC 1860 
ACTTAGTGAT CAGCTCCCTA AAACCATGCC TAAGTACAGG CGGATTAGCT ATCTTCCAAC 1920 
AATGCTGACC ACCAGACAAT TACTGCATTT TTCCAGAAGC CCACTATTGC CTTTGTAGTG 1980 
CTTTCGGCCC AGTTCTGGCC TCAGCCTCAA AGTGCACCGA CTAGTTGCTT GCCTATACCT 2040 
GGCACCTCAT TAAGATGCTG GGCAGCAGTA TAACAGGAGG AAGAGATCCC TCTCCTTTGG 2100 
TCAGATTATT ATGTTCTCAG TTCTCTCTCC CTGCTACCCC TTTCTCTGCA GATAGATAGA 2160 
CACTGGCATT ATCCCTTTAG GAAGAGGGGG GGGCAGCAAG AGAGCCTATT TGGGACAGCA 2220 
TTCCTCTCTC TCTGCTGCTG TGACATCTCC CTCTCCTTGC TGGCTCCATC TTTCGTCTGC 2280 
ACTACCAATT CAATGCCCTT CATCCAATGG GTATCTATTT TTGTGTGTGA TTATAGTAAC 2340 
TACTCCCTGC TTTATATGCC ACCCTCTTCC TTCTCTTTGA CCCCTGTGAC TCTTTCTGTA 2400 
ACTTTCCCAG TGACTTCCCC TAGCCCTGAC CCAGGCACTA GGCCTTGGTG ACTTCCTGGG 2460 
GCCAAGAAAC TAAGGAAACT OGGCTTTGCA ACAGGCATTA CTCGCCATTG ATTGGTGCCC - 2520 
ACCCAGGGCA CACTGTGGGA GTTCTATCAC TTGCTTGACC CCTGGACCCA TAAACCAGTC 2580 
CACTGTTATA CCCGGGGCAC TCTAACCATC ACAATCAATC AATCAAATTC CCTTAAATTT 2640 
GTATGGCACT GGAACTTTGG CAAAGCACTT TTGACAAGTT G TGTC TGATT GGAGCTTCAT 2700 
GATAGCCTTG TGACATCTTT AGGGCAGGAT TCTTATCCCC ATTTTGCAGA TGAAAACCCT 2760 
GAGTCACAGA TTTCTGTGGG ACTGTGGATC TCACTGGAAG CTATCCAAGA GCCCACTGTC 2820 
ACCTTCTAGA CCACATGATA GGGCTAGACA GCTCAGTTCA CCATGATTCT CTTCTGTCAC 2880 
CTCTGCTGGC ACACCAGTGG CAAGGCCCAG AATGGCGACC TCTCTTTAGC TCAATTTCTG 2940 
GGCCTGAGGT GCTCAGACTG CCCCCAAGAT CAAATCTCTC CTGGCTGTAG TAACCCAGTG 3000 
GAATGAATTT GGACATGCCC CAATGCTTCT ATATGCTAAG TGAAATCTGT GTCTGTAATT 3060 



1185 



WO 03/042661 



PCT7US02/36810 



TGTTGGGGGG TGGATAGGGT GGGGTCTCCA TCTACTTTTT GTCACCATCA TCTGAAATGG 
GGAAATATGT AAATAAATAT ATCAGCAAAG CAAAAAGAAA AAAAAAAA 



3120 



10 

15 

20 

25 

30 

35 

40 

45 
50 
55 
60 
65 
70 
75 
80 



Seq ID NO: 610 Protein sequence 
Protein Accession #: NP 004952.1 



MLSKVLPVLL 
TGSRVGKLPE 
IFSQTWYDER 
GKVLYTIRMT 
KLFQFDPTGV 
IKTESAPART 
LNFLIYNQTK 
PSCSAQQPPS 
DNYSRWFPV 



11 
I 

GILLILQSRV 
ASRILNTILS 
LCYNDTFESL 
IDAGCSLHML 
SNKTEIITTP 
SLGITSVLTM 
AHASPKLRHP 
PGSPEGPRSL 
TFFFFNVLYW 



21 
I 

EGPQTESKNE 
NYDHKLRPGI 
VLNGNWSQL 
RFPMDSHSCP 
VGDFMVMTIF 
TTLGTFSRKN 
RINSRAHART 
CSKLACCEWC 
LVCLNL 



31 
I 

ASSRDWYGP 
GEKPTWTVE 
WIPDTFFRNS 
LSFSSFSYPE 
FNVSRRFGYV 
FPRVSYITAL 
RARSRACARQ 
KRFKKYFCMV 



41 
I 

QPQPLENQLL 
IAVNSLGPLS 
KRTHEHEITM 
NEMIYKWENF 
AFQNYVPSSV 
DFYIAICFVF 
HQEAFVCQIV 
PDCEGSTWQQ 



51 
I 

SEETKSTETE 
ILDMEYTIDI 
PNQMVRIYKD 
KLEINEKNSW 
TTMLSWVSFW 
CFCALLEFAV 



Seq ID NOt 611 DMA sequence 

Nucleic Acid Accession #: NMJJ21984.1 

Coding sequence : 572 . . 1753 



GRLCIHVYRL 



51 



60 
120 
180 
240 
300 
360 
420 
480 



1 11 21 31 41 

I I I I I 1 

GCCAGAGCGT GAGCCGCGAC CTCCGCGCAG GTGGTCGCGC CGGTCTCCGC GGAAATGTTG 60 

TCCAAAGTTC TTCCAGTCCT CCTAGGCATC TTATTGATCC TCCAGTCGAG AACATGTATA 120 

CAGAGAAGTG CTCAAATCAT AAGTGTACAG CTGATGAGTT GTCAAAAAAT GACCACAGCG 180 

GTGTAAAGAA AGCCAAATCA AGGACCCGAA TGTGAGCAGG ACCTCAGAAG CCCCCTTTGT 240 

CACTGCCTCC CAGCAAAGGC AGCACTATCC GGACTTCTAA CACCATCGGG TCGAGGGACC 300 

TCAGACTGAA TCAAAGAATG AAGCCTCTTC CCGTGATGTT GTCTATGGCC CCCAGCCCCA 360 

GCCTCTGGAA AATCAGCTCC TCTCTGAGGA AACAAAGTCA ACTGAGACTG AGACTGGGAG 420 

CAGAGTTGGC AAACTGCCAG AAGCCT CTCG CATCCTGAAC ACTATCCTGA GTAATTATGA 480 

CCACAAACTG CGCCCTGGCA TTGGAGAGAA GCCCACTGTG GTCACTGTTG AGATCTCCGT 540 

CAACAGCCTT GGTCCTCTCT CTATCCTAGA CATGGAATAC ACCATTGACA TCATCTTCTC 600 

CCAGACCTGG TACGACGAAC GCCTCTGTTA CAACGACACC TTTGAGTCTC TTGTTCTGAA 660 

TGGCAATGTG GTGAGCCAGC TATGGATCCC GGACACCTTT TTTAGGAATT CTAAGAGGAC 720 

CCACGAGCAT GAGATCACCA TGCCCAACCA GATGGTCCGC ATCTACAAGG ATGGCAAGGT 780 

GTTGTACACA ATTAGGATGA CCATTGATGC CGGATGCTCA CTCCACATGC TCAGATTTCC 840 

AATGGATTCT CACTCTTGCC CTCTATCTTT CTCTAGCTTT TCCTATCCTG AGAATGAGAT 900 

GATCTACAAG TGGGAAAATT TCAAGCTTGA AATCAATGAG AAGAACTCCT GGAAGCTCTT 960 

CCAGTTGGAT TTTACAGGAG TGAGCAACAA AACTGAAATA ATCACAACCC CAGTTGGTGA 1020 

CTTCATGGTC ATGACGATTT TCTTCAATGT GAGCAGGCGG TTTGGCTATG TTGCCTTTCA 1080 

AAACTATGTC CCTTCTTCCG TGACCACGAT GCTCTCCTGG GTTTCCTTTT GGATCAAGAC 1140 

AGAGTCTGCT CCAGCCCGGA CCTCTCTAGG GATCACCTCT GTTCTGACCA TGACCACGTT 1200 

GGGCACCTTT TCTCGTAAGA ATTTCCCGOG TGTCTCCTAT ATCACAGCCT TGGATTTCTA 1260 

TATCGCCATC TGCTTCGTCT TCTGCTTCTG CGCTCTGTTG GAGTTTGCTG TGCTCAACTT 1320 

CCTGATCTAC AACCAGACAA AAGCCCATGC TTCTCCTAAA CTCCGCCATC CTCGTATCAA 1380 

TAGCCGTGCC CATGCCCGTA CCCGTGCACG TTCCCGAGCC TGTGCCCGOC AACATCAGGA 1440 

AGCTTTTGTG TGCCAGATTG TCACCACTGA GGGAAGTGAT GGAGAGGAGC GCCCGTCTTG 1500 

CTCAGCCCAG CAGCCCCCTA GCCCAGGTAG CCCTGAGGGT CCCCGCAGCC TCTGCTCCAA 1560 

GCTGGCCTGC TGTGAGTGGT GCAAGCGTTT TAAGAAGTAC TTCTGCATGG TCCCCGATTG 1620 

TGAGGGCAGT ACCTGGCAGC AGGCCCGCCT CTGCATCCAT GTCTACCGCC TGGATAACTA 1680 

CTCGAGAGTT GTTTTCCCAG TGACTTTCTT CTTCTTCAAT GTGCTCTACT GGCTTGTTTG 1740 

CCTTAACTTG TAGGTACCAG CTGGTACCCT GTGGGGCAAC CTCTCCAGTT CCCCAGGAGG 1800 

TCCAAGCCCC TTGCCAAGGG AGTTGGGGGA AAGCAGCAGC AGCAGCAGGA GCGACTAGAG 1860 

TTTTTCCTGC CCCATTCCCC AAACAGAAGC TTGCAGAGGG TTTGTCTTTG CTGCCCCTCT 1920 

CCCCTACCTG GCCCATTCAC TGAGTTTTCT CAGCAGACCA TTTCAAATTA TTAATAAATG 1980 

GGCCACCTCC CTCTTCTTCA AGGAGCATCC GTGATGCTCA GTGTTCAAAA CCACAGCCAC 2040 

TTAGTGATCA GCTCCCTAAA ACCATGCCTA AGTACAGGCG GATTAGCTAT CTTCCAACAA 2100 

TGCTGACCAC CAGACAATTA CTGCATTTTT CCAGAAGCCC ACTATTGCCT TTGCAGTGCT 2160 

TTCGGCCCAG TTCTGGCCTC AGCCTCAAAG TGCACCGACT AGTTGCTTGC CTATACCTGG 2220 

CACCTCATTA AGATGCTGGG CAGCAGTATA ACAGGAGGAA GAGATCCCTC TCCTTTGGTC 2280 

AGATTATTAT GTTCTCAGTT CTCTCTCCCT GCTACCCCTT TCTCTGCAGA TAGATAGACA 2340 

CTGGCATTAT CCCTTTAGGA AGAGGGGGGG GCAGCAAGAG AGCCTATTTG GGACAGCATT 2400 

CCTCTCTCTC TGCTGCTGTG ACATCTCCCT CTCCTTGCTG GCTCCATCTT TCGTCTGCAC 2460 

TACCAATTCA ATGCCCTTCA TCCAATGGGT ATCTATTTTT GTGTGTGATT ATAGTAACTA 2520 

CTCCCTGCTT TATATGCCAC CCTCTTCCTT CTCTTTGACC CCTGTGACTC TTTCTGTAAC 2580 

TTTCCCAGTG ACTTCCCCTA GCCCTOACCC AGGCACTAGG CCTTGGTGAC TTCCTGGGGC 2640 

CAAGAAACTA AGGAAACTCG GCTTTGCAAC AGGCATTACT CGCCATTGAT TGGTGCCCAC 2700 

CCAGGGCACA CTGTCGGAGT TCTATCACTT GCTTGACCCC TGGACCCATA AACCAGTCCA 2760 

CTGTTATACC CGGGGCACTC TAACCATCAC AATCAATCAA TCAAATTCCC TTAAATTTGT 2820 

ATGGCACTGG AACTTTGGCA AAGCACTTTT GACAAGTTGT GTCTGATTGG AGCTTCATGA 2880 

TAGCCTTGTG ACATCTTTAG GGCAGGATTC TTATCCCCAT TTTGCAGATG AAAACCCTGA 2940 

GTCACAGATT TCTGTGGGAC TGTGGATCTC ACTGGAAGCT ATCCAAGAGC CCACTGTCAC 3000 

CTTCTAGACC ACATGATAGG GCTAGACAGC TCAGTTCACC ATGATTCTCT TCTGTCACCT 3060 

CTGCTGGCAC ACCAGTGGCA AGGCCCAGAA TGGCGACCTC TCTTTAGCTC AATTTCTGGG 3120 

CCTGAGGTGC TCAGACTGCC CCCAAGATCA AATCTCTCCT GGCTGTAGTA ACCCAGTGGA 3180 

ATGAATTTGG ACATGCCCCA ATGCTTCTAT ATGCTAAGTG AAATCTGTGT CTGTAATTTG 3240 

TTGGGGGGTG GATAGGGTGG GGTCTCCATC TACTTTTTGT CACCATCATC TGAAATGGGG 3300 
AAATATGTAA ATAAATATAT CAGCAAAGC 

Seq ID NO: 612 Protein sequence 
Protein Accession #: NP_068819.1 



1186 



WO 03/042661 



PCT/US02/36810 



10 

15 
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1 11 21 31 41 51 

I 1 I I I I 

MEYTIDIIFS QTWYDERLCY NDTFESLVLN GNWSQLWIP DTFFRNSKRT HEHEITMPNQ 60 

MVRIYKDGKV LYTIRMTIDA GCSLHMLRFP MDSHSCPLSF SSFSYPENEM IYKWENFKLE 120 

INEKNSWKLF QLDFTGVSNK TEIITTPVGD FMVMTIFFNV SRRFGYVAFQ NYVPSSVTTM 180 

LSWVSFWIKT ESAPARTSLG ITSVLTMTTL GTFSRKNFPR VSYITALDFY IAICFVFCFC 240 

ALLEFAVLNF LIYNQTKAHA SPKLRHPRIN SRAHARTRAR SRACARQHQE AFVCQIVTTE 300 

GSDGEERPSC SAQQPPSPGS PEGPRSLCSK LACCEWCKRF KKYFCMVPDC EGSTWQQARL 360 
CIHVYRLDNY SRWFPVTFF FFNVLYWLVC LNL 

Seq ID NO: 613 DMA sequence 

Nucleic Acid Accession ft: NM_021987.1 

Coding sequence: 572.. 1657 

1 11 21 31 41 51 

I I I I I I 

GCCAGAGCGT GAGCCGCGAC CTCOGCGCAG GTGGTCGCGC CGGTCTCCGC GGAAATGTTG 60 

TCCAAAGTTC TTCCAGTCCT CCTAGGCATC TTATTGATCC TCCAGTCGAG AACATGTATA 120 

CAGAGAAGTG CTCAAATCAT AAGTGTACAG CTGATGAGTT GTCAAAAAAT GACCACAGCG 180 

GTGTAAAGAA AGCCAAATCA AGGACCCGAA TGTGAGCAGG ACCTCAGAAG CCCCCTTTGT 240 

CACTGCCTCC CAGCAAAGGC AGCACTATCC GGACTTCTAA CACCATCGGG TCGAGGGACC 300 

TCAGACTGAA TCAAAGAATG AAGCCTCTTC CCGTGATGTT GTCTATGGCC CCCAGCCCCA 360 

GCCTCTGGAA AATCAGCTCC TCTCTGAGGA AACAAAGTCA ACTGAGACTG AGACTGGGAG 420 

CAGAGTTGGC AAACTGCCAG AAGCCTCTCG CATCCTGAAC ACTATCCTGA GTAATTATGA 480 

CCACAAACTG CGCCCTGGCA TTGGAGAGAA GCCCACTGTG GTCACTGTTG AGATCTCCGT 540 

CAACAGCCTT GGTCCTCTCT CTATCCTAGA CATGGAATAC ACCATTGACA TCATCTTCTC 600 

CCAGACCTGG AATTCTAAGA GGACCCACGA GCATGAGATC ACCATGCCCA ACCAGATGGT 660 

CCGCATCTAC AAGGATGGCA AGGTGTTGTA CACAATTAGG ATGACCATTG ATGCCGGATG 720 

CTCACTCCAC ATGCTCAGAT TTCCAATGGA TTCTCACTCT TGCCCTCTAT CTTTCTCTAG 780 

CTTTTCCTAT CCTGAGAATG AGATGATCTA CAAGTGGGAA AATTTCAAGC TTGAAATCAA 840 

TGAGAAGAAC TCCTGGAAGC TCTTCCAGTT TGATTTTACA GGAGTGAGCA ACAAAACTGA 900 

AATAATCACA ACCCCAGTTG GTGACTTCAT GGTCATGACG ATTTTCTTCA ATGTGAGCAG 960 

GCGGTTTGGC TATGTTGCCT TTCAAAACTA TGTCCCTTCT ' TCCGTGACCA CGATGCTCTC 1020 

CTGGGTTTCC TTTTGGATCA AGACAGAGTC TGCTCCAGCC CGGACCTCTC TAGGGATCAC 1080 

CTCTGTTCTG ACCATGACCA CGTTGGGCAC CTTTTCTCGT AAGAATTTCC CGCGTGTCTC 1140 

CTATATCACA GCCTTGGATT TCTATATCGC CATCTGCTTC GTCTTCTGCT TCTGCGCTCT 1200 

GTTGGAGTTT GCTGTGCTCA ACTTCCTGAT CTACAACCAG ACAAAAGCCC ATGCTTCTCC 1260 

TAAACTCCGC CATCCTCGTA TCAATAGCCG TGCCCATGCC CGTACCCGTG CACGTTCCCG 1320 

AGCCTGTGCC OGCCAACATC AGGAAGCTTT TGTGTGCCAG ATTGTCACCA CTGAGGGAAG 1380 

TGATGGAGAG GAGCGCCCGT CTTGCTCAGC CCAGCAGCCC CCTAGCCCAG GTAGCCCTGA 1440 

GGGTCCCCGC AGCCTCTGCT CCAAGCTGGC CTGCTGTGAG TGGTGCAAGC GTTTTAAGAA 1500 

GTACTTCTGC ATGGTCCCCG ATTGTGAGGG CAGTACCTGG CAGCAGGGCC GCCTCTGCAT 1560 

CCATGTCTAC CGCCTGGATA ACTACTCGAG AGTTGTTTTC CCAGTGACTT TCTTCTTCTT 1620 

CAATGTGCTC TACTGGCTTG TTTGCCTTAA CTTGTAGGTA CCAGCTGGTA CCCTGTGGGG 1680 

CAACCTCTCC AGTTCCCCAG GAGGTCCAAG CCCCTTGCCA AGGGAGTTGG GGGAAAGCAG 1740 

CAGCAGCAGC AGGAGCGACT AGAGTTTTTC CTGCCCCATT CCCCAAACAG AAGCTTGCAG 1800 

AGGGTTTGTC TTTGCTGCCC CTGTCCCCTA CCTGGCCCAT TCACTGAGTT TTCTCAGCAG 1860 

ACCATTTCAA ATTATTAATA AATGGGCCAC CTCCCTCTTC TTCAAGGAGC ATCCGTGATG 1920 

CTCAGTGTTC AAAACCACAG CCACTTAGTG ATCAGCTCCC TAAAACCATG CCTAAGTACA 1980 

GGCGGATTAG CTATCTTCCA ACAATGCTGA CCACCAGACA ATTACTGCAT TTTTCCAGAA 2040 

GCCCACTATT GCCTTTGCAG TGCTTTCGGC CCAGTTCTGG CCTCAGCCTC AAAGTGCACC 2100 

GACTAGTTGC TTGCCTATAC CTGGCACCTC ATTAAGATGC TGGGCAGCAG TATAACAGGA 2160 

GGAAGAGATC CCTCTCCTTT GGTCAGATTA TTATGTTCTC AGTTCTCTCT CCCTGCTACC 2220 

CCTTTCTCTG CAGATAGATA GACACTGGCA TTATCCCTTT AGGAAGAGGG GGGGGCAGCA 2280 

AGAGAGCCTA TTTGGGACAG CATTCCTCTC TCTCTGCTGC TGTGACATCT CCCTCTCCTT 2340 

GCTGGCTCCA TCTTTCGTCT GCACTACCAA TTCAATGCCC TTCATCCAAT GGGTATCTAT 2400 

TTTTGTGTGT GATTATAGTA ACTACTCCCT GCTTTATATG CCACCCTCTT CCTTCTCTTT 2460 

GACCCCTGTG ACTCTTTCTG TAACTTTCCC AGTGACTTCC CCTAGCCCTG ACCAGGCACT 2520 

AGGCCTTGGT GACTTCCTGG GGOCAAGAAA CTAAGGAAAC TCGGCTTTGC AACAGGCATT 2580 

ACTCGCCATT GATTGGTGCC CACCCAGGGC ACACTGTCGG AGTTCTATCA CTTGCTTGAC 2640 

CCCTGGACCC ATAAACCAGT CCACTGTTAT ACCCGGGGCA CTCTAACCAT CACAATCAAT 2700 

CAATCAAATT CCCTTAAATT TGTATGGCAC TGGAACTTTG GCAAAGCACT TTTGACAAGT 2760 

TGTGTCTGAT TGGAGCTTCA TGATAGCCTT GTGACATCTT TAGGGCAGGA TTCTTATCCC 2820 

CATTTTGCAG ATGAAAACCC TGAGTCACAG ATTTCTGTGG GACTGTGGAT CTCACTGGAA 2880 

GCTATCCAAG AGCCCACTGT CACCTTCTAG ACCACATGAT AGGGCTAGAC AGCTCAGTTC 2940 

ACCATGATTC TCTTCTGTCA CCTCTGCTGG CACACCAGTG GCAAGGCCCA GAATGGCGAC 3000 

CTCTCTTTAG CTCAATTTCT GGGCCTGAGG TGCTCAGACT GCCCCCAAGA TCAAATCTCT 3060 

CCTGGCTGTA GTAACCCAGT GGAATGAATT TGGACATGCC CCAATGCTTC TATATGCTAA 3120 

GTGAAATCTG TGTCTGTAAT TTGTTGGGGG GTGGATAGGG TGGGGTCTCC ATCTACTTTT 3180 
TGTCACCATC ATCTGAAATG GGGAAATATG TAAATAAATA TATCAGCAAA GC 

Seq ID NO: 614 Protein sequence 
Protein Accession 8: NP_068822.l 

1 11 21 31 41 51 

I I I I I 1 

MEYTIDIIFS QTWNSKRTHE HEITMPNQMV RIYKDGKVLY TIRMTIDAGC SLHMLRFPMD 60 

SHSCPLSFSS FSYPENEMIY KWENFKLEIN EKNSWKLFQF DFTGVSNKTE IITTPVGDFM 120 

VMTIFFNVSR RFGYVAFQNY VPSSVTTMLS WVSFWIKTES APARTSLGIT SVLTMTTLGT 180 

FSRKNFPRVS YITALDFYIA ICFVFCFCAL LEFAVLNFLI YNQTKAHASP KLRHPRINSR 240 

AHARTRARSR ACARQHQEAF VCQIVTTEGS DGEERPSCSA QQPPSPGSPE GPRSLCSKLA 300 

CCEWCKRFKK YFCMVPDCEG STWQQGRLCI HVYRLDNYSR WFPVTFFFF NVLYWLVCLN 360 
L 
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Seq ID NO: 615 DNA sequence 

Nucleic Acid Accession 8-. NM_021990.1 

Coding sequence: 1309..2490 

1 11 21 31 41 51 

t I 1 I I I 

GCCAGAGCGT GAGCCGCGAC CTCCGCGCAG GTGGTCGCGC OGGTCTCCGC GGAAATGTTG 60 

TCCAAAGTTC TTCCAGTCCT CCTAGGCATC TTATTGATCC TCCAGTCGAG AACATGTATA 120 

CAGAGAAGTG CTCAAATCAT AAGTGTACAG CTGATGAGTT GTCAAAAAAT GACCACAGCG 180 

GTGTAAAGAA AGCCAAATCA AGGACCCGAA TGTGAGCAGG ACCTCAGAAG CCCCCTTTGT 240 

CACTGCCTCC CAGCAAAGGC AGCACTATCC GGACTTCTAA CACCATCGGT GAGTTTCATA 300 

CCTTGGCAGA TGGCCTTTAA CATTTTTGTT TAATTCAATT ATTCTTACTA ATCTTCTTCT 360 

TTTTCTTGGC TGTGGTGCAT GGCTGTGGAG CTCAGGGTGG ACTCCTGTTG GGCAGCCAGT 420 

TCCTGGATGG CTGTCTGTGG GTGGAGGACT CCTGCCTTTC CTGTTTAGAC ACCCACAAAG 480 

GCTGCTCTTT AGCCTCCTTC CCTTCATCCC CTTCCCCTGC CCCCAGTGCA ACGAGTATTA 540 

CACAACCAAC AAAACCGCAA AATATTCCCA CAATTTTCTG GTCCTCTCTG GGAGAGGCOG 600 

CTCTGGCTTT TCCTCTCAGC CCTGGCCCTC TGCCTGCTCC TCACTCCTGG TTGGTGCTGG 660 

TCAGGCTGAC TAGAGGCCAA GGCGACCAAC ACTAGGCAAA CGCGGCCAGC GCTCAGACAT 720 

AAATGCCCTC TTCATTTCAC GTGTAACATT CTTTTAAAAT CTAGGTCTTG GTTTTGTTGA 780 

TTTTTTCTTA AATAAAAGAG TGATCATAAA AGAGGGACAG CATAGAAAGT CCCCAAAGAG 840 

CAGCAAGGTT TTAAAGAAAT TCACAAGCCT AATCTGTCAC TGTCTTATAA TTTGCTATTA 900 

CCAGTCACAA TTTAACTAGG TTTTGTGTTG AAAACTTGTT TTGGTTTGCT TCTGTCCCAA 960 

GAGG CACT AG CTGGGGCCCC TACAGAGTGC AGGGCAGAGC TTCATTTTTC GTTTGAATGT 1020 

TCTAGGGTCG AGGGACCTCA GACTGAATCA AAGAATGAAG CCTCTTCCCG TGATGTTGTC 1080 

TATGGCCCCC AGCCCCAGCC TCTGGAAAAT CAGCTCCTCT CTGAGGAAAC AAAGTCAACT 1140 

GAGACTGAGA CTGGGAGCAG AGTTGGCAAA CTGCCAGAAG CCTCTCGCAT CCTGAACACT 1200 

ATCCTGAGTA ATTATGACCA CAAACTGCGC CCTGGCATTG GAGAGAAGCC CACTGTGGTC 1260 

ACTGTTGAGA TCTCCGTCAA CAGCCTTGGT CCTCTCTCTA TCCTAGACAT GGAATACACC 1320 

ATTGACATCA TCTTCTCCCA GACCTGGTAC GACGAACGCC TCTGTTACAA CGACACCTTT 1380 

GAGTCTCTTG TTCTGAATGG CAATGTGGTG AGCCAGCTAT GGATCCCGGA CACCTTTTTT 1440 

AGGAATTCTA AGAGGACCCA CGAGCATGAG ATCACCATGC CCAACCAGAT GGTCCGCATC 1500 

TACAAGGATG GCAAGGTGTT GTACACAATT AGGATGACCA TTGATGCCGG ATGCTCACTC 1560 

CACATGCTCA GATTTCCAAT GGATTCTCAC TCTTGCCCTC TATCTTTCTC TAGCTTTTCC 1620 

TATCCTGAGA ATGAGATGAT CTACAAGTGG GAAAATTTCA AGCTTGAAAT CAATGAGAAG 1680 

AACTCCTGGA AGCTCTTCCA GTTTGATTTT ACAGGAGTGA GCAACAAAAC TGAAATAATC 1740 

ACAACCCCAG TTGGTGACTT CATGGTCATG ACGATTTTCT TCAATGTGAG CAGGCGGTTT 1800 

GGCTATGTTG CCTTTCAAAA CTATGTCCCT TCTTCCGTGA CCACGATGCT CTCCTGGGTT 1860 

TCCTTTTGGA TCAAGACAGA GTCTGCTCCA GCCCGGACCT CTCTAGGGAT CACCTCTGTT 1920 

CTGACCATGA CCACGTTGGG CACCTTTTCT CGTAAGAATT TCCCGCGTGT CTCCTATATC 1980 

ACAGCCTTGG ATTTCTATAT CGCCATCTGC TTCGTCTTCT GCTTCTGCGC TCTGTTGGAG 2040 

TTTGCTGTGC TCAACTTCCT GATCTACAAC CAGACAAAAG CCCATGCTTC TCCTAAACTC 2100 

CGCCATCCTC GTATCAATAG CCGTGCCCAT GCCCGTACCC GTGCACGTTC CCGAGCCTGT 2160 

GCCCGCCAAC ATCAGGAAGC TTTTGTGTGC CAGATTGTCA CCACTGAGGG AAGTGATGGA 2220 

GAGGAGCGCC CGTCTTGCTC AGCCCAGCAG CCCCCTAGCC CAGGTAGCCC TGAGGGTCCC 2280 

CGCAGCCTCT GCTCCAAGCT GGCCTGCTGT GAGTGGTGCA AGCGTTTTAA GAAGTACTTC 2340 

TGCATGGTCC CCGATTGTGA GGGCAGTACC TGGCAGCAGG GCCGCCTCTG CATCCATGTC 2400 

TACCGCCTGG ATAACTACTC GAGAGTTGTT TTCCCAGTGA CTTTCTTCTT CTTCAATGTG 2460 

CTCTACTGGC TTGTTTGCCT TAACTTGTAG GTACCAGCTG GTACCCTGTG GGGCAACCTC 2520 

TCCAGTTCCC CAGGAGGTCC AAGCCCCTTG CCAAGGGAGT TGGGGGAAAG CAGCAGCAGC 2580 

AGCAGGAGCG ACTAGAGTTT TTCCTGCCCC ATTCCCCAAA CAGAAGCTTG CAGAGGGTTT 2640 

GTCTTTGCTG CCCCTCTCCC CTACCTGGCC CATTCACTGA GTTTTCTCAG CAGACCATTT 2700 

CAAATTATTA ATAAATGGGC CACCTCCCTC TTCTTCAAGG AGCATCCGTG ATGCTCAGTG 2760 

TTCAAAACCA CAGCCACTTA GTGATCAGCT CCCTAAAACC ATGCCTAAGT ACAGGCGGAT 2820 

TAGCTATCTT CCAACAATGC TGACCACCAG ACAATTACTG CATTTTTCCA GAAGCCCACT 2880 

ATTGCCTTTG CAGTGCTTTC GGCCCAGTTC TGGCCTCAGC CTCAAAGTGC ACCGACTAGT 2940 

TGCTTGCCTA TACCTGGCAC CTCATTAAGA TGCTGGGCAG CAGTATAACA GGAGGAAGAG 30 00 

ATCCCTCTCC TTTGGTCAGA TTATTATGTT CTCAGTTCTC TCTCCCTGCT ACCCCTTTCT 3060 

CTGCAGATAG ATAGACACTG GCATTATCCC TTTAGGAAGA GGGGGGGGCA GCAAGAGAGC 3120 

CTATTTGGGA CAGCATTCCT CTCTCTCTGC TGCTGTGACA TCTCCCTCTC CTTGCTGGCT 3180 

- CCATCTTTCG TCTGCACTAC CAATTCAATG CCCTTCATCC AATGGGTATC TATTTTTGTG 3240 

TGTGATTATA GTAACTACTC CCTGCTTTAT ATGCCACCCT CTTCCTTCTC TTTGACCCCT 3300 

GTGACTCTTT CTGTAACTTT CCCAGTGACT TCCCCTAGCC CTGACCAGGC ACTAGGCCTT 3360 

GGTGACTTCC TGGGGCCAAG AAACTAAGGA AACTCGGCTT TGCAACAGGC ATTACTCGCC 3420 

ATTGATTGGT GCCCACCCAG GGCACACTGT CGGAGTTCTA TCACTTGCTT GACCCCTGGA 3480 
CCCATAAACC AGTCCACTGT TATACCCGGG GCACTCTAAC CATCACAATC AATCAATCAA 3540 
ATTCCCTTAA ATTTGTATGG CACTGGAACT TTGGCAAAGC ACTTTTGACA AGTTGTGTCT 3600 
GATTGGAGCT TCATGATAGC CTTGTGACAT CTTTAGGGCA GGATTCTTAT CCCCATTTTG 3660 
CAGATGAAAA CCCTGAGTCA CAGATTTCTG TGGGACTGTG GATCTCACTG GAAGCTATCC 3720 
AAGAGCCCAC TGTCACCTTC TAGACCACAT GATAGGGCTA GACAGCTCAG TTCACCATGA 3780 
TTCTCTTCTG TCACCTCTGC TGGCACACCA GTGGCAAGGC CCAGAATGGC GACCTCTCTT 3840 
TAGCTCAATT TCTGGGCCTG AGGTGCTCAG ACTGCCCCCA AGATCAAATC TCTCCTGGCT 3900 
GTAGTAACCC AGTGGAATGA ATTTGGACAT GCCCCAATGC TTCTATATGC gAAG TGAAAT 3960 
CTGTGTCTGT AATTTGTTGG GGGGTGGATA GGGTGGGGTC TCCATCTACT TTTTGTCACC 4020 
ATCATCTGAA ATGGGGAAAT ATGTAAATAA ATATATCAGC AAAGC 

Seq ID NO: 616 Protein sequence 
Protein Accession #: NP_068830.1 

11 



1 11 21 31 41 51 

1 I I I I I 

MEYTIDIIFS QTWYDERLCY NDTFESLVLN GNWSQLWIP DTFFRNSKRT HEHEITMPNQ 
MVRIYKDGKV LYTIRMTIDA GCSLHMLRPP MDSHSCPLSF SSFSYPENEM IYKWENFKLE 
INEKNSWKLF OFDFTGVSNK TEIITTPVGD FMVMTIFFNV SRRFGYVAFQ NYVPSSVTTM 
LSWVSFWIKT ESAPARTSLG ITSVLTMTTL GTFSRKNFPR VSYITALDFY IAICFVFCFC 
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ALLEFAVLNF LIYNQTKAHA SPKLRHPRIN SRAHARTRAR SRACARQHQE AFVCQIVTTE 300 

GSDGEERPSC SAQQPPSPGS PEGPRSLCSK LACCEWCKRF KKYFCMVPDC EGSTWQQGRL 360 
CIHVYRLDNY SRWFPVTFF FFNVLYWLVC LNL 

Seq ID NO; 617 DNA sequence 

Nucleic Acid Accession #: NMJ504864.1 

Coding sequence: 26.. 952 

1 11 21 31 41 51 

I I I t I I 

CGGAACGAGG GCAACCTGCA CAGCCATGCC CGGGCAAGAA CTCAGGACGG TGAATGGCTC 60 

TCAGATGCTC CTGGTGTTGC TGGTGCTCTC GTGGCTGCCG CATGGGGGCG CCCTGTCTCT 120 

GGCCGAGGCG AGCCGCGCAA GTTTCCCGGG ACCCTCAGAG TTGCACTCCG AAGACTCCAG 180 

ATTCCGAGAG TTGCGGAAAC GCTACGAGGA CCTGCTAACC AGGCTGCGGG CCAACCAGAG 240 

CTGGGAAGAT TCGAACACCG ACCTCGTCCC GGCCCCTGCA GTCCGGATAC TCACGCCAGA 300 

AGTGCGGCTG GGATCCGGCG GCCACCTGCA CCTGCGTATC TCTCGGGCCG CCCTTCCCGA 360 

GGGGCTCCCC GAGGCCTCCC GCCTTCACCG GGCTCTGTTC CGGCTGTCCC CGACGGCGTC 420 

AAGGTCGTGG GACGTGACAC GACCGCTGCG GCGTCAGCTC AGCCTTGCAA GACCCCAAGC 480 

GCCCGCGCTG CACCTGCGAC TGTOGCCGCC GCCGTCGCAG TCGGACCAAC TGCTGGCAGA 540 

ATCTTCGTCC GCACGGCCCC AGCTGGAGTT GCACTTGCGG CCGCAAGCCG CCAGGGGGOG 600 

CCGCAGAGCG CGTGCGCGCA ACGGGGACGA CTGTCCGCTC GGGCCCGGGC GTTGCTGCCG 660 

TCTGCACACG GTCCGCGCGT CGCTGGAAGA CCTGGGCTGG GCCGATTGGG TGCTGTCGCC 720 

ACGGGAGGTG CAAGTGACCA TGTGCATCGG CGCGTGCCCG AGCCAGTTCC GGGCGGCAAA 780 

CATGCACGCG CAGATCAAGA CGAGCCTGCA CCGCCTGAAG CCCGACACGG AGCCAGCGCC 840 

CTGCTGCGTG CCCGCCAGCT ACAATCCCAT GGTGCTCATT CAAAAGACCG ACACCGGGGT 900 

GTCGCTCCAG ACCTATGATG ACTTGTTAGC CAAAGACTGC CACTGCATAT GAGCAGTCCT 960 

GGTCCTTCCA CTGTGCACCT GCGCGGGGGA GGCGACCTCA GTTGTCCTGC CCTGTGGAAT 1020 

GGGCTCAAGG TTCCTGAGAC ACCCGATTCC TGCCCAAACA GCTGTATTTA TATAAGTCTG 1080 

TTATTTATTA TTAATTTATT GGGGTGACCT TCTTGGGGAC TCGGGGGCTG GTCTGATGGA 1140 

ACTGTGTATT TATTTAAAAC TCTGGTGATA AAAATAAAGC TGTCTGAACT GTTAAAAAAA 1200 
AAAA 

Seq ID NO: 618 Protein sequence 
Protein Accession #: NP_004855.1 

1 11 21 31 41 51 

I I I I I I 

MPGQELRTVN GSQMLLVLLV LSWLPHGGAL SLAEASRASF PGPSELHSED SRFREIiRKRY 60 

EDLLTRLRAN QSWEDSNTDL VPAPAVRILT PEVRLGSGGH LHLRISRAAL PEGLPEASRL 120 

HRALFRLSPT ASRSWDVTRP LRRQL.SLARP QAPALHLRLS PPPSQSDQLL AESSSARPQL 180 

ELHLRPQAAR GRRRARARNG DDCPLGPGRC CRLHTVRASL EDLGWADWVL SPREVQVTMC 240 

IGACPSQFRA ANMHAQIKTS LHRliKPDTEP APCCVPASYN PMVLIQKTDT GVSLQTYDDL 300 
LAKDCHCI - 

Seq ID NO: 619 DNA sequence 

Nucleic Acid Accession #: NMJ>03979.2 

Coding sequence: 254.. 1357 

1 11 21 31 41 51 

I I I I I I 

ATAACAGCAT GAAGTGCCGT GGAACTGGAA TAGGCGTGTC CTCTCCCTCG ACCCTCCCCC 60 

TCCTTGTCCC TCTGCTCACC CCTCGCTCGT TCCCTCCCTC CGGCGAGGGC CGCCTTTATA 120 

ACAACTGCTC AGAGTGCGAG GGCGGGATAG CTGTCCAAGG TCTCCCCCAG CACTGAGGAG 180 

CTCGCCTGCT GCCCTCTTGC GCGCGGGAAG CAGCACCAAG TTCACGGCCA ACGCCTTGGC 240 

ACTAGGGTCC AGAATGGCTA CAACAGTCCC TGATGGTTGC CGCAATGGCC TGAAATCCAA 300 

GTACTACAGA CTTTGTGATA AGGCTGAAGC TTGGGGCATC GTCCTAGAAA CGGTGGCCAC 360 

AGCCGGGGTT GTGACCTCGG TGGCCTTCAT GCTCACTCTC CCGATCCTCG TCTGCAAGGT 420 

GCAGGACTCC AACAGGCGAA AAATGCTGCC TACTCAGTTT CTCTTCCTCC TGGGTGTGTT 480 

GGGCATCTTT GGCCTCACCT TCGCCTTCAT CATCGGACTG GACGGGAGCA CAGGGCCCAC 540 

ACGCTTCTTC CTCTTTGGGA TCCTCTTTTC CATCTGCTTC TCCTGCCTGC TGGCTCATGC 600 

TGTCAGTCTG ACCAAGCTCG TCCGGGGGAG GAAGCCCCTT TCCCTGTTGG TGATTCTGGG 660 

TCTGGCCGTG GGCTTCAGCC TAGTCCAGGA TGTTATCGCT ATTGAATATA TTGTCCTGAC 720 

CATGAATAGG ACCAACGTCA ATGTCTTTTC TGAGCTTTCC GCTCCTCGTC GCAATGAAGA 780 

CTTTGTCCTC CTGCTCACCT ACGTCCTCTT CTTGATGGCG CTGACCTTCC TCATGTCCTC 840 

CTTCACCTTC TGTGGTTCCT TCACGGGCTG GAAGAGACAT GGGGCCCACA TCTACCTCAC 900 

GATGCTCCTC TCCATTG CCA TCTGGGTGGC CTGGATCACC CTGCTCATGC TTCCTGACTT 960 

TGACCGCAGG TGGGATGACA CCATCCTCAG CTCCGCCTTG GCTGCCAATG GCTGGGTGTT 1020 

CCTGTTGGCT TATGTTAGTC CCGAGTTTTG GCTGCTCACA AAGCAACGAA ACCCCATGGA 1080 

TTATCCTGTT GAGGATGCTT TCTGTAAACC TCAACTCGTG AAGAAGAGCT ATGGTGTGGA 1140 

GAACAGAGCC TACTCTCAAG AGGAAATCAC TCAAGGTTTT GAAGAGACAG GGGACACGCT 1200 

CTATGCCCCC TATTCCACAC ATTTTCAGCT GCAGAACCAG CCTCCCCAAA AGGAATTCTC 1260 

CATCCCACGG GCCCACGCTT GGCCGAGCCC TTACAAAGAC TATGAAGTAA AGAAAGAGGG 1320 

CAGCTAACTC TGTCCTGAAG AGTGGGACAA ATGCAGCCGG GCGGCAGATC TAGCGGGAGC 1380 

TCAAAGGGAT GTGGGCGAAA TCTTGAGTCT TCTGAGAAAA CTGTACAAGA CACTACGGGA 1440 

ACAGTTTGCC TCCCTCCCAG CCTCAACCAC AATTCTTCCA TGCTGGGGCT GATGTGGGCT 1500 

AGTAAGACTC CAGTTCTTAG AGGCGCTGTA GTATTTTTTT TTTTTTGTCT CATCCTTTGG 1560 

ATACTTCTTT TAAGTGGGAG TCTCAGGCAA CTCAAGTTTA GACCCTTACT C TT T TTUT T T 1620 

GTTTTTTGAA ACAGGATCTT GCTCTGTCAC CCAGGCTTGA GTGCAGTGGT GCGATCACAG 1680 

CCCAGTGCAG CCTCGACCAC CTGTGCTCAA GCAATCCTCC CATCTCCATC TCCCAAAGTG 1740 

CTGGGATGAC AGGCGTGAGC CACAGCTCCC AGCCTAGGCC CTTAATCTTG CTGTTATTTT 1800 

CCATGGACTA AAGGTCTGGT CATCTGAGCT CACGCTGGCT CACACAGCTC TAGGGGCCTG 1860 

CTCCTCTAAC TCACAGTGGG TTTTGTGAGG CTCTGTGGCC CAGAGCAGAC CTGCATATCT 1920 

GAGCAAAAAT AGCAAAAGCC TCTCTCAGCC CACTGGCCTG AATCTACACT GGAAGCCAAC 1980 

TTGCTGGCAC CCCCGCTCCC CAACCCTTCT TGCCTGGGTA GGAGAGGCTA AAGATCACCC 2040 
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TAAATTTACT CATCTCTCTA GTGCTGCCTC ACATTGGGCC TCAGCAGCTC CCCAGCACCA 
ATTCACAGGT CACCCCTCTC TTCTTGCACT GTCCCCAAAC TTGCTGTCAA TTCCGAGATC 
TAATCTCCCC CTACGCTCTG CCAGGAATTC TTTCAGACCT CACTAGCACA AGCCCGGTTG 
CTCCTTGTCA GGAGAATTTG TAGATCATTC TCACTTCAAA TTCCTGGGGC TGATACTTCT 
CTCATCTTGC ACCCCAACCT CTGTAAATAG ATTTACCGCA TTTAGGGCTG CATTCTGTAA 
GTGGGCATGG TCTCCTAATG GAGGAGTGTT CATTGTATAA TAAGTTATTC ACCTGAGTAT 
GCAATAAAGA TGTGGTGGCC ACTCTTTCAT GGTGGTGGCA GCAAAAAAAA AAAAAA 
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Seq ID NO: 620 Protein sequence 
Protein Accession #: NP 003970.1 



MATTVPDGCR 
RRKMLPTQFL 
KLVRGRKPLS 
LTYVLFLMAL 
DDTILSSALA 
SQEEITQGFE 



11 
I 

NGLKSKYYRL 
FLLGVLGIFG 
LLVILGLAVG 
TFLMSSFTFC 
ANGWVFLLAY 
ETGDTLYAPY 



21 
I 

CDKAEAWGIV 
LTFAFIIGLD 
FSLVQDVIAI 
GSFTGWKRHG 
VSPEFWLLTK 
STHFQLQNQP 



31 
i 

LETVATAGW 
GSTGPTRFFL 
EYIVLTMNRT 
AHIYLTMLLS 
QRNPMDYPVE 
PQKEFSIPRA 



41 
I 

TSVAFMLTLP 
FGILFSICFS 
NVNVFSELSA 
IAIWVAWITL 
DAFCKPQLVK 
HAWPSPYKDY 



51 
1 

ILVCKVQDSN 
CLLAHAVSLT 
PRUNED FVLli 
LMLPDFDRRW 
KSYGVENRAY 
EVKKEGS 



Seq ID NO: 621 DNA sequence 

Nucleic Acid Accession #: NMJJ02423.2 

Coding sequence: 48.. 851 



1 
I 

ACCAAATCAA 
TGCTGTGTGC 
GAGGCATGAG 
ATGACTCAGA 
TCTTTGGCCT 
CCAGATGTGG 
CTTCCAAAGT 
TGGATCGATT 
GGAAAGTTGT 
ACTCCTACCC 
GTCTCGGAGG 
GGATTAACTT 
CCTCTGATCC 
AACTTTCCCA 
GAAAGAAATA 
TGTTGCACAA 
CTTTTTTATT 
ATGGTGTGAC 
ATGTTACATA 



11 
I 

CCATAGGTCC 
TGTGTGCCTG 
TGAGCTACAG 
AACAAAAAAT 
ACCTATAACT 
AGTGCCAGAT 
GGTCACCTAC 
AGTGTCAAAG 
ATGGGGAACT 
ATTTGATGGG 
AGATGCTCAC 
CCTGTATGCT 
TAATGCAGTG 
GGATGATATT 
GAAACTTCAG 
TCAGAATTGA 
GCAGTTGGTT 
TGTGTCTTAT 
CACAAATAAA 



21 
I 

AAGAACAATT 
CTGCCTGGCA 
TGGGAACAGG 
GCCAACAGTT 
GGAATGTTAA 
GTTGCAGAAT 
AGGATCGTAT 
GCTTTAAACA 
GCTGACATCA 
CCAGGAAACA 
TTCGATGAGG 
GCAACTCATG 
ATGTATCCAA 
AAAGGCATTC 
GCAGAACATC 
TAAGCACTGT 
TTTGAATGTC 
TCCATCTATG 
TAAAATGTTT 



31 

I 

GTCTCTGGAC 
GCCTGGCCCT 
CTCAGGACTA 
TAGAAGCCAA 
ACTCCCGCGT 
ACTCACTATT 
CATATACTCG 
TGTGGGGCAA 
TGATTGGCTT 
CGCTGGCTCA 
ATGAACGCTG 
AACTTGGCCA 
CCTATGGAAA 
AGAAACTATA 
CATTCATTCA 
TCCTCCACTC 
TTTCACTCCT 
AGCTTTGTCA 
ATTCCATGGT 



41 
I 

GGCAGCTATG 
GCCGCTGCCT 
TCTCAAGAGA 
ACTCAAGGAG 
CATAGAAATA 
TCCAAATAGC 
AGACTTACCG 
AGAGATCCCC 
TGCGCGAGGA 
TGCCTTTGCG 
GACGGATGGT 
TTCTTTGGGT 
TGGAGATCCC 
TGGAAAGAGA 
TTCATTGGAT 
CATTTAGCAA 
TTTATTGGTT 
GTGOGCGTAG 
AAATTTA 



51 
I 

CGACTCACCG 
CAGGAGGCGG 
TTTTATCTCT 
ATGCAAAAAT 
ATGCAGAAGC 
CCAAAATGGA 
CATATTACAG 
CTGCATTTCA 
GCTCATGGGG 
CCTGGGACAG 
AGCAGTCTAG 
ATGGGACATT 
CAAAATTTTA 
AGTAATTCAA 
TGTATATCAT 
TTATGTCACC 
AAACTCCTTT 
ATGTCAATAA 



60 
120 
180 
240 
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Seq ID NO: 622 Protein sequence 
Protein Accession #: NP_002414.1 

1 11 21 31 41 51 

I I I I I I 

MRLTVLCAVC LLPGSIJVLPL PQEAGGMSEL QWEQAQDYLK RFYLYDSETK NANSLEAKLK 
EMQKFFGLPI TGMLNSRVIE IMQKPRCGVP DVAEYSLFPN SPKWTSKWT YRIVSYTRDL 
PHITVDRLVS KALNMWGKEI PLHFRKWWG TADIMIGFAR GAHGDSYPFD GPGNTLAHAF 
APGTGIjGGDA HFDEDERWTD GSSLGINFLY AATHELGHSL GMGHSSDPNA vmyptygngd 
PQNFKLSQDD IKGIQKLYGK RSNSRKK 

Seq ID NO: 623 DNA sequence 
Nucleic Acid Accession #: NM_031457.1 
Coding sequence : 2 04 . . 9S6 



AAACAGGAAA 
CAGGAAATAG 
ATTCTCTCAG 
ACGTGGCTGC 
CTGTGTTGGT 
TGCCCCTGTA 
TGGTGTCGAA 
GGGCCATCCA 
TTCTCGTAGG 
TGTGGTTTAT 
GCCTGCTGTC 
TCATACTCTT 
CTTACGCCTG 
TCCTGGAGTT 
AATCAAGCAA 
CAGAACCGGT 
ACAGATTCTG 
CCATAACCCA 
TCTCACCTTC 
ACAGAGATTT 
TATGTGGGCA 



11 
I 

TAAATACGAA 
AGGACTTCGG 
GAAAAAAAAC 
TGGCAGAGCA 
GGTGGCACCC 
TCCAAACAGC 
TGTGAATGGG 
GATCATCATT 
GGAATACCTG 
CATTTCAGGA 
TGGCAGTTTG 
CATCACAGAT 
GGGTGTGAAC 
TGGCATCGCA 
TGTGAGTGTC 
GACCTCACCA 
GAAGCATCTT 
GGTCGTTCCT 
ATTCTTCAAT 
TAAACAGATG 
TCCAGCCTCT 



21 

1 

TGAAACTGAG 
ATCCTTCTAA 
AAGGTCCCCA 
AGCATGAATT 
CACAATGGTT 
CAGCGGCAAG 
CAGCCTGTGC 
GGCCTGGCTC 
TCTATTTCAT 
TCTCTCTCCG 
GGCTTGAACA 
CTAAGTATTC 
CCTGGAATGG 
TGCGCATCTT 
ATCTATCCAA 
CCAAGTTATT 
TCACTGGGAC 
GTTCTGACAG 
TCAGTCTAGG 
TTAACCAAGA 
GGGGCCTTGG 



31 
I 

CTCTAAGCAG 
CCCTACCACC 
CAGCAAAGAA 
CGATGACTTC 
ATCCTGTGAC 
TCCACCTAGT 
AGAAAGCTCT 
ACATCGGCCT 
TCTACGGAGG 
TGGCAGCAGA 
TCGTCAGTGC 
CCCACCCATA 
CGATTTCTGG 
CCCACTTTGG 
ACATCTATGC 
CCAGTGAGAT 
CAAAAGAAGT 
CTGAGGAAAC 
AAACCATGCT 
GGGACTCCCT 
CACACACACA 



41 

I 

CATGTAACCT 
CAACTGGCCC 
AAGGAATAGG 
AGCAGTTCCG 
CCCAGGAATT 
TCCTGGGAAC 
GAAAGAAGGC 
CGGCTCCATC 
CTTTCCCTTC 
AAATCAGCCA 
AATCTGCTCT 
TGCCTACCCC 
CGTGCTGCTG 
CTGCCAGTTG 
AGCAAACCCA 
CCAAGCAAAT 
CCTCCTCCCT 
GTCTCTCCCA 
GTTTCTCTAT 
AGGGCACATG 
TTCGTGTGCT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 



60 
120 
1B0 
240 



51 
I 

GGCCTGCATC 
CAGTACATTC 
ATCAAGAGAT 
GTGGCCAATT 
ATGTCTCACG 
CCACCTAGTT 
AAAACCTTGG 
ATGGCGACGG 
TGGGGAGGCT 
TATTCTTATT 
GCAGTTGGAG 
GACTATTATC 
GTCTTCTGCC 
GTCTGCTGTC 
GTGATCACCC 
AAGTAAGGCT 
TTCTGGGCTT 
CTGTTTGTAC 
CAAGAAGAAG 
CATCAGCACA 
CTGCTGCATG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 



1190 



WO 03/042661 



PCT/US02/36810 



TGAGCTTGTG GGTTAGAGGA ACAAATATCT AGACATTCAA TCTTCACTCT TTCAATTGTG 1320 
CATTCATTTA ATAAATAGAT ACTGAGCATT CAAAAAAAAA AAAAAAAAA 

_ Seq ID NO: 624 Protein sequence 
5 Protein Accession #: NP_113645.1 

1 11 21 31 41 SI 

I I 1 I I I 

MNSMTSAVPV ANSVLWAPH NGYPVTPGIM SHVPLYPNSQ PQVHLVPGNP PSLVSNVNGC; 60 
10 PVQKALKEGK TLGAIQIIIG LAHIGLGSIM ATVLVGEYLS ISPYGGPPFW GGLWFIISGS 120 
LSVAAENQPY SYCLLSGSLG LNIVSAICSA VGVILPITDL SIPHPYAYPD YYPYAWGVNP 180 
GMAISGVLLV FCLLEFGIAC ASSHFGCQLV CCQSSNVSVI YPNIYAANPV ITPEPVTSPP 240 
SYSSEIQANK 

15 Seq ID NO: 625 DNA sequence 

Nucleic Acid Accession #: NM_005221.3 
Coding sequences 1..B70 

„ 1 11 21 31 41 51 

20 ! | | | | | 

ATGACAGGAG TGTTTGACAG AAGGGTCCCC AGCATCCGAT CCGGCGACTT CCAAGCTCCG 60 

TTCCAGACGT CCGCAGCTAT GCACCATCCG TCTCAGGAAT CGCCAACTTT GCCCGAGTCT 120 

TCAGCTACCG ATTCTGACTA CTACAGCCCT ACGGGGGGAG CCCCGCACGG CTACTGCTCT 180 

CCTACCTCGG CTTCCTATGG CAAAGCTCTC AACCCCTACC AGTATCAGTA TCACGGCGTG 240 

25 AACGGCTCCG CCGGGAGCTA CCCAGCCAAA GCTTATGCCG ACTATAGCTA CGCTAGCTCC 300 

TACCACCAGT ACGGCGGCGC CTACAACCGC GTCCCAAGCG CCACCAACCA GCCAGAGAAA 360 

GAAGTGACCG AGCCCGAGGT GAGAATGGTG AATGGCAAAC CAAAGAAAGT TCGTAAAOCC 420 

AGGACTATTT ATTCCAGCTT TCAGCTGGCC GCATTACAGA GAAGGTTTCA GAAGACTCAG 480 

TACCTCGCCT TGCCGGAACG CGCCGAGCTG GCCGCCTCGC TGGGATTGAC ACAAACACAG 540 

30 GTGAAAATCT GGTTTCAGAA CAAAAGATCC AAGATCAAGA AGATCATGAA AAACGGGGAG 600 

ATGCCCCCGG AGCACAGTCC CAGCTCCAGC GACCCAATGG CGTGTAACTC GCCGCAGTCT 660 

CCAGCGGTGT GGGAGCCCCA GGGCTCGTCC CGCTCGCTCA GCCACCACCC TCATGCCCAC 720 

CCTCCGACCT CCAACCAGTC CCCAGCGTCC AGCTACCTGG AGAACTCTGC ATCCTGGTAC 780 

ACAAGTGCAG CCAGCTCAAT CAATTCCCAC CTGCCGCCGC CGGGCTCCTT ACAGCACCCG 840 

35 CTGGCGCTGG CCTCCGGGAC ACTCTATTAG 

Seq ID NO: 626 Protein sequence 
Protein Accession NP_005212.1 

40 1 11 21 31 41 51 

I 1 I 1 I I 

MTGVFDRRVP SIRSGDPQAP FQTSAAMHHP SQESPTLPES SATDSDYYSP TGGAPHGYCS 60 
PTSASYGKAIi NPYQYQYHGV NGSAGSYPAK AYADVSYASS YHQYGGAYNR VPSATNQPEK 120 
EVTEPEVRMV NGKPKKVRKP RTIYSSFQLA ALQRRFQKTQ YLALPERAEL AASLGLTQTQ 180 
45 VKIWFQNKRS KIKKIMKNGE MPPEHSPSSS DPMACNSPQS PAVWEPQGSS RSLSHHPHAH 240 
PPTSNQSPAS SYLENSASWY TSAASSINSH LPPPGSLQHP LALASGTLY 

Seq ID NO: 627 DNA sequence 
Nucleic Acid Accession #: NM_0 14420 
50 Coding sequence: lie.. 792 

1 11 21 31 41 51 

I I I 1 I I 

GCACGAGAGA CGACGTGCTG AGCTGCCAGC TTAGTGGAAG CTCTGCTCTG GGTGGAGAGC 60 

AGCCTCGCTT TGGTGACGCA CAGTGCTGGG ACCCTCCAGG AGCCCCGGGA TTGAAGGATG 120 

GTGGCGGCCG TCCTGCTGGG GCTGAGCTGG CTCTGCTCTC CCCTGGGAGC TCTGGTCCTG 180 

GACTTCAACA ACATCAGGAG CTCTGCTGAC CTGCATGGGG CCCGGAAGGG CTCACAGTGC 240 

CTGTCTGACA CGGACTGCAA TACCAGAAAG TTCTGCCTCC AGCCCCGCGA TGAGAAGCCG 300 

TTCTGTGCTA CATGTCGTGG GTTGCGGAGG AGGTGCCAGC GAGATGCCAT GTGCTGCCCT 360 

GGGACACTCT GTGTGAACGA TGTTTGTACT ACGATGGAAG ATGCAACCCC AATATTAGAA 420 

AGGCAGCTTG ATGAGCAAGA TGGCACACAT GCAGAAGGAA CAACTGGGCA CCCAGTCCAG 480 

GAAAACCAAC CCAAAAGGAA GCCAAGTATT AAGAAATCAC AAGGCAGGAA GGGACAAGAG 540 

GGAGAAAGTT GTCTGAGAAC TTTTGACTGT GGCCCTGGAC TTTGCTGTGC TCGTCATTTT 600 

TGGACGAAAA TTTGTAAGCC AGTCCTTTTG GAGGGACAGG TCTGCTCCAG AAGAGGGCAT 660 

65 AAAGACACTG CTCAAGCTCC AGAAATCTTC CAGCGTTGCG ACTGTGGCCC TGGACTACTG 720 

TGTCGAAGCC AATTGACCAG CAATCGGCAG CATGCTCGAT TAAGAGTATG CCAAAAAATA 780 

GAAAAGCTAT AAATATTTCA AAATAAAGAA GAATCCACAT TGCAAAAAAA AAAAAAAAAA 840 
• A 

70 Seq ID NO: 628 Protein sequence 
Protein Accession #: NP_055235 

1 11 21 31 41 51 

*7c I I I I I I 

/ J MVAAVLLGLS WLCSPLGALV LDFNNIRSSA DLHGARKGSQ CLSDTDCNTR KFCLQPRDEK 60 

PFCATCRGLR RRCQRDAMCC PGTLCVNDVC TTMEDATPIL ERQLDEQDGT HAEGTTGHPV 120 

QENQPKRKPS IKKSQGRKGQ EGESCLRTFD CGPGLCCARH FWTKICKPVL LEGQVCSRRG 180 

HKDTAQAPEI FQRCDCGPGL LCRSQLTSNR QHARLRVCQK IEKL 



55 
60 



80 



Seq ID NO: 629 DNA sequence 

Nucleic Acid Accession #: NM_00244B.l 

Coding sequence: 241.. 1134 

1 11 21 31 41 51 

1191 



WO 03/042661 



PCT/US02/36810 



10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



GCGCGAGTGC 
ACGCCGGAGC 
AGGCCCCGGG 
CGGCTGGCCA 
ATGACTTCTT 
GGAGGCGCGG 
GAGGAGGGGG 
ATGGCCGACC 
CAGGCGGCGG 
GACGCGCCCT 
CCAGAAGATG 
CAGAGCCCCC 
CGCAAACACA 
CTGGAGCGCA 
AGCTCGCTCA 
GCAAAGAGAC 
CCACCGGCTG 
GCGGGTGCCT 
CCCGTGGGAC 
CCAGGTCCCC 
CTCCCCTGCT 



GTCCCTTAGT 
AGGGCAGAGA 
CTAGAGGCCA 
GAGAAATCGG 
AAAAAAAAAA 
TCCATTGGAA 
ACACAGATGT 



I 

TCCCGGGAAC 
TGGCCTGCTG 
CCCTCGCAGA 
GTGCTGCGGC 
TGCCACTCGG 
GCCAGGCCCC 
CCAAGCCCAA 
ACAGGAAGCC 
GTGGCTCGGC 
CTTCGCCGCG 
CGCTCGTCAA 
GCTTCTCCCC 
AGACGAACCG 
AGTTCCGCCA 
GCCTCACTGA 
TACAAGAGGC 
CCTTCGGCCT 
CGCTCTACGG 
TCTACACGGC 
ACCTGTGGGC 
CGGCACCGCC 
CCTGAGTTCA 
ACTCTTCTAG 
GGTTAACAGA 
TGTCTCCTGC 
TGTCTGACGA 
GAAAAGAGAA 
AATTTAAGAC 
GTTGCAAAGG 



I 

TCTGCCTGCG 
GGGAGGGGCG 
GGCCGGCCGC 
AGAAGGGGGG 
TGTCAAAGTG 
CAGCGCCGCC 
AGTGTCCCCT 
GGGGGCCAAG 
GCAGCCACTG 
GCCGCTCGGC 
AGCCGAGAGC 
GCCGCCGGCC 
TAAGCCGCGG 
GAAGCAGTAC 
GACGCAGGTG 
AGAGCTGGAG 
CTCCTTCCCT 
TGCCTCTGGC 
CCATGTGGGC 
CAGCCGATTC 
AGCCGCCTTC 
CTCTCCGAAG 
CATTTAGATC 
TTTATCTAGG 
ATAGCTTTTC 
TTTTGGAAAT 
AAAAAAGACT 
AAGTTCAACA 
TAGGTTGAAG 



I 

CGGCGGCAGC 
GGAGGCGCGC 
GGTCCCAGCC 
GCCCGGCTCT 
GAGGACTCCG 
GCGGCCACGG 
TCGCTCCTGC 
GAGAGCGCCC 
GGCGTCCCGC 
CATTTCTCGG 
CCCGAGAAGC 
AGGCGGCTGA 
ACGCCCTTCA 
CTGTCCATCG 
AAGATATGGT 
AAGCTGAAGA 
CTCGGCGGCC 
CCCTTCCAGC 
TACAGCATGT 
CTCCAGCCCT 
CCTTTAACCC 
TCTGATCCCT 
TACACTCTCG 
GTCCCCAGCA 
CCTGTCCTGA 
GAGAACAATC 
AGCCAGCCAG 
ACAAAACATT 
GGA 



! 

GACCGGAGGC 
GCGGGAGGGT 
CGCCCGGAGC 
GCATGGCCCC 
CCTTCGGCAA 
CAGCCGCCAT 
CCTTCAGCGT 
TGGCGCCCTC 
CGGGGTCGCT 
TGGGGGGACT 
CCGAGAGGAC 
GCCCCCCAGC 
CCACCGCGCA 
CCGAGCGCGC 
TCCAGAACCG 
TGGCCGCCAA 
CCGCAGCTGT • 
GCGCCGCGCT 
ACCACCTGAC 
GGTGCTGTAC 
TCACACTGCT 
GCCAAAAAGT 
AGTTAAAGAT 
GAATTGACAG 
CACCAGGCAA 
TCAAAAAAAA 
GAAGATGAAT 
TGCTCTGGGG 



Seq ID NO: 630 Protein sequence 
Protein Accession #: NP_002439.1 

X 11 21 

I I I 

MTSIiPLGVKV EDSAFGKPAG GGAGQAPSAA 
MADHRKPGAK ESALAPSEGV QAAGGSAQPL 
PEDALVKAES PEKPERTPWM QSPRFSPPPA 
LERKFRQKQY LSIAERAEFS SSLSLTETQV 
PPAAFGLSFP LGGPAAVAAA AGASLYGASG 



CAGGCCCAGC 
CCGCCCGGCC 
CCATGCCCGG 
GGCTGCTGAC 
GCCGGCGGGG 
GGGCGCGGAC 
GGAGGCGCTC 
CGAGGGCGTG 
GGGAGCCCCG 
CCTCAAGCTG 
CCCGTGGATG 
CTGCACCCTC 
GCTGCTGGCG 
GGAGTTCTCC 
CCGCGCCAAG 
GCCCATGCTG 
AGCGGCCGCG 
GCCTGTGGCG 
ATAGAGGGTC 
CCCCGACGTG 
CCAGTTTCAC 
GGCTGGAAGA 
GGGGAAACTG 
TTGAACAGAG 
GAAAAGCGCA 
AAAAAAAAAA 
CCTAGCTTCT 
GGCAGGGAAA 



51 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 



31 41 
I I I 

AATAAAMGAD EEGAKPKVSP SLLPFSVEAL 60 
GVPPGSLGAP DAPSSPRPLG HFSVGGUUKL 120 
RRLSPPACTL RKHKTNRKPR TPFTTAQLLA 180 
KIWFQNRRAK AKRLQEAELE KLKMAAKPML 240 
PFQRAALPVA PVGLYTAHVG YSMYHLT 



Seq ID NO: 631 DNA sequence 

Nucleic Acid Accession #: NM_002557.1 

Coding sequence: 13.. 2 04 9 



CAGACCATTG 
CACGATGGTG 
GGCCCTGCCT 
GCCTTTGCCT 
CTCTACCCAG 
ATCGGCGGGT 
CGTGAAAAGT 
GACCTTTTCT 
CTCTTCTTAA 
CCGAGGCTGC 
GATGTGCGCT 
GGAAGTTGGG 
AAATCTTCGG 
ATCATGGGGA 
TTGCAGGCCA 
TTGGCTTATT 
CAGTATGTCC 
TTCAGTTACA 
TTGGACATGG 
GTATTGAATG 
CTGTCATCTG 
TGGACCACTG 
GGAAAGTGTG 
ACTGTATCCC 
ATGACCATGA 
GTGGGTCATC 
GTGACCCCTG 
CATCAGTCTG 
CTTAGACAGA 
CCCTCCAGAA 
TTGACTTCTG 
GAAAACAGGA 
GCTTTTGACA 
CCTCAAACAA 
GAAGCCTAAG 
ACATGTTGGA 



11 

I 

AGATGTGGAA 
CTGCCCATAA 
CGATCTTGCC 
CAATGAACAA 
AGTTCAACAA 
GGAACTTTGG 
TTATTGCTTC 
TCTTATATCC 
TTGAAGAGCT 
TGCTGTCTGC 
TTCTAGGAAG 
AAAGGTTCAC 
CATATGCTAT 
TCCCCACCTA 
GAGCGATCGG 
TTGAGATTTG 
CGTATGCCAA 
AGGCATGGTT 
ATGACGTCAG 
ATATCCTGGT 
CTGTGAATTC 
ATAGTAAGAT 
AAAATATGAC 
TTGGAAAGCA 
CTTCTGTGGG 
AATCTGTGAC 
GGGAAAAGAC 
TGAGCCCTGG 
ATACAGTGGC 
ACATATCAGT 
AGGTGGGCAC 
TGATGCTGTC 
ACCGCTTTGT 
GTCCTCTTTC 
CCCCTCTGGT 
AGCCTTCTCA 



21 
I 

GCTGTTGCTG 
ACTCGTGTGT 
CCATGACCTG 
CAATCAGATT 
ACTAAAGGAG 
CACCTCAAGA 
AGTTATATCC 
TGGACTAAGA 
CCTGTTTGCC 
TGCTGTTTCT 
ACTCCTGGAT 
AGGACATAAT 
GAATTATTGG 
TGGACGTACC 
ACCAGCATCT 
TTCCTTTGTC 
CAAGGGGAAA 
TATAAGGCGA 
GGGCACGTTC 
GCGGGCTGAG 
TTCAAGCACT 
TTTGCCCCCA 
TATAACCCCT 
CACTGTAGCT 
TCATCAGTCC 
CACTGGACAG 
CCTGACCCCT 
AGGAACGACT 
CCCTAGAAGG 
CACCCCTGAA 
TCACCCCAGG 
CTCCAGCCCC 
TCCCATCTAT 
TCTAAAAAAA 
GTCAGAAACC 
TCCCGGGGCA 



31 
I 

TGGGTTGGGC 
TATTTCACCA 
GACCCCTTTC 
GTTGCTAAGG 
AGGAACAGAG 
TTCACCACTA 
CTTCTGAGGA 
GGCAGCCCCA 
TTCCGGAAGG 
GGGGTCCCAC 
TTCATCAATG 
AGCCCCCTCT 
AGAAAGCTTG 
TTTCGCCTCC 
CCAGGGAAGT 
TGGGGAGOGA 
GAGTGGGTTG 
GAGCATTTTG 
TGTGGCACTG 
TTCAGTTCAA 
GACCCTGAAA 
GGAGGAGAGG 
AGAGGTACAA 
CTAGGAGAGA 
ATGACCCCTG 
AAGACCCTGA 
GTGGGTCATC 
ATGACCCCTG 
AAGGCTGTGG 
GGGCAGACTA 
ATGGGTAACT 
GTCATCCAGC 
GGAAACCATT 
GAAATCCCAG 
AGGGAAAACC 
AAGCAGGCAT 



41 



TGGTTCTTGT 
ACTGGGCACA 
TCTGCACCCA 
ATCTCCAGGA 
AGCTGAAAAC 
TGTTGTCCAC 
CACATGACTT 
TGCATGACCG 
AGGCACTGCT 
ACATCGTCCA 
TCTTGTCTTA 
TCTCTCTGCC 
GGGCACCCTC 
TCAAAGCCTC 
ACACCAAGCA 
AGAAGCACTG 
GCTATGACAA 
GGGGGGCCAT 
GCCCTTTCCC 
CTTCTTTACC 
GGCTGGCTGT 
CTGGGGTCAC 
CTGTGACCCC 
AGACTGAGAT 
GAGAGAAGGC 
CCTCTGTGGG 
AGTCTGTGAC 
TCCATTTTCA 
CCCGTGAAAA 
TGCCTTTAAG 
TGGGTCTTCA 
TCCCGGAACA 
CCTCTGTCAA 
AAAACTCTGC 
CTTGTCTTTT 
CAAAACCAGA 



51 
I 

GCTGAAACAC 
CAGTCGGCCA 
CCTGATATTT 
TGAGAAAATT 
ACTACTGTCC 
ATTTGCCAAC 
TGATGGTCTT 
GTGGACTTTT 
CACCATGCGC 
AACATCCTAT 
TGACTTACAT 
TGAAGACCCC 
AGAGAAGCTC 
TAAGAATGGG 
AGAAGGCTTC 
GATTGATTAC 
TGCCATCAGC 
GGTGTGGACA 
CCTTGTCTAC 
ACAATTTTGG 
GACCACGGCA 
TGAGATCCAC 
TACAAAGGAA 
CACTGGGGCA 
CCTGACCCCT 
TTATCAGTCT 
CCCTGTGAGT 
GACTGAGACC 
GGTGACTGTC 
AGGGGAGAAT 
GATGGAAGCT 
AACTCCTCTA 
CTCAGTAACC 
TGTGGATGAA 
CTTCTAAGTG 
ATAGGCCAAT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1BO0 
1860 
1920 
1980 
2040 
2100 
2160 



1192 



WO 03/042661 



PCT/US02/36810 
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55 



CTCTTTTCCA TTAAATAAAC TGTAAACACA AGAACCCA 

Seq ID NO: 632 Protein sequence 
Protein Accession ft: NP_002548.1 

1 11 21 31 41 51 

I I I I I I 

MWKLLLWVGL VLVLKHHDGA AHKLVCYFTN WAHSRPGPAS ILPHDLDPFL CTHLIFAFAS 60 

MNNNQIVAKD LQDEKILYPE FNKLKERNRE LKTLLSIGGW NFGTSRFTTM LSTFANREKF 120 

IASVISLLRT HDFDGLDLFF LYPGLRGSPM HDRWTFLFLI EELLFAFRKE ALLTMRPRLL 180 

LSAAVSGVPH IVQTSYDVRF LGRLLDFINV LSYDLHGSWE RFTGHNSPLF SLPEDPKSSA 240 

YAMNYWRKLG APSE KL IMG I PTYGRTFRLL KASKNGLQAR AIGPASPGKY TKQEGFLAYF 300 

EICSFVWGAK KHWIDYQYVP YANKGKEWVG YDNAISFSYK AWFIRREHFG GAMVWTLDMD 360 

DVRGTFCGTG PFPLVYVLND ILVRAEFSST SLPQFWLSSA VNSSSTDPER LAVTTAWTTD 420 

SKILPPGGEA GVTEIHGKCE NMTITPRGTT VTPTKETVSL GKHTVALGEK TEITGAKTMT 480 

SVGHQSMTPG EKALTPVGHQ SVTTGQKTLT SVGYQSVTPG EKTLTFVGHQ SVTPVSHQSV 540 

SPGGTTMTPV HFQTETLRQN TVAPRRKAVA REKVTVPSRN ISVTPEGQTM PLRGENLTSE 600 . 

VGTHPRMGNL GLQMEAENRM MLSSSPVIQL PEQTPLAFDN RFVPIYGNHS SVNSVTPOTS 660 
PLSLKKEIPE NSAVDEEA 

Seq ID NO: 633 DNA sequence 

Nucleic Acid Accession #: NM_003685.1 

Coding sequence: 98.. 1021 

1 11 21 31 41 51 

I I I I I I 

AAACTCAGAA TTTTCGCGGG CTCGGTGAGC GGTTTTATCC CTCCGGCCGG CAGGCTGGGC 60 

GCAGGGGGCG AGCCCCCGCC CGGCGCGCAG CAGCACCATG GGCACGGTGC TGTCCCTGTC 120 

TCCCAGCTAC CGGAAGGCCA CGCTGTTTGA GGATGGCGGG GCCACCGTGG GCCACTATAC 180 

GGCCGTACAG AACAGCAAGA ACGCCAAGGA CAAGAACCTG AAGCGCCACT CCATCATCTC 240 

CGTGCTGCCT TGGAAGAGAA TCGTGGCCGT GTCGGCCAAG AAGAAGAACT CCAAGAAGGT 300 

GCAGCCTAAC AGCAGCTACC AGAACAACAT CACGCACCTC AACAATGAGA ACCTGAAGAA 360 

GTCGCTGTCG TGCGCCAACC TGTCCACATT CGCCCAGCCC CCACCGGCCC AGCCGCCTGC 420 

ACCCCCGGCC AGCCAGCTCT CGGGTTCCCA GACCGGGGGC TCCTCCTCAG TCAAGAAAGC 480 

CCCTCACCCT GCCGTCACCT CCGCAGGGAC GCCCAAACGG GTCATCGTCC AGGCGTCCAC 540 

CAGTGAGCTG CTTCGCTGCC TGGGTGAGTT TCTCTGCCGC CGGTGCTACC GCCTGAAGCA 600 

CCTGTCCCCC ACGGACCCCG TGCTCTGGCT GCGCAGCGTG GACCGCTCGC TGCTTCTGCA 660 

GGGCTGGCAG GACCAGGGCT TCATCACGCC GGCCAACGTG GTCTTCCTCT ACATGCTCTG 720 

CAGGGATGTT ATCTCCTCCG AGGTGGGCTC GGATCACGAG CTCCAGGCCG TCCTGCTGAC 780 

ATGCCTGTAC CTCTCCTACT CCTACATGGG CAACGAGATC TCCTACCCGC TCAAGCCCTT 840 

CCTGGTGGAG AGCTGCAAGG AGGCCTTTTG GGACCGTTGC CTCTCTGTCA TCAACCTCAT 900 

GAGCTCAAAG ATGCTGCAGA TAAATGCCGA CCCACACTAC TTCACACAGG TCTTCTCCGA 960 

CCTGAAGAAC GAGAGCGGCC AGGAGGACAA GAAGCGGCTC CTCCTAGGCC TGGATCGGTG 1020 

AGCACTGTAG CCTGCGTCAT GGCTCAAGGA TTCAATGCAT TTTTAAGAAT TTATTATTAA 1080 
ATCAGTTTTG TGTACAG 

Seq ID NO: 634 Protein sequence 
Protein Accession #: NP_003876.1 

1 11 21 31 41 51 

I I I I I I 

MGTVLSLSPS YRKATLFEDG AATVGHYTAV QNSKNAKDKN LKRHSIISVL PWKRIVAVSA 60 

KKKNSKKVQP NSSYQNNITH LNNENLKKSL SCANLSTFAQ PPPAQPPAPP ASQLSGSQTG 120 

GSSSVKKAPH PAVTSAGTPK RVIVQASTSE LLRCLGEFLC RRCYRLKHLS PTDPVLWLRS 180 

VDRSLLLQGW QDQGFITPAN WFLYMLCRD VISSEVGSDH ELQAVLLTCL YLSYSYMGNE 240 

ISYPLKPFLV ESCKEAFWDR CLSVINLMSS KMLQINADPH YFTQVFSDLK NESGQEDKKR 300 
LLLGLDR 



1193 



WO 03/042661 



PCT/US02/36810 



TABLE 79/t 



Pkey: Unique Eos probesel identifier number 

ExAccm Exemplar Accession number, Genbank accession number 

UnigenelO: Unigene number 

Unigene TiUetlnigene gene GUe 

Seq ID No.: Sequence identification number finking informaCon in Table 7SA to 



in Table 80 
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Pkey ExAccn 



UnigenelD Unigene Title 



424212 
424503 
418007 
418007 
418738 
4438^6 
409956 
422867 
444381 
4215B2 
411789 
452281 
42869B 
421552 
425247 
432201 
447377 
446921 



432179 
422578 



447033 

447033 

411975 

434206 

423936 

447400 

449032 

415214 

443247 

422048 

410418 

446342 

411274 

104978 

422260 

409041 

420344 

422163 

437935 

422330 

408908 

407811 

437852 

408243 

426088 

439738 

419741 

450983 

417771 

421379 

442006 

413048 

443324 

424917 

424917 

444527 

442652 

450726 

416965 

442275 

431745 

431745 

453142 

421659 

444090 

421563 

430413 

414991 

419833 

424943 



NM.005814 

NMJJ022Q5 

M13509 

M13509 

AW3B8633 

AI085198 

AW103364 

L32137 

BE387335 

AI910275 

AF245505 

T93500 

AAB52773 

AF026692 

NM-005940 

AI538613 

X77343 

AB012113 

AU076801 

X75208 

AF239666 

AW630041 

AI357412 

AI357412 

AI916058 

AW136973 

U77629 

AK000322 

M045573 

AI445236 

BE614387 

NMJJ12445 

D31382 

BE298665 

NM_0O2776 

AI199268 

AA315993 

AB 033025 

BE463721 

AF027208 

AW939591 

D30783 

BE296227 

AW190902 

BE001836 

Y007B7 

AF038007 

BE246502 

NM_007019 

AA305384 

AA804698 

Y15221 - 

AW975183 

M93221 

R44013 

AI636208 

AI6362Q8 

NM.005408 

AI005163 

AW204600 

N26223 

AW449467 

AW972448 

AW972448 

AA033648 

NNL014459 

S59115 

NM.006433 

AW842182 

C17898 

AA251131 

AU077260 



Hs.83169 
Hs.83169 
Hs.6682 



Hs.143131 glycoprotein A33 (transmembrane) 
Hs.149609 integrin, alpha 5 (fibronectin receptor, 
matrix metalloproteinase 1 (interstitial 
matrix metalloproteinase 1 (interstitial 
sohite carrier family 7, (calionic amino 
Hs.1 64226 Thrombos port din 1 
Hs.727 bihlbin, beta A (acfivin A, acfivin AB a 
Hs.1584 cartilage oligomeric matrix protein (pse 
Hs.283713 hypothetical protein BC014245 
Hs.350470 trefoil factor 1 (breast cancer, estroge 
Hs.72157 Adlican 

Hs.28792 Homo sapiens cONA FU11041 lis, clone PL 
Ks.334838 K1AA1866 protein 
Hs.1 0570O secreted frizzled-related protein 4 
Hs. 1 55324 matrix metalloproteinase 1 1 (strometysin 
Hs.298241 Transmembrane protease, serine 3 
Hs.334334 transcription factor AP-2 alpha 
Hs.1 6530 small inducible cytokine subfamily A (Cy 
cadherin 17, LI cadherin (llver-intestin 
EphB3 

caudal type homeo box transcription fact 
suppression of lumorigenicity 14 (colon 
Predicted gene: Eos cloned; secreted WV 
Predicted gene: Eos cloned; secreted w/V 
Hs.1 44583 3UTR of: dead ringer (Drosophaa)-fike 
Hs.362915 ESTs,WeaWy similar to S69890 mitogen 1 
Hs.1 35639 achaete-scute complex (Drosophfla) homol 
Hs.1 8457 hypothetical protein FLJ20315 
Hs.22900 nuclear factor (erythroid-derived 2)-lik . 
Hs.125124 Eph82 
Hs.333893 c-Myc target JP01 

spondin 2, extracellular matrix protein 
transmembrane protease, serine 4 
solute carrier family 7 (cafionic amino 
kaffitaeh.10 

Homo sapiens, Similar to R1KEN cDNA 2010 
Hs.105484 regenerating gene type IV 
Hs.50081 Hypothetical protein, XPJ)51860(KIAA119 
Hs.97101 putative G protein-coupled receptor 
Hs.1 12360 prominin (mouse)-Gke 1 
Hs,5940 mucin 13, epithelial transmembrane 
Hs.1 15263 epiregulin 
Hs.250822 serine/threonine kinase 15 
Hs.40098 cysteine knot superfamily 1, BMP antagon 
Hs.256897 putative GPCR 
Hs.624 tnterteukinS 
Hs.166196 ATPase, Class I. type 8B, member 1 
Hs.9598 sema domain, immunoglobulin domain (Ig), 
Hs.93002 ubiquitin carrier protein E2-C 
Hs.25740 EROI (S. cerevisIaeHike 
Hs.82547 retinoic acid receptor responder (tazaro 
Hs.103982 small irtdudble cytokine subfamily B (Cy 
Hs^72210 ESTs, Weakly similar to S72482 hypotheti 
Hs.75182 man nose receptor, C type 1 
Hs.164225 ESTs 

Hs.96901 hypothetical protein FU23049 
hypothetical protein FU23049 
small inducible cytokine subfamSy A (Cy 
Hs. 20 1378 Homo sapiens cDNA FU40427 fe 
Hs.355462 HUMPSPBA Human pulmonary surfactantesso 
Hs.160436 MDAC1 

Hs.54795 Homo sapiens secretogtobin, family 3A, m 
Hs.163425 Novel FGENESH predicted cadherin repeat 
Hs.163425 Novel FGENESH predicted cadherin repeat 
Homo sapiens gap junction protein, alpha 



Hs.89436 

Hs.2913 

Hs.1545 

Hs.56937 

Hs.157601 

Hs.157601 



Hs.288126 

Hs.63325 

Hs.14846 

Hs.69423 

Hs.19322 



Hs.96901 
Hs.1 1383 



Hs.7473 
Hs.106511 
Hs.10306 



17 



natural killer cell group 7 sequence 
Hs.1 05806 granutysin 

Hs.241392 small inducible cytokine AS (RANTES) 

Homo sapiens up<eguiated by BCG-CWS (LO 
Hs .220697 Homo sapiens tryptopharryl-tRNA synthetas 
Hs.1 53924 death-associated protein kinase 1 



Seq ID No. 

Seq ID No. C1&C217 

Seq ID NO.C2&C21B 

SeqIDNo.C3&C219 

SeqlDNo.C4&C220 

SeqlDNo.C5&C221 

SeqlDNo.C6&C222 

Seq ID No. C7&C223 

SeqlDNo.C8&C224 

SeqlDNo.C9&C225 

SeqlDNo.C10&C226 

SeqtDNo.C11&C227 

Seq ID No. C12 

SeqlDNo.C138.C228 

SeqlDNo.C14&C229 

SeqtDNo.C15&C230 

Seq!DNo,C16&C231 

SeqlDNo.C17&C232 

SeqlDNo.C18&C233 

Seq IDNo.C19&C234 

Seq ID NO.C20&C235 

SeqlDNo.C2UC236 

SeqlDNo.C22&C237 

SeqlDNo.C23&C238 

Seq ID N0.C24&C239 

SeqlDNo.C25&C240 

SeqlDNo.C26&C241 

SeqlDNo.C27&C242 

SeqiDNo.C28&C243 

SeqlDNaC29&C244 

Seq ID NO.C30&C245 

Seq IDNO.C31&C246 

SeqlDNaC32&C247 

SeqlDNaC33&C248 

SeqtDNaC34&C249 

SeqlDNaC35&C250 

Seq(DNo.C36&C251 

Seq ID No. C37 & C252 

SeqiDNo.C38&C253 

SeqlDNo.C39&C254 

SeqlDNo.C40&C255 

Seq 10 No. C41&C256 

SeqlDNo.C42&C257 

Seq ID No. C43 & C258 

Seq ID N0.C44&C259 

SeqtDNaC45&C260 

Seq ID No. C46&C261 

Seq ID NO.C47&C262 

SeqtDNo.C48&C263 

SeqlDNo.C49&C264 

SeqlDNo.C508cC265 

SeqlDNo.C5UC266 

Seq ID No. C52&C267 

Seq ID NO.C53&C258 

Seq ID No. C54 & C259 

Seq ID NO.C55&C270 

Seq ID NO.C56&C271 

Seq ID No. C57 & C272 

SeqlDNo,C58&CZ73 

SeqIDNa C59&C274 

Seq ID No.C60&C275 

SeqlDNo.C61&C276 

SeqlDNo.C62&C277 

SeqIDNaC63&C278 

SeqlDNaC64&C279 

SeqlDNaC65&C280 

Seq ID No. C6S&C281 

Seq(DNo.C67&C282 

SeqlDNo.C68&C283 

SeqlDNo.C69&C284 

SeqIDNo.C70&C285 

SeqlDNaC7UC286 

SeqlDNo.C72&C287 
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430890 X54232 
452401 NMJJ07115 
439180 AI393742 . 
410407 X65839 
418526 BE019020 
422627 BE336857 
430486 6E062109 
423673 BE003054 
423673 BE003054 
431846 BE019924 
431958 X63629 
448733 NMJ)05629 
426440 BE382756 
428484 AF104032 
429211 AF052693 
423634 AW959908 
457819 AA057484 
424687 J05070 
418462 BE001596 
439606 W79123 
407720 AB037776 
418543 NM_005329 
417512 X76534 
415817 U88967 
415817 U88967 
415817 U88967 
415817 U88967 
415B17 U88967 
415817 U88967 
415817 U88967 
415817 U88967 
421B17 AF146074 
421817 AF146074 
409420 Z15008 
440659 AF134160 
408790 AW580227 
408790 AW580227 
408790 AW580227 
450701 H39960 
414774 X02419 
413691 AB023173 
453857 AL080235 
449101 AA205847 
429263 AA019004 
421474 U76362 
421753 BE314828 
408482 NM_000676 
426761 AI015709 
429736 AF125304 
430985 AA490232 
431890 X17033 
432583 AW023624 
446872 X97058 
453102 NMJW7197 
428513 BE220806 
430280 AA361258 
428486 AW583497 
457489 AI693815 
432874 W94322 
445891 AW391342 
445891 AW391342 
404682 

429547 AW009166 

404287 

404287 

404287 

418318 U47732 
444754 T83911 
432596 AJ224741 
444006 BE395085 
428505 AL035461 
448844 AI581519 
448844 AI581519 
428392 H10233 
448030 N30714 
422109 S73265 
449048 Z45051 
417931 W95642 
419216 AU076718 
426227 U67058 
413554 AA319146 
445417 AK00105B 
426322 J05068 



Hs.2699 fllypican 1 

Hs.29352 tumor necrosis (actor, alpha-induced pro 
Hs.1 99067 v-erb-b2 avian erythroblastic leukemia v 
Hs.63287 carbonic anhydrase IX 
Hs.85838 solute carrier family 16 (rronocarboxyfic 
Hs.1 18787 transforming growth factor, beta-induced 
Hs.241 551 chloride channel, calcium activated, fam 
matrix metalloproteinase 12 (macrophage 
matrix metalloproteinase 12 (macrophage 
Hs.271580 uroplakin IB 

Hs.2877 cadherin 3, type 1 , P-cadherin (placenta 
Hs.1 87958 solute carrier family 6 (neurotransmitte 
Hs.1 69902 solute carrier family 2 (facilitated gju 
Hs.1 84601 solute carrier family 7 (cationic amino 
Hs.198249 gap junction protein, beta 5 (cormexin 3 
Hs.1 690 heparin-Wnding growth factor binding pr 
FU20522 Hypothetical protein RJ20522 
matrix metalloproteinase 9 (geiafinase B 
integrin, beta 4 
G protein-coupled receptor 87 
immunoglobulin superfamiry, member 9 
hyaiuronan synthase 3 
glycoprotein (transmembrane) nmb 
protein tyrosine phosphatase, receptor-t 
protein tyrosine phosphatase, receptor-t 
protein tyrosine phosphatase, receptor-l 
protein tyrosine phosphatase, receptor-t 
protein tyrosine phosphatase, receptor-t 
protein tyrosine phosphatase, receptor-t 
protein tyrosine phosphatase, receptor-t 
protein tyrosine phosphatase, receptor-t 
Hs.1 08660 ATP-binding cassette, sub-family C (CFTR 
Hs.1 08660 ATP-binding cassette, sub-family C [CFTR 
laminin, gamma 2 (nicein (100kD), kaiini 
daudin 1 

neurotrophic tyrosine kinase, receptor, 
neurotrophic tyrosine kinase, receptor, 
neurotrophic tyrosine kinase, receptor, 
Hs.288467 hypothefcal protein XP_098151 (leucine- 
Hs.77274 plasminogen activator, urokinase 
ATPase, Class VI, type 11B 
Ras-induced senescence 1 (RIS1) 
G protein-coupled receptor 
ATP-binding cassette, sub-family A (ABC1 
solute carrier family 1 (glulamate trans 
ATP-binding cassette, sub-famity B (MDR/ 
adenosine A2b receptor 
PORIMIN Pro-oncosis receptor inducing me 
tumor necrosis factor receptor superfami 
ESTs, Weakly simitar to (78685 serine/lh 
integrin, alpha 2 (CD49B, alpha 2 subuni 
potassium channel TASK-4; potassium chan 
pyrimidinergic receptor P2Y, G-proiein c 
frizzled (Drosophila) homolog 1 0 
Hs.184697 ptaxinCI 
Hs^3786B interieutdn 7 receptor 
pancreatic polypeptide 
cryptic gene 

melanoma inhibitory activity 
Hs.199460 DPCR1 protein 
Hs.199460 DPCR1 protein 

ortholog of mouse polydomain protein 
FGENESH predicted novel secreted protein 
FGENESH predicted novel CUB-domam conta 
FGENESH predicted novel CUB-domain conta 
FGENESH predicted novel CUB-domain conta 
transmembrane 4 superfamiry member 3 
transmembrane 4 superfamiry member 4 
maWEn 3 

Hs.334762 type I transmembrane protein Fn14 
Hs.2281 chromogranin B (secretogranin 1} 

FGENESH predicted novel cell surface pr 
FGENESH predicted novel cell surface pr 
secretory granule, neuroendocrine protei 
membrane-spanning 4-domains, subfamily A 



Hs.1695 
Hs.1695 



Hs.35406 

Hs.151738 

Hs.85266 

Hs.58561 

Hs.38002 

Hs.85962 

Hs.82226 

Hs.78867 

HS.78B67 

Hs.78867 

Hs.78867 

Hs.78867 

Hs.78867 

Hs.78867 

Hs.78867 



Hs.54451 

Hs.7327 

Hs.47860 

Hs.47860 

Hs.47860 



Hs.75478 

Hs.35861 

Hs.23016 

Hs.198396 

Hs.104637 

Hs.107911 

Hs.45743 

Hs.172089 

Hs.212680 

Hs.27323 

Hs.271986 

Hs.162282 

Hs.16362 

Hs.31664 



Hs.184604 
Hs,127179 
Hs.279651 



Hs.99376 



Hs.84072 
Hs.11881 
Hs.278461 



Hs.177164 
Hs.177164 
Hs.2265 
Hs.325960 
Hs.1473 
Hs.22920 
Hs.82961 
Hs.164021 
Hs.154299 
HsJ5426 
Hs.12680 
Hs.2012 



similar to S68401 (cattle) glucose indue 
trefoil factor 3 (intestinal) 
small inducible cytokine subfamily B (Cy 
Human proteinase activated receptor-2 mR 
secretogranin ll (chromogranin C) 
a disintegrin-iike and metalloprotease w 
transcobalamm I (vitamin B12 binding pr 



SeqlDNo.C73&C288 
SeqlONo.C74&C289 
SeqlDNo.C75&C290 
SeqID NO.C76&C291 
SeqlDNo.C77&C292 
SeqID NO.C78&C293 
SeqlDNo.C79&C294 
SeqID No. C80&C295 
SeqlDNo.C8UC296 
SeqID No. C82&C297 
SeqID No. C83&C298 
SeqID No. C84&C299 
SeqID No. C85&C300 
SeqID No.C86&C301 
SeqID No. C87&C302 
SeqID No. C88&C303 
SeqID No. C69&C304 
SeqID No. C90&C305 
SeqID No. C91&C306 
SeqID No. C92&C307 
SeqiDNo.C93&C308 
SeqID No. C94&C309 
SeqID NO.C95&C310 
SeqID No. C96&C311 
SeqID NO.C97&C312 
SeqlDNo.C98&C313 
SeqID No. C99&C314 
SeqID No. C100&C315 
Seq IDNO.C101&C316 
SeqID No. C102&C317 
Seq ID No. C103&C318 
SeqlDNo.C104&C319 
Seq ID No. C105&C320 
Seq IDNO.C106&C321 
SeqlDNo.C107&C322 
SeqID No.C108&C323 
SeqlDNo.C109&C324 
SeqlDNaC110&C325 
SeqlDNo.C111&C326 
Seq ID No. C112&C327 
SeqlDNo.C113&C328 
SeqlDNo.C114&C329 
SeqlDNo.C115&C330 
SeqlDNo.C116&C331 
.Seq ID No. C117&C332 
SeqlDNo.C118&C333 
SeqJDNo.C119&C334 
SeqlDNo.C120&C335 
SeqlDNo.C121&C336 
SeqlDNo.C122&C337 
SeqlDNo.C123&C338 
SeqlDNo.C124&C339 
SeqlDNo.C125&C340 
SeqlDNaC126&C341 
SeqlDNo.C127&C342 
Seq ID No. C128&C343 
SeqlDNo.C129&C344 
SeqlDNo.C130&C345 
SeqlDNo.C131&C346 
SeqlDNo.C132&C347 
SeqiDNo.C133&C348 
SeqlDNo.C134&C349 
SeqlDNo.C135&C350 
SeqiDNaC136&C351 
SeqlDNaC137&C352 
SeqlDNo.C138&C353 
SeqlDNo.C139&C354 
Seq(DNo.C140&C355 
SeqlDNo.C14UC356 
SeqlDNo.C142&C357 
SeqlDNo.C143&C358 
. SeqlDNo.C144&C359 
SeqlDNo.C145&C360 
SeqiDNo.C146&C361. 
SeqlDNo.C147&C362 
SeqIDNo.C148&C363 
SeqID NO.C149&C364 
SeqlDNaC150&C365 
SeqlDNo.C15UC366 
SeqlDNo.C'152&C367 
SeqlDNo.C153&C368 
Seq ID No. C154&C369 
SeqlDNo.C155&C370 
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413719 


Bb4Jyt>oU 


Hs .75498 


431462 




Hs.256311 


416498 


U33632 


Hs.79351 


413095 


AA494359 


Hs.30715 


426125 


X87241 


Hs. 166994 


436729 


BE621B07 


Hs.351316 


437145 


AF007216 


Hs.5462 


451620 


AWQ58357 


Hs. 199248 


427557 


NMJXJ2659 


Hs.1 79657 


408308 


AL033377 


Hs.44197 


421340 


F07783 


Hs.1369 


428187 


A) 687303 


Hs.285529 


428187 


AI687303 


Hs.285529 


422278 


AF072873 


Hs.1 14218 


446619 


AU076643 


Hs.313 


419452 


U33635 


Hs.90572 


428242 


H55709 


Hs.2250 


439659 


AW970780 


Hs.59483 


411825 


AK000334 


Hs.352415 


412314 


AA825247 


Hs.356084 


429150 


AF1 20103 


Hs.1 97366 


419073 


AW372170 


Hs.1 83918 


411828 


AW161449 


Hs.72290 


419508 


AW997938 


Hs.90786 


421779 


AI879159 


Hs.1 08219 


439668 


AI091277 


Hs.302634 


433336 


AF017986 


Hs.31386 


436972 


AA284679 


Hs.25640 


410268 


AA316181 


Hs.61635 


416370 


N90470 


Hs.203697 


437052 


AA861697 


Hs.1 20591 


421481 


AW391972 


Hs.1 04696 


444151 


AW972917 


Hs.1 28749 


426174 


AA547959 


Hs.1 15838 


410037 


AB020725 


Hs.58009 


425071 


NM_0t3989 


Hs.1 54424 


421829 


AB018330 


Hs.1 08708 


418576 


AW968159 


Hs.302740 


419693 


AA1 33749 


Hs.301350 


419693 


AA133749 


Hs^01350 


448988 


Y09763 


Hs.22785 


448988 


Y09763 


Hs^2785 


448988 


Y09763 


HS.227B5 


448988 


Y09763 


Hs.22785 


430144 


AI732722 


Hs.98927 


408833 


AW612232 


Hs.254335 


452017 


AF1 09302 


Hs.27495 


415992 


C05837 


Hs.145807 


415992 


C05837 


Hs.145807 


443991 


NM.0Q225O 


Hs.10082 


425976 


C75094 


Hs.334514 


432800 


BE391046 


Hs.278962 


452955 


AW390282 


Hs.31130 


424339 


BE257148 


Hs.145416 


425263 


NMJJ01197 


Hs.155419 


421537 


BE383488 


Hs.105547 


434293 


NMJXJ4445 


Hs.3796 


427715 


BE245274 


Hs.180428 


413049 


NMJW2151 


Hs.823 


414555 


N98569 


Hs.76422 


422424 


AI186431 


Hs.296638 


432378 


AI493046 


Hs.146133 


409041 


AB033025 


Hs.50081 


TABLE 79B 





small Inducible cytokine subfamily A (Cy 
granin-lfte neuroendocrine peptide precu 
potassium channel, subfamily K. member 1 
potassium voltage-gated channel, Isk-rel 
FAT tumor suppressor (Drosophila) homolo 
transmembrane 4 superfamily member 1 
solute carrier family 4, sodium blcarbon 
ESTs 

plasminogen activator, urokinase recepto 
hypothetical protein OKFZp564D0462 
decay accelerating factor for complement 
G protein-coupled receptor 49 
G protem-coupted receptor 49 
frizzled (Drosophila) homolog 6 
secreted phosphoprotein 1 (osteopontin, 
PTK7 protein tyrosine kinase 7 
leukemia inhibitory factor (cholinergic 
leucine-rich repeat-containing G protein 
solute carrier family 39 (zinc transport 
G protein-coupled receptor 27 (GPR27) (S 
smoothened (Drosophila) homolog 
transmembrane receptor Unc5H2 mfUMA 
wingless-type MMTV integration site faml 
ATP-binding cassette, sub-family C (CFTR 
wingless-type MMTV integration site fami 
frizzled (Drosophila) homolog 8 
secreted frizzted-retated protein 2 (sir 
daudin3 

six transmembrane epilhelia! antigen of 

CD38 antigen (p45) 

ESTs 

WAA1324 protein 
alpha-methylacyl-CoA racemase 
Homo sapiens similar to Echinoidin (LOC1 
KIAA0918 protein 
deiodinase, iodothyronine, type II 
caldurr^almodulin-dependent protein km 
Epithelial calcium channel 2, CaT-Jike A 
FXYD domain-containing Ion transport reg 
FXYD domain-containing ion transport reg 
gamma-aminobutyric acid (GABA) A recepto 
gamma-ami nobirtyric acid (GABA) A recepto 
gamma-arrariobufyric acid (GABA) A recepto 
ganvna-arriiiobutyric acid (GABA) A recepto 
ERGL protein; ERGtC-53-like protein 
ESTs 

prostate cancer associated protein 7 
hypothetical protein FU13593 
hypothetical protein FL) 13593 
potassium intermediate/small conductance 
NG22 protein 
AIM-1 protein 

transmembrane 7 superfamily member 2 
endogtycan 

BCL2-interactjng kDIer (apoptosis-induc 
neural profiferation, differentiation an 
EphB6 

K1AA1181 protein 

hepsin (transmembrane protease, serine 1 
phospholipase A2, group IIA (platelets, 



ESTs 

Hypothetical protein. XP_051860 (WAA119 



SeqlDNo. C156&C371 

SeqIDNo. C157&C372 

SeqlDNo.C158&C373 

SeqIDNo. C159&C374 

SeqlDNo.C160&C375 

SeqlDNo.C16UC376 

SeqtDNo.C162&C377 

SeqIDNo.C163&C378 

SeqIDNo. C164&C379 

SeqIDNo. C165&C380 

SeqIDNo.C166&C381 

SeqIDNo. C167&C382 

SeqID NO.C168&C383 

SeqlDNo.C169&C384 

Seq ID No. C170& C385 

SeqID N0.C171&C385 

SeqID N0.C172&C387 

SeqlDNo.C173&C3B8 

SeqlDNo.C174&C389 

SeqlDNo.C175&C390 

SeqtDNo.C176&C391 

SeqIDNo. C177&C392 

SeqlDNo.C178&C393 

SeqiDNo.C179&C394 

Seq iO NO.C180&C395 

SeqlDNo.C181&C396 

SeqlDNaC182&C397 

SeqID N0.C183&C398 

SeqID NO.C184&C399 

Seq ID No. C185&C40Q 

Seq1DNo.C186&C401 

Seq IDNo.C1878.C402 

SeqlDNo.C188&C403 

SeqlDNo.C169&C404 

SeqIDNo. C190&C405 

SeqlDNo.C19UC406 

SeqlDNo.C192&C407 

SeqlDNo.C193&C408 

Seq IDNO.C194&C409 

SeqID No. C195&C410 

SeqlDNo.C1968 l C411 

SeqlDNo.C197&C412 

SeqlDNo.C198&C413 

SeqlDNo.C199&C414 

SeqIDNo. C200&C415 

SeqlDNo.C201&C416 

SeqIDNo. C202&C417 

SeqlDNo.C203&C418 

SeqlDNo.C204&C419 

SeqtDNo.C205&C420 

SeqlDNo.C206&C421 

SeqlDNo.C207&C422 

SeqlDNo.C208&C423 

SeqIDNo.C209&C424 

SeqtDNo.C210&C425 

Seq IDNO.C211&C426 

SeqlDNo.C212&C427 

Seq!DNo.C213&C428 

SeqlDNaC214&C429 

SeqlDNo.C215&C430 

SeqlDNo.C216&C431 

SeqlDNo.C432&C433 

SeqtDNo.C434&C435 



Pkey. Unique Eos probeset identifier number 
CAT number Gene cluster number 
Accession: Genbank accession numbers 



Pkey CAT Number Accession 

414991 1785136J D78831 C17898 D78863 

TABLE 79C 



Pkey: Unique number corresponding to an Eos probeset 

Ret Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. "Dunham I. et aL" refers to the publication entitled The DNA 

sequence of human chromosome 22." Dunham I. et aL, Nature (1999) 402:48*495. 
Strand: Indicates DNA strand from which exons were predicted. 
NLpositioa- Indicates nucleotide positions of predicted exons. 

Pkey Ref Strand Ntposttion 

1196 



WO 03/042661 



404682 9797231 Minus 40977-41150 

404287 2326514 Plus 53134-53281 

404287 2326514 Plus 53134-53281 

404287 2326514 Plus 53134-53281 



PCT/US02/36810 



1197 



WO 03/042661 



PCT/US02/36810 



Table 80: 

Seq ID NO: CI DNA Sequence 
Nucleic Acid Accession #: NM_005814 
5 Coding sequence: 345.. 1304 

1 11 21 31 41 51 

I I I I I I 

CTACCCCTTT GTGAGCAGTC TAGGACTTTG TACACCTGTT AAGTAGGGAG AAGGCAGGGG 60 

10 AGGTGGCTGG TTTAAGGGGA ACTTGAGGGA AGTAGGGAAG ACTCCTCTTG GGACCTTTGG 120 

AGTAGGTGAC ACATGAGCCC AGCCCCAGCT CACCTGCCAA TCCAGCTGAG GAGCTCACCT 180 

GCCAATCCAG CTGAGGCTGG GCAGAGGTGG GTGAGAAGAG GGAAAATTGC AGGGACCTCC 240 

AGTTGGGCCA GGCCAGAAGC TGCTGTAGCT TTAACCAGAC AGCTCAGACC TGTCTGGAGG 300 

CTGCCAGTGA CAGGTTAGGT TTAGGGCAGA GAAGAAGCAA GACCATGGTG GGGAAGATGT 360 

15 GGCCTGTGTT GTGGACACTC TGTGCAGTCA GGGTGACCGT CGATGCCATC TCTGTGGAAA 420 

CTCCGCAGGA CGTTCTTCGG GCTTCGCAGG GAAAGAGTGT CACCCTGCCC TGCACCTACC 480 

ACACTTCCAC CTCCAGTCGA GAGGGACTTA TTCAATGGGA TAAGCTCCTC CTCACTCATA 540 

CGGAAAGGGT GGTCATCTGG CCGTTTTCAA ACAAAAACTA CATCCATGGT GAGCTTTATA 600 

AGAATCGCGT CAGCATATCC AACAATGCTG AGCAGTCCGA TGCCTCCATC ACCATTGATC 660 

20 AGCTGACCAT GGCTGACAAC GGCACCTACG AGTGTTCTGT CTCGCTGATG TCAGACCTGG 720 

AGGGCAACAC CAAGTCACGT GTCCGCCTGT TGGTCCTCGT GCCACCCTCC AAACCAGAAT 780 

GCGGCATCGA GGGAGAGACC ATAATTGGGA ACAACATCCA GCTGACCTGC CAATCAAAGG 840 

AGGGCTCACC AACCCCTCAG TACAGCTGGA AGAGGTACAA CATCCTGAAT CAGGAGCAGC 900 

CCCTGGCCCA GCCAGCCTCA GGTCAGCCTG TCTCCCTGAA GAATATCTCC ACAGACACAT 960 

25 CGGGTTACTA CATCTGTACC TCCAGCAATG AGGAGGGGAC GCAGTTCTGC AACATCACGG 1020 

TGGCCGTCAG ATCTCCCTCC ATGAACGTGG CCCTGTATGT GGGCATCGCG GTGGGCGTGG 1080 

TTGCAGCCCT CATTATCATT GGCATCATCA TCTACTGCTG CTGCTGCCGA GGGAAGGACG 1140 

ACAACACTGA AGACAAGGAG GATGCAAGGC CGAACCGGGA AGCCTATGAG GAGCCACCAG 1200 

AGCAGCTAAG AGAACTTTCC AGAGAGAGGG AGGAGGAGGA TGACTACAGG CAAGAAGAGC 1260 

30 AGAGGAGCAC TGGGCGTGAA TCCCCGGACC ACCTCGACCA GTGACAGGCC AGCAGCAGAG 1320 

GGCGGCGGAG GAAGGGTTAG GGGTTCATTC TCCCGCTTCC TGGCCTCCCT TCTCCTTTCT 1380 

AAGCCCTGTT CTCCTGTCCC TCCATCCCAG ACATTGATGG GGACATTTCT TCCCCAGTGT 1440 

CAGCTGTGGG GAACATGGCT GGCCTGGTAA GGGGGTCCCT GTGCTGATCC TGCTGACCTC 1500 

ACTGTCCTGT GAAGTAACCC CTCCTGGCTG TGACACCTGG TGCGGGCCTG GCCCTCACTC 1560 

35 AAGACCAGGC TGCAGCCTCC ACTTCCCTCG TAGTTGGCAG GAGCTCCTGG AAGCACAGCG 1620 

CTGAGCATGG GGCGCTCCCA CTCAGAACTC TCCAGGGAGG CGATGCCAGC CTTGGGGGGT 1680 

GGGGGCTGTC CTGCTCACCT GTGTGCCCAG CACCTGGAGG GGCACCAGGT GGAGGGTTTG 1740 

CACTCCACAC ATCTTTCTTG AATGAATGAA AGAATAAGTG AGTATGCTTG GGCCCTGCAT 1800 

Af . TGGCCTGGCC TCCAGCTCCC ACTCCCTTTC CAACCTCACT TCCCGTAGCT GCCAGTATGT 1860 

4U TCCAAACCCT CCTGGGAAGG CCACCTCCCA CTCCTGCTGC ACAGGCCCTG GGGAGCTTTT 1920 

GCCCACACAC TTTCCATCTC TGCCTGTCAA TATCGTACCT GTCCCTCCAG GCCCATCTCA 1980 

AATCACAAGG ATTTCTCTAA CCCTATCCTA ATTGTCCACA TACGTGGAAA CAATCCTGTT 2040 

ACTCTGTCCC ACGTCCAATC ATGGGCCACA AGGCACAGTC TTCTGAGCGA GTGCTCTCAC 2100 

TGTATTAGAG CGCCAGCTCC TTGGGGCAGG GCCTGGGCCT CATGGCTTTT GCTTTCCCTG 2160 

45 AAGCCCTAGT AGCTGGCGCC CATCCTAGTG GGCACTTAAG CTTAATTGGG GAAACTGCTT 2220 
TGATTGGTTG TGCCTTCCCT TCTCTGGTCT CCTTGAGATG ATCGTAGACA CAGGGATGAT 2280 
TCCCACCCAA ACCCACGTAT TCATTCAGTG AGTTAAACAC GAATTGATTT AAAGTGAACA 2340 
CACACAAGGG AGCTTGCTTG CAGATGGTCT GAGTTCTTGT GTCCTGGTAA TTCCTCTCCA 2400 
GGCCAGAATA ATTGGCATGT CTCCTCAACC CACATGGGGT TCCTGGTTGT TCCTGCATCC 2460 
50 CGATACCTCA GCCCTGGCCC TGCCCAGCCC ATTTGGGCTC TGGTTTTCTG GTGGGGCTGT 2520 
CCTGCTGCCC TCCCACAGCC TCCTTCTGTT TGTCGAGCAT TTCTTCTACT CTTGAGAGCT 2580 
CAGGCAGCGT TAGGGCTGCT TAGGTCTCAT GGACCAGTGG CTGGTCTCAC CCAACTGCAG 2640 
TTTACTATTG CTATCTTTTC TGGATGATCA GAAAAATAAT TCCATAAATC TATTGTCTAC 2700 
TTGCGATTTT TTAAAAAATG TATATTTTTA TATATATTGT TAAATCCTTT GCTTCATTCC 2760 
55 AAATGCTTTC AGTAATAATA AAATTGTGGG TGG 2793 



60 
65 
70 
75 
80 



Seq ID NO: C2 DNA Sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 1..3150 



1 
I 

ATGGGGAGCC 
CGCCGACCCC 
GGCTTCAACT 
GGATTCTCAG 
CCCAAGGCTA 
TGGGGTGCCA 
CTGGAGTCCT 
TGGTTCGGGG 
AGCTGGCGCA 
GATAACTTCA 
GGACAGGGTT 
TTAGGTGGAC 
ATTGCAGAAT 
CGCCAGGCCA 
TTCAGTGGTG 
GGCTATGTCA 
CAGATGGCCT 
GATGACTTGC 
GAGGTGGGCA 
CTTACCCTCA 
GACCTGGACC 
CAGCAGGGAG 



11 
I 

GGACGCCAGA 
CGCTSSTGCC 
TAGACGCGGA 
TGGAGTTTTA 
ATACCAGCCA 
GCCCCACACA 
CACTGTCCAG 
CAACAGTTCG 
CAGAGAAGGA 
CCCGAATTCT 
ACTGCCAAGG 
CAGGAAGCTA 
CTTATTACCC 
GTTCCATCTA 
ATGACACAGA 
CCATCCTTAA 
CCTACTTTGG 
TGGTGGGGGC 
GGGTCTACGT 
CTGGCCATGA 
AGGATGGCTA 
TAGTGTTTGT 



21 
I 

GTCCCCTCTC 
GCTGCTGTTG 
GGCCCCAGCA 
CCGGCCGGGA 
GCCAGGAGTG 
GTGCACCCCC 
CTCAGAGGGA 
AGCCCATGGC 
GCCACTGAGC 
GGAGTATGCA 
AGGCTTCAGT 
TTTCTGGCAA 
CGAGTACCTG 
TGATGACAGC 
AGACTTTGTT 
TGGCTCAGAC 
CTATGCAGTG 
ACCCCTGCTC 
CTACCTGCAG 
TGAGTTTGGC 
CAATGATGTG 
ATTTCCTGGG 



31 
I 

CACGCOGTGC 
CTGCTSSTGC 
GTACTCTCGG 
ACAGACGGGG 
CTGCAGGGTG 
ATTGAATTTG 
GAGGAGCCTG 
TCCTCCATCT 
GACCCCGTGG 
CCCTGCCGCT 
GCCGAGTTCA 
GGCCAGATCC 
ATCAACCTGG 
TACCTAGGAT 
GCTGGTGTGC 
ATTCGATCCC 
GCCGCCACAG 
ATGGATCGGA 
CACCCAGCCG 
CGATTTGGCA 
GCCATCGGGG 
GGCCCAGGAG 



41 
I 

AGCTGCGCTG 
CGCCGCCACC 
GGCCCCCGGG 
TCAGTGTGCT 
GTGCTGTCTA 
ACAGCAAAGG 
TGGAGTACAA 
TGGCATGCGC 
GCACCTGCTA 
CAGATTTCAG 
CCAAGACTGG 
TGTCTGCCAC 
TTCAGGGGCA 
ACTCTGTGGC 
CCAAAGGGAA 
TCTACAACTT 
ACGTCAATGG 
CCCCTGACGG 
GCATAGAGCC 
GCTCCTTGAC 
CTCCCTTTGG 
GGCTGGGCTC 



51 

I 

GGGCCCCCGG 
CAGGGTCGGG 
CTCCTTCTTC 
GGTGGGAGCA 
CCTCTGTCCT 
CTCTCGGCTC 
GTCCTTGCAG 
TCCACTGTAC 
CCTCTCCACA 
CTGGGCAGCA 
CCGTGTGGTT 
TCAGGAGCAG 
GCTGCAGACT 
TGTTGGTGAA 
CCTCACTTAC 
CTCAGGGGAA 
GGACGGGCTG 
GCGGCCTCAG 
CACGCCCACC 
CCCCCTGGGG 
TGGGGAGACC 
TAAGCCTTCC 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 . 

1140 

1200 

1260 

1320 



1198 



WO 03/042661 



PCT/US02/36810 



5 

10 

15 

20 

25 

30 

35 

40 

45 
50 
55 
60 
65 
70 
75 
80 



CAGGTTCTGC 
CGAGGAGGCC 
GTGGACAAGG 
ATCTTCCCCG 
GCCTGCATCA 
GGTTTCACAG 
CTGTTCCTGG 
CGAGAGGATT 
CTCTCGCCGA 
CACGGCCTCA 
ATCTTGCTGG 
GGGGAGCAGA 
CAGAATGTGG 
GCTGAGTACT 
TTTGCCGTQA 
GCCAGTCTGT 
ATCCAGTTTG 
TCCTTTCGGC 
GAGGCAGTGC 
GACCTGGGAC 
AGCCAGGGTG 
GTGACCAGAG 
GAGTTGGATC 
TCTGCTTCCT 
TGTGAGCTCG 
TGGGCCAAGA 
TACAAAGCCC 
CAGGTGGCCA 
ATCATCATCC 
TACAAGCTTG 
CTCAAGCCTC 



AGCCCCTGTG 
GAGACCTGGA 
CTGTGGTATA 
CCATGTTCAA 
ACCTTAGCTT 
TGGAACTTCA 
CCTCCAGGCA 
GCAGAGAGAT 
TTCACATCGC 
GGCCAGCCCT 
ACTGTGGAGA 
ACCATGTGTA 
GTGAGGGTGG 
CAGGACTCGT 
ACCAGAGCCG 
GGGGTGGCCT 
ACTTCCAGAT 
TCTCCGTGGA 
TATTCCCAGT 
CTGCTGTCCA 
TGCTGGAACT 
TTACGGGACT 
CCGAGGGTTC 
CGGGACCTCA 
GGCCCCTGCA 
CTTTCTTGCA 
TGAAGATGCC 
CAGCTGTGCA 
TAGCCATCCT 
GATTCTTCAA 
CAGCCACCTC 



GGCAGCCAGC 
TGGCAATGGA 
CAGGGGCCGC 
CCCAGAGGAG 
CTGCCTCAAT 
GCTGGACTGG 
GGCAACCCTG 
GAAGATCTAC 
TCTCAACTTC 
ACATTATCAG 
AGACAACATC 
CCTGGGTGAC 
CGCCTATGAG 
CAGACACCCA 
CCTGCTGGTG 
TCGGTTTACA 
CCTCAGCAAG 
GGCTCAGGCC 
AAGCGACTGG 
CCATGTCTAT 
CAGCTGTCCC 
CAACTGCACC 
CCTGCACCAC 
GATCCTGAAA 
CCAACAAGAG 
GGGGGAGCAC 
CTACCGAATC 
ATGGACCAAG 
GTTTGGCCTC 
ACGCTCCCTC 
TGATGCCTGA 



CACACCCCAG 
TATCCTGATC 
CCCATCGTGT 
CGGAGCTGCA 
GCTTCTGGAA 
CAGAAGCAGA 
ACCCAGACCC 
CTCAGGAACG 
TCCTTGGACC 
AGCAAGAGCC 



AAGAATGCCC 
GCTGAGCTTC 
GGGAACTTCT 
TGTGACCTGG 
GTCCCTCATC 
AATCTCAACA 
CAGGTCACCC 
CATCCCCGAG 
GAGCTCATCA 
CAGGCTCTGG 
ACCAATCACC 
CAGCAAAAAC 
TGCCCGGAGG 
AGCCAAAGTC 
CAGCCATTTA 
CTGCCTCGGC 
GCAGAAGGCA 
CTGCTCCTAG 
CCATATGGCA 



ACTTCTTTGG 
TGATTGTGGG 
CCGCTAGTGC 
GCTTAGAGGG 
AACACGTTGC 
AGGGAGGGGT 
TGCTCATCCA 
AGTCAGAATT 
CCCAAGCCCC 
GGATAGAGGA 
ACCTGCAGCT 
TGAACCTCAC 
GGGTCACOGC 
CCAGCCTGAG 
GCAACCCCAT 
TCCGGGACAC 
ACTCGCAAAG 
TGAACGGTGT 
ACCAGCCTCA 
ACCAAGGCCC 
AAGGTCAGCA 
CCATTAACCC 
GGGAAGCTCC 
CTGAGTGTTT 
TGCAGTTGCA 
GCCTGCAGTG 
AGCTGCCCCA 
GCTATGGCGT 
GTCTACTCAT 
CCGCCATGGA 



CTCTGCCCTT 
GTCCTTTGGT 
CTCCCTCACC 
GAACCCTGTG 
TGACTCCATT 
ACGGCGGGCA 
GAATGGGGCT 
TCGAGACAAA 
AGTGGACAGC 
CAAGGCTCAG 
GGAAGTGTTT 
TTTCCATGCC 
CCCTCCAGAG 
CTGTGACTAC 
GAAGGCAGGA 
TAAGAAAACC 
CGACGTGGTT 
CTCCAAGCCT 
GAAGGAGGAG 
CAGCTCCATT 
GCTCCTATAT 
AAAGGGCCTG 
AAGCCGCAGC 
CAGGCTGCGC 
TTTCCGAGTC 
TGAGGCTGTG 
AAAAGAGCGT 
CCCACTGTGG 
CTACATCCTC 
AAAAGCTCAG 



Seq ID NO: C3 DNA Sequence 

Nucleic Acid Accession i\i NM_002421.2 

Coding sequence : 1 . . 1410 



1 
1 

ATGCACAGCT 
CCAGCGACTC 
TACTACAACC 
GTTGAAAAAT 
GCTGAAACCC 
GTCCTCACTG 
TACACGCCAG 
TGGAGTAATG 
ATATCTTTTG 
CTTGCTCATG 
GAAAGGTGGA 
GGCCATTCTC 
ACCTTCAGTG 
GGACGTTCCC 
AAGCTAACCT 
TTCTACATGC 
TGGCCACAAC 
CGGTTTTTCA 
CCCAAGGACA 
CTTTCTGAGG 
GATGAATATA 
GGAATTGGCC 
GGAACAAGAC 
AATAGCTGGT 



11 

I 

TTCCTCCACT 
TAGAAACACA 
TGAAGAATGA 
TGAAGCAAAT 
TGAAGGTGAT 
AGGGGAACCC 
ATTTGCCAAG 
TCACACCTCT 
TCAGGGGAGA 
CTTTTCAACC 
CCAACAATTT 
TTGGACTCTC 
GTGATGTTCA 
AAAATCCTGT 
TTGATGCTAT 
GCACAAATCC 
TGCCAAATGG 
AAGGGAATAA 
TCTACAGCTC 
AAAACACTGG 
AACGATCTAT 
ACAAAGTTGA 
AATACAAATT 
TCAACTGCAG 



21 
I 

GCTGCTGCTG 
AGAGCAAGAT 
TGGGAGGCAA 
GCAGGAATTC 
GAAGCAGCCC 
TCGCTGGGAG 
AGCAGATGTG 
GACATTCACC 
TCATCGGGAC 
AGGCCCAGGT 
CAGAGAGTAC 
CCATTCTACT 
GCTAGCTCAG 
CCAGCCCATC 
AACTACGATT 
CTTCTACCCG 
GCTTGAAGCT 
GTACTGGGCT 
CTTTGGCTTC 
AAAAACCTAC 
GGATCCAGGT 
TGCAGTTTTC 
TGATCCTAAA 
GAAAAATTAG 



31 
I 

CTGTTCTGGG 
GTGGACTTAG 
GTTGAAAAGC 
TTTGGGCTGA 
AGATGTGGAG 
CAAACACATC 
GACCATGCCA 
AAGGTCTCTG 
AACTCTCCTT 
ATTGGAGGGG 
AACTTACATC 
GATATCGGGG 
GATGACATTG 
GGCCCACAAA 
CGGGGAGAAG 
GAAGTTGAGC 
GCTTACGAAT 
GTTCAGGGAC 
CCTAGAACTG 
TTCTTTGTTG 
TATCCCAAAA 
ATGAAAGATG 
ACGAAGAGAA 



41 
I 

GTGTGGTGTC 
TCCAGAAATA 
GGAGAAATAG 
AAGTGACTGG 
TGCCTGATGT 
TGACCTACAG 
TTGAGAAAGC 
AGGGTCAAGC 
TTGATGGACC 
ATGCTCATTT 
GTGTTGCGGC 
CTTTGATGTA 
ATGGCATCCA 
CCCCAAAAGC 
TGATGTTCTT 
TCAATTTCAT 
TTGCCGACAG 
AGAATGTGCT 
TGAAGCATAT 
CTAACAAATA 
TGATAGCACA 
GATTTTTCTA 
TTTTGACTCT 



Seq ID NO: C4 DNA Sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence : 1 . . 1410 



1 
I 

ATGCACAGCT 
CCAGCGACTC 
TACTACAACC 
GTTGAAAAAT 
GCTGAAACCC 
GTCCTCACTG 
TACACGCCAG 
TGGAGTAATG 
ATATCTTTTG 
CTTGCTCATG 
GAAAGGTGGA 
GGCCATTCTC 
ACCTTCAGTG 
GGACGTTCCC 
AAGCTAACCT 



11 
I 

TTCCTCCACT 
TAGAAACACA 
TGAAGAATGA 
TGAAGCAAAT 
TGAAGGTGAT 
AGGGGAACCC 
ATTTGCCAAG 
TCACACCTCT 
TCAGGGGAGA 
CTTTTCAACC 
CCAACAATTT 
TTGGACTCTC 
GTGATGTTCA 
AAAATCCTGT 
TTGATGCTAT 



21 
I 

GCTGCTGCTG 
AGAGCAAGAT 
TGGGAGGCAA 
GCAGGAATTC 
GAAGCAGCCC 
TCGCTGGGAG 
AGCAGATGTG 
GACATTCACC 
TCATCGGGAC 
AGGCCCAGGT 
CAGAGAGTAC 
CCATTCTACT 
GCTAGCTCAG 
CCAGCCCATC 
AACTACGATT 



31 
I 

CTGTTCTGGG 
GTGGACTTAG 
GTTGAAAAGC 
TTTGGGCTGA 
AGATGTGGAG 
CAAACACATC 
GACCATGCCA 
AAGGTCTCTG 
AACTCTCCTT 
ATTGGAGGGG 
AACTTACATC 
GATATCGGGG 
GATGACATTG 
GGCCCACAAA 
CGGGGAGAAG 



41 

I 

GTGTGGTGTC 
TCCAGAAATA 
GGAGAAATAG 
AAGTGACTGG 
TGCCTGATGT 
TGACCTACAG 
TTGAGAAAGC 
AGGGTCAAGC 
TTGATGGACC 
ATGCTCATTT 
GTGTTGCGGC 
CTTTGATGTA 
ATGGCATCCA 
CCCCAAAAGC 
TGATGTTCTT 



1380 
1440 
1500 
1S60 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3150 



51 
I 

ACACAGCTTC 
CCTGGAAAAA 
TGGCCCAGTG 
GAAACCAGAT 
GGCTCAGTTT 
GATTGAAAAT 
CTTCCAACTC 
AGACATCATG 
TGGAGGAAAT 
TGATGAAGAT 
TCATGAACTC 
CCCTAGCTAC 
AGCCATATAT 
ATGTGACAGT 
TAAAGACAGA 
TTCTGTTTTC 
AGATGAAGTC 
ACACGGATAC 
CGATGCTGCT 
CTGGAGGTAT 
TGACTTTCCT 
TTTCTTTCAT 
CCAGAAAGCT 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1410 



51 
I 

ACACAGCTTC 
CCTGGAAAAA 
TGGCCCAGTG 
GAAACCAGAT 
GGCTCAGTTT 
GATTGAAAAT 
CTTCCAACTC 
AGACATCATG 
TGGAGGAAAT 
TGATGAAGAT 
TCATGCCCTC 
CCCTAGCTAC 
AGCCATATAT 
ATGTGACAGT 
TAAAGACAGA 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 



1199 



WO 03/042661 



PCT/US02/36810 



10 

15 

20 

25 

30 

35 

40 
45 
50 
55 
60 
65 
70 



TTCTACATGC 
TGGCCACAAC 
CGGTTTTTCA 
CCCAAGGACA 
CTTTCTGAGG 
GATGAATATA 
GGAATTGGCC 
GGAACAAGAC 
AATAGCTGGT 



GCACAAATCC 
TGCCAAATGG 
AAGGGAATAA 
TCTACAGCTC 
AAAACACTGG 
AACGATCTAT 
ACAAAGTTGA 
AATACAAATT 
TCAACTGCAG 



CTTCTACCCG 
GCTTGAAGCT 
GTACTGGGCT 
CTTTGGCTTC 
AAAAACCTAC 
GGATCCAGGT 
TGCAGTTTTC 
TGATCCTAAA 
GAAAAATTAG 



GAAGTTGAGC 
GCTTACGAAT 
GTTCAGGGAC 
CCTAGAACTG 
TTCTTTGTTG 
TATCCCAAAA 
ATGAAAGATG 
ACGAAGAGAA 



TCAATTTCAT 
TTGCCGACAG 
AGAATGTGCT 
TGAAGCATAT 
CTAACAAATA 
TGATAGCACA 
GATTTTTCTA 
TTTTGACTCT 



TTCTGTTTTC 
AGATGAAGTC 
ACACGGATAC 
CGATGCTGCT 
CTGGAGGTAT 
TGACTTTCCT 
TTTCTTTCAT 
CCAGAAAGCT 



960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1410 



Seq ID NO: C5 DNA Sequence 

Nucleic Acid Accession ft: NM_014331.2 

Coding sequence: 1..1506 



1 11 21 

I I ) 

ATGGTCAGAA AGCCTGTTGT GTCCACCATC 
AAOGGGAGGC TGCCTTCCCT GGGCAACAAG 
AAGAGGAAAG TCACTTTACT GAGGGGAGTC 
GGAATCTTCA TCTCTCCTAA GGGCGTGCTC 
ACCATCTGGA CGGTGTGTGG GGTCCTGTCA 
GGAACAACTA TAAAGAAATC TGGAGGTCAT 
TTACCAGCTT TTGTACGAGT CTGGGTGGAA 
GTGATATCCC TGGCATTTGG ACGCTACATT 
CCTGAACTTG CGATCAAGCT CATTACAGCT 
AGCATGAGTG TCAGCTGGAG CGCCCGGATC 
GCAATTCTGA TAATTATAGT CCCTGGAGTT 
TTTAAAGACG CGTTTTCAGG AAGAGATTCA 
TATGGAATGT ATGCATATGC TGGCTGGTTT 
AACCCTGAAA AAACCATTCC CCTTGCAATA 
TATGTGCTGA CAAATGTGGC CTACTTTACG 
AATGCAGTGG CAGTGACCTT TTCTGAGCGG 
ATCTTTGTTG CCCTCTCCTG CTTTGGCTCC 
TTATTCTATG TTGCGTCTCG AGAGGGTCAC 
CGCAAGCACA CTCCTCTACC AGCTGTTATT 
TTCTCTGGAG ACCTCGACAG TCTTTTGAAT 
GGGCTGGCAG TTGCTGGGCT GATTTATCTT 
TTCAAGGTGC CACTGTTCAT CCCAGCTTTG 
CTTTCCCTCT ATTCGGACCC ATTTAGTACA 
GTCCCTGCGT ATTATCTCTT TATTATATGG 
TCAGAGAAAA TAACCAGAAC ATTACAAATA 
TTATGAACTA ATGGACTTGA GATCTTGGCA 
TTTTTACTTC ATTTTCTGAA AGTCTAGAGA 
CAGTTATTTT TATTCATATA TTTTAGCATA 
AACTCTATGT AGTTATAGAA AGTGAATATG 
GTCTCTGATA CCTACCTATT GGGGTTAGGA 
TCTCTACAAC ATATGTTAGC ACGGCAAAGA 
TATATATGGG TTTTGTAAAG ATGGTTTTAC 
TTTTCAATTC TGAAAAAAAG CATACATCAT 
ATTTTACATT GACATTGCAT TGCTTCCCCT 
GCTTTAATGG ATTATACCCA GAGCACTTTG 
TTAAAGAAGA GTTTCTAGGG GCTACTGTTT 
AAAAATCCTT GAGAATTTAT TATGTCAGAT 
TTATCTGTCA TTTTTTTTTT TCACATCAGT 
AGCAAGAGTT AGTTTGGTAT TAAATCCTCA 
TACCCCTGAT GAGTCTATCT AAACATATGC 
TGAGAGAAAT AACCAACAAA GAAGATGTTC 
CTACATGCAA TGTTAGTAAT TCTGAAGTTT 
TGAATTTTGA CAGTTTGTGC ATTTTCTTTA 
CTTCAGATGA AACTGTCCAG ATTAATTAGG 
AAGAAATGTC GCTGTAAATA AGATTTACAA 
ATCTAGGCTT TGTCAGTAAT TTCCACACCT 
CTGATAAGAA GAAAATTGAA ATGAGAATCT 
GTTTTGCCAG TATTAGAAAA TACTGTGAGC 
GCACTTTGGG AGGCTGAGGG GGTGGATCAC 
CAACATGGAG AAACCCCATC TCTACTAAAA 
GCTGGTAATC TCAGCTATTG AGGAGGCTGA 
GAGGTTGCAG TGAGCCAAGA TTGCACCACT 
CCATCTCCAA AAAAAAAAAA AAAA 



41 



51 



TCCAAAGGAG GTTACCTGCA 
GAGCCACCTG GGCAGGAGAA 
TCCATTATCA TTGGCACCAT 
CAGAACACGG GCAGCGTGGG 
CTATTTGGAG CTTTGTCTTA 
TACACATATA TTTTGGAAGT 
CTCCTCATAA TACGCCCTGC 
CTGGAACCAT TTTTTATTCA 
GTGGGCATAA CTGTAGTGAT 
CAGATTTTCT TAACCTTTTG 
ATGCAGCTAA TTAAAGGTCA 
AGTATTACGC GGTTGCCACT 
TACCTCAACT TTGTTACTGA 
TGTATATCCA TGGCCATTGT 
ACCATTAATG CTGAGGAGCT 
CTACTGGGAA ATTTCTCATT 
ATGAACGGTG GTGTGTTTGC 
CTTCCAGAAA TCCXCTCCAT 
GTTTTGCACC CTTTGACAAT 
TTCCTCAGTT TTGCCAGGTG 
CGATACAAAT GCCCAGATAT 
TTTTCCTTCA CATGCCTCTT 
GGGATTGGCT TCGTCATCAC 
GACAAGAAAC CCAGGTGGTT 
ATACTGGAAG TTGTACCAGA 
ATCTGCCCAA GGGGAGACAC 
ATTACAACTT TGGTGATAAA 
TTOGAACTAA TTTCTAAGAA 
CAGTTATTCT ATGAGTCGCA 
GAAAAGACTA GACAATTACT 
ACCTTCAAAT TGAAGACTGA 
ACACTACAGA TGTCTATACT 
GATTATGGCA AAGAGGAGAG 
TAGATACCAA TTTAGATAAC 
AACAAAGGTC AGTGGGGATT 
ATGAGACACA TCCAGGAGTT 
GTTTTTTCAT TCATTATCAG 
TTGATCAGGA AAGTGTATAA 
TTAGAACAAC CACCTGTTTC 
ATTTTAAGCC TTCAAATTAC 
AAAATAATAG TCCCATATCT 
TTTAAATTTA TGGCTATTTT 
TACATTTTAT ATTCTTCTGT 
AAAAGGCATA TATTAACATA 
CTGATGTTTC TAGAAAATTT 
TAATTATCAT TCAACTTGCA 
GTGGATAAGT GTTTGTGTTC 
CGGGCATGGT GGCTTACATC 
CTGAGGTCGG GAGTTCTAGA 
ATACAAAATT AGCTGGGCAT 
GGCAGGAGAA TTGCTTGAAC 
GTACTCCAGC CTGGGTGACA 



GGGAAATGTT 
AGTGCAGCTG 
CATTGGAGCA 
CATGTCTCTG 
TGCTGAATTG 
CTTTGGTCCA 
AGCTACTGCT 
ATGTGAAATC 
GGTCCTAAAT 
CAAGCTCACA 
AACGCAGAAC 
GGCTTTTTAT 
AGAAGTAGAA 
CACCATTGGC 
GCTGCTTTCA 
AGCAGTTCCG 
TGTCTCCAGG 
GATT CATGTC 
GATAATGCTC 
GCTTTTTATT 
GCATCGTCCT 
CATGGTTGCC 
TCTGACTGGA 
TAGAATAATG 
AGAAGATAAG 
AAAATAGGGA 
CAAAAGGAGT 
ATTTAGTTAT 
CAATTCTTGA 
ATGTGGTCAT 
GATTTTTCTG 
GTGAAAAGTG 
AAAGAAATTT 
AAACACTCAT 
GTTGAATACA 
ATGTTTAAGT 
GAAGTTTTAG 
CACATCTTAG 
ACTAATAACT 
ATTATCAACA 
GTAATCATAT 
TACACGATGA 
TAAAATATCT 
AAAATTGCAA 
CCACTTCTAT 
AAAGAGACAA 
AGAAGATGTT 
TGTAATCCCA 
CCAGCCTGAC 
GGTGGCACAT 
CCGGGAGGCG 
AAGTCAGACT 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3144 



75 
80 



Seq ID NO: C6 DNA Sequence 

Nucleic Acid Accession ft: NM_O03246.1 

Coding sequence: 112., 3624 



GGACGCACAG 
CGCCGCGCTC 
GCCTGGGGAC 
TCTGGCGGAG 
TCTGGGOGCC 
GCCAACCTGA 
GCAGAAAAGG 



11 
I 

GCATTCCCCG 
OGGTACACAC 
TAGGOGTCCT 
ACAACAGCGT 
GACTGGTGAA 
TCCCCCCTGT 
GTTTCCTCCT 



21 
I 

CGCCCCTCCA 
AGGATCCCTG 
GTTCCTGATG 
GTTTGACATC 
GGGCCCCGAC 
GCCTGATGAC 
TCTGGCATCC 



31 
I 

GCCCTCGCCG 
CTGGGCACCA 
CATGTGTGTG 
TTTGAACTCA 
CCTTCCAGCC 
AAGTTCCAAG 
CTGAGGCAGA 



41 

i 

CCCTCGCCAC 
ACAGCTCCAC 
GCACCAACCG 
CCGGGGCCGC 
CAGCTTTCOG 
ACCTGGTGGA 
TGAAGAAGAC 



51 
I 

OGCTCCCGGC 60 

CATGGGGCTG 120 

CATTCCAGAG 180 

CCGCAAGGGG 240 

CATCGAGGAT 300 

TGCTGTGCGG 360 

CCGGGGCACG 420 



1200 



WO 03/042661 



PCT/US02/36810 



10 

15 

20 

25 

30 

35 

40 
45 
50 
55 
60 
65 
70 
75 
80 



CTGCTGGCCC TGGAGCGGAA AGACCACTCT GGCCAGGTCT TCAGCGTGGT GTCCAATGGC 480 

AAGGCGGGCA CCCTGGACCT CAGCCTGACC GTCCAAGGAA AGCAGCACGT GGTGTCTGTG 540 

GAAGAAGCTC TCCTGGCAAC CGGCCAGTGG AAGAGCATCA CCCTGTTTGT GCAGGAAGAC 600 

AGGGCCCAGC TGTACATCGA CTGTGAAAAG ATGGAGAATG CTGAGTTGGA CGTCCCCATC 660 

CAAAGCGTCT TCACCAGAGA CCTGGCCAGC ATCGCCAGAC TCCGCATCGC AAAGGGGGGC 720 

GTCAATGACA ATTTCCAGGG GGTGCTGCAG AATGTGAGGT TTGTCTTTGG AACCACACCA 780 

GAAGACATCC TCAGGAACAA AGGCTGCTCC AGCTCTACCA GTGTCCTCCT CACCCTTGAC 840 

AACAACGTGG TGAATGGTTC CAGCCCTGCC ATCCGCACTA ACTACATTGG CCACAAGACA 900 

AAGGACTTGC AAGCCATCTG CGGCATCTCC TGTGATGAGC TGTCCAGCAT GGTCCTGGAA 960 

CTCAGGGGCC TGCGCACCAT TGTGACCACG CTGCAGGACA GCATCCGCAA AGTGACTGAA 1020 

GAGAACAAAG AGTTGGCCAA TGAGCTGAGG CGGCCTCCCC TATGCTATCA CAACGGAGTT 1080 

CAGTACAGAA ATAACGAGGA ATGGACTGTT GATAGCTGCA CTGAGTGTCA CTGTCAGAAC 1140 

TCAGTTACCA TCTGCAAAAA GGTGTCCTGC CCCATCATGC CCTGCTCCAA TGCCACAGTT 1200 

CCTGATGGAG AATGCTGTCC TCGCTGTTGG CCCAGCGACT CTGCGGACGA TGGCTGGTCT 1260 

CCATGGTCCG AGTGGACCTC CTGTTCTACG AGCTGTGGCA ATGGAATTCA GCAGCGCGGC 1320 

CGCTCCTGCG ATAGCCTCAA CAACCGATGT GAGGGCTCCT CGGTCCAGAC ACGGACCTGC 1380 

CACATTCAGG AGTGTGACAA AAGATTTAAA CAGGATGGTG GCTGGAGCCA CTGGTCCCCG 1440 

TGGTCATCTT GTTCTGTGAC ATGTGGTGAT GGTGTGATCA CAAGGATCCG GCTCTGCAAC 1500 

TCTCCCAGCC CCCAGATGAA TGGGAAACCC TGTGAAGGCG AAGCGCGGGA GACCAAAGCC 1560 

TGCAAGAAAG ACGCCTGCCC CATCAATGGA GGCTGGGGTC CTTGGTCACC ATGGGACATC 1620 

TGTTCTGTCA CCTGTGGAGG AGGGGTACAG AAACGTAGTC GTCTCTGCAA CAACCCCGCA 1680 

CCCCAGTTTG GAGGCAAGGA CTGCGTTGGT GATGTAACAG AAAACCAGAT CTGCAACAAG 1740 

CAGGACTGTC CAATTGATGG ATGCCTGTCC AATCCCTGCT TTGCCGGCGT GAAGTGTACT 1800 

AGCTACCCTG ATGGCAGCTG GAAATGTGGT GCTTGTCCCC CTGGTTACAG TGGAAATGGC 1860 

ATCCAGTGCA CAGATGTTGA TGAGTGCAAA GAAGTGCCTG ATGCCTGCTT CAACCACAAT 1920 

GGAGAGCACC GGTGTGAGAA CACGGACCCC GGCTACAACT GCCTGCCCTG CCCCCCACGC 1980 

TTCACCGGCT CACAGCCCTT CGGCCAGGGT GTCGAACATG CCACGGCCAA CAAACAGGTG 2040 

TGCAAGCCCC GTAACCCCTG CACGGATGGG ACCCACGACT GCAACAAGAA CGCCAAGTGC 21 00 

AACTACCTGG GCCACTATAG CGACCCCATG TACCGCTGGG AGTGCAAGCC TGGCTACGCT 2160 

GGCAATGGCA TCATCTGCGG GGAGGACACA GACCTGGATG GCTGGCCCAA TGAGAACCTG 2220 

GTGTGCGTGG CCAATGCGAC TTACCACTGC AAAAAGGATA ATTGCCCCAA CCTTCCCAAC 2280 

TCAGGGCAGG AAGACTATGA CAAGGATGGA ATTGGTGATG CCTGTGATGA TGACGATGAC 2340 

AATGATAAAA TTCCAGATGA CAGGGACAAC TGTCCATTCC ATTACAACCC AGCTCAGTAT 2400 

GACTATGACA GAGATGATGT GGGAGACCGC TGTGACAACT GTCCCTACAA CCACAACCCA 2460 

GATCAGGCAG ACACAGACAA CAATGGGGAA GGAGACGCCT GTGCTGCAGA CATTGATGGA 2520 

GACGGTATCC TCAATGAACG GGACAACTGC CAGTACGTCT ACAATGTGGA CCAGAGAGAC 2S80 

ACTGATATGG ATGGGGTTGG AGATCAGTGT GACAATTGCC CCTTGGAACA CAATCCGGAT 2640 

CAGCTGGACT CTGACTCAGA CCGCATTGGA GATACCTGTG ACAACAATCA GGATATTGAT 2700 

GAAGATGGCC ACCAGAACAA TCTGGACAAC TGTCCCTATG TGCCCAATGC CAACCAGGCT 27G0 

GACCATGACA AAGATGGCAA GGGAGATGCC TGTGACCACG ATGATGACAA CGATGGCATT 2820 

CCTGATGACA AGGACAACTG CAGACTCGTG CCCAATCCCG ACCAGAAGGA CTCTGACGGC 2880 

GATGGTCGAG GTGATGCCTG CAAAGATGAT TTTGACCATG ACAGTGTGCC AGACATCGAT 2940 

GACATCTGTC CTGAGAATGT TGACATCAGT GAGACCGATT TCCGCCGATT CCAGATGATT 3000 

CCTCTGGACC CCAAAGGGAC ATCCCAAAAT GACCCTAACT GGGTTGTACG CCATCAGGGT 3060 

AAAGAACTCG TCCAGACTGT CAACTGTGAT CCTGGACTCG CTGTAGGTTA TGATGAGTTT 3120 

AATGCTGTGG ACTTCAGTGG CACCTTCTTC ATCAACACCG AAAGGGACGA TGACTATGCT 3160 
GGATTTGTCT TTGGCTACCA GTCCAGCAGC CGCTTTTATG TTGTGATGTG GAAGCAAGTC 3240 
ACCCAGTCCT ACTGGGACAC CAACCCCACG AGGGCTCAGG GATACTCGGG CCTTTCTGTG 3300 
AAAGTTGTAA ACTCCACCAC AGGGCCTGGC GAGCACCTGC GGAACGCCCT GTGGCACACA 3360 
GGAAACACCC CTGGCCAGGT GCGCACCCTG TGGCATGACC CTCGTCACAT AGGCTGGAAA 3420 
GATTTCACCG CCTACAGATG GCGTCTCAGC CACAGGCCAA AGACGGGTTT CATTAGAGTG 3480 
GTGATGTATG AAGGGAAGAA AATCATGGCT GACTCAGGAC CCATCTATGA TAAAACCTAT 3540 
GCTGGTGGTA GACTAGGGTT GTTTGTCTTC TCTCAAGAAA TGGTGTTCTT CTCTGACCTG 3600 
AAATACGAAT GTAGAGATCC CTAATCATCA AATTGTTGAT TGAAAGACTG ATCATAAACC 3660 
AATGCTGGTA TTGCACCTTC TGGAACTATG GGCTTGAGAA AACCCCCAGG ATCACTTCTC 3720 
CTTGGCTTCC TTCTTTTCTG TGCTTGCATC AGTGTGGACT CCTAGAACGT GCGACCTGCC 3780 
TCAAGAAAAT GCAGTTTTCA AAAACAGACT CATCAGCATT CAGCCTCCAA TGAATAAGAC 3840 
ATCTTCCAAG CATATAAACA ATTGCTTTGG TTTCCTTTTG AAAAAGCATC TACTTGCTTC 3900 
AGTTGGGAAG GTGCCCATTC CACTCTGCCT TTGTCACAGA GCAGGGTGCT ATTGTGAGGC 3960 
CATCTCT 3967 

Seq ID NO: C7 DNA Sequence 

Nucleic Acid Accession #: NM_002192 

Coding sequence i 86.. 1366 



1 
I 

TCCACACACA 
AATCACAACA 
AAGTTGCTGG 
CCCCGACTGT 
AGAGATGGTG 
CGATGTCACC 
GGGCAAAGTC 
AATGAATGAA 
CAGGAAGACG 
TGCAGAAGTC 
CATCCGCCTC 
CGAGGAAGTG 
CGCTCGGAAG 
CCAGGGCAAG 
CAGCTTGGTT 
GGGCGGAGGT 
CCTCATGCTG 



11 
I 

CAAAAAACCT 
ACTTTTGCTG 
ATTATAGTGA 
CCGTCCTGTG 
GAGGCCGTCA 
CAGCCGGTAC 
GGGGAGAACG 
CTTATGGAGC 
CTGCACTTCG 
TGGCTCTTCC 
TTCCAGCAGC 
GGCTTAAAGG 
AGCACCTGGC 
AGCTCCCTGG 
CTCCTGGGCA 
GAAGGTGGGG 
CAGGCCCGGC 



21 
I 

GCGOGTGAGG 
CCAGGATGCC 
GGAGTTCCCC 
CGCTGGCCGC 
AGAAGCACAT 
CCAAGGCGGC 
GGTATGTGGA 
AGACCTCGGA 
AGATTTCCAA 
TAAAAGTCCC 
AGAAGCACCC 
GGGAGAGGAG 
ATGTCTTCCC 
AOGTTCGGAT 
AGAAGAAGAA 
CAGGAGCAGA 
AGTCTGAAGA 



31 
I 

GGGGAGGAAA 
CTTGCTTTGG 
CACCCCAGGA 
CCTCCCAAAG 
TTTAAACATG 
GCTTCTGAAC 
GATAGAGGAT 
GATCATCACG 
GGAAGGCAGT 
CAAGGCCAAC 
GCAGGGCAGC 
TGAACTGTTG 
TGTCTCCAGC 
TGCCTGTGAG 
GAAAGAAGAG 
TGAGGAAAAG 
CCACCCTCAT 



41 

I 

AGCAGGGCCT 
CTGAGAGGAT 
TCCGAGGGGC 
GATGTACCCA 
CTGCACTTGA 
GCGATCAGAA 
GACATTGGAA 
TTTGCCGAGT 
GACCTGTCAG 
AGGACCAGGA 
TTGGACACAG 
CTCTCTGAAA 
AGCATCCAGC 
CAGTGCCAGG 
GAGGGGGAAG 
GAGCAGTCGC 
CGCCGGCGTC 



51 
I 

TTAAAAAGGC 
TTCTGTTGGC 
ACAGCGCGGC 
ACTCTCAGCC 
AGAAGAGACC 
AGCTTCATGT 
GGAGGGCAGA 
CAGGAACAGC 
TGGTGGAGCG 
CCAAAGTCAC 
GGGAAGAGGC 
AAGTAGTAGA 
GGTTGCTGGA 
AGAGTGGCGC 
GGAAAAAGAA 
ACAGACCTTT 
GGCGGGGCTT 
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480 

540 

600 
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780 

840 

900 

960 

1020 
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GGAGTGTGAT GGCAAGGTCA ACATCTGCTG TAAGAAACAG TTCTTTGTCA GTTTCAAGGA 1080 

CATOGGCTGG AATGACTGGA TCATTGCTCC CTCTGGCTAT CATGCCAACT ACTGCGAGGG 1140 

TGAGTGCCCG AGCCATATAG CAGGCACGTC CGGGTCCTCA CTGTCCTTCC ACTCAACAGT 1200 

CATCAACCAC TACOGCATGC GGGGCCATAG CCCCTTTGCC AACCTCAAAT CGTGCTGTGT 1260 

GCCCACCAAG CTGAGACCCA TGTCCATGTT GTACTATGAT GATGGTCAAA ACATCATCAA 1320 

AAAGGACATT CAGAACATGA TCGTGGAGGA GTGTGGGTGC TCATAGAGTT GCCCAGCCCA 1380 

GGGGGAAAGG GAGCAAGAGT TGTCCAGAGA AGACAGTGGC AAAATGAAGA AATTTTTAAG 1440 

GTTTCTGAGT TAACCAGAAA AATAGAAATT AAAAACAAAA CAAAACAAAA AAAAAAACAA 1500 

AAAAAAACAA AAGTAAATTA AAAACAAACC TGATGAAACA GATGAAACAG ATGAAGGAAG 1560 

ATGTGGAAAT CTTAGCCTGC CTTAGCCAGG GCTCAGAGAT GAAGCAGTGA AGAGACAGAT 1620 

TGGGAGGGAA AGGGAGAATG GTGTACCCTT TATTTCTTCT GAAATCACAC TGATGACATC 1680 

AGTTGTTTAA AOGGGGTATT GTCCTTTCCC CCCTTGAGGT TCCCTTGTGA GCTTGAATCA 1740 

ACCAATCTGA TCTGCAGTAG TGTGGACTAG AACAACCCAA ATAGCATCTA GAAAGCCATG 1800 

AGTTTGAAAG GGCCCATCAC AGGCACTTTC CTAGCCTAAT 1840 



Seg ID NO: C8 DNA Sequence 

Nucleic Acid Accession fh NM_000095.l 

Coding sequence: 26.. 2299 

1 11 21 31 41 51 

I I I I I I 

CAGCACCCAG CTCCCCGCCA CCGCCATGGT CCCCGACACC GCCTGCGTTC TTCTGCTCAC 60 

CCTGGCTGCC CTCGGCGCGT CCGGACAGGG CCAGAGCCCG TTGGGCTCAG ACCTGGGCCC 120 

GCAGATGCTT CGGGAACTGC AGGAAACCAA CGCGGCGCTG CAGGACGTGC GGGACTGGCT 180 

GCGGCAGCAG GTCAGGGAGA TCACGTTCCT GAAAAACACG GTGATGGAGT GTGACGCGTG 240 

CGGGATGCAG CAGTCAGTAC GCACCGGCCT ACCCAGCGTG CGGCCCCTGC TCCACTGCGC 300 

GCCCGGCTTC TGCTTCCCCG GCGTGGCCTG CATCCAGACG GAGAGCGGCG GCCGCTGCGG 360 

CCCCTGCCCC GCGGGCTTCA CGGGCAAOGG CTCGCACTGC ACCGACGTCA ACGAGTGCAA 420 

CGCCCACCCC TGCTTCCCCC GAGTCCGCTG TATCAACACC AGCCCGGGGT TCCGCTGCGA 480 

GGCTTGCCCG CCGGGGTACA GCGGCCCCAC CCACCAGGGC GTGGGGCTGG CTTTCGCCAA 540 

GGCCAACAAG CAGGTTTGCA CGGACATCAA CGAGTGTGAG ACCGGGCAAC ATAACTGCGT 600 

CCCCAACTCC GTGTGCATCA ACACCCGGGG CTCCTTCCAG TGCGGCCCGT GCCAGCCCGG 660 

CTTCGTGGGC GACCAGGCGT CCGGCTGCCA GCGCGGCGCA CAGCGCTTCT GCCCCGACGG 720 

CTCGCCCAGC GAGTGCCACG AGCATGCAGA CTGCGTCCTA GAGCGCGATG GCTCGCGGTC 780 

GTGCGTGTGT CGCGTTGGCT GGGCCGGCAA CGGGATCCTC TGTGGTCGCG ACACTGACCT 840 

AGACGGCTTC CCGGAOGAGA AGCTGCGCTG CCCGGAGCCG CAGTGCCGTA AGGACAACTG 900 

CGTGACTGTG CCCAACTCAG GGCAGGAGGA TGTGGACCGC GATGGCATCG GAGACGCCTG 960 

CGATCCGGAT GCCGACGGGG ACGGGGTCCC CAATGAAAAG GACAACTGCC CGCTGGTGCG 1020 

GAACCCAGAC CAGCGCAACA CGGACGAGGA CAAGTGGGGC GATGCGTGCG ACAACTGCCG 1080 

GTCCCAGAAG AACGACGACC AAAAGGACAC AGACCAGGAC GGCCGGGGCG ATGCGTGCGA 1140 

CGACGACATC GACGGCGACC GGATCCGCAA CCAGGCCGAC AACTGCCCTA GGGTACCCAA 1200 

CTCAGACCAG AAGGACAGTG ATGGCGATGG TATAGGGGAT GCCTGTGACA ACTGTCCCCA 1260 

GAAGAGCAAC COGGATCAGG CGGATGTGGA CCACGACTTT GTGGGAGATG CTTGTGACAG 1320 

CGATCAAGAC CAGGATGGAG ACGGACATCA GGACTCTCGG GACAACTGTC CCACGGTGCC 1380 

TAACAGTGCC CAGGAGGACT CAGACCACGA TGGCCAGGGT GATGCCTGCG ACGACGACGA 1440 

CGACAATGAC GGAGTCCCTG ACAGT CGGG A CAACTGCCGC CTGGTGCCTA ACCCCGGCCA 1500 

GGAGGAGGCG GACAGGGACG GCGTGGGCGA OGTGTGCCAG GACGACTTTG ATGCAGACAA 1560 

GGTGGTAGAC AAGATCGACG TGTGTCCGGA GAACGCTGAA GTCACGCTCA CCGACTTCAG 1620 

GGCCTTCCAG ACAGTCGTGC TGGACCCGGA GGGTGACGCG CAGATTGACC CCAACTGGGT 1680 

GGTGCTCAAC CAGGGAAGGG AGATCGTGCA GACAATGAAC AGCGACCCAG GCCTGGCTGT 1740 

GGGTTACACT GCCTTCAATG GCGTGGACTT CGAGGGCACG TTCCATGTGA ACACGGTCAC 1800 

GGATGACGAC TATGCGGGCT TCATCTTTGG CTACCAGGAC AGCTCCAGCT TCTACGTGGT 1860 

CATGTGGAAG CAGATGGAGC AAACGTATTG GCAGGCGAAC CCCTTCCGTG CTGTGGCCGA 1920 

GCCTGGCATC CAACTCAAGG CTGTGAAGTC TTCCACAGGC CCCGGGGAAC AGCTGCGGAA 1980 

CGCTCTGTGG CATACAGGAG ACACAGAGTC CCAGGTGCGG CTGCTGTGGA AGGACCCGCG 2040 

AAACGTGGGT TGGAAGGACA AGAAGTCCTA TCGTTGGTTC CTGCAGCACC GGCCCCAAGT 2100 

GGGCTACATC AGGGTGCGAT TCTATGAGGG CCCTGAGCTG GTGGCCGACA GCAACGTGGT 2160 

CTTGGACACA ACCATGCGGG GTGGCCGCCT GGGGGTCTTC TGCTTCTCCC AGGAGAACAT 2220 

CATCTGGGCC AACCTGCGTT ACCGCTGCAA TGACACCATC CCAGAGGACT ATGAGACCCA 2280 

TCAGCTGCGG CAAGCCTAGG GACCAGGGTG AGGACCCGCC GGATGACAGC CACCCTCACC 2340 

GCGGCTGGAT GGGGGCTCTG CACCCAGCCC AAGGGGTGGC CGTCCTGAGG GGGAAGTGAG 2400 

AAGGGCTCAG AGAGGACAAA ATAAAGTGTG TGTGCAGGG 2439 



Seq ID NO: C9 DNA Sequence 

Nucleic Acid Accession #i XM_057014 

Coding sequence : 143.. 874 

1 11 21 31 41 51 

I I I I I I 

GGGAGGGAGA GAGGCGCGCG GGTGAAAGGC GCATTGATGC AGCCTGCGGC GGCCTCGGAG 60 

CGCGGCGGAG CCAGACGCTG ACCACGTTCC TCTCCTCGGT CTCCTCCGCC TCCAGCTCCG 120 

CGCTGCCCGG CAGCCGGGAG CCATGCGACC CCAGGGCCCC GCCGCCTCCC CGCAGCGGCT 180 

CCGCGGCCTC CTGCTGCTCC TGCTGCTGCA GCTGCCCGCG CCGTCGAGCG CCTCTGAGAT 240 

CCCCAAGGGG AAGCAAAAGG CGCAGCTCCG GCAGAGGGAG GTGGTGGACC TGTATAATGG 300 

AATGTGCTTA CAAGGGCCAG CAGGAGTGCC TGGTCGAGAC GGGAGCCCTG GGGCCAATGG 360 

CATTCCGGGT ACACCTGGGA TCCCAGGTCG GGATGGATTC AAAGGAGAAA AGGGGGAATG 420 

TCTGAGGGAA AGCTTTGAGG AGTCCTGGAC ACCCAACTAC AAGCAGTGTT CATGGAGTTC 480 

ATTGAATTAT GGCATAGATC TTGGGAAAAT TGCGGAGTGT ACATTTACAA AGATGCGTTC 540 

AAATAGTGCT CTAAGAGTTT TGTTCAGTGG CTCACTTCGG CTAAAATGCA GAAATGCATG 600 

CTGTCAGCGT TGGTATTTCA CATTCAATGG AGCTGAATGT TCAGGACCTC TTCCCATTGA 660 

AGCTATAATT TATTTGGACC AAGGAAGCCC TGAAATGAAT TCAACAATTA ATATTCATCG 720 



1202 



WO 03/042661 
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CACTTCTTCT 
CTGGGTTGGC 
TTCTCGCATC 
TTTTTTTATT 
CATCTGAATG 
TTTAAATCTA 
TGGTTAGAAT 
GGTCTTTTGT 
TGTACAATTT 
CAACCTTAAA 



GTGGAAGGAC 
ACTTGTTCAG 
ATTATTGAAG 
ATGCCTTGGA 
AAAAGCAAAG 
GCATTATTCA 
ACTTTCTTCA 
TTTTTCTCTT 
GTAAATGTTA 
AAAAAAAAAA 



TTTGTGAAGG 
ATTACCCAAA 
AACTACCAAA 
ATGGTTCACT 
CTAAATATGT 
TTTTGCTTCA 
TAGTCACATT 
AGTATAGCAT 
AGAATTTTTT 
AAAA 



AATTGGTGCT 
AGGAGATGCT 
ATAAATGCTT 
TAAATGACAT 
TTACAGACCA 
ATCAAAAGTG 
CTCTCAACCT 
TTTTAAAAAA 
TTATATCTGT 



GGATTAGTGG 
TCTACTGGAT 
TAATTTTCAT 
TTTAAATAAG 
AAGTGTGATT 
GTTTCAATAT 
ATAATTTGGA 
ATATAAAAGC 
TAAATAAAAA 



ATGTTGCTAT 
GGAATTCAGT 
TTGCTACCTC 
TTTATGTATA 
TCACACTGTT 
TTTTTTTAGT 
ATATTGTTGT 
TACCAATCTT 
TTATTTCCAA 



780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1284 



Seq ID NO: CIO DNA Sequence 
Nucleic Acid Accession #: NM_003225 
Coding sequence: 41. '.2 95 



ATCCCTGACT 
GGTGATCTGC 
GACAGAGACG 
GCCCTCCCAG 
GTGCTTCTAT 
TCTGCAGGGA 
GAGCTCGGCT 
CACAACACAG 
GATCGATATT 



11 

i 

CGGGGTCGCC 
GCCCTGGTCC 
TGTACAGTGG 
TGTGCAAATA 
CCTAATACCA 
TCTGCCTGCA 
GCCACCTCCA 
ATTGACTGCT 
AAAAAAAAAA 



21 
I 

TTTGGAGCAG 
TGGTGTCCAT 
CCCCCCGTGA 
AGGGCTGCTG 
TCGACGTCCC 
TCCTGACGGG 
CCGGACACCT 
CTGACTTTGA 
AAAAAAAAAA 



31 
I 

AGAGGAGGCA 
GCTGGCCCTC 
AAGACAGAAT 
TTTCGACGAC 
TCCAGAAGAG 
GTGCCGTCCC 
CAGACACGCT 
CTACTCAAAA 
AAAAAAAAAA 



41 
I 

ATGGCCACCA 
GGCACCCTGG 
TGTGGTTTTC 
ACCGTTCGTG 
GAGTGTGAAT 
CAGCACGGTG 
TCTGCAGCTG 
TTGGCCTAAA 
AAAAAAAAAA 



51 
I 

TGGAGAACAA 
CCGAGGCCCA 
CTGGTGTCAC 
GGGTCCCCTG 
TTTAGACACT 
ATTAGTCCCA 
TGCCTCGGCT 
AATTAAAAGA 
AAAAAAAAAA 



60 

120 

180 

240 

300 

360 

420 

480 

540 



Seq ID NO: Cll DNA Sequence 

Nucleic Acid Accession #: NMJJ15419.1 

Coding sequence: 1.-8487 



21 



31 



1 11 

I I I I I \ 

ATGCCCAAGC GCGCGCACTG GGGGGCCCTC TCCGTGGTGC TGATCCTGCT TTGGGGCCAT 60 

CCGCGAGTGG CGCTGGCCTG CCCGCATCCT TGTGCCTGCT ACGTCCCCAG CGAGGTCCAC 120 

TGCACGTTCC GATCCCTGGC TTCCGTGCCC GCTGGCATTG CTAGACACGT GGAAAGAATC 180 

AATTTGGGGT TTAATAGCAT ACAGGCCCTG TCAGAAACCT CATTTGCAGG ACTGACCAAG 240 

TTGGAGCTAC TTATGATTCA CGGCAATGAG ATCCCAAGCA TCCCCGATGG AGCTTTAAGA 300 

GACCTCAGCT CTCTTCAGGT TTTCAAGTTC AGCTACAACA AGCTGAGAGT GATCACAGGA 360 

CAGACCCTCC AGGGTCTCTC TAACTTAATG AGGCTGCACA TTGACCACAA CAAGATCGAG 420 

TTTATCCACC CTCAAGCTTT CAACGGCTTA ACGTCTCTGA GGCTACTCCA TTTGGAAGGA 480 

AATCTCCTCC ACCAGCTGCA CCCCAGCACC TTCTCCACGT TCACATTTTT GGATTATTTC 540 

AGACTCTCCA CCATAAGGCA CCTCTACTTA GCAGAGAACA TGGTTAGAAC TCTTCCTGCC 600 

AGCATGCTTC GGAACATGCC GCTTCTGGAG AATCTTTACT TGCAGGGAAA TCCGTGGACC 660 

TGCGATTGTG AGATGAGATG GTTTTTGGAA TGGGATGCAA AATCCAGAGG AATTCTGAAG 720 

TGTAAAAAGG ACAAAGCTTA TGAAGGCGGT CAGTTGTGTG CAATGTGCTT CAGTCCAAAG 780 

AAGTTGTACA AACATGAGAT ACACAAGCTG AAGGACATGA CTTGTCTGAA GCCTTCAATA 840 

GAGTCCCCTC TGAGACAGAA CAGGAGCAGG AGTATTGAGG AGGAGCAAGA ACAGGAAGAG 900 

GATGGTGGCA GCCAGCTCAT CCTGGAGAAA TTCCAACTGC CCCAGTGGAG CATCTCTTTG 960 

AATATGACCG ACGAGCACGG GAACATGGTG AACTTGGTCT GTGACATCAA GAAACCAATG 1020 

GATGTGTACA AGATTCACTT GAACCAAACG GATCCTCCAG ATATTGACAT AAATGCAACA 1080 

GTTGCCTTGG ACTTTGAGTG TCCAATGACC CGAGAAAACT ATGAAAAGCT ATGGAAATTG 1140 

ATAGCATACT ACAGTGAAGT TCCCGTGAAG CTACACAGAG AGCTCATGCT CAGCAAAGAC 1200 

CCCAGAGTCA GCTACCAGTA CAGGCAGGAT GCTGATGAGG AAGCTCTTTA CTACACAGGT 1260 

GTGAGAGCCC AGATTCTTGC AGAACCAGAA TGGGTCATGC AGCCATCCAT AGATATCCAG 1320 

CTGAACCGAC GTCAGAGTAC GGCCAAGAAG GTGCTACTTT CCTACTACAC CCAGTATTCT 1380 

CAAACAATAT CCACCAAAGA TACAAGGCAG GCTCGGGGCA GAAGCTGGGT AATGATTGAG 1440 

CCTAGTGGAG CTGTGCAAAG AGATCAGACT GTCCTGGAAG GGGGTCCATG CCAGTTGAGC 1500 

TGCAACGTGA AAGCTTCTGA GAGTCCATCT ATCTTCTGGG TGCTTCCAGA TGGCTCCATC 1560 

CTGAAAGCGC CCATGGATGA CCCAGACAGC AAGTTCTCCA TTCTCAGCAG TGGCTGGCTG 1620 

AGGATCAAGT CCATGGAGCC ATCTGACTCA GGCTTGTACC AGTGCATTGC TCAAGTGAGG 1680 

GATGAAATGG ACCGCATGGT ATATAGGGTA CTTGTGCAGT CTCCCTCCAC TCAGCCAGCC 1740 

GAGAAAGACA CAGTGACAAT TGGCAAGAAC CCAGGGGAGT CGGTGACATT GCCTTGCAAT 1800 

GCTTTAGCAA TACCCGAAGC CCACCTTAGC TGGATTCTTC CAAACAGAAG GATAATTAAT 1860 

GATTTGGCTA ACACATCACA TGTATACATG TTGCCAAATG GAACTCTTTC CATCCCAAAG 1920 

GTCCAAGTCA GTGATAGTGG TTACTACAGA TGTGTGGCTG TCAACCAGCA AGGGGCAGAC 1980 

CATTTTACGG TGGGAATCAC AGTGACCAAG AAAGGGTCTG GCTTGCCATC CAAAAGAGGC 2040 

AGACGCCCAG GTGCAAAGGC TCTTTCCAGA GTCAGAGAAG ACATCGTGGA GGATGAAGGG 2100 

GGCTCGGGCA TGGGAGATGA AGAGAACACT TCAAGGAGAC TTCTGCATCC AAAGGACCAA 2160 

GAGGTGTTCC TCAAAACAAA GGATGATGCC ATCAATGGAG ACAAGAAAGC CAAGAAAGGG 2220 

AGAAGAAAGC TGAAACTCTG GAAGCATTCG GAAAAAGAAC CAGAGACCAA TGTTGCAGAA 2280 

GGTCGCAGAG TGTTTGAATC TAGACGAAGG ATAAACATGG CAAACAAACA GATTAATCCG 2340 

GAGCGCTGGG CTGATATTTT AGCCAAAGTC CGTGGGAAAA ATCTCCCTAA GGGCACAGAA 2400 

GTACCCCCGT TGATTAAAAC CACAAGTCCT CCATCCTTGA GCCTAGAAGT CACACCACCT 2460 

TTTCCTGCTG TTTCTCCCCC CTCAGCATCT CCTGTGCAGA CAGTAACCAG TGCTGAAGAA 2520 

TCCTCAGCAG ATGTACCTCT ACTTGGTGAA GAAGAGCACG TTTTGGGTAC CATTTCCTCA 2580 

GCCAGCATGG GGCTAGAACA CAACCACAAT GGAGTTATTC TTGTTGAACC TGAAGTAACA 2640 

AGCACACCTC TGGAGGAAGT TGTTGATGAC CTTTCTGAGA AGACTGAGGA GATAACTTCC 2700 

ACTGAAGGAG ACCTGAAGGG GACAGCAGCC CCTACACTTA TATCTGAGCC TTATGAACCA 2760 

TCTCCTACTC TGCACACATT AGACACAGTC TATGAAAAGC CCACCCATGA AGAGACGGCA 2820 

ACAGAGGGTT GGTCTGCAGC AGATGTTGGA TCGTCACCAG AGCCCACATC CAGTGAGTAT 2880 

GAGCCTCCAT TGGATGCTGT CTCCTTGGCT GAGTCTGAGC CCATGCAATA CTTTGACCCA 2940 

GATTTGGAGA CTAAGTCACA ACCAGATGAG GATAAGATGA AAGAAGACAC CTTTGCACAC 3000 

CTTACTCCAA CCCCCACCAT CTGGGTTAAT GACTCCAGTA CATCACAGTT ATTTGAGGAT 3060 
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TCTACTATAG GGGAACCAGG TGTCCCAGGC CAATCACATC TACAAGGACT GACAGACAAC 3120 . 
ATCCACCTTG TGAAAAGTAG TCTAAGCACT CAAGACACCT TACTGATTAA AAAGGGTATG 3180 
AAAGAGATGT CTCAGACACT ACAGGGAGGA AATATGCTAG AGGGAGACCC CACACACTCC 3240 
AGAAGTTCTG AGAGTGAGGG CCAAGAGAGC AAATCCATCA CTTTGCCTGA CTCCACACTG 3300 
GGTATAATGA GCAGTATGTC TCCAGTTAAG AAGCCTGCGG AAACCACAGT TGGTACCCTC 3360 
CTAGACAAAG ACACCACAAC AGTAACAACA ACACCAAGGC AAAAAGTTGC TCCGTCATCC 3420 
ACCATGAGCA CTCACCCTTC TCGAAGGAGA CCCAACGGGA GAAGGAGATT ACGCCCCAAC 3480 
AAATTCOGCC ACCGGCACAA GCAAACCCCA CCCACAACTT TTGCCCCATC AGAGACTTTT 3540 
TCTACTCAAC CAACTCAAGC ACCTGACATT AAGATTTCAA GTCAAGTGGA GAGTTCTCTG 3600 
GTTCCTACAG CTTGGGTGGA TAACACAGTT AATACCCCCA AACAGTTGGA AATGGAGAAG 3660 
AATGCAGAAC CCACATCCAA GGGAACACCA CGGAGAAAAC ACGGGAAGAG GCCAAACAAA 3720 
CATCGATATA CCCCTTCTAC AGTGAGCTCA AGAGOGTCOG GATCCAAGCC CAGCCCTTCT 3780 
CCAGAAAATA AACATAGAAA CATTGTTACT CCCAGTTCAG AAACTATACT TTTGCCTAGA 3840 
ACTGTTTCTC TGAAAACTGA GGGCCCTTAT GATTCCTTAG ATTACATGAC AACCACCAGA 3900 
AAAATATATT CATCTTACCC TAAAGTCCAA GAGACACTTC CAGTCACATA TAAACCCACA 3960 
TCAGATGGAA AAGAAATTAA GGATGATGTT GCCACAAATG TTGACAAACA TAAAAGTGAC 4020 
ATTTTAGTCA CTGGTGAATC AATTACTAAT GCCATACCAA CTTCTCGCTC CTTGGTCTCC 4080 
ACTATGGGAG AATTTAAGGA AGAATCCTCT CCTGTAGGCT TTCCAGGAAC TCCAACCTGG 4140 
AATCCCTCAA GGACGGCCCA GCCTGGGAGG CTACAGACAG ACATACCTGT TACCACTTCT 4200 
GGGGAAAATC TTACAGACCC TCCCCTTCTT AAAGAGCTTG AGGATGTGGA TTTCACTTCC 4260 
GAGTTTTTGT CCTCTTTGAC AGTCTCCACA CCATTTCACC AGGAAGAAGC TGGTTCTTCC 4320 
ACAACTCTCT CAAGCATAAA AGTGGAGGTG GCTTCAAGTC AGGCAGAAAC CACCACCCTT 4380 
GATCAAGATC ATCTTGAAAC CACTGTGGCT ATTCTCCTTT CTGAAACTAG ACCACAGAAT 4440 
CACACCCCTA CTGCTGCCCG GATGAAGGAG CCAGCATCCT CGTCCCCATC CACAATTCTC 4500 
ATGTCTTTGG GACAAACCAC CACCACTAAG CCAGCACTTC CCAGTCCAAG AATATCTCAA 4560 
GCATCTAGAG ATTCCAAGGA AAATGTTTTC TTGAATTATG TGGGGAATCC AGAAACAGAA 4620 
GCAACCCCAG T CAACAATG A AGGAACACAG CATATGTCAG GGCCAAATGA ATTATCAACA 4680 
CCCTCTTCCG ACCGGGATGC ATTTAACTTG TCTACAAAGC TGGAATTGGA AAAGCAAGTA 4740 
TTTGGTAGTA GGAGTCTACC ACGTGGCCCA GATAGCCAAC GCCAGGATGG AAGAGTTCAT 4800 
GCTTCTCATC AACTAACCAG AGTCCCTGCC AAACCCATCC TACCAACAGC AACAGTGAGG 4860 
CTACCTGAAA TGTCCACACA AAGCGCTTCC AGATACTTTG TAACTTCCCA GTCACCTCGT 4920 
CACTGGACCA ACAAACCGGA AATAACTACA TATCCTTCTG GGGCTTTGCC AGAGAACAAA 4980 
CAGTTTACAA CTCCAAGATT ATCAAGTACA ACAATTCCTC TCCCATTGCA CATGTCCAAA 5040 
CCCAGCATTC CTAGTAAGTT TACTGACCGA AGAACTGACC AATTCAATGG TTACTCCAAA 5100 
GTGTTTGGAA ATAACAACAT CCCTGAGGCA AGAAACCCAG TTGGAAAGCC TCCCAGTCCA 5160 
AGAATTCCTC ATTATTCCAA TGGAAGACTC CCTTTCTTTA CCAACAAGAC TCTTTCTTTT 5220 
CCACAGTTGG GAGTCACCCG GAGACCCCAG ATACCCACTT CTCCTGCCCC AGTAATGAGA 5280 
GAGAGAAAAG TTATTCCAGG TTCCTACAAC AGGATACATT CCCATAGCAC CTTCCATCTG 5340 
GACTTTGGCC CTCCGGCACC TCCGTTGTTG CACACTCCGC AGACCAGGGG ATCACCCTCA 5400 
ACTAACTTAC AGAATATCCC TATGGTCTCT TCCACCCAGA GTTCTATCTC CTTTATAACA 5460 
TCTTCTGTCC AGTCCTCAGG AAGCTTCCAC CAGAGCAGCT CAAAGTTCTT TGCAGGAGGA 5520 
CCTCCTGCAT CCAAATTCTG GTCTCTTGGG GAAAAGCCCC AAATCCTCAC CAAGTCCCCA 5580 
CAGACTGTGT CCGTCACCGC TGAGACAGAC ACTGTGTTCC CCTGTGAGGC AACAGGAAAA 5640 
CCAAAGCCTT TCGTTACTTG GACAAAGGTT TCCACAGGAG CTCTTATGAC TCCGAATACC 5700 
AGGATACAAC GGTTTGAGGT TCTCAAGAAC GGTACCTTAG TGATACGGAA GGTTCAAGTA 5760 
CAAGATCGAG GCCAGTATAT GTGCACCGCC AGCAACCTGC ACGGCCTGGA CAGGATGGTG 5820 
GTCTTGCTTT CGGTCACCGT GCAGCAACCT CAAATCCTAG CCTCCCACTA CCAGGAOGTC 5880 
ACTGTCTACC TGGGAGACAC CATTGCAATG GAGTGTCTGG CCAAAGGGAC CCCAGCCCCC 5940 
CAAATTTCCT GGATCTTCCC TGACAGGAGG GTGTGGCAAA CTGTGTCCCC CGTGGAGAGC 6000 
CGCATCACCC TGCACGAAAA CCGGACCCTT TCCATCAAGG AGGCGTCCTT CTCAGACAGA 6060 
GGCGTCTATA AGTGCGTGGC CAGCAATGCA GCCGGGGCGG ACAGOCTGGC CATCCGCCTG 6120 
CACGTGGOGG CACTGCCCCC CGTTATCCAC CAGGAGAAGC TGGAGAACAT CTCGCTGCCC 6180 
CCGGGGCTCA GCATTCACAT TCACTGCACT GCCAAGGCTG CGCCCCTGCC CAGCGTGCGC 6240 
TGGGTGCTCG GGGACGGTAC CCAGATCCGC CCCTCGCAGT TCCTCCACGG GAACTTGTTT 6300 
GTTTTCCCCA ACGGGACGCT CTACATCCGC AACCTCGCGC CCAAGGACAG CGGGCGCTAT 6360 
GAGTGGGTGG COGCCAACCT GGTAGGCTCC GCGCGCAGGA CGGTGCAGCT GAACGTGCAG 6420 
CGTGCAGCAG CCAACGCGCG CATCACGGGC ACCTCCCCGC GGAGGACGGA CGTCAGGTAC 6480 
GGAGGAACCC TCAAGCTGGA CTGCAGCGCC TCGGGGGACC CCTGGCCGCG CATCCTCTGG 6540 
AGGCTGCCGT CCAAGAGGAT GATCGACGCG CTCTTCAGTT TTGATAGCAG AATCAAGGTG 6600 
TTTGCCAATG GGACCCTGGT GGTGAAATCA GTGACGGACA AAGATGCCGG AGATTACCTG 6660 
TGCGTAGCTC GAAATAAGGT TGGTGATGAC TACGTGGTGC TCAAAGTGGA TGTGGTGATG 6720 
AAACCGGCCA AGATTGAACA CAAGGAGGAG AACGACCACA AAGTCTTCTA CGGGGGTGAC 6780 
CTGAAAGTGG ACTGTGTGGC CACCGGGCTT CCCAATCCCG AGATCTCCTG GAGCCTCCCA 6840 
GACGGGAGTC TCGTGAACTC CTTCATGCAG TCGGATGACA GCGGTGGACG CACCAAGCGC 6900 
TATGTCGTCT TCAACAATGG GACACTCTAC TTTAACGAAG TGGGGATGAG GGAGGAAGGA 6960 
GACTACACCT GCTTTGCTGA AAATCAGGTC GGGAAGGACG AGATGAGAGT CAGAGTCAAG 7020 
GTGGTGACAG CGCCCGCCAC CATCOGGAAC AAGACTTACT TGGGGGTTCA GGTGCCCTAT 7080 
GGAGACGTGG TCACTGTAGC CTGTGAGGCC AAAGGAGAAC CCATGCCCAA GGTGACTTGG 7140 
TTGTCCCCAA CCAACAAGGT GATCCCCACC TCCTCTGAGA AGTATCAGAT ATACCAAGAT 7200 
GGCACTCTCC TTATTCAGAA AGCCCAGCGT TCTGACAGCG GCAACTACAC CTGCCTGGTC 7260 
AGGAACAGOG CGGGAGAGGA TAGGAAGACG GTGTGGATTC ACGTCAACGT CCAGCCACCC 7320 
AAGATCAACG GTAACCCCAA CCCCATCACC ACCGTGCGGG AGATAGCAGC CGGGG GCAGT 7380 
CGGAAACTGA TTGACTGCAA AGCTGAAGGC ATCCCCACCC OGAGGGTGTT ATGGGCTTTT 7440 
CCCGAGGGTG TGGTTCTGCC AGCTCCATAC TATGGAAACC GGATCACTGT CCATGGCAAC 7500 
GGTTCCCTGG ACATCAGGAG TTTGAGGAAG AGCGACTCCG TCCAGCTGGT ATGCATGGCA 7560 
CGCAACGAGG GAGGGGAGGC GAGGTTGATC GTGCAGCTCA CTGTCCTGGA GCCCATGGAG 7620 
AAACCCATCT TCCACGACCC GATCAGCGAG AAGATCACGG CCATGGCGGG CCACACCATC 7 680 
AGCCTCAACT GCTCTGCCGC GGGGACCCCG ACACCCAGCC TGGTGTGGGT CCTTCCCAAT 7740 
GGCACCGATC TGCAGAGTGG ACAGCAGCTG CAGCGCTTCT ACCACAAGGC TGAOGGCATG 7800 
CTACACATTA GCGGTCTCTC CTCGGTGGAC GCTGGGGCCT ACCGCTGCGT GGCCCGCAAT 7860 
GCCGCTGGCC ACACGGAGAG GCTGGTCTCC CTGAAGGTGG GACTGAAGCC AGAAGCAAAC 7920 
AAGCAGTATC ATAACCTGGT CAGCATCATG AATGGTGAGA CCCTGAAGCT CCCCTGCACC 7980 
CCTCCCGGGG CTGGGCAGGG ACGTTTCTCC TGGACGCTCC CCAATGGCAT GCATCTGGAG 8040 
GGCCCCCAAA CCCTGGGACG CGTTTCTCTT CTGGACAATG GCACCCTCAC GGTTCGTGAG 8100 
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GCCTCGGTGT TTGACAGGGG TACCTATGTA TGCAGGATGG AGACGGAGTA CGGCCCTTCG 8160 

GTCACCAGCA TCCCCGTGAT TGTGATCGCC TATCCTCCCC GGATCACCAG CGAGCCCACC 8220 

CCGGTCATCT ACACCCGGCC CGGGAACACC GTGAAACTGA ACTGCATGGC TATGGGGATT 8280 

CCCAAAGCTG ACATCACGTG GGAGTTACCG GATAAGTCGC ATCTGAAGGC AGGGGTTCAG 8340 

GCTCGTCTGT ATGGAAACAG ATTTCTTCAC CCCCAGGGAT CACTGACCAT CCAGCATGCC 8400 

ACACAGAGAG ATGCCGGCTT CTACAAGTGC ATGGCAAAAA ACATTCTCGG CAGTGACTCC 8460 

AAAACAACTT ACATCCACGT CTTCTGAAAT GTGGATTCCA GAATGATTGC TTAGGAACTG 8520 

ACAACAAAGC GGGGTTTGTA AGGGAAGCCA GGTTGGGGAA TAGGAGCTCT TAAATAATGT 8580 

GTCACAGTGC ATGGTGGCCT CTGGTGGGTT TCAAGTTGAG GTTGATCTTG ATCTACAATT 8640 

GTTGGGAAAA GGAAGCAATG CAGACACGAG AAGGAGGGCT CAGCCTTGCT GAGACACTTT 8700 

CTTTTGTGTT TACATCATGC CAGGGGCTTC ATTCAGGGTG TCTGTGCTCT GACTGCAATT 8760 

'I Ti 'LTTCTT T TGCAAATGCC ACTOGACTGC CTTCATAAGC GTCCATAGGA TATCTGAGGA 8820 

ACATTCATCA AAAATAAGCC ATAGACATGA ACAACACCTC ACTACCCCAT TGAAGACGCA 8880 

TCACCTAGTT AACCTGCTGC AGTTTTTACA TGATAGACTT TGTTCCAGAT TGACAAGTCA 8940 

TCTTTCAGTT ATTTCCTCTG TCACTTCAAA ACTCCAGCTT GCCCAATAAG GATTTAGAAC 9000 

CAGAGTGACT GATATATATA TATATATTTT AATTCAGAGT TACATACATA CAGCTACCAT 9060 

TTTATATGAA AAAAGAAAAA CATTTCTTCC TGGAACTCAC TTTTTATATA ATGTTTTATA 9120 

TATATATTTT TTCCTTTCAA ATCAGACGAT GAGACTAGAA GGAGAAATAC TTTCTGTCTT 9180 

ATTAAAATTA ATAAATTATT GGTCTTTACA AGACTTGGAT ACATTACAGC AGACATGGAA 9240 

ATATAATTTT AAAAAATTTC TCTCCAACCT CCTTCAAATT CAGTCACCAC TGTTATATTA 9300 

CCTTCTCCAG GAACCCTCCA GTGGGGAAGG CTGCGATATT AGATTTCCTT GTATGCAAAG 9360 

TTTTTGTTGA AAGCTGTGCT CAGAGGAGGT GAGAGGAGAG GAAGGAGAAA ACTGCATCAT 9420 

AACTTTACAG AATTGAATCT AGAGTCTTCC CCGAAAAGCC CAGAAACTTC TCTGCAGTAT 9480 

CTGGCTTGTC CATCTGGTCT AAGGTGGCTG CTTCTTCCCC AGCCATGAGT CAGTTTGTGC 9540 

CCATGAATAA TACACGACCT GTTATTTCCA TGACTGCTTT ACTGTATTTT TAAGGTCAAT 9600 

ATACTGTACA TTTGATAATA AAATAATATT CTCCCAAAAA AAAAA 9645 



Seq ID NO: C12 DNA Sequence 
Nucleic Acid Accession AK001903 
Coding sequence: none 

1 11 21 31 41 51 

1 I I I I I 

TATCATGCAT GTGGGAAGGT GGGTGTGGTG AGAAAAGTTT TAAGGCAAGA GTAGATGGCC 60 

ATGTTCAACT TTACAAAATT TCTTGGAAAA CTGGCAGTAT TTTGAACTGC ATCTTCTTTG 120 

GTACCGGAAC CTGCAGAAAC AGTGTGAGAA ATTAAGTCCT GGTTCACTGC GCAGTAGCAA 180 

AGATGGTCAA GGCCATGGAA AAAGCAGAAA TTTACCAAGA AAGCTGATAC CCATGTATAG 240 

TTCCCACTCA TCTCAAATAC ATCTGCTATC TTTTTAAGCT AAGTCCTAGA CATATCGGGG 300 

ATAACATGGG GGTTGATTAG TGACCACAGT TATCAGAAGC AGAGAAATGT AATTCCATAT 360 

TTTATTTGAA ACTTATTCCA TATTTTAATT GGATATTGAG TGATTGGGTT ATCAAACACC 420 

CACAAACTTT AATTTTGTTA AATTTATATG GCTTTGAAAT AGAAGTATAA GTTGCTACCA 480 

TTTTTTGATA ACATTGAAAG ATAGTATTTT ACCATCTTTA ATCATCTTGG AAAATACAAG 540 

TCCTGTGAAC AACCACTCTT TCACCTAGCA GCATGAGGCC AAAAGTAAAG GCTTTAAATT 600 

ATAACATATG GGATTCTTAG TAGTATGTTT TTTTCTTGAA ACTCAGTGGC TCTATCTAAC 660 

CTTACTATCT CCTCACTCTT TCTCTAAGAC TAAACTCTAG GCTCTTAAAA ATCTGCCCAC 720 

ACCAATCTTA GAAGCTCTGA AAAGAATTTG TCTTTAAATA TCTTTTAATA GTAACATGTA 780 

TTTTATGGAC CAAATTGACA TTTTCGACTA TTTTTTCCAA AAAAGTCAGG TGAATTTCAG 840 

CACACTGAGT TGGGAATTTC TTATCCCAGA AGACCAACCA ATTTCATATT TATTTAAGAT 900 

TGATTCCATA CTCCGTTTTC AAGGAGAATC CCTGCAGTCT CCTTAAAGGT AGAACAAATA 960 

CTTTCTATTT TTTTTTTCAC CATTGTGGGA TTGGACTTTA AGAGGTGACT CTAAAAAAAC 1020 

AGAGAACAAA TATGTCTCAG TTGTATTAAG CACGGACCCA TATTATCATA TTCACTTAAA 1080 

AAAATGATTT CCTGTGCACC TTTTGGCAAC TTCTCTTTTC AATGTAGGGA AAAACTTAGT 1140 

CACCCTGAAA ACCCACAAAA TAAATAAAAC TTGTAGATGT GGGCAGAAGG TTTGGGGGTG 1200 

GACATTGTAT GTGTTTAAAT TAAACCCTGT ATCACTGAGA AGCTGTTGTA TGGGTCAGAG 1260 

AAAATGAATG CTTAGAAGCT GTTCACATCT TCAAGAGCAG AAGCAAACCA CATGTCTCAG 1320 

CTATATTATT ATTTATTTTT TATGCATAAA GTGAATCATT TGTTCTGTAT TAATTTCCAA 1380 

AGGGTTTTAC CCTCTATTTA AATGCTTTGA AAAACAGTGC ATTGACAATG GGTTGATATT 1440 

TTTCTTTAAA AGAAAAATAT AATTATGAAA GCCAAGATAA TCTGAAGCCT GTTTTATTTT 1500 

AAAACTTTTT ATGTTCTGTG GTTGATGTTG TTTGTTTGTT TGTTTCTATT TTGTTGGTTT 1560 

TTTACTTTGT TTTTTGTTTT GTTTTGTTTT GTTTTGCATA CTACATGCAG TTCTTTAACC 1620 

AATGTCTGTT TGGCTAATGT AATTAAAGTT GTTAATTTAT ATGAGTGCAT TTCAACTATG 1680 

TCAATGGTTT CTTAATATTT ATTGTGTAGA AGTACTGGTA ATTTTTTTAT TTACAATATG 1740 

TTTAAAGAGA TAACAGTTTG ATATGTTTTC ATGTGTTTAT AGCAGAAGTT ATTTATTTCT 1800 

ATGGCATTCC AGCGGATATT TTGGTGTTTG CGAGGCATGC AGTCAATATT TTGTACAGTT 1860 

AGTGGACAGT ATTCAGCAAC GCCTGATAGC TTCTTTGGCC TTATGTTAAA TAAAAAGACC 1920 

TGTTTGGGAT GT 1932 



Seq ID NO: C13 Protein Sequence 
Nucleic Acid Accession #: Eos sequence 
Coding sequence : 1 . . 5001 

1 11 21 31 41 51 

I I I I I I 

ATGCCAGGCA CAAAACTAAC CCGAACAGGC GCCCCAGCAG ACTACAGAGT GATATTGAAG 60 
ACCTCTCAAG AGGACGAATT GGATGTACCT GACGACATCA GCGTCCGGGT TATGTCATCT 120 
CAGTCTGTGC TTGTGTCCTG GGTGGATCCT GTTCTGGAAA AACAGAAGAA AGTTGTTGCA 1B0 
TCAAGACAGT ACACCGTGCG CTATCGAGAG AAGGGGGAAT TGGCCAGGTG GGATTATAAG 240 
CAGATCGCTA ACAGGCGTGT GCTGATTGAG AACCTGATTC CAGACACTGT GTATGAATTT 300 
GCAGTCCGTA TTTCACAGGG TGAAAGAGAT GGCAAATGGA GTACGTCAGT CTTCCAAAGA 360 
ACACCAGAAT CTGCCCCTAC CACAGCTCCT GAAAACTTGA ACGTCTGGCC AGTCAATGGC 420 
AAACCTACAG TTGTCGCTGC ATCTTGGGAT GCGCTACCAG AGACTGAGGG GAAAGTGAAA 480 
GTCTGTCTGC TGGACACAGG ACTGTTTTCA GTTTCCTCCT TCCAACCATC TGCCAAATCA 540 
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TTTCAGAATA CATTCTTTCA TACGCCCCGG CTCTCAAACC ATTTGGAGCA AAGTCCCTCA 600 

CCTATCCTGG AGACACTACT TCTGCCCTGG TGGATGGTCT GCAGCCTGGG GAACGCTATC 660 

TTTTCAAAAT CCGGGCCACA AACAGGAGAG GCCTGGGACC TCACTCCAAA GCCTTCATTG 720 

TCGCTATGCC AACAAGAATG CAGCTGTACC CAGAAGGATT TCAGTTGTCT AGCTTACCTG 7 BO 

ATCGATATCC AAACCAAACA AGTTAATAAA GATCCACAAC TGGAAGGGAG TGTTTTTGGA 840 

CCATGTTTTC TTTTCTACTT CCTCACATTT ATGCTGGATA TTGGCGGCTT TTCCTTCATT 900 

ATGTGCTATG AAGACCCANN TGTTTCTTCT TTGACAGGCA ATTCTTTAAA ATCTGTTGCA 960 

GCCAGTAAGG CGGATGTTCA GCAGAACACG GAGGACAATG GGAAACCCGA AAAACCTGAG 1020 

CCTTCCTCAC CTTCTCCCAG AGCTCCAGCT TCCTCCCAAC ACCCCTCTGT GCCTGCTTCT 1080 

CCCCAAGGGA GAAATGCCAA GGACCTTCTT CTTGACTTGA AGAACAAAAT ATTGGCTAAT 1140 

GGTGGGGCGC CCCGAAAACC CCAGCTTOGC GCCAAGAAGG CAGAGGAGCT GGATCTTCAG 1200 

TCGACAGAAA TCACTGGGGA GGAGGAGCTG GGTTCCCGGG AGGACTCGCC CATGTCACCC 1260 

TCAGACACCC AAGACCAGAA ACGGACCCTG AGGCCGCCAA GTAGACACGG CCACTCGGTG 1320 

GTTGCTCCOG GCAGGACTGC AGTGAGGGCC CGGATGCCAG CGCTGCCCCG AAGGGAAGGC 1380 

GTAGATAAGC CTGGCTTTTC CCTGGCCACG CAGCCCCGCC CAGGGGCGCC CCCCTCGGCT 1440 

TCGGCCTCTC CTGCCCACCA CGCGTCCACC CAGGGCACCT CTCATGGTCC TTCCCTGCCT 1500 

GCCAGCTTGA ATGACAACGA CTTGGTGGAC TCAGACGAAG ATGAGCGCGC TGTGGGCTCC 1560 

CTCCACCCCA AGGGCGCCTT CGCCCAGCCC CGGCCAGCCC TGTCCCCCAG CCGCCAGTCC 1620 

CCGTCCAGCG TTCTCCGCGA CAGAAGCTCT GTGCACCCCG GCGCAAAGCC AGCCTCGCCG 1680 

GCGCGGAGGA CCCCCCATTC AGGGGCCGCA GAGGAAGATT CCAGTGCCTC AGCCCCACCC 1740 

TCAAGACTTT CTCCACCCCA TGGGGGATCA TCTCGGCTGC TGCCCACCCA GCCACACCTG 1800 

AGCTCTCCAC TTTCCAAGGG CGGGAAGGAT GGTGAGGACG CCCCAGCCAC CAACTCCAAT 1860 

GCGCCATCAC GGTCCACCAT GTCCTCCTCC GTCTCTTCTC ATCTCTCGTC CAGGACGCAG 1920 

GTCTCTGAGG GAGCGGAGGC TTCTGATGGT GAAAGCCACG GTGACGGCGA TAGGGAAGAC 1980 

GGCGGAAGGC AGGCGGAGGC CACGGCCCAG ACGCTGCGGG CCCGGCCTGC CTCTGGACAC 2040 

TTCCATTTGC TCAGACACAA ACCCTTTGCT GCCAACGGGA GGTCTCCAAG CAGGTTCAGC 2100 

ATTGGGCGGG GACCTCGGCT GCAGCCCTCC AGCTCCCCAC AGTCGACTGT GCCCTCCCGA 2160 

GCCCACCCCA GGGTTCCCTC TCACTCTGAT TCCCACCCTA AGCTTAGCTC AGGTATCCAT 2220 

GGAGACGAGG AGGATGAGAA GCCGCTTCCT GCCACCGTTG TCAATGACCA CGTGCCTTCC 2280 

TCCTCCAGGC AGCCCATCTC CCGGGGCTGG GAGGACTTAA GGAGAAGCCC GCAGAGAGGG 2340 

GCCAGCCTGC ATCGGAAGGA ACCCATCCCA GAGAACCCCA AATCCACAGG GGCAGATACA 2400 

CATCCTCAGG GCAAGTACTC CTCCCTGGCC TCCAAGGCTC AGGATGTTCA ACAGAGCACA 2460 

GACGCGGACA CGGAGGGTCA TTCTCCCAAA GCACAGCCAG GGTCCACAGA CCGCCACGCG 2520 

TCCCCTGCTC GTCCTCCCGC AGCACGGTCA CAGCAGCATC CCAGTGTTCC CAGAAGGATG 2580 

ACACCCGGCC GGGCCCCAGA ACAGCAGCCC CCTCCTCCCG TCGCCACGTC CCAGCACCAC 2640 

CCGGGACCCC AGAGCAGAGA CGCGGGTCGG TCACCTTCCC AGCCCAGGCT CTCACTGACC 2700 

CAGGCCGGGC GGCCCCGCCC CACGTCGCAG GGCCGCTCCC ACTCCTCCTC GGACCCTTAC 2760 

ACGGCGAGCT CCAGAGGGAT GCTCCCCACG GCCCTCCAGA ACCAGGACGA GGATGCCCAG 2820 

GGCAGCTACG ACGACGACAG CACAGAAGTC GAGGCCCAGG ATGTGCGGGC CCCCGCGCAC 2680 

GCCGCGCGCG CCAAGGAGGC AGCTGCGTCC CTTCCCAAGC ACCAGCAGGT GGAGTCTCCC 2940 

ACAGGCGCAG GGGCAGGTGG CGACCACAGG TCCCAGCGCG GACATGCGGC CTCCCCCGCC 3000 

AGGCCCAGCC GACCCGGCGG CCCCCAGTCC CGCGCCCGGG TCCCCAGCAG GGCAGCGCCG 3060 

GGGAAGTCGG AGCCTCCTTC CAAGCGGCCC CTGTCCTCCA AGTCCCAGCA GTCGGTCTCA 3120 

GCCGAGGACG AGGAGGAGGA GGACGCGGGG TTTTTTAAAG GCGGGAAAGA AGACCTTCTG 3180 

TCTTCCTCTG TGCCAAAGTG GCCCTCTTCC TCCACTCCCA GGGGCGGCAA AGAOGCCGAT 3240 

GGGAGCCTCG CCAAGGAAGA GAGGGAGCCT GCCATCGCGC TTGCCCCTCG CGGAGGGAGC 3300 

CTGGCTCCTG TGAAGCGACC TCTCCCCCCA CCTCCAGGCA GCTCCCCCAG GGCCTCCCAC 3360 

GTCCCTTCCC GACCGCCGCC TCGCAGOGCT GCCACCGTGA GCCCCGTCGC GGGCACCCAC 3420 

CCCTGGCCGC GGTACACCAC GCGCGCCCCV CCTGGCCACT TCTCCACCAC CCCGATGCTG 3480 

TCCTTGCGCC AGAGGATGAT GCATGCCAGA TTCCGTAACC CTCTCTCCCG ACAGCCTGCC 3540 

AGACCCTCTT ACAGACAAGG TTATAATGGC AGACCAAATG TAGAAGGGAA AGTCCTTCCT 3600 

GGTAGTAATG GAAAACCGAA TGGACAGAGA ATTATCAATG GCCCTCAAGG AACAAAGTGG 3660 

GTTGTGGACC TTGATCGTGG GTTAGTATTG AATGCAGAAG GAAGGTACCT CCAAGATTCA 3720 

CATGGAAATC CTCTTCGGAT TAAACTAGGA GGAGATGGTC GAACCATTGT AGATCTGGAA 3780 

GGGACCCCCG TGGTGAGTCC TGAOGGCCTC CCACTCTTTG GGCAGGGGCG ACATGGCACA 3840 

CCTCTGGCCA ATGCCCAAGA TAAGCCAATT TTGAGTCTTG GAGGAAAGCC GCTGGTGGGC 3900 

TTGGAGGTCA TCAAAAAAAC CACCCATCCC CCTACCACTA CCATGCAGCC CACCACTACT 3960 

ACGACGCCCC TGCCTACCAC TACAACCCCG AGGCCCACCA CTGCCACCAC CATGCAGCCC 4020 

ACCACTACTA CGACGCCCCT GCCTACCACT ACACCGAGGC CCACCACTGC CACCACCCGC 4080 

CGCACGACCA CCAGGCGTCC AACAACCACA GTCCGAACCA CTACGCGGAC AACCACCACC 4140 

ACCACCCCCA AACCCACCAC TCCCATCCCC ACCTGTCCCC CTGGGACCTT GGAACGGCAC 4200 

GACGATGATG GCAACCTGAT AATGAGCTCC AATGGGATCC CAGAGTGCTA CGCTGAAGAA 4260 

GATGAGTTCT CAGGCTTGGA GACTGACACT GCAGTACCTA CGGAAGAGGC CTACGTTATA 4320 

TATGATGAAG ATTATGAATT TGAGACGTCA AGGCCACCAA CCACCACTGA GCCTTCGACC 4380 

ACTGCTACCA CACCGAGGGT GATCCCAGAG GAAGGCGCCA TCAGTTCCTT TCCTGAAGAA 4440 

GAATTTGATC TGGCTGGAAG GAAACGATTT GTTGCTCCTT ACGTGACGTA CCTAAATAAA 4500 

GACCCATCAG CCCCGTGCTC TCTGACTGAT GCACTGGATC ACTTCCAAGT GGACAGCCTG 4560 

GATGAAATCA TCCCCAATGA CCTGAAGAAG AGTGATCTGC CTCCCCAGCA TGCTCCCCGC 4620 

AACATCACCG TGGTGGCCGT GGAAGGTTGC CACTCATTTG TCATTGTGGA TTGGGACAAA 4680 

GCCACCCCAG GAGATTTGGT CACAGGTTAT TTGGTTTACA GTGCATCCTA TGAAGATTTC 4740 

ATCAGGAACA AGTTTTCCAC TCAAGCTTCA TCAGTAACTC ACTTGCCCAT TGAGAACCTA 4800 

AAGCCCAACA CGAGGTATTA TTTTAAAGTG CAAGCACAAA ATCCTCATGG CTACGGACCT 4860 

ATCAGCCCTT CGGTCTCATT TGTCACCGAA TCAGATAATC CTCTGCTTGT TGTGAGGCCC 4920 

CCAGGCGGTG AGCTATCTGG ATCCCATTCG CTTTCAAACA TGATCCCAGC TACACGGACT 4980 

GCCATGGACG GCAATATGTG AAGCGCACGT GGTATCGAAA GTTCGTGGGA GTTGTTCTTT 5040 

GTAATTCACT GAGGTATAAA ATCTACCTCA GTGACAACCT GAAAGATACA TTCTACAGCA 5100 

TTGGAGACAG CTGGGGAAGA GGTGAAGACC ATTGCCAATT TGTGGATTCA CACCTTGATG 5160 

GAAGAACAGG GCCTCAGTCC TATGTAGAAG CCCTCCCTAC TATTCAAGGC TACTATCGCC 5220 

AGTATCGTCA GGAGCCTGTC AGGTTTGGGA ACATCGGCTT CGGAACCCCC TACTACTATG 5280 

TGGGCTGGTA CGAGTGTGGG GTCTCCATCC CTGGAAAGTG GTAATCACAG GACCGTCATG 5340 

CTGCAAGCTT GCCCTGCOCA GCCCCACCAA CTAAGTCGCA CTAGGGGCTG TGAGCAAAGA 5400 

CAGCCAGCAT GCTCAGCCCC GCTGCCCTAG GTGCCAGGAA GGTCACAGAT GGACACTGGC 5460 

CATTCTGGTC ATCTCAGTCT GGAACTCAGT CCCACTTCTT GGCCTGGACA ATGAACAGGA 5520 

TTCAGTTTTG CTGTTAACTT TGCTTCTCTA CTTTTTTTTG TTTGTTTGTA ATAGCACATC 5580 
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CCAGAGACAT 
AATTCGGACA 
TGCTACATGC 
TCAGGTCCCT 
ATATCCTACT 
TTATTTGTAA 
TGTCTGCAAG 



CAGAAACCAG 
CTTCAGTATT 
TTTCTGTTTT 
TTGTATGCAG 
TGAAATTTAC 
ATTCTCAATT 
AATATTGATT 



CAACTGATTC 
TCCAGGAATA 
TCTCATTTTG 
TAGAAAGGAA 
TCTATGGACT 
TTGATATATA 
AAAATTGCTA 



AGTGTGATTT 
GCATATGCAC 
GATTTCTCCA 
TTATTAAAAA 
TACCCACTGC 
TATGTATATA 
AATTTGTACT 



CCCAGACTTT 
GCTGTTCTTG 
AAACTAACTG 
CACCACCAAA 
TAGAATAAAT 
TGCATATACA 
TGTTCACCAA 



TTAGGCATGA 
CTTCATGGAA 
AATTTAAGCT 
GAAAATAAAT 
GTATCAAATC 
TATCCACACT 
AAAAAAAAAA 



5640 
5700 
5760 
5820 
5880 
5940 
6000 
6007 



Seq ID NO: C14 DNA Sequence 
Nucleic Acid Accession #: NM_003014 
Coding sequence: 238.. 1278 

1 11 21 31 41 51 

1 I I I I I 

GGCGGGTTCG CGCCCCGAAG GCTGAGAGCT GGCGCTGCTC GTGCCCTGTG TGCCAGACGG 
CGGAGCTCCG CGGCCGGACC CCGCGGCCCC GCTTTGCTGC CGACTGGAGT TTGGGGGAAG 
AAACTCTCCT GCGCCCCAGA AGATTTCTTC CTCGGCGAAG GGACAGCGAA AGATGAGGGT 
GGCAGGAAGA GAAGGCGCTT TCTGTCTGCC GGGGTCGCAG CGCGAGAGGG CAGTGCCATG 
TTCCTCTCCA TCCTAGTGGC GCTGTGCCTG TGGCTGCACC TGGCGCTGGG CGTGCGCGGC 
GCGCCCTGCG AGGCGGTGCG CATCCCTATG TGCCGGCACA TGCCCTGGAA CATCACGCGG 
ATGCCCAACC ACCTGCACCA CAGCACGCAG GAGAACGCCA TCCTGGCCAT CGAGCAGTAC 
GAGGAGCTGG TGGACGTGAA CTGCAGCGCC GTGCTGCGCT TCTTCTTCTG TGCCATGTAC 
GCGCCCATTT GCACCCTGGA GTTCCTGCAC GACCCTATCA AGCCGTGCAA GTCGGTGTGC 
CAACGCGCGC GCGACGACTG CGAGCCCCTC ATGAAGATGT ACAACCACAG CTGGCCCGAA 
AGCCTGGCCT GCGACGAGCT GCCTGTCTAT GACGGTGGCG TGTGCATTTC GCCTGAAGCC 
ATCGTCACGG ACCTCCCGGA GGATGTTAAG TGGATAGACA TCACACCAGA CATGATGGTA 
CAGGAAAGGC CTCTTGATGT TGACTGTAAA CGCCTAAGCC CCGATCGGTG CAAGTGTAAA 
AAGGTGAAGC CAACTTTGGC AACGTATCTC AGCAAAAACT ACAGCTATGT TATTCATGCC 
AAAATAAAAG CTGTGCAGAG GAGTGGCTGC AATGAGGTCA CAACGGTGGT GGATGTAAAA 
GAGATCTTCA AGTCCTCATC ACCCATCCCT CGAACTCAAG TCCCGCTCAT TACAAATTCT 
TCTTGCCAGT GTCCACACAT CCTGCCCCAT CAAGATGTTC TCATCATGTG TTACGAGTGG 
CGTTCAAGGA TGATGCTTCT TGAAAATTGC TTAGTTGAAA AATGGAGAGA TCAGCTTAGT 
AAAAGATCCA TACAGTGGGA AGAGAGGCTG CAGGAACAGC GGAGAACAGT TCAGGACAAG 
AAGAAAACAG CCGGGCGCAC CAGTCGTAGT AATCCCCCCA AACCAAAGGG AAAGCCTCCT 
GCTCCCAAAC CAGCCAGTCC CAAGAAGAAC ATTAAAACTA GGAGTGCCCA GAAGAGAACA 
AACCCGAAAA GAGTGTGAGC TAACTAGTTT CCAAAGCGGA GACTTCCGAC TTCCTTACAG 
GATGAGGCTG GGCATTGCCT GGGACAGCCT ATGTAAGGCC ATGTGCCCCT TGCCCTAACA 
ACTCACTGCA GTGCTCTTCA TAGACACATC TTGCAGCATT TTTCTTAAGG CTATGCTTCA 
G TTTT TC T TT GTAAGCCATC ACAAGCCATA GTGGTAGGTT TGCCCTTTGG TACAGAAGGT 
GAGTTAAAGC TGGTGGAAAA GGCTTATTGC ATTGCATTCA GAGTAACCTG TGTGCATACT 
CTAGAAGAGT AGGGAAAATA ATGCTTGTTA CAATTCGACC TAATATGTGC ATTGTAAAAT 
AAATGCCATA TTTCAAACAA AACACGTAAT TTTTTTACAG TATGTTTTAT TACCTTTTGA 
TATCTGTTGT TGCAATGTTA GTGATGTTTT AAAATGTGAT GAAAATATAA TGTTTTTAAG 
AAGGAACAGT AGTGGAATGA ATGTTAAAAG ATCTTTATGT GTTTATGGTC TGCAGAAGGA 
TTTTTGTGAT GAAAGGGGAT TTTTTGAAAA ATTAGAGAAG TAGCATATGG AAAATTATAA 
TGTGTTTTTT TACCAATGAC TTCAGTTTCT GTTTTTAGCT AGAAACTTAA AAACAAAAAT 
AATAATAAAG AAAAATAAAT AAAAAGGAGA GGCAGACAAT GTCTGGATTC CTGTTTTTTG 
GTTACCTGAT TTCCATGATC ATGATGCTTC TTGTCAACAC CCTCTTAAGC AGCACCAGAA 
ACAGTGAGTT TGTCTGTACC ATTAGGAGTT AGGTACTAAT TAGTTGGCTA ATGCTCAAGT 
ATTTTATACC CACAAGAGAG GTATGTCACT CATCTTACTT CCCAGGACAT CCACCCTGAG 
AATAATTTGA CAAGCTTAAA AATGGCCTTC ATGTGAGTGC CAAATTTTGT TTTTCTTCAT 
TTAAATATTT TCTTTGCCTA AATACATGTG AGAGGAGTTA AATATAAATG TACAGAGAGG 
AAAGTTGAGT TCCACCTCTG AAATGAGAAT TACTTGACAG TTGGGATACT TTAATCAGAA 
AAAAAGAACT TATTTGCAGC ATTTTATCAA CAAATTTCAT AATTGTGGAC AATTGGAGGC 
ATTTATTTTA AAAAACAATT TTATTGGCCT TTTGCTAACA CAGTAAGCAT GTATTTTATA 
AGGCATTCAA TAAATGCACA ACGCCCAAAG GAAATAAAAT CCTATCTAAT CCTACTCTCC 
ACTACACAGA GGTAATCACT ATTAGTATTT TGGCATATTA TTCTCCAGGT GTTTGCTTAT 
GCACTTATAA AATGATTTGA ACAAATAAAA CTAGGAACCT GTATACATGT GTTTCATAAC 
CTGCCTCCTT TGCTTGGCCC TTTATTGAGA TAAGTTTTCC TGTCAAGAAA GCAGAAACCA 
TCTCATTTCT AACAGCTGTG TT AT ATT CCA TAGTATGCAT TACTCAACAA ACTGTTGTGC 
TATTGGATAC TTAGGTGGTT TCTTCACTGA CAATACTGAA TAAACATCTC ACCGGAATTC 

Seq ID NO : C15 DNA Sequence 
Nucleic Acid Accession #: NM_005940 
Coding sequence : 23 .. 1489 



AAGCCCAGCA 
CGCCCTCCTG 
TCTGCCGCCG 
AGCCCTGCCC 
CAGCCTCAGG 
CCGACAGAAG 
GATCCTTCGG 
CCTAAAGGTA 
TGACATCATG 
TGGGGGCATC 
CGACTATGAT 
AGCCCATGAA 
GTCCGCCTTC 
TCAACACCTA 
• CCAGGCTGGG 



11 
I 

GCCCCGGGGC 
CCCCCGATGC 
GACGTCCACC 
AGTAGCCCGG 
CCTCCCCGCT 
AGGTTCGTGC 
TTCCCATGGC 
TGGAGCGATG 
ATCGACTTCG 
CTGGCCCATG 
GAGACCTGGA 
TTTGGCCACG 
TACACCTTTC 
TATGGCCAGC 
ATAGACACCA 




ACCTCCATGC 
CACCTGCCCC 
GTGGCGTGCC 
TTTCTGGCGG 
AGTTGGTGCA 
TGACGCCACT 
CCAGGTACTG 
CCTTCTTCCC 
CTATCGGGGA 
TGCTGGGGCT 
GCTACCCACT 
CCTGGCCCAC 
ATGAGATTGC 



31 

! 

GGCCGCCTGG 
GCTCCAGCCG 
CGAGAGGAGG 
TGCCACGCAG 
CGACCCATCT 
GCGCTGGGAG 
GGAGCAGGTG 
CACCTTTACT 
GCATGGGGAC 
CAAGACTCAC 
TGACCAGGGC 
GCAGCACACA 
GAGTCTCAGC 
TGTCACCTCC 
ACCGCTGGAG 



41 

i 

CTCCGCAGCG 
CCGCCGCTGC 
GGGCCACAGC 
GAAGCCCCCC 
GATGGGCTGA 
AAGACGGACC 
CGGCAGACGA 
GAGGTGCACG 
GACCTGCCGT 
CGAGAAGGGG 
ACAGACCTGC 
ACAGCAGCCA 
CCAGATGACT 
AGGACCCCAG 
CCAGACGCCC 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 
1080 

1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 



51 
I 

CGGCCGCGCG 
TGGCCCGGGC 
CCTGGCATGC 
GGCCTGCCAG 
GTGCCCGCAA 
TCACCTACAG 
TGGCAGAGGC 
AGGGCCGTGC 
TTGATGGGCC 
ATGTCCACTT 
TGCAGGTGGC 
AGGCCCTGAT 
GCAGGGGCGT 
CCCTGGGCCC 
CGCCAGATGC 
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CTGTGAGGCC TCCTTTGACG CGGTCTCCAC CATCCGAGGC GAGCTCTTTT TCTTCAAAGC 960 

GGGCTTTGTG TGGCGCCTCC GTGGGGGCCA GCTGCAGCCC GGCTACCCAG CATTGGCCTC 1020 

TCGCCACTGG CAGGGACTGC CCAGCCCTGT GGACGCTGCC TTCGAGGATG CCCAGGGCCA 1080 

CATTTGGTTC TTCCAAGGTG CTCAGTACTG GGTGTACGAC GGTGAAAAGC CAGTCCTGGG 1140 

CCCCGCACCC CTCACCGAGC TGGGCCTGGT GAGGTTCCCG GTCCATGCTG CCTTGGTCTG 1200 

GGGTCCCGAG AAGAACAAGA TCTACTTCTT CCGAGGCAGG GACTACTGGC GTTTCCACCC 1260 

CAGCACCCGG CGTGTAGACA GTCCCGTGCC CCGCAGGGCC ACTGACTGGA GAGGGGTGCC 1320 

CTCTGAGATC GACGCTGCCT TCCAGGATGC TGATGGCTAT GCCTACTTCC TGCGCGGCCG 1380 

CCTCTACTGG AAGTTTGACC CTGTGAAGGT GAAGGCTCTG GAAGGCTTCC CCCGTCTCGT 1440 

1U GGGTCCTGAC TTCTTTGGCT GTGCCGAGCC TGCCAACACT TTCCTCTGAC CATGGCTTGG 1500 

ATGCCCTCAG GGGTGCTGAC CCCTGCCAGG CCACGAATAT CAGGCTAGAG ACCCATGGCC 1560 

ATCTTTGTGG CTGTGGGCAC CAGGCATGGG ACTGAGCCCA TGTCTCCTGC AGGGGGATGG 1620 

GGTGGGGTAC AACCACCATG ACAACTGCOG GGAGGGCCAC GCAGGTCGTG GTCACCTGCC 1680 

AGCGACTGTC TCAGACTGGG CAGGGAGGCT TTGGCATGAC TTAAGAGGAA GGGCAGTCTT 1740 

GGGACCCGCT ATGCAGGTCC TGGCAAACCT GGCTGCCCTG TCTCATCCCT GTCCCTCAGG 1800 

GTAGCACCAT GGCAGGACTG GGGGAACTGG AGTGTCCTTG CTGTATCCCT GTTGTGAGGT I860 

TCCTTCCAGG GGCTGGCACT GAAGCAAGGG TGCTGGGGCC CCATGGCCTT CAGCCCTGGC 1920 

TGAGCAACTG GGCTGTAGGG CAGGGCCACT TCCTGAGGTC AGGTCTTGGT AGGTGCCTGC 1980 

ATCTGTCTGC CTTCTGGCTG ACAATCCTGG AAATCTGTTC TCCAGAATCC AGGCCAAAAA 2040 

GTTCACAGTC AAATGGGGAG GGGTATTCTT CATGCAGGAG ACCCCAGGCC CTGGAGGCTG 2100 

CAACATACCT CAATCCTGTC CCAGGCCGGA TCCTCCTGAA GCCCTTTTCG CAGCACTGCT 2160 

ATCCTCCAAA GCCATTGTAA ATGTGTGTAC AGTGTGTATA AACCTTCTTC TTCTTTTTTT 2220 

TTTTTAAACT GAGGATTGTC ATTAAACACA GTTGTTTTCT 2260 
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Seq ID NO: C16 DNA Sequence 
Nucleic Acid Accession #: NM_024022 
Coding sequence: 202.-1563 



„ 1 11 21 31 41 51 

30 1 | | | | | 

ACCGGGCACC GGACGGCTCG GGTACTTTCG TTCTTAATTA GGTCATGCCC GTGTGAGCCA 60 

GGAAAGGGCT GTGTTTATGG GAAGCCAGTA ACACTGTGGC CTACTATCTC TTCCGTGGTG 120 

CCATCTACAT TTTTGGGACT CGGGAATTAT GAGGTAGAGG TGGAGGCGGA GCCGGATGTC 180 

AGAGGTCCTG AAATAGTCAC CATGGGGGAA AATGAT CCGC CTGCTGTTGA AGCCCCCTTC 240 

TCATTCCGAT CGCTTTTTGG CCTTGATGAT TTGAAAATAA GTCCTGTTGC ACCAGATGCA 300 

GATGCTGTTG CTGCACAGAT CCTGTCACTG CTGCCATTGA AGTTTTTTCC AATCATCGTC 360 

ATTGGGATCA TTGCATTGAT ATTAGCACTG GCCATTGGTC TGGGCATCCA CTTCGACTGC 420 

TCAGGGAAGT ACAGATGTCG CTCATCCTTT AAGTGTATCG AGCTGATAGC TCGATGTGAC 480 

GGAGTCTCGG ATTGCAAAGA CGGGGAGGAC GAGTACCGCT GTGTCCGGGT GGGTGGTCAG 540 

AATGCCGTGC TCCAGGTGTT CACAGCTGCT TCGTGGAAGA CCATGTGCTC CGATGACTGG 600 

AAGGGTCACT ACGCAAATGT TGCCTGTGCC CAACTGGGTT TCCCAAGCTA TGTGAGTTCA 660 

GATAACCTCA GAGTGAGCTC GCTGGAGGGG CAGTTCCGGG AGGAGTTTGT GTCCATCGAT 720 

CACCTCTTGC CAGATGACAA GGTGACTGCA TTACACCACT CAGTATATGT GAGGGAGGGA 780 

TGTGCCTCTG GCCACGTGGT TACCTTGCAG TGCACAGCCT GTGGTCATAG AAGGGGCTAC 840 

AGCTCACGCA TCGTGGGTGG AAACATGTCC TTGCTCTCGC AGTGGCCCTG GCAGGCCAGC 900 

CTTCAGTTCC AGGGCTACCA CCTGTGCGGG GGCTCTGTCA TCACGCCCCT GTGGATCATC 960 

ACT GCTG CAC ACTGTGTTTA TGACTTGTAC CTCCCCAAGT CATGGACCAT CCAGGTGGGT 1020 

CTAGTTTCCC TGTTGGACAA TCCAGCCCCA TCCCACTTGG TGGAGAAGAT TGT CTACCAC 10 B0 

AGCAAGTACA AGCCAAAGAG GCTGGGCAAT GACATCGCCC TTATGAAGCT GGCCGGGCCA 1140 

CTCACGTTCA ATGAAATGAT CCAGCCTGTG TGCCTGCCCA ACTCTGAAGA GAACTTCCCC 1200 

GATGGAAAAG TGTGCTGGAC GTCAGGATGG GGGGCCACAG AGGATGGAGG TGACGCCTCC 1260 

CCTGTCCTGA ACCACGCGGC CGTCCCTTTG ATTTCCAACA AGATCTGCAA CCACAGGGAC 1320 

GTGTACGGTG GCATCATCTC CCCCTCCATG CTCTGCGCGG GCTACCTGAC GGGTGGCGTG 1380 

GACAGCTGCC AGGGGGACAG CGGGGGGCCC CTGGTGTGTC AAGAGAGGAG GCTGTGGAAG 1440 

TTAGTGGGAG CGACCAGCTT TGGCATCGGC TGCGCAGAGG TGAACAAGCC TGGGGTGTAC 1500 

ACCCGTGTCA CCTCCTTCCT GGACTGGATC CACGAGCAGA TGGAGAGAGA CCTAAAAACC 1560 

TGAAGAGGAA GGGGACAAGT AGCCACCTGA GTTCCTGAGG TGATGAAGAC AGCCCGATCC 1620 

TCCCCTGGAC TCCCGTGTAG GAACCTGCAC ACGAGCAGAC ACCCTTGGAG CTCTGAGTTC 16 BO 

CGGCACCAGT AGCAGGCCCG AAAGAGGCAC CCTTCCATCT GATTCCAGCA CAACCTTCAA 1740 

GCTGCTTTTT GTTTTTTGTT TTTTTGAGGT GGAGTCTCGC TCTGTTGCCC AGGCTGGAGT 1800 

GCAGTGGCGA AATCCCTGCT CACTGCAGCC TCCGCTTCCC TGGTTCAAGC GATTCTCTTG 1860 

CCTCAGCTTC CCCAGTAGCT GGGACCACAG GTGCCCGCCA CCACACCCAA CTAATTTTTG 1920 

TATTTTTAGT AGAGACAGGG TTTCACCATG TTGGCCAGGC TGCTCTCAAA CCCCTGACCT 1980 

CAAATGATGT GCCTGCTTCA GCCTCCCACA GTGCTGGGAT TACAGGCATG GGCCACCACG 2040 

CCTAGCCTCA CGCTCCTTTC TGATCTTCAC TAAGAACAAA AGAAGCAGCA ACTTGCAAGG 2100 

GCGGCCTTTC CCACTGGTCC ATCTGGTTTT CTCTCCAGGG GTCTTGCAAA ATTCCTGACG 2160 

AGATAAGCAG TTATGTGACC TCACGTGCAA AGCCACCAAC AGCCACTCAG AAAAGACGCA 2220 

CCAGCCCAGA AGTGCAGAAC TGCAGTCACT GCACGTTTTC ATCTCTAGGG ACCAGAACCA 2280 

AACCCACCCT TTCTACTTCC AAGACTTATT TTCACATGTG GGGAGGTTAA TCTAGGAATG 2340 

ACTCGTTTAA GGCCTATTTT CATGATTTCT TTGTAGCATT TGGTGCTTGA CGTATTATTG 2400 

TCCTTTGATT CCAAATAATA TGTTTCCTTC CCTCAAAAAA AAAAAAAAAA AAAAAAAAAA 2460 

AAAAA 2465 



Seq ID NO: CI 7 DNA Sequence 
Nucleic Acid Accession #: NM_003220 
Coding sequence: 63.. 1376 

1 11 21 31 41 SI 

I I I I I I 

GAATTCCGGC TCTCTGGGTG AGAGACCGAG AGGGGCATAT CCGTTCACGC CGATCCATGA 60 

AAATGCTTTG GAAATTGACG GATAATATCA AGTACGAGGA CTGCGAGGAC CGTCACGACG 120 

GCACCAGCAA CGGGACGGCA CGGTTGCCCC AGCTGGGCAC TGTAGGTCAA TCTCCCTACA 180 

CGAGCGCCCC GCCGCTGTCC CACACCCCCA ATGCCGACTT CCAGCCCCCA TACTTCCCCC 240 

CACCCTACCA GCCTATCTAC CCCCAGTCGC AAGATCCTTA CTCCCACGTC AACGACCCCT 300 

1208 
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ACAGCCTGAA CCCCCTGCAC GCCCAGCCGC AGCCGCAGCA CCCAGGCTGG CCCGGCCAGA 
GGCAGAGCCA GGAGTCTGGG CTCCTGCACA CGCACCGGGG GCTGCCTCAC CAGCTGTCGG 
GCCTGGATCC TCGCAGGGAC TACAGGCGGC ACGAGGACCT CCTGCACGGC CCACACGCGC 
TCAGCTCAGG ACTCGGAGAC CTCTCGATCC ACTCCTTACC TCACGCCATC GAGGAGGTCC 
CGCATGTAGA AGACCCGGGT ATTAACATCC CAGATCAAAC TGTAATTAAG AAAGGCCCCG 
TGTCCCTGTC CAAGTCCAAC AGCAATGCCG TCTCCGCCAT CCCTATTAAC AAGGACAACC 
TCTTCGGCGG CGTGGTGAAC CCCAACGAAG TCTTCTGTTC AGTTCCGGGT CGCCTCTCGC 
TCCTCAGCTC CACCTCGAAG TACAAGGTCA CGGTGGCGGA AGTGCAGCGG CGGCTCTCAC 
CACCOGAGTG TCTCAACGCG TCGCTGCTGG GCGGAGTGCT CCGGAGGGCG AAGTCTAAAA 
ATGGAGGAAG ATCTTTAAGA GAAAAACTGG ACAAAATAGG ATTAAATCTG CCTGCAGGGA 
GACGTAAAGC TGCCAACGTT ACCCTGCTCA CATCACTAGT AGAGGGAGAA GCTGTCCACC 
TAGCCAGGGA CTTTGGGTAC GTGTGCGAAA CCGAATTTCC TGCCAAAGCA GTAGCTGAAT 
TTCTCAACCG ACAACATTCC GATCCCAATG AGCAAGTGAC AAGAAAAAAC ATGCTCCTGG 
CTACAAAACA GATATGCAAA GAGTTCACCG ACCTGCTGGC TCAGGACCGA TCTCCCCTGG 
GGAACTCACG GCCCAACCCC ATCCTGGAGC CCGGCATCCA GAGCTGCTTG ACCCACTTCA 
ACCTCATCTC CCACGGCTTC GGCAGCCCCG CGGTGTGTGC CGCGGTCACG GCCCTGCAGA 
ACTATCTCAC CGAGGCCCTC AAGGCCATGG ACAAAATGTA CCTCAGCAAC AACCCCAACA 
GCCACACGGA CAACAACGCC AAAAGCAGTG ACAAAGAGGA GAAGCACAGA AAGTGAGGCT 
CTCCTCCCGC CCCGCCCCTC CCACGCCTCA CCAGCCCCCC GCGCGCCCAC CCTCCGGCGG 
GTGACAGCTC CGGGATCAGC AACCCTTCCT GCTGCTGCTA CTGCTGCTGC TGCTGCCGCC 
GCCGCCGCCG CCGCTGCCCT TGGGTCCCCC CGAGTCTCCG GGACTGCCCT CTCGACTGTC 
AGTGGGGCAG CCTCTCCGAC TCTGCACCCG CCTCGACCTC CCCACCCGCT CCCACACCCC 
TGTGCCCCCG GAATTC 

Seq ID NO: CI 8 DNA Sequence 
Nucleic Acid Accession #: NM_002988 
Coding sequence: 71.. 340 

1 11 21 31 41 51 

I I 1 I I I 

CCGGCACGAG AGGAGTTGTG AGTTTCCAAG CCCCAGCTCA CTCTGACCAC TTCTCTGCCT SO 

GCCCAGCATC ATGAAGGGCC TTGCAGCTGC CCTCCTTGTC CTCGTCTGCA CCATGGCCCT 120 

CTGCTCCTGT GCACAAGTTG GTACCAACAA AGAGCTCTGC TGCCTCGTCT ATACCTCCTG 180 

GCAGATTCCA CAAAAGTTCA TAGTTGACTA TTCTGAAACC AGCCCCCAGT GCCCCAAGCC 240 

AGGTGTCATC CTCCTAACCA AGAGAGGCCG GCAGATCTGT GCTGACCCCA ATAAGAAGTG 300 

GGTCCAGAAA TACATCAGCG ACCTGAAGCT GAATGCCTGA GGGGCCTGGA AGCTGCGAGG 360 

GCCCAGTGAA CTTGGTGGGC CCAGGAGGGA ACAGGAGCCT GAGCCAGGGC AATGGCCCTG 420 

CCACCCTGGA GGCCACCTCT TCTAAGAGTC CCATCTGCTA TGCCCAGCCA CATTAACTAA 480 

CTTTAATCTT AGTTTATGCA TCATATTTCA TTTTGAAATT GATTTCTATT GTTGAGCTGC 540 

ATTATGAAAT TAGTATTTTC TCTGACATCT CATGACATTG TCTTTATCAT CCTTTCCCCT 600 

TTCCCTTCAA CTCTTCGTAC ATTCAATGCA TGGATCAATC AGTGTGATTA GCTTTCTCAG 660 

CAGACATTGT GCCATATGTA TCAAATGACA AATCTTTATT GAATGGTTTT GCTCAGCACC 720 

ACCTTTTAAT ATATTGGCAG TACTTATTAT ATAAAAGGTA AACCAGCATT CTCACTGTGA 780 

AAAAAAAAAA AAAAAAAAAA AAA 803 

Seq ID NO: C19 DNA Sequence 
Nucleic Acid Accession #: NM_004063 
Coding sequence: 121.. 2619 

1 11 21 31 41 51 

I I I I I I 

AGGGAGTGTT CCCGGGGGAG ATACTCCAGT CGTAGCAAGA GTCTCGACCA CTGAATGGAA 60 

GAAAAGGACT TTTAACCACC ATTTTGTGAC TTACAGAAAG GAATTTGAAT AAAGAAAACT 120 

ATGATACTTC AGGCCCATCT TCACTCCCTG TGTCTTCTTA TGCTTTATTT GGCAACTGGA 180 

TATGGCCAAG AGGGGAAGTT TAGTGGACCC CTGAAACCCA TGACATTTTC TATTTATGAA 240 

GGCCAAGAAC CGAGTCAAAT TATATTCCAG TTTAAGGCCA ATCCTCCTGC TGTGACTTTT 300 

GAACTAACTG GGGAGACAGA CAACATATTT GTGATAGAAC GGGAGGGACT TCTGTATTAC 360 

AACAGAGCCT TGGACAGGGA AACAAGATCT ACTCACAATC TCCAGGTTGC AGCCCTGGAC 420 

GCTAATGGAA TTATAGTGGA GGGTCCAGTC CCTATCACCA TAGAAGTGAA GGACATCAAC 480 

GACAATCGAC CCACGTTTCT CCAGTCAAAG TACGAAGGCT CAGTAAGGCA GAACTCTCGC 540 

CCAGGAAAGC CCTTCTTGTA TGTCAATGCC ACAGACCTGG ATGATCCGGC CACTCC CAAT 600 

GGCCAGCTTT ATTACCAGAT TGTCATCCAG CTTCCCATGA TCAACAATGT CATGTACTTT 660 

CAGATCAACA ACAAAACGGG AGCCATCTCT CTTACCCGAG AGGGATCTCA GGAATTGAAT 720 

CCTGCTAAGA ATCCTTCCTA TAATCTGGTG ATCTCAGTGA AGGACATGGG AGGCCAGAGT 780 

GAGAATTCCT TCAGTGATAC CACATCTGTG GATATCATAG TGACAGAGAA TATTTGGAAA 840 

GCACCAAAAC CTGTGGAGAT GGTGGAAAAC TCAACTGATC CTCACCCCAT CAAAATCACT 900 

CAGGTGCGGT GGAATGATCC CGGTGCACAA TATTCCTTAG TTGACAAAGA GAAGCTGCCA 960 

AGATTCCCAT TTTCAATTGA CCAGGAAGGA GATATTTACG TGACTCAGCC CTTGGACCGA 1020 

GAAGAAAAGG ATGCATATGT TTTTTATGCA GTTGCAAAGG ATGAGTACGG AAAACCACTT 1080 

TCATATCCGC TGGAAATTCA TGTAAAAGTT AAAGATATTA ATGATAATCC ACCTACATGT 1140 

CCGTCACCAG TAACCGTATT TGAGGTCCAG GAGAATGAAC GACTGGGTAA CAGTATCGGG 1200 

ACCCTTACTG CACATGACAG GGATGAAGAA AATACTGCCA ACAGTTTTCT AAACTACAGG 1260 

ATTGTGGAGC AAACTCCCAA ACTTCCCATG GATGGACTCT TCCTAATCCA AACCTATGCT 1320 

GGAATGTTAC AGTTAGCTAA ACAGTCCTTG AAGAAGCAAG ATACTCCTCA GTACAACTTA 1380 

ACGATAGAGG TGTCTGACAA AGATTTCAAG ACCCTTTGTT TTGTGCAAAT CAACGTTATT 1440 

GATATCAATG ATCAGATCCC CATCTTTQAA AAATCAGATT ATGGAAACCT GACTCTTGCT 1500 

GAAGACACAA ACATTGGGTC CACCATCTTA ACCATCCAGG CCACTGATGC TGATGAGCCA 1560 

TTTACTGGGA GTTCTAAAAT TCTGTATCAT ATCATAAAGG GAGACAGTGA GGGACGCCTG 1620 

GGGGTTGACA CAGATCCCCA TACCAACACC GGATATGTCA TAATTAAAAA GCCTCTTGAT 1G80 
TTTGAAACAG CAGCTGTTTC CAACATTGTG TTCAAAGCAG AAAATCCTGA GCCTCTAGTG 1740 
TTTGGTGTGA AGTACAATGC AAGTTCTTTT GCCAAGTTCA CGCTTATTGT GACAGATGTG 1800 
AATGAAGCAC CTCAATTTTC CCAACACGTA TTCCAAGCGA AAGTCAGTGA GGATGTAGCT 1860 
ATAGGCACTA AAGTGGGCAA TGTGACTGCC AAGGATCCAG AAGGTCTGGA CATAAGCTAT 1920 
TCACTGAGGG GAGACACAAG AGGTTGGCTT AAAATTGACC ACGTGACTGG TGAGATCTTT 1980 
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AGTGTGGCTC 
GAAGTAGGGG 
AATGACAACC 
AGTGCACCTG 
CCCCATTTTA 
ATCAATGGTA 
GTCGTCTTGA 
CCAGTTACAT 
GGGATACCCA 
ATAATTTTAG 
AGTGCTCAAG 
TTTATATAGC 
TGCATTATAA 
TTTTTTGAGG 
TCACTGCAAC 
TGGGTTTACA 
GTTTCGCCAT 
GTCTCCCAAT 
ACATTAGAGA 
CTTGTGTTTT 
TATCAAGGAG 
CTGATTCCAT 
CATTTGTCAG 
TTTTCCCTTA 
ATGTTGCTCT 
CAAACACAAC 
ACTACCTGCC 
GGTTATTATA 
AGAATAAACA 



CATTGGACAG 
GGTCTTCCTT 
CTCCCAGGCT 
GAAGTCTCAT 
CATTTTCCCT 
CTCATGCCCG 
TCCGCATCAA 
TCTGCAGTTG 
CTGTGGGCAT 
CAGTTGTGTT 
CATCTGAAGT 
AAGTGCTATT 
TTTTTTAAAC 
TGGAGTCTTG 
CTCCGCCTCC 
GGCACCCACC 
TTGGCCAGGC 
ACAGGCATGA 
GATTTTTCAT 
TCCCTTTTGG 
ATATATCAGT 
CCTGTGTCCC 
AGAAGAAAAA 
GTATTAACAG 
TTGGCTGAAA 
CTACTCTGCA 
ATGCATACAT 
TATTTAACAT 
CTGGTTGTAG 



AGAAGCCGGA 
GAGCTCTGTG 
AGCCAAGGAC 
TTTCGAGGCT 
CGGCAGTGGA 
ACTGTCTACC 
TGATGGGGGT 
TGTGGAAGGA 
GGCAGTTGGT 
TATCCGCATA 
CAAACCTCTG 
TCAGCAACAA 
AGATATTCCC 
CTCTGTCGCC 
TGGGTTCACA 
ACCATGCCCA 
TGGTCTTGAA 
ACCACTGCAC 
TTTTCCATGA 
GGCAAGACAG 
GTTGTCTCAT 
CTTCATCCTT 
OGTGAGGACT 
AAATGTTTCT 
TTCTTCAACT 
AACCTTGGTA 
GCTGCGCATG 
GTGGAAGAAA 
TCAGTTTTGT 



AGTCCATATC 
TCAGAGTTCC 
TACACGGGCT 
ACTGATGATG 
AGCTTACAAA 
AGGCACACAG 
CGGCCACCCT 
AGTTGTTTCC 
ATACTGCTGA 
AAGAAGGATA 
AGAAGCTGAA 
CCATCTCATC 
TCTTGTCCTT 
CAGGCTGGAG 
TGATTCTCCT 
GCTAATTTTT 
CTCCTGACGT 
CCACCTACTT 
CATTTTTCCT 
ACTCATTAAA 
AGAACTGCCT 
GACTCCTTTG 
CAGGAAAAAT 
GTGTCATTAA 
TGGAAATGAC 
AAGGAACCAG 
TTTTCTTCAT 
ACAAGACATG 
TTGTTAA 



GGGTACAAGT 
ACCTGATCCT 
TGTTCTTCTG 
ATCAGCACTT 
ACGACTGGGA 
AGTTTGAGGA 
TGGAAGGCAT 
GGCCAGCAGG 
CCACCCTTCT 
AAGGCAAAGA 
TTTGAAAAGG 
CTATTACTTT 
TAATATTTGC 
TACAGTGGTG 
GCCTCAGCTT 
GTATTTTTAA 
CAAGTGATCT 
AGATATTTCA 
CTCTGCAAAT 
TATTCTGTAC 
GGATTCCATT 
GTATTTCACT 
AAATAAATAA 
CCATCTTTAA 
ACAGACCCAC 
TCAGCTGGCC 
TCGTATGTTA 
AAAAGAGTGG 



GGTGGCCACA 
TATGGATGTG 
CCATCCCCTC 
ATTTCGGGGT 
AGTTTCCAAA 
GAGGGAGTAT 
TGTTTCTTTA 
TCACCAGACT 
GGTGATTGGT 
TAATGTTGAA 
AATGTTTGAA 
TCATCTAACG 
TAAATATTTC 
TGATCCCAGC 
CCTAAGTAGC 
TAGAGACGGG 



TGTGCTATAG 
GGCTTAGCTA 
ATTTTTTCTT 
TATGTTTTTT 
GAATTTCAAA 
AAGAACAGCC 
TCAATGTGAC 
AGAAGGTGTT 
AGATTTCCTC 
GTAAAGTTTT 
TGACAAATCA 



2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3697 



Seq ID NO: C20 DNA Sequence 
Nucleic Acid Accession #: NM__004443 
Coding sequence: 28.. 3024 

1 11 21 31 41 51 

I I I I I I 

GGCTCGGCTC CTAGAGCTGC CACGGCCATG GCCAGAGCCC GCCCGCCGCC GCCGCCGTCG 60 

CCGCCGCCGG GGCTTCTGCC GCTGCTCCCT CGGCTGCTGC TGCTGCCGCT GCTGCTGCTG 120 

CCCGCCGGCT GCOGGGCGCT GGAAGAGACC CTCATGGACA CAAAATGGGT AACATCTGAG 180 

TTGGCGTGGA CATCTCATCC AGAAAGTGGG TGGGAAGAGG TGAGTGGCTA CGATGAGGCC 240 

ATGAATCCCA TCCGCACATA CCAGGTGTGT AATGTGCGCG AGTCAAGCCA GAACAACTGG 300 

CTTCGCACGG GGTTCATCTG GCGGCGGGAT GTGCAGCGGG TCTACGTGGA GCTCAAGTTC 360 

ACTGTGCGTG ACTGCAACAG CATCCCCAAC ATCCCCGGCT CCTGCAAGGA GACCTTCAAC 420 

CTCTTCTACT ACGAGGCTGA CAGCGATGTG GCCTCAGCCT CCTCCCCCTT CTGGATGGAG 480 

AACCCCTACG TGAAAGTGGA CACCATTGCA CCCGATGAGA GCTTCTCGCG GCTGGATGCC 540 

GGCCGTGTCA ACACCAAGGT GCGCAGCTTT GGGCCACTTT CCAAGGCTGG CTTCTACCTG 600 

GCCTTCCAGG ACCAGGGCGC CTGCATGTCG CTCATCTCCG TGCGCGCCTT CTACAAGAAG 660 

TGTGCATCCA CCACCGCAGG CTTCGCACTC TTCCCCGAGA CCCTCACTGG GGCGGAGCCC 720 

ACCTOGCTGG TCATTGCTCC TGGCACCTGC ATCCCTAACG CCGTGGAGGT GTCGGTGCCA 780 

CTCAAGCTCT ACTGCAACGG CGATGGGGAG TGGATGGTGC CTGTGGGTGC CTGCACCTGT 840 

GCCACCGGCC ATGAGCCAGC TGCCAAGGAG TCCCAGTGCC GCCCCTGTCC CCCTGGGAGC 900 

TACAAGGCGA AGCAGGGAGA GGGGCCCTGC CTCCCATGTC CCCCCAACAG CCGTACCACC 960 

TCCCCAGCCG CCAGCATCTG CACCTGCCAC AATAACTTCT ACCGTGCAGA CTCGGACTCT 1020 

GCGGACAGTG CCTGTACCAC CGTGCCATCT CCACCCCGAG GTGTGATCTC CAATGTGAAT 1080 

GAAACCTCAC TGATCCTCGA GTGGAGTGAG CCCCGGGACC TGGGTGGCCG GGATGACCTC 1140 

CTGTACAATG TCATCTGCAA GAAGTGCCAT GGGGCTGGAG GGGCCTCAGC CTGCTCAOGC 1200 

TGTGATGACA ACGTGGAGTT TGTGCCTCGG CAGCTGGGCC TGACGGAGCG CCGGGTCCAC 1260 

ATCAGCCATC TGCTGGCCCA CACGCGCTAC ACCTTTGAGG TGCAGGCGGT CAACGGTGTC 1320 

TCGGGCAAGA GCCCTCTGCC GCCTCGTTAT GCGGCCGTGA ATATCACCAC AAACCAGGCT 1380 

GCCCCGTCTG AAGTGCCCAC ACTACGCCTG CACAGCAGCT CAGGCAGCAG CCTCACCCTA 1440 

TCCTGGGCAC CCCCAGAGCG GCCCAACGGA GTCATCCTGG ACTACGAGAT GAAGTACTTT 1500 

GAGAAGAGCG AGGGCATCGC CTCCACAGTG ACCAGCCAGA TGAACTCCGT GCAGCTGGAC 1560 

GGGCTTCGGC CTGACGCCCG CTATGTGGTC CAGGTCCGTG CCCGCACAGT AGCTGGCTAT 1620 

GGGCAGTACA GCCGCCCTGC CGAGTTTGAG ACCACAAGTG AGAGAGGCTC TGGGGCCCAG 1680 

CAGCTCCAGG AGCAGCTTCC CCTCATCGTG GGCTCCGCTA CAGCTGGGCT TGTCTTCGTG 1740 

GTGGCTGTCG TGGTCATCGC TATCGTCTGC CTCAGGAAGC AGCGACACGG CTCTGATTCG 1800 

GAGTACACGG AGAAGCTGCA GCAGTACATT GCTCCTGGAA TGAAGGTTTA TATTGACCCT 1860 

TTTACCTACG AGGACGCTAA TGAGGCTGTT CGGGAGTTTG CCAAGGAGAT CGACGTGTCC 1920 

TGCGTCAAGA TCGAGGAGGT GATCGGAGCT GGGGAATTTG GGGAAGTGTG CCGTGGTCGA 1980 

CTGAAACAGC CTGGCCGCCG AGAGGTGTTT GTGGCCATCA AGACGCTGAA GGTGGGCTAC 2040 

ACCGAGAGGC AGCGGCGGGA CTTCCTAAGC GAGGCCTCCA TCATGGGTCA GTTTGATCAC 2100 

CCCAATATAA TCCGGCTCGA GGGCGTGGTC ACCAAAAGTC GGCCAGTTAT GATCCTCACT 2160 

GAGTTCATGG AAAACTGCGC CCTGGACTCC TTCCTCCGGC TCAACGATGG GCAGTTCACG 2220 

GTCATCCAGC TGGTGGGCAT GTTGCGGGGC ATTGCTGCCG GCATGAAGTA CCTGTCCGAG 2280 

ATGAACTATG TGCACCGCGA CCTGGCTGCT OGCAACATCC TTGTCAACAG CAACCTGGTC 2340 

TGCAAAGTCT CAGACTTTGG CCTCTCCCGC TTCCTGGAGG ATGACCCCTC CGATCCTACC 2400 

TACACCAGTT CCCTGGGCGG GAAGATCCCC ATCCGCTGGA CTGCCCCAGA GGCCATAGCC 2460 

TATCGGAAGT TCACTTCTGC TAGTGATGTC TGGAGCTACG GAATTGTCAT GTGGGAGGTC 2520 

ATGAGCTATG GAGAGCGACC CTACTGGGAC ATGAGCAACC AGGATGTCAT CAATGCCGTG 2580 

GAGCAGGATT ACOGGCTGCC ACCACCCATG GACTGTCCCA CAGCACTGCA CCAGCTCATG 2640 

CTGGACTGCT GGGTGCGGGA CCGGAACCTC AGGCCCAAAT TCTCCCAGAT TGTCAATACC 2700 

CTGGACAAGC TCATCCGCAA TGCTGCCAGC CTCAAGGTCA TTGCCAGCGC TCAGTCTGGC 2760 

ATGTCACAGC CCCTCCTGGA CCGCACGGTC CCAGATTACA CAACCTTCAC GACAGTTGGT 2820 
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GATTGGCTGG 
GCATCTTTTG 
CTGGCOGGCC 
CAGACGCTGC 
GATGCCAAGC 
TGTGGCCCAG 
TCCCTCCCCA 
TGATGACCCC 
CCACAACCTC 
TAAGCCGGGG 
AGCAGTCCCT 
CCATCCTGAA 
CCCTGTCCCC 
AGTTGCCCTT 
TGTGTGTGTG 
GCATGGGTGA 
AGCAGAATAA 



ATGCCATCAA 
ACCTGGTGGC 
ACCAGAAGAA 
CTGTGCAGGT 
AGCCGGCTGG 
AAGCTGGAAG 
GGAAGTGCGC 
TCCCCAAGCC 
ACACTTGTCT 
TTCCACAGGG 
CCCTCAGGAA 
GCCAGCTTGC 
ACCCCCGCCC 
TGCCCCCCAG 
TGTGCGCGCG 
GCGTGTAAAA 
AGGCAATAAG 



GATGGGGCGG 
CCAGATGACG 
GATCCTGAGC 
CTGACACCGG 
ACTTTCGGAC 
TTTGGGAAAG 
CCCAAACCTC 
CCTCAGGGCC 
GTTCTTCAGT 
CCCAGCCCTG 
CTGGAGGAGG 
ACCTCCAGTT 
TTGGTGCTGT 
AGACTGACTC 
CGCGCGCGTG 
GCTTGGCCCT 
ATGAA 



TACAAGGAGA 
GCAGAAGACC 
AGTATCCAGG 
CTCCCACGGG 
TCTTGGACTT 
GCCCAAGCTG 
TTCATATTGA 
CAGACCTTCC 
GCTGGAGGTC 
GCAGGGGTCT 
GGACTCCAGG 
TGCACAGGGA 
CATAAAAGGG 
TCAGAGCCAG 
TGTGTGTGCA 
GTGCCCTACA 



GCTTCGTCAG 
TGCTCCGTAT 
ACATGCGGCT 
GACCCTGAGG 
TTGGATGCCT 
GGACTTCTCC 
AGATGGATTA 
TGCTCTCCAG 
CTGGCAGGGT 
GGCCCCCCAG 
AATGGGGAAA 
TTTGTCCTGG 
CAGGCAGGGG 
AGATGGGATG 
CGCACTGGCC 
ATGGGGCCAG 



TGCGGGGTTT 
TGGGGTCACC 
GCAGATGAAC 
ACCGTGCAGG 
GGCCTTAGGC 
AGGCCTGTGT 
GGAGAGGGGG 
CAGGGGATCC 
CAGGCTGGGG 
GTAGGCGGAG 
TGTGACACCA 
GGGCTGAGGG 
CAGGCTGAGG 
TGTGAGTGTG 
TGCACAGAGA 
CTGGGCCGAC 



Seq ID NO: C21 DNA Sequence 
Nucleic Acid Accession #: NM_001804 
Coding sequence: 82.. 879 



AGGTGAGCGG 
GCGGGGGACC 
TACCCCGGCC 
GCCCCGCCCC 



GCCGCCTACG 
CCCCCTCCAG 
CAGCCCCTCG 
GAGTGGATGC 
AAGGACAAGT 
CATTACAGCC 
ACTGAACGGC 
AAGAAGAAAC 
ACCCCAGCCG 
TCCTCTCCAA 
CGGGGGACCT 
CCGAGTCTCA 
CCCCTTGGCC 
TAGACCTCTG 
AAGCCCCAAA 
TACGTTCAGA 
ATGGAGCTGA 
AACTCACACC 
GCCCTTCCCA 
CACTGGGTTA 
GGAGGCCCTG 
CAGGCTGCCC 
CACAGCTGGA 
TTTTTCTTCT 



11 
I 

TTGCTCGTCG 
CCGCGGCCAC 
CAGCCAGGCC 
CGGCGCCCCC 
CCCCGACGGC 
GCCCGGGCCC 
ACTTTAGCCC 
GGGGCCCGGG 
GGCGCAGCGT 
ACCGCGTGGT 
GTTACATCAC 
AGGTGAAGAT 
AGCAGCAGCA 
GGCCATCCCT 
TGCCTGTGAA 
GGGGACTCGG 
GCCCTGACCT 
CATTGTGTGC 
GGGGATAAGG 
TGGTTGGGGG 
GGTGCAGCTG 
AAAAGATGGA 
TGCCTCTTCT 
GGCTGGATAC 
CAGAATCACA 
GGAGGAGAGA 
ATCAGGGCCC 
CAGGTGTTGT 
CTTTTCAAA 



21 

! 

TCGGGGCGGC 
CATGTATGTG 
AGCCAGCCTC 
GCAGTACCCC 
CTGGGGGGCG 
CGCGGCCCCT 
GGTGCCGGCG 
CACACCGTCC 
GGCGGCCGGA 
CTACACCGAC 
AATCCGGCGG 
CTGGTTCCAA 
ACAGCCCCCA 
GGGGGGCCTG 
AGAGGAGTTT 
GTGCTGGGAG 
TCTGGGACAT 
AGTAAGCCTG 
GAGTCCAGGG 
AGGGGCCTAG 
GAGGCCTGTG 
ATGCTTGCAG 
GCAGCCTCAC 
TAAGCACAAA 
GCCCTCATGA 
AGGGGGCAGA 
AGGGAGCCCC 
ATATAGAGTG 



31 
I 

CGGCAGCGGC 
GGCTATGTGC 
GGCCTGGGCC 
GACTTCTCCA 
CCCTTCCCTG 
GCCGCCAGCC 
CCCCCTGGGC 
TCGCCCGGAG 
GGCGGCGGTG 
CACCAACGCC 
AAATCAGAGC 
AACCGGCGGG 
CAGCCGCCGA 
TGTCCCAGCA 
CTGCCATAGC 
TGTGGCTCCT 
GGTGGACAGT 
TTGGATAAAG 
TGGATGATCT 
ACAAGGCTCC 
TGGGGACCAC 
AGCATGACCT 
CTCTACCTGC 
GCCCATAGCA 
TCATTCTCAG 
GTCTTCCCTA 
CAGAGGACTT 
GAATCTCTTG 



41 
I 

GGCTCCAGGG 
TGGACAAGGA 
CGGCAAACTA 
GCTACTCTCA 
CGCCCAAGGA 
CAGCTTCGCT 
CCGGCCCGGG 
CGCAGAGGCC 
GCAGCGGTAA 
TGGAGCTGGA 
TGGCTGCCAA 
CAAAGGAGCG 
TGGCCCACGA 
ACACCAGCCT 
CCCATGCCCA 
GTGGGCCCAG 
CACCTATCCA 
ACCTTCCAGC 
CAATCTCCCG 
AGGCCCCACC 
ACTGATCCTG 
GAGGAGGGAG 
CCCCATCATA 
CTGGGCTCTG 
TGAGGGCTCT 
CCAGGTTTCT 
TATTCGGACC 
GATGCAGCTT 



CCCAGCATGC 

CGGCCCCCCG 
CGTGGAGCCG 
CGACTGGGCC 
GGCATTCGGG 
CCTCCTGGCG 
GACGCCCTAC 
GACTCGGACC 
GAAGGAGTTT 
TCTGGGGCTC 
CAAAGTGAAC 
CATCACGGCC 
CCTGGCCACC 
GCCTGTGCGC 
GAGGTCTGGT 
CCCTCTGCAT 
TCCTGTGTTC 
TGGGCATCTC 
TCCTCCTCCA 
GAGAAAAGGG 
GAACGTGGTC 
AGGGCACTGA 
ATGGCTGCTC 
GGATTGAGAG 
ACACCCCCGC 
AAGCAGAGCT 
CAAGAATAAA 



Seq ID NO: C22 DNA Sequence 
Nucleic Acid Accession #: NM_021978 
Coding sequence i 3 6 .. 2 60 3 



GACGCCTGTG 
GCGGAGGGGG 
TGAATGGCTT 
AGCATGGCCC 
TGCTGGGGAT 
TCTTCAATGG 
ACTCCACTGA 
GCGGAGTCCC 
GCAGCGTCAT 
CCGAGCGCGT 
AGTCCTTTGT 
CCCAGGACAA 
CCACGCCCGG 
GGGGGGACGC 
ACGAGCGCGG 
CCCTGGTGCA 
AGAACGTCCT 
CCACCTTCTT 
GGACATTCAA 
ACATTGAGGT 
AGCCCGGCGT 
ACTGCGGAGA 
TCCACTCAGA 



11 
I 

AGACCCGCGA 
CCCGAAGGAC 
GGAGGAAGGC 
GGGGCGCTGG 
CGGCTTCCTG 
CTACATGAGG 
GTTTGTAAGC 
ATTCCTGGGC 
CGCCTACTAC 
CATGGCCGAG 
GGTCACCTCA 
CAGCTGCAGC 
CTTCCCTGAC 
CGACTCAGTG 
CAGCGACCTG 
GTTGTGTGGC 
GCTCATCACA 
CCAGCTGCCT 
CAGCCCCTAC 
GCCCAACAAC 
GCCTGCGGGC 
GAGGTCCCAG 
TCAGTCCTAC 



21 
I 

GCGGCCTCGG 
TTCGGCGCGG 
GTGGAGTTCC 
GTGGTGCTGG 
GTGTGGCATT 
ATCACAAATG 
CTGGCCAGCA 
CCCTACCACA 
TGGTCTGAGT 
GAGCGCGTAG 
GTGGTGGCTT 
TTTGGCCTGC 
AGCCCCTACC 
CTGAGCCTCA 
GTGACGGTGT 
ACCTACCCTC 
CTGATAACCA 
AGGATGAGCA 
TACCCAGGCC 
CAGCATGTGA 
ACCTGCCCCA 
TTCGTCGTCA 
ACCGACACCG 



31 
I 

GGACCATGGG 
GACTCAAGTA 
TGCCAGTCAA 
CAGCCGTGCT 
TGCAGTACCG 
AGAATTTTGT 
AGGTGAAGGA 
AGGAGTCGGC 
TCAGCATCCC 
TCATGCTGCC 
TCCCCACGGA 
ACGCCCGCGG 
CCGCTCATGC 
CCTTCCGCAG 
ACAACACCCT 
CCTCCTACAA 
ACACTGAGCG 
GCTGTGGAGG 
ACTACCCACC 
AGGTGCGCTT 
AGGACTACGT 
CCAGCAACAG 
GCTTCTTAGC 



41 
I 

GAGCGATOGG 
CAACTCCCGG 
CAACGTCAAG 
GATCGGCCTC 
GGACGTGCGT 
GGATGCCTAC 
CGCGCTGAAG 
TGTGAOGGCC 
GCAGCACCTG 
CCCGCGGGCG 
CTCCAAAACA 
TGTGGAGCTG 
CCGCTGCCAG 
CTTTGACCTT 
GAGCCCCATG 
CCTGACCTTC 
GCGGCATCCC 
CCGCTTACGT 
CAACATTGAC 
CAAATTCTTC 
GGAGATCAAT 
CAACAAGATC 
TGAATACCTC 



51 
I 

GCCCGCAAGG 
CACGAGAAAG 
AAGGTGGAAA 
CTCTTGGTCT 
GTCCAGAAGG 
GAGAACTCCA 
CTGCTGTACA 
TTCAGOGAGG 
GTGGAGGAGG 
CGCTCCCTGA 
GTACAGAGGA 
ATGCGCTTCA 
TGGGCCCTGC 
GCGTCCTGCG 
GAGCCCCACG 
CACTCCTCCC 
GGCTTTGAGG 
AAAGCCCAGG 
TGCACATGGA 
TACCTGCTGG 
GGGGAGAAAT 
ACAGTTCGCT 
TCCTACGACT 



2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3805 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

X080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1699 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 
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5 

10 
15 
20 
25 
30 



CCAGTGACCC 
TGCGCTGTGA 
ACGCCGGCCA 
ACAGTGTGAA 
CCTTCAGGTG 
ACTGTGGGGA 
AACACACCTA 
GGAAGGAGGA 
TCACGAGACA 
AGGTAAGCCT 
ACTGGCTGGT 
CCACGCAGTG 
TGCAGGAGCG 
ACTATGACAT 
CCATCTGCCT 
GCTGGGGACA 
GCGTCATCAA 
TGTGCGTGGG 
TGTCCAGCGT 
GCTGCGCTCA 
TCAAAGAGAA 
ACCCATCGTC 
AGCGCCCCCA 
CCTCTCGCTT 
TTCTACTGAC 
CTGGGCCGAG 
GAGCTTCGGA 
TGGGCCACGC 
ACTGAAAATG 
AACATTTTAT 



ATGCCCGGGG 
TGGCTGGGCC 
CCAGTTCACG 
CGACTGCGGA 
TTCCAATGGG 
CGGGTCCGAC 
CCGCTGCCTC 
CTGTAGCGAC 
GGCTCGTGTT 
GCATGCTCTG 
CTCTGCCGCA 
GACGGCCTTC 
CAGGCTCAAG 
CGCGCTGCTG 
GCCGGACGCC 
CACCCAGTAT 
CCAGACCACC 
CTTCCTCAGC 
GGAGGCGGAT 
GAGGAACAAG 
CACTGGGGTA 
CACCCCAGTG 
GAACATACAC 
CCTCAGCCTC 
CCAACTGGGG 
GCGCGTTTGT 
GCCTCCTCAG 
TCTTGAGGAA 
GTTTACCAGC 
TTCTTTTTAA 



CAGTTCACGT 
GACTGCACCG 
TGCAAGAACA 
GACAACAGCG 
AAGTGCCTCT 
GAGGCCTCCT 
AATGGGCTCT 
GGCTCAGATG 
GTTGGGGGCA 
GGCCAGGGCC 
CACTGCTACA 
CTGGGCTTGC 
CGCATCATCT 
GAGCTGGAGA 
TCCCATGTCT 
GGAGGCACTG 
TGCGAGAACC 
GGCGGCGTGG 
GGGCGGATCT 
CCAGGCGTGT 
TAGGGGCCGG 
TGCACGCCTG 
TGTGAACTCA 
CAAAGTGGAG 
GCAAAGGTTT 
GTATATCTGC 
TGAAGGTGGT 
GCCCAGGCTC 
TCCCAGGTGA 
AAAAAAAAA 



GCCGCACGGG 
ACCACAGCGA 
AGTTCTGCAA 
ACGAGCAGGG 
CGAAAAGCCA 
GCCCCAAGGT 
GCTTGAGCAA 
AGAAGGACTG 
CGGATGCGGA 
ACATCTGCGG 
TCGATGACAG 
ACGACCAGAG 
CCCACCCCTT 
AACCGGCAGA 
TCGGTGCCGG 
GCGCGCTGAT 
TCCTGCCGCA 
ACTCCTGCCA 
TCCAGGCCGG 
ACACAAGGCT 
GGCCACCCAA 
CAGGCTGGAG 
ATCTCCAGGG 
CTGGGAGGTA 
GAAGACACAG 
CTCCCCTGTC 
GGGGCTGCCG 
GGAGGACCCT 
CTTCAGTGTG 



GCGGTGTATC 
TGAGCTCAAC 
GCCCCTCTTC 
GTGCAGTTGT 
GCAGTGCAAT 
GAACGTCGTC 
GGGCAACCCT 
CGACTGTGGG 
TGAGGGCGAG 
TGCTTCCCTC 
AGGATTCAGG 
CCAGCGCAGC 
CTTCAATGAC 
GTACAGCTCC 
CAAGGCCATC 
CCTGCAAAAG 
GCAGATCACG 
GGGTGATTCC 
TGTGGTGAGC 
CCCTCTGTTT 
ATGTGTACAC 
ACTGGACCGC 
CTCCAAATCT 
GAAGGGGAGG 
CCTCCCCCGC 
TGTAAGGAGC 
GATCTGGGCT 
GGAAAACAGA 
TGTATTGTGT 



CGGAAGGAGC 
TGCAGTTGCG 
TGGGTCTGCG 
CCGGCCCAGA 
GGGAAGGACG 
ACTTGTACCA 
GAGTGTGACG 
CTGCGGTCAT 
TGGCCCTGGC 
ATCTCTCCCA 
TACTCAGACC 

TTCACCTTCG 
ATGGTGCGGC 
TGGGTCACGG 
GGTGAGATCC 
CCGCGCATGA 
GGGGGACCCC 
TGGGGAGACG 
CGGGACTGGA 
CTGCGGGGCC 
TGACTGCACC 
GCCTAGAAAA 
ACACTGGTGG 
CAGCCCCAAG 
AGCGGGAACG 
GTGGGGCCCT 
CGGGTCTGAG 
AAATGAGTAA 



1440 
1S00 
1560 
1620 
1680 
1740 
1800 
I860 
1520 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
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Seq ID NO: C23 DNA Sequence 

Nucleic Acid Accession tt: Eos sequence 

Coding sequence: 1..2268 

1 11 21 31 41 51 

I 1 I I I I 

ATGCCCCCTT TCCTGTTGCT GGAAGCCGTC TGTGTTTTCC TGTTTTCCAG AGTGCCCCCA 
TCTCTCCCTC TCCAGGAAGT CCATGTAAGC AAAGAAACCA TCGGGAAGAT TTCAGCTGCC 
AGCAAAATGA TGTGGTGCTC GGCTGCAGTG GACATCATGT TTCTGTTAGA TGGGTCTAAC 
AGCGTCGGGA AAGGGAGCTT TGAAAGGTCC AAGCACTTTG CCATCACAGT CTGTGACGGT 
CTGGACATCA GCCCCGAGAG GGTCAGAGTG GGAGCATTCC AGTTCAGTTC CACTCCTCAT 
CTGGAATTCC CCTTGGATTC ATTTTCAACC CAACAGGAAG TGAAGGCAAG AATCAAGAGG 
ATGGTTTTCA AAGGAGGGCG CACGGAGACG GAACTTGCTC TGAAATACCT TCTGCACAGA 
GGGTTGCCTG GAGGCAGAAA TGCTTCTGTG CCCCAGATCC TCATCATCGT CACTGATGGG 
AAGTCCCAGG GGGATGTGGC ACTGCCATCC AAGCAGCTGA AGGAAAGGGG TGTCACTGTG 
TTTGCTGTGG GGGTCAGGTT TCCCAGGTGG GAGGAGCTGC ATGCACTGGC CAGCGAGCCT 
AGAGGGCAGC ACGTGCTGTT GGCTGAGCAG GTGGAGGATG CCACCAAOGQ CCTCTTCAGC 
ACCCTCAGCA GCTCGGCCAT CTGCTCCAGC GCCACGCCAG ACTGCAGGGT CGAGGCTCAC 
CCCTGTGAGC ACAGGACGCT GGAGATGGTC CGGGAGTTCG CTGGCAATGC CCCATGCTGG 
AGAGGATCGC GGCGGACCCT TGCGGTGCTG GCTGCACACT GTCCCTTCTA CAGCTGGAAG 
AGAGTGTTCC TAACCCACCC TGCCACCTGC TACAGGACCA CCTGCCCAGG CCCCTGTGAC 
TCGCAGCCCT GCCAGAATGG AGGCACATGT GTTCCAGAAG GACTGGACGG CTACCAGTGC 
CTCTGCCCGC TGGCCTTTGG AGGGGAGGCT AACTGTGCCC TGAAGCTGAG CCTGGAATGC 
AGGGTCGACC TCCTCTTCCT GCTGGACAGC TCTGCGGGCA CCACTCTGGA CGGCTTCCTG 
CGGGCCAAAG TCTTCGTGAA GCGGTTTGTG CGGGCCGTGC TGAGCGAGGA CTCTCGGGCC 
CGAGTGGGTG TGGCCACATA CAGCAGGGAG CTGCTGGTGG CGGTGCCTGT GGGGGAGTAC 
CAGGATGTGC CTGACCTGGT CTGGAGCCTC GATGGCATTC CCTTCCGTGG TGGCCCCACC 
CTGACGGGCA GTGCCTTGCG GCAGGCGGCA GAGCGTGGCT TCGGGAGCGC CACCAGGACA 
GGCCAGGACC GGCCACGTAG AGTGGTGGTT TTGCTCACTG AGTCACACTC CGAGGATGAG 
GTTGCGGGCC CAGCGCGTCA CGCAAGGGCG CGAGAGCTGC TCCTGCTGGG TGTAGGCAGT 
GAGGCCGTGC GGGCAGAGCT GGAGGAGATC ACAGGCAGCC CAAAGCATGT GATGGTCTAC 
TCGGATCCTC AGGATCTGTT CAACCAAATC CCTGAGCTGC AGGGGAAGCT GTGCAGCCGG 
CAGCGGCCAG GGTGCCGGAC ACAAGCCCTG GACCTCGTCT TCATGTTGGA CACCTCTGCC 
TCAGTAGGGC CCGAGAATTT TGCTCAGATG CAGAGCTTTG TGAGAAGCTG TGCCCTCCAG 
TTTGAGGTGA ACCCTGACGT GACACAGGTC GGCCTGGTGG TGTATGGCAG CCAGGTGCAG 
ACTGCCTTCG GGCTGGACAC CAAACCCACC CGGGCTGCGA TGCTGCGGGC CATTAGCCAG 
GCCCCCTACC TAGGTGGGGT GGGCTCAGCC GGCACCGCCC TGCTGCACAT CTATGACAAA 
GTGATGACCG TCCAGAGGGG TGCCCGGCCT GGTGTCCCCA AAGCTGTGGT GGTGCTCACA 
GGCGGGAGAG GCGCAGAGGA TGCAGCCGTT CCTGCCCAGA AGCTGAGGAA CAATGGCATC 
TCTGTCTTGG TCGTGGGCGT GGGGCCTGTC CTAAGTGAGG GTCTGCGGAG GCTTGCAGGT 
CCCCGGGATT CCCTGATCCA CGTGGCAGCT TACGCCGACC TGCGGTACCA CCAGGACGTG 
CTCATTGAGT GGCTGTGTGG AGAAGCCAAG CGGCCAGTCA ACCTCTGCAA ACCCAGCCCG 
TGCATGAATG AGGGCAGCTG CGTCCTGCAG AATGGGAGCT ACCGCTGCAA GTGTCGGGAT 
GGCTGGGAGG GCCCCCACTG CGAGAACCGA TTCTTGAGAC GCCCCTGA 

Seq ID NO: C24 DNA Sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 1. .2424 

I 11 21 31 41 51 

I I I I I I 

ATGCCCCCTT TCCTGTTGCT GGAGGCCGTC TGTGTTTTCC TGTTTTCCAG AGTGCCCCCA 
TCTCTCCCTC TCCAGGAAGT CCATGTAAGC AAAGAAACCA TCGGGAAGAT TTCAGCTGCC 
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AGCAAAATGA TGTGGTGCTC GGCTGCAGTG GACATCATGT TTCTGTTAGA TGGGTCTAAC 
AGCGTCGGGA AAGGGAGCTT TGAAAGGTCC AAGCACTTTG CCATCACAGT CTGTGACGGT 
CTGGACATCA GCCCCGAGAG GGTCAGAGTG GGAGCATTCC AGTTCAGTTC CACTCCTCAT 
CTGGAATTCC CCTTGGATTC ATTTTCAACC CAACAGGAAG TGAAGGCAAG AATCAAGAGG 
ATGGTTTTCA AAGGAGGGCG CACGGAGACG GAACTTGCTC TGAAATACCT TCTGCACAGA 
GGGTTGCCTG GAGGCAGAAA TGCTTCTGTG CCCCAGATCC TCATCATCGT CACTGATGGG 
AAGTCCCAGG GGGATGTGGC ACTGCCATCC AAGCAGCTGA AGGAAAGGGG TGTCACTGTG 
TTTGCTGTGG GGGTCAGGTT TCCCAGGTGG GAGGAGCTGC ATGCACTGGC CAGCGAGCCT 
AGAGGGCAGC ACGTGCTGTT GGCTGAGCAG GTGGAGGATG CCACCAACGG CCTCTTCAGC 
ACCCTCAGCA GCTCGGCCAT CTGCTCCAGC GCCACGCCAG ACTGCAGGGT CGAGGCTCAC 
CCCTGTGAGC ACAGGACGCT GGAGATGGTC CGGGAGTTCG CTGGCAATGC CCCATGCTGG 
AGAGGATCGC GGCGGACCCT TGCGGTGCTG GCTGCACACT GTCCCTTCTA CAGCTGGAAG 
AGAGTGTTCC TAACCCACCC TGCCACCTGC TACAGGACCA CCTGCCCAGG CCCCTGTGAC 
TCGCAGCCCT GCCAGAATGG AGGCACATGT GTTCCAGAAG GACTGGACGG CTACCAGTGC 
CTCTGCCCGC TGGCCTTTGG AGGGGAGGCT AACTGTGCCC TGAAGCTGAG CCTGGAATGC 
AGGGTCGACC TCCTCTTCCT GCTGGACAGC TCTGCGGGCA CCACTCTGGA CGGCTTCCTG 
CGGGCCAAAG TCTTCGTGAA GOGGTTTGTG CGGGCCGTGC TGAGCGAGGA CTCTCGGGCC 
CGAGTGGGTG TGGCCACATA CAGCAGGGAG CTGCTGGTGG CGGTGCCTGT GGGGGAGTAC 
CAGGATGTGC CTGACCTGGT CTGGAGCCTC GATGGCATTC CCTTCCGTGG TGGCCCCACC 
CTGACGGGCA GTGCCTTGCG GCAGGCGGCA GAGCGTGGCT TCGGGAGCGC CACCAGGACA 
GGCCAGGACC GGCCACGTAG AGTGGTGGTT TTGCTCACTG AGTCACACTC CGAGGATGAG 
GTTGCGGGCC CAGCGCGTCA CGCAAGGGCG CGAGAGCTGC TCCTGCTGGG TGTAGGCAGT 
GAGGCCGTGC GGGCAGAGCT GGAGGAGATC ACAGGCAGCC CAAAGCATGT GATGGTCTAC 
TCGGATCCTC AGGATCTGTT CAACCAAATC CCTGAGCTGC AGGGGAAGCT GTGCAGCCGG 
CAGCGGCCAG GGTGCCGGAC ACAAGCCCTG GACCTCGTCT TCATGTTGGA CACCTCTGCC 
TCAGTAGGGC CCGAGAATTT TGCTCAGATG CAGAGCTTTG TGAGAAGCTG TGCCCTCCAG 
TTTGAGGTGA ACCCTGACGT GACACAGGTC GGCCTGGTGG TGTATGGCAG CCAGGTGCAG 
ACTGCCTTCG GGCTGGACAC CAAACCCACC OGGGCTGCGA TGCTGCGGGC CATTAGCCAG 
GCCCCCTACC TAGGTGGGGT GGGCTCAGCC GGCACCGCCC TGCTGCACAT CTATGACAAA 
GTGATGACCG TCCAGAGGGG TGCCCGGCCT GGTGTCCCCA AAGCTGTGGT GGTGCTCACA 
GGCGGGAGAG GCGCAGAGGA TGCAGCCGTT CCTGCCCAGA AGCTGAGGAA CAATGGCATC 
TCTGTCTTGG TCGTGGGCGT GGGGCCTGTC CTAAGTGAGG GTCTGCGGAG GCTTGCAGGT 
CCCCGGGATT CCCTGATCCA CGTGGCAGCT TACGCCGACC TGCGGTACCA CCAGGACGTG 
CTCATTGAGT GGCTGTGTGG AGAAGCCAAG CAGCCAGTCA ACCTCTGCAA ACCCAGCCCG 
TGCATGAATG AGGGCAGCTG CGTCCTGCAG AATGGGAGCT ACCGCTGCAA GTGTCGGGAT 
GGCTGGGAGG GCCCCCACTG CGAGAACCGT GAGTGGAGCT CTTGCTCTGT ATGTGTGAGC 
CAGGGATGGA TTCTTGAGAC GCCCCTGAGG CACATGGCTC CCGTGCAGGA GGGCAGCAGC 
CGTACCCCTC CCAGCAACTA CAGAGAAGGC CTGGGCACTG AAATGGTGCC TACCTTCTGG 
AATGTCTGTG CCCCAGGTCC TTAG 

Seq ID NO: C25 UNA Sequence 

Nucleic Acid Accession #: XMJ>97386.3 

Coding sequence: 142.. 795 



1 
I 

CTCGCAGAAC 
GGGACCCCCG 
CTACCTCAGG 
TCCCGTGCAC 
CCTGGACCCC 
GGGTCGAGGG 
GTGTCTGTGG 
TTGAGCCTGC 
CGAGGCTGGG 
GGAAATGGGA 
CCGTGTCTGC 
TCTCCTGCCA 
CAGCACCCCC 
CTGGATGGGT 
CACAGTCAAG 
GGAACCTTCC 
ACGGGGGGCA 
TCTTATCATG 
CCTTGGCAAA 
GTTTGTCAGA 
AACTTAAAAA 
GTCTCGCCCT 
GTATTAAAAG 




GGCGGTTGCA 
AGAGGCTCCT 
CCGCCTGTCG 
CCTTATGCAG 
TCTAGTTCAC 
GCCAACGGGG 
GTGGGACCCG 
GCTGGGGTGG 
TTTTACCCAC 
TGTAAACTCA 
CGGCGTGGGA 
AATTTTTAAA 
GGCCGCGGCA 
GAAAAAA 



21 
I 

CTGTCCGTGT 
GGGCCCCGCG 
GATGCCAAAA 
AGCAGCCACT 
GTGCACCTGC 
TCCCTCGGGC 
CTCCCGGCGG 
GATGATTGTA 
GGAGAAGGAA 
GAGGGTCTAT 
CCCTGCCTCA 
GTGGGGGTTT 
ACTGGGAGGG 
TTGGGACCGT 
GCTCCCCCTG 
TAAGTGGCTG 
TTGTTAAGGG 
CTTGTCCCTT 
GCCTTTCATT 
ACCACGCCTG 
AAACATAAAA 
GGGTGGCCTG 




GGTGGCCCCG 
AAATTCGGTG 
GGTCGGGCAG 
GGGGCCTGGC 
CTCTGAGATG 
TCCAGAAAGG 
TCACGCATCT 
TTTTCCCCAA 
CATGAOGTGT 
AAACTCAGGT 
CTACTCTCTA 
TAACAATTTC 



41 
I 

GCCTCCAGGG 
CGCCTGCTGC 
TGGGGTGCCT 
TGCCTGCTGC 
GCCTGGGGAG 
TGTCGGGAAT 
TGGATTCTAA 
GGGTGGGTGG 
GCAGTGGCCA 
CCTGGATACT 
TTCAACCCCA 
TGGCTTTCTG 
TCCCCTCAGC 
TGCGTACTGA 
TTGGGAGAAA 
GGGGAGGGTG 
GTACAGTTGA 
TTGTGCTTTT 
GAAATTTCAG 
AATAGGAGGA 
CCTCTGGCTG 
AGTTTTCGCA 



SI 
I 

CTCCTCTCCC 
CTATCTCTGT 
CTCCCCCGCC 
ATCTGGCGGC 
GGGCCGTGCA 
CCTGCGTGTC 
CTCAGAGGAC 
GGGGTGGGCG 
TGCTGGCCGT 
CGGCAGGAAG 
GCCGTGCCCA 
GGGTGCAGCT 
CACGAGGAGC 
GTGGGTGCCC 
GGCGGCTTCT 
GGCACGGGGC 
ATTTCCTTTC 
GCATTTTTTT 
TTTCTCTGGA 
AAAAAAAAAA 
GGCCCAGCCT 
GAACATTCAG 



Seq ID NO: C26 DNA Sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 95.. 2126 



1 
I 

GGGGTAGTTT 
CTCCTGGAAA 
TGCTTCCTGG 
CACACTGAGA 
GATCAATATA 
GCTTCTGAAA 
CTTCACCCCT 
GCGAGTCAGC 



11 
I 

GTAGGGACGC 
GGAGAGACAC 
GAGCTTGTGG 
GTATCTGGAG 
TCCCAAGACT 
ACCCACTGGA 
CAGCATAAAA 
TTCTCAGCTG 



21 
I 

AGCTCTCCAC 
CAGCATTTGC 
TCCGCGTTGA 
ACCTTCATGT 
GGTCAGACTT 
CCCTGGACAA 
TGCTGCGCCT 
TGGTTTTTAA 



31 

I 

GTGCGCGACT 
CACAATGCTG 
CCATCCCAAT 
TGGAGGAGTG 



ATATGGGGTC 
TCGTCTGCCG 
AGCTGTCAGT 



41 
I 

GCGAGGCTGG 
TCATCCACTG 
GAAGAGCAGC 
ATGCTCAAGT 
TGGGAACAGA 
CAGGCAGATG 
AATTTGAAGA 
GATATCTGCA 
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51 
I 

ACGCTACGGG 
ACTTTACATT 
AGAAAGACGT 
TAGTAGAACA 
AGCATTGCTG 
CAAAGCTTCT 
TGGTGAGGTT 
AAATCCTGAA 
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TATTAGAAGA TCAGAAGAGC TTTCCTTGTT AAAGCCGTCT GGTGACTATT TTAAGAAGAA 540 

GAAGAAAAAA GACAAAAATA ATAAGGAACC CATAATTGAA GATATTCTAA ACCTGGAGAG 600 

TTCTCCAACA GCTTCAGGTT CATCAGTAAG TCCTGGTTTA TACAGTAAAA CCATGACCCC 660 

TATATATGAC CCCATCAATG GAACACCAGC ATCATCCACC ATGACTTGGT TCAGTGACAG 720 

CCCTTTGACG GAACAAAACT GCAGCATCCT CGCATTCAGC CAACCCCCCC AGTCCCCAGA 780 

AGCACTTGCG GATATGTACC AGCCTCGGTC TCTGGTTGAT AAAGCCAAGC TCAATGCAGG 840 

TTGGCTAGAC TCCTCACGCT CCCTTATGGA ACAAGGCATC CAAGAGGATG AGCAGCTGCT 900 

CTTACGATTT AAATATTATT CTTTCTTCGA CTTGAATCCT AAATATGATG CTGTCCGAAT 960 

AAACCAACTC TATGAGCAAG CCAGGTGGGC CATTCTCTTA GAAGAAATTG ATTGCACAGA 1020 

GGAAGAAATG TTGATCTTTG CAGCTCTACA GTACCACATT AGCAAACTGT CGTTGTCTGC 10B0 

TGAAACACAG GATTTTGCAG GCGAGTCCGA GGTTGATGAA ATAGAAGCGG CGCTTTCTAA 1140 

TTTGGAAGTA ACCCTAGAAG GTGGAAAAGC GGACAGCCTT TTGGAGGACA TTACTGATAT 1200 

CCCTAAACTT GCAGATAATC TCAAATTATT TAGGCCCAAG AAGTTACTAC CAAAAGCTTT 1260 

CAAACAATAT TGGTTTATCT TTAAAGACAC ATCCATAGCA TACTTTAAAA ATAAGGAACT 1320 

TGAACAAGGA GAACCACTAG AAAAACTAAA TCTTAGAGGC TGCGAAGTTG TGCCCGATGT 1380 

AAATGTAGCA GGAAGAAAAT TTGGAATCAA GTTACTAATC CCTGTTGCCG ATGGTATGAA 1440 

TGAAATGTAT TTGAGATGTG ACCATGAGAA TCAATACGCC CAATGGATGG CTGCCTGCAT 1500 

GTTGGCATCG AAGGGCAAAA CCATGGCAGA CAGCTCCTAC CAGCCAGAGG TCCTCAACAT 1560 

CCTTTCATTT CTGAGGATGA AAAACAGGAA CTCTGCATCT CAGGTGGCTT CCAGTCTCGA 1620 

AAACATGGAT ATGAACCCAG AATGTTTTGT GTCACCACGG TGTGCAAAAA GACACAAATC 1680 

CAAACAGCTG GCCGCCCGGA TCCTGGAGGC GCACCAGAAC GTGGCCCAGA TGCCCCTGGT 1740 

CGAAGCCAAG CTGCGGTTCA TCCAGGCGTG GCAGTCACTG CCTGAGTTTG GCCTCACCTA 1800 

CTACCTTGTC AGATTTAAAG GAAGCAAAAA AGATGACATT CTGGGAGTTT CATATAACAG 1860 

GTTGATTAAA ATTGATGCAG CCACCGGGAT TCCAGTGACA ACATGGAGAT TCACAAATAT 1920 

CAAACAGTGG AATGTAAACT GGGAAACCCG GCAGGTGGTC ATCGAGTTTG ACCAAAACGT 1980 

CTTTACTGCT TTCACCTGCC TGAGTGCAGA TTGCAAGATT GTGCACGAGT ACATTGGCGG 2040 

CTACATTTTC TTGTCCACCC GCTCCAAGGA CCAGAATGAA ACACTCGATG AGGACTTGTT 2100 

CCACAAATTG ACCGGCGGTC AGGATTGAAA CAAGCACGCG TGCTCGGCTC ACACCAACAA 2160 

GGCAAGCCAA AGGCGCCCCT CCCCAGAGGG ATCCCTAACG TGCCCAGCAT GTAGATTCTG 2220 

GACTAACAGA CAACATACAT TCACCGCTGG TCACCCAGAT CCTCATTCAA ACCCACTGCT 2280 

GGCACATCCC TTTCCTTACT TTGCCCTGTG CTACCAGCCA CGGAAGGAGC CTCTCTTGTT 2340 

TTTTCTATAA AATGGGTAGG CAGGAGAAAA GCAGGTGCCC TAAGATTGCT CTAAGGCCCA 2400 

GCATGTGGTT ACAGTTCTCT GACTTGCAGA ACCTGCCAGG TGTATGGCTA CAAGTTATCC 2460 

TCGTGCTGAT CTGTCTCATT ACTAAGTCAA TGGAGAAGAC AGAAAGGTAA AAATCACGTG 2520 

TAGCAAGAAC AACTCTTATT TCACAAACTC AGGTATGAAA CGAAACGCCT GTCCTTCATG 2580 

GAACTGCTTT TAGCTCCTGT CTTTTCAAAA TGGCAGAGGG AGTTCCTACA CACACTTTTT 2640 

CCCTGGAGGC CAAGGTCTAG GGGT AGAAAG GGGAGGGGTG GGGCTACCAG GTAGCAGTTG 2700 

ACAACCCAAG GTCAGAGGAG TGGCCCTCAG TGTCATCTGT CCACAGTGAT ACCTGCCAAG 2760 

ATGACCACTG ACCCACATCT GGTCTTAGTC ATTGGTCTCC TCAGATTTCT GGGGCCACCT 2820 

GCAAGCCCCA TTCCATTCCT ACAGATCTCT CAGCCACCTG TAAGTCCTTT GTGAAGATGT 2880 

GGGTGACACA GGGGGACAGG AAAACCCATT TCTCAACCCA GATCCATGTC TCCACTGCTT 2940 

CTACTCTGGG TTGGGATTCA GGAAGACAGG CACAGTCCTC TCTGTTCATA GAAACACCTG 3000 

CCAGTGTCAA GGATTCCAGT CAGGTGTCTA TCCCAACTGG TCAGGGAGAG AAGGGCAGAC 3060 

CCATTCTCAA AGACCACCAT GTCCAAGGTC TGACAGCTCC CCACTGGCTG CCCCCACAGG 3120 

GGCTTTAGGC TGGTCTGGGT CATGGGGAAG CGTCCCTCTT ATCGCTGGTC TGTGTTCTCC 3180 

TGGATTTGGT ATCTATGTTG GTACGACTCC TGGCCTTTTA TCTAAAGGAC TTTGGCTTTT 3240 

GTAAATCACA AGCCAATAAT AGACTTTTTT CTCCCCCTCT GTTTTTTGCT GTGTCATCTC 3300 

TGCCTTGAGA CTGCCTTGAG ACAGTGCTTG CCTTGAGAGA GTGAGCCAAT TAACAGCTGC 3360 

CTGAATTGTC ATTTTCCATT TTGGTTTGTT AGAGGTGGGA GGGGTGGGTT TTGAGAAGGT 3420 

CAAAAGCAAT ACCAGAAGTA AAGGGAAATA TCAGACAATA TTTTATTATT TTTTCATAGA 3480 

TGTTCTGCCA CACAAAGAAC TTGGGGTGTA AGGATAAGGC AAAAGCTCCA ATCCCATTTT 3540 

TCAGTTCTCC TAGGATGCAC CCCTCAGGGA GCCTGGCCAG AGTTCCGAGG CCCGTGAGCG 3600 

TCAGCTGTTG CTTTATTTTC CATCAAAGCC CTCTGAGAAG TGAGACCTCA GCAATTCCGG 3660 

GAGCCACATA GAGACAGACT TGGCAAGGGA CCCCCTGGTT CTGAGCCAGT AGCTGCCATC 3720 

TGGAAATTCC TCTTTTAGCC TCTCCTTAGA GGTGAATGTG AATGAAGCCT CCCAGGCACC 3780 

OGCTGAATTT CTGAGGCCTT GCTTAAAGCT CAGAAGTGGT TTAGGCATTT GGAAAATCTG 3840 

GTTCACATCA TAAAGAACTT GATTTGAAAT GTTTTCTATA GAAACAAGTG CTAAGTGTAC 3900 

CGTATTATAC TTGATGTTGG TCATTTCTCA GTCCTATTTC TCAGTTCTAT TATTTTAGAA 3960 

CCTAGTCAGT TCTTTAAGAT TATAACTGGT CCTACATTAA AATAATGCTT CTCGATGTCA 4020 

GATTTTACCT GTTTGCTGCT GAGAACATCT CTGCCTAATT TACCAAAGCC AGACCTTCAG 4080 

TTCAACATGC TTCCTTAGCT TTTCATAGTT GTCTGACATT TCCATGAAAA CAAAGGAACC 4140 

AACTTTGTTT TAACCAAACT TTGTTTGGTT ACAGTTTTCA GGGGAGCGTT TCTTCCATGA 4200 

CACACAGCAA CATCCCAAAG AAATAAACAA GTGTGACAAA AAAAAAAAAA AACAAACCTA 4260 

AATGCTACTG TTCCAAAGAG CAACTTGATG GTTTTTTTTA ATACTGAGTG CAAAAGGTCA 4320 

CCCAAATTCC TATGATGAAA TTTTAAATTA ATGGGCACCT TTCAACATCA TTTGCTTCCT 4380 

TATCTACAGT TGATTCAGAA ATCTGCATTT TTTATTCTTT TATATGACTT TTAAGTAAAA 4440 

GATTTATATG GATTTGAAAA AAAAAAAAAA A 4471 

Seq ID NO: C27 Protein Sequence 
Protein Accession #: NP_00 5161.1 

1 11 21 31 41 51 

I i I I I I 

MDGGTLPRSA PPAPPVPVGC AARRRPASPE LLRCSRRRRP ATAETGGGAA AVARRNERER 60 

NRVKLVNLGF QALRQHVPHG GASKKLSKVE TLRSAVEYIR ALQRLLAEHD AVRNALAGGL 120 

RPQAVRPSAP RGPPGTTPVA ASPSRASSSP GRGGSSEPGS PRSAYSSDDS GCEGALSPAE 180 

RELLDFSSWL GGY 193 

Seq ID NO: C28 DNA Sequence 
Nucleic Acid Accession #: NM_017763 
Coding sequence: 169.. 2520 

1 11 21 31 41 51 

I I I I I I 

1214 



WO 03/042661 
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AAAAAAAAAA AACTTTAGAG AAAGGAAGGG CCAAAACTAC GACTTGGCTT TCTGAAACGG 60 

AAGCATAAAT GT TCTTTTCC TCCATTTGTC TGGATCTGAG AACCTGCATT TGGTATTAGC 120 

TAGTGGAAGC AGTATGTATG GTTGAAGTGC ATTGCTGCAG CTGGTAGCAT GAGTGGTGGC 1BO 

CACCAGCTGC AGCTGGCTGC CCTCTGGCCC TGGCTGCTGA TGGCTACCCT GCAGGCAGGC 240 

5 TTTGGACGCA CAGGACTGGT ACTGGCAGCA GCGGTGGAGT CTGAAAGATC AGCAGAACAG 300 

AAAGCTGTTA TCAGAGTGAT CCCCTTGAAA ATGGACCCCA CAGGAAAACT GAATCTCACT 360 

TTGGAAGGTG TGTTTGCTGG TGTTGCTGAA ATAACTCCAG CAGAAGGAAA ATTAATGCAG 420 

TCCCACCCAC TGTACCTGTG CAATGCCAGT GATGACGACA ATCTGGAGCC TGGATTCATC 480 

AGCATCGTCA AGCTGGAGAG TCCTCGACGG GCCCCCCGCC CCTGCCTGTC ACTGGCTAGC 540 

10 AAGGCTCGGA TGGOGGGTGA GCGAGGAGCC AGTGCTGTCC TCTTTGACAT CACTGAGGAT 600 

CGAGCTGCTG CTGAGCAGCT GCAGCAGCCG CTGGGGCTGA CCTGGCCAGT GGTGTTGATC 660 

TGGGGTAATG ACGCTGAGAA GCTGATGGAG TTTGTGTACA AGAACCAAAA GGCCCATGTG 720 

AGGATTGAGC TGAAGGAGCC CCCGGCCTGG CCAGATTATG ATGTGTGGAT CCTAATGACA 780 

GTGGTGGGCA CCATCTTTGT GATCATCCTG GCTTCGGTGC TGCGCATCCG GTGCCGCCCC 840 

15 CGCCACAGCA GGCCGGATCC GCTTCAGCAG AGAACAGCCT GGGCCATCAG CCAGCTGGCC 900 

ACCAGGAGGT ACCAGGCCAG CTGCAGGCAG GCCCGGGGTG AGTGGCCAGA CTCAGGGAGC 960 

AGCTGCAGCT CAGCCCCTGT GTGTGCCATC TGTCTGGAGG AGTTCTCTGA GGGGCAGGAG 1020 

CTACGGGTCA TTTCCTGCCT CCATGAGTTC CATCGTAACT GTGTGGACCC C TGGTT ACAT 1080 

CAGCATCGGA CTTGCCCCCT CTGCGTGTTC AACATCACAG AGGGAGATTC ATTTTCCCAG 1140 

20 TCCCTGGGAC CCTCTCGATC TTACCAAGAA CCAGGTCGAA GACTCCACCT CATTCGCCAG 1200 

CATCCCGGCC ATGCCCACTA CCACCTCCCT GCTGCCTACC TGTTGGGCCC TTCCCGGAGT 1260 

GCAGTGGCTC GGCCCCCACG ACCTGGTCCC TTCCTGCCAT CCCAGGAGCC AGGCATGGGC 1320 

CCTCGGCATC ACCGCTTCCC CAGAGCTGCA CATCCCCGGG CTCCAGGAGA GCAGCAGCGC 1380 

CTGGCAGGAG CCCAGCACCC CTATGCACAA GGCTGGGGAA TGAGCCACCT CCAATCCACC 1440 

25 TCACAGCACC CTGCTGCTTG CCCAGTGCCC CTACGCCGGG CCAGGCCCCC TGACAGCAGT 1500 

GGATCTGGAG AAAGCTATTG CACAGAACGC AGTGGGTACC TGGCAGATGG GCCAGCCAGT 1560 

GACTCCAGCT CAGGGCCCTG TCATGGCTCT TCCAGTGACT CTGTGGTCAA CTGCACGGAC 1620 

ATCAGCCTAC AGGGGGTCCA TGGCAGCAGT TCTACTTTCT GCAGCTCCCT AAGCAGTGAC 1680 

TTTGACCCCC TAGTGTACTG CAGCCCTAAA GGGGATCCCC AGCGAGTGGA CATGCAGCCT 1740 

30 AGTGTGACCT CrCGGCCTCG TTCCTTGGAC TCGGTGGTGC CCACAGGGGA AACCCAGGTT 1800 

TCCAGCCATG TCCACTACCA CCGCCACCGG CACCACCACT ACAAAAAGCG GTTCCAGTGG 1860 

CATGGCAGGA AGCCTGGCCC AGAAACCGGA GTCCCCCAGT CCAGGCCTCC TATTCCTCGG 1920 

ACACAGCCCC AGCCAGAGCC ACCTTCTCCT GATCAGCAAG TCACCGGATC CAACTCAGCA 1980 

GCCCCTTCGG GGCGGCTCTC TAACCCACAG TGCCCCAGGG CCCTCCCTGA GCCAGCCCCT 2040 

35 GGCCCAGTTG ACGCCTCCAG CATCTGCCCC AGTACCAGCA GTCTGTTCAA CTTGCAAAAA 2100 

TCCAGCCTCT CTGCCCGACA CCCACAGAGG AAAAGGCGGG GGGGTCCCTC CGAGCCCACC 2160 

CCTCGCTCTC GGCCCCAGGA TGCAACTGTG CACCCAGCTT GCCAGATTTT TCCCCATTAC 2220 

ACCCCCAGTG TGGCATATCC TTGGTCCCCA GAGGCACACC CCTTGATCTG TGGACCTCCA 2280 

GGCCTGGACA AGAGGCTGCT ACCAGAAACC CCAGGCCCCT GTTACTCAAA TTCACAGCCA 2340 

40 GTGTGGTTGT GCCTGACTCC TCGCCAGCCC CTGGAACCAC ATCCACCTGG GGAGGGGCCT 2400 

TCTGAATGGA GTTCTGACAC CGCAGAGGGC AGGCCATGCC CTTATCCGCA CTGCCAGGTG 2460 

CTGTCGGCCC AGCCTGGCTC AGAGGAGGAA CTCGAGGAGC TGTGTGAACA GGCTGTGTGA 2520 

GATGTTCAGG CCTAGCTCCA ACCAAGAGTG TGCTCCAGAT GTGTTTGGGC CCTACCTGGC 2580 

ACAGAGTCCT GCTCCTGGGA AAGGAAAGGA CCACAGCAAA CACCATTCTT TTTGCCGTAC 2640 
45 TTCCTAGAAG CACTGGAAGA GGACTGGTGA TGGTGGAGGG TGAGAGGGTG CCGTTTCCTG 2700 
CTCCAGCTCC AGACCTTGTC TGCAGAAAAC ATCTGCAGTG CAGCAAATCC ATGTCCAGCC 2760 
AGGCAACCAG CTGCTGCCTG TGGOGTGTGT GGGCTGGATC CCTTGAAGGC TGAGTTTTTG 2820 
AGGGCAGAAA GCTAGCTATG GGTAGCCAGG TGTTACAAAG GTGCTGCTCC TTCTCCAACC 2880 
CCTACTTGGT TTCCCTCACC CCAAGCCTCA TGTTCATACC AGCCAGTGGG TTCAGCAGAA 2940 
50 CGCATGACAC CTTATCACCT CCCTCCTTGG GTGAGCTCTG AACACCAGCT TTGGCCCCTC 3000 
CACAGTAAGG CTGCTACATC AGGGGCAACC CTGGCTCTAT CATTTTCCTT TT TTGCC AAA 3060 
AGGACCAGTA GCATAGGTGA GCCCTGAGCA CTAAAAGGAG GGGTCCCTGA AGCTTTCCCA 3120 
CTATAGTGTG GAGTTCTGTC CCTGAGGTGG GTACAGCAGC CTTGGTTCCT CTGGGGGTTG 3180 
AGAATAAGAA TAGTGGGGAG GGAAAAACTC CTCCTTGAAG ATTTCCTGTC TCAGAGTCCC 3240 
55 AGAGAGGTAG AAAGGAGGAA TTTCTGCTGG ACTTTATCTG GGCAGAGGAA GGATGGAATG 3300 
AAGGTAGAAA AGGCAGAATT ACAGCTGAGC GGGGACAACA AAGAGTTCTT CTCTGGGAAA 3360 
AGTTTTGTCT TAGAGCAAGG ATGGAAAATG GGGACAACAA AGGAAAAGCA AAGTGTGACC 3420 
CTTGGGTTTG GACAGCCCAG AGGCCCAGCT CCCCAGTATA AGCCATACAG GCCAGGGACC 3480 
CACAGGAGAG TGGATTAGAG CACAAGTCTG GCCTCACTGA GTGGACAAGA GCTGATGGGC 3540 
60 CTCATCAGGG TGACATTCAC CCCAGGGCAG CCTGACCACT CTTGGCCCCT CAGGC ATTAT 3600 
CCCATTTGGA ATGTGAATGT GGTGGCAAAG TGGGCAGAGG AOCCCACCTG GGAACCTTTT 3660 
TCCCTCAGTT AGTGGGGAGA CTAGGACCTA GGTACCCACA TGGGTATTTA TATCTGAACC 3720 
AGACAGACGC TTGAATCAGG CACTATGTTA AGAAATATAT TTATTTGCTA ATATATTTAT 3780 
_ CCACAAAAAA AAAAAAAAAA AA 3802 



Seq ID NO: C29 Protein Sequence 
Protein Accession ft: NP_004280.2 

70 1 11 21 31 41 51 

1 1 I I I I 

MNSSAHYHVN FSQAISQDVN LHEAILLCPN NTFRRDPTAR TSQSQEPFLQ LNSHTTNPEQ 60 

TLPGTNLTGF LSPVDNHMRN LTSQDLLYDL DINIFDEINL MSLATEDNFD PIDVSQLFDE 120 

PDSDSGLSLD SSHNNTSVIK SNSSHSVCDE GAIGYCTDHE SSSHHDLEGA VGGYYPEPSK 180 

75 LCHLDQSDSD FHGDLTFQHV FHNHTYHLQP TAPESTSEPF PWPGKSQKIR SRYLEDTDRN 240 

LSRDEQRAKA LHIPFSVDEI VGMPVDSFNS MLSRYYLTDL QVSLIRDIRR RGKNKVAAQN 300 

CRKRKLDIIL NLEDDVCNLQ AKKETLKREQ AQCNKAIN1M KQKLHDLYHD IFSRLRDDQG 360 

RPVNPNHYAL QCTHDGSILI VPKELVASGH KKETQKGKRK 400 

80 Seq ID NO: C30 DNA Sequence 

Nucleic Acid Accession #: NM_004442 
Coding sequence: 19.. 2982 

1 u 21 31 41 51 

1215 



WO 03/042661 



I I I I I I 

GCCCCGGGAA GCGCAGCCAT GGCTCTGCGG AGGCTGGGGG CCGCGCTGCT GCTGCTGCCG 60 

CTGCTCGCCG CCGTGGAAGA AACGCTAATG GACTCCACTA CAGCGACTGC TGAGCTGGGC 120 

TGGATGGTGC ATCCTCCATC AGGGTGGGAA GAGGTGAGTG GCTACGATGA GAACATGAAC 180 

ACGATCCGCA CGTACCAGGT GTGCAACGTG TTTGAGTCAA GCCAGAACAA CTGGCTACGG 240 

ACCAAGTTTA TCCGGCGCCG TGGCGCCCAC CGCATCCACG TGGAGATGAA GTTTTCGGTG 300 

CGTGACTGCA GCAGCATCCC CAGCGTGCCT GGCTCCTGCA AGGAGACCTT CAACCTCTAT 3 SO 

TACTATGAGG CTGACTTTGA CTCGGCCACC AAGACCTTCC CCAACTGGAT GGAGAATCCA 420 

TGGGTGAAGG TGGATACCAT TGCAGCCGAC GAGAGCTTCT CCCAGGTGGA CCTGGGTGGC 480 

CGOGTCATGA AAATCAACAC CGAGGTGOGG AGCTTOGGAC CTGTGTCCOG CAGCGGCTTC 540 

TACCTGGCCT TCCAGGACTA TGGCGGCTGC ATGTCCCTCA TCGCCGTGCG TGTCTTCTAC 600 

CGCAAGTGCC CCCGCATCAT CCAGAATGGC GCCATCTTCC AGGAAACCCT GTCGGGGGCT 660 

GAGAGCACAT CGCTGGTGGC TGCCCGGGGC AGCTGCATCG CCAATGCGGA AGAGGTGGAT 720 

GTACCCATCA AGCTCTACTG TAACGGGGAC GGCGAGTGGC TGGTGCCCAT CGGGCGCTGC 780 

ATGTGCAAAG CAGGCTTCGA GGCCGTTGAG AATGGCACCG TCTGCCGAGG TTGTCCATCT 840 

GGGACTTTCA AGGCCAACCA AGGGGATGAG GCCTGTACCC ACTGTCCCAT CAACAGCCGG 900 

ACCACTTCTG AAGGGGCCAC CAACTGTGTC TGCCGCAATG GCTACTACAG AGCAGACCTG 960 

GACCCCCTGG ACATGCCCTG CACAACCATC CCCTCCGCGC CCCAGGCTGT GATTTCCAGT 1020 

GTCAATGAGA CCTCCCTCAT GCTGGAGTGG ACCCCTCCCC GCGACTCCGG AGGCCGAGAG 1080 

GACCTCGTCT ACAACATCAT CTGCAAGAGC TGTGGCTCGG GCOGGGGTGC CTGCACCCGC 1140 

TGCGGGGACA ATGTACAGTA CGCACCACGC CAGCTAGGCC TGACCGAGCC ACGCATTTAC 1200 

ATCAGTGACC TGCTGGCCCA CACCCAGTAC ACCTTCGAGA TCCAGGCTGT GAACGGCGTT 1260 

ACTGACCAGA GCCCCTTCTC GCCTCAGTTC GCCTCTGTGA ACATCACCAC CAACCAGGCA 1320 

GCTCCATCGG CAGTGTCCAT CATGCATCAG GTGAGCCGCA CCGTGGACAG CATTACCCTG 1380 

TCGTGGTCCC AGCCAGACCA GCCCAATGGC GTGATCCTGG ACTATGAGCT GCAGTACTAT 1440 

GAGAAGGAGC TCAGTGAGTA CAACGCCACA GCCATAAAAA GCCCCACCAA CACGGTCACC 1500 

GTGCAGGGCC TCAAAGCCGG CGCCATCTAT GTCTTCCAGG TGCGGGCACG CACCGTGGCA IS 60 

GGCTACGGGC GCTACAGCGG CAAGATGTAC TTCCAGACCA TGACAGAAGC CGAGTACCAG 1620 

ACAAGCATCC AGGAGAAGTT GCCACTCATC ATCGGCTCCT CGGCCGCTGG CCTGGTCTTC 1680 

CTCATTGCTG TGGTTGTCAT CGCCATCGTG TGTAACAGAA GACGGGGGTT TGAGCGTGCT 1740 

GACTCGGAGT ACACGGACAA GCTGCAACAC TACACCAGTG GCCACATGAC CCCAGGCATG 1800 

AAGATCTACA TCGATCCTTT CACCTACGAG GACCCCAACG AGGCAGTGCG GGAGTTTGCC I860 

AAGGAAATTG ACATCTCCTG TGTCAAAATT GAGCAGGTGA TCGGAGCAGG GGAGTTTGGC 1920 

GAGGTCTGCA GTGGCCACCT GAAGCTGCCA GGCAAGAGAG AGATCTTTGT GGCCATCAAG 1980 

ACGCTCAAGT CGGGCTACAC GGAGAAGCAG CGCOGGGACT TCCTGAGOGA AGCCTCCATC 2040 

ATGGGCCAGT TCGACCATCC CAACGTCATC CACCTGGAGG GTGTCGTGAC CAAGAGCACA 2100 

CCTGTGATGA TCATCACCGA GTTCATGGAG AATGGCTCCC TGGACTCCTT TCTCCGGCAA 2160 

AACGATGGGC AGTTCACAGT CATCCAGCTG GTGGGCATGC TTCGGGGCAT CGCAGCTGGC 2220 

ATGAAGTACC TGGCAGACAT GAACTATGTT CACCGTGACC TGGCTGCCCG CAACATCCTC 2280 

GTCAACAGCA ACCTGGTCTG CAAGGTGTCG GACTTTGGGC TCTCACGCTT TCTAGAGGAC 2340 

GATACCTCAG ACCCCACCTA CACCAGTGCC CTGGGCGGAA AGATCCCCAT CCGCTGGACA 2400 

GCCCCGGAAG CCATCCAGTA CCGGAAGTTC ACCTCGGCCA GTGATGTGTG GAGCTACGGC 2460 

ATTGTCATGT GGGAGGTGAT GTCCTATGGG GAGCGGCCCT ACTGGGACAT GACCAACCAG 2520 

GATGTAATCA ATGCCATTGA GCAGGACTAT CGGCTGCCAC CGCCCATGGA CTGCCCGAGC 2580 

GCCCTGCACC AACTCATGCT GGACTGTTGG CAGAAGGACC GCAACCACCG GCCCAAGTTC 2640 

GGCCAAATTG TCAACACGCT AGACAAGATG ATCCGCAATC CCAACAGCCT CAAAGCCATG 2700 

GCGCCCCTCT CCTCTGGCAT CAACCTGCCG CTGCTGGACC GCACGATCCC CGACTACACC 2760 

AGCTTTAACA CGGTGGACGA GTGGCTGGAG GCCATCAAGA TGGGGCAGTA CAAGGAGAGC 2820 

TTCGCCAATG CCGGCTTCAC CTCCTTTGAC GTCGTGTCTC AGATGATGAT GGAGGACATT 2880 

CTCCGGGTTG GGCTCACTTT GGCTGGCCAC CAGAAAAAAA TCCTGAACAG TATCCAGGTG 2940 

ATGCGGGCGC AGATGAACCA GATTCAGTCT GTGGAGGTTT GACATTCACC TGCCTCGGCT 3000 

CACCTCTTCC TCCAAGCCCC GCCCCCTCTG CCCCACGTGC CGGCCCTCCT GGTGCTCTAT 3060 

CCACTGCAGG GCCAGCCACT CGCCAGGAGG CCACGGGCCA CGGGAAGAAC CAAGCGGTGC 3120 

CAGCCACGAG ACGTCACCAA GAAAACATGC AACTCAAACG ACGGAAAAAA AAAGGGAATG 3180 

GGAAAAAAGA AAACAGATCC TGGGAGGGGG CGGGAAATAC AAGGAATATT TTTTAAAGAG 3240 

GATTCTCATA AGGAAAGCAA TGACTGTTCT TGCGGGGGAT AAAAAAGGGC TTGGGAGATT 3300 

CATGCGATGT GTCCAATCGG AGACAAAAGC AGTTTCTCTC CAACTCCCTC TGGGAAGGTG 3360 

ACCTGGCCAG AGCCAAGAAA CACTTTCAGA AAAACAAATG TGAAGGGGAG AGACAGGGGC 3420 

CGCCCTTGGC TCCTGTCCCT GCTGCTCCTC TAGGCCTCAC TCAACAACCA AGCGCCTGGA 3480 
GGACGGGACA GATGGACAGA CAGCCACCCT GAGAACCCCT CTGGGAAAAT CTATTCCTGC 3540 
CACCACTGGG CAAACAGAAG AATTTTTCTG TCTTTGGAGA GTATTTTAGA AACTCCAATG 3600 
AAAGACACTG TTTCTCCTGT TGGCTCACAG GGCTGAAAGG GGCTTTTGTC CTCCTGGGTC 3660 
AGGGAGAACG CGGGGACCCC AGAAAGGTCA GCCTTCCTGA GGATGGGCAA CCCCAGGTCT 3720 
GCAGCTCCAG GTACATATCA CGCGCACAGC CTGGCAGCCT GGCCCTCCTG GTGCCCACTC 3780 
CCGCCAGCCC CTGCCTCGAG GACTGATACT GCAGTGACTG CCGTCAGCTC CGACTGCCGC 3840 
TGAGAAGGGT TGATCCTGCA TCTGGGTTTG TTTACAGCAA TTCCTGGACT CGGGGGTATT 3900 
TTGGTCACAG GGTGGTTTTG GTTTAGGGGG TTTGTTTGT T GGGTTGTTTT TTGTTTTTTG 3960 
GTTTTTTTTA ATGACAATGA AGTGACACTT TGACATTTCC TACCTTTTGA GGACTTGATC 4020 
CTTCTCCAGG AAGAAGGTGC TTTCTGCTTA CTGACTTAGG CAATACACCA AGGGCGANAT 4080 
TTTATATGCA CATTTCTGGA TTTTTTTATA CGGTTTTCAT TGACACTCTT CCCTCCTCCC 4140 
ACCTGCCACC AGGCCTCACC AAAGCCCACT GCCATGGGGC CATCTGGGCC ATTCAGAGAC 4200 
TGGAGTGAGA TTTGGGTGTG GAGGGGGAGG CGCCAAGGTG GAGGAGCTTC CCACTCCAGG 4260 
ACTGTTGATG AAAGGGACAG ATTGAGGAGG AAGTGGGCTC TGAGGCTGCA GGGCTGGAAG 4320 
TCCTTGCCCA CTTCCCACTC TCCTGCCCCA ATCTATCTAG TACTTCCCAG GCAAATAGGC 4380 
CCCTTTGAGG CTCCTGAGTG CCCTCAGATG GTCAAAACCC AGTTTTCCCT CTGGGAGCCT 4440 
AAACCAGGCT GCATCGGAGG CCAGGACCCG GATCATTCAC TGTGATACCC TGCCCTCCAG 4500 
AGGGTGCGCT CAGAGACACG GGCAAGCATG CCTCTTCCCT TCCCTGGAGA GAAAGTGTGT 4560 
GATTTCTCTC CCACCTCCTT CCCCCCACCA GACCTTTGCT GGGCCTAAAG GTCTTGGCCA 4620 
TGGGGACGCC CTCAGTCTAG GGATCTGGCC ACAGACTCCC TCCTGTGAAC CAACACAGAC 4680 
ACCCAAGCAG AGCAATCAGT TAGTGAATTG GAATTCCCCA AGTCTTTGCT ATTGTGAATA 4740 
GTGCTGCAAT AAACATACGT GTGCATGTGT CTTTATAGTA GAATGATCTA TAATCCTCTG 4800 
GGTATGTACC CAGTAATGGG ATTGCTGGGT CAAATGGTTT TTCTGGTTCT AGATCCTTGA 4860 
GGAATTGCCA CACTGTCTTT CACAATGGTT GAACTAATTT ACACTCCTAC CAACAGTGTA 4920 
AAAGTCTTCC TGTTTCTCCA CATCCTCTCC AGCATCTGTT GTTTCCTGAC TTTTTAATGA 4980 
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TTGCCATTCT 
TGACCAGTGA 
TGAGAAGTGT 
TAAACTTAAT 
ACAAAGCGTC 
GCTGAAAATG 
GAGAACTTCT 
CCCAAAGATA 
TTGAAATGAA 
GGAAGCCCAT 
GGGGACCAAT 
CCAAACTAAG 
GAGATTTTGT 
AAAAGAAAAA 
ATGAAGAAAC 
AGATTCACAC 
ACGCAGACTG 
CCATCTCACG 



AACTGGTGTG 
AGATGAGCTT 
CTGTTCATAA 
GAAATAAAGC 
CAAGAAATAT 
ACAGGGAGAA 
CCAACCTAGC 
TTTCTCGAGA 
GGAAAAAATG 
CAGACTAACA 
ATTCAACATT 
CTTCATAAGC 
CACCACCAGG 
CTGGTGCCAG 
CGTGTCAACT 
ATAACAATAT 
GCAAATT66T 
TGCAAAGACA 



AGATGGTATC 
TTTTTCATAT 
CCTTCACCAA 
ATGAAGACAA 
GGGACTATGT 
TGAAACCAAG 
AAGACAGACC 
AGAGCAACCC 
CTAAGGGCAG 
GCAGATCTCT 
CACAAAGAAA 
AAAGGAGAAA 
CCTGCCTTAC 
CCACTGCAAA 
AATGGGCAAA 
TAACCTTATA 
TAAAGAGTCA 
CACATAGGCT 



TCATTGTGGG 



TTTTTGATGA 
GATTAGAAGA 
GACAAGAACA 
TTGGAAAACA 
AACGTTCAAA 
GAAGACACAT 
CCAGAGAGAA 
CTGCAGAAAC 
AGAATTTTCA 
TAAAATCCTT 
AAGACATCCT 
AAATACCAAA 
ATAACCAGCT 
TCTAAATGGG 
AGACTCATTG 
CAGAGTAAAA 



TTTGATTTGC 
CACATGTTTG 
GGTTGTTTGT 
AAAAGAATGA 
AACTTACGTT 
CTCTTCAGGA 
TTCAGGAAAT 
AATTGTCAGA 
AGGTCAGGTT 
CCTACAACCT 
ATCCAGAATT 
TACAGACAAG 
GAAGGAAGCA 
TTGTAGAGAC 
AGTATCATAA 
CTAAATCCCC 
GTGTGCCGTA 
GGGATACAGG 



ATTTCTCTAA 
TTGTTTCTTT 
TCTTTTCTTG 
AAAGGAACAA 
TGACTGGTGT 
TATTATCCAG 
ACAGAGAACA 
TTCACCAAGG 
ACTCACAAAA 
AGAAGAGAGT 
TCATATCCAG 
CAAATCCTGG 
CTAAATATGG 
CATTGACACT 
TGACAGGATC 
CAATTAAAAG 
TTCAGGAGAC 
GGAATTC 



5040 

S100 

5160 

5220 

5280 

5340 

5400 

5460 

5520 

5580 

5640 

5700 

5760 

5820 

5880 

5940. 

6000 

6057 



Seq ID NO: C31 DNA Sequence 

Nucleic Acid Accession ft: NM_031942.1 

Coding sequence: 145.. 1260 

1 11 21 31 41 51 

I I I I I I 

CCCGAGCCCC GCCCCTCCGG GCCCGGGTCG GCGCGCCCAG CCTGCCAGCC GCGCTGCTGC 
TGCTCCTCCT GCTGTGGGAC CGCTGACOGC GOGGCTGCTC CGCTCTCCCC GCTCCAAGCG 
CCGATCTGGG CACCCGCCAC CAGCATGGAC GCTCGCCGCG TGCCGCAGAA AGATCTCAGA 
GTAAAGAAGA ACTTAAAGAA ATTCAGATAT GTGAAGTTGA TTTCCATGGA AACCTCGTCA 
TCCTCTGATG ACAGTTGTGA CAGCTTTGCT TCTGATAATT TTGCAAACAC GAGGCTGCAG 
TCAGTTCGGG AAGGCTGTAG GACCCGCAGC CAGTGCAGGC ACTCTGGACC TCTCAGGGTG 
GCGATGAAGT TTCCAGCGCG GAGTACCAGG GGAGCAACCA ACAAAAAAGC AGAGTCCCGC 
CAGCCCTCAG AGAATTCTGT GACTGATTCC AACTCCGATT CAGAAGATGA AAGTGGAATG 
AATTTTTTGG AGAAAAGGGC TTTAAATATA AAGCAAAACA AAGCAATGCT TGCAAAACTC 
ATGTCTGAAT TAGAAAGCTT CCCTGGCTCG TTCCGTGGAA GACATCCCCT CCCAGGCTCC 
GACTCACAAT CAAGGAGACC GCGAAGGCGT ACATTCCCGG GTGTTGCTTC CAGGAGAAAC 
CCTGAACGGA GAGCTCGTCC TCTTACCAGG TCAAGGTCCC GGATCCTCGG GTCCCTTGAC 
GCTCTACCCA TGGAGGAGGA GGAGGAAGAG GATAAGTACA TGTTGGTGAG AAAGAGGAAG 
ACCGTGGATG GCTACATGAA TGAAGATGAC CTGCCCAGAA GCCGTCGCTC CAGATCATCC 
GTGACCCTTC CGCATATAAT TCGCCCAGTG GAAGAAATTA CAGAGGAGGA GTTGGAGAAC 
GTCTGCAGCA ATTCTCGAGA GAAGATATAT AACCGTTCAC TGGGCTCTAC TTGTCATCAA 
TGCCGTCAGA AGACTATTGA TACCAAAACA AACTGCAGAA ACCCAGACTG CTGGGGCGTT 
CGAGGCCAGT TCTGTGGCCC CTGCCTTCGA AACCGTTATG GTGAAGAGGT CAGGGATGCT 
CTGCTGGATC CGAACTGGCA TTGCCCGCCT TGTCGAGGAA TCTGCAACTG CAGTTTCTGC 
CGGCAGCGAG ATGGACGGTG TGCGACTGGG GTCCTTGTGT ATTTAGCCAA ATATCATGGC 
TTTGGGAATG TGCATGCCTA CTTGAAAAGC CTGAAACAGG AATTTGAAAT GCAAGCATAA 
TATCTGGAAA ATTTGCTGCC TGCCTTCTAC TTCTCAAATC TTTCTTGTAA AAGTTTCCAA 
TTTTTTCACT GAAACCTGAG TTAAAAATCT TGATGATCAG CCTGTTTCAT AAGAAACTCC 
AATCAAGTTA ATCTTAGCAG ACATGTGTTT CTGGAGCATC ACAGAAGGTA TATTGCTAGT 
TACACTTTGC CCTCCTGCAG TTTCTTCTCT GCTCCCAACC CCCATCTCAT AGCATCCCCC 
TCTATTTCCA ATGCTCCTCT CCAACCGCTT AGTTTCTGAA TTTCTTTTAA ATTACAGTTT 
TATGAAAGCA TATTTTATTT ACTTGGTGTT GAAATAGCCC TCATAAAACC TAAGCACTTG 
GAAACACAAT AATAGTATTA ACTAACTAGA TCTATTGAAT TTCAGAGAAG AGCCTTCTAA 
CTTGTTTACA CAAAAACGAG TATGATTTAG CACTCATACT AGTTGAAATT TTTAATAGAA 
TCAAGGCACA AAAGTCTTAA AACCATGTGG AAAAATTAGG TAATTATTGC AGATTGATGT 
CTCTCAATCC CATGTATTGC GCTTATGTTA CAAGTTGTTG TCACAGTTGA GACTTAATTT 
CTCCTAATTT CTTCTGCCCG AAGGGTAAGT GGTGCGTCCA GCTTACACGA TCATAATTCA 
AAGGTTGGTG GGCAATGTAA TACTTAATTA AAATAATGAT GGAAGAGCTA TCTGGAGATT 
ATGAGTAAGC TGATTTGAAT TTTCAGTATA AAACTTTAGT ATAATTGTAG TTTGCAAAGT 
TTATTTCAGT TCACATGTAA GGTATTGCAA ATAAATTCTT GGACAATTTT GTATGGAAAC 
TTGATATTAA AAACTAGTCT GTGGTTCTTT GCAGTTTCTT GTAAATTTAT AAACCAGGCA 
CAAGGTTCAA GTTTAGATTT TAAGCACTTT TATAACAATG ATAAGTGCCT TTTTGGAGAT 
GTAACTTTTA GCAGTTTGTT AACCTGACAT CTCTGCCAGT CTAGTTTCTG GGCAGGTTTC 
CTGTGTCAGT ATTCCCCCTC CTCTTTGCAT TAATCAAGGT ATTTGGTAGA GGTGGAATCT 
AAGTGTTTGT ATGTCCAATT TACTTGCATA TGTAAACCAT TGCTGTGCCA TTCAATGTTT 
GATGCATAAT TGGACCTTGA ATCGATAAGT GTAAATACAG CTTTTGATCT GTAATGCTTT 
TATACAAAAG TTTATTTTAA TAATAAAATG TTTGTTCTAA AAAAAAAAAA 

Seq ID NO: C32 DNA Sequence 

Nucleic Acid Accession #: NM_012445.1 

Coding sequence: 276.. 1271 



GCACGAGGGA 
GCGGCAGCCC 
TCTCGCTGGA 
GGCCCGGGGC 
GCTGCCGGCC 
CCCTGGGCAA 
TTGGGGGAGA 
GCAAGTGGAG 
GGTCTTCGCT 
ACGTCAGTAA 



11 
I 

AGAGGGTGAT 
CCGCCGCCCC 
GGCCAGGCCG 
GCCGGCCTCG 
GCGCTCCCGC 
GGCCCTCTGC 
GTCCAT CTGT 
CCAGACGGCC 
GCTGGGGGCC 
CGGGCTGCGC 



21 

I 

CCGACCCGGG 
CGCAGCCCCT 
TGCAGCATCG 
GGCTTAAATA 
TGCTCCTGCC 
GCTCTCCTCC 
TCCGCCAGAG 
TTCCCCAAGC 
GCGCATAGCT 
GACTTTGCGG 



31 
I 

GAAGGTCGCT 
TCTCCTCCTT 
AAGACAGGAG 
GGAGCTCCGG 
GGGTGATGGA 
TGGCCACTCT 
CCCCGGCCAA 
AGTACCCCCT 
CCGACTACAG 
AGCGCGGCGA 



41 
I 

GGGCAGGGCG 
TCTCCCACGT 
GAACTGGAGC 
GCTCTGGCTG 
AAACCCCAGC 
CGGCGCCGCC 
ATACAGCATC 
GTTCCGCCCC 
CATGTGGAGG 
GGCCTGGGCG 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
I860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2510 



51" 
I 

AGTTGGGAAA 
CCTATCTGCC 
CTCATTGGCC 
GGACCCGACC 
CCGGCCGCCG 
GGCCAGCCTC 
ACCTTCACGG 
CCTGCGCAGT 
AAGAACCAGT 
CTGATGAAGG 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 
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AGATCGAGGC 
TCCCCAGCGG 
TCTCGTTTGT 
ACCTGTGCGA 
CCGGGACGGA 
CGGTGACCGA 
GGCTGAAGGC 
GGGCCTTCAT 
CCTCAGTTCC 
GCGGAGGCCA 
CCGCCAACAA 
ACTGCGTCTA 
GGCTCCTGTG 
GACCGCGGTG 
GGCATTGGGA 
TCTGCTCTCA 
AATTATGGTC 
CCTGGCTCCC 
CTCTCCCGAG 
GGAAGCGTCA 
TGCTCAC 



GGCGGGGGAG 
CACCGGGCAG 
GGTGCGCATC 
CGGGGACCGT 
CAGCGGCTTC 
GATAACGTCC 
CCTGCCTCCC 
CCCTCCCGCC 
AGAAACGCCG 
CTGTGGGAGG 
CGGGAGCCCC 
AGACCAGAGC 
CAGGCTCATG 
AGGCCGCGCC 
AACAGCCTCC 
GCCTCCTCCT 
TCCTTATAAG 
ACGTGGTTGC 
GGCGCATCCA 
GTGTTTCCAT 



GCGCTGCAGA 
ACGTCGGCGG 
GTGCCCAGCC 
TGGCGGGAAC 
ACCTTCTCCT 
TCXTCTCCCA 
ATCGCCAGGG 
CCAGTCCTGC 
CTGGACTGCG 
CTCGGGACCA 
TGCCCCGAGC 
CCCGCAGCCC 
CTGCAGGCGG 
GACCATCTCT 
TCCTTTCCCA 
CCTGCAGGAT 
TTATTGCTGC 
AGATACCTCA 
AGCGGGGGCC 
GTTATGGATC 



GCGTGCACGC 
AGCTGGAGGT 
CCGACTGGTT 
AGGCGGCGCT 
CCCCCAACTT 
GCCACCCGGC 
TGACACTGGT 
CCAGCAGGGA 
AGGTCTCCCT 
AGAGCAGGAC 
TCGAAGAAGA 
CTGGGGCCCC 
CCGAGGCACA 
GCACTGAAGG 
ACCTTGCTTC 
AAAGTCATCC 
TCCAGGAGAT 
GACCTGGTGC 
ACTTGAGAAG 
TCTCTGCGTT 



GCAGCGCAGG 
CGTGGGCGTG 
GGACCTGTAC 
CGCCACCATC 
CAACTCCTTC 
GCGGCTGCGA 
CAATGAGATT 
GTGGTCGTCC 
TCGCTACGTC 
GGCTGAGTGC 
CGGAGCCATG 
GGGGGTTTCG 
GCCCTCTGGT 
TTAGGGGCCC 
CCAAGGCTCC 
TGTCCTTCAT 
TCTAGGCTGT 
TGAATAAATG 
TGAATAAAGA 



GCGCCCGCCG 
CACTCGCTGG 
GACAGCCTGG 
CCCTACGACG 
CCGCAGGACA 
TACTACCCGC 
CAGAGCCCCA 
GTAGACAGCG 
TGGGGACTGT 
CGGGTCCAGC 
GTCCCTGATA 
GGGTGTCGGG 
CGCTGCTCCT 
GGCCGGCACG 

AGCTACTCTA 
CGTCCAGGGG 
GCTGAGCCCA 



CTATCTCTGT 



660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1807 



Seq ID NO: C33 DNA Sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 1..1314 



ATGTTACAGG 
AAACCCCGTA 
CTGAGCCTGG 
TACTTCCTCT 
CTGGACTGTC 
GCAGTGGCAG 
GGGAACTGGT 
AGGCAGATGG 
GATCTGGATG 

AAGACCCCCC 
AGCATCCAGT 
CTCACGGCAG 
GGCTCAGACA 
TTCAACCCCA 
ACTTTCTCAG 
GCCACCCCAC 
GACATACTGC 
GCGTACCAGG 
GACACCTGCC 
GTGGGCATCG 
AAGGTCTCAG 



11 
I 

ATCCTGACAG 
TCCCCATGGA 
CGAGTATCAT 
GCGGGCAGCC 
CCTTGGGGGA 
TCCGCCTCTC 
TCTCTGCCTG 
GCTACAGCAG 
TTGTTGAAAT 
TCTCAGGCTC 
GTGTGGTGGG 
ACGACAAACA 
CCCACTGCTT 
AACTGGGCAG 
TGTACCCCAA 
GCACAGTCAG 
TCTGGATCAT 
TGCAGGCGTC 
GGGAAGTCAC 
AGGGTGACAG 
TTAGCTGGGG 
CCTATCTCAA 



21 
I 

TGATCAACCT 
GACCTTCAGA 
CATTGTGGTT 
TCTCCACTTC 
GGACGAGGAG 
CAAGGACCGA 
TTTCGACAAC 
CAAACCCACT 
CACAGAAAAC 

TGGGGAGGAG 
GCACGTCTGT 
CAGGAAACAT 
CTTCCCATCC 
AGACAATGAC 
GCCCATCTGT 
TGGATGGGGC 
AGTCCAGGTC 
CGAGAAGATG 

CTATGGCTGC 
CTGGATCTAC 




TTTACGAAGC 
ATTGACAGCA 
ATGTGTGCAG 
CTGATGTACC 
GGGGGCCCGA 
AATGTCTGGA 



41 
I 

TGGATGTCAA 
TCCCCATCAT 
AGGTGATTCT 
AGCAGCTGTG 
AGAGCTTCCC 
AGGTGCTGGA 
CTCTCGCTGA 
TGGAGATTGG 
TTCGCATGCG 
TTGCCTGTGG 
ATTCTTGGCC 
TCCTGGACCC 
TCAACTGGAA 
CCAAGATCAT 
TGAAGCTGCA 
TTGATGAGGA 
AGAATGGAGG 
CACGGTGCAA 
GCATCCCGGA 
AATCTGACCA 
GCACCCCAGG 
AGGCTGAGCT 



Seq ID NO: C34 DNA Sequence 

Nucleic Acid Accession #: NM_003045.l 

Coding sequence: 148.. 2037 



CGATCCTGCC 
CTGAGACATC 
CGTCATATTC 
CAGATGCTGC 
CTGAACACTT 
GTCCTGGCTG 
ATCGCTGCGC 
CCCAAGACGG 
ATCACCGGCT 
TGGAGCGCCA 
ATGACTCTGA 
ATTCTCATCT 
ATATTCACTT 
GGATCGGTTA 
TGTTTGAACA 
TTCTCTGGTG 
ATCGCCACCA 
GCGTCCCTCT 
ATGCCCTACT 
TGGGAAGGTG 
CTAGGTTCCA 
TTTAAATTCT 
TCGGGTGCCG 
ATGTCCATTG 
TACCAGCCAG 
CCAGCAGACC 
GCAGAGATGT 



11 
I 

GGAGCCCCGC 
TTTGCTGCAA 
CAGCTCTGAA 
GGCGGAAGGT 
TTGATCTGGT 
GAGCTGTGGC 
TGGCCTCAGT 
GCTCAGCTTA 
GGAACTTAAT 
CCTTCGACGA 
ACGCCCCCGG 
TGACAGGACT 
GTATTAACGT 
AAAACTGGCA 
ATGACACAAA 
TCCTGTCGGG 
CAGGTGAAGA 
TGATCTGCTT 
TCTGCCTGGA 
CCAAGTACGC 
TGTTTCCCAT 
TAGCCAACGT 
TTGCTGCTGT 
GCACTCTCCT 
AGCAGCCTAA 
AAAATGAATT 
TCTCTTTGAA 



21 

i 

CGCCGCCGGC 
GATCGAGGCT 
CAGCAACATG 
GGTGGACTGT 
GGCCCTCGGG 
CCGTGAGAAT 
GCTGGCTGGC 
CCTCTACAGC 
CCTCTCCTAC 
GCTGATAGGC 
CGTGCTGGCT 
TTTAACTCTT 
CCTGGTCCTG 
GCTCACGGAG 
AGAAGGGAAG 
GGCAGCGACT 
GGTGAAGAAC 
CATCGCCTAC 
CAATAACAGC 
AGTGGCCGTG 
GCCTCGGGTT 
CAATGATAGG 
GATGGCCTTC 
GGCTTACTCG 
CCTGGTATAC 
GGCAAGCACC 
AACCATACTC 



31 
I 

TTGGATTCTG 
GTCCTCTGGT 
GGGTGCAAAG 
AGCCGGGAGG 
GTGGGCAGCA 
GCAGGCCCTG 
CTGTGCTATG 
TATGTCACCG 
ATCATCGGTA 
AGACCCATCG 
GAAAACCCCG 
GGTGTGAAAG 
GGCTTCATAA 
GAGGATTTTG 
CCCGGTGTTG 
TGCTTCTATG 
CCACAGAAGG 
TTTGGGGTGT 
CCCCTGCCCG 
GGCTCCCTCT 
ATCTATGCCA 
ACCAAAACAC 
CTCTTTGACC 
TTGGTGGCTG 
CAGATGGCCA 
AATGATTCCC 
TCACCCAAAA 




CCATCCCCGT 
CGGCTGCCCT 
ACGCCTTTAA 
GCGCTCTTTC 
TGGCTGAGGA 
CAATAATCGC 
TGAAGGACTT 
CCTGTQTGTT 
GTACTTCCGA 
AGCTGGGGTT 
ACATGGAGCC 



51 
I 

ACCCCTGCGC 
CATAGCACTA 
GGATAAATAC 
TGACGGAGAG 
CGAAGGGCCT 
CTCGGCCACA 
GACAGCCTGT 
CCCAGACCAG 
GAACTCAAGT 
GAAGAGCCTG 
TTGGCAGGTC 
CCACTGGGTC 
GGTGCGGGCA 
CATCATTGAA 
GTTCCCACTC 
GCTCACTCCA 
GAAGATGTCT 
TGCAGACGAT 
AGGGGGTGTG 
GTGGCATGTG 
AGTATACACC 
GTAA 



51 
I 

TGTATCCCTC 
TGAGGCTTCC 
CATTGGGCAG 
GTCTCGCTGC 
TGGTGTCTAC 
CTCCTTCCTG 
TGCTCGGGTC 
CTGGGCCTTC 
AGCGAGGGCC 
ACGGACACAC 
AGTGATCATA 
GGTCAACAAA 
ATTTGTGAAA 
AGGCCGTCTC 
GCCCTTCGGG 
CTTTGACTGC 
GGGGATCGTG 
CACGCTCATG 
GCACGTGGGC 
CGCCAGTCTT 
TGGACTGCTA 
CACATTAGCC 
GGTGGACCTC 
GGTCTTACGG 
CGAGTTAGAT 
TTTACCAGAG 
TTCCAAAATC 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1314 
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1218 
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5 

10 

15 

20 

25 

30 

35 

40 

45 
50 
55 
60 
65 
70 
75 
80 



TCTGGGCTAA 
ATTGTGACCG 
CTCGCAGGGT 
AGCAAGACCA 
TTCGTGAACG 
TGGATGCTGA 
TCCCTGGATG 
ACAGCCCCGC 
CACCCCACCC 



TTGTGAACAT 
TGCTTGGAAG 
CTGCCCTCCT 
AGCTCTCATT 
TCTATCTCAT 
TAGGCTTCAT 
CCGACCAAGC 
CCCCCGGAGG 
TCCCCACCAG 



TTCAACCAGC 
GGAGGCTCTC 
CTGTGCCGTG 
TAAGGTTCCC 
GATGCAGCTG 
CATCTACTTT 
AAGGACTCCT 
TGGCAGCAGC 
TGCAACAGAA 



CTTATAGCTG 
ACCAAAGGGG 
GTCACGGGCG 
TTCCTGCCAG 
GACCAGGGCA 
GGCTATGGCC 
GACGGCAACT 
CCCGAGGGAC 
ACCACCTGCG 



TTCTCATCAT 
CGCTGTGGGC 
TCATCTGGAG 
TGCTCCCCAT 
CCTGGGTCCG 
TGTGGCACAG 
TGGACCAGTG 
GCCCCCAGAG 
TCCACACCCT 



CACCTTCTGC 
AGTCTTTCTG 
GCAGCCCGAG 
CCTGAGCATC 
GTTTGCTGTG 
CGAGGAGGCG 
CAAGTGACGC 
GACCGGGAGG 
CACTGCA 



1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
21S7 



Seq ID NO: C35 DNA Sequence 

Nucleic Acid Accession #: NM_002776.1 

Coding sequence: 82.. 912 



1 
I 

ACCAGCGGCA 
GGCGACTCCC 
GCCCGGGCTC 
GCGCTGCTCC 
CGCGGCTCGC 
GTCCTGGTGG 
GCTCGAGTAG 
CGCTCTGTTG 
GATGAGCACG 
CGGGCCCTGC 
TGGGGCACCA 
ACTATCCTGA 
ATATGTGCTG 
GTCTGTGACG 
CAGCATCCAG 
CGCTCCAACT 
CAGATGCCCA 
TGTCTGCACT 
CATTCCCCCA 
CAAAGGTTTA 
CTGGGGTCAC 
AGTGCCCTCT 
TCTTAGACAT 
GTCATGTAAG 
AAAAAAAAAA 



11 
I 

GACCACAGGC 
AGATCCTGGC 
TGGOGAAGCT 
CCCAAAACGA 
AGCCCTGGCA 
ACCAGAGTTG 
GGGATGATCA 
TCCATCCCAA 
ATCTCATGTT 
AGCTTCCCTA 
CGGCCGCCCG 
GCCCTAAAGA 
GACTGGACCG 
AGACCCTCCA 
CTGTCTACAC 
GATCCAGATG 
GAGGCTCCAT 
GTTCAAACCT 
CCTATCCCCA 
TTCCAGAGAA 
CCAACCTGAC 
CTGAACCTCA 
GTTGTGAGGA 
GCTTAACACA 
AAAA 



21 
I 

AGGGCAGAGG 
CATGAGAGCT 
GCTGCCGCTG 
CACGCGCTTG 
GGTCTCGCTC 
GGTGCTGACG 
CCTGCTGCTT 
GTACCACCAG 
GCTAAAGCTG 
CCGCTGTGCT 
GAGAGTGAAG 
GTGTGAGGTC 
GGGCCAGGAC 
AGGCATCCTC 
CCAGATCTGC 
CTACGCTCCA 
CGTCCATCCT 
CTGCCGCCCT 
TTCTCTGCCT 
GCCAGGAAGC 
TTCCTCTGCC 
GTTTCCTCAT 
GACTATGATA 
GTGGGTGGTG 




CCACACCTCT 
GTACTGAAGC 
CGGTCATCAC 
ACTCCCCGCT 
CTGGAAAATG 
TAACATGTGT 
AGTTCTGACT 



41 
I 

TCCCCTCCCT 
ACCTCTCCGC 
AACTCTGGGC 
CCTATGGCGC 
TCTCGTTCCA 
GCGGAAACAA 
AGCAGCTCCG 
CCATCCTGCC 
TAGTGCCGGG 
ACCAGTGCCA 
GCCTGACCTG 
GCGTGGTCAC 
GTGACTCTGG 
TTTACCCCTG 
CCTGGATCAA 
ATGTTATGCT 
AGTCGGCTGA 
AAACATCTCC 
TGAAATGCAG 
CCAGCCTCTG 
GTGTGACTTT 
GGAACAATGA 
ATGTAAATCT 
AAAGGTTACC 



Seq ID NO: C36 DNA Sequence 
Nucleic Acid Accession *h XM_095088 
Coding sequence: 1..4074 



51 

I 

CCTTCCTATC 
CGCCTCTGGC 
CGCAGAGGCG 
CCCGTGCGCG 
CTGGGCGGGT 
GCCACTGTGG 
CCGGACGACT 
AAGGCGAACG 
GCCCCGCGTC 
GGTTGCTGGC 
CTCCAGCATC 
CAACAACATG 
AGGCCCCCTG 
TGGCTCTGCC 
TAAAGTCATA 
CCTGCTGATC 
ACTCtCCCCT 
CCTCTCACCT 
GAAGTGGTGG 
AGAGCAGTTA 
GGGCAAGCCA 
CGTGCCTACC 
TCATGTGATT 
TGTTGTCGTG 



51 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1454 



1 11 21 31 41 

I I ! I I 1 

ATGACCCGGG CCGCGACAGC CGAGCCGGGC CGCGTGTCTC CCGCCTCGCC CGCGCGCAGC 60 

ACGGCGGGCC TGCCGCGCGC CTTCCTGCAG AGCCTGCGCA CCCTGTTAGA CATCCTGGAC 120 

GACTGGCAGC GCGGCTGCGT GCACCTGCGA GAGATCCAGT CCCTCTGGGT CGAAGCGCGG 180 

GAGCTGCCAA GCGGGGTGCT GGAGGGCCTG AGCCAGCGGC GCGGGCCGCA GCCGGGAGCA 240 

GCTGTACGCT CCCGCCGAGG CGGCGCAGTG CCCCGCGGGG CCCGAGCGGT TCCAGAGCGC 300 

TGCGCTGGAA CGGAGACCCG GCGCGGACGC AGGTGCAGCG GCCTGCAGCG CCTTGGAGGC 360 

GGATTCAGGG GATGCCCGGC GGACCCCTGT GCCCGAGGGG AACACCGGAG GCACACCATC 420 

ACCAGGGGCG TGGACTGOGG CCTGCTGAAG CAGATGAAGG AGCTGGAGCA GGAGAAGGAG 480 

GTGCTGCTAC AGGGTTTGGA GATGATGGCG CAGGGCCGCG ATTGGTACCA GCAGCAGCTG 540 

CAACAAGTGC AGGAGCGCCA GTGCCGCCTG GGCCAGAGCA GAGCCAGCGC CGACTTTGGG 600 

GCCGTGGGGA GCCCTCGCCC ACTGGGACGG CTACTGCCCA AGGTACAGGA GGTGGCCCGG 660 

TGGCTGGGGG AGCTGCTGGC TGAGGCCTGT GCCGGTCGGG CCCTGCCCAC ATCCTCCTCC 720 

GGGCCCCCTT GCTCTGCCCT GACGTCCACC TCGTCCCCGG GCTGGCAGCA GCAGATCATC 780 

CTCATGCTGA AGGAGCAGAA CCGACTCCTC ACCCAGGAGG TGACCGAGAA GAGTGAGCGC 840 

ATCACGCAGC TGGAGCAGAA GTCGGCGCTC ATTAAGCAGC TATTTGAGGC CCGCGCCCTG 900 

AGCCAGCAGG ATGGGGGCCT GTCCCCGGCT GGCCCCCACA TTGAGCCCTT GACTCGGTTT 960 

CGGCTTCCGG TGCTGACATG GGCTGGGGCT CTCTTGAGTC CGCATAGTCC GCAGCTACTA 1020 

CTGCCGCTGT CAGCGGACAG TGGGGGACCC CTCCACGAGT TACCAGATAC CTGGTTTCCA 1080 

GCGGTGCTGC TTTGGGTCCC ATCTCCAGGG AAAAGAACTG CTCACGCCAG GCTGCACTTC 1140 

CACCAAAGGC CAGCAGAGGG CGCGTGGCAG CTCGGATGCG GCGCTGAGGC TGCGCCCGAG 1200 

ACCTGCGGGA CGCTGCCCCA CTTTGAGTCC CACAAAACAA CCTGTGAGCC TGACTCCCTC 1260 

GGAGGGCCCT GTCCCCAGGA GGGGGATCGC AGCTGGAGCC ACCTGGGCGC AGCGTTTGAT 1320 

GTGGCACCTG CAGTGGCCAA AGTGACACCC AACCGTGAGG ACGCTGCAGG GAGTCGGCAC 1380 

GGAGACATCT GTCCCCTCTG TCCCAAGGGA CTGTTGACAT TCAGAGACAT AGCTATAGAA 1440 

TTCTCTCTGG CGGAGTGGCA ATGCCTGGAT CATGCTCAGC AGAATTTATA TAGAGATGTG 1500 

ATGTTAGAGA ACTACAGAAA CCTGTTCTCC CTGGGTATGA CTGTCTCTAA GCCAGACTTG 1560 

ATCGCCTGTC TGGAGCAAAA TAAAGAGCCC CAGAATATAA AGAGAAATGA GATGGCAGCC 1620 

AAACACCCAG TTACATGTTC TCATTTCAAC CAAGACCTTC AGCCAGAGCA GAGCATAAAA 1680 

GATTCACTCC AAAAAGTAAT ACCAAGAACA TATGGAAAAT GTGGACATGA GAATTTACAA 1740 

TTAAAAAAAT GTTGTAAAAG AGTAGATGAG TGTGAGGTGC ACAAAGGAGG TTATAATGAC 1800 

CTTAACCAAT GTTTGTCAAA TACCCAAAAC AAAATATTTC AGACTCATAA ATGTGTCAAA 1860 

GTCTTCAGTA AATTTTCAAA TTCCAATAGA CACAATGCAA GATATACTGG AAAGAAACAT 1920 

TTGAAATGTA AAAAATATGG CAAATCATTT TGCATGTTTT CACACCTAAA TCAACATCAG 1980 

ATAATTCATA CTAAGGAGAA GTCCTACAAA TGTGAAGAAT GTGGCAAATC CTTTAACCAC 2040 

TCCTCAAGCG GTACTACACA TAAAAGAATT CTTACTGGAG AGAAACCCTA CAGATGTGAG 2100 

GAATGTGGCA AAGCCTTTAG GTGGCCCTCA AACCTTACTA GACATAAGAG AATTCACACT 2160 

1219 



WO 03/042661 PCT/US02/36810 



10 



25 
30 



40 
45 
50 
55 
60 
65 
70 



GGAGAGAAAC CCTACGCATG TGAAGAATGT GGCCAAGCCT TTAGGCGCTC CTCAACACTT 2220 

ACTAACCACA AGAGAATTCA TACTGGAGAG AGACCCTACA AATGTGAAGA ATGTGGCAAA 2280 

GCCTTTAGCG TATCCTCAGC CCTCATTTAC CACAAGAGAA TTCATACTGG AGAGAAACCC 2340 

TACACATGTG AAGAATGTGG CAAAGCCTTT AACTGCTCCT CGACTCTTAA GACACATAAG 2400 

ATAATTCATA CTGGAGAGAA ACCCTACACA TGTGAAGAAT GTGGCAGAAC CTTTAACTGC 2460 

TCCTCAACTG TAAAGGCACA TAAGAGAATT CATACTGGAG AGAAACCATA CAAATGTGAA 2520 

GAATGTGACA AAGCTTTTAA GTGGCATTCA AGTCTTGCTA AACATAAGAT AATTCACACT 2580 

GGAGAGAAAC CCTACAAATG CAGTGACAGC AAAGCCTTAG CCAAATCATC AGAAGTGCAA 2640 

AAGGTCTACT CTGGAGATGG GGAAAATGGA ATCCGTGTAC ATAAGAAAAA GGAGACACAG 2700 

GGCTGGCTTG TGAGAAACAA GAACGAAAAT AGAACAGGGC TGTTCCAGAT CCGGGCTGCC 2760 

GTGAGACCTA ACAGGGACCC TTCATGGGGA CAGCAAGAAG GTTCACTGAC TGACCCAATT 2820 

CAGAGGAAGG AGGAACCTGA CCTTCAAAAT CACTATGACC ATCAGAATGC CTTAGAAGAT 2880 

CAAAGAAATA CTGGAGTGGG TGGACTGTTG ACATTCAGAG ATGTAGTCAT AGAATTCTCT 2940 



CTGGAGGAGT GGCAATGCCT GGATCACGCT CAGCAGAATT TATATAGAGA TGTGATGTTA 3000 
2^^" RCA < ^^?° rGGT CTCCCTGGGT ATTGCTGTCT CTAAGCCAGA CTTGATCACC 3060 

20 



TGTCTGGAGC AAAATAAAGA GCCTTGGAAT ATAAAGAGAA ATGAGATGGT AACCAAACAC 3120 

CCAGACCTTC CGCCAGAGCT AGGCATAAAA GATTCACTCC AAAAAGTAAT ACCAAGAAGA 3180 

TATGGAAAAA GTGGACATGA CAATTTACAA GTAAAAACAT GTAAAAGCAT GGGTGAGTGT 3240 

GAGGTGCAAA AAGGAGGTTG TAATGAAGTT AACCAATGTT TGTCAACTAC CCAAAACAAA 3300 

ATATTTCAGA CTCATAAATG TGTCAAAGTC TTCGGCAAAT TTTCAAATTC CAATAGACAT 3360 

AAGACAAGAC ATACTGGAAA GAAACATTTC AAATGTAAAA AATATGGCAA ATCATTTTGC 3420 

ATGGTTTCAC AACTACATCA ACATCAGATA ATTCATACTA GGGAGAATTC CTACCAATGT 3480 

GAAGAATGCG GCAAACCCTT CAACTGCTCT TCAACCCTTT CTAAACATAA AAGAATTCAT 3540 

ACTGGAGAGA AACCCTACAG ATGTGAGGAA TGTGGCAAAG CTTTTACCTG GTCCTCAACC 3S00 

CTTACTAAAC ATAGGAGAAT TCATACTGGA GAAAAACCCT ACACATGTGA AGAATGTGGC 36S0 

CAAGCCTTTA GCCGCTCCTC AACACTTGCT AACCACAAGA GAATTCATAC TGGAGAGAAA 3720 

CCATACACAT GTGAAGAATG TGGCAAAGCC TTTAGCTTAT CCTCATCCCT CACTTACCAC 3780 

AAGAGAATTC ATACTGGAGA GAAACCCTAC ACATGTGAAG AATGTGGCAA AGCCTTTAAC 3840 

TGCTCCTCAA CCCTTAAGAA ACATAAGATA ATTCATACTG GAGAGAAACC CTACAAATGT 3900 

AAAGAATGTG GGAAAGCCTT TGCCTTCTCC TCAACTCTTA ATACTCATAA GAGGATTCAT 3960 

ACTGGAGAGG AACCCTACAA ATGTGAAGAA TGTGACAAAG CTTTTAAGTG GTCCTCAAGT 4020 

CTTGCTAATC ATAAGAGTAT GCATACTGGA GAGAAACCCT ACAAATGTGA ATAA 4074 



Seq ID NO: C37 DNA Sequence 
J D Nucleic Acid Accession ft: NM_032044 
Coding sequence : 182 . . 658 



1 11 .21 31 41 51 

I I I I I | 

AAGATATAAA AGCTCCAGAA ACGTTGACTG GGACCACTGG AGACACTGAA GAAGGCAGGG 60 

GCCCTTAGAG TCTTGGTTGC CAAACAGATT TGCAGATCAA GGAGAACCCA GGAGTTTCAA 120 

AGAAGCGCTA GTAAGGTCTC TGAGATCCTT GCACTAGCTA CATCCTCAGG GTAGGAGGAA 180 

GATGGCTTCC AGAAGCATGC GGCTGCTCCT - ATTGCTGAGC TGCCTGGCCA AAACAGGAGT 240 

CCTGGGTGAT ATCATCATGA GACCCAGCTG TGCTCCTGGA TGGTTTTACC ACAAGTCCAA 300 

TTGCTATGGT TACTTCAGGA AGCTGAGGAA CTGGTCTGAT GCCGAGCTCG AGTGTCAGTC 360 

TTACGGAAAC GGAGCCCACC TGGCATCTAT CCTGAGTTTA AAGGAAGCCA GCACCATAGC 420 

AGAGTACATA AGTGGCTATC AGAGAAGCCA GCOGATATGG ATTGGCCTGC ACGACCCACA 480 

GAAGAGGCAG CAGTGGCAGT GGATTGATGG GGCCATGTAT CTGTACAGAT CCTGGTCTGG 540 

CAAGTCCATG GGTGGGAACA AGCACTGTGC TGAGATGAGC TCCAATAACA ACTTTTTAAC 600 

TTGGAGCAGC AACGAATGCA ACAAGCGCCA ACACTTCCTG TGCAAGTACC GACCATAGAG 660 

CAAGAATCAA GATTCTGCTA ACTCCTGCAC AGCCCCGTCC TCTTCCTTTC TGCTAGCCTG 720 

GCTAAATCTG CTCATTATTT CAGAGGGGAA ACCTAGCAAA CTAAGAGTGA TAAGGGCCCT 780 

ACTACACTGG CTTTTTTAGG CTTAGAGACA GAAACTTTAG CATTGGCCCA GTAGTGGCTT 840 

CTAGCTCTAA ATGTTTGCCC CGCCATCCCT TTCCACAGTA TCCTTCTTCC CTCCTCCCCT 900 

GTCTCTGGCT GTCTCGAGCA GTCTAGAAGA GTGCATCTCC AGCCTATGAA ACAGCTGGGT 960 

CTTTGGCCAT AAGAAGTAAA GATTTGAAGA CAGAAGGAAG AAACTCAGGA GTAAGCTTCT 1020 

AGACCCCTTC AGCTTCTACA CCCTTCTGCC CTCTCTCCAT TGCCTGCACC CCACCCCAGC 1080 

CACTCAACTC CTGCTTGTTT TTCCTTTGGC CATAGGAAGG TTTACCAGTA GAATCCTTGC 1140 

TAGGTTGATG TGGGCCATAC ATTCCTTTAA TAAACCATTG TGTACATAAG AAAAAAAAAA 1200 

Seq ID NO: C38 DNA Sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequences 52.-3042 

1 11 21 31 41 51 

I I I I | | 

GCTCACCCAG GAAAAATATG CAATCGTCCC ATTGATATAC AGGCCACTAC AATGGATGGA 60 

GTTAACCTCA GCACCGAGGT TGTCTACAAA AAAGGCCAGG ATTATAGGTT TGCTTGCTAC 120 

GACCGGGGCA GAGCCTGCCG GAGCTACCGT GTAOGGTTCC TCTGTGGGAA GCCTGTGAGG 180 

CCCAAACTCA CAGTCACCAT TGACACCAAT GTGAACAGCA CCATTCTGAA CTTGGAGGAT 240 

AATGTACAGT CATGGAAACC TGGAGATACC CTGGTCATTG CCAGTACTGA TTACTCCATG 300 

TACCAGGCAG AAGAGTTCCA GGTGCTTCCC TGCAGATCCT GCGCCCCCAA CCAGGTCAAA 360 

GTGGCAGGGA AACCAATGTA CCTGCACATC GGGGAGGAGA TAGACGGCGT GGACATGCGG 420 



~_ GCGGAGGTTG GGCTTCTGAG CCGGAACATC ATAGTGATGG GGGAGATGGA GGACAAATGC 480 
T ACCCCTACA GAAACCACAT CTGCAATTTC TTTGACTTCG ATACCTTTGG GGGCCACATC 540 

80 



AAGTTTGCTC TGGGATTTAA GGCAGCACAC TTGGAGGGCA CGGAGCTGAA GCATATGGGA 600 

CAGCAGCTGG TGGGTCAGTA CCCGATTCAC TTCCACCTGG CCGGTGATGT AGAOGAAAGG 660 

OGAGGTTATG ACCCACCCAC ATACATCAGG GACCTCTCCA TCCATCATAC ATTCTCTCGC 720 

TGCGTCACAG TCCATGGCTC CAATGGCTTG TTGATCAAGG ACGTTGTGGG CTATAACTCT 780 

TTGGGCCACT GCTTCTTCAC • GGAAGATGGG CCGGAGGAAC GCAACACTTT TGACCACTGT 840 

CTTGGCCTCC TTGTCAAGTC TGGAACCCTC CTCCCCTCGG ACCGTGACAG CAAGATGTGC 900 

AAGATGATCA CAGGAGACTC CTACCCAGGG TACATCCCCA AGCCCAGGCA AGACTGCAAT 960 

GCTGTGTCCA CCTTCTGGAT GGCCAATCCC AACAACAACC TCATCAACTG TGCCGCTGCA 1020 

GGATCTGAGG AAACTGGATT TTGGTTTATT TTTCACCACG TACCAACGGG CCCCTCCGTG 1080 



1220 



WO 03/042661 



GGAATGTACT CCCCAGGTTA TTCAGAGCAC ATTCCACTGG GAAAATTCTA TAACAACCGA 1140 

GCACATTCCA ACTACCGGGC TGGCATGATC ATAGACAACG GAGTCAAAAC CACCGAGGCC 1200 

TCTGCCAAGG ACAAGCGGCC GTTCCTCTCA ATCATCTCTG CCAGATACAG CCCTCACCAG 1260 

GACGCCGACC CGCTGAAGCC CCGGGAGCCG GCCATCATCA GACACTTCAT TGCCTACAAG 1320 

AACCAGGACC ACGGGGCCTG GCTGCGCGGC GGGGATGTGT GGCTGGACAG CTGCCGGTTT 1380 

GCTGACAATG GCATTGGCCT GACCCTGGCC AGTGGTGGAA CCTTCCCGTA TGACGACGGC 1440 

TCCAAGCAAG AGATAAAGAA CAGCTTGTTT GTTGGCGAGA GTGGCAACGT GGGGAOGGAA 1500 

ATGATGGACA ATAGGATCTG GGGCCCTGGC GGCTTGGACC ATAGCGGAAG GACCCTCCCT 1560 

ATAGGCCAGA ATTTTCCAAT TAGAGGAATT CAGTTATATG ATGGCCCCAT CAACATCCAA 1620 

AACTGCACTT TCCGAAAGTT TGTGGCCCTG GAGGGCCGGC ACACCAGCGC CCTGGCCTTC 1660 

OGCCTGAATA ATGCCTGGCA GAGCTGCCCC CATAACAACG TGACCGGCAT TGCCTTTGAG 1740 

GACGTTCCGA TTACTTCCAG AGTGTTCTTC GGAGAGCCTG GGCCCTGGTT CAACCAGCTG 1800 

GACATGGATG GGGATAAGAC ATCTGTGTTC CATGACGTCG ACGGCTCCGT GTCCGAGTAC 1860 

CCTGGCTCCT ACCTCACGAA GAATGACAAC TGGCTGGTCC GGCACCCAGA CTGCATCAAT 1920 

GTTOCCGACT GGAGAGGGGC CATTTGCAGT GGGTGCTATG CACAGATGTA CATTCAAGCC 1980 

TACAAGACCA GTAACCTGCG AATGAAGATC ATCAAGAATG ACTTCCCCAG CCACCCTCTT 2040 

TACCTGGAGG GGGCGCTCAC CAGGAGCACC CATTACCAGC AATACCAACC GGTTGTCACC 2100 

CTGCAGAAGG GCTACACCAT CCACTGGGAC CAGACGGCCC CCGCCGAACT CGCCATCTGG 2160 

CTCATCAACT TCAACAAGGG CGACTGGATC CGAGTGGGGC TCTGCTACCC GOGAGGCACC 2220 

ACATTCTCCA TCCTCTCGGA TGTTCACAAT CGCCTGCTGA AGCAAACGTC CAAGACGGGC 2280 

GTCTTCGTGA GGACCTTGCA GATGGACAAA GTGGAGCAGA GCTACCCTGG CAGGAGCCAC 2340 

TACTACTGGG ACGAGGACTC AGGGCTGTTG TTCCTGAAGC TGAAAGCTCA GAACGAGAGA 2400 

GAGAAGTTTG CTTTCTGCTC CATGAAAGGC TGTGAGAGGA TAAAGATTAA AGCTCTGATT 2460 

CCAAAGAACG CAGGCGTCAG TGACTGCACA GCCACAGCTT ACCCCAAGTT CACCGAGAGG 2520 

GCTGTCGTAG AOGTGCCGAT GCCCAAGAAG CTCTTTGGTT CTCAGCTGAA AACAAAGGAC 2580 

CATTTCTTGG AGGTGAAGAT GGAGAGTTCC AAGCAGCACT TCTTCCACCT CTGGAACGAC 2640 

TTCGCTTACA TTGAAGTGGA TGGGAAGAAG TACCCCAGTT CGGAGGATGG CATCCAGGTG 2700 

GTGGTGATTG ACGGGAACCA AGGGCGCGTG GTGAGCCACA CGAGCTTCAG GAACTCCATT 2760 

CTGCAAGGCA TACCATGGCA GCTTTTCAAC TATGTGGCGA CCATCCCTGA CAATTCCATA 2820 

GTGCTTATGG CATCAAAGGG AAGATACGTC TCCAGAGGCC CATGGACCAG AGTGCTGGAA 2880 

AAGCTTGGGG CAGACAGGGG TCTCAAGTTG AAAGAGCAAA TGGCATTCGT TGGCTTCAAA 2940 

GGCAGCTTCC GGCCCATCTG GGTGACACTG GACACTGAGG ATCACAAAGC CAAAATCTTC 3000 

CAAGTTGTGC CCATCCCTGT GGTGAAGAAG AAGAAGTTGT GAGGACAGCT GCCGCCCGGT 3060 

GCCACCTCGT GGTAGACTAT GACGGTGACT CTTGGCAGCA GACCAGTGGG GGATGGCTGG 3120 

GTCCCCCAGC CCCTGCCAGC AGCTGCCTGG GAAGGCCGTG TTTCAGCCCT GATGGGCCAA 3180 

GGGAAGGCTA TCAGAGACCC TGGTGCTGCC ACCTGCCCCT ACTCAAGTGT CTACCTGGAG 3240 

CCCCTGGGGC GGTGCTGGCC AATGCTGGAA ACATTCACTT TCCTGCAGCC TCTTGGGTGC 3300 

TTCTCTCCTA TCTGTGCCTC TTCAGTGGGG GTTTGGGGAC CATATCAGGA GACCTGGGTT 3360 

GTGCTGACAG CAAAGATCCA CTTTGGCAGG AGCCCTGACC CAGCTAGGAG GTAGTCTGGA 3420 

GGGCTGGTCA TTCACAGATC CCCATGGTCT TCAGCAGACA AGTGAGGGTG GTAAATGTAG 3480 

GAGAAAGAGC CTTGGCCTTA AGGAAATCTT TACTCCTGTA AGCAAGAGCC AACCTCACAG 3540 

GATTAGGAGC TGGGGTAGAA CTGGCTATCC TTGGGGAAGA GGCAAGCCCT GCCTCTGGCC 3600 

GTGTCCACCT TTCAGGAGAC TTTGAGTGGC AGGTTTGGAC TTGGACTAGA TGACTCTCAA 3660 

AGGCCCTTTT AGTTCTGAGA TTCCAGAAAT CTGCTGCATT TCACATGGTA CCTGGAACCC 3720 

AACAGTTCAT GGATATCCAC TGATATCCAT GATGCTGGGT GCCCCAGCGC ACACGGGATG 3780 

GAGAGGTGAG AACTAATGCC TAGCTTGAGG GGTCTGCAGT CCAGTAGGGC AGGCAGTCAG 3840 

GTCCATGTGC ACTGCAATGC CAGGTGGAGA AATCACAGAG AGGTAAAATG GAGGCCAGTG 3900 

CCATTTCAGA GGGGAGGCTC AGGAAGGCTT CTTGCTTACA GGAATGAAGG CTGGGGGCAT 3960 

TTTGCTGGGG GGAGATGAGG CAGCCTCTGG AATGGCTCAG GGATTCAGCC CTCCCTGCCG 4020 

CTGCCTGCTG AAGCTGGTGA CTACGGGGTC GCCCTTTGCT CAOGTCTCTC TGGCCCACTC 4080 

ATGATGGAGA AGTGTGGTCA GAGGGGAGCA ATGGGCTTTG CTGCTTATGA GCACAGAGGA 4140 

ATTCAGTCCC CAGGCAGCCC TGCCTCTGAC TCCAAGAGGG TGAAGTCCAC AGAAGTGAGC 4200 

TCCTGCCTTA GGGCCTCATT TGCTCTTCAT CCAGGGAACT GAGCACAGGG GGCCTCCAGG 4260 

AGACCCTAGA TGTGCTCGTA CTCCCTCGGC CTGGGATTTC AGAGCTGGAA ATATAGAAAA 4320 

TATCTAGCCC AAAGCCTTCA TTTTAACAGA TGGGGAAAGT GAGCCCCCAA GATGGGAAAG 4380 

AACCACACAG CTAAGGGAGG GCCTGGGGAG CCCCACCCTA GCCCTTGCTG CCACACCACA 4440 

TTGCCTCAAC AACCGGCCCC AGAGTGCCCA GGCACTCCTG AGGTAGCTTC TGGAAATGGG 4500 

GACAAGTCCC CTCGAAGGAA AGGAAATGAC TAGAGTAGAA TGACAGCTAG CAGATCTCTT 4560 
CCCTCCTGCT CCCAGCGCAC ACAAACCCGC CCTCCCCTTG GTGTTGGCGG TCCCTGTGGC 4620 

CTTCACTTTG TTCACTACCT GTCAGCCCAG CCTGGGTGCA CAGTAGCTGC AACTCCCCAT 4680 
TGGTGCTACC TGGCTCTCCT GTCTCTGCAG CTCTACAGGT GAGGCCCAGC AGAGGGAGTA 4740 
GGGCTCGCCA TGTTTCTGGT GAGCCAATTT GGCTGATCTT GGGTGTCTGA ACAGCTATTG 4800 
GGTCCACCCC AGTCCCTTTC AGCTGCTGCT TAATGCCCTG CTCTCTCCCT GGCCCACCTT 4 860 
ATAGAGAGCC CAAAGAGCTC CTGTAAGAGG GAGAACTCTA TCTGTGGTTT ATAATCTTGC 4920 
ACGAGGCACC AGAGTCTCCC TGGGTCTTGT GATGAACTAC ATTTATCCCC TTTCCTGCCC 4980 
CAACCACAAA CTCTTTCCTT CAAAGAGGGC CTGCCTGGCT CCCTCCACCC AACTGCACCC 5040 
ATGAGACTCG GTCCAAGAGT CCATTCCCCA GGTGGGAGCC AACTGTCAGG GAGGTCTTTC 5100 
CCACCAAACA TCTTTCAGCT GCTGGGAGGT GACCATAGGG CTCTGCTTTT AAAGATATGG 5160 
CTGCTTCAAA GGCCAGAGTC ACAGGAAGGA CTTCTTCCAG GGAGATTAGT GGTGATGGAG 5220 
AGGAGAGTTA AAATGACCTC ATGTCCTTCT TGTCCAOGGT TTTGTTGAGT TTTCACTCTT 5280 
CTAATGCAAG GGTCTCACAC TGTGAACCAC TTAGGATGTG ATCACTTTCA GGTGGCCAGG 5340 
AATGTTGAAT GTCTTTGGCT CAGTTCATTT AAAAAAGATA TCTATTTGAA AGTTCTCAGA 5400 
GTTGTACATA TGTTTCACAG TACAGGATCT GTACATAAAA GTTTCTTTCC TAAACCATTC 5460 
AGCAAGAGCC AATATCTAGG CATTTTCTTG GTAGCACAAA TTTTCTTATT GCTTAGAAAA 5520 
TTGTCCTCCT TGTTATTTCT GTTTGTAAGA CTTAAGTGAG TTAGGTCTTT AAGGAAAGCA 5580 
AOGCTCCTCT GAAATGCTTG TCTTTTTTCT GTTGCCGAAA TAGCTGGTCC TTTTTCGGGA 5640 
GTTAGATGTA TAGAGTGTTT GTATGTAAAC ATTTCTTGTA GGCATCACCA TGAACAAAGA 5700 
TATATTTTCT ATTTATTTAT TATATGTGCA CTTCAAGAAG TCACTGTCAG AGAAATAAAG 5760 
AATTGTCTTA AATGTCAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAA 5808 

Seq ID HO: C39 DNA Sequence 
Nucleic Acid Accession ft: NM_014373 
Coding sequence: 322.1338 
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1 
1 

GTGGCCTCGA 
GCCTCCGGAG 
ATGTCTCCGA 
AAATAACATA 
GACCCACTGG 
TCTTGAAATT 
CAGTTAOGTC 
GGGAAAATAT 
TTTATGGAAT 
TCCATTATAT 
ATCTGCCTAT 
CTGACAGCTT 
TGTCAAAAAT 
GTTTTGGGAG 
TGTCCTTTCT 
TTTGTAGCTT 
ACTTCCTATA 
GTGAGATCTA 
CCATTTGTAC 
GAGATGAATA 
TTTAATTGTC 
TGGAAGTGCT 
TCAATAATGA 
TATGAACAGA 
CCCTGACTGA 
GTTTTATTTT 
TTACAAATAT 
GATGTTTGTG 
ACTGACAACT 
AAAAAAAAA 



11 

I 

GGTGGTGGCA 
GCAGGTTCGG 
GCTTACTCAC 
ATTGAAGGCA 
AGAGGACAGA 
TAACTAAAAA 
AAACAAACCA 
TATTAAATAT 
ATTTTTGCAT 
TGTATTTCAG 
TTACTCAAAT 
GTATAGATTA 
TATTTTATTT 
ACCCAGCCAT 
ATGTCAGCAT 
TCATAACCTG 
TGAATGAAAC 
AAAAAATATT 
TACTTCAGGT 
TTCCCTGGTT 
ACAAGCTTAA 
GCTTCATTCC 
TTTGTTAATA 
AAGAACTCAG 
TAGCATTTCA 
AAAAACAAAA 
TACT1TGTTA 
TTACTCAAAA 
TTAAGATATC 



21 
I 

GGGCCGCCCC 
CTGGAAGGAA 
AT AG CAT ATT 
GTAAAAGTGA 
AAATGAAGCA 
TATGACTGCT 
GCCCCTAGAC 
CCTTACACTA 
TTCACTAGCA 
GGATTTTGTA 
TATTTCCTTT 
TTGCCTGAAT 
CTTTACAGTA 
CTACCAAAGC 
TCAGAGTTAC 
TTGGGAAGAA 
TATCTTATAT 
CTTATCCAAG 
AATCATTGTT 
ATACTTTGTC 
TTTAAAAGAC 
ACTTACAATT 
TTATTAATTA 
GACATATTAA 
GAATGTGTCT 
TAATTCCAAG 
TTAACACAAA 
AAACTACTGG 
AACCTAAACA 



31 
I 

CTGCAGTCCG 
CCGCTCTOGC 
GGTATATCAA 
AATTAAATAG 
GTGTTTTATC 
CTCTCTTCAG 
GTTAACTATC 
GGAATGAGAA 
TTCGTTGATC 
CTTTTAAGCA 
ACTTATGGCT 
TTCTCTAAAA 
ATTTTAATTT 
CTGAAGGCAC 
TGGCTGTCAT 
GTTACTACTT 
TTTCCTTTTT 
CTCATTGTCT 
TTACTTAAAG 
AATAGTTTTC 
ATTGGATTAC 
CCTAATCTTG 
AAAGTTACAG 
AAAATAAACT 
TTTGAAGGGC 
AAGTTTTTAT 
AAGTGATAAG 
ATGCAAACTG 
TTTTTATTAA 



41 
I 

GAGACGAACG 
TTCGTCCTAC 
AATGAAATGC 
GAAGATCATC 
ATGTGTATTT 
AGAACTGCTC 
TGCTATTCTT 
GAAAAAACAC 
TTTTACTTTT 
TTAGGTTCAC 
TTTTGCATTA 
CAACCAAGCT 
GGATTTCAGT 
AGAATGCTTA 
TTTTCATGGT 
TGGTACAGGC 
CATCCCACTC 
GTTTTCTCAG 
TTCAGATTCC 
TCATTGCTAC 
CTTTGGATCC 
AGCAAATTGA 
CTGTCATAAG 
GAACTAAAAC 
TATACCAGTT 
AGTTATTCAG 
AGTTAACATT 
TTATGTAAAT 
ATGTTCAAAT 



51 
I 

CACGGACCGG 
ACTTGCGCAA 
AAGGAACCAA 
AGTCAAGGAA 
CAGCAGGTCT 
TTTTCAGTAC 
GATCATACTT 
CTGTCAAAAT 
GGTAAACATT 
TAAATACCAC 
TCCAGTTTTC 
TTCATTTAAG 
CCTTGCTTAT 
TTCTOGTCAC 
GATGATTTTA 
TATCAGGATA 
CAGTTATACT 
TACCTGGTTA 
AGCATATATT 
AGTGTATTGG 
ATTTGTCAAC 
AAAGCCTATA 
ATCATAATTT 
AACTTTTGCC 
ATTAAATAGT 
GGACACTATA 
TGGCTATACT 
CTGAGATTTC 
GTAAGCAAGA 
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1749 



Seq ID NO: C40 DNA Sequence 
Nucleic Acid Accession BC012089 
Coding sequence: 1..2571 

1 11 21 31 41 51 

I I I I I I 

ATGGCCCTCG TACTCGGCTC CCTGTTGCTG CTGGGGCTGT GCGGGAACTC CTTTTCAGGA 60 

GGGCAGCCTT CATCCACAGA TGCTCCTAAG GCTTGGAATT ATGAATTGCC TGCAACAAAT 120 

TATGAGACCC AAGACTCCCA TAAAGCTGGA CCCATTGGCA TTCTCTTTGA ACTAGTGCAT 180 

ATCTTTCTCT ATGTGGTACA GCCGCGTGAT TTCCCAGAAG ATACTTTGAG AAAATTCTTA 240 

CAGAAGGCAT ATGAATCCAA AATTGATTAT GACAAGATTG TCTACTATGA AGCAGGGATT 300 

ATTCTATGCT GTGTCCTGGG GCTGCTGTTT ATTATTCTGA TGCCTCTGGT GGGGTATTTC 360 

TTTTGTATGT GTCGTTGCTG TAACAAATGT GGTGGAGAAA TGCACCAGCG ACAGAAGGAA 420 

AATGGGCCCT TCCTGAGGAA ATGCTTTGCA ATCTCCCTGT TGGTGATTTG TATAATAATA 480 

AGCATTGGCA TCTTCTATGG TTTTGTGGCA AATCACCAGG TAAGAACCCG GATCAAAAGG 540 

AGTCGGAAAC TGGCAGATAG CAATTTCAAG GACTTGCGAA CTCTCTTGAA TGAAACTCCA 600 

GAGCAAATCA AATATATATT GGCCCAGTAC AACACTACCA AGGACAAGGC GTTCACAGAT 660 

CTGAACAGTA TCAATTCAGT GCTAGGAGGC GGAATTCTTG ACCGACTGAG ACCCAACATC 720 

ATCCCTGTTC TTGATGAGAT TAAGTCCATG GCAACAGCGA TCAAGGAGAC CAAAGAGGCG 780 

TTGGAGAACA TGAACAGCAC CTTGAAGAGC TTGCACCAAC AAAGTACACA GCTTAGCAGC 840 

AGTCTGACCA GCGTGAAAAC TAGCCTGCGG TCATCTCTCA ATGACCCTCT GTGCTTGGTG 900 

CATCCATCAA GTGAAACCTG CAACAGCATC AGATTGTCTC TAAGCCAGCT GAATAGCAAC 960 

CCTGAACTGA GGCAGCTTCC ACCCGTGGAT GCAGAACTTG ACAACGTTAA TAAOGTTCTT 1020 

AGGACAGATT TGGATGGCCT GGTCCAACAG GGCTATCAAT CCCTTAATGA TATACCTGAC 1080 

AGAGTACAAC GCCAAACCAC GACTGTCGTA GCAGGTATCA AAAGGGTCTT GAATTCCATT 1140 

GGTTCAGATA TCGACAATGT AACTCAGCGT CTTCCTATTC AGGATATACT CTCAGCATTC 1200 

TCTGTTTATG TTAATAACAC TGAAAGTTAC ATCCACAGAA ATTTACCTAC ATTGGAAGAG 1260 

TATGATTCAT ACTGGTGGCT GGGTGGCCTG GTCATCTGCT CTCTGCTGAC CCTCATCGTG 1320 

ATTTTTTACT ACCTGGGCTT ACTGTGTGGC GTGTGCGGCT ATGACAGGCA TGCCACCCCG 1380 

ACCACCCGAG GCTGTGTCTC CAACACCGGA GGCGTCTTCC TCATGGTTGG AGTTGGATTA 1440 

AGTTTCCTCT TTTGCTGGAT ATTGATGATC ATTGTGGTTC TTACCTTTGT CTTTGGTGCA 1500 

AATGTGGAAA AACTGATCTG TGAACCTTAC ACGAGCAAGG AATTATTCCG GGTTTTGGAT 1560 

ACACCCTACT TACTAAATGA AGACTGGGAA TACTATCTCT CTGGGAAGCT ATTTAATAAA 1620 

TCAAAAATGA AGCTCACTTT TGAACAAGTT TACAGTGACT GCAAAAAAAA TAGAGGCACT 1680 

TACGGCACTC TTCACCTGCA GAACAGCTTC AATATCAGTG AACATCTCAA CATTAATGAG 1740 

CATACTGGAA GCATAAGCAG TGAATTGGAA AGTCTGAAGG TAAATCTTAA TATCTTTCTG 1800 

TTGGGTGCAG CAGGAAGAAA AAACCTTCAG GATTTTGCTG CTTGTGGAAT AGACAGAATG 1860 
AATTATGACA GCTACTTGGC TCAGACTGGT AAATCCCCCG CAGGAGTGAA TCTTTTATCA v 1920 

TTTGCATATG ATCTAGAAGC AAAAGCAAAC AGTTTGCCCC CAGGAAATTT GAGGAACTCC 1980 

CTGAAAAGAG ATGCACAAAC TATTAAAACA ATTCACCAGC AACGAGTCCT TCCTATAGAA 2040 

CAATCACTGA GCACTCTATA CCAAAGCGTC AAGATACTTC AACGCACAGG GAATGGATTG 2100 

TTGGAGAGAG TAACTAGGAT TCTAGCTTCT CTGGATTTTG CTCAGAACTT CATCACAAAC 2160 

AATACTTCCT CTGTTATTAT TGAGGAAACT AAGAAGTATG GGAGAACAAT AATAGGATAT 2220 

TTTGAACATT ATCTGCAGTG GATCGAGTTC TCTATCAGTG AGAAAGTGGC ATCGTGCAAA 2280 
CCTGTGGCCA CCGCTCTAGA TACTGCTGTT GATGTCTTTC TGTGTAGCTA CATTATCGAC 2340 
CCCTTGAATT TGTTTTGGTT TGGCATAGGA AAAGCTACTG TATTTTTACT TCCGGCTCTA 2400 
ATTTTTGCGG TAAAACTGGC TAAGTACTAT CGTCGAATGG ATTCGGAGGA CGTGTACGAT 2460 
GATGTTGAAA CTATACCCAT GAAAAATATG GAAAATGGTA ATAATGGTTA TCATAAAGAT 2520 
CATGTATATG GTATTCACAA TCCTGTTATG ACAAGCCCAT CACAACATTG A 2571 



Seq ID NO: C41 DNA Sequence 
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Nucleic Acid Accession #: NM_033049 
Coding sequence: 28..1S66 
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21 31 41 

III 

CCACGCGTCC GAGCAAGAAC AGCTAAAATG AAAGCCATCA TTCATCTTAC TCTTCTTGCT 
CTCCTTTCTG TAAACACAGC CACCAACCAA GGCAACTCAG CTGATGCTGT AACAACCACA 
GAAACTGCGA CTAGTGGTCC TACAGTAGCT GCAGCTGATA CCACTGAAAC TAATTTCCCT 
GAAACTGCTA GCACCACAGC AAATACACCT TCTTTCCCAA CAGCTACTTC ACCTGCTCCC 
CCCATAATTA GTACACATAG TTCCTCCACA ATTCCTACAC CTGCTCCCCC CATAATTAGT 
ACACATAGTT CCTCCACAAT TCCTATACCT ACTGCTGCAG ACAGTGAGTC AACCACAAAT 
GTAAATTCAT TAGCTACCTC TGACATAATC ACCGCTTCAT CTCCAAATGA TGGATTAATC 
ACAATGGTTC CTTCTGAAAC ACAAAGTAAC AATGAAATGT CCCCCACCAC AGAAGACAAT 
CAATCATCAG GGCCTCCCAC TGGCACCGCT TTATTGGAGA CCAGCACCCT AAACAGCACA 
GGTCCCAGCA ATCCTTGCCA AGATGATCCC TGTGCAGATA ATTCGTTATO TGTTAAGCTG 
CATAATACAA GTTTTTGCCT GTGTTTAGAA GGGTATTACT ACAACTCTTC TACATGTAAG 
AAAGGAAAGG TATTCCCTGG GAAGATTTCA GTGACAGTAT CAGAAACATT TGACCCAGAA 
GAGAAACATT CCATGGCCTA TCAAGACTTQ CATAGTGAAA TTACTAGCTT GTTTAAAGAT 
GTATTTGGCA CATCTGTTTA TGGACAGACT GTAATTCTTA CTGTAAGCAC ATCTCTGTCA 
CCAAGATCTG AAATGCGTGC TGATGACAAG TTTGTTAATG TAACAATAGT AACAATTTTG 
GCAGAAACCA CAAGTGACAA TGAGAAGACT GTGACTGAGA AAATTAATAA AGCAATTAGA 
AGTAGCTCAA GCAACTTTCT AAACTATGAT TTGACCCTTC GGTGTGATTA TTATGGCTGT 
AACCAGACTG CGGATGACTG CCTCAATGGT TTAGCATGCG ATTGCAAATC TGACCTGCAA 
AGGCCTAACC CACAGAGCCC TTTCTGCGTT GCTTCCAGTC TCAAGTGTCC TGATGCCTGC 
AACGCACAGC ACAAGCAATG CTTAATAAAG AAGAGTGGTG GGGCCCCTGA GTGTGCGTGC 
GTGCCCGGCT ACCAGGAAGA TGCTAATGGG AACTGCCAAA AGTGTGCATT TGGCTACAGT 
GG ACT CG ACT GTAAGGACAA ATTTCAGCTG ATCCTCACTA TTGTGGGCAC CATCGCTGGC 
ATTGTCATTC TCAGCATGAT AATTGCATTG ATTGTCACAG CAAGATCAAA TAACAAAACG 
AAGCATATTG AAGAAGAGAA CTTGATTGAC GAAGACTTTC AAAATCTAAA ACTGCGGTCG 
ACAGGCTTCA CCAATCTTGG AGCAGAAGGG AGCGTCTTTC CTAAGGTCAG GATAACGGCC 
TCCAGAGACA GCCAGATGCA AAATCCCTAT TCAAGACACA GCAGCATGCC CCGCCCTGAC 
TATTAGAATC ATAAGAATGT GGAACCCGCC ATGGCCCCCA ACCAATGTAC AAGCTATTAT 
TTAGAGTGTT TAGAAAGACT GATGGAGAAG TGAGCACCAG TAAAGATCTG GCCTCCGGGG 
TTTTTCTTCC ATCTGACATC TGCCAGCCTC TCTGAATGGA AGTTGTGAAT GTTTGCAACG 
AATCCAGCTC ACTTGCTAAA TAAGAATCTA TGACATTAAA TGTAGTAGAT GCTATTAGCG 
CTTGTCAGAG AGGTGGTTTT CTTCAATCAG TACAAAGTAC TGAGACAATG GTTAGGGTTG 
TTTTCTTAAT TCTTTTCCTG GTAGGGCAAC AAGAACCATT TCCAATCTAG AGGAAAGCTC 
CCCAGCATTG CTTGCTCCTG GGCAAACATT GCTCTTGAGT TAAGTGACCT AATTCCCCTG 
GGAGACATAC GCATCAACTG TGGAGGTCCG AGGGGATGAG AAGGGATACC CACCACCTTT 
CAAGGGTCAC AAGCTCACTC TCTGACAAGT CAGAATAGGG ACACTGCTTC TATCCCTCCA 
ATGGAGAGAT TCTGGCAACC TTTGAACAGC CCAGAGCTTG CAACCTAGCC TCACCCAAGA 
AGACTGGAAA GAGACATATC TCTCAGCTTT TTCAGGAGGC GTGCCTGGGA ATCCAGGAAC 
TTTTTGATGC TAATTAGAAG GCCTGGACTA AAAATGTCCA CTATGGGGTG CACTCTACAG 
TTTTTGAAAT GCTAGGAGGC AGAAGGGGCA GAGAGTAAAA AACATGACCT GGTAGAAGGA 
AGAGAGGCAA AGGAAACTGG GTGGGGAGGA TCAATTAGAG AGGAGGCACC TGGGATCCAC 
CTTCTTCCTT AGGTCCCCTC CTCCATCAGC AAAGGAGCAC TTCTCTAATC ATGCCCTCCC 
GAAGACTGGC TGGGAGAAGG TTTAAAAACA AAAAATCCAG GAGTAAGAGC CTTAGGTCAG 
TTTGAAATTG GAGACAAACT GTCTGGCAAA GGGTGCGAGA GGGAGCTTGT GCTCAGGAGT 
CCAGCCGTCC AGCCTCGGGG TGTAGGTTTC TGAGGTGTGC CATTGGGGCC TCAGCCTTCT 
CTGGTGACAG AGGCTCAGCT GTGGCCACCA ACACACAACC ACACACACAC AACCACACAC 
ACAAATGGGG GCAACCACAT CCAGTACAAG CTTTTACAAA TGTTATTAGT GTCCTTTTTT 
ATTTCTAATG CCTTGTCCTC TTAAAAGTTA TTTTATTTGT TATTATTATT TGTTCTTGAC 
TGTTAATTGT GAATGGTAAT GCAATAAAGT GCCTTTGTTA GATGGTGAAA AAAAAAAAAA 
AAAAAAA 

Seq ID NO: C42 DNA Sequence 

Nucleic Acid Accession #: NM_001432.1 

Coding sequence: 167.. 676 



1 11 21 31 

I I I I 

TCACTTGCCT GATATTTCCA GTGTCAGAGG GACACAGCCA 
TGACAGCCGC TCTCCAGCCA CTGCCGCGAG CCCGTCTGCT 
TCCGCAGCCG CCCTCCGCCA AGCCCCAGCG CCCGCTCCCA 
GAGGATGGAG ATGCTCTGTG CCGGCAGGGT CCCTGCGCTG 
TCTTCTACAG GCAGTCCTCA GTACAACTGT GATTCCATCA 
TGATAACTGC ACAGCTTTAG TTCAGACAGA AGACAATCCA 
AACAAAGTGT AGCTCTGACA TGAATGGCTA TTGTTTGCAT 
GGACATGAGT CAAAACTACT GCAGGTGTGA AGTGGGTTAT 
CTTCTTTTTA ACCGTCCACC AACCTTTAAG CAAAGAGTAT 
TATTATTTTG TTTCTTATCA CAGTCGTCGG TTCCACATAT 
AAATCGAAAA AGTAAAGAAC CAAAGAAGGA ATATGAGAGA 
GTTGCCGCAA GTCTGAATAT GAGAGAGTTA CCTCAGGGGA 
GAATGGCGCC ATCAAACTTA TGGGCAGGGA TAACAGTGTG 
CATTTTATTA ATAATATTTA TGTTGGGTCA AGTGTTAGGT 
GTACTTGAAA AATGTTTTTA TTTTTGTTTT ATTTTTGACA 
TGTGCAGAAA ATATTTAATA TCAAAAGAAA ATTGATATTT 
GCTAAATGCT TCATTGAAAG CTTCAAAGTT TATATGCCTG 
AGCAATTCCC AGGTCATAGC TCAAGAATTG TTAGCAAATG 
ATATATAGTC AAATCGATTT AGTAAGTATG TTTTTTATGT 
GGTTTGACTG TACCATGGTT TGATATGTAG TTGGCACCAT 
TAATGCAATT AGAATTTGGG AGAAGCAAAT ATAGGTCCTG 
AAACAAGCTA ACCCTGGGGA GTCTATGGTC TCTTCACTCA 
TATATGAGGG GCAGTGGACA GTTCCCTATG CCAACTCACG 



41 

! 

ACGTGGGGTC 
CCCGCCCTGC 
TCGCCGATGA 
CTGCTCTGCC 
TGTATCCCAG 

GGACAGTGCA 
ACTGGTGTCC 
GTGGCTTTGA 
TATTTCTGCA 
GTTACCTCAG 
TCCAGAGTTG 
CCTGGTTAAT 
CAATAACACT 
GACTATTTGC 
TTATACAAGT 
GTGCACAGTG 
ACAGATTTCT 
TCCTCAAATC 
GGTATCATAT 
TGTTAAACAC 
GGTCTCAGCT 
ACTCCTACAG 



60 
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480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 
1200 
1260 
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1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2887 



51 
I 

CCTT CTAGGC 
CCGTGCACTC 
CCGCGGGGAG 
TGGGTTTCCA 
GAGAGTCCAG 
AAGTGTCAAT 
TCTATCTGGT 
GATGTGAACA 
CCGTGATTCT 
GATGGTACAG 
GGGATCCAGA 
CCGCAAGTCT 
ATTAATATTC 
GTATTTTAAT 
TAATGTATAA 
AATTTCCTGA 
CTTAGAAGTA 
GTAAGCCTAT 
AGTGATAATT 
ATTAAAACAA 
TACACATTTG 
ATAATTCTGT 
GTACTAGTCA 
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CTCATCTACC AGATTCTGCC TATGTAAAAT GAATTGAAAA ACAATTTTCT GTAATCTTTT 
ATTTAAGTAG TGGGCATTTC ATAGCTTCAC AATGTTCCTT TTTTGTATAT TACAACATTT 
ATGTGAGGTA ATTATTGCTC AACAGACAAT TAGAAAAAAG TCCACACTTG AAGCCTAAAT 
TTGTGCTTTT TAAGAATATT TTTAGACTAT TTCTTTTTAT AGGGGCTTTG CTGAATTCTA 
ACATTAAATC ACAGCCCAAA ATTTGATGGA CTAATTATTA TTTTAAAATA TATGAAGACA 
ATAATTCTAC ATGTTGTCTT AAGATGGAAA TACAGTTATT TCATCTTTTA TTCAAGGAAG 
TTTTAACTTT AATACAGCTC AGTAAATGGC TTCTTCTAGA ATGTAAAGTT ATGTATTTAA 
AGTTGTATCT TGACACAGGA AATGGGAAAA AACTTAAAAA TTAATATGGT GTATTTTTCC 
AAATGAAAAA TCTCAATTGA AAGCTTTTAA AATGTAGAAA CTTAAACACA CCTTCCTGTG 
GAGGCTGAGA TGAAAACTAG GGCTCATTTT CCTGACATTT GTTTATTTTT TGGAAGAGAC 
AAAGATTTCT TCTGCACTCT GAGCCCATAG GTCTCAGAGA GTTAATAGGA GTATTTTTGG 
GCTATTGCAT AAGGAGCCAC TGCTGCCACC ACTTTTGGAT TTTATGGGAG GCTCCTTCAT 
CGAATGCTAA ACCTTTGAGT AGAGTCTCCC TGGATCACAT ACCAGGTCAG GGAGGATCTG 
TTCTTCCTCT AOGTTTATCC TGGCATGTGC TAGGGTAAAC GAAGGCATAA TAAGCCATGG 
CTGACCTCTG GAGCACCAGG. TGCCAGGACT TGTCTCCATG TGTATCCATG CATTATATAC 
CCTGGTGCAA TCACACGACT GTCATCTAAA GTCCTGGCCC TGGCCCTTAC TATTAGGAAA 
ATAAACAGAC AAAAACAAGT AAATATATAT GGTCCTATAC ATATTGTATA TATATTCATA 
TACAAACATG TATGTATACA TGACCTTAAT GGATCATAGA ATTGCAGTCA TTTGGTGCTC 
TGCTAACCAT TTATATAAAA CTTAAAAACA AGAGAAAAGA AAAATCAATT AGATCTAAAC 
AGTTATTTCT GTTTCCTATT TAATATAGCT GAAGTCAAAA TATGTAAGAA CACATTTTAA 
ATACTCTACT TACAGTTGGC CCTCTGTGGT TAGTTCCACA TCTGTGGATT CAACCAACCA 
AGGACGGAAA ATGCTTAAAA AATAATACAA CAACAACAAA AAATACATTA TAACAACTAT 
TTACTTTTTT TTTTTTCTTT TTGAGATGGA GTCTCGCTCT GTTGCCCAGG TTGGAGTGCA 
GTGGCACGAT CTCGGCTCAC TGCAACCTCA CCTCCCGGGT TCAAGAGATC CTCCTGCCTC 
AGCCTCCTGA GCAGCTGGGA CTACAGGCGC ATGCCACCAT GCCCAGCTAA TTTTTGTATT 
TTTAGTAGAG GCGGGGTTTC ACCATGTTGG CCAGGATGGT CTCAATCTCC TAACCTTGAG 
ATCCACCCTC CACAGCCTCC CAAACTGCTG GGATTACAGG CGTGAGCCAC CGCACGTAGC 
ATTTACATTA GGTATTACAA GTAATGTAAA GATGATTTAA GTATACAGGA GGATGTGAAT 
AGGTTATATG CAAGCACTAT GCCCTTTTAT ATAAGTGACT TGAACATCTG TGCCCGATTT 
TAGTATGTGC AGGGGGGCGA TCTGGGAATC AGTCCCCTGT GGATACCAAG GTACAACTGT 
ATTTATTAAC GCTTACTAGA TGTGAGGAGA GTCTGAATAT TTTCAGTGAT CTTGGCTGTT 
TCAAAAAAAT CTATTGACTT TTCAATAAAT CAGCTGCAAT CCATTTATTT CATTTACAAA 
AGATTTATTG TAAGCCTCTC AATCTTGGTT TTTCAGTTGA TCTTAAGCAT GTCAATTCAT 
AAAAACAAGT CATTTTTGTA TTTTTCATCT TTAAGAATGC TTAAAAAAGC TAATCCCTAA 
AATAGTTAGA TCTTTGTAAA TGCATATTAA ATAATAAAGT ATGACCCACA TTACTTTTTA 
TGGGTGAAAA TAAGACAAAA ATAATAGTTT TAGTGAGGAT GGTGCTGAGT AAACATAAAA 
ACTGATTTGC TCTCAGCTGA TGTGTCCTGT ACACAGTGGG AAGATTTTAG TTCACACTTA 
GTCTAACTCC CCCATTTTAC AGATTTCTCA CTATATATAT TTCTAGAAGG GGCTATGCAT 
ATTCAATGTA TTGAGAACCA AAGCAACCAC AAATGCATAA ATGCATAATT TATGGTCTTC 
AACCAAGGCC ACATAATAAC CCAGTTAACT TACTCTTTAA CCAGGAATAT TAAGTTCTAT 
AACTAGTACT CAAGGTTTAA CCTTAAAATT AAGATTTCCT TAACCTTAAC CTTAAAATTG 
ATATTATATT AAACATACAT AATACAATGT AACTCCACTG TTCTCCTGAA TATTTTTTGC 
TCTAATCTCT CTGCCGAAAG TCAAAGTGAT GGGAGAATTG GTATACTGGT ATGACTACGT 
CTTAAGTCAG ATTTTTATTT ATGAGTCTTT GAGACTAAAT TCAATCACCA CCAGGTATCA 
AATCAACTTT TATGCAGCAA ATATATGATT CTAGTGTCTG ACTTTTGTTA AATTCAGTAA 
TGCAGTTTTT AAAAACCTGT ATCTGACCCA CTTTGTAATT TTTGCTCCAA TATCCATTCT 
GTAGACTTTT GAAAAAAAAG TTTTTAATTT GATGCCCAAT ATATTCTGAC CGTTAAAAAA 
TTCTTGTTCA TATGGGAGAA GGGGGAGTAA TGACTTGTAC AAACAGTATT TCTGGTGTAT 
ATTTTAATGT TTTTAAAAAG AGTAATTTCA TTTAAATATC TGTTATTCAA ATTTGATGAT 
GTTAAATGTA ATATAATGTA TTTTCTTTTT ATTTTGCACT CTGTAATTGC ACTTTTTAAG 
TTTGAAGAGC CATTTTGGTA AACGGTTTTT ATTAAAGATG CTATGGAACA TAAAGTTGTA 
TTGCATGCAA TTTAAAGTAA CTTATTTGAC TATGAATATT ATCGGATTAC TGAATTGTAT 
CAATTTGTTT GTGTTCAATA TCAGCTTTGA TAATTGTGTA CCTTAAGATA TTGAAGGAGA 
AAATAGATAA TTTACAAGAT ATTATTAATT TTTATTTATT TTTCTTGGGA ATTGAAAAAA 
ATTGAAATAA ATAAAAATGC ATTGAACATC TTGCATTCAA AATCTTCACT GAC 



Seq ID NO: C43 DNA Sequence 

Nucleic Acid Accession #: AF011468.1 

Coding sequence: 257.. 1468 



1 
I 

GGAAGACTTG 
ATCTCAGTGG 
CAAGTCCCCT 
GCGCCTTTGC 
TACAGCTAGA 
TACAGCTCCA 
TCCATTACCT 
GCGCGTTCCT 
GCAGAAGCAA 
CCAAAAGAGC 
ATCAAAACAG 
TGGTCGCCCT 
CAAGTTTATT 
GCATCAGCTC 
ACTGTATGGT 
TGGAACAGTT 
TTATATAACA 
AGACATTAAG 
TGGGTGGTCA 
CCTGCCCCCT 
TGGAGTTCTT 



11 
I 

GGTCCTTGGG 
CGGACGAGGA 
GTCGGTTCCT 
ATCCGCTCCT 
GGCATCATGG 
GTTGGAGGTC 
GTAAATAGTG 
TTGCAAGCAC 
TTGCAGGCAA 
AAGCAGCCCC 
AAAAATGAAG 
CTGGGTAAAG 
CTGGCTCTTA 
AGAAGAGAAG 
TATTTCCATG 
TATAGAGAAC 
GAATTGGCAA 
CCAGAGAACT 
GTACATGCTC 
GAAATGATTG 
TGCTATGAAT 



21 
I 

TCGCAGGTGG 
CGGCGGGGAC 
CCGTCCCTGA 
GGGCACCGAG 
ACCGATCTAA 
CAAAACGTGT 
GCCAGGCTCA 
AAAAGCTTGT 
CCAGTGTACC 
TGCCATCGGC 
AATCAAAAAA 
GAAAGTTTGG 
AAGTGTTATT 
TAGAAATACA 
ATGCTACCAG 
TTCAGAAACT 
ATGCCCTGTC 
TACTTCTTGG 
CATCTTCCAG 
AAGGTCGGAT 
TTTTAGTTGG 



31 
I 

GAGCCGACGG 
AAGGGGCGGC 
GTGTCCTTGG 
GCGCCCTGTA 
AGAAAACTGC 
TCTCGTGACT 
GCGGGTCTTG 
CTCCAGTCAC 
TCATCCTGTC 
ACCTGAAAAT 
GAGGCAGTGG 
TAATGTTTAT 
TAAAGCTCAG 
GTCCCACCTT 
AGTCTACCTA 
TTCAAAGTTT 
TTACTGTCAT 
ATCAGCTGGA 
GAGGACCACT 
GCATGATGAG 
GAAGCCTCCT 



41 
I 

GTGGGTAGAC 
TGGTCGGAGT 
CGCTGCCTTG 
GGATACTGCT 
ATTTCAGGAC 
CAGCAAATTC 
TGTCCTTCAA 
AAGCCGGTTC 
TCCAGGCCAC 
AATCCTGAGG 
GCTTTGGAAG 
TTGGCAAGAG 
CTGGAGAAAG 
CGGCATCCTA 
ATTCTGGAAT 
GATGAGCAGA 
TCGAAGAGAG 
GAGCTTAAAA 
CTCTGTGGCA 
AAGGTGGATC 
TTTGAGGCAA 



51 
I 

CGTGGGGGAT 
GGCGGAGCGT 
TGCCCGCCCA 
TGTTACTTAT 
CTGTTAAGGC 
CTTGTCAGAA 
ATTCTTCCCA 
AGAATCAGAA 
TGAATAACAC 
AGGAACTGGC 
ACTTTGAAAT 
AAAAGCAAAG 
CCGGAGTGGA 
ATATTCTTAG 
ATGCACCACT 
GAACTGCTAC 
TTATTCATAG 
TTGCAGATTT 
CCCTGGACTA 
TCTGGAGCCT 
ACACATACCA 



1440 

1500 

15S0 

1620 

1680 

1740 

1800 ■ 

1860 

1920 

1960 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4673 



60 

120 

1B0 

240 

300 

360 

420 

480 

S40 

600 

660 

720 

780 

640 

900 

960 

1020 

1080 

1140 

1200 

1260 



1224 
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10 

15 

20 

25 

30 

35 

40 
45 
50 
55 
60 
65 
70 
75 
80 



AGAGACCTAC 
AGCCAGGGAC 
AGAAGTACTT 
CAAAGAATCA 
GGCTGCCATA 
CCTCAATCTA 
CCCTATTCAG 
AATTGTGCTA 
CAGCCTGTGC 
AGTGACCACT 
CTGAGTGAGT 
GTGATTCTTT 
GGCATTCCTT 
TAGTCCTCCT 
ACCTCCATTT 
GGCTTATTTT 
AATAAGTATC 



AAAAGAATAT 
CTCATTTCAA 
GAACACCCCT 
GCTAGCAAAC 
TAACCTGACA 
GAACGCTACA 
AAAGCTCCAC 
CTTATACTGG 
TAGGCATGGT 
CTGCCCTGAC 
GTGTAACTTA 
TTAAGTATGA 
TAGGAATGCT 
TAACCACTTA 
AGGGATTTGC 
TTTAAAACAT 
TATGTCTAAA 



CACGGGTTGA 
GACTGTTGAA 
GGATCACAGC 
AGTCTTAGGA 
GGAACATGCT 
CAAGAAATAT 
ATCAATAAAC 
TTCATAATCT 
GTCTTCACAG 
CCCGATCAGT 
TTGGGTTGGC 
AAATAAAGAT 
GTGTGTCTGT 
TCTCCCATAT 
TTGGGATACA 
TGGAGTCATA 
AAAAAAAAAA 



ATTCACATTC 
GCATAATCCC 
AAATTCATCA 
ATCGTGCAGG 
ACTGAAGTTT 
TTGTTTTACT 
ATGACACTCT 
GGAGGCAAGG 
GAGGCAAATC 
TAAGGAGCTG 
GAAGCCTGGT 
ATATGTACAG 
CGGGCACCCC 
GAGAGTGTGA 
GAAGAGGCCA 
GCATGTGTGT 
AAA 



CCTGACTTTG 
AGCCAGAGGC 
AAACCATCAA 
GGGAGAAATC 
ATTTTACCAT 
CAGCAGGTGT 
GAAGTGAAAG 
TTCGACTGCA 
CAGAGCCTGG 
TGCAATAACC 
AAAGCTGTTG 
ACTTGTATTT 
GGTAGGCCTG 
AAAATAGGAA 
TGTGTCTCAG 
AAACTTTAAA 



TAACAGAGGG 
CAATGCTCAG 
ATTGCCAAAA 
CTTGAGCCAG 
TGACTGCTGC 
GCCTTAACCT 
TAGCCACGAG 
GCCGCCCCGT 
CTGTGGGGAA 
TTCCTAGTAC 
GAATGAGTAT 
TTTCTCTGGT 
ATTGGGTTTC 
CACGTGCTCT 
AGCTGTTAAG 
TATGCAAATA 



Seq ID NO: C44 DNA Sequence 
Nucleic Acid Accession #: NM_013372 
Coding sequence: 63.. 617 



1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2253 



1 11 21 31 41 51 

I I ! I I I 

GCGGCCGCAC TCAGCGCCAC GCGTCGAAAG CGCAGGCCCC GAGGACCCGC CGCACTGACA 60 

GTATGAGCCG CACAGCCTAC ACGGTGGGAG CCCTGCTTCT CCTCTTGGGG ACCCTGCTGC 120 

CGGCTGCTGA AGGGAAAAAG AAAGGGTCCC AAGGTGCCAT CCCCCCGCCA GACAAGGCCC 180 

AGCACAATGA CTCAGAGCAG ACTCAGTCGC CCCAGCAGCC TGGCTCCAGG AACCGGGGGC 240 

GGGGCCAAGG GCGGGGCACT GCCATGCCCG GGGAGGAGGT GCTGGAGTCC AGCCAAGAGG 300 

CCCTGCATGT GACGGAGCGC AAATACCTGA AGCGAGACTG GTGCAAAACC CAGCCGCTTA 360 

AGCAGACCAT CCACGAGGAA GGCTGCAACA GTCGCACCAT CATCAACCGC TTCTGTTACG 420 

GCCAGTGCAA CTCTTTCTAC ATCCCCAGGC ACATCCGGAA GGAGGAAGGT TCCTTTCAGT 480 

CCTGCTCCTT CTGCAAGCCC AAGAAATTCA CTACCATGAT GGTCACACTC AACTGCCCTG 540 

AACTACAGCC ACCTACCAAG AAGAAGAGAG TCACACGTGT GAAGCAGTGT CGTTGCATAT 600 

CCATCGATTT GGATTAAGCC AAATCCAGGT GCACCCAGCA TGTCCTAGGA ATGCAGCCCC 660 

AGGAAGTCCC AGACCTAAAA CAACCAGATT CTTACTTGGC TTAAACCTAG AGGCCAGAAG 720 

AACCCCCAGC TGCCTCCTGG CAGGAGCCTG CTTGTGCGTA GTTCGTGTGC ATGAGTGTGG 780 

ATGGGTGCCT GTGGGTGTTT TTAGACACCA GAGAAAACAC AGTCTCTGCT AGAGAGCACT 840 

CCCTATTTTG TAAACATATC TGCTTTAATG GGGATGTACC AGAAACCCAC CTCACCCCGG 900 

CTCACATCTA AAGGGGCGGG GCCGTGGTCT GGTTCTGACT TTGTGTTTTT GTGCCCTCCT 960 

GGGGACCAGA ATCTCCTTTC GGAATGAATG TTCATGGAAG AGGCTCCTCT GAGGGCAAGA 1020 

GACCTGTTTT AGTGCTGCAT TCGACATGGA AAAGTCCTTT TAACCTGTGC TTGCATCCTC 1080 

CTTTCCTCCT CCTCCTCACA ATCCATCTCT TCTTAAGTTG ATAGTGACTA TGTCAGTCTA 1140 

ATCTCTTGTT TGCCAAGGTT CCTAAATTAA TTCACTTAAC CATGATGCAA ATGTTTTTCA 1200 

TTTTGTGAAG ACCCTCCAGA CTCTGGGAGA GGCTGGTGTG GGCAAGGACA AGCAGGATAG 1260 

TGGAGTGAGA AAGGGAGGGT GGAGGGTGAG GCCAAATCAG GTCCAGCAAA AGTCAGTAGG 1320 

GACATTGCAG AAGCTTGAAA GGCCAATACC AGAACACAGG CTGATGCTTC TGAGAAAGTC 1380 

TTTTCCTAGT ATTTAACAGA ACCCAAGTGA ACAGAGGAGA AATGAGATTG CCAGAAAGTG 1440 

ATTAACTTTG GCCGTTGCAA TCTGCTCAAA CCTAACACCA AACTGAAAAC ATAAATACTG 1500 

ACCACTCCTA TGTTCGGACC CAAGCAAGTT AGCTAAACCA AACCAACTCC TCTGCTTTGT 1560 

CCCTCAGGTG GAAAAGAGAG GTAGTTTAGA ACTCTCTGCA TAGGGGTGGG AATTAATCAA 1620 

AAACCKCAGA GGCTGAAATT CCTAATACCT TTCCTTTATC GTGGTTATAG TCAGCTCATT 1680 

TCCATTCCAC TATTTCCCAT AATGCTTCTG AGAGCCACTA ACTTGATTGA TAAAGATCCT 1740 

GCCTCTGCTG AGTGTACCTG ACAGTAAGTC TAAAGATGAR AGAGTTTAGG GACTACTCTG 1800 

TTTTAGCAAG ARATATTKTG GGGGTCTTTT TGTTTTAACT ATTGTCAGGA GATTGGGCTA 1860 

RAGAGAAGAC GACGAGAGTA AGGAAATAAA GGGRATTGCC TCTGGCTAGA GAGTAAGTTA 1920 

GGTGTTAATA CCTGGTAGAA ATGTAAGGGA TATGACCTCC CTTTCTTTAT GTGCTCACTG 1980 

AGGATCTGAG GGGACCCTGT TAGGAGAGCA TAGCATCATG ATGTATTAGC TGTTCATCTG 2040 

CTACTGGTTG GATGGACATA ACTATTGTAA CTATTCAGTA TTTACTGGTA GGCACTGTCC 2100 

TCTGATTAAA CTTGGCCTAC TGGCAATGGC TACTTAGGAT TGATCTAAGG GCCAAAGTGC 2160 

AGGGTGGGTG AACTTTATTG TACTTTGGAT TTGGTTAACC TGTTTTCTTC AAGCCTGAGG 2220 

TTTTATATAC AAACTCCCTG AATACTCTTT TTGCCTTGTA TCTTCTCAGC CTCCTAGCCA 2280 

AGTCCTATGT AATATGGAAA ACAAACACTG CAGACTTGAG ATTCAGTTGC CGATCAAGGC 2340 

TCTGGCATTC AGAGAACCCT TGCAACTCGA GAAGCTGTTT TTATTTCGTT TTTGTTTTGA 2400 

TCCAGTGCTC TCCCATCTAA CAACTAAACA GGAGCCATTT CAAGGCGGGA GATATTTTAA 2460 

ACACCCAAAA TGTTGGGTCT GATTTTCAAA CTTTTAAACT CACTACTGAT GATTCTCACG 2520 

CTAGGCGAAT TTGTCCAAAC ACATAGTGTG TGTGTTTTGT ATACACTGTA TGACCCCACC 2580 

CCAAATCTTT GTATTGTCCA CATTCTCCAA CAATAAAGCA CAGAGTGGAT TTAATTAAGC 2640 

ACACAAATGC TAAGGCAGAA TTTTGAGGGT GGGAGAGAAG AAAAGGGAAA GAAGCTGAAA 2700 

ATGTAAAACC ACACCAGGGA GGAAAAATGA CATTCAGAAC CAGCAAACAC TGAATTTCTC 2760 

TTGTTGTTTT AACTCTGCCA CAAGAATGCA ATTTCGTTAA TGGAGATGAC TTAAGTTGGC 2820 

AGCAGTAATC TTCTTTTAGG AGCTTGTACC ACAGTCTTGC ACATAAGTGC AGATTTGGCT 2880 

CAAGTAAAGA GAATTTCCTC AACACTAACT TCACTGGGAT AATCAGCAGC GTAACTACCC 2940 

TAAAAGCATA TCACTAGCCA AAGAGGGAAA TATCTGTTCT TCTTACTGTG CCTATATTAA 3000 

GACTAGTACA AATGTGGTGT GTCTTCCAAC TTTCATTGAA AATGCCATAT CTATACCATA 3060 

TTTTATTCGA GTCACTGATG ATGTAATGAT ATATTTTTTC ATTATTATAG TAGAATATTT 3120 

TTATGGCAAG ATATTTGTGG T CTTGAT CAT ACCTATTAAA ATAATGCCAA ACACCAAATA 3180 

TGAATTTTAT GATGTACACT TTGTGCTTGG CATTAAAAGA AAAAAACACA CATCCTGGAA 3240 

GTCTGTAAGT TGTTTTTTGT TACTGTAGGT CTTCAAAGTT AAGAGTGTAA GTGAAAAATC 3300 

TGGAGGAGAG GATAATTTCC ACTGTGTGGA ATGTGAATAG TTAAATGAAA AGTTATGGTT 3360 

ATTTAATGTA ATTATTACTT CAAATCCTTT GGTCACTGTG ATTTCAAGCA TGTTTTCTTT 3420 

TTCTCCTTTA TATGACTTTC TCTGAGTTGG GCAAAGAAGA AGCTGACACA CCGTATGTTG 3480 

TTAGAGTCTT TTATCTGGTC AGGGGAAACA AAATCTTGAC CCAGCTGAAC ATGTCTTCCT 3540 

GAGTCAGTGC CTGAATCTTT ATTTTTTAAA TTGAATGTTC CTTAAAGGTT AACATTTCTA 3600 

1225 



WO 03/042661 
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5 

10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



AAGCAATATT 
ACGAATAGCA 
AATGAAAAGT 
AGTTCTATTG 
AAATCATTTA 
ATGAAAGGGG 
ACTAGAATTT 
TAAATTAAAC 



AAGAAAGACT 
GATAATGATG 
GGATAAACAG 
ACATTCCTCA 
AAAACGGCAA 
AGTTGATAGT 
AATTTTCACC 
CTATTCTTTC 



TTAAATGTTA 
ACTAGTTCAC 
AACATTTATA 
AGATATTTAA 
AGAATTATAT 
CTCATAAAAC 
CCAATAATGT 
AAAAAAAAA 



TTTTGGAAGA 
ACATAAAGTC 
AGTGATCAGT 
TATCAACTGC 
AGACTATGAG 
TAATTTGGCT 
TCTATATAGC 



CTTACGATGC 
CTTTTAAGGA 
TAATGCCTAA 
ATTATGTATT 
GTACCTTGCT 
TCAAGTTTCA 
CTTTGCTAAA 



ATGTATACAA 
GAAAATCTAA 
GAGTGAAAGT 
ATGTCTGCTT 
GTGTAGGAGG 
TGAATCTGTA 
GAGCAACTAA 



3660 
3720 
3780 
3840 
3900 
3960 
4020 
4049 



Seq ID NO: C45 DNA Sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 200.. 2932 



51 



1 11 21 31 41 

I. I I I I I 

ATTGCTGATG GATCAGTGAG CCTGTGTTCA TGCCAGTGAG CTGCTGTGGC TCAGATACTG 60 

ATACTTTCTT TCCAAACAGC ATAAGAAGTG ATTGAGCCAC AAGTATACTG AAGGAAGGGC 120 

TCCCTCGAGT TCTGGTGTGA AGAGATAAAT CACCAGTCAC AGACTATGCA CCCGACTGCT 180 

GCTGTTCAGT CCAGGGAAAA TGAAAGTTGG AGTGCTGTGG CTCATTTCTT TCTTCACCTT 240 

CACTGACGGC CACGGTGGCT TCCTGGGGAA AAATGATGGC ATCAAAACAA AAAAAGAACT 300 

CATTGTGAAT AAGAAAAAAC ATCTAGGCCC AGTCGAAGAA TATCAGCTGC TGCTTCAGGT 360 

GACCTATAGA GATTCCAAGG AGAAAAGAGA TTTGAGAAAT TTTCTGAAGC TCTTGAAGCC 420 

TCCATTATTA TGGTCACATG GGCTAATTAG AATTATCAGA GCAAAGGCTA CCACAGACTG 480 

CAACAGCCTG AATGGAGTCC TGCAGTGTAC CTGTGAAGAC AGCTACACCT GGTTTCCTCC 540 

CTCATGCCTT GATCCCCAGA ACTGCTACCT TCACACGGCT GGAGCACTCC CAAGCTGTGA 600 

ATGTCATCTC AACAACCTCA GCCAGAGTGT CAATTTCTGT GAGAGAACAA AGATTTGGGG 660 

CACTTTCAAA ATTAATGAAA GGTTTACAAA TGACCTTTTG AATTCATCTT CTGCTATATA 720 

CTCCAAATAT GCAAATGGAA TTGAAATTCA ACTTAAAAAA GCATATGAAA GAATTCAAGG 780 

TTTTGAGTCG GTTCAGGTCA CCCAATTTCG AAATGGAAGC ATCGTTGCTG GGTATGAAGT 840 

TGTTGGCTCC AGCAGTGCAT CTGAACTGCT GTCAGCCATT GAACATGTTG CCGAGAAGGC 900 

T AAG ACAG CC CTTCACAAGC TGTT TCCATT AGAAGACGGC TCTTTCAGAG TGTTCGGAAA 960 

AGCCCAGTGT AATGACATTG TCTTTGGATT TGGGTCCAAG GATGATGAAT ATACCCTGCC 1020 

CTGCAGCAGT GGCTACAGGG GAAACATCAC AGCCAAGTGT GAGTCCTCTG GGTGGCAGGT 1080 

CATCAGGGAG ACTTGTGTGC TCTCTCTGCT TGAAGAACTG AACAAGAATT TCAGTATGAT 1140 

TGTAGGCAAT GCCACTGAGG CAGCTGTGTC ATCCTTCGTG * CAAAATCTTT CTGTCATCAT 1200 

TCGGCAAAAC CCATCAACCA CAGTGGGGAA TCTGGCTTCG GTGGTGTCGA TTCTGAGCAA 1260 

TATTTCATCT CTGTCACTGG CCAGCCATTT CAGGGTGTCC AATTCAACAA TGGAGGATGT 1320 

CATCAGTATA GCTGACAATA TCCTTAATTC AGCCTCAGTA ACCAACTGGA CAGTCTTACT 1380 

GCGGGAAGAA AAGTATGCCA GCTCACGGTT ACTAGAGACA TTAGAAAACA TCAGCACTCT 1440 

GGTGCCTCCG ACAGCTCTTC CTCTGAATTT TTCTCGGAAA TTCATTGACT GGAAAGGGAT 1500 

TCCAGTGAAC AAAAGCCAAC TCAAAAGGGG TTACAGCTAT CAGATTAAAA TGTGTCCCCA 1560 

AAATACATCT ATTCCCATCA GAGGCCGTGT GTTAATTGGG TCAGACCAAT TCCAGAGATC 1620 

CCTTCCAGAA ACTATTATCA GCATGGCCTC GTTGACTCTG GGGAACATTC TACCCGTTTC 1680 

CAAAAATGGA AATGCTCAGG TCAATGGACC TGTGATATCC ACGGTTATTC AAAACTATTC 1740 

CATAAATGAA GTTTTCCTAT TTTTTTCCAA GATAGAGTCA AACCTGAGCC AGCCTCATTG 1800 

TGTGTTTTGG GATTTCAGTC ATTTGCAGTG GAACGATGCA GGCTGCCACC TAGTGAATGA 1860 

AACTCAAGAC ATCGTGACGT GCCAATGTAC TCACTTGACC TCCTTCTCCA TATTGATGTC 1920 

ACCTTTTGTC CCCTCTACAA TCTTCCCCGT TGTAAAATGG ATCACCTATG TGGGACTGGG I960 

TATCTCCATT GGAAGTCTCA TTTTATGCCT GATCATCGAG GCTTTGTTTT GGAAGCAGAT 2040 

TAAAAAAAGC CAAACCTCTC ACACACGTCG TATTTGCATG GTGAACATAG CCCTGTCCCT 2100 

CTTGATTGCT GATGTCTGGT TTATTGTTGG TGCCACAGTG GACACCACGG TGAACCCTTC 2160 

TGGAGTCTGC ACAGCTGCTG TGTTCTTTAC ACACTTCTTC TACCTCTCTT TGTTCTTCTG 2220 

GATGCTCATG CTTGGCATCC TGCTGGCTTA CCGGATCATC CTCGTGTTCC ATCACATGGC 2280 

CCAGCATTTG ATGATGGCTG TTGGATTTTG CCTGGGTTAT GGGTGCCCTC TCATTATATC 2340 

TGTCATTACC ATTGCTGTCA CGCAACCTAG CAATACCTAC AAAAGGAAAG ATGTGTGTTG 2400 

GCTTAACTGG TCCAATGGAA GCAAACCACT CCTGGCTTTT GTTGTCCCTG CACTGGCTAT 2460 

TGTGGCTGTG AACTTCGTTG TGGTGCTGCT AGTTCTCACA AAGCTCTGGA GGCCGACTGT 2S20 

TGGGGAAAGA CTGAGTCGGG ATGACAAGGC CACCATCATC CGCGTGGGGA AGAGCCTCCT 2580 

CATTCTGACC CCTCTGCTAG GGCTCACCTG GGGCTTTGGA ATAGGAACAA TAGTGGACAG 2640 

CCAGAATCTG GCTTGGCATG TTATTTTTGC TTTACTCAAT GCATTCCAGG GATTTTTTAT 2700 

CTTATGCTTT GGAATACTCT TGGACAGTAA GCTGCGACAA CTTCTGTTCA ACAAGTTGTC 2760 

TGCCTTAAGT TCTTGGAAGC AAACAGAAAA GCAAAACTCA TCAGATTTAT CTGCCAAACC 2820 

CAAATTCTCA AAGCCTTTCA ACCCACTGCA AAACAAAGGC CATTATGCAT TTTCTCATAC 2880 

TGGAGATTCC TCCGACAACA TCATGCTAAC TCAGTTTGTC TCAAATGAAT AAGGCAAGGA 2940 

ATCATAAAAT CAAGAAAAAA TTTCCAGAAC AACTTGACAT TTAGAGACAA ATGTCAATGA 3000 

AGAAATTATG CTCAGTATTC GATCGGGTTT TCTGATTTAG GGGTCTGGGA ATAAAACAAG 3060 

AATGTCTCAG TGGCTTCA 3078 

Seq ID NO: C46 DNA Sequence 

Nucleic Acid Accession tf: NMJJ00584.1 

Coding sequence: 75.. 374 



AGCAGAGCAC 
CACTGTGTGT 
CTGCAGCTCT 
GCATAAAGAC 
AGAGTGGACC 
TCTGTCTGGA 
CTGAGAATTC 
GAAACTTCAA 
CCAGATGCAA 
CATTGTACCA 



11 
I 

ACAAGCTTCT 
AAACATGACT 
GTGTGAAGGT 
ATACTCCAAA 
ACACTGCGCC 
CCCCAAGGAA 
ATAAAAAAAT 
GCAAATCTAC 
TACAAGATTC 
TGAAATATCC 



21 
I 

AGGACAAGAG 
TCCAAGCTGG 
GCAGTTTTGC 
CCTTTCCACC 
AA CA CAG AAA 
AACTGGGTGC 
TCATTCTCTG 
TTCAACACTT 
CTGGTTAAAT 
AGAACATACT 



31 
1 

CCAGGAAGAA 
CCGTGGCTCT 
CAAGGAGTGC 
CCAAATTTAT 
TTATTGTAAA 
AGAGGGTTGT 
TGGTATCCAA 
CATGTATTGT 
TTGAATTTCA 
TATATGTAAA 



41 
I 

ACCACCGGAA 
CTTGGCAGCC 
TAAAGAACTT 
CAAAGAACTG 
GCTTTCTGAT 
GGAGAAGTTT 
GAATCAGTGA 
GTGGGTCTGT 
GTAAACAATG 
GTATTATTTA 



51' 
1 

GGAACCATCT 
TTCCTGATTT 
AGATGTCAGT 
AGAGTGATTG 
GGAAGAGAGC 
TTGAAGAGGG 
AGATGCCAGT 
TGTAGGGTTG 
AATAGTTTTT 
TTTGAATCTA 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 
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CAAAAAACAA 
TACAAATAGC 
GAATGGGTTT 
TTTGCCATAA 
CCTAGTCTGC 
TTTCTAAGTG 
TGGAAGCACT 
CTATTTATTA 
AGAAGATGAA 
GATATTAAAT 
ACAATTGGGT 
AGTACATTAT 
AGTCTTGTCA 
AACTATTAAA 
TGTTTATTAT 
AAGGCTTTAT 
TTCTGATTGT 
AAGTAAAAAA 



CAAATAATTT 
AAAATTGAGC 
GCTAGAATGT 
AGTCAAATTT 
TAGCCAGGAT 
GAAAAAGTAT 
TTAAGTTTTT 
TTTATGTATT 
TCATTGATTG 
GATGTTTTAT 
ACCCAGTTAA 
TGTTTATCTG 
TTGCCAGCTG 
ACAGCCAAAA 
GTACAAATAG 
ATTTTTAACT 
ATGGAAATAT 
AAAAAAAAA 



TTAAATATAA 
CAAGGGCCAA 
GATATTTGAA 
AGCTGGAAAT 
CCACAAGTCC 
TAGCCACCAT 
TCATCATAAC 
TATTTAAGCA 
AATAGTTATA 
TAGATAAATT 
ATTTTCATTT 
AAAGTTTTAA 
TGTTGGTAGT 
CTCCACAGTC 
ATTCTTATAA 
TTAAGATGTT 
AAAAGTAAAT 



GGATTTTCCT 
GAGAATATCC 
GCATCACATA 
CCTGGATTTT 
TTGTTCCACT 
CTTACCTCAC 
ATAAATTATT 
TCAAATATTT 
AAGATGTTAT 
TCAATCAGGG 
CAGATAAACA 
TTGAACTAAC 
GCTGTGTTGA 
AATATTAGTA 
TATTATTTAA 
TTTATGTGCT 
ATGAAACATT 



AGATATTGCA 
GAACTTTAAT 
AAAATGATGG 
TTTCTGTTAA 
GTGCCTTGGT 
AGTGATGTTG 
TTCAAGTGTA 
GTGCAAGAAT 
AGTAAATTTA 
TTTTTAGATT 
ACAAATAATT 
AATCCTAGTT 
ATTACGGAAT 
ATTTCTTGCT 
ATGACTGCAT 
CTCCAAATTT 
TAAAATATAA 



CGGGAGAATA 
TTCAGGAATT 
GACAATAAAT 
ATCTGGCAAC 
TTCTCCTTTA 
TGAGGACATG 
ACTTATTAAC 
TTGGAAAAAT 
TTTTATTTTA 
AAACAAAGAA 
TTTTAGTATA 
TGATACTCCC 
AATGAGTTAG 
GGTTGAAACT 
TTTTAAATAC 
TTTTTACTGT 
TTTGTTGTCA 



660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1S00 

1560 

1620 

1639 



Seq ID NO: C47 DNA Sequence 

Nucleic Acid Accession #: NM_005603.1 

Coding sequence: 1..3756 

1 11 21 31 41 51 

I I I i I I 

ATGAGTACAG AAAGAGACTC AGAAACGACA TTTGACGAGG ATTCTCAGCC TAATGACGAA 60 

GTGGTTCCCT ACAGTGATGA TGAAACAGAA GATGAACTTG ATGACCAGGG GTCTGCTGTT 120 

GAACCAGAAC AAAACCGAGT CAACAGGGAA GCAGAGGAGA ACCGGGAGCC ATTCAGAAAA 180 

GAATGTACAT GGCAAGTCAA AGCAAACGAT CGCAAGTACC ACGAACAACC TCACTTTATG 240 

AACACAAAAT T CTTGTGT AT TAAGGAGAGT AAATATGCGA ATAATGCAAT TAAAACATAC 300 

AAGTACAACG CATTTACCTT TATACCAATG AATCTGTTTG AGCAGTTTAA GAGAGCAGCC 360 

AATTTATATT TCCTGGCTCT TCTTATCTTA CAGGCAGTTC CTCAAATCTC TACCCTGGCT 420 

TGGTACACCA CACTAGTGCC CCTGCTTGTG GTGCTGGGCG TCACTGCAAT CAAAGACCTG 480 * 

GTGGACGATG TGGCTCGCCA TAAAATGGAT AAGGAAATCA ACAATAGGAC GTGTGAAGTC 540 

ATTAAGGATG GCAGGTTCAA AGTTGCTAAG TGGAAAGAAA TTCAAGTTGG AGACGTCATT 600 

CGTCTGAAAA AAAATGATTT TGTTCCAGCT GACATTCTCC TGCTGTCTAG CTCTGAGCCT 660 

AACAGCCTCT GCTATGTGGA AACAGCAGAA CTGGACGGAG AAACCAATTT AAAATTTAAG 720 

ATGTCACTTG AAATCACAGA CCAGTACCTC CAAAGAGAAG ATACATTGGC TACATTTGAT 780 

GGTTTTATTG AATGTGAAGA ACCCAATAAC CGACTAGATA AGTTTACAGG AACACTATTT 840 

TGGAGAAACA CAAGTTTTCC TTTGGATGCT GATAAAATTT TGTTACGTGG CTGTGTAATT 900 

AGGAACACCG ATTTCTGCCA CGGCTTAGTC ATTTTTGCAG GTGCTGACAC TAAAATAATG 960 

AAGAATAGTG GGAAAACCAG ATTTAAAAGA ACTAAAATTG ATTACTTGAT GAACTACATG 1020 

GTTTACACGA TCTTTGTTGT TCTTATTCTG CTTTCTGCTG GTCTTGCCAT CGGCCATGCT 1080 

TATTGGGAAG CACAGGTGGG CAATTCCTCT TGGTACCTCT ATGATGGAGA AGACGATACA 1140 

CCCTCCTACC GTGGATTCCT CATTTTCTGG GGCTATATCA TTGTTCTCAA CACCATGGTA 1200 

CCCATCTCTC TCTATGTCAG CGTGGAAGTG ATTCGTCTTG GACAGAGTCA CTTCATCAAC 1260 

TGGGACCTGC AAATGTACTA TGCTGAGAAG GACACACCCG CAAAAGCTAG AACCACCACA 1320 

CTCAATGAAC AGCTCGGGCA GATCCATTAT ATCTTCTCTG ATAAGACGGG GACACTCACA 1380 

CAAAATATCA TGACCTTTAA AAAGTGCTGT ATCAACGGGC AGATATATGG GGACCATCGG 1440 

GATGCCTCTC AACACAACCA CAACAAAATA GAGCAAGTTG ATTTTAGCTG GAATACATAT 1500 

GCTGATGGGA AGCTTGCATT TTATGACCAC TATCTTATTG AGCAAATCCA GTCAGGGAAA 1560 

GAGCCAGAAG TACGACAGTT CTTCTTCTTG CTCGCAGTTT GCCACACAGT CATGGTGGAT 1620 

AGGACTGATG GTCAGCTCAA CTACCAGGCA GCCTCTCCCG ATGAAGGTGC CCTGGTAAAC 1680 

GCTGCCAGGA ACTTTGGCTT TGCCTTCCTC GCCAGGACCC AGAACACCAT CACCATCAGT 1740 

GAACTGGGCA CTGAAAGGAC TTACAATGTT CTTGCCATTT TGGACTTCAA CAGTGACCGG 1800 

AAGCGAATGT CTATCATTGT AAGAACCCCA GAAGGCAATA TCAAGCTTTA CTGTAAAGGT I860 

GCTGACACTG TTATTTATGA ACGGTTACAT CGAATGAATC CTACTAAGCA AGAAACACAG 1920 

GATGCCCTGG ATATCTTTGC AAATGAAACT CTTAGAACCC TATGCCTTTG CTAGAAGGAA 1980 

ATTGAAGAAA AAGAATTTAC AGAATGGAAT AAAAAGTTTA TGGCTGCCAG TGTGGCCTCC 2040 

ACCAACCGGG ACGAAGCTCT GGATAAAGTA TATGAGGAGA TTGAAAAAGA CTTAATTCTC 2100 

CTGGGAGCTA CAGCTATTGA AGACAAGCTA CAGGATGGAG TTCCAGAAAC CATTTCAAAA 2160 

CTTGCAAAAG CTGACATTAA GATCTGGGTG CTTACTGGAG ACAAAAAGGA AACTGCTGAA 2220 

AATATAGGAT TTGCTTGTGA ACTTCTGACT GAAGACACCA CCATCTGCTA TGGGGAGGAT 2280 

ATTAATTCTC TTCTTCATGC AAGGATGGAA AACCAGAGGA ATAGAGGTGG CGTCTACGCA 2340 

AAGTTTGCAC CTCCTGTGCA GGAATCTTTT TTTCCACCCG GTGGAAACCG TGCCTTAATC 2400 

ATCACTGGTT CTTGGTTGAA TGAAATTCTT CTCGAGAAAA AGACCAAGAG AAATAAGATT 2460 

CTGAAGCTGA AGTTCCCAAG AACAGAAGAA GAAAGACGGA TGCGGACCCA AAGTAAAAGG 2520 

AGGCTAGAAG CTAAGAAAGA GCAGCGGCA.G AAAAACTTTG TGGACCTGGC CTGCGAGTGC 2580 

AGCGCAGTCA TCTGCTGCCG CGTCACCCCC AAGCAGAAGG CCATGGTGGT GGACCTGGTG 2640 

AAGAGGTACA AGAAAGCCAT CACGCTGGCC ATCGGAGATG GGGCCAATGA CGTGAACATG 2700 

ATCAAAACTG CCCACATTGG CGTTGGAATA AGTGGACAAG AAGGAATGCA AGCTGTCATG 2760 

TCGAGTGACT ATTCCTTTGC TCAGTTCCGA TATCTGCAGA GGCTACTGCT GGTGCATGGC 2820 

CGATGGTCTT ACATAAGGAT GTGCAAGTTC CTACGATACT TCTTTTACAA AAACTTTGCC 2880 

TTTACTTTGG TTCATTTCTG GTACTCCTTC TTCAATGGCT ACTCTGCGCA GACTGCATAC 2940 

GAGGATTGGT TCATCACCCT CTACAACGTG CTGTACACCA GCCTGCCOGT GCTCCTCATG 3000 

GGGCTGCTCG ACCAGGATGT GAGTGACAAA CTGAGCCTCC GATTCCCTGG GTTATACATA 3060 

GTGGGACAAA GAGACTTACT ATTCAACTAT AAGAGATTCT TTGTAAGCTT GTTGCATGGG 3120 

GTCCTAACAT CGATGATCCT CTTCTTCATA CCTCTTGGAG CTTATCTGCA AACCGTAGGG 3180 
CAGGATGGAG AGGCACCTTC CGACTACCAG TCTTTTGCCG TCACCATTGC CTCTGCTCTT 3240 
GTAATAACAG TCAATTTCCA GATTGGCTTG GATACTTCTT ATTGGACTTT TGTGAATGCT 3300 
TTTTCAATTT TTGGAAGCAT TGCACTTTAT TTTGGCATCA TGTTTGACTT TCATAGTGCT 3360 
GGAATACATG TTCTCTTTCC ATCTGCATTT CAATTTACAG GCACAGCTTC AAACGCTCTG 3420 
AGACAGCCAT ACATTTGGTT AACTATCATC CTGACTGTTG CTGTGTGCTT ACTAOCCGTC 3480 
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GTTGCCATTC GATTCCTGTC AATGACCATC TGGCCATCAG AAAGTGATAA GATCCAGAAG 3540 

CATCGCAAGC GGTTGAAGGC GGAGGAGCAG TGGCAGCGAC GGCAGCAGGT GTTCCGCCGG 3600 

GGCGTGTCAA OGCGGCGCTC GGCCTACGCC TTCTCGCACC AGCGGGGCTA CGCGGACCTC 3660 

ATCTCCTCCG GGCGCAGCAT CCGCAAGAAG CGCTCGCCGC TTGATGCCAT CGTGGCGGAT 3720 

GGCACCGOGG AGTACAGGCG CACCGGGGAC AGCTGA 3756 

Seq ID NO: C46 DNA Sequence 
Nucleic Acid Accession «: XM_044533 
Coding sequence: 238.. 2751 

1 11 21 31 41 51 

I 1 I I I I 

GCTCTGCCCA AGCCGAGGCT GCGGGGCCGG CGCCGGCGGG AGGACTGCGG TGCCCCGCGG 60 

AGGGGCTGAG TTTGCCAGGG CCCACTTGAC CCTGTTTCCC ACCTCCCGCC CCCCAGGTCC 120 

GGAGGCGGGG GCCCCCGGGG CGACTCGGGG GCGGACCGCG GGGCGGAGCT GCCGCCCGTG 180 

AGTCCGGCCG AGCCACCTGA GCCCGAGCCG CGGGACACCG TCGCTCCTGC TCTCCGAATG 240 

CTGCGCACCG CGATGGGCCT GAGGAGCTGG CTCGCCGCCC CATGGGGCGC GCTGCCGCCT 300 

CGGCCACCGC TGCTGCTGCT CCTGCTGCTG CTGCTCCTGC TGCAGCCGCC GCCTCCGACC 360 

TGGGCGCTCA GCCCCCGGAT CAGCCTGCCT CTGGGCTCTG AAGAGOGGCC ATTCCTCAGA 420 

TTCGAAGCTG AACACATCTC CAACTACACA GCCCTTCTGC TGAGCAGGGA- TGGCAGGACC 480 

CTGTACGTGG GTGCTCGAGA GGCCCTCTTT GCACTCAGTA GCAACCTCAG CTTCCTGCCA 540 

GGCGGGGAGT ACCAGGAGCT GCTTTGGGGT GCAGACGCAG AGAAGAAACA GCAGTGCAGC 600 

TTCAAGGGCA AGGACCCACA GCGCGACTGT CAAAACTACA TCAAGATCCT CCTGCCGCTC 660 

AG CGGCAGTC ACCTGTTCAC CTGTGGCACA GCAGCCTTCA GCCCCATGTG TACCTACATC 720 

AACATGGAGA ACTTCACCCT GGCAAGGGAC GAGAAGGGGA ATGTCCTCCT GGAAGATGGC 780 

AAGGGCCGTT GTCCCTTCGA CCCGAATTTC AAGTCCACTG CCCTGGTGGT TGATGGCGAG 840 

CTCTACACTG GAACAGTCAG CAGCTTCCAA GGGAATGACC CGGCCATCTC GCGGAGCCAA 900 

AGCCTTCGCC CCACCAAGAC CGAGAGCTCC CTCAACTGGC TGCAAGACCC AGCTTTTGTG 960 

GCCTCAGCCT ACATTCCTGA GAGCCTGGGC AGCTTGCAAG GCGATGATGA CAAGATCTAC 1020 

TTTTTCTTCA GCGAGACTGG CCAGGAATTT GAGTTCTTTG AGAACACCAT TGTGTCCCGC 1080 

ATTGCCCGCA TCTGCAAGGG CGATGAGGGT GGAGAGCGGG TGCTACAGCA GCGCTGGACC 1140 

TCCTTCCTCA AGGCCCAGCT GCTGTGCTCA CGGCCCGACG ATGGCTTCCC CTTCAACGTG 1200 

CTGCAGGATG TCTTCACGCT GAGCCCCAGC CCCCAGGACT GGCGTGACAC CCTTTTCTAT 1260 

GGGGTCTTCA CTTCCCAGTG GCACAGGGGA ACTACAGAAG GCTCTGCCGT CTGTGTCTTC 1320 

ACAATGAAGG ATGTGCAGAG AGTCTTCAGC GGCCTCTACA AGGAGGTGAA CCGTGAGACA 1380 

CAGCAGTGGT ACACCGTGAC CCACCCGGTG CCCACACCCC GGCCTGGAGC GTGCATCACC 1440 

AACAGTGCCC GGGAAAGGAA GATCAACTCA TCCCTGCAGC TCCCAGACCG CGTGCTGAAC 1500 

TTCCTCAAGG ACCACTTCCT GATGGACGGG CAGGTCCGAA GCCGCATGCT GCTGCTGCAG 1S60 

CCCCAGGCTC GCTACCAGCG GGTGGCTGTA CACOGCGTCC CTGGCCTGCA CCACACCTAC 1620 

GATGTCCTCT TCCTGGGCAC TGGTGACGGC CGGCTCCACA AGGCAGTGAG CGTGGGCCCC 1680 

CGGGTGCACA TCATTGAGGA GCTGCAGATC TTCTCATCGG GACAGCCCGT GCAGAATCTG 1740 

CTCCTGGACA CCCACAGGGG GCTGCTGTAT GCGGCCTCAC ACTCGGGCGT AGTCCAGGTG 1800 

CCCATGGCCA ACTGCAGCCT GTACAGGAGC TGTGGGGACT GCCTCCTCGC CCGGGACCCC 1860 

TACTGTGCTT GGAGCGGCTC CAGCTGCAAG CAOGTCAGCC TCTACCAGCC TCAGCTGGCC 1920 

ACCAGGCCGT GGATCCAGGA CATCGAGGGA GCCAGCGCCA AGGACCTTTG CAGCGCGTCT 1980 

TCGGTTGTGT CCCCGTCTTT TGTACCAACA GGGGAGAAGC CATGTGAGCA AGTCCAGTTC 2040 

CAGCCCAACA CAGTGAACAC TTTGGCCTGC CCGCTCCTCT CCAACCTGGC GACCCGACTC 2100 

TGGCTACGCA ACGGGGCCCC CGTCAATGCC TCGGCCTCCT GCCACGTGCT ACCCACTGGG 2160 

GACCTGCTGC TGGTGGGCAC CCAACAGCTG GGGGAGTTCC AGTGCTGGTC ACTAGAGGAG 2220 

GGCTTCCAGC AGCTGGTAGC CAGCTACTGC CCAGAGGTGG TGGAGGACGG GGTGGCAGAC 2280 

CAAACAGATG AGGGTGGCAG TGTACCOGTC ATTATCAGCA CATCGCGTGT GAGTGCACCA 2340 

GCTGGTGGCA AGGCCAGCTG GGGTGCAGAC AGGTCCTACT GGAAGGAGTT CCTGGTGATG 2400 

TGCACGCTCT TTGTGCTGGC CGTGCTGCTC CCAGTTTTAT TCTTGCTCTA CCGGCACCGG 2460 

AACAGCATGA AAGTCTTCCT GAAGCAGGGG GAATGTGCCA GCGTGCACCC CAAGAGCTGC 2S20 

CCTGTGGTGC TGCCCCCTGA GACCCGCCCA CTCAACGGCC TAGGGCCCCC TAGCACCCCG 2580 

CTCGATCACC GAGGGTACCA GTCCCTGTCA GACAGCCCCC CGGGGTCCCG AGTCTTCACT 2640 

GAGTCAGAGA AGAGGCCACT CAGGATCCAA GACAGCTTOG TGGAGGTATC CCCAGTGTGC 2700 

CCCCGGCCCC GGGTCCGCCT TGGCTCGGAG ATCCGTGACT CTGTGGTGTG AGAGCTGACT 2760 

TCCAGAGGAC GCTGCCCTGG CTTCAGGGGC TGTGAATGCT CGGAGAGGGT CAACTGGACC 2820 

TCCCCTCCGC TCTGCTCTTC GTGGAACACG ACCGTGGTGC CCGGCCCTTG GGAGCCTTGG 2880 

GGCCAGCTGG CCTGCTGCTC TCCAGTCAAG TAGOGAAGCT CCTACCACCC AGACACCCAA 2940 

ACAGCCGTGG CCCCAGAGGT CCTGGCCAAA TATGGGGGCC TGCCTAGGTT GGTGGAACAG 3000 

TGCTCCTTAT GTAAACTGAG CCCTTTGTTT AAAAAACAAT TCCAAATGTG AAACTAGAAT 3060 

GAGAGGGAAG AGATAGCATG GCATGCAGCA CACACGGCTG CTCCAGTTCA TGGCCTCCCA 3120 

GGGGTGCTGG GGATGCATCC AAAGTGGTTG TCTGAGACAG AGTTGGAAAC CCTCACCAAC 3180 

TGGCCTCTTC ACCTTCCACA TTATCCCGCT GCCACCGGCT GCCCTGTCTC ACTGCAGATT 3240 
CAGGACCAGC TTGGGCTGCG TGCGTTCTGC CTTGCCAGTC AGCCGAGGAT GTAGTTGTTG 3300 
CTGCCGTCGT CCCACCACCT CAGGGACCAG AGGGCTAGGT TGGCACTGCG GCCCTCACCA 3360 
GGTCCTGGGC TCGGACCCAA CTCCTGGACC TTTCCAGCCT GTATCAGGCT GTGGCCACAC 3420 
GAGAGGACAG CGCGAGCTCA GGAGAGATTT CGTGACAATG TACGCCTTTC CCTCAGAATT 3480 
CAGGGAAGAG ACTGTCGCCT GCCTTCCTCC GTTGTTGCGT GAGAACCCGT GTGCCCCTTC 3540 
CCACCATATC CACCCTCGCT CCATCTTTGA ACTCAAACAC GAGGAACTAA CTGCACCCTG 3600 
GTCCTCTCCC CAGTCCCCAG TTCACCCTCC ATCCCTCACC TTCCTCCACT CTAAGGGATA 3660 
TCAACACTGC CCAGCACAGG GGCCCTGAAT TTATGTGGTT TTTATACATT TTTTAATAAG 3720 
ATGCACTTTA TGTCATTTTT TAATAAAGTC TGAAGAATTA CTGTTT 3766 



Seq ID NO-. C49 DNA Sequence 

Nucleic Acid Accession #: NM_007019.1 

Coding sequence: 41.. 580 

1 11 21 31 41 51 

I I I I I I 

GGCACGAGCG AGTTCCTGTC TCTCTGCCAA CGCCGCCCGG ATGGCTTCCC AAAACCGCGA 
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CCCAGCCGCC 
CCGGGGTCCG 
TAAAGGGATT 
TGGAGCAGCT 
TGGCTACCCT 
GGACACCCAG 
TGTCAGGACC 
CTTGAACACA 
AGAAACCTAC 
CTTGTGTCGT 
GACTCTTTAT 
GTTGAGCCCT 



ACTAGCGTCG 
GTGGGCAAAA 
TCTGCCTTCC 
GGAACAGTAT 
TACAATGCGC 
GGTAACATAT 
ATTCTGCTCT 
CATGCTGCCG 
TCAAAGCAGG 
CTTTTTAATT 
CTTGAGCTGT 
TGTATATTAA 



GGCTACAGCA 
CTGAATCAGA 
ATGAAGACCT 
CCACAGTGAA 
GCCTGGACAT 
CCATCCAGAG 
AGCTCTGGAA 
TCACCAGCCA 
TTTCCTTAGA 
GGTATTTTTG 
ATAAATGCAT 



TAAAGGAGCT 
GGAGCTGATG 
CAACCTTTTC 
GAGGTATAAG 
GTTCCTCACG 
CCTGAAGGAA 
CCTTCTAGGA 
AAACCCCACA 
GGAGCCCTGA 

TTTTGTTTTT 
TTTTGTCCTT 



GAGCCGAGCG 
ACCCTCATGA 
AAATGGGTAG 
CTCTCGCTAG 
CCCTGCTATC 
AAGTGGTCTG 
GAACCCAACA 
GCTTTTAAGA 
CCCAGGCTGC 
TTTTTGTGAT 
GTCTTTTAAA 
TTTTAAAAAA 



GGGGCGCCGC 
TGTCTGGCGA 
GGACCATCCA 
AGTTCCCCAG 
ACCCCAACGT 
CCCTGTATGA 
TTGATAGTCC 
AGTACCTGCA 
CCAGCCTGTC 
TTCTGTATAG 
TTAAGCCTCG 
AAAAAAAAAA 



120 
180 
240 
300 
360 
420 
480 
540 
€00 
660 
720 
780 
783 



Seq ID NO: C50 DNA Sequence 

Nucleic Acid Accession fl: NM_014584.1 

Coding sequence: 227. .1633 

1 11 21 31 41 51 

I I 1 I i I 

GCACGAGCCC CGGGCTGCCG GCGCGGGCGC CGCGGCACGT CCACAGGCTG GGTCGCGAGG 60 

TGGCGATCGC TGAGAGGCAG GAGGGCCGAG GCGGGCCTGG GAGGCGGCCC GGAGGTGGGG 120 

CGCCGCTGGG GCCGGCCCGC ACGGGCTTCA TCTGAGGGCG CACGGCCCGC GACCGAGCGT 180 

GCGGACTGGC CTCCCAAGCG TGGGGCGACA AGCTGCCGGA GCTGCAATGG GCCGCGGCTG 240 

GGGATTCTTG TTTGGCCTCC TGGGCGCCGT GTGGCTGCTC AGCTCGGGCC ACGGAGAGGA 300 

GCAGCCCCCG GAGACAGCGG CACAGAGGTG CTTCTGCCAG GTTAGTGGTT ACTTGGATGA 360 

TTGTACCTGT GATGTTGAAA CCATTGATAG ATTTAATAAC TACAGGCTTT TCCCAAGACT 420 

ACAAAAACTT CTTGAAAGTG ACTACTTTAG GTATTACAAG GTAAACCTGA AGAGGCCGTG 480 

TCCTTTCTGG AATGACATCA GCCAGTGTGG AAGAAGGGAC TGTGCTGTCA AACCATGTCA 540 

ATCTGATGAA GTTCCTGATG GAATTAAATC TGCGAGCTAC AAGTATTCTG AAGAAGCCAA 600 

TAATCTCATT GAAGAATGTG AACAAGCTGA ACGACTTGGA GCAGTGGATG AATCTCTGAG 660 

TGAGGAAACA CAGAAGGCTG TTCTTCAGTG GACCAAGCAT GATGATTCTT CAGATAACTT 720 

CTGTGAAGCT GATGACATTC AGTCCCCTGA AGCTGAATAT GTAGATTTGC TTCTTAATCC 780 

TGAGCGCTAC ACTGGTTACA AGGGACCAGA TGCTTGGAAA ATATGGAATG TCATCTACGA 840 

AGAAAACTGT TTTAAGCCAC AGACAATTAA AAGACCTTTA AATCCTTTGG CTTCTGGTCA 900 

AGGGACAAGT GAAGAGAACA CTTTTTACAG TTGGCTAGAA GGTCTCTGTG TAGAAAAAAG 960 

AGCATTCTAC AGACTTATAT CTGGCCTACA TGCAAGCATT AATGTGCATT TGAGTGCAAG 1020 

ATATCTTTTA CAAGAGACCT GGTTAGAAAA GAAATGGGGA CACAACATTA CAGAATTTCA 1080 

ACAGCGATTT GATGGAATTT TGACTGAAGG AGAAGGTCCA AGAAGGCTTA AGAACTTGTA 1140 

TTTTCTCTAC TTAATAGAAC TAAGGGCTTT ATCCAAAGTG TTACCATTCT TCGAGGGCCC 1200 

AGATTTTCAA CTCTTTACTG GAAATAAAAT TCAGGATGAG GAAAACAAAA TGTTACTTCT 1260 

GGAAATACTT CATGAAATCA AGTCATTTCC TTTGCATTTT GATGAGAATT CATTTTTTGC 1320 

TGGGGATAAA AAAGAAGCAC ACAAACTAAA GGAGGACTTT CGACTGCATT TTAGAAATAT 1380 

TTCAAGAATT ATGGATTGTG TTGGTTGTTT TAAATGTCGT CTGTGGGGAA AGCTTCAGAC 1440 

TCAGGGTTTG GGCACTGCTC TGAAGATCTT ATTTTCTGAG AAATTGATAG CAAATATGCC 1500 

AGAAAGTGGA CCTAGTTATG AATTCCATCT AACCAGACAA GAAATAGTAT CATTATTCAA 1560 

CGCATTTGGA AGAATTTCTA CAAGTGTGAA AGAATTAGAA AACTTCAGGA ACTTGTTACA 1620 

GAATATTCAT TAAAGAAAAC AAGCTGATAT GTGCCTGTTT CTGGACAATG GAGGCGAAAG 1680 

AGTGGAATTT CATTCAAAGG CATAATAGCA ATGACAGTCT TAAGCCAAAC ATTTTATATA 1740 

AAGTTGCTTT TGTAAAGGAG AATTATATTG TTTTAAGTAA ACACATTTTT AAAAATTGTG 1800 

TTAAGTCTAT GTATAATACT ACTGTGAGTA AAAGTAATAC TTTAATAATG TGGTACAAAT 1860 

TTTAAAGTTT AATATTGAAT AAAAGGAGGA TTATCAAATT CATATATGAT AAAAGTGAAT 1920 

GTTCTAAGTC TCTCAAACTA GCGTTTTATG TAATAATATG TAATATAAAT AAAACTATGG 1980 

TAAATGTGAC AAGCATTTAA TAGGAAAATG CTAAGGAGGC CTCATAAATG ACCCATAATT 2040 

ACCAACGTAG AATTTTTCAG TACATTTAGG GTTGCTGGAT TTAGCAAATA AAAATAAAGA 2100 

TTGCCCAGTT AGATTTGAAT TTCAGATAAA CAATTAGTTT TTTAATATTT TACATGGAAT 2160 

ATTTGGAAAA TACTTATACT AAAAAATTAT TTGTTTGAAA TTCACATTTA ACTGGGAGTC 2220 

TTGTATTTTA TCTGGCAATC CTAAAATACA TTGGTATGAA ACAAATCACT TTTAGAAGTA 2280 

TATTGCTATT TTGATTGGGT TGTTTTTGTG TGTAGAAACG TACAATAACA ACTCAAAGGC 2340 

ACAGGAGATT TCTAAACATT GTGAAAAGTT GAATAGATTA TATATTTATT CTCATAATAC 2400 

TTTCACTAAT ACTAAATAAA ATTTGGGGAA CACTTTTTAT TTTTATATAA TTTCCAATTT 2460 

ACAGAAAAGT TTCAAAAATA GTACAAAGAG CTCTCTTACC CAGATTCACT AATTGTTCAT 2520 

ACGTGCTTTA TCTTTCATGC TTTCTCTGTA CACACACACA CACACACAAA TTTTTCCTCA 2580 

ATCATTTGAA AGTCAGTTAT AGGCATCATG CCCCTTAAAC CCTAAATACT TCAGTGTGTA 2640 

ATACTGAATA ATTACTAAAA ATGATTTTCT CAGAAAAAAA AACTCCCACA ATTCTGGAAC 2700 

TATAATACTG TAAGCCTTAG AATAAATAAT ACTTTCAAGT TCCAATCTAA AGTTCTTTTT 2760 

GAGTTTTGTT GCCCGTTTTA TGCTTGATGT GTATAGTAAT AGGGTAGGCT ATTTATTTTA 2820 

TTAAAATTTT TTTTAGAGAC AAGGTTTTGC TGTGTTGCCC AAGCTGGAAC TTGAACGACT 2880 

GGGCTGAAGT GATCTTCCCA CCTCAGCCTC CCAAGTAGCT GGGAATACAG GTGTCTGCCA 2940 

CCATACCCAG TTTCATTTTT GTTTTTTATA CCCGAAGTTC ATTTCCTTTG TCTCCCTAAA 3000 

ACTGAACTGT AATTTTGGGA GGTTTTCATT AGTGGAAGCT CTTCATTTAT AAAGCTATTT 3060 

GAAGGGGTTT AGGAATTTAT ATCACATGGT AATTGTAGAG AAAAAGAAGC TATATACCTC 3120 

AAAATCGTGC CCTCTTTACA TATGTCTTAT CAGGTATAAC ATGTTGAAAT GTCACATTAG 3180 

TAGTAAAGTG GGGTTTATTT ATATAGTGGT TAAGAAATGT CAGTTTACAC TGCTGTATAC 3240 

TTCTTCTTCT GTGTCCCTAA GGCCTGGTAC AGTGCCAAGC ACATACTTGG TATCCAATAA 3300 

ATATTTGTTG GATGAAAAAA AAAAAAAAAA AAAA 3334 



Seq ID NO: CS1 DNA Sequence 

Nucleic Acid Accession ft: NM_002888.1 

Coding sequence: 37.. 723 
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CCACGTCCGG 
CCCGCTCCCT 
CTGTTGCTCG 
CCTCAGGATG 
TTCTTCAACT 
GGCCGCGOGT 
CGCTACAACC 
CGAGTGTTTT 
ATCGAGAAAA 
AAAAACCCCT 
AGACTCATCT 
CAGGTGTCAC 
TGAAAATTAA 
GTGAACTTCT 
TCCTGC 



GGTGCCGAGC 
GGTCCGGGCC 
CCCCGGTGGC 
CTGGGGTCCC 
TCOGGTCCGG 
GGATTAATCC 
CAGAGTCTTT 
TCAAGAATCA 
AGAAAAGACA 
TGGAAATAGT 
GGGATTTGGC 
ACTACTACTT 
CTTGTGCCAC 
AATTTCATAT 



CAACTTTCCT 
CAGGGGCCCG 
GGCGCCCGCG 
GCGCAGGCTC 
CTCGCCCAGC 
AAAAGAGGGA 
ACTTCAGGAA 
GAAACCCAGA 
ACAAGAGGAT 
CAGCATACCT 
TTTCCTTGGA 
GGCACAGCTC 
AAGAGTTACA 
CAAGAGTTGT 



GCGTCCATGC 
CGCCCCACCG 
GGGTCCGGGG 
CTGCAGCAGA 
GCGCTGCGAG 
TGTAAAGTTC 
GGTGAGGGAC 
CCAACCATCA 
TACCTGCTTT 
GATAATCATG 
AGCTCTTACG 
ACTAGTGTGA 
ATCAAAGTGG 
AATCACATTT 



AGCCCCGCCG 
CCCCGCTGCT 
GCCCCGACGA 
AGGCGCGCGC 
TGCTGGCCGA 
ACGTGGTCTT 
GTTTGGGGAA 
ATGTAACTTG 
ACAAGCAAAT 
GACATATTGA 
TGATGTGGGA 
GGCAGTGGGT 
TCTCCTTAGA 
ATTTCAATAA 



GCAACGGCTG 
CGCGCTGCTG 
CCCTGGGCAG 
GGCGCTTCAC 
GGTGCAGGAG 
CAGCACAGAG 
ATGTTCTGCT 
TACACGGCTC 
GAAGCAACTG 



AATGACAACA 
AAGAAAAACC 
CTGAATTCAT 
ATATGTGAGT 
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Seq ID NO: C52 DNA Sequence 

Nucleic Acid Accession #: NM_005409.3 

Coding sequence: 94.. 378 



1 
I 

TTCCTTTCAT 
CAACAGCACC 
TTGGCTGTGA 
TGTCTTTGCA 
ATAATGTACC 
AAAGGACAAC 
GAAAGAAAGA 
AACCTAGAAC 
AGACTTTTCT 
GGGTGAAAGG 
CTGCCCAAAG 
GGTTACCATC 
GCATTTCTAG 
GAGAACATTT 
CTGTGGTTAC 
CATCTATGTG 
CCAAATATCA 
TTTATAACCA 
TGGGATACTG 
GATGTTTTTC 
TGTACTTTTT 
TACAAAATGT 
AATCACTTTT 
TTGTTCATGC 
GTCATTTTTT 



11 
I 

GTTCAGCATT 
AGCAGCAACA 
TATTGTGTGC 
TAGGCCCTGG 
CAAGTAACAA 
GATGCCTAAA 
ATTTTTAAAA 
AAGTTTAACT 
ATGGTTTTGT 
ACCAAAAACA 
GAGTCCAGCA 
GGAGTTTACA 
GCTAGAGAAC 
CTGTCTCTAG 
AGTGGAGACA 
TCGTAAAGCA 
TGTAGCACAT 
ATTCATTAAA 
GCAACAGTGC 
AACTTTTATT 
GTTTTGATCC 
TTTTGTCTAC 
ACTTTTTGTA 
CTATATACTG 
TCTCTAATAA 



21 
I 

TCTACTCCTT 
GCAAAAAACA 
TACAGTTGTT 
GGTAAAAGCA 
CTGTGACAAA 
TCCCAAATCG 
ATATCAAAAC 
GTGACTACTG 
GACTTTCAAC 
GAAATACAGT 
ATTAAATGGA 
AAGTGCTTTC 
CTTCTAGATT 
AAGTTATCTG 
TTGACATTAT 
TTCCTCAAAC 
CAATATGTAG 
TGTAATTCAT 
ACATATTTCA 
CATTGAGATG 
GTTTGTATAA 
CAAAGAAAAA 
ATTCTGTCTC 
TAAAATTTAG 
ACTACCACAA 



31 
I 

CCAAGAAGAG 
AACATGAGTG 
CAAGGCTTCC 
GTGAAAGTGG 
ATAGAAGTGA 
AAGCAAGCAA 
ATATGAAGTC 
AAATGACAAG 
TTTTGTACAG 
CTTCCTGAAT 
TTTCTAGGAA 
ACGTTCTTAC 
TGATGCTTAC 
TCTGTATTGA 
TACTGGAGTC 
ATTTTTTCAT 
GGAAACATTC 
AAAATGTACT 
TAACCAAATT 
TTTTGAAGCA 
ATGATAGCAA 
TGTTGAAAAA 
TTAGAAAAAT 
GTATACTCAA 
CCTTTCTTTT 



41 
I 

CAGCAAAGCT 
TGAAGGGCAT 
CCATGTTCAA 
CAGATATTGA 
TTATTACCCT 
GGCTTATAAT 
CTGGAAAAGG 
AATTCTACAG 
TTATGTGAAG 
GAATGACAAT 
AAGCTACCTT 
TTGTTGTATT 
AACTATTCTG 
TCTTTATGCT 
AAGCCCTTAT 
GCAAATACAC 
TTATGCATCA 
ATGAAAAAAA 
AGCAGCACCG 
ATTAGGATAT 
TATCTTGGAC 
TAAGCAAATG 
ACATAATCTA 
GACTAGTTTA 
TTAAAAAAAA 



51 
I 

GAAGTAGCAG 
GGCTATAGCC 
AAGAGGACGC 
GAAAGCCTCC 
GAAAGAAAAT 
CAAAAAAGTT 
GCATCTGAAA 
TAGGAAACTG 
GATGAAAGGT 
CAGAATTCCA 
AAGAAAGGCT 
ATACATTCAT 
TTGTGACTAT 
ATATTACTAT 
AAGTCAAAAG 
ACTTCTTTCC 

TTATACGCTA 
GTCTTAATTT 
GTGTGTTTAC 
ACATTTGAAA 
TATACCTAGC 
ATCAATTTCT 
AAGAATCAAA 
AAA 



.Seq ID NO: C53 DNA Sequence 
Nucleic Acid Accession #: 
Coding sequence: 1..609 



FGENESH predicted 



1 
I 

ATGCTGCGGC 
GTGAGCCGGC 
CTCAGCGCGC 
CACAGCCTGG 
AAAAGCCAGC 
TCCCCAACCG 
GTGCTGGAAA 
TTGAGAAATC 
AGGCAGGCTG 
AAATTAAAGG 
TTGCCCTAA 



11 
I 

AGGTGCTTCG 
ACCCGGTCTT 
TCAACOGCTT 
CCAAGATCGA 
TCTATTCGGA 
GGGACAATAT 
TGAAGGTGAA 
AGGATAAGAA 
CCGTCTCCAA 
TATGCTCCTT 



21 
I 

CAGAGGGCTC 
TTTCCTCACC 
CCAGCCCGAG 
GCGCAGCCTG 
CTTACACACC 
TTTGCTCCAG 
CCACAAGGGC 
ATGCGTGCTG 
TAAGACAACA 
CTGCATGCTT 



31 
I 

CAGTCGTTCT 
GTGCCCGCAG 
GGCGACCTGG 
GCCAGCAGCC 
CCTGGGAGGT 
GCTGAGGGGA 
TATAATTATA 
GATGATATTA 
GCCAGGGTGC 
CTGCCAATTA 



41 
I 

GCCACAGGCT 
TCCTGACAAT 
AGCGCCTGGT 
TTTTCCCCCT 
ATGGCAGGGT 
TCCTGCAGAC 
CTTTTTCCCA 
TTTCAGTGCT 
AAGTGAGGTA 
AAGAGGCAGC 



Seq ID NO: C54 DNA Sequence 

Nucleic Acid Accession tt: NM_002438.1 

Coding sequence: 104.. 4474 



1 
I 

GGGAACTTGG 
GCCCTCCTGT 
CCTGGTTTTT 
AATCTATAAT 
CGCAGCTTGC 
TATGAGTGTT 
CACTCTCTAT 
ACTTTTGGGG 
TATTATGCTC 
CAATCTGTGC 
AACCTGTGCA 
GCGGTCGGAT 



11 
I 

ATTAGGTGGA 
CCATCAGGAG 
GCCTCTGTCA 
GAAGATCACA 
AACCAGGATG 
GCATTTAAAT 
GCCTGTGACT 
ATCAAAGGAG 
TACAAGGGAT 
TCCAGAGGTT 
TTCCCGTTCA 
GGATGGCTCT 



21 
I 

GAGGCAGTTG 
AAGGAAAGGA 
TTCCGGGTGC 
AGCGCTGCGT 
CCGAATCACA 
TATGCCTGGG 
CAAAAAGTGA 
AAGATTTATT 
CGGGTTTATG 
ATGAAGCCAT 
AGTTTGAAAA 
GGTGCGGAAC 



31 
I 

GGGGGCCTCG 
TAAACCCTGG 
TGTTCTCCTA 
GGATGCAGTG 
GAAATTCCGA 
AGTGCCATCA 
ATTTCAGAAA 
TTTTAACTAC 
GAGCAGGTGG 
GTATACGCTA 
CAAGTGGTAC 
CACTACTGAC 



41 
I 

TTGTTTTGCG 
GCCATGAGGC 
CTGGACACCA 
AGTCCCAGTG 
TGGGTGTCCG 
AAAACAGACT 
TGGGAGTGCA 
GGCAACAGAC 
AAGATCTATG 
CTAGGCAATG 
GCAGATTGCA 
TATGACACAG 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1493 



51 
I 

GGGTTTGTGC 
CACCTTCGGC 
CGCTCCCAGC 
GGACCAGTCC 
GATCCTCCTC 
CCACCGAGCC 
TCTGTGTGTG 
AGAGGATCTC 
TCCCAACACT 
ACTTCATTTC 



60 
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180 
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300 

360 
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600 

609 



51 
I 

TCTTAGTTCC 
TACCCCTGCT 
GGCAATTTTT 
CCGTCCAAAC 
AATCTCAGAT 
GGGTTGCTAT 
AAAATGACAC 
AAGAAAAGAA 
GAACCACAGA 
CCAATGGAGC 
CGAGTGCTGG 
ACAAGCTATT 
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TGGATATTGT CCATTGAAAT TTGAGGGCAG TGAAAGCTTA TGGAATAAAG ACCCGCTGAC 780 

CAGCGTTTCC TACCAGATAA ACTCCAAATC CGCTTTAACG TGGCACCAAG CGAGGAAAAG 840 

CTGCCAACAA CAGAACGCTG AGCTCCTGAG CATCACAGAG ATACATGAGC AAACATACCT 900 

GACAGGATTA ACCAGTTCCT TGACCTCAGG ACTCTGGATT GGACTTAACA GTCTGAGCTT 960 

CAACAGCGGT TGGCAGTGGA GTGACCGCAG TCCTTTCCGA TATTTGAACT GGTTACCAGG 1020 

AAGTCCATCA GCTGAACCTG GAAAAAGCTG TGTGTCACTA AATCCTGGAA AAAATGCTAA 1080 

ATGGGAAAAT CTGGAATGTG TTCAGAAACT GGGCTATATT TGCAAAAAGG GCAACACCAC 1140 

TTTAAATTCT TTTGTTATTC CCTCAGAAAG TGATGTGCCT ACTCACTGTC CTAGTCAGTG 1200 

GTGGCCGTAT GCCGGTCACT GTTACAAGAT TCACAGAGAT GAGAAAAAAA TCCAGAGGGA 1260 

TGCTCTGACC ACCTGCAGGA AGGAAGGCGG TGACCTCACA AGTATCCACA CCATCGAGGA 1320 

ATTGGACTTT ATTATCTCCC AGCTAGGATA TGAGCCAAAT GACGAATTGT GGATCGGCTT 1380 

AAATGACATT AAGATTCAAA TGTACTTTGA GTGGAGTGAT GGGACCCCTG TAACGTTTAC 1440 

CAAATGGCTT CGTGGAGAAC CAAGCCATGA AAACAACAGA CAGGAGGATT GTGTGGTGAT 1500 

GAAAGGCAAG GATGGGTACT GGGCAGATCG GGGCTGTGAG TGGCCTCTTG GCTACATCTG 1560 

CAAGATGAAA TCACGAAGCC AAGGTCCAGA AATAGTGGAA GTCGAAAAAG GCTGCAGGAA 1620 

AGGCTGGAAA AAACATCACT TTTACTGCTA TATGATTGGA CATACGCTTT CAACATTTGC 1680 

AGAAGCAAAC CAAACCTGTA ATAATGAGAA TGCTTATTTA ACAACTATTG AAGACAGATA 1740 

TGAACAAGCC TTCCTGACTA GTTTCGTTGG CTTAAGGCCT GAAAAATATT TCTGGACAGG 1800 

ACTTTCAGAT ATACAAACCA AAGGGACTTT TCAGTGGACC ATCGAGGAAG AGGTTOGGTT 1860 

CACCCACTGG AATTCAGATA TGCCAGGGCG AAAGCCAGGG TGTGTTGCCA TGAGAACCGG 1920 

GATTGCAGGG GGCTTATGGG ATGTTTTGAA ATGTGATGAA AAGGCAAAAT TTGTGTGCAA 1980 

GCACTGGGCA GAAGGAGTAA CCCACCCACC GAAGCCCACG ACGACTCCCG AACCCAAATG 2040 

TCCGGAGGAT TGGGGCGCCA GCAGTAGAAC AAGCTTGTGT TTCAAGCTGT ATGCAAAAGG 2100 

AAAACATGAG AAGAAAACGT GGTTTGAATC TCGAGATTTT TGTCGAGCTC TGGGTGGAGA 2160 

CTTAGCTAGC ATCAATAACA AAGAGGAACA GCAAACAATA TGGCGATTAA TAACAGCTAG 2220 

TGGAAGCTAC CACAAACTGT TTTGGTTGGG ATTGACATAT GGAAGCCCTT CAGAAGGTTT 2280 

TACTTGGAGT GATGGTTCTC CTGTTTCATA TGAAAACTGG GCTTATGGAG AACCTAATAA 2340 

TTATCAAAAT GTTGAATACT GTGGTGAGCT GAAAGGTGAC CCTACTATGT CTTGGAATGA 2400 

TATTAATTGT GAACACCTTA ACAACTGGAT TTGCCAGATA CAAAAAGGAC AAACACCAAA 2460 

ACCTGAGCCA ACACCAGCTC CTCAAGACAA TCCACCAGTT ACTGAAGATG GGTGGGTTAT 2520 

TTACAAAGAC TACCAGTATT ATTTCAGCAA AGAGAAGGAA ACCATGGACA ATGCGCGAGC 2560 

GTTTTGCAAG AGGAATTTTG GTGATCTTGT TTCTATTCAA AGTGAAAGTG AAAAGAAGTT 264 0 

TCTATGGAAA TATGTAAACA GAAATGATGC ACAGTCTGCA TATTTTATTG GTTTATTGAT 2700 

CAGCTTGGAT AAAAAGTTTG CTTGGATGGA TGGAAGCAAA GTGGATTACG TGTCTTGGGC 2760 

CACAGGTGAA CCCAATTTTG CAAATGAAGA TGAAAACTGT GTGACCATGT ATTCAAATTC 2820 

AGGGTTTTGG AATGACATTA ACTGTGGCTA TCCAAACGCC TTCATTTGCC AGCGACATAA 2880 

CAGTAGTATC AATGCTACCA CAGTTATGCC TACCATGCCC TOGGTCCCAT CAGGGTGCAA 2940 

GGAAGGTTGG AATTTCTACA GCAACAAGTG TTTCAAAATC TTTGGATTTA TGGAAGAAGA 3000 

AAGAAAAAAT TGGCAAGAGG CACGAAAAGC TTGTATAGGC TTTGGAGGGA ATCTGGTCTC 3060 

CATACAAAAT GAAAAAGAGC AAGCATTTCT TACCTATCAC ATGAAGGACT CCACTTTCAG 3120 

TGCCTGGACT GGGCTGAATG ATGTCAATTC AGAACACACG TTCCTTTGGA CGGATGGACG 3180 

AGGAGTCCAT TACACAAACT GGGGGAAAGG TTACCCTGGT GGAAGAAGAA GCAGTCTTTC 3240 

TTATGAAGAT GCTGACTGTG TTGTTATTAT TGGAGGTGCA TCAAATGAAG CAGGAAAATG 3300 

GATGGATGAT ACCTGCGACA GTAAACGAGG CTACATATGC CAGACACGAT CGGACCCTTC 3360 

CTTGACTAAT CCTCCAGCAA CGATTCAAAC AGATGGCTTT GTTAAATATG GCAAAAGCAG 3420 

CTATTCACTC ATGAGACAAA AATTTCAATG GCATGAAGCG GAGACATACT GCAAGCTTCA 3480 

CAATTCCCTT ATAGCCAGCA TTCTGGATCC CTACAGTAAT GCATTTGCGT GGCTGCAGAT 3540 

GGAAACATCT AATGAACGTG TGTGGATCGC CCTGAACAGT AACTTGACTG ATAATCAATA 3600 

CACTTGGACT GATAAGTGGA GGGTGAGGTA CACTAACTGG GCTGCTGATG AGCCCAAATT 3660 

GAAATCAGCA TGTGTTTATC TGGATCTTGA TGGCTACTGG AAGACAGCAC ATTGCAATGA 3720 

AAGTTTTTAC TTTCTCTGTA AAAGATCAGA TGAAATCCCT GCTACTGAAC CCCCACAACT 3780 

GCCTGGCAGA TGCCCGGAGT CAGATCACAC AGCATGGATT CCTTTCCATG GTCACTGTTA 3840 

CTATATTGAG TCCTCATATA CAAGAAACTG GGGCCAAGCT TCTCTGGAAT GTCTTCGAAT 3900 

GGGTTCCTCT CTGGTTTCCA TTGAAAGTGC TGCAGAATCC AGTTTTCTGT CATATCGGGT 3960 

TGAGCCACTT AAAAGTAAAA CCAATTTTTG GATAGGATTG TTCAGAAATG TTGAAGGGAC 4020 

GTGGCTGTGG ATAAATAACA GTCCGGTCTC CTTTGTCAAC TGGAACACAG GAGATCCCTC 4080 

TGGTGAACGG AATGATTGTG TAGCTTTACA TGCGTCTTCT GGGTTTTGGA GTAATATTCA 4140 

CTGTTCTTCC TACAAAGGAT ATATTTGTAA AAGACCAAAA ATTATTGATG CTAAACCTAC 4200 

TCATGAATTA CTTACAACAA AAGCTGACAC AAGGAAGATG GACCCTTCTA AACCGTCTTC 4260 

CAACGTGGCC GGAGTAGTCA TCATTGTGAT CCTCCTGATT TTAACGGGTG CTGGCCTTGC 4320 

CGCCTATTTC TTTTATAAGA AAAGACGTGT GCACCTACCT CAAGAGGGCG CCTTTGAAAA 4380 

CACTCTGTAT TTTAACAGTC AGTCAAGCCC AGGAACTAGT GATATGAAAG ATCTOGTGGG 4440 

CAATATTGAA CAGAATGAAC ACTCGGTCAT CTAGTACCTC AATGCGATTC TGAGATATTT 4500 

GAATTTCATA AAATTGTAAC TGAAATTTAA AATTTTTAGT TCAATGTGAT TGTTTTCTTT 4560 

AAAATGAGTA CTGAATTGTA CTGGTCTGTC CTTTTTTCCT TTGCCTAATT GAAGAAATAA 4620 

TTGCTTGTTT TCTAGCCTGG CAAGATATTT TCATAAAAGA GGGATAACAA TGCTGATTAC 4680 

TACCTTTTAA AATATTTTAG ATAAATGCAC AGCACCACAG CACCACATCT AAGCATTAGT 4740 

GATGGGTAGC TGATGTCAGC TTCATGTGGA TTTTAAGCAC TCTAGAAACA ATGAAGCTTC 4800 

TTGGCATATT TTAAGGAGCT CCCAAAATGT GTTACCTATT AAATTGTAAC TCAGCAAGTA 4860 

GAAGACCATT TGAAAAGTCA GGTACAAATT TCCTCAAGTG GCATAAAAAT GTAGTCAGTT 4920 

TTCTCTTTTA CCAGTTTTTA TTTCCACTCC AATTATTTAG AACTTTATTT GTACATGTGC 4980 

AGAAGAATAA GGCAGCTGAG AAT CTTGTTT CCCCCAAGAG AGTTTTACAG GCTGAGTGTT 5040 

GCAAATGTGT TCTTTGTCCT GTTATATGTA TATCAGGAAT ACAAGGATGT GAAATAAAAC 5100 

TGTAAATTTG CATAACTGGA TGTACTTAGA TAATGTGAAA TAAACATTAA AGACAAGGTC 5160 

TATTTTTAAT AAAAAAAAAA AAAAA 5185 



80 



Seq ID NO: CSS DNA Sequence 

Nucleic Acid Accession #: NM_024574.2 

Coding sequence: 424.. 2130 



1 11 21 31 41 51 

I ! I I I I 

AGTGCGACTA GCGGCGGCCG GCGAAGCCCG CGCCAGCCCC GCGGGTCGCT GCGGCATCCT 
CCCGCCGCCC AGCCCCCGCA CGCTCCCTGC AGTTTAAAAG GACCTCCCGC CGCTTCTCGG 
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CGCTGCCCCG GGATTCCCCA GCCCCGCGCG GCTCCCTACT CCACTTCGCA GCAACTTCGG 
CGACCGCGCG CCGCCCGCCC TCGCCCGCCT TTGAAGTTTG CTGTGCCGAC CGCAAAGTTG 
GGACACTTCA GOGGATTGAA TTTTTCTCTT TTATCTGCCT CCGTCCCCGC CCTCCAGGCT 
TCTCGTTCCT GGATATTGGT GCTTAGCATC TTGGCAGGGT CCGGGGACGT GGACTATTTC 
GCACACCACA CCACGGGGAG GGATTTTTTT CTATTTTCCC TACGAAAAAC AGATCTTTTT 
AAGGATGGTG CTGCTCCACT GGTGCCTGCT GTGGCTCCTG TTTCCACTCA GCTCAAGGAC 
CCAGAAGTTA CCCACCCGGG ATGAGGAACT TTTTCAGATG CAGATCCGGG ACAAGGCATT 
TTTTCATGAT TCGTCAGTAA TTCCAGATGG AGCTGAAATT AGCAGTTATC TCTTTAGAGA 
TACACCTAAA AGGTATTTCT TTGTGGTTGA AGAAGACAAT ACTCCATTAT CAGTCACAGT 
GACGCCCTGT GATGCGCCTT TGGAGTGGAA GCTGAGCCTC CAGGAGCTGC CAGAGGACAG 
GAGCGGGGAA GGCTCAGGTG ATCTGGAACC TCTTGAGCAG CAGAAGCAGC AGATCATTAA 
TGAGGAAGGC ACTGAGTTAT TCTCCTACAA AGGCAATGAT GTTGAGTATT TTATATCGTC 
TAGTTCCCCA TCCGGTTTAT ATCAGTTGGA TCTTCTTTCA ACAGAGAAAG ACACACATTT 
CAAAGTATAT GCCACCACAA CTCCAGAATC TGATCAGCCA TACCCTGAGT TACCCTATGA 
CCCAAGAGTA GATGTGACCT CACTGGGGCG CACCACGGTC ACTTTGGCCT GGAAACCAAG 
CCCCACTGCC TCTTTGCTGA AACAACCCAT TCAGTACTGT GTGGTCATCA ACAAAGAGCA 
CAATTTCAAA AGTCTCTGTG CAGTGGAAGC AAAACTGAGT GCAGATGATG CTTTTATGAT 
GGCACCGAAA CCTGGTCTGG ACTTCAGCCC CTTTGACTTT GCCCACTTTG GATTTCCTTC 
TGATAATTCA GGTAAAGAAC GCAGTTTCCA GGCAAAGCCT TCTCCAAAAC TGGGGCGTCA 
TGTCTACTCC AGGCCCAAGG TTGATATTCA GAAAATCTGC ATAGGAAACA AGAACATCTT 
CACCGTCTCT GATCTGAAAC CCGACACGCA GTACTACTTT GACGTATTTG TGGTCAACAT 
CAACAGCAAC ATGAGCACCG CTTATGTAGG TACCTTTGCC AGGACCAAGG AAGAAGCCAA 
ACAGAAGACA GTCGAGCTCA AAGATGGGAA GATAACAGAT GTATTTGTTA AAAGGAAGGG 
AGCAAAGTTT CTACGGTTTG CTCCAGTCTC TTCTCACCAA AAAGTCACCT TCTTTATTCA 
CTCTTGTCTG GATGCTGTCC AAATCCAAGT GAGAAGAGAT GGGAAACTTC TTCTGTCTCA 
GAATGTGGAA GGCATTCAGC AGTTTCAGCT TAGAGGAAAA CCTAAAGCTA AATACCTCGT 
TCGACTGAAA GGAAACAAGA AAGGAGCATC TATGTTGAAA ATTCTAGCTA CCACAAGGCC 
TACTAAGCAG TCATTTCCCT CTCTTCCTGA AGACACAAGA ATCAAAGCCT TTGACAAGCT 
CCGTACCTGT TCCTCGGCCA CCGTGGCTTG GCTAGGCACT CAGGAAAGGA ACAAGTTTTG 
CATCTACAAA AAAGAAGTGG ATGATAACTA CAATGAAGAC CAGAAGAAAA GAGAGCAAAA 
CCAATGTCTA GGACCAGATA TAAGGAAGAA GTCAGAAAAG GTCCTCTGTA AATATTTCCA 
CAGTCAAAAC CTGCAGAAAG CAGTGACCAC AGAAACAATT AAAGGTCTTC AGCCTGGCAA 
ATCTTACCTG CTGGATGTTT ATGTCATAGG ACATGGGGGG CACTCTGTAA AGTATCAGAG 
TAAGGTTGTG AAAACTAGAA AGTTCTGTTA GTTACCTTCT TATAGAGATA TATTATGTAG 
AACTCCAGGA GGGACATTAA ATCACTTTAA GTATAAACTG ACTACTCCCA CAGTTGAGAG 
AAGTTGTGAC CTGTACTTGT ACTATGGAAG GAAGGATATC AACGTGTGTA TATTGATGTT 
TATATAAGTA ACTCTTGAAG GAGACTTGTT CTAGCGTGCC CCATGGTACC TAGTGTGTGT 
CTGATGCCGG TTGGTGTCAA AGATAGAGGG CTTCTTGAAG GAACTTGCCA TTCCTTGCTT 
TGACCACTGC ATGAACTGCT TCTAAATTAT TTTATTACCT AAAAATTTAA AATATGCCAT 
TCATTGCACA CACCCACAAA TGCAAATCAT TCCTCTCTAT AGATGCTAGG ATATATATAA 
ATTATTTTAT AAATTCTTGT TTTAAATGTC AGTGTTTCTA TGATTGTAAA CTATTAAATT 
CTTTTCCTAT TAAAGTACAG ATCTAATCTA AGTATTATTA AGTTGATAGC CCTCTAGTCA 
GTTATATTGC TATTGTAAAT TCTTGTTTGT TGAGTAAAAT GTTTAAATAC TATATGTATC 
TCATGTACAA AGTTGACATA CATTATATTC ATGTACATAA AATTAAAGAG ATTAGATTAT 
ATACTGTTAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAA 



Seq ID NO: C56 DNA Sequence 

Nucleic Acid Accession #: BC034229.1 

Coding sequence: 3 73.. 1422 



ATCCGGCGGT 
CTTCCTCAGC 
AGTGTTTACA 
GATTTTGTAQ 
GTAATAATTA 
AAGGCAAATG 
CAAGAACTGC 
CTGCAAGAAT 
TACTGGAAAT 
AAAGGAAATG 
ATTCAAGAGC 
AAACCCAAGA 
TGCCTTGAAT 
GCACTAAAGC 
TTGGATGTTG 
AATTCTACAA 
AGCAGCTCTG 
AGTCTGTTGT 
TTAAGTCAAT 
GTAAAAGACT 
CTTAATATTG 
CACAGGAGAA 
ATGAAACCAC 
AGGAGACCAA 
GACCTGAACC 
AAGCAGAAAA 
GTGCTTTTCA 
ATGTTTTCGA 
ATTATAAAGA 
CTGATGTGTA 
TAAAGGAGAA 



11 
I 

GGTGACGCGA 
CTGCTCAATG 
AATCAGAATA 
TTCTTCCATG 
AAACAAAAGT 
CTAACTTTAA 
GAATGGAGAA 
TCCGATCCAC 
CTGGAAAAGT 
TTGTTAAGTT 
CAGTGAAACC 
TAAATGGGAA 
GTGGAGAAGA 
TCCACAGAAC 
CCCATCAGTT 
AGGAAACCAG 
AGGTAGAAAT 
GTGAAGGGTC 
GGAGAACCGG 
CATTGGAAGA 
AACTTAAAGA 
CTCCACAAGA 
TGGTGATGCA 
AGTACAACAC 
ATCTCTGAAG 
TATTGTGCCT 
TGATTGTCAG 
TAAGGGAAAG 
TAATTCAAAA 
TTCTTCTGAC 
AAATATAGGT 



21 
I 

GAGGCTGGGG 
TGAAGCCCCT 
ACTTTTAGAC 
GGGAAAACCT 
CTAAGATTTG 
AAACCAGATT 
AGTACAGTTA 
AAGAAACAAA 
GGGCAAATTG 
TTCTGCTGGA 
AACAGTTAAT 
AGTTTGTGGA 
TTATTGTTCA 
AACTCTTTTG 
TATAAAGGAT 
TAAAATTCAA 
TACAACGATG 
ATTCGATGAA 
AAATCATGAT 
ATGCGAAGTA 
AGACATTCTA 
GCAACTTTTT 
CAGTGTTCTC 
ACAGCTCTTT 
ATAGTCGAAC 
TACAAAGTTA 
AAGAATAGCT 
AGAGACTTCT 
GGAGAAACTT 
ATTGAAAAAA 
TTAGAAAGTA 



31 
I 

TCTCCAGGAC 
GATCATGATT 
AATATTAAGG 
GGAAATTCTG 
AAGAGATAAT 
TCGGAGAAGT 
GAGTTTGAGA 
GAAAAGGAAG 
GTCAATCAAT 
AAAGTGAAAT 
TATAAAATGG 
CAGTGTGAGA 
GGATGCTTTG 
CAGGCAAGAT 
GTTAATCCAG 
CATAAACCCA 
AAAAGAGCAC 
GAAGCTTCTG 
GACAACAAGA 
CAGACTAATC 
TCCTATATGG 
AAATGCTACC 
AAAATGAAAA 
TATTGCCAGT 
TGGATGATAC 
AATTAGCTGA 
TTCCATATGA 
TAAATCTTTG 
CAAACACAGA 
TTGAGGAAAG 
ATCAAAAGTC 



41 
I 

CAACTCCTCT 
CACTTCCACT 
TGGTAATCAT 
TAAAGCTAAA 
TTGCTTCAGG 
ACAAAAGAAA 
ACCAAGAGAT 
ATAGAGAGTC 
CATATATGAT 
TAAAATTGCT 
CAAATTCTTC 
ACAAAGCTGC 
CTAATGTTCA 
CTCAAATATT 
ATGAACCCAA 
AATCTGTACT 
AACGTACAAA 
CACAGTCCTT 
AACAGAATTT 
TGAAAATTTG 
AAAAATTATG 
AGATACGTTC 
CGATGAAGAT 
AGAAACATTA 
TTATGAAGAG 
TGCAGACAGT 
AAATGGCATC 
TCTGAGAAAC 
TTTTGACAAC 
CACCTCCTTT 
TGATGATTCC 
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51 
I 

TCATCTTCGT 
TAATAAATAA 
GAACAGAAAA 
ATATAGCAAT 
ATTTTGATGG 
TAGAAATGCT 
GGAGAAGAAA 
AAGCGAGTAT 
GTCACAAAAT 
GAAGGAACAG 
AGAATGTGAA 
TCTACTGGTA 
CCAGAAAGGG 
ATTCAATGTA 
AGAGGAGAAT 
TCTCCAGAGG 
ACCAAGAAAG 
TCAGGAAGTG 
ACATGCAGCA 
GAGAGAACCA 
GCTTAAAAAA 
CCACATCCAC 
AGTGATGGTG 
AACATAGAGA 
GAATTTGAAG 
CAACGAAGTT 
CATCAACATC 
AGCTCTACTT 
ATCGTGGATC 
GAAAGAAATT 
TGTGTATCAC 
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WO 03/042661 
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10 
15 
20 



TTGAAAGCAA GGACACTTTG CTAGGTAGAG ATTTAGAAAA AGCTCCCATT GAGGAGAAAT 1920 

TATCTCAAGA CATCAAAGAA TCCTTGGAAT TGAGCAATCT GTATAAGAGG CCAAGCTTTG 1980 

AAGAATCAAA AACTACAAAG TCATCACTGT TGTTACAAGA AATAGCCTGC AGAAGTAAGC 2040 

CTATAACAAA ACAATATCAA GGACTTGAGA GATTCTTTAT TTTTGATACA AATGAAAGAC 2100 

TCAACTTACT TCCTTCTCAT CGTTTAGAAT GCAACAATTC CAGTACTAGG ATTACACTTG 2 ISO 

CAGGTCAGAA ATCACAGAGA CCTTCAACAG CAAATTTTCC ACTTTCCAAC TCTGTTAAAG 2220 

AAAGCTCCAG TTGCCTTTCA TCCTCTCATC CTCGATCAAG AAGTGCAGCT GCTCAATCAT 2280 

CATCTAGAGC TGCTTCTGAA ATTTCAGAAA TTGAATATAT TGATATTACT GACCAGAATG 2340 

AGCTTTCCTT AGATGACACT ACTGATCAAC ATACTTTAGA CAATTTGGAA AAAGAATTAC 2400 

AAGTGCTGAG ATCTCTTGCA GATACTTCAG AAAAGCTTTA CAGCTTAACC TCAGAAGAGT 2460 

TCCCAGATTT CAGCAGCCAA TCACTGAATA TAAGTCAGAT TTCCACAGAT TTCCTTAAGA 2520 

CCTCACATGT GAGGGGTCCC TGTGGAGTTG AGGAATTGAG CTGTTCTGGA AGAGATACCA 2580 

AAATTCAGTC TTTGCTGTCA CTTTCTGAGA GCAGTACAGA TGAGGAGGAG GAAGATTTTC 2640 

TCAACAAGCA ACA TGTC ATC ACACTACCGT GGTCAAAGAG TACTTAAAGA TTATTTGTTC ' 2700 

ATTACTGTTT CCATTTTGTA CCCAGAGTAA AGCAAACAAC TGAGAAAAGT AACCAAGTGA 2760 

TTACCTATCC AAGTGCTGGA GATTTTGATT ACTAATGTCT TTGATGTTTC AAGGCTACAA 2820 

ACTAATAAAA GTAAAATTAT AAGTTCAAAA AAATTTTTAA AAAAAAAAAT AAAAAA 2876 

Seq ID NO: C57 DNA Sequence 
Nucleic Acid Accession #: NM_024687.l 
Coding sequence: 138.. 1706 

OC 1 11 21 31 41 51 

25 I I ! II | 

AAAAACATGA TGACAACAAG AAACAGAATT TACATGCAGC AGTAAAAGAC TCATTGGAAG 60 

AATGCGAAGT ACAGACTAAT CTGAAAATTT GGAGAGAACC ACTTAATATT GAACTTAAAG 120 

AAGACATTCT ATCCTATATG GAAAAATTAT GGCTTAAAAA ACACAGGAGA ACTCCACAAG 180 

AGCAACTTTT TAAAATGCTA TCAGATACGT TCCCACATCC ACATGAAACC ACTGGTGATG 240 

CACAGTGTTC TCAAAATGAA AACGATGAAG ATAGTGATGG TGAGGAGACC AAAGTACAAC 300 

ACACAGCTCT TTTATTGCCA GTAGAAACAT TAAACATAGA GAGACCTGAA CCATCTCTGA 360 

AGATAGTCGA ACTGGATGAT ACTTATGAAG AGGAATTTGA AGAAGCAGAA AATATTGTGC 420 

CTTACAAAGT TAAATTAGCT GATGCAGACA GTCAACGAAG TTGTGCTTTT CATGATTGTC 480 

_ _ AGAAGAATAG CTTTCCATAT GAAAATGGCA TCCATCAACA TCATGTTTTC GATAAGGGAA 540 

AGAGAGACTT CTTAAATCTT TGTCTGAGAA ACAGCTCTAC TTATTATAAA GATAATTCAA 600 

AAGGAGAAAC TTCAAACACA GATTTTGACA ACATCGTGGA TCCTGATGTG TATTCTTCTG 660 

ACATTGAAAA AATTGAGGAA AGCACCTCCT TTGAAAGAAA TTTAAAGGAG AAAAATATAG 720 

GTTTAGAAAG TAATCAAAAG TCTGATGATT CCTGTGTATC ACTTGAAAGC AAGGACACTT 780 

TGCTAGGTAG AGATTTAGAA AAAGCTCCCA TTGAGGAGAA ATTATCTCAA GACATCAAAG 840 

AATCCTTGGA ATTGAGCAAT CTGTATAAGA GGCCAAGCTT TGAAGAATCA AAAACTACAA 900 

AGTCATCACT GTTGTTACAA GAAATAGCCT GCAGAAGTAA GCCTATAACA AAACAATATC 960 

AAGGACTTGA GAGATTCTTT ATTTTTGATA CAAATGAAAG ACTCAACTTA CTTCCTTCTC 1020 

ATCGTTTAGA ATGCAACAAT TCCAGTACTA GGATTACACT TGCAGAAGAC AGAGAATGGA 1080 

TTCCAGACCA TAGCTTAAGT GAATATGCTG ATAATGCAAT TGTCTTGGGT GTTCTGCAGG 1140 

GTGCTCAGAG TCC ATCA TCA AGTAGAAAAC AGCAAAAGAT GGGTCAGAAA TCACAGAGAC 1200 

CTTCAACAGC AAATTTTCCA CTTTCCAACT CTGTTAAAGA AAGCTCCAGT TGCCTTTCAT 1260 

CCTCTCATCC TCGATCAAGA AGTGCAGCTG CTCAATCATC ATCTAGAGCT GCTTCTGAAA 1320 

TTTCAGAAAT TGAATATATT GATATTACTG ACCAGAATGA GCTTTCCTTA GATGACACTA 1380 

CTGATCAACA TACTTTAGAC AATTTGGAAA AAGAATTACA AGTGCTGAGA TCTCTTGCAG 1440 

ATACTTCAGA AAAGCTTTAC AGCTTAACCT CAGAAGAGTT CCCAGATTTC AGCAGCCAAT 1500 

CACTGAATAT AAGTCAGATT TCCACAGATT TCCTTAAGAC CTCACATGTG AGGGGTCCCT 1560 

GTGGAGTTGA GGAATTGAGC TGTTCTGGAA GAGATACCAA AATTCAGTCT TTGCTGTCAC 1620 

TTTCTGAGAG CAGTACAGAT GAGGAGGAGG AAGATTTTCT CAACAAGCAA CATGTCATCA 1680 

CACTACCGTG GTCAAAGAGT ACTTAAAGAT TATTTGTTCA TTACTGTTTC CATTTTGTAC 1740 

CCAGAGTAAA GCAAACAACT GAGAAAAGTA ACCAAGTGAT TACCTATCCA AGTGCTGGAG 1800 

ATTTTGATTA CTAATGTCTT TGATGTTTCA AGGCTACAAA CTAATAAAAG TAAAATTATA 1860 

AGTTCAAAAA AAAAAAAAAA AAAA 1884 
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Seq ID NO: C58 DNA Sequence 

Nucleic Acid Accession #: NM_005408.1 

Coding sequence: 7 6.. 3 72 

} 11 21 31 41 51 

I I I I I I 

AAAAGGCCGG CGGAACAGCC AGAGGAGCAG AGAGGCAAAG AAACATTGTG AAATCTCCAA 60 

CTCTTAACCT TCAACATGAA AGTCTCTGCA GTGCTTCTGT GCCTGCTGCT CATGACAGCA 120 

GCTTTCAACC CCCAGGGACT TGCTCAGCCA GATGCACTCA ACGTCCCATC TACTTGCTGC 180 

TTCACATTTA GCAGTAAGAA GATCTCCTTG CAGAGGCTGA AGAGCTATGT GATCACCACC 240 

AGCAGGTGTC CCCAGAAGGC TGrCATCTTC AGAACCAAAC TGGGCAAGGA GATCTGTGCT 300 

GACCCAAAGG AGAAGTGGGT CCAGAATTAT ATGAAACACC TGGGCCGGAA AGCTCACACC 360 

CTGAAGACTT GAACTCTGCT ACCCCTACTG AAATCAAGCT GGAGTACGTG AAATGACTTT 420 

TCCATTCTCC TCTGGCC TCC TCTTCTATGC TTTGGAATAC TTCTACCATA ATTTTCAAAT 480 

AGGATGCATT CGGTTTTGTG ATTCAAAATG TACTATGTGT TAAGTAATAT TGGCTATTAT 540 

TTGACTTGTT GCTGGTTTGG AGTTTATTTG AGTATTGCTG ATCTTTTCTA AAGCAAGGCC 600 

TTGAGCAAGT AGGTTGCTGT CTCTAAGCCC CCTTCCCTTC CACTATGAGC TGCTGGCAGT 660 

GGGTTTGTAT TCGGTTCCCA GGGGTTGAGA GCATGCCTGT GGGAGTCATG GACATGAAGG 720 

GATGCTGCAA TGTAGGAAGG AGAGCTCTTT GTGAATGTGA GGTGTTGCTA AATATGTTAT 780 

TGTGGAAAGA TGAATGCAAT AGTAGGACTG CTGACATTTT GCAGAAAATA CATTTTATTT 840 

AAAATCTCCA AAAAAAAAAA 860 

Seq ID NO: C59 DNA Sequence 

Nucleic Acid Accession #: AK097746.1 

Coding sequence: IBS.. 2224 
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CTTTCATGAC AGTAACAAAT CCAAGATTTT GGAAAAGCGC CTACGATATT TAAATGACCA 
CTTCACATAC AACTTATATT GTAATATATG CCGATCACTA TTTGAGAAGG ACAAGCTGTT 
ATTTTCCTTT TTATTATGTG CCAATCTTCT TCTGGCAAGG AAAGAGATTG AATACCAGGA 
ACTGATGTTT CTTTTAACTG GAGGAGTAAG TCTTAAAAGT GCTGAGAAAA ATCCTGATCC 
AACTTGGCTA CAGGACAAAA GCTGGGAGGA AATCTGTCGG GCAAGTGAAT TTCCTGCCTT 
CAGAGGACTC AGGCAACATT TTTGTGAACA TATATATGAA TGGCGAGAAA TCTATGACAG 
TAAAGAGCCA CATAATGCTA AATTTCCAGC ACCAATGGAT AAGAACCTAA ATGAACTACA 
GAAAATAATA ATTCTTCGGT GTTTAAGACC TGATAAGATA ACCCCAGCTA TAACAAACTA 
TGTAACTGAC AAACTAGGGA AAAAGTTTGT AGAGCCTCCA CCATTTGATT TGACAAAGAG 
TTACTTGGAT TCAAATTGCA CCATTCCCTT AATTTTTGTT CTATCTCCAG GAGCAGATCC 
TATGGCCAGC CTGCTGAAAT TTGCAAATGA TAAATCTATG TCTGGAAATA AGTTTCAAGC 
TATTTCACTG GGACAGGGAC AAGGACCGAT TGCAGCAAAA ATGATTAAAG CAGCAATTGA 
AGAAGGAACT TGGGTGTGCC TACAGAATTG CCATCTTGCA GTGTCCTGGA TGCCCATGTT 
GGAAAAAATA TGTGAAGATT TTACCTCTGA AACCTGTAAC TCATCCTTTA GGCTTTGGCT 
GACAAGCTAT CCATCTTCAA AATTCCCAGT AACAATTCTA CAGAATGGAG TAAAAATGAC 
TAATGAACCT CCCACGGGTC TTCGGCTGAA TCTCCTTCAA TCATATCTCA CTGATCCAGT 
TTCTGATCCT GAGTTTTTCA AGGGATGCCG TGGAAAGGAA CTGTTATTTA TCAATGAATA 
TGATACAATT CCATTTGAAG CTATATCTTA CCTGACTGGG GAGTGTAATT ATGGAGGAAG 
AGTGACAGAC GATTGGGACA GACGTCTTCT ATTAACCATG CTGGCTGACT TTTATAATCT 
GTACATAGTT GAAAACCCTC ATTATAAGTT TTCTCCCAGT GGAAACTATT TTGCACCTCC 
TAAAGGCACT TATGAGGACT ACATTGAATT CATTAAGAAA CTTCCATTTA CTCAACACCC 
TGAGATATTT GGATTACATG AAAACGTTGA CATCTCCAAG GATCTTCAAC AAACAAAAAC 
CCTCTTTGAG TCCTTGCTCC TCACCCAGGG AGGCTCCAAA CAGACAGGAG CCTCAGGAAG 
TACTGATCAG ATTCTGTTAG AAATTACCAA AGATATCCTC AACAAGCTCC CTAGTGATTT 
CGACATTGAA ATGGCACTAC GGAAGTATCC TGTGAGATAT GAAGAAAGCA TGAATACTGT 
GTTAGTACAA GAAATGGAAA GATTTAACAA TTTAATTATA ACTATACGTA ACACTCTACG 
GGACCTTGAA AAAGCTATTA AGGGTGTGGT TGTGATGGAT TCTGCATTGG AGGCACTCTC 
CAGTAGCTTA CTTGTTGGAA AGGTTCCAGA AATATGGGCC AAACGTTCAT ACCCAAGCCT 
TAAGCCCCTG GGAAGTTACA TCACAGATTT CCTAGCCCGG TTGAACTTTT TACAGGACTG 
GTATAATTCA GGAAAACCTT GTGTGTTTTG GCTGTCAGGT TTCTTTTTCA CTCAGGCCTT 
TTTAACTGGA GCTATGCAGA ATTATGCCAG AAAATATACC ACCCCTATTG ATTTGCTAGG 
ATATGAATTT GAGGTTATCC CATCTGATAC ATCTGACACA TCACCAGAAG ATGGTGTTTA 
TATCCACGGA CTGTATCTCG ATGGCGCACG CTGGGACCGA GAAAGTGGAT TGCTTGCTGA 
ACAATATCCC AAACTTCTGT TTGACCTGAT GCCCATCATA TGGATAAAAC CAACTCAAAA 
ATCTCGGATT ATAAAGTCGG ATGCCTATGT CTGTCCCCTC TACAAGACAA GTGAACGTAA 
AGGAACTCTT TCCACTACGG GACATTCTAC TAACTTTGTC ATTGCAATGT TGTTAAAAAC 
AGACCAACCT ACTCGGCACT GGATCAAGCG CGGGGTTGCT TTGCTTTGTC AGTTGGATGA 
CTAAATTGGA CAAATTTATA AAACATCCAA AAGTTT 

Seq ID NO: C60 DNA Sequence 
Nucleic Acid Accession #: J02761.1 
Coding sequence: 14.. 1159 
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1 11 21 31 41 

I I I I i I 

GAATTCCGGT GCCATGGCTG AGTCACACCT GCTGCAGTGG CTGCTGCTGC TGCTGCCCAC 60 

GCTCTGTGGC CCAGGCACTG CTGCCTGGAC CACCTCATCC TTGGCCTGTG CCCAGGGCCC 120 

TGAGTTCTGG TGCCAAAGCC TGGAGCAAGC ATTGCAGTGC AGAGCCCTAG GGCATTGCCT 180 

ACAGGAAGTC TGGGGACATG TGGGAGCCGA TGACCTATGC CAAGAGTGTG AGGACATCGT 240 

CCACATCCTT AACAAGATGG CCAAGGAGGC CATTTTCCAG GACACGATGA GGAAGTTCCT 300 

GGAGCAGGAG TGCAACGTCC TCCCCTTGAA GCTGCTCATG CCCCAGTGCA ACCAAGTGCT 360 

TGACGACTAC TTCCCCCTGG TCATCGACTA CTTCCAGAAC CAGACTGACT CAAACGGCAT 420 

CTGTATGCAC CTGGGCCTGT GCAAATCCCG GCAGCCAGAG CCAGAGCAGG AGCCAGGGAT 480 

GTCAGACCCC CTGCCCAAAC CTCTGCGGGA CCCTCTGCCA GACCCTCTGC TGGACAAGCT 540 

CGTCCTCCCT GTGCTGCCCG GGGCCCTCCA GGCGAGGCCT GGGCCTCACA CACAGGATCT 600 

CTCCGAGCAG CAATTCCCCA TTCCTCTCCC CTATTGCTGG CTCTGCAGGG CTCTGATCAA 660 

GCGGATCCAA GCCATGATTC CCAAGGGTGC GCTAGCTGTG GCAGTGGCCC AGGTGTGCCG 720 

CGTGGTACCT CTGGTGGCGG GCGGCATCTG CCAGTGCCTG GCTGAGCGCT ACTCCGTCAT 780 

CCTGCTCGAC ACGCTGCTGG GCCGCATGCT GCCCCAGCTG GTCTGCCGCC TCGTCCTCCG 840 

GTGCTCCATG GATGACAGCG CTGGCCCAAG GTCGCCGACA GGAGAATGGC TGCCGCGAGA 900 

CTCTGAGTGC CACCTCTGCA TGTCCGTGAC CACCCAGGCC GGGAACAGCA GCGAGCAGGC 960 

CATACCACAG GCAATGCTCC AGGCCTGTGT TGGCTCCTGG CTGGACAGGG AAAAGTGCAA 1020 

GCAATTTGTG GAGCAGCACA CGCCCCAGCT GCTGACCCTG GTGCCCAGGG GCTGGGATGC 1080 

CCACACCACC TGCCAGGCCC TCGGGGTGTG TGGGACCATG TCCAGCCCTC TCCAGTGTAT 1140 

CCACAGCCCC GACCTTTGAT GAGAACTCAG CTGTCCAGCT GCAAAGGAAA AGCCAAGTGA 1200 

GACGGGCTCT GGGACCATGG TGACCAGGCT CTTCCCCTGC TCCCTGGCCC TCGCCAGCTG 1260 

CCAGGCTGAA AAGAAGCCTC AGCTCCCACA CCGCCCTCCT CACCTCCCTT CCTCGGCAGT 1320 

CACTTCCACT GGTGGACCAC GGGCCCCCAG CCCTGTGTCG GCCTTGTCTG TCTCAGCTCA 1380 

ACCACAGTCT GACACCAGAG CCCACTTCCA TCCTCTCTGG TGTGAGGCAC AGCGAGGGCA 1440 

GCATCTGGAG GAGCTCTGCA GCCTCCACAC CTACCACGAC CTCCCAGGGC TGGGCTCAGG 1500 

AAAAACCAGC CACTGCTTTA CAGGACAGGG GGTTGAAGCT GAGCCCCGCC TCACACCCAC 1560 

CCCCATGCAC TCAAAGATTG GATTTTACAG CTACTTGCAA TTCAAAATTC AGAAG AATAA 1620 

AAAATGGGAA CATACAGAAC TCTAAAAGAT AGACATCAGA AATTGTTAAG TTAAGCTTTT 1680 

TCAAAAAATC AGCAATTCCC CAGCGTAGTC AAGGGTGGAC ACTGCACGCT CTGGCATGAT 1740 

GGGATGGCGA CCGGGCAAGC TTTCTTCCTC GAGATGCTCT GCTGCTTGAG AGCTATTGCT 1800 

TTGTTAAGAT ATAAAAAGGG GTTTCTTTTT GTCTTTCTGT AAGGTGGACT TCCAGATTTT 1860 

GATTGAAAGT CCTAGGGTGA TTCTATTTCT GCTGTGATTT ATCTGCTGAA AGCTCAGCTG 1920 

GGGTTGTGCA AGCTAGGGAC CCATTCCTGT GTAATACAAT GTCTGCACCA ATGCTAATAA 1980 

AGTCCTATTC TCTTTTAAAA AAAAAAAAAA AAAAAAAACG GAATTC 2026 



Seq ID NO: C61 DNA Sequence 
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1 

I 

GGGGTCTGGG 
CTGCAGGGCT 
GGTGACATAT 
TGCCTGCGTC 
AACGTGCGGA 
AGCTGCAGCA 
TTCCTGGCGG 
ACCAAGAA6A 
GTGGAGGGAG 
GAGGAGGATT 



11 
I 

GAGGTGACAT 
CGGCAGACGG 
GGGACCGGGA 
TCCGGTGCTG 
GGAAGCACAT 

GTCOGTGTGA 
CGCCGTCCAC 
GCACCGAGGG 
AGGGGAGTCC 



21 
I 

GTTGGGCTGT 
AAATGGAATC 
GAGCTGTGGG 
CTACCGCAAT 
GTGGGCGCTG 
CTGGTGGGCC 
CATGTCCAAG 
GGGCAGCGTG 
GGAAGGGACG 
CCGGGGGACT 



31 
I 

GGGATCCCAG 
CAGGGATTCT 
GGCCAGGCGG 
GGGGTCTGCT 
GTCTGGACGT 
AAGCGCCGGG 
TCCGTCTCGC 
CCAGTCGCCC 
GAGGAGGGTG 
GCTCAATACA 



41 
I 

CGCTGGGCCT 
TCTACCCATG 
CCATCGATAG 
ACCACCAGCG 
GCAGCGGCCT 
ACGTGCTGCA 
TGCTCTCCAA 
TGTCCAAAGA 
AGGAGACAGA 
GATACGGTGG 



51 
I 

GCTCCTGCTG 
GAGCTGTGAG 
CCCCAACCTC 
TCCAGACGAA 
CCTCCTCCTG 
TATGCCCGGT 
GCACCGAGGG 
GTCCAGGGAT 
GGGCGAGGAA 
ACG 
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Seq ID NO: C62 DNA Sequence 

Nucleic Acid Accession #: NM_054023.2 

Coding sequence: 98.. 379 



1 
I 

GGGGACACTT 
ATTTTTAAAC 
GCTGGTGACC 
CCTTCCTGTT 
ATTAAAGCTT 
GAAGTGTGTA 
GCTATCACAC 
TGCTCCTATC 
GTAGTGACCC 
AAAAAAAAAA 



11 
I 

TGTATGGCAA 
TCCTGAAAAA 
ATCAGCCTTT 
GACAAGTTGG 
CTTCTGAAAA 
AATGAGCTGG 
TTGGTGTGAC 
CTCCCTGCCT 
GTGAAAAGGA 



21 
I 

GTGGAACCAC 
TATCCCAGAT 
GTAGTTACTC 
CACCTTTACC 
CTCTGGGCAT 
GACCAGAGGC 
ATCAAGATAA 
GAAACCTGTT 
CAAATAAAGC 



31 
I 

TGGCTTGGTG 
AACTGTCATG 
TGCTACTGCC 
TCTGGACAAC 
TTCTGTTGAG 
TTCTGAAGCT 
AGAGCGGAGG 
CTACCAATTA 
AATGAATACT 



41 
I 

GATTTTGCTA 
AAGCTGGTAA 
TTCCTCATCA 
ATTCTTCCCT 
CACCTTGTGG 
GTGAAGAAAC 
TGGATGGGGA 
TAGATCAAAT 
AAAAAAAAAA 



51 
I 

GATTTTTCTG 
CTATCTTCCT 
ACAAAGTGCC 
TTATGGATCC 
AGGGGCTAAG 
TGCTGGAGGC 
TGGAAGATGA 
GCCCTAAAAT 
AAAAAAAAAA 



Seq ID NO: C63 DNA Sequence 

Nucleic Acid Accession 8: FGENESH predicted 

Coding sequence: 1..2874 



11 
I 



21 



ATGCCCCTGT CCTATGCCTA 
TGGATGTCGA GAGGAGCATA 
TGCACTGACA GAAACTGGCA 
GAGAGATGTT CAGATGTGTT 
TCAGGAGCGA AGCTGCAGAC 
CAGGCTGAAG CGCTCCTCAA 
AGCGAGCGAG CGAGGGATGC 
CCAACGTCCT CTACCATCCC 
AGTTACTCCT GCAGCCTTCC 
GATAGCCTTG ACATTCTGGA 
TACTTTCTGG GGGTTGTCAC 
ATATTTGATT TGCAGATTTA 
ACTGTCCAGG TAACAGATGT 
CATCTCCGTG CAGACCAGCC 
GCTACTGCAT TGGCCAGGCA 
TTCTGTGTTG TGGTGGGCAT 
TCTGCTAATG GCACCCTCTT 
TTCCATCTCA TCGTGGAGGT 
GTGAACATCG TGAACCTCAA 
ACAGTCCTGG AGGAACTGAG 
GATGATGAAG GTTTTCCCAG 
ATGATAAATC AGTTGACTGG 
GAATTGAGAC AAAATCCCAC 
GGTCAGGAGA ATCGCATCCA 
ACATGCCAAA AGTTCACCTT 
ACCTGGATGG ATACCGTATT 
GGGCGATTTA CAAAAGAAAG 
CTCACAATCA TGGCAGAAGG 
CAAAGAGATA GAGCTTGTGT 
CTTAGTCACT ATCACGAGAA 
TACCAGGTCC CTCCCACAAC 
ACAAGCTCTG TCACTGTTAC 
ACTCCAAACT CTTATTTCCT 
AATTTCAAGC TGACATGTAC 
GGCCCAGGTA ACGTCAACAA 
CGCCTGCTGC TTACATCTCG 
AAGCTACTTG TCTACGTAAC 
CTTGTTGAGA CAGGAACAGT 
ATCACCACGA CCCCCAGGCC 
CCATCTGCAT GGTACGTGCC 
CTCGTGTACC TGGTCGTCCT 
GGGAAGAACA AGGAACCTCT 
GTGGTGGAAA CTATCCAGAT 
GAGCAAGCTT CACTCGAGCT 
ACCCTAAGAA AGGTCCAGGT 



31 
I 



TAAAAACGCT GAGACCCTAG 
TCAGCGGAGG AACACACGGG 
TGCTGGCAGA ACACGTGGAA 
CGGAGTTTCT TTCTTCTGGT 
CTTCACGCCA GCCCAGGAAG 
GTGCCGCCAG AGTGGGCGTC 
CAGCATGCTG TCACCTCTCA 
TCCAAGAAGA TCCTACTCTC 
AGACATGAAA ATCTCCATGG 
GGATGGCGAG TCTGGGTCAC 
CACTGGGATG GAACAACTAG 
TGTGAAGGAT GAGGTTGGTG 
GAACGAGCCA CCTCAGTTTC 
ACATTTCAAT GCTCATAGTC 
CAGGCTTAGA TCTAGCATTG 
GCAGTATTTC CTGATTTCTC 
CTCCACAACA GAATTGGACT 
GAGGGACAGT GGAGGCCTCA 
CGACGAAGTC CCTCGCTTTA 
TCCAGGAACC ATCGTGGCCA 
CCACCTCCTC TACAGCATTA 
TACAATCCAA GTGGCCCAAA 
CATTTCCCTG GAAGTTCTAG 
GATAACCTTC ATTGTGGAAG 
CAGATCCAGT CTCCACCCTG 
AGACT GTTTT CATGCTGCTG 
AGGTTTAATT GGACTTACAG 
CAAGGAGGAG CAAGTCACAT 
AGGGAAACTC CTCCTTATAA 
CAACTCAGGA AAGACTTGCC 
ATGTAGGAAT TCAAGAATCC 
TGTGAACATC CTTGAAGAAA 
GGCCCTCCCA GTGGATCTGA 
CGACCTTGAT TCCAGCCCCA 
TCATTTCACC TTCTCTCCCA 
CTTTGACTAT GCTGGTGGGT 
TGATGACAAC TTGATGTCTG 
GACACTGAGT ATTAAAGTCA 
CAGGGTCACC TATCAGGTCC 
GTTTGTCATC ACTTTGGGCT 
ATTGGCCAAA GCCATCCACA 
GACAAAGAAA GGAGAAACGA 
GAACACTATC TTTGATGGAG 
CTATGCCCTG CTGCCCAGCT 
GTGTGGGGAG AGTGAAGAGA 



41 
I 

CAGGCAGACA 
CAGCTGGACG 
TTTGGCTGGG 
GGGTTCGTGG 
GGGCTCCCAC 
CAGGCAGAGG 
GTGCTGCCAT 
CAACCGAAAT 
CAGAATCTGG 
CATTTCTTGT 
ATTTTGAAAC 
TCACAGACCT 
AAGGCAACTT 
ACACGTACGT 
GTTCCCCCTT 
CCCCAAAGAG 
TTGAAGCAGG 
AAGCCTCCAC 
CCAGCCCGAC 
ATATCACAGC 
CCACTGTTAG 
GGATAGACCG 
TGAAGGACAG 
ACGTCAACGA 
CTCTGTGCTC 
ATAAAGATAT 
TTCCACATGG 
CTTACATGGA 
AGCCATCAGA 
CCCATGATTC 
AGGCCACCAA 
ATGATGAAAA 
AAGTTGGCAC 
GATCTTTCCG 
ATGCTGGTTC 
TTGATAAGAT 
ACAGGAAGAA 
TTCCCCACCC 
TGAGGAAAAA 
CCATATTGCT 
GACACTGCCC 
AGACTGCAGA 
AAGCCATAGA 
GCTGCGACCC 
CCGGTCAGTG 
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300 

360 

420 

480 

540 

550 



51 
I 

CACAAGCAGC 
TCCAGAGGAA 
GCAGTTGGAG 
TCTCGCTGGC 
AGTGCAGCGG 
AGGCGCCGAG 
GCGAAACTAC 
TGCTCACAAG 
CCCCTCCTTG 
GACTCATTTG 
AGGACCAAAC 
GCAAGTCCTG 
GGCAGAAGAT 
GAGGGTAGTG 
CCTGGGCACC 
CTTCAGAATG 
ACACAGAAGT 
AGAGCTCCAG 
ACGAGTGTAC 
GGAGGATCCT 
CAAATATTTC 
AGATGCAGGT 
ACCATATGGG 
CAATCCTGCC 
CAAGACGCTG 
ACCTGTGACT 
CTGGGGAAGC 
TGGCAGCAGG 
TCTCATGAGA 
CATTTCCTCC 
CAACGAAGAC 
GCCAATTTGT 
AAATATTCAG 
TTATTCCATT 
CAATGTCACA 
CTGGGACTAC 
AGCGGAGGCT 
AACCACTATC 
CGTTTACTCT 
TCTGGGTCTC 
CTGCAAGACT 
GAGAGACGTC 
TCCAGAGCCT 
TAGTCCAGTA 
TTCCGGCCAC 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

B40 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 
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1500 
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I860 
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2040 
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2400 
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2520 

2580 
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ATCACACTTC CCGGCAAGAT TCCAGTCGAT GACCCAAGGA AACAGGAAAC AGGCCTGCAG 
GGTGATTTCG AGGTCTGGAC TCTATGCCCC GCTGTGAAGG TGGTTGTAGG CAGCCCTCAA 
GCTGAACGGT GCATTCGATT GGCTCTCAGT CTGAAAAAGT ACAGTTCTGA TTAA 

Seq ID NO: C64 DNA Sequence 

Nucleic Acid Accession 8: XM_168571.1 

Coding sequence: 155.. 988 



2760 
2820 
2874 



11 



41 



21 31 

I 1 I I I 

TACACAGTCC TGGAGGAACT GAGTCCAGGA ACCATCGTGG CCAATATCAC 
CCTGATGATG AAGGTTTTCC CAGCCACCTC CTCTACAGCA TTACCACTGT 
TTCATGATAA ATCAGTTGAC TGGTACAATC CAAGTGGCCC AAAGGATAGA 
GGTGAATTGA GACAAAATCC CACCATTTCC CTGGAAGTTC TAGTGAAGGA 
GGGGGTCAGG AGAATCGCAT CCAGATAACC TTCATTGTGG AAGACGTCAA 
GCCACATGCC AAAAGTTCAC CTTCAGCATT ATGGTGCCGG AAAGAACAGC 
TTGCTTCTTG ACCTAAACAA GTTCTGCTTT GATGATGACA GTGAGGCACC 
TTCAACTTCA CCATGCCATC TGGAGTGGGG AGCGGCAGCA GATTTTTACA 
GGCTCTGGGA AGATTGTGCT GATTGGTGAT CTAGACTACG AAAATCCAAG 
GCCGGCAATA AATATACGGT GATAATCCAG GTGCAGGATG TGGCCCCCCC 
AATAACGTCT ACGTTTATAT CCTAACAAGC CCAGAAAATG AGTTTCCTCT 
AGGCCATCCT ATGTATTTGA TGTGTCAGAA AGAAGGCCCG CCCAGGGTCA 
CCTGAGGAAA AACGTTTACT CTCCATCTGC ATGGTACGTG CCGTTTGTCA 
CTCCATATTG CTTCTGGGTC TCCTCGTGTA CCTGGTCGTC CTATTGGCCA 
CAGACACTGC CCCTGCAAGA CTGGGAAGAA CAAGGAACCT CTGACAAAGA 
GAAGACTGCA GAGAGAGACG TCGTGGTGGA AACTATCCAG ATGAACACTA 
AGAAGCCATA GATCCAGTGA CCGGGGAAAC ATATGAATTC AACTCAAAAA 
AAAGTGGAAA GATCCACTAA CCCAAATGCC AAAATGGAAA GAGTCCAGCC 
TGCCCCACGC AGAGTCACTG CTGGGGAAGG GATGGGGTCA CTGAGAAGTG 
AGAAGATGAG CTGAGTGGCA AAGCGTGGGC TGAGGATGCT GGTCTGGGTT 
GGGTGGCAAG CTGGGCAACC CAAAGAACAG AAATCCAGCC TTCATGAACA 
CAAACCACAC CCAGGAAAGT AAACGGGGTC TAAGGAGGGG CCTGTCAATC 
TGCCTCACCC TAAATTCTAT GGGGATGGTG TGGGCATGGT GTAGGGGGGA 
TGAGGGGATT CAGACATCCA GGGTCAAACA TGGGATGTTT GACAAATTTT 
GAAAGGGGTT TGATCACATA GTTGCGTGTT CTGAAATGAT ACAGGAACAT 
ATTTCAGAAC TACCTGTGCT TCTGATAAGC AAGACTGTTA ACTTTGGGGT 
TGTGTTTCTT CTTTGCATTG ACTGCTAGGA AGCTCTATTC TGTTCACCAT 
TAGGAATTCC TGACATAAAT AGTGAAGACT ATCCTTACAT CTGGTTTCCA 
CTGCCCTCGT TTTAACATCA CCCAGATTTC TTCAGTTATA AATATGCCAT 
AAGTCACCTC AAATCTTCTT CAAAAGAAGC AGAACAGTGA AAAAAACAGA 
AAGAGTTGGT CATCTGGAAA GAAGAAAACT CAGTAGGCAC CTTCTTTTGT 
GGTGTCCGGA TCAGCATCCT GCATGTGAGA TTCATCCACG TTGTCCTGTC 
TCAGTTCTCT TCATGGTTAT GTCTGGTTTC ATTCTATGAT TATATCACAA 
CTACACTTGG GTGGCAGCTG CTTCAGATTT TTTACTTTTA AAAAATATAC 
ACTACAGGCA GGGCATGATG GCTCATGCCT GTAATGCCAG CACTTTGCCA 
ATCACCTAAG GTCAGGAGTT CAAGATCAGC CTGGCCTAGA TGGCAAAACC 
TAAAAAATAC AAAAATTAGC TTGGTGTGGT GGTGGGCACA TGTAATCCCA 
AGGCTGAGGT AGGGAGAACT GCTTAAACCT GAGAGGTGGA GGTTACAGTG 
GTGCCACTGC ACTCTAGCCT GGGTGACAAA GCAAGACTCC ATCTCAGAAA 
AAGTGAATTA CAACACT 

Seq ID NO: C65 DNA Sequence 

Nucleic Acid Accession #: NMJ)05266.3 

Coding sequence: 122.. 1198 



51 
I 

AGCGGAGGAT 
TAGCAAATAT 
CCGAGATGCA 
CAGACCATAT 
CGACAATCCT 
CAAGGGGACG 
AAACAACAGA 
GGATCCAGCT 
TAACCTAGCA 
TTACTATAAA 
CATTTTTGAT 
CCTATCAGGT 
TCACTTTGGG 
AAGCCATCCA 
AAGGAGAAAC 
TCTTTGATGG 
CTGGAGCCAG 
ACCAGGGAGC 
CCAACTGGGA 
CCAGAAATGA 
GGGCTTACCC 
ACTGAGATGC 
AAATGTGGGC 
TAAACAAATA 
TTTCTATCAG 
GTGGAATTGT 
AGAAAGTTTG 
CCTTATTTTC 
ACACCTTTGT 
TGAGTAAGTT 
TTTTTCTTGT 
TAGCAGTAGT 
TTTATCTATT 
TTAAAAGTGA 
AGGTGGGCAG 
CTGTCTCTAC 
GCTACTTGGG 
AGTTGAGATT 
AAAAAAATAA 



GGCACGAGGC 
ACACAAGGCA 
GATGGGCGAT 
GGTAGGCAAG 
TGCTGAGTCT 
CTGCCAGAAT 
GCAGATCATC 
GCGCATGCAG 
TGGCTCTTAC 
TGGAAGGATT 
CACCACCATG 
CACCCTGCAT 
GCCCACAGAG 
CCTTAGCCTG 
ACCGCGGCAG 
CTGCACACCA 
CAATCCCTTC 
AGTACGAGGT 
GCCTGAGGTG 
CAAGCGACGT 
CTCCTTTATG 
GTCCCTTCTG 
TCTCAATGAC 
AGGTGACAAC 
TTCAGATTAC 
GGAAGGGATA 
GCGTTCTCTA 



11 
I 

CATTTTCAAA 
GCAAGCACTG 
TGGAGCTTCC 
GTCTGGCTCA 
TCCTGGGGGG 
GTCTGCTACG 
TTCGTCTCCA 
GAGAAGCGCA 
GAGTACCCGG 
GCCCTCCAGG 
GAGGTGGGCT 
GTCTGCCGCA 
AAGAATGTCT 
GCTGAACTCT 
CACATGGCTA 
CCCCCCGACT 
AGCAATAATA 
CAGGAGCAGA 
CCCAATGGAG 
CTTAGTAAGG 
GGAGGATCAG 
AACTGATGCT 
GTTGCTCATT 
CCACCCAGAC 
TCATGAAACA 
GCCAGAGGGA 
AGTTCCTACC 



21 
I 

CAGTCCCTCC 
GGAGACGAAA 
TGGGAAATTT 
CTGTCCTCTT 
ATGAGCAGGC 
ACCAGGCTTT 
CGCCCTCTCT 
AGCTACGGGA 
TGGCAGAGAA 
GCACTCTGCT 
TCATTGTGGG 
GGAGTCCCTG 
TCATTGTCTT 
ACCACCTGGG 
AGTGCCAGCT 
TTAATCAGTG 
TGGCCTCCCA 
CTCCTGGGGA 
TCTCACCAGG 
CCAGCAGCAA 
GACCAGGTGG 
TTCTCACTGT 
AATTCTAGAA 
TGCAGTTCCC 
GGGTAGAGGG 
TAGAATGACT 
TCCTTGACCT 



31 

I 

TGGGAGAACA 
GTTTTGGCAT 
CCTGGAGGAA 
CATATTCCGT 
TGATTTCOGG 
CCCCATCTCC 
GGTGTACATG 
GGCCGAGAGG 
GGCAGAACTG 
CAACACCTAT 
CCAGTACTTC 
TCCCCACCOG 
TATGCTGGCT 
CTGGAAGAAG 
TTCTGGCCCC 
CCTGGAGAAT 
ACAAAACACA 
AGGTTTCATC 
TCACCGCCTT 
GGCAAGGTCA 
GAACAAAGGA 
CATCACTGCT 
ACTATAACCA 
TCCCCACCCT 
AAAGAAGGGA 
CTCTCTCTAC 
GATCACCCTC 



41 
I 

CAGACAGGCA 
CTGTTCCCTG 
GTACACAAGC 
ATGCTCGTGC 
TGTGATACGA 
CACATTCGCT 
GGCCACGCCA 
GCCAAAGAGG 
TCCTGCTGGG 
GTGTGCAGCA 
ATCTACGGAA 
GTCAACTGTT 
GTGGCTGCAC 
ATCAGACAGC 
TCTGTGGGCA 
GGCCCTGGGG 
GACAACCTGG 
CAGGTTCGTT 
CCCCATGGCT 
GATGACCTAT 
GGCTCAGAGA 
TGGCTCCTTT 
GGGCTCTGGG 
CTACCCAGTA 
AGCATGGCAA 
ATACCAGCAG 
CCTCCTCCAA 
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720 

780 
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900 

960 

1020 

1080 

1140 
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2040 
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2220 

2280 

2340 

2357 



51 
I 

GAGGATTACA 
GCTGTGCCAA 
ACTCGACCGT 
TGGGCACAGC 
TTCAGCCTGG 
ACTGGGTGCT 
TGCACACTGT 
TCCGGGGCTC 
AGGAAGGGAA 
TCCTGATCCG 
TCTTCCTGAC 
ACGTATCCCG 
TGTCCCTCCT 
GATTTGTCAA 
TAGTCCAGAG 
GAAAATTCTT 
TCACCGAGCA 
ATGGCCAGAA 
ATCATAGTGA 
CAGTGTGACC 
GGAAAGACGT 
GAGCCCCGGG 
ATAGTAAGAG 
TACGAAGCCT 
AAGCTGGCCT 
CATACCAAAT 
GGAAGAGCTC 
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AAAGTTCCCA 
CTGTTGTCTC 
CTTCTCCTCC 
CTTTACAAAT 
GCCCCACCTT 
GGTGTCAGCC 
TCTGTCACCC 
CTCCCTTGGC 
GCTGTTGTCC 
CCAGGCCTTG 



GCCAATAGAC 
CATGGACCAT 
CTCTACTCAG 
GCCTTGGTAT 
GGTGATGTTG 
CTGCTAGGGA 
TGCAGCTCCT 
TGTTCACCCA 
TCTCTCGAAC 
TTAAAAAAAA 



AGCATGAATC 
TCCTCGGAGT 
CCTTAAAAAG 
GTACCTCTGG 
CAACCTTTCC 
GTCACTGTAC 
TTACAGTTCA 
GCCATTCCCT 
CCTGACCAGA 
AAAAAAAAAA 



AAGGAACTTG 
AGTGGTGAGA 
GGCTTCTTGG 
CAAATGCCCT 
TTCTGCTAGG 
ACACAAACTC 
ATCCAATGAT 
GAAGGCCTTA 
TCATCAGCCA 



CATTATATGT 
TGGCCTTGGG 
AACTTTACCA 
GGTATGTACC 
GTGTACACCT 
TACTGGAATT 
AGAAACCATC 
CCAACAGGAA 
CTGAGGCCAG 



Seq ID NO: C66 DNA Sequence 

Nucleic Acid Accession fl: NM_014459.2 

Coding sequence: 73B..3407 



11 



21 



GCTCTTGAAT 
TTGCCCTTGG 
GCAGCCTCAG 
TCTGGCAAAT 
AGCCTGTGCA 
CCTGCCAACA 
CCTTCCCTTT 
TATCCAAGAA 
TGGAATTTCC 



51 



1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2190 



31 41 

I I 1 

GTAGATGCAG TCCGCCGCCG CCGCTGCCTC AGCCAGCAAT GCAAGATTAG ATCTCTAAAT 60 

GCAGCAAAAC ACTGCCTGAA AACAGACCGG CCCGCGCAGC AAGCAGACAT TTCACGGTGC 120 

GCTGGGGAAG CTTCAAAATA TATCTGTGAC TCTGTCTTCG TTGCTCTTCA TCCCCATCAA 180 

TTTCATCACG GGAGGCGAGC AGCAAGTAAG AATTTCACTT TCGGATCTGC CTAGAGACAC 240 

ACCTCCCTGC TCCCTCCCCC ACTCGATGTG AAGAGTATTC CGGAGTCTCC GGGCGGGAGT 300 

AGATTTGCAG CACCCTAGCG GGAGCGAGGA AAACCTACTG ATTCTTTAGC TCATTATCAT 360 

CTCTCCCAGA CGAGATTTCC TTCTTATCGC CTGCCTCATC GCTCAAGTTT GAGCCTCCCG 420 

AAGTCCGGGC GGGAGAGACG AAACCCCTGG CTCACCCCCA GCCGCAGGAA GCCACCGCCT 480 

TGCTCCAAGC CCCTGCAGCT CTGCTGCACC GCAGCTTCTC ACCCAGTGCG GATGCTGTAG 540 

ATCAACAGGT TCAGGGAACT TGAGCAGAAT AAGGAGAGAC CACCGGGTGC CGCAGCTCGG 600 

GTGCAGAGGG AAAAAAGGAC CCATAGACTT GTGGCTCGCG TCGCGCGCGC ACGCTGCGCC 660 

AGGGCCCCAG GCTGGCGCGC ACTCCCTCTC TGGCTCCTCC AGTCCGATTG CTCCTGCCCC 720 

CACCTTACAG GTCTGGGATG TACCTTTCCA TCTGTTGCTG CTTTCTTCTA TGGGCCCCTG 780 

CCCTCACTCT CAAGAACCTC AACTACTCCG TGCCGGAGGA GCAAGGGGCC GGCACGGTGA 840 

TCGGGAACAT CGGCAGGGAT GCTCGACTGC AGCCTGGGCT TCCGCCTGCA GAGCGCGGCG 900 

GCGGAGGGCG CAGCAAGTCG GGTAGCTACC GGGTGCTGGA GAACTCCGCA CCGCACCTGC 960 

TGGACGTGGA CGCAGACAGC GGGCTCCTCT ACACCAAGCA GCGCATCGAC CGCGAGTCCC 1020 

TGTGCCGCCA CAATGCCAAG TGCCAGCTGT CCCTCGAGGT GTTCGCCAAC GACAAGGAGA 1080 

TCTGCATGAT CAAGGTAGAG ATCCAGGACA TCAACGACAA CGCGCCCTCC TTCTCCTCGG 1140 

ACCAGATCGA AATGGACATC TCGGAGAACG CTGCTCCGGG CACCCGCTTC CCCCTCACCA 1200 

GCGCACATGA CCCCGACGCC GGCGAGAATG GGCTCCGCAC CTACCTGCTC ACGCGCGACG 1260 

ATCACGGCCT CTTTGGACTG GACGTTAAGT CCCGCGGCGA CGGCACCAAG TTCCCAGAAC 1320 

TGGTCATCCA GAAGGCTCTG GACCGCGAGC AACAGAATCA CCATACGCTC GTGCTGACTG 1380 

CCCTGGACGG TGGCGAGCCT CCACGTTCCG CCACCGTACA GATCAACGTG AAGGTGATTG 1440 

ACTCCAACGA CAACAGCCCG GTCTTCGAGG CGCCATCCTA CTTGGTGGAA CTGCCCGAGA 1S00 

ACGCTCCGCT GGGTACAGTG GTCATCGATC TGAACGCCAC CGACGCCGAT GAAGGTCCCA 1560 

ATGGTGAAGT GCTCTACTCT TTCAGCAGCT AOGTGCCTGA CCGCGTGCGG GAGCTCTTCT 1620 

CCATCGACCC CAAGACOGGC CTAATCCGTG TGAAGGGCAA TCTGGACTAT GAGGAAAACG 1680 

GGATGCTGGA GATTGACGTG CAGGCCOGAG ACCTGGGGCC TAACCCTATC C CAGC CCACT 1740 

GCAAAGTCAC GGTCAAGCTC ATCGACCGCA ACGACAATGC GCCGTCCATC GGTTTCGTCT 1800 

CCGTGCGCCA GGGGGCGCTG AGCGAGGCCG CCCCTCCCGG CACGGTCATC GCCCTGGTGC 1860 

GGGTCACTGA CCGGGACTCT GGCAAGAACG GACAGCTGCA GTGTCGGGTC CTAGGCGGAG 1920 

GAGGGACGGG CGGCGGCGGG GGCCTGGGCG GGCCCGGGGG TTCCGTCCCC TTCAAGCTTG 1980 

AGGAGAACTA CGACAACTTC TACACGGTGG TGACTGACCG CCCGCTGGAC CGCGAGACAC 2040 

AAGACGAGTA CAACGTGACC ATCGTGGCGC GGGACGGGGG CTCTCCTCCC CTCAACTCCA 2100 

CCAAGTCGTT CGCGATCAAG ATTCTAGACG AGAACGACAA CCCGCCTCGG TTCACCAAAG 2160 

GGCTCTACGT GCTTCAGGTG CACGAGAACA ACATCCCGGG AGAGTACCTG GGCTCTGTGC 2220 

TCGCCCAGGA TCCCGACCTG GGCCAGAACG GCACCGTATC CTACTCTATC CTGCCCTCGC 2280 

ACATCGGCGA CGTGTCTATC TACACCTATG TGTCTGTGAA TCCCACGAAC GGGGCCATCT 2340 

ACGCCCTGCG CTCCTTTAAC TTCGAGCAGA CCAAGGCTTT TGAGTTCAAG GTGCTTGCTA 2400 

AGGACTCGGG GGCGCCCGCG CACTTGGAGA GCAACGCCAC GGTGAGGGTG ACAGTGCTAG 2460 

ACGTGAATGA CAACGCGCCA GTGATCGTGC TCCCCACGCT GCAGAACGAC ACCGCGGAGC 2520 

TGCAGGTGCC GCGCAACGCT GGCCTGGGCT ATCTGGTGAG CACTGTGCGC GCCCTAGACA 2580 

GCGACTTCGG CGAGAGCGGG CGTCTCACCT ACGAGATCGT GGACGGCAAC GACGACCACC 2640 

TGTTTGAGAT CGACCCGTCC AGCGGCGAGA TCCGCACGCT GCACCCTTTC TGGGAGGACG 2700 

TGACGCCCGT GGTGGAGCTG GTGGTGAAGG TGACCGACCA CGGCAAGCCT ACCCTGTCCG 2760 

CAGTGGCCAA GCTCATCATC CGCTCGGTGA GCGGATCCCT TCCCGAGGGG GTACCACGGG 2B20 

TGAATGGCGA GCAGCACCAC TGGGACATGT CGCTGCCGCT CATCGTGACT CTGAGCACTA 2880 

TCTCCATCAT CCTCCTAGCG GCCATGATCA CCATCGCCGT CAAGTGCAAG CGCGAGAACA 2940 

AGGAGATCCG CACTTACAAC TGCCGCATCG CCGAGTACAG CCACCCGCAG CTGGGTGGGG 3000 

GCAAGGGCAA GAAGAAGAAG ATCAACAAAA ATGATATCAT GCTGGTGCAG AGCGAAGTGG 3060 

AGGAGAGGAA CGCCATGAAC GTCATGAACG TGGTGAGCAG CCCCTCCCTG GCCACCTCCC 3120 

CCATGTACTT CGACTACCAG ACCCGCCTGC CCCTCAGCTC GCCCCGGTCG GAGGTGATGT 3180 

ATCTCAAACC GGCCTCCAAC AACCTGACTG TCCCTCAGGG GCACGCGGGC TGCCACACCA 3240 

GCTTCACCGG ACAAGGGACT AATGCAAGCG AGACCCCTGC CACTCGGATG TCCATAATTC 3300 

AGACAGACAA TTTTCCCGCA GAGCCCAATT ACATGGGCAG CAGGCAGCAG TTTGTTCAAA 3360 
GTATTTCAGT AGCTCCACGT TTAAGGACCC AGAAAGAGCC AGCCTGAGAG ACAGTGGGCA 3420 
CGGGGACAGT GATCAGGCPG ACAGTGACCA AGACACTAAC AAAGGCTCCT GCTGTGACAT 3480 
GTCTGTTAGG GAGGCACTCA AGATGAAAAC TACTTCAACT AAAAGCCAAC CACTTGAACA 3540 
AGAACCAGAA GAGTGTGTTA ATTGCACAGA TGAATGCCGA GTGCTTGGTC ATTCTGACAG 3600 
GTGCTGGATG CCACAGTTCC CTGCAGCCAA TCAGGCTGAA AATGCAGATT ACCGCACAAA 3660 
TCTCTTTGTA CCTACAGTTG AAGCTAATGT TGAGACTGAG ACTTACGAAA CTGTGAATCC 3720 
CACTGGGAAA AAGACTTTTT GTACATTTGG AAAAGACAAG CGAGAGCACA CTATTCTCAT 3780 
TGCCAACGTT AAACCTTATT TAAAAGCCAA ACGTGCCCTG AGCCCTCTCC TCCAAGAGGT 3840 
CCCCTCAGCA TCAAGCAGCC CAACCAAGGC GTGCATCGAG CCTTGCACCT CAACAAAAGG 3900 
CTCCCTGGAT GGCTGTGAAG CAAAACCAGG AGCCCTGGCT GAAGCAAGCA GTCAGTACTT 3960 
GCCCACTGAC AGTCAATATC TGTCACCTAG TAAGCAACCA AGAGACCCTC CCTTCATGGC 4020 

1237 



WO 03/042661 



TTCCGATCAG ATGGCAAGGG TCTTTGCAGA TGTGCATTCC AGAGCCAGCC GGGATTCCAG 4 080 

TGAGATGGGT GCTGTTCTTG AGCAGCTTGA CCACCCCAAC AGGGATCTGG GCAGAGAGTC 4140 

TGTGGATGCA GAGGAAGTTG TGAGAGAAAT TGATAAGCTT TTGCAAGACT GCCGGGGAAA 4200 

CGACCCTGTG GCTGTGAGAA AGTGAAAAAA GAAAAAAAAA AAGGCATTGG CATTTTCTTG 4260 

TCTCTTCTGT TGATTTAAAA ATGATCCCTC CTGGTGATAA CCCATTTTAC AGGGATGAAG 4320 

AAAGACCAAT GCTGCTTTAA GGCTTTTAGT GAACATCTGA AGTGCCCACA AGTATGTTCT 4380 

TTCCACTGCT GATTTCTTTT TCAGAGATAA CAATGGTTTC GTTTTGACCA AACTTGTATT 4440 

AGGACAGAAT TAATGATGCT TAAAGAGAAA AGAAAAAAAG AGAGAAGAAA AAGGAGAGAT 4500 

GAAAAAGGAG GATGAGGAGA AGAATTACCT TTTGACAATC TGTTAGGAAG GTATGCAGTG 45G0 

TGAGAACTGA AGTATTTCTG ATCACTCTCA GACTGTCCTC CGTGATTTAT GCTGACTTAA 4620 

CTGTTTACCT ATAAACCCCA TACAAAGCAG GGTCATAATT TGTGATCTGT GGTGGATTTC 4680 

TAGCAGTCAT CACAGGCTTC TACTGAAAGT CCTGAAAAGA CCTTGCAGTA GTCCAAGCTA 4740 

CACCAAACAT TAACACATAT TTGTGGTAAA CATTTCTGTA TAAAGTTACC TGACACACAT 4800 

ATAAACACAA GGAACATTCC ATATCATTAG TCGAAAACAA AAACAAAAAA AAAACCTTTG 4860 

GTCATTTGTA AGACATCTCA TGTCATATAA AAGTTAAATG TAAAAAGATA CAGTCCATTT 4920 

TGTCCTGCAC ACACGTAGAC TAATTCACGT CAAAAAAAAA AAAAAA 4966 

Seq ID NO: C67 DNA Sequence 

Nucleic Acid Accession #: NM_005601.2 

Coding sequence: 101.. 598 

1 J 11 21 31 41 51 

I I I I I I 

CCCAGGAGTC TGGGTGCACA GCCTCCTTCT CTCTGAGATT CAAGAGTCTG ATCAGCAGCC 60 

TCTTCCTCCT CCAGGACCCA GAAGCCCTGA GCTTATCCCC ATGGAGCTCT GCCGGTCCCT 120 

GGCCCTGCTG GGGGGCTCCC TGGGCCTGAT GTTCTGCCTG ATTGCTTTGA GCACCGATTT 180 

CTGGTTTGAG GCTGTGGGTC CCACCCACTC AGCTCACTCG GGCCTCTGGC CAACAGGGCA 240 

TGGGGACATC ATATCAGGCT ACATCCACGT GACGCAGACC TTCAGCATTA TGGCTGTTCT 300 

GTGGGCCCTG GTGTCCGTGA GCTTCCTGGT CCTGTCCTGC TTCCCCTCAC TGTTCCCCCC 360 

AGGCCACGGC CCGCTTGTCT CAACCACCGC AGCCTTTGCT GCAGCCATCT CCATGGTGGT 420 

GGCCATGGCG GTGTACACCA GCGAGCGGTG GGACCAGCCT CCACACCCCC AGATCCAGAC 480 

CTTCTTCTCC TGGTCCTTCT ACCTGGGCTG GGTCTCAGCT ATCCTCTTGC TCTGTACAGG 540 

TGCCCTGAGC CTGGGTGCTC ACTGTGGCGG TCCCCGTCCT GGCTATGAAA CCTTGTGAGC 600 

AGAAGGCAAG AGCGGCAAGA TGAGTTTTGA GCGTTGTATT CCAAAGGCCT CATCTGGAGC 660 

CTCGGGAAAG TCTGGTCCTA CATTTGCCCG CCCTTCCAGC CCTTCCCCAG CCCCTCCTCT 720 

TGTTTCTTCA TTCATTCAAC AAAATTTGGC TGGAAAAAAA AAAAAAAAAA AAAAAAAAAA 780 

AAA 783 



Seq ID NO: C68 DNA Sequence 

Nucleic Acid Accession #: NM_006433.2 

Coding sequence: 129.. 566 

1 11 21 31 41 51 

I I I I I I 

GTATCTGTGG TAAACCCAGT GACACGGGGG AGATGACATA CAAAAAGGGC AGGACCTGAG 60 

AAAGATTAAG CTGCAGGCTC CCTGCCCATA AAACAGGGTG TGAAAGGCAT CTCAGCGGCT 120 

GCCCCACCAT GGCTACCTGG GCCCTCCTGC TCCTTGCAGC CATGCTCCTG GGCAACCCAG 180 

GTCTGGTCTT CTCTCGTCTG AGCCCTGAGT ACTACGACCT GGCAAGAGCC CACCTGCGTG 240 

ATGAGGAGAA ATCCTGCCCG TGCCTGGCCC AGGAGGGCCC CCAGGGTGAC CTGTTGACCA 300 

AAACACAGGA GCTGGGCCGT GACTACAGGA CCTGTCTGAC GATAGTCCAA AAACTGAAGA 360 

AGATGGTGGA TAAGCCCACC CAGAGAAGTG TTTCCAATGC TGCGACCCGG GTGTGTAGGA 420 

CGGGGAGGTC ACGATGGCGC GACGTCTGCA GAAATTTCAT GAGGAGGTAT CAGTCTAGAG 480 

TTACCCAGGG CCTCGTGGCC GGAGAAACTG CCCAGCAGAT CTGTGAGGAC CTCAGGTTGT 540 

GTATACCTTC TACAGGTCCC CTCTGAGCCC TCTCACCTTG TCCTGTGGAA GAAGCACAGG 600 

CTCCTGTCCT CAGATCCCGG GAACCTCAGC AACCTCTGCC GGCTCCTCGC TTCCTCGATC 660 

CAGAATCCAC TCTCCAGTCT CCCTCCCCTG ACTCCCTCTG CTGTCCTCCC CTCTCACGAG 720 

AATAAAGTGT CAAGCAAG 738 



Seq ID NO: C69 DNA Sequence 

Nucleic Acid Accession #: NM_D02985.2 

Coding sequence: 69.. 344 

1 11 21 31 41 51 

I I 1 I I I 

GCTGCAGAGG ATTCCTGCAG AGGATCAAGA CAGCACGTGG ACCTCGCACA GCCTCTCCCA 60 

CAGGTACCAT GAAGGTCTCC GCGGCAGCCC TCGCTGTCAT CCTCATTGCT ACTGCCCTCT 120 

GCGCTCCTGC ATCTGCCTCC CCATATTCCT CGGACACCAC ACCCTGCTGC TTTGCCTACA 180 

TTGCCCGCCC ACTGCCCCGT GCCCACATCA AGGAGTATTT CTACACCAGT GGCAAGTGCT 240 

CCAACCCAGC AGTCGTCTTT GTCACCCGAA AGAACCGCCA AGTGTGTGCC AACCCAGAGA 300 

AGAAATGGGT TCGGGAGTAC ATCAACTCTT TGGAGATGAG CTAGGATGGA GAGTCCTTGA 360 

ACCTGAACTT ACACAAATTT GCCTGTTTCT GCTTGCTCTT GTCCTAGCTT GGGAGGCTTC 420 

CCCTCACTAT CCTACCCCAC CCGCTCCTTG AAGGGCCCAG ATTCTACCAC ACAGCAGCAG 480 

TTACAAAAAC CTTCCCCAGG CTGGACGTGG TGGCTCACGC CTGTAATCCC AGCACTTTGG 540 

GAGGCCAAGG TGGGTGGATC ACTTGAGGTC AGGAGTTCGA GACCAGCCTG GCCAACATGA 600 

TGAAACCCCA TCTCTACTAA AAATACAAAA AATTAGCCGG GCGTGGTAGC GGGCGCCTGT 660 

AGTCCCAGCT ACTCGGGAGG CTGAGGCAGG AGAATGGCGT GAACCCGGGA GGCGGAGCTT 720 

GCAGTGAGCC GAGATCGCGC CACTGCACTC CAGCCTGGGC GACAGAGCGA GACTCCGTCT 780 

CAAAAAAAAA AAAAAAAAAA AAAATACAAA AATTAGCCGG GCGTGGTGGC CCACGCCTGT 840 

AATCCCAGCT ACTCGGGAGG CTAAGGCAGG AAAATTGTTT GAACCCAGGA GGTGGAGGCT 900 

GCAGTGAGCT GAGATTGTGC CACTTCACTC CAGCCTGGGT GACAAAGTGA GACTCCGTCA 960 

CAACAACAAC AACAAAAAGC TTCCCCAACT AAAGCCTAGA AGAGCTTCTG AGGOGCTGCT 1020 

TTGTCAAAAG GAAGTCTCTA GGTTCTGAGC TCTGGCTTTG CCTTGGCTTT GCCAGGGCTC 1080 
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TGTGACCAGG AAGGAAGTCA GCATGCCTCT AGAGGCAAGG AGGGGAGGAA CACTGCACTC 1140 
TTAAGCTTCC GCCGTCTCAA CCCCTCACAG GAGCTTACTG GCAAACATGA AAAATCGGCT 1200 
TACCATTAAA GTTCTCAATG CAACCATAAA AAAAAAA 1237 



Seq ID NO: C70 DNA Sequence 

Nucleic Acid Accession 8: NMJ1221S4.2 

Coding sequence: 13 81.. 1722 

1 11 21 31 41 51 

111!!) 

AGTGTGGTTT TAGTTTTTCC TAAGAAGTGG CGTGGTTTGG GGCTTTATAT CCGGGAGGAG 60 

CATATGTACG CAAATCCTGG GGCGTTTGCA AACCCGGATC CGGGGCGTCT GGCCCCATGC 120 

CCGGCCGGGC GTTTGAGGGC TACTGCCACG CAGCGTTTCT GGAGCCTGCC GGCTGGTGCC 180 

CTGGTGGCCT TTATCTCTGT CCCCCTTTGT CCTCTTTATC TCAGGCTCTC CAGGAGGCCG 240 

GGGGGCCCAC TCCGCCTATC GCTCCCCTCG GCTACGCTGC CACTCCAATG CCCCGCAGGT 300 

CGCGAGCTGC TGTTCTTTCG AAGGCGCCGG AGAACCAGGG GCGTCCCGCG CCACCTCTGA 360 

CTCGGAGCAG CGCCGAGCAC TGACGCTCCC GCCCTTGGGC AAGGACGCCA GTGCGCCCGC 420 

GCGCGTCCCT CTGCGCGGCA GCCCGTCGCG GGCCCTCAAG GGGAAGCCCA GGCCAGGATG 480 

GCCCCGGGTC GOGCGGTGGC CGGGCTCCTG TTGCTGGCGG CCGCCGGCCT CGGAGGAGTG 540 

GCGGAGGGGC CAGGGCTAGC CTTCAGCGAG GATGTGCTGA GCGTGTTCGG CGCGAATCTG 600 

AGCCTGTCGG CGGCGCAGCT CCAGCACTTG CTGGAGCAGA TGGGAGCCGC CTCCCGCGTG 660 

GGCGTCCCGG AGCCTGGCCA GCTGCACTTC AACCAGTGTT TAACTGCTGA AGAGATCTTT 720 

TCCCTTCATG GCTTTTCAAA TGCTACCCAA ATAACCAGCT CCAAATTCTC TGTCATCTGT 780 

CCAGCAGTCT TACAGCAATT GAACTTTCAC CCATGTGAGG ATCGGCCCAA GCACAAAACA 840 

AGACCAAGTC ATTCAGAAGT TTGGGGATAT GGATTCCTGT CAGTGACGAT TATTAATCTG 900 

GCATCTCTCC TCGGATTGAT TTTGACTCCA CTGATAAAGA AATCTTATTT CCCAAAGATT 960 

TTGACCTTTT TTGTGGGGCT GGCTATTGGG ACTCTTTTTT CAAATGCAAT TTTCCAACTT 1020 

ATTCCAGAGG CATTTGGATT TGATCCCAAA GTCGACAGTT ATGTTGAGAA GGCAGTTGCT 1080 

GTGTTTGGTG GATTTTACCT ACTTTTCTTT TTTGAAAGAA TGCTAAAGAT GTTATTAAAG 1140 

ACATATGGTC AGAATGGTCA TACCCACTTT GGAAATGATA . ACTTTGGTCC TCAAGAAAAA 1200 

ACTCATCAAC CTAAAGCATT ACCTGCCATC AATGGTGTGA CATGCTATGC AAATCCTGCT 1260 

GTCACAGAAG CTAATGGACA TATCCATTTT GATAATGTCA GTGTGGTATC TCTACAGGAT 1320 

GGAAAAAAAG AGCCAAGTTC ATGTACCTGT TTGAAGGGGC CCAAACTGTC AGAAATAGGG 13 B0 

ACGATTGCCT GGATGATAAC GCTCTGCGAT GCCCTCCACA ATTTCATCGA TGGCCTGGCG 1440 

ATTGGGGCTT CCTGCACCTT GTCTCTCCTT CAGGGACTCA GTACTTCCAT AGCAATCCTA 1500 

TGTGAGGAGT TTCCCCACGA GTTAGGAGAC TTTGTGATCC TACTCAATGC AGGGATGAGC 1560 

ACTCGACAAG CCTTGCTATT CAACTTCCTT TCTGCATGTT CCTGCTATGT TGGGCTAGCT 1620 

TTTGGCATTT TGGTGGGCAA CAATTTCGCT CCAAATATTA TATTTGCACT TGCTGGAGGC 1680 

ATGTTCCTCT ATATTTCTCT GGCAGATATG TTTCCAGAGA TGAATGATAT GCTGAGAGAA 1740 

AAGGTAACTG GAAGAAAAAC CGATTTCACC TTCTTCATGA TTCAGAATGC TGGAATGTTA 1800 

ACTGGATTCA CAGCCATTCT ACTCATTACC TTGTATGCAG GAGAAATCGA ATTGGAGTAA 1860 

TAGAAAATGG AAGATGGTGT TGTTAATAAA GGCATTTAAT AGATAAAAAC ATCTCCAAAA 1920 

AGGATTTTGA AGCTGATCCT ATTTAGTTAA AAAGATAATT TTGCTTTCAA CTGTAGGTCC 1980 

AGAAAACTAA TTATTGGCAT CAGTCTGTGA AATAGTCCAT TATTTG TTGT TAAAAATGCT 2040 

TCAAAAGGTT TT CAGTGTCA GTCTGAGATG CCTGGTATAT AGGAGCCTTT GGGAAATACT 2100 

TATTTTTCAG TATTCCATGC ATATTAGATA TCACCATGAA GCAAGAGACA TGCATTCTAT 2160 

AATCATGTAG ACACTCAGAC TCAGGGGAAA ATACAAGTTA TATCCTGAAA GCCTTTAAAA 2220 

CTCTATGGTA GGATCAAAGA TTCAAATGGT TTCAGAGAGG TTTTATTTCA ATTAATTTGT 2280 

TCTAGTGCTT TCAAGAGCAA GTACATCAAA ATGTAGAAGG TAAAATGTAT GCAACACTAA 2340 

TATAAATTAT TCCAAGTCTT TAAGGAGCCA AAGAAAAAAA AGATTTCTCA CAGCTTTTTG 2400 

TTCTGTTTTG TATTTCAATT AGGAACTTGC AGTATTATTT TGAAAACCAT TCTAAAATAA 2460 

TAGGAGTTAG GAAATAAATA AAGTTTTGCT AGCCCTGCTA AGTTCAGGCT TAGAGGCTTA 2520 

TCGCTAAGTN TAAACTTCAC CAGATTCCAC GAAAAGCTGG ATAGCTTTTT TTCTGACTTA 2580 

TGTTGTGGTT GCACCCCTCA CAAATGGCAG AACAGTATGT AAAGCTGGTA ACACCTCGGT 2640 
TTCAGTGCAC CATGTGTTTG CTTTGTGAAG GTGAAGAATA TGTTGGTTTA GAGAAAGAAA 2700 
TTGGATGTAA TTTTATGCAA TTTACTTTTA AAGACAAACA TAACTATTTA GCAGAGAATA 2760 
TTTTAATAAA TGCAAAACAA CAGCTGGACT GCTGTACATC AAGGACAGAT TAACTGGAAA 2820 
ACATATGTTC CTTATGTGTG ATTGAGAGCC ATTCAGAAAA GACTTCCTTT GTGTTCAGCC 2880 
TATACTTTTC CATATGGTAT ACCTTGAAAA AAATTAGCAC ACCATGGTTA TTTTTCTACC 2940 
TTTTATAAAA GACAGAGCCT GTTTACTCAT TTAGAAGATA GAGAAAATTG GTCTAAAATT 3000 
GAACATCCTA GATTCACACT CCCAAGTCAC TTAAGGTGAT TTGATGGTGA GGAAAATGAT 3060 
TGACAAAGCC CAACAATGAT CTCAGGAATT ACATTTTCCA ACAGACCAAA AAATGTTTTC 3120 
ATGTAGCAGC AATGCAGATT TGGTGAATAT TTAATATATA TTTTAGTATG TATTTCACTT 3180 
TATGACTGAC AATTAAAAAA TATTGTTTGG CCAAATAGTA AACACCCTTT TGAAACCATG 3240 
AAAAAA 3246 



Seq ID NO: C71 DNA Sequence 

Nucleic Acid Accession #: NM_004184.2 

Coding sequence : 188 . . 1603 

1 11 21 31 41 51 

I 1 I I I I 

CGAAAAAAGA GGGGAAGAGT ATTAAAGACC ATTTCTGGCT GGGCAGGGCA CTCTCAGCAG 60 

CTCAACTGCC CAGCGTGACC AGTGGCCACC TCTGCAGTGT CTTCCACAAC CTGGTCTTGA 120 

CTCGTCTGCT GAACAAATCC TCTGACCTCA GGCCGGCTGT GAACGTAGTT CCTGAGAGAT 180 

AGCAAACATG CCCAACAGTG AGCCCGCATC TCTGCTGGAG CTGTTCAACA GCATCGCCAC 240 

ACAAGGGGAG CTCGTAAGGT CCCTCAAAGC GGGAAATGCG TCAAAGGATG AAATTGATTC 300 

TGCAGTAAAG ATGTTGGTGT CATTAAAAAT GAGCTACAAA GCTGCCGCGG GGGAGGATTA 360 

CAAGGCTGAC TGTCCTCCAG GGAACCCAGC ACCTACCAGT AATCATGGCC CAGATGCCAC 420 

AGAAGCTGAA GAGGATTTTG TGGACCCATG GACAGTACAG ACAAGCAGTG CAAAAGGCAT 480 

AGACTACGAT AAGCTCATTG TTCGGTTTGG AAGTAGTAAA ATTGACAAAG AGCTAATAAA 540 

CCGAATAGAG AGAGCCACCG GCCAAAGACC ACACCACTTC CTGCGCAGAG GCATCTTCTT 600 
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CTCACACAGA GATATGAATC AGGTTCTTGA TGCCTATGAA AATAAGAAGC CATTTTATCT 660 

GTACACGGGC CGGGGCCCCT CTTCTGAAGC AATGCATGTA GGTCACCTCA TTCCATTTAT 720 

TTTCACAAAG TGGCTCCAGG ATGTATTTAA CGTGCCCTTG GTCATCCAGA TGACGGATGA 780 

CGAGAAGTAT CTGTGGAAGG ACCTGACCCT GGACCAGGCC TATGGCGATG CTGTTGAGAA 840 

5 TGCCAAGGAC ATCATCGCCT GTGGCTTTGA CATCAACAAG ACTTTCATAT TCTCTGACCT 900 

GGACTACATG GGGATGAGCT CAGGTTTCTA CAAAAATGTG GTGAAGATTC AAAAGCATGT 960 

TACCTTCAAC CAAGTGAAAG GCATTTTCGG CTTCACTGAC AGCGACTGCA TTGGGAAGAT 1020 

CAGTTTTCCT GCCATCCAGG CTGCTCCCTC CTTCAGCAAC TCATTCCCAC AGATCTTCOG 1080 

AGACAGGACG GATATCCAGT GCCTTATCCC ATGTGCCATT GACCAGGATC CTTACTTTAG 1140 

10 AATGACAAGG GACGTCGCCC CCAGGATCGG CTATCCTAAA CCAGCCCTGT TGCACTCCAC 1200 

CTTCTTCCCA GCCCTGCAGG GCGCCCAGAC CAAAATGAGT GCCAGCGACC CAAACTCCTC 1260 

CATCTTCCTC ACCGACACGG CCAAGCAGAT CAAAACCAAG GTCAATAAGC ATGCGTTTTC 1320 

TGGAGGGAGA GACACCATCG AGGAGCACAG GCAGTTTGGG GGCAACTGTG ATGTGGACGT 1380 

GTCTTTCATG TACCTGACCT TCTTCCTCGA GGACGACGAC AAGCTCGAGC AGATCAGGAA 1440 

15 GGATTACACC AGCGGAGCCA TGCTCACCGG TGAGCTCAAG AAGGCACTCA TAGAGGTTCT 1500 

GCAGCCCTTG ATCGCAGAGC ACCAGGCCCG GCGCAAGGAG GTCACGGATG AGATAGTGAA 1560 

AGAGTTCATG ACTCCCCGGA AGCTGTCCTT CGACTTTCAG TAGCACTCGT TTTACATATG 1620 

CTTATAAAAG AAGTGATGTA TCAGTAATGT ATCAATAATC CCAGCCCAGT CAAAGCACCG 1680 

CCACCTGTAG GCTTCTGTCT CATGGTAATT ACTGGGCCTG GCCTCTGTAA GCCTGTGTAT 1740 

2X) GTTATCAATA CTGTTTCTTC CTGTGAGTTC CATTATTTCT ATCTCTTATG GGCAAAGCAT 1800 

TGTGGGTAAT TGGTGCTGGC TAACATTGCA TGGTCGGATA GAGAAGTCCA GCTGTGAGTC 1860 

TCTCCCCAAA GCAGCCCCAC AGTGGAGCCT TCGGCTGGAA GTCCATGGGC CACCCTGTTC 1920 

TTGTCCATGG AGGACTTCCG AGGGTTCCAA GTATACTCTT AAGACCCACT CTGTTTAAAA I960 

AT AT AT ATT C TATGTATGCG TATATGGAAT TGAAATGTCA TTATTGTAAC CTAGAAAGTG 2040 

25 CTTTGAAATA TTGATGTGGG GAGGTTTATT GAGCACAAGA TGTATTTCAG CCCATGCCCC 2100 

CTCCCAAAAA GAAATTGATA AGTAAAAGCT TCGTTATACA TTTGACTAAG AAATCACCCA 2160 

GCTTTAAAGC TGCTTTTAAC AATGAAGATT GAACAGAGTT CAGCAATTTT GATTAAATTA 2220 

AGACTTGGGG GTGAAACTTT CCAGTTTACT GAACTCCAGA CCATGCATGT AGTCCACTCC 2280 

AGAAATCATG CTCGCTTCCC TTGGCACACC AGTGTTCTCC TGCCAAATGA CCCTAGACCC 2340 

30 TCTGTCCTGC AGAGTCAGGG TGGCTTTTCC CCTGACTGTG TCCGATGCCA AGGAGTCCTG 2400 

GCCTCCGCAG ATGCTTCATT TTGACCCTTG GCTGCAGTGG AAGTCAGCAC AGAGCAGTGC 2460 

CCTGGCTGTG TCCTGGACGG GTGGACTTAG CTAGGGAGAA AGTCGAGGCA GCAGCCCTCG 2520 

AGGCCCTCAC AGATGTCTAG GCAGGCCTCA TTTCATCACG CAGCATGTGC AGGCCTGGAA 2580 

GAGCAAAGCC AAATCTCAGG GAAGTCCTTG GTTGATGTAT CTGGGTCTCC TCTGGAGCAC 2640 

35 TCTGCCCTCC TGTCACCCAG TAGAGTAAAT AAACTTCCTT GGCTCCTAAA AAA 2693 
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Seq ID NO: C72 DNA Sequence 

Nucleic Acid Accession 8: NMJ)04938.1 

Coding sequence: 337.. 4632 

1 11 21 31 41 51 

I I I I I I 

CGGAGGACAG CCGGACCGAG CCAACGCCGG GGACTTTGTT CCCTCCACGG AGGGGACTCG 60 

GCAACTCGCA GCGGCAGGGT CTGGGGCCGG CGCCTGGGAG GGATCTGGGC CCCCCACTCA 120 

CTCCCTAGCT GTGTTCCCGC CGCCGCCCCG GCTAGTCTCC GGCGCTGGCG CCTATGGTCG 180 

GCCTCCGACA GCGCTCCGGA GGGACCGGGG GAGCTCCCAG GCGCCCGGGA CTGGAGACTG 240 

ATGCATGAGG GGCCTACGGA GGCGCAGGAG CGGTGGTGAT GGTCTGGGAA GCGGAGCTGA 300 

AGTCCCCTGG GCTTTGGTGA GGCGTGACAG TTTATCATGA CCGTGTTCAG GCAGGAAAAC 360 

GTGGATGATT ACTACGACAC CGGCGAGGAA CTTGGCAGTG GACAGTTTGC GGTTGTGAAG 420 

AAATGCCGTG AGAAAAGTAC CGGCCTCCAG TATGCCGCCA AATTCATCAA GAAAAGGAGG 480 

ACTAAGTCCA GCCGGCGGGG TGTGAGCCGC GAGGACATCG AGCGGGAGGT CAGCATCCTG 540 

AAGGAGATCC AGCACCCCAA TGTCATCACC CTGCACGAGG TCTATGAGAA CAAGACGGAC 600 

GTCATCCTGA TCTTGGAACT CGTTGCAGGT GGCGAGCTGT TTGACTTCTT AGCTGAAAAG 660 

GAATCTTTAA CTGAAGAGGA AGCAACTGAA TTTCTCAAAC AAATTCTTAA TGGTGTTTAC 720 

TACCTGCACT CCCTTCAAAT OGCCCACTTT GATCTTAAGC CTGAGAACAT AATGCTTTTG 780 

GATAGAAATG TCCCCAAACC TCGGATCAAG ATCATTGACT TTGGGTTGGC CCATAAAATT 840 

GACTTTGGAA ATGAATTTAA AAACATATTT GGGACTCCAG AGTTTGTCGC TCCTGAGATA 900 

GTCAACTATG AACCTCTTGG TCTTGAGGCA GATATGTGGA GTATCGGGGT AATAACCTAT 960 

ATCCTCCTAA GTGGGGCCTC CCCATTTCTT GGAGACACTA AGCAAGAAAC GTTAGCAAAT 1020 

GTATCCGCTG TCAACTACGA ATTTGAGGAT GAATACTTCA GTAATACCAG TGCCCTAGCC 1080 

AAAGATTTCA TAAGAAGACT TCTGGTCAAG GATCGAAAGA AGAGAATGAC AATTCAAGAT 1140 

AGTTTGCAGC ATCCCTGGAT CAAGCCTAAA GATACACAAC AGGCACTTAG TAGAAAAGCA 1200 

TCAGCAGTAA ACATGGAGAA ATTCAAGAAG TTTGCAGCCC GGAAAAAATG GAAACAATCC 1260 

GTTCGCTTGA TATCACTGTG CCAAAGATTA TCCAGGTCAT TCCTGTCCAG AAGTAACATG 1320 

AGTGTTGCCA GAAGCGATGA TACTCTGGAT GAGGAAGACT CCTTTGTGAT GAAAGCCATC 1380 

ATCCATGCCA TCAACGATGA CAATGTCCCA GGCCTGCAGC ACCTTCTGGG CTCATTATCC 1440 

AACTATGATG TTAACCAACC CAACAAGCAC GGGACACCTC CATTACTCAT TGCTGCTGGC 1500 

TGTGGGAATA TTCAAATACT ACAGTTGCTC ATTAAAAGAG GCTCGAGAAT CGATGTCCAG 1560 

GATAAGGGCG GGTCCAATGC CGTCTACTGG GCTGCTCGGC ATGGCCACGT CGATACCTTG 1620 

AAATTTCTCA GTGAGAACAA ATGCCCTTTG GATGTGAAAG ACAAGTCTGG AGAGATGGCC 1680 

CTCCACGTGG CAGCTCGCTA TGGCCATGCT GACGTGGCTC AAGTTACTTG TGCAGCTTCG 1740 

GCTCAAATCC CAATATCCAG GACAAAGGAA GAAGAAACCC CCCTGCACTG TGCTGCTTGG 1B00 

CACGGCTATT ACTCTGTGGC CAAAGCCCTT TGTGAAGCCG GCTGTAACGT GAACATCAAG 1860 

AACCGAGAAG GAGAGACGCC CCTCCTGACA GCCTCTGCCA GGGGCTACCA CGACATCGTG 1920 

G AGTGT CTGG CCGAACATGG AGCCGACCTT AATGCTTGCG ACAAGGACGG ACACATTGCC 1980 

CTTCATCTGG CTGTAAGACG GTGTCAGATG GAGGTAATCA AGACTCTCCT CAGCCAAGGG 2040 

TGTTTCGTCG ATTATCAAGA CAGGCACGGC AATACTCCCC TCCATGTGGC ATGTAAAGAT 2100 

GGCAACATGC CTATCGTGGT GGCCCTCTGT GAAGCAAACT GCAATTTGGA CATCTCCAAC 2160 

AAGTATGGGC GAACGCCTCT GCACCTTGCG GCCAACAACG GAATCCTAGA CGTGGTCCGG 2220 

TATCTCTGTC TGATGGGAGC CAGCGTTGAG GCGCTGACCA CGGACGGAAA GAOGGCAGAA 2280 

GATCTTGCTA GATCGGAACA GCACGAGCAC GTAGCAGGTC TCCTTGCAAG ACTTCGAAAG 2340 

GATACGCACC GAGGACTCTT CATCCAGCAG CTCCGACCCA CACAGAACCT GCAGCCAAGA 2400 

ATTAAGCTCA AGCTGTTTGG CCACTCGGGA TCCGGGAAAA CCACCCTTGT AGAATCTCTC 2460 

AAGTGTGGGC TGCTGAGGAG CTTTTTCAGA AGGCGTCGGC CCAGACTGTC TTCCACCAAC 2520 
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TCCAGCAGGT TCCCACCTTC ACCCCTGGCT TCTAAGCCCA CAGTCTCAGT GAGCATCAAC 2580 

AACCTGTACC CAGGCTGCGA GAACGTGAGT GTGAGGAGCC GCAGCATGAT GTTCGAGCCG 2640 

GGTCTTACCA AAGGGATGCT GGAGGTGTTT GTGGCCCCGA CCCACCACCC GCACTGCTCG 2700 

GCCGATGACC AGTCCACCAA GGCCATCGAC ATCCAGAACG CTTATTTGAA TGGAGTTGGC 2760 

GATTTCAGCG TGTGGGAGTT CTCTGGAAAT CCTGTGTATT TCTGCTGTTA TGACTATTTT 2820 

GCTGCAAATG ATCCCACGTC AATCCATGTT GTTGTCTTTA GTCTAGAAGA GCCCTATGAG 2880 

ATCCAGCTGA ACCCAGTGAT TTTCTGGCTC AGTTTCCTGA AGTCCCTTGT CCCAGTTGAA 2940 

GAACCCATAG CCTTCGGTGG CAAGCTGAAG AACCCACTCC AAGTTGTCCT GGTGGCCACC 3000 

CACGCTGACA TCATGAATGT TCCTCGACCG GCTGGAGGCG AGTTTGGATA TGACAAAGAC 3060 

ACATCGTTGC TGAAAGAGAT TAGGAACAGG TTTGGAAATG ATCTTCACAT TTCAAATAAG 3120 

CTGTTTGTTC TGGATGCTGG GGCTTCTGGG TCAAAGGACA TGAAGGTACT TCGAAATCAT 3180 

CTGCAAGAAA TACGAAGCCA GATTGTTTCG GTCTGTCCTC CCATGACTCA CCTGTGTGAG 3240 

AAAATCATCT CCACGCTGCC TTCCTGGAGG AAGCTCAATG GACCCAACCA GCTGATGTCG 3300 

CTGCAGCAGT TTGTGTACGA CGTGCAGGAC CAGCTGAACC CCCTGGCCAG CGAGGAGGAC 3360 

CTCAGGCGCA TTGCTCAGCA GCTCCACAGC ACAGGCGAGA TCAACATCAT GCAAAGTGAA 3420 

ACAGTTCAGG ACGTGCTGCT CCTGGACCCC CGCTGGCTCT GCACAAACGT CCTGGGGAAG 3480 

TTGCTGTCCG TGGAGACCCC ACGGGCGCTG CACCACTACC GGGGCCGCTA CACCGTGGAG 3540 

GACATCCAGC GCCTGGTGCC CGACAGCGAC GTGGAGGAGC TGCTGCAGAT CCTCGATGCC 3600 

ATGGACATCT GCGCCCGGGA CCTGAGCAGC GGGACCATGG TGGACGTCCC AGCCCTGATC 3660 

AAGACAGACA ACCTGCACCG CTCCTGGGCT GATGAGGAGG ACGAGGTGAT GGTGTATGGT 3720 

GGCGTGCGCA TCGTGCCCGT GGAACACCTC ACCCCCTTCC CATGTGGCAT CTTTCACAAG 3780 

GTCCAGGTGA ACCTGTGCCG GTGGATCCAC CAGCAAAGCA CAGAGGGCGA CGCGGACATC 3840 

CGCCTGTGGG TGAATGGCTG CAAGCTGGCC AACCGTGGGG CCGAGCTGCT GGTGCTGCTG 3900 

GTCAACCACG GCCAGGGCAT TGAGGTCCAG GTCCGTGGCC TGGAGACGGA GAAGATCAAG 3960 

TGCTGCCTGC TGCTGGACTC GGTGTGCAGC ACCATTGAGA ACGTCATGGC CACCACGCTG 4020 

CCAGGGCTCC TGACCGTGAA GCATTACCTG AGCCCCCAGC AGCTGCGGGA GCACCATGAG 4080 

CCCGTCATGA TCTACCAGCC ACGGGACTTC TTCCGGGCAC AGACTCTGAA GGAAACCTCA 4140 

CTGACCAACA CCATGGGGGG GTACAAGGAA AGCTTCAGCA GCATCATGTG CTTCGGGTGT 4200 

CACGACGTCT ACTCACAGGC CAGCCTCGGC ATGGACATCC ATGCATCAGA CCTGAACCTC 4260 

CTCACTCGGA GGAAACTGAG TCGCCTGCTG GACCCGCCCG ACCCCCTGGG GAAGGACTGG 4320 

TGCCTTCTCG CCATGAACTT AGGCCTCCCT GACCTCGTGG CAAAGTACAA CACCAATAAC 4380 

GGGGCTCCCA AGGATTTCCT CCCCAGCCCC CTCCACGCCC TGCTGCGGGA ATGGACCACC 4440 

TACCCTGAGA GCACAGTGGG CACCCTCATG TCCAAACTGA GGGAGCTGGG TCGCCGGGAT 4500 

GCCGCAGACC TTTTGCTGAA GGCATCCTCT GTGTTCAAAA TCAACCTGGA TGGCAATGGC 4560 

CAGGAGGCCT ATG CCTCGAG CTGCAACAGC GGCACCTCTT ACAATTCCAT TAGCTCTGTT 4620 

GTATCCCGGT GAGGGCAGCC TCTGGCTTGG ACAGGGTCTG TTTGGACTGC AGAACCAAGG 4680 

GGGTGATGTA GCCCATCCTT CCCTTTGGAG ATGCTGAGGG TGTTTCTTCC TGCACCCACA 4740 

GCCAGGGGGA TGCCACTCCT CCCTCCGGCT TGACCTGTTT CTCTGCCGCT ACCTCCCTCC 4800 

CCGTCTCATT CCGTTGTCTG TGGATGGTCA TTGCAGTTTA AGAGCAGAAC AGATCTTTTA 4860 

CTTTGGCCGC TTGAAAAGCT AGTGTACCTC CTCTCAGTGT TTTGGACTCC ATCTCTCATC 4920 

CTCCAGTACC TTGCTTCTTA CTGATAATTT TGCTGGAATT CCTAACTTTT CAATGACATT 4980 
TTTTTTAACT ATCATATTGA TTGTCCTTTA AAAAAGAAAA GTGCATATTT ATCCAAAATG 5040 
TGTATTTCTT ATACGCTTTT CTGTGTTATA CCATTTCCTC AGCTTATCTC TTTTATATTT 5100 
GTAGGAGAAA CTCCCATGTA TGGAATCCCA CTGTATGATT TATAAACAGA CAATATGTGA 5160 
GTGCCTTTTG CAGAAGAGGG TGTGTTTGAA ATCATCGGAG TCAGCCAGGA GCTGTCACCA 5220 
AGGAAACGCT ACCTCTCTGT CCCTTGCTGT ATGCTGATCA TCGCCAGAGG TGCTTCACCC 52 80 
TGAGTTTTGT TTTGTATTGT TTTCTGACAG TTTTTCTGTT TTGTTTGGCA AGGAAAGGGG 5340 
AGAAGGGAAT CCTCCTCCAG GGTGATTTTA TGATCAGTGT TGTTGCTCTA GGAAGACATT 5400 
TTTCCGTTTG CTTTTGTTCC AATGTCAATG TGAACGTCCA CATGAAACCT ACACACTGTC 5460 
ATG CTT CATC ATTCCCTCTC ATCTCAGGTA GAAGGTTGAC ACAGTTGTAG GGTTACAGAG 5520 
. ACCTATGTAA GAATTCAGAA GACCCCTGAC TCATCATTTG TGGCAGTCCC TTATAATTGG 5580 
TGCATAGCAG ATGGTTTCCA CATTTAGATC CTGGTTTCAT AACTTCCTGT ACTTGAAGTC 5640 
TAAAAGCAGA AAATAAAGGA AGCAAGTTTT CTTCCATGAT TTTAAATTGT GATCGAGTTT 5700 
TAAATTGATA GGAGGGAACA TGTCCTAATT CTTCTGTCCT GAGAAGCATG TAATGTTAAT 5760 
GTTATATCAT ATGTATATAT ATATATGCAC TATGTATATA CATATATATT AATACTGGTA 5820 
TTTTTACTTA ATCTATAAAA TGTCGTTAAA AAGTTGTTTG TTTTTTTCTT TTTTTATAAA 5880 
TAAACTGTTG CTCGTTAAAA AAAAAAAAAA 5910 



Seq ID NO: C73 DNA Sequence 

Nucleic Acid Accession #: NM_002081.1 

Coding sequence: 222.. 1898 

1 11 21 31 41 51 

I I ! t 1 I 

GGCTGCCCGA GCGAGCGTTC GGACCTCGCA CCCCGCGCGC CCCGCGCCGC CGCCGCCGCC 60 

GGCTTTTGTT GTCTCCGCCT CCTCGGCCGC CGCCGCCTCT GGACCGCGAG CCGCGCGCGC 120 

CGGGACCTTG GCTCTGCCCT TCGCGGGCGG GAACTGCGCA GGACCCGGCC AGGATCCGAG 180 

AGAGGCGCGG GCGGGTGGCC GGGGGCGCCG CCGGCCCCGC CATGGAGCTC CGGGCCCGAG 240 

GCTGGTGGCT GCTATGTGCG GCCGCAGCGC TGGTCGCCTG CGCCCGCGGG GACCOGGCCA 300 

GCAAGAGCCG GAGCTGCGGC GAGGTCCGCC AGATCTACGG AGCCAAGGGC TTCAGCCTGA 360 

GCGACGTGCC CCAGGCGGAG ATCTCGGGTG AGCACCTGCG GATCTGTCCC CAGGGCTACA 420 

CCTGCTGCAC CAGCGAGATG GAGGAGAACC TGGCCAACCG CAGCCATGCC GAGCTGGAGA 480 

CCGCGCTCCG GGACAGCAGC CGCGTCCTGC AGGCCATGCT TGCCACCCAG CTGCGCAGCT 540 

TCGATGACCA CTTCCAGCAC CTGCTGAACG ACTCGGAGCG GACGCTGCAG GCCACCTTCC 600 

CCGGCGCCTT CGGAGAGCTG TACACGCAGA ACGCGAGGGC CTTCCGGGAC CTGTACTCAG 660 

AGCTGCGCCT GTACTACCGC GGTGCCAACC TGCACCTGGA GGAGACGCTG GCCGAGTTCT 720 

GGGCCCGCCT GCTCGAGCGC CTCTTCAAGC AGCTGCACCC CCAGCTGCTG CTGCCTGATG 780 

ACTACCTGGA CTGCCTGGGC AAGCAGGCCG AGGCGCTGCG GCCCTTCGGG GAGGCCCCGA 840 

GAGAGCTGCG CCTGCGGGCC ACCCGTGCCT TCGTGGCTGC TCGCTCCTTT GTGCAGGGCC 900 

TGGGCGTGGC CAGCGACGTG GTCCGGAAAG TGGCTCAGGT CCCCCTGGGC CCGGAGTGCT 960 

CGAGAGCTGT CATGAAGCTG GTCTACTGTG CTCACTGCCT GGGAGTCCCC GGCGCCAGGC 1020 

CCTGCCCTGA CTATTGCCGA AATGTGCTCA AGGGCTGCCT TGCCAACCAG GCCGACCTGG 1080 

ACGCCGAGTG GAGGAACCTC CTGGACTCCA TGGTGCTCAT CACCGACAAG TTCTGGGGTA 1140 

CATCGGGTGT GGAGAGTGTC ATCGGCAGCG TGCACACGTG GCTGGCGGAG GCCATCAACG 1200 
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CCTCCAGAGA 
TCTGAGATGA 
CTGCGCCCTT 
GGAGTCTGAG 
GGGGCCACTG 
GGAGGCAGCG 
GCCTTGCTGG 
CCTGCTCCCA 
CAGGGCTCAG 
CCCCGCACTG 
GCACGGGGAC 
TGTCCTTGTT 
CACCTTGGAC 
CTGGACGGGC 
TGTGGTGTTG 
TCCTGAACCG 
CGCACAGTGG 
CTGACTTTAG 
CCCTGCCAGT 
CAGCACTCCC 
TCTCTGGAAG 
CCTTCCTCCA 
TCCTTGTATG 



ACGCTCACGG CCAAGGTCAT 
GGGCCTGAGG AGAAGCGGCG 
ACGCTGGAGA AGCTGGTCTC 
ATCAGCCTCC CAGGGACACT 
CGCTGCTGGA ACGGGATGGC 
GCCAACCAGA TCAACAACCC 
CGGCAGCAGA TCATGCAGCT 
AACGACGTGG ACTTCCAGGA 
TGTCTGGATG ACCTCTGCGG 
TTGACCCATG CCCTCCCAGG 
TGCCCCCAGC CCCCGACCTT 
AGGCCCCGGT GGCGGTAACT 
CCAAAAATAC AACACAGACG 
CAGGGAGGGC CGGCGGCTCT 
CTCGCCTGCC TTTCTGCCTT 
GTGCCCAAAA GCCATGTATT 
CCCAGCTCCC TGCACCGCCG 
CAACCCGCTG GAGCCCACAG 
CCCAGCAGAG CCCACCAGCC 
GGGCTGTCTG GGTGTCCGCC 
CCTCCCCTCA GCGCAGGCTG 
AGCGTCTGCA GGGTGACGCC 
CCACAGACCC TGCAGTGAGG 
GCTTCTGCTG GAGGAGGGGA 
CAATGTGGGC TGCCCCTCGC 
TGTTGGAGGA CCCCGAGGGC 
GATCAGGAAC CAGGGCCTCC 
GGCTGTCACC TGCTCACAGG 
TGCCTAGGTC CCTTCCCGAC 
AAGGGCTTTT CCAAACATGC 
TGTTCGCTCC TCCCAGATGG 
TCCTGTCACT CACTGAGGCC 
TCCTGTGCCC CAGCTGCCAG 
GGAAGGGGTC CTGCAGGGGG AGGAGGACTT 
ACTGACCCTG AGGAGGCCGC TTAGTGCTGC 
ACGGAGGTCC CCGGTTGCTG GTCAGGTCCC 
ATGTTTTGGG ATCAGGAGCC CCCAACACAG 
GCCAGGGTGG GCTGGGGACT CTGGCACAGT 
GCTGCACACA GACGGCCTAG GGGTGGCGCT 
GGGCAGCCCT GAGTGGTCAC TGGTCAGGGC 
CAAGGTCCCC CCACCGCTCA GTGTCAGCGG 
AATAAAAGGC TGGAAACCTA AA 



CAACAGGGAC- 
CCAGGGCCCT 
ACCTTCAGGC 
GGACTTCTGG 
CAGTGATGAC 
TGACGGCCTG 
CATGACCATC 
CTACAACGGC 
CGGTGATGGC 
CCGGACGCCC 
GGCTGCCAGC 
TACAGTAGCC 
ACTGACTTTG 
TGGGGTGGGA 
CAGGCCTGGC 
GGGAGCCAGT 
TAGCCCTCCC 
GGCCTCAAAG 
CACTGGGACT 
AGCCCCGCAC 
TGCATGATGC 
GAGGGGCCCC 
GACTGTCCTC 
ACCCACCTGC 
TGGGCTCTGC 
GGTCCAGGGC 
TCCTCACCCA 
AGTGACCCTC 
CACACGGGAA 
CTGGATAGTT 
CATGGAGAGC 
CCTGGTGACC 



CCAGGGCTGC 
CCGGGGCAAG 
TGAAGCCAAG 
GTGCAGTGAG 
CAGAGGCCGG 
CGAGGTGGAG 
GAAGATCATG 
CGCCAGTGAC 
CCGGAAGGTC 
CCTGTCAGAG 
CCTCCTGCCC 
GCCCCAAGGC 
ATATTTAATT 
GAGCAGGGGC 
TTAATTTTGT 
TCAGGGACCT 
CAGAAGCAGC 
CGAGCCTGTG 
AGCCCTGGCC 
ATCCAGGGTC 
CAGAGCCCGG 
TGAGACAGCA 
GGCCCTCCAT 
AGCTGGGCCC 
ACACAGGGCT 
TGAGGAGCAG 
CTGTTCACGG 
GATGCTGGTG 
CCAGCCAGCT 
ATCCATTTAC 
CTTCGGAGGC 
ATCAGGGCCC 
GTGGCCCTGG 
GGAGGGTCTG 
TTTGCTTTTC 
CATGGCTTGT 
GCAAGTCCAC 
GATGCCGGGC 
CAGACCCCAC 
AGTGGCCAAG 
GTGACGTGTG 



GGGAACCCCA 
CTGGCCCCGC 
GCCCAGCTCC 
AAGATGGCCC 
TACCTCCCCG 
GTGGACATCA 
ACCAACCGGC 
GACGGCAGCG 
AGCAGGAAGA 
CAGGAAGGAC 
CTCCTCCTCT 
CCCAGGGACA 
CACCTCAGCC 
AGGCGCAGAG 
ATGAGGTCCT 
CAGGGGCACC 
CCCTCGAGGC 
CCTTCCTCCC 
CACCCCCCAG 
TGGCAGAGCC 
CCCCACCTCC 
CCACTGCTGA 
GCGCAGATGA 
AAAGGCCCAG 
CACAGGGCAG 
CCAGGACCCG 
TGACACAGGT 
GCTGGTGAGA 
GCACTGCAGG 
TGACACTTCC 
CCGCAGGGCC 
TGCCCCAGGC 
GGAGGGGTGG 
GGGGCAGCTG 
ATCACCGTCC 
TCTCTGGAAC 
CCCATAATAA 
GCCAGGACAG 
CCTACGCTCA 
CCTGCTGTGT 
TTCTTTTGAG 



1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

19B0 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3692 



Seq ID NO: C74 DNA Sequence 

Nucleic Acid Accession #: BC030205.1 

Coding sequence : 45 . . 878 

1 11 21 31 41 51 

I I I I I I 

GTGAGCAGCC CCTAACAGGC TGTTACTTCA CTACAACTGA CGATATGATC ATCTTAATTT 60 

ACTTATTTCT CTTGCTATGG GAAGACACTC AAGGATGGGG ATTCAAGGAT GGAATTTTTC 120 

ATAACTCCAT ATGGCTTGAA CGAGCAGCCG GTGTGTACCA CAGAGAAGCA CGGTCTGGCA 180 

AATACAAGCT CACCTACGCA GAAGCTAAGG CGGTGTGTGA ATTTGAAGGC GGCCATCTCG 240 

CAACTTACAA GCAGCTAGAG GCAGCCAGAA AAATTGGATT TCATGTCTGT GCTGCT GGAT 300 

GGATGGCTAA GGGCAGAGTT GGATACCCCA TTGTGAAGCC AGGGCCCAAC TGTGGATTTG 360 

GAAAAACTGG CATTATTGAT TATGGAATCC GTCTCAATAG GAGTGAAAGA TGGGATGCCT 420 

ATTGCTACAA CCCACACGCA AAGGAGTGTG GTGGCGTCTT TACAGATCCA AAGCAAATTT 480 

TTAAATCTCC AGGCTTCCCA AATGAGTACG AAGATAACCA AATCTGCTAC TGGCACATTA 540 

GACTCAAGTA TGGTCAGCGT ATTCACCTGA GTTTTTTAGA TTTTGACCTT GAAGATGACC 600 

CAGGTTGCTT GGCTGATTAT GTTGAAATAT ATGACAGTTA CGATGATGTC CATGGCTTTG 660 

TGGGAAGATA CTGTGGAGAT GAGCTTCCAG ATGACATCAT CAGTACAGGA AATGTCATGA 720 

CCTTGAAGTT TCTAAGTGAT GCTTCAGTGA CAGCTGGAGG TTTCCAAATC AAATATGTTG 780 

CAATGGATCC TGTATCCAAA TCCAGTCAAG GAAAAAATAC AAGTACTACT TCTACTGGAA 840 

ATAAAAACTT TTTAGCTGGA AGATTTAGCC ACTTATAAAA AAAAAAAAAG GATGATCAAA 900 

ACACACAGTG TTTATGTTGG AATCTTTTGG AACTCCTTTG ATCTCACTGT TATTATTAAC 960 

ATTTATTTAT TAT T T TT CTA AATGTGAAAG CAATACATAA TTTAGGGAAA ATTGGAAAAT 1020 

ATAGGAAACT TTAAACGAGA AAATGAAACC TCTCATAATC CCACTGCATA GAAATAACAA 1080 

GCGTTAACAT TTTCATATTT TTTTCTTTCA GTCATTTTTC TATTTGTGGT AT ATGTAT AT 1140 

ATGTACCTAT ATGTATTTGC ATTTGAAATT TTGGAATCCT GCTCTATGTA CAGTTTTGTA 1200 

TTATACTTTT TAAATCTTGA ACTTTATAAA CATTTTCTGA AATC ATTGA T TATTCTACAA 1260 

AAACATGATT TTAAACAGCT GTAAAATATT CTATGATATG AATGTTTTAT GCATTATTTA 1320 

AGCCTGTCTC TATTGTTGGA ATTTCAGGTC ATTTTCATAA ATATTGTTGC AATAAATATC 1380 

CTTGAACACA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 1430 



Seq ID NO: C75 DNA Sequence 

Nucleic Acid Accession #: NM_001982.1 

Coding sequence: 199.. 4227 

1 11 21 31 41 51 

e i I l I I 

CTCTCACACA CACACACCCC TCCCCTGCCA TCCCTCCCCG GACTCCGGCT CCGGCTCCGA 60 
TTGCAATTTG CAACCTCCGC TGCCGTCGCC GCAGCAGCCA CCAATTCGCC AGCGGTTCAG 120 
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GTGGCTCTTG CCTCGATGTC CTAGCCTAGG GGCCCCCGGG CCGGACTTGG CTGGGCTCCC 180 

TTCACCCTCT GCGGAGTCAT GAGGGCGAAC GACGCTCTGC AGGTGCTGGG CTTGCTTTTC 240 

AGCCTGGCCC GGGGCTCCGA GGTGGGCAAC TCTCAGGCAG TGTGTCCTGG GACTCTGAAT 300 

GGCCTGAGTG TGACCGGCGA TGCTGAGAAC CAATACCAGA CACTGTACAA GCTCTACGAG 360 

AGGTGTGAGG TGGTGATGGG GAACCTTGAG ATTGTGCTCA CGGGACACAA TGCCGACCTC 420 

TCCTTCCTGC AGTGGATTCG AGAAGTGACA GGCTATGTCC TCGTGGCCAT GAATGAATTC 480 

TCTACTCTAC CATTGCCCAA CCTCCGCGTG GTGCGAGGGA CCCAGGTCTA CGATGGGAAG 540 

TTTGCCATCT TCGTCATGTT GAACTATAAC ACCAACTCCA GCCACGCTCT GCGCCAGCTC 600 

CGCTTGACTC AGCTCACCGA GATTCTGTCA GGGGGTGTTT ATATTGAGAA GAACGATAAG 660 

. CTTTGTCACA TGGACACAAT TGACTGGAGG GACATCGTGA GGGACCGAGA TGCTGAGATA 720 

GTGGTGAAGG ACAATGGCAG AAGCTGTCCC CCCTGTCATG AGGTTTGCAA GGGGCGATGC 780 

TGGGGTCCTG GATCAGAAGA CTGCCAGACA TTGACCAAGA CCATCTGTGC TCCTCAGTGT 840 

AATGGTCACT GCTTTGGGCC CAACCCCAAC CAGTGCTGCC ATGATGAGTG TGCCGGGGGC 900 

TGCTCAGGCC CTCAGGACAC AGACTGCTTT GCCTGCCGGC ACTTCAATGA CAGTGGAGCC 960 

TGTGTACCTC GCTGTCCACA GCCTCTTGTC TACAACAAGC TAACTTTCCA GCTGGAACCC 1020 

AATCCCCACA CCAAGTATCA GTATGGAGGA GTTTGTGTAG CCAGCTGTCC CCATAACTTT 1080 

GTGGTGGATC AAACATCCTG TGTCAGGGCC TGTCCTCCTG ACAAGATGGA AGTAGATAAA 1140 

AATGGGCTCA AGATGTGTGA GCCTTGTGGG GGACTATGTC CCAAAGCCTG TGAGGGAACA 1200 

GGCTCTGGGA GCCGCTTCCA GACTGTGGAC TCGAGCAACA TTGATGGATT TGTGAACTGC 1260 

ACCAAGATCC TGGGCAACCT GGACTTTCTG ATCACCGGCC TCAATGGAGA CCCCTGGCAC 1320 

AAGATCCCTG CCCTGGACCC AGAGAAGCTC AATGTCTTCC GGACAGTACG GGAGATCACA 1380 

GGTTACCTGA ACATCCAGTC CTGGCCGCCC CACATGCACA ACTTCAGTGT TTTTTCCAAT 1440 

TTGACAACCA TTGGAGGCAG AAGCCTCTAC AACCGGGGCT TCTCATTGTT GATCATGAAG 1500 

AACTTGAATG TCACATCTCT GGGCTTCCGA TCCCTGAAGG AAATTAGTGC TGGGCGTATC 1560 

TATATAAGTG CCAATAGGCA GCTCTGCTAC CACCACTCTT TGAACTGGAC CAAGGTGCTT 1620 

CGGGGGCCTA CGGAAGAGCG ACTAGACATC AAGCATAATC GGCCGCGCAG AGACTGCGTG 1680 

GCAGAGGGCA AAGTGTGTGA CCCACTGTGC TCCTCTGGGG GATGCTGGGG CCCAGGCCCT 1740 

GGTCAGTGCT TGTCCTGTCG AAATTATAGC CGAGGAGGTG TCTGTGTGAC CCACTGCAAC 1800 

TTTCTGAATG GGGAGCCTCG AGAATTTGCC CATGAGGCCG AATGCTTCTC CTGCCACCCG 1860 

GAATGCCAAC CCATGGGGGG CACTGCCACA TGCAATGGCT CGGGCTCTGA TACTTGTGCT 1920 

CAATGTGCCC ATTTTCGAGA TGGGCCCCAC TGTGTGAGCA GCTGCCCCCA TGGAGTCCTA 1980 

GGTGCCAAGG GCCCAATCTA CAAGTACCCA GATGTTCAGA ATGAATGTCG GCCCTGCCAT 2040 

GAGAACTGCA CCCAGGGGTG TAAAGGACCA GAGCTTCAAG ACTGTTTAGG ACAAACACTG 2100 

GTGCTGATCG GCAAAACCCA TCTGACAATG GCTTTGACAG TGATAGCAGG ATTGGTAGTG 2160 

ATTTTCATGA TGCTGGGCGG CACTTTTCTC TACTGGCGTG GGCGCCGGAT TCAGAATAAA 2220 

AGGGCTATGA GGCGATACTT GGAACGGGGT GAGAGCATAG AGCCTCTGGA CCCCAGTGAG 2280 

AAGGCTAACA AAGTCTTGGC CAGAATCTTC AAAGAGACAG AGCTAAGGAA GCTTAAAGTG 2340 

CTTGGCTCGG GTGTCTTTGG AACTGTGCAC AAAGGAGTGT GGATCCCTGA GGGTGAATCA 2400 

ATCAAGATTC CAGTCTGCAT TAAAGTCATT GAGGACAAGA GTGGACGGCA GAGTTTTCAA 2460 

GCTGTGACAG ATCATATGCT GGCCATTGGC AGCCTGGACC ATGCCCACAT TGTAAGGCTG 2520 

CTGGGACTAT GCCCAGGGTC ATCTCTGCAG CTTGTCACTC AATATTTGCC TCTGGGTTCT 2580 

CTGCTGGATC ATGTGAGACA ACACCGGGGG GCACTGGGGC CACAGCTGCT GCTCAACTGG 2640 

GGAGTACAAA TTGCCAAGGG AATGTACTAC CTTGAGGAAC ATGGTATGGT GCATAGAAAC 2700 

CTGGCTGCCC GAAACGTGCT ACTCAAGTCA CCCAGTCAGG TTCAGGTGGC AGATTTTGGT 2760 

GTGGCTGACC TGCTGCCTCC TGATGATAAG CAGCTGCTAT ACAGTGAGGC CAAGACTCCA 2820 

ATTAAGTGGA TGGCCCTTGA GAGTATCCAC TTTGGGAAAT ACACACACCA GAGTGATGTC 2880 

TGGAGCTATG GTGTGACAGT TTGGGAGTTG ATGACCTTCG GGGCAGAGCC CTATGCAGGG 2940 

CTACGATTGG CTGAAGTACC AGACCTGCTA GAGAAGGGGG AGCGGTTGGC ACAGCCCCAG 3000 

ATCTGCACAA TTGATGTCTA CATGGTGATG GTCAAGTGTT GGATGATTGA TGAGAACATT 3060 

CGCCCAACCT TTAAAGAACT AGCCAATGAG TTCACCAGGA TGGCCCGAGA CCCACCACGG 3120 

TATCTGGTCA TAAAGAGAGA GAGTGGGCCT GGAATAGCCC CTGGGCCAGA GCCCCATGGT 3180 

CTGACAAACA AGAAGCTAGA GGAAGTAGAG CTGGAGCCAG AACTAGACCT AGACCTAGAC 3240 

TTGGAAGCAG AGGAGGACAA CCTGGCAACC ACCACACTGG GCTCCGCCCT CAGCCTACCA 330 0 

GTTGGAACAC TTAATCGGCC ACGTGGGAGC CAGAGCCTTT TAAGTCCATC ATCTGGATAC 3360 

ATGCCCATGA ACCAGGGTAA TCTTGGGGGG TCTTGCCAGG AGTCTGCAGT TTCTGGGAGC 3420 

AGTGAACGGT GCCCCCGTCC AGTCTCTCTA CACCCAATGC CACGGGGATG CCTGGCATCA 3480 

GAGTCATCAG AGGGGCATGT AACAGGCTCT GAGGCTGAGC TCCAGGAGAA AGTGTCAATG 3540 

TGTAGAAGCC GGAGCAGGAG CCGGAGCCCA CGGCCACGCG GAGATAGCGC CTACCATTCC 3600 

CAGCGCCACA GTCTGCTGAC TCCTGTTACC CCACTCTCCC CACCCGGGTT AGAGGAAGAG 3660 

GATGTCAACG GTTATGTCAT GCCAGATACA CACCTCAAAG GTACTCCCTC CTCCCGGGAA 3720 

GGCACCCTTT CTTCAGTGGG TCTCAGTTCT GTCCTGGGTA CTGAAGAAGA AGATGAAGAT 3780 

GAGGAGTATG AATACATGAA CCGGAGGAGA AGGCACAGTC CACCTCATCC CCCTAGGCCA 3840 

AGTTCCCTTG AGGAGCTGGG TTATGAGTAC ATGGATGTGG GGTCAGACCT CAGTGCCTCT 3900 

CTGGGCAGCA CACAGAGTTG CCCACTCCAC CCTGTACCCA TCATGCCCAC TGCAGGCACA 3960 

ACTCCAGATG AAGACTATGA ATATATGAAT CGGCAACGAG ATGGAGGTGG TCCTGGGGGT 4020 

GATTATGCAG CCATGGGGGC CTGCCCAGCA TCTGAGCAAG GGTATGAAGA GATGAGAGCT 4080 

TTTCAGGGGC CTGGACATCA GGCCCCCCAT GTCCATTATG CCCGCCTAAA AACTCTACGT 4140 

AGCTTAGAGG CTACAGACTC TGCCTTTGAT AACCCTGATT ACTGGCATAG CAGGCTTTTC 4200 

CCCAAGGCTA ATGCCCAGAG AACGTAACTC CTGCTCCCTG TGGCACTCAG GGAGCATTTA 4260 

ATGGCAGCTA GTGCCTTTAG AGGGTACCGT CTTCTCCCTA TTCCCTCTCT CTCCCAGGTC 4320 

CCAGCCCCTT TTCCCCAGTC CCAGACAATT CCATTCAATC TTTGGAGGCT TTTAAACATT 4380 

TTGACACAAA ATTCTTATGG TATGTAGCCA GCTGTGCACT TTCTTCTCTT TCCCAACCCC 4440 

AGGAAAGGTT TTCCTTATTT TGTGTGCTTT CCCAGTCCCA TTCCTCAGCT TCTTCACAGG 4500 

CACTCCTGGA GATATGAAGG ATTACTCTCC ATATCCCTTC CTCTCAGGCT CTTGACTACT 4560 

TGGAACTAGG CTCTTATGTG TGCCTTTGTT TCCCATCAGA CTGTCAAGAA GAGGAAAGGG 4620 

AGGAAACCTA GCAGAGGAAA GTGTAATTTT GGTTTATGAC TCTTAACCCC CTAGAAAGAC 4680 

AGAAGCTTAA AATCTGTGAA GAAAGAGGTT AGGAGTAGAT ATTGATTACT ATCATAATTC 4740 

AGCACTTAAC TATGAGCCAG GCATCATACT AAACTTCACC TACATTATCT CACTTAGTCC 4800 

TTTATCATCC TTAAAACAAT TCTGTGACAT ACATATTATC TCATTTTACA CAAAGGGAAG 4860 

TCGGGCATGG TGGCTCATGC CTGTAATCTC AGCACTTTGG GAGGCTGAGG CAGAAGGATT 4920 

ACCTGAGGCA AGGAGTTTGA GACCAGCTTA GCCAACATAG TAAGACCCCC ATCTC 4975 



Seq ID NO: C76 DNA Sequence 



1243 



WO 03/042661 



PCT/US02/36810 



Nucleic Acid Accession «: NM_001216.1 
Coding sequence : 4 3 1 4 2 2 



5 
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1 11 21 

I I I 

GCCCGTACAC ACCGTGTGCT GGGACACCCC 
AGCCCCTGGC TCCCTCTGTT GATCCCGGCC 
CTGTCACTGC TGCTTCTGAT GCCTGTCCAT 
TCCCCCTTGG GAGGAGGCTC TTCTGGGGAA 
AGTGAAGAGG ATTCACCCAG AGAGGAGGAT 
GAGGATCTAC CTGGAGAGGA GGATCTACCT 
TCCCTGAAGT TAGAGGATCT ACCTACTGTT 
AATAATGCCC 74CAGGGACAA AGAAGGGGAT 
CCGCCCTGGC CCCGGGTGTC CCCAGCCTGC 
CGCCCCCAGC 

CTCCCGCCGC TCCCAGAACT 
CCTCCTGGGC TAGAGATGGC TCTGGGTCCC 
CTGCACTGGG GGGCTGCAGG TCGTCCGGGC 
CCTGCGGAGA TCCACGTGGT TCACCTCAGC 
GGGCGCCCGG GAGGCCTGGC CGTGTTGGCC 
AGTGCCTATG AGCAGTTGCT GTCTCGCTTG 
CAGGTCCCAG GACTGGACAT ATCTGCACTC 
TATGAGGGGT CTCTGACTAC ACCGCCCTGT 
CAGACAGTGA TGCTGAGTGC TAAGCAGCTC 
GGTGACTCTC GGCTACAGCT GAACTTCCGA 
GAGGCCTCCT TCCCTGCTGG AGTGGACAGC 
AATTCCTGCC TGGCTGCTGG TGACATCCTA 
ACCAGCGTCG CGTTCCTTGT GCAGATGAGA 
GTGAGCTACC GCCCAGCAGA GGTAGCCGAG 
TGTGAGAAGC CAGCCAGAGG CATCTGAGGG 
ATGCCACTTC CTTTTAACTG CCAAGAAATT 



31 41 

I I 
ACAGTCAGCC GCATGGCTCC 
CCTGCTCCAG GCCTCACTGT 
CCCCAGAGGT TGCCCCGGAT 
GATGACCCAC TGGGCGAGGA 
CCACCCGGAG AGGAGGATCT 
GAAGTTAAGC CTAAATCAGA 
GAGGCTCCTG GAGATCCTCA 
GACCAGAGTC ATTGGCGCTA 
GCGGGCCGCT TCCAGTCCCC 
CTGCGCCCCC TGGAACTCCT 
AACAATGGCC ACAGTGTGCA 
GGGCGGGAGT ACCGGGCTCT 
TCGGAGCACA CTGTGGAAGG 
ACCGCCTTTG CCAGAGTTGA 
GCCTTTCTGG AGGAGGGCCC 
GAAGAAATCG CTGAGGAAGG 
CTGCCCTCTG ACTTCAGCCG 
GCCCAGGGTG TCATCTGGAC 
CACACCCTCT CTGACACCCT 
GCGACGCAGC CTTTGAATGG 
AGTCCTCGGG CTGCTGAGCC 
GCCCTGGTTT TTGGCCTCCT 
AGGCAGCACA GAAGGGGAAC 
ACTGGAGCCT AGAGGCTGGA 
GGAGCCGGTA ACTGTCCTGT 
TTTTAAAATA AATATTTATA 



51 
I 

GCAACTGCTG 
GCAGGAGGAT 
GGATCTGCCC 
ACCTGGAGAG 
AGAAGAGGGC 
AGAACCCCAG 
TGGAGGCGAC 
GGTGGATATC 
GGGCTTCCAG 
ACTGACCCTG 
GCAGCTGCAT 
CCACCGTTTC 
CGAGGCCTTG 



Seq ID NO : C77 DNA Sequence 

Nucleic Acid Accession NM_004207.1 

Coding sequence:' 63.. 1460 



41 



CTCAGAGACT 
CTACTTCCAA 
TGTGTTTAAC 
GTGGGGACCT 
GCGAGTGATT 
AGTCCAGCTG 
TTTTGCTGTC 
CAAAGGGGGT 
TCTTGGAGAA 
CCTGCTCATT 
AT 



51 



60 

120 ' 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

ll40 

1200 

1260 

1320 

1380 

1440 

1500 

1S52 



1 11 21 31 

I I I I I I 

GGCGAGAGGC GGGCTGAGGC GGCCCAGCGG CGGCAGGTGA GGCGGAACCA ACCCTCCTGG 60 

CCATGGGAGG GGCCGTGGTG GACGAGGGCC CCACAGGCGT CAAGGCCCCT GACGGCGGCT 120 

GGGGCTGGGC CGTGCTCTTC GGCTGTTTCG TCATCACTGG CTTCTCCTAC GCCTTCCCCA 180 

AGGCCGTCAG TGTCTTCTTC AAGGAGCTCA TACAGGAGTT TGGGATCGGC TACAGCGACA 240 

CAGCCTGGAT CTCCTCCATC CTGCTGGCCA TGCTCTAOGG GACAGGTCCG CTCTGCAGTG 300 

TGTGCGTGAA CCGCTTTGGC TGCCGGCCCG TCATGCTTGT GGGGGGTCTC TTTGCGTCGC 360 

TGGGCATGGT GGCTGCGTCC TTTTGCCGGA GCATCATCCA GGTCTACCTC ACCACTGGGG 420 

TCATCACGGG GTTGGGTTTG GCACTCAACT TCCAGCCCTC GCTCATCATG CTGAACCGCT 480 

ACTTCAGCAA GCGGCGCCCC ATGGCCAACG GGCTGGCGGC AGCAGGTAGC CCTGTCTTCC 540 

TGTGTGCCCT GAGCCCGCTG GGGCAGCTGC TGCAGGACCG CTACGGCTGG CGGGGCGGCT 600 

TCCTCATCCT GGGCGGCCTG CTGCTCAACT GCTGCGTGTG TGCCGCACTC ATGAGGCCCC 660 

TGGTGGTCAC GGCCCAGCCG GGCTCGGGGC CGCCGCGACC CTCCCGGCGC CTGCTAGACC 720 

TGAGCGTCTT COGGGACCGC GGCTTTGTGC TTTACGCCGT GGCCGCCTCG GTCATGGTGC 780 

TGGGGCTCTT CGTCCCGCCC GTGTTCGTGG TGAGCTACGC CAAGGACCTG GGCGTGCCCG 840 

ACACCAAGGC CGCCTTCCTG CTCACCATCC TGGGCTTCAT TGACATCTTC GCGCGGCCGG 900 

CCGCGGGCTT CGTGGCGGGG CTTGGGAAGG TGCGGCCCTA CTCCGTCTAC CTCTTCAGCT 960 

TCTCCATGTT CTTCAACGGC CTCGCGGACC TGGCGGGCTC TACGGCGGGC GACTACGGCG 1020 

GCCTCGTGGT CTTCTGCATC TTCTTTGGCA TCTCCTACGG CATGGTGGGG GCCCTGCAGT 1080 

TCGAGGTGCT CATGGCCATC GTGGGCACCC ACAAGTTCTC CAGTGCCATT GGCCTGGTGC 1140 

TGCTGATGGA GGCGGTGGCC GTGCTCGTCG GGCCCCCTTC GGGAGGCAAA CTCCTGGATG 1200 

CGACCCACGT CTACATGTAC GTGTTCATCC TGGCGGGGGC CGAGGTGCTC ACCTCCTCCC 1260 

TGATTTTGCT GCTGGGCAAC TTCTTCTGCA TTAGGAAGAA GCCCAAAGAG CCACAGCCTG 1320 

AGGTGGCGGC CGCGGAGGAG GAGAAGCTCC ACAAGCCTCC TGCAGACTOG GGGGTGGACT 1380 

TGCGGGAGGT GGAGCATTTC CTGAAGGCTG AGCCTGAGAA AAACGGGGAG GTGGTTCACA 1440 

CCCCGGAAAC AAGTGTCTGA GTGGCTGGGC GGGGCCGGCA GGCACAGGGA GGAGGTACAG 1500 

AAGCCGGCAA CGCTTGCTAT TTATTTTACA AACTGGACTG GCTCAGGCAG GGCCACGGCT 1560 

GGGCTCCAGC TGCCGGCCCA GCGGATCGTC GCCCGATCAG TGTTTTGAGG GGGAAGGTGG 1620 

CGGGGTGGGA ACCGTGTCAT TCCAGAGTGG ATCTGOGGTG AAGCCAAGCC GCAAGGTTAC 1680 

AAGGCATCCT CACCAGGGGC CCCGCCTGCT GCTCCCAGGT GGCCTGCGGC CACTGCTATG 1740 

CTCAAGGACC TGGAAACCCA TGCTTCGAGA CAACGTGACT TTAATGGGAG GGTGGGTGGG 1800 

CCGCAGACAG GCTGGCAGGG CAGGTGCTGC GTGGGGCCCT CTCCAGCCCG TCCTACCCTG I860 

GGCTCACATG GGGCCTGTGC CCACCCCTCT TGAGTGTCTT GGGGACAGCT CTTTCCACCC 1920 

CTGGAAGATG GAAATAAACC TGCGTGTGGG TGGAGTGTTC TCGTGCCGAA TTCAAAAAGC 1980 

TT 19B2 



75 Seq ID NO: C78 DNA Sequence 

Nucleic Acid Accession #: NMJJO0358.1 
Coding sequence: 48.. 2099 

0 - 1 11 21 31 41 51 

80 | | | | | | 

GCTTGCCCGT CGGTCGCTAG CTCGCTCGGT GCGCGTCGTC CCGCTCCATG GCGCTCTTCG 60 

TGCGGCTGCT GGCTCTCGCC CTGGCTCTGG CCCTGGGCCC CGCCGCGACC CTGGCGGGTC 120 

CCGCCAAGTC GCCCTACCAG CTGGTGCTGC AGCACAGCAG GCTCCGGGGC CGCCAGCACG 180 

GCCCCAACGT GTGTGCTGTG CAGAAGGTTA TTGGCACTAA TAGGAAGTAC TTCACCAACT 240 

1244 



WO 03/042661 



PCT/US02/36810 
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GCAAGCAGTG GTACCAAAGG AAAATCTGTG GCAAATCAAC AGTCATCAGC TACGAGTGCT 300 

GTCCTGGATA TGAAAAGGTC CCTGGGGAGA AGGGCTGTCC AGCAGCCCTA CCACTCTCAA 360 

ACCTTTACGA GACCCTGGGA GTCGTTGGAT CCACCACCAC TCAGCTGTAC ACGGACCGCA 420 

CGGAGAAGCT GAGGCCTGAG ATGGAGGGGC CCGGCAGCTT CACCATCTTC GCCCCTAGCA 480 

ACGAGGCCTG GGCCTCCTTG CCAGCTGAAG TGCTGGACTC CCTGGTCAGC AATGTCAACA S40 

TTGAGCTGCT CAATGCCCTC CGCTACCATA TGGTGGGCAG GOGAGTCCTG ACTGATGAGC 600 

TGAAACACGG CATGACCCTC ACCTCTATGT ACCAGAATTC CAACATCCAG ATCCACCACT 660 

ATCCTAATGG GATTGTAACT GTGAACTGTG CCCGGCTCCT GAAAGCCGAC CACCATGCAA 720 

CCAAOGGGGT GGTGCACCTC ATCGATAAGG TCATCTCCAC CATCACCAAC AACATCCAGC 780 

AGATCATTGA GATCGAGGAC ACCTTTGAGA CCCTTCGGGC TGCTGTGGCT GCATCAGGGC 840 

TCAACACGAT GCTTGAAGGT AACGGCCAGT ACACGCTTTT GGCCCCGACC AATGAGGCCT 900 

TCGAGAAGAT CCCTAGTGAG ACTTTGAACC GTATCCTGGG CGACCCAGAA GCCCTGAGAG 960 

ACCTGCTGAA CAACCACATC TTGAAGTCAG CTATGTGTGC TGAAGCCATC GTTGCGGGGC 1020 

TGTCTGTAGA GACCCTGGAG GGCACGACAC TGGAGGTGGG CTGCAGCGGG GACATGCTCA 1080 

CTATCAACGG GAAGGCGATC ATCTCCAATA AAGACATCCT AGCCACCAAC GGGGTGATCC 1140 

ACTACATTGA TGAGCTACTC ATCCCAGACT CAGCCAAGAC ACTATTTGAA TTGGCTGCAG 1200 

AGTCTGATGT GTCCACAGCC ATTGACCTTT TCAGACAAGC CGGCCTCGGC AATCATCTCT 1260 

CTGGAAGTGA GCGGTTGACC CTCCTGGCTC CCCTGAATTC TGTATTCAAA GATGGAACCC 1320 

CTCCAATTGA TGCCCATACA AGGAATTTGC TTCGGAACCA CATAATTAAA GACCAGCTGG 1380 

CCTCTAAGTA TCTGTACCAT GGACAGACCC TGGAAACTCT GGGCGGCAAA AAACTGAGAG 1440 

TTTTTGTTTA TCGTAATAGC CTCTGCATTG AGAACAGCTG CATCGCGGCC CACGACAAGA 1500 

GGGGGAGGTA CGGGACCCTG TTCACGATGG ACCGGGTGCT GACCCCCCCA ATGGGGACTG 1560 

TCATGGATGT CCTGAAGGGA GACAATCGCT TTAGCATGCT GGTAGCTGCC ATCCAGTCTG 1620 

CAGGACTGAC GGAGACCCTC AACCGGGAAG GAGTCTACAC AGTCTTTGCT CCCACAAATG 1680 

AAGCCTTCCG AGCCCTGCCA CCAAGAGAAC GGAGCAGACT CTTGGGAGAT GCCAAGGAAC 1740 

TTGCCAACAT CCTGAAATAC CACATTGGTG ATGAAATCCT GGTTAGCGGA GGCATCGGGG 1800 

CCCTGGTGCG GCTAAAGTCT CTCCAAGGTG ACAAGCTGGA AGTCAGCTTG AAAAACAATG I860 

TGGTGAGTGT CAACAAGGAG CCTGTTGCCG AGCCTGACAT CATGGCCACA AATGGCGTGG 1920 

TCCATGTCAT CACCAATGTT CTGCAGCCTC CAGCCAACAG ACCTCAGGAA AGAGGGGATG 1980 

AACTTGCAGA CTCTGCGCTT GAGATCTTCA AACAAGCATC AGCGTTTTCC AGGGCTTCCC 2040 

AGAGGTCTGT GCGACTAGCC CCTGTCTATC AAAAGTTATT AGAGAGGATG AAGCATTAGC 2100 

TTGAAGCACT ACAGGAGGAA TGCACCACGG CAGCTCTCCG CCAATTTCTC TCAGATTTCC 2160 

ACAGAGACTG TTTGAATGTT TTCAAAACCA AGTATCACAC TTTAATGTAC ATGGGCCGCA 2220 

CCATAATGAG ATGTGAGCCT TGTGCATGTG GGGGAGGAGG GAGAGAGATG TACTTTTTAA 2280 

ATCATGTTCC CCCTAAACAT GGCTGTTAAC CCACTGCATG CAGAAACTTG GATGTCACTG 2340 

CCTGACATTC ACTTCCAGAG AGGACCTATC CCAAATGTGG AATTGACTGC CTATGCCAAG 2400 

TCCCTGGAAA AGGAGCTTCA GTATTGTGGG GCTCATAAAA CATGAATCAA GCAATCCAGC 2460 

CTCATGGGAA GTCCTGGCAC AGTTTTTGTA AAGCCCTTGC ACAGCTGGAG AAATGGCATC 2S20 

ATTATAAGCT ATGAGTTGAA ATGTTCTGTC AAATGTGTCT CACATCTACA CGTGGCTTGG 2580 

AGGCTTTTAT GGGGCCCTGT CCAGGTAGAA AAGAAATGGT ATGTAGAGCT TAGATTTCCC 2640 

TATTGTGACA GAGCCATGGT GTGTTTGTAA TAATAAAACC AAAGAAACAT A 2691 

Seq ID NO: C79 DNA Sequence 

Nucleic Acid Accession #: NM_006536.2 

Coding sequence: 109.. 2940 

1 11 21 31 41 51 

I 1 I I • I I 

ACCTAAAACC TTGCAAGTTC AGGAAGAAAC CATCTGCATC CATATTGAAA ACCTGACACA 60 

ATGTATGCAG CAGGCTCAGT GTGAGTGAAC TGGAGGCTTC TCTACAACAT GACCCAAAGG 120 

AGCATTGCAG GTCCTATTTG CAACCTGAAG TTTGTGACTC TCCTGGTTGC CTTAAGTTCA 180 

GAACTCCCAT TCCTGGGAGC TGGAGTACAG CTTCAAGACA ATGGGTATAA TGGATTGCTC 240 

ATTGCAATTA ATCCTCAGGT ACCTGAGAAT CAGAACCTCA TCTCAAACAT TAAGGAAATG 300 

ATAACTGAAG CTTCATTTTA CCTATTTAAT GCTACCAAGA GAAGAGTATT TTTCAGAAAT 360 

ATAAAGATTT TAATACCTGC CACATGGAAA GCTAATAATA ACAGCAAAAT AAAACAAGAA 420 

TCATATGAAA AGGCAAATGT CATAGTGACT GACTGGTATG GGGCACATGG AGATGATCCA 4 80 

TACACCCTAC AATACAGAGG GTGTGGAAAA GAGGGAAAAT ACATTCATTT CACACCTAAT 540 

TTCCTACTGA ATGATAACTT AACAGCTGGC TAOGGATCAC GAGGCCGAGT GTTTGTCCAT 600 

GAATGGGCCC ACCTCCGTTG GGGTGTGTTC GATGAGTATA ACAATGACAA ACCTTTCTAC 660 

ATAAATGGGC AAAATCAAAT TAAAGTGACA AGGTGTTCAT CTGACATCAC AGGCATTTTT ' 720 

GTGTGTGAAA AAGGTCCTTG CCCCCAAGAA AACTGTATTA TTAGTAAGCT TTTTAAAGAA 780 

GGATGCACCT TTATCTACAA TAGCACCCAA AATGCAACTG CATCAATAAT GTTCATGCAA 840 

AGTTTATCTT CTGTGGTTGA ATTTTGTAAT GCAAGTACCC ACAACCAAGA AGCACCAAAC 900 

CTACAGAACC AGATGTGCAG CCTCAGAAGT GCATGGGATG TAATCACAGA CTCTGCTGAC 960 

TTTCACCACA GCTTTCCCAT GAATGGGACT GAGCTTCCAC CTCCTCCCAC ATTCTCGCTT 1020 

GTACAGGCTG GTGACAAAGT GGTCTGTTTA GTGCTGGATG TGTCCAGCAA GATGGCAGAG 10 80 

GCTGACAGAC TCCTTCAACT ACAACAAGCC GCAGAATTTT ATTTGATGCA GATTGTTGAA 1140 

ATTCATACCT TCGTGGGCAT TGCCAGTTTC GACAGCAAAG GAGAGATCAG AGCCCAGCTA 1200 

CACCAAATTA ACAGCAATGA TGATCGAAAG TTGCTGGTTT CATATCTGCC CACCACTGTA 1260 

TCAGCTAAAA CAGACATCAG CATTTGTTCA GGGCTTAAGA AAGGATTTGA GGTGGTTGAA 1320 

AAACTGAATG GAAAAGCTTA TGGCTCTGTG ATGATATTAG TGACCAGCGQ AGATGATAAG 1380 

CITCTTGGCA ATTGCTTACC CACTGTGCTC AGGAGTGGTT CAACAATTCA CTCCATTGCC 1440 

CTGGGTTCAT CTGCAGCCCC AAATCTGGAG GAATTATCAC GTCTTACAGG AGGTTTAAAG 1500 

TTCTTTGTTC CAGATATATC AAACTCCAAT AGCATGATTG ATGCTTTCAG TAGAATTTCC 1560 

TCTGGAACTG GAGACATTTT CCAGCAACAT ATTCAGCTTG AAAGTACAGG TGAAAATGTC 1620 

AAACCTCACC ATCAATTGAA AAACACAGTG ACTGTGGATA ATACTGTGGG CAACGACACT 1680 

ATGTTTCTAG TTACGTGGCA GGCCAGTGGT CCTCCTGAGA TTATATTATT TGATCCTGAT 1740 

GGACGAAAAT ACTACACAAA TAATTTTATC ACCAATCTAA CTTTTCGGAC AGCTAGTCTT 1800 

TGGATTCCAG GAACAGCTAA GCCTGGGCAC TGGACTTACA CCCTGAACAA TACCCATCAT I860 

TCTCTGCAAG CCCTGAAAGT GACAGTGACC TCTCGCGCCT CCAACTCAGC TGTGCCCCCA 1920 

GCCACTGTGG AAGCCTTTGT GGAAAGAGAC AGCCTCCATT TTCCTCATCC TGTGATGATT 1980 

TATGCCAATG TGAAACAGGG ATTTTATCCC ATTCTTAATG CCACTGTCAC TGCCACAGTT 2040 

GAGCCAGAGA CTGGAGATCC TGTTACGCTG AGACTCCTTG ATGATGGAGC AGGTGCTGAT 2100 
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GTTATAAAAA 
TATAGCTTGA 
CCAGGGAGTC 
GCTCCAAGGA 
AGCTCAGGAG 
CCACCATGCA 
TGGACAGCAC 
AGTAAAAGTC 
AAGCGAAATC 
ACGAATGGAC 
GCAATAOGAG 
CCTCTGTTTA 
GGAGTTTTAA 
CATACTTTAA 
ATAAATATCC 
CATACTAACA 
ATACAGATAA 
CCTTACACTT 
GCAAAGGGAA 
AATAGCCCCA 
TCATTTAGTT 
TTTACATGAA 
CTTGCTATTT 
TTTCACTGTA 
TTTATGACAA 
TTTCTAAGTT 
TACCTAGGAA 



ATGATGGAAT 
AAGTGCATGT 
ATGCTATGTA 
AATCAGTAGG 
GCTCCTTTTC 
AAATTATTGA 
CTGGAGAAGA 
TACAGAATAT 
CTCAGCAAGC 
CTGAACATCA 
CAATGGATAG 
TTCCCCCCAA 
CAGCAATGGG 
GCAGGAAAAA 
AAAGTGTCTT 
AAGTCAAATT 
GATTTTTACA 
TGGCTATGAA 
GGGTAAAGTC 
AGCAGAGAAA 
ACTTTGATTA 
GATCATGCTA 
TGTTATATAT 
AGAGGTAACC 
AGGTCTATTG 
TATTGCCTTG 
A 



TTACTCGAGG 
CAATCACTCT 
TGTACCAGGT 
CAGAAATGAG 
AGTGCTGGGA 
CCTGGAAGCT 
CTTTGATCAG 
CCAAGATGAC 
TGGCATCAGG 
GCCAAATGGA 
GAACTCCTTA 
TTCTGATCCT 
TTTGATAGGA 
GAGAGCAGAC 
CCTTCTTAGA 
AACATCAAAA 
TGGTAGATCA 
CAAATAATAA 
GGACCAGTGT 
AGGAGGGTAG 
ATTTTTCTTT 
TATTTTATAT 
ATTTCAGATG 
TTTAACAATA 
AATTTATTTG 
GGTTATTATG 



TATTTTTTCT 
CCCAGCATAA 
TACACAGCAA 
GAGGAGCGAA 
GTTCCAGCTG 
GTAAAAGTAG 
GGCCAGGCTA 
TTTAACAATG 
GAGATATTTA 
GAAACACATG 
CAGTCTGCTG 
GTACCTGCCA 
ATCATTTGCC 
AAGAAAGAGA 
TATAAGACCC 
CTGTATTAAA 
ACAATTCTTT 
AAATTATTCT 
CAAGGAAAGT 
GTCTGCATTA 
TCTCCTTATC 
ATGTAGCCCC 
ACATCTCCCT 
TGGGTATTAC 
TNTGTAAGTT 
GAATGATAGT 



CCTTTGCTGC 
GCACCCCAGC 
ACGGTAATAT 
AGTGGGGCTT 
GCCCCCACCC 
AAGAGGAATT 
CAAGCTATGA 
CTATTTTAGT 
OGTTCTCACC 
AAAGCCACAG 
TATCTAACAT 
GAGATTATCT 
TTATTATAGT 
ATGGAACAAA 
ATGGCCTTCG 
ATGCATTGAG 
TTGGGGGTAG 
TTAAAGTAAT 
TTGTTTTATT 
TAACTGTCTG 
TGTGCAGTAC 
TAATGCAAAG 
GCTAATGCTC 
CTTTGTCTCT 
TCTACTCCCA 
TATAGCCCCN 



Seq ID NO: ceo DNA Sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 1..1413 



ATGAAGTTTC 
AGCTCTACAA 
TATGGCCTTG 
AAGGAAAAAA 
ACATCTAQCC 
AGGGAAATGC 
TACACACCTG 
TGGAGTAATG 
GTGGTTTTTG 
CTAGCCCATG 
GAATTCTGGA 
GGCCATTCCT 
AAATATGTTG 
CTGTATGGAG 
CTCTGTGACC 
TTCAAAGACA 
ATTTCTTCCT 
AGAAATCAAG 
GAGCCAAATT 
GATGCAGCTG 
TGGAGGTATG 
AACTTCCAAG 
TATTTCTTCC 
ACACTGAAAA 



11 
I 

TTCTAATACT 
GCCTGGAAAA 
AGATAAACAA 
TCCAAGAAAT 
TGGAGATGAT 
CAGGGGGGCC 
ACATGAACCG 
TTACCCCCTT 
CCCGTGGAGC 
CTTTTGGACC 
CTACACATTC 
TAGGTCTTGG 
ACATCAACAC 
ACCCAAAAGA 
CCAATTTGAG 
GGTTCTTCTG 
TATGGCCAAC 
TTTTTCTTTT 
ATCCCAAGAG 
TTTTTAACCC 
ATGAAAGGAG 
GAATCGGGCC 
AAGGATCTAA 
GCAATAGCTG 



21 
I 

GCTCCTGCAG 
AAATAATGTG 
ACTTCCAGTG 
GCAGCACTTC 
GCACGCACCT 
CGTATGGAGG 
TGAGGATGTT 
GAAATTCAGC 
TCATGGAGAC 
TGGATCTGGC 
AGGAGGCACA 
CCATTCTAGT 
ATTTCGCCTC 
GAACCAACGC 
TTTTGATGCT 
GCTGAAGGTT 
CTTGCCATCT 
TAAAGATGAC 
CATACATTCT 
ACGTTTTTAT 
ACAGATGATG 
TAAAATTGAT 
CCAATTTGAA 
GTTTGGTTGT 



31 

I 

GCCACTGCTT 
CTATTTGGTG 
ACAAAAATGA 
TTGGGTCTGA 
CGATGTGGAG 
AAACATTATA 
GACTACGCAA 
AAGATTAACA 
TTCCATGCTT 
ATTGGAGGGG 
AACTTGTTCC 
GATCCAAAGG 
TCTGCTGATG 
TTGCCAAATC 
GTCACTACCG 
TCTGAGAGAC 
GGCATTGAAG 
AAATACTGGT 
TTTGGTTTTC 
AGGACCTACT 
GACCCTGGTT 
GCAGTCTTCT 
TATGACTTCC 
TGA 



41 
I 

CTGGAGCTCT 
AAAGATACTT 
AATATAGTGG 
AAGTGACCGG 
TCCCCGATGT 
TCACCTACAG 
TCCGGAAAGC 
CAGGCATGGC 
TTGATGGCAA 
ATGCACATTT 
TCACTGCTGT 
CCGTAATGTT 
ACATACGTGG 
CTGACAATTC 
TGGGAAATAA 
CAAAGACCAG 
CTGCTTATGA 
TAATTAGCAA 
CTAACTTTGT 
TCTTTGTAGA 
ATCCCAAACT 
ACTCTAAAAA 
TACTCCAACG 



Seq ID NO: C81 DNA Sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 1..1413 



1 
I 

ATGAAGTTTC 
AGCTCTACAA 
TATGGCCTTG 
AAGGAAAAAA 
ACATCTACCC 
AGGGAAATGC 
TACACACCTG 
TGGAGTAATG 
GTG GTT TTTG 
CTAGCCCATG 
GAATTCTGGA 
GGCCATTCCT 
AAATATGTTG 
CTGTATGGAG 
CTCTGTGACC 
TTCAAAGACA 
ATTTCTTCCT 
AGAAATCAAG 
GAGCCAAATT 



11 

I 

TTCTAATACT 
GCCTGGAAAA 
AGATAAACAA 
TCCAAGAAAT 
TGGAGATGAT 
CAGGGGGGCC 
ACATGAACCG 
TTACCCCCTT 
CCCGTGGAGC 
CTTTTGGACC 
CTACACATTC 
TAGGTCTTGG 
ACATCAACAC 
ACCCAAAAGA 
CCAATTTGAG 
GGTTCTTCTG 
TATGGCCAAC 



ATCCCAAGAG 



21 
I 

GCTCCTGCAG 
AAATAATGTG 
ACTTCCAGTG 
GCAGCACTTC 
GCACGCACCT 
CGTATGGAGG 
TGAGGATGTT 
GAAATTCAGC 
TCATGGAGAC 
TGGATCTGGC 
AGGAGGCACA 
CCATTCTAGT 
ATTTCGCCTC 
GAACCAACGC 
TTTTGATGCT 
GCTGAAGGTT 
CTTGCCATCT 
TAAAGATGAC 
CATACATTCT 



31 
I 

GCCACTGCTT 
CTATTTGGTG 
ACAAAAATGA 
TTGGGTCTGA 
CGATGTGGAG 
AAACATTATA 
GACTACGCAA 
AAGATTAACA 
TTCCATGCTT 
ATTGGAGGGG 
AACTTGTTCC 
GATCCAAAGG 
TCTGCTGATG 
TTGCCAAATC 
GTCACTACCG 
TCTGAGAGAC 
GGCATTGAAG 
AAATACTGGT 
TTTGGTTTTC 



41 

I 

CTGGAGCTCT 
AAAGATACTT 
AATATAGTGG 
AAGTGACCGG 
TCCCCGATGT 
TCACCTACAG 
TCCGGAAAGC 
CAGGCATGGC 
TTGATGGCAA 
ATGCACATTT 
TCACTGCTGT 
CCGTAATGTT 
ACATACGTGG 
CTGACAATTC 
TGGGAAATAA 
CAAAGACCAG 
CTGCTTATGA 
TAATTAGCAA 
CTAACTTTGT 



AAATGGTAGA 
CCACTCTATT 
TCAGATGAAT 
TAGCCGAGTC 
TGATGTGTTT 
GACCCTATCT 
AATAAGAATG 
AAATACATCA 
CCAGATTTCC 
AATTTATGTT 
TGCCCAGGCG 
TATATTGAAA 
TGTGACACAT 
ATTATTATAA 
ACTACAAAAA 
TTTTTGTACA 
ATTAGAAAAC 
GTCTTTAAAG 
GAGGTGGAAA 
TGTGAAGCAA 
AGGTTGCTTG 
CTCTTTACCT 
AGAGATCTTT 
TCATACCGGT 
TCAAAGCAGC 
TATAATGCCT 



51 
I 

TCCCCTGAAC 
AGAAAAATTT 
AAACTTAATG 
GCAACTGGAC 
CCATCATTTC 
AATCAATAAT 
TTTCCAAGTA 
TGACATTTTG 
AGGTGGAATC 
CGATGAGGAC 
TCACGAGATT 
CCCCACCTAC 
CATTCAGTCC 
AGAACCAGCT 
GATCTTTTTC 
TGTTAATTTA 
AATTGAAGCC 
TTTAAGACCA 
GAAAAAAATT 
TAACCAGTAT 
GATTACCAAG 
CAAATACTAC 
TATCACCAAA 



51 
I 

TCCCCTGAAC 
AGAAAAATTT 
AAACTTAATG 
GCAACTGGAC 
CCATCATTTC 
AATCAATAAT 
TTTCCAAGTA 
TGACATTTTG 
AGGTGGAATC 
CGATGAGGAC 
TCACGCCATT 
CCCCACCTAC 
CATTCAGTCC 
AGAACCAGCT 
GATCTTTTTC 
TGTTAATTTA 
AATTGAAGCC 
TTTAAGACCA 
GAAAAAAATT 



2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
31B0 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3671 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1413 



60 

120 

160 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 
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GATGCAGCTG TTTTtAACCC ACGTTTTTAT AGGACCTACT TCTTTGTAGA TAACCAGTAT 
TGGAGGTATG ATGAAAGGAG ACAGATGATG GACCCTGGTT ATCCCAAACT GATTACCAAG 
AACTTCCAAG GAATCGGGCC TAAAATTGAT GCAGTCTTCT ACTCTAAAAA CAAATACTAC 
TATTTCTTCC AAGGATCTAA CCAATTTGAA TATGACTTCC TACTCCAACG TATCACCAAA 
ACACTGAAAA GCAATAGCTG GTTTGGTTGT TGA 

Seq ID NO: C82 DNA Sequence 

Nucleic Acid Accession 8: NM_0 06952.1 

Coding sequence: 11.. 793 



1200 
1260 
1320 
1380 
1413 



1 
I 

AATCCCGACA 
TGGAAATGTG 
ATCTGACCAA 
GGCTGCCTGG 
TGTAGGCATC 
AGTATATGCC 
ACCCAACCTC 
TGATGACCAG 
CAATTGCTGT 
TGAGAATAAT 
AGAACCTCTC 
CTGCTATGAA 
ATTTGCCATT 
AATTGAATAT 



11 
I 

ATGGCGAAAG 
ATTATTGGTT 
CACAGCCTCT 
ATCGGCATAT 
ATGAAGTCCA 
TTTGAAGTGG 
TTCCTGAAGC 
TGGAAAAACA 
GGCGTAAATG 
GATGCTGACT 
AACCTGGAGG 
CTGATCTCTG 
CTCTGCTGGA 
TAAGAA 



21 
I 

ACAACTCAAC 
GTTGOGGCAT 
ACCCACTGCT 
TTGTGGGCAT 
GCAGGAAAAT 
CATCTTGTAT 
AGATGCTAGA 
ATGGAGTCAC 
GTCCATCAGA 
ATCCCTGGCC 
CTTGTAAACT 
GTCCAATGAA 
CTTTTTGGGT 



31 
I 

TGTTCGTTGC 
TGCCCTGACT 
TGAAGCCACC 
CTGCCTCTTC 
TCTTCTGGOG 
CACAGCAGCA 
GAGGTACCAA 
CAAAACCTGG 
CTGGCAAAAA 
TCGTCAATGC 
AGGCGTGCCT 
CCGACACGCC 
TCTCCTGGGT 



41 

I 

TTCCAGGGCC 
GCGGAGTGCA 
GACAACGATG 
TGCCTGTCTG 
TATTTCATTC 
ACACAACGAG 
AACAACAGCC 
GACAGGCTCA 
TACACATCTG 
TGTGTTATGA 
GGTTTTTATC 
TGGGGGGTTG 
ACCATGTTCT 



51 
I 

TGCTGATTTT 
TCTTCTTTGT 
ACATCTATGG 
TTCTAGGCAT 
TGATGTTTAT 
ACTTTTTCAC 
CTCCAAACAA 
TGCTCCAGGA 
CCTTCCGGAC 
ACAATCTTAA 
ACAATCAGGG 
CCTGGTTTGG 
ACTGGAGCAG 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

796 



Seq ID NO: C83 DNA Sequence 

Nucleic Acid Accession #: NM_001793.2 

Coding sequence: 71.. 2560 



11 



21 



31 



41 



51 



AAAGGGGCAA GAGCTGAGCG 
CTCTGCAGCC ATGGGGCTCC 
CTGGCTGCAG TGCGCGGCCT 
CTTGGAGGCG GGAGGCGCGG 
CTGCCCTGGG CAAGAGCCAG 
TGGCGAGACA GTCCAGGAAA 
ATCCAAACGT ATCTTACGAA 
TGAAAATGGC AAGGGTCCCT 
AGACACCAAG ATTTTCTACA 
CTTCGCTGTA GAGAAGGAGA 
GATTGCCAAG TATGAGCTCT 
CCCCATGAAC ATCTCCATCA 
GGACACCTTC CGAGGGAGTG 
GACAGCCACG GATGAGGATG 
CCATAGCCAA GAACCAAAGG 
CACCATCAGC GTCATCTCCA 
CATCCAGGCC ACAGACATGG 
GATCCTTGAT GCCAATGACA 
GCCTGAGAAT GCAGTGGGCC 
CAACTCACCA GCGTGGCGTG 
TACCATCACC ACCCACCCTG 
TTTTGAGGCC AAAAACCAGC 
GCTGAAGCTC CCAACCTCCA 
ACCTGTGTTT GTCCCACCCT 
GCCTGTGTGT GTCTACACTG 
CATCCTGAGA GACCCAGCAG 
TGTGGGCACC CTCGACCGTG 
GGTCTTGGCC ATGGACAATG 
ACTGATTGAT GTCAATGACC 
CCAAAGCCCT GTGCGCCAGG 
CCCTTTCCAG GCCCAGCTCA 
GGAAGGTGAC ACAGTGGTCT 
GCACCTTTCT CTGTCTGACC 
GTGCGACTGC CATGGCCATG 
CCCTGTGCTG GGGGCTGTCC 
GAGAAAGAAG CGGAAGATCA 
CGTCTTCTAC TATGGCGAAG 
GCTCCACCGA GGTCTGGAGG 
CATCATCCCG ACACCCATGT 
TATAATTGAG AACCTGAAGG 
CTTGGTGTTC GACTATGAGG 
CTCCGCCTCC GACCAAGACC 
GAAGCTGGCA GACATGTACG 
GGGACCAAAC GTCAGGCCAC 
GACTTCGGAG CTTGTCAGGA 
ACGTTAGAGT GGTTGCTTCC 
AGCACTGAAA ACCTCTCCAC 
TCTTACCTGC CGTAAAATGC 
TACAGTGGAC TTTCTCTCTG 
TTTTTTTAAT GCTATCTTCA 
GCTGGGCCCA CTGGCCGTCC 



GAACACCGGC 
CTCGTGGACC 
CCGAGCCGTG 
AGCAGGAGCC 
CTCTGTTTAG 
GAAGGTCACT 
GACACAAGAG 
TCCCCCAGAG 
GCATCACGGG 
CAGGCTGGTT 
TTGGCCACGC 
TCGTGACCGA 
TCTTAGAGGG 
ATGCCATCTA 
ACCCACACGA 
GTGGCCTGGA 
ATGGGGACGG 
ATGCTCCCAT 
ATGAGGTGCA 
CCACCTACCT 
AGAGCAACCA 
ACACCCTGTA 
CAGCCACCAT 
CCAAAGTCGT 
CAGAAGACCC 
GGTGGCTAGC 
AGGATGAGCA 
GAAGCCCTCC 
ATGGCCCAGT 
TGCTGAACAT 
CAGATGACTC 
TGTCCCTGAA 
ATGGCAACAA 
TCGAAACCTG 
TGGCTCTGCT 
AGGAGCCCCT 
AGGGGGGTGG 
CCAGGCCGGA 
ACCGTCCTCG 
CGGCTAACAC 
GCAGCGGCTC 
AAGATTACGA 
GTGGCGGGGA 
AGAGCATCTC 
AGTGGCCGTA 
TTAGCCTTTC 
CTGGGCCAGG 
TCAACCCTGT 
GAATGGAACC 
AAACGTTAGA 
TGCATTTCTG 



CCGCCGTCGC 
TCTCGCGTCT 
CCGGGCGGTC 
CGGCCAGGCG 
CACTGATAAT 
GAAGGAAAGG 
AGATTGGGTG 
ACTGAATCAG 
GCCGGGGGCA 
GTTGTTGAAT 
TGTGTCAGAG 
CCAGAATGAC 
AGTCCTACCA 
CACCTACAAT 
CCTCATGTTC 
CCGGGAAAAA 
CTCCACCACC 
GTTTGACCCC 
GAGGCTGACG 
TATCATGGGC 
GGGCATCCTG 
CGTTGAAGTG 
AGTGGTCCAC 
TGAGGTCCAG 
TGACAAGGAG 
CATGGACCCA 
GTTTGTGAGG 
CACCACTGGC 
CCCTGAGCCC 
CACGGACAAG 
AGACATCTAC 
GAAGTTCCTG 
AGAGCAGCTG 
CCCTGGACCC 
GTTCCTCCTG 
CCTACTCCCA 
CGAAGAGGAC 
GGTGGTTCTC 
GCCAGCCAAC 
AGACCCCACA 
CGACGCCGCG 
TTATCTGAAC 
GGACGACTAG 
CAAGGGGTCT 
GCAACTTGGC 
AGGATGGAGG 
GTTGCCTCAG 
GTCCTGGGCC 
TTCTTAGGCC 
GAAAGTTCTT 
GTTTCCAGAC 



GGCAGCTGCT 
CTCCTCCTTC 
TTCAGGGAGG 
CTGGGGAAAG 
GATGACTTCA 
AATCCATTGA 
GTTGCTCCAA 
CTCAAGTCTA 
GACAGCCCCC 
AAGCCACTGG 
AATGGTGCCT 
CACAAGCCCA 
GGTACTTCTG 
GGGGTGGTTG 
ACCATTCACC 
GTCCCTGAGT 
ACGGCAGTGG 
CAGAAGTACG 
GTCACTGATC 
GGTGACGACG 
ACAACCAGGA 
ACCAACGAGG 
GTGGAGGATG 
GAGGGCATCC 
AATCAAAAGA 
GACAGTGGGC 
AACAACATCT 
ACGGGAACCC 
CGTCAGATCA 
GACCTGTCTC 
TGGACGGCAG 
AAGCAGGATA 
ACGGTGATCA 
TGGAAGGGAG 
CTGGTGCTGC 
GAAGATGACA 
CAGGACTATG 
CGCAATGACG 
CCAGATGAAA 
GCCCCGCCCT 
TCCCTGAGCT 
GAGTGGGGCA 
GCGGCCTGCC 
CAGTTCCCCC 
GGAGACAGGC 
AATGTGGGCA 
AGGCCAAGTT 
TGGGCCTGCT 
TCCTGGTGCA 
CAAAAGTGCA 
CCCAATGCCT 



TCACCCCTCT 
TCCAGGTTTG 
CTGAAGTGAC 
TATTCATGGG 
CTGTGCGGAA 
AGATCTTCCC 
TATCTGTCCC 
ATAAAGATAG 
CTGAGGGTGT 
ACCGGGAGGA 
CAGTGGAGGA 
AGTTTACCCA 
TGATGCAGGT 
CTTACTCCAT 
GGAGCACAGG 
ACACACTGAC 
CAGTAGTGGA 
AGGCCCATGT 
TGGACGCCCC 
GGGACCATTT 
AGGGTTTGGA 
CCCCTTTTGT 
TGAATGAGGC 
CCACTGGGGA 
TCAGCTACCG 
AGGTCACAGC 
ATGAAGTCAT 
TTCTGCTAAC 
CCATCTGCAA 
CCCACACCTC 
AGGTCAACGA 
CATATGACGT 
GGGCCACTGT 
GTTTCATCCT 




60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 



1247 



WO 03/042661 



TGGATCTCTG CGTTTTTATA CTGAGTGTGC CTAGGTTGCC CCTTATTTTT TATTTTCCCT 3120 

GTTGCGTTGC TATAGATGAA GGGTGAGGAC AATCGTGTAT ATGTACTAGA ACTTTTTTAT 3180 

TAAAGAAACT TTTCCCAGAA AAAAA 3205 

Seq ID NO: C84 DNA Sequence 

Nucleic Acid Accession #: NMJJ05629.1 

Coding sequence: 639.-2546 

1 11 21 31 41 51 

I I I I I 1 

TAGTCGGAGC GAGGTGGCGA GTCGCTGAGC CCGCCGCGGC CCCGAGAGCG GCTGCAGCCG 60 

CCGCCGCCGG GAAGGAGAGG GCGAGGCGCG CCGGAGCCGC CGCCGCCGCC GCCACCGCCG 120 

CCGCCGCCAC CACCGCCACC GGAGTCGCGG GCCAGCCGGG CAGCCTCCGC GGGCCCOGGC 180 

CGGGGCGGGG GGCGCGGGCC ACAGGCCCCT GCTCCGGCCG TCGTTTGCAG ACCGCGGGCG 240 

CCGATGTCGC CCGCGCCCCG TTAGGATGAG TCTCGGGTCG GGCGAGGAGC CGCCGCAGCC 300 

GCCGCCGCCC GAGCCGCGGG CAGGAGCCTC GGGAGCOGCC GCCGCCGCCG CCGCCGCCGG 360 

GCCGGGCCCC GACGCCGCCC GCGCGCCCCC GGGCCCCCGA CACACATGAG ATTCTTCAGG 420 

CTCACTTTCA AGTGCTTCGT GGACTGCTTC TGACTGCGCC GCCCGCGCCC CGCACCCCGC 480 

CGTCCGCCCG COGCCCCGTC CCCCGGCCCG GCCGCCCCCC GGCCCCCGGC CGGCCCGCGC 540 

CCTCGGGGCC CTCCCCGGTG CCGCCGGTGC CCCCCGCCTG ACCGCCGCCC CCCGTGAGGC 600 

GCCGCGACCC CGGCCCGGCC GTGCGGCCCG CCGGGGCCAT GGCGAAGAAG AGCGCCGAGA 660 

ACGGCATCTA TAGCGTGTCC GGCGACGAGA AGAAGGGCCC CCTCATCGCG CCCGGGCCCG 720 

ACGGGGCCCC GGCCAAGGGC GACGGCCCCG TGGGCCTGGG GACACCCGGC GGCCGCCTGG 780 

CCGTGCCGCC GCGCGAGACC TGGACGCGCC AGATGGACTT CATCATGTCG TGCGTGGGCT 840 

TCGCCGTGGG CTTGGGCAAC GTGTGGCGCT TCCCCTACCT GTGCTACAAG AACGGCGGAG 900 

GTGTGTTCCT TATTCCCTAC GTCCTGATCG CCCTGGTTGG AGGAATCCCC ATTTTCTTCT 960 

TAGAGATCTC GCTGGGCCAG TTCATGAAGG CCGGCAGCAT CAATGTCTGG AACATCTGTC 1020 

CCCTGTTCAA AGGCCTGGGC TACGCCTCCA TGGTGATCGT CTTCTACTGC AACACCTACT 1080 

ACATCATGGT GCTGGCCTGG GGCTTCTATT ACCTGGTCAA GTCCTTTACC ACCACGCTGC 1140 

CCTGGGCCAC ATGTGGCCAC ACCTGGAACA CTCCCGACTG CGTGGAGATC TTCCGCCATG 1200 

AAGACTGTGC CAATGCCAGC CTGGCCAACC TCACCTGTGA CCAGCTTGCT GACCGCCGGT 1260 

CCCCTGTCAT CGAGTTCTGG GAGAACAAAG TCTTGAGGCT GTCTGGGGGA CTGGAGGTGC 1320 

CAGGGGCCCT CAACTGGGAG GTGACCCTTT GTCTGCTGGC CTGCTGGGTG CTGGTCTACT 1380 

TCTGTGTCTG GAAGGGGGTC AAATCCACGG GAAAGATCGT GTACTTCACT GCTACATTCC 1440 

CCTACGTGGT CCTGGTCGTG CTGCTGGTGC GTGGAGTGCT GCTGCCTGGC GCCCTGGATG 1500 

GCATCATTTA CTATCTCAAG CCTGACTGGT CAAAGCTGGG GTCCCCTCAG GTGTGGATAG 1560 

ATGCGGGGAC CCAGATTTTC TTTTCTTACG CCATTGGCCT GGGGGCCCTC ACAGCCCTGG 1620 

GCAGCTACAA CCGCTTCAAC AACAACTGCT ACAAGGACGC CATCATCCTG GCTCTCATCA 1680 

ACAGTGGGAC CAGCTTCTTT GCTGGCTTCG TGGTCTTCTC CATCCTGGGC TTCATGGCTG 1740 

CAGAGCAGGG CGTGCACATC TCCAAGGTGG CAGAGTCAGG GCCGGGCCTG GCCTTCATCG 1800 

CCTACCCGCG GGCTGTCACG CTGATGCCAG TGGCCCCACT CTGGGCTGCC CTGTTCTTCT 1860 

TCATGCTGTT GCTGCTTGGT CTCGACAGCC AGTTTGTAGG TGTGGAGGGC TTCATCACCG 1920 

GCCTCCTCGA CCTCCTCCOG GCCTCCTACT ACTTCCGTTT CCAAAGGGAG ATCTCTGTGG 1980 

CCCTCTGTTG TGCCCTCTGC TTTGTCATCG ATCTCTCCAT GGTGACTGAT GGCGGGATGT 2040 

ACGTCTTCCA GCTGTTTGAC TACTACTCGG CCAGCGGCAC CACCCTGCTC TGGCAGGCCT 2100 

TTTGGGAGTG CGTGGTGGTG GCCTGGGTGT ACGGAGCTGA CCGCTTCATG GACGACATTG 2160 

CCTGTATGAT CGGGTACCGA CCTTGCCCCT GGATGAAATG GTGCTGGTCC TTCTTCACCC 2220 

CGCTGGTCTG CATGGGCATC TTCATCTTCA ACGTTGTGTA CTACGAGCCG CTGGTCTACA 2280 

ACAACACCTA CGTGTACCCG TGGTGGGGTG AGGCCATGGG CTGGGCCTTC GCCCTGTCCT 2340 

CCATGCTGTG CGTGCCGCTG CACCTCCTGG GCTGCCTCCT CAGGGCCAAG GGCACCATGG 2400 

CTGAGCGCTG GCAGCACCTG ACCCAGCCCA TCTGGGGCCT CCACCACTTG GAGTACCGAG 2460 

CTCAGGACGC AGATGTCAGG GGCCTGACCA CCCTGACCCC AGTGTCCGAG AGCAGCAAGG 2520 

TCGTCGTGGT GGAGAGTGTC ATGTGACAAC TCAGCTCACA TCACCAGCTC ACCTCTGGTA 2580 

GCCATAGCAG CCCCTGCTTC AGCCCCACCG CACCCCTCCA GGGGGCCTGC CTTTCCCTGA 2640 

CACTTTTGGG GTCTGCCTGG GGGAGGAGGG GAGAAAGCAC CATGAGTGCT CACTAAAACA 2700 

ACTTTTTCCA TTTTTAATAA AACGCCAAAA ATATCACAAC CCACCAAAAA TAGATGCCTC 2760 

TCCCCCTCCA GCCCTAGCCG AGCTGGTCCT AGGCCCCGCC TAGTGCCCCA CCCCCACCCA 2820 

CAGTGCTGCA CTCCTCCTGC CCCTGCCACG CCCACCCCCT GCCCACCTCT CCAGGCTCTG 2880 

CTCTGCAGCA CACCCGTGGG TGACCCCTCA CCCCAGAAGC AGCAGTGGCA GCTTGGGAAA 2940 

TGTGAGGAAG GGAAGGAGGG AGAGACGGGA GGGAGGAGAG AGAGGAGAAG GGAGGCAGGG 3000 

GAGGGGCAGC AGAACCAAGG CAAATATTTC AGCTGGGCTA TACCCCTCTC CCCATCCCTG 3060 

TTATAGAAGC TTAGAGAGCC AGCCAGCAAT GGAACCTTCT GGTTCCTGCG CCAATCGCCA 3120 

CCAGTATCAA TTGTGTGAGC TTGGGTGCGA GTGCACGOGT GCGTGAGTAC GGAGAGTATA 3180 

TATAGATCTC TATCTCTTAG CAAAGGTGAA TGCCAGATGT AAATGGCGCC TCTGGGCAAA 3240 

GGAGGCTTGT ATTTTGCACA TTTTATAAAA ACTTGAGAGA ATGAGATTTC TGCTTGTATA 3300 

TTTCT AAAAA GAGGAAGGAG CCCAAACCAT CCTCTCCTTA CCACTCCCAT CCCTGTGAGC 3360 

CCTACCTTAC CCCTCTGCCC CTAGCCAAGG AGTGTGAATT TATAGATCTA ACTTTCATAG 3420 

GCAAAACAAA AGCTTCGAGC TGTTGCGTGT GTGAGTCTGT TGTGTGGATG TGCGTGTGTG 3480 

GTCCCCAGCC CCAGACTGGA TTGGAAAAGT GCATGGTGGG GGCCTCGGGG CTGTCCCCAC 3540 

GCTGTCCCTT TGCCACAAGT CTGTGGGGCA AGAGGCTGCA ATATTCCGTC CTGGGTGTCT 3600 
GGGCTGCTAA CCTGGCCTGC TCAGGCTTCC CACCCTGTGC GGGGCACACC CCCAGGAAGG 3660 
GACCCTGGAC ACGGCTCCCA CGTCCAGGCT TAAGGTGGAT GCACTTCCCG CACCTCCAGT 3720 
CTTCTGTGTA GCAGCTTTAA CCCACGTTTG TCTGTCACGT CCAGTCCCGA GACGGCTGAG 3780 
TGACCCCAAG AAAGGCTTCC CCGACACCCA GACAGAGGCT GCAGGGCTGG GGCTGGGTGA 3840 
GGGTGGCGGG CCTGCGGGGA CATTCTACTG TGCTAAAAAG CCACTGCAGA CATAGCAATA 3900 
AAAACATGTC ATTTTCC 3917 

Seq ID NO: CSS DNA Sequence 

Nucleic Acid Accession 8: NM_006516.1 

Coding sequence: 180.. 1658 



1 11 21 31 41 51 

I I 1 I I I 

TAGTCGCGGG TCCCCGAGTG AGCACGCCAG GGAGCAGGAG ACCAAACGAC GGGGGTCGGA 



1248 



60 



WO 03/042661 



GTCAGAGTCG CAGTGGGAGT CCCCGGACCG GAGCACGAGC CTGAGCGGGA GAGCGCCGCT 120 

CGCACGCCOG TCGCCACCCG CGTACCCGGC GCAGCCAGAG CCACCAGCGC AGCGCTGCCA 180 

TGGAGCCCAG CAGCAAGAAG CTGACGGGTC GCCTCATGCT GGCTGTGGGA GGAGCAGTGC 240 

TTGGCTCCCT GCAGTTTGGC TACAACACTG GAGTCATCAA TGCCCCCCAG AAGGTGATCG 300 

AGGAGTTCTA CAACCAGACA TGGGTCCACC GCTATGGGGA GAGCATCCTG CCCACCACGC 360 

TCACCACGCT CTGGTCCCTC TCAGTGGCCA TCTTTTCTGT TGGGGGCATG ATTGGCTCCT 420 

TCTCTGTGGG CCTTTTCGTT AACCGCTTTG GCCGGCGGAA TTCAATGCTG ATGATGAACC 480 

TGCTGGCCTT CGTGTCCGCC GTGCTCATGG GCTTCTCGAA ACTGGGCAAG TCCTTTGAGA 540 

TGCTGATCCT GGGCCGCTTC ATCATCGGTG TGTACTGCGG CCTGACCACA GGCTTCGTGC 600 

CCATGTATGT GGGTGAAGTG TCACCCACAG CCTTTCGTGG GGCCCTGGGC ACCCTGCACC 660 

AGCTGGGCAT CGTOGTCGGC ATCCTCATCG CCCAGGTGTT CGGCCTGGAC TCCATCATGG 720 

GCAACAAGGA CCTGTGGCCC CTGCTGCTGA GCATCATCTT CATCCCGGCC CTGCTGCAGT 780 

GCATCGTGCT GCCCTTCTGC CCCGAGAGTC CCCGCTTCCT GCTCATCAAC CGCAACGAGG 840 

AGAACGGGGC CAAGAGTGTG CTAAAGAAGC TGCGCGGGAC AGCTGACGTG ACCCATGACC 900 

TGCAGGAGAT GAAGGAAGAG AGTCGGCAGA TGATGCGGGA GAAGAAGGTC ACCATCCTGG 960 

AGCTGTTCOG CTCCCCCGCC TACCGCCAGC CCATCCTCAT CGCTGTGGTG CTGCAGCTGT 102 0 

CCCAGCAGCT GTCTGGCATC AACGCTGTCT TCTATTACTC CACGAGCATC TTCGAGAAGG 10 80 

CGGGGGTGCA GCAGCCTGTG TATGCCACCA TTGGCTCCGG TATCGTCAAC ACGGCCTTCA 1140 

CTGTCGTGTC GCTGTTTGTG GTGGAGCGAG CAGGCCGGCG GACCCTGCAC CTCATAGGCC 1200 

TCGCTGGCAT GGCGGGTTGT GCCATACTCA TGACCATCGC GCTAGCACTG CTGGAGCAGC 1260 

TACCCTGGAT GTCCTATCTG AGCATCGTGG CCATCTTTGG CTTTGTGGCC TTCTTTGAAG 1320 

TGGGTCCTGG CCCCATCCCA TGGTTCATCG TGGCTGAACT CTTCAGCCAG GGTCCACGTC 1380 

CAGCTGCCAT TGCCGTTGCA GGCTTCTCCA ACTGGACCTC AAATTTCATT GTGGGCATGT 1440 

GCTTCCAGTA TGTGGAGCAA CTGTGTGGTC CCTACGTCTT CATCATCTTC ACTGTGCTCC 1500 

TGGTTCTGTT CTTCATCTTC ACCTACTTCA AAGTTCCTGA GACTAAAGGC CGGACCTTCG 1560 

ATGAGATCGC TTCCGGCTTC CGGCAGGGGG GAGCCAGCCA AAGTGATAAG ACACCCGAGG 1620 

AGCTGTTCCA TCCCCTGGGG GCTGATTCCC AAGTGTGAGT CGCCCCAGAT CACCAGCCCG 1680 

GCCTGCTCCC AGCAGCCCTA AGGATCTCTC AGGAGCACAG GCAGCTGGAT GAGACTTCCA 1740 

AACCTGACAG ATGTCAGCCG AGCCGGGCCT GGGGCTCCTT TCTCCAGCCA GCAATGATGT 1800 

CCAGAAGAAT ATTCAGGACT TAACGGCTCC AGGATTTTAA CAAAAGCAAG ACTGTTGCTC 1860 

AAATCTATTC AGACAAGCAA CAGGTTTTAT AATTTTTTTA TTACTGATTT TGTTATTTTT 1920 

ATATCAGCCT GAGTCTCCTG TGCCCACATC CCAGGCTTCA CCCTGAATGG TTCCATGCCT 1980 

GAGGGTGGAG ACTAAGCCCT GTCGAGACAC TTGCCTTCTT CACCCAGCTA ATCTGTAGGG 2040 

CTGGACCTAT GTCCTAAGGA CACACTAATC GAACTATGAA CTACAAAGCT T CTATCCCAG 2100 

GAGGTGGCTA TGGCCACCCG TTCTGCTGGC CTGGATCTCC CCACTCTAGG GGTCAGGCTC 2160 

CATTAGGATT TGCCCCTTCC CATCTCTTCC TACCCAACCA CTCAAATTAA TCTTTCTTTA 2220 

CCTGAGACCA GTTGGGAGCA CTGGAGTGCA GGGAGGAGAG GGGAAGGGCC AGTCTGGGCT 2280 

GCCGGGTTCT AGTCTCCTTT GCACTGAGGG CCACACTATT ACCATGAGAA GAGGGCCTGT 2340 

GGGAGCCTGC AAACTCACTG CTCAAGAAGA CATGGAGACT CCTGCCCTGT TGTGTATAGA 2400 

TGCAAGATAT TTATATATAT TTTTGGTTGT CAATATTAAA TACAGACACT AAGTTATAGT 2460 

ATATCTGGAC AAGCCAACTT GTAAATACAC CACCTCACTC CTGTTACTTA CCTAAACAGA 2520 
TATAAATGGC TGGTTTTTAG AAACATGGTT TTGAAATGCT TGTGGATTGA GGGTAGGAGG 2580 
TTTGGATGGG AGTGAGACAG AAGTAAGTGG GGTTGCAACC ACTGCAACGG CTTAGACTTC 2640 

GACTCAGGAT CCAGTCCCTT ACACGTACCT CTCATCAGTG TCCTCTTGCT CAAAAATCTG 2700 
TTTGATCCCT GTTACCCAGA GAATATATAC ATTCTTTATC TTGACATTCA AGGCATTTCT 2760 
ATCACATATT TGATAGTTGG TGTTCAAAAA AACACTAGTT TTGTGCCAGC CGTGATGCTC 2820 
AGGCTTGAAA TCGCATTATT TTGAATGTGA AGGGAA 2856 

Seq ID NO: C86 DNA Sequence 

Nucleic Acid Accession #: XM_03S292.2 

Coding sequence i 53..1S76 



l 
1 

GCTCGCTGGG 



GGAGAAGATG 
CGTGACCCTG 
TATCGGCTCG 
GCTGGCGCTG 
CGCGGAGCTC 
CTACGGCTCG 
ATCGCAGTAC 
CTGCCCGGTG 
GGCCGTGAAC 
CAAGCTCCTG 
TGTGTCCAAT 
TGTGCTGGCA 
CACAGAGGAA 
CATCGTGACG 
GCAGATGCTG 
GTCCTGGATC 
GTTCACATCC 
CTCCATGATC 
GACGCTGCTC 
CAACTGGCTC 
TGAGCTTGAG 

CATCATCCTC 
GTGGCTCCTC 
CCCCCAGGAG 



11 21 31 41 51 

I 1 1 1 I 

CCGCGGCTCC CGGGTGTCCC AGGCCCGGCC GGTGCGCAGA GCATGGCGGG 60 

AAGCGGCGCG CGCTAGCGGC GCCGGCGGCC GAGGAGAAGG AAGAGGCGCG 120 

CTGGCCGCCA AGAGCGCGGA CGGCTCGGCG CCGGCAGGCG AGGGCGAGGG 180 

CAGCGGAACA TCACGCTGCT CAACGGCGTG GCCATCATCG TGGGGACCAT 240 

GGCATCTTCG TGACGCCCAC GGGCGTGCTC AAGGAGGCAG GCTCGCCGGG 300 

GTGGTGTGGG CCGCGTGCGG CGTCTTCTCC ATCGTGGGCG CGCTCTGCTA 360 

GGCACCACCA TCTCCAAATC GGGCGGCGAC TACGCCTACA TGCTGGAGGT 420 

CTGCCCGCCT TCCTCAAGCT CTGGATCGAG CTGCTCATCA TCCGGCCTTC 480 

ATCGTGGCCC TGGTCTTOGC CACCTACCTG CTCAAGCCGC TCTTCCCCAC 540 

CCCGAGGAGG CAGCCAAGCT CGTGGCCTGC CTCTGCGTGC TGCTGCTCAC 600 

TGCTACAGCG TGAAGGCCGC CACCCGGGTC CAGGATGCCT TTGCCGCCGC 660 

GCCCTGGCCC TGATCATCCT GCTGGGCTTC GTCCAGATCG GAAAGGGTGA 720 

CTAGATCCCA ACTTCTCATT TGAAGGCACC AAACTGGATG TGGGGAACAT 780 

TTATACAGCG GCCTCTTTGC CTATGGAGGA TGGAATTACT TGAATTTCGT 840 

ATGATCAACC CCTACAGAAA CCTGCCCCTG GCCATCATCA TCTCCCTGCC 900 

CTGGTGTACG TGCTGACCAA CCTGGCCTAC TTCACCACCC TGTCCACCGA 960 

TCGTCCGAGG CCGTGGCCGT GGACTTCGGG ' AACTATCACC TGGGCGTCAT 1020 

ATCCCCGTCT TGGTGGGCCT GTCCTGCTTC GGCTCCGTCA ATGGGTCCCT 1080 

TCCAGGCTCT TCTTCGTGGG GTCCCGGGAA GGCCACCTGC CCTCCATCCT 1140 

CACCCACAGC TCCTCACCCC CGTGCCGTCC CTCGTGTTCA CGTGTGTGAT 1200 

TACGCCTTCT CCAAGGACAT CTTCTCCGTC ATCAACTTCT TCAGCTTCTT 1260 

TGCGTGGCCC TGGCCATCAT CGGCATGATC TGGCTGCGCC ACAGAAAGCC 1320 

CGGCCCATCA AGGTGAAOCT GGCCCTGCCT GTGTTCTTCA TCCTGGCCTG 1380 

ATCGCCGTCT CCTTCTGGAA GACACCCGTG GAGTGTGGCA TCGGCTTCAC 1440 

AGCGGGCTGC CCGTCTACTT CTTCGGGGTC TGGTGGAAAA ACAAGCCCAA 1500 

CAGGGCATCT TCTCCACGAC CGTCCTGTGT CAGAAGCTCA TGCAGGTGGT 1560 

ACATAGCCAG GAGGCCGAGT GGCTGCCGGA GGAGCATGC 1609 



Seq ID NO: C87 DNA Sequence 

Nucleic Acid Accession 8: NM 005268.1 



1249 



WO 03/042661 PCT/US02/36810 



Coding sequence : 168 . . 989 
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TAAAAAGCAA 
TCTGGATATG 
AGCCCTGAGG 
TCTTTGAGGG 
TGTCTCTGGT 
GTGATGACCA 
TTGATGAGTT 
CATGCCCCTC 
ACCGAGAAGC 
GTGGGCTCTG 
TTCTCTATGT 
ACGCAGATCC 
TTTTCACCCT 
TCATCTACCT 
TGTGCACAGG 
CGGGTGACCT 
GAGACCATGT 
CCTGGATGGG 
CATGAGGTAG 
TCAACTCCAG 
GCTCGGTTTC 



11 
I 

AAGAATTCGC 
AAATTCAAGC 
AGTAGTCACT 
ACTCCTGAGT 
CTTCATCTTC 
CAAGGACTTC 
CTTCCCTGTG 
ACTGCTCGTG 
CCATGGGGAG 
GTGGACATAT 
GTTCCACTCA 
ATGTCCCAAT 
CTTCATGGTG 
GGTGAGCAAG 
TCATCACCCC 
CATCTTTCTG 
GAAGAAAACC 
GAGGCTCTAG 
GGGCAGGCAA 
CCACCTGCCC 
CTTTTCTAGA 




Seq ID NO: C88 DNA Sequence 
Nucleic Acid Accession #: NM_005130 
Coding sequence: 98.. 802 



CTCTACCTGA 
CGTGTGCTCA 
GCTCTCCTTC 
GAATGGACTT 
TAAGCAGAAA 
CAGATGGGCT 
GGACCATGAA 
TGAGAGAGTC 
ATATTCCAAG 
TAAGCTAGTC 
GTCCCCCAGG 
GACCATGGCC 
GACTGCCCTG 
AGTGCAGGAC 
TGTCGTAAGT 
TGTGCTTAGT 
TGGAATTTGC 
TTCCATGGCC 
GAGTGATAAT 
TTTTTCAAAA 



11 
I 

CACAGCTGCA 
GAACAAGGTG 
CTCCTACTGG 
CACAGCAAAG 
AGCAGGCCCG 
GCTACTGAGC 
TTTTCCTGTG 
TATTGGAAAC 
ACAGCTGTGA 
AGCTCCACTC 
GAGCACATCA 
ACCAAAGCTC 
GAGTTCTGTG 
ACGTCATGCT 
CCCTCTGTAT 
GAGTGCAACG 
CTTATTTTTC 
CACACAGCTA 
TTCAGTGCAA 
AAAAAAAAAA 



21 
I 

GCCTGCAATT 
AACGCCCAGC 
CTGCTCAGGT 
TGGTCTCAGA 
GGAACAAAGG 
AGGAGGAGGG 
TCTTTGCTGG 
AAGTTGCCCG 
AAACCAGAGT 
TATTTGGGAA 
AGGGCAAAGA 
CCGAGTGTGT 
GAGAGACTTG 
AATGAGGTCA 
ACTTTAAAGC 
AAATATTTAA 
TTGGATGCGA 
TGTGTTTGAG 
CGAACTTTCT 
AAA 



31 
I 

CACTCCCACT 
TGCAGCCATG 
GCTCCTGGTG 
ACAAAAGGAC 
CAAGTTTGTC 
CATCTCTCTC 
CAATCCAACC 
GAATCTGCGC 
GTGCAGAAAG 
CACAAAGCCC 
GACCACCCCC 
GGAGGACCCA 
GAGCTCTCTC 
AAAGAGAACG 
TCTCTACAGT 
ACAAGTTTTG 
TGTTCAGAGG 
CAGCGAAGAG 
GCTGAATTAA 



41 
I 

GCCTGGGATT 
AAGATCTGTA 
GAGGGGAAAA 
ACTCTGGGCA 
ACCAAAGACC 
AAGGTTGAGT 
TCATGCCTAA 
TCACAGAAAG 
GATTTTCCAG 
AGGAAGGAGA 
TCTAGCCTAG 
GATATGGCAA 
TGCACATTCT 
GGTTCCTTTA 
CCCCCCAAAA 
TATTTTTTGC 
CTGTTTCCTG 
TCTTTGAGCT 
TGGTAATAAA 



SI 
I 

GCACTGGATC 
GCCTCACCCT 
AAAAAGTGAA 
ACACCCAGAT 
AAGCCAACTG 
GCACTCAATT 
AGCTCAAGGA 
ACATCTGTAG 
AATCCAGTCT 
AAACAGAGAT 
CAGTGACCCA 
ACCAGAGGAA 
TCCTCAGCAT 
AGAGATGTCA 
TATGAACTTT 
TTTTGTGTTT 
CAGCATGTAT 
GAATGAGCCA 
ACTCTGGGTG 



Seq ID NO: C89 DNA Sequence 
Nucleic Acid Accession #: BC022S42 
Coding sequence: 274.. 927 



1 
I 

ACTTGGTCCC 
CCTTCCTGCG 
TGGCGGCTCG 
GGCTCACGCG 
CCATGTGTTC 
TTTTAATCAA 
TTAAACAGGA 
AGAGAAACAT 
ATTTGTCCAA 
GTTTTCAAGC 
AAGCCTCGAT 
TTTTGAAATG 
TATGCCAATG 
CAGTGGGACT 
TGTGCTCTAC 
TATGTAGTTA 
TTCTTCTAGA 
TAGAGGAAAT 
CCTTGGTAAT 
TTTATTTTGT 
GTAGAGACAA 
CCGTTATAAA 
TTTGGGAGGC 
ACCCTGTCTC 
CAGCTAATTG 



11 
I 

AGCCGATAAA 
TCCGCACCTG 
GG7GGCGGCG 
CGGGCCCGCC 
TGAAATTATT 
AGTGAAGTTT 
CATTCCTGCA 
AACAGAGGCA 
GGAGTCTGAA 
CTTTTTGCCT 
TGTGGTCAAT 
CTGGGCTCAC 
GAACAAGATG 
GACTGTACAT 
ATTGATCCTT 
AATGCTTCCT 
ATTAATTACT 
TTGGGATCAT 
CCTCAAGCAT 
GAGAAGTGAC 
ATTATTCTCA 
TAAGAATAAA 
CGAGGTGGGC 
TACTAAAAAT 
GAAGGGTGAG 



21 
I 

TCTGGGGCAG 
GCCCCGCGCG 
GTTCGGGCGG 
ACGGCCTTCA 
TTGAGGCAAG 
GGGGAAAGCA 
GGACTTTATG 
GTGATGGTTT 
GTTCTCATTT 
GTGCACTGCC 
AACCCAGATT 
TCAGAAGTGG 
AAGTATAAAT 
ACCTCTCTAG 
GTAGCAGTTT 
AGAAACCTAA 
TTTATCTTTT 
TCTCAGCTAA 
CAGATGCCAT 
TTTATCTTCA 
TTTTGCAAGT 
ATAGGCCAGG 
GGATCACCAG 
ACAAAAGTTA 
GCGGGAGGAT 



31 
1 

CGCGCGGTAG 
CCCCTCTCGG 
CCGCCTGGCT 
CCGCCGCGCG 
AAGTTTTGAA 
TTGAGGACTT 
TGGATCCGTA 
CAGAAAATTT 
ATGCCAGACG 
GCTATCATCG 
TGTTGATGTT 
CAGCCCCTTG 
CAGTATATAA 
TATGTTCTGT 
TCAAATATGG 
ATAAGATCTA 
GTCTTCATTT 
TTCCAAAATG 
AAGGGGAAAC 
TTTGGGGTAG 
ACTTTCAATT 
CACAGTGGCT 
AGGTCAAGAG 
GCTGGGGCTG 
CGCTTGAACC 



41 
I 

GAGCTGCGGG 
GCGTCCGGCT 
GCTCCTCGGG 
CTCTGACGCC 
AGATGGTTTC 
GCACACGTGC 
TGAGTTGGCT 
TGATATAGAG 
AGATTCACAG 
GCCGCACAGT 
TTGTGACCAA 
TGCTTTGGAT 
GAATGTGATT 
GACTCTGCTC 
CCATTTTTCC 
TTAATTTCTG 
GTGGCCAAAA 
TAGTGCTCTA 
TTAATTCTGC 
AAAAATTATT 
TAAGCTACAA 
CACACCTGTA 
TTTGAGACCA 
GTGGTGGGCA 
TGGGAGGCGG 




GCGGCGACGG 
GGCATAAGGG 
CACAGAGACC 
CGTCTCTTAA 
TCATTACGAG 
GCCCCTAACT 
TGCATTGACT 
GAAGATGGAG 
GAGTTCCOGA 
AATGAGGATA 
CTACAAGTTC 
ATTACAATCC 
CTATAAGTTT 
AOGAGAGGTG 
TTATGTTTAC 
TTGCATGGAT 
TAAATTAATG 
TCTTTATGTA 
ATTGAGAAAA 
ATCCCAGCAC 
GCTTGGTGAA 
TCTGTAGTCC 
AGGTTCCAGA 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 
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PCT/US02/36810 



GAGCCAAGAT 
GGAAAAACAA 
ATGTCATGAG 
CTAAGAAATT 
ACGCACTCCA 
GGACTTGATG 
TTGCCA1TTT 



CGCACCACTG 
AAAAGAAGAA 
ACTATTAAAG 
AATATTAATA 



AAACTGAGTA 
AAATAAAGTT 



CACTACAGCC TGGGCGACAG AACGAGACCC TGTCTCCAAA 1560 

TAAAATAATT TGGATGAAAA TCATGTTTAT TTAAATAGTA 1620 

ATGTGCCAGA GTTTCAATGA AAATCATTAA AGTAGGACAG 1680 

TAAAAATTAT TGATAATCTT AAATTATTGA TTATTCCTTA 1740 

ACATTTTATC ATGTTTCTTT TGAATATATG AATTGGCAAA 1800 

CTAAGATTTG GTACAGAGTA TGTCAGGAAG ACAACTCAGA 1860 

GTACATGAAC AAAAAAAAAA AAAAAA 1906 
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Seq ID NO: C90 UNA Sequence • 
Nucleic Acid Accession «: NM_004994 
Coding. sequence : 20 . .2143 

1 11 21 31 41 51 

I I I I I I 

AGACACCTCT GCCCTCACCA TGAGCCTCTG GCAGCCCCTG GTCCTGGTGC TCCTGGTGCT 60 

GGGCTGCTGC TTTGCTGCCC CCAGACAGCG CCAGTCCACC CTTGTGCTCT TCCCTGGAGA 120 

CCTGAGAACC AATCTCACCG ACAGGCAGCT GGCAGAGGAA TACCTGTACC GCTATGGTTA 180 

CACTCGGGTG GCAGAGATGC GTGGAGAGTC GAAATCTCTG GGGCCTGCGC TGCTGCTTCT 240 

CCAGAAGCAA CTGTCCCTGC CCGAGACCGG TGAGCTGGAT AGCGCCACGC TGAAGGCCAT 300 

GCGAACCCCA CGGTGCGGGG TCCCAGACCT GGGCAGATTC CAAACCTTTG AGGGCGACCT 360 

CAAGTGGCAC CACCACAACA TCACCTATTG GATCCAAAAC TACTCGGAAG ACTTGCCGCG 420 

GGCGGTGATT GACGACGCCT TTGCCCGCGC CTTCGCACTG TGGAGCGCGG TGACGCCGCT 480 

CACCTTCACT CGCGTGTACA GCCGGGAOGC AGACATCGTC ATCCAGTTTG GTGTCGCGGA 540 

GCACGGAGAC GGGTATCCCT TCGACGGGAA GGACGGGCTC CTGGCACACG CCTTTCCTCC 600 

TGGCCCCGGC ATTCAGGGAG ACGCCCATTT CGACGATGAC GAGTTGTGGT CCCTGGGCAA 660 

GGGCGTCGTG GTTCCAACTC GGTTTGGAAA CGCAGATGGC GCGGCCTGCC ACTTCCCCTT 720 

CATCTTCGAG GGCCGCTCCT ACTCTGCCTG CACCACCGAC GGTCGCTCCG ACGGCTTGCC 780 

CTGGTGCAGT ACCACGGCCA ACTACGACAC CGACGACOGG TTTGGCTTCT GCCCCAGCGA 840 

GAGACTCTAC ACCCGGGACG GCAATGCTGA TGGGAAACCC TGCCAGTTTC CATTCATCTT 900 

CCAAGGCCAA TCCTACTCCG CCTGCACCAC GGACGGTCGC TCCGACGGCT ACCGCTGGTG 960 

CGCCACCACC GCCAACTACG ACCGGGACAA GCTCTTCGGC TTCTGCCCGA CCCGAGCTGA 1020 

CTCGACGGTG ATGGGGGGCA ACTCGGCGGG GGAGCTGTGC GTCTTCCCCT TCACTTTCCT 1080 

GGGTAAGGAG TACTCGACCT GTACCAGCGA GGGCCGCGGA GATGGGCGCC TCTGGTGCGC 1140 

TACCACCTCG AACTTTGACA GCGACAAGAA GTGGGGCTTC TGCCCGGACC AAGGATACAG 1200 

TTTGTTCCTC GTGGCGGCGC ATGAGTTCGG CCACGCGCTG GGCTTAGATC ATTCCTCAGT 1260 

GCCGGAGGCG CTCATGTACC CTATGTACCG CTTCACTGAG GGGCCCCCCT TGCATAAGGA 1320 

CGACGTGAAT GGCATCCGGC ACCTCTATGG TCCTCGCCCT GAACCTGAGC CACGGCCTCC 13 BO 

AACCACCACC ACACCGCAGC CCACGGCTCC CCCGACGGTC TGCCCCACCG GACCCCCCAC 1440 

TGTCCACCCC TCAGAGCGCC CCACAGCTGG CCCCACAGGT CCCCCCTCAG CTGGCCCCAC 1500 

AGGTCCCCCC ACTGCTGGCC CTTCTACGGC CACTACTGTG CCTTTGAGTC CGGTGGACGA 1560 

TGCCTGCAAC GTGAACATCT TCGACGCCAT CGCGGAGATT GGGAACCAGC TGTATTTGTT 1620 
CAAGGATGGG AAGTACTGGC GATTCTCTGA GGGCAGGGGG AGCCGGCCGC AGGGCCCCTT 1680 
CCTTATCGCC GACAAGTGGC CCGCGCTGCC CCGCAAGCTG GACTCGGTCT TTGAGGAGCC 1740 
GCTCTCCAAG AAGCTTTTCT TCTTCTCTGG GCGCCAGGTG TGGGTGTACA CAGGCGCGTC 1800 
GGTGCTGGGC CCGAGGCGTC TGGACAAGCT GGGCCTGGGA GCCGACGTGG CCCAGGTGAC 1860 
CGGGGCCCTC CGGAGTGGCA GGGGGAAGAT GCTGCTGTTC AGCGGGCGGC GCCTCTGGAG 1920 
GTTCGACGTG AAGGCGCAGA TGGTGGATCC CCGGAGCGCC AGCGAGGTGG ACCGGATGTT 1980 
CCCCGGGGTG CCTTTGGACA CGCACGACGT CTTCCAGTAC CGAGAGAAAG CCTATTTCTG 2040 
CCAGGACCGC TTCTACTGGC GCGTGAGTTC CCGGAGTGAG TTGAACCAGG TGGACCAAGT 2100 
GGGCTACGTG ACCTATGACA TCCTGCAGTG CCCTGAGGAC TAGGGCTCCC GTCCTGCTTT 2160 
GCAGTGCCAT GTAAATCCCC ACTGGGACCA ACCCTGGGGA AGGAGCCAGT TTGCCGGATA 2220 
CAAACTGGTA TTCTGTTCTG GAGGAAAGGG AGGAGTGGAG GTGGGCTGGG CCCTCTCTTC 2280 
TCACCTTTGT TTTTTGTTGG AGTGTTTCTA ATAAACTTGG ATTCTCTAAC CTTT 2334 

Seq ID NO: C91 Sequence 

Nucleic Acid Accession #: NM_000213 

Coding sequence: 188.56S6 



GCGCTGCCCG 
TCGCCCGCGC 
GCCCCGAGGT 
GAAGAGGATG 
CAGCGTCAGC 
CACGGAGTGT 
CCGGCGCTGC 
GGTCATGGAG 
CAGCCAGATG 
TGAGCTGGAG 
CTCCAACTCC 
GGTCCTGAGC 
CAGCGTCCCG 
CCCCCCCTTC 
TAAACTGCAG 
CATCCTGCAG 
GCTGGTCTTC 
TGGCATCATG 
GTACAGGACA 
CATCATCCCC 
CTATTTCCCT 
GCTGGAGGAG 
CCGAGGCCTT 
TCACATCCGG 



11 
1 

CCTCGTCCCC 
GCTGCAGCCC 
AGGTCCAGGA 
GCAGGGCCAC 
CTCTCTGGGA 
QTCCGTGTGG 
AACACCCAGG 
AGCAG CTTCC 
TCCCCCCAAG 
GTGTTTGAGC 
ATGTCCGATG 
CAGCTCACCA 
CAGACGGACA 
TCCTTCAAGA 
GGAGAGCGGA 
ACAGCTGTGT 
TCCACCGAGT 
AGCCGCAACG 
CAGGACTACC 
ATCTTTGCTG 
GTCTCCTCAC 
GCCTTCAATC 
CGGACAGAGG 
CGGGGGGAAG 



21 
I 

ACCCCCCAAC 
CATCTCCTAG 
CGGGCGCACA 
GCCCCAGCCC 
CCTTGGCAAA 
ATAAGGACTG 
CGGAGCTGCT 
AAATCACAGA 
GCCTGCGGGT 
CACTGGAGAG 
ATCTGGACAA 
GCGACTACAC 
TGAGGCCTGA 
ACGTCATCAG 
TCTCAGGCAA 
GCACGAGGGA 
CAGCCTTCCA 
ATGAACGGTG 
CGTCGGTGCC 
TCACCAACTA 
TGGGGGTGCT 
GGATCCGCTC 
TCACCTCCAA 
TGGGTATATA 



31 
I 

CCCCCGCGCC 
CGGCAGCCCA 
GCAGCAGCCG 
ATGGGCCAGG 
CCGCTGCAAG 
CGCCTACTGC 
GGCCGCGGGC 
GGAGACCCAG 
CCGTCTGCGG 
CCCCGTGGAC 
CCTCAAGAAG 
TATTGGATTT 
GAAGCTGAAG 
CCTGACAGAA 
CCTGGATGCT 
CATTGGCTGG 
CTATGAGGCT 
CCACCTGGAC 
CACCCTGGTG 
CTCCTATAGC 
GCAGGAGGAC 
CAACCTGGAC 
GATGTTCCAG 
CCAGGTGCAG 



41 

I 

CGCCCTCGGA 
GGCGCGGAGG 
AGGCTGGCCG 
CTGCTCCTGG 
AAGGCCCCAG 
ACAGACGAGA 
TGCCAGCGGG 
ATTGACACCA 
CCCGGTGAGG 
CTGTACATCC 
ATGGGGCAGA 
GGCAAGTTTG 
GAGCCCTGGC 
GATGTGGATG 
CCTGAGGGCG 
CGCCCGGACA 
GATGGCGCCA 
ACCACGGGCA 
CGCCTGCTCG 
TACTACGAGA 
TCGTCCAACA 
ATCCGGGCCC 
AAGACGAGGA 
CTGCGGGCCC 



51 
I 

CAGTCCCTGC 
GAGCGAGTCC 
GGAGAGGGAG 
CAGCCTTGAT 
TGAAGAGCTG 
TGTTCAGGGA 
AGAGCATCGT 
CCCTGCGGCG 
AGCGGCATTT 
TCATGGACTT 
ACCTGGCTCG 
TGGACAAAGT 
CCAACAGTGA 
AGTTCCGGAA 
GCTTCGATGC 
GCACCCACCT 
ACGTGCTGGC 
CCTACACCCA 
CCAAGCACAA 
AGCTTCACAC 
TCGTGGAGCT 
TAGACAGCCC 
CTGGGTCCTT 
TTGAGCACGT 
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GGATGGGACG CACGTGTGCC AGCTGCCGGA GGACCAGAAG GGCAACATCC ATCTGAAACC 1500 

TTCCTTCTCC GACGGCCTCA AGATGGACGC GGGCATCATC TGTGATGTGT GCACCTGCGA 1560 

GCTGCAAAAA GAGGTGCGGT CAGCTCGCTG CAGCTTCAAC GGAGACTTCG TGTGCGGACA 1620 

GTGTGTGTGC AGCGAGGGCT GGAGTGGCCA GACCTGCAAC TGCTCCACCG GCTCTCTGAG 1680 

TGACATTCAG CCCTGCCTGC GGGAGGGCGA GGACAAGCCG TGCTCCGGCC GTGGGGAGTG 1740 

CCAGTGCGGG CACTGTGTGT GCTACGGCGA AGGCCGCTAC GAGGGTCAGT TCTGCGAGTA 1800 

TGACAACTTC CAGTGTCCCC GCACTTCCGG GTTCCTGTGC AATGACCGAG GACGCTGCTC 1860 

CATGGGCCAG TGTGTGTGTG AGCCTGGTTG GACAGGCCCA AGCTGTGACT GTCCCCTCAG 1920 

CAATGCCACC TGCATCGACA GCAATGGGGG CATCTGTAAT GGACGTGGCC ACTGTGAGTG 1980 

TGGCCGCTGC CACTGCCACC AGCAGTCGCT CTACACGGAC ACCATCTGCG AGATCAACTA 2040 

CTCGGCGATC CACCCGGGCC TCTGCGAGGA CCTACGCTCC TGCGTGCAGT GCCAGGOGTG 2100 

GGGCACCGGC GAGAAGAAGG GGCGCACGTG TGAGGAATGC AACTTCAAGG TCAAGATGGT 2160 

GGACGAGCTT AAGAGAGCCG AGGAGGTGGT GGTGCGCTGC TCCTTCCGGG ACGAGGATGA 2220 

CGACTGCACC TACAGCTACA CCATGGAAGG TGACGGCGCC CCTGGGCCCA ACAGCACTGT 2280 

CCTGGTGCAC AAGAAGAAGG ACTGCCCTCC GGGCTCCTTC TGGTGGCTCA TCCCCCTGCT 2340 

CCTCCTCCTC CTGCCGCTCC TGGCCCTGCT ACTGCTGCTA TGCTGGAAGT ACTGTGCCTG 2400 

CTGCAAGGCC TGCCTGGCAC TTCTCCCGTG CTGCAACCGA GGTCACATGG TGGGCTTTAA 2460 

GGAAGACCAC TACATGCTGC GGGAGAACCT GATGGCCTCT GACCACTTGG ACACGCCCAT 2520 

GCTGCGCAGC GGGAACCTCA AGGGCCGTGA CGTGGTCCGC TGGAAGGTCA CCAACAACAT 2580 

GCAGCGGCCT GGCTTTGCCA CTCATGCCGC CAGCATCAAC CCCACAGAGC TGGTGCCCTA 2640 

CGGGCTGTCC TTGCGCCTGG CCCGCCTTTG CACCGAGAAC CTGCTGAAGC CTGACACTCG 2700 

GGAGTGCGCC CAGCTGCGCC AGGAGGTGGA GGAGAACCTG AACGAGGTCT ACAGGCAGAT 2760 

CTCCGGTGTA CACAAGCTCC AGCAGACCAA GTTCCGGCAG CAGCCCAATG CCGGGAAAAA 2820 

GCAAGACCAC ACCATTGTGG ACACAGTGCT GATGGCGCCC CGCTCGGCCA AGCCGGCCCT 2880 

GCTGAAGCTT ACAGAGAAGC AGGTGGAACA GAGGGCCTTC CAOGACCTCA AGGTGGCCCC 2940 

CGGCTACTAC ACCCTCACTG CAGACCAGGA CGCCCGGGGC ATGGTGGAGT TCCAGGAGGG 3000 

CGTGGAGCTG GTGGACGTAC GGGTGCCCCT CTTTATCCGG CCTGAGGATG ACGACGAGAA 3060 

GCAGCTGCTG GTGGAGGCCA TCGACGTGCC CGCAGGCACT GCCACCCTCG GCCGCCGCCT 3120 

GGT AAA CATC ACCATCATCA AGGAGCAAGC CAGAGACGTG GTGTCCTTTG AGCAGCCTGA 3180 

GTTCTCGGTC AGCCGCGGGG ACCAGGTGGC CCGCATCCCT GTCATCCGGC GTGTCCTGGA 3240 

CGGCGGGAAG TCCCAGGTCT CCTACCGCAC ACAGGATGGC ACCGCGCAGG GCAACCGGGA 3300 

CTACATCCCC GTGGAGGGTG AGCTGCTGTT CCAGCCTGGG GAGGCCTGGA AAGAGCTGCA 33 60 

GGTGAAGCTC CTGGAGCTGC AAGAAGTTGA CTCCCTCCTG CGGGGCCGCC AGGTCCGCCG 342 0 

TTTCCACGTC CAGCTCAGCA ACCCTAAGTT TGGGGCCCAC CTGGGCCAGC CCCACTCCAC 3480 

CACCATCATC ATCAGGGACC CAGATGAACT GGACCGGAGC TTCACGAGTC AGATGTTGTC 3540 

ATCACAGCCA CCCCCTCACG GCGACCTGGG CGCCCCGCAG AACCCCAATG CTAAGGCCGC 3600 

TGGGTCCAGG AAGATCCATT TCAACTGGCT GCCCCCTTCT GGCAAGCCAA TGGGGTACAG 3660 

GGTAAAGTAC TGGATTCAGG GCGACTCCGA ATCCGAAGCC CACCTGCTCG ACAGCAAGGT 3720 

GCCCTCAGTG GAGCTCACCA ACCTGTACCC GTATTGCGAC TATGAGATGA AGGTGTGCGC 3780 

CTAGGGGGCT CAGGGCGAGG GACCCTACAG CTCCCTGGTG TCCTGCCGCA CCCACCAGGA 3840 

AGTGCCCAGC GAGCCAGGGC GTCTGGCCTT CAATGTCGTC TCCTCCACGG TGACCCAGCT 3900 

GAGCTGGGCT GAGCCGGCTG AGACCAACGG TGAGATCACA GCCTACGAGG TCTGCTATGG 3960 

CCTGGTCAAC GATGACAACC GACCTATTGG GCCCATGAAG AAAGTGCTGG TTGACAACCC 4020 

TAAGAACCGG ATGCTGCTTA TTGAGAACCT TCGGGAGTCC CAGCCCTACC GCTACACGGT 4080 

GAAGGCGCGC AACGGGGCCG GCTGGGGGCC TGAGCGGGAG GCCATCATCA ACCTGGCCAC 4140 

CCAGCCCAAG AGGCCCATGT CCATCCCCAT CATCCCTGAC ATCCCTATCG TGGACGCCCA 4200 

GAGCGGGGAG GACTACGACA GCTTCCTTAT GTACAGCGAT GACGTTCTAC GCTCTCCATC 4260 

GGGCAGCCAG AGGCCCAGCG TCTCCGATGA CACTGGCTGC GGCTGGAAGT TCGAGCCCCT 4320 

GCTGGGGGAG GAGCTGGACC TGCGGCGCGT CACGTGGCGG CTGCCCCCGG AGCTCATCCC 4380 

GCGCCTGTCG GCCAGCAGCG GGCGCTCCTC CGACGCCGAG GCCCCCACGG CCCCCCGGAC 444 0 

GACGGCGGCG CGGGCGGGAA GGGCGGGAGC CGTGCCCCGC AGTGCGACAC CCGGGCCCCC 4500 

CGGAGAGCAC CTGGTGAATG GCCGGATGGA CTTTGCCTTC CCGGGCAGCA CCAACTCCCT 4560 

GCACAGGATG ACCACGACCA GTGCTGCTGC CTATGGCACC CACCTGAGCC CACACGTGCC 4620 

CCACCGCGTG CTAAGCACAT CCTCCACCCT CACACGGGAC TACAACTCAC TGACCOGCTC 4680 

AGAACACTCA CACTCGACCA CACTGCCCAG GGACTACTCC ACCCTCACCT CCGTCTCCTC 4740 

CCACGACTCT CGCCTGACTG CTGGTGTGCC CGACACGCCC ACCCGCCTGG TGTTCTCTGC 4800 

CCTGGGGCCC ACATCTCTCA GAGTGAGCTG GCAGGAGCCG CGGTGCGAGC GGCCGCTGCA 4860 

GGGCTACAGT GTGGAGTACC AGCTGCTGAA CGGCGGTGAG CTGCATCGGC TCAACATCCC 4920 

CAACCCTGCC CAGACCTCGG TGGTGGTGGA AGACCTCCTG CCCAACCACT CCTACGTGTT 4980 

CCGCGTGCGG GCCCAGAGCC AGGAAGGCTG GGGCCGAGAG CGTGAGGGTG TCATCACCAT 5040 

TGAATCCCAG GTGCACCCGC AGAGCCCACT GTGTCCCCTG CCAGGCTCCG CCTTCACTTT 5100 

GAGCACTCCC AGTGCCCCAG GCCCGCTGGT GTTCACTGCC CTGAGCCCAG ACTCGCTGCA 5160 

GCTGAGCTGG GAGCGGCCAC GGAGGCCCAA TGGGGATATC GTCGGCTACC TGGTGACCTG 5220 

TGAGATGGCC CAAGGAGGAG GGCCAGCCAC CGCATTCCGG GTGGATGGAG ACAGCCCCGA 5280 

GAGCCGGCTG ACCGTGCCGG GCCTCAGCGA GAACGTGCCC TACAAGTTCA AGGTGCAGGC 5340 

CAGGACCACT GAGGGCTTCG GGCCAGAGCG CGAGGGCATC ATCACCATAG AGTCCCAGGA 5400 

TGGAGGACCC TTCCCGCAGC TGGGCAGCCG TGCCGGGCTC TTCCAGCACC CGCTGCAAAG 5460 

CGAGTACAGC AGCATCACCA CCACCCACAC CAGCGCCACC GAGCCCTTCC TAGTGGATGG 5520 

GCTGACCCTG GGGGCCCAGC ACCTGGAGGC AGGCGGCTCC CTCACCCGGC ATGTGACCCA 5580 

GGAGTTTGTG AGCCGGACAC TGACCACCAG CGGAACCCTT AGCACCCACA TGGACCAACA 5640 

GTTCTTCCAA ACTTGACCGC ACCCTGCCCC ACCCCCGCCA TGTCCCACTA GGCGTCCTCC 5700 

CGACTCCTCT CCCGGAGCCT CCTCAGCTAC TCCATCCTTG CACCCCTGGG GGCCCAGCCC 5760 

ACCCGCATGC ACAGAGCAGG GGCTAGGTGT CTCCTGGGAG GCATGAAGGG GGCAAGGTCC 5820 

GTCCTCTGTG GGCCCAAACC TATTTGTAAC CAAAGAGCTG GGAGCAGCAC AAGGACCCAG 5880 

CCTTTGTTCT GCACTTAATA AATGGTTTTG CTACTGCTAA AAAAAAAAAA AAAAAAAAAA 5940 

AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAA 5994 

Seq ID NO: C92 DNA Sequence 
Nucleic Acid Accession #: NM_023 915 
Coding sequence : 2 5 0 . . 1 3 2 6 

1 11 21 31 41 51 

I I I I I ! 

GGCACGAGGG TTTCGTTTTC ATGCTTTACC AGAAAATCCA CTTCCCTGCC GACCTTAGTT 60 
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TCAAAGCTTA TTCTTAATTA GAGACAAGAA ACCTGTTTCA ACTTGAAGAC ACCGTATGAG 
GTGAATGGAC AGCCAGCCAC CACAATGAAA GAAATCAAAC CAGGAATAAC CTATGCTGAA 
CCCACGCCTC AATCGTCCCC AAGTGTTTCC TGACACGCAT CTTTGCTTAC AGTGCATCAC 
AACTGAAGAA TGGGGTTCAA CTTGACGCTT GCAAAATTAC CAAATAACGA GCTGCACGGC 
CAAGAGAGTC ACAATTCAGG CAACAGGAGC GACGGGCCAG GAAAGAACAC CACCCTTCAC 
AATGAATTTG ACACAATTGT CTTGCCGGTG CTTTATCTCA TTATATTTGT GGCAAGCATC 
TTGCTGAATG GTTTAGCAGT GTGGATCTTC TTCCACATTA GGAATAAAAC CAGCTTCATA 
TTCTATCTCA AAAACATAGT GGTTGCAGAC CTCATAATGA CGCTGACATT TCCATTTCGA 
ATAGTCCATG ATGCAGGATT TGGACCTTGG TACTTCAAGT TTATTCTCTG CAGATACACT 
TCAGTTTTGT TTTATGCAAA CATGTATACT TCCATCGTGT TCCTTGGGCT GATAAGCATT 
GATCGCTATC TGAAGGTGGT CAAGCCATTT GGGGACTCTC GGATGTACAG CATAACCTTC 
ACGAAGGTTT TATCTGTTTG TGTTTGGGTG ATCATGGCTG TTTTGTCTTT GCCAAACATC 
ATCCTGACAA ATGGTCAGCC AACAGAGGAC AATATCCATG ACTGCTCAAA ACTTAAAAGT 
CCTTTGGGGG TCAAATGGCA TACGGCAGTC ACCTATGTGA ACAGCTGCTT GTTTGTGGCC 
GTGCTGGTGA TTCTGATCGG ATGTTACATA GCCATATCCA GGTACATCCA CAAATCCAGC 
AGGCAATTCA TAAGTCAGTC AAGCOGAAAG CGAAAACATA ACCAGAGCAT CAGGGTTGTT 
GTGGCTGTGT TTTTTACCTG CTTTCTACCA TATCACTTGT GCAGAATTCC TTTTACTTTT 
AGTCACTTAG ACAGGCTTTT AGATGAATCT GCACAAAAAA TCCTATATTA CTGCAAAGAA 
ATTACACTTT TCTTGTCTGC GTGTAATGTT TGCCTGGATC CAATAATTTA CTTTTTCATG 
TGTAGGTCAT TTTCAAGAAG GCTGTTCAAA AAATCAAATA TCAGAACCAG GAGTGAAAGC 
ATCAGATCAC TGCAAAGTGT GAGAAGATCG GAAGTTCGCA TATATTATGA TTACACTGAT 
GTGTAGGCCT TTTATTGTTT GTTGGAATCG ATATGTACAA AGTGTAAATA AATGTTTCTT 
TTCATTATCC TTAAAAAAAA AA 

Seq ID NO: C93 DNA Sequence 
Nucleic Acid Accession #: NM_02O789.1 
Coding sequence: 208.. 3699 

1 11 21 31 . 41 51 

I I I I I I 

GGCACGAGGG TGGAGCCGAG CGGTGCGGAG CAGATCTGGT GGTTCTCCGG AGAGCAGCTT 60 

CCTTGGGTGT TACATGAGCC AAGCCCTCAC TGTACAGAAG AGTGAGAGCT GAAACCTGTT 120 

CCCTGAGCTG ATCAGAAGGA CATCCCTTGG CCCCTCCATC TGGGCTCCTG TGGATAGGAG 180 

GGGCTGGGTG AGCAGGCCAG CTGGGCTATG GTGTGGTGCC TCGGCCTGGC CGTCCTCAGC 240 

CTGGTCATCA GCCAGGGGGC TGACGGTCGA GGGAAGCCTG AGGTGGTATC GGTGGTGGGC 300 

CGGGCTGAGG AGAGTGTGGT GCTGGGCTGT GACCTGCTGC CCCCGGCCGG CCGGCCCCCC 360 

CTGCATGTCA TCGAGTGGCT GCGCTTTGGA TTCCTGCTTC CCATCTTCAT CCAGTTCGGC 420 

CTCTACTCTC CCCGAATTGA CCCTGATTAC GTGGGACGAG TCCGGCTGCA GAAGGGGGCC 480 

TCTCTCCAGA TTGAGGGTCT CCGGGTGGAA GACCAGGGCT GGTACGAGTG CCGCGTGTTC 540 

TTCCTGGACC AGCACATCCC TGAAGACGAT TTTGCTAACG GCTCCTGGGT GCATCTGACA 600 

GTCAATTCAC CCCCTCAATT CCAGGAGACA CCTCCTGCTG TGTTGGAAGT GCAGGAACTG 660 

GAGCCTGTGA CCCTGCGTTG TGTGGCCCGT GGCAGCCCCC TGCCTCATGT GACGTGGAAG 720 

CTCCGAGGAA AGGACCTTGG CCAGGGCCAG GGCCAGGTGC AAGTGCAGAA CGGGACGCTG 7 BO 

CGGATCCGCC GGGTAGAGCG AGGCAGCTCT GGGGTCTACA CCTGCCAAGC CTCCAGCACT 840 

GAGGGCAGCG CCACCCACGC CACCCAGCTG CTAGTGCTAG GACCCCCAGT CATCGTGGTG 900 

CCCCCCAAGA ACAGCACAGT CAATGCCTCC CAGGATGTTT CATTGGCCTG CCATGCTGAG 960 

GCATACCCTG CTAACCTCAC CTACAGCTGG TTCCAGGACA ACATCAATGT CTTCCACATT 1020 

AGCCGCCTGC AGCCCCGGGT GCAGATCCTG GTGGACGGGA GCCTGCGGCT GCTGGCCACC 1080 

CAGCCTGATG ATGCCGGCTG CTACACCTGT GTGCCCAGCA ATGGCCTCCT GCATCCACCC 1140 

TCAGCCTCTG CCTACCTCAC TGTGCTCTGC ATGCCGGGGG TGATCCGCTG CCCGGTTCGT 1200 

GCCAACCCCC CACTGCTCTT TGTCAGCTGG ACCAAGGATG GAAAGGCCCT GCAGCTGGAC 1260 

AAGTTCCCTG GCTGGTCCCA GGGCACAGAA GGCTCACTGA TCATCGCCCT GGGGAACGAG 1320 

GATGCCCTGG GAGAATACTC CTGCACCCCC TACAACAGTC TTGGTACCGC CGGGCCCTCT 1380 

CCTGTGACCC GCGTGCTGCT CAAGGCTCCC CCAGCTTTTA TAGAGCGGCC CAAGGAAGAA 1440 

TATTTCCAAG AAGTAGGGCG GGAGCTGCTC ATCCCCTGCT CCGCCCAAGG GGACCCTCCT 1500 

CCTGTTGTCT CTTGGACCAA GGTGGGCCGG GGGCTGCAAG GCCAGGCCCA GGTGGACAGC 1560 

AACAGCAGCC TCATCCTGCG ACCATTGACC AAGGAGGCCC ACGGGCACTG GGAATGCAGT 1620 

GCCAGCAATG CTGTGGCCCG AGTGGCCACC TCCACGAACG TCTACGTGCT GGGCACTAGC 1680 

CCTCATGTTG TCACCAATGT GTCCGTGGTG GCTTTGCCCA AGGGTGCCAA TGTCTCCTGG 1740 

GAGCCTGGCT TTGATGGTGG TTATCTGCAG AGATTCAGTG TCTGGTACAC CCCACTGGCC 1800 

AAGCGTCCTG ACCGAATGCA CCATGACTGG GTGTCCTTGG CAGTGCCTGT GGGGGCTGCT 1860 

CACCTCCTAG TGCCAGGGCT GCAGCCCCAC ACCCAGTACC AGTTCAGCGT GCTAGCTCAG 1920 

AACAAGCTGG GGAGTGGTCC CTTCAGCGAA ATCGTCTTGT CTGCTCCGGA AGGGCTTCCT 1980 

ACCACGCCAG CTGCACCCGG GCTTCCCCCA ACAGAGATAC CGCCTCCCCT GTCCCCTCCG 2040 
CGGGGTCTGG TGGCAGTGAG GACACCCOGG GGGGTACTCC TGCATTGGGA TCCCCCAGAG 2100 
CTGGTCCCTA AGAGACTGGA TGGCTACGTC TTGGAAGGCC GGCAAGGCTC CCAGGGCTGG 2160 
GAGGTGCTGG ACCCGGCTGT GGCAGGCACA GAAACAGAGC TGCTGGTGCC AGGCCTCATC 2220 
AAGGATGTTC TCTACGAGTT CCGCCTCGTG GCCTTCGCGG GCAGCTTCGT CAGCGACCCC 2280 
AGCAACAOGG CCAACGTCTC CACTTCCGGT CTGGAGGTCT ACCCTTCGCG CACGCAGCTG 2340 
CCGGGCCTCC TGCCTCAGCC CGTGCTGGCC GGCGTGGTGG GCGGAGTCTG CTTTCTGGGA 2400 
GTGGCCGTCC TTGTGAGCAT CCTGGCCGGC TGCCTCCTGA ACCGGCGCAG GGCTGCCCGC 2460 
CGCCGCCGCA AGCGCCTCCG CCAAGATCCA CCTCTTATCT TCTCTCCGAC CGGGAAGTCA 2520 
GCTGCACCCT CTGCTCTGGG CTCAGGCAGT CCTGACAGCG TGGCGAAGCT GAAGCTCCAG 2580 
GGATCCCCAG TCCCCAGCCT GCGCCAGAGT CTGCTCTGGG GGGATCCTGC CGGAACTCCC 2640 
AGCCCCCACC CGGATCCTCC ATCTAGCCGG GGACCCTTAC CTCTGGAGCC CATTTGCOGG 2700 
GGCCCAGACG GGCGCTTTGT GATGGGGCCC ACTGTGGCGG CCCCCCAGGA AAGGTCAGGC 2760 
CGGGAGCAGG CAGAACCTCG GACTCCAGCC CAGCGTCTGG CCCGGTCCTT TGACTGTAGC 2820 
AGCAGCAGCC CCAGTGGGGC ACCCCAGCCC CTCTGCATTG AAGACATCAG CCCTGTGGCA 2880 
CCCCCTCCAG CAGCCCCACC CAGTCCCTTG CCAGGTCCTG GACCCCTGCT CCAGTACCTG 2940 
AGCCTGCCCT TCTTCCGAGA GATGAATGTG GATGGGGACT GGCCCCCGCT TGAGGAGCCC 3000 
AGCCCTGCTG CACCCCCAGA TTACATGGAT ACCCGGCGCT GTCCCACCTC ATCTTTCCTT 3060 
CGTTCTCCAG AAACCCCTCC TGTATCCCCC AGGGAATCAC TTCCTGGGGC TGTGGTAGGG 3120 
GCTGGGGCCA CTGCAGAGCC CCCTTACACA GCCCTGGCTG ACTGGACACT GAGGGAGCGG 3180 
CTGCTGCCAG GCCTTCTCCC TGCTGCCCCT CGAGGCAGCC TCACCAGCCA GAGCAGCGGG 3240 

1253 
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10 
15 
20 
25 
30 
35 
40 
45 
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55 



CGAGGCAGCG 
CTCAGCCCTG 
AGGGAGCATG 
TGGGACTCAG 
AGCTCCCGGC 
TGCCTCCTGA 
GAATTCCTGG 
CAGCCAGTCC 
GAAAAGGCAT 
GGCACTGCCC 
CACCTTGGTA 
TCTTAGGTCT 
GTGTGTGAAG 
AAAA 



CTTOGTTCCT 
CTCCAGGAGA 
TGGTGACAGT 
AATTCCCTGG 
TCAGACCTGA 
ACACTGCCCA 
CCTTCCGCCG 
CCCACCCCGA 
ATGGACCTGC 
CTGCCTGCCC 
TGGAATGTAT 
GCCTCTCTCT 
TTTTTTACAG 



GCGGCCCCCC 
CACCAGCAGC 
CAGCAAGAGG 
GGACATGGAA 
AGCTGAGACA 
TGTTACTGGC 
CCGCCGAGAT 
ACAGGCCACT 
AAAGGAGGCC 

GTGCTGACCC 
CTCTCTCTCT 
GTGAATAAAC 



TCCACAGCCC 
TGGGCCAGTG 
AGGAACACAT 
TTGCTGGAGA 
GAGCTAGGTG 
CCTGAGGCCC 
GCTACTAGGG 
CTGCTGTGAA 
CCCAACCAGA 
CAGGCACAGA 
CCTAGGTGAG 
CTCTCTCTCT 
AAAGTTTGAA 



CCTCTGCAGG 
GCCCTGAGAG 
CTGTGGACGA 
CTTTGCACCT 
TGAAGACTCC 
GCTGTGCTGC 
CTCGGCTACC 
CATCCCTAAT 
CAGACCTAGT 
CCCTGATAGT 
TCTGGGGATT 
CTCTCTCTGT 
AGAAAAAAAA 



AGGCAGCTAC 
ATGGCCCCGA 
GAACTATGAG 
GGGCTTGGCC 
AGAGGAGGGC 
CCTTCGGGAG 
AGCCTATCGA 
GTGAGGCTGT 
TTCAAACGAG 



GGAACAGGGA 
GTGTGTGTGT 
AAAAAAAAAA 



Seq ID NO: C94 DNA Sequence 
Nucleic Acid Accession ft: nm_006875 
Coding sequence: 186.. 1190 



GAATTCGGCA 
CACCAGTTTC 
CCCGGGCGTC 
CCTCCATGTT 
CGCCAGGAGG 
GTAAGGGGGG 
CCATCAAAGT 
CATGCCCACT 
TGATCCGCCT 
CTTTGCCCGC 
CAAGCCGCTG 
TTGTCCATCG 
AACTCATTGA 
GGACAAGGGT 
CCACTGTCTG 
AGAGGGACCA 
GCTGTGCCCT 
AGATCCTGCT 
AAAGGAGGCC 
TGGCCCCCAA 
GTTGACTTGG 
ATTGAGGATC 
CAAAGGAGCC 
CTCATTTTGC 
TTATTTTGAT 
TCATATGCTT 
AGTAAAGGGA 
TCAGCCCAGG 
TTTTTTTTTT 
CTGGTGAGAA 
CACCACCAGA 
GCTTGCTGTT 
CTGAGCCGGG 
TCCAAGTGTG 
CCACTATTTA 



11 

I 

CGAGCGCGCG 
TCTGCTTTCC 
CACGCCCTGC 
GACCAAGCCT 
CAAGGATCGG 
CTTTGGCACC 
GATTCCCCGG 
CGAAGTCGCA 
GCTTGACTGG 
CCAGGATCTC 
CTTCTTTGGC 
TGACATCAAG 
TTTTGGTTCT 
GTACAGCCCC 
GTCACTGGGC 
GGAGATTCTG 
AATCCGCCGG 
GGACCCCTGG 
CTGCCCCTTT 
TGGTCAGAAG 
TTTTACAGGT 
AGGGGTTAGA 
TTCCTCCCAG 
TAAGGAAGTT 
GATGTGTCAC 
TTACTTGGGC 
CCCTTTCCCC 
ATTTTTTATT 
TTTTTTTTTG 
GAACCTTAAT 
CAATAGGATG 
TGTTTTCCTG 
ATTGT CCAAT 
CCCTCCTTTT 
ATAAAAGTAA 



21 
I 

GCGAATCTCA 
ACCCTGGCGC 
GGGCTTAGCG 
CTACAGGGGC 
GAAGCGTTCG 
GTCTTCGCAG 
AATCGTGTGC 
CTGCTATGGA 
TTTGAGACAC 
TTTGACTATA 
CAAGTAGTGG 
GATGAGAACA 
GGTGCCCTGC 
CCAGAGTGGA 
ATCCTCCTCT 
GAAGCTGAGC 
TGCCTGGCCC 
ATGCAAACAC 
GGCCTGGTCC 
AGCCATCCCA 
CATTACCAGT 
AGACATAAAC 
AACCTGTGGT 
TATTTTGGTG 
CCCACATTGG 
AAGGGTGCTT 
TAGCCTAGGG 
TTGGGGGAGG 
GGTGAGGGGA 
TCCATAATTT 
GGATGGATGG 
GGGCGCTCCC 
TACTAAAATG 
TTTTCCTGCC 
TAGAATCAGA 



31 
I 

ACGCTGCGCC 
CCCCCAGCCC 
GGTTCAGTGG 
CTCCCGCGCC 
AGGCCGAGTA 
GACACCGCCT 
TGGGCTGGTC 
AAGTGGGTGC 
AGGAAGGCTT 
TCACAGAGAA 
CAGCCATCCA 
TCCTGATAGA 
TTCATGATGA 
TCTCTCGACA 
ATGACATGGT 
TCCACTTCCC 
CCAAACCTTC 
CAGCCGAGGA 
TTGCTACCCT 
TGGCCATGTC 
CATTAAAGTC 
CAAGTTTGCC 
CCCTGATTTT 
AAGTTGTTCC 
CACCTCCTAC 
TCCTTCCAAT 
TCCCATATTG 
TAATGCCCTG 
CCCTACTTTG 
GGGAAGGAAT 
TTTTTTGGGG 
TCCAATTTTG 
TAAATAATCA 
TGGATTATTT 
AAAAAAAAAA 



41 
I 

GTCTGCGGGC 
TGGCTCCCCA 
GCTCAATCTG 
CCCCGGGACC 
TCGACTCGGC 
CACAGATCGA 
CCCCTTGTCA 
AGGTGGTGGG 
CATGCTGGTC 
GGGCCCACTG 
GCACTGCCAT 
CCTACGCCGT 
ACCCTACACT 
CCAGTACCAT 
GTGTGGGGAC 
AGCCCATGTC 
TTCCCGACCC 
TGTTACCCCT 
AAGCCTGGCC 
ACAGGGATAG 
CAGTATTACT 
CAGTTCCCTT 
GGAGGGGGAA 
CATTTTGAGC 
TACCACCACA 
ACCCCAGTAG 
GGTCAAGCTG 
TTGTTACCCC 
TTATCCCAAG 
GGAAGATGGA 
GATGGGCTAG 
CAGATTTTTG 
CGTATTGTGG 
AAAAAGCCAT 
AAAAAAAA 



51 
I 

GCTTCCGGGC 
GCTGCGCTGC 
CGCAGCGCCA 
CCCACGCCGC 
CCCCTCCTGG 
CTCCAGGTGG 
GACTCAGTCA 
CACCCTGGCG 
CTCGAGCGGC 
GGTGAAGGCC 
TCCCGTGGAG 
GGCTGTGCCA 
GACTTTGATG 
GCACTCCCGG 
ATTCCCTTTG 
TCCCCAGACT 
TCACTGGAAG 
CAACCCCTCC 
TGGCCTGGCC 
ATGGACATTT 
AAGGTAAGGG 
CCCAATCCTA 
CTTCTTGCTT 
CCCGGGACTC 
CAAACTTAGT 
CTTTTATTTT 
CTTACCTGCC 
AAGGCTTCTT 
TGCTCTTATT 
CACCACCGGA 
GGGAAATAAG 
CAACCTCCTC 
GGAGGGGAGT 
GTGTGGAAAC 



3300 
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3480 
3540 
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4020 
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Seq ID NO: C95 DNA Sequence 

Nucleic Acid Accession #: NM_002510.1 

Coding sequence: 92.. 1774 



1 
I 

CAGATGCCAG 
CCTTGAGTGC 
TCTGCTCCTG 
CAATGAAAGA 
TGAAAATGAC 
AAACTCCTGG 
GGGCTCAAAT 
CAATGGCAAC 
ATATGTTTAC 
AAGCCATCAT 
ATGGAATTTC 
TTCAGTGAGA 
GACTGTCTAC 
CGTGGTAACA 
ATCCGACGAA 
TAGCCACTTC 
CCT G TTTGTT 
CCTTAACCTC 
CAGACCTTCA 
TAGGATTCCT 
AATTGTAGAG 



11 
I 

AAGAACACTG 
CTGCGTCCGT 
GCTGCAAGAT 
CCTTCTGCTT 
TGGAATGAAA 
AAGGGAGGCC 
ATAACATTTG 
ATAGTCTATG 
AACTGGACAG 
AACGTCTTCC 
ATCTACGTCT 
GTTTCTGTGA 
AGAAGACATG 
GATCAGATTC 
ACCTTCCTCA 
CTCAATTATT 
TCCACCAATC 
ACTGTGAAAG 
AAACCCACCC 
GATGAAAACT 
GGAATCTTAG 



21 
I 

TTGCTCTTGG 
GAGAATTCAG 
TGCCACTTGA 
ACATGAGGGA 
AACTCTACCC 
GTGTGCAGGC 
CGGTGAACCT 
AGAAGAACTG 
CATGGTCAGA 
CTGATGGGAA 
TCCACACACT 
ACACAGCCAA 
GACGGGCATA 
CTGTGTTTGT 
AAGATCTCCC 
CTACCATTAA 
ATACTGTGAA 
CTGCAGCACC 
CTTCTTTAGG 
GCCAGATTAA 
AGGTTAACAT 



31 
I 

TGGACGGGCC 
CATGGAATGT 
TGCCGCCAAA 
GCACAATCAA 
AGTGTGGAAG 
GGTCCTGACC 
GATATTCCCT 
CAGAAATGAG 
GGACAGTGAC 
ACCTTTTCCT 
TGGTCAGTAT 
TGTGACACTT 
TGTTCCCATC 
GACTATGTTC 
CATTATGTTT 
CTACAAGTGG 
TCACACGTAT 
AGGACCTTGT 
ACCTGCTGGT 
CAGATATGGC 
CATCCAGATG 



41 
I 

CAGAGGAATT 
CTCTACTATT 
CGATTTCATG 
TTAAATGGCT 
CGGGGAGACA 
AGTGACTCAC 
AGATGCCAAA 
GCTGGTTTAT 
GGGGAAAATG 
CACCACCCCG 
TTCCAGAAAT 
GGGCCTCAAC 
GCACAAGTGA 
CAGAAGAACG 
GATGTCCTGA 
AGCTTCGGGG 
GTGCTCAATG 
CCGCCACCGC 
GACAACCCCC 
CACTTTCAAG 
ACAGACGTCC 



51 
I 

CAGAGTTAAA 
TCCTGGGATT 
ATGTGCTGGG 
GGTCTTCTGA 
TGAGGTGGAA 
CAGCCCTCGT 
AGGAAGATGC 
CTGCTGATCC 
GCACCGGCCA 
GATGGAGAAG 
TGGGACGATG 
TCATGGAAGT 
AAGATGTGTA 
ATCGAAATTC 
TTCATGATCC 
ATAATACTGG 
GAACCTTCAG 
CACCACCACC 
TGGAGCTGAG 
CCACCATCAC 
TGATGCCGGT 
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GCCATGGCCT 
GGAGGTCTGT 
CCCTGTGGAT 
GACGTACTGT 
GATTTCTGTT 
CTCCGTTGGC 
CAAGGAATAC 
TGTCTTTCTC 
ACTCAAAAAC 
TTTTCAGTGC 
TTATTGTTAA 
TTTAGAGATG 
AAAGCAACTT 
TAACTAGTAG 
AGCCTAACCC 
ATGCATAAAG 
TTCAATACAC 
GTGCACACTT 
TGACAACCTA 
GGACATTTAG 
ATTTCCAAAT 
AGATGAGGTC 
ACTAAAACCA 
AAGTGTGGGA 



GAAAGCTCCC 
ACCATCATTT 
GTGGATGAGA 
GTGAACCTCA 
CCTGACAGAG 
TGCTTGGCCA 
AACCCAATAG 
AACCGTGCAA 
CAAGAATTTA 
CATTGATGTG 
ATAGATATTG 
GGGAGAGGGA 
AGCAAGGCTT 
GATAGAAACA 
AGGTTAACTG 
CCAATGTAGT 
ACTCATGAAC 
GCTAGACTCA 
CTTTGCTTGG 
TTAGTGCTTT 
TTTTGTATAG 

TCTACTATAT 
AGAGACAAAA 



TAATAGACTT 
CTGACCCCAC 
TGTGTCTGCT 
CCCTGGGGGA 
ACCCAGCCTC 
TATTTGTCAC 
AAAATAGTCC 
AAGCCGTGTT 
AAGGAGTTTC 
AGATGTGCTG 

TTATACTGCA 
CTTTTCATTA 
CTGTGTCCCG 
CAAGAAGAGG 
CCAGTTTCTA 
TCCTGATGGA 
GAAAAAATAC 
CTGAGTGAAG 
TTATATACCA 
TCGCTGCACA 
CATGGCAACT 
GTTAGACATG 
AAAAAAAAA 



TGTCGTGACC 
CTGCGAGATC 
GACTGTGAGA 
TGACACAAGC 
GCCTTTAAGG 
TGTGATCTCC 
TGGGAATGTG 
CTTCCCGGGA 
TTAAATTTCG 
GAGTGGCTAT 
AAGTTGAATT 
GGCAGCTTCA 
TTTTTTATGT 
AGAGTAAGGA 
CGGGATACTT 
AGATCATGTT 
ACAATAACAG 
TACTCTCATA 
GAATGATATT 
GGCATGATGC 
TATTTGAAAT 
TGATCAGTAA 
ACATTCTTTT 



TGCCAAGGGA 
ACCCAGAACA 
CGAACCTTCA 
CTGGCTCTCA 
ATGGCAAACA 
CTCTTGGTGT 
GTCAGAAGCA 
AACCAGGAAA 
ACCTTGTTTC 
TAACCTTTTT 
TTTTATAGGT 
GCCATGTTGT 
TTCACTTATA 
GAGAAGCTAC 
TCAGCTTTCC 
CCAAGCTAAC 
GCCCAAGCCT 
AATGGGTGGG 
CATATATTCA 
TGAGTGACAC 
CATATATTAA 
GGATTTCACC 
TCTCTCCTTC 



GCATTCCCAC 
CAGTCTGCAG 
ATGGGTCTGG 
CGAGCACCCT 
GTGCCCTGAT 
ACAAAAAACA 
AAGGCCTGAG 
AGGATCCGCT 
TGAAGCTCAC 
TTCCTAAAGA 
TAAATGTCAT 
GAAACTGATA 
AAGTCTTAGG 
TATTGATTAG 
ATGTAACTGT 
TGAATCCCAC 
GTGGTATGAT 
AGTATTTTGG 
TTTATTCCAT 
TCTTGTGTAT 
GACTTTCCAA 
TCTGTTTGTA 
CTGAAAAATA 



Seq ID NO: C96 DNA Sequence 

Nucleic Acid Accession tf: Eos sequence 

Coding sequence: 1-4247 
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2640 
2669 



1 11 21 31 41 51 

I I I I 1 I 

ATGCGAATCC TAAAGCGTTT CCTCGCTTGC ATTCAGCTCC TCTGTGTTTG CCGCCTGGAT 60 

TGGGCTAATG GATACTACAG ACAACAGAGA AAACTTGTTG AAGAGATTGG CTGGTCCTAT 120 

ACAGGAGCAC TGAATCAAAA AAATTGGGGA AAGAAATATC CAACATGTAA TAGCCCAAAA 180 

CAATCTCCTA TCAATATTGA TGAAGATCTT ACACAAGTAA ATGTGAATCT TAAGAAACTT 240 

AAATTTCAGG GTTGGGATAA AACATCATTG GAAAACACAT TCATTCATAA CACTGGGAAA 300 

ACAGTGGAAA TTAATCTCAC TAATGACTAC CGTGTCAGCG GAGGAGTTTC AGAAATGGTG 360 

TTTAAAGCAA GCAAGATAAC TTTTCACTGG GGAAAATGCA ATATGTCATC TGATGGATCA 420 

GAGCATAGTT TAGAAGGACA AAAATTTCCA CTTGAGATGC AAATCTACTG CTTTGATGCA 480 

GACCGATTTT CAAGTTTTGA GGAAGCAGTC AAAGGAAAAG GGAAGTTAAG AGCTTTATCC 540 

ATTTTGTTTG AGGTTGGGAC AGAAGAAAAT TTGGATTTCA AAGCGATTAT TGATGGAGTC 600 

GAAAGTGTTA GTCGTTTTGG GAAGCAGGCT GCTTTAGATC CATTCATACT GTTGAACCTT 660 

CTGCCAAACT CAACTGACAA GTATTACATT TACAATGGCT CATTGACATC TCCTCCCTGC 720 

ACAGACACAG TTGACTGGAT TGTTTTTAAA GATACAGTTA GCATCTCTGA AAGCCAGTTG 780 

GCTGTTTTTT GTGAAGTTCT TACAATGCAA CAATCTGGTT ATGTCATGCT GATGGACTAC 840 

TTACAAAACA ATTTTCGAGA GCAACAGTAC AAGTTCTCTA GACAGGTGTT TTCCTCATAC 900 

ACTGGAAAGG AAGAGATTCA TGAAGCAGTT TGTAGTTCAG AACCAGAAAA TGTTCAGGCT 960 

GACCCAGAGA ATTATACCAG CCTTCTTGTT ACATGGGAAA GACCTCGAGT CGTTTATGAT 1020 

ACCATGATTG AGAAGTTTGC AGTTTTGTAC CAGCAGTTGG ATGGAGAGGA CCAAACCAAG 1080 

CATGAATTTT TGACAGATGG CTATCAAGAC TTGGGTGCTA TTCTCAATAA TTTGCTACCC 1140 

AATATGAGTT ATGTTCTTCA GATAGTAGCC ATATGCACTA ATGGCTTATA TGGAAAATAC 1200 

AGCGACCAAC TGATTGTCGA CATGCCTACT GATAATCCTG AACTTGATCT TTTCCCTGAA 1260 

TTAATTGGAA CTGAAGAAAT AATCAAGGAG GAGGAAGAGG GAAAAGACAT TGAAGAAGGC 1320 

GCTATTGTGA ATCCTGGTAG AGACAGTGCT ACAAACCAAA TCAGGAAAAA GGAACCCCAG 1380 

ATTTCTACCA CAACACACTA CAATCGCATA GGGACGAAAT ACAATGAAGC CAAGACTAAC 1440 

CGATCCCCAA CAAGAGGAAG TGAATTCTCT GGAAAGGGTG ATGTTCCCAA TACATCTTTA 1500 

AATTCCACTT CCCAACCAGT CACTAAATTA GCCACAGAAA AAGATATTTC CTTGACTTCT 1560 

CAGACTGTGA CTGAACTGCC ACCTCACACT GTGGAAGGTA CTTCAGCCTC TTTAAATGAT 1620 

GGCTCTAAAA CTGTTCTTAG ATCTCCACAT ATGAACTTGT CGGGGACTGC AGAATCCTTA 1680 

AATACAGTTT CTATAACAGA ATATGAGGAG GAGAGTTTAT TGACCAGTTT CAAGCTTGAT 1740 

ACTGGAGCTG AAGATTCTTC AGGCTCCAGT CCCGCAACTT CTGCTATCCC ATTCATCTCT 1800 

GAGAACATAT CCCAAGGGTA TATATTTTCC TCCGAAAACC CAGAGACAAT AACATATGAT 1860 

GTCCTTATAC CAGAATCTGC TAGAAATGCT TCCGAAGATT CAACTTCATC AGGTTCAGAA 1920 

GAATCACTAA AGGATCCTTC TATGGAGGGA AATGTGTGGT TTCCTAGCTC TACAGACATA 1980 

ACAGCACAGC CCGATGTTGG ATCAGGCAGA GAGAGCTTTC TCCAGACTAA TTACACTGAG 2040 

ATACGTGTTG ATGAATCTGA GAAGACAACC AAGTCCTTTT CTGCAGGCCC AGTGATGTCA 2100 

CAGGGTCCCT CAGTTACAGA TCTGGAAATG CCACATTATT CTACCTTTGC CTACTTCCCA 2160 

ACTGAGGTAA CACCTCATGC TTTTACCCCA TCCTCCAGAC AACAGGATTT GGTCTCCACG 2220 

GTCAACGTGG TATACTCGCA GACAACCCAA CCGGTATACA ATGAGGCCAG TAATAGTAGC 2280 

CATGAGTCTC GTATTGGTCT AGCTGAGGGG TTGGAATCCG AGAAGAAGGC AGTTATACCC 2340 

CTTGTGATCG TGTCAGCCCT GACTTTTATC TGTCTAGTGG TTCTTGTGGG TATTCTCATC 2400 

TACTGGAGGA AATGCTTCCA GACTGCACAC TTTTACTTAG AGGACAGTAC ATCCCCTAGA 2460 

GTTATATCCA CACCTCCAAC ACCTATCTTT CCAATTTCAG ATGATGTCGG AGCAATTCCA 2520 

ATAAAGCACT TTCCAAAGCA TGTTGCAGAT TTACATGCAA GTAGTGGGTT TACTGAAGAA 25 80 

TTTGAGGAAG TGCAGAGCTG TACTGTTGAC TTAGGTATTA CAGCAGACAG CTCCAACCAC 2640 

CCAGACAACA AGCACAAGAA TCGATACATA AATATCGTTG CCTATGATCA TAGCAGGGTT 2700 

AAGCTAGCAC AGCTTGCTGA AAAGGATGGC AAACTGACTG ATTATATCAA TGCCAATTAT 2760 

GTTGATGGCT ACAACAGACC AAAAGCTTAT ATTGCTGCCC AAGGCCCACT GAAATCCACA 2820 

GCTGAAGATT TCTGGAGAAT GATATGGGAA CATAATGTGG AAGTTATTGT CATGATAACA 2880 

AACCTCGTGG AGAAAGGAAG GAGAAAATGT GATCAGTACT GGCCTGCCGA TGGGAGTGAG 2940 

GAGTACGGGA ACTTTCTGGT CACTCAGAAG AGTGTGCAAG TGCTTGCCTA TTATACTGTG 3000 

AGGAATTTTA CTCTAAGAAA CACAAAAATA AAAAAGGGCT CCCAGAAAGG AAGACCCAGT 3060 

GGACGTGTGG TCACACAGTA TCACTACACG CAGTGGCCTG ACATGGGAGT ACCAGAGTAC 3120 

TCCCTGCCAG TGCTGACCTT TGTGAGAAAG GCAGCCTATG CCAAGCGCCA TGCAGTGGGG 3180 

1255 
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CCTGTTGTCG TCCACTGCAG TGCTGGAGTT GGAAGAACAG GCACATATAT TGTGCTAGAC 3240 

AGTATGTTGC AGCAGATTCA ACACGAAGGA ACTGTCAACA TATTTGGCTT CTTAAAACAC 3300 

ATCOGTTCAC AAAGAAATTA TTTGGTACAA ACTGAGGAGC AATATGTCTT CATTCATGAT 3360 

ACACTGGTTG AGGCCATACT TAGTAAAGAA ACTGAGGTGC TGGACAGTCA TATTCATGCC 3420 

5 TATGTTAATG CACTCCTCAT TCCTGGACCA GCAGGCAAAA CAAAGCTAGA GAAACAATTC 3480 

CAGCTCCTGA GCCAGTCAAA TATACAGCAG AGTGACTATT CTGCAGCCCT AAAGCAATGC 3540 

AACAGGGAAA AGAATCGAAC TTCTTCTATC ATCCCTGTGG AAAGATCAAG GGTTGGCATT 3600 

TCATCCCTGA GTGGAGAAGG CACAGACTAC ATCAATGCCT CCTATATCAT GGGCTATTAC 3660 

CAGAGCAATG AATTCATCAT TACCCAGCAC CCTCTCCTTC ATACCATCAA GGATTTCTGG 3720 

10 AGGATGATAT GGGACCATAA TGCCCAACTG GTGGTTATGA TTCCTGATGG CCAAAACATG 3780 

GCAGAAGATG AATTTGTTTA CTGGCCAAAT AAAGATGAGC CTATAAATTG TGAGAGCTTT 3840 

AAGGTCACTC TTATGGCTGA AGAACACAAA TGTCTATCTA ATGAGGAAAA ACTTATAATT 3900 

CAGGACTTTA TCTTAGAAGC TACACAGGAT GATTATGTAC TTGAAGTGAG GCACTTTCAG 3960 

TGTCCTAAAT GGCCAAATCC AGATAGCCCC ATTAGTAAAA CTTTTGAACT TATAAGTGTT 4020 

15 ATAAAAGAAG AAGCTGCCAA TAGGGATGGG CCTATGATTG TTCATGATGA GCATGGAGGA 4080 

GTGACGGCAG GAACTTTCTG TGCTCTGACA ACCCTTATGC ACCAACTAGA AAAAGAAAAT 4140 

TCCGTGGATG TTTACCAGGT AGCCAAGATG ATCAATCTGA TGAGGCCAGG AGTCTTTGCT 4200 

GACATTGAGC AGTATCAGTT TCTCTACAAA GTGATCCTCA GCCTTGTGAG CACAAGGCAG 4260 

GAAGAGAATC CATCCACCTC TCTGGACAGT AATGGTGCAG CATTGCCTGA TGGAAATATA 4320 

20 GCTGAGAGCT TAGAGTCTTT AGTTTAA 4347 

Seq ID NO: C97 DNA Sequence 
Nucleic Acid Accession 8: XM_031379 
Coding sequence: 148.. 7095 

1 11 21 31 41 51 

I I I i I I 

CACACATACG CACGCACGAT CTCACTTCGA TCTATACACT GGAGGATTAA AACAAACAAA 60 

CAAAAAAAAC ATTTCCTTCG CTCCCCCTCC CTCTCCACTC TGAGAAGCAG AGGAGCCGCA 120 

30 CGGCGAGGGG CCGCAGACCG TCTGGAAATG CGAATCCTAA AGCGTTTCCT CGCTTGCATT 180 

CAGCTCCTCT GTGTTTGCCG CCTGGATTGG GCTAATGGAT ACTACAGACA ACAGAGAAAA 240 

CTTGTTGAAG AGATTGGCTG GTCCTATACA GGAGCACTGA ATCAAAAAAA TTGGGGAAAG 300 

AAATATCCAA CATGTAATAG CCCAAAACAA TCTCCTATCA ATATTGATGA AGATCTTACA 360 

CAAGTAAATG TGAATCTTAA GAAACTTAAA TTTCAGGGTT GGGATAAAAC ATCATTGGAA 420 

35 AACACATTCA TTCATAACAC TGGGAAAACA GTGGAAATTA ATCTCACTAA TGACTACCGT 480 

GTCAGCGGAG GAGTTTCAGA AATGGTGTTT AAAGCAAGCA AGATAACTTT TCACTGGGGA 540 

AAATGCAATA TGTCATCTGA TGGATCAGAG CATAGTTTAG AAGGACAAAA ATTTCCACTT 600 

GAGATGCAAA TCTACTGCTT TGATGCGGAC CGATTTTCAA GTTTTGAGGA AGCAGTCAAA 660 

GGAAAAGGGA AGTTAAGAGC TTTATCCATT TTGTTTGAGG TTGGGACAGA AGAAAATTTG 720 

40 GATTTCAAAG CGATTATTGA TGGAGTCGAA AGTGTTAGTC GTTTTGGGAA GCAGGCTGCT 780 

TTAGATCCAT TCATACTGTT GAACCTTCTG CCAAACTCAA CTGACAAGTA TTACATTTAC 840 

AATGGCTCAT TGACATCTCC TCCCTGCACA GACACAGTTG ACTGGATTGT TTTTAAAGAT 900 

ACAGTTAGCA TCTCTGAAAG CCAGTTGGCT GTTTTTTGTG AAGTTCTTAC AATGCAACAA 960 

TCTGGTTATG TCATGCTGAT GGACTACTTA CAAAACAATT TTCGAGAGCA ACAGTACAAG 1020 

45 TTCTCTAGAC AGGTGTTTTC CTCATACACT GGAAAGGAAG AGATTCATGA AG CAGTTTGT 1080 

AGTTCAGAAC CAGAAAATGT TCAGGCTGAC CCAGAGAATT ATACCAGCCT TCTTGTTACA 1140 

TGGGAAAGAC CTCGAGTCGT TTATGATACC ATGATTGAGA AGTTTGCAGT TTTGTACCAG 1200 

CAGTTGGATG GAGAGGACCA AACCAAGCAT GAATTTTTGA CAGATGGCTA TCAAGACTTG 1260 

GGTGCTATTC TCAATAATTT GCTACCCAAT ATGAGTTATG TTCTTCAGAT AGTAGCCATA 1320 

50 TGCACTAATG GCTTATATGG AAAATACAGC GACCAACTGA TTGTCGACAT GCCTACTGAT 1380 

AATCCTGAAC TTGATCTTTT CCCTGAATTA ATTGGAACTG AAGAAATAAT CAAGGAGGAG 1440 

GAAGAGGGAA AAGACATTGA AGAAGGCGCT ATTGTGAATC CTGGTAGAGA CAGTGCTACA 1500 

AACCAAATCA GGAAAAAGGA ACCCCAGATT TCTACCACAA CACACTACAA TCGCATAGGG 1560 

ACGAAATACA ATGAAGCCAA GACTAACCGA TCCCCAACAA GAGGAAGTGA ATTCTCTGGA 1620 

55 AAGGGTGATG TTCCCAATAC ATCTTTAAAT TCCACTTCCC AACCAGTCAC TAAATTAGCC 1680 

ACAGAAAAAG ATATTTCCTT GACTTCTCAG ACTGTGACTG AACTGCCACC TCACACTGTG 1740 

GAAGGTACTT CAGCCTCTTT AAATGATGGC TCTAAAACTG TTCTTAGATC TCCACATATG 1800 

AACTTGTCGG GGACTGCAGA ATCCTTAAAT ACAGTTTCTA TAACAGAATA TGAGGAGGAG 1860 

AGTTTATTGA CCAGTTTCAA GCTTGATACT GGAGCTGAAG ATTCTTCAGG CTCCAGTCCC 1920 

60 GCAACTTCTG CTATCCCATT CATCTCTGAG AACATATCCC AAGGGTATAT ATTTTCCTCC 1980 

GAAAACCCAG AGACAATAAC ATATGATGTC CTTATACCAG AATCTGCTAG AAATGCTTCC 2040 

GAAGATTCAA CTTCATCAGG TTCAGAAGAA TCACTAAAGG ATCCTTCTAT GGAGGGAAAT 2100 

GTGTGGTTTC CTAGCTCTAC AGACATAACA GCACAGCCCG ATGTTGGATC AGGCAGAGAG 2160 

AGCTTTCTCC AGACTAATTA CACTGAGATA CGTGTTGATG AATCTGAGAA GACAACCAAG 2220 

65 TCCTTTTCTG CAGGCCCAGT GATGTCACAG GGTCCCTCAG TTACAGATCT GGAAATGCCA 2280 

CATTATTCTA CCTTTGCCTA CTTCCCAACT GAGGTAACAC CTCATGCTTT TACCCCATCC 2340 

TCCAGACAAC AGGATTTGGT CTCCACGGTC AACGTGGTAT ACTCGCAGAC AACCCAACCG 2400 

GTATACAATG GTGAGACACC TCTTCAACCT TCCTACAGTA GTGAAGTCTT TCCTCTAGTC 2460 

ACCCCTTTGT TGCTTGACAA TCAGATCCTC AACACTACCC CTGCTGCTTC AAGTAGTGAT 2520 

70 TCGGCCTTGC ATGCTACGCC TGTATTTCCC AGTGTCGATG TGTCATTTGA ATCCATCCTG 2580 

TCTTCCTATG ATGGTGCACC TTTGCTTCCA TTTTCCTCTG CTTCCTTCAG TAGTGAATTG 2640 

TTTCGCCATC TGCATACAGT TTCTCAAATC CTTCCACAAG TTACTTCAGC TACCGAGAGT 2700 

GATAAGGTGC CCTTGCATGC TTCTCTGCCA GTGGCTGGGG GTGATTTGCT ATTAGAGCCC 2760 

AGCCTTGCTC AGTATTCTGA TGTGCTGTCC ACTACTCATG CTGCTTCAGA GACGCTGGAA 2820 

75 TTTGGTAGTG AATCTGGTGT TCTTTATAAA ACGCTTATGT TTTCTCAAGT TGAACCACCC 2880 

AGCAGTGATG CCATGATGCA TGCACGTTCT TCAGGGCCTG AACCTTCTTA TGCCTTGTCT 2940 
GATAATGAGG GCTCCCAACA CATCTTCACT GTTTCTTACA GTTCTGCAAT ACCTGTGCAT 3000 
GATTCTGTGG GTGTAACTTA TCAGGGTTCC TTATTTAGCG GCCCTAGCCA TATACCAATA 3060 
CCTAAGTCTT CGTTAATAAC CCCAACTGCA TCATTACTGC AGCCTACTCA TGCCCTCTCT 3120 
60 GGTGATGGGG AATGGTCTGG AGCCTCTTCT GATAGTGAAT TTCTTTTACC TGACACAGAT 3180 
GGGCTGACAG CCCTTAACAT TTCTTCACCT GTTTCTGTAG CTGAATTTAC ATATACAACA 3240 
TCTGTGTTTG GTGATGATAA TAAGGCGCTT TCTAAAACTG AAATAATATA TGGAAATGAG 3300 
ACTGAACTGC AAATTCCTTC TTTCAATGAG ATGGTTTACC CTTCTGAAAG CACAGTCATG 3360 
CCCAACATGT ATGATAATGT AAATAAGTTG AATGCGTCTT TACAAGAAAC CTCTGTTTCC 3420 
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ATTTCTAGCA CCAAGGGCAT GTTTCCAGGG TCCCTTGCTC ATACCACCAC TAAGGTTTTT 3480 

GATCATGAGA TTAGTCAAGT TCCAGAAAAT AACTTTTCAG TTCAACCTAC ACATACTGTC 3540 

TCTCAAGCAT CTGGTGACAC TTCGCTTAAA CCTGTGCTTA GTGCAAACTC AGAGCCAGCA 3600 

TCCTCTGACC CTGCTTCTAG TGAAATGTTA TCTCCTTCAA CTCAGCTCTT ATTTTATGAG 3660 

ACCTCAGCTT CTTTTAGTAC TGAAGTATTG CTACAACCTT CCTTTCAGGC TTCTGATGTT 3720 

GACACCTTGC TTAAAACTGT TCTTCCAGCT GTGCCCAGTG ATCCAATATT GGTTGAAACC 3780 

CCCAAAGTTG ATAAAATTAG TTCTACAATG TTGCATCTCA TTGTATCAAA TTCTGCTTCA 3840 

AGTGAAAACA TGCTGCACTC TACATCTGTA CCAGTTTTTG ATGTGTCGCC TACTTCTCAT 3900 

ATGCACTCTG CTTCACTTCA AGGTTTGACC ATTTCCTATG CAAGTGAGAA ATATGAACCA 3960 

GTTTTGTTAA AAAGTGAAAG TTCCCACCAA GTGGTACCTT CTTTGTACAG TAATGATGAG 4020 

TTGTTCCAAA CGGCCAATTT GGAGATTAAC CAGGCCCATC CCCCAAAAGG AAGGCATGTA 4080 

TTTGCTACAC CTGTTTTATC AATTGATGAA CCATTAAATA CACTAATAAA TAAGCTTATA 4140 

CATTCCGATG AAATTTTAAC CTCCACCAAA AGTTCTGTTA CTGGTAAGGT ATTTGCTGGT 4200 

ATTCCAACAG TTGCTTCTGA TACATTTGTA TCTACTGATC ATTCTGTTCC TATAGGAAAT 4260 

GGGCATGTTG CCATTACAGC TGTTTCTCCC CACAGAGATG GTTCTGTAAC CTCAACAAAG 4320 

TTGCTGTTTC CTTCTAAGGC AACTTCTGAG CTGAGTCATA GTGCCAAATC TGATGCCGGT 4380 

TTAGTGGGTG GTGGTGAAGA TGGTGACACT GATGATGATG GTGATGATGA TGATGATGAC 4440 

AGAGGTAGTG ATGGCTTATC CATTCATAAG TGTATGTCAT GCTCATCCTA TAGAGAATCA 4500 

CAGGAAAAGG TAATGAATGA TTCAGACACC CACGAAAACA GTCTTATGGA TCAGAATAAT 4560 

CCAATCTCAT ACTCACTATC TGAGAATTCT GAAGAAGATA ATAGAGTCAC AAGTGTATCC 4620 

TCAGACAGTC AAACTGGTAT GGACAGAAGT CCTGGTAAAT CACCATCAGC AAATGGGCTA 4680 

TCCCAAAAGC ACAATGATGG AAAAGAGGAA AATGACATTC AGACTGGTAG TGCTCTGCTT 4740 

CCTCTCAGCC CTGAATCTAA AGCATGGGCA GTTCTGACAA GTGATGAAGA AAGTGGATCA 4800 

GGGCAAGGTA CCTCAGATAG CCTTAATGAG AATGAGACTT CCACAGATTT CAGTTTTGCA 4860 

GACACTAATG AAAAAGATGC TGATGGGATC CTGGCAGCAG GTGACTCAGA AATAACTCCT 4920 

GGATTCCCAC AGTCCCCAAC ATCATCTGTT » ACTAGCGAGA ACTCAGAAGT GTTCCACGTT 4980 

TCAGAGGCAG AGGCCAGTAA TAGTAGCCAT GAGTCTCGTA TTGGTCTAGC TGAGGGGTTG 5040 

GAATCCGAGA AGAAGGCAGT TATACCCCTT GTGATCGTGT CAGCCCTGAC TTTTATCTGT 5100 

CTAGTGGTTC TTGTGGGTAT TCTCATCTAC TGGAGGAAAT GCTTCCAGAC TGCACACTTT 5160 

TACTTAGAGG ACAGTACATC CCCTAGAGTT ATATCCACAC CTCCAACACC TATCTTTCCA 5220 

ATTTCAGATG ATGTCGGAGC AATTCCAATA AAGCACTTTC CAAAGCATGT TGCAGATTTA 5280 

CATGCAAGTA GTGGGTTTAC TGAAGAATTT GAGACACTGA AAGAGTTTTA CCAGGAAGTG 5340 

CAGAGCTGTA CTGTTGACTT AGGTATTACA GCAGACAGCT CCAACCACCC AGACAACAAG" 5400 

CACAAGAATC GATACATAAA TATCGTTGCC TATGATCATA GCAGGGTTAA GCTAGCACAG 5460 

CTTGCTGAAA AGGATGGCAA ACTGACTGAT TATATCAATG CCAATTATGT TGATGGCTAC S520 

AACAGACCAA AAGCTTATAT TGCTGCCCAA GGCCCACTGA AATCCACAGC TGAAGATTTC 5580 

TGGAGAATGA TATGGGAACA TAATGTGGAA GTTATTGTCA TGATAACAAA CCTCGTGGAG 5640 

AAAGGAAGGA GAAAATGTGA TCAGTACTGG CCTGCCGATG GGAGTGAGGA GTACGGGAAC 5700 

TTTCTGGTCA CTCAGAAGAG TGTGCAAGTG CTTGCCTATT ATACTGTGAG GAATTTTACT 5760 

CTAAGAAACA CAAAAATAAA AAAGGGCTCC CAGAAAGGAA GACCCAGTGG ACGTGTGGTC 5820 

ACACAGTATC ACTACACGCA GTGGCCTGAC ATGGGAGTAC CAGAGTACTC CCTGCCAGTG S880 

CTGACCTTTG TGAGAAAGGC AGCCTATGCC AAGCGCCATG CAGTGGGGCC TGTTGTCGTC 5940 

CACTGCAGTG CTGGAGTTGG AAGAACAGGC ACATATATTG TGCTAGACAG TATGTTGCAG 6000 

CAGATTCAAC ACGAAGGAAC TGTCAACATA TTTGGCTTCT TAAAACACAT CCGTTCACAA 6060 

AGAAATTATT TGGTACAAAC TGAGGAGCAA TATGTCTTCA TTCATGATAC ACTGGTTGAG 6120 

GCCATACTTA GTAAAGAAAC TGAGGTGCTG GACAGTCATA TTCATGCCTA TGTTAATGCA 6180 

CTCCTCATTC CTGGACCAGC AGGCAAAACA AAGCTAGAGA AACAATTCCA GCTCCTGAGC 6240 

CAGTCAAATA TACAGCAGAG TGACTATTCT GCAGCCCTAA AGCAATGCAA CAGGGAAAAG 6300 

AATCGAACTT CTTCTATCAT CCCTGTGGAA AGATCAAGGG TTGGCATTTC ATCCCTGAGT 6360 

GGAGAAGGCA CAGACTACAT CAATGCCTCC TATATCATGG GCTATTACCA GAGCAATGAA 6420 

TTCATCATTA CCCAGCACCC TCTCCTTCAT ACCATCAAGG ATTTCTGGAG GATGATATGG 6480 

GACCATAATG CCCAACTGGT GGTTATGATT CCTGATGGCC AAAACATGGC AGAAGATGAA 6540 

TTTGTTTACT GGCCAAATAA AGATGAGCCT ATAAATTGTG AGAGCTTTAA GGTCACTCTT 6600 

ATGGCTGAAG AACACAAATG TCTATCTAAT GAGGAAAAAC TTATAATTCA GGACTTTATC 6660 

TTAGAAGCTA CACAGGATGA TTATGTACTT GAAGTGAGGC ACTTTCAGTG TCCTAAATGG 6720 

CCAAATCCAG ATAGCCCCAT TAGTAAAACT TTTGAACTTA TAAGTGTTAT AAAAGAAGAA 6780 

GCTGCCAATA GGGATGGGCC TATGATTGTT CATGATGAGC ATGGAGGAGT GACGGCAGGA 6840 

ACTTTCTGTG CTCTGACAAC CCTTATGCAC CAACTAGAAA AAGAAAATTC CGTGGATGTT 6900 

TACCAGGTAG CCAAGATGAT CAATCTGATG AGGCCAGGAG TCTTTGCTGA CATTGAGCAG 6960 

TATCAGTTTC TCTACAAAGT GATCCTCAGC CTTGTGAGCA CAAGGCAGGA AGAGAATCCA 7020 

TCCACCTCTC TGGACAGTAA TGGTGCAGCA TTGCCTGATG GAAATATAGC TGAGAGCTTA 7080 

GAGTCTTTAG TTTAACACAG AAAGGGGTGG GGGGACTCAC ATCTGAGCAT TGTTTTCCTC 7140 

TTCCTAAAAT TAGGCAGGAA AATCAGTCTA GTTCTGTTAT CTGTTGATTT CCCATCACCT 7200 

GACAGTAACT TTCATGACAT AGGATTCTGC CGCCAAATTT ATATCATTAA CAATGTGTGC 7260 

CTTTTTGCAA GACTTGTAAT TTACTTATTA TGTTTGAACT AAAATGATTG AATTTTACAG 7320 

TATTTCTAAG AATGGAATTG TGGTATTTTT TTCTGTATTG ATTTTAACAG AAAATTTCAA 7380 

TTTATAGAGG TTAGGAATTC CAAACTACAG AAAATGTTTG TTTTTAGTGT CAAATTTTTA 7440 

GCTGTATTTG TAGCAATTAT CAGGTTTGCT AGAAATATAA CTTTTAATAC AGTAGCCTGT 7500 

AAATAAAACA CTCTTCCATA TGATATTCAA CATTTTACAA CTGCAGTATT CACCTAAAGT 7560 

AGAAATAATC TGTTACTTAT TGTAAATACT GCCCTAGTGT CTCCATGGAC CAAATTTATA 7620 

TTTATAATTG TAGATTTTTA TATTTTACTA CTGAGTCAAG TTTTCTAGTT CTGTGTAATT 7680 
GTTTAGTTTA ATGACGTAGT TCATTAGCTG GTCTTACTCT ACCAGTTTTC TGACATTGTA 7740 
TTGTGTTACC TAAGTCATTA ACTTTGTTTC AGCATGTAAT TTTAACTTTT GTGGAAAATA 7800 
GAAATACCTT CATTTTGAAA GAAGTTTTTA TGAGAATAAC ACCTTACCAA ACATTGTTCA 7860 
AATGGTTTTT ATCCAAGGAA TTGCAAAAAT AAATATAAAT ATTGCCATTA AAAAAAAAAA 7920 
AAAAAAAAAA AAAAAAAAAA AAAA 7944 

Seq ID NO: C98 DNA Sequence 
Nucleic Acid Accession ft: NM_002851 
Coding sequence: 77.. 4518 

1 11 21 31 41 51 

I I I I I I 

CACACATACG CACGCACGAT CTCACTTCGA TCTATACACT GGAGGATTAA AACAAACAAA 60 
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CAAAAAAAAC ATTTCCTTCG CTCCCCCTCC CTCTCCACTC TGAGAAGCAG AGGAGCCGCA 120 
CGGCGAGGGG CCGCAGACCG TCTGGAAATG CGAATCCTAA AGCGTTTCCT CGCTTGCATT 180 
CAGCTCCTCT GTGTTTGCCG CCTGGATTGG GCTAATGGAT ACTACAGACA ACAGAGAAAA 240 
CTTGTTGAAG AGATTGGCTG GTCCTATACA GGAGCACTGA ATCAAAAAAA TTGGGGAAAG 300 
AAATATCCAA CATGTAATAG CCCAAAACAA TCTCCTATCA ATATTGATGA AGATCTTACA 360 
CAAGTAAATG TGAATCTTAA GAAACTTAAA TTTCAGGGTT GGGATAAAAC ATCATTGGAA 420 
AACACATTCA TTCATAACAC TGGGAAAACA GTGGAAATTA ATCTCACTAA TGACTACCGT 480 
GTCAGCGGAG GAGTTTCAGA AATGGTGTTT AAAGCAAGCA AGATAACTTT TCACTGGGGA 540 
AAATGCAATA TGTCATCTGA TGGATCAGAG CATAGTTTAG AAGGACAAAA ATTTCCACTT 600 
GAGATGCAAA TCTACTGCTT TGATGCGGAC CGATTTTCAA GTTTTGAGGA AGCAGTCAAA 660 
GGAAAAGGGA AGTTAAGAGC TTTATCCATT TTGTTTGAGG TTGGGACAGA AGAAAATTTG 720 
GATTTCAAAG CGATTATTGA TGGAGTCGAA AGTGTTAGTC GTTTTGGGAA GCAGGCTGCT 780 
TTAGATCCAT TCATACTGTT GAACCTTCTG CCAAACTCAA CTGACAAGTA TTACATTTAC 840 
AATGGCTCAT TGACATCTCC TCCCTGCACA GACACAGTTG ACTGGATTGT TTTTAAAGAT 900 
ACAGTTAGCA TCTCTGAAAG CCAGTTGGCT GTTTTTTGTG AAGTTCTTAC AATGCAACAA 960 
TCTGGTTATG TCATGCTGAT GGACTACTTA CAAAACAATT TTCGAGAGCA ACAGTACAAG 1020 
TTCTCTAGAC AGGTGTTTTC CTCATACACT GGAAAGGAAG AGATTCATGA AGCAGTTTGT 1080 
AGTTCAGAAC CAGAAAATGT TCAGGCTGAC CCAGAGAATT ATACCAGCCT TCTTGTTACA 1140 
TGGGAAAGAC CTCGAGTCGT TTATGATACC ATGATTGAGA AGTTTGCAGT TTTGTACCAG 1200 
CAGTTGGATG GAGAGGACCA AACCAAGCAT GAATTTTTGA CAGATGGCTA TCAAGACTTG 1260 
GGTGCTATTC TCAATAATTT GCTACCCAAT ATGAGTTATG TTCTTCAGAT AGTAGCCATA 1320 
TGCACTAATG GCTTATATGG AAAATACAGC GACCAACTGA TTGTOGACAT GCCTACTGAT 1380 
AATCCTGAAC TTGATCTTTT CCCTGAATTA ATTGGAACTG AAGAAATAAT CAAGGAGGAG 1440 
GAAGAGGGAA AAGACATTGA AGAAGGCGCT ATTGTGAATC CTGGTAGAGA CAGTGCTACA 1500 
AACCAAATCA GGAAAAAGGA ACCCCAGATT TCTACCACAA CACACTACAA TCGCATAGGG 1560 
ACGAAATACA ATGAAGCCAA GACTAACCGA TCCCCAACAA GAGGAAGTGA ATTCTCTGGA 1620 
AAGGGTGATG TTCCCAATAC ATCTTTAAAT TCCACTTCCC AACCAGTCAC TAAATTAGCC 1680 
ACAGAAAAAG ATATTTCCTT GACTTCTCAG ACTGTGACTG AACTGCCACC TCACACTGTG 1740 
GAAGGTACTT CAGCCTCTTT AAATGATGGC TCTAAAACTG TTCTTAGATC TCCACATATG 1800 
AACTTGTCGG GGACTGCAGA ATCCTTAAAT ACAGTTTCTA TAACAGAATA TGAGGAGGAG 1860 
AGTTTATTGA CCAGTTTCAA GCTTGATACT GGAGCTGAAG ATTCTTCAGG CTCCAGTCCC 1920 
GCAACTTCTG CTATCCCATT CATCTCTGAG AACATATCCC AAGGGTATAT ATTTTCCTCC 1980 
GAAAACCCAG AGACAATAAC ATATGATGTC CTTATACCAG AATCTGCTAG AAATGCTTCC 2040 
GAAGATTCAA CTTCATCAGG TTCAGAAGAA TCACTAAAGG ATCCTTCTAT GGAGGGAAAT 2100 
GTGTGGTTTC CTAGCTCTAC AGACATAACA GCACAGCCCG ATGTTGGATC AGGCAGAGAG 2160 
AGCTTTCTCC AGACTAATTA CACTGAGATA CGTGTTGATG AATCTGAGAA GACAACCAAG 2220 
TCCTTTTCTG CAGGCCCAGT GATGTCACAG GGTCCCTCAG TTACAGATCT GGAAATGCCA 2280 
CATTATTCTA CCTTTGCCTA CTTCCCAACT GAGGTAACAC CTCATGCTTT TACCCCATCC 2340 
TCCAGACAAC AGGATTTGGT CTCCACGGTC AACGTGGTAT ACTCGCAGAC AACCCAACCG 2400 
GTATACAATG CAGAGGCCAG TAATAGTAGC CATGAGTCTC GTATTGGTCT AGCTGAGGGG 2460 
TTGGAATCCG AGAAGAAGGC AGTTATACCC CTTGTGATCG TGTCAGCCCT GACTTTTATC 2520 
TGTCTAGTGG TTCTTGTGGG TATTCTCATC TACTGGAGGA AATGCTTCCA GACTGCACAC 2580 
TTTTACTTAG AGGACAGTAC ATCCCCTAGA GTTATATCCA CACCTCCAAC ACCTATCTTT 2640 
CCAATTTCAG ATGATGTCGG AGCAATTCCA ATAAAGCACT TTCCAAAGCA TGTTGCAGAT 2700 
TTACATGCAA GTAGTGGGTT TACTGAAGAA TTTGAGACAC TGAAAGAGTT TTACCAGGAA 2760 
GTGCAGAGCT GTACTGTTGA CTTAGGTATT ACAGCAGACA GCTCCAACCA CCCAGACAAC 2820 
AAGCACAAGA ATCGATACAT AAATATCGTT GCCTATGATC ATAGCAGGGT TAAGCTAGCA 2880 
CAGCTTGCTG AAAAGGATGG CAAACTGACT GATTATATCA ATGCCAATTA TGTTGATGGC 2940 
TACAACAGAC CAAAAGCTTA TATTGCTGCC CAAGGCCCAC TGAAATCCAC AGCTGAAGAT 3000 
TTCTGGAGAA TGATATGGGA ACATAATGTG GAAGTTATTG TCATGATAAC AAACCTCGTG 3060 
GAGAAAGGAA GGAGAAAATG TGATCAGTAC TGGCCTGCCG ATGGGAGTGA GGAGT ACGGG 3120 
AACTTTCTGG TCACTCAGAA GAGTGTGCAA GTGCTTGCCT ATTATACTGT GAGGAATTTT 3180 
ACTCTAAGAA ACACAAAAAT AAAAAAGGGC TCCCAGAAAG GAAGACCCAG TGGACGTGTG 3240 
GTCACACAGT ATCACTACAC GCAGTGGCCT GACATGGGAG TACCAGAGTA CTCCCTGCCA 3300 
GTGCTGACCT TTGTGAGAAA GGCAGCCTAT GCCAAGCGCC ATGCAGTGGG GCCTGTTGTC 3360 
GTCCACTGCA GTGCTGGAGT TGGAAGAACA GGCACATATA TTGTGCTAGA CAGTATGTTG 3420 
CAGCAGATTC AACACGAAGG AACTGTCAAC ATATTTGGCT TCTTAAAACA CATCCGTTCA 3480 
CAAAGAAATT ATTTGGTACA AACTGAGGAG CAATATGTCT TCATTCATGA TACACTGGTT 3540 
GAGGCCATAC TTAGTAAAGA AACTGAGGTG CTGGACAGTC ATATTCATGC CTATGTTAAT 3600 
GCACTCCTCA TTCCTGGACC AGCAGGCAAA ACAAAGCTAG AGAAACAATT CCAGCTCCTG 3660 
AGCCAGTCAA ATATACAGCA GAGTGACTAT TCTGCAGCCC TAAAGCAATG CAACAGGGAA 3720 
AAGAATCGAA CTTCTTCTAT CATCCCTGTG GAAAGATCAA GGGTTGGCAT TTCATCCCTG 3780 
AGTGGAGAAG GCACAGACTA CATCAATGCC TCCTATATCA TGGGCTATTA CCAGAGCAAT 3840 
GAATTCATCA TTACCCAGCA CCCTCTCCTT CATACCATCA AGGATTTCTG GAGGATGATA 3900 
TGGGACCATA ATGCCCAACT GGTGGTTATG ATTCCTGATG GCCAAAACAT GGCAGAAGAT 3960 
GAATTTGTTT ACTGGCCAAA TAAAGATGAG CCTATAAATT GTGAGAGCTT TAAGGTCACT 4020 
CTTATGGCTG AAGAACACAA ATGTCTATCT AATGAGGAAA AACTTATAAT TCAGGACTTT 4080 
ATCTTAGAAG CTACACAGGA TGATTATGTA CTTGAAGTGA GGCACTTTCA GTGTCCTAAA 4140 
TGGCCAAATC CAGATAGCCC CATTAGTAAA ACTTTTGAAC TTATAAGTGT TATAAAAGAA 4200 
GAAGCTGCCA ATAGGGATGG GCCTATGATT GTTCATGATG AGCATGGAGG AGTGAOGGCA 4260 
GGAACTTTCT GTGCTCTGAC AACCCTTATG CACCAACTAG AAAAAGAAAA TTCCGTGGAT 4320 
GTTTACCAGG TAGCCAAGAT GATCAATCTG ATGAGGCCAG GAGTCTTTGC TGACATTGAG 4380 
CAGTATCAGT TTCTCTACAA AGTGATCCTC AGCCTTGTGA GCACAAGGCA GGAAGAGAAT 4440 
CCATCCACCT CTCTGGACAG TAATGGTGCA GCATTGCCTG ATGGAAATAT AGCTGAGAGC 4500 
TTAGAGTCTT TAGTTTAACA CAGAAAGGGG TGGGGGGACT CACATCTGAG CATTGTTTTC 4560 
CTCTTCCTAA AATTAGGCAG GAAAATCAGT CTAGTTCTGT TATCTGTTGA TTTCCCATCA 4620 
CCTGACAGTA ACTTTCATGA CATAGGATTC TGCCGCCAAA TTTATATCAT TAACA ATGTG 4680 
TGCCTTTTTG CAAGACTTGT AATTTACTTA TTATGTTTGA ACTAAAATGA TTGAATTTTA 4740 
CAGTATTTCT AAGAATGGAA TTGTGGTATT TTTTTCTGTA TTGATTTTAA CAGAAAATTT 4800 
CAATTTATAG AGGTTAGGAA TTCCAAACTA CAGAAAATGT TTGTTTTTAG TGTCAAATTT 4860 
TTAGCTGTAT TTGTAGCAAT TATCAGGTTT GCTAGAAATA TAACTTTTAA TACAGTAGCC 4920 
TGTAAATAAA ACACTCTTCC ATATGATATT CAACATTTTA CAACTGCAGT ATTCACCTAA 4980 
AGTAGAAATA ATCTGTTACT TATTGTAAAT ACTGCCCTAG TGT CTCCA TG GACCAAATTT 5040 
ATATTTATAA TTGTAGATTT TTATATTTTA CTACTGAGTC AAGTTTTCTA GTTCTGTGTA 5100 
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ATTGTTTAGT TTAATGACGT AGTTCATTAG CTGGTCTTAC TCTACCAGTT TTCTGACATT 5160 

GTATTGTGTT ACCTAAGTCA TTAACTTTGT TTCAGCATGT AATTTTAACT TTTGTGGAAA 5220 

ATAGAAATAC CTTCATTTTG AAAGAAGTTT TTATGAGAAT AACACCTTAC CAAACATTGT S280 

TCAAATGGTT TTTATCCAAG GAATTGCAAA AATAAATATA AATATTGCCA TTAAAAAAAA 5340 

AAAAAAAAAA AAAAAAAAAA AAAAAAA 5367 



Seq ID NO: C99 DNA Sequence 
Nucleic Acid Accession # : Eos 
Coding sequence: 501.. 4514 



sequence 



1 11 21 31 41 SI 

I I I I I I 

CACACATACG CACGCACGAT CTCACTTCGA TCTATACACT GGAGGATTAA AACAAACAAA 60 

CAAAAAAAAC ATTTCCTTCG CTCCCCCTCC CTCTCCACTC TGAGAAGCAG AGGAGCCGCA 120 

CGGCGAGGGG CCGCAGACCG TCTGGAAATG CGAATCCTAA AGOGTTTCCT CGCTTGCATT 180 

CAGCTCCTCT GTGTTTGCCG CCTGGATTGG GCTAATGGAT ACTACAGACA ACAGAGAAAA 240 

CTTGTTGAAG AGATTGGCTG GTCCTATACA GGAGCACTGA ATCAAAAAAT TGGGGAAAGA 300 

AATATCCAAC ATGTAATAGC CCAAAACAAT CTCCTATCAA TATTGATGAA GATCTTACAC 360 

AAGTAAATGT GAATCTTAAG AAACTTAAAT TTCAGGGTTG GGATAAAACA TCATTGGAAA 420 

ACACATTCAT TCATAACACT GGGAAAACAG TGGAAATTAA TCTCACTAAT GACTACCGTG 480 

TCAGCGGAGG AGTTTCAGAA ATGGTGTTTA AAGCAAGCAA GATAACTTTT CACTGGGGAA 540 

AATGCAATAT GTCATCTGAT GGATCAGAGC ATAGTTTAGA AGGACAAAAA TTTCCACTTG 600 

AGATGCAAAT CTACTGCTTT GATGCGGACC GATTTTCAAG TTTTGAGGAA GCAGTCAAAG 660 

GAAAAGGGAA GTTAAGAGCT TTATCCATTT TGTTTGAGGT TGGGACAGAA GAAAATTTGG 720 

ATTTCAAAGC GATTATTGAT GGAGTCGAAA GTGTTAGTCG TTTTGGGAAG CAGGCTGCTT 780 

TAGATCCATT CATACTGTTG AACCTTCTGC CAAACTCAAC TGACAAGTAT TACATTTACA 840 

ATGGCTCATT GACATCTCCT CCCTGCACAG ACACAGTTGA CTGGATTGTT TTTAAAGATA 90 0 

CAGTTAGCAT CTCTGAAAGC CAGTTGGCTG TTTTTTGTGA AGTTCTTACA ATGCAACAAT 960 

CTGGTTATGT CATGCTGATG GACTACTTAC AAAACAATTT TCGAGAGCAA CAGTACAAGT 1020 

TCTCTAGACA GGTGTTTTCC TCATACACTG GAAAGGAAGA GATTCATGAA GCAGTTTGTA 1080 

GTTCAGAACC AGAAAATGTT CAGGCTGACC CAGAGAATTA TACCAGCCTT CTTGTTACAT 1140 

GGGAAAGACC TCGAGTCGTT TATGATACCA TGATTGAGAA GTTTGCAGTT TTGTACCAGC 1200 

AGTTGGATGG AGAGGACCAA ACCAAGCATG AATTTTTGAC AGATGGCTAT CAAGACTTGG 1260 

GTGCTATTCT CAATAATTTG CTACCCAATA TGAGTTATGT TCTTCAGATA GTAGCCATAT 1320 

GCACTAATGG CTTATATGGA AAATACAGCG ACCAACTGAT TGTCGACATG CCTACTGATA 1380 

ATCCTGAACT TGATCTTTTC CCTGAATTAA TTGGAACTGA AGAAATAATC AAGGAGGAGG 1440 

AAGAGGGAAA AGACATTGAA GAAGGCGCTA TTGTGAATCC TGGTAGAGAC AGTGCTACAA 1500 

ACCAAATCAG GAAAAAGGAA CCCCAGATTT- CTACCACAAC ACACTACAAT CGCATAGGGA 1560 

CGAAATACAA TGAAGCCAAG ACTAACCGAT CCCCAACAAG AGGAAGTGAA TTCTCTGGAA 1620 

AGGGTGATGT TCCCAATACA TCTTTAAATT CCACTTCCCA ACCAGTCACT AAATTAGCCA 1680 

CAGAAAAAGA TATTTCCTTG ACTTCTCAGA CTGTGACTGA ACTGCCACCT CACACTGTGG 1740 

AAGGTACTTC AGCCTCTTTA AATGATGGCT CTAAAACTGT TCTTAGATCT CCACATATGA 1800 

ACTTGTCGGG GACTGCAGAA TCCTTAAATA CAGTTTCTAT AACAGAATAT GAGGAGGAGA 1860 

GTTTATTGAC CAGTTTCAAG CTTGATACTG GAGCTGAAGA TTCTTCAGGC TCCAGTCCCG 1920 

CAACTTCTGC TATCCCATTC AT CTCTGAG A ACATATCCCA AGGGTATATA TTTTCCTCCG 1980 

AAAACCCAGA GACAATAACA TATGATGTCC TTATACCAGA ATCTGCTAGA AATGCTTCCG 2040 

AAGATTCAAC TTCATCAGGT TCAGAAGAAT CACTAAAGGA TCCTTCTATG GAGGGAAATG 2100 

TGTGGTTTCC TAGCTCTACA GACATAACAG CACAGCCCGA TGTTGGATCA GGCAGAGAGA 2160 

GCTTTCTCCA GACTAATTAC ACTGAGATAC GTGTTGATGA ATCTGAGAAG ACAACCAAGT 2220 

CCTTTTCTGC AGGCCCAGTG ATGTCACAGG GTCCCTCAGT TACAGATCTG GAAATGCCAC 2280 

ATTATTCTAC CTTTGCCTAC TTCCCAACTG AGGTAACACC TCATGCTTTT ACCCCATCCT 2340 
CCAGACAACA GGATTTGGTC TCCACGGTCA AOGTGGTATA CTCGCAGACA ACCCAACCGG 2400 
TATACAATGA GGCCAGTAAT AGTAGCCATG AGTCTCGTAT TGGTCTAGCT GAGGGGTTGG 2460 
AATCCGAGAA GAAGGCAGTT ATACCCCTTG TGATCGTGTC AGCCCTGACT TTTATCTGTC 2520 
TAGTGGTTCT TGTGGGTATT CTCATCTACT GGAGGAAATG CTTCCAGACT GCACACTTTT 2S80 
ACTTAGAGGA CAGTACATCC CCTAGAGTTA TATCCACACC TCCAACACCT ATCTTTCCAA 2640 
TTTCAGATGA TGTCGGAGCA ATTCCAATAA AGCACTTTCC AAAGCATGTT GCAGATTTAC 2700 
ATGCAAGTAG TGGGTTTACT GAAGAATTTG AGACACTGAA AGAGTTTTAC CAGGAAGTGC 2760 
AGAGCTGTAC TGTTGACTTA GGTATTACAG CAGACAGCTC CAACCACCCA GACAACAAGC 2820 
ACAAGAATCG ATACATAAAT ATCGTTGCCT ATGATCATAG CAGGGTTAAG CTAGCACAGC 2880 
TTGCTGAAAA GGATGGCAAA CTGACTGATT ATATCAATGC CAATTATGTT GATGGCTACA 2940 
ACAGACCAAA AGCTTATATT GCTGCCCAAG GCCCACTGAA ATCCACAGCT GAAGATTTCT 3000 
GGAGAATGAT ATGGGAACAT AATGTGGAAG TTATTGTCAT GATAACAAAC CTCGTGGAGA 3060 
AAGGAAGGAG AAAATGTGAT CAGTACTGGC CTGCCGATGG GAGTGAGGAG TACGGGAACT 3120 
TTCTGGTCAC TCAGAAGAGT GTGCAAGTGC TTGCCTATTA TACTGTGAGG AATTTTACTC 3180 
TAAGAAACAC AAAAATAAAA AAGGGCTCCC AGAAAGGAAG ACCCAGTGGA CGTGTGGTCA 3240 
CACAGTATCA CTACACGCAG TGGCCTGACA TGGGAGTACC AGAGTACTCC CTGCCAGTGC 3300 
TGACCTTTGT GAGAAAGGCA GCCTATGCCA AGCGCCATGC AGTGGGGCCT GTTGTCGTCC 3360 
ACTGCAGTGC TGGAGTTGGA AGAACAGGCA CATATATTGT GCTAGACAGT ATGTTGCAGC 3420 
AGATTCAACA CGAAGGAACT GTCAACATAT TTGGCTTCTT AAAACACATC CGTTCACAAA 3480 
GAAATTATTT GGTACAAACT GAGGAGCAAT ATGTCTTCAT TCATGATACA CTGGTTGAGG 3540 
CCATACTTAG TAAAGAAACT GAGGTGCTGG ACAGTCATAT TCATGCCTAT GTTAATGCAC 3600 
TCCTCATTCC TGGACCAGCA GGCAAAACAA AGCTAGAGAA ACAATTCCAG CTCCTGAGCC 3660 
AGTCAAATAT ACAGCAGAGT GACTATTCTG CAGCCCTAAA GCAATGCAAC AGGGAAAAGA 3720 
ATCGAACTTC TTCTATCATC CCTGTGGAAA GATCAAGGGT TGGCATTTCA TCCCTGAGTG 3780 
GAGAAGGCAC AG ACT ACAT C AATGCCTCCT ATATCATGGG CTATTACCAG AGCAATGAAT 3840 
TCATCATTAC CCAGCACCCT CTCCTTCATA CCATCAAGGA TTTCTGGAGG ATGATATGGG 3900 
ACCATAATGC CCAACTGGTG GTTATGATTC CTGATGGCCA AAACATGGCA GAAGATGAAT 3960 
TTGTTTACTG GCCAAATAAA GATGAGCCTA TAAATTGTGA GAGCTTTAAG GTCACTCTTA 4020 
TGGCTGAAGA ACACAAATGT CTATCTAATG AGGAAAAACT TATAATTCAG GACTTTATCT 4080 
TAGAAGCTAC ACAGGATGAT TATGTACTTG AAGTGAGGCA CTTTCAGTGT CCTAAATGGC 4140 
CAAATCCAGA TAGCCCCATT AGTAAAACTT TTGAACTTAT AAGTGTTATA AAAGAAGAAG 4200 
CTGCCAATAG GGATGGGCCT ATGATTGTTC ATGATGAGCA TGGAGGAGTG ACGGCAGGAA 4260 
CTTTCTGTGC TCTGACAACC CTTATGCACC AACTAGAAAA AGAAAATTCC GTGGATGTTT 4320 
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ACCAGGTAGC 
ATCAGTTTCT 
CCA.CCTCTCT 
AGTCTTTAGT 
TCCTAAAATT 
ACAGTAACTT 
TTTTTGCAAG 
ATTTCTAAGA 
TTATAGAGGT 
CTGTATTTGT 
AATAAAACAC 
GAAATAATCT 
TTATAATTGT 
TTTAGTTTAA 
TGTGTTACCT 
AAATACCTTC 
ATGGTTTTTA 
AAAAAAAAAA 



CAAGATGATC 
CTACAAAGTG 
GGACAGTAAT 
TTAACACAGA 
AGGCAGGAAA 
TCATGACATA 
ACTTGTAATT 
ATGGAATTGT 
TAGGAATTCC 
AGCAATTATC 
TCTTCCATAT 
GTTACTTATT 
AGATTTTTAT 
TGAOGTAGTT 
AAGTCATTAA 
ATTTTGAAAG 
TCCAAGGAAT 
AAAAAAAAAA 



AATCTGATGA 
ATCCTCAGCC 
GGTGCAGCAT 
AAGGGGTGGG 
ATCAGTCTAG 
GGATTCTGCC 
TACTTATTAT 
GGTATTTTTT 
AAACTACAGA 
AGGTTTGCTA 
GATATTCAAC 
GTAAATACTG 
ATTTTACTAC 
CATTAGCTGG 
CTTTGTTTCA 
AAGTTTTTAT 
TGCAAAAATA 
AAA 



GGCCAGGAGT 
TTGTGAGCAC 
TGCCTGATGG 
GGGACTCACA 
TTCTGTTATC 
GCCAAATTTA 
GTTTGAACTA 
TCTGTATTGA 
AAATGTTTGT 
GAAATATAAC 
ATTTTACAAC 
CCCTAGTGTC 
TGAGTCAAGT 
TCTTACTCTA 
GCATGTAATT 
GAGAATAACA 
AATATAAATA 



CTTTGCTGAC 
AAGGCAGGAA 
AAATATAGCT 
TCTGAGCATT 
TGTTGATTTC 
TATCATTAAC 
AAATGATTGA 
TTTTAACAGA 
TTTTAGTGTC 
TTTTAATACA 
TGCAGTATTC 
TCCATGGACC 
TTTCTAGTTC 
CCAGTTTTCT 
TTAACTTTTG 
CCTTACCAAA 
TTGCCATTAA 



ATTGAGCAGT 
GAGAATCCAT 
GAGAGCTTAG 
GTTTTCCTCT 
CCATCACCTG 
AATGTGTGCC 
ATTTTACAGT 
AAATTTCAAT 
AAATTTTTAG 
GTAGCCTGTA 
ACCTAAAGTA 
AAATTTATAT 
TGTGTAATTG 
GACATTGTAT 
TGGAAAATAG 
CATTGTTCAA 
AAAAAAAAAA 



43B0 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5363 



Seq ID NO: CI 00 DNA Sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 148 -.4362 



11 



21 31 41 51 

I I I I 1 I 

CACACATACG CACGCACGAT CTCACTTCGA TCTATACACT GGAGGATTAA AACAAACAAA 60 

CAAAAAAAAC ATTTCCTTCG CTCCCCCTCC CTCTCCACTC TGAGAAGCAG AGGAGCCGCA 120 

CGGCGAGGGG CCGCAGACCG TCTGGAAATG CGAATCCTAA AACGTTTCCT CGCTTGCATT 180 

CAGCTCCTCT GTGTTTGCCG CCTGGATTGG GCTAATGGAT ACTACAGACA ACAGAGAAAA 240 

CTTGTTGAAG AGATTGGCTG GTCCTATACA GGAGCACTGA ATCAAAAAAA TTGGGGAAAG 300 

AAATATCCAA CATGTAATAG CCCAAAACAA TCTCCTATCA ATATTGATGA AGATCTTACA 360 

CAAGTAAATG TGAATCTTAA GAAACTTAAA TTTCAGGGTT GGGATAAAAC ATCATTGGAA 420 

AACACATTCA TTCATAACAC TGGGAAAACA GTGGAAATJA ATCTCACTAA TGACTACCGT 480 

GTCAGCGGAG GAGTTTCAGA AATGGTGTTT AAAGCAAGCA AGATAACTTT TCACTGGGGA 540 

AAATGCAATA TGTCATCTGA TGGATCAGAG CATAGTTTAG AAGGACAAAA ATTTCCACTT 600 

GAGATGCAAA TCTACTGCTT TGATGCGGAC CGATTTTCAA GTTTTGAGGA AGCAGTCAAA 660 

GGAAAAGGGA AGTTAAGAGC TTTATCCATT TTGTTTGAGG TTGGGACAGA AGAAAATTTG 720 

GATTTCAAAG CGATTATTGA TGGAGTOGAA AGTGTTAGTC GTTTTGGGAA GCAGGCTGCT 780 

TTAGATCCAT TCATACTGTT GAACCTTCTG CCAAACTCAA CTGACAAGTA TTACATTTAC 840 

AATGGCTCAT TGACATCTCC TCCCTGCACA GACACAGTTG ACTGGATTGT TTTTAAAGAT 900 

ACAGTTAGCA TCTCTGAAAG CCAGTTGGCT GTTTTTTTGT GAAGTTCTTA CAATGCAACA 960 

ATCTGGTTAT GTCATGCTGA TGGACTACTT ACAAAACAAT TTTCGAGAGC AACAGTACAA 1020 

GTTCTCTAGA CAGGTGTTTT CCTCATACAC TGGAAAGGAA GAGATTCATG AAGCAGTTTG 1080 

TAGTTCAGAA CCAGAAAATG TTCAGGCTGA CCCAGAGAAT TATACCAGCC TTCTTGTTAC 1140 

ATGGGAAAGA CCTCGAGTCG TTTATGATAC CATGATTGAG AAGTTTGCAG TTTTGTACCA 1200 

GCAGTTGGAT GGAGAGGACC AAACCAAGCA TGAATTTTTG ACAGATGGCT ATCAAGACTT 1260 

GGGTGCTATT CTCAATAATT TGCTACCCAA TATGAGTTAT GTTCTTCAGA TAGTAGCCAT 1320 

ATGCACTAAT GGCTTATATG GAAAATACAG CGACCAACTG ATTGTCGACA TGCCTACTGA 1380 

TAATCCTGAA CTTGATCTTT TCCCTGAATT AATTGGAACT GAAGAAATAA TCAAGGAGGA 1440 

GGAAGAGGGA AAAGACATTG AAGAAGGCGC TATTGTGAAT CCTGGTAGAG ACAGTGCTAC 1500 

AAACCAAATC AGGAAAAAGG AACCCCAGAT TTCTACCACA ACACACTACA ATCGCATAGG 1560 

GACGAAATAC AATGAAGCCA AGACTAACCG ATCCCCAACA AGAGGAAGTG AATTCTCTGG 1620 

AAAGGGTGAT GTTCCCAATA CATCTTTAAA TTCCACTTCC CAACCAGTCA CTAAATTAGC 1680 

CACAGAAAAA GATATTTCCT TGACTTCTCA GACTGTGACT GAACTGCCAC CTCACACTGT 1740 

GGAAGGTACT TCAGCCTCTT TAAATGATGG CTCTAAAACT GTTCTTAGAT CTCCACATAT 1800 

GAACTTGTCG GGGACTGCAG AATCCTTAAA TACAGTTTCT ATAACAGAAT ATGAGGAGGA 1860 

GAGTTTATTG ACCAGTTTCA AGCTTGATAC TGGAGCTGAA GATTCTTCAG GCTCCAGTCC 1920 

CGCAACTTCT GCTATCCCAT TCATCTCTGA GAACATATCC CAAGGGTATA TATTTTCCTC 1980 

CGAAAACCCA GAGACAATAA CATATGATGT CCTTATACCA GAATCTGCTA GAAATGCTTC 2040 

CGAAGATTCA ACTTCATCAG GTTCAGAAGA ATCACTAAAG GATCCTTCTA TGGAGGGAAA 2100 

TGTGTGGTTT CCTAGCTCTA CAGACATAAC AGCACAGCCC GATGTTGGAT CAGGCAGAGA 2160 

GAGCTTTCTC CAGACTAATT ACACTGAGAT ACGTGTTGAT GAATCTGAGA AGACAACCAA 2220 

GTCCTTTTCT GCAGGCCCAG TGATGTCACA GGGTCCCTCA GTTACAGATC TGGAAATGCC 2280 

ACATTATTCT ACCTTTGCCT ACTTCCCAAC TGAGGTAACA CCTCATGCTT TTACCCCATC 2340 

CTCCAGACAA CAGGATTTGG TCTCCACGGT CAACGTGGTA TACTCGCAGA CAACCCAACC 2400 

GGTATACAAT GAGGCCAGTA ATAGTAGCCA TGAGTCTCGT ATTGGTCTAG CTGAGGGGTT 2460 

GGAATCCGAG AAGAAGGCAG TTATACCCCT TGTGATCGTG TCAGCCCTGA CTTTTATCTG 2S20 

TCTAGTGGTT CTTGTGGGTA TTCTCATCTA CTGGAGGAAA TGCTTCCAGA CTGCACACTT 2580 

TTACTTAGAG GACAGTACAT CCCCTAGAGT TATATCCACA CCTCCAACAC CTATCTTTCC 2640 

AATTTCAGAT GATGTCGGAG CAATTCCAAT AAAGCACTTT CCAAAGCATG TTGCAGATTT 2700 
ACATGCAAGT AGTGGGTTTA CTGAAGAATT TGAGACACTG AAAGAGTTTT ACCAGGAAGT 2760 
GCAGAGCTGT ACTGTTGACT TAGGTATTAC AGCAGACAGC TCCAACCACC CAGACAACAA 2820 
GCACAAGAAT CGATACATAA ATATCGTTGC CTATGATCAT AGCAGGGTTA AGCTAGCACA 2880 
GCTTGCTGAA AGGATGGCAA ACTGACTGAT TATATCAATG CCAATTATGT TGATGGCTAC 2940 
AACAGACCAA AAGCTTATAT TGCTGCCCAA GGCCCACTGA AATCCACAGC TGAAGATTTC 3000 
TGGAGAATGA TATGGGAACA TAATGTGGAA GTTATTGTCA TGATAACAAA CCTCGTGGAG 3060 
AAAGGAAGGA GAAAATGTGA TCAGTACTGG CCTGCCGATG GGAGTGAGGA GTACGGGAAC 3120 
TTTCTGGTCA CTCAGAAGAG TGTGCAAGTG CTTGCCTATT ATACTGTGAG GAATTTTACT 3180 
CTAAGAAACA CAAAAATAAA AAAGGGCTCC CAGAAAGGAA GACCCAGTGG ACGTGTGGTC 3240 
ACACAGTATC ACTACACGCA GTGGCCTGAC ATGGGAGTAC CAGAGTACTC CCTGCCAGTG 3300 
CTGACCTTTG TGAGAAAGGC AGOCTATGCC AAGCGCCATG CAGTGGGGCC TGTTGTCGTC 3360 
CACTGCAGTG CTGGAGTTGG AAGAACAGGC ACATATATTG TGCTAGACAG TATGTTGCAG 3420 
CAGATTCAAC ACGAAGGAAC TGTCAACATA TTTGGCTTCT TAAAACACAT CCGTTCACAA 3480 
AGAAATTATT TGGTACAAAC TGAGGAGCAA TATGTCTTCA TTCATGATAC ACTGGTTGAG 3540 
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GCCATACTTA GTAAAGAAAC TGAGGTGCTG GACAGTCATA TTCATGCCTA TGTTAATGCA 3600 

CTCCTCATTC CTGGACCAGC AGGCAAAACA AAGCTAGAGA AACAATTCCA GGGTCTCACT 3660 

CTGTCACCCA GGCTGGAGTG CAGAGGCACA ATCTCGGCTC ACTGCAACCT TCCTCTCCCT 3720 

GGCTTAACTG ATCCTCCTAC CTCAGCCTCC CGAGTGGCTG GGACTATACT CCTGAGCCAG 3780 

TCAAATATAC AGCAGAGTGA CTATTCTGCA GCCCTAAAGC AATGCAACAG GGAAAAGAAT 3840 

CGAACTTCTT CTATCATCCC TGTGGAAAGA TCAAGGGTTG GCATTTCATC CCTGAGTGGA 3900 

GAAGGCACAG ACTACATCAA TGCCTCCTAT ATCATGGGCT ATTACCAGAG CAATGAATTC 3960 

ATCATTACCC AGCACCCTCT CCTTCATACC ATCAAGGATT TCTGGAGGAT GATATGGGAC 4020 

CATAATGCCC AACTGGTGGT TATGATTCCT GATGGCCAAA ACATGGCAGA AGATGAATTT 4080 

GTTTACTGGC CAAATAAAGA TGAGCCTATA AATTGTGAGA GCTTTAAGGT CACT CTTATG 4140 

GCTGAAGAAC ACAAATGTCT ATCTAATGAG GAAAAACTTA TAATTCAGGA CTTTATCTTA 4200 

GAAGCTACAC AGGATGATTA TGTACTTGAA GTGAGGCACT TTCAGTGTCC TAAATGGCCA 4260 

AATCCAGATA GCCCCATTAG TAAAACTTTT GAACTTATAA GTGTTATAAA AGAAGAAGCT 4320 

GCCAATAGGG ATGGGCCTAT GATTGTTCAT GATGAGCATG GAGGAGTGAC GGCAGGAACT 4380 

TTCTGTGCTC TGACAACCCT TATGCACCAA CTAGAAAAAG AAAATTCCGT GGATGTTTAC 4440 

CAGGTAGCCA AGATGATCAA TCTGATGAGG CCAGGAGTCT TTGCTGACAT TGAGCAGTAT 4500 

CAGTTTCTCT ACAAAGTGAT CCTCAGCCTT GTGGGCACAA GGCAGGAAGA GAATCCATCC 4560 

ACCTCTCTGG ACAGTAATGG TGCAGCATTG CCTGATGGAA ATATAGCTGA GAGCTTAGAG 4620 

TCTTTAGTTT AACACAGAAA GGGGTGGGGG GACTCACATC TGAGCATTGT TTTCCTCTTC 4680 

CTAAAATTAG GCAGGAAAAT CAGTCTAGTT CTGTTATCTG TTGATTTCCC ATCACCTGAC 4740 

AGTAACTTTC ATGACATAGG ATTCTGCCGC CAAATTTATA TCATTAACAA TGTGTGCCTT 4800 

TTTGCAAGAC TTGTAATTTA CTTATTATGT TTGAACTAAA ATGATTGAAT TTTACAGTAT 4860 

TTCTAAGAAT GGAATTGTGG TATTTTTTTC TGTATTGATT TTAACAGAAA ATTTCAATTT 4920 

ATAGAGGTTA GGAATTCCAA ACTACAGAAA ATGTTTGTTT TTAGTGTCAA ATTTTTAGCT 4980 

GTATTTGTAG CAATTATCAG GTTTGCTAGA AATATAACTT TTAATACAGT AGCCTGTAAA 5040 

TAAAACACTC TTCCATATGA TATTCAACAT TTTACAACTG CAGTATTCAC CTAAAGTAGA 5100 

AATAATCTGT TACTTATTGT AAATACTGCC CTAGTGTCTC CATGGACCAA ATTTATATTT 5160 

ATAATTGTAG ATTTTTATAT TTTACTACTG AGTCAAGTTT TCTAGTTCTG TGTAATTGTT 5220 

TAGTTTAATG ACGTAGTTCA TTAGCTGGTC TTACTCTACC AGTTTTCTGA CATTGTATTG 5280 

TGTTACCTAA GTCATTAACT TTGTTTCAGC ATGTAATTTT AACTTTTGTG GAAAATAGAA 5340 

ATACCTTCAT TTTGAAAGAA GTTTTTATGA GAATAACACC TTACCAAACA TTGTTCAAAT 5400 

GGTTTTTATC CAAGGAATTG CAAAAATAAA TATAAATATT GCCATTAAAA AAAAAAAAAA 5460 

AAAAAAAAAA AAAAAAAAAA A 5481 

Seq ID NO: C101 DNA Sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 1..3340 



1 11 21 31 41 51 

I I 1 I I 1 

ATGCGAATCC TAAAGCGTTT CCTCGCTTGC ATTCAGCTCC TCTGTGTTTG CCGCCTGGAT 60 

TGGGCTAATG GATACTACAG ACAACAGAGA AAACTTGTTG AAGAGATTGG CTGGTCCTAT 120 

ACAGGAGCAC TGAATCAAAA AAATTGGGGA AAGAAATATC CAACATGTAA TAGCCCAAAA 180 

CAATCTCCTA TCAATATTGA TGAAGATCTT ACACAAGTAA ATGTGAATCT TAAGAAACTT 240 

AAATTTCAGG GTTGGGATAA AACATCATTG GAAAACACAT TCATTCATAA CACTGGGAAA 300 

ACAGTGGAAA TTAATCTCAC TAATGACTAC CGTGTCAGCG GAGGAGTTTC AGAAATGGTG 360 

TTTAAAGCAA GCAAGATAAC TTTTCACTGG GGAAAATGCA ATATGTCATC TGATGGATCA 420 

GAGCATAGTT TAGAAGGACA AAAATTTCCA CTTGAGATGC AAATCTACTG CTTTGATGCG 480 

GACCGATTTT CAAGTTTTGA GGAAGCAGTC AAAGGAAAAG GGAAGTTAAG AGCTTTATCC 540 

ATTTTGTTTG AGGTTGGGAC AGAAGAAAAT TTGGATTTCA AAGCGATTAT TGATGGAGTC 600 

GAAAGTGTTA GTCGTTTTGG GAAGCAGGCT GCTTTAGATC CATTCATACT GTTGAACCTT 660 

CTGCCAAACT CAACTGACAA GTATTACATT TACAATGGCT CATTGACATC TCCTCCCTGC 720 

ACAGACACAG TTGACTGGAT TGTTTTTAAA GATACAGTTA GCATCTCTGA AAGCCAGTTG 780 

GCTGTTTTTT GTGAAGTTCT TACAATGCAA CAATCTGGTT ATGTCATGCT GATGGACTAC 840 

TTACAAAACA ATTTTCGAGA GCAACAGTAC AAGTTCTCTA GACAGGTGTT TTCCTCATAC 900 

ACTGGAAAGG AAGAGATTCA TGAAGCAGTT TGTAGTTCAG AACCAGAAAA TGTTCAGGCT 960 

GACCCAGAGA ATTATACCAG CCTTCTTGTT ACATGGGAAA GACCTCGAGT CGTTTATGAT 1020 

ACCATGATTG AGAAGTTTGC AGTTTTGTAC CAGCAGTTGG ATGGAGAGGA CCAAACCAAG 1080 

CATGAATTTT TGACAGATGG CTATCAAGAC TTGGGTGCTA TTCTCAATAA TTTGCTACCC 1140 

AATATGAGTT ATGTTCTTCA GATAGTAGCC ATATGCACTA ATGGCTTATA TGGAAAATAC 1200 

AGCGACCAAC TGATTGTCGA CATGCCTACT GATAATCCTG AGGCCAGTAA TAGTAGCCAT 1260 

GAGTCTCGTA TTGGTCTAGC TGAGGGGTTG GAATCCGAGA AGAAGGCAGT TATACCCCTT 1320 

GTGATCGTGT CAGCCCTGAC TTTTATCTGT CTAGTGGTTC TTGTGGGTAT TCTCATCTAC 1380 

TGGAGGAAAT GCTTCCAGAC TGCACACTTT TACTTAGAGG ACAGTACATC CCCTAGAGTT 1440 

ATATCCACAC CTCCAACACC TATCTTTCCA ATTTCAGATG ATGTCGGAGC AATTCCAATA 1500 

AAGCACTTTC CAAAGCATGT TGCAGATTTA CATGCAAGTA GTGGGTTTAC TGAAGAATTT 1560 

GAGACACTGA AAGAGTTTTA CCAGGAAGTG CAGAGCTGTA CTGTTGACTT AGGTATTACA 1620 

GCAGACAGCT CCAACCACCC AGACAACAAG CACAAGAATC GATACATAAA TATCGTTGCC 1680 

TATGATCATA GCAGGGTTAA GCTAGCACAG CTTGCTGAAA AGGATGGCAA ACTGACTGAT 1740 

TATATCAATG CCAATTATGT TGATGGCTAC AACAGACCAA AAGCTTATAT TGCTGCCCAA 1800 

GGCCCACTGA AATCCACAGC TGAAGATTTC TGGAGAATGA TATGGGAACA TAATGTGGAA 1860 

GTTATTGTCA TGATAACAAA CCTCGTGGAG AAAGGAAGGA GAAAATGTGA TCAGTACTGG 1920 

CCTGCCGATG GGAGTGAGGA GTACGGGAAC TTTCTGGTCA CTCAGAAGAG TGTGCAAGTG 1980 

CTTGCCTATT ATACTGTGAG GAATTTTACT CTAAGAAACA CAAAAATAAA AAAGGGCTCC 2040 

CAGAAAGGAA GACCCAGTGG ACGTGTGGTC ACACAGTATC ACTACACGCA GTGGCCTGAC 2100 

ATGGGAGTAC CAGAGTACTC CCTGCCAGTG CTGACCTTTG TGAGAAAGGC AGCCTATGCC 2160 

AAGCGCCATG CAGTGGGGCC TGTTGTCGTC CACTGCAGTG CTGGAGTTGG AAGAACAGGC 2220 

ACATATATTG TGCTAGACAG TATGTTGCAG CAGATTCAAC ACGAAGGAAC TGTCAACATA 2280 

TTTGGCTTCT TAAAACACAT CCGTTCACAA AGAAATTATT TGGTACAAAC TGAGGAGCAA 2340 

TATGTCTTCA TTCATGATAC ACTGGTTGAG GCCATACTTA GTAAAGAAAC TGAGGTGCTG 2400 

GACAGTCATA TTCATGCCTA TGTTAATGCA CTCCTCATTC CTGGACCAGC AGGCAAAACA 2460 

AAGCTAGAGA AACAATTCCA GCTCCTGAGC CAGTCAAATA TACAGCAGAG TGACTATTCT 2520 

GCAGCCCTAA AGCAATGCAA CAGGGAAAAG AATCGAACTT CTTCTATCAT CCCTGTGGAA 2580 

AGATCAAGGG TTGGCATTTC ATCCCTGAGT GGAGAAGGCA CAGACTACAT CAATGCCTCC 2640 
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TATATCATGG GCTATTACCA GAGCAATGAA TTCATCATTA CTCAGCACCC TCTCCTTCAT 2700 

ACCATCAAGG ATTTCTGGAG GATGATATGG GACCATAATG CCCAACTGGT GGTTATGATT 2760 

CCTGATGGCC AAAACATGGC AGAAGATGAA TTTGTTTACT GGCCAAATAA AGATGAGCCT 2820 

ATAAATTGTG AGAGCTTTAA GGTCACTCTT ATGGCTGAAG AACACAAATG TCTATCTAAT 2880 

5 GAGGAAAAAC TTATAATTCA GGACTTTATC TTAGAAGCTA CACAGGATGA TTATGTACTT 2940 

GAAGTGAGGC ACTTTCAGTG TCCTAAATGG CCAAATCCAG ATAGCCCCAT TAGTAAAACT 3000 

TTTGAACTTA TAAGTGTTAT AAAAGAAGAA GCTGCCAATA GGGATGGGCC TATGATTGTT 3060 

CATGATGAGC ATGGAGGAGT GACGGCAGGA ACTTTCTGTG CTCTGACAAC CCTTATGCAC 3120 

CAACTAGAAA AAGAAAATTC CGTGGATGTT TACCAGGTAG CCAAGATGAT CAATCTGATG 3180 

10 AGGCCAGGAG TCTTTGCTGA CATTGAGCAG TATCAGTTTC TCTACAAAGT GATCCTCAGC 3240 

CTTGTGAGCA CAAGGCAGGA AGAGAATCCA TCTACCTCTC TGGACAGTAA TGGTGCAGCA 3300 

TTGCCTGATG GAAATATAGC TGAGAGCTTA GAGTCTTTAG 3340 

Seq ID NO: C102 DNA Sequence 
15 Nucleic Acid Accession #: Eos sequence 
Coding sequence: 1..4480 

1 11 21 31 41 51 

on I I 1 III 

20 ATGCGAATCC TAAAGCGTTT CCTCGCTTGC ATTCAGCTCC TCTGTGTTTG CCGCCTGGAT 60 

TGGGCTAATG GATACTACAG ACAACAGAGA AAACTTGTTG AAGAGATTGG CTGGTCCTAT 120 

ACAGGAGCAC TGAATCAAAA AAATTGGGGA AAGAAATATC CAACATGTAA TAGCCCAAAA 180 

CAATCTCCTA TCAATATTGA TGAAGATCTT ACACAAGTAA ATGTGAATCT TAAGAAACTT 240 

AAATTTCAGG GTTGGGATAA AACATCATTG GAAAACACAT TCATTCATAA CACTGGGAAA 300 

25 ACAGTGGAAA TTAATCTCAC TAATGACTAC CGTGTCAGCG GAGGAGTTTC AGAAATGGTG 360 

TTTAAAGCAA GCAAGATAAC TTTTCACTGG GGAAAATGCA ATATGTCATC TGATGGATCA 420 

GAGCATAGTT TAGAAGGACA AAAATTTCCA CTTGAGATGC AAATCTACTG CT TTGAT GCA 480 

GACCGATTTT CAAGTTTTGA GGAAGCAGTC AAAGGAAAAG GGAAGTTAAG AGCTTTATCC 540 

ATTTTGTTTG AGGTTGGGAC AGAAGAAAAT TTGGATTTCA AAGCGATTAT TGATGGAGTC 600 

30 GAAAGTGTTA GTCGTTTTGG GAAGCAGGCT GCTTTAGATC CATTCATACT GTTGAACCTT 660 

CTGCCAAACT CAACTGACAA GTATTACATT TACAATGGTT CATTGACATC TCCTCCCTGC 720 

ACAGACACAG TTGACTGGAT TGTTTTTAAA GATACAGTTA GCATCTCTGA AAGCCAGTTG 780 

GCTGTTTTTT GTGAAGTTCT TACAATGCAA CAATCTGGTT ATGTCATGCT GATGGACTAC 840 

TTACAAAACA ATTTTCGAGA GCAACAGTAC AAGTTCTCTA GACAGGTGTT TTCCTCATAC 900 

35 ACTGGAAAGG AAGAGATTCA TGAAGCAGTT TGTAGTTCAG AACCAGAAAA TGTTCAGGCT 960 

GACCCAGAGA ATTATACCAG CCTTCTTGTT ACATGGGAAA GACCTCGAGT CGTTTATGAT 1020 

ACCATGATTG AGAAGTTTGC AGTTTTGTAC CAGCAGTTGG ATGGAGAGGA CCAAACCAAG 1080 

CATGAATTTT TGACAGATGG CTATCAAGAC TTGGGTGCTA TTCTCAATAA TTTGCTACCC 1140 

AATATGAGTT ATGTTCTTCA GATAGTAGCC ATATGCACTA ATGGCTTATA TGGAAAATAC 1200 

40 AGCGACCAAC TGATTGTCGA CATGCCTACT GATAATCCTG AACTTGATCT TTTCCCTGAA 1260 

TTAATTGGAA CTGAAGAAAT AATCAAGGAG GAGGAAGAGG GAAAAGACAT TGAAGAAGGC 1320 

GCTATTGTGA ATCCTGGTAG AGACAGTGCT ACAAACCAAA TCAGGAAAAA GGAACCCCAG 1380 

ATTTCTACCA CAACACACTA CAATCGCATA GGGACGAAAT ACAATGAAGC CAAGA CTAA C 1440 

CGATCCCCAA CAAGAGGAAG TGAATTCTCT GGAAAGGGTG ATGTTCCCAA TACATCTTTA 1S00 

45 AATTCCACTT CCCAACCAGT CACTAAATTA GCCACAGAAA AAGATATTTC CTTGACTTCT 1560 

CAGACTGTGA CTGAACTGCC ACCTCACACT GTGGAAGGTA CTTCAGCCTC TTTAAATGAT 1620 

GGCTCTAAAA CTGTTCTTAG ATCTCCACAT ATGAACTTGT CGGGGA CTGC AGAATCCTTA 1680 

AATACAGTTT CTATAACAGA ATATGAGGAG GAGAGTTTAT TGACCAGTTT CAAGCTTGAT 1740 

. ACTGGAGCTG AAGATTCTTC AGGCTCCAGT CCCGCAACTT CTGCTATCCC ATTCATCTCT 1800 

50 GAGAACATAT CCCAAGGGTA TATATTTTCC TCCGAAAACC CAGAGACAAT AACATATGAT I860 

GTCCTTATAC CAGAATCTGC TAGAAATGCT TCCGAAGATT CAACTTCATC AGGTTCAGAA 1920 

GAATCACTAA AGGAT CCTTC TATGGAGGGA AATGTGTGGT TTCCTAGCTC TACAGACATA 1980 

ACAGCACAGC CCGATGTTGG ATCAGGCAGA GAGAGCTTTC TCCAGACTAA TTACACTGAG 2040 

ATACGTGTTG ATGAATCTGA GAAGACAACC AAGTCCTTTT CTGCAGGCCC AGTGATGTCA 2100 

55 CAGGGTCCCT CAGTTACAGA TCTGGAAATG CCACATTATT CTACCTTTGC CTACTTCCCA 2160 

ACTGAGGTAA CACCTCATGC TTTTATCCCA TCCTCCAGAC AACAGGATTT GGTCTCCACG 2220 

GTCAACGTGG TATACTCGCA GACAACCCAA CCGGTATACA ATGAGGCCAG TAATAGTAGC 2280 

CATGAGTCTC GTATTGGTCT AGCTGAGGGG TTGGAATCCG AGAAGAAGGC AGTTATACCC 2340 

CTTGTGATCG TGTCAGCCCT GACTTTTATC TGTCTAGTGG TTCTTGTGGG TATTCTCATC 2400 

60 TACTGGAGGA AATGCTTCCA GACTGGACAC TTTTACTTAG AGGACAGTAC ATCCCCTAGA 2460 

GTTATATCCA CACCTCCAAC ACCTATCTTT CCAATTTCAG ATGATGTCGG AGCAATTCCA 2520 

ATAAAGCACT TTCCAAAGCA TGTTGCAGAT TTACATGCAA GTAGTGGGTT TACTGAAGAA 2580 

TTTGAGACAC TGAAAGAGTT TTACCAGGAA GTGCAGAGCT GTACTGTTGA CTTAGG TATT 2640 

ACAGCAGACA GCTCCAACCA CCCAGACAAC AAGCACAAGA ATCGATACAT AAATATCGTT 2700 

65 GCCCATGATC ATAGCAGGGT TAAGCTAGCA CAGCTTGCTG AAAAGGATGG CAAACTGACT 2760 

GATTATATCA ATGCCAATTA TGTTGATGGC TACAACAGAC CAAAAGCTTA TATTGCTGCC 2820 

CAAGGCCCAC TGAAATCCAC AGCTGAAGAT TTCTGGAGAA TGATATGGGA ACATAATGTG 2880 

GAAGTTATTG TCATGATAAC AAACCTCGTG GAGAAAGGAA GGAGAAAATG TGATCAGTAC 2940 

TGGCCTGCCG ATGGGAGTGA GGAGTACGGG AACTTTCTGG TCACTCAGAA GAGTGTGCAA 3000 

70 GTGCTTGCCT ATTATACTGT GAGGAATTTT ACTCTAAGAA ACACAAAAAT AAAAAAGGGC 3060 

TCCCAGAAAG GAAGACCCAG TGGACGTGTG GTCACACAGT ATCACTACAC GCAGTGGCCT 3120 

GACATGGGAG TACCAGAGTA CTCCCTGCCA GTGCTGACCT TTGTGAGAAA GGCAGCCTAT 3180 
GCCAAGCGCC ATGCAGTGGG GCCTGTTGTC GTCCACTGCA GTGCTGGAGT TGGAAGAACA 3240 
GGCACATATA TTGTGCTAGA CAGTATGTTG CAGCAGATTC AACACGAAGG AACTGTCAAC 3300 
75 ATATTTGGCT TCTTAAAACA CATCCGTTCA CAAAGAAATT ATTTGGTACA AACTGAGGAG 3360 
CAATATGTCT TCATTCATGA TACACTGGTT GAGGCCATAC TTAGTAAAGA AACTGAGGTG 3420 
CTGGACAGTC ATATTCATGC CTATGTTAAT GCACTCCTCA TTCCTGGACC AGCAGGCAAA 3480 
ACAAAGCTAG AGAAACAATT CCAGGGTCTC ACTCTGTCAC CCAGGCTGGA GTGCAGAGGC 3540 
_ _ ACAATCTCGG CTCACTGCAA CCTTCCTCTC CCTGGCTTAA CTGATCCTCC TACCTCAGCC 3600 
80 TCCCGAGTGG CTAGGACTAT ACTCCTGAGC CAGTCAAATA TACAGCAGAG TGACTATTCT 3660 
GCAGCCCTAA AGCAATGCAA CAGGGAAAAG AATCGAACTT CTTCTATCAT CCCTGTGGAA 3720 
AGATCAAGGG TTGGCATTTC ATCCCTGAGT GGAGAAGGCA CAGACTACAT CAATGCCTCC 3780 
TATATCATGG GCTATTACCA GAGCAATGAA TTCATCATTA CCCAGCACCC TCTCCTTCAT 3840 
ACCATCAAGG ATTTCTGGAG GATGATATGG GACCATAATG CCCAACTGGT GGTTATGATT 3900 
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CCTGATGGCC AAAACATGGC AGAAGATGAA TTTGTTTACT GGCCAAATAA AGATGAGCCT 3960 

ATAAATTGTG AGAGCTTTAA GGTCACTCTT ATGGCTGAAG AACACAAATG TCTATCTAAT 4020 

GAGGAAAAAC TTATAATTCA GGACTTTATC TTAGAAGCTA CACAGGATGA TTATGTACTT 4060 

GAAGTGAGGC ACTTTCAGTG TCCTAAATGG CCAAATCCAG ATAGCCCCAT TAGTAAAACT 4X40 

5 TTTGAACTTA TAAGTGTTAT AAAAGAAGAA GCTGCCAATA GGGATGGGCC TATGATTGTT 4200 

CATGATGAGC ATGGAGGAGT GACGGCAGGA ACTTTCTGTG CTCTGACAAC CCTTATGCAC 4260 

CAACTAGAAA AAGAAAATTC CGTGGATGTT TACCAGGTAG CCAAGATGAT CAATCTGATG 4320 

AGGCCAGGAG TCTTTGCTGA CATTGAGCAG TATCAGTTTC TCTACAAAGT GATCCTCAGC 4380 

CTTGTGAGCA CAAGGCAGGA AGAGAATCCA TCCACCTCTC TGGACAGTAA TGGTGCAGCA 4440 

10 TTGCCTGATG GAAATATAGC TGAGAGCTTA GAGTCTTTAG 4480 

Seq ID NO: CI 03 DNA Sequence 
Nucleic Acid Accession #: Eos sequence 
^ Coding sequence: 1..4220 

X 11 21 31 41 51 

I I I ! I I 

ATGCGAATCC TAAAGCGTTT CCTCGCTTGC ATTCAGCTCC TCTGTGTTTG CCGCCTGGAT 60 
TGGGCTAATG GATACTACAG ACAACAGAGA AAACTTGTTG AAGAGATTGG CTGGTCCTAT 120 
20 ACAGGAGCAC TGAATCAAAA AAATTGGGGA AAGAAATATC CAACATGTAA TAGCCCAAAA 180 
CAATCTCCTA TCAATATTGA TGAAGATCTT ACACAAGTAA ATGTGAATCT TAAGAAACTT 240 
AAATTTCAGG GTTGGGATAA AACATCATTG GAAAACACAT TCATTCATAA CACTGGGAAA 300 
ACAGTGGAAA TTAATCTCAC TAATGACTAC CGTGTCAGCG GAGGAGTTTC AGAAATGGTG 360 
TTTAAAGCAA GCAAGATAAC TTTTCACTGG GGAAAATGCA ATATGTCATC TGATG GATCA 420 
25 GAGCATAGTT TAGAAGGACA AAAATTTCCA CTTGAGATGC AAATCTACTG CT TTGA TGOG 480 
GACCGATTTT CAAGTTTTGA GGAAGCAGTC AAAGGAAAAG GGAAGTTAAG AGCTTTATCC 540 
ATTTTGTTTG AGGTTGGGAC AGAAGAAAAT TTGGATTTCA AAGCGATTAT TGATGGAGTC 600 
GAAAGTGTTA GTCGTTTTGG GAAGCAGGCT GCTTTAGATC CATTCATACT GTTGAACCTT 660 
CTGCCAAACT CAACTGACAA GTATTACATT TACAATGGCT CATTGACATC TCCTCCCTGC 720 
30 ACAGACACAG TTGACTGGAT TGTTTTTAAA GATACAGTTA GCATCTCTGA AAGCCAGTTG 780 
GCTGTTTTTT GTGAAGTTCT TACAATGCAA CAATCTGGTT ATGTCATGCT GATGGACTAC 840 
TTACAAAACA ATTTTCGAGA GCAACAGTAC AAGTTCTCTA GACAGGTGTT TTCCTCATAC 900 
ACTGGAAAGG AAGAGATTCA TGAAGCAGTT TGTAGTTCAG AACCAGAAAA TGTTCAGGCT 960 
GACCCAGAGA ATTATACCAG CCTTCTTGTT ACATGGGAAA GACCTCGAGT CGTTTATGAT 1020 
35 ACCATGATTG AGAAGTTTGC AGTTTTGTAC CAGCAGTTGG ATGGAGAGGA CCAAACCAAG 1080 
CATGAATTTT TGACAGATGG CTATCAAGAC TTGGGTGCTA TTCTCAATAA TTTGCTACCC 1140 
AATATGAGTT ATGTTCTTCA GATAGTAGCC ATATGCACTA ATGGCTTATA TGGAAAATAC 1200 
AGCGACCAAC TGATTGTCGA CATGCCTACT GATAATCCTG AACTTGATCT TTTCCCTGAA 1260 
TTAATTGGAA CTGAAGAAAT AATCAAGGAG GAGGAAGAGG GAAAAGACAT TGAAGAAGGC 1320 
40 GCTATTGTGA ATCCTGGTAG AGACAGTGCT ACAAACCAAA TCAGGAAAAA GGAACCCCAG 1380 
ATTTCTACCA CAACACACTA CAATCGCATA GGGACGAAAT ACAATGAAGC CAAGACTAAC 1440 
CGATCCCCAA CAAGAGGAAG TGAATTCTCT GGAAAGGGTG ATGTTCCCAA TACATCTTTA 1500 
AATTCCACTT CCCAACCAGT CACTAAATTA GCCACAGAAA AAGATATTTC CTTGACTTCT 1560 
CAGACTGTGA CTGAACTGCC ACCTCACACT GTGGAAGGTA CTTCAGCCTC TTTAAATGAT 1620 
45 GGCTCTAAAA CTGTTCTTAG ATCTCCACAT ATGAACTTGT CGGGGACTGC AGAATCCTTA 1680 
AATACAGTTT CTATAACAGA ATATGAGGAG GAGAGTTTAT TGACCAGTTT CAAGCTTGAT 1740 
ACTGGAGCTG AAGATTCTTC AGGCTCCAGT CCCGCAACTT CTGCTATCCC ATTCATCTCT 1800 
GAGAACATAT CCCAAGGGTA TATATTTTCC TCCGAAAACC CAGAGACAAT AACATATGAT 1860 
GTCCTTATAC CAGAATCTGC TAGAAATGCT TCCGAAGATT CAACTTCATC AGGTTCAGAA 1920 
50 GAATCACTAA AGGATCCTTC TATGGAGGGA AATGTGTGGT TTCCTAGCTC TACAGACATA 1980 
ACAGCACAGC CCGATGTTGG ATCAGGCAGA GAGAGCTTTC TCCAGACTAA TTACACTGAG 2040 
ATACGTGTTG ATGAATCTGA GAAGACAACC AAGTCCTTTT CTGCAGGCCC AGTGATGTCA 2100 
CAGGGTCCCT CAGTTACAGA TCTGGAAATG CCACATTATT CTACCTTTGC CTACTTCCCA 2160 
ACTGAGGTAA CACCTCATGC TTTTACCCCA TCCTCCAGAC AACAGGATTT GGTCTCCACG 2220 
55 GTCAACGTGG TATACTCGCA GACAACCCAA CCGGTATACA ATGAGGCCAG TAATAGTAGC 2280 
CATGAGTCTC GTATTGGTCT AGCTGAGGGG TTGGAATCCG AGAAGAAGGC AGTTATACCC 2340 
CTTGTGATCG TGTCAGCCCT GACTTTTATC TGTCTAGTGG TTCTTGTGGG TATTCTCATC 2400 
TACTGGAGGA AATGCTTCCA GACTGCACAC TTTTACTTAG AGGACAGTAC ATCCCCTAGA 2460 
GTTATATCCA CACCTCCAAC ACCTATCTTT CCAATTTCAG ATGATGTCGG AGCAATTCCA 2520 
60 ATAAAGCACT TTCCAAAGCA TGTTGCAGAT TTACATGCAA GTAGTGGGTT TACTGAAGAA 2580 
TTTGAGACAC TGAAAGAGTT TTACCAGGAA GTGCAGAGCT GTACTGTTGA CTTAGGTATT 2640 
ACAGCAGACA GCTCCAACCA CCCAGACAAC AAGCACAAGA ATCGATACAT AAATATCGTT 2700 
GCCTATGATC AT AG CAGGGT TAAGCTAGCA CAGCTTGCTG AAAAGGATGG CAAACTGACT 2760 
GATTATATCA ATGCCAATTA TGTTGATGGC TACAACAGAC CAAAAGCTTA TATTGCTGCC 2820 
65 CAAGGCCCAC TGAAATCCAC AGCTGAAGAT TTCTGGAGAA TGATATGGGA ACATAATGTG 2880 
GAAGTTATTG TCATGATAAC AAACCTCGTG GAGAAAGGAA GGAGAAAATG TGATCAGTAC 2940 
TGGCCTGCCG ATGGGAGTGA GGAGTACGGG AACTTTCTGG TCACTCAGAA GAGTGTGCAA 3000 
GTGCTTGCCT ATTATACTGT GAGGAATTTT ACTCTAAGAA ACACAAAAAT AAAAAAGGGC 3060 
TCCCAGAAAG GAAGACCCAG TGGACGTGTG GTCACACAGT ATCACTACAC GCAGTGGCCT 3120 
70 GACATGGGAG TACCAGAGTA CTCCCTGCCA GTGCTGACCT TTGTGAGAAA GGCAGCCTAT 3180 
GCCAAGCGCC ATGCAGTGGG GCCTGTTGTC GTCCACTGCA GTGCTGGAGT TGGAAGAACA 3240 
GGCACATATA TTGTGCTAGA CAGTATGTTG CAGCAGATTC AACACGAAGG AACTGTCAAC 3300 
ATATTTGGCT TCTTAAAACA CATCCGTTCA CAAAGAAATT ATTTGGTACA AACTGAGGAG 3360 
CAATATGTCT TCATTCATGA TACACTGGTT GAGGCCATAC TTAGTAAAGA AACTGAGGTG 3420 
75 CTGGACAGTC ATATTCATGC CTATGTTAAT GCACTCCTCA TTCCTGGACC AGCAGGCAAA 3480 
ACAAAGCTAG AGAAACAATT CCAGCTCCTG AGCCAGTCAA ATATACAGCA GAGTGACTAT 3540 
TCTGCAGCCC TAAAGCAATG CAACAGGGAA AAGAATCGAA CTTCTTCTAT CATCCCTGTG 3600 
GAAAGATCAA GGGTTGGCAT TTCATCCCTG AGTGGAGAAG GCACAGACTA CATCAATGCC 3660 
TCCTATATCA TGGGCTATTA CCAGAGCAAT GAATTCATCA TTACCCAGCA CCCTCTCCTT 3720 
80 CATACCATCA AGGATTTCTG GAGGATGATA TGGGACCATA ATGCCCAACT GGTGGTTATG 3780 
ATTCCTGATG GCCAAAACAT GGCAGAAGAT GAATTTGTTT ACTGGCCAAA TAAAGATGAG 3840 
CCTATAAATT GTGAGAGCTT TAAGGTCACT CTTATGGCTG AAGAACACAA ATGTCTATCT 3900 
AATGAGGAAA AACTTATAAT TCAGGACTTT ATCTTAGAAG CTACACAGGC ATGGAGGAGT 3960 
GACGGCAGGA ACTTTCTGTG CTCTGACAAC CCTTATGCAC CAACTAGAAA AAGAAAATTC 4020 
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CGTGGATGTT TACCAGGTAG CCAAGATGAT CAATCTGATG AGGCCAGGAG TCTTTGCTGA 4080 

CATTGAGCAG TATCAGTTTC TCTACAAAGT GATCCTCAGC CTTGTGAGCA CAAGGCAGGA 4140 

AGAGAATCCA TCCACCTCTC TGGACAGTAA TGGTGCAGCA TTGCCTGATG GAAATATAGC 4200 

TGAGAGCTTA GAGTCTTTAG 4220 

Seq ID NO: C104 DNA Sequence 

Nucleic Acid Accession #: XM_002914.6 

Coding sequence: 1..4314 

1 11 21 31 41 51 

I I I I I I 

ATGAAGGATA TCGACATAGG AAAAGAGTAT ATCATCCCCA GTCCTGGGTA TAGAAGTGTG 60 

AGGGAGAGAA CCAGCACTTC TGGGACGCAC AGAGACCGTG AAGATTCCAA GTTCAGGAGA 120 

ACTCGACCGT TGGAATGCCA AGATGCCTTG GAAACAGCAG CCCGAGCCGA GGGCCTCTCT 1B0 

CTTGATGCCT CCATGCATTC TCAGCTCAGA ATCCTGGATG AGGAGCATCC CAAGGGAAAG 240 

TACCATCATG GCTTGAGTGC TCTGAAGCCC ATCCGGACTA CTTCCAAACA CCAGCACCCA 300 

GTGGACAATG CTGGGCTTTT TTCCTGTATG ACTTTTTCGT GGCTTTCTTC TCTGGCCCGT 360 

GTGGCCCACA AGAAGGGGGA GCTCTCAATG GAAGACGTGT GGTCTCTGTC CAAGCACGAG 420 

TCTTCTGACG TGAACTGCAG AAGACTAGAG AGACTGTGGC AAGAAGAGCT GAATGAAGTT 480 

GGGCCAGACG CTGCTTCCCT GCGAAGGGTT GTGTGGATCT TCTGCCGCAC CAGGCTCATC 540 

CTGTCCATCG TGTGCCTGAT GATCACGCAG CTGGCTGGCT TCAGTGGACC AGCCTTCATG 600 

GTGAAACACC TCTTGGAGTA TACCCAGGCA ACAGAGTCTA ACCTGCAGTA CAGCTTGTTG 660 

TTAGTGCTGG GCCTCCTCCT GACGGAAATC GTGCGGTCTT GGTCGCTTGC ACTGACTTGG 720 

GCATTGAATT ACCGAACCGG TGTCCGCTTG CGGGGGGCCA TCCTAACCAT GGCATTTAAG 780 

AAGATCCTTA AGTTAAAGAA CATTAAAGAG AAATCCCTGG GTGAGCTCAT CAACATTTGC 840 

TCCAACGATG GGCAGAGAAT GTTTGAGGCA GCAGCCGTTG GCAGCCTGCT GGCTGGAGGA 900 

CCCGTTGTTG CCATCTTAGG CATGATTTAT AATGTAATTA TTCTGGGACC AACAGGCTTC 960 

CTGGGATCAG CTGTTTTTAT CCTCTTTTAC CCAGCAATGA TGTTTGCATC ACGGCTCACA 1020 

GCATATTTCA GGAGAAAATG CGTGGCCGCC ACGGATGAAC GTGTCCAGAA GATGAATGAA 1080 

GTTCTTACTT ACATTAAATT TATCAAAATG TATGCCTGGG TCAAAGCATT TTCTCAGAGT 1140 

GTTCAAAAAA TCCGCGAGGA GGAGCGTCGG ATATTGGAAA AAGCTGGGTA CTTCCAGAGC 1200 

AT CACTGTGG GTGTGGCTCC CATTGTGGTG GTGATTGCCA GCGTGGTGAC CTTCTCTGTT 1260 

CATATGACCC TGGGCTTCGA TCTGACAGCA GCACAGGCTT TCACAGTGGT GACAGTCTTC 1320 

AATTCCATGA CTTTTGCTTT GAAAGTAACA CCGTTTTCAG TAAAGTCCCT CTCAGAAGCC 1380 

TCAGTGGCTG TTGACAGATT TAAGAGTTTG TTTCTAATGG AAGAGGTTCA CATGATAAAG 1440 

AACAAACCAG CCAGTCCTCA CATCAAGATA GAGATGAAAA ATGCCACCTT GGCATGGGAC 1500 

TCCTCCCACT CCAGTATCCA GAACTCGCCC AAGCTGACCC CCAAAATGAA AAAAGACAAG 1560 

AGGGCTTCCA GGGGCAAGAA AGAGAAGGTG AGGCAGCTGC AGCGCACTGA GCATCAGGCG 1620 

GTGCTGGCAG AGCAGAAAGG CCACCTCCTC CTGGACAGTG ACGAGCGGCC CAGTCCCGAA 1680 

GAGGAAGAAG GCAAGCACAT CCACCTGGGC CACCTGCGCT TACAGAGGAC ACTGCACAGC 1740 

ATCGATCTGG AGATCCAAGA GGGTAAACTG GTTGGAATCT GCGGCAGTGT GGGAAGTGGA 1800 

AAAACCTCTC TCATTTCAGC CATTTTAGGC CAGATGACGC TTCTAGAGGG CAGCATTGCA 1860 

ATCAGTGGAA CCTTCGCTTA TGTGGCCCAG CAGGCCTGGA TCCTCAATGC TACTCTGAGA 1920 

GACAACATCC TGTTTGGGAA GGAATATGAT GAAGAAAGAT ACAACTCTGT GCTGAACAGC 1980 

TGCTGCCTGA GGCCTGACCT GGCCATTCTT CCCAGCAGCG ACCTGACGGA GATTGGAGAG 2040 

CGAGGAGCCA ACCTGAGCGG TGGGCAGCGC CAGAGGATCA GCCTTGCCCG GGCCTTGTAT 2100 

AGTGACAGGA GCATCTACAT CCTGGACGAC CCCCTCAGTG CCTTAGATGC CCATGTGGGC 2160 

AACCACATCT TCAATAGTGC TATCCGGAAA CATCTCAAGT CCAAGACAGT TCTGTTTGTT 2220 

ACCCACCAGT TACAGTACCT GGTTGACTGT GATGAAGTGA TCTTCATGAA AGAGGGCTGT 2280 

ATTAGGGAAA GAGGCACCCA TGAGGAACTG ATGAATTTAA ATGGTGACTA TGCTACCATT 2340 

TTTAATAACC TGTTGCTGGG AGAGACACCG CCAGTTGAGA TCAATTCAAA AAAGGAAACC 2400 

AGTGGTTCAC AGAAGAAGTC ACAAGACAAG GGTCCTAAAA CAGGATCAGT AAAGAAGGAA 2460 

AAAGCAGTAA AGCCAGAGGA AGGGCAGCTT GTGCAGCTGG AAGAAAAAGG GCAGGGTTCA 2520 

GTGCCCTGGT CAGTATATGG TGTCTACATC CAGGCTGCTG GGGGCCCCTT GGCATTCCTG 2580 

GTTATTATGG CCCTTTTCAT GCTGAATGTA GGCAGCACCG CCTTCAGCAC CTGGTGGTTG 2640 

AGTTACTGGA TCAAGCAAGG AAGCGGGAAC ACCACTGTGA CTCGAGGGAA CGAGACCTCG 2700 

GTGAGTGACA GCATGAAGGA CAATCCTCAT ATGCAGTACT ATGCCAGCAT CTACGCCCTC 2760 

TCCATGGCAG TCATGCTGAT CCTGAAAGCC ATTCGAGGAG TTGTCTTTGT CAAGGGCACG 2820 

CTGOGAGCTT CCTCCCGGCT GCATGACGAG CTTTTCCGAA GGATCCTTCG AAGCCCTATG 2880 

AAGTTTTTTG ACACGACCCC CACAGGGAGG ATTCTCAACA GGTTTTCCAA AGACATGGAT 2940 

GAAGTTGACG TGCGGCTGCC GTTCCAGGCC GAGATGTTCA TCCAGAACGT TATCCTGGTG 3000 

TTCTTCTGTG TGGGAATGAT CGCAGGAGTC TTCCCGTGGT TCCTTGTGGC AGTGGGGCCC 3060 

CTTGTCATCC TCTTTTCAGT CCTGCACATT GTCTCCAGGG TCCTGATTCG GGAGCTGAAG 3120 

CGTCTGGACA ATATCACGCA GTCACCTTTC CTCTCCCACA TCACGTCCAG CATACAGGGC 3180 

CTTGCCACCA TCCACGCCTA CAATAAAGGG CAGGAGTTTC TGCACAGATA CCAGGAGCTG 3240 

CTGGATGACA ACCAAGCTCC TTTTTTTTTG TTTACGTGTG CGATGCGGTG GCTGGCTGTG 3300 

CGGCTGGACC TCATCAGCAT CGCCCTCATC ACCACCACGG GGCTGATGAT CGTTCTTATG 3360 

CACGGGCAGA TTCCCCCAGC CTATGOGGGT CTCGCCATCT CTTATGCTGT CCAGTTAACG 3420 

GGGCTGTTCC AGTTTACGGT CAGACTGGCA TCTGAGACAG AAGCTCGATT CACCTCGGTG 3480 

GAGAGGATCA ATCACTACAT TAAGACTCTG TCCTTGGAAG CACCTGCCAG AATTAAGAAC 3540 

AAGGCTCCCT CCCCTGACTG GCCCCAGGAG GGAGAGGTGA CCTTTGAGAA CGCAGAGATG 3600 

AGGTACCGAG AAAACCTCCC TCTCGTCCTA AAGAAAGTAT CCTTCACGAT CAAACCTAAA 3660 

GAGAAGATTG GCATTGTGGG GCGGACAGGA TCAGGGAAGT CCTCGCTGGG GATGGCCCTC 3720 

TTCCGTCTGG TGGAGTTATC TGGAGGCTGC ATCAAGATTG ATGGAGTGAG AATCAGTGAT 3780 

ATTGGCCTTG CCGACCTCCG AAGCAAACTC TCTATCATTC CTCAAGAGCC GGTGCTGTTC 3840 

AGTGGCACTG TCAGATCAAA TTTGGACCCC TTCAACCAGT ACACTGAAGA CCAGATTTGG 3900 

GATGCCCTGG AGAGGACACA CATGAAAGAA TGTATTGCTC AGCTACCTCT GAAACTTGAA 3 960 

TCTGAAGTGA TGGAGAATGG GGATAACTTC TCAGTGGGGG AACGGCAGCT CTTGTGCATA 4020 

GCTAGAGCCC TGCTCCGCCA CTGTAAGATT CTGATTTTAG ATGAAGCCAC AGCTGCCATG 4080 

GACACAGAGA CAGACTTATT GATTCAAGAG ACCATCCGAG AAGCATTTGC AGACTGTACC 4140 

ATGCTGACCA TTGCCCATCG CCTGCACACG GTTCTAGGCT CCGATAGGAT TATGGTGCTG 4200 

GCCCAGGGAC AGGTGGTGGA GTTTGACACC CCATCGGTCC TTCTGTCCAA CGACAGTTCC 4260 

CGATTCTATG CCATGTTTGC TGCTGCAGAG AACAAGGTCG CTGTCAAGGG CTAG 4314 
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Nucleic Acid Accession #: NM_005688.1 
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1 11 21 31 41 

| I 1 I 1 I 

CCGGGCAGGT GGCTCATGCT CGGGAGCGTG GTTGAGCGGC TGGCGCGGTT GTCCTGGAGC 
AGGGGCGCAG GAATTCTGAT GTGAAACTAA CAGTCTGTGA GCCCTGGAAC CTCCGCTCAG 
AGAAGATGAA GGATATCGAC ATAGGAAAAG AGTATATCAT CCCCAGTCCT GGGTATAGAA 
GTGTGAGGGA GAGAACCAGC ACTTCTGGGA CGCACAGAGA CCGTGAAGAT TCCAAGTTCA 
GGAGAACTCG ACCGTTGGAA TGCCAAGATG CCTTGGAAAC AGCAGCCCGA GCCGAGGGCC 
TCTCTCTTGA TGCCTCCATG CATTCTCAGC TCAGAATCCT GGATGAGGAG CATCCCAAGG 
GAAAGTACCA TCATGGCTTG AGTGCTCTGA AGCCCATCCG GACTACTTCC AAACACCAGC 
ACCCAGTGGA CAATGCTGGG CTTTTTTCCT GTATGACTTT TTCGTGGCTT TCTTCTCTGG 
CCCGTGTGGC CCACAAGAAG GGGGAGCTCT CAATGGAAGA CGTGTGGTCT CTGTCCAAGC 
ACGAGTCTTC TGACGTGAAC TGCAGAAGAC TAGAGAGACT GTGGCAAGAA GAGCTGAATG 
AAGTTGGGCC AGACGCTGCT TCCCTGCGAA GGGTTGTGTG GATCTTCTGC CGCACCAGGC 
TCATCCTGTC CATCGTGTGC CTGATGATCA CGCAGCTGGC TGGCTTCAGT GGACCAGCCT 
TCATGGTGAA ACACCTCTTG GAGTATACCC AGGCAACAGA GTCTAACCTG CAGTACAGCT 
TGTTGTTAGT GCTGGGCCTC CTCCTGACGG AAATCGTGCG GTCTTGGTCG CTTGCACTGA 
CTTGGGCATT GAATTACCGA ACCGGTGTCC GCTTGCGGGG GGCCATCCTA ACCATGGCAT 
TTAAGAAGAT CCTTAAGTTA AAGAACATTA AAGAGAAATC CCTGGGTGAG CTCATCAACA 
TTTGCTCCAA CGATGGGCAG AGAATGTTTG AGGCAGCAGC CGTTGGCAGC CTGCTGGCTG 
GAGGACCCGT TGTTGCCATC TTAGGCATGA TTTATAATGT AATTATTCTG GGACCAACAG 
GCTTCCTGGG ATCAGCTGTT TTTATCCTCT TTTACCCAGC AATGATGTTT GCATCACGGC 
TCACAGCATA TTTCAGGAGA AAATGOGTGG CCGCCACGGA TGAACGTGTC CAG AAGA TGA 
ATGAAGTTCT TACTTACATT AAATTTATCA AAATGTATGC CTGGGTCAAA GCATTTTCTC 
AGAGTGTTCA AAAAATCCGC GAGGAGGAGC GTCGGATATT GGAAAAAGCC GGGTACTTCC 
AGGGTATCAC TGTGGGTGTG GCTCCCATTG TGGTGGTGAT TGCCAGOGTG GTGACCTTCT 
CTGTTCATAT GACCCTGGGC TTCGATCTGA CAGCAGCACA GGCTTTCACA GTGGTGACAG 
TCTTCAATTC CATGACTTTT GCTTTGAAAG TAACACCGTT TTCAGTAAAG TCCCTCTCAG 
AAGCCTCAGT GGCTGTTGAC AGATTTAAGA GTTTGTTTCT AATGGAAGAG GTTCACATGA 
TAAAGAACAA ACCAGCCAGT CCTCACATCA AGATAGAGAT GAAAAATGCC ACCTTGGCAT 
GGGACTCCTC CCACTCCAGT ATCCAGAACT CGCCCAAGCT GACCCCCAAA ATGAAAAAAG 
ACAAGAGGGC TTCCAGGGGC AAGAAAGAGA AGGTGAGGCA GCTGCAGCGC ACTGAGCATC 
AGGCGGTGCT GGCAGAGCAG AAAGGCCACC TCCTCCTGGA CAGTGACGAG CGGCCCAGTC 
CCGAAGAGGA AGAAGGCAAG CACATCCACC TGGGCCACCT GCGCTTACAG AGGACACTGC 
ACAGCATCGA TCTGGAGATC CAAGAGGGTA AACTGGTTGG AATCTGCGGC AGTGTGGGAA 
GTGGAAAAAC CTCTCTCATT TCAGCCATTT TAGGCCAGAT GACGCTTCTA GAGGGCAGCA 
TTGCAATCAG TGGAACCTTC GCTTATGTGG CCCAGCAGGC CTGGATCCTC AATGCTACTC 
TGAGAGACAA CATCCTGTTT GGGAAGGAAT ATGATGAAGA AAGATACAAC TCTGTGCTGA 
ACAGCTGCTG CCTGAGGCCT GACCTGGCCA TTCTTCCCAG CAGCGACCTG ACGGAGATTG 
GAGAGCGAGG AGCCAACCTG AGCGGTGGGC AGCGCCAGAG GATCAGCCTT GCCCGGGCCT 
TGTATAGTGA CAGGAGCATC TACATCCTGG ACGACCCCCT CAGTGCCTTA GATGCCCATG 
TGGGCAACCA CATCTTCAAT AGTGCTATCC GGAAACATCT CAAGTCCAAG ACAGTTCTGT 
TTGTTACCCA CCAGTTACAG TACCTGGTTG ACTGTGATGA AGTGATCTTC ATGAAAGAGG 
GCTGTATTAC GGAAAGAGGC ACCCATGAGG AACTGATGAA TTTAAATGGT GACTATGCTA 
CCATTTTTAA TAACCTGTTG CTGGGAGAGA CACCGCCAGT TGAGATCAAT TCAAAAAAGG 
AAACCAGTGG TTCACAGAAG AAGTCACAAG ACAAGGGTCC TAAAACAGGA TCAGTAAAGA 
AGGAAAAAGC AGTAAAGCCA GAGGAAGGGC AGCTTGTGCA GCTGGAAGAG AAAGGGCAGG 
GTTCAGTGCC CTGGTCAGTA TATGGTGTCT ACATCCAGGC TGCTGGGGGC CCCTTGGCAT 
TCCTGGTTAT TATGGCCCTT TTCATGCTGA ATGTAGGCAG CACCGCCTTC AGCACCTGGT 
GGTTGAGTTA CTGGATCAAG CAAGGAAGCG GGAACACCAC TGTGACTCGA GGGAACGAGA 
CCTCGGTGAG TGACAGCATG AAGGACAATC CTCATATGCA GTACTATGCC AGCATCTACG 
CCCTCTCCAT GGCAGTCATG CTGATCCTGA AAGCCATTCG AGGAGTTGTC TTTGTCAAGG 
GCACGCTGCG AGCTTCCTCC CGGCTGCATG ACGAGCTTTT CCGAAGGATC CTTCGAAGCC 
CTATGAAGTT TTTTGACACG ACCCCCACAG GGAGGATTCT CAACAGGTTT TCCAAAGACA 
TGGATGAAGT TGACGTGCGG CTGCCGTTCC AGGCCGAGAT GTTCATCCAG AACGTTATCC 
TGGTGTTCTT CTGTGTGGGA ATGATCGCAG GAGTCTTCCC GTGGTTCCTT GTGGCAGTGG 
GGCCCCTTGT CATCCTCTTT TCAGTCCTGC ACATTGTCTC CAGGGTCCTG ATTCGGGAGC 
TGAAGCGTCT GGACAATATC ACGCAGTCAC CTTTCCTCTC CCACATCACG TCCAGCATAC 
AGGGCCTTGC CACCATCCAC GCCTACAATA AAGGGCAGGA GTTTCTGCAC AGATACCAGG 
AGCTGCTGGA TGACAACCAA GCTCCTTTTT TTTTGTTTAC GTGTGCGATG CGGTGGCTGG 
CTGTGCGGCT GGACCTCATC AGCATCGCCC TCATCACCAC CACGGGGCTG ATGATCGTTC 
TTATGCACGG GCAGATTCCC CCAGCCTATG CGGGTCTCGC CATCTCTTAT GCTGTCCAGT 
TAACGGGGCT GTTCCAGTTT ACGGTCAGAC TGGCATCTGA GACAGAAGCT CGATTCACCT 
CGGTGGAGAG GATCAATCAC TACATTAAGA CTCTGTCCTT GGAAGCACCT GCCAGAATTA 
AGAACAAGGC TCCCTCCCCT GACTGGCCCC AGGAGGGAGA GGTGACCTTT GAGAACGCAG 
AGATGAGGTA CCGAGAAAAC CTCCCTCTTG TCCTAAAGAA AGTATCCTTC ACGATCAAAC 
CTAAAGAGAA GATTGGCATT GTGGGGCGGA CAGGATCAGG GAAGTCCTCG CTGGGGATGG 
CCCTCTTCCG TCTGGTGGAG TTATCTGGAG GCTGCATCAA GATTGATGGA GTGAGAATCA 
GTGATATTGG CCTTGCCGAC CTCCGAAGCA AACTCTCTAT CATTCCTCAA GAGCCGGTGC 
TGTTCAGTGG CACTGTCAGA TCAAATTTGG ACCCCTTCAA CCAGTACACT GAAGACCAGA 
TTTGGGATGC CCTGGAGAGG ACACACATGA AAGAATGTAT TGCTCAGCTA CCTCTGAAAC 
TTGAATCTGA AGTGATGGAG AATGGGGATA ACTTCTCAGT GGGGGAACGG CAGCTCTTGT 
GCATAGCTAG AGCCCTGCTC CGCCACTGTA AGATTCTGAT TTTAGATGAA GCCACAGCTG 
CCATGGACAC AGAGACAGAC TTATTGATTC AAGAGACCAT CCGAGAAGCA TTTGCAGACT 
GTACCATGCT GACCATTGCC CATCGCCTGC ACACGGTTCT AGGCTCCGAT AGGATTATGG 
TGCTGGCCCA GGGACAGGTG GTGGAGTTTG ACACCCCATC GGTCCTTCTG TCCAACGACA 
GTTCCCGATT CTATGCCATG TTTGCTGCTG CAGAGAACAA GGTCGCTGTC AAGGGCTGAC 
TCCTCCCTGT TGACGAAGTC TCTTTTCTTT AGAGCATTGC CATTCCCTGC CTGGGGCGGG 
CCCCTCATCG CGTCCTCCTA CCGAAACCTT GCCTTTCTCG ATTTTATCTT TCGCACAGCA 
GTTCCGGATT GGCTTGTGTG TTTCACTTTT AGGGAGAGTC ATATTTTGAT TATTGTATTT 
ATTCCATATT CATGTAAACA AAATTTAGTT TTTGTTCTTA ATTGCACTCT AAAAGGTTCA 
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GGGAACCGTT ATTATAATTG TATCAGAGGC CTATAATGAA GCTTTATACG TGTAGCTATA 
TCTATATATA ATTCTGTACA TAGCCTATAT TTACAGTGAA AATGTAAGCT GTTTATTTTA 
TATTAAAATA AGCACTGTGC TAATAACAGT GCATATTCCT TTCTATCATT TTTGTACAGT 
TTGCTGTACT AGAGATCTGG TTTTGCTATT AGACTGTAGG AAGAGTAGCA TTTCATTCTT 
CTCTAGCTGG TGGTTTCACG GTGCCAGGTT TTCTGGGTGT CCAAAGGAAG ACGTGTGGCA 
ATAGTGGGCC CTCCGACAGC CCCCTCTGCC GCCTCCCCAC AGCCGCTCCA GGGGTGGCTG 
GAGACGGGTG GGCGGCTGGA GACCATGCAG AGOGCCGTGA GTTCTCAGGG CTCCTGCCTT 
CTGTCCTGGT GTCACTTACT GTTTCTGTCA GGAGAGCAGC GGGGCGAAGC CCAGGCCCCT 
TTTCACTCCC TCCATCAAGA ATGGGGATCA CAGAGACATT CCTCCGAGCC GGGGAGTTTC 
TTTCCTGCCT TCTTCTTTTT GCTGTTGTTT CTAAACAAGA ATCAGTCTAT CCACAGAGAG 
TCCCACTGCC TCAGGTTCCT ATGGCTGGCC ACTGCACAGA GCTCTCCAGC TCCAAGACCT 
GTTGGTTCCA AGCCCTGGAG CCAACTGCTG CTTTTTGAGG TGGCACTTTT TCATTTGCCT 
ATTCCCACAC CTCCACAGTT CAGTGGCAGG GCTCAGGATT TCGTGGGTCT GTTTTCCTTT 
CTCACCGCAG TCGTCGCACA GTCTCTCTCT CTCTCTCCCC TCAAAGTCTG CAACTTTAAG 
CAGCTCTTGC TAATCAGTGT CTCACACTGG CGTAGAAGTT TTTGTACTGT AAAGAGACCT 
ACCTCAGGTT GCTGGTTGCT GTGTGGTTTG GTGTGTTCCC GCAAACCCCC TTTGTGCTGT 
GGGGCTGGTA GCTCAGGTGG GCGTGGTCAC TGCTGTCATC AGTTGAATGG TCAGCGTTGC 
ATGTCGTGAC CAACTAGACA TTCTGTCGCC TTAGCATGTT TGCTGAACAC CTTGTGGAAG 
CAAAAATCTG AAAATGTGAA TAAAATTATT TTGGATTTTG TAAAAAAAAA AAAAAAAAAA 
AAAAAAAAAA AAAAAAAA 

Seq ID NO: C106 DNA Sequence 
Nucleic Acid Accession #: NM_0055G2 
Coding sequence : 90 3 671 

1 11 21 31 41 51 

I I I I I I 

ACAGCGGAGC GCAGAGTGAG AACCACCAAC CGAGGGGCCG GGCAGCGACC CCTGCAGCGG 60 

AGACAGAGAC TGAGCGGCCC GGCACCGCCA TGCCTGCGCT CTGGCTGGGC TGCTGCCTCT 120 

GCTTCTCGCT CCTCCTGCCC GCAGCCCGGG CCACCTCCAG GAGGGAAGTC TGTGATTGCA 180 

ATGGGAAGTC CAGGCAGTGT ATCTTTGATC GGGAACTTCA CAGACAAACT GGTAATGGAT 240 

TCCGCTGCCT CAACTGCAAT GACAACACTG ATGGCATTCA CTGCGAGAAG TGCAAGAATG 300 

GCTTTTACCG GCACAGAGAA AGGGACCGCT GTTTGCCCTG CAATTGTAAC TCCAAAGGTT 360 

CTCTTAGTGC TCGATGTGAC AACTCTGGAC GGTGCAGCTG TAAACCAGGT GTGACAGGAG 420 

CCAGATGCGA CCGATGTCTG CCAGGCTTCC ACATGCTCAC GGATGCGGGG TGCACCCAAG 480 

ACCAGAGACT GCTAGACTCC AAGTGTGACT GTGACCCAGC TGGCATCGCA GGGCCCTGTG 540 

ACGCGGGCCG CTGTGTCTGC AAGCCAGCTG TTACTGGAGA ACGCTGTGAT AGGTGTCGAT 600 

CAGGTTACTA TAATCTGGAT GGGGGGAACC CTGAGGGCTG TACCCAGTGT TTCTGCTATG 660 

GGCATTCAGC CAGCTGCCGC AGCTCTGCAG AATACAGTGT CCATAAGATC ACCTCTACCT 720 

TTCATCAAGA TGTTGATGGC TGGAAGGCTG TCCAACGAAA TGGGTCTCCT GCAAAGCTCC 790 

AATGGTCACA GCGCCATCAA GATGTGTTTA GCTCAGCCCA ACGACTAGAC CCTGTCTATT 840 

TTGTGGCTCC TGCCAAATTT CTTGGGAATC AACAGGTGAG CTATGGGCAA AGCCTGTCCT 900 

TTGACTACCG TGTGGACAGA GGAGGCAGAC ACCCATCTGC CCATGATGTG ATTCTGGAAG 960 

GTGCTGGTCT ACGGATCACA GCTCCCTTGA TGCCACTTGG CAAGACACTG CCTTGTGGGC 1020 

TCACCAAGAC TTACACATTC AGGTTAAATG AGCATCCAAG CAATAATTGG AGCCCCCAGC 1080 

TGAGTTACTT TGAGTATCGA AGGTTACTGC GGAATCTCAC AGCCCTCCGC ATCCGAGCTA 1140 

CATATGGAGA ATACAGTACT GGGTACATTG ACAATGTGAC CCTGATTTCA GCCCGCCCTG 1200 

TCTCTGGAGC CCCAGCACCC TGGGTTGAAC AGTGTATATG TCCTGTTGGG TACAAGGGGC 1260 

AATTCTGCCA GGATTGTGCT TCTGGCTACA AGAGAGATTC AGCGAGACTG GGGCCTTTTG 1320 

GCACCTGTAT TCCTTGTAAC TGTCAAGGGG GAGGGGCCTG TGATCCAGAC ACAGGAGATT 1380 

GTTATTCAGG GGATGAGAAT CCTGACATTG AGTGTGCTGA CTGCCCAATT GGTTTCTACA 1440 

ACGATCCGCA CGACCCCCGC AGCTGCAAGC CATGTCCCTG TCATAACGGG TTCAGCTGCT 1500 

CAGTGATGCC GGAGACGGAG GAGGTGGTGT GCAATAACTG CCCTCCCGGG GTCACCGGTG 1560 

CCCGCTGTGA GCTCTGTGCT GATGGCTACT TTGGGGACCC CTTTGGTGAA CATGGCCCAG 1620 

TGAGGCCTTG TCAGCCCTGT CAATGCAACA ACAATGTGGA CCCCAGTGCC TCTGGGAATT 1680 

GTGACCGGCT GACAGGCAGG TGTTTGAAGT GTATCCACAA CACAGCCGGC ATCTACTGCG 1740 

ACCAGTGCAA AGCAGGCTAC TTCGGGGACC CATTGGCTCC CAACCCAGCA GACAAGTGTC 1800 

GAGCTTGCAA CTGTAACCCC ATGGGCTCAG AGCCTGTAGG ATGTCGAAGT GATGGCACCT 1860 

GTGTTTGCAA GCCAGGATTT GGTGGCCCCA ACTGTGAGCA TGGAGCATTC AGCTGTCCAG 1920 

CTTGCTATAA TCAAGTGAAG ATTCAGATGG ATCAGTTTAT GCAGCAGCTT CAGAGAATGG 1980 

AGGCCCTGAT TTCAAAGGCT CAGGGTGGTG ATGGAGTAGT ACCTGATACA GAGCTGGAAG 2040 

GCAGGATGCA GCAGGCTGAG CAGGCCCTTC AGGACATTCT GAGAGATGCC CAGATTTCAG 2100 

AAGGTGCTAG CAGATCCCTT GGTCTCCAGT TGGCCAAGGT GAGGAGCCAA GAGAACAGCT 2160 
ACCAGAGCCG CCTGGATGAC CTCAAGATGA CTGTGGAAAG AGTTCGGGCT CTGGGAAGTC 2220 
AGTACCAGAA CCGAGTTCGG GATACTCACA GGCTCATCAC TCAGATGCAG CTGAGCCTGG 2280 
CAGAAAGTGA AGCTTCCTTG GGAAACACTA ACATTCCTGC CTCAGACCAC TACGTGGGGC 2340 
CAAATGGCTT TAAAAGTCTG GCTCAGGAGG CCACAAGATT AGCAGAAAGC CACGTTGAGT 2400 
CAGCCAGTAA CATGGAGCAA CTGACAAGGG AAACTGAGGA CTATTCCAAA CAAGCCCTCT 2460 
CACTGGTGCG CAAGGCCCTG CATGAAGGAG TCGGAAGCGG AAGCGGTAGC CGGGACGGTG 2520 
CTGTGGTGCA AGGGCTTGTG GAAAAATTGG AGAAAACCAA GTCCCTGGCC CAGCAGTTGA 2580 
CAAGGGAGGC CACTCAAGCG GAAATTGAAG CAGATAGGTC TTATCAGCAC AGTCTCCGCC 2640 
TCCTGGATTC AGTGTCTCGG CTTCAGGGAG TCAGTGATCA GTCCTTTCAG GTGGAAGAAG 2700 
CAAAGAGGAT CAAACAAAAA GCGGATTCAC TCTCAACGCT GGTAACCAGG CATATGGATG 2760 
AGTTCAAGCG TACACAAAAG AATCTGGGAA ACTGGAAAGA AGAAGCACAG CAGCTCTTAC 2820 
AGAATGGAAA AAGTGGGAGA GAGAAATCAG ATCAGCTGCT TTCCCGTGCC AATCTTGCTA 2880 
AAAGCAGAGC ACAAGAAGCA CTGAGTATGG GCAATGCCAC TTTTTATGAA GTTGAGAGCA 2940 
TCCTTAAAAA CCTCAGAGAG TTTGACCTGC AGGTGGACAA CAGAAAAGCA GAAGCTGAAG 3000 
AAGCCATGAA GAGACTCTCC TACATCAGCC AGAAGGTTTC AGATGCCAGT GACAAGACCC 3060 
AGCAAGCAGA AAGAGCCCTG GGGAGCGCTG CTGCTGATGC ACAGAGGGCA AAGAATGGGG 3120 
CCGGGGAGGC CCTGGAAATC TCCAGTGAGA TTGAACAGGA GATTGGGAGT CTGAACTTGG 3180 
AAGCCAATGT GACAGCAGAT GGAGCCTTGG CCATGGAAAA GGGACTGGCC TCTCTGAAGA 3240 
GTGAGATGAG GGAAGTGGAA GGAGAGCTGG AAAGGAAGGA GCTGGAGTTT GACACGAATA 3300 
TGGATGCAGT ACAGATGGTG ATTACAGAAG CCCAGAAGGT TGATACCAGA GCCAAGAACG 3360 
CTGGGGTTAC AATCCAAGAC ACACTCAACA CATTAGACGG CCTCCTGCAT CTGATGGACC 3420 
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AGCCTCTCAG 
AGACCCAGAT 
AGCAGAGGGG 
AGAACTTGGA 
AGCAACAGTG 
GGCTCGGGAG 
TATGCTCAGG 
TGCACCATAC 
ATGATCAAGG 
ATGTTTGCCT 
ATAGTCAACT 
ATGAAATTCT 
ACTATTGCCT 
ACCCAGGGTG 
AGGACCTGTA 
GTTCTGGACC 
ATTTTTATTA 
GTTTCAAAGT 
ATTAGTCCTA 
ATCTTATTTT 
CACACTTCAG 
TTACCTCCAT 
GTGGGACAGT 
AGCATTTTTA 
GCAATAACCG 
CATGGGGGCA 
CATTCCAGCT 
TAACACCAGT 
TGGTGCTGCC 
CAATTGTTAG 



TGTAGATGAA 
CAACAGCCAA 
CCACCTCCAT 
GAACATTAGG 
AAGCTGCCAT 
CCATGTCATG 
TCAACTGACC 
TCCTTGCTTC 
ATCTGGACCC 
CATAATAGTC 
TATTCTTTGA 
TCCTAATGTC 
CATATTGTCC 
TGAACATGTT 
AGGCAGGCCC 
TGGGCATGAC 
AAGCATTTCC 
GATAGAAAAG 
ATTCAATCCT 
CTCAATCTCC 
CTGGGTCACA 
CCATCCTTCC 
GGTGACATAG 
AAAAATAAAT 
CTTGGTTTGC 
CTTGAGTTTT 
GTCACTCTGT 
GGGAATTGCT 
TTGCTTCTGT 
ATGCC 



GAGGGGCTGG 
CTGCGGCCCA 
TTGCTGGAGA 
GACAACCTGC 
AAATATTTCT 
TGAGTGGGTG 
TGACCCCATT 
CTGATGCTGG 
CAAAGAATAG 
GTAAGTGGAG 
GTAATGTGAC 
AGAACAGAGT 
TCTGCAAGCT 
CTCCATTTTC 
ATTCAGAGCT 
ATCCTTTCTT 
TACCAGCAAA 
TGTGGCTTGG 
ACTTTTCGAA 
TCTCTCTTTC 
TCCATCCCTC 
AACATATATT 
TCTCTGCCCT 
TTAAACTTAC 
AACCTCTTTG 
GGCAAGGCTG 
GCCTTTCTAC 
GGAGGAACCA 
ATTTCCTTGG 



TCTTACTGGA 
TGATGTCAGA 
CAAGCATAGA 
CCCCAGGCTG 
CAACTGAGGT 
GGATGGGGAC 
CCTGATCCCA 
GCAATGAGGC 
ACTGGATGGA 
TCCTGGAATT 
TAAAGGAAAA 
GCAACCCAGT 
TCTTGCTGAT 
AAGCTGGAAG 
ATGGTGCTTG 
TTAATGATGC 
GCAAATGTTG 
GCATTGAAAG 
CACCAAAAAT 
CTCCACCCAT 
CATTCATCCT 
TATTGAGTAC 
CATAGAGTTG 
AAACTTTGTT 
CTCAACAGAA 
ACAGAGCTCT 
AACTGATTGC 
GAGGCACTTC 
ATTTTCCTGA 



GCAGAAGCTT 
GCTGGAAGAG 
TGGGATTCTG 
CTACAATACC 
TCTTGGGATA 
ATTTGAACAT 
TGGCCAGGTG 
AGATAGCACT 
AAGACAAACT 
TGGACAAGTG 
AACTTTGACT 
CACACTGTGG 
CAGAGTTCCT 
AAGTGAGCAG 
CTGGTGCCTG 
CATGGCAACT 
GGAAAGTATT 
AGGTAAAATT 
GATGCGCATC 
AATAAGAGAA 
TCCATCCATC 
CTACTGTGTG 
ATTGTCTAGT 
TGTCACAAGT 
CATATGTTGC 
GGGTTGTGCA 
AACAGACTGT 
CACCTTGGCT 
AAGTGTTTTT 



TCCCGAGCCA 
AGGGCACGTC 
GCTGATGTGA 
CAGGCTCTTG 
CAGATCTCAG 
GTTTAATGGG 
GTTGTCTTAT 
GGGTGTGAGA 
GCACAGGCAG 
CTGTTGGGAT 
TTGCCCAGGC 
CCAGTAAAAT 
CCTACTTACA 
TGTTGGAGTG 
CCACCTTCAA 
TAGAGATTGC 
TACTTTTTCG 
CTCTAGATTT 
AATGTATTTT 
TGTTCCTACT 
TTTCCATCCA 
CCAGGGGCTG 
GAGGAAGACA 
GGTGTTTATT 
AAGACCCTCC 
CATTTCTTTG 
TGAGTTATGA 
GGGAAGACTA 
AAATAAAGAA 



Seq ID NO: C107 DNA Sequence 
Nucleic Acid Accession 8: NM_02ll01 
Coding sequence: 221.. 856 



1 
I 

GAATTCGGCA 
CGCCGCCCCG 
GCAGGGCTCC 
CCCACCTGCA 
GCGAGTCATG 
GATCGGCGCC 
CAACATCGTG 
CACCGGGCAG 
AGCAACCCGT 
CACCGTTGGC 
GGCTGTCATT 
ATGGTATGGC 
GTACGAATTT 
AGGTGCCCTA 
CTATCCAAAA 
AAAATCATGT 
TTAGAATTTT 
AACCCATGTG 
TCAATATAGG 
CAAATGGGGG 
ATTAAAAATA 



11 
I 

CGAGGCCTCG 
GGCGCGGACC 
CCGCCTTAAC 
AACTCTCCGC 
GCCAACGCGG 
ATCGTCAGCA 
ACCGCCCAGG 
ATCCAGTGCA 
GCCTTGATGG 
ATGAAGTGTA 
GGGGGCGCGA 
AATAGAATCG 
GGTCAGGCTC 
CTTTGCTGTT 
CCTGCACCTT 
TGAAACAAAC 
GGGTATTGTA 
TTAAAATACT 
AGGGAAGATT 
AAGGGGTGCT 
GCCAGTAAAA 



21 
1 

TGCCGGGGAG 
CCAACCCCGA 
TTCCTCCGCG 
CTTCTGCACC 
GGCTGCAGCT 
CTGCCCTGCC 
CCATGTACGA 
AAGTCTTTGA 
TGGTTGGCAT 
TGAAGTGCTT 
TATTTCTTCT 
TTCAAGAATT 
TCTTCACTGG 
CCTGTCCCCG 
CCAGCGGGAA 
CGAAAATGGA 
ATCTAAAGTA 
CAGTGCTAAA 
TTTCCATTTG 
CCTTAAATAT 
AAAAAAAAAA 



31 
I 

CAACCTCAGC 
CCCAGAGCTT 
GGGCCCAGCC 
TGCCACCCCT 
GTTGGGCTTC 
CCAGTGGAGG 
GGGGCTGTGG 
CTCCTTGCTG 
CCTCCTGGGA 
GGAAGACGAT 
TGCAGGTCTG 
CTATGACCCT 
CTGGGCTGCT 
AAAAACAACC 
AGACTACGTG 
CATTGAGATA 
TGTTATTACA 
CATGGCTTAA 
TATTAGTGCT 
ATATAGATAT 
AAAAAAA 



41 
I 

TTCTAGTATC 
CTCCAGCGGC 
ACCTTCGGGA 
GAGCCAGCGC 
ATTCTCGCCT 
ATTTACTCCT 

AATCTGAGCA 
GTGATAGCAA 
GAGGTGCAGA 
GCTATTTTAG 
ATGACCCCAG 
GCTTCTCTCT 
TCTTACCCAA 
TGACACAGAG 
CTATCATTAA 
AAACAAACAA 
TCTTATTTTA 
TCCCATTGAG 
GTATATATAC 



51 
I 

CAGACTCCAG 
GGCGCAGCGA 
GTCCGGGTTG 
GGGCGCCCGA 
TCCTGGGATG 
ATGCCGGCGA 
TGTCGCAGAG 
GCACATTGCA 
TCTTTGTGGC 
AGATGAGGAT 
TTGCCACAGC 
TCAATGCCAG 
GCCTTCTGGG 
CACCAAGGCC 
GCAAAAGGAG 
CATTAGGACC 
ACAAACAAAA 
TCTTCTTTCC 
TAATCATACT 
ATGTTTTTCT 



Seq ID NO: CI 08 DNA Sequence 
Nucleic Acid Accession #: AF508964. 
Coding sequence: 98.. 1531 



1 
I 

CAGAGCCGCA 
CGGGGACAGG 
TGGACCCGCC 



TTCTCCTGGC 
CACCGAAATT 
AGCTTATGTG 
TAAAGCATTT 
GAGTTTGTCT 
TCCATTTACA 
TCCAGACACT 
CCTGCAGATA 
GGAGGAAGGA 
GTATTGGGAT 
CTCCTTAAGG 
GGAAAATCTT 
TATCACATTT 
AGGCAACCCC 
ATACATCTGT 



11 
I 

AGCGCAGGGA 
CACTCGGGCT 
ATGGCGCGGC 
TGTCCCACGT 
ATCGTGGCAT 
TTCATCGCAA 
GGACTGAGAA 
CTGAAAAACA 
AGGAAACATT 
TGCTCCTGTG 
CAGGATTTGT 
CCCAATTGTG 
AAGTCTATCA 
GTTGGTAACC 
ATAACTAACA 
GTAGGAGAAG 
CTCGAATCTC 
AAACCAGCGC 
ACTAAAATAC 



21 
I 

AGGCCTCCCC 
GGCACTGGCT 
TCTGGGGCTT 
CCTGCAAATG 
TTCCGAGATT 
ACCAGAAAAG 
ATCTGACAAT 
GCAACCTGCA 
TCCGTCACCT 
ACATTATGTG 
ACTGCCTGAA 
GTTTGCCATC 
CATTATCCTG 
TGGTTTCCAA 
TTTCATCCGA 
ATCAAGATTC 
CAACCTCAGA 
TTCAGTGGTT 
ATGTTACCAA 



31 
I 

GCACGGTGGG 
GCTAGGGATG 
CTGCTGGCTG 
CAGTGCCTCT 
GGAGCCTAAC 
GTTAGAAATC 
TGTGGATTCT 
GCACATCAAT 
TGACTTGTCT 
GATCAAGACT 
TGAAAGCAGC 
TGCAAATCTG 
TAGTGTGGCA 
ACATATGAAT 
TGACAGTGGG 
TGTCAACCTC 
CCACCACTGG 
CTATAACGGG 
TCACACGGAG 



41 
I 

GGAAAGCGGC 
TCGTCCTGGA 
GTTGTGGGCT 
CGGATCTGGT 
AGTGTAGATC 
ATCAACGAAG 
GGATTAAAAT 
TTTACCCGAA 
GAACTGATCC 
CTCCAAGAGG 
AAGAATATTC 
GCCGCACCTA 
GGTGATCCGG 
GAAACAAGCC 
AAGCAGATCT 
ACTGTGCATT 
TGCATTCCAT 
GCAATATTGA 
TACCACGGCT 
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51 
I 

CGGTGCAGCG 
TAAGGTGGCA 
TCTGGAGGGC 
GCAGCGACCC 
CTGAGAACAT 
ATGATGTTGA 
TTGTGGCTCA 
ACAAACTGAC 
TGGTGGGCAA 
CTAAATCCAG 
CCCTGGCAAA 
ACCTCACTGT 
TTCCTAATAT 
ACACACAGGG 
CTTGTGTGGC 
TTGCACCAAC 
TCACTGTGAA 
ATGAGTCCAA 
GCCTCCAGCT 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 



1267 



WO 03/042661 



GGATAATCCC ACTCACATGA ACAATGGGGA CTACACTCTA ATAGCCAAGA ATGAGTATGG 1200 

GAAGGATGAG AAACAGATTT CTGCTCACTT CATGGGCTGG CCTGGAATTG ACGATGGTGC 1260 

AAACCCAAAT TATCCTGATG TAATTTATGA AGATTATGGA ACTGCAGCGA ATGACATCGG 1320 

GGACACCAGG AACAGAAGTA ATGAAATCCC TTCCACAGAC GTCACTGATA AAACCGGTCG 1380 

GGAACATCTC TCGGTCTATG CTGTGGTGGT GATTGCGTCT GTGGTGGGAT TTTGCCTTTT 1440 

GGTAATGCTG TTTCTGCTTA AGTTGGCAAG ACACTCCAAG TTTGGCATGA AAGGTTTTGT 1500 

TTTGTTTCAT AAGATCCCAC TGGATGGGTA GCTGAAATAA AGGAAAAGAC AGAGAAAGGG 1560 

GCTGTGGTGC TTGTTGGTTG ATGCTGCCAT GTAAGCTGGA CTCCTGGGAC TGCTGTTGGC 1620 

TTATCCCGGG AAGTGCTGCT TATCTGGGGT TTTCTGGTAG . ATGTGGGCGG TGTTTGGAGG 1680 

CTGTACTATA TGAAGCCTGC ATATACTGTG AGCTGTGATT GGGGAACACC AATGCAGAGG 1740 

TAACTCTCAG GCAGCTAAGC AGCACCTCAA GAAAACATGT TAAATTAATG CTTCTCTTCT 1800 

TACAGTAGTT CAAATACAAA ACTGAAATGA AATCCCATTG GATTGTACTT CTCTTCTGAA 1860 

AAGTGTGCTT TTTGACCCTA CTGGACATTT ATTGACTTAA TTGCTTCTGT TTATTAAAAT 1920 

TGACCTGCAA AGTTAAAAAA AAATTAAAGT TGAGAACAGG TATAAGTGCA CACTGAATAG 1980 

TCTAATCTAC ATGTAACACA TATTTTAGTA TGATTTTCTA TACTCTAATC AGCACTGAAT 2040 

TCAGAGGGTT TGACTTTTTC ATCTATAACA CAGTGACTAA AAGAGTTAAG GGTATATATA 2100 

CCATCACTTT GGGACTTGGT AGTATTATTA AAAGGTTATT TCCTTCACTG TCAATAAAAG 2160 

TCCAAATGTT TAGCTTAGGT CTGAGAGTCA AACAATGTTA AGGATTGTCT TAAAGTTCCT 2220 

TAGCCAGCAA AACAAAACAA AACAAAACAA ACAAATGAAA AACGTTTAAA AAGAAGAAGA 2280 

AGAAAAAAAA CAAGAACAAG CAGCAACAGC TGTTTTGTTG GGGCTATAGA TTTAAGTTAG 2340 

GCATAGTCAA TTTCAGAATA ACTAAGAGTG GAATATATGC ATATGGTGAA ATTATAACCT 2400 

TGCCCTTTTT TATTTGCCCT CTGCGATCCA CCTGGCTTTT TAGAAGTCTG CCGAGTGAGA 2460 

AGGCCACAGT ATCTCATGCT GTTTGCATTA CAGAACTGCA GCTTTTCTAC TCTGAAAAGG 2520 

CCTGGGAGCA GAATGGCTGG CCTGCTGTGA GCAGGAGAGG AGATTCTAAG AAGGATAGTC 2580 

CCCCCTACAA CATACTGTCA TACTGCTGGG TTTTCATGGG TAGGAAAGCT TGTCCTGACC 2640 

CCAGCAGCAA AGAGGTGGCA GGTCGCTAAT GAATATATGC TTTATAATGT CCTTCTTCAT 2700 

TGCTGAGAGG GCAGCCTTAG AGCTGTGGAT TTCTGCATCC CCCCTGAGTC TGACCCATGG 2760 

ACACCTGTTT CATTCACTTT AGCATCACAG TGACCTTTGT ATGCTCTGTT CAGTCTGTGT 2820 

CAGGCAGTAT GCTTGTCCTG AAGAGAGGTT TGGCTATCCC CCACCCCACC CACCCACCCT 2880 

GTTCCTTTTT TATCAGGAGG ACTTCAGAGC CAGGCCTGCA GCATTTTGTT TGAAAACACA 2940 

ATCAGCTCTG ACAGTTAGAC ATGCACACAG ACGCCATAGC TGGATTGGAA ACATTGATGT 3000 

TTTAAAAATT TATTTTTTTT GGAAATAGTT GCACAAATGC TGCAATTTAG CTTTAAGGTT 3060 

CTATAGATTT TTAACTAGTC CAACACAGTC AGAAACATTG TTTTGAATCC TCTGTAAACC 3120 

AAGGCATTAA TCTTAATAAA CCAGGATCCA TTTAGGTACC ACTTGATATA AAAAGGATAT 3180 

CCATAATGAA TATTTTATAC TGCATCCTTT ACATTAGCCA CTAAATACGT TATTGCTTGA 3240 

TGAAGACCTT TCACAGAATC CTATGGATTG CAGCATTTCA CTTGGCTACT TCATACCCAT 3300 

GCCTTAAAGA GGGGCAGTTT CTCAAAAGCA GAAACATGCC GCCAGTTCTC AAGTTTTCCT 3360 

CCTAACTCCA TTTGAATGTA AGGGCAGCTG GCCCCCAATG TGGGGAGGTC CGAACATTTT 3420 

CTGAATTCCC ATTTTCTTGT TCGCGGCTAA ATGACAGTTT CTGTCATTAC TTAGATTCCG 3480 

ATCTTTCCCA AAGGTGTTGA TTTACAAAGA GGCCAGCTAA TAGCAGAAAT CATGACCCTG 3540 

AAAGAGAGAT GAAATTCAAG CTGTGAGCCA GGCAGGAGCT CAGTATGGCA AAGGTTCTTG 3600 

AGAATCAGCC ATTTGGTACA AAAAAGATTT TTAAAGCTTT TATGTTATAC CATGGAGCCA 3660 

TAGAAAGGCT ATGGATTGTT TAAGAACTAT TTTAAAGTGT TCCAGACCCA AAAAGGAAAA 3720 

ATAAAAAAAA AGGAATATTT GTACCCAACA GCTAGAAGGA TTGCAAGGTA GATTTTTGTT 3780 

TTAAAATGGA GAGAAGTGGA CAGATAAGGC CATTTAATAT ATCAAAGATC AGTTGACATC 3B40 

TCTAGGGAAT GATGAAACAG CAGGCTATTA GAAAATTATT TCATATAGTT CTCGTGTTCT 3900 - 

TTT C TTTTT T TTAATCCCTG AAGGGAGATC AGTAACATAG CTTCTCTTTT CTGTACTCTA 3960 

GACCACCCCT TTTCATCATT TTGCTTTTTA TGTCTCCCAT AAGAAATGTG CTTTTTAGAG 4020 

CTTCCTAATG CATGTGTTGC ATTATTGCAG CATTAGAAAA GGAGAGGTAG CATTTTTGCT 4080 

GAAATCGGGC CTGTCACTCT CCAATAAAGG TTCTGGCACT TCAATGCCAG GCAGGTCTCC 4140 

TAAATGAACA GAATGATCTG TGTGAGCCGA TGCCTGCCCT TCCAGAGGGG CCACTGTCCC 4200 

CAGCCGCAGC CAACTGTGTC CCACAGGAAT GGGAGCCTAG GTTTCCAAAT CTTGTGATTC 4260 

TTTAGGAGAA ACATGAAACC TGGATTTCGT GTGAAATGTC CCGATTGTTA AAAAGTTGGC 4320 

TCAATTATTT TTAAAACATT TTGTAAGCCA ACAAAAGTCT GTGGGCTGCC AGTTTATTAC 4380 

TTTTGTCTTA AAACATGATC ATTGTTCTCT CACGGTATCC TTCTGTCTTC CCGTTGCAAA 4440 

TTCACTTTTC TTTCTTCCTG ACATTGCCAT TGAGGGCTTT GTTACCACAA GCTAAGAAAC 4500 

TGAGTTTAAC AGCCCAGTTA TCTGCAACAT GTCAATTACC TTTGCTCCTC TCCTGTGATT 4560 

CCCACCATGC TGTGACCCTC AGCTGTCTCC CTTTGCTGGG AATTCTGCAC CAATGTCTTC 4620 

CCCTCAACCC ATTCCCTGGT TGGTCCTACT CCCGTGTGGC CAGAGACATC CTAGCAAATC 4680 

CTTCCTCCTA TTATATCTGA CACTAATTTC TTTTCAACAG CGCTCATGTC TCTTGGCCCA 4740 

GTCAGGTGCT GCCAGGTTTA GATAGGAAAG TACATGTCCC ATTTTCATGG GTGCCCTTAA 4000 

TGTGGTCCAC GTCCTATATC TTATTATATT TACTCATGGC TCAATGGGGG CCTCCAGAGA 4860 

CCCTCTCAGG CTGCTGAGCT AGACTAAGGA ATGCATCCAC TGTCATCACA TGAGACACTG 4920 

ACTCTGTGAC GACAAAAGTA CAAACAGTCT GAGGCTAAGA AAGGTTCATC TCACAACAGG 4980 

AAAAACAAAT CTCAACACAC ATTAGAGATA ATTGATTCAG GGGTTTTCTC TCCCAGTCTC 5040 

CCAGCAGGGA CTGATTTCAT TTCTGACCCA CTAGGTTTTC TTTCCAGAAA TAGGTAGCAA 5100 

GGACAAGAAC TAAACAATCC CAGCCCCACC CAGCAACACA GAACACAGGA GTTTGCTTTT 5160 

GGCTTCTCAC TCTCCAAGTA ACCCTGAATT AGGCCCAGAA TGGCTGAGGC TTGGAGCATC 5220 

TCCTCAGACA GAGCAGAGGC GACACCTCTT CAGGGGTGTG TGGAGTAAAT AGCTCGAAGA 5280 

GCTGAAGACA GAAAACCAGT TTCACGCCAG GTGCGAGAGA GAGCATAATG GAGGGAAGCC 5340 

CGCTTTCTCT CTCCTCTTCT TTTCTCTTTA TTTCTTTAGA GCACTTGACT TTTTTTTCTC 5400 

TCTCTCTCTA GTATTCTAAA CTGACCCCAT GACCAACTGA GAATTTATTT TTGTTTCATT 5460 

GGTTGTTTCA CAGAATTAGA ACACACACGA CTTTTTATTC CTCCATTGCA AAATGGAATC 5520 

AAGATACTAC ACAAGACCTG TGCTTTCTTC CTTTGCATGA TTTACACCTC CGOCTGTTTT 5580 

GGTGCTAGCT GTCTAGAACT TCTCTCTTGG TTTGAATCTG ATTCCTTCAC ACTACACTAG 5640 

AAGTTTATTT CATCTTGTTT TGTCTAGACT CCAGATACAG AGGGACAGCT GGACTGAGGA 5700 

CAAGCAATTC CATCTAGCAT AGGGTCTCTC AGGGTTGGTG CATCCAGCCA CATGGGCAGG 5760 

GCCAGTCACA TCTAGTCTAT GTCCCCAGAG CCCTTGGAGT TGCGCAGCTT AGCTGACTTG 5820 

ACTCCAAGGA AATTAGTACA GAAGTAACCA CTCTATTAAG TGTGTTCTGC TATGTTCACA 5880 

■ TGCCTGTAGT ACCTGCAAAC CATGCCAGGT TCATCTAAAG ACATAGGGGA AGATTAAGGA 5940 

CTCTTTTGGA CAGACCATGA ATTGAATTTG CTGCCAGGTG CTGCCAGACT GAATTTGGCT 6000 
GACAGAACTC CCAGCCCAGG AAAGTTCCAT GACAATGACT GTCGCAGAAG GAAATTTCCC 6060 
ACTAAAGTCA GTCCATTTTC AAGTTTTGGT CTTCAGAGAC AAAAGAACGT CCCAGCCACC 6120 
TGATTTTGAT GGTGAGGTAA CTCTAAGTTG AATTCAGGCT AGTGTTGCAG TATAGCTTTG 6180 
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GCATGTTCAT GAGTGAGCAC CCAGAATGTG TTGAACCAAC CCCCACCCCT AACTACTGAC 6240 

TATGACTGCA GTGGGTTTTT ATGGGGAAAA AAAGTGTGAA AAGCAAAAAG AAAGGAACAG 6300 

AGATTTTTTA TCACCTTTAT TGTAAGACAG TCCATTTATG AATTGAGTAT AAACACATAC 6360 

AAAGTAACAA GAGATTCCTA AGAAACGCAA ATCCTTGAGT TTCACGCACT TCATGTTCAA 6420 

CCATTTGCTG TAATCCAGAG GCAGCCTGTG AATCATTCTC ATGCCCTGTT TTTTTTTTTT 6480 

TTTTCCTATA ATGTTCTGGG TTTAAAAGCC ATCTTTTCCA CATTTTCTGT AAATAATGGA 6540 

TAATCATTTT AAAAAGTTTT ATTTTTAGTG CTGTTTTAAC AATGTAGATA GATCATAAAT 6600 

GTACTTGCTG AATTCAATCA TTTTTAACAA GCCAATAAAG TTTGATAATT CAAAAAAAAA 6660 

AAAAAAAAAA AAAAAAAAA 6679 

Seq ID NO: CX09 DNA Sequence 

Nucleic Acid Accession *: NM_006180.1 

Coding sequence: 3 5 2.. 2 820 

1 11 21 31 41 51 

I I I I I I 

CCCCCATTCG CATCTAACAA GGAATCTGCG CCCCAGAGAG TCCCGGACGC CGCCGGTCGG 60 

TGCCCGGCGC GCCGGGCCAT GCAGCGACGG CCGCCGCGGA GCTCCGAGCA GCGGTAGCGC 120 

CCCCCTGTAA AGCGGTTCGC TATGCCGGGA CCACTGTGAA CCCTGCCGCC TGCCGGAACA 180 

CTCTTCGCTC CGGACCAGCT CAGCCTCTGA TAAGCTGGAC TCGGCACGCC CGCAACAAGC 240 

ACCGAGGAGT TAAGAGAGCC GCAAGCGCAG GGAAGGCCTC CCCGCACGGG TGGGGGAAAG 300 

CGGCCGGTGC AGCGCGGGGA CAGGCACTCG GGCTGGCACT GGCTGCTAGG GATGTCGTCC 360 

TGGATAAGGT GGCATGGACC CGCCATGGCG CGGCTCTGGG GCTTCTGCTG GCTGGTTGTG 420 

GGCTTCTGGA GGGCCGCTTT CGCCTGTCCC ACGTCCTGCA AATGCAGTGC CTCTCGGATC 480 

TGGTGCAGCG ACCCTTCTCC TGGCATCGTG GCATTTCCGA GATTGGAGCC TAACAGTGTA 540 

GATCCTGAGA ACATCACCGA AATTTTCATC GCAAACCAGA AAAGGTTAGA AATCATCAAC 600 

GAAGATGATG TTGAAGCTTA TGTGGGACTG AGAAATCTGA CAATTGTGGA TTCTGGATTA 660 

AAATTTGTGG CTCATAAAGC ATTTCTGAAA AACAGCAACC TGCAGCACAT CAATTTTACC 720 

CGAAACAAAC TGACGAGTTT GTCTAGGAAA CATTTCCGTC ACCTTGACTT GTCTGAACTG 780 

ATCCTGGTGG GCAATCCATT TACATGCTCC TGTGACATTA TGTGGATCAA GACTCTCCAA 840 

GAGGCTAAAT CCAGTCCAGA CACTCAGGAT TTGTACTGCC TGAATGAAAG CAGCAAGAAT 900 

ATTCCCCTGG CAAACCTGCA GATACCCAAT TGTGGTTTGC CATCTGCAAA TCTGGCCGCA 960 

CCTAACCTCA CTGTGGAGGA AGGAAAGTCT ATCACATTAT" CCTGTAGTGT GGCAGGTGAT 1020 

CCGGTTCCTA ATATGTATTG GGATGTTGGT AACCTGGTTT CCAAACATAT GAATGAAACA 1080 

AGCCACACAC AGGGCTCCTT AAGGATAACT AACATTTCAT CCGATGACAG TGGGAAGCAG 1140 

ATCTCTTGTG TGGCGGAAAA TCTTGTAGGA GAAGATCAAG ATTCTGTCAA CCTCACTGTG 1200 

CATTTTGCAC CAACTATCAC ATTTCTCGAA TCTCCAACCT CAGACCACCA CTGGTGCATT 1260 

CCATTCACTG TGAAAGGCAA CCCCAAACCA GCGCTTCAGT GGTTCTATAA CGGGGCAATA 1320 

TTGAATGAGT CCAAATACAT CTGTACTAAA ATACATGTTA CCAATCACAC GGAGTACCAC 13 80 

GGCTGCCTCC AGCTGGATAA TCCCACTCAC ATGAACAATG GGGACTACAC TCTAATAGCC 1440 

AAGAATGAGT ATGGGAAGGA TGAGAAACAG ATTTCTGCTC ACTTCATGGG CTGGCCTGGA 1500 

ATTGACGATG GTGCAAACCC AAATTATCCT GATGTAATTT ATGAAGATTA TGGAACTGCA 1560 

GCGAATGACA TCGGGGACAC CACGAACAGA AGTAATGAAA TCCCTTCCAC AGACGTCACT 1620 

GATAAAACCG GTCGGGAACA TCTCTCGGTC TATGCTGTGG TGGTGATTGC GTCTGTGGTG 1680 

GGATTTTGCC TTTTGGTAAT GCTGTTTCTG CTTAAGTTGG CAAGACACTC CAAGTTTGGC 1740 

ATGAAAGGCC CAGCCTCCGT TATCAGCAAT GATGATGACT CTGCCAGCCC ACTCCATCAC 1800 

ATCTCCAATG GGAGTAACAC TCCATCTTCT TCGGAAGGTG GCCCAGATGC TGTCATTATT I860 

GGAATGACCA AGATCCCTGT CATTGAAAAT CCCCAGTACT TTGGCATCAC CAACAGTCAG 1920 

CTCAAGCCAG ACACATTTGT TCAGCACATC AAGCGACATA ACATTGTTCT GAAAAGGGAG 1980 

CTAGGCGAAG GAGCCTTTGG AAAAGTGTTC CTAGCTGAAT GCTATAACCT CTGTCCTGAG 2040 

CAGGACAAGA TCTTGGTGGC AGTGAAGACC CTGAAGGATG CCAGTGACAA TGCACGCAAG 2100 

GACTTCCACC GTGAGGCCGA GCTCCTGACC AACCTCCAGC ATGAGCACAT CGTCAAGTTC 2160 

TATGGCGTCT GCGTGGAGGG CGACCCCCTC ATCATGGTCT TTGAGTACAT GAAGCATGGG 2220 

GACCTCAACA AGTTCCTCAG GGCACACGGC CCTGATGCCG TGCTGATGGC TGAGGGCAAC 2280 

CCGCCCACGG AACTGACGCA GTCGCAGATG CTGCATATAG CCCAGCAGAT CGCCGCGGGC 2340 

ATGGTCTACC TGGCGTCCCA GCACTTCGTG CACCGCGATT TGGCCACCAG GAACTGCCTG 2400 

GTCGGGGAGA ACTTGCTGGT GAAAATCGGG GACTTTGGGA TGTCCCGGGA CGTGTACAGC 2460 

ACTGACTACT ACAGGGTCGG TGGCCACACA ATGCTG CCCA TTCGCTGGAT GCCTCCAGAG 2520 

AGCATCATGT ACAGGAAATT CACGACGGAA AGCGACGTCT GGAGCCTGGG GGTCGTGTTG 2580 

TGGGAGATTT TCACCTATGG CAAACAGCCC TGGTACCAGC TGTCAAACAA TGAGGTGATA 2640 

GAGTGTATCA CTCAGGGCCG AGTCCTGCAG CGACCCCGCA CGTGCCCCCA GGAGGTGTAT 2700 

GAGCTGATGC TGGGGTGCTG GCAGCGAGAG CCCCACATGA GGAAGAACAT CAAGGGCATC 2760 

CATACCCTCC TTCAGAACTT GGCCAAGGCA TCTCCGGTCT ACCTGGACAT TCTAGGCTAG 2820 

GGCCCTTTTC CCCAGACCGA TCCTTCCCAA CGTACTCCTC AGACGGGCTG AGAGGATGAA 2880 

CATCTTTTAA CTGCCGCTGG AGGCCACCAA GCTGCTCTCC TTCACTCTGA CAGTATTAAC 2940 

ATCAAAGACT CCGAGAAGCT CTCGAGGGAA GCAGTGTGTA CTTCTTCATC CATAGACACA 3000 

GTATTGACTT CTTTTTGGCA TTATCTCTTT CTCTCTTTCC ATCTCCCTTG GTTGTTCCTT 3060 

TTTCTTTTTT TAAATTTTCT TTTTCTTCTT TTTTTTCGTC TTCCCTGCTT CACGATTCTT 3120 

ACCCTTTCTT TTGAATCAAT CTGGCTTCTG CATTACTATT AACTCTGCAT AGACAAAGGC 3180 

CTTAACAAAC GTAATTTGTT ATATCAGCAG ACACTCCAGT TTGCCCACCA CAACTAACAA 3240 

TGCCTTGTTG TATTCCTGCC TTTGATGTGG ATGAAAAAAA GGGAAAACAA ATATTTCACT 3300 

TAAACTTTGT CACTTCTGCT GTACAGATAT CGAGAGTTTC TATGGATTCA CTTCTATTTA 3360 

TTTATTATTA TTACTGTTCT TATTGTTTTT GGATGGCTTA AGCCTGTGTA TAAAAAAGAA 3420 

AACTTGTGTT CAATCTGTGA AGCCTTTATC TATGGGAGAT TAAAACCAGA GAGAAAGAAG 3480 

ATTTATTATG AACCGCAATA TGGGAGGAAC AAAGACAACC ACTGGGATCA GCTGGTGTCA 3540 

GTCCCTACTT AGGAAATACT CAGCAACTGT TAGCTGGGAA GAATGTATTC GGCACCTTCC 3600 

CCTGAGGACC TTTCTGAGGA GTAAAAAGAC TACTGGCCTC TGTGCCATGG ATGATTCTTT 3660 

TCCCATCACC AGAAATGATA GCGTGCAGTA GAGAGCAAAG ATGGCTT 3707 

Seq ID NO: C110 DNA Sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence? 939.. 2372 

1 11 21 31 41 51 
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I I I I I I 

AAGACGGATT CTCAGACAAG GCTTGCAAAT GCCCCGCAGC CATCATTTAA CTGCACCCGC 60 

AGAATAGTTA CGGTTTGTCA CCCGACCCTC CCGGATCGCC TAATTTGTCC CTAGTGAGAC 120 

CCCGAGGCTC TGCCCGCGCC TGGCTTCTTC GTAGCTGGAT GCATATCGTG CTCCGGGCAG 180 

CGCGGGCGCA GGGCACGCGT TCGCGCACAC CCTAGCACAC ATGAACACGC GCAAGAGCTG 240 

AACCAAGCAC GGTTTCCATT TCAAAAAGGG AGACAGCCTC TACOGOGATT GTAGAAGAGA 300 

CTGTGGTGTG AATTAGGGAC CGGGAGGOGT OGAAOGGAGG AAOGGTTCAT CTTAGAGACT 360 

AATTTTCTGG AGTTTCTGCC CCTGCTCTGC GTCAGCCCTC ACGTCACTTC GCCAGCAGTA 420 

GCAGAGGCGG CGGCGGCGGC TCCCGGAATT GGGTTGGAGC AGGAGCCTCG CTGGCTGCTT 460 - 

CGCTCGOGCT CTACGCGCTC AGTCCCGGGC GGTAGCAGGA GCCTGGACCC AGGCGCCGCC 540 

GGGGGGCGTG AGGCGCCGGA GCCCGGCCTC GAGGTGCATA CCGGACCCCC ATTCGCATCT 600 

AACAAGGAAT CTGCGCCCCA GAGAGTCCCG GGAGCGCCGC CGGTCGGTGC CCGGCGCGCC 660 

GGGCCATGCA GCGACGGCCG CCGCGGAGCT CCGAGCAGCG GTAGCGCCCC CCTGTAAAGC 720 

GGTTCGCTAT GCCGGGGCCA CTGTGAACCC TGCCGCCTGC CGGAACACTC TTCGCTCOGG 780 

ACCAGCTCAG CCTCTGATAA GCTGGACTCG GCACGCCCGC AACAAGCACC GAGGAGTTAA 840 

GAGAGCCGCA AGCGCAGGGA AGGCCTCCCC GCACGGGTGG GGGAAAGCGG CCGGTGCAGC 900 

GCGGGGACAG GCACTCGGGC TGGCACTGGC TGCTAGGGAT GTCGTCCTGG ATAAGGTGGC 960 

ATGGACCCGC CATGGCGCGG CTCTGGGGCT TCTGCTGGCT GGTTGTGGGC TTCTGGAGGG 1020 

CCGCTTTCGC CTGTCCCACG TCCTGCAAAT GCAGTGCCTC TCGGATCTGG TGCAGOGACC 1080 

CTTCTCCTGG CATCGTGGCA TTTCCGAGAT TGGAGCCTAA CAGTGTAGAT CCTGAGAACA 1140 

TCACCGAAAT TTTCATCGCA AACCAGAAAA GGTTAGAAAT CATCAACGAA GATGATGTTG 1200 

AAGCTTATGT GGGACTGAGA AATCTGACAA TTGTGGATTC TGGATTAAAA TTTGTGGCTC 1260 

ATAAAGCATT TCTGAAAAAC AGCAACCTGC AGCACATCAA TTTTACCCGA AACAAACTGA 1320 

CGAGTTTGTC TAGGAAACAT TTCCGTCACC TTGACTTGTC TGAACTGATC CTGGTGGGCA 1380 

ATCCATTTAC ATG CTCCTGT GACATTATGT GGATCAAGAC TCTCCAAGAG GCTAAATCCA 1440 

GTCCAGACAC TCAGGATTTG TACTGCCTGA ATGAAAGCAG CAAGAATATT CCCCTGGCAA 1500 

ACCTGCAGAT ACCCAATTGT GGTTTGCCAT CTGCAAATCT GGCCGCACCT AACCTCACTG 1560 

TGGAGGAAGG AAAGTCTATC ACATTATCCT GTAGTGTGGC AGGTGATCCG GTTCCTAATA 1620 

TGTATTGGGA TGTTGGTAAC CTGGTTTCCA AACATATGAA TGAAACAAGC CACACACAGG 1680 

GCTCCTTAAG GATAACTAAC ATTTCATCCG ATGACAGTGG GAAGCAGATC TCTTGTGTGG 1740 

CGGAAAATCT TGTAGGAGAA GATCAAGATT CTGTCAACCT CACTGTGCAT TTTGCACCAA 1800 

CTATCACATT TCTCGAATCT CCAACCTCAG ACCACCACTG GTGCATTCCA TTCACTGTGA 1860 

AAGGCAACCC CAAACCAGCG CTTCAGTGGT TCTATAACGG GGCAATATTG AATGAGTCCA 1920 

AATACATCTG TACTAAAATA CATGTTACCA ATCACACGGA GTACCACGGC TGCCTCCAGC 1980 

TGGATAATCC CACTCACATG AACAATGGGG ACTACACTCT AATAGCCAAG AATGAGTATG 2040 

GGAAGGATGA GAAACAGATT TCTGCTCACT TCATGGGCTG GCCTGGAATT GACGATGGTG 2100 

CAAACCCAAA TTATCCTGAT GTAATTTATG AAGATTATGG AACTGCAGCG AATGACATCG 2160 

GGGACACCAC GAACAGAAGT AATGAAATCC CTTCCACAGA OGTCACTGAT AAAACCGGTC 2220 

GGGAACATCT CTGGGTCTAT GCTGTGGTGG TGATTGCGTC TGTGGTGGGA TTTTGCCTTT 2280 

TGGTAATGCT GTTTCTGCTT AAGTTGGCAA GACACTCCAA GTTTGGCATG AAAGGTTTTG 2340 

TTTTGTTTCA TAAGATCCCA CTGGATGGGT AGCTGAAATA AAGGAAAAGA CAGAGAAAGG 2400 

GGCTGTGGTG CTTGTTGGTT GATGCTGCCA TGTAAGCTGG ACTCCTGGGA CTGCTGTTGG 2460 

CTTATCCCGG GAAGTGCTGC TTATCTGGGG TTTTCTGGTA GATGTGGGCG GTGTTTGGAG 2520 

GCTGTACTAT ATGAAGCCTG CATATACTGT GAGCTGTGAT TGGGGAACAC CAATGCAGAG 2580 

GTAACTCTCA GGCAGCTAAG CAGCACCTCA AGAAAACATG TTAAATTAAT GCTTCTCTTC 2640 

TTACAGTAGT TCAAATACAA AACTGAAATG AAATCCCATT GGATTGTACT TCTCTTCTGA 2700 

AAAGTGTGCT TTTTGACCCT ACTGGACATT TATTGACTTA ATTGCTTCTG TTTATTAAAA 2760 

TTGACCTGCA AAGTTAAAAA AAAATTAAAG TTGAGAACAG GTATAAGTGC ACACTGAATA 2820 

GTCTAATCTA CATGTAACAC ATATTTTAGT GTGATTTTCT ATACTCTAAT CAGCACTGAA 2880 

TTCAGAGGGT TTGACTTTTT CATCTATAAC ACAGTGACTA AAAGAGTTAA GGGTATATAT 2940 

ACCATCACTT TGGGACTTGG TAGTATTATT AAAAGGTTAT TTCCTTCACT GTCAATAAAA 3000 

GTCCAAATGT TTAGCTTAGG TCTGAGAGTC AAACAATGTT AAGGATTGTC TTAAAGTTCC 30S0 

TT AG CCAGCA AAACAAAACA AAACAAAACA AACAAATGAA AAACGTTTAA AAAGAAGAAG 3120 

AAGAAAAAAA ACAAGAACAA GCAGCAACAG CTGTTTTGTT GGGGCTATAG ATTTAAGTTA 3180 

GGCATAGTCA ATTTCAGAAT AACTAAGAGT GGAATATATG CATATGGTGA AATTATAACC 3240 

TTGCCCTTTT TTATTTGCCC TCTGCGATCC ACCTGCTTTT TAGAAGTCTG CCGAGTGAGA 3300 

AGGCCACAGT ATCTCATGCT GTTTGCATTA CAGAACTGCA GCTTTTCTAC TCTGAAAAGG 3360 

CCTGGGAGCA GAATGGCTGG CCTGCTGTGA GCAGGAGAGG AGATTCTAAG AAGGATAGTC 3420 

CCCCCTACAA CATACTGTCA TACTGCTGGG TTTTCATGGG TAGGAAAGCT TGTCCTGACC 3480 

CCAGCAGCAA AGAGGTGGCA GGTCGCTAAT GAATATATGC TTTATAATGT CCTTCTTCAT 3540 

TGCTGAGAGG GCAGCCTTAG AGCTGTGGAT TTCTGCATCC CCCCTGAGTC TGACCCATGG 3600 

ACACCTGTTT CATTCACTTT AGCATCACAG TGACCTTTGT ATGCTCTGTT CAGTCTGTGT 3660 

CAGGCAGTAT GCTTGTCCTG AAGAGAGGTT TGGCTATCCC CACCCCACCC CACCCCACCC 3720 

TGTTCCTTTT TTATCAGGAG GACTTCAGAG CCAGGCCTGC AGCATTTTGT TTGAAAACAC 3780 

AATCAGCTCT GACAGTTAGA CATGCACACA GACGCCATAG CTGGATTGGA AACATTGATG 3840 

TTTTAAAAAT TTATTTTTTT TGGAAATAGT TGCACAAATG CTGCAATTTA GCTTTAAGGT 3900 

TCTATAGATT TTTAACTAGT CCAACACAGT CAGAAACATT GTTTTGAATC CTCTGTAAAC 3960 

CAAGGCATTA ATCTTAATAA ACCAGGATCC ATTTAGGTAC CACTTGATAT AAAAAGGATA 4020 

TCCATAATGA ATATTTTATA CTGCATCCTT TACATTAGCC ACT AAAT ACG TTATTGCTTG 4080 

ATGAAGACCT TTCACAGAAT CCTATGGATT GCAGCATTTC ACTTGGCTAC TTCATACCCA 4140 
TGCCTTAAAG AGGGGCAGTT TCTCAAAAGC AGAAACATGC CGCCAGTTCT CAAGTTTTCC 4200 
TCCTAACTCC ATTTGAATGT AAGGGCAGCT GGCCCCCAAT GTGGGGAGGT CGGAACATTT 4260 
TCTGAATTCC CATTTTCTTG TTCGCGGCTA AATGACAGTT TCTGTCATTA CTTAGATTCC 4320 
GATCTTTCCC AAAGGTGTTG ATTTACAAAG AGGCCAGCTA ATAGCAGAAA TCATGACCCT 4380 
GAAAGAGAGA TGAAATTCAA GCTGTGAGCC AGGCAGGAGC TCAGTATGGC AAAGGTTCTT 4440 
GAGAATCAGC CATTTGGTAC AAAAAAGATT TTTAAAGCTT TTATGTTATA CCATGGAGCC 4500 
ATAGAAAGGC TATGGATTGT TTAAGAACTA TTTTAAAGTG TTCCAGACCC AAAAAGGAAA 4560 
AATAAAAAAA AAGGAATATT TGTACCCAAC AGCTAGAAGG ATTGCAAGGT AGATTTTTGT 4620 
TTTAAAATGG AGAGAAGTGG ACAGATAAGG CCATTTAATA TATCAAAGAT CAGTTGACAT 4680 
CTCCTAGGGA ATGATGAAAA CAGCAGGCTA T 4711 



Seq ID NO: CI 11 DMA Sequence 

Nucleic Acid Accession #: NM_130830.1 

Coding sequence t 1 - - 1746 
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ATGCCACTGA 
GCCTACCATG 
GGGGCACGCA 
CTCAACACGC 
GCCCTGAGGA 
GGCTCGCTGC 
TTCCAGGGCC 
CAGCCGGCCC 
CTGGAATACA 
GGCAAGAATA 
GTCCTCCGGC 
GTTAACCTGC 
TTCCACAACA 
CCACCCAGCA 
CTGAAGGAGC 
TATGACAACC 
GTCCTGATTC 
ACGGAGCTTC 
TTCCGCATGT 
CCAGGGAATA 
CTGGAGAACT 
TATGACAATC 
AACCAGCCTA 
GGCCAGTCCC 
GTGCCTAGTT 
TCCGTCTCTT 
ATTCAGGTCA 
ATTGCCGCCA 
TGTTGCTGCT 
TGTTAA 




ACATCTCTTC 
TTAGCCGCAA 
GGGAGCTGTC 
TGGCCAACCT 
TCTTCGCCAA 
TGCCCCTCGG 
CCTGGAGGTG 
GGTTAGGGAC 
TCATTATCAT 
ACCCAGAAAC 
CTACCACTGA 
CTGATGACCG 
TTGTAATTGG 
GCAAGAAGAG 



21 
I 

CCTTTTGCTG 
CGAGTGTACC 
GCCCACCCCT 
ACTCAATGAG 
TGAGCTGTCG 
CCTCGCCAAC 
TGAGTCTCTC 
GTGCAGCAAC 
AGCCTTCGAC 
CATCTCACCC 
CAGGCTCACG 
TCTACAGCAG 
CCAGAGACTC 
GCTGCCCCAG 
GATCTTCGGG 
TCTACCCGAC 
TCAGATCAGC 
CCTCCACACC 
GCAGAACATC 
CGTCAATGGC 
CATCTTCGAT 
TGACTCAGAC 
GGACACTGTA 
CAATGTCAAC 
ACCATGGTAC 
GCTAACCAGC 
CAGCGTTTGG 
CATTGTCGCC 
GAGCCAAGCT 



31 
I 

GTGGGCTGCC 
TGCTCCAGGG 
CTGCCCTGGA 
TCCCCGTTCC 
CGCATCACGC 
AACAAGCTGC 
CTTCTGTCCA 
CTCAAGGAGC 
CACCTGGTAG 
AGGGTCTTCC 
GATATCCCCA 
AACCAGATTG 
TACCTGTCCA 
CTCAACCGTC 
CCCATGCCCA 
AATGTCTTCA 
TTCATCTCCC 
AACGCACTGC 
TCCCTGCAGA 
CTCATGGCCA 
CACCTGGGGA 
ATCCTTCCGC 
CCTGTGTGTT 
GTTGCTGTTC 
CCAGACACAC 
CCTGTGGAAG 
GGCATGACCC 
CTGGCCTGCT 
GTCCTGATGC 




AGGACCTGGA 
ACAATCGCCT 
TCCAGCTGCA 
AACTGTGTGA 
TCCGCAACTG 
TCAGCCCAGC 
CAAGCGTCCA 
CCAGTTACCC 
ACTACACTGA 
AGGCCCAGAG 
CCCTGGCTGC 
AGATGAAGGC 



51 
I 

TGCAGGGTTG 
GGAGTGCACC 
CCTGCAGATC 
AGCCCTCATC 
CCGAAACCTG 
CATCGGCCTC 
GTTGCAGATC 
CGGCAACCAC 
GCTCAATCTG 
CAATCTCCAG 
TGATGGGCTT 
CCCTGGTCTC 
CTCCCAGCTG 
TGGGAATTCC 
GCTTTGGCTC 
CCAGTTGCAG 
CAACGGGCTA 
CGGGAATGTC 
CAGACAGCTC 
GAACAACCAG 
GCTGCGGCTG 
GCTCCTGCTC 
CAATGTCCGA 
TGTCCCTGAG 
TGACACCACA 
TCTGACTACC 
GGGGCTGGCC 
CTGCGTCGGC 
ACCCAATGAG 



Seq ID NO: C112 ENA Sequence 

Nucleic Acid Accession #: NM_0026S8.1 

Coding sequence: 77.. 1372 



GTCCCCGCAG 
CCCCGACCTC 
GAGCGACTCC 
TGGAGGAACA 
GAAATTCGGA 
TCACTTTTAC 
CTCTGCCACT 
CCTGGGGAAA 
GCAGGTGGGC 
AAAGCCCTCC 
CGGCTTTAAG 
CATCTACAGG 
CCCTTGCTGG 
CATCGTCTAC 
GGTGGAAAAC 
CATTGCCTTG 
ACAGACCATC 
CACTGGCTTT 
TGTTGTGAAG 
CACCACCAAA 
CTCAGGGGGA 
CTGGGGCCGT 
CTTACCCTGG 
AGGGAGGAAA 
TCCATCAGCT 
CACCACCAGG 
CAGACCCTCT 
TGTCTTTTTC 
GGCTCGAAGG 
AATGAATAAT 
AATGTGGGAG 
ATTCCATGAA 
GCTGTGAGTG 
AAACTGTGTG 
CTGGGGCCTC 
ACCTGTGACC 
ATCCCTTCCT 
ACACTGAATA 
ATCAATAAAA 



11 
\ 

CGCCGTCGCG 
GCCACCATGA 
AAAGGCAGCA 
TGTGTGTCCA 
GGGCAGCACT 
CGAGGAAAGG 
GTCCTTCAGC 
CATAATTACT 
CTAAAGCCGC 
TCTCCTCCAG 
ATTATTGGGG 
AGGCACCGGG 
GTGATCAGCG 
CTGGGTCGCT 
CTCATCCTAC 
CTGAAGATCC 
TGCCTGCCCT 
GGAAAAGAGA 
CTGATTTCCC 
ATGCTATGTG 
CCCCTCGTCT 
GGATGTGCCC 
ATCCGCAGTC 
CGGGCACCAC 
GTAAGAAGAG 
GTGAACGACA 
GGCCAGGATG 
TGGACTGAAG 
GAGAGCCAGC 
TTCCCAATTA 
CAGCGGTTTG 
TGTATCAGGA 
TAAGTGTGAG 
GACTGTGATG 
TTGGGTCCCC 
AGCACTGTCT 
TTTAGCCTAG 
TTTATATTTC 
TGTGATTTTT 



21 
I 

CCCTCCTGCC 
GAGCCCTGCT 
ATGAACTTCA 
ACAAGTACTT 
GTGAAATAGA 
CCAGCACTGA 
AAACGTACCA 
GCAGGAACCC 
TTGTCCAAGA 
AAGAATTAAA 
GAGAATTCAC 
GGGGCTCTGT 
CCACACACTG 
CAAGGCTTAA 
ACAAGGACTA 
GTTCCAAGGA 
CGATGTATAA 
ATTCTACCGA 
ACCGGGAGTG 
CTGCTGACCC 
GTTCCCTCCA 
TGAAGGACAA 
ACACCAAGGA 
CCGCTTTCTT 
ACTGGGAAGA 
AT AG CTTT AC 
GAGGGGTGGT 
CCTGCAGGAG 
TCCCCCGACC 
GGAAGTGTAA 
GGGAGCAGAG 
AATATATATG 
TAAGAGCTGG 
CCACACAGAG 
CACGTGACAG 
CAGTTTCACT 
TTCATCCAAT 
ACTATTTTTA 
CTGA 



31 
I 

GCAGGCCACC 
GGCGCGCCTG 
TCAAGTTCCA 
CTCCAACATT 
TAAGTCAAAA 
CACCATGGGC 
TGCCCACAGA 
AGACAACCGG 
GTGCATGGTG 
ATTTCAGTGT 
CACCATCGAG 
CACCTACGTG 
CTTCATTGAT 
CTCCAACACG 
CAGCGCTGAC 
GGGCAGGTGT 
CGATCCCCAG 
CTATCTCTAT 
TCAGCAGCCC 
CCAATGGAAA 
AGGCCGCATG 
GCCAGGCGTC 
AGAGAATGGC 
GCTGGTTGTC 
TAGGCTCTGC 
CCTCACGGAT 
CCTGACTCAA 
TTAAAAAGGG 
GGTGGGCATT 
GCAGCTGAGG 
ACACTAACGA 
TGTGTGTATG 
TGTCTGATTG 
TGGTCTTTCT 
TGCCTGGGAA 
TTCACATAGA 
CCTCAC TGGG 
TTTATATTTT 



41 

I 

GAGGCOGCCG 
CTTCTCTGCG 
TCGAACTGTG 
CACTGGTGCA 
ACCTGCTATG 
CGGCCCTGCC 
TCTGATGCTC 
AGGCGACCCT 
CATGACTGCG 
GGCCAAAAGA 
AACCAGCCCT 
TGTGGAGGCA 
TACCCAAAGA 
CAAGGGGAGA 
ACGCTTGCTC 
GCGCAGCCAT 
TTTGGCACAA 
CCGGAGCAGC 
CACTACTACG 
ACAGATTCCT 
ACTTTGACTG 
TACACGAGAG 
CTGGCCCTCT 
ATTTTTGCAG 
ACAGATGGAT 
AGGCCTGGGT 
CATGTTACTG 
CAGGGCATCT 
TGTGAGGCCC 
TCTCTTGAGG 
CTTCAGGGCA 
TTTGCACACT 
TTAAGTCTAA 
GGAGAGGTTA 
TGTACTTATT 
TGTCCCTTTC 
TGGGGTGAGG 
TGTAATTTTA 



51 

I 

CCGTCTAGCG 
TCCTGGTCGT 
ACTGTCTAAA 
ACTGCCCAAA 
AGGGGAATGG 
TGCCCTGGAA 
TTCAGCTGGG 
GGTGCTATGT 
CAGATGGAAA 
CTCTGAGGCC 
GGTTTGCGGC 
GCCTCATCAG 
AGGAGGACTA 
TGAAGTTTGA 
ACCACAACGA 
CCCGGACTAT 
GCTGTGAGAT 
TGAAAATGAC 
GCTCTGAAGT 
GCCAGGGAGA 
GAATTGTGAG 
TCTCACACTT 
GAGGGTCCCC 
TAGAGTCATC 

GCTGGCTGCC 
ACCAGCAACT 
CCTGTGCATG 
ATGGTTGAGA 
GAGCTTAGCC 
GGGCTCTGAT 
TGTTGTGTGG 
ATATTTCCTT 
TAGGTCACTC 
CTGCAGCATG 
TTGGCCAGTT 
ACCACTCCTT 
AATAAAAGTG 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1746 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

B40 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

I860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2304 



Seq ID NO: C113 DNA Sequence 

Nucleic Acid Accession #» XM_087254.1 

Coding sequence: 47.. 2332 
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10 



20 
25 
30 



40 



1 11 21 31 41 51 

I I I I I I 

AGAGTACGTG TTTACAGATA AAACTGGTAC ACTGACAGAA AATGAGATGC AGTTTCGGGA 60 

- ATGTTCAATT AATGGCATGA AATACCAAGA AATTAATGGT AGACTTGTAC CCGAAGGACC 120 

AACACCAGAC TCTTCAGAAG GAAACTTATC TTATCTTAGT AGTTTATCCC ATCTTAACAA 180 

CTTATCCCAT CTTACAACCA GTTCCTCTTT CAGAACCAGT CCTGAAAATG AAACTGAACT 240 

AATTAAAGAA CATGATCTCT TCTTTAAAGC AGTCAGTCTC TGTCACACTG TACAGATTAG 300 

CAATGTTCAA ACTGACTGCA CTGGTGATGG TCCCTGGCAA TCCAACCTGG CACCATCGCA 360 

GTTGGAGTAC TATGCATCTT CACCAGATGA AAAGGCTCTA GTAGAAGCTG CTGCAAGGAT 420 

TGGTATTGTG TTTATTGGCA ATTCTGAAGA AACTATGGAG GTTAAAACTC TTGGAAAACT 480 

GGAACGGTAC AAACTGCTTC ATATTCTGGA ATTTGATTCA GATCGTAGGA GAATGAGTGT 540 

AATTGTTCAG GCACCTTCAG GTGAGAAGTT ATTATTTGCT AAAGGAGCTG AGTCATCAAT 600 

TCTCCCTAAA TGTATAGGTG GAGAAATAGA AAAAACCAGA ATTCATGTAG ATGAATTTGC 660 

■j r TTTGAAAGGG CTAAGAACTC TGTGTATAGC ATATAGAAAA TTTACATCAA AAGAGTATGA 720 

13 GGAAATAGAT AAACGCATAT TTGAAGCCAG GACTGCCTTG CAGCAGCGGG AAGAGAAATT 780 

GGCAGCTGTT TTCCAGTTCA TAGAGAAAGA CCTGATATTA CTTGGAGCCA CAGCAGTAGA 840 

AGACAGACTA CAAGATAAAG TTCGAGAAAC TATTGAAGCA TTGAGAATGG CTGGTATCAA 900 

AGTATGGGTA CTTACTGGGG ATAAACATGA AACAGCTGTT AGTGTGAGTT TATCATGTGG 960 

CCATTTTCAT AGAACCATGA ACATCCTTGA ACTTATAAAC CAGAAATCAG ACAGCGAGTG 1020 

TGCTGAACAA TTGAGGCAGC TTGCCAGAAG AATTACAGAG GATCATGTGA TTCAGCATGG 1080 

GCTGGTAGTG GATGGGACCA GCCTATCTCT TGCACTCAGG GAGCATGAAA AACTATTTAT 1140 

GGAAGTTTGC AGAAATTGTT CAGCTGTATT ATGCTGTCGT ATGGCTCCAC TGCAGAAAGC 1200 

AAAAGTAATA AGACTAATAA AAATATCACC TGAGAAACCT ATAACATTGG CTGTTGGTGA 1260 

TGGTGCTAAT GACGTAAGCA TGATACAAGA AGCCCATGTT GGCATAGGAA TCATGGGTAA 1320 

AGAAGGAAGA CAGGCTGCAA GAAACAGTGA CTATGCAATA GCCAGATTTA AGTTCCTCTC 1380 

CAAATTGCTT TTTGTTCATG GTCATTTTTA TTATATTAGA ATAGCTACCC TTGTACAGTA 1440 

TTTTTTTTAT AAGAATGTGT GCTTTATCAC ACCCCAGTTT TTATATCAGT TCTACTGTTT 1500 

GTTTTCTCAG CAAACATTGT ATGACAGCGT GTACCTGACT TTATACAATA TTTGTTTTAC 1S60 

TTCCCTACCT ATTCTGATAT ATAGTCTTTT GGAACAGCAT GTAGACCCTC ATGTGTTACA 1620 

AAATAAGCCC ACCCTTTATC GAGACATTAG TAAAAACCGC CTCTTAAGTA TTAAAACATT 1680 

TCTTTATTGG ACCATCCTGG GCTTCAGTCA TGCCTTTATT TTCTTTTTTG GATCCTATTT 1740 

ACTAATAGGG AAAGATACAT CTCTGCTTGG AAATGGCCAG ATGTTTGGAA ACTGGACATT 1800 

TGGCACTTTG GTCTTCACAG TCATGGTTAT TACAGTCACA GTAAAGATGG CTCTGGAAAC 1860 

TCATTTTTGG ACTTGGATCA ACCATCTCGT TACCTGGGGA TCTATTATAT TTTATTTTGT 1920 

ATTTTCCTTG TTTTATGGAG GGATTCTCTG GCCATTTTTG GGCTCCCAGA ATATGTATTT 1980 

TGTGTTTATT CAGCTCCTGT CAAGTGGTTC TGCTTGGTTT GCCATAATCC TCATGGTTGT 2040 

TACATGTCTA TTTCTTGATA TCATAAAGAA GGTCTTTGAC CGACACCTCC ACCCTACAAG 2100 

TACTGAAAAG GCACAGCTTA CTGAAACAAA TGCAGGTATC AAGTGCTTGG ACTCCATGTG 2160 

CTGTTTCCCG GAAGGAGAAG CAGCGTGTGC ATCTGTTGGA AGAATGCTGG AACGAGTTAT 2220 

AGGAAGATGT AGTCCAACCC ACATCAGCAG ATCATGGAGT GCATCGGATC CTTTCTATAC 2280 

CAACG ACAGG AGCATCTTGA CTCTCTCCAC AATGGACTCA TCTACTTGTT AAAGGGGCAG 2340 

TAGTACTTTG TGGGAGCCAG TTCACCTCCT TTCCTAAAAT TCAGTGTGAT CACCCTGTTA 2400 

ATGGCCACAC TAGCTCTGAA ATTAATTTCC AAAATCTTTG TAGTAGTTCA TACCCACTCA 2460 

GAGTTATAAT GGCAAACAAA CAGAAAGCAT TAGTACAAGC CCCTCCCAAC ACCCTTAATT 2520 

TGAATCTGAA CATGTTAAAA TTTGAGAATA AAGAGACATT TTTCATCTCT TTGTCTGGTT 2580 

TGTCCCTTGT GCTTATGGGA CTCCTAATGG CATTTCAGTC TGTTGCTGAG GCCATTATAT 2640 

TTTAATATAA ATGTAGAAAA AAGAGAGAAA TCTTAGTAAA GAGTATTTTT TAGTATTAGC 2700 

TTGATTATTG ACTCTT CTAT TTAAATCTGC TTCTGTAAAT TATGCTGAAA GTTTGCCTTG 2760 

- n AGAACTCTAT TTTTTTATTA GAGTTATATT TAAAGCTTTT CATGGGAAAA GTTAATGTGA 2820 

JKJ ATACTGAGGA ATTTTGGTCC CTCAGTGACC TGTGTTGTTA ATTCATTAAT GCATTCTGAG 2880 

TTCACAGAGC AAATTAGGAG AATCATTTCC AACCATTATT TACTGCAGTA TGGGGAGTAA 2940 

ATTTATACCA ATTCCTCTAA CTGTACTGTA ACACAGCCTG TAAAGTTAGC CATATAAATG 3000 

CAAGGGTATA TCATATATAC AAATCAGGAA TCAGGTCCGT TCACCGAACT TCAAATTGAT 3060 

GTTTACTAAT ATTTTTGTGA CAGAGTATAA AGACCCTATA GTGGGTAAAT TAGATACTAT 3120 

TAGCATATTA TTAATTTAAT GTCTTTATCA TTGGATCTTT TGCATGCTTT AATCTGGTTA 3180 

ACATATTTAA ATTTGCTTTT TTTCTCTTTA CCTGAAGGCT CTGTGTATAG TATTTCATGA 3240 

CATCGTTGTA CAGTTTAACT ATATCAATAA AAAGTTTGGA CAGTATTTAA ATATTGCAAA 3300 

TATGTTTAAT TATACAAATC AGAATAGTAT GGGTAATTAA ATGAATACAA AAAGAAGAGC 3360 

£H CTCTTTCTGC AGCCGACTTA GACATGCTCT TCCCTTTCTA TAAGCTAGAT TTTAGAATAA 3420 

UU AGGGTTTCAG TTAATAATCT TATTTTCAGG TTATGTCATC TAACTTATAG CAAACTACCA 3480 

CAATACAGTG AGTTCTGCCA GTGTCCCAGT ACAAGGCATA TTTCAGGTGT GGCTGTGGAA 3540 

TGTAAAAATG CTCAACTTGT ATCAGGTAAT GTTAGCAATA AATTAAATGC TAAGAATGAT 3600 

TAATCGGGTA CATGTTACTG TAATTAACTC ATTGCACTTC AAAACCTAAC TTCCATCCTG 3660 

AATTTATCAA GTAGTTCAGT ATTGTCATTT GTTTTTGTTT TATTGAAAAG TAATGTTGTC 3720 

UJ TTAAGATTTA GAAGTGATTA TTAGCTTGAG AACTATTACC CAGCTCTAAG CAAATAATGA 3780 

TTGTATACAT ATTAAGATAA TGGTTAAATG CGGTTTTACC AAGTTTTCCC TTGAAAATGT 3840 

AATTCCTTTA TGGAGATTTA TTGTGCAGCC CTAAGCTTCC TTCCCATTTC ATGAATATAA 3900 

GGCTTCTAGA ATTGGACTGG CAGGGGAAAG AATGGTAGAG ACAGAAATTA AGACTTTATC 3960 

qr\ CTTGTTTGCT TGTAAACTAT TATTTTCTTG CTAATGTAAC ATTTGTCTGT TCCAGTGATG 4020 

/U TAAGGATATT AAGTTATTAA GCTAAATATT AATTTTCAAA AATAGTCCTT CTTTAACTTA 4080 

GATATTTCAT AGCTGGATTT AGGAAGATGT GTTATTCTGG AAGTACTAAA AAGAATAATA 4140 

CAACGTACAA TGTCTGCATT CACTAATTCA TGTTCCAGAA GAGGAAATAA TGAAGATATA 4200 

CTCAGTAGAG TACTAGGTGG GAGGATATGG AAATTTGCTC ATAAAATCTC TTATAAAACG 4260 

TGCATATAAC AAAATGACAC CCAGTAGGCC TGCATTACAT TTACATGACC GTGTTTATTT 4320 

GCCATCAAAT AAACTGAGTA CTGACACCAG ACAAAGACTC CAAAGTCATA AAATAGCCTA 4380 

TGACCAACTG CAGCAAGACA GGAGGTCAGC TCGCCTATAA TGGTGCTTAA AGTGTGATTG 4440 

ATGTAATTTT CTGTACTCAC CATTTGAAGT TAGTTAAGGA GAACTTTATT TTTTTAAAAA 4500 

AAGTAAATGG CAACCACTAG TGTGCTCATC CTGAACTGTT ACTCCAAATC CACTCCGTTT 4560 

Of) TTAAAGCAAA ATTATCTTGT GATTTTAAGA AAAGAGTTTT CTATTTATTT AAGAAAGTAA 4620 

OKJ CAATGCAGTC TGCAAGCTTT CAGTAGTTTT CTAGTGCTAT ATTCATCCTG TAAAACTCTT 4680 

ACTACGTAAC CAGTAATCAC AAGGAAAGTG TCCCCTTTGC ATATTTCTTT AAAATTCTTT 4740 

CTTTGGAAAG TATGATGTTG ATAATTAACT TACCCTTATC TGCCAAAACC AGAGCAAAAT 4800 

GCTAAATAOG TTATTGCTAA TCAGTGGTCT CAAATCGATT TGCCTCCCTT TGCCTCGTCT 4860 

GAGGGCTGTA AGCCTGAAGA TAGTGGCAAG CACCAAGTCA GTTTCCAAAA TTGCCCCTCA 4920 
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GCTGCTTTAA 
GGAGCAAAGT 
AGGGGAGAAT 
ATACACAAAG 
CAGAGGCAAA 
AGGTCATTTC 
ATAGGCAGGA 
TGCTACTCTG 
TTTTCCTGTA 
CAAAGAGTTA 
CTATTTTGAA 
TGTGAAATAA 
AATCATGGTA 
GGTTTAAAAT 
TGCATTGTCA 
GAAGTTATAT 



GTGACTCAGC 
ATGGGCCAGG 
TTATGGTCTG 
GCTTCCAGAC 
TCAACCCTAG 
TACTGGAAAA 
GTCAAGGCCA 
AAAAATCTCG 
AAGATCAGTT 
CGTAAAACAT 
ATGAGTTATC 
CTTGAATGTT 
ATTTAGATTT 
TTTGGACCTG 
GTAAATGTAG 
TTATCAAATA 



ACCCTGCCTC 
GAGAACTACA 
AATTTTCTAA 
CTGAGCCACA 
GAAATACTTG 
GATTGTGAGA 
CTAGAAAATT 
TGAAGGCTGT 
TGGGGTATGA 
GTTTTATTAA 
TATTTTCATA 
GTTCCTATAA 
TTATGAGGAA 
AGACACTGTG 
TATATTATTG 
AAAACTTTCC 



AGCTTCAGCA 
GCTACGAAGA 
CTGTCCTCTT 
CCCAGGCCCT 
CATTCTGCCC 
TTGAACTTAT 
GACAGTTAAG 
AGGAAAAGGG 
TATAAGCAGG 
TTTTGGTCCC 
AAAGTAAAAC 
AAAATAGATC 
TGAGTATCTG 
GCTGTCTAAT 
TACAGCTACT 
TATAT 



GGCGTAGGCT 
CCTGCTGTCG 
TCTTGGGTCT 
ATCCTGAACA 
TACGGTTAGT 
CTGATCGCTT 
AGCCAAAAGT 
AGAATCTTCC 
TATTAATAAA 
CACGTACAGA 
ACTATTAAAG 
ATAACTCATG 
GAAATATTGT 
GTAATCCTTT 
CATAATTTTT 



CACCCTGGGC 
AGTTGAGAAA 
AAAGCTCATA 
GGAGACTAAA 
ACCAGGACTG 
GAGACTCCTA 
TTTTAAAATA 
ATGTTGGTGT 
AATAACACAC 
CATTTTATTT 
TGCTGTTTTA 
ATATGTTTGT 
AGCAATACTT 
AAAAATTCTC 
TAAAGTTTAT 



Seq ID NO: CI 14 DMA Sequence 

Nucleic Acid Accession #: XM_087461.1 

Coding sequence: 236.. 1138 



CCGGCGCCGQ 
GAGCCAGCCG 
CCCGCACTGC 
CCGCTCGCCA 
GCCCCTGCTC 
CGCGGACGGG 
GGCGAGCCCA 
GGCCCGGAGG 
GCTGAGCTCG 
CTTCTCCACC 
GCCGCTGCTC 
CTGCGTGGGC 
CCCCAGCGCC 
CGAGCCGCCC 
CAGCCTGGAG 
CACGCTGGTG 
CTGGCCGGTG 
CGCCAGCACC 
CGCCGCCGCC 
CGCTCCGCTC 
GACTCGGCCG 
GCCGAGAGAC 
GCAGATGTTT 
AAATGGGGCC 
GCCCTAGATT 
CTGTTTTAAA 
CTGTAGTAAG 
AATTTATTTA 
CTGAAAATGA 
GCGTTTCTTC 



11 
I 

GGGCGGCGGG 
CCGTCACCGC 
GCCCGGCCGC 
TGCAACCGCC 
GCCGCGCTCC 
CCCGGCCTCC 
TCGCCCCGCG 
AGGCGGCGGC 
CTGCTCGTGC 
AACGCGCACG 
ATCGAGCACC 
TGCGGCTGGG 
GCCGCCGCCA 
GGGCCGCTGT 
GAGCTGCAGG 
GCCTGCTTCA 
CCCATCATCG 
ACCGCAGCCA 
GCTGCCGCCG 
CTCCCTGTGT 
GTGTGCTTCG 
CCCAGCGCCT 
AAGCCCTTGG 



CATGGCAGAA 
TGAAAGTTTT 
AGAAGCTCTT 
AAATTGGGGG 
ACTTTATGAA 
TTGTACTTAT 



21 
I 

GAACCCCAAA 
CTCOGCGCCG 
CGGCTCCCGC 
GCCGGCCTCG 
TGGCCGCCGC 
TCGGGGTGCC 
GCTGCTGGCC 
GGCGGCGGCG 
GCTGGGGCCG 
GCCGCGCTTT 
TGGGGCTGGC 
TGCGCGGTCG 
CCGCCGGGGC 
GGCTGCAGGG 
GCGAGCCGGG 
TGACCCTGGT 
CCGGCTTCCT 
CCCCCGCCGC 
CCGCCGCGGC 

TGCTGTAGTT 
TTGAAAAGCA 
ACTGAGGGTG 
TATTGCATCC 
AATGACCAAA 
AGTAAAAAAA 
TGTATCTGAA 
GAGGGCATGG 
CCTTTTCCAA 
GTGGTTTTTT 



31 
i 

CGCAACCGGG 
CCCCTGCGGG 
GGTCCCACCG 
CGCGCGTAGG 
CTGCCCGCTG 
CTCCAATGCT 
TCGGCGGCCC 
CCTGTGCAAC 
CCCGCGGGGC 
CTTCGCCGCC 
GGCGGGCGGC 
CCGCACCGGC 
GCCCACCGCG 
CGAGCCGCTG 
CTGGCGGCTG 
CATCGTGGTG 
GCCCAACGGC 
AGTGCCCGCA 
CGTCACTTCG 
TCCGCGOGCG 
ATCGTTAGTT 
AGGTTTGTGC 
GGATCGCAGC 
CCCTGCCCGA 
TCCTGTGTAT 
ATACAAAACA 
CATAGTTGTA 
GAAGGATTTA 
GTTGATCTAT 
GGCTTTTAAT 



41 
I 

TCTGGAGGGA 
CTTGGCAGGC 
TGAGCTCGCC 
CGCCCGCCGC 
CCGCCCGTCC 
TCAGTCAACG 
CCGGGCCCCC 
ATCAGCGTGC * 
TTCCAGTGCG 
GCCTTCCACC 
GCGCAGCAGG 
CGCCTCCGGC 
CTGCCAGCCT 
CATTTCTGCT 
AACCGTAAGC 
TGGAGCGTGG 
ATGGAACAGC 
GGGACCACCG 
GGGGTGGCGA 
CGGGTGCCTT 
CCTCTTCCCG 
TGCGCTTCCA 
TCCGAAGACG 

TTGTTTTATA 
AAAAGATTAA 
TTTGAAATTT 
ACACCGATAT 
CCAGTGACGT 
ACAGACATTT 



51 
I 

TCCCCGCGCC 
GCCCGGCGCG 
GGCCCGTCGC 
AGGCCATGCT 
GCGGCGGGGC 
CGTCCTCCGC 
CGAGCGCCCG 
AGCGGCAGAT 
ACCTACTGCT 
GCGTCGGGCC 
ACCTGCGCCT 
CCGCCGCCGC 
ACCCCGCGGC 
GCCTAGACTT 
CCATTGAGTC 
CCGCCCTCAT 
GCCGGACCAC 
CAGCCGCCGC 
CCAAGTGACC 
TCCOGCCGGA 
AGATGGGGCC 
GTTCCGAAAA 
GAGAGGAGGG 
GCACCCACGT 
TATTTAATAA 
ATTGCTATTG 
GTGGTTTTTT 
ATTGTTACCG 
GGCCTGGTGG 
TCCTCC 



4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5855 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1796 



Seq ID NO: C115 DNA Sequence 

Nucleic Acid Accession #: XM_051522.4 

Coding sequence: 127.. 1215 



1 11 21 31 41 51 

I I I I I I 

ACAGTTGTTG CAAAGTGCTC AGCACTAAGG GAGCCAGCGC ACAGCACAGC CAGGAAGGOG 60 

AGCGAGCCCA GCCAGCCCAG CCAGCCCAGC CAGCCCGGAG GTCATTTGAT TGCCCGCCTC 120 

AGAACGATGG ATCTGCATCT CTTCGACTAC TCAGAGCCAG GGAACTTCTC GGACATCAGC 180 

TGGCCATGCA ACAGCAGCGA CTGCATCGTG GTGGACACGG TGATGTGTCC CAACATGCCC 240 

AACAAAAGCG TCCTGCTCTA CACGCTCTCC TTCATTTACA TTTTCATCTT CGTCATCGGC 300 

ATGATTGCCA ACTCCGTGGT GGTCTGGGTG AATATCCAGG CCAAGACCAC AGGCTATGAC 360 

ACGCACTGCT ACATCTTGAA CCTGGCCATT GCCGACCTGT GGGTTGTCCT CACCATCCCA 420 

GTCTGGGTGG TCAGTCTCGT GCAGCACAAC CAGTGGCCCA TGGGCGAGCT CACGTGCAAA 480 

GTCACACAOC TCATCTTCTC CATCAACCTC TTCGGCAGCA TTTTCTTCCT CACGTGCATG 540 

70 AGCGTGGACC GCTACCTCTC CATCACCTAC TTCACCAACA CCCCCAGCAG CAGGAAGAAG 600 

ATGGTACGCC GTGTCGTCTG CATCCTGGTG TGGCTGCTGG CCTTCTGCGT GTCTCTGCCT 660 

GACACCTACT ACCTGAAGAC CGTCACGTCT GCGTCCAACA ATGAGACCTA CTGCCGGTCC 720 

TTCTACCCCG AGCACAGCAT CAAGGAGTGG CTGATCGGCA TGGAGCTGGT CTCCGTTGTC 780 

TTGGGCTTTG CCGTTCCCTT CTCCATTATC GCTGTCTTCT ACTTCCTGCT GGCCAGAGCC 840 

75 ATCTCGGCGT CCAGTGACCA GGAGAAGCAC AGCAGCCGGA AGATCATCTT CTCCTACGTG 900 

GTGGTCTTCC TTGTCTGCTG GCTGCCCTAC CACGTGGCGG TGCTGCTGGA CATCTTCTCC 960 

ATCCTGCACT ACATCCCTTT CACCTGCCGG CTGGAGCACG CCCTCTTCAC GGCCCTGCAT 1020 

GTCACACAGT GCCTGTCGCT GGTGCACTGC TGCGTCAACC CTGTCCTCTA CAGCTTCATC 1080 

AATCGCAACT ACAGGTACGA GCTGATGAAG GCCTTCATCT TCAAGTACTC GGCCAAAACA 1140 

o0 GGGCTCACCA AGCTCATOGA TGCCTCCAGA GTCTCAGAGA CGGAGTACTC TGCCTTGGAG 1200 

CAGAGCACCA AATGATCTGC CCTGGAGAGG CTCTGGGACG GGTTTACTTG TTTTTGAACA 1260 

GGGTGATGGG CCCTATGGTT TTCTAGAGCA AAGCAAAGTA GCTT CGGGT C TTGATGCTTG 1320 

AGTAGAGTGA AGAGGGGAGC ACGTGCCCCC TGCATCCATT CTCTCTTTCT CTTGATGACG 1380 

CAGCTGTCAT TTGGCTGTGC GTGCTGACAG TTTTGCAACA GGCAGAGCTG TGTCGCACAG 1440 
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CAGTGCTGTG CGTCAGAGCC AGCTGAGGAC AGGCTTGCCT GGACTTCTGT AAGATAGGAT 1500 

TTTCTGTGTT TCCTGAATTT TTTATATGGT GATTTGTATT TAAATTTTAA GACTTTATTT 1560 

TCTCACTATT GGTGTACCTT ATAAATGTAT TTGAAAGTTA AATATATTTT AAATATTGTT 1620 

TGGGAGGCAT AGTGCTGACA TATATTCAGA GTGTTGTAGT TTTAAGGTTA GCGTGACTTC 1680 

AGTTTTGACT AAGGATGACA CTAATTGTTA GCTGTTTTGA AATTATATAT ATATAAATAT 1740 

ATATAAATAT ATAAATATAT GCCAGTCTTG GCTGAAATGT TTTATTTACC ATAGTTTTAT 1800 

ATCTGTGTGG TGTTTTGTAC CGGCACGGGA TATGGAACGA AAACTGCTTT GTAATGCAGT 1860 

TTGTGACATT AATAGTATTG TAAAGTTACA TTTTAAAATA AACAAAAAAC TGTTCTGGAC 1920 

TGCAAATCTG CACACACAAC GAACAGTTGC ATTTCAGAGA GTTCTCTCAA TTTGTAAGTT 1980 

ATTTTTTTTT AATAAAGATT TTTGTTTCCT 2010 



Seq ID NO: CI 16 DNA Sequence 

Nucleic Acid Accession #: NMJ>00350.1 

Coding sequence : 82 . . 6903 



1 11 21 31 41 51 

I I I I I I 

CTGGCTCTTA ACGGCGTTTA TGTCCTTTGC TGTCTGAGGG GCCTCAGCTC TGACCAATCT 60 

GGTCTTCGTG TGGTCATTAG CATGGGCTTC GTGAGACAGA TACAGCTTTT GCTCTGGAAG 120 

AACTGGACCC TGCGGAAAAG GCAAAAGATT CGCTTTGTGG TGGAACTCGT GTGGCCTTTA 180 

TCTTTATTTC TGGTCTTGAT CTGGTTAAGG AATGCCAACC CGCTCTACAG CCATCATGAA 240 

TGCCATTTCC CCAACAAGGC GATGCCCTCA GCAGGAATGC TGCCGTGGCT CCAGGGGATC 300 

TTCTGCAATG TGAACAATCC CTGTTTTCAA AGCCCCACCC CAGGAGAATC TCCTGGAATT 360 

GTGTCAAACT ATAACAACTC CATCTTGGCA AGGGTATATC GAGATTTTCA AGAACTCCTC 420 

ATGAATGCAC CAGAGAGCCA GCACCTTGGC CGTATTTGGA CAGAGCTACA CATCTTGTCC 480 

CAATTCATGG ACACCCTCCG GACTCACCCG GAGAGAATTG CAGGAAGAGG AATACGAATA 540 

AGGGATATCT TGAAAGATGA AGAAACACTG ACACTATTTC TCATTAAAAA CATCGGCCTG 600 

TCTGACTCAG TGGTCTACCT TCTGATCAAC TCTCAAGTCC GTCCAGAGCA GTTCGCTCAT 660 

GGAGTCCCGG ACCTGGCGCT GAAGGACATC GCCTGCAGCG AGGCCCTCCT GGAGCGCTTC 720 

ATCATCTTCA GCCAGAGACG CGGGGCAAAG ACGGTGCGCT ATGCCCTGTG CTCCCTCTCC 780 

CAGGGCACCC TACAGTGGAT AGAAGACACT CTGTATGCCA ACGTGGACTT CTTCAAGCTC 840 

TTCCGTGTGC TTCCCACACT CCTAGACAGC CGTTCTCAAG GTATCAATCT GAGATCTTGG 900 

GGAGGAATAT TATCTGATAT GTCACCAAGA ATTCAAGAGT TTATCCATCG GCCGAGTATG 960 

CAGGACTTGC TGTGGGTGAC CAGGCCCCTC ATGCAGAATG GTGGTCCAGA GACCTTTACA 1020 

AAGCTGATGG GCATCCTGTC TGACCTCCTG TGTGGCTACC CCGAGGGAGG TGGCTCTCGG 1080 

GTGCTCTCCT TCAACTGGTA TGAAGACAAT AACTATAAGG CCTTTCTGGG GATTGACTCC 1140 

ACAAGGAAGG ATCCTATCTA TTCTTATGAC AGAAGAACAA CATCCTTTTG TAATGCATTG 1200 

ATCCAGAGCC TGGAGTCAAA TCCTTTAACC AAAATCGCTT GGAGGGCGGC AAAGCCTTTG 1260 

CTGATGGGAA AAATCCTGTA CACTCCTGAT TCACCTGCAG CACGAAGGAT ACTGAAGAAT 1320 

GCCAACTCAA CTTTTGAAGA ACTGGAACAC GTTAGGAAGT TGGTCAAAGC CTGGGAAGAA 1380 

GTAGGGCCCC AGATCTGGTA CTTCTTTGAC AACAGCACAC AGATGAACAT GATCAGAGAT 1440 

ACCCTGGGGA ACCCAACAGT AAAAGACTTT TTGAATAGGC AGCTTGGTGA AGAAGGTATT 1500 

ACTGCTGAAG CCATCCTAAA CTTCCTCTAC AAGGGCCCTC GGGAAAGCCA GGCTGACGAC 1560 

ATGGCCAACT TCGACTGGAG GGACATATTT AACATCACTG ATCGCACCCT CCGCCTGGTC 1620 

AATCAATACC TGGAGTGCTT GGTCCTGGAT AAGTTTGAAA GCTACAATGA TGAAACTCAG 1680 

CTCACCCAAC GTGCCCTCTC TCTACTGGAG GAAAACATGT TCTGGGCOGG AGTGGTATTC 1740 

CCTGACATGT ATCCCTGGAC CAGCTCTCTA CCACCCCACG TGAAGTATAA GATCCGAATG 1800 

GACATAGACG TGGTGGAGAA AACCAATAAG ATTAAAGACA GGTATTGGGA TTCTGGTCCC 1860 

AGAGCTGATC CCGTGGAAGA TTTCCGGTAC ATCTGGGGCG GGTTTGCCTA TCTGCAGGAC 1920 

ATGGTTGAAC AGGGGATCAC AAGGAGCCAG GTGCAGGCGG AGGCTCCAGT TGGAATCTAC 1980 

CTCCAGCAGA TGCCCTACCC CTGCTTCGTG GACGATTCTT TCATGATCAT CCTGAACCGC 2040 

TGTTTCCCTA TCTTCATGGT GCTGGCATGG ATCTACTCTG TCTCCATGAC TGTGAAGAGC 2100 

ATCGTCTTGG AGAAGGAGTT GCGACTGAAG GAGACCTTGA AAAATCAGGG TGTCTCCAAT 2160 

GCAGTGATTT GGTGTACCTG GTTCCTGGAC AGCTTCTCCA TCATGTCGAT GAGCATCTTC 2220 

CTCCTGACGA TATTCATCAT GCATGGAAGA ATCCTACATT ACAGCGACCC ATTCATCCTC 2280 

TTCCTGTTCT TGTTGGCTTT CTCCACTGCC ACCATCATGC TGTGCTTTCT GCTCAGCACC 2340 

TTCTTCTCCA AGGCCAGTCT GGCAGCAGCC TGTAGTGGTG TCATCTATTT CACCCTCTAC 2400 

CTGCCACACA TCCTGTGCTT CGCCTGGCAG GACCGCATGA CCGCTGAGCT GAAGAAGGCT 2460 

GTGAGCTTAC TGTCTCCGGT GGCATTTGGA TTTGGCACTG AGTACCTGGT TCGCTTTGAA 2520 

GAGCAAGGCC TGGGGCTGCA GTGGAGCAAC ATCGGGAACA GTCCCACGGA AGGGGACGAA 2580 

TTCAGCTTCC TGCTGTCCAT GCAGATGATG CTCCTTGATG CTGCGTGCTA TGGCTTACTC 2640 

GCTTGGTACC TTGATCAGGT GTTTCCAGGA GACTATGGAA CCCCACTTCC TTGGTACTTT 2700 

CTTCTACAAG AGTCGTATTG GCTTAGCGGT GAAGGGTGTT CAACCAGAGA AGAAAGAGCC 2760 

CTGGAAAAGA CCGAGCCCCT AACAGAGGAA ACGGAGGATC CAGAGCACCC AGAAGGAATA 2820 

CACGACTCCT TCTTTGAACG TGAGCATCCA GGGTGGGTTC CTGGGGTATG CGTGAAGAAT 2880 

CTGGTAAAGA TTTTTGAGCC CTGTGGCCGG CCAGCTGTGG ACCGTCTGAA CATCACCTTC 2940 

TACGAGAACC AGATCACCGC ATTCCTGGGC CACAATGGAG CTGGGAAAAC CACCACCTTG 3000 
TCCATCCTGA CGGGTCTGTT GCCACCAACC TCTGGGACTG TGCTCGTTGG GGGAAGGGAC 3060 
ATTGAAACCA GCCTGGATGC AGTCCGGCAG AGCCTTGGCA TGTGTCCACA GCACAACATC 3120 
CTGTTCCACC ACCTCACGGT GGCTGAGCAC ATGCTGTTCT ATGCCCAGCT GAAAGGAAAG 3180 
TCCCAGGAGG AGGCCCAGCT GGAGATGGAA GCCATGTTGG AGGACACAGG CCTCCACCAC 3240 
AAGCGGAATG AAGAGGCTCA GGACCTATCA GGTGGCATGC AGAGAAAGCT GTCGGTTGCC 3300 
ATTGCCTTTG TGGGAGATGC CAAGGTGGTG ATTCTGGACG AACCCACCTC TGGGGTGGAC 3360 
CCTTACTCGA GACGCTCAAT CTGGGATCTG CTCCTGAAGT ATCGCTCAGG CAGAACCATC 3420 
ATCATGCCCA CTCACCACAT GGACGAGGCC GACCACCAAG GGGACCGCAT TGCC ATCATT 3480 
GCCCAGGGAA GGCTCTACTG CTCAGGCACC CCACTCTTCC TGAAGAACTG CTTTGGCACA 3540 
GGCTTGTACT TAACCTTGGT GCGCAAGATG AAAAACATCC AGAGCCAAAG GAAAGGCAGT 3600 
GAGGGGACCT GCAGCTGCTC GTCTAAGGGT TTCTCCACCA CGTGTCCAGC CCACGTCGAT 3660 
GACCTAACTC CAGAACAAGT CCTGGATGGG GATGTAAATG AGCTGATGGA TGTAGTTCTC 3720 
CACCATGTTC CAGAGGCAAA GCTGGTGGAG TGCATTGGTC AAGAACTTAT CTTCCTTCTT 3780 
CCAAATAAGA ACTTCAAGCA CAGAGCATAT GCCAGCCTTT TCAGAGAGCT GGAGGAGACG 3840 
CTGGCTGACC TTGGTCTCAG CAGTTTTGGA ATTTCTGACA CTCCCCTGGA AGAGATTTTT 3900 
CTGAAGGTCA CGGAGGATTC TGATTCAGGA CCTCTGTTTG OGGGTGGCGC TCAGCAGAAA 3960 
AGAGAAAACG TCAACCCCCG ACACCCCTGC TTGGGTCCCA GAGAGAAGGC TGGACAGACA 4020 
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CCCCAGGACT CCAATGTCTG CTCCCCAGGG GCGCCGGCTG CTCACCCAGA GGGCCAGCCT 4080 

CCCCCAGAGC CAGAGTGCCC AGGCCCGCAG CTCAACACGG GGACACAGCT GGTCCTCCAG 4140 

CATGTGCAGG CGCTGCTGGT CAAGAGATTC CAACACACCA TCCGCAGCCA CAAGGACTTC 4200 

CTGGCG CAGA TCGTGCTCCC GGCTACCTTT GTGTTTTTGG CTCTGATGCT TTCTATTGTT 4260 

ATCCTTCCTT TTGGCGAATA CCCCGCTTTG ACCCTTCACC CCTGGATATA TGGGCAGCAG 4320 

TACACCTTCT TCAGCATGGA TGAACCAGGC AGTGAGCAGT TCACGGTACT TGCAGACGTC 4380 

CTCCTGAATA AGCCAGGCTT TGGCAACCGC TGCCTGAAGG AAGGGTGGCT TCCGGAGTAC 4440 

CCCTGTGGCA ACTCAACACC CTGGAAGACT CCTTCTGTGT CCCCAAACAT CACCCAGCTG 4500 

TTCCAGAAGC AGAAATGGAC ACAGGTCAAC CCTTCACCAT CCTGCAGGTG CAGCACCAGG 4560 

GAGAAGCTCA CCATGCTGCC AGAGTGCCCC GAGGGTGCCG GGGGCCTCCC GCCCCCCCAG 4620 

AGAACACAGC GCAGCACGGA AATTCTACAA GACCTGAOGG ACAGGAACAT CTCCGACTTC 4680 

TTGGTAAAAA CGTATCCTGC TCTTATAAGA AGCAGCTTAA AGAGCAAATT CTGGGTCAAT 4740 

GAACAGAGGT ATGGAGGAAT TTCCATTGGA GGAAAGCTCC CAGTCGTCCC CATCACGGGG 4800 

GAAGCACTTG TTGGGTTTTT AAGCGACCTT GGCCGGATCA TGAATGTGAG CGGGGGCCCT 4860 

ATCACTAGAG AGGCCTCTAA AGAAATACCT GATTTCCTTA AACATCTAGA AACTGAAGAC 4920 

AACATTAAGG TGTGGTTTAA TAACAAAGGC TGGCATGCCC TGGTCAGCTT TCTCAATGTG 4980 

GCCCACAACG CCATCTTACG GGCCAGCCTG CCTAAGGACA GGAGCCCCGA GGAGTATGGA 5040 

ATCACCGTCA TTAGCCAACC CCTGAACCTG ACCAAGGAGC AGCTCTCAGA GATTACAGTG S100 

OA CTGACCACTT CAGTGGATGC TGTGGTTGCC ATCTGCGTGA TTTTCTCCAT GTCCTTCGTC 5160 

A\J CCAGCCAGCT TTGTCCTTTA TTTGATCCAG GAGCGGGTGA ACAAATCCAA GCACCTCCAG 5220 

TTTATCAGTG GAGTGAGCCC CACCACCTAC TGGGTGACCA ACTTCCTCTG GGACATCATG 5280 

AATTATTCCG TGAGTGCTGG GCTGGTGGTG GGCATCTTCA TCGGGTTTCA GAAGAAAGCC 5340 

TACACTrCTC CAGAAAACCT TCCTGCCCTT GTGGCACTGC TCCTGCTGTA TGGATGGGCG 5400 

9 GTCATTCCCA TGATGTACCC AGCATCCTTC CTGTTTGATG TCCCCAGCAC AGCCTATGTG 5460 

GCTTTATCTT GTGCTAATCT GTTCATCGGC ATCAACAGCA GTGCTATTAC CTTCATCTTG 5520 

GAATTATTTG ATAATAACCG GACGCTGCTC AGGTTCAACG COGTGCTGAG GAAGCTGCTC 5580 

ATTGTCTTCC CCCACTTCTG CCTGGGCOGG GGCCTCATTG ACCTTGCACT GAGCCAGGCT 5640 

GTGACAGATG TCTATGCCCG GTTTGGTGAG GAGCACTCTG CAAATCCGTT CCACTGGGAC 5700 

OA CTGATTGGGA AGAACCTGTT TGCCATGGTG GTGGAAGGGG TGGTGTACTT CCTCCTGACC 5760 

DKJ CTGCTGGTCC AGCGCCACTT CTTCCTCTCC CAATGGATTG CCGAGCCCAC TAAGGAGCCC 5820 

ATTGTTGATG AAGATGATGA TGTGGCTGAA GAAAGACAAA GAATTATTAC TGGTGGAAAT S880 

AAAACTGACA TCTTAAGGCT ACATGAACTA ACCAAGATTT ATCTGGGCAC CTCCAGCCCA 5940 

GCAGTGGACA GGCTGTGTGT CGGAGTTCGC CCTGGAGAGT GCTTTGGCCT CCTGGGAGTG 6000 

X AATGGTGCCG GCAAAACAAC CACATTCAAG ATGCTCACTG GGGACACCAC AGTGACCTCA 6060 

GGGGATGCCA CCGTAGCAGG CAAGAGTATT TTAACCAATA TTTCTGAAGT CCATCAAAAT 6120 

ATGGGCTACT GTCCTCAGTT TGATGCAATC GATGAGCTGC TCACAGGACG AGAACATCTT 6180 

TACCTTTATG CCCGGCTTCG AGGTGTACCA GCAGAAGAAA TCGAAAAGGT TGCAAACTGG 6240 

AGTATTAAGA GCCTGGGCCT GACTGTCTAC GCCGACTGCC TGGCTGGCAC GTACAGTGGG 6300 

A(\ GGCAACAAGC GGAAACTCTC CACAGCCATC GCACTCATTG GCTGCCCACC GCTGGTGCTG 6360 

HU CTGGATGAGC CCACCACAGG GATGGACCCC CAGGCACGCC GCATGCTGTG GAACGTCATC 6420 

GTGAGCATCA TCAGAAAAGG GAGGGCTGTG GTCCTCACAT CCCACAGCAT GGAAGAATGT 6480 

GAGGCACTGT GTACCCGGCT GGCCATCATG GTAAAGGGCG CCTTTGGATG TATGGGCACC 6540 

ATTCAGCATC TCAAGTCCAA ATTTGGAGAT GGCTATATOG TCACAATGAA GATCAAATCC 6600 

. CCGAAGGACG ACCTGCTTCC TGACCTGAAC CCTGTGGAGC AGTTCTTCCA GGGGAACTTC 6660 

HJ CCAGGCAGTG TGCAGAGGGA GAGGCACTAC AACATGCTCC AGTTCCAGGT CTCCTCCTCC 6720 

TCCCTGGCGA GGATCTTCCA GCTCCTCCTC TCCCACAAGG ACAGCCTGCT GATCGAGGAG 6780 

TACTCAGTCA CACAGACCAC ACTGGACCAG GTGTTTGTAA ATTTTGCTAA ACAGCAGACT 6840 

GAAAGTCATG ACCTCCCTCT GCACCCTCGA GCTGCTGGAG CCAGTCGACA AGCCCAGGAC 6900 

f A TGATCTTTCA CACCGCTCGT TCCTGCAGCC AGAAAGGAAC TCTGGGCAGC TGGAGGCGCA 6960 

JU GGAGCCTGTG CCCATATGGT CATCCAAATG GACTGGCCCA GCGTAAATGA CCCCACTGCA 7020 

GCAGAAAACA AACACACGAG GAGCATGCAG CGAATTCAGA AAGAGGTCTT TCAGAAGGAA 7080 

ACCGAAACTG ACTTGCTCAC CTGGAACACC TGATGGTGAA ACCAAAGAAA TACAAAATCC 7140 

TTCTCCAGAC CCCAGAACTA GAAACCCCGG GCCATCCCAC TAGCAGCTTT GGCCTCCATA 7200 

<T4T TTGCTCTCAT TTCAAGCAGA TCTGCTTTTC TGCATGTTTG TCTGTGTGTC TGCGTTGTGT 7260 

JJ GTGATTTTCA TGGAAAAATA AAATGCAAAT GCACTCATCA CAAAAAAAAA AAAAAAAA 7318 
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Seq ID NO: C117 DNA Sequence 

Nucleic Acid Accession #i NM_006671.2 

Coding sequence: 138 -.1820 ~ 



i r r r r t 1 

GGCACGAGGC TGGTGTTTAG CAACTCCGAC CACCTGCCTG CTGAGGGGCT AGAGCCCTCA 60 

zrc GCCCAGACCC TGTGCCCCCG GCCGGGCTCT CATGCGTGGA ATGGTGCTGT GCCCCTTGCC 120 

UJ AGCAGGCCAG GCTCACCATG GTGCCGCATG CCATCTTGGC ACGGGGGAGG GACGTGTGCA 180 

GGCGGAATGG ACTCCTCATC CTGTCTGTGC TGTCTGTCAT CGTGGGCTGC CTCCTCGGCT 240 

TCTTCTTGAG GACCCGGCGC CTCTCACCAC AGGAAATTAG TTACTTCCAG TTCCCTGGAG 300 

AGCTCCTGAT GAGGATGCTG AAGATGATGA TCCTGCCACT GGTGGTCTCC AGCTTGATGT 360 

70 CCGGACTTGC CTCCCTGGAT GCCAAGACCT CTAGCCGCCT GGGCGTCCTC ACCGTGGCGT 420 

/VJ ACTACCTGTG GACCACCTTC ATGGCTGTCA TCGTGGGCAT CTTCATGGTC TCCATCATCC 480 

ACCCAGGCAG CGCGGCCCAG AAGGAGACCA CGGAGCAGAG TGGGAAGCCC ATCATGAGCT 540 

CAGCCGATGC CCTGTTGGAC CTCATCCGGA ACATGTTCCC AGCCAACCTA GTAGAAGCCA 600 

CATTCAAACA GTACCGCACC AAGACCACCC CAGTTGTCAA GTCCCCCAAG GTGGCACCAG 660 

„ AGGAGGCCCC TCCTCGGCGG ATCCTCATCT ACGGGGTCCA GGAGGAGAAT GGCTCCCATG 720 

TGCAGAACTT CGCCCTGGAC CTGACCCCGC CGCCCGAGGT CGTTTACAAG TCAGAGCCGG 780 

GCACCAGCGA TGGCATGAAT GTGCTGGGCA TCGTCTTCTT CTCTGCCACC ATGGGCATCA 840 

TGCTGGGCCG CATGGGTGAC AGCGGGGCCC CCCTGGTCAG CTTCTGCCAG TGCCTCAATG 900 

AGTCGGTCAT GAAGATCGTG GCGGTGGCTG TGTGGTATTT CCCCTTCGGC ATTGTGTTCC 960 

orv TCATTGCGGG TAAGATCCTG GAGATGGACG ACCCCAGGGC CGTOGGCAAG AAGCTGGGCT 1020 

OU TCTACTCAGT CACCGTGGTG TGCGGGCTGG TGCTCCAGGG GCTCTTTATC CTGCCCCTGC 1080 

TCTACTTCTT CATCACCAAG AAGAATCCCA TCGTCTTCAT CCGCGGCATC CTGCAGGCTC 1140 

TGCTCATCGC GCTGGCCACC TCCTCCAGCT CAGCCACACT GCCCATCACC TTCAAGTGCC 1200 

TGCTGGAGAA CAACCACATC GACCGGCGCA TCGCTCGCTT CGTGCTGCCC GTGGGTGCCA 1260 

CCATCAACAT GGACGGCACT GCGCTCTACG AGGCTGTGGC CGCCATCTTC ATCGCCCAGG 1320 

1275 



WO 03/042661 



TCAACAACTA CGAGCTGGAC TTTGGCCAGA TCATCACCAT CAGTATCACA GCCACTGCAG 
CCAGCATTGG GGCAGCTGGC ATCCCCCAGG CCGGCCTCGT CACCATGGTC ATCGTGCTCA 
CCTCCGTGGG ACTGCCCACC GATGACATCA CCCTCATCAT TGCCGTTGAC TGGGCTCTGG 
ACCGTTTCCG CACCATGATT AACGTGCTGG GTGATGCGCT GGCAGCGGGG ATCATGGCCC 
ATATATGTCG GAAGGATTTT GCCCGGGACA CAGGCACCGA GAAACTGCTG CCCTGCGAGA 
CCAAGCCAGT GAGCCTCCAG GAGATCGTGG CAGCCCAGCA GAATGGCTGT GTGAAGAGTG 
TAGCCGAGGC CTCCGAGCTC ACCCTGGGCC CCACCTGCCC CCACCACGTC CCCGTTCAAG 
TGGAGCGGGA TGAGGAGCTG CCCGCTGOGA GTCTGAACCA CTGCACCATC CAGATCAGCG 
AGCTGGAGAC CAATGTCTGA GCCTGCGGAG CTGCAGGGGC AGGCGAGGCC TCCAGGGGCA 
GGGTCCTGAG GCAGGAACTC GACTCTCCAA CCCTCCTGAG CAGCCGGCAG GGGCCAGGAT 
CACACATTCT TCTCACCCTT GAGAGGCTGG AATTAACCCC GCTTGACGGA AAATGTATCT 
CAGAGAAGGG AAAGGCTGCA TGGGGGAGCC CCATCCAGGG AGTGATGGGC CCGGCATTGC 
CTGAGGCCCC GCTGTGACAG TTTCCCCGGT GTGAGCCCGG TGAGGGCGGC AGGCAGGGGT 
TATCCGGCCC CACTTTCTGG ATGACAGACT TGAGGCTCTG AGAGCTGAAA ACACTTGTCC 
AAGGTCTCAC GTTAAGGTCA AGACACTAAC TCAAATCTTT CAAGCCCCGC CTCTCCTCTT 
GGAGGACAGG GCAGCCTGCA GCTGTGTCCA GGCCCAGGCC CCACCCCATA ACAGGTGGCC 
TCAGCCACAC AGTTCTCCCC AAGGGGAGCA GCCCAGGGCC AAGCCCCGCT G CCTT CCCCA 
GGCCACAGTG CGTCCAGTCT CCTGTCCTGC CACGTGTCTT TTGCAAAGCT CCTTGGATGT 
GGAGACAGAT GTCTTTACTA GAGCTGAAAG GCCCCCTTGA CACATCCAGG CCAACCTCCC 
ATGGAATAGG TAGGCAAGCC AGGACTCCGG GAAGGAGGTG CAGCCAGGAT GCTCTGGTGG 
AGCTGCCGAT GGGGCCCTGG TGTCAGAACT CCCCAAAGGC CTGTGCGTCC AAGT GGAGT C 
AGGTTTTCTA TTCCTTTCTG TGTTTGCAAA TTCAGTGTTA ACTAAATAAA GGTATTTTGT 
TTTTCAAAAA AAAAAAAAAA AAA 

Seq ID NO: CI 18 DNA Sequence 
Nucleic Acid Accession #: NM_005689 
Coding sequence: 278.. 2806 

1 11 21 31 41 51 

I I I I I I 

GGGCCTGCAG TTGGCAGAAG GGTCCCGGGC CCAGAGCCAG CGGGGCCGTG CTGAGACGGC 60 

GTACGTGCCC TGCGTGAGTG CGTGGCGGCG GCGCGTGCGC TAGGGGAGTG GGCGGTGAGG 120 

CCTGGTCCAC GTGCGTCCCT TCCCGGGACC CCCGCAGCTT GGCGCCCAGC GGCTACGTGA 180 

GCCAAGGCAC CCGGATGTCC GCGCCCCTCT CCGAGTGACA AGTCCCGGCC TCCGGTCCCG 240 

CAGTGCCCGC AGCCTCGGCC GGCGTCCACG CATTGCCATG GTGACTGTGG GCAACTACTG 300 

CGAGGCCGAA GGGCCCGTGG GTCCGGCCTG GATGCAGGAT GGCCTGAGTC CCTGCTTCTT 360 

CTTCACGCTC GTGCCCTCGA CGCGGATGGC TCTAGGGACT CTGGCCTTGG TGCTGGCTCT 420 

TCCCTGCAGA CGCCGGGAGC GGCCCGCTGG TGCTGATTCG CTGTCTTGGG GGGCCGGCCC 480 

TCGCATCTCT CCCTACGTGC TGCAGCTGCT TCTGGCCACA CTTCAGGCGG CGCTGCCCCT 540 

GGCCGGCCTG GCTGGCCGGG TGGGCACTGC CCGGGGGGCC CCACTGCCAA GCTATCTACT 600 

TCTGGCCTCC GTGCTGGAGA GTCTGGCCGG CGCCTGTGGC CTGTGGCTGC TTGTCGTGGA 660 

GCGGAGCCAG GCACGGCAGC GTCTGGCAAT GGGCATCTGG ATCAAGTTCA GGCACAGCCC 720 

TGGTCTCCTG CTCCTCTGGA CTGTGGCGTT TGCAGCTGAG AACTTGGCCC TGGTGTCTTG 780 

GAACAGCCCA CAGTGGTGGT GGGCAAGGGC AGACTTGGGC CAACAGGTTC AGTTTAGCCT 840 

GTGGGTGCTG CGGTATGTGG TCTCTGGAGG GCTGTTTGTC CTGGGTCTCT GGGCCCCTGG 900 

ACTTCGTCCC CAGTCCTATA CATTGCAGGT TCATGAAGAG GACCAAGATG TGGAAAGGAG 960 

CCAGGTTCGG TCAGCAGCCC AACAGTCTAC CTGGCGAGAT TTTGGCAGGA AGCTCCGCCT 1020 

CCTGAGTGGC TACCTGTGGC CTCGAGGGAG TCCAGCTCTG CAGCTGGTGG TGCTCATCTG 1080 

CCTGGGGCTC ATGGGTTTGG AACGGGCACT CAATGTGTTG GTGCCTATAT TCTATAGGAA 1140 

CATTGTGAAC TTGCTGACTG AGAAGGCACC TTGGAACTCT CTGGCCTGGA CTGTTACCAG 1200 

TTACGTCTTC CTCAAGTTCC TCCAGGGGGG TGGCACTGGC AGTACAGGCT TCGTGAGCAA 1260 

CCTGCGCACC TTCCTGTGGA TCCGGGTGCA GCAGTTCACG TCTCGGCGGG TGGAGCTGCT 1320 

CATCTTCTCC CACCTGCACG AGCTCTCACT GCGCTGGCAC CTGGGGCGCC GCACAGGGGA 1380 

GGTGCTGCGG ATCGCGGATC GGGGCACATC CAGTGTCACA GGGCTGCTCA GCTACCTGGT 1440 

GTTCAATGTC ATCCCCACGC TGGCCGACAT CATCATTGGC AT CAT CT ACT TCAGCATGTT 1500 

CTTCAACGCC TGGTTTGGCC TCATTGTGTT CCTGTGCATG AGTCTTTACC TCACCCTGAC 1560 

CATTGTGGTC ACTGAGTGGA GAACCAAGTT TCGTCGTGCT ATGAACACAC AGGAGAACGC 1620 

TACCCGGGCA CGAGCAGTGG ACTCTCTGCT AAACTTCGAG ACGGTGAAGT ATTACAACGC 1680 

CGAGAGTTAC GAAGTGGAAC GCTATCGAGA GGCCATCATC AAATATCAGG GTTTGGAGTG 1740 

GAAGTCGAGC GCTTCACTGG TTTTACTAAA TCAGACCCAG AACCTGGTGA TTGGGCTCGG 1800 

GCTCCTCGCC GGCTCCCTGC TTTGCGCATA CTTTGTCACT GAGCAGAAGC TACAGGTTGG 1860 
GGACTATGTG CTCTTTGGCA CCTACATTAT CCAGCTGTAC ATGCCCCTCA ATTGGTTTGG 1920 
CACCTACTAC AGGATGATCC AGACCAACTT CATTGACATG GAGAACATGT TTGACTTGCT 1980 
GAAAGAGGAG ACAGAAGTGA AGGACCTTCC TGGAGCAGGG CCCCTTCGCT TTCAGAAGGG 2040 
CCGTATTGAG TTTGAGAACG TGCACTTCAG CTATGCCGAT GGGCGGGAGA CTCTGCAGGA 2100 
CGTGTCTTTC ACTGTGATGC CTGGACAGAC ACTTGCCCTG GTGGGCCCAT CTGGGGCAGG 2160 
GAAGAGCACA ATTTTGCGCC TGCTGTTTCG CTTCTAOGAC ATCAGCTCTG GCTGCATCCG 2220 
AATAGATGGG CAGGACATTT CACAGGTGAC CCAGGCCTCT CTCCGGTCTC ACATTGGAGT 2280 
TGTGCCCCAA GACACTGTCC TCTTTAATGA CACCATCGCC GACAATATCC GTTACGGCCG 2340 
TGTCACAGCT GGGAATGATG AGGTGGAGGC TGCTGCTCAG GCTGCAGGCA TCCATGATGC 2400 
CATTATGGCT TTCCCTGAAG GGTACAGGAC ACAGGTGGGC GAGCGGGGAC TGAAGCTGAG 2460 
CGGCGGGGAG AAGCAGCGCG TCGCCATTGC CCGCACCATC CTCAAGGCTC CGGGCATCAT 2520 
TCTGCTGGAT GAGGCAACGT CAGCGCTGGA TACATCTAAT GAGAGGGCCA TCCAGGCTTC 2580 
TCTGGCCAAA GTCTGTGCCA ACCGCACCAC CATCGTAGTG GCACACAGGC TCTCAACTGT 2640 
GGTCAATGCT GACCAGATCC TCGTCATCAA GGATGGCTGC ATCGTGGAGA GGGGACGACA 2700 
CGAGGCTCTG TTGTCCCGAG GTGGGGTGTA TGCTGACATG TGGCAGCTGC AGCAGGGACA 2760 
GGAAGAAACC TCTGAAGACA CTAAGCCTCA GACCATGGAA CGGTGACAAA AGTTTGGCCA 2820 
CTTCCCTCTC AAAGACTAAC CCAGAAGGGA ATAAGATGTG TCTCCTTTCC CTGGCTTATT 2880 
TCATCCTGGT CTTGGGGTAT GGTGCTAGCT ATGGTAAGGG AAAGGGACCT TTCCGAAAAA 2940 
CATCTTTTGG GGAAATAAAA ATGTGGACTG TGAAAAAAAA AAAAAAAAAA AAA 2993 

Seq ID NO: C119 DNA Sequence 
Nucleic Acid Accession ft: NM_000676 
Coding sequence: 333.. 1331 
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1 11 21 31 41 SI 

I I I I I I 

GGGCAATTTG TTAGTTATCC GCCGCCACCA AGACGCGGCA CGGCGCCTGG ACCGGAGGGG 60 

CCCCGCGCGG GCGCGAACTT TGGGCTCGGG CGAGTGGGTG GTGCTCCGCC CAGCCCGAGA 120 

CGGGCGGGCG CGCGGGCCAA TGGGTGCCGC CTCTTGGCCG CGGGGGGCCC CGACCCGTGG 180 

GTCCCGGCCA CCAGCGCCCC AGCCCCGAGG CTCAGAAGCG GCAGGCGGAG GCGCGGTCCG 240 

GGCGCTATGG CCATGCCCGG CGGGTCTCAC GCGGCTGCCC CTCGCCCGGC GCGCCTTCGG .300 

TAGGGGGCGC CCGGGGCCCA GCTGGCCCGG CCATGCTGCT GGAGACACAG GACGCGCTGT 360 

ACGTGGCGCT GGAGCTGGTC ATCGCCGCGC TTTCGGTGGC GGGCAACGTG CTGGTGTGCG 420 

CCGCGGTGGG CACGGCGAAC ACTCTGCAGA CGCCCACCAA CTACTTCCTG GTGTCCCTGG 460 

CTGCGGCCGA CGTGGCCGTG GGGCTCTTCG CCATCCCCTT TGCCATCACC ATCAGCCTGG S40 

GCTTCTGCAC TGACTTCTAC GGCTGCCTCT TCCTOGCCTG CTTCGTGCTG GTGCTCACGC 600 

AGAGCTCCAT CTTCAGCCTT CTGGCCGTGG CAGTCGACAG ATACCTGGCC ATCTGTGTCC 660 

CGCTCAGGTA TAAAAGTTTG GTCACGGGGA CCCGAGCAAG AGGGGTCATT GCTGTCCTCT 720 

GGGTCCTTGC CTTTGGCATC GGATTGACTC CATTCCTGGG GTGGAACAGT AAAGACAGTG 780 

CCACCAACAA CTGCACAGAA CCCTGGGATG GAACCACGAA TGAAAGCTGC TGCCTTGTGA 840 

AGTGTCTCTT TGAGAATGTG GTCCCCATGA GCTACATGGT ATATTTCAAT TTCTTTGGGT 900 

GTGTTCTGCC CCCACTGCTT ATAATGCTGG TGATCTACAT TAAGATCTTC CTGGTGGCCT 960 

GCAGGCAGCT TCAGCGCACT GAGCTGATGG ACCACTCGAG GACCACCCTC CAGCGGGAGA 1020 

TCCATGCAGC CAAGTCACTG GCCATGATTG TGGGGATTTT TGCCCTGTGC TGGTTACCTG 1080 

TGCATGCTGT TAACTGTGTC ACTCTTTTCC AGCCAGCTCA GGGTAAAAAT AAGCCCAAGT 1140 

GGGCAATGAA TATGGCCATT CTTCTGTCAC ATGCCAATTC AGTTGTCAAT CCCATTGTCT 1200 

ATGCTTACCG GAACCGAGAC TTCCGCTACA CTTTTCACAA AATTATCTCC AGGTATCTTC 1260 

TCTGCCAAGC AGATGTCAAG AGTGGGAATG GTCAGGCTGG GGTACAGCCT GCTCTCGGTG 1320 

TGGGCCTATG ATCTAGGCTC TCGCCTCTTC CAGGAGAAGA TACAAATCCA CAAGAAACAA 1380 

AGAGGACACG GCTGGTTTTC ATTGTGAAAG ATAGCTACAC CTCACAAGGA AATGGACTGC 1440 

CTCTCTTGAG CACTTCCCTG GAGCTACCAC GTATCTAGCT AATATGTATG TGTCAGTAGT 1500 

AGGCTCCAAG GATTGACAAA TATATTTATG ATCTATTCAG CTGCTTTTAC TGTGTGGATT 1560 

ATGCCAACAG CTTGAATGGA TTCTAACAGA CTCTTTTGTT TTTAAAAGTC TGCCTTGTTT 1620 

ATGGTGGAAA ATTACTGAAA CTATTTTACT GTGAAACAGT GTGAACTATT ATAATGCAAA 1680 

TACTTTTTAA CTTAGAGGCA ATGGAAAAAT AAAAGTTGAC TGTACTAAAA ATG 1733 

Seq ID NO: C120 DNA Sequence 
Nucleic Acid Accession #: NM_0S2932 
Coding sequence: 217.. 786 

1 11 21 31 41 51 

I 1 I I 1 I 

CCCAGCCCGG CCCCGCCGCC CCGGCTGCGC ACGCGACGCC CCCTCCAGGC CCCGCTCCTG 60 

CGCCCTATTT GGTCATTCGG GGGGCAAGCG GCGGGAGGGG AAACGTGCGC GGCCGAAGGG 120 

GAAGCGGAGC CGGCGCCGGC TGCGCAGAGG AGCCGCTCTC GCCGCCGCCA CCTCGGCTGG 180 

GAGCCCACGA GGCTGCCGCA TCCTGCCCTC GGAACAATGG GACTCGGCGC GCGAGGTGCT 240 

TGGGCCGCGC TGCTCCTGGG GACGCTGCAG GTGCTAGCGC TGCTGGGGGC CGCCCATGAA 300 

AGCGCAGCCA TGGCGGAGAC TCTCCAACAT GTGCCTTCTG ACCATACAAA TGAAACTTCC 360 

AACAGTACTG TGAAACCACC AACTTCAGTT GCCTCAGACT CCAGTAATAC AACGGTCACC 420 

ACCATGAAAC CTACAGCGGC ATCTAATACA ACAACACCAG GGATGGTCTC AACAAATATG 480 

ACTTCTACCA CCTTAAAGTC TACACCCAAA ACAACAAGTG TTTCACAGAA CACATCTCAG 540 

ATATCAACAT CCACAATGAC CGTAACCCAC AATAGTTCAG TGACATCTGC TGCTTCATCA 600 

GTAACAATCA CAACAACTAT GCATTCTGAA GCAAAGAAAG GATCAAAATT TGATACTGGG 660 

AGCTTTGTTG GTGGTATTGT ATTAACGCTG GGAGTTTTAT CTATTCTTTA CATTGGATGC 720 

AAAATGTATT ACTCAAGAAG AGGCATTCGG TATCGAACCA TAGATGAACA TGATGCCATC 780 

ATTTAAGGAA ATCCATGGAC CAAGGATGGA ATACAGATTG ATGCTGCCCT ATCAATTAAT 840 

TTTGGTTTAT TAATAGTTTA AAACAATATT CTCTTTTTGA AAATAGTATA AACAGGCCAT 900 

GCATATAATG TACAGTGTAT TACGTAAATA TGTAAAGATT CTTCAAGGTA ACAAGGGTTT 960 

GGGTTTTGAA ATAAACATCT GGATCTTATA GACCGTTCAT ACAATGGTTT TAGCAAGTTC 1020 

ATAGTAAGAC AAACAAGTCC TATCTTTTTT TTTTTGGCTG GGGTGGGGGC ATTGGTCACA 1080 

TATGACCAGT AATTGAAAGA CGTCATCACT GAAAGACAGA ATGCCATCTG GGCATACAAA 1140 

TAAGAAGTTT GTCACAGCAC TCAGGATTTT GGGTATCTTT TGTAGCTCAC ATAAAGAACT 1200 

TCAGTGCTTT TCAGAGCTGG ATATATCTTA ATTACTAATG CCACACAGAA ATTATACAAT 1260 

CAAACTAGAT CTGAAGCATA ATTTAAGAAA AACATCAACA TTTTTTGTGC TTTAAACTGT 1320 

AGTAGTTGGT CTAGAAACAA AATACTCCAA GAAAAAGAAA ATTTTCAAAT AAAACCCAAA 1380 

ATAATAGCTT TGCTTAGCCC TGTTAGGGAT CCATTGGAGC ATTAAGGAGC ACATATTTTT 1440 

ATTAACTTCT TTTGAGCTTT CAATGTTGAT GTAATTTTTG TTCTCTGTGT AATTTAGGTA 1500 

AACTGCAGTG TTTAACATAA TAATGTTTTA AAGACTTAGT TGTCAGTATT AAATAATCCT 1560 

GGCATTATAG GGAAAAAACC TCCTAGAAGT TAGATTATTT GCTACTGTGA GAATATTGTC 1620 

ACCACTGGAA GTTACTTTAG TTCATTTAAT TTTAATTTTA TATTTTGTGA ATATTTTAAG 1680 

AACTGTAGAG CTGCTTTCAA TATCTAGAAA TTTTTAATTG AGTGTAAACA CACCTAACTT 1740 

TAAGAAAAAG AACCGCTTGT ATGATTTTCA AAAGAACATT TAGAATTCTA TAGAGTCAAA 1800 

ACTATAGCGT AATGCTGTGT TTATTAAGCC AGGGATTGTG GGACTTCCCC CAGGCAACTA 1860 

AACCTGCAGG ATGAAAATGC TATATTTTCT TTCATGCACT GTCGATATTA CTCAGATTTG 1920 

GGGAAATGAC ATTTTTATAC TAAAACAAAC ACCAAAATAT TTTAGAATAA ATTCTTAGAA 1980 

AGTTTTGAGA GGAATTTTTA GAGAGGACAT TTCCTCCTTC CTGATTTGGA TATTCCCTCA 2040 

AATCCCTCCT CTTACTCCAT GCTGAAGGAG AAGTACTCTC AGATGCATTA TGTTAATGGA 2100 

GAGAAAAAGC ACAGTATTGT AGAGACACCA ATATTAGCTA ATGTATTTTG GAGTGTTTTC 2160 

CATTTTACAG TTTATATTCC AGCACTCAAA ACTCAGGGTC AAGTTTTAAC AAAAGAGGTA 2220 

TGTAGTCACA GTAAATACTA AGATGGCATT TCTATCTCAG AGGGCCAAAG TGAATCACAC 2280 

CAGTTTCTGA AGGTCCTAAA AATAGCTCAG ATGTCCTAAT GAACATGCAC CTACATTTAA 2340 

TAGGAGTACA ATAAAACTGT TGTCAGCTTT TGTTTTACAG AGAACGCTAG ATATTAAGAA 2400 

TTTTGAAATG GATCATTTCT ACTTGCTGTG CATTTTAACC AATAATCTGA TGAATATAGA 2460 

AAAAAATGAT CCAAAATATG GATATGATTG GATGTATGTA ACACATACAT GGAGTATGGA 2520 

GGAAATTTTC TGAAAAATAC ATTTAGATTA GTTTAGTTTG AAGGAGAGGT GGGCTGATGG 2580 

CTGAGTTGTA TGTTACTAAC TTGGCCCTGA CTGGTTGTGC AACCATTGCT TCATTTCTTT 2640 

GCAAAATGTA GTTAAGATAT ACTTTATTCT AATGAAGGCC TTTTAAATTT GTCCACTGCA 2700 
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TTCTTGGTAT 
TTTGAATACT 
CATCATTTGC 
TCTCTGTATG 
TTTTTGTTGT 
TTGGACCATA 
TATTTATCTG 
TTAAAATTTC 
CAGTATATTA 
TTTTTATTAT 
AATTGTTCTC 



TTCACTACTT 
TGTTTCTTTA 
CTTGAAAGTT 
TAAGTAGTAT 
TGTTATCTAT 
TTTCTTAAGC 
AAGATCAAAA 
CTTGAATAAT 
TAGTGATAAT 
AATGACCAGC 
CTTCTAAAAA 



CAAGTCAGTC 
GCACTTTGAA 
TCCTCTGCAT 
AATTTGTTAC 
AAAAATTGAG 
ATAAAAAAAT 
CAAACAATCC 
CCATGAAAGG 
TTTGTATTTT 
TTTTGGTATT 
AAAAAAAAAA 



AGAACTTOGT 
GATAGAAAAA 
TGGGTTTGAA 
TTTCAAATAC 
GGAAATGGTT 
GCTCAGTTTT 
AGATGTATAA 
AATAATTCAA 
CAAMAAAAAA 
TCATTGTTAC 
AAAAAAAA 



AGACCGACCT 
CCACTTTTTA 
GTAGTTTAGT 
CCGTACTTTG 
ATGCAAAAAA 
GCTTGCATTC 
GTACTAGGCA 
ATACAGATAA 
AAAGTTAAAC 
CAAGTTCTAT 



Seq ID NO: C121 dna Sequence 
Nucleic Acid Accession ft; NM_004195 
Coding sequence: 1..726 



ATGGCACAGC 
GCGCTCAGCC 
CTTGGGACGG 
TACCCGGGCG 
TGCGGAGACC 
CAGTCCCAGG 
TCCGGGGGCC 

GAGCCGCTTG 
ACCTCGGCCC 
GAGACCCAGC 
CCCGAGGAAG 
GTGTGA 



11 
I 

ACGGGGCGAT 
TGGGTCAGCG 
GAACGGACGC 
AGGAGTGCTG 
CTTGCTGCAC 
GGAAATTCAG 
ACGAAGGCCA 
CTGGGAACAA 
GGTGGCTGAC 
AGCTTGGACT 
TGCTGCTGGA 
AGCGGGGCGA 



21 
I 

GGGCGCGTTT 
CCCCACCGGG 
GCGCTGCTGC 
TTCCGAGTGG 
GACCTGCCGG 
TTTTGGCTTC 
CTGCAAACCT 
GACCCACAAC 
CGTCGTCCTC 
GCACATCTGG 
GGTGCCGCCG 
GCGATCGGCA 



31 
I 

CGGGCCCTGT 
GGTCCCGGGT 
CGGGTTCACA 
GACTGCATGT 
CACCACCCTT 
CAGTGTATCG 
TGGACAGACT 
GCTGTGTGCG 
CTGGCCGTGG 
CAGCTGAGGA 
TCGACCGAAG 
GAGGAGAAGG 




GAAGTTTCTT 
AGTACTAAGT 
TATGTCTTTT 
AATGTAGGTT 
ATATTTTGCT 
CTTGAGAATG 
GAAGCCAATT 
ACAGAGTTGG 
T CTT CT T TTC 
TTTTAGATAA 



51 
I 

GCTGCTGTGC 
GCGCCTCCTG 
CTGCCGCGAT 
TGAATTCCAC 
CCAGGGGGTA 
GGGGACCTTC 
CGGGTTTCTC 
CCCGCCGGCA 
CCTCCTCCTG 
GTGGCCCCGA 
CTGCCAGTTC 
AGACCTGTGG 



2760 
2820 
2860 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3338 
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180 
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600 
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Seq ID NO: C122 DNA Sequence 
Nucleic Acid Accession #: AK091896.1 
Coding sequence: 2 8.. 1572 

1 11 21 31 41 51 

I I I I I I 

AGATCCGCGA GCCCGTCAGC CTGCGCCATG GGCTGCGACG GCCGCGTGTC GGGGCTGCTC 60 

CGCCGCAACC TGCAGCCCAC GCTCACCTAC TGGAGCGTCT TCTTCAGCTT CGGCCTGTGC 120 

ATCGCCTTCC TGGGGCCCAC GCTGCTGGAC CTGCGCTGTC AGACGCACAG CTCGCTGCCC 180 

CAGATCTCCT GGGTCTTCTT CTCGCAGCAG CTCTGCCTCC TGCTGGGCAG CGCCCTCGGG 240 

GGCGTCTTCA AAAGGACCCT GGCCCAGTCA CTATGGGCCC TGTTCACCTC CTCTCTGGCC 300 

ATCTCCCTGG TGTTTGCCGT CATCCCCTTC TGCCGCGACG TGAAGGTGCT GGCCTCAGTC 360 

ATGGCGCTGG CGGGCTTGGC CATGGGCTGC ATCGACACCG TGGCCAACAT GCAGCTGGTA 420 

AGGATGTACC AGAAGGACTC GGCCGTCTTC CTCCAGGTGC TCCATTTCTT CGTGGGCTTT 480 

GGTGCTCTGC TGAGCCCCCT TATTGCTGAC CCTTTCCTGT CTGAGGCCAA CTGCTTGCCT 540 

GCCAATAGCA CGGCCAACAC CACCTCCCGA GGCCACCTGT TCCATGTCTC CAGGGTGCTG 600 

GGCCAGCACC ACGTAGATGC CAAGCCTTGG TCCAACCAGA CGTTCCCAGG GCTGACTCCA 660 

AAGGACGGGG CAGGGACCCG AGTGTCCTAT GCCTTCTGGA TCATGGCCCT CATCGATCTT 720 

CCAGTGCCCA TGGCTGTGCT GATGCTGCTG TCCAAGGAGC GGCTGCTGAC CTGCTGTCCC 780 

CAGAGGAGGC CCCTGCTTCT GTCTGCTGAT GAGCTTGCCT TGGAGACACA GCCTCCTGAG 840 

AAGGAAGATG CCTCCTCACT GCCCCCAAAG TTTCAGTCAC ACCTAGGGCA TGAGGACCTG 900 

TTCAGCTGCT GCCAAAGGAA GAACCTCAGA GGAGCCCCTT ATTCCTTCTT TGCCATCCAC 960 

ATCACGGGCG CCCTGGTACT GTTCATGACG GATGGGTTGA CGGGTGCCTA TTCCGCCTTC 1020 

GTGTACAGCT ATGCTGTGGA GAAGCCCCTG TCTGTGGGAC ACAAGGTGGC TGGCTACCTC 1080 

CCCAGCCTCT TCTGGGGCTT CATCACACTG GGCCGGCTCC TCTCCATTCC CATATCCTCA 1140 

AGAATGAAGC CGGCCACCAT GGTTTTCATC AACGTGGTTG GCGTGGTGGT GACGTTCCTG 1200 

GTGCTGCTTA TTTTCTCCTA CAACGTCGTC TTCCTGTTCG TGGGGACGGC AAGCCTGGGC 1260 

CTGTTTCTCA GCAGCACCTT CCCCAGCATG CTGGCCTACA CGGAGGACTC GCTGCAGTAC 1320 

AAAGGCTGTG CAACCACAGT GCTGGTGACA GGGGCAGGAG TTGGCGAGAT GGTGCTGCAG 1380 

ATGCTGGTTG GTTCGATATT CCAGGCTCAG GGCAGCTATA GTTTCCTGGT CTGTGGCGTG 1440 

ATCTTTGGTT GTCTGGCTTT TACCTTCTAT ATCTTGCTCC TGTTTTTCCA CAGGATGCAC 1500 

CCTGGACTCC CATCAGTTCC TACCCAAGAC AGATCAATTG GAATGGAAAA CTCTGAGTGC 1560 

TACCAGAGGT AAAACTGGGT GAAGAAGGCA AGAGAAGACT TTCAGCCTCT TGATCACCAG 1620 

CACGACCATA CTGTTTCAGA AAGCTGGGTG GTGGTGGAGG CGCTCTCTCA ATGGCTATTC 1680 

AAGTCTTCTC CACTAAAACT TGGTTGGGTA GAGGAAATTA AATTGAGTCC TGGTACCTGG 1740 

TCAAAATCAT TAGAAGTTTA CCTGGCTTCT CAAGTTATCT TCTTCCCTGG TTCAGACTGT 1800 

TGGTAAGAGC TGTCCAGATA CCCAGATGGG AAGGAAGGAG ACAGCCGCGC GCTTCACTCC 1860 

ATTTGTCACC TCATGCATGG ACCATACTCT GGGTTTGAGA TCATTCTTCA TTGAAGTTTG 1920 

TAAAAATAGG TTGAAATTGT AAAGCTCCAT GATCACTGCT ATATGTAGAT ATATTTCAAT 1980 

TTAAGCAAAA CAAGCTGCAA GTTATTCCCT GGCATGCTCA AAGGATTTTC GTGCTTTTCA 2040 

CTTAATAGTC CAAAGTCTCT TAAATTCCTG CTGCAGATAT CAATAGCTTA TCTATATTCT 2100 

CAAACACCAA AAGGAAAAGT TGAATCTTGC TCTCTTTGGT ATACTAATGT AGTGGTATGC 2160 

TAAGCTGGCT CATACCAACT TAGAAAAGCT GATTGTAAAA TTTTCATTTT GACAGCTGGT 2220 

TATTAAATGC AGCCATTATT AAAAATCAAA TCATACAAAC TTATAATTAA ATCAATTACA 2280 

TTTAAAACAA AGGTAATAAA TATTCAAAGC ATATCACTTC CT 2322 

Seq ID NO: C123 DNA Sequence 

Nucleic Acid Accession ft: NM_002203-2 

Coding sequence: 43.. 3588 
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CTGCAAACCC AGCGCAACTA CGGTCCCCCG GTCAGACCCA GGATGGGGCC AGAACGGACA 60 

GGGGCCGCGC CGCTGCCGCT GCTGCTGGTG TTAGCGCTCA GTCAAGGCAT TTTAAATTGT 120 

TGTTTGGCCT ACAATGTTGG TCTCCCAGAA GCAAAAATAT TTTCCGGTCC TTCAAGTGAA 180 

CAGTTTGGGT ATGCAGTGCA GCAGTTTATA AATCCAAAAG GCAACTGGTT ACTGGTTGGT 240 

TCACCCTGGA GTGGCTTTCC TGAGAACCGA ATGGGAGATG TGTATAAATG TCCTGTTGAC 300 

CTATCCACTG CCACATGTGA AAAACTAAAT TTGCAAACTT CAACAAGCAT TCCAAATGTT 360 

ACTGAGATGA AAACCAACAT GAGCCTCGGC TTGATCCTCA CCAGGAACAT GGGAACTGGA 420 

GGTTTTCTCA CATGTGGTCC TCTGTGGGCA CAGCAATGTG GGAATCAGTA TTACACAACG 480 

GGTGTGTGTT CTGACATCAG TCCTGATTTT CAGCTCTCAG CCAGCTTCTC ACCTGCAACT 540 

CAGCCCTGCC CTTCCCTCAT AGATGTTGTG GTTGTGTGTG ATGAATCAAA TAGTATTTAT 600 

CCTTGGGATG CAGTAAAGAA TTTTTTGGAA AAATTTGTAC AAGGCCTTGA TATAGGCCCC 660 

ACAAAGACAC AGGTGGGGTT AATTCAGTAT GCCAATAATC CAAGAGTTGT GTTTAACTTG 720 

AACACATATA AAACCAAAGA AGAAATGATT GTAGCAACAT CCCAGACATC CCAATATGGT 780 

GGGGACCTCA CAAACACATT CGGAGCAATT CAATATGCAA GAAAATATGC CTATTCAGCA 840 

GCTTCTGGTG GGCGACGAAG TGCTACGAAA GTAATGGTAG TTGTAACTGA CGGTGAATCA 900 

CATGATGGTT CAATGTTGAA AGCTGTGATT GATCAATGCA ACCATGACAA TATACTGAGG 960 

TTTGGCATAG CAGTTCTTGG GTACTTAAAC AGAAACGCCC TTGATACTAA AAATTTAATA 1020 

AAAGAAATAA AAGCGATCGC TAGTATTCCA ACAGAAAGAT ACTTTTTCAA TGTGTCTGAT 1080 

GAAGCAGCTC TACTAGAAAA GGCTGGGACA TTAGGAGAAC AAATTTTCAG CATTGAAGGT 1140 

ACTGTTCAAG GAGGAGACAA CTTTCAGATG GAAATGTCAC AAGTGGGATT CAGTGCAGAT 1200 

TACTCTTCTC AAAATGATAT TCTGATGCTG GGTGCAGTGG GAGCTTTTGG CTGGAGTGGG 1260 

ACCATTGTCC AGAAGACATC TCATGGCCAT TTGATCTTTC CTAAACAAGC CTTTGACCAA 1320 

ATT CTGCAGG ACAGAAATCA CAGTTCATAT TTAGGTTACT CTGTGGCTGC AATTTCTACT 1380 

GGAGAAAGCA CTCACTTTGT TG CTGGTGCT CCTCGGGCAA ATTATACCGG CCAGATAGTG 1440 

CTATATAGTG TGAATGAGAA TGGCAATATC ACGGTTATTC AGGCTCACCG AGGTGACCAG 1500 

ATTGGCTCCT ATTTTGGTAG TGTGCTGTGT TCAGTTGATG TGGATAAAGA CACCATTACA 1560 

GACGTGCTCT TGGTAGGTGC ACCAATGTAC ATGAGTGACC TAAAGAAAGA GGAAGGAAGA 1620 

GTCTACCTGT TTACTATCAA AAAGGGCATT TTGGGTCAGC ACCAATTTCT TGAAGGCCCC 1680 

GAGGGCATTG AAAACACTCG ATTTGGTTCA GCAATTGCAG CTCTTTCAGA CATCAACATG 1740 

GATGGCTTTA ATGATGTGAT TGTTGGTTCA CCACTAGAAA ATCAGAATTC TGGAGCTGTA 1800 

TACATTTACA ATGGTCATCA GGG CACTATC CGCACAAAGT ATTCCCAGAA AATCTTGGGA 1860 

TCCGATGGAG CCTTTAGGAG CCATCTCCAG TACTTTGGGA GGTCCTTGGA TGGCTATGGA 1920 

GATTTAAATG GGGATTCCAT CACCGATGTG TCTATTGGTG CCTTTGGACA AGTGGTTCAA 1980 

CTCTGGTCAC AAAGTATTGC TGATGTAGCT ATAGAAGCTT CATTCACACC AGAAAAAATC 2040 

ACTTTGGTCA ACAAGAATGC TCAGATAATT CTCAAACTCT GCTTCAGTGC AAAGTTCAGA 2100 

CCTACTAAGC AAAACAATCA AGTGGCCATT GTATATAACA TCACACTTGA TGCAGATGGA 2160 

TTTTCATCCA GAGTAACCTC CAGGGGGTTA TTTAAAGAAA ACAATGAAAG GTGCCTGCAG 2220 

AAGAATATGG TAGTAAATCA AGCACAGAGT TGCCCCGAGC ACATCATTTA TATACAGGAG 2280 

CCCTCTGATG TTGTCAACTC TTTGGATTTG CGTGTGGACA TCAGTCTGGA AAACCCTGGC 2340 

ACTAGCCCTG CCCTTGAAGC CTATTCTGAG ACTGCCAAGG TCTTCAGTAT TCCTTTCCAC 2400 

AAAGACTGTG GTGAGGATGG ACTTTGCATT TCTGATCTAG TCCTAGATGT CCGACAAATA 2460 

CCAGCTGCTC AAGAACAACC CTTTATTGTC AGCAACCAAA ACAAAAGGTT AACATTTTCA 2520 

GTAACACTGA AAAATAAAAG GGAAAGTGCA TACAACACTG GAATTGTTGT TGATTTTTCA 2580 

GAAAACTTGT TTTTTGCATC ATTCTCCCTA CCGGTTGATG GGACAGAAGT AACATGCCAG 2640 

GTGGCTGCAT CTCAGAAGTC TGTTGCCTGC GATGTAGGCT ACCCTGCTTT AAAGAGAGAA 2700 

CAACAGGTGA CTTTTACTAT TAACTTTGAC TTCAATCTTC AAAACCTTCA GAATCAGGCG 2760 

TCTCTCAGTT TCCAAGCCTT AAGTGAAAGC CAAGAAGAAA ACAAGGCTGA TAATTTGGTC 2820 

AACCTCAAAA TTCCTCTCCT GTATGATGCT GAAATTCACT TAACAAGATC TACCAACATA 2880 

AATTTTTATG AAATCTCTTC GGATGGGAAT GTTCCTTCAA TCGTGCACAG TTTTGAAGAT 2940 

GTTGGTCCAA AATTCATCTT CTCCCTGAAG GTAACAACAG GAAGTGTTCC AGTAAGCATG 3000 

GCAACTGTAA TCATCCACAT CCCTCAGTAT ACCAAAGAAA AGAACCCACT GATGTACCTA 3060 

ACTGGGGTGC AAACAGACAA GGCTGGTGAC ATCAGTTGTA ATGCAGATAT CAATCCACTG 3120 

AAAATAGGAC AAACATCTTC TTCTGTATCT TTCAAAAGTG AAAATTTCAG GCACACCAAA 3180 

GAATTGAACT GCAGAACTGC TTCCTGTAGT AATGTTACCT GCTGGTTGAA AGACGTTCAC 3240 

ATGAAAGGAG AATACTTTGT TAATGTGACT ACCAGAATTT GGAACGGGAC TTTCGCATCA 3300 

TCAACGTTCC AGACAGTACA GCTAACGGCA GCTGCAGAAA TCAACACCTA TAACCCTGAG 3360 

ATATATGTGA TTGAAGATAA CACTGTTACG ATTCCCCTGA TGATAATGAA ACCTGATGAG 3420 

AAAGCCGAAG TACCAACAGG AGTTATAATA GGAAGTATAA TTGCTGGAAT CCTTTTGCTG 3480 

TTAGCTCTGG TTGCAATTTT ATGGAAGCTC GGCTTCTTCA AAAGAAAATA TGAAAAGATG 3540 

ACCAAAAATC CAGATGAGAT TGATGAGACC ACAGAGCTCA GTAGCTGAAC CAGCAGACCT 3600 

ACCTGCAGTG GGAACCGGCA GCATCCCAGC CAGGGTTTGC TGTTTGCGTG CATGGATTTC 3660 

TTTTTAAATC CCATATTTTT TTTATCATGT CGTAGGTAAA CTAACCTGGT ATTTTAAGAG 3720 

AAAACTGCAG GTCAGTTTGG ATGAAGAAAT TGTGGGGGGT GGGGGAGGTG CGGGGGGCAG 3780 

GTAGGGAAAT AATAGGGAAA ATACCTATTT TATATGATGG GGGAAAAAAA GTAATCTTTA 3840 

AACTGGCTGG CCCAGAGTTT ACATTCTAAT TTGCATTGTG TCAGAAACAT GAAATGCTTC 3900 

CAAGCATGAC AACTTTTAAA GAAAAATATG ATACTCTCAG ATTTTAAGGG GGAAAACTGT 3960 

TCTCTTTAAA ATATTTGTCT TTAAACAGCA ACTACAGAAG TGGAAGTGCT TGATATGTAA 4020 

GTACTTCCAC TTGTGTATAT TTTAATGAAT ATTGATGTTA ACAAGAGGGG AAAACAAAAC 4080 

ACAGGTTTTT TCAATTTATG CTGCTCATCC AAAGTTGCCA CAGATGATAC TTCCAAGTGA 4140 

TAATTTTATT TATAAACTAG GTAAAATTTG TTGTTGGTTC CTTTTATACC ACGGCTGCCC 4200 

CTTCCACACC CCATCTTGCT CTAATGATCA AAACATGCTT GAATAACTGA GCTTAGAGTA 4260 

TACCTCCTAT ATGTCCATTT AAGTTAGGAG AGGGGGCGAT ATAGAGACTA AGGCACAAAA 4320 

TTTTGTTTAA AACTCAGAAT ATAACATTTA TGTAAAATCC CATCTGCTAG AAGCCCATCC 4380 

TGTGCCAGAG GAAGGAAAAG GAGGAAATTT CCTTTCTCTT TTAGGAGGCA CAACAGTTCT 4440 

CTTCTAGGAT TTGTTTGGCT GACTGGCAGT AACCTAGTGA ATTTTTGAAA GATGAGTAAT 4500 

TTCTTTGGCA ACCTTCCTCC TCCCTTACTG AACCACTCTC CCACCTCCTG GTGGTACCAT 4560 

TATTATAGAA GCCCTCTACA GCCTGACTTT CTCTCCAGCG GTCCAAAGTT ATCCCCTCCT 4620 

TTACCCCTCA TCCAAAGTTC CCACTCCTTC AGGACAGCTG CTGTGCATTA GATATTAGGG 4680 

GGGAAAGTCA TCTGTTTAAT TTACACACTT GCATGAATTA CTGTATATAA ACTCCTTAAC 4740 

TTCAGGGAGC TATTTTCATT TAGTGCTAAA CAAGTAAGAA AAATAAGCTA GAGTGAATTT 4800 

CTAAATGTTG GAATGTTATG GGATGTAAAC AATGTAAAGT AAAACACTCT CAGGATTTCA 4860 

CCAGAAGTTA CAGATGAGGC ACTGGAAACC ACCACCAAAT TAGCAGGTGC ACCTTCTGTG 4920 

GCTGTCTTGT TTCTGAAGTA CTTTTTCTTC CACAAGAGTG AATTTGACCT AGGCAAGTTT 4980 

GTTCAAAAGG TAGATCCTGA GATGATTTGG TCAGATTGGG ATAAGGCCCA GCAATCTGCA 5040 
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TTTTAACAAG CACCCCAGTC ACTAGGATGC AGATGGACCA CACTTTGAGA AACACCACCC 5100 

ATTTCTACTT TTTGCACCTT ATTTTCTCTG TTCCTGAGCC CCCACATTCT CTAGGAGAAA 5160 

CTTAGATTAA AATTCACAGA CACTACATAT CTAAAGCTTT GACAAGTCCT TGACCTCTAT 5220 

AAACTTCAGA GTCCTCATTA TAAAATGGGA AGACTGAGCT GGAGTTCAGC AGTGATGCTT 5280 

TTTAGTTTTA AAAGTCTATG ATCTGATCTG GACTTCCTAT AATACAAATA CACAATCCTC 5340 

CAAGAATTTG ACTTGGAAAA G 5361 

Seq ZD MO: C124 DMA Sequence 
Nucleic Acid Accession #: NM_031460 
Coding sequence t 103.. 1101 



1 
I 

AGCAGGCGTT 
CTAGTCCACC 
CGGGCGGCTC 
CTCGCCTACC 
GCGGCGCAGG 
ACGTGTCTGG 
AACGGAGCCA 
TTCTTCTTTT 
ATGGCTGCCC 
GTGCTCAACC 
GGGGGCACCT 
TCGGGCCTCC 
AGCTACACAG 
TACGTGATTG 
CTGTGGATCC 
CTGGAGACGC 
TCCCAAAGCT 
TGCTATCCAG 
GGCTGTGGCA 
CCCCTGATTT 
CTAGAGGCTG 
CTGGGTGACA 
CTTACTTTAG 
GCACCATGAA 
TGATGATCAC 
GAAATAAACA 



11 

I 

TGCGAGAGGA 

CCGAGGGCAG 
TGGCTTACCT 
ACTCCAGCCG 
ACCGCCCGGC 
GCCTCCTCAG 
CTGTGTCCAC 
GCCTCTTCTG 
GACTGGGGCA 
GGCAGGATCC 
TGCTCTTCCT 
AGGGCTTCTA 
GAATGAACCC 
TCTTTGGGAT 
CAGGGAGGGT 
GGAGACAGGG 
AGGGACCCAT 
AGGACAGCTA 
TAAGCTTTGC 
GGTGCAGCTA 
TGGGATGTGA 
CGGGCTGCAA 
GTAGCGGCAA 
AGAGCCATCC 
TCTATGAAA 



21 
I 

GATACGAGCT 
CCGGCTCCCC 
GGTCCGGGGC 
GGCGCTGGGC 
CAGCTTCCAG 
GCTGGACTCG 
CAACACCACC 
CATCACCACC 
CATCTTCTTT 
TCTCATGCAG 
TGACAAGGCG 
GCTGCTGCCA 
CTTCGCCTTC 
CTCCCAGAGG 
GGCATGGCTG 
ATGTTCCTGC 
ACCTGACCGG 
GGGAATCATA 
GTTATACTCC 
ACATGTCCAC 
TATGATTAAT 
CTTTCGGGTG 
TGCCGCCGAT 
TGTTTGAGCG 
TAACAAACGG 



31 

I 

GGACGCCTGG 

TGCGCGGTGC 
ACCGGCGTGT 
CGCGACAAGT 
CTGATCCGGG 
AGCATGGGGC 
ATTGGCTATG 
GCCCTTGTGG 
CAGGGAGTAA 
CGGTGGCTGG 
CCGCTGCTCT 
ATCACCCTCA 
TACCCACTGT 
GCCTTGATCA 
TGCCACCACA 
GAGCCAGAGT 
CAGCATCTGG 
ATTCTTTGGT 
CCAAACTAAA 
TCTGCCCAAT 
TCGGGGCAGC 
ATGATGGCTG 
GCACAATTAG 
AATATCACCG 



41 
I 

CCCTTCCCTC 
CTATGTACCG 
CCAGCACCGT 
TCTGGACGCT 
GGGAGCTGTT 
ATGTCGTCCA 
GCTGGGAGCT 
GCAACCTGAG 
GGATCCCACT 
ACCACTGGGC 
CGGGCTCTGG 
TCTCCCACAT 
GCACCGTGGG 
GGTACAAGAA 
TCAAACTCAT 
GCTCTAAGGA 
CCCACTCCCC 
AACCTTCTGC 
CGTCGTCCTC 
GACTACATTT 
AGGGTATACA 
ATGCCCTTCT 
GGAGCTCTGG 
ATAGGAAGAG 
ACCCTCCTTT 



51 
I 

CCACCGGGTC 
ACCGCGAGCC 
GCTCCTGCTG 
GGAGGGCCGC 
GCAGAACTTC 
AGCATACAAA 
CGTGGGCTCC 
CCCCAACACG 
CAACCTCGTG 
CAGCAGGCTG 
CGCCCTCCTC 



CTTCGGCGAC 
CATGGTGTCC 
CCTCTCCCAG 
AGACTTCAAG 
ACAGCAAGGA 
TCACGCTGCA 
GGTAGCAAGA 
TCCATCCACC 
GAGACATGTC 
CCCCCACTTC 
CAGCCATACG 
TCTGGATCTC 
ATGTGAGAGA 



Seq ID NO: C125 DNA Sequence 
Nucleic Acid Accession #: NM_ 004154 
Coding sequence: 309.. 1295 



AAGGACAGAG 
CGGAAGAACT 
TGGGGCTACC 
CCTGTCATCT 
TTCTTCATGA 
TGCCAGAACA 
CACGAGTGGG 
GGATAACAAG 
GATGGGTGCG 
CCATGGAATG 
ACCGCGAGAA 



GCACGGCOGT 
CCCTGCTCAT 
GCCTGGTCCG 
TCAGCTTCCA 
GCCGCCGGGC 
TGCCCACAGC 
TCAGCCCGCC 
GCTTCCTGCT 
GCCGCCAGGA 
TGGCCGTGGT 
CAGCCTACCT 
CAGCGGCCTA 
TCTTCTACTT 
CAGCCAAATG 
GCCATTGTGT 
TCAGCTCAGC 
ATTTCTTCAG 
CTGGCTCTTG 
ACAAAAATAC 



11 
I 

GAGGGGCCCT 
GGTTCTGTGG 
TCAGGGCCCC 
GGATAGTGTC 
CACTCCTGAT 
TTGCACGCGA 
AATTTGCTCC 
ACCTCTGCCA 
GTCCTCAGTG 
GGACAATGGC 
CTTCAAGCAA 
GAACATCTGT 
GTACACCCTA 
CTACAACTAT 
CTTCCTCTTC 
GCGCTACCTG 
TGCCTGGCTA 
CATCTTCGCT 
TGCCCTGGCC 
GCCCTTTGCT 
TGGCCCGGCA 
GGTGGCTGCT 
GGCAGTGCGC 
CAAAGGCACG 
CACCCAGAAG 
GCAGAGGCAG 
CCGGGGCACC 
TGGGCATGGA 

AGAGGTCCCA 
AGTGTGACGT 



21 
I 

TCCTGTCAGC 
AATTTGTGCT 
ACAGGATGAG 
TAAAAATTTG 
ATGTCTCTCA 
CAGTTTCAGG 
AGCACTTCAC 
GAAGAACCAT 
AGCCCCTGCC 
ACAGGCCAGG 
CTGCTGCTGC 
GTCATTACCC 
AACCTTGCTC 
GCCCAAGGTG 
TATGCCAACC 
GGCATCTGCC 
GTGTGTGTAG 
GCCACAGGCA 
ACCCACTATA 
GCCCTGCTGG 
GAGCCTGTGG 
GCCTTTGCCA 
TCGACGCCGG 

AAGTTCCGCC 
GGTCGCTGAG 
AGGAGCCCCA 
GTTAAGATCC 
GGCCCAGACC 
GTCAGCCATG 
GTACTGTCAA 



31 

I 

TGGCTGGGAG 
TATTTCCCAT 
GGGCTGGTTT 
CAAACTGCCT 
GTTTCCTCAT 
CACAGAACTG 
GGACTGCAAG 
GGCTTTGGAA 
TCCCTGAACA 
CTCTGGGCTT 
CACCTGTGTA 
AGATCTGCAC 
TGGCTGACCT 
ATCACTGGCC 
TGCACGGCAG 
ACCCGCTGGC 
CGGTGTGGCT 
TCCAGCGTAA 
TGCCCTATGG 
CCTGCTACTG 
CCCAGGAGCG 
TCAGCTTCCT 



CCAGTGCCAA 
GGCGACCACA 
TCCTCCAGGT 
CCAACCCCAA 
CTCACAGGAC 
CTGTGGGCAT 
GAGAGCTGGG 
AA 



41 

1 

CAGAGGTGGC 
CAAGGATCAA 
TCAGATGAGT 
TCTTGTCAGT 
CTGCTGCCTC 
ACTGGCAGCA 
CGAGGCACTT 
GGCGGAGTTC 
TAGGAAACCC 
GCCACCCACC 
TTCGGCGGTG 
GTCCCGCCGG 
GCTATATGCC 
CTTTGGCGAC 
CATCCTCTTC 
CCCCTGGCAC 
GGCCGTGACA 
CCGCACTGTC 
CATGGCTCTC 
TCTCCTGGCC 
GCGTGGCAAG 
GCCTTTTCAC 
CACTGTATTG 
CAGCGTGCTG 
TGAGCTCCTA 
CCTGGGCAGC 
ACCATGCGGA 
CCAGAAGCTC 
GGAGATGGAC 
GAAACCACAT 



Seq ID NO: C126 DNA Sequence 
Nucleic Acid Accession fl: NM_007197 
Coding sequence: 18.. 1763 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1519 



51 
I 

TTTGTCTTTT 
GGACCTGCTC 
TTTCTGCTTG 
GTCTTGCTCA 
TCCAGACTTC 
GGGGCTGCTC 
GCTAACTCTT 
AGGCTGAGGA 
ACCTGGGCAG 
ACCTGTGTCT 
CTGGOGGCTG 
GCCCTGACCC 
TGCTCCCTGC 
TTCGCCTGCC 
CTCACCTGCA 
AAACGTGGGG 
ACCCAGTGCC 
TGCTATGACC 
ACTGTCATCG 
TGCCGCCTGT 
GCGGCCCGCA 
ATCACCAAGA 
GAGGCCTTTG 
GACCCCATCC 
CAGAAACTCA 
CTTCATATTT 
GAATTAGAGT 
ACCAAAAACT 
AGACCTGGGC 
TAAGGTGCTC 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720' 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1832 



11 
I 



21 
I 



31 
I 



41 
I 



51 
I 

1280 



WO 03/042661 



ACACGTCCAA CGCCAGCATG CAGCGCCCGG GCCCCCGCCT GTGGCTGGTC CTGCAGGTGA 60 

TGGGCTCGTG CGCCGCCATC AGCTCCATGG ACATGGAGCG CCCGGGCGAC GGCAAATGCC 120 

AGCCCATCGA GATCCCGATG TGCAAGGACA TOGGCTACAA CATGACTCGT ATGCCCAACC 180 

TGATGGGCCA CGAGAACCAG CGCGAGGCAG CCATCCAGTT GCACGAGTTC GCGCCGCTGG 240 

TGGAGTACGG CTGCCACGGC CACCTCCGCT TCTTCCTGTG CTCGCTGTAC GCGCCGATGT 300 

GCACCGAGCA GGTCTCTACC CCCATCCCCG CCTGCCGGGT CATGTGCGAG CAGGCCCGGC 360 

TCAAGTGCTC CCCGATTATG GAGCAGTTCA ACTTCAAGTG GCCCGACTCC CTGGACTGCC 420 

GGAAACTCCC CAACAAGAAC GACCCCAACT ACCTGTGCAT GGAGGCGCCC AACAACGGCT 480 

CGGACGAGCC CACCCGGGGC TCGGGCCTGT TCCCGCCGCT GTTCCGGCCG CAGCGGCCCC 540 

ACAGCGCGCA GGAGCACCCG CTGAAGGACG GGGGCCCCGG GCGCGGCGGC TGCGACAACC 600 

CGGGCAAGTT CCACCACGTG GAGAAG AG CG CGTCGTGCGC GCCGCTCTGC ACGCCCGGCG 660 

TGGACGTGTA CTGGAGCCGC GAGGACAAGC GCTTCGCAGT GGTCTGGCTG GCCATCTGGG 720 

CGGTGCTGTG CTTCTTCTCC AGCGCCTTCA CCGTGCTCAC CTTCCTCATC GACCCGGCCC 780 

GCTTCCGCTA CCCCGAGCGC CCCATCATCT TCCTCTCCAT GTGCTACTGC GTCTACTCCG 840 

TGGGCTACCT CATCCGCCTC TTCGCCGGCG CCGAGAGCAT CGCCTGCGAC CGGGACAGCG 900 

GCCAGCTCTA TGTCATCCAG GAGGGACTGG AGAGCACCGG CTGCACGCTG GTCTTCCTGG 960 

TCCTCTACTA CTTCGGCATG GCCAGCTCGC TGTGGTGGGT GGTCCTCACG CTCACCTGGT 1020 

TCCTGGCCGC CGGCAAGAAG TGGGGCCACG AGGCCATCGA AGCCAACAGC AGCTACTTCC 1080 

ACCTGGCAGC CTGGGCCATC CCGGCGGTGA AGACCATCCT GATCCTGGTC ATGCGCAGGG 1140 

TGGCGGGGGA CGAGCTCACC GGGGTCTGCT ACGTGGGCAG CATGGACGTC AACGCGCTCA 1200 

CCGGCTTCGT GCTCATTCCC CTGGCCTGCT ACCTGGTCAT CGGCACGTCC TTCATCCTCT 1260 

. CGGGCTTCGT GGCCCTGTTC CACATCCGGA GGGTGATGAA GACGGGCGGC GAGAACACGG 1320 

ACAAGCTGGA GAAGCTCATG GTGCGTATCG GGCTCTTCTC TGTGCTGTAC ACCGTGCCGG 1380 

CCACCTGTGT GATCGCCTGC TACTTTTACG AACGCCTCAA CATGGATTAC TGGAAGATCC 1440 

TGGCGGCGCA GCACAAGTGC AAAATGAACA ACCAGACTAA AACGCTGGAC TGCCTGATGG 1500 

CCGCCTCCAT CCCCGCCGTG GAGATCTTCA TGGTGAAGAT CTTTATGCTG CTGGTGGTGG 1560 

GGATCACCAG CGGGATGTGG ATTTGGACCT CCAAGACTCT GCAGTCCTGG CAGCAGGTGT 1620 

GCAGCCGTAG GTTAAAGAAG AAGAGCCGGA GAAAACCGGC CAGCGTGATC ACCAGCGGTG 1680 

GGATTTACAA AAAAGCCCAG CATCCCCAGA AAACTCACCA CGGGAAATAT GAGATCCCTG 1740 

CCCAGTCGCC CACCTGCGTG TGAACAGGGC TGGAGGGAAG GGCACAGGGG CGCCCGGAGC 1800 

TAAGATGTGG TGCTTTTCTT GGTTGTGTTT TTCTTTCTTC TTCTTCTTTT TTTTTTTTTT 1860 

ATAAAAGCAA AAGAGAAATA CATAAAAAAG TGTTTACCCT GAAATTCAGG ATGCTGTGAT 1920 

ACACTGAAAG GAAAAATGTA CTTAAAGGGT TTTGTTTTGT TTTGGTTTTC CAGCGAAGGG 1980 

AAGCTCCTCC AGTGAAGTAG CCTCTTGTGT AACTAATTTG TGGTAAAGTA GTTGATTCAG 2040 

CCCT CAGAAG AAAACTTTTG TTTAGAGCCC TCCGTAAATA TACATCTGTG TATTTGAGTT 2100 

GGCTTTGCTA CCCATTTACA AATAAGAGGA CAGATAACTG CTTTGCAAAT TCAAGAGCCT 2160 

CCCCTGGGTT AACAAATGAG CCATCCCCAG GGCCCACCCC CAGGAAGGCC ACAGTGCTGG 2220 

GCGGCATCCC TGCAGAGGAA AGACAGGACC CGGGGCCCGC CTCACACCCC AGTGGATTTG 2280 

GAGTTGCTTA AAATAGACTC TGGCCTTCAC CAATAGTCTC TCTGCAAGAC AGAAACCTCC 2340 

ATCAAACCTC ACATTTGTGA ACTCAAACGA TGTGCAATAC ATTTTTTTCT CTTTCCTTGA 2400 

AAATAAAAAG AGAAACAAGT ATTTTGCTAT ATATAAAGAC AACAAAAGAA ATCTCCTAAC 2460 

AAAAGAACTA AGAGGCCCAG CCCTCAGAAA CCCTTCAGTG CTACATTTTG TGGCTTTTTA 2520 

ATGGAAACCA AGCCAATGTT ATAGACGTTT GGACTGATTT GTGGAAAGGA GGGGGGAAGA 2580 

GGGAGAAGGA TCATTCAAAA GTTACCCAAA GGGCTTATTG ACTCTTTCTA TTGTTAAACA 2640 

AATGATTTCC ACAAACAGAT CAGGAAGCAC TAGGTTGGCA GAGACACTTT GTCTAGTGTA 2700 

TTCTCTTCAC AGTGCCAGGA AAGAGTGGTT TCTGCGTGTG TATATTTGTA ATATATGATA 2760 

TTTTTCATGC TCCACTATTT TATTAAAAAT AAAATATGTT CTTTAAAAAA A 2811 



Seq ID NO: C127 DNA Sequence 

Nucleic Acid Accession #: NM_00576l.l 

Coding sequence: 250.. 4956 



1 11 21 31 41 51 

1111)1 

GCGAGGAGGA AACGGTGCCG GAGCGCGCAG GGCTTGCTGC CGCCACCGCC GCTGCACAGG 60 

CTGCCGGAGC GAGCCTGCCG CGCGCCGCCC TCCCCGCTCT CCTTCCTGGG CGAGCTGCGG 120 

GGATGGGGCG GCCGCGGGAG CCCGAGCGCG CGCAGGAACC GCCGCCGCCG CCGCCOGOGT 180 

CTCCGTTGCC GCGCGCCTGA GCCGCCGTCG CCGCCGCGCG CCCTGCCCGG GGGCGGCCCC 240 

CCCAGCCCCA TGGAGGTCTC CCGGAGGAAG GCGCCGCCGC GCCCCCCGCG CCCCGCAGCG 300 

CCACTGCCCC TGCTCGCCTA TCTGCTGGCA CTGGCGGCTC CCGGCCGGGG CGCGGACGAG 360 

CCCGTGTGGC GGTCGGAGCA AGCCATCGGA GCCATCGCGG CGAGCCAGGA GGACGGCGTG 420 

TTTGTGGCGA GCGGCAGCTG CCTGGACCAG CTGGACTACA GCCTGGAGCA CAGCCTCTCG 480 

CGCCTGTACC GGGACCAAGC GGGCAACTGC ACAGAGCOGG TCTCGCTGGC GCCCCCCGCG 540 

CGGCCCCGGC CCGGGAGCAG CTTCAGCAAG CTGCTGCTGC CCTACCGCGA GGGGGCGGCC 600 

GGCCTCGGGG GGCTGCTGCT CACCGGCTGG ACCTTCGACC GGGGCGCCTG CGAGGTGCGG 660 

CCCCTGGGCA ACCTGAGCCG CAACTCCCTG CGCAACGGCA CCGAGGTGGT GTCGTGCCAC 720 

CCGCAGGGCT OGACGGCOGG CGTGGTGTAC CGCGCGGGCC GGAACAACCG CTGGTACCTG 780 

GCGGTGGCCG CCACCTACGT GCTGCCTGAG CCGGAGACGG CGAGCCGCTG CAACCCCGCG 840 

GCATCCGACC ACGACACGGC CATCGCGCTC AAGGACACGG AGGGGCGCAG CCTGGCCACG 900 

CAGGAGCTGG GGCGCCTCAA GCTGTGCGAG GGCGCGGGCA GCCTGCACTT CGTGGACGCC 960 

TTTCTCTGGA ACGGCAGCAT CTACTTCCCC TACTACCCCT ACAACTATAC GAGCGGCGCT 1020 

GCCACCGGCT GGCCCAGCAT GGCGCGCATC GCGCAGAGCA CCGAGGTGCT GTTCCAGGGC 1080 

CAGGCATCCC TCGACTGCGG CCACGGCCAC CCCGACGGCC GCCGCCTGCT CCTCTCCTCC 1140 

AGCCTAGTGG AGGCCCTGGA CGTCTGGGCG GGAGTGTTCA GCGCGGCCGC TGGAGAGGGC 1200 

CAGGAGCGGC GCTCCCCCAC CACCACGGCG CTCTGCCTCT TCAGAATGAG TGAGATCCAG 1260 

GCGCGCGCCA AGAGGGTCAG CTGGGACTTC AAGACGGCCG AGAGCCACTG CAAAGAAGGG 1320 

GATCAACCTG AAAGAGTCCA ACCAATCGCA TCATCTACCT TGATCCATTC CGACCTGACA 1380 

TCCGTTTATG GCACCGTGGT AATGAACAGG ACTGTTTTAT TCTTGGGGAC TGGAGATGGC 1440 

CAGTTACTTA AGGTTATTCT TGGTGAGAAT TTGACTTCAA ATTGTCCAGA GGTTATCTAT 1500 

GAAATTAAAG AAGAGACACC TGTTTTCTAC AAACTCGTTC CTGATCCTGT GAAGAATATC 1560 

TACATTTATC TAACAGCTGG GAAAGAGGTG AGGAGAATTC GTGTTGCAAA CTGCAATAAA 1620 

CATAAATCCT GTTCGGAGTG TTTAACAGCC ACAGACCCTC ACTGCGGTTQ GTGCCATTCG 1680 

CTACAAAGGT GCACTTTTCA AGGAGATTGT GTACATTCAG AGAACTTAGA AAACTGGCTG 1740 

GATATTTCGT CTGGAGCAAA AAAGTGCCCT AAAATTCAGA TAATTCGAAG CAGTAAAGAA 1800 



1281 



WO 03/042661 



AAGACTACAG TGACTATGGT GGGAAGCTTC TCTCCAAGAC ACTCAAAGTG CATGGTGAAG I860 

AATGTGGACT CTAGCAGGGA GCTCTGCCAG AATAAAAGTC AGCCCAACCG GACCTGCACC 1920 

TGTAGCATCC CAACCAGAGC AACCTACAAA GATGTTTCAG TTGTCAACGT GATGTTCTCC 1980 • 

TTCGGTTCTT GGAATTTATC AGACAGATTC AACTTTACCA ACTGCTCATC ATTAAAAGAA 2040 

TGCCCAGCAT GCGTAGAAAC TGGCTGCGCG TGGTGTAAAA GTGCAAGAAG GTGTATCCAC 2100 

CCCTTCACAG CTTGCGACCC TTCTGATTAT GAGAGAAACC AGGAACAGTG TCCAGTGGCT 2160 

GTCGAGAAGA CATCAGGAGG AGGAAGACCC AAGGAGAACA AGGGGAACAG AACCAACCAG 2220 

GCTTTACAGG TCTTCTACAT TAAGTCCATT GAGCCACAGA AAGTATCGAC ATTAGGGAAA 2280 

AGCAACGTGA TAGTAACGGG AGCAAACTTT ACCCGGGCAT CGAACATCAC AATGATCCTG 2340 

AAAGGAACCA GTACCTGTGA TAAGGATGTG ATACAGGTTA GCCATGTGCT AAATGACACC 2400 

CACATGAAAT TCTCTCTTCC ATCAAGCCGG AAAGAAATGA AGGATGTGTG TATCCAGTTT 2460 

GATGGTGGGA ACTGCTCTTC TGTGGGATCC TTATCCTACA TTGCTCTGCC ACATTGTTCC 2520 

CTTATATTTC CTGCTACCAC CTGGATCAGT GGTGGTCAAA ATATAACCAT GATGGGCAGA 2 580 

AATTTTGATG TAATTGACAA CTTAATCATT TCACATGAAT TAAAAGGAAA CATAAATGTC 2640 

TCTGAATATT GTGTGGCGAC TTACTGCGGG TTTTTAGCCC CCAGTTTAAA GAGTTCAAAA 2700 

GTGCGCACGA ATGTCACTGT GAAGCTGAGA GTACAAGACA CCTACTTGGA TTGTGGAACC 2760 

CTGCAGTATC GGGAGGACCC CAGATTCACG GGGTATCGGG TGGAATCCGA GGTGGACACA 2820 

GAACTGGAAG TGAAAATTCA AAAAGAAAAT GACAACTTCA ATATTTCCAA AAAAGACATT 2880 

GAAATTACTC TCTTCCATGG GGAAAATGGG CAATTAAATT GCAGTTTTGA AAATATTACT 2940 

AGAAATCAAG ATCTTACCAC CATCCTTTGC AAAATTAAAG GCATCAAGAC TGCAAGCACC 3000 

ATTGCCAACT CTTCTAAGAA AGTTCGGGTC AAGCTGGGAA ACCTGGAGCT CTACGTCGAG 3060 

CAGGAGTCAG TTCCTTCCAC ATGGTATTTT CTGATTGTGC TCCCTGTCTT GCTAGTGATT 3120 

GTCATTTTTG CGGCCGTGGG GGTGACCAGG CACAAATCGA AGGAGCTGAG TCGCAAACAG 3180 

AGTCAACAAC TAGAATTGCT GGAAAGCGAG CTCCGGAAAG AGATACGTGA CGGCTTTGCT 3240 

GAGCTGCAGA TGGATAAATT GGATGTGGTT GATAGTTTTG GAACTGTTCC CTTCCTTGAC 3300 

TACAAACATT TTGCTCTGAG AACTTTCTTC CCTGAGTCAG GTGGCTTCAC CCACATCTTC 3360 

ACTGAAGATA TGCATAACAG AGACGCCAAC GACAAGAATG AAAGTCTCAC AGCTTTGGAT 3420 

GCCCTAATCT GTAATAAAAG CTTTCTTGTT ACTGTCATCC ACACCCTTGA AAAGCAGAAG 3480 

AACTTTTCTG TGAAGGACAG GTGTCTGTTT GCCTCCTTCC TAACCATTGC ACTGCAAACC 3540 

AAGCTGGTCT ACCTGACCAG CATCCTAGAG GTGCTGACCA GGGACTTGAT GGAACAGTGT 3600 

AGTAACATGC AGCCGAAACT CATGCTGAGA CGCACGGAGT CCGTCGTCGA AAAACTCCTC 3660 

ACAAACTGGA TGTCCGTCTG CCTTTCTGGA TTTCTCCGGG AGACTGTCGG AGAGCCCTTC 3720 

TATTTGCTGG TGACGACTCT GAACCAGAAA ATTAACAAGG GTCCCGTGGA TGTAATCACT 3780 

TGCAAAGCCC TGTACACACT TAATGAAGAC TGGCTGTTGT GGCAGGTTCC GGAATTCAGT 3840 

ACTGTGGCAT TAAACGTCGT CTTTGAAAAA ATCCCGGAAA ACGAGAGTGC AGATGTCTGT 3900 

CGGAATATTT CAGTCAATGT TCTCGACTGT GACACCATTG GCCAAGCCAA AGAAAAGATT 3960 

TTCCAAGCAT TCTTAAGCAA AAATGGCTCT CCTTATGGAC TTCAGCTTAA TGAAATTGGT 4020 

CTTGAGCTTC AAATGGGCAC ACGACAGAAA GAACTTCTGG ACATCGACAG TTCCTCCGTG 4080 

ATTCTTGAAG ATGGAATCAC CAAGCTAAAC ACCATTGGCC ACTATGAGAT ATCAAATGGA 4140 

TCCACTATAA AAGTCTTTAA GAAGATAGCA AATTTTACTT CAGATGTGGA GTACTCGGAT 4200 

GACCACTGCC ATTTGAfTTT ACCAGATTCG GAAGCATTCC AAGATGTGCA AGGAAAGAGA 4260 

CATCGAGGGA AGCACAAGTT CAAAGTAAAA GAAATGTATC TGACAAAGCT GCTGTCGACC 4320 

AAGGTGGCAA TTCATTCTGT GCTTGAAAAA CTTTTTAGAA GCATTTGGAG TTTACCCAAC 4380 

AGCAGAGCTC CATTTGCTAT AAAATACTTT TTTGACTTTT TGGACGCCCA GGCTGAAAAC 4440 

■ AAAAAAATCA CAGATCCTGA CGTCGTACAT ATTTGGAAAA CAAACAGCCT TCCTCTTCGC 4500 

TTCTGGGTAA ACATCCTGAA GAACCCTCAG TTTGTCTTTG ACATTAAGAA GACACCACAT 4560 

ATAGACGGCT GTTTGTCAGT GATTGCCCAG GCATTCATGG ATGCATTTTC TCTCACAGAG 4620 

CAGCAACTAG GGAAGGAAGC ACCAACTAAT AAGCTTCTCT ATGCCAAGGA TATCCCAACC 4680 

TACAAAGAAG AAGTAAAATC TTATTACAAA GCAATCAGGG ATTTGCCTCC ATTGTCATCC 4740 

TCAGAAATGG AAGAATTTTT AACTCAGGAA TCTAAGAAAC ATGAAAATGA ATTTAATGAA 4800 

GAAGTGGCCT TGACAGAAAT TTACAAATAC ATCGTAAAAT ATTTTGATGA GATTCTAAAT 4860 

AAACTAGAAA GAGAACGAGG GCTGGAAGAA GCTCAGAAAC AACTCTTGCA TGTAAAAGTC 4920 

TTATTTGATG AAAAGAAGAA ATGCAAGTGG ATGTAAGCAC TCTGGGGCCT GGCTTAATCT 4980 

GGCAAAGTTC TTCAGACGAC TTGGGAGCAA AATGGCTGCT TGAGCTACTC TGTGTCGTTA 5040 

ATTTGTTGTT TGCACATAGG TTCCACTTTG GGCACTGTCT TTTTAAGAGA CCAAGGCACA 5100 

TGCACAGCTT TTAGAAAGCA A 5121 

Seq ID NO: C128 DMA Sequence 

Nucleic Acid Accession #: NMJ>02185.1 

Coding sequence: 23.. 1402 



1 
I 

CTCTCTCTCT 
TTTACTTCAA 
AGAACTGGAT 
TTCACTGACC 
ATGTGGGGCC 
CATCGAGACA 
AAAGAGTCTA 
TGACCTGAGT 
ACACTTGCAA 
GGATGAAAAC 
AAAGCTCCAA 
TAAAGGCTTC 
TAGCTCAGGG 
CGCTCTGTTG 
GCCCAGTCTC 
TTTAAATGTG 
CATTCAAGCT 
AGAATCTGAG 
TGTAGTCGTC 
TGTCAGTGCA 
TGGCAAGAAT 



11 
I 

ATCTCTCTCA 
GTOGTTTCTG 
GACTACTCAT 
TGTGCTTTTG 
CTCGTGGAGG 
AAGAAATTCT 
ACCTGCAAAA 
GTCATCTATC 
AAGAAGTATG 
AAATGGACGC 
CCGGCAGCAA 
TGGAGTGAAT 
GAGATGGATC 
GTCATCTTGG 
CCCGATCATA 
AGTTTCAATC 
AGAGATGAAG 
AAGCAGAGGC 
ACTCCAGAAA 
TGTGACGCCC 
GGGCCTCATG 



21 
I 

GAATGACAAT 
GAGAAAGTGG 
TCTCATGCTA 
AGGACCCAGA 
TAAAGTGCCT 
TACTGATTGG 
AAATAGACCT 
GGGAAGGAGC 
TAAAAGTTTT 
ATGTGAATTT 
TGTATGAGAT 
GGAGTCCAAG 
CTATCTTACT 
CCTGTGTGTT 
AGAAGACTCT 
CTGAAAGTTT 
TGGAAGGTTT 
TTGGAGGGGA 
GCTTTGGAAG 
CTATTCTCTC 
TGTACCAGGA 



31 
I 

TCTAGGTACA 
CTATGCTCAA 
TAGCCAGTTG 
TGTCAACACC 
GAATTTCAGG 
AAAGAGCAAT 
AACCACTATA 
CAATGACTTT 
AATGCATGAT 
ATCCAGCACA 
TAAAGTTCGA 
TTATTACTTC 
AACCATCAGC 
ATGGAAAAAA 
GGAACATCTT 
CCTGGACTGC 
TCTGCAAGAT 
TGTGCAGAGC 
AGATTCATCC 
CTCTTCCAGG 
CCTCCTGCTT 



41 
I 

ACTTTTGGCA 
AATGGAGACT 
GAAGTGAATG 
ACCAATCTGG 
AAACTACAAG 
ATATGTGTGA 
GTTAAACCTG 
GTGGTGACAT 
GTAGCTTACC 
AAGCTGACAC 
TCCATCCCTG 
AGAACTCCAG 
ATTTTGAGTT 
AGGATTAAGC 
TGTAAGAAAC 
CAGATTCATA 
ACGTTTCCTC 
CCCAACTGCC 
CTCACATGCC 
TCCCTAGACT 
AGCCTTGGGA 



51 

1 

TGGTTTTTTC 
TGGAAGATGC 
GATCGCAGCA 
AATTTGAAAT 
AGATATATTT 
AGGTTGGAGA 
AGGCTCCTTT 
TTAATACATC 
GCCAGGAAAA 
TCCTGCAGAG 
ATCACTATTT 
AGATCAATAA 
TTTTCTCTGT 
CTATCGTATG 
CAAGAAAAAA 
GGGTGGATGA 
AGCAACTAGA 
CATCTGAGGA 
TGGCTGGGAA 
GCAGGGAGAG 
CTACAAACAG 



60 

120 

180 

240 

300 

360 

420 

460 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 
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CACGCTGCCC 
GGGTCAGCCC 
CAGCTTCTAC 
AAGATGATTT 
AATTAGCAAA 
CCTGAGTTCA 
GTCACAAGGT 



CCTCCATTTT 
ATTCTTACTT 
CAAAACCAGT 
AAAAGGGAAG 
ACCCCACTAC 
GTGGCACTCA 
TTAAGGTGAC 



CTCTCCAATC 
CCCTGGGATC 
GAAGTGTAAG 
TCTAGAGTTC 
ACAGTCTGCA 
ACATGAGTCA 
CCAATGATTC 



TGGAATCCTG 
AAATCAAGAA 
AAACCCAGAC 
CTAGTCTCCC 
AGATTCTGAA 
AGAGCATCCT 
AGCTATTT 



ACATTGAACC 


CAGTTGCTCA 


1320 


GAAGCATATG 


TCACCATGTC 


1360 


TGAACTTACC 


GTGAGCGACA 


1440 


TCACAGCACA 


GAGAAGACAA 


1500 


ACATTGCTTT 


GACCACTCTT 


1560 


GCTTCTACCA 


TGTGGATTTG 


1620 






1658 



Seq ID NO: C129 DNA Sequence 

Nucleic Acid Accession 8: NM_002722.1 

Coding sequence : 15 . . 302 



1 
I 

ACTCTGGACT 
GCGTGGCTCT 
ACCCAGGGGA 
ACATCAACAT 
TCTCGGAGTG 
AATGCCACCT 
ACCCTTGGCT 
AAGCC 



11 

I 

CCGGATGGCT 
GTTACTACAG 
CAATGCCACA 
GCTGACCAGG 
GGGGTCCCCG 
TCTGTCTCCT 
CTGGCCAAAG 



21 
I 

GCCGCACGCC 
CCACTGCTGG 
CCAGAGCAGA 
CCTAGGTATG 
CATGCTGCTG 
ACGACTCCAT 
CTTGCTCCCT 



31 
I 

TCTGCCTCTC 
GTGCCCAGGG 
TGGCCCAGTA 
GGAAAAGACA 
TCCCCAGGGA 
GAGCAGCGCC 
GCTCCCACAC 



41 
i 

CCTGCTGCTC 
AGCCCCACTG 
TGCAGCTGAT 
CAAAGAGGAC 
GCTCAGCCCG 
AGCCCAGCTC 
AGGCTCAATA 



SI 
I 

CTGTCCACCT 
GAGCCAGTGT 
CTCCGTAGAT 
ACGCTGGCCT 
CTGGACTTAT 
TCCCCTCTGC 
AAGCAAGTCA 



60 

120 

180 

240 

300 

360 

420 

425 



Seq ID NO: CI 30 DNA Sequence 

Nucleic Acid Accession 8: NM_0 3 2545.1 

Coding sequence: 47.. 718 



AAACTGATCT 
CCATGTCAGG 
CTATCAAAGA 
GCACCGACAG 
CGAGGGCTGG 
CGCGCGGCCG 
CCCGGCCCAC 
CCTGGAGCAC 
CCTGCACTGC 
CCACGCTCAC 
ACTCCTGCAC 
CGTCCTCCAG 
TTCTATGTTG 
TTTTTATTGG 
AAAAAA 



11 
I 

TCAATGCACT 
CTTCTGTTTA 
GAGAAACATA 
TCACCGCTCA 
GGGCCGGAGG 
CGCTGCTGCA 
TTCACCGGCC 
GGAGCCTGGA 
CTCCCCCTCC 
GGGCCGAGCG 
CGCCTCCTGC 
CGGGAGCGGC 
TAAATAATAG 
GTAATAAATA 



21 
I 

AAGAGAAGGA 
CGGTCAGTTT 
ACGGCGGTAG 
ACTGGACCTC 
AGCCGCTCCC 
GGAACGGCGG 
GCTACTGCGA 
CCCTCCGCGC 
AGACGCCTGA 
CCGGGGGCGC 
GCCCGGATGC 
GCCCCTGOGG 
ATGTGTTTAG 
TTTTCATGAA 



31 
I 

GACTCTCAAA 

GGCATTACAG 

AGAGGAAGTC 

CAGTCATTTC ' 

CTACTCCCGG 

TACCTGCGTG 

GCATGACCAG 

CTGCCACCTC 

CCGCTGTGAC 

GCCCAGCCTG 

GCCCGCGCAC 

AAGGCCGGGA 

TTTACCGTAA 

AGCGCCAAAA 



41 
I 

CCAAAAATGA 
ATCATCAATT 
ACCAAGGTTG 
GGAGAGGTG A 
GCTTTCGGAG 
CTGGGCAGCT 
AGGCGCAGTG 
TGCAGGTGCA 
CCGAAAGACT 
CTACTCTTGC 
CCTCGGTCCC 
CTTGGGCATC 
GCTGAAGCAC 
AAAAAAAAAA 



51 
I 

CCTGGAGGCA 
TGGGAAACAG 
CCACTCAGAA 
CTGGGAGCGC 
AGGGTGCGTC 
TCTGCGTGTG 
AATGCGGCGC 
TCTTCGGGGC 
TCCTGGCCTC 
TGCCCTGCGC 
TGGTCCCTTC 
GCCTTTAATT 
TGGGTGAATA 
AAAAAAAAAA 



Seq ID NO: C131 DNA Sequence 

Nucleic Acid Accession ft: NM_006533.l 

Coding sequence: 72.. 467 



AGGGAGAGAG 
CACAGTCCAC 
TCTCCGGACC 
CGGACCAGGA 
CCGACTGCCG 
AGGGCCGTGG 
CTGCTCGCCT 
GCAAAGTCGA 
CTGGCCCTGC 



11 
I 

GGAGGGGAGG 
GATGGCCCGG 
TGGTGTCAGG 
GTGCAGCCAC 
ATTCCTGACC 
GOGGCTCTTC 
GGGCTATTTC 
TGTGAAGACA 
CGTTTCCCCT 



21 
I 

AAATTGGAGA 
TCCCTGGTGT 
GGTGGTCCTA 
CCTATCTCCA 
ATTCACCGGG 
TGGGGAGGCA 
CCCAGTAGCA 
GACAAATGGG 
CCTTGGGTTT 




TGCCCAAGCT 
TGGCTGTGGC 
GCCAAGTGGT 
GCGTTCAGGG 
TTGTCCGAGA 
ATTTCTACTG 
ATGCAAATAC 



41 
I 

CCCTTGCTCA 
CATCATCTTG 
GGCTGACCGG 
CCTTCAGGAC 
GTATGTCTTC 
AGATTACTAT 
GGACCAGACC 
CCAGTGAGCT 
AATCAGCCCA 



Seq ID NO: C132 DNA Sequence 
Nucleic Acid Accession ft: AB064272 
Coding sequence: 1..708 



ATGACACAAG 
AAAACCACAA 
GGGAAAAACA 
ACTGAGACCA 
ACAAAGACTA 
TCTCATCTAA 
ACATCTAGAA 
TACCTCAATA 
TCATTCCCTG 
TTCCTTGGCC 
AACACCCAGT 
ATGGAGCAGC 



11 

I 

TCACAGAAAA 
GAACCCCAGA 
CACCAGTCCC 
TAAAAGCCCC 
TAAAACCTTC 
ATAAAACTGA 
CGAAGCTGAG 
AAGATGGCTC 
CATGGGCCAT 
TGATCTTCTT 
ACAATGATGC 
AGAATCTTGG 



21 
I 

GTCCACAGAA 
AAAGCCTACG 
AGAAAAGCCT 
AGTAAAGTCC 
AGTCAAGGTC 
AGTTACTCAT 
TTCTATCACA 
ACAGAAAGGT 
AGTTATTGTG 
GGTCTCCTAT 
AGAGGATGAG 
CATGGGCCAG 



31 
I 

CACCCAGAAA 
CTATACTCAG 
ACAGAAAACC 
ACAGAAAACC 
ACAGGAGACA 
CAGGTGCCCA 
TCAGAAGCCA 
ATCCACGCTG 
GTCCTGGTGG 
ATGATGCGGA 
GGTGGCCCCA 
ATCCCTTCCC 



41 
I 

AGACCACGTC 
AGAAGACCAT 
TGGGGAACAC 
CAGAAAAAAC 
AATCTCTCAC 
CTGGTTCTTT 
CAGGAAACGA 
GACAGATGGG 
CTGTGATTCT 
CACGCCGCAC 
ATTCCTACCC 
CACGGTGA 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

846 



51 
I 

CTCTCTTGCT 
CTGTCTGCCT 
AAGCTGTGTG 
TACATGGCCC 
TCCAAGCTGA 
GGAGATCTGG 
CTGAAACCTG 
CAGCCTACCG 
GTGCAAAC 



Seq ID NO: CI 3 3 DNA Sequence 

Nucleic Acid Accession #: NM_080B70.1 

Coding sequence : 3 . . 710 



60 

120 

180 

240 

300 

360 

420 

480 

538 



51 
I ' 

AACCACAGAG 
ATGCACCAAA 
CACACTGACC 
AGCAGCAGTC 
TACTACCTCT 
CACCCTCATT 
GAGCCATCCA 
AGAGAATGAT 
CCTCCTGGTG 
ACTAACCCAG 
GGTCTACCTG 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

70B 



1283 
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AGATGACACA 
AGAAAACCAC 
AAGGGAAAAA 
CCACTGAGAC 
TCACAAAGAC 
CTTCTCATCT 
TTACATCTAG 
CATACCTCAA 
ATTCATTCCC 
TGTTCCTTGG 
AGAACACCCA 
TGATGGAGCA 
GGCGCCCAGC 
AATTTCTATT 
CTTCATCTGT 
AGGGGACAAA 
CTGAGAATGA 
TGAGTCCTAA 
GGTTTATGGG 
TTATTTCCAT 
AGGACCCTTG 
AAGAAGAAAA 
CCAGAAGAAA 
TTAGTTTTCC 
GGACACGCAA 
GACCTTCCCT 
GTAGCTTTAT 
CCTTCACCCC 
ATTTTCTCTC 
" AGGAGAATGA 
ATGTT 



11 
I 

AGTCACAGAA 
AAGAACCCCA 
CACACCAGTC 
CATAAAAGCC 
TATAAAACCT 
AAATAAAACT 
AACGAAGCTG 
TAAAGATGGC 
TGCATGGGCC 
CCTGATCTTC 
GTACAATGAT 
GCAGAATCTT 
CCTGGCTCTT 
TCCGGGACTA 
TCTTGAAACT 
GAAGAAAGAA 
AAAGGTGTTT 
AGGAAGGACA 
GAAAGGGAGG 
TCACTATTAC 
CCCCCACCCC 
GAGAATTTAT 
TCATAAATAT 
TAAATGTCTA 
GGAGGGGATT 
GATTGGTGTC 
CTCGTAAAAG 
AGATTCAAGT 
TCTCTTCTTA 
TGATGATAGT 



21 
I 

AAGTCCACAG 
GAAAAGCCTA 
CCAGAAAAGC 
CCAGTAAAGT 
TCAGTCAAGG 
GAAGTTACTC 
AGTTCTATCA 
TCACAGAAAG 
ATAGTTATTG 
TTGGTCTCCT 
GCAGAGGATG 
GGCATGGGCC 
CCATGCTCTG 
CAGGAAGGGC 
GGTTGGGGAA 
TGAATAATAC 
GATGGACATG 
GGAGCCTTAT 
GACTGAGGGC 
TTAAGAGTTT 
CATTTTTTTA 
TTOCTTCTCC 
CTCTCATCTA 
CAGTGGACGC 
TTTATTTGGC 
TCAGCATTTA 
TTACCCATCT 
TTTCCTCCTT 
TGGCCATTTC 
TTCCTTCTGT 



31 
I 

AACACCCAGA 
CGCTATACTC 
CTACAGAAAA 
CCACAGAAAA 
TCACAGGAGA 
ATCAGGTGCC 
CATCAGAAGC 
GTATCCACGC 



ATATGATGCG 
AGGGTGGCCC 
AGATCCCTTC 
CCCCTTTCCT 
AGAGAATACT 
TGAGGTGATA 
GAGCAGACAT 
TTGTGGGGGC 
AGGCAATGCC 
AGAGTCTCTG 
GTGTGTAAAC 
ATGAAAAAAA 
ACTCTCTCCA 
CATGGTTGCT 
CCTGTTGGTT 
CAGCAGTCTC 
TTTTCCTGTC 
TCTCTACTGT 
GTAGGCATTT 
ACCTTATTAC 
CTATTGACCT 



41 
I 

AAAGACCACG 
AGAGAAGACC 
CCTGGGGAAC 
CCCAGAAAAA 
CAAATCTCTC 
CACTGGTTCT 
CACAGGAAAC 
TGGACAGATG 
GGCTGTGATT 
GACACGCCGC 
CAATTCCTAC 
CCCACGGTGA 
GGATGAGGAA 
GACGGTTACC 
AGCAAGGAGG 
TCTCTGTAGA 
ACCAATGCAG 
CCAGACTGAC 
GGTTTCAGGA 
AGGCTCATCT 
AAAACAAAAA 
TGCCCTGGAG 
TCCTCTTCCT 
TGGCTTGCTG 
ACCCACTGAT 
TCTTCCACCA 
CCCCATTCTC 
CATCTGTGTG 
TGATTGGGTA 
TTTTTATAAT 



51 

I 

TCAACCACAG 
ATATGCACCA 
ACCACACTGA 
ACAGCAGCAG 
ACTACTACCT 
TTCACCCTCA 
GAGAGCCATC 
GGAGAGAATG 
CTCCTCCTGG 
ACACTAACCC 
CCGGTCTACC 
TCTTGGAGTA 
CCGGACTCAC 
AGTATTAACC 
GTGTAAGTTT 
AGGTAATGGT 
AACACTGCAC 
TTGTGAGTGG 
CAGCATTATG 
CTGAGTTCTC 
AAACGGATCC 
AAAAAAAAGT 
CCCAAATCCC 
GGTTGTGGGT 
CTCCACCCCA 
AAAGCCAGCT 
TCTCCTCCCA 
TGTTTTCTGG 
GAGGGGGAAA 
AAAGTATAAC 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1805 



Seq ID NO: C134 DNA Sequence 

Nucleic Acid Accession #: FGENESH predicted 

Coding sequence: 1.. 10674 



1 11 21 31 41 51 

I I I I I I 

ATGTGGCCTC GCCTGGCCTT TTGTTGCTGG GGTCTGGCGC TCGTTTCGGG CTGGGCGACC 60 

TTTCAGCAGA TGTCCCCGTC GCGCAATTTC AGCTTCCGCC TCTTCCCCGA GACCGCGCCC 120 

GGGGCCCCCG GGAGTATCCC CGCGCCGCCC GCTCCTGGCG ACGAAGCGGC GGGGAGCAGA 180 

GTGGAGCGGC TGGGCCAGGC GTTCCGGCGA CGCGTGCGGC TGCTGCGGGA GCTCAGCGAG 240 

CGCCTGGAGC TTGTCTTCCT GGTGGATGAT TCGTCCAGCG TGGGCGAAGT CAACTTCCGC 300 

AGCGAGCTCA TGTTCGTCCG CAAGCTGCTG TCCGACTTCC CCGTGGTGCC CACGGCCACG 360 

CGGGTGGCCA TCGTGACCTT CTCGTCCAAG AACTACGTGG TGCCGCGCGT CGATTACATC 420 

TCCACCCGCC GCGCGCGCCA GCACAAGTGC GCGCTGCTCC TCCAAGAGAT CCCTGCCATC 480 

TCCTACCGAG GTGGCGGCAC CTACACCAAG GGCGCCTTCC AGCAAGCCGC GCAAATTCTT 540 

CTTCATGCTA GAGAAAACTC AACAAAAGTT GTATTTCTCA TCACTGATGG ATATTCCAAT 600 

GGGGGAGACC CTAGACCAAT TGCAGCGTCA CTGCGAGATT CAGGAGTGGA GATCTTCACT 660 

TTTGGCATAT GG CAAGGGAA CATTCGAGAG CTGAATGACA TGGCTTCCAC CCCAAAGGAG 720 

GAGCACTGTT ACCTGCTACA CAGTTTTGAA GAATTTGAGG CTTTAGCTCG CCGGGCATTG 780 

CATGAAGATC TACCTTCTGG GAGTTTTATT CAAGATGATA TGGTCCACTG CTCATATCTT 840 

TGTGATGAAG GCAAGGACTG CTGTGACCGA ATGGGAAGCT GCAAATGTGG GACACACACA 900 

GGCCATTTTG AGTGCATCTG TGAAAAGGGG TATTACGGGA AAGGTCTGCA GTATGAATGC 960 

ACAGCTTGCC CATCGGGGAC ATACAAACCT GAAGGCTCAC CAGGAGGAAT CAGCAGTTGC 1020 

ATTCCATGTC CTGATGAAAA TCACACCTCT CCACCTGGAA GCACATCCCC TGAAGACTGT 1080 

GTCTGCAGAG AGGGATACAG GGCATCTGGC CAGACCTGTG AACTTGTCCA CTGCCCTGCC 1140 

CTGAAGCCTC CCGAAAATGG TTACTTTATC CAAAACACTT GCAACAACCA CTTCAATGCA 1200 

GCCTGTGGGG TCCGATGTCA CCCTGGATTT GATCTTGTGG GAAGCAGCAT CATCTTATGT 1260 

CTACCCAATG GTTTGTGGTC CGGTTCAGAG AGCTACTGCA GAGTAAGAAC ATGTCCTCAT 1320 

CTCCGCCAGC CGAAACATGG CCACATCAGC TGTTCTACAA GGGAAATGTT ATATAAGACA 1380 

ACATGTTTGG TTGCCTGTGA TGAAGGGTAC AGACTAGAAG GCAGTGATAA GCTTACTTGT 1440 

CAAGGAAACA GCCAGTGGGA TGGGCCAGAA CCCCGGTGTG TGGAGCGCCA CTGTTCCACC 1500 

TTTCAGATGC CCAAAGATGT CATCATATCC CCCCACAACT GTGGCAAGCA GCCAGCCAAA 1560 

TTTGGGACGA TCTGCTATGT AAGTTGCCGC CAAGGGTTCA TTTTATCTGG AGTCAAAGAA 1620 

ATGCTGAGAT GTACCACTTC TGGAAAATGG AATGTCGGAG TTCAGGCAGC TGTGTGTAAA 1680 

GACGTGGAGG CTCCTCAAAT CAACTGTCCT AAGGACATAG AGGCTAAGAC TCTGGAACAG 1740 

CAAGATTCTG CCAATGTTAC CTGGCAGATT CCAACAGCTA AAGACAACTC TGGTGAAAAG 1800 

GTGTCAGTCC ACGTTCATCC AGCTTTCACC CCACCTTACC TTTTCCCAAT TGGAGATGTT 1860 

GCTATCGTAT ACACGGCAAC TGACCTATCC GGCAACCAGG CCAGCTGCAT TTTCCATATC 1920 

AAGGTTATTG ATGCAGAACC ACCTGTCATA GACTGGTGCA GATCTCCACC TCCCGTCCAG 1980 

GTCTCGGAGA AGGTACATGC CGCAAGCTGG GATGAGCCTC AGTTCTCAGA CAACTCAGGG 2040 

GCTGAATTGG TCATTACCAG AAGTCATACA CAAGGAGACC TTTTCCCTCA AGGGGAGACT 2100 

ATAGTACAGT ATACAGCCAC TGACCCCTCA GGCAATAACA GGACATGTGA TATCCATATT 2160 

GTCATAAAAG GTTCTCCCTG TGAAATTCCA TTCACACCTG TAAATGGGGA TTTTATATGC 2220 

ACTCCAGATA ATACTGGAGT CAACTGTACA TTAACTTGCT TGGAGGGCTA TGATTTCACA 2280 

GAAGGGTCTA CTGACAAGTA TTATTGTGCT TATGAAGATG GCGTCTGGAA ACCAACATAT 2340 

ACCACTGAAT GGCCAGACTG TGCCAAAAAA CGTTTTGCAA ACCACGGGTT CAAGTCCTTT 2400 

GAGATGTTCT ACAAAGCAGC TCGTTGTGAT GACACAGATC TGATGAAGAA GTTTTCTGAA 2460 

GCATTTGAGA CGACCCTGGG AAAAATGGTC CCATCATTTT GTAGTGATGC AGAGGACATT 2520 



1284 



WO 03/042661 



GACTGCAGAC TGGAGGAGAA CCTGACCAAA AAATATTGCC TAGAATATAA TTATGACTAT 2 580 

GAAAATGGCT TTGCAATTGG ACCAGGTGGC TGGGGTGCAG CTAATAGGCT GGATTACTCT 2640 

TACGATGACT TCCTGGACAC TGTGCAAGAA ACAGCCACAA GCATCGGCAA TGCCAAGTCC 2700 

TCACGGATTA AAAGAAGTGC CCCATTATCT GACTATAAAA TTAAGTTAAT TTTTAACATC 2760 

ACAGCTAGTG TGCCATTACC CGATGAAAGA AATGATACCC TTGAATGGGA AAATCAGCAA 2820 

CGACTCCTTC AGACATTGGA AACTATCACA AATAAACTGA AAAGGACTCT CAACAAAGAC 2880 

CCCATGTATT CCTTTCAGCT TGCATCAGAA ATACTTATAG CCGACAGCAA TTCATTAGAA 2940 

ACAAAAAAGG CTTCCCCCTT CTGCAGACCA GGCTCAGTGC TGAGAGGGCG TATGTGTGTC 3000 

AATTGCCCTT TGGGAACCTA TTATAATCTG GAACATTTCA CCTGTGAAAG CTGCCGGATC 3060 

GGATCCTATC AAGATGAAGA AGGGCAACTT GAGTGCAAGC TTTGCCCCTC TGGGATGTAC 3120 

ACGGAATATA TCCATTCAAG AAACATCTCT GATTGTAAAG CTCAGTGTAA ACAAGGCACC 3180 

TACTCATACA GTGGACTTGA. GACTTGTGAA TCGTGTCCAC TGGGCACTTA TCAGCCAAAA 3240 

TTTGGTTCCC GGAGCTGCCT CTCGTGTCCA GAAAACACCT CAACTGTGAA AAGAGGAGCC 3300 

GTGAACATTT CTGCATGTGG AGTTCCTTGT CCAGAAGGAA AATTCTCGCG TTCTGGGTTA 3360 

ATGCCCTGTC ACCCATGTCC TCGTGACTAT TACCAACCTA ATGCAGGGAA GGCCTTCTGC 3420 

CTGGCCTGTC CCTTTTATGG AACTACCCCA TTCGCTGGTT CCAGATCCAT CACAGAATGT 3480 

TCAACTTCAG TTCTGAATAT TACTATTTTC GGTGGATTTG GGCATCTGGA GTTGTTAAAT 3540 

TGTCCTTCTG AGGTTTTCCA TGAATGCTTC TTTAACCCTT GCCACAATAG TGGAACCTGC 3600 

CAGCAACTTG GGCGTGGTTA TGTTTGTCTC TGTCCACTTG GATATACAGG CTTAAAGTGT 3660 

GAAACAGACA TCGATGAGTG CAGCCCACTG CCTTGCCTCA ACAATGGAGT TTGTAAAGAC 3720 

CTAGTTGGGG AATTCATTTG TGAGTGCCCA TCAGGTTACA CAGGTCAGCG GTGTGAAGAA 3780 

AATATAAATG AGTGTAGCTC CAGTCCTTGT TTAAATAAAG GAATCTGTGT TGATGGTGTG 3840 

GCTGGCTATC GTTGCACATG TGTGAAAGGA TTTGTAGGCC TGCATTGTGA AACAGAAGTC 3900 

AATGAATGCC AGTCAAACCC ATGCTTAAAT AATGCAGTCT GTGAAGACCA GGTTGGGGGA 3960 

TTCTTGTGCA AATGCCCACC TGGATTTTTG GGTACCCGAT GTGGAAAGAA CGTCGATGAG 4020 

TGTCTCAGTC AGCCATGCAA AAATGGAGCT ACCTGTAAAG ACGGTGCCAA TAGCTTCAGA 4080 

TGCCTGTGTG CAGCTGGCTT CACAGGATCA CACTGTGAAT TGAACATCAA TGAATGTCAG 4140 

TCTAATCCAT GTAGAAATCA GGCCACCTGT GTGGATGAAT TAAATTCATA CAGTTGTAAA 4200 

TGTCAGCCAG GATTTTCAGG CAAAAGGTGT GAAACAGAAC AGTCTACAGG CTTTAACCTG 4260 

GATTTTGAAG TTTCTGGCAT CTATGGATAT GTCATGCTAG ATGGCATGCT CCCATCTCTC 4320 

CATGCTCTAA CCTGTACCTT CTGGATGAAA TCCTCTGACG ACATGAACTA TGGAACACCA 4380 

ATCTCCTATG CAGTTGATAA CGGCAGCGAC AATACCTTGC TCCTGACTGA TTATAACGGC 4440 

TGGGTTCTTT ATGTGAATGG CAGGGAAAAG ATAACAAACT GTCCCTCGGT GAATGATGGC 4500 

AGATGGCATC ATATTGCAAT CACTTGGACA AGTGCCAATG GCATCTGGAA AGTCTATATC 4S60 

GATGGGAAAT TATCTGACGG TGGTGCTGGC CTCTCTGTTG GTTTGCCCAT ACCTGGTGGT 4620 

GGTGCGTTAG TTCTGGGGCA AGAGCAAGAC AAAAAAGGAG AGGGATTCAG CCCAGCTGAG 4680 

TCTTTTGTGG GCTCCATAAG CCAGCTCAAC CTCTGGGACT ATGTCCTGTC TCCACAGCAG 4740 

GTGAAGTCAC TGGCTACCTC CTGCCCAGAG GAACTCAGTA AAGGAAACGT GTTAGCATGG 4 BOO 

CCTGATTTCT TGTCAGGAAT TGTGGGGAAA GTGAAGATCG ATTCTAAGAG CATATTTTGT 4860 

TCTGATTGCC CACGCTTAGG AGGGTCAGTG CCTCATCTGA GAACTGCATC TGAAGATTTA 4920 

AAGCCAGGTT CCAAAGTCAA TCTGTTCTGT GATCCAGGCT TCCAGCTGGT CGGGAACCCT 4980 

GTGCAGTACT GTCTGAATCA AGGACAGTGG ACACAACCAC TTCCTCACTG TGAACGCATT 5040 

AGCTGTGGGG TGCCACCTCC TTTGGAGAAT GGCTTCCATT CAGCCGATGA CTTCTATGCT 5100 

GGCAGCACAG TAACCTACCA GTGCAACAAT GGCTACTATC TATTGGGTGA CTCAAGGATG 5160 

TTCTGTACAG ATAATGGGAG CTGGAACGGC GTTTCACCAT CCTGCCTTGA TGTCGATGAG 5220 

TGTGCAGTTG GATCAGATTG TAGTGAGCAT GCTTCTTGCC TGAACGTAGA TGGATCCTAC 5280 

ATATGTTCAT GTGTCCCACC GTACACAGGA GATGGGAAAA ACTGTGCAGA ACCTATAAAA 5340 

TGTAAGGCTC CAGGAAATCC GGAAAATGGC CACTCCTCAG GTGAGATTTA TACAGTAGGT 5400 

GCCGGAGTCA CATTTTCGTG TCAGGAAGGA TACCAGTTGA TGGGAGTAAC CAAAATCACA 5460 

TGTTTGGAGT CTGGAGAATG GAATCATCTA ATACCATATT GTAAAGCTGT TTCATGTGGT 5520 

AAACCGGCTA TTCCAGAAAA TGGTTGCATT GAGGAGTTAG CATTTACTTT TGGCAGCAAA 5580 

GTGACATATA GGTGTAATAA AGGATATACT CTGGCOGGTG ATAAAGAATC ATCCTGTCTT 5640 

GCTAACAGTT CTTGGAGTCA TTCCCCTCCT GTGTGTGAAC CAGTGAAGTG TTCTAGTCCG 5700 

GAAAATATAA ATAATGGAAA ATATATTTTG AGTGGGCTTA CCTACCTTTC TACTGCATCA 5760 

TATTCATGCG ATACAGGATA CAGCTTACAG GGCCCTTCCA TTATTGAATG CACGGCTTCT 5820 

GGCATCTGGG ACAGAGCGCC ACCTGCCTGT CACCTCGTCT TCTGTGGAGA ACCACCTGCC 5880 

ATCAAAGATG CTGTCATTAC GGGGAATAAC TTCACTTTCA GGAACACCGT CACTTACACT 5940 

TGCAAAGAAG GCTATACTCT TGCTGGTCTT GACACCATTG AATGCCTGGC GGACGGCAAG 6000 

TGGAGTAGAA GTGACCAGCA GTGCCTGGCT GTCTCCTGTG ATGAGCCACC CATTGTGGAC 6060 

CACGCCTCTC CAGAGACTGC CCATCGGCTC TTTGGAGACA TTGCATTCTA CTACTGCTCT 6120 

GATGGTTACA GCCTAGCAGA CAATTCCCAG CTTCTCTGCA ATGCCCAGGG CAAGTGGGTA 6180 

CCCCCAGAAG GTCAAGACAT GCCCCGTTGT ATAGCTCATT TCTGTGAAAA ACCTCCATCG 6240 

GTTT CCTATA GCATCTTGGA ATCTGTGAGC AAAGCAAAAT TTGCAGCTGG CTCAGTTGTG 6300 

AGCTTTAAAT GCATGGAAGG CTTTGTACTG AACACCTCAG CAAAGATTGA ATGTATGAGA 6360 

GGTGGGCAGT GGAACCCTTC CCCCATGTCC ATCCAGTGCA TCCCTGTGCG GTGTGGAGAG 6420 

CCACCAAGCA TCATGAATGG CTATGCAAGT GGATCAAACT ACAGTTTTGG AGCCATGGTG 6480 

GCTTACAGCT GCAACAAGGG GTTCTACATC AAAGGGGAAA AGAAGAGCAC CTGCGAAGCC 6540 

ACAGGGCAGT GGAGTAGTCC TATACCGACG TGCCACCCGG TATCTTGTGG TGAACCACCT 6600 

AAGGTTGAGA ATGGCTTTCT GGAGCATACA ACTGGCAGGA TCTTTGAGAG TGAAGTGAGG 6660 

TATCAGTGTA ACCCGGGCTA TAAGTCAGTC GGAAGTCCTG TATTTGTCTG CCAAGCCAAT 6720 

CGCCACTGGC ACAGTGAATC CCCTCTGATG TGTGTTCCTC TCGACTGTGG AAAACCTCCC 6780 

CCGATCCAGA ATGGCTTCAT GAAAGGAGAA AACTTTGAAG TAGGGTCCAA GGTTCAGTTT 6840 

TTCTGTAATG AGGGTTATGA GCTTGTTGGT GACAGTTCTT GGACATGTCA GAAATCTGGC 6900 

AAATGGAATA AGAAGTCAAA TCCAAAGTGC ATGCCTGCCA AGTGCCCAGA GCCGCCCCTC 6960 

TTGGAAAACC AGCTAGTATT AAAGGAGTTG ACCACCGAGG TAGGAGTTGT GACATTTTCC 7020 

TGTAAAGAAG GGCATGTCCT GCAAGGCCCC TCTGTCCTGA AATGCTTGCC ATCCCAGCAA 7080 

TGGAATGACT CTTTCCCTGT TTGTAAGATT GTTCTTTGTA CCCCACCTCC CCTAATTTCC 7140 

TTTGGTGTCC CCATTCCTTC TTCTGCTCTT CATTTTGGAA GTACTGTCAA GTATTCTTGT 7200 

GTAGGTGGGT TTTTCCTAAG AGGAAATTCT ACCACCCTCT GCCAACCTGA TGGCACCTGG 7260 

AGCTCTCCAC TGCCAGAATG TGTTCCAGTA GAATGTCCCC AACCTGAGGA AATCCCCAAT 7320 

GGAATCATTG ATGTGCAAGG CCTTGCCTAT CTCAGCACAG CTCTCTATAC CTGCAAGCCA 7380 

GGCTTTGAAT TGGTGGGAAA TACTACCACC CTTTGTGGAG AAAATGGTCA CTGGCTTGGA 7440 

GGAAAACCAA CATGTAAAGC CATTGAGTGC CTGAAACCCA AGGAGATTTT GAATGGCAAA 7500 

TTCTCTTACA CGGACCTACA CTATGGACAG ACCGTTACCT ACTCTTGCAA CCGAGGCTTT 7560 

1285 



WO 03/042661 



CGGCTCGAAG GTCCCAGTGC CTTGACCTGT TTAGAGACAG GTGATTGGGA TGTAGATGCC 7620 

CCATCTTGCA ATGCCATCCA CTGTGATTCC CCACAACCCA TTGAAAATGG TTTTGTAGAA 7680 

GGTGCAGATT ACAGCTATGG TGCCATAATC ATCTACAGTT GCTTCCCTGG GTTTCAGGTG 7740 

GCTGGTCATG CCATGCAGAC CTGTGAAGAG TCAGGATGGT CAAGTTCCAT CCCAACATGT 7800 

ATGCCAATAG ACTGTGGCCT CCCTCCTCAT ATAGATTTTG GAGACTGTAC TAAACTCAAA 7860 

GATGACCAGG GATATTTTGA GCAAGAAGAC GACATGATGG AAGTTCCATA TGTGACTCCT 7920 

CACCCTCCTT ATCATTTGGG AGCAGTGGCT AAAACCTGGG AAAATACAAA GGAGTCTCCT 7980 

GCTACACATT CATCAAACTT TCTGTATGGT ACCATGGTTT CATACACCTG TAATCCAGGA 8040 

TATGAACTTC TGGGGAACCC TGTGCTGATC TGCCAGGAAG ATGGAACTTG GAATGGCAGT 8100 

GCACCATCCT GCATTTCAAT TGAATGTGAC TTGCCTACTG CTCCTGAAAA TGGCTTTTTG 8160 

CGTTTTACAG AGACTAGCAT GGGAAGTGCT GTGCAGTATA GCTGTAAACC TGGACACATT 8220 

CTAGCAGGCT CTGACTTAAG GCTTTGTCTA GAGAATAGAA AGTGGAGTGG TGCCTCCCCA 8280 

CGCTGTGAAG CCATTTCATG CAAAAAGCCA AATCCAGTCA TGAATGGATC CATCAAAGGA 8340 

AGCAACTACA CATACCTGAG CACGTTGTAC TATGAGTGTG ACCCCGGATA TGTGCTGAAT 8400 

GGCACTGAGA GGAGAACATG CCAGGATGAC AAAAACTGGG ATGAGGATGA GCCCATTTGC 8460 

ATTCCTGTGG ACTGCAGTTC ACCCCCAGTC TCAGCCAATG GCCAGGTGAG AGGAGACGAG 8S20 

TACACATTCC AAAAAGAGAT TGAATACACT TGCAATGAAG GGTTCTTGCT TGAGGGAGCC 8580 

AGGAGTCGGG TTTGTCTTGC CAATGGAAGT TGGAGTGGAG CCACTCCCGA CTGTGTGCCT B640 

GTCAGATGTG CCACCCCGCC ACAACTGGCC AATGGGGTGA CGGAAGGCCT GGACTATGGC 8700 

TTCATGAAGG AAGTAACATT CCACTGTCAC GAGGGCTACA TCTTGCACGG TGCTCCAAAA 8760 

CTCACCTGTC AGTCAGATGG CAACTGGGAT GCAGAGATTC CTCTCTGTAA ACCAGTCAAC 8820 

TGTGGACCTC CTGAAGATCT TGCCCATGGT TTCCCTAATG GTTTTTCCTT TATTCATGGG 8880 

GGCCATATAC AGTATCAGTG CTTTCCTGGT TATAAGCTCC ATGGAAATTC ATCAAGAAGG 8940 

TGCCTCTCCA ATGGCTCCTG GAGTGGCAGC TCACCTTCCT GCCTGCCTTG CAGATGTTCC 9000 

ACACCAGTAA TTGAATATGG AACTGTCAAT GGGACAGATT TTGACTGTGG AAAGGCAGCC 9060 

CGGATTCAGT GCTTCAAAGG CTTCAAGCTC CTAGGACTTT CTGAAATCAC CTGTGAAGCC 9120 

GATGGCCAGT GGAGCTCTGG GTTCCCCCAC TGTGAACACA CTTCTTGTGG TTCTCTTCCA 9180 

ATGATACCAA ATGCGTTCAT CAGTGAGACC AGCTCTTGGA AGGAAAATGT GATAACTTAC 9240 

AGCTGCAGGT CTGGATATGT CATACAAGGC AGTTCAGATC TGATTTGTAC AGAGAAAGGG 9300 

GTATGGAGCC AGCCTTATCC AGTCTGTGAG CCCTTGTCCT GTGGGTCCCC ACCGTCTGTC 9360 

GCCAATGCAG TGGCAACTGG AGAGGCACAC ACCTATGAAA GTGAAGTGAA ACTCAGATGT 9420 

CTGGAAGGTT ATACGATGGA TACAGATACA GATACATTCA CCTGTCAGAA AGATGGTCGC 9480 

TGGTTCCCTG AGAGAATCTC CTGCAGTCCT AAAAAATGTC CTCTCCCGGA AAACATAACA 9540 

CATATACTTG TACATGGGGA CGATTT CAGT GTGAATAGGC AAGTTTCTGT GTCATGTGCA 9600 

GAAGGGTATA CCTTTGAGGG AGTTAACATA TCAGTATGTC AGCTTGATGG AACCTGGGAG 9660 

CCACCATTCT CCGATGAATC TTGCAGTCCA GTTTCTTGTG GGAAACCTGA AAGTCCAGAA 9720 

CATGGATTTG TGGTTGGCAG TAAATACACC TTTGAAAGCA CAATTATTTA TCAGTGTGAG 9780 

CCTGGCTATG AACTAGAGGG GAACAGGGAA CGTGTCTGCC AGGAGAACAG ACAGTGGAGT 9840 

GGAGGGGTGG CAATATGCAA AGAGACCAGG TGTGAAACTC CACTTGAATT TCTCAATGGG 9900 

AAAGCTGACA TTGAAAACAG GACGACTGGA CCCAACGTGG TATATTCCTG CAACAGAGGC 9960 ' 

TACAGTCTTG AAGGGCCATC TGAGGCACAC TGCACAGAAA ATGGAACCTG GAGCCACCCA 10020 

GTCCCTCTCT GCAAACCAAA TCCATGCCCT GTTCCTTTTG TGATTCCCGA GAATGCTCTG 10080 

CTGTCTGAAA AGGAGTTTTA TGTTGATCAG AATGTGTCCA TCAAATGTAG GGAAGGTTTT 10140 

CTGCTGCAGG GCCACGGCAT CATTACCTGC AACCCCGACG AGACGTGGAC ACAGACAAGC 10200 

GCCAAATGTG AAAAAATCTC ATGTGGTCCA CCAGCTCACG TAGAAAATGC AATTGCTCGA 10260 

GGCGTACATT ATCAATATGG AGACATGATC ACCTACTCAT GTTACAGTGG ATACATGTTG 10320 

GAGGGTTTCC TGAGGAGTGT TTGTTTAGAA AATGGAACAT GGACATCACC TCCTATTTGC 10380 

AGAGCTGTCT GTCGATTTCC ATGTCAGAAT GGGGGCATCT GCCAACGCCC AAATGCTTGT 10440 

TCCTGTCCAG AGGGCTGGAT GGGGCGCCTC TGTGAAGAAC CAATCTGCAT TCTTCCCTGT 10500 

CTGAACGGAG GTCGCTGTGT GGCCCCTTAC CAGTGTGACT GCCCGCCTGG CTGGACGGGG 10560 

TCTCGCTGTC ATACAGCTGT TTGCCAGTCT CCCTGCTTAA ATGGTGGAAA ATGTGTAAGA 10620 

CCAAACCGAT GTCACTGTCT TTCTTCTTGG ACGGGACATA ACTGTTCCAG GTAA 10674 



Seq ID NO: CI 3 5 DNA Sequence 
Nucleic Acid Accession *h 
Coding sequence: 1..390 



FGENESH predicted 



1 

! 

ATGAGGTTCA 
TATGTGTCAG 
GAACCATGGC 
CAGTGCTGTT 
TGCACCTTCT 
TTTGTTGTGA 
AGTAAATGTG 



11 
I 

GTGTCTCAGG 

TGTGCCAGCC 
ACAATGACGC 
GGCCCTGCTT 
AGCTGAAGGT 
AAAGAGGCCG 



21 

I 

CATGAGGACC 
CCTCTTGTGT 
GGCACCCAGG 
CATCGTGTCC 
TGAGCTCTGC 
TCAGGGTGTG 
GATATGTTAG 



31 
I 

GACTACCCCA 
CCAAGGGAAG 
TGTGGAGACA 
CTGAGCGAGA 
TGTCTTGATT 
AATTCCCAGT 



41 
I 

GGAGTGTGCT 
TCATCGCTCC 
AGATCTACAA 
CCCGCCAATG 
CCTTTGGCCT 
GCCACTCATC 



51 
I 

GGCTCCTGCT 
CGCTGGCTCA 
CCCCTTGGAG 
TGGTCCCCCC 
CACAAACGAT 
TCCCATCTCC 



60 

120 

180 

240 

300 

360 

390 



Seq ID NO: C136 DNA Sequence 
Nucleic Acid Accession #.: BC035671.1 
Coding sequence: 12 6.. 1745 

1 11 21 31 41 51 

1 i i I I I 

GGCAGCGACT GCGCCCCGTC CCGGCGCCGC GCTCGTCCGC AGAGGAGGCG GCCCGGCCCG 60 

GGCAGCTGCG GCTCGGGATC CGTCGAGGGG AGGCCGAGCT TGCCAAGCTG GCGCCCAGCG 120 

GGGTCATGGT GCCCGGCGCC CGCGGCGGCG GCGCACTGGC GCGGGCTGCC GGGCGGGGCC 180 

TCCTGGCTTT GCTGCTCGCG GTCTCOGCCC CGCTCCGGCT GCAGGCGGAG GAGCTGGGTG 240 

ATGGCTGTGG ACACCTAGTG ACTTATCAGG ATAGTGGCAC AATGACATCT AAGAATTATC 300 

CCGGGACCTA CCCCAATCAC ACTGTTTGCG AAAAGACAAT TACAGTACCA AAGGGGAAAA 360 

GACTGATTCT GAGGTTGGGA GATTTGGATA TCGAATCCCA GACCTGTGCT TCTGACTATC 420 

TTCTCTTCAC CAGCTCTTCA GATCAATATG GTCCATACTG TGGAAGTATG ACTGTTCCCA 480 

AAGAACTCTT- GTTGAACACA AGTGAAGTAA CCGTCCGCTT TGAGAGTGGA TCCCACATTT 540 

CTGGCCGGGG TTTTTTGCTG ACCTATGCGA GCAGCGACCA TCCAGATTTA ATAACATGTT 600 



1286 



WO 03/042661 



PCTAJS02/36810 



10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



TGGAACGAGC 
GAGACGTAGC 
TGTGCAAAGC 
TGCTTCAGCG 
GGGATGGTTC 
TGAGTTTTGA 
GTGGAGACCA 
GGGCTTCGGG 
GGGAGAAAAA 
TTTATGTTAA 
AAGGAATTGT 
TGCAAAACAA 
GGCACCAGAG 
ATTCATTGGT 
AGACAATCAC 
TGGCTATTCC 
CCTTTAGAAA 
ACTGTTGGAA 
GCTATGATAA 
GTTAACTCCG 
AGGTTGCCCC 
ACAACAACAA 
AGAGATAAAA 
AAAAAAAAAA 



TAGCCATTAT 
AGGAGACATT 
TGCCATCCAT 
CAAAGGGATC 
CCTGTCAGAC 
ACCTGACGGG 
AGTTCACTGG 
CGACAGTAGC 
GAAAATAACA 
GAGTTTTGTG 
GAATAATGAA 
TTTCATCCCT 
GATAGCCTTG 
GTGGCGCAAG 
AAGGCCCATC 
ATTGGTGCTC 
GAAGAAGAAG 
GCAGATTAAA 
TGAGAAGGAG 
TTGACTGCCA 
GGATGGATCT 
AGGGCAGTAA 
TATTTTCTTA 
AAAAAAAAAA 



TTGAAGACAG 
TCTGGGAATA 
GCAGGAATAA 
AGTCGATATG 
AAGCGATTTC 
CAAATCAGAG 
TCTCCTGGCC 
AACAACCACA 
GGAATTAGGA 
ATGAACTTCA 
GAAAAGGTGT 
CCCATCGTGG 
AAGGTGGAGC 
ACAAGTCAAA 
CCCTCGGAAG 
CTTGTTGTCC 
AAAGGAAGTC 
TATCCCTTTG 
ATGACACAAA 
AAATAGCATC 
CAGAGATGAG 
ATTAAAGTAC 
AAAATATATT 
AAAAAAAAAA 



AATACAGCAA 
TGGTAGATGG 
TTGCTGATGA 
AAGGGATTCT 
TGTTTACCTC 
CTTCTTCCTC 
AAGCCOGACT 
AACCACGAGA 
CCACAGGATC 
AAAACAATAA 
TTCAGGGTAA 
CCAGATATGT 
TCATTGGTTG 
GCACCAGTGT 
AAACATCCAC 
TGGTGTTTGC 
CGTATGGATC 
CCAGACAT CA 
AGTTAGATCT 
CCCAACGTGC 
GATCGGAACA 
TCTTTGTAAG 
TCATTAAACA 



ATTCTGCCCA 
ATATAGAGAT 
ACTAGGTGGC 
GGCCAATGGT 
CAATGGTTGC 
ATGGCAGTCG 
TCAGGACCAA 
GTGGCTGGAG 
TACACAGTCG 
TTCTAAGTGG 
CTCTAACTTT 
GCGGGTTGTC 
CCAGATTACA 
TTCAACTAAG 
AGGAATAAAC 
TGGAATGGGG 
AGCAGAGGCT 
GTCAGCTGAG 
CATCACAAGT 
AGCCCTCCGC 
CCATGTTCTT 
GTACAGTTAC 
CCTATGCTGT 



GCTGGTTGTA 
ACCTCTTTAT 
CAGATCAGTG 
GTTCTTTCGA 
AGCAGATCCT 
GTCAATGAGA 
GGCCCATCAT 
ATCGATTTGG 
AACTTCAACT 
AAGACCTATA 
CGGGACCCAG 
CCCCAGACAT 
CAAGGTAATG 
AAAGAAGATG 
ATTACAACGG 
ATCTTTGCAG 
CAGAAAACAG 
TTTACCATCA 
GATATGGCAG 
ATCTATCAGC 
TCCCACCCTA 
CGATTAATCT 
CTCTATAAAA 



Seq ID NO: CI 3 7 DNA Sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 1..1761 



ATGGGATTCG 
GAGGCGGCCC 
AAGCTGGCGC 
GCTGCCGGGC 
GCGGAGGAGC 
ACATCTAAGA 
GTACCAAAGG 
TGTGCTTCTG 
AGTATGACTG 
AGTGGATCCC 
GATTTAATAA 
TGCCCAGCTG 
AGAGATACCT 
GGTGGCCAGA 
AATGGTGTTC 
GGTTGCAGCA 
CAGTCGGTCA 
GACCAAGGCC 
CTGGAGATCG 
CAGTCGAACT 
AAGTGGAAGA 
AACTTTCGGG 
GTTGTCCCCC 
ATTACACAAG 
ACTAAGAAAG 
ATAAACATTA 
ATGGGGATCT 
GAGGCTCAGA 
GCTGAGTTTA 
ACAAGTGATA 



11 

I 

GTGCGGGGCA 
GGCCCGGGCA 
CCAGCGGGGT 
GGGGCCTCCT 
TGGGTGATGG 
ATTATCCCGG 
GGAAAAGACT 
ACTATCTTCT 
TTCCCAAAGA 
ACATTTCTGG 
CATGTTTGGA 
GTTGTAGAGA 
CTTTATTGTG 
TCAGTGTGCT 
TTTCGAGGGA 
GATCCTTGAG 
ATGAGAGTGG 
CATCATGGGC 
ATTTGGGGGA 
TCAACTTTTA 
CCTATAAAGG 
ACCCAGTGCA 
AGACATGGCA 
GTAATGATTC 
AAGATGAGAC 
CAACGGTGGC 
TTGCAGCCTT 
AAACAGACTG 
CCATCAGCTA 
TGGCAGGTTA 




TTTTGAACCT 
AGACCAAGTT 
TTCGGGCGAC 
GAAAAAGAAA 
TGTTAAGAGT 
AATTGTGAAT 
AAACAATTTC 
CCAGAGGATA 
ATTGGTGTGG 
AATCACAAGG 
TATTCCATTG 
TAGAAAGAAG 
TTGGAAGCAG 
TGATAATGAG 
A 



31 
I 

CCCGTCCCGG 
GGGATCCGTC 
GGCGCCCGCG 
CTCGCGGTCT 
CTAGTGACTT 
AATCACACTG 
TTGGGAGATT 
TCTTCAGATC 
AACACAAGTG 
TTGCTGACCT 
CATTATTTGA 
GACATTTCTG 
ATCCATGCAG 
GGGATCAGTC 
TCAGACAAGC 
GACGGGCAAA 
CACTGGTCTC 
AGTAGCAACA 
ATAACAGGAA 
TTTGTGATGA 
AATGAAGAAA 
ATCCCTCCCA 
GCCTTGAAGG 
CGCAAGACAA 
CCCATCCCCT 
GTGCTCCTTG 
AAGAAGAAAG 
ATTAAATATC 
AAGGAGATGA 



41 
I 

CGCCGCGCTC 
GAGGGGAGGC 
GCGGCGGCGC 
CCGCCCCGCT 
ATCAGGATAG 
TTTGCGAAAA 
TGGATATCGA 
AATATGGTCC 
AAGTAACCGT 
ATGCGAGCAG 
AGACAGAATA 
GGAATATGGT 
GAATAATTGC 
GATATGAAGG 
GATTTCTGTT 
TCAGAGCTTC 
CTGGCCAAGC 
ACCACAAACC 
TTAGGACCAC 
ACTTCAAAAA 
AGGTGTTTCA 
TCGTGGCCAG 
TGGAGCTCAT 
GTCAAAGCAC 
CGGAAGAAAC 
TTGTCCTGGT 
GAAGTCCGTA 
CCTTTGCCAG 
CACAAAAGTT 



Seq ID NO: CI 3 8 DNA Sequence 

Nucleic Acid Accession #: FGENESH predicted 

Coding sequence: 1..2310 



660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

iboo 

I860 
1920 
1980 
2010 



51 
I 

GTCCGCAGAG 
CGAGCTTGCC 
ACTGGCGCGG 
CCGGCTGCAG 
TGGCACAATG 
GACAATTACA 
ATCCCAGACC 
ATACTGTGGA 
CCGCTTTGAG 
CGACCATCCA 
CAGCAAATTC 
AGATGGATAT 
TGATGAACTA 
GATTCTGGCC 
TACCTCCAAT 
TTCCTCATGG 
CCGACTTCAG 
ACGAGAGTGG 
AGGATCTACA 
CAATAATTCT 
GGGTAACTCT 
ATATGTGCGG 
TGGTTGCCAG 
CAGTGTTTCA 
ATCCACAGGA 
GTTTGCTGGA 
TGGATCAGCA 
ACATCAGTCA 
AGATCTCATC 



ATGTTCCAGC 
ATATTACATC 
TATCAGGATA 
GTTTGCGAAA 
TTGGATATCG 
CAATATGGAA 
CAGAGAGTGG 
GCTGATGCAA 
ACAAGTGAAG 
CTGACCTATG 
TATTTGAAGA 
ATTTCTGGGA 
CATGCAGGAA 
ATCAGTCGAT 
GACAAGCGAT 
GGGCAAATCA 



11 
I 

GGCAGGAAAG 
AACATCCTGA 
GTGGCACAAT 
AGACAATTAC 
AATCCCAGAC 
TGCAGAAGGA 
ACATTCCTGT 
GAGGTCCATA 
TAACCGTCCG 
CGAGCAGCGA 
CAGAATACAG 
ATATGGTAGA 
TAATTGCTGA 
ATGAAGGGAT 
TTCTGTTTAC 
GAGCTTCTTC 




GGAGGAGACA 
GCAGCTGTTG 
CTGTGGAAGT 
CTTTGAGAGT 
CCATCCAGAT 
CAAATTCTGC 
TGGATATAGA 
TGAACTAGGT 
TCTGGCCAAT 
CTCCAATGGT 
CTCATGGCAG 



31 
I 

TTATCTTCAG 
AAAGGTGATG 
AATTATCCCG 
GGGAAAAGAC 
GACTATCTTC 
GAAGTGCTTT 
CCCAGCTTCC 
ATGACTGTTC 
GGATCCCACA 
TTAATAACAT 
CCAGCTGGTT 
GATACCTCTT 
GGCCAGATCA 
GGTGTTCTTT 
TGCAGCAGAT 
TCGGTCAATG 



41 
I 

CTGAAGCAGT 
GCTGTGGACA 
GGACCTACCC 
TGATTCTGAG 
TCTTCACCAG 
GTCTTTCAGT 
TGGAAGGGTG 
CCAAAGAACT 
TTTCTGGCCG 
GTTTGGAACG 
GTAGAGACGT 
TATTGTGCAA 
GTGTGCTTCA 
CGAGGGATGG 
CCTTGAGTTT 
AGAGTGGAGA 



51 



GGCAGCTTGG 
CCTAGTGACT 
CAATCACACT 
GTTGGGAGAT 
CTCTTCAGAT 



GAAGGGTCAT 
CTTGTTGAAC 
GGGTTTTTTG 
AGCTAGCCAT 
AGCAGGAGAC 
AGCTGCCATC 
GCGCAAAGGG 
TTCCCTGTCA 
TGAACCTGAC 
CCAAGTTCAC 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

640 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1761 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 



1287 



WO 03/042661 



PCT/US02/36810 



10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



TGGTCTCCTG 
AGCAACAACC 
ACAGGAATTA 
GTGATGAACT 
GAAGAAAAGG 
CCTCCCATCG 
TTGAAGGTGG 
AAGACAAGTC 
ATCCCCTCGG 
CAGATGATCT 
ACAGCGGAAA 
TTCTTAGCAC 
CCCTTTGCCA 
ACACAAAAGT 
GGCACCGGGA 
GAGGCAGGGG 
CACGAGTAGG 
CGGCACGTGC 
CAGCCOGGCC 
GCCCAGGACG 
CGGCCCAAAG 
CCAACGCATC 
AACCAGACGG 



GCCAAGCCCG 
ACAAACCACG 
GGACCACAGG 
TCAAAAACAA 
TGTTTCAGGG 
TGGCCAGATA 
AGCTCATTGG 
AAAGCACCAG 
AAGAAACATC 
CACAAAGGGA 
GCATGGTTAG 
ACACACCTGA 
GACATCAGTC 
TAGATCTCAT 
CAGTCACGAG 
TGAGCACCGA 
CGCTGCCCCT 
TGCGCGCCCA 
ACAAACACTC 
GAGACTATCA 
CTGTCAGCGC 
CCGGGACGAG 
CCATGACTGC 



ACTTCAGGAC 
AGAGTGGCTG 
ATCTACACAG 
TAATTCTAAG 
TAACTCTAAC 



TTGCCAGATT 
TGTTTCAACT 
CACAGATGCC 
GAATCTGGGA 
AGTAGTGTTT 
GGAGGACATT 
AGCTGAGTTT 
CACAAGTGAT 
GAAGGGCTCC 
TGCCGGCGGC 
GGCGCCCCOG 
CACGTTCTCT 
CCTCTCCTOG 
AAGGCCACAC 
CCTCGCCACC 
TGACAGCTAT 
CCTTTTGTGA 



CAAGGCCCAT 
GAGATCGATT 
TCGAACTTCA 
TGGAAGACCT 
TTTCGGGACC 
GTCCCCCAGA 
ACACAAGGTA 
AAGAAAGAAG 
ATGCCAGTGC 
CCTGATGAGG 
GCTGTTGTGG 
GATCACTACT 
ACCATCAGCT 
ATGGCAGATT 
ACCTTCCGGC 
CACTATGACT 
GAGCCCGAGT 
GCGCAGAGCG 
GGCGGCTTCT 
AGCGCACAGC 
GAAAGCGGGC 
TCTGCCCCCA 



CATGGGCTTC 
TGGGGGAGAA 
ACTTTTATGT 
ATAAAGGAAT 
CAGTGCAAAA 
CATGGCACCA 
ATGATTCATT 
ATGAGACAAT 
AGATTGTCGG 
GCAAAATACC 
TTAATGACCT 
GTTGGAAGCA 
ATGATAATGA 
ACCAGCAGCC 
CCATGGACAC 
GCCCGCAGGG 
ACGCCACGCC 
GCTACCGCGT 
CCCCCGTAGC 
CTGCGGACAG 
ACCCTGACTC 
GAGACTGCCT 



GGGCGACAGT 
AAAGAAAATA 
TAAGAGTTTT 
TGTGAATAAT 
CAATTTCATC 
GAGGATAGCC 
GGTGTGGCGC 
CACAAGGCCC 
AGACCATACC 
TTTTAAAGGC 
TGGCATGCTG 
GATTAAATAT 
GAAGGAGATG 
CCTCATGATT 
GGATGCCGAG 
GGCCGGCCGC 
CATCGTGGAG 
CCCAGGGCCC 
GGGTGTGGGC 
GGGCTACGAC 
TCAGAAGCCC 
CACACCCCTC 



1020 

ioao 

1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2310 



Seq ID NO: CI 3 9 DNA Sequence 

Nucleic Acid Accession #: NM_004616.2 

Coding sequence: 180.. 893 



1 
I 

AGTGCCCCAG 
ATTTAAATGT 
GACAAGCCTG 
TGGCAGGTGT 
TATGTGGTAT 
CAATTTTTGG 
CTGTAGGTGC 
GTCGCTGCAT 
CGACAGGTAT 
TCTATGAAAA 
CCATAATTGT 
GGGGAAATAA 
GCCAAAGCTA 
TCTTGGCAAA 
TACTGGGTTT 
GATGCATCAA 
AAATATGTAA 
AGACCACAGA 
AATGATATGA 
AAAAAAAAAA 



11 

I 

GAGCTATGAC 
CCGTGGATAC 
TAACGAATAG 
GAGTGCCTGT 
CTTGATCCTA 
TTCTGAAGAT 
CATCATCATG 
GCTTCTGTTG 
CCTAGGAGCT 
CACAAAGCTT 
GTTTCAAGAA 
TTTTCAACAC 
TAATGGAAAA 
AAATTTGATT 
GGTGTTTTCT 
CCTATCGTCA 
GTGCTATATA 
TATCTTCTAG 
ATGTGTATTT 
AAAAAAAAA 



21 
I 

AAGCAAAGGA 
AGAAATCTCT 
TTAAATTCAC 
ATAAAATATT 
GCATTAGCAA 
GTAGGCTCTA 
ATTCTGGGCT 
TTTTTCATAG 
GTTTTCAAAT 
TTGAGCGCCA 
GAGTTTAAAT 
TATCCTGAAT 
CAAGTTTACA 
ATAGTTATTG 
ATGGTCCTGT 
GTCAAACCCC 
AGTCAGGAGC 
ACATATTGAA 
TTACTCAAAA 



31 
I 

ACATACTTGC 
GCAGGCAAGT 
GGCATCTGGA 
CTATGTTTAC 
TATGGGTACG 
GCTCCTACGT 
TCCTGGGATG 
GCTTGCTTCT 
CTAAGTCTGA 
CAGGGGAAAG 
GCTGCGGTTT 
TATGTGCCTG 
AAGAGACCTG 
GAATATCATT 
ATTGCCAGAT 
TTTAAAATGT 
AGCTGTCTTT 
CACATTTAAG 
TAAAAGTAAC 



41 

I 

CTGGAGATAG 
TGCTCCAGAG 
TTCCTAATCC 
CTTCAACTTC 
AGTAAGCAAT 
TGCTGTGGAC 
CTGCGGTGCT 
GATCCTGCTC 
TCGCATTGTG 
TGAAAAACAA 
GGTCAATGGA 
TCTAGATAAG 
TATTTCTTTC 
TGGACTGGCA 
CGGGAACAAA 
TGCTTTGGCT 
TTAAAATGTC 
ATTTGAGGGA 
TGTTTACGTT 



51 

I 

CCTTTGCGAT 
CATATTGCAG 
TTTTCCGAAA 
TTGTTCTGGC 
GACTCTCAAG 
ATATTGATTG 
ATAAAAGAAA 
CTGCAGGTGG 
AATGAAACTC 
TTCCAGGAAG 
GCTGCTGATT 
CAGAGACCAT 
ATAAAAGACT 
GTTATTGAGA 
TGAATCTGTG 
TTGTAAATTT 
TCGGCTAGCT 
TATAAGGGAA 
AAAAAAAAAA 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1159 



Seq ID NOt CI 40 DNA Sequence 

Nucleic Acid Accession #? NM__ 004617.2 

Coding sequence : 232 . . 840 

1 11 21 31 41 51 

I I I 1 I I 

CTTCAGGTCA GGGAGAATGT ATAAATGTCC ATTGCCATCG AGGTTCTGCT ATTTTTGAGA 60 

AGCTGAAGCA ACTCCAAGGA CACAGTTCAC AGAAATTTGG TTCTCAGCCC CAAAATACTG 120 

ATTGAATTGG AGACAATTAC AAGGACTCTC TGGCCAAAAA CCCTTGAAGA GGCCCCGTGA 180 

AGGAGGCAGT GAGGAGCTTT TGATTGCTGA CCTGTGTCGT ACCACCCCAG AATGTGCACT 240 

GGGGGCTGTG CCAGATGCCT GGGGGGGACC CTCATTCCCC TTGCTTTTTT TGGCTTCCTG 300 

GCTAACATCC TGTTATTTTT TCCTGGAGGA AAAGTGATAG ATGACAACGA CCACCTTTCC 360 

CAAGAGATCT GGTTTTTCGG AGGAATATTA GGAAGCGGTG TCTTGATGAT CTTCCCTGCG 420 

CTGGTGTTCT TGGGCCTGAA GAACAATGAC TGCTGTGGGT GCTGCGGCAA CGAGGGCTGT 480 

GGGAAGCGAT TTGCGATGTT CACCTCCACG ATATTTGCTG TGGTTGGATT CTTGGGAGCT 540 

GGATACTCGT TTATCATCTC AGCCATTTCA ATCAACAAGG GTCCTAAATG CCTCATGGCC 600 

AATAGTACAT GGGGCTACCC CTTCCACGAC GGGGATTATC TCAATGATGA GGCCTTATGG 660 

AACAAGTGCC GAGAGCCTCT CAATGTGGTT CCCTGGAATC TGACCCTCTT CTCCATCCTG 720 

CTGGTCGTAG GAGGAATCCA GATGGTTCTC TGCGCCATCC AGGTGGTCAA TGGCCTCCTG 780 

GGGACCCTCT GTGGGGACTG CCAGTGTTGT GGCTGCTGTG GGGGAGATGG ACCOGTTTAA 840 

ACCTCCGAGA TGAGCTGCTC AGACTCTACA GCATGACGAC TACAATTTCT TTTCATAAAA 900 

CTTCTTCTCT TCTTGGAATT ATTAATTCCT ATCTGCTTCC TAGCTGATAA AGCTTAGAAA 960 

AGGCAGTTAT TCCTTCTTTC CAACCAGCTT TGCTCGAGTT AGAATTTTGT TATTTTCAAA 1020 

TAAAAAATAG TTTGGCCACT TAACAAATTT GATTTATAAA TCTTTCAAAT TAGTTCCTTT 1080 

TTAGAATTTA CCAACAGGTT CAAAGCATAC TTTTCATGAT TTTTTTATTA CAAATGTAAA 1140 

ATGTATAAAG TCACATGTAC TGCCATACTA CTTCTTTGTA TATAAAGATG TTTATATCTT 1200 

TGGAAGTTTT ACATAAATCA AAGGAAGAAA GCACATTTAA AATGAGAAAC TAAGACCAAT 1260 

TTCTGTTTTT AAGAGGAAAA AGAATGATTG ATGTATCCTA AGTATTGTTA TTTGTTGTCT 1320 

TTTTTTGCTG CCTTGCTTGA GTTGCTTGTG ACTGATCTTT TGAGGCTGTC ATCATGGCTA 1380 

GGGTTCTTTT ATGTATGTTA AATTAAAACC TGAATTCAGA GGTAACGT 1428 



Seq ID N0» CI 41 DNA Sequence 

Nucleic Acid Accession #: NM_002381.2 



1288 



WO 03/042661 



Coding sequence : 64 . . 1524 




GAGCATGTTT 
GAAACCTTCT 
ATCAGTGATG 
GACAAGAAAA 
ATCTGTGTGA 
AATGAAGACA 
CAGCACATTT 
ACTCTGAATG 
GGTTGCCAGC 
GGCTACACCT 
GTTTCCACTG 
TCGTATCTTC 
GAATATGGAC 
TGGTGTACTT 
TAATTTGCCA 
CTGCAGAATC 
CTTTAGTGTC 
GTGTTTCTTT 
ATTCACAATG 
TTATTACTGA 
TGAATGTAAT 
TTGAAGATGT 
TGAACATTAC 
AGCACTAGTT 
CTTTATTATT 
ATCGACAAAA 
AGTGCTGGGA 
AATTCTTTTG 
TGCTCAATTC 
TATTATCATG 
TTAAAAAAAA 



11 
I 

CTCGCGTGGG 
GCCCGGCCCC 
TGCCCTCCGC 
GTCCCGGGGG 
CCGGGACCAG 
ACCTGGTGTT 
AAACTTTTGT 
CAGTGGTGAA 
AGCAGTCCCT 
GCCTAGCCAT 
CCTCTTCTAA 
TGAATGAAGT 
ACCGGGCAGA 
TCTACGTGGA 
GTGCGCTGGA 
GGGAAGGCAA 
CGTGTTCAGC 
ATGACAGAAG 
GGAAAACTTG 
GTGTGAATGA 
CAGATAAAAA 
ACATTTGTGT 
TAAATGAGGA 
AAGATGCTTG 
AAAGACTGAA 
AAATACATCG 
AATACTCATG 
TTATCTGTAT 
AGCATGATTT 
TCTAAGTTAT 
ATCTACTAAT 
GAAACTTTAG 
GAGTGCAAAT 
TTTGAACTGT 
GATCAATAGA 
ATTGCCATTT 
GTGAATTTAG 
TTGCTTCAGG 
TCTAATGTTG 
TTACAGGCTT 
GCTTTGAAAC 
TGTTTTTCGT 
TCTATTTTTG 
AAAAAAAAA 



21 
I 

CTCCTGGCCC 
OGCGCGCCGC 
CGCCCCCGAC 
CAGCCCTGGA 
CGAGCCTGGC 
TATCATTGAT 
CTCCCGGATA 
CTATGCTAGC 
GAAGCAGGCT 
CCAGACAGCA 
CATCCCTAAG 
GGCGGCTCGG 
CATGGCGTCC 
GACCTATGGG 
CCCCTGTGTG 
GCACCACTGT 
TCTTGATAGG 
TGGCTCTTAT 
TTCAGCTCAA 
CAGAACAGGG 
AACATGTTCA 
GAGTGATGGG 
CAAGAAAACA 
TGGATGTGAA 
CACTAAACTT 
TTAAATTGCT 
CATTCTTTTG 
TAATGCTTGA 
TTCCAAGGAA 
GACTGTGAAA 
TGAGCCATTT 
GTCTAGTTTC 
TGTACAAGGT 
TTAACACTTT 
TTGTAATACA 
TTAATTCATT 
GGTGTTAAAC 
ATCCAAGTGA 
TCTTTTTAAT 
GAAAGTCTAA 
TTGCAACTTT 
TTGCATTGTC 
ATGACTCATC 



31 
I 

CCGACGGACA 
CTCCCGGGAC 
CCCGTGGCCC 
CGCOGCCCCT 
CGCGCCCGCG 
AGTTCTCGTA 
ATCGACACTC 
ACTGTGAAGA 
GTGGGTCGAA 
ATGGACGAAG 
GTGGCCATCA 
GCCCAAGCAT 
CTCAAGATGA 
GTCATTGAGA 
CTTGGAACAC 
GAGTGTAGCC 
TGTGCTCTTA 
CATTGTGAGT 
GATAAATGTG 
TCCCATCATT 
GTCCGTGACA 
GCCGCATCCT 
TGTTCAGCCA 
GCTACACTGG 
GATGACATTT 
CCAATTTCTC 
CACACCTGTT 
ATATTACTGG 
ATACATATGC 
TGATTGGTAG 
AATTTTTAAA 
TTTTGATAGT 
ATTTACACAT 
TTGTTTTTTG 
CATATCTAAA 
CTGGTCTTTG 
TTTTTACCAA 
CAAAGTTATA 
GTTAGTGATC 
CTTTTTTTTA 
GAGAACAAAA 
TTTAATATAA 
AATTTTGTCT 



41 
I 

CCACCAGGCC 
TCCTCCTGCT 
GCCCGGGCTT 
CTCCTGCGGC 
GTGCAGGTGT 
GCGTACGGCC 
TGGACATTGG 
TCGAGTTCCA 
TCACACCCTT 
CCTTCACAGT 
TTGTTACAGA 
CTGGTATTGA 
TGGCCAGTGA 
AACTTTCCTC 
ACCAGTGCCA 
AAGGATACAC 
ACACCCACGG 
GCTATGAAGG 
CTTTGGGTAC 
GTGAATGCTA 
AGTGTGCCCT 
ACCACTGTGA 
CTGAGGAAGC 
CATTCCAGGA 
TGGAGAAGTT 
ACCTGAAAAT 
ATTGCCAATG 
ATAAATTGTA 
AGATACTTAT 
GAAATAGAAT 
TGTTTATATT 
ATTTATAATA 
ACAACTTCAT 
CTTATTTTGT 
AATAGTTAAC 
AAAGAAATGT 
GTACAAAAAT 
TATTTATAAA 
CACCTGCCTC 
CTTATATATT 
CAGTCCTTTA 
TAAAAGTTAT 
ATTAAAGATA 



SI 
I 

CACGGAGCCC 
GCTCTGGCCG 
CCGGAGGCTG 
TCCCGACGGC 
TTGCAAGAGC 
CCTGGAATTC 
GCCAGCCGAC 
ACTCCAGGCC 
GTCAACAGGC 
GGAGGCAGGG 
TGGGAGGCCC 
GCTCTATGCT 
GCCCCTAGAG 
TAGATTCCAG 
GCACGTCTGC 
CTTGAATGCC 
ATGTGAGCAC 
TTATACCTTG 
CCATGGGTGT 
TGAGGGCTAC 
AGGCTCTCAT 
TTGCTATCCT 
ACGAAGACTT 
CAAGGTCAGC 
GAAAATAAAT 
GTGGACAGCT 
TTCCTGCTAA 
TGAAGATCTT 
TAAGAGCAAA 
GAAAAGTTTA 
AGATAACCAT 
TAAATCAATC 
ATAACTGAGA 
TGGAGTATTA 
ACAGATCAAG 
ACTACTAAAG 
CCCAAATTCA 
ATTGCTATAA 
AGCCTCCCAA 
TGATACATAT 
AATTTTGCAC 
TACCTTTACA 
TTTCTTTAAA 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 . 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2599 



Seq ID NO: C142 DNA Sequence 

Nucleic Acid Accession #: NM_016639.1 

Coding sequence: 40.. 429 



1 
I 

GCGGCGGGCG 
CGGTTGCTGC 
GAGCAAGCGC 
AAGTGCATGG 
GCTGCAGCAC 
CTGACCTTCG 
GAGAAGTTCA 
ATCCAGTGAC 
TTCTAGAGCC 



AGGTGTCTGG 
ACAAAACAGC 
CCTTCCTTAG 
TCACTCAGAT 
TTAACACTAG 
CCCAAAGCGG 
AATAAAAGAA 




GGAGGAGATA 
TCTTTAACTT 



31 
1 

ACGTGCACTA 
TGGCTGGCGT 
CGCGGCAGCT 
CGACCGCACA 
CTTTGGCCCA 
TTTTTGGTCT 
GGCGGAGAGG 
GGGCTCGCCC 
GGCGGGAGCC 
CCTGGGCCCA 
GAACAGAAAG 
CAGCATTTGC 
TTGGGGGGCA 
GGGTCACCCT 
CTGGCCCTAA 
GGAGAGTTTG 



41 
I 

TGGCTOGGGG 
TGCTGCGCTC 
CCTGGAGCGC 
GCGACTTCTG 
TCCTTGGGGG 
GGAGACGATG 
GCTGCCCAGC 
ACTCATCATT 
AAGCTCCTCC 
GGGTTCAGGG 
GGAGCCTCAC 



GACTTGACAC 
GGGGGGTTAG 
GATACAGACC 
GAGGGGAGGG 



51 
I 

CTCGCTGCGC 
CGTGGCCGGG 
GGACCTGGAC 
CCTGGGCTGC 
CGCTCTGAGC 
CCGCAGGAGA 

CATTCATCCA 

AACCACAAGG 

GAACCTTCCA 

GCTGGCTCAC . 

GGGGTGCCCT 

TAGGCCCCAC 

GGACCTATTT 

CCCCCAACTC 

AGAATTTATT 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

998 



Seq ID NO: C143 DNA Sequence 
Nucleic Acid Accession «: NM001819 
Coding sequence: 113.. 2146 

1 11 21 31 41 51 

I I I I I I 

CCAGGAGGCA CGCTGGTTTT CCGGGGCCGC TCCATCGCGC CTTCCTCCTG CGCCTCGCTT 60 

CTCCGGTCCA GCCGCCATCT TCCTTTCCGC ACAGGGGCCG CCGAGCGGGG CCATGCAGCC 120 

AACGCTGCTT CTCAGCCTCC TGGGAGCCGT GGGGCTGGCG GCTGTCAATT CCATGCCAGT 180 

GGATAACAGG AACCACAATG AAGGAATGGT GACTCGCTGC ATCATTGAGG TCCTCTCAAA 240 



1289 



WO 03/042661 



PCT/US02/36810 



5 

10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



TGCCTTGTCG 
GAGTAGAAAA 
AAGATTGTTA 
GGCAGGAGCC 
GGCAGAGGGA 
CTCCGACAGC 
GGATGAGGAG 
TGAAGAGAAA 
GCAGGCTTCA 
GCATTCTCAG 
AGGGAGCCAG 
TGAGGAAGGT 
CCACCACGGG 
AAAGGGACAC 
AAAGAGGGAC 
AGAAATAAAG 
GGGCAGAGGA 
GGAGGACAAG 
AGAAAGTGAG 
AGGGGAGCCA 
AGGACACCAC 
GTGGAAAGAG 
GCAGCAGCAG 
TAAACAATAT 
CCCATACTAC 
TGACAATTTT 
CCCAGAATAC 
GTATGAGAAG 
GGTGGCCCAA 
CTATGATTCT 
TGACCAGACA 
TTTGGAACTA 
GTGGGCACTG 
ACCATTTATA 
TTGGAATTGT 
GCATATTCTT 
TCTTTGGAAA 



AAGTCCAGCG 
GACGTCAAAG 
AGAGACCCAG 
CCAGGGGAGG 
GGCGGGCACA 
CAAGTCTCTG 
GAGGAGGAGG 
CACCTTGAAG 
GCTATAAAAA 
GAGAAGACAC 
GAGAATCACC 
GAGGAAGATG 
AGGAGCAGGC 
CCCCAGGAGG 
CACCATTCAA 
GGTTATCCAG 
AGTGAAGAAT 
AGAAACTACC 
GAAGAGAGGG 
CGTGCCTATT 
CGTGTCCAAG 
CTGGACAGAA 
GGAGACCTGC 
AGCTCCCATC 
GACCCTCTCC 
CTCGAGGGTG 
AACTATGACT 
AGAAACCTCG 
CTGGACCAGC 
GAGGAGCCGG 
GTCCTGACAG 
CAGAAGATAG 
TTAAGAAGCA 
TACCCAAGGG 
CTTTAATTTC 
TTCTGCAAAA 
AATGTTTGTC 



CTCCACCCAT 
ACAAAGAGAC 
CTGATGCCTC 
AGGACATCCA 
GCCGAGAGCG 
AAGAAGTGAA 
GAGAGAACTA 
AGCCAGGAGA 
AAGAGGAGTT 
ATAGCCGAGA 
CCCAGGAGTC 
CCACCTCTGA 
CCGACAGGTC 
AATCTGAGGA 
CCCACTACAG 
GCGTCCAGGC 
ACAGGGCTCC 
CCAGCTTAGA 
GCCTTGAGCC 
TCATGTCTGA 
AAAACCAGAT 
ATTATCTCAA 
AGGACACTAA 
ACACAGCTGA 
AGTGGAAGAG 
AGGAGGAAAA 
GGTGGGAGAA 
CCAGGGTCCC 
TCCTTCACTA 
TGAGCACCCA 
AGGACGAGAA 
CTGAGAAATT 
GCCATCACAT 
CAGAAAGTAG 
TGTCAGAATG 
TAGACATATT 
TCAGTTGGAA 



CACCCCTGAG 
AACTGAAAAT 
GGAAGCCCAC 
AGGCCCAACA 
AGCGGATGAG 
GACACGCCAT 
TCAAAAAGGG 
GACACAAAAC 
AGTGGCCAGA 
GAAGAGTAGC 
TAAAGGCCAA 
GGTGGACAAA 
CTCTCAAGGA 
GTCAAACGTC 
GGCTTCAGAG 
CCCTGAGGAC 
AAGACCTCAG 
GCTTGATAAG 
GGGAAAGGGA 
CACCAGAGAA 
GGACAAGGCA 
CTACGGTGAG 
AGAAAACAGG 
AAAGAGGAAG 
CAGCCATTTT 
TGAGCTGACC 
AAAGCCCTTC 
CAAGCTGGAC 
CAGGAAGAAG 
CCAGGAGGCA 
AAAAGAACTC 
CAGCCAAAGG 
GATCTGTTTT 
AACTTACTAT 
CTATTGAAAA 
AACATGCTTA 
ATAATAAAAG 



TGCCGCCAAG 
GAAAACACAA 
GAGTCCTCCA 
AAGGCAGACA 
CCCCAGTGGA 
TCTGAGAAGA 
GAGCGAGGGG 
GCTTTTCTCA 
TCGGAAACAC 
CAGGAGAGTG 
CCCCGAAGCC 
CGACGCACGA 
GGGAGTCTTC 
AGCATGGCCA 
GAAGAACCTG 
CTGGAGTGGG 
AGTGAGGAGA 
ATGGCACATG 
CGCCATCACA 
GAGAAAAGGT 
AGGAGGCATC 
GAAGGAGCCC 
GAGGAAGCTA 
AGATTAGGGG 
GAAAGAAGAG 
TTGAACGAGA 
TCTGAGGATG 
CTGAAAAGGC 
TCAGCTGAGT 
GAAAATGAAA 
GAAAACTTGG 
GGCTGACTGT 
TCACCACTTC 
TCATTAAATG 
TGTGAATTGC 
TGACAATGAC 
ATTCACCTGA 



TCCTGAAGAC 
AGTTTGAAGT 
GCAGGGGAGA 
CAGAGAAATG 
GCCTCTATCC 
GCCAGAGAGA 
AAGATAGCAG 
ATGAAAGAAA 
ATGCTGCCGG 
GAGAGGAGGC 
AGGAAGAATC 
GGCCCAGACA 
CCTCTGAGGA 
GTTTAGGGGA 
AATATGGAGA 
AGCGCTATAG 
GTTGGGATGA 
GATATGGTGA 
GAGGCAGGGG 
TCTTGGGTGA 
CACAAGGTGC 
CAGGGAAGTG 
GGTTTCAAGA 
AACTGTTCAA 
ACAACATGAA 
AGAATTTCTT 
TGAACTGGGG 
AATATGACAG 
TTCCAGACTT 
AGGACAGGGC 
CTGCAATGGA 
CATTGGAGCG 
ACTGAAAGAC 
TTTGACACAA 
ATGACTTGTA 
TGTGCTACTG 
GACC 



Seg ID NO: C144 DNA Sequence 

Nucleic Acid Accession 8: xm_0930B2.1 

Coding sequence : 93 . . 1988 



1 
I 

CTTCTTGTGG 
AATTTGTAAA 
AGCCCAACTG 
GTGGGGCTCT 
AGACAAAACT 
GCAAATTACA 
TTGGAATGGT 
ACTGTCAGGA 
TTGCTGCTGC 
CAGGAAGACT 
ACCCAGCATC 
ACTTAGAAGT 
CCCTACAAGT 
TGCCTAATTC 
ACGTGACTGT 
GAGAACAGCT 
CTCCTTGGGA 
AGCAGGCTGA 
TTAAAGATCG 
TGCAGCCAGC 
GCCAAAACCA 
GCGTTCAAGG 
GAACACCTTC 
AAGAAAACTT 
AAGGTTATTA 
TCACTTCTTC 
GTGCCGCCAT 
AAGAAAGAAA 
GAGAAAGCGA 
CCATTCATGA 
AGGAGCCTGC 
TCGAGCTGGA 
CAGAGTCAGA 
AGGCATAG 



11 
I 

TAGGGACCTC 
AAATACCCTC 
TGTATTACAG 
AGCAGCTGTT 
GGAAGCTTCA 
GAAGCTCAAC 
TGGAAATTGT 
CAGTTCTTTC 
AAAGGATTGT 
TGAGGCAGTG 
TCAGGGTGGG 
CAAGGCAGAA 



TCCTGGTCAG 
TGGATCTAAT 
TTCCATCCAG 
GGAGGGGAAG 
ACTCCAGATT 
AATTACAGGG 
AGACAGTGGA 
AGGCATCCTC 
AAGACCAGAA 
CCCTGTGTAC 
CAACCCAACC 
CCAGTGTACT 
ACATCCAGAA 
CATCATCTCT 
TTCTAAGACC 
AGCAATGCCA 
GACTGGCCCT 
CCCAGAGCCT 
GCTGGAGCCA 
GCCTGGGGTT 



21 

I 

TCCTCAGTAT 
GAAGATTCAG 
ACATTGAGGT 
CTTAGCACCA 
GCTTTGAAAT 
TTGCAGTTTT 
TC TG GT TT CT 
AAATGTGATC 
GGGAGTCCTA 
GAGGAGGTTT 
GGGCATTGTG 
CCAAGCCTGA 
TGCAAAATCT 
GTTAGTGTGG 
GTCACTCTCA 

TGGCCAGATG 
TACTTTTCTC 
TCCAACGATC 
ATTTACATCT 
AACGTCAGTG 
ACTGGCCACA 
TACTGGCATA 
ACCGGGATTT 
GCCATCAACA 
GTTGGAATCA 
GTTGTGTGCT 
ATCGCGGAAC 
AGAGAAGACG 
GATACCATCC 
GCCCCAGGAT 
GAAACGCAGT 
GTAGTTGAGC 




AAAACTATGG 
GAAAAGGTGG 
TCTCCCTCAA 
TGCAAGTGAC 
TCTGCATCTA 
TCCATAAGAA 
TTGAGGCTGT 
AAGGTGGACA 
CAGGTAATGC 
GCGATGTTAA 
TGTTAGTGAA 
CTATTTCCCT 
AACTTGAGGG 
TGGTCATTGG 
GACTTGGCAA 
TTGTTGGGGC 
TCGCAAGGAA 
TTGAGCCAAT 
CTACCCAACT 
AAGAACCAGA 
CAGAGCCTAT 
CGGAATTGGA 
CCTTAAGTGA 



41 

I 

CCAGCATCTG 
TCTGTGTGAA 
CCTTATCTCT 
CACTGAACTG 
AGGCTTAAAA 
AACGGCTGCA 
GGTGCAATCT 
GCTCTGGCCT 
ATCAGAAGGG 
GGTGCTTGTA 
GTCTTTTAGA 
TATGGATCTC 
ACTATTTCTC 
CATCCCAGAC 
CACCACCACT 
GGAGATGGAG 
GAAGGGCACT 
AGCTGTAGCC 
ATCTATCACT 
CAACCCCCCA 
ACCTTCTAAG 
TTCCTGTCTC 
AAGAGACATC 
AAATCTGACA 
TAGTTCCTGC 
CTTGATTGGT 
TAAGGCAAAA 
GACAAAGATA 
AGAAGTAACT 
CTATGAGCCA 
GGCAGTGCCT 
GCCAGAGCCA 
AGATGAAAAG 




GAATTTAGTG 
GATCCAAATT 
AAACTTCCAG 
CATTTTGGCT 
TCCACCAGAG 
CTGAACTGGG 
CAGCCCGGTG 
GACTTGGTGG 
CAGAGACCCA 
TTTATTGCAT 
GGTTTCGTGA 
GTGGCCTCCC 
CCAATTTCTT 
CTTGATGGAC 
ATCGGGCAAT 
ATCTCGCATA 
GACTTTCTCG 
CCCCTTTGTA 
TCTGCGCTTG 
GTGCCAGTGA 
AATTTTGAAC 
GAAATCGATC 
AGCCTGGTAG 
GCAAAGGCAA 
AACCCAAGGG 
CTACCATCTT 
AAGCCTACTC 
GACCTTGACA 
GAGCCAGAGC 
GGAGTGGTTA 



300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2454 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 
840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

1988 



Seq ID NO: CI 4 5 DNA Sequence 

Nucleic Acid Accession #: FGENKSH predicted 

Coding sequence : 1 . . 1242 



11 



21 



31 



51 

1290 



WO 03/042661 



PCTAJS02/36810 



10 

15 

20 

25 

30 

35 

40 

45 

50 



ATGGTGTTCG 
GTGGTGCAAG 
CTCATCTGCA 
TTCTTCCATA 
GATGTTGAGG 
TCTCAAGGTG 
GATCCAGGTA 
■ ATCTGCGATG 
AGTGTGTTAG 
CACACTATTT 
CATAAACTTG 
ATTTTGGTCA 
AACAGACTTG 
ATCATTGTTG 
TGCTTCGCAA 
GAACTTGAGC 
GACGCTACCC 
ATCCAAGAAC 
GGATCAGAGC 
CAGTCGGAAT 
GAGCCCTTAA 



CATTTTGGAA 
TGACCATCCC 
TCTACACCAC 
AGAAGGAGAT 
CTGTGAAGGG 
GACAAGCTGT 
ATGCATCTAT 
TTAACAACCC 
TGAAACCTTC 
CCCTTTCCTG 
AGGGAAGAGA 
TTGGAAATCT 
GCAATAGTTC 
GGGCCTTGAT 
GGAATAAGGC 
CAATGACAAA 
AACTAGAAGT 
CAGACTATGA 
CTATGGCAGT 
TGGAGCCAGA 
GTGAAGATGA 



GGTCTTTCTG 
AGACGGTTTC 
CACTGTGGCC 
GGAGCCAATT 
CACTCTTGAT 
AGCCATCGGG 
CACTATCTCG 
CCCAGACTTT 
TAAGCCCCTT 
TCTCTCTGCG 
CATCGTGCCA 
GACAAATTTT 
CTGCGAAATC 
TGGTAGCCTG 
AAAAGCAAAG 
GATAAACCCA 
AACTCTACCA 
GCCAAAGCCT 
GCCTGACCTT 
GCCAGAGCCA 
AAAGGGAGTG 



I 

ATCCTAAGCT 
GTGAACGTGA 
TCCCGAGAAC 
TCTTCTCCTT 
GGACAGCAGG 
CAATTTAAAG 
CATATGCAGC 
CTCGGCCAAA 
TGTAGCGTTC 
CTTGGAACAC 
GTGAAAGAAA 
GAACAAGGTT 
GATCTCACTT 
GTAGGTGCCG 
GCAAAAGAAA 
AGGGGAGAAA 
TCTTCCATTC 
ACTCAGGAGC 
GACATCGAGC 
GAGCCAGAGT 
GTTAAGGCAT 



I 

GCCTTGCAGG 
CTGTTGGATC 
AGCTTTCCAT 
GGGAGGAGGG 
CTGAACTCCA 
ATCGAATTAC 
CAGCAGACAG 
ACCAAGGCAT 
AAGGAAGACC 
CTTCCCCTGT 
ACTTCAACCC 
ATTACCAGTG 
CTTCACATCC 
CCATCATCAT 
GAAATTCTAA 
GCGAAGCAAT 
ATGAGACTGG 
CTGCCCCAGA 
TGGAGCTGGA 
CAGAGCCTGG 
AG 



TCAGGTTAGT 
TAATGTCACT 
CCAGTGGTCT 
GAAGTGGCCA 
GATTTACTTT 
AGGGTCCAAC 
TGGAATTTAC 
CCTCAACGTC 
AGAAACTGGC 
GTACTACTGG 
AACCACCGGG 
TACTGCCATC 
AGAAGTTGGA 
CTCTGTTGTG 
GACCATCGCG 
GCCAAGAGAA 
CCCTGATACC 
GCCTGCCCCA 
GCCAGAAACG 
GGTTGTAGTT 



Seq ID NO: Ci 4 6 DNA Sequence 
Nucleic Acid Accession #i NM_003020. 
Coding sequence: 29.. 664 



CGCTCCTCGG 
TGGCCTACTG 
CCCTGACCGG 
GGGCATTGCC 
CCAGAGCATT 
CCCCAACATC 
GGGGTACCCA 
AAACACCCCT 
TCCGGAACAT 
GAAGGGAGGA 
GGATAATGTT 
GTAAAGAGAA 
TGCACGTGTA 
TACAAAGCAG 
AAATTAGAAT 
TAAAAATTAA 
TGCAGTTTAA 
TTTGATTTTG 
GCTGTACTCA 
GCAAGCATTG 



11 
I 

GCTGCCCCTC 
TTTTGGCTGG 
GTCTCAGAAG 
AGGCCCCGAG 
GAAGGTGGAG 
GTGGCAGAGT 
GACCCTCCAA 
GACACTGCAG 
GACTATCCAG 
GAGAGACGAA 
GTTGCAAAGA 
GATGCTAGAC 
AATGGAGTCC 
CTGTATGTAG 
AAGAGCTTTT 
AATGTGAATG 
AATGGTGTCT 
ATTATGTAGT 
AGAGGAGGAG 
GC 



21 

! 

GGTTGACAAT 
CATCTGGATG 
CAGATATCCA 
TGGAATATCC 
CTCATGAAGG 
TGACTGGAGA 
ATCCCTGTCC 
AGTTCAGTCG 
GCTTGGGCAA 
AGCGGAGGAG 
AGTCTGTCCC 
GAAAACCCAC 
CTGTGAATGA 
ATAGTGTATT 
TTGTTTCTTG 
TCAACAATAA 
GAGGTTGTAC 
TCATCCAGCC 
CTGACACATT 



31 
I 

GGTCTCCAGG 
GACTCCAGCA 
GAGGCTGCTT 
AGCTCACCAG 
ACTTCAGCAT 
CAACATTCCT 
TGTTGGAAAA 
AGAGTTCCAG 
GTGGAACAAG 
TGTCAATCCA 
CCATTTTTCA 
ATTACCTGTT 
CAGCATGTTT 
GTCTTCACAC 
GGTTTTTAAA 
AAAGCAAGAC 
TATTTTGGCC 
CTTGGGCATT 
TCACTTGGCT 



41 
I 

ATGGTCTCTA 
TTTGCTTACA 
CATGGTGTTA 
GCCATGAATC 
TTGGGTCCTT 
AAGGACTTTA 
ACAGATGATG 
TTGCACCAGC 
AAACTCCTTT 
TATCTACAAG 
GATGAGGATA 
AGGCCTCAGC 
CTTACATAGA 
CGATGATTCT 
ATGTGAATCT 
TATGAAAGGC 
AAGTCTGTAG 
GTTATACACC 
GCGTCTTAAT 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1242 



SI 
I 

CCATGCTATC 
GCCCCCGGAC 
TGGAGCAATT 
TTGTGGGCCC 
TTGGCAACAT 
GTGAGGATCA 
GATGTCTAGA 
ATCTCTTTGA 
ACGAGAAGAT 
GACAGAGACT 
AGGATCCAGA 
ATGGCTTATG 
TAATTATGGA 
GCTTTTTGCT 
GCAATGATCA 
TCAGATTTCT 
AAAGCTGTCA 
AGTAAAGAAG 
AAACATGAAT 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1152 



Seq ID NO: C147 DNA Sequence 
Nucleic Acid Accession #: NM_024021.2 
^ Coding sequence: 144.. 806 

1 11 21 31 41 51 

I I I I I I 

AACATTCCTG CAAATGGTTT CAATATATGC AGATGTCTCG ATATAGGAAT GAAATTACGT 60 

- CTTTGGAACA ACTTAAATAA GTCAAATATA CTTGGAGCTT TAAAAATTAA AAGGAGAGAG 120 

60 ATTCGAGCAC CTTTTCTGCT GCCATGACAA CCATGCAAGG AATGGAACAG GCCATGCCAG 180 

GGGCTGGCCC TGGTGTGCCC CAGCTGGGAA ACATGGCTGT CATACATTCA CATCTGTGGA 240 

AAGGATTGCA AGAGAAGTTC TTGAAGGGAG AACCCAAAGT CCTTGGGGTT GTGCAGATTC 300 

TGACTGCCCT GATGAGCCTT AGCATGGGAA TAACAATGAT GTGTATGGCA TCTAATACTT 360 

ATGGAAGTAA CCCTATTTCC GTGTATATCG GGTACACAAT TTGGGGGTCA GTAATGTTTA 420 

65 TTATTTCAGG ATCCTTGTCA ATTGCAGCAG GAATTAGAAC TACAAAAGGC CTGGTCCGAG 480. 

GTAGTCTAGG AATGAATATC ACCAGCTCTG TACTGGCTGC ATCAGGGATC TTAATCAACA 540 

CATTTAGCTT GGCGTTTTAT TCATTCCATC ACCCTTACTG TAACTACTAT GGCAACTCAA 600 

ATAATTGTCA TGGGACTATG TCCATCTTAA TGGGTCTGGA TGGCATGGTG CTCCTCTTAA 660 

GTGTGCTGGA ATTCTGCATT GCTGTGTCCC TCTCTGCCTT TGGATGTAAA GTGCTCTGTT 720 

70 GTACCCCTGG TGGGGTTGTG TTAATTCTGC CATCACATTC TCACATGGCA GAAACAGCAT 780 

CTCCCACACC ACTTAATGAG GTTTGAGGCC ACCAAAAGAT CAACAGACAA ATGCTCCAGA 840 

AATCTATGCT GACTGTGACA CAAGAGCCTC ACATGAGAAA TTACCAGTAT CCAACTTCGA 900 

TACTGATAGA CTTGTTGATA TTATTATTAT ATGTAATCCA ATTATGAACT GTGTGTGTAT 960 

AGAGAGATAA TAAATTCAAA ATTATGTTCT CATTTTTTTC CCTGGAACTC AATAACTCAT 1020 

75 TTCACTGGCT CTTTATCGAG AGTACTAGAA GTTAAATTAA TAAATAATGC ATTTAATGAG 1080 

GCAACAGCAC TTGAAAGTTT TTCATTCATC ATAAGAACTT TATATAAAGG CATTACATTG 1140 

GCAAATAAGG TTTGGAAGCA GAAGAGCAAA AAAAAGATAT TGTTAAAATG AGGCCTCCAT 1200 

GCAAAACACA TACTTCCCTC CCATTTATTT AACTTTTTTT TTCTCCTACC TATGGGGACC 1260 

AAAGTGCTTT TTCCTTCAGG AAGTGGAGAT GCATGGCCAT CTCCCCCTCC CTTTTTCCTT 1320 

80 CTCCTGCTTT TCTTTCCCCA TAGAAAGTAC CTTGAAGTAG CACAGTCCGT CCTTGCATGT 1380 

GCACGAGCTA TCATTTGAGT AAAAGTATAC ATGGAGTAAA AATCATATTA AGCATCAGAT 1440 

TCAACTTATA TTTTCTATTT CATCTTCTTC CTTTCCCTTC TCCCACCTTC TACTGGGCAT 1500 

AATTATATCT TAATCATATA TGGAAATGTG CAACATATGG TATTTGTTAA ATACGTTTGT 1560 

TTTTATTGCA GAGCAAAAAT AAATCAAATT AGAAGCAATA AAAAAAAAAA AAAAAAAAA 1619 

1291 



WO 03/042661 



PCT/US02/36810 



Seq ID NO: C148 DNA Sequence 

Nucleic Acid Accession 8: NM_002091.1 

Coding sequence: 56.. 502 



10 

15 

20 

25 

30 

35 

40 
45 
50 
55 
60 
65 
70 



1 
I 

AGTCTCTGCT 
CGGCAGTGAG 
AGCGGTCCCG 
CCACTGGGCG 
TGAGAGAGGG 
GAATTTGCTG 
GGCCTTGGGC 
AGGTTCAAAA 
CCCCCAGCTG 
TAAGAGACTG 
AAATATTTGA 
CTTCTGGTTT 
TTTTTATATC 
TAAAAGCTTA 



11 



CTTCCCAGCC 



CTGCCTGCGG 
GTGGGGCACT 
AGCCTGAAGC 
GGTCTCATAG 
AATCAGCAGC 
GGCAAAGTTG 
AACCAGCAAT 
AGTTCTGCAA 
CTATTCTGTA 
AAACTTGTTT 
TAGGCTACCT 
AACACAT 



21 
I 

TCTCOGGCGC 
TCCTGCTGGC 
GCGGAGGGAC 
TAATGGGGAA 
AGCAGCTGAG 
AAGCAAAGGA 
CTTCGTGGGA 
GTAGACTCTC 
GATAATGATG 
GCATCAGTTC 
TCTTTCATCC 
GCTGTGAACA 
GTTGGTTAGA 



31 
I 

GCTCCAAGGG 
GCTGGTCCTC 
CGTGCTGACC 
AAAGAGCACA 
AGAGTACATC 
GAACAGAAAC 
TTCAGAGGAT 
TGCTCCAGGT 
GCCTCTCTCA 
TACGGATCAT 
TTGACTAAAT 
ATTGTCGAAA 
TTCAAGGCCC 



Seq ID NO: C149 DNA Sequence 

Nucleic Acid Accession #: NM_012261.1 

Coding sequence: 203.. 1045 



11 



41 

I 

CTTCCCGTCG 
TGCCTAGCGC 
AAGATGTACC 
GGGGAGTCTT 
AGGTGGGAAG 
CACCAGCCAC 
AGCAGCAACT 
TCTCAACGTG 
AAAGAGAAAA 
CAACAAGATT 
TCGTGATTTT 
AGAGTCTTCC 
CGAGCTGTTA 



51 
I 

GGACCATGCG 
CCCGGGGGCG 
CGCGCGGCAA 
CTTCTGTTTC 
AAGCTGCAAG 
CTCAACCCAA 
TCAAAGATGT 
AAGGAAGGAA 
ACAAAACCCC 
TCCTTGTGCA 
CAAGCAGCAT 
AATTAATGCT 
CCATTCACAA 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

797 



21 31 41 51 

I I I I I I 

GATTTGCTCT GCCAGCAGCT GTCGGTGCCG CGCTCGACAC CGAGTCCTAG CTAGGCGCTC 
ACAGAATACG CGCTCCCTCC CTCCCCCTTC TCTGTCCCCC GCCTCTCGCT CACCCCGGCC 
CACTCCAGCG GCGACTTTGA GGGATTCCCT CTCTGGCGGC CTCTGCAGCA GCACAGCCGG 
CCTCATTCGG GGCACTGCGA GTATGGATCT CCAAGGAAGA GGGGTCCCCA GCATCGACAG 
ACTTCGAGTT CTCCTGATGT TGTTCCATAC AATGGCTCAA ATCATGGCAG AACAAGAAGT 
GGAAAATCTC TCAGGCCTTT CCACTAACCC TGAAAAAGAT ATATTTGTGG TGCGGGAAAA 
TGGGACGACG TGTCTCATGG CAGAGTTTGC AGCCAAATTT ATTGTACCTT ATGATGTGTG 
GGCCAGCAAC TACGTAGATC TGATCACAGA ACAGGCCGAT ATCGCATTGA CCCGGGGAGC 
TGAGGTGAAG GGCCGCTGTG GCCACAGCCA GTCGGAGCTG CAAGTGTTCT GGGTGGATCG 
CGCATATGCA CTCAAAATGC TCTTTGTAAA GGAAAGCCAC AACATGTCCA AGGGACCTGA 
GGCGACTTGG AGGCTGAGCA AAGTGCAGTT TGTCTACGAC TCCTCGGAGA AAACCCACTT 
CAAAGACGCA GTCAGTGCTG GGAAGCACAC AGCCAACTCG CACCACCTCT CTGCCTTGGT 
CACCCCCGCT GGGAAGTCCT ATGAGTGTCA AGCTCAACAA ACCATTTCAC TGGCCTCTAG 
TGATCCGCAG AAGACGGTCA CCATGATCCT GTCTGCGGTC CACATCCAAC CTTTTGACAT 
TATCTCAGAT TTTGTCTTCA GTGAAGAGCA TAAATGCCCA GTGGATGAGC GGGAGCAACT 
GGAAGAAACC TTGCCCCTGA TTTTGGGGCT CATCTTGGGC CTCGTCATCA TGGTAACACT 
CGCGATTTAC CACGTCCACC ACAAAATGAC TGCCAACCAG GTGCAGATCC CTCGGGACAG 
ATCCCAGTAT AAGCACATGG GCTAGAGGCC GTTAGGCAGG CACCCCCTAT TCCTGCTCCC 
CCAACTGGAT CAGGTAGAAC AACAAAAGCA CTTTTCCATC TTGTACACGA GATACACCAA 
CATAGCTACA ATCAAACAGG CCTGGGTATC TGAGGCTTGC TTGGCTTGTG TCCATGCTTA 
AACCCACGGA AGGGGGAGAC TCTTTCGGAT TTGTAGGGTG AAATGGCAAT TATTCTCTCC 
ATGCTGGGGA GGAGGGGAGG AGGGTCTCAG ACAGCTTTCG TGCTCATGGT GGCTTGGCTT 
TGACTCTCCA AAGAGCAATA AATGCCACTT GGAGCTGTAT CTGGCCCCAA AGTTTAGGGA 
TTGAAAACAT GCTTCTTTGA GGAGGAAACC CCTTTAGGTT CAGAAGAATA TGGGGTGCTT 
TGCTCCCTTG GACACAGCTG GCTTATCCTA TACAGTTGTC AATGCACACA GAATACAACC 
TCATGCTCCC TGCAGCAAGA CCCCTGAAAG TGATTCATGC TTCTGGCTGG CATTCTGCAT 
GTTTAGTGAT TGTCTTGGGA ATGTTTCACT GCTACCCGCA TCCAGCGACT GCAGCACCAG 
AAAACGACTA ATGTAACTAT GCAGAGTTGT TTGGACTTCT TCCTGTGCCA GGTCCAAGTC 
GGGGGACCTG AAGAATCAAT CTGTGTGAGT CTGTTTTTCA AAATGAAATA AAACACACTA 
TTCTCTGGC 

Seq ID NO: C150 DNA Sequence 

Nucleic Acid Accession NM_003226.1 

Coding sequence: 2.-226 



1 

I 

GATGCTGGGG 
GTCTGCAAAC 
CACCCCCAAG 
TTGGTGTTTC 
CCCCTGGGAT 
CTCAGTTTTT 
CTGATGTCTT 



11 
I 

CTGGTCCTGG 
CAGTGTGCCG 
GAGTGCAACA 
AAGCCCCTGA 
GCAGGCTGAG 
CTGTCCCTTT 
AACGAATAAA 



21 
I 

CCTTGCTGTC 
TGCCGGCCAA 
ACCGGGGCTG 
CTAGGAAGAC 
CACCCTTGCC 
GCTCCCGGCA 
GGTCCCATGC 



31 
I 

CTCCAGCTCT 
GGACAGGGTG 
CTGCTTTGAC 
AGAATGCACC 
CGGCTGTGAT 
AGCTTTCTGC 
TCCACCCG 



41 
I 

GCTGAGGAGT 
GACTGCGGCT 
TCCAGGATCC 
TTCTGAGGCA 
TGCTGCCAGG 
TGAAAGTTCA 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1749 



51 
I 

ACGTGGGCCT 
ACCCCCATGT 
CTGGAGTGCC 
CCTCCAGCTG 
CACTGTTCAT 
TATCTGGAGC 



60 

120 

180 

240 

300 

360 

398 



75 



Seq ID NO: C151 DNA Sequence 

Nucleic Acid Accession #: NM__002993.1 

Coding sequence: 64.. 408 



0rt 1 11 21 31 41 51 

80 | i j | i | 

GGCACGAGCC AGTCTCCGOG CCTCCACCCA GCTCAGGAAC CCGCGAACCC TCTCTTGACC 60 

ACTATGAGCC TCCCGTCCAG CCGCGCGGCC CGTGTCCCGG GTCCTTCGGG CTCCTTGTGC 120 

GCGCTGCTCG CGCTGCTGCT CCTGCTGACG CCGCCGGGGC CCCTCGCCAG CGCTGGTCCT 180 

GTCTCTGCTG TGCTGACAGA GCTGCGTTGC ACTTGTTTAC GCGTTACGCT GAGAGTAAAC 240 

1292 



WO 03/042661 



CCCAAAACGA TTGGTAAACT GCAGGTGTTC CCCGCAGGCC CGCAGTGCTC CAAGGTGGAA 300 

GTGGTAGCCT CCCTGAAGAA CGGGAAGCAA GTTTGTCTGG ACCCGGAAGC CCCTTTTCTA 360 

AAGAAAGTCA TCCAGAAAAT TTTGGACAGT GGAAACAAGA AAAACTGAGT AACAAAAAAG 420 

ACCATGCATC ATAAAATTGC CCAGTCTTCA GCGGAGCAGT TTTCTGGAGA TCCCTGGACC 480 

CAGTAAGAAT AAGAAGGAAG GGTTGGTTTT TTTCCATTTT CTACATGGAT TCCCTACTTT 540 

GAAGAGTGTG GGGGAAAGCC TACGCTTCTC CCTGAAGTTT ACAGCTCAGC TAATGAAGTA 600 

CTAATATAGT ATTTCCACTA TTTACTGTTA TTTTACCTGA TAAGTTATTG AACCCTTTGG 660 

CAATTGACCA TATTGTGAGC AAAGAATCAC TGGTTATTAG TCTTTCAATG AATATTGAAT 720 

TGAAGATAAC TATTGTATTT CTATCATACA TTCCTTAAAG TCTTACCGAA AAGGCTGTGG 780 

ATTTCGTATG GAAATAATGT TTTATTAGTG TGCTGTTGAG GGAGGTATCC TGTTGTTCTT 840 

ACTCACTCTT CTCATAAAAT AGGAAATATT TTAGTTCTGT TTTCTTGGGG AATATGTTAC 900 

TCTTTACCCT AGGATGCTAT TTAAGTTGTA CTGTATTAGA ACACTGGGTG TGTCATACCG 960 

TTATCTGTGC AGAATATATT TCCTTATTCA GAATTTCTAA AAATTTAAGT TCTGTAAGGG 1020 

CTAATATATT CTCTTCCTAT GGTTTTAGAT GTTTGATGTC TTCTTAGTAT GGCATAATGT 10 BO 

CATGATTTAC TCATTAAACT TTGATTTTGT ATGCTATTTT TTCACTATAG GATGACTATA 1140 

ATTCTGGTCA CTAAATATAC ACTTTAGATA GATGAAGAAG CCCAAAAACA GATAAATTCC 1200 

TGATTGCTAA TTTACATAGA AATGTATTCT CTTGGTTTTT TAAATAAAAG CAAAATTAAC 1260 

AATGATCTGT GCTCTGCAAA GTTTTGAAAA TATATTTGAA CAATTTGAAT ATAAATTCAT 1320 

CATTTAGTCC TCAAAATATA TACAGCATTG CTAAGATTTT CAGATATCTA TTGTGGATCT 1380 

TTTAAAGGTT TTGACCATTT TGTTATGAGG AATTATACAT GTATCACATT CACTATATTA 1440 

AAATTGCACT TTTATTTTTT CCTGTGTGTC ATGTTGGTTT TTGGTACTTG TATTGTCATT 1500 

TGGAGAAACA ATAAAAGATT TCTAAACCAA AAAAAAAAAA AAAAAAA 1547 



Seq ID NO: C152 DNA Sequence 
Nucleic Acid Accession #: NM_00S242.2 
Coding sequence: 148.. 1341 

1 11 21 31 41 51 

I I I I I I 

CGGCCCGCCC TGGGGAGGCG CGCAGCAGAG GCTCCGATTC GGGGCAGGTG AGAGGCTGAC 
TTTCTCTCGG TGCGTCCAGT GGAGCTCTGA GTTTCGAATC GGTGGCGGCG GATTCCCCGC 
GCGCCCGGCG TCGGGGCTTC CAGGAGGATG CGGAGCCCCA GCGCGGCGTG GCTGCTGGGG 
GCCGCCATCC TGCTAGCAGC CTCTCTCTCC TGCAGTGGCA CCATCCAAGG AACCAATAGA 
TCCTCTAAAG GAAGAAGCCT TATTGGTAAG GTTGATGGCA CATCCCACGT CACTGGAAAA 
GGAGTTACAG TTGAAACAGT CTTTTCTGTG GATGAGTTTT CTGCATCTGT CCTCACTGGA 
AAACTGACCA CGGTCTTCCT TCCAATTGTC TACACAATTG TGTTTGTGGT GGGTTTGCCA 
AGTAACGGCA TGGCCCTGTG GGTCTTTCTT TTCCGAACTA AGAAGAAGCA CCCTGCTGTG 
ATTTACATGG CCAATCTGGC CTTGGCTGAC CTCCTCTCTG TCATCTGGTT CCCCTTGAAG 
ATTGCCTATC ACATACATGC CAACAACTGG ATTTATGGGG AAGCTCTTTG TAATGTGCTT 
ATTGGCTTTT TCTATGGCAA CATGTACTGT TCCATTCTCT TCATGACCTG CCTCAGTGTG 
CAGAGGTATT GGGTCATCGT GAACCCCATG GGGCACTCCA GGAAGAAGGC AAACATTGCC 
ATTGGCATCT CCCTGGCAAT ATGGCTGCTG ATTCTGCTGG TCACCATCCC TTTGTATGTC 
GTGAAGCAGA CCATCTTCAT TCCTGCCCTG AACATCACGA CCTGTCATGA TGTTTTGCCT 
GAGCAGCTCT TGGTGGGAGA CATGTTCAAT TACTTCCTCT CTCTGGCCAT TGGGGTCTTT 
CTGTTCCCAG CCTTCCTCAC AGCCTCTGCC TATGTGCTGA TGATCAGAAT GCTGCGATCT 
TCTGCCATGG ATGAAAACTC AGAGAAGAAA AGGAAGAGGG CCATCAAACT CATTGTCACT 
GTCCTGGCCA TGTACCTGAT CTGCTTCACT CCTAGTAACC TTCTGCTTGT GGTGCATTAT 
TTTCTGATTA AGAGCCAGGG CCAGAGCCAT GTCTATGCCC TGTACATTGT AGCCCTCTGC 
CTCTCTACCC TTAACAGCTG CATCGACCCC TTTGTCTATT ACTTTGTTTC ACATGATTTC 
AGGGATCATG CAAAGAACGC TCTCCTTTGC CGAAGTGTCC GCACTGTAAA GCAGATGCAA 
GTATCCCTCA CCTCAAAGAA ACACTCCAGG AAATCCAGCT CTTACTCTTC AAGTTCAACC 
ACTGTTAAGA CCTCCTATTG AGTTTTCCAG GTCCTCAGAT GGGAATTGCA CAGTAGGATG 
TGGAACCTGT TTAATGTTAT GAGGACGTGT CTGTTATTTC CTAATCAAAA AGGTCTCACC 
ACATACCACC G 

Seq ID NO: C153 DNA Sequence 
Nucleic Acid Accession #: NM_003469.2 
Coding sequence: 92.. 1945 

1 11 21 31 41 51 

I I I I I 

GAAACGGCCC GAGAAGCTCG CCCGGAGAAC GGGGAGGAAT ATGCTGTGGA GCTCCTCTGC 60 

CATATAAACA AAAAGAGGAA ATCTTTCAAA CATGGCTGAA GCAAAGACCC ACTGGCTTGG 120 

AGCAGCCCTG TCTCTTATCC CTTTAATTTT CCTCATCTCT GGGGCTGAAG CAGCTTCATT 180 

TCAGAGAAAC CAGCTGCTTC AGAAAGAACC AGACCTCAGG TTGGAAAATG TCCAAAAGTT 240 

TCCCAGTCCT GAAATGATCA GGGCTTTGGA GTACATAGAA AACCTCCGAC AACAAGCTCA 300 

TAAGGAAGAA AGCAGCCCAG ATTATAATCC CTACCAAGGT GTCTCTGTCC CCCTTCAGCA 360 

AAAAGAAAAT GGCGATGAAA GCCACTTGCC CGAGAGGGAT TCACTGAGTG AAGAAGACTG 420 

GATGAGAATA ATACTCGAAG CTTTGAGACA GGCTGAAAAT GAGCCTCAGT CTGCACCAAA 480 

AGAAAATAAG CCCTATGCCT TGAATTCAGA AAAGAACTTT CCAATGGACA TGAGTGATGA 540 

TTATGAGACA CAGCAGTGGC CAGAAAGAAA GCTTAAGCAC ATGCAATTCC CTCCTATGTA 600 

TGAAGAGAAT TCCAGGGATA ACCCCTTTAA ACGCACAAAT GAAATAGTGG AGGAACAATA 660 

TACTCCTCAA AGCCTTGCTA CATTGGAATC TGTCTTCCAA GAGCTGGGGA AACTGACAGG 720 

ACCAAACAAC CAGAAACGTG AGAGGATGGA TGAGGAGCAA AAACTTTATA CGGATGATGA 780 

AGATGATATC TACAAGGCTA ATAACATTGC CTATGAAGAT GTGGTCGGGG GAGAAGACTG 840 

GAACCCAGTA GAGGAGAAAA TAGAGAGTCA AACCCAGGAA GAGGTGAGAG ACAGCAAAGA 900 

GAATATAGGA AAAAATGAAC AAATCAACGA TGAGATGAAA CGCTCAGGGC AGCTTGGCAT 960 

CCAGGAAGAA GATCTTOGGA AAGAGAGTAA AGACCAACTC TCAGATGATG TCTCCAAAGT 1020 

AATTGCCTAT TTGAAAAGGT TAGTAAATGC TGCAGGAAGT GGGAGGTTAC AGAATGGGCA 1080 

AAATGGGGAA AGGGCCACCA GGCTTTTTGA GAAACCTCTT GATTCTCAGT CTATTTATCA 1140 

GCTGATTGAA ATCTCAAGGA ATTTACAGAT ACCCCCAGAA GACTTAATTG AGATGCTCAA 1200 

AACTGGGGAG AAGCCGAATG GATCAGTGGA ACCGGAGCGG GAGCTTGACC TTCCTGTTGA 1260 

CCTAGATGAC ATCTCAGAGG CTGACTTAGA CCATCCAGAC CTGTTCCAAA ATAGGATGCT 1320 

1293 



60 

120 

180 

240 

300 

360 
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480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 
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WO 03/042661 



PCT/US02/36810 



5 

10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



CTCCAAGAGT 
GCTCAGTGTT 
GTOGTATTTT 
TGGAAGATCT 
ACAGATGGCA 
GATGCTAGTT 
AGGCTCATCT 
TTTGAATCAA 
TGTGGGGCCC 
AATGAAAGTG 
TAAGAGAGCA 
CACCCCAAGC 
TAATATCTTT 
TCCTCTGAGC 
ATTTATTATG 
GGCTGTGGCA 
TTTTTAATAT 
AAAAAAATAA 



GGCTACCCTA 
GAGGATATTT 
CCCAATCCAT 
AGATCGAACC 
TATGAAAACC 
AAATACCCTG 
GAAGATGACC 
GGCAGCTCTC 
CCGAAGAATG 
CTGGAATACC 
ATGGAAAATA 
AAATCCCAAC 
AAATGATGTA 
TGTTATCTTG 
TCCATTATTC 
TTGTTGATGC 
TTATTGAATT 
AGCATTATAA 



AAACACCTGG 
TAAATCTTTT 
ATAACCAGGA 
AGCTTCCCAA 
TGAACGACAA 
AGATCATTAA 
TGCAGGAAGA 
AGGAGACTGA 
ATGATACCCC 
TCAATCAAGA 
TGTAAGCTGC 
ATTTCTCTTC 
CAGGCAGATG 
TGTATGGATA 
AAGAAAGATA 
TCACATATGA 
ATTTTGTTAC 
ATATA 



TCGTGCTGGG 
AGGGATGGAG 
GAAAGTTCTG 
AGCTGCCTGG 
GGATCAAGAA 
TTCAAACCAA 
GGAACAAATT 
CAAGCTGGCC 
AAATAGGCAG 
AAAGGCAGAA 
TTTCATTAAT 
AGTGTGTTGA 
AAACCAGGTC 
TGTGTAAATG 
TCTATGACTG 
TAAAAAAGTG 
TGTCTGTAGC 



ACTGAGGCCC 
AGTGCAGCAA 
CCAAGGCTCC 
ATTCCACATG 
TTAGGTGAGT 
GTGAAGCGAG 
GAGCAGGCCA 
CCGGTGAGCA 
TACTGGGATG 
AAGGGAAGGG 
TACCCTACTT 
CTTCTATCCT 
ACTGGGGAGT 
TTATGACTCC 
TGTTTAATAG 
TCCTATAATT 
GTTTTGTGGA 



TACCAGACGG 
ATCAGAAAAC 
CTTATGGTGC 
TTGAAAACAG 
ACTTGGCCAG 
TTCCTGGTCA 
TCAAAGAGCA 
AAAGGTTCCC 
AAGATCTGTT 
AGCATATTGC 
TCATTCCTCC 
GTTAACACTG 
CTGCTTCATT 
TTGATAAAAA 
TATATCTAAT 
CTATTGAAAG 
GTACTGGACC 



1380 
1440 
1500 
1S60 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2365 



Seq ID NO: CI 54 DNA Sequence 
Nucleic Acid Accession #: NT*J)30955 
Coding sequence; 327.. 5108 

1 11 21 31 41 51 

I I I 1 I I 

GAATTCCGGG AGCGGGCGGG CTGCGAGGCC GCGGGGCATG CGGGAGGCGG AGGGGTGGGA 60 

CCGGGTGGCT GCGCCCATTC CACACCCGCC GAAAGCGGAC ACTGTCAGCT GAATCACTCC 120 

CCTTTTAGGA GGAGGGAGGG GGAAAAGGTG TCTAGCTAAT TTCTGCTTAA AAAAGCACAG 180 

GAGATCGCGG GTCAGCTTTG CAGTCGCTGC CTTCTCGCGC CTGACCATGC ACCCCTGCAT 240 

CTTCCTGCTG GGCACAGGCG AGCG CTTTAT TTCTGGAGCT GAGGGCTAAA ACTTTTTTCA 300 

CTTTTCTTCT CCTCAACATC TGAATCATGC CATGTGCCCA GAGGAGCTGG CTTGCAAACC 360 

TTTCCGTGGT GGCTCAGCTC CTTAACTTTG GGGCGCTTTG CTATGGGAGA CAGCCTCAGC 420 

CAGGCCCGGT TCGCTTCCCG GACAGGAGGC AAGAGCATTT TATCAAGGGC CTGCCAGAAT 480 

ACCACGTGGT GGGTCCAGTC CGAGTAGATG CCAGTGGGCA TTTTTTGTCA TATGGCTTGC 540 

ACTATCCCAT CACGAGCAGC AGGAGGAAGA GAGATTTGGA TGGCTCAGAG GACTGGGTGT 600 

ACTACAGAAT TTCTCACGAG GAGAAGGACC TGTTTTTTAA CTTGACGGTC AATCAAGGAT 660 

TTCTTTCCAA TAGCTACATC ATGGAGAAGA GATATGGGAA CCTCTCCCAT GTTAAGATGA 720 

TGGCTTCCTC TGCCCCCCTC TGCCATCTCA GTGGCACGGT TCTACAGCAG GGCACCAGAG 780 

TTGGGACGGC AGCCCTCAGT GCCTGCCATG GACTGACTGG ATTTTTCCAA CTACCACATG 840 

GAGACTTTTT CATTGAACCC GTGAAGAAGC ATCCACTGGT TGAGGGAGGG TACCACCCGC 900 

ACATCGTTTA CAGGAGGCAG AAAGTTCCAG AAACCAAGGA GCCAACCTGT GGATTAAAGG 960 

ACAGTGTTAA CATCTCCCAG AAGCAAGAGC TATGGCGGGA GAAGTGGGAG AGGCACAACT 1020 

TGCCAAGCAG AAGCCTCTCT CGGCGTTCCA TCAGCAAGGA GAGATGGGTG GAGACACTGG 10 BO 

TGGTGGCCGA CACAAAGATG ATTGAATACC ATGGGAGTGA GAATGTGGAG TCCTACATCC 1140 

TCACCATCAT GAACATGGTC ACTGGGTTGT TCCATAACCC AAGCATTGGC AATGCAATTC 1200 

ACATTGTTGT GGTTCGGCTC ATTCTACTCG AAGAAGAAGA GCAAGGACTG AAAATAGTTC 1260 

ACCATGCAGA AAAGACACTG TCTAGCTTCT GCAAGTGGCA GAAGAGTATC AATCCCAAGA 1320 

GTGACCTCAA TCCTGTTCAT CACGACGTGG CTGTCCTTCT CACCAGAAAG GACATCTGTG 1380 

CTGGTTTCAA TCGCCCCTGC GAGACCCTGG GCCTGTCTCA CCTTTCAGGA ATGTGTCAGC 1440 

CTCACCGCAG TTGTAACATC AATGAAGATT CGGGACTCCC TCTGGCTTTC ACAATTGCCC 1500 

ATGAGCTAGG ACACAGCTTC GGCATCCAGC ATGATGGGAA AGAAAATGAC TGTGAGCCTG 1560 

TGGGCAGACA TCCGTACATC ATGTCCCGCC AGCTCCAGTA CGATCCCACT CCGCTGACAT 1620 

GGTCCAAGTG CAGCGAGGAG TACATCACCC GCTTCTTGGA CCGAGGCTGG GGGTTCTGTC 1680 

TTGATGACAT ACCTAAAAAG AAAGGCTTGA AGTCCAAGGT CATTGCCCCC GGAGTGATCT 1740 

ATGATGTTCA CCACCAGTGC CAGCTACAAT ATGGACCCAA TGCTACCTTC TGCCAGGAAG 1800 

TAGAAAACGT CTGCCAGACA CTGTGGTGCT CCGTGAAGGG CTTTTGTCGC TCTAAGCTGG 1860 

ACGCTGCTGC AGATGGAACT CAATGTGGTG AGAAGAAGTG GTGTATGGCA GGCAAGTGCA 1920 

TCACAGTGGG GAAGAAACCA GAGAGCATTC CTGGAGGCTG GGGCCGCTGG TCACCCTGGT 1980 

CCCACTGTTC CAGGACCTGT GGGGCTGGAG TCCAGAGCGC AGAGAGGCTC TGCAACAACC 2040 

CCGAGCCAAA GTTTGGAGGG AAATATTGCA CTGGAGAAAG AAAACGCTAT CGCTTGTGCA 2100 

ACGTCCACCC CTGTCGCTCA GAGGCACCAA CATTTCGGCA GATGCAGTGC AGTGAATTTG 2160 

ACACTGTTCC CTACAAGAAT GAACTCTACC ACTGGTTTCC CATTTTTAAC CCAGCACATC 2220 

CTTGTGAGCT CTACTGCCGA CCCATAGATG GCCAGTTTTC TGAGAAAATG CTGGATGCTG 2280 

TCATTGATGG TACCCCTTGC TTTGAAGGCG GCAACAGCAG AAATGTCTGT ATTAATGGCA 2340 

TATGTAAGAT GGTTGGCTGT GACTATGAGA TCGATTCCAA TGCCACCGAG GATCGCTGCG 2400 

GTGTGTGCCT GGGAGATGGC TCTTCCTGCC AGACTGTGAG AAAGATGTTT AAGCAGAAGG 2460 

AAGGATCTGG TTATGTTGAC ATTGGGCTCA TTCCAAAAGG AGCAAGGGAC ATAAGAGTGA 2520 
TGGAAATTGA GGGAGCTGGA AACTTCCTGG CCATCAGGAG TGAAGATCCT GAAAAATATT 2580 
ACCTGAATGG AGGGTTTATT ATCCAGTGGA ACGGGAACTA TAAGCTGGCA GGGACTGTCT 2640 
TTCAGTATGA CAGGAAAGGA GACCTGGAAA AGCTGATGGC CACAGGTCCC ACCAATGAGT 2700 
CTGTGTGGAT CCAGCTTCTA TTCCAGGTGA CTAACCCTGG CATCAAGTAT GAGTACACAA 2760 
TCCAGAAAGA TGGCCTTGAC AATGATGTTG AGCAGATGTA CTTCTGGCAG TACGGCCACT 2820 
GGACAGAGTG CAGTGTGACC TGCGGGACAG" GTATCCGCCG CCAAACTGCC CATTGCATAA 2880 
AGAAGGGCCG CGGGATGGTG AAAGCTACAT TCTGTGACCC AGAAACACAG CCCAATGGGA 2940 
GACAGAAGAA GTGCCATGAA AAGGCTTGTC CACCCAGGTG GTGGGCAGGG GAGTGGGAAG 3000 
CATGCTCGGC GACATGCGGG CCCCACGGGG AGAAGAAGCG AACCGTGCTG TGCATCCAGA 3060 
CCATGGTCTC TGACGAGCAG GCTCTCCCGC CCACAGACTG CCAGCACCTG CTGAAGCCCA 3120 
AGACCCTCCT TTCCTGCAAC AGAGACATCC TGTGCCCCTC GGACTGGACA GTGGGCAACT 3180 
GGAGTGAGTG TTCTGTTTCC TGTGGTGGTG GAGTGCGGAT TCGCAGTGTC ACATGTGCCA 3240 
AGAACCATGA TGAACCTTGC GATGTGACAA GGAAACCCAA CAGCCGAGCT CTGTGTGGCC 3300 
TCCAGCAATG CCCTTCTAGC CGGAGAGTTC TGAAACCAAA CAAAGGCACT ATTTCCAATG 3360 
GAAAAAACCC ACCAACACTA AAGCCCGTCC CTCCACCTAC ATCCAGGCCC AGAATGCTGA 3420 
CCACACCCAC AGGGCCTGAG TCTATGAGCA CAAGCACTCC AGCAATCAGC AGCCCTAGTC 3480 
CTACCACAGC CTCCAAAGAA GGAGACCTGG GTGGGAAACA GTGGCAAGAT AGCTCAACCC 3540 
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AACCTGAGCT GAGCTCTCGC TATCTCATTT CCACTGGAAG CACTTCCCAG CCCATCCTCA 
CTTCCCAATC CTTGAGCATT CAGCCAAGTG AGGAAAATGT TTCCAGTTCA GATACTGGTC 
CTACCTOGGA GGGAGGCCTT GTAGCTACAA CAACAAGTGG TTCTGGCTTG TCATCTTCCC 
GCAACCCTAT CACTTGGCCT GTGACTCCAT TTTACAATAC CTTGACCAAA GGTCCAGAAA 
TGGAGATTCA CAGTGGCTCA GGGGAAGAAA GAGAACAGCC TGAGGACAAA GATGAAAGCA 
ATCCTGTAAT ATGGACCAAG ATCAGAGTAC CTGGAAATGA CGCTCCAGTG GAAAGTACAG 
AAATGCCACT TGCACCTCCA CTAACACCAG ATCTCAGCAG GGAGTCCTGG TGGCCACCCT 
TCAGCACAGT AATGGAAGGA CTGCTCCCCA GCCAAAGGCC CACTACTTCC GAAACTGGGA 
CACCCAGAGT TGAGGGGATG GTTACTGAAA AGCCAGCCAA CACTCTGCTC CCTCTGGGAG 
GAGACCACCA GCCAGAACCC TCAGGAAAGA CGGCAAACCG TAACCACCTG AAACTTCCAA 
ACAACATGAA CCAAACAAAA AGTTCTGAAC CAGTCCTGAC TGAGGAGGAT GCAACAAGTC 
TGATTACTGA GGGCTTTTTG CTAAATGCCT CCAATTACAA GCAGCTCACA AACGGCCACG 
GCTCTGCACA CTGGATCGTC GGAAACTGGA GCGAGTGCTC CACCACATGT GGCCTGGGGG 
CCTACTGGAA AAGGGTGGAG TGCACCACCC AGATGGATTC TGACTGTGCG GCCATCCAGA 
GACCTGACCC TGCAAAAAGA TGCCACCTCC GTCCCTGTGC TGGCTGGAAA GTGGGAAACT 
GGAGCAAGTG CTCCAGAAAC TGCAGTGGGG GCTTCAAGAT ACGCGAGATT CAGTGCGTGG 
ACAGCCGGGA CCACCGGAAC CTGAGGCCAT TTCACTGCCA GTTCCTGGCC GGCATTCCTC 
CCCCATTGAG CATGAGCTGT AACCCGGAGC CCTGTGAGGC GTGGCAGGTG GAGCCTTGGA 
GCCAGTGCTC CAGGTCCTGT GGAGGTGGAG TTCAGGAGAG AGGAGTGTTC TGTCCAGGAG 
GCCTCTGTGA TTGGACAAAA AGACCCACAT CCACCATGTC TTGCAATGAG CACCTGTGCT 
GTCACTGGGC CACTGGGAAC TGGGACCTGT GTTCCACTTC CTGTGGAGGT GGCTTTCAGA 
AGAGGATTGT CCAATGTGTG CCCTCAGAGG GCAATAAAAC TGAAGACCAA GACCAATGTC 
TATGTGATCA CAAACCCAGA CCTCCAGAAT TCAAAAAATG CAACCAGCAG GCCTGCAAGA 
AAAGTGCCGA TTTACTTTGC ACTAAGGACA AACTGTCAGC CAGTTTCTGC CAGACACTGA 
AAGCCATGAA GAAATGTTCT GTGCCCACCG TGAGGGCTGA GTGCTGCTTC TCGTGTCCCC 
AGACACACAT CACACACACC CAAAGGCAAA GAAGGCAACG GTTGCTCCAA AAGTCAAAAG 
AACTCTAAGC CCAAA 

Seq ID NO: C155 DNA Sequence 
Nucleic Acid Accession tf: NM_0OlO62.1 
Coding sequence: 76.. 1380 

1 11 21 31 41 51 

I I I 1 1 I 

G CTCTCATT A CCTTCTGCCC ATCACTTAAT AAATAGCCAG CCAATTCATC AACATTCTGG 60 

TACACTGTTG GAGAGATGAG ACAGTCACAC CAGCTGCCCC TAGTGGGGCT CTTACTGTTT 120 

TCTTTTATTC CAAGCCAACT ATGCGAGATT TGTGAGGTAA GTGAAGAAAA CTACATCCGC 180 

CTAAAACCTC TGTTGAATAC AATGATCCAG TCAAACTATA ACAGGGGAAC CAGCGCTGTC 240 

AATGTTGTGT TGTCCCTCAA ACTTGTTGGA ATCCAGATCC AAACCCTGAT GCAAAAGATG 300 

ATCCAACAAA TCAAATACAA TGTGAAAAGC AGATTGTCAG ATGTAAGCTC GGGAGAGCTT 360 

GCCTTGATTA TACTGGCTTT GGGAGTATGT CGTAACGCTG AGGAAAACTT AATATATGAT 420 

TACCACCTGA CTGACAAGCT AGAAAATAAA TTCCAAGCAG AAATTGAAAA TATGGAAGCA 480 

CACAATGGCA CTCCCCTGAC TAACTACTAC CAGCTCAGCC TGGACGTTTT GGCCTTGTGT 540 

CTGTTCAATG GGAACTACTC AACCGCOGAA GTTGTCAACC ACTTCACTCC TGAAAATAAA 600 

AACTATTATT TTGGTAGCCA GTTCTCAGTA GATACTGGTG CAATGGCTGT CCTGGCTCTG 660 

ACCTGTGTGA AGAAGAGTCT AATAAATGGG CAGATCAAAG CAGATGAAGG CAGTTTAAAG 720 

AACATCAGTA TTTATACAAA GTCACTGGTA GAAAAGATTC TGTCTGAGAA AAAAGAAAAT 780 

GGTCTCATTG GAAACACATT TAGCACAGGA GAAGCCATGC AGGCCCTCTT TGTATCATCA 840 

GACTATTATA ATGAAAATGA CTGGAATTGC CAACAAACTC TGAATACAGT GCTCACGGAA 900 

ATTTCTCAAG GAGCATTCAG TAATCCAAAC GCTGCAGCCC AGGTCTTACC TGCCCTGATG 960 

GGAAAGACCT TCTTGGATAT TAACAAAGAC TCTTCTTGCG TCTCTGCTTC AGGTAACTTC 1020 

AACATCTCCG CTGATGAGCC TATAACTGTG ACACCTCCTG ACTCACAATC ATATATCTCC 1080 

GTCAATTACT CTGTGAGAAT CAATGAAACA TATTTCACCA ATGTCACTGT GCTAAATGGT 1140 

TCTGTCTTCC TCAGTGTGAT GGAGAAAGCC CAGAAAATGA ATGATACTAT ATTTGGTTTC 1200 

ACAATGGAGG AGCGCTCATG GGGGCCCTAT ATCACCTGTA TTCAGGGCCT ATGTGCCAAC 1260 

AATAATGACA GAACCTACTG GGAACTTCTG AGTGGAGGCG AACCACTGAG CCAAGGAGCT 1320 

GGTAGTTACG TTGTCCGCAA TGGAGAAAAC TTGGAGGTTC GCTGGAGCAA ATACTAATAA 1380 

GCCCAAACTT TCCTCAGCTG CATAAAATCC ATTTGCAGTG GAGTTCCATG TTTATTGTCC 1440 

TTATGCCTTC TTCTTCATTT ATCCCAGTAC GAGCAGGAGA GTTAATAACC TCCCGTTCTC 1500 

TCTCTACATG TTCAATAAAA GTTGTTGAAA GATTAAC 1537 

Seq ID NO: C156 DNA Sequence 
Nucleic Acid Accession NM_00459l 
Coding sequence: 59.. 349 

1 11 21 31 41 51 

I I I I I I 

CACTCCCAAA GAACTGGGTA CTCAACACTG AGCAGATCTG TTCTTTGAGC TAAAAACCAT 60 

GTGCTGTACC AAGAGTTTGC TCCTGGCTGC TTTGATGTCA GTGCTGCTAC TCCACCTCTG 120 

CGGCGAATCA GAAGCAGCAA GCAACTTTGA CTGCTGTCTT GGATACACAG ACCGTATTCT 180 

TCATCCTAAA TTTATTGTGG GCTTCACACG GCAGCTGGCC AATGAAGGCT GTGACATCAA 240 

TGCTATCATC TTTCACACAA AGAAAAAGTT GTCTGTGTGC GCAAATCCAA AACAGACTTG 300 

GGTGAAATAT ATTGTGCGTC TCCTCAGTAA AAAAGTCAAG AACATGTAAA AACTGTGGCT 360 

TTTCTGGAAT GGAATTGGAC ATAGCCCAAG AACAGAAAGA ACCTTGCTGG GGTTGGAGGT 420 

TTCACTTGCA CATCATGGAG GGTTTAGTGC TTATCTAATT TGTGCCTCAC TGGACTTGTC 480 

CAATTAATGA AGTTGATTCA TATTGCATCA TAGTTTGCTT TGTTTAAGCA TCACATTAAA 540 

GTTAAACTGT ATTTTATGTT ATTTATAGCT GTAGGTTTTC TGTGTTTAGC TATTTAATAC 600 

TAATTTTCCA TAAGCTATTT TGGTTTAGTG CAAAGTATAA AATTATATTT GGGGGGGAAT 660 

AAGATTATAT GGACTTTCTT GCAAGCAACA AGCTATTTTT TAAAAAAACT ATTTAACATT 720 

CTTTTGTTTA TATTGTTTTG TCTCCTAAAT TGTTGTAATT GCATTATAAA ATAAGAAAAA 780 

CATTAATAAG ACAAATATT 799 

Seq ID NO: C157 DNA Sequence 

Nucleic Acid Accession #: NMJJ13271.1 
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Coding sequence: 27.. 809 



5 

10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



TCCGGAGCCA 
CCGGGGGCGT 
TCTGCGCGCG 
AGACTGGCGC 
TGCAGGAGCT 
GGGCCGAGGC 
TCTGGGGCGC 
CTGCAGCGCA 
CCCAGCTTGT 
ACGACGGCCC 
CCGAGCTGTT 
TGGCAGCCCC 
CTGAGGGCGT 
TGCCTGCACG 
CAGAAGTGCC 
TTACCCCGGC 
GATCTGAGC 




CCCCGCGCCC 
CGCGGGCCCG 
GAGGTACTTG 
GCGCCGCCTC 
GCTGGGGGCG 
CCGCCTCTTG 
CCCGCCATCC 
CAGCCAGCCC 



21 
I 

GGCAGCATGG 
GTGCTGCTGC 
GAACCCCGCG 
TTCCGGCGGT 
CTGGCGCATC 
GAGGATCAGC 
TCTGATCCGG 
GCTCTGCTCC 
GTCCCCGCCG 
GATGCTGAGG 
CTGGGACGGA 
CGCCGTGCCG 
CTGCTGCGTG 
CCACCCTGAG 
CGCCACCAGG 
TCTCACCCGA 



TGCTCGGCCT 
GCCTAAGCGC 
CAGTGCCCCG 
TGCTGGAGGC 
AGGCGCGCGT 
CTCTGGGCCT 
GCGCCCGCCT 
CGGCGCTCCG 
AGGCAGGCGA 
TTCTTGCGGG 
CCGACCACGA 
TGAAACGCCT 
CACTGCCCGG 
ACTTCTCCCC 
GGATCCCTAC 



41 
1 

GCTGCTCTGG 
GTTTCGGCCG 
AGCGTCTCCG 
AGGTGAGGCG 
CGAACGTCAG 
CCTGGCGCAG 
GGACGACGAC 
TGACCCTGCC 
ACCCCGGCCC 
CGAGACACCC 
AAGCGCGGAC 
TGTGGGCTCT 
AGAGACCCCG 
ATCCCGTGCA 
GCCAGCACGT 
CCCCTGGCCC 



,Seq ID NO: C158 DNA Sequence 
Nucleic Acid Accession #: NM_002245.2 
Coding sequence: 183.. 1193 



11 



21 



31 



41 



51 
I 

GGGCCGCGGG 
CCCCCCGCGC 
CCCTTGGCTG 
GCGGGGGCGG 
GAGCGGGCGC 
CTGCTGCGCG 
CCCGACGCGC 
GCCCTAGCAG 
CCGGTCTACG 
GACGTGGACC 
TCCGAGGGGG 
GAGCTGCCCC 
GCGCCCCAGG 
CCCTGGGACC 
CCAGAGCAAC 
ACAATAACAT 



51 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

969 



GGGCAGGAAG ACGGCGCTGC CCGGAGGAGC GGGGCGGGCG GGCGCGCGGG GGAGCGGGCG 
GCGGGCGGGA GCCAGGCCCG GGCGGGGGCG GGGGCGGCGG GGCCAGAAGA GGCGGCGGGC 
CGCGCTCCGG CCGGTCTGCG GCGTTGGCCT TGGCTTTGGC TTTGGCGGCG GCGGTGGAGA 
AGATGCTGCA GTCCCTGGCC GGCAGCTCGT GCGTGCGCCT GGTGGAGCGG CACCGCTCGG 
CCTGGTGCTT CGGCTTCCTG GTGCTGGGCT ACTTGCTCTA CCTGGTCTTC GGCGCAGTGG 
TCTTCTCCTC GGTGGAGCTG CCCTATGAGG ACCTGCTGCG CCAGGAGCTG CGCAAGCTGA 
AGCGACGCTT CTTGGAGGAG CACGAGTGCC TGTCTGAGCA GCAGCTGGAG CAGTTCCTGG 
GCCGGGTGCT GGAGGCCAGC AACTACGGCG TGTCGGTGCT CAGCAACGCC TCGGGCAACT 
GGAACTGGGA CTTCACCTCC GCGCTCTTCT TCGCCAGCAC CGTGCTCTCC ACCACAGGTT 
ATGGCCACAC CGTGCCCTTG TCAGATGGAG GTAAGGCCTT CTGCATCATC TACTCCGTCA 
TTGGCATTCC CTTCACCCTC CTGTTCCTGA CGGCTGTGGT CCAGCGCATC ACCGTGCACG 
TCACCCGCAG GCCGGTCCTC TACTTCCACA TOCGCTGGGG CTTCTCCAAG CAGGTGGTGG 
CCATCGTCCA TGCCGTGCTC CTTGGGTTTG TCACTGTGTC CTGCTTCTTC TTCATCCCGG 
CCGCTGTCTT CTCAGTCCTG GAGGATGACT GGAACTTCCT GGAATCCTTT TATTTTTGTT 
TTATTTCCCT GAGCACCATT GGCCTGGGGG ATTATGTGCC TGGGGAAGGC TACAATCAAA 
AATTCAGAGA GCTCTATAAG ATTGGGATCA CGTGTTACCT GCTACTTGGC CTTATTGCCA 
TGTTGGTAGT TCTGGAAACC TTCTGTGAAC TCCATGAGCT GAAAAAATTC AGAAAAATGT 
TCTATGTGAA GAAGGACAAG GACGAGGATC AGGTGCACAT CATAGAGCAT GACCAACTGT 
CCTTCTCCTC GATCACAGAC CAGGCAGCTG GCATGAAAGA GGACCAGAAG CAAAATGAGC 
CTTTTGTGGC CACCCAGTCA TCTGCCTGCG TGGATGGCCC TGCAAACCAT TGAGCGTAGG 
ATTTGTTGCA TTATGCTAGA GCACCAGGGT CAGGGTGCAA GGAAGAGGCT TAAGTATGTT 
CATTTTTATC AGAATGCAAA AGCGAAAATT ATGTCACTTT AAGAAATAGC TACTGTTTGC 
AATGTCTTAT TAAAAAACAA CAAAAAAAGA CACATGGAAC AAAGAAGCTG TGACCCCAGC 
AGGATGTCTA ATATGTGAGG AAATGAGATG TCCACCTAAA ATTCATATGT GACAAAATTA 
TCTCGACCTT ACATAGGAGG AGAATACTTG AAGCAGTATG CTGCTGTGGT TAGAAGCAGA 
TTTTATACTT TTAACTGGAA ACTTTGGGGT TTGCATTTAG ATCATTTAGC TGATGGCTAA 
ATAGCAAAAT TTATATTTAG AAGCAAAAAA AAAAAGCATA GAGATGTGTT TTATAAATAG 
GTTTATGTGT ACTGGTTTGC ATGTACCCAC CCAAAATGAT TATTTTTGGA GAATCTAAGT 
CAAACTCACT ATTTATAATG CATAGGTAAC CATTAACTAT GTACATATAA AGTATAAATA 
TGTTTATATT CTGTACATAT GGTTTAGGTC ACCAGATCCT AGTGTAGTTC TGAAACTAAG 
ACTATAGATA TTTTGTTTCT TTTGATTTCT CTTTATACTA AAGAATCCAG AGTTGCTACA 
ATAAAATAAG GGGAATAATA AACTTGAGAG TGAATAACCA T 

Seq ID NO: C159 DNA Sequence 

Nucleic Acid Accession #: NM_005472.l 

Coding sequence: 93.. 404 



1 
I 

AAAGGGACTC 
CCGAGTCTTC 
GGTATGAGAG 

TACCTGGCCG 
TAACTGTGGG 
ACCCCTATCA 
CGGTGGAAGA 
TGCATCAGGT 



11 
I 

CTTGAAACTG 
CCCCACCTCA 
CCTGCATGCC 
AGGGCCAGGG 
TGATGACAAC 
CAGCCTCATC 
TGTGTATATC 
CCAAGACACC 
CT 



21 
I 

ATTGAGAGCC 
ATCCCTGTTG 
GTGCTGAAGG 
CTGGGGCCAG 
TCCTACATGT 
CTGGGATACA 
AAGAACCGTG 
TGGGGATTGC 



31 
I 

CAGTGGATTT 
CTATGGAGAC 
CTCTAAATGC 
ACAACCAGAC 
ACATTCTCTT 
CCCGCTCCCG 
TGTCTATGAT 
GTCTGGGGCC 



41 

I 

GCCAGCAGTT 
TACCAATGGA 
CACTCTTCAC 
TGAAGAGAGG 
TGTCATGTTT 
CAAAGTGGAC 
CTAACACGAG 
TCCAGAACTC 



Seq ID NO: CI 60 DNA Sequence 

Nucleic Acid Accession #: NM_005245.1 

Coding sequence: 187.. 13959 



11 



21 



31 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

7 BO 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1901 



51 

1 

TGAGCTTCTA 
ACGGAGACCT 
AGCAATTTGC 
CGGGCCAGCC 
CTATTTGCTG 
AAGCGTAGTG 
AGGGCTGGGA 
TGCTGTGGAC 



51 
I 



60 

120 

180 

240 

300 

360 

420 

480 

492 



CTGGGCGGCC GGGCGCGGGG AGAGGGCGCG GGAGCGGCTC GTGCGGCAGG TACCATGCGG 60 
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ACGCGCGAGC COGGCGAGGC CCCGGCAGGC CCGTCCCTGC TCGGGGGCGC GCTGAGACGG 120 
CGGGTGAGCT CCACGAGAGC GCCGTCGCCA CTTCGGGCCA ACTTTGCGAT TCCCGACAGT 180 
TAAGCAATGG GGAGACATTT GGCTTTGCTC CTGCTTCTGC TCCTTCTCTT CCAACATTTT 240 
GGAGACAGTG ATGGCAGCCA ACGACTTGAA CAGACTCCTC TGCAGTTTAC ACACCTCGAG 300 
TACAACGTCA CCGTGCAGGA GAACTCTGCA GCTAAGACTT ATGTGGGGCA TCCTGTCAAG 360 
ATGGGTGTTT ACATTACACA TCCAGCGTGG GAAGTAAGGT ACAAAATTGT TTCCGGAGAC 420 
AGTGAAAACC TGTTCAAAGC TGAAGAGTAC ATTCTCGGAG ACTTTTGCTT TCTAAGAATA 480 
AGGACCAAAG GAGGAAATAC AGCTATTCTT AATAGAGAAG TGAAGGATCA CTACACATTG 540 
ATAGTGAAAG CACTTGAAAA AAATACTAAT GTGGAGGCGC GAACAAAGGT CAGGG TGCA G 600 
GTGCTGGATA CAAATGACTT GAGACCGTTA TTCTCACCCA CCTCATACAG CGTTTCTTTA 660 
CCTGAAAACA CAGCTATAAG GACCAGTATC GCAAGAGTCA GCGCCACGGA TGCAGACATA 720 
GGAACCAACG GGGAATTTTA CTACAGTTTT AAAGATCGAA CAGATATGTT TGCTATTCAC 780 
CCAACCAGTG GTGTGATAGT GTTAACTGGT AGACTTGATT ACCTAGAGAC CAAGCTCTAT 840 
GAGATGGAAA TCCTCGCTGC GGACCGTGGC ATGAAGTTGT ATGGGAGCAG TGGCATCAGC 900 
AGCATGGCCA AGCTAACGGT GCACATCGAA CAGGCCAATG AATGTGCTCC GGTGATAACA 960 
GCAGTGACAT TGTCACCATC AGAACTGGAC AGGGACCCAG CATATGCAAT TGTGACAGTG 1020 
GATGACTGCG ATCAGGGTGC CAATGGTGAC ATAGCATCTT TAAGCATCGT GGCAGGTGAC 1080 
CTTCTCCAGC AGTTTAGAAC AGTGAGGTCC TTTCCAGGGA GTAAGGAGTA TAAAGTCAAA 1140 
GCCATCGGTG ACATTGATTG GGACAGTCAT CCTTTCGGCT ACAATCTCAC ACTACAGGCT 1200 
AAAGATAAAG GAACTCCGCC CCAGTTCTCT TCTGTTAAAG TCATTCACGT GACTTCTCCA 1260 
CAGTTCAAAG CCGGGCCAGT CAAGTTTGAA AAGGATGTTT ACAGAGCAGA AATAAGTGAA 1320 
TTTGCTCCTC CCAACACACC TGTGGTCATG GTAAAGGCCA TTCCTGCTTA TTCCCATTTG 1380 
AGGTATGTTT TTAAAAGGAC ACCTGGAAAA GCTAAATTCA GTTT AAATT A CAACACTGGT 1440 
CTCATTTCTA TTTTAGAACC AGTTAAAAGA CAGCAGGCAG CCCATTTTGA ACTTGAAGTA 1500 
ACAACAAGTG ACAGAAAAGC GTCCACCAAG GTCTTGGTGA AAGTCTTAGG TGCAAATAGC 1560 
AATCCCCCTG AATTTACCCA GACAGCGTAC AAAGCTGCTT TTGATGAGAA CGTGCCCATT 1620 
GGTACTACTA TCATGAGCCT GAGTGCCGTA GACCCTGATG AGGGTGAGAA TGGGTACGTG 1680 
ACATACAGTA TOGCAAATTT AAATCATGTG CCGTTTGCGA TTGACCATTT CACTGGTGCC 1740 
GTGAGTACGT CAGAAAACCT GGACTACGAA CTGATGCCTC GGGTTTATAC TCTGAGGATT 1800 
CGTGCATCAG ACTGGGGCTT GCCGTACCGC CGGGAAGTCG AAGTCCTTGC TACAATTACT 1860 
CTCAATAACT TGAATGACAA CACACCTTTG TTTGAGAAAA TAAATTGTGA AGGGACAATT 1920 
CCCAGAGATC TAGGCGTGGG AGAGCAAATA ACCACTGTTT CTGCTATTGA TGCAGATGAA 1980 
CTTCAGTTGG TACAGTATCA GATTGAAGCT GGAAATGAAC TGGATTTGTT TAGTTTAAAC 2040 
CCCAACTCGG GGGTATTGTC ATTAAAGCGA TCGCTAATGG ATGGCTTAGG TGCAAAGGTG 2100 
TCTTTCCACA GTCTGAGAAT CACAGCTACA GATGGAGAAA ATTTTGCCAC ACCATTATAT 2160 
ATCAACATAA CAGTGGCTGC CAGTCACAAG CTGGTAAACT TGCAGTGTGA AGAGACTGGT 2220 
GTTGCCAAAA TGCTGGCAGA GAAGCTCCTG CAGGCAAATA AATTACACAA CCAGG GAGAG 2280 
GTGGAGGATA TTTTCTTCGA TTCTCACTCT GTCAATGCTC ACATACCGCA GTTTA GAAGC 2340 
ACTCTTCCGA CTGGTATTCA GGTAAAGGAA AACCAGCCTG TGGGTTCCAG T GTAAT TTTC 2400 
ATGAACTCCA CTGACCTTGA CACTGGCTTC AATGGAAAAC TGGTCTATGC TGTTTCTGGA 2460 
GGAAATGAGG ATAGTTGCTT CATGATTGAT ATGGAAACAG GAATGCTGAA AATTTTATCT 2520 
CCTCTTGACC GTGAAACAAC AGACAAATAC ACCCTGAATA TTACCGTCTA TGACCTTGGG 2580 
ATACCCCAGA AGGCTGCGTG GCGTCTTCTA CATGTCGTGG TTGTCGATGC CAATGATAAT 2640 
CCACCCGAGT TTTTACAGGA GAGCTATTTT GTGGAAGTGA GTGAAGACAA GGAGGTACAT 2700 
AGTGAAATCA TCCAGGTTGA AGCCACAGAT AAAGACCTGG GGCCCAACGG ACAOGTGACG 2760 
TACTCAATTC TTACAGACAC AGACACATTT TCAATTGACA GCGTGACGGG TGTTGTTAAC 2820 
ATCGCACGCC CTCTGGATCG AGAGCTGCAG CATGAGCACT CCTTAAAGAT TGAGGCCAGG 2880 
GACCAAGCCA GAGAAGAGCC TCAGCTGTTC TCCACTGTCG TTGTGAAAGT ATCACTAGAA 2940 
GATGTTAATG ACAACCCACC TACATTTATT CCACCTAATT ATCGTGTGAA AGTCCGAGAG 3000 
GATCTTCCAG AAGGAACCGT CATCATGTGG TTAGAAGCCC ACGATCCTGA TTTAGGTCAG 3060 
TCTGGTCAGG TGAGATACAG CCTTCTGGAC CACGGAGAAG GAAACTTCGA TGTGGATAAA 3120 
CTCAGTGGAG CAGTTAGGAT CGTCCAGCAG TTGGACTTTG AGAAGAAGCA AGTGTATAAT 3180 
CTCACTGTGA GGGCCAAAGA CAAGGGAAAG CCAGTTTCTC TGTCTTCTAC TTGCTATGTT 3240 
GAAGTTGAGG TGGTTGATGT GAATGAGAAC CTGCACCCAC CCGTGTTTTC CAGCTTTGTG 3300 
GAAAAGGGGA CAGTGAAAGA AGATGCACCT GTTGGTTCAT TGGTAATGAC GGTGTCGGCT 3360 
CATGATGAGG AGGGCGGAAG AGATGGGGAG ATCCGATACT CCATTAGAGA TGGCTCTGGC 3420 
GTTGGTGTTT TCAAAATAGG TGAAGAGACA GGTGTCATAG AGACGTCAGA TCGACTGGAC 3480 
CGTGAATCGA CCTCCCATTA TTGGCTAACA GTCTTTGCAA CCGATCAGGG TGTCGTGCCT 3540 
CTTTCATCGT TCATAGAGAT CTACATAGAG GTTGAGGATG TCAATGACAA TGCACCACAG 3600 
ACATCAGAGC CTGTTTATTA CCCAGAAATC ATGGAAAATT CTCCTAAAGA TGTATCTGTG 3660 
GTCCAGATCG AGGCATTTGA TCCAGATTCG AGCTCTAATG ACAAGCTCAT GTACAAAATT 3720 
ACAAGTGGAA ATCCACAAGG ATTCTTTTCA ATACATCCTA AAACAGGTCT CATCACAACT 3780 
ACGTCAAGGA AGCTAGACCG AGAACAGCAA GATGAACACA TATTAGAGGT TACTGTGACA 3840 
GACAATGGTA GTCCCCCCAA ATCAACCATT GCAAGAGTCA TTGTGAAAAT CCTTGATGAA 3900 
AATGACAACA AACCTCAGTT TCTGCAAAAG TTCTACAAAA TCAGACTCCC TGAGCGGGAA 3960 
AAGCCAGACC GAGAAAGAAA TGCCAGACGG GAGCCGCTCT ATCGCGTCAT AGCCACCGAC 4020 
AAGGATGAGG GCCCCAATGC AGAAATCTCC TACAGCATCG AAGACGGGAA TGAGCATGGC 4080 
AAATTTTTCA TCGAACCGAA AACTGGAGTG GTTTCGTCCA AGAGGTTTTC AGCAGCTGOA 4140 
GAATATGATA TTCTTTCAAT TAAGGCAGTT GACAATGGTC GCCCTCAAAA GTCATCAACC 4200 
ACCAGACTCC ATATTGAATG GATCTCCAAG CCCAAACAGT CCCTGGAGCC CATTTCATTT 4260 
GAAGAATCAT TTTTTACCTT TACTGTGATG GAAAGTGACC CCGTTGCTCA CATGATTGGA 4320 
GTAATATCTG TGGAGCCTCC TGGCATACCC CTTTGGTTTG ACATCACTGG TGGCAACTAC 4380 
GACAGTCACT TCGATGTGGA CAAGGGAACT GGAACCATCA TTGTTGCCAA ACCTCTTGAT 4440 
GCAGAACAGA AGTCAAACTA CAACCTCACA GTCGAGGCTA CAGATGGAAC CACCACTATC 4500 
CTCACTCAGG TATTCATCAA AGTAATAGAC ACAAATGACC ATCGTCCTCA GTTTTCTACA 4560 
TCAAAGTATG AAGTTGTTAT TCCTGAAGAT ACAGCGCCAG AAACAGAAAT TTTGCAAATC 4620 
AGTGCTGTGG ATCAGGATGA GAAAAACAAA CTAATCTACA CTCTGCAGAG CAGTAGAGAT 4680 
CCACTGAGTC TCAAGAAATT TCGTCTTGAT CCTGCAACCG GCTCTCTCTA TACTTCTGAG 4740 
AAACTGGATC ATGAAGCTGT TTCACCAGCA CACCTCACGG TCATGGTACG AGATCAAGAT 4800 
GTGCCTGTAA AACGCAACTT TGCAAGGATT GTGGTCAATG TCAGCGACAC GAATGACCAC 4860 
GCCCCGTGGT TCACCGCTTC CTCCTACAAA GGGCGGGTTT ATGAATCGGC AGCCGTTGGC 4920 
TCAGTTGTGT TGCAGGTGAC GGCTCTGGAC AAGGACAAAG GGAAAAATGC TGAAGTGCTG 4980 
TACTCGATCG AGTCAGGAAA TATTGGAAAT ATTGGAAATT CTTTTATGAT TGATCCTGTC 5040 
TTGGGCTCTA TTAAAACTGC CAAAGAATTA GATCGAAGTA ACCAAGCGGA GTATGATTTA 5100 
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ATGGTAAAAG CTACAGATAA GGGCAGTCCA CCAATGAGTG AAATAACTTC TGTGCGTATC 5160 

TTTGTCACAA TTGCTGACAA CGCCTCTCCG AAGTTTACAT CAAAAGAATA TTCTGTTGAA 5220 

CTTAGTGAAA CTGTCAGCAT TGGGAGTTTC GTTGGGATGG TTACAGCCCA TAGTCAATCA 5280 

TCAGTGGTGT ATGAAATAAA AGATGGAAAT ACAGGTGATG CTTTTGATAT TAATCCACAT S340 

TCTGGAACTA TCATCACTCA GAAAGCCCTG GACTTTGAAA CTTTGCCCAT TTACACATTG 5400 

ATAATACAAG GAACTAACAT GGCTGGTTTG TCCACTAATA CAACGGTTCT AGTTCACTTG 5460 

CAGGATGAGA ATGACAACGC GCCAGTTTTT ATGCAGGCAG AATATACAGG ACTCATTAGT 5520 

GAATCAGCCT CAATTAACAG CGTGGTCCTA ACAGACAGGA ATGTCCCACT GGTGATTCGA 5580 

GCAGCTGATG CTGATAAAGA CTCAAATGCT TTGCTTGTAT ATCACATTGT TGAACCATCT 5640 

GTACACACAT ATTTTGCTAT TGATTCTAGC ACTGGTGCTA TTCATACAGT ACTAAGTCTG 5700 

GACTATGAAG AAACAAGTAT TTTTCACTTT ACCGTCCAAG TGCATGACAT GGGAACCCCA 5760 

OGTTTATTTG CTGAGTATGC AGCGAATGTA ACAGTACATG TAATTGACAT TAATGACTGC 5820 

CCCCCTGTGT TTGCCAAGCC ATTATATGAA GCATCTCTTT TGTTACCAAC ATACAAAGGA 5880 

GTAAAAGTCA TCACAGTAAA TGCTACAGAT GCTGATTCAA GTGCATTCTC ACAGTTGATT 5940 

TACTCCATCA CCGAAGGCAA CATCGGGGAG AAGTTTTCTA TGGACTACAA GACTGGTGCT 6000 

CTCACTGTCC AAAACACAAC TCAGTTAAGA AGCCGCTACG AGCTAACOGT TAGAGCTTCC 6060 

GATGGCAGAT TTGCOGGCCT TACCTCTGTC AAAATTAATG TGAAAGAAAG CAAAGAAAGT 6120 

CACCTAAAGT TTACCCAGGA TGTCTACTCT GCGGTAGTGA AAGAGAATTC CACCGAGGCC 6180 

GAAACATTAG CTGTCATTAC TGCTATTGGG AGTCCAATCA ATGAGCCTTT GTTTTATCAC 6240 

ATCCTCAACC CAGATCGCAG ATTTAAAATA AGCCGCACTT CAGGGGTTCT GTCAACCACT 6300 

GGCAGGCCCT TCGATCGTGA GCAGCAGGAG GCGTTTGATG TGGTTGTAGA AGTGATAGAG 6360 

GAACATAAGC CTTCTGCAGT GGCCCACGTT GTCGTGAAGG TCATTGTAGA AGACCAAAAT 6420 

GATAATGCGC CGGTGTTTGT CAACCTTCCC TACTACGCCG TTGTTAAAGT GGACACTGAG 6480 

GTGGGCCATG TCATTCGCTA TGTCACTGCT GTAGACAGAG ACAGTGGCAG AAACGGGGAA 6540 

GTGCATTACT ACCTCAAGGA ACATCATGAA CACTTTCAAA TTGGACCCTT GGGTGAAATT 6600 

TCACTGAAAA AGCAATTTGA GCTTGACACC TTAAATAAAG AATATCTTGT TACAGTGGTT 6660 

GCAAAAGATG GAGGGAACCC GGCCTTTTCA GCGGAAGTTA TCGTTCCGAT CACTGTCATG 6720 

AATAAAGCCA TGCCTGTGTT TGAAAAACCT TTCTACAGTG CAGAGATTGC AGAGAGCATC 6780 

CAGGTGCACA GCCCTGTGGT CCACGTGCAG GCTAACAGCC CGGAAGGCCT GAAAGTGTTC 6840 

TACAGCATCA CAGAOGGAGA CCCTTTCAGC CAGTTCACTA TTAACTTCAA TACTGGAGTT 6900 

ATCAATGTCA TAGCTCCTCT GGACTTTGAG GCCCACCCGG CATATAAGCT GAGCATACGC 6960 

GCAACTGACT CCTTGACGGG CGCTCATGCT GAAGTATTTG TGGACATCAT AGTAGACGAC 7020 

ATCAATGATA ACCCTCCTGT GTTTGCTCAG CAGTCTTATG CGGTGACCCT GTCTGAGGCA 7080 

TCTGTAATTG GAACGTCTGT TGTTCAAGTT AGAGCCACCG ATTCTGATTC AGAACCAAAT 7140 

AGAGGAATCT CATACCAGAT GTTTGGGAAT CACAGCAAGA GTCATGATCA TTTTCATGTA 7200 

GACAGCAGCA CTGGCCTCAT CTCACTACTC AGAACCCTGG ATTACGAGCA GTCCCGGCAG 7260 

CACACGATTT TTGTGAGGGC AGTTGATGGT GGTATGCCCA OGCTGAGCAG TGATGTGATT 7320 

GTCAGGGTGG AOGTTACCGA CCTCAATGGT AATCCACCAC TCTTTGAACA ACAGATTTAT 7380 

GAAGCCAGAA TTAGCGAGCA CGCCCCTCAT GGGCATTTOG TGACCTGTGT AAAAGCCTAT 7440 

GATGCAGACA GTTCAGACAT AGACAAGTTG CAGTATTCCA TTCTGTCTGG CAATGATCAT 7500 

AAACATTTTG TCATTGACAG TGCAACAGGG ATTATCACCC TCTCAAACCT GCACCGGCAC 7560 

GCCCTGAAGC CATTTTACAG TCTTAACCTG TCAGTGTCTG ATGGAGTTTT TAGAAGTTCC 7620 
ACCCAGGTTC ATGTAACTGT AATTGGAGGC AATTTGCACA GTCCTGCTTT CCTTCAGAAC 7680 
GAATATGAAG TGGAACTAGC TGAAAACGCT CCCCTACATA CCCTGGTGAT GGAGGTGAAA 7740 
ACTACGGATG GGGATTCTGG TATTTATGGT CACGTTACTT ACCATATTGT AAATGACTTT 7800 
GCCAAAGACA GATTTTACAT AAATGAGAGA GGACAGATAT TTACTTTGGA AAAACTTGAT 7860 
CGAGAAACCC CGGCGGAGAA AGTGATCTCA GTCCGTTTAA TGGCTAAGGA TGCTGGAGGA 7920 
AAAGTTGCTT TCTGCACCGT GAATGTCATC CTTACAGATG ACAATGACAA TGCACCACAA 7980 
TTTOGAGCAA CCAAATACGA AGTGAATATC GGGTCCAGTG CTGCTAAAGG GACTTCAGTC 8040 
GTAAAGTCTG CAAGTGATGC CGATGAGGGC TCCAATGCCG ACATCACCTA TGCCATTGAA 8100 
GCAGACTCTG AAAGTGTAAA AGAGAATTTG GAAATTAACA AACTGTCOGG CGTAATCACT 8160 
ACAAAGGAGA GCCTCATTGG CTTGGAAAAT GAATTCTTCA CTTTCTTTGT TAGAGCTGTG 8220 
GATAATGGGT CTCCATCAAA AGAATCTGTT GTTCTTGTCT ATGTTAAAAT CCTTCCACCG 8280 
GAAATGCAGC TTCCAAAATT TTCAGAACCT TTCTATACCT TTACAGTGTC AGAGGACGTG 8340 
CCTGTTGGAA CAGAGATAGA TCTCATCCGA GCAGAACATA GTGGGACTGT TCTTTACAGC 8400 
CTGGTGAAAG GGAATACTCC AGAAAGCAAT AGGGATGAGT CCTTTGTGAT TGACAGACAG 8460 
AGCGGGAGAC TGAAGTTGGA GAAGAGTCTT GATCATGAGA CAACTAAGTG GTATCAGTTT 8520 
TCCAXACTGG CCAGGTGCAC TCAAGATGAC CATGAGATGG TGGCTTCTGT AGATGTTAGT 8580 
ATCCAAGTGA AAGATGCAAA TGACAACAGC COGGTCTTTG AATCTAGTCC ATATGAGGCA 8640 
TTCATTGTTG AAAACCTGCC AGGGGGAAGT AGAGTAATTC AGATCAGGGC ATCTGATGCT 8700 
GACTCAGGAA CCAACGGCCA AGTTATGTAT AGCCTGGATC AGTCACAAAG TGTGGAAGTC 8760 
ATTGAATCCT TTGCCATTAA CATGGAAACA GGCTGGATTA CAACTTTAAA GGAACTTGAC 8820 
CATGAAAAGA GAGACAATTA CCAGATTAAA GTGGTTGCAT CAGATCATGG TGAAAAGATC 8880 
CAGCTATCCT CCACAGCCAT TGTGGATGTT ACGGTCACCG ATGTCAACGA TAGTCCACCA 8940 
CGATTCACGG CCGAGATCTA TAAAGGGACT GTGAGTGAGG ATGACCCCCA AGGTGGGGTG 9000 
ATTGCCATCT TAAGTACCAC GGATGCTGAT TCTGAAGAGA TCAACAGACA AGTTACATAT 9060 
TTCATAACAG GAGGGGATCC TTTAGGACAG TTTGCOGTTG AAACTATACA GAATGAATGG 9120 
AAGGTATATG TGAAGAAACC TCTAGACAGG GAAAAAAGGG ACAATTACCT TCTTACTATC 9180 
ACGGCAACTG ATGGCACCTT CTCATCAAAA GCGATAGTTG AAGTGAAAGT TCTGGATGCA 9240 
AATGACAACA GTCCAGTTTG TGAAAAGACT TTATATTCAG ACACTATTCC TGAAGAOGTC 9300 
CTTCCTGGAA AATTGATCAT GCAGATCTCT GCTACAGACG CAGACATCCG CTCTAACGCT 9360 
GAAATTACTT ACACGTTATT GGGTTCAGGT GCAGAAAAAT TCAAACTAAA TCCAGACACA 9420 
GGTGAACTGA AAACGTCAAC CCCCCTTGAT CGTGAGGAGC AAGCTGTTTA TCATCTTCTC 9480 
GTCAGGGCCA CAGATGGAGG AGGAAGATTC TGCCAAGCCA GTATTGTCGT CACGCTAGAA 9540 
GATGTGAACG ATAACGCCCC CGAATTCTCT GCOGATCCTT ATGCCATCAC CGTGTTTGAA 9600 
AACACAGAGC OGGGAACGCT GCTGACAAGA GTGCAGGCCA CAGATGCGGA CGCAGGATTA 9660 
AATCGGAAGA TTTTATACTC ACTGATTGAC TCTGCTGATG GGCAGTTCTC CATTAACGAA 9720 
TTATCTGGAA TTATTCAGTT AGAAAAACCT TTGGACAGAG AACTCCAGGC AGTATACACC 9780 
CTCTCTTTGA AAGCTGTGGA TCAAGGCTTG CCAAGGAGGC TGACTGCCAC TGGCACTGTG 9840 
ATTGTATCAG TTCTTGACAT AAATGACAAC CCCCCTGTGT TTGAGTACCG TGAATATGGT 9900 
GCCACCGTGT CTGAGGACAT TCTTGTTGGA ACTGAAGTTC TTCAAGTGTA TGCAGCAAGT 9960 
CGGGATATTG AAGCAAATGC AGAAATCACC TACTCAATAA TAAGTGGAAA TGAACATGGG 10020 
AAATTCAGCA TAGATTCTAA AACAGGGGCC GTATTTATCA TTGAGAATCT GGATTATGAG 10080 
AGCTCTCATG AGTATTACCT AACAGTAGAG GCCACTGATG GAGGCACGCC TTCACTGAGC 10140 
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GACGTTGCCA CTGTGAACGT TAATGTAACA GATATCAACG ATAATACCCC TGTGTTCAGC 10200 

CAAGACACCT ACACGACAGT CATCAGTGAA GATGCCGTTC TTGAGCAGTC TGTCATCACG 10260 

GTTATGGCCG ATGATGCCGA TGGACCTTCC AACAGCCACA TCCACTACTC AATTATAGAT 10320 

GGCAACCAAG GAAGCTOGTT CACAATTGAC CCCGTCAGGG GAGAAGTCAA AGTGACCAAA 10380 

5 CTTCTCGACC GAGAAAOGAT TTCAGGTTAC ACGCTCACGG TTCAAGCTTC TGATAATGGC 10440 

AGTCCACCCA GAGTCAACAC GACGACCGTG AACATCGATG TGTCCGATGT CAATGACAAC 10500 

GCGCCCGTCT TCTCCAGGGG AAACTACAGT GTCATTATCC AGGAAAATAA GCCAGTGGGC 10560 

TTCAGCGTGC TGCAGCTGGT AGTAACAGAT GAGGATTCTT CCCATAAOGG TCCACCCTTC 10620 

TTCTTTACTA TTGTAACTGG AAATGATGAG AAGGCTTTTG AAGTTAACCC GCAAGGAGTC 10680 

10 CTCCTGACAT CATCTGCCAT CAAGAGGAAG GAGAAAGATC ATTACTTACT GCAGGTGAAG 10740 

GTGGCAGATA ATGGAAAGCC TCAGTTGTCA TCTTTGACAT ACATTGACAT TAGGGTAATT 10800 

GAGGAGAGCA TCTATCCGCC TGCGATTTTG CCCCTGGAGA TTTTCATCAC CTCTTCTGGA 10860 

* GAAGAATACT CAGGTGGCGT CATTGGGAAG ATCCATGCCA CAGACCAGGA CGTGTATGAT 10920 

ACTCTAACCT ACAGTCTCGA CCCTCAGATG GACAACCTGT TCTCTGTTTC CAGCACAGGG 10980 

15 GGCAAGCTGA TAGCACACAA AAAGCTAGAC ATAGGGCAAT ACCTTCTCAA TGTCAGCGTA 11040 

ACAGATGGGA AGTTCACGAC GGTGGCCGAC ATCACAGTGC ATATCAGACA AGTCACACAG 11100 

GAGATGTTGA ACCACACCAT CGCGATCCGC TTTGCCAACC TCACTCCGGA AGAATTCGTT 11160 

GGTGACTACT GGOGCAACTT CCAGCGAGCT TTACGGAACA TCCTGGGTGT GAGGAGGAAC 11220 

GACATACAGA TTGTTAGTTT GCAGTCCTCT GAACCTCACC CACATCTGGA CGTCTTACTT 11280 

20 TTTGTAGAGA AACCAGGTAG TGCTCAGATC TCAACAAAAC AACTTCTGCA CAAGATTAAC 11340 

TCTTCCGTGA CTGACATTGA GGAAATCATT GGAGTTAGGA TACTGAATGT ATTCCAGAAA 11400 

CTCTGCGCGG GACTGGACTG CCCCTGGAAG TTCTGCGATG AAAAGGTGTC TGTGGATGAA 11460 

AGTGTGATGT CAACACACAG CACAGCCAGA CTGAGTTTTG TGACTCCCCG CCACCACAGG 11520 

GCAGCGGTGT GTCTCTGCAA AGAGGGAAGG TGCCCACCTG TCCACCATGG CTGTGAAGAT 11580 

25 GATCCGTGCC CTGAGGGATC CGAATGTGTG TCTGATCCCT GGGAGGAGAA ACACACCTGT 11640 

GTCTGTCCCA GCGGCAGGTT TGGTCAGTGC CCAGGGAGTT CATCTATGAC ACTGACTGGA 11700 

AACAGCTACG TGAAATACCG TCTGACGGAA AATGAAAACA AATTAGAGAT GAAACTGACC 11760 

ATGAGGCTCA GAACATATTC CACGCATGCG GTTGTCATGT ATGCTCGAGG AACTGACTAT 11820 

AGCATCTTGG AG ATT CAT CA TGGAAGGCTG CAGTACAAGT TTGACTGTGG AAGTGGCCCT 11880 

30 GGAATTGTCT CTGTTCAGAG CATTCAGGTC AATGATGGGC AGTGGCACGC AGTGGCCCTG 11940 

GAAGTGAATG GAAACTATGC TCGCTTGGTT CTAGACCAAG TTCATACTGC ATCGGGCACA 12000 

GCCCCAGGGA CTCTGAAAAC CCTGAACCTG GATAACTATG TGTTTTTTGG TGGCCACATC 12060 

CGTCAGCAGG GAACAAGGCA TGGAAGAAGT CCTCAAGTTG GTAATGGTTT CAGGGGTTGT 12120 

ATGGACTCCA TTTATTTGAA TGGGCAGGAG CTCCCTTTAA ACAGCAAACC CAGAAGCTAT 12180 

35 GCACACATCG AAGAGTCGGT GGATGTATCT CCAGGCTGCT TCCTGACGGC CAOGGAAGAC 12240 

TGCGCCAGCA ACCCTTGCCA GAATGGAGGC GTTTGCAATC CGTCACCTGC TGGAGGTTAT 12300 

TACTGCAAAT GCAGTGCCTT GTACATAGGG ACCCACTGTG AGATAAGCGT CAATCCGTGT 12360 

TCCTCCAACC CATGCCTCTA TGGGGGCACG TGTGTTGTOG ACAACGGAGG CTTTGTTTGC 12420 

CAGTGTAGAG GATTATATAC TGGTCAGAGG TGTCAGCTTA GTCCATACTG CAAAGATGAA 124 B0 

40 CCCTGTAAGA ATGGCGGAAC ATGCTTTGAC AGTTTGGATG GCGCOGTTTG TCAGTGTGAT 12540 

TCGGGTTTTA GGGGAGAAAG GTGTCAGAGT GATATGGACG AGTGCTCTGG AAACCCTTGC 12600 

CTGCACGGGG CCCTCTGTGA GAACACGCAC GGCTCCTATC ACTGCAACTG CAGCCACGAG 12660 

TACAGGGGAC GTCACTGCGA GGATGCTGCG CCCAACCAGT ATGTGTCCAC GCCGTGGAAC 12720 

ATTGGGTTGG CGGAAGGAAT TGGAATCGTT GTGTTTGTTG CAGGGATATT TTTACTGGTG 12780 

45 GTGGTGTTTG TTCTCTGOCG TAAGATGATT AGTCGGAAAA AGAAGCATCA GGCTGAACCT 12840 

AAAGACAAGC ACCTGGGACC CGCTACGGCT TTCTTGCAAA GACCGTATTT TGATTCCAAG 12900 

CTAAATAAGA ACATTTACTC AGACATACCA CCCCAGGTGC CTGTCCGGCC TATTTCCTAC 12960 

ACCCCGAGTA TTCCAAGTGA CTCAAGAAAC AATCTGGACC GAAATTCCTT CGAAGGATCT 13020 

GCTATCCCAG AGCATCCCGA ATTCAGCACT TTTAACCCCG AGTCTGTGCA CGGGCACCGA 13080 

50 AAAGCAGTGG CGGTCTGCAG CGTGGCGCCA AACCTGCCTC CCCCACCCCC TTCAAACTCC 13140 

CCTTCTGACA GOGACTCCAT CCAGAAGCCT AGCTGGGACT TTGACTATGA CACAAAAGTG 13200 

GTGGATCTTG ATCCCTGTCT TTCCAAGAAG CCTCTAGAGG AAAAGCCTTC CCAGCCATAC 13260 

AGTGCCCGGG AAAGCCTGTC TGAAGTGCAG TCCCTGAGCT CCTTCCAGTC CGAATCGTGC 13320 

GATGACAATG GGTATCACTG GGATACATCA GATTGGATGC CAAGCGTTCC TCTGCCGGAC 13380 

55 ATACAAGAGT TCCCCAACTA TGAGGTGATT GATGAGCAGA CACCCCTGTA CTCAGCAGAT 13440 

CCAAACGCCA TCGATACGGA CTATTACCCT GGAGGCTACG ACATCGAAAG TGATTTTCCT 13500 

CCACCCCCAG AAGACTTCCC OGCAGCTGAT GAGCTACCAC CGTTACCGCC CGAATTCAGC 13560 

AATCAGTTTG AATCCATCCA CCCTCCTAGA GACATGCCTG COGOGGGTAG CTTGGGTTCT 13620 

TCATCAAGAA ACCGGCAGAG GTTCAACTTG AATCAGTATT TGCCCAATTT TTATCCCCTC 13680 

60 GATATGTCTG AACCTCAAAC AAAAGGCACT GGTGAGAATA GTACTTGTAG AGAACCCCAT 13740 

GCCCCTTACC CGCCAGGGTA TCAAAGACAC TTCGAGGCGC CCGCTGTCGA GAGCATGCCC 13800 

ATGTCTGTGT ACGCCTCCAC CGCCTCCTGC TCTGACGTGT CAGCCTGCTG CGAAGTGGAG 13860 

TCCGAGGTCA TGATGAGTGA CTATGAGAGC GGGGACGACG GCCACTTCGA AGAGGTGACG 13920 

, _ ATCCCGCCCC TGGATTCCCA GCAGCACACG GAAGTCTGAC TCTCAACTCC CCCCAAAGTG 13980 

65 CCTGACTTTA GTGAACCTAG AGGTGATGTG AGTAATCCGC GCTGTTCTTT GCAGCAGTGC 14040 

TTCCAAGCTT TTTTTGGTGA GCCGAATGGG CATGGCTGCG CTGGATCCTG CGCCTCTGGA 14100 

CGTGCTAGCC ATTTCCAGTG TCCCAACTAC TGTCATCGTG AGGTTTTCAT CGGCTGTGCC 14160 

ATTTCCCAAC GTCTTTTGGG ATTTACATCT GTCTGTGTTA AAATAATCAA ACGAAAAATC 14220 

AGTCCTGTGT TGTCAGCATG ATTCATGTAT TTATATAGAT TTGATTATTT TAATTTTCCT 14280 

70 GTCTCTTTTT TTTGTAAATT TTATGTACAG ATTTGATTTT TCATAGTTTT AACTAGATTT 14340 

CCAAGATATT TTGTGCATTT GTTTCAACTG AATTTTGGTG GTGTCAGTGC CATTATCTAG 14400 

CACCCTGATT TTTTTTTTTT TACTATAACC AGGGTTTCAT TCTGTCTTTT TCCACTGAAG 14460 

TGTGACATTT TGTTAGTACA TTTCAGTGTA GTCATTCATT TCTAGCTGTA CATAGGATGA 14520 

' AGGAGAGATC AGATACATGA ACATGTCTTA CATGGGTTGC TGTATTTAGA ATTATAAACA 14580 

75 TTTTTCATTA TTGGAAAGTG TAACGGGGAC CTTCTGCATA CCTGTTTAGA ACCAA AACCA 14640 

CCATGACACA GTTTTTATAG TGTCTGTATA TTTGTGATGC AATGGTCTTG TAAAGGTTTT 14700 

TAATGAAAAC TACCATTAGC CAGTCTTTCT TACTGACAAT AAATTATTAA TAAAAT 14756 
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GTGGTGTTTG CTTTCTCCAC CAGAAGGGCA CACTTTCATC TAATTTGGGG TATCACTGAG 60 

CTGAAGACAA AGAGAAGGGG GAGAAAACCT AGCAGACCAC CATGTGCTAT GGGAAGTGTG 120 

CACGATGCAT CGGACATTCT CTGGTGGGGC TCGCCCTCCT GTGCATCGCG GCTAATATTT 180 

TGCTTTACTT TCCCAATGGG GAAACAAAGT ATGCCTCCGA AAACCACCTC AGCCGCTTCG 240 

5 TGTGGTTCTT TTCTGGCATC GTAGGAGGTG GCCTGCTGAT GCTCCTGCCA GCATTTGTCT 300 

TCATTGGGCT GGAACAGGAT GACTGCTGTG GCTGCTGTGG CCATGAAAAC TGTGGCAAAC 360 

GATGTGCGAT GCTTTCTTCT GTATTGGCTG CTCTCATTGG AATTGCAGGA TCTGGCTACT 420 

GTGTCATTGT GGCAGCCCTT GGCTTAGCAG AAGGACCACT ATGTCTTGAT TCCCTCGGCC 480 

AGTGGAACTA CACCTTTGCC AGCACCGAGG GCCAGTACCT TCTGGATACC TCCACATGGT 54 0 

10 CCGAGTGCAC TGAACCCAAG CACATTGTGG AATGGAATGT ATCTCTGTTT TCTATCCTCT 600 

TGGCTCTTGG TGGAATTGAA TTCATCTTGT GTCTTATTCA AGTAATAAAT GGAGTGCTTG 660 

GAGGCATATG TGGCTTTTGC TGCTCTCACC AACAGCAATA TGACTGCTAA AAGAACCAAC 720 

CCAGGACAGA GCCACAATCT TCCTCTATTT CATTGTAATT TATATATTTC ACTTGTATTC 780 

ATTTGTAAAA CTTTGTATTA GTGTAACATA CTCCCCACAG TCTACTTTTA CAAAOGCCTG 840 

15 TAAAGACTGG CATCTTCACA GGATGTCAGT GTTTAAATTT AGTAAACTTC TTTTTTGTTT 900 

GTTTATTTGT TTTTGTTTTT TTTTAAGGAA TGAGGAAACA AACCACCCTC TGGGGGTAGT 960 

TTACAGACTG AGTGACAGTA CTCAGTATAT CTGAGATAAA CTCTATAATG TTTTGGATAA 1020 

AAATAACATT CCAATCACTA TTGTATATAT GTGCATGTAT TTTTTAAATT AAAGATGTCT 1080 

AGTTGCTTTT TATAAGACCA AGAAGGAGAA AATCCGACAA CCTGGAAAGA TTTTTGTTTT 1140 

20 CACTGCTTGT ATGATGTTTC CCATTCATAC ACCTATAAAT CTCTAACAAG AGGCCCTTTG 1200 

AACTGCCTTG TGTTCTGTGA GAAACAAATA TTTACTTAGA GTGGAAGGAC TGATTGAGAA 1260 

TGTTCCAATC CAAATGAATG CATCACAACT TACAATGCTG CTCATTGTTG TGAGTACTAT 1320 

GAGATTCAAA TTTTTCTAAC ATATGGAAAG CCTTTTGTCC TCCAAAGATG AGTACTAGGG 1380 

ATCATGTGTT TAAAAAAAGA AAGGCTACGA TGACTGGGCA AGAAGAAAGA TGGGAAACTG 1440 

25 AATAAAGCAG TTGATCAGCA TCATTGGAAC ATGGGGACGA GTGACGGCAG GAGGACCACG 1500 

AGGAAATACC CTCAAAACTA ACTTGTTTAC AACAAAATAA AGTATTCACT ACGAAAAAAA 1560 

AAAAAAAAAA AAAAAAAAAA AAA 1583 
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GTTCTTTGTG ACACATCACA CAGAATTGGA GTGCTGTCCT TCTGGAGAGT GGTGGAGAAC 60 

CAAGATACAG TTCAGAACCA AAGGAATAGA GAAGGGCTTT GATTTCTTTT TGGCTTTAGA 120 

TTGGGGATTT GGGAGGCTTA GCAGGAAAGA TGTCCACTGA AAATGTGGAA GGGAAGCCCA 180 

GTAACCTTGG GGAGAGAGGA AGAGCCCGGA GCTCCACTTT CCTCAGGGTT GTCCAGCCAA 240 

TGTTTAACCA CAGTATTTTC ACTTCTGCAG TCTCTCCTGC TGCAGAACGC ATCCGATTCA 300 

TCTTGGGAGA GGAGGATGAC AGCCCAGCTC CCCCTCAGCT CTTCACGGAA CTGGATGAGC 360 

TGCTGGCCGT GGATGGGCAG GAGATGGAGT GGAAGGAAAC AGCCAGGTGG ATCAAGTTTG 420 

AAGAAAAAGT GGAACAGGGT GGGGAAAGAT GGAGCAAGCC CCATGTGGCC ACATTGTCCC 480 

TTCATAGTTT ATTTGAGCTG AGGACATGTA TGGAGAAAGG ATCCATCATG CTTGATCGGG 540 

AGGCTTCTTC TCTCCCACAG TTGGTGGAGA TGATTGTTGA CCATCAGATT GAGACAGGCC 600 

TATTGAAACC TGAACTTAAG GATAAGGTGA CCTATACTTT GCTCCGGAAG CACCGGCATC 660 

AAACCAAGAA ATCCAACCTT CGGTCCCTGG CTGACATTGG GAAGACAGTC TCCAGTGCAA 720 

GTAGGATGTT TACCAACCCT GATAATGGTA GCCCAGCCAT GACCCATAGG AATCTGACTT 780 

CCTCCAGTCT GAATGACATT TCTGATAAAC CGGAGAAGGA CCAGCTGAAG AATAAGTTCA 840 

TGAAAAAATT GCCACGTGAT GCAGAAGCTT CCAACGTGCT TGTTGGGGAG GTTGACTTTT 900 

TGGATACTCC TTTCATTGCC TTTGTTAGGC TACAGCAGGC TGTCATGCTG GGTGCCCTGA 960 

CTGAAGTTCC TGTGCCCACA AGGTTCTTGT TCATTCTCTT AGGTCCTAAG GGGAAAGCCA 1020 

AGTCCTACCA CGAGATTGGC AGAGCCATTG CCACCCTGAT GTCTGATGAG GTGTTCCATG 1080 

ACATTGCTTA TAAAGCAAAA GACAGGCACG ACCTGATTGC TGGTATTGAT GAGTTCCTAG 1140 

ATGAAGTCAT CGTCCTTCCA CCTGGGGAAT GGGATCCAGC AATTAGGATA GAGCCTCCTA 1200 

AGAGTCTTCC ATCCTCTGAC AAAAGAAAGA ATATGTACTC AGGTGGAGAG AATGTTCAGA 1260 

TGAATGGGGA TACGCCCCAT GATGGAGGTC ACGGAGGAGG AGGACATGGG GATTGTGAAG 1320 

AATTGCAGCG AACTGGACGG TTCTGTGGTG GACTAATTAA AGACATAAAG AGGAAAGCGC 1380 

CATTTTTTGC CAGTGATTTT TATGATGCTT TAAATATTCA AGCTCTTTCG GCAATTCTCT 1440 

TCATTTATCT GGCAACTGTA ACTAATGCTA TCACTTTTGG AGGACTGCTT GGGGATGCCA 1500 

CTGACAACAT GCAGGGCGTG TTGGAGAGTT TCCTGGGCAC TGCTGTCTCT GGAGCCATCT 1560 

TTTGCCTTTT TGCTGGTCAA CCACTCACTA TTCTGAGCAG CACCGGACCT GTCCTAGTTT 1620 

TTGAGAGGCT TCTATTTAAT TTCAGCAAGG ACAATAATTT TGACTATTTG GAGTTTCGCC 1680 

TTTGGATTGG CCTGTGGTCC GCCTTCCTAT GTCTCATTTT GGTAGCCACT GATGCCAGCT 1740 

TCTTGGTTCA ATACTTCACA CGTTTCACGG AGGAGGGCTT TTCCTCTCTG ATTAGCTTCA 1800 

TCTTTATCTA TGATGCTTTC AAGAAGATGA TCAAGCTTGC AGATTACTAC CCCATCAACT 1860 

CCAACTTCAA AGTGGGCTAC AACACTCTCT TTTCCTGTAC CTGTGTGCCA CCTGACCCAG 1920 

CTAATATCTC AATATCTAAT GACACCACAC TGGCCCCAGA GTATTTGCCA ACTATGTCTT 1980 

CTACTGACAT GTACCATAAT ACTACCTTTG ACTGGGCATT TTTGTCGAAG AAGGAGTGTT 2040 

CAAAATACGG AGGAAACCTT GTCGGGAACA ACTGTAATTT TGTTCCTGAT ATCACACTCA 2100 

TGTCTTTTAT CCTCTTCTTG GGAACCTACA CCTCTTCCAT GGCTCTGAAA AAATTCAAAA 2160 

CTAGTCCTTA TTTTCCAACC ACAGCAAGAA AACTGATCAG TGATTTTGCC ATTATCTTGT 2220 

CCATTCTCAT CTTTTGTGTA ATAGATGCCC TAGTAGGCGT GGACACCCCA AAACTAATTG 2280 

TGCCAAGTGA GTTCAAGCCA ACAAGTCCAA ACCGAGGTTG GTTCGTTCCA CCGTTTGGAG 2340 

AAAACCCCTG GTGGGTGTGC CTTGCTGCTG CTATCCCGGC TTTGTTGGTC ACTATACTGA 2400 

TTTTCATGGA CCAACAAATT ACAGCTGTGA TTGTAAACAG GAAAGAACAT AAACTCAAGA 2460 

AAGGAGCAGG GTATCACTTG GATCTCTTTT GGGTGGCCAT CCTCATGGTT ATATGCTCCC 2520 

TCATGGCTCT TCCGTGGTAT GTAGCTGCTA CGGTCATCTC CATTGCTCAC ATCGACAGTT 2580 

TGAAGATGGA GACAGAGACT TCTGCACCTG GAGAACAACC AAAGTTTCTA GGAGTGAGGG 2640 

AACAAAGAGT CACTGGAACC CTTGTGTTTA TTCTGACTGG TCTGTCAGTC TTTATGGCTC 2700 

CCATCTTGAA GTTTATACCC ATGCCTGTAC TCTATGGTGT GTTCCTGTAT ATGGGAGTAG 2760 

CATCCCTTAA TGGTGTGCAG TTCATGGATC GTCTGAAGCT GCTTCTGATG CCTCTGAAGC 2820 

ATCAGCCTGA CTTCATCTAC CTGCGTCATG TTCCTCTGCG CAGAGTCCAC CTGTTCACTT 2880 

TCCTGCAGGT GTTGTGTCTG GCCCTGCTTT GGATCCTCAA GTCAACGGTG GCTGCTATCA 2940 

TTTTTCCAGT AATGATCTTG GCACTTGTAG CTGTCAGAAA AGGCATGGAC TACCTCTTCT 3000 
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CCCAGCATGA CCTCAGCTTC CTGGATGATG TCATTCCAGA AAAGGACAAG AAAAAGAAGG 3060 

AGGATGAGAA GAAAAAGAAA AAGAAGAAGG GAAGTCTGGA CAGTGACAAT GATGATTCTG 3120 

ACTG CCCATA CTCAGAAAAA GTTCCAAGTA TTAAAATTCC AATGGACATC ATGGAACAGC 3180 

AACCTTTCCT AAGCGATAGC AAACCTTCTG ACAGAGAAAG ATCACCAACA TTCCTTGAAC 3240 

5 GCCACACATC ATGCTGATAA AATTCCTTTC CTTCAGTCAC TCGGTATGCC AAGTCCTCCT 3300 

AGAACTCCAG TAAAAGTTGC CTCAAATTAG ACTAGAACTT GAACCTGAAG ACAATGATTA 3360 

TTTCTGGAGG AGCAAGGGAA CAGAAACTAC ATTGTAACCT GTTTGTCTTT CTTAAAACTG 3420 

ACATTTGTTG TTAATGTCAT TTGTTTTTGT TTGGCTGTTT GTTTATTTTT TAACTTTTAT 3480 

TTCGTCTCAG TTTTTGGTCA CAGGCCAAAT AATACAGCGC TCTCTCTGCT TCTCTCTTGC 3540 

10 ATAGATACAA TCAAGACAAT AGTGCACOGT TCCTTAAAAA CAGCATCTGA GGAATCCCCC 3600 

TTTTGTTCTT AAACTTTCAG ATGTGTCCTT TGATAACCAA ATTCTGTCAC TCAAGACACA 3660 

GACACCCACA GACCCTGTCC TTTGCCTCTA TTAAGCAGAG GATGGAAGTA TTAAGGATTT 3720 

TGTAACACCT TTTATGAAAA TGTTGAAGGA ACTTAAAACT TTAGCTTTGG AGCTGTGCTT 3780 

ACTGGCTTGT CTTTGTCTGG TAGAACAAAC CTTGACCTCC AGACAGAGTC CCTTCTCACT 3840 

15 TATAGAGCTC TCCAGGACTG GAAAAAGTGC TGCTATTTTA ACTTGCTCTT GCTTGTAAAT 3900 

CCTAATCTTA ' GAGTTATCAA AAGAAGAAAA AACTGAAGGT ACTTTACTCC CTATAGAGAA 3960 

ACCATTGCCA TCATTGTAGC AAGTGCTGGA ATGTCCCTTT TTTCCTATGC AACTTTTTTA 4020 

TAACCCTTTA ATGAACTTAT CTGTGGAGTA CATTGAAGAA TATTTTTCTT CCTAGATTTT 4080 

GTTGTTTAAA TTATGGGGCC TAACCTGCCA CTTATTTTTT GTCAATTTTT AAAACTTTTT 4140 

20 TTTAATTACT GTAAAGAAAA TGAATTTTTT CCTGCAGCAG GAAACATAGT TTTCAGTAGT 4200 

TCTACCTCTT ATTTGTAGCT GCCAGGCTTT CTGTAAAAAT TGTATTGTAT ATAATGTGAT 4260 

TTTTACACAT ACATACACAC ACAAATACAC AATCTCTAGG GTAAGCCAGA AGGCAAGATC 4320 

AGATTAAAAA CACCATGTTT CTAAGCATCC ATTTTTCCCT TTCTTTAAAA GAAACTTAAC 4380 

TGTTCTATGA AGGAGATTGA GGGAGAAGAG ACAAACTCCT ATGTCATGAG AATAACCGAT 4440 

25 GTTCTGATAA TAGTAGCATC TAGGTACAGA TGCTGGTTGT ATTACCACGT CAATGTCCTA 4500 

TGCAGTATTG TTAGACATTT TCTCATTTTG AAATATTTGT GTGTTTGTGT ATGTGCTCTG 4560 

TGCCATGGCT GGTGTATATA TGTGCAATGT TAGAAGGCAA AAGAGTGATG GTAGGCAGAG 4620 

GGCAAAGTCA TTGAATCTCT TATGCCAGTT TTCATAAAAC CCAAACCACA TATGAAAAAA 4680 

TCCATTAAGG GTCCAAGAAG TCTGTCCATA TGAAAATGAG GGTAAATATA GTTTATTTCC 4740 

30 CAGGTATCAG TCATTATAAT TGATATAATA GCTCTAACAT GCAATATAAA ATTCATAGGA 4800 

GTATTAATAG CCCATTTACA CATCTATAAA ATGTAATGGG ATTGCAGAGC TGCAGAGTAC 4860 

AGTGTAACAG TACTCTCATG CAATTTTTTT CAGGATGCAA AGGCAATTAT TCTTTGTAAG 4920 

CGGGACATTT AGATATATTT GTGTACATAT TATATGTATG TATATTTCAA AGTACCACAC 4980 

TGAAAATTAG ACATTTATTA ACCAAATTTA AOGTGGTATT TAAAGGTAAT ATTTTTAATA 5040 

35 TGATACATTA CATATTGTGA ATGTATACTA AAAAAACATT TTAAATGTTA AAATTATAAT 5100 

TTCAGATTCA TATAACCACA ACTGTGATAT ATCCTAACTA TAACCAGTTG TTGAGGGGTA 5160 

TACTAGAAGC AGAATGAAAC CACATTTTTT GGTTTGATAA TATGCACTTA TTGACTCCCA S220 

CTCATTGTTA TGTTAATTAA GTTATTATTC TGTCTCCTTG TAATTTTGAT TACAAAAATT 5280 

TTATTATCCT GAGTTAGCTG TTACTTTTAC AGTACCTGAT ACTCCTAAAA CTTTTAACTT 5340 

40 ATACAAATTA GTCAATAATG ACCCCAATTT TTTCATTAAA ATAATAGTGG TGAATTATAT 5400 

GTTATTGTGT TAAAACCTCA CTTGCCAAAT TCTGGCTTCA CATTTGTATT TAGGGCTATC 5460 

CTTAAAATGA TGAGTCTATA TTATCTAGCT TTCTATTACC CTAATATAAA CTGGTATAAG 5520 

AAGACTTTCC TTTTTTCTTT ATGCATGGAA GCATCAATAA ATTGTTTAAA AACCATGTAT 5580 

AGTAAATTCA GCTTAACCCG TGATCTTCTT AAGTTAAAGG TACTTTTGTT TTATAAAAGC 5640 

45 TCTAGATAAA ACTTTCTTTT CTGATCATGA ATCAAGTATC TGTGGTTTCA TGCCCCTCTC 5700 

TATACCTTTC AAAGAACTCC TGAAGCAACT TAACTCATCA TTTCAGCCTC TGAGTAGAGG 5760 

TAAAACCTAT GTGTACTTCT GTTTATGATC CATATTGATA TTTATGACAT GAACACAGAA 5820 

TAGTACCTTA CATTTGCTAA ACAGACAGTT AATATCAAAT CCTTTCAATA TTCTGGGAAC 5880 

CCAGGGAAGT TTTTAAAAAT GTCATTACTT TCAAAGGAAC AGAAGTAGTT AACCAAACTA 5940 

50 ACAAGCAAAA CCTGAGGTTT ACCTAGTGAC ACCAAATTAT CGGTATTTTA ACTGAATTTA 6000 

CCCATTGACT AAGAATGAAC CGGATTTGGT GGTGGTTTTG TTTCTATGCA AACTGGACAC 6060 

AAATTACAAC AGTAAATTTT TTTATAAGTG CTTCTCCCTT CTCCATGATG TGACTTCCGG 6120 

AGATAAAGGA TTCAAAAGAT AAAGACAAAG TACGCTCAGA GTTGTTAACC AGAAAGTCCT 6180 

GGCTGTGGTT GCAGAAACAC TGTTGGAAGA AAAGAGATGA CTAAGTCAAG TGTCTGCCTT 6240 

55 ATCAAAAGAG CAAAAATGCC TCTGGTTTTG TGTTTGGGAG AAAAATATCT TGGACGCACT 6300 

GTTTTCCTTG ATAAAAGTCA TCTTCTCTAC TGTGTGAAAT GAATACTTGG AATTCTAATT 6360 

GTTTTGTGTG CCAGGGGCAG TAATGTCCCT GCCTCTTCTC CCAATCAAGG TTGAGGAGTG 6420 

GGGCTGGGGA GAGGACTTAA CTGACTTAAG AAGTAGGAAA ACAAAAACCT CTCTCCTCAG 6480 

CCTTCCACCT CCAAGAGAGG AGGAAAAACA GTTGTCTGCT GTCTGTAATT CAGTTTGCGT 6540 

60 GTATTTTATG CTCATGCACC AACCCATACA GAGTAAATCT TTTATCAACT ATATACTGGT 6600 

GTTTAATAGA GAATGATTGT CTTCCGAGTT TTTTGGTTCC TTTTTTAACT GTGTTAAAGT 6660 

ACTTGAAATG TATTGACTGC TGACTATATT TTAAAAACAA AATGAAATAA TTTGAGTTGT 6720 

ATTACAGAGG TTGACATTGT TCAGGGATGG GACAAAGCCT TCTTCAATCC TTTTCATACT 6780 

. _ ACTTAATGAT TTTGGTGCAG GAACCTGAGA TTTTCTGATT TATATTTCAT GATATTTCAC 6840 

65 ATTTGCTCTT CACAGCATGA GCATGAAGCC CAGTGGCACC AAATGGCTGG GTACAATCAA 6900 

GTGATATTTT GTAGCACCTC ACTATCTGAA AGGCCATGAG TTTTCAGATG ATTTCATTGA 6960 

GCTTCATTGC AGCCTGAAAT TTTAAAAAAG TTGTGTAATA CGCCAACCAG TCAAGTTGTG 7020 

TTTTGGCCAG AGATTTAGAT ATGTCCAATT TCCTGGCTCA TTTCATTGTG CTCTATGGGT 7080 

ACGTATAAAA AGCAAGAATT CTGTTTCCTA GGCAAACATT GCAACTCAGG GCTAAAGTCA 7140 

70 TCCAGTGAAA CTTTTAGAGC CAGAAGTAAC TTTGTCCCAG TCCTACAATG TGAAAAGAGT 7200 

GAATAGTTGC CTCTTTTTAG CCATTTTCAT GGCTGGTACA TATTCGTACG CATTACTTTT 7260 

CAGAATCAAT ACGCACTTTC AGATATTCTT ATTTTTATTC TCTTAAGTCT TTATTAACTT 7320 

TGGAGAGAGA AATGATGCAT CTTTTTATTT TAAATGAAGT AGATCAACAT GGTGGAACAA 7380 

AATGATAAAG AACAGAAAAC ATTTCAATAT ATTACTAATA ACTTTTTCCA AT ATAAAT CC 7440 

75 TAAAATTCCT ATAACATAGT ATTTTACAGT TTTATGAAGC TTTCTATTGT GACTTTTATG 7500 

GAATTAAGAG ATGAAGAAGA TGAGATATTT TAGCATTTAT ATTTTTCAAA ATTATATGTA 7560 

TACTTAAAAA TAAAGTAACT TTATGC 75 86 
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CGGCACAGCC TCACACCTGA ACGCTGTCCT CCCGCAGACG AGACCGGCGG GCACTGCAAA 
GCTGGGACTC GTCTTTGAAG GAAAAAAAAT AGCGAGTAAG AAATCCAGCA CCATTCTTCA 
CTGACCCATC CCGCTGCACC TCTTGTTTCC CAAGTTTTTG AAAGCTGGCA ACTCTGACCT 
CGGTGTCCAA AAATCGACAG CCACTGAGAC CGGCTTTGAG AAGCCGAAGA TTTGGCAGTT 
TCCAGACTGA GCAGGACAAG GTGAAAGCAG GTTGGAGGCG GGTCCAGGAC ATCTGAGGGC 
TGACCCTGGG GGCTCGTGAG GCTGCCACCG CTGCTGCCGC TACAGACCCA GCCTTGCACT 
CCAAGGCTGC GCACCGCCAG CCACTATCAT GTCCACTCCC GGGGTCAATT CGTCCGCCTC 
CTTGAGCCCC GACCGGCTGA ACAGCCCAGT GACCATCCCG GCGGTGATGT TCATCTTCGG 
GGTGGTGGGC AACCTGGTGG CCATCGTGGT GCTGTGCAAG TCGCGCAAGG AGCAGAAGGA 
GACGACCTTC TACACGCTGG TATGTGGGCT GGCTGTCACC GACCTGTTGG GCACTTTGTT 
GGTGAGCCCG GTGACCATCG CCACGTACAT GAAGGGCCAA TGGCCCGGGG GCCAGGCGCT 
GTGCGAGTAC AGCACCTTCA TTCTGCTCTT CTTCAGCCTG TCCGGCCTCA GCATCATCTG 
CGCCATGAGT GTCGAGCGCT ACCTGGCCAT CAACCATGCC TATTTCTACA GCCACTACGT 
GGACAAGCGA TTGGCGGGCC TCACGCTCTT TGCAGTCTAT GCGTCCAACG TGCTCTTTTG 
CGCGCTGCCC AACATGGGTC TCGGTAGCTC GCGGCTGCAG TACCCAGACA CCTGGTGCTT 
CATCGACTGG ACCACCAACG TGACGGCGCA CGCOGCCTAC TCCTACATGT ACGOGGGCTT 
CAGCTCCTTC CTCATTCTCG CCACCGTCCT CTGCAACGTG CTTGTGTGCG GCGGGCTGCT 
CCGCATGCAC CGCCAGTTCA TGCGCCGCAC CTOGCTGGGC ACCGAGCAGC ACCACGCGGC 
CGCGGCCGCC TCGGTTGCCT CCCGGGGCCA CCCCGCTGCC TCCCCAGCCT TGCCGOGCCT 
CAGCGACTTT CGGCGCCGCC GGAGCTTCCG CCGCATCGCG GGCGCCGAGA TCCAGATGGT 
CATCTTACTC ATTGCCACCT CCCTGGTGGT GCTCATCTGC TCCATCCCGC TCGTGGTGCG 
AGTATTCGTC AACCAGTTAT ATCAGCCAAG TTTGGAGCGA GAAGTCAGTA AAAATCCAGA 
TTTGCAGGCC ATCCGAATTG CTTCTGTGAA CCCCATCCTA GACCCCTGGA TATATATCCT 
CCTGAGAAAG ACAGTGCTCA GTAAAGCAAT AGAGAAGATC AAATGCCTCT TCTGCCGCAT 
TGGCGGGTCC CGCAGGGAGC GCTCCGGACA GCACTGCTCA GACAGTCAAA GGACATCTTC 
TGCCATGTCA GGCCACTCTC GCTCCTTCAT CTCCCGGGAG CTGAAGGAGA TCAGCAGTAC 
ATCTCAGACC CTCCTGCCAG ACCTCTCACT GCCAGACCTC AGTGAAAATG GCCTTGGAGG 
CAGGAATTTG CTTCCAGGTG TGCCTGGCAT GGGCCTGGCC CAGGAAGACA CCACCTCACT 
GAGGACTTTG CGAATATCAG AGACCTCAGA CTCTTCACAG GGTCAGGACT CAGAGAGTGT 
CTTACTGGTG GATGAGGCTG GTGGGAGCGG CAGGGCTGGG CCTGCCCCTA AGGGGAGCTC 
CCTGCAAGTC ACATTTCCCA GTGAAACACT GAACTTATCA GAAAAATGTA TATAATAGGC 
AAGGAAAGAA ATACAGTACT GTTTCTGGAC CCTTATAAAA TCCTGTGCAA TAGACACATA 
CATGTCACAT TTAGCTGTGC TCAGAAGGGC TATCATCA 

Seq ID NO: CI 6 4 DNA Sequence 

Nucleic Acid Accession ft: NM_002659.1 

Coding sequence: 427.. 1434 
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I I i I I I 

CAGTATCCCT cctgacaaaa ctaacaaaaa tcctgttagc caaataatca gccacattca 

TATTTACCGT CAAAGTTTTT ATCCTCATTT TACAGCAGTG GAGAGCGATT GCCCCGGGTC 
CCACGTTAGG AAGAGAGAGA ACTGGGATTT GCACCCAGGC AATCTGGGGA CAGAGCTGTG 
ATCACAACTC CATGAGTCAG GGCCGAGCCA GCCCCTTCAC CACCAGCCGG CCGCGCCCCG 
GGAAGGAAGT TTGTGGCGGA GGAGGTTCGT AOGGGAGGAG GGGGAGGCGC CCACGCATCT 
GGGGCTGACT CGCTCTTTCG CAAAACGTCT GGGAGGAGTC CCTGGGGCCA CAAAACTGCC 
TCCTTCCTGA GGCCAGAAGG AGAGAAGACG TGCAGGGACC CCGCGCACAG GAGCTGCCCT 
CGCGACATGG GTCACCCGCC GCTGCTGCCG CTGCTGCTGC TGCTCCACAC CTGCGTCCCA 
GCCTCTTGGG GCCTGCGGTG CATGCAGTGT AAGACCAACG GGGATTGCCG TGTGGAAGAG 
TGCGCCCTGG GACAGGACCT CTGCAGGACC ACGATCGTGC GCTTGTGGGA AGAAGGAGAA 
GAGCTGGAGC TGGTGGAGAA AAGCTGTACC CACTCAGAGA AGACCAACAG GACCCTGAGC 
TATCGGACTG GCTTGAAGAT CACCAGCCTT ACCGAGGTTG TGTGTGGGTT AGACTTGTGC 
AACCAGGGCA ACTCTGGCCG GGCTGTCACC TATTCCCGAA GCCGTTACCT CGAATGCATT 
TCCTGTGGCT CATCAGACAT GAGCTGTGAG AGGGGCCGGC ACCAGAGCCT GCAGTGCCGC 
AGCCCTGAAG AACAGTGCCT GGATGTGGTG ACCCACTGGA TCCAGGAAGG TGAAGAAGGG 
CGTCCAAAGG ATGACCGCCA CCTCCGTGGC TGTGGCTACC TTCCCGGCTG CCCGGGCTCC 
AATGGTTTCC ACAACAACGA CACCTTCCAC TTCCTGAAAT GCTGCAACAC CACCAAATGC 
AACGAGGGCC CAATCCTGGA GCTTGAAAAT CTGCCGCAGA ATGGCCGCCA GTGTTACAGC 
TGCAAGGGGA ACAGCACCCA TGGATGCTCC TCTGAAGAGA CTTTCCTCAT TGACTGCCGA 
GGCCCCATGA ATCAATGTCT GGTAGCCACC GGCACTCACG AACCGAAAAA CCAAAGCTAT 
ATGGTAAGAG GCTGTGCAAC CGCCTCAATG TGCCAACATG CCCACCTGGG TGACGCCTTC 
AGCATGAACC ACATTGATGT CTCCTGCTGT ACTAAAAGTG GCTGTAACCA CCCAGACCTG 
GATGTCCAGT ACCGCAGTGG GGCTGCTCCT CAGCCTGGCC CTGCCCATCT CAGCCTCACC 
ATCACCCTGC TAATGACTGC CAGACTGTGG GGAGGCACTC TCCTCTGGAC CTAAACCTGA 
AATCCCCCTC TCTGCCCTGG CTGGATCCGG GGGACCCCTT TGCCCTTCCC TCGGCTCCCA 
GCCCTACAGA CTTGCTGTGT GACCTCAGGC CAGTGTGCCG ACCTCTCTGG GCCTCAGTTT 
TCCCAGCTAT GAAAACAGCT ATCTCACAAA GTTGTGTGAA GCAGAAGAGA AAAGCTGGAG 
GAAGGCCGTG GGCAATGGGA GAGCTCTTGT TATTATTAAT ATTGTTGCCG CTGTTGTGTT 
GTTGTTATTA ATTAATATTC ATATTATTTA TTTTATACTT ACATAAAGAT TTTGTACCAG 
TGG 

Seq ID NO: Cl 6 5 DNA Sequence 
Nucleic Acid Accession ft: AK027843.1 
Coding sequence: 193.. 1731 



l 
I 

TTGCTTGAGT 
CTAAATAGCA 
CTGTTACCAG 
TCGTATTTCC 
CCTCCAAACT 
TTTACTTTCT 
GTGAGTTATG 



11 
I 

CATCTTCTGA 
CATCACATGT 
GGACAAATGC 
AGATGGATTT 
TACTTGAGAA 
TCAACAAAAC 
TGATGGCGTG 



21 

i 

AGCTTTAAAA 
GAATATTACA 
AATTTCAAAT 
TGAGAGTGGA 
TTTAAGTCCA 
TGGACTTTTC 
CAGTATTGGA 



31 
I 

ACAATTGATG 
ACTCGGAACT 
TTTAGCATTG 
CAAGTGGATC 
GAAGATTCTG 
CAGGATGTAG 
AACATTACTA 



41 
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TGGCTCTCAG CGTATCATCC 
GTCTTCCAAG CAATAATGAA 
CACTGGCATC TGTAATTTTG 
TATTAGTTAG AAGAGCACAG 
GACCCCAAAG AAAAACTTTA 
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GTTCAAATAA AAATCAAACA TACAAGAACT CAGGAAGTGC ATCATCCCAT CTGTGCCTTC 480 

TGGGATCTGA ACAAAAACAA AAGTTTTGGA GGATGGAACA CGTCAGGATG TGTTGCACAC 540 

AGAGATTCAG ATGCAAGTGA GACAGTCTGC CTGTGTAACC ACTTCACACA CTTTGGAGTT 600 

CTGATGGACC TTCCAAGAAG TGCCTCACAG TTAGATGCAA GAAACACTAA AGTCCTCACT 660 

5 TTCATCAGCT ATATTGGGTG TGGAATATCT GCTATTTTTT CAGCAGCAAC TCTCCTGACA 720 

TATGTTGCTT TTGAGAAATT GOGAAGGGAT TATCCCTCCA AAATCTTGAT GAACCTGAGC 780 

ACAGCCCTGC TGTTCCTGAA TCTCCTCTTC CTCCTAGATG GCTGGATCAC CTCCTTCAAT 840 

GTGGATGGAC TTTGCATTGC TGTTGCAGTC CTGTTGCATT TCTTCCTTCT GGCAACCTTT 900 

ACCTGGATGG GGCTAGAAGC AATTCACATG TACATTGCTC TAGTTAAAGT ATTTAACACT 960 

10 TACATTCGCC GATACATTCT AAAATTCTGC ATCATTGGCT GGGGTTTGCC TGCCTTAGTG 1020 

GTGTCAGTTG TTCTAGCGAG CAGAAACAAC AATGAAGTCT ATGGAAAAGA AAGTTATGGG 1080 

AAAGAAAAAG GTGATGAATT CTGTTGGATT CAAGATCCAG TCATATTTTA TGTGACCTGT 1140 

GCTGGGTATT TTGGAGTCAT GTU TTTTCT G AACATTGCCA TGTTCATTGT GGTAATGGTG 1200 

CAGATCTGTG GGAGGAATGG CAAGAGAAGC AACOGGACCC TGAGAGAAGA AGTGTTAAGG 1260 

15 AACCTGCGCA GTGTGGTTAG CTTGACCTTT CTGTTGGGCA TGACATGGGG TTTTGCATTC 1320 

TTTGCCTGGG GACCCTTAAA TATCCCCTTC ATGTACCTCT TCTCCATCTT CAATTCATTA 1380 

CAAGGCTTAT TTATATTCAT CTTCCACTGT GCTATGAAGG AGAATGTTCA GAAACAGTGG 1440 

CGGCGGCATC TCTGCTGTGG TAGATTTCGG TTAGCAGATA ACTCAGATTG GAGTAAGACA 1500 

GCTACCAATA TCATCAAGAA AAGTTCTGAT AATCTAGGAA AATCTTTGTC TTCAAGCTCC 1560 

20 ATTGGTTCCA ACTCAACCTA TCTTACATCC AAATCTAAAT CCAGCTCTAC CACCTATTTC 1620 

AAAAGGAATA GCCACACAGA TAATGTCTCC TATGAGCATT CCTTCAACAA AAGTGGATCA 1680 

CTCAGACAGT GCTTCCATGG ACAAGTCCTT GTCAAAACTG GCCCATGCTG ATGGAGATCA 1740 

AACATCAATC ATCCCTGTCC ATCAGGTCAT TGATAAGGTC AAGGGTTATT GCAATGCTCA 1800 

TTCAGACAAC TTCTATAAAA AT ATT AT CAT GTCAGACACC TTCAGCCACA GCACAAAGTT 1860 

25 TTAATGTCTT TAAGAAAAAG AAATCAATCT GCAGAAATGT GAAGATTTGC AAGCAGTGTA 1920 

AACTGCAACT AGTGATGTAA ATGTGCTATT ACCTAGGTAA CTGCATATAT ATAAGGAATG 1980 

TATTTTGTTA AGAAGGCTTT TGTGAAATTC AGAATTTTTC TTTTTAATAT ATTTCTTCCA 2040 

TGGAAGAGTT GTCATCACTA AAACTTCAGT ACTGAGAGTA ACATGACTCA GTAGCCACAG 2100 

AAGCTATGAT TTGTAAAATA TATAATTGAA TCAGAGTAAT CATAATGCAG GGGAGACATT 2160 

30 CAAATTAGAG ACAAGGGAGA AGCAATGCTG AGGAAGACCC TAGATAGAGC TCATTTTACT 2220 

CCACCTAATC GTTATATCTG GATATACCCA TTTTCTGCAT CTTCTTTCTC AACAATAAAC 2280 

TGTCCTTGCT TTGGAGACTT TAAGACATTT CCTAAAGCAC AAATAAAAGC CTCGTATTTC 2340 

CCCATTGAGA GTTTTGTTCC AAGGAATATG AAGTGAGACA TATGGGTGAG TCATAATAAT 2400 

CAAAATAATT TATGAAGAGC TGGGTCTGCA ATAGCTAGTC TAAAAACTAC TTGTGTGTCA 2460 

35 GTCCTCTGGT TATAGTATAT AAGAGCCTGA GGAGGTCTGG CAAGATAGAT GGTGTATTAT 2520 

TTATGGATCA GGCTGCTGCA TACAAACCTT GCATACTATT ATGCAGCTTA CCTAACTCTC 2580 

AGACTATTCT GAGTAATGCT TGCTTGCTAA TGAATGTATA GGAGACCACA TTGTAATTGT 2640 

TCTTAGATGA TGGAGTCCAT GCAGTTTCTT AGAAATCGGT CTCAGTGCAT GCTGTGCTTT 2700 

TTCACATTTG CTCTGGGTTA TCTGGGAAGT ATCAGGTTCT GGGAGGCAAC AGCATTAAGT 2760 

40 GATAAGAAAA GGAGACATTC TGGCAAAGCC AATCTGCTTA AAGGCAAAGT CCAGAACCTG 2820 

GAACCTAGAG GCCTTTCTCT CTGCAOGAAA AACAGGTAGT TTGCAGTCTG AGATATGGGA . 2880 

GAGCTTTTAG GCTACACAGC AACCCAAGGG ACCTCTCACC TTTTGCTGAG CTTCAATCAG 2940 

GAAGCTATTT GCCTGGCTCC AGCAGATGAT GAGATAATGA GGTAGTGGGT TTTTTATTAC 3000 

TGTTCCATTT TGCAACATCC TGCAACACCA TCCTGGGAGA CAAGAGCATT ACCCAGCTTG 3060 - 

45 GCTTTCACGG GGGAGGGTTG TATTCAGT 3088 
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Seq ID NO: C166 DMA Sequence 

Nucleic Acid Accession #x NM_000574.1 

Coding sequence i 66.. 1211 

1 11 21 31 41 51 

I 1 I I I I 

CCGCTGGGCG TAGCTGCGAC TCGGCGGAGT CCCGGCGGCG CX3TCCTTGTT CTAACCCGGC 60 

GCGCCATGAC CGTCGCGCGG CCGAGCGTGC CCGCGGCGCT GCCCCTCCTC GGGGAGCTGC 120 

CCCGGCTGCT GCTGCTGGTG CTGTTGTGCC TGCCGGCCGT GTGGGGTGAC TGTGGCCTTC 180 

CCCCAGATGT ACCTAATGCC CAGCCAGCTT TGGAAGGCCG TACAAGTTTT CCCGAGGATA 240 

CTGTAATAAC GTACAAATGT GAAGAAAGCT TTGTGAAAAT TCCTGGCGAG AAGGACTCAG 300 

TGATCTGCCT TAAGGGCAGT CAATGGTCAG ATATTGAAGA GTTCTGCAAT CGTAGCTGCG 360 

AGGTGCCAAC AAGGCTAAAT TCTGCATCCC TCAAACAGCC TTATATCACT CAGAATTATT 420 

TTCCAGTCGG TACTGTTGTG GAATATGAGT GCCGTCCAGG TTACAGAAGA GAACCTTCTC 480 

TATCACCAAA ACTAACTTGC CTTCAGAATT TAAAATGGTC CACAGCAGTC GAATTTTGTA 540 

AAAAGAAATC ATGCCCTAAT CCGGGAGAAA TACGAAATGG TCAGATTGAT GTACCAGGTG 600 

GCATATTATT TGGTGCAACC ATCTCCTTCT CATGTAACAC AGGGTACAAA TTATTTGGCT 660 

CGACTTCTAG TTTTTGTCTT ATTTCAGGCA GCTCTGTCCA GTGGAGTGAC CCGTTGCCAG 720 

AGTGCAGAGA AATTTATTGT CCAGCACCAC CACAAATTGA CAATGGAATA ATTCAAGGGG 780 

AAOGTGACCA TTATGGATAT AGACAGTCTG TAACGTATGC ATGTAATAAA GGATTCACCA 840 

TGATTGGAGA GCACTCTATT TATTGTACTG TGAATAATGA TGAAGGAGAG TGGAGTGGCC 900 

CACCACCTGA ATGCAGAGGA AAATCTCTAA CTTCCAAGGT CCCACCAACA GTTCAGAAAC 960 

CTACCACAGT AAATGTTCCA ACTACAGAAG TCTCACCAAC TTCTCAGAAA ACCACCACAA 1020 

AAACCACCAC ACCAAATGCT CAAGCAACAC GGAGTACACC TGTTTCCAGG ACAACCAAGC 1080 

ATTTTCATGA AACAACCCCA AATAAAGGAA GTGGAACCAC TTCAGGTACT ACCCGTCTTC 1140 

TATCTGGGCA CACGTGTTTC ACGTTGACAG GTTTGCTTGG GACGCTAGTA ACCATGGGCT 1200 

TGCTGACTTA GCCAAAGAAG AGTTAAGAAG AAAATACACA CAAGTATACA GACTGTTCCT 1260 

AGTTTCTTAG ACTTATCTGC ATATTGGATA AAATAAATGC AATTGTGCTC TTCATTTAGG 1320 

ATGCTTTCAT TGTCTTTAAG ATGTGTTAGG AATGTCAACA GAGCAAGGAG AAAAAAGGCA 1380 

GTCCTGGAAT CACATTCTTA GCACACCTAC ACCTCTTGAA AATAGAACAA CTTGCAGAAT 1440 

TGAGAGTGAT TCCTTTCCTA AAAGTGTAAG AAAGCATAGA GATTTGTTCG TATTTAGAAT 1500 

GGGATCACGA GGAAAAGAGA AGGAAAGTGA TTTTTTTCCA CAAGATCTGT AATGTTATTT 1S60 

CCACTTATAA AGGAAATAAA AAATGAAAAA CATTATTTGG ATATCAAAAG CAAATAAAAA 1620 

CCCAATTCAG TCTCTTCTAA GCAAAATTGC TAAAGAGAGA TGAACCACAT TATAAAGTAA 1680 

TCTTTGGCTG TAAGGCATTT TCATCTTTCC TTCGGGTTGG CAAAATATTT TAAAGGTAAA 1740 

ACATGCTGGT GAACCAGGGG TGTTGATGGT GATAAGGGAG GAATATAGAA TGAAAGACTG 1800 

AATCTTCCTT TGTTGGACAA ATAGAGTTTG GAAAAAGCCT GTGAAAGGTG TCTTCTTTGA 1860 

CTTAATGTCT TTAAAAGTAT CCAGAGATAC TACAATATTA ACATAAGAAA AGATTATATA 1920 
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TTATTTCTGA AT CG AGATGT CCATAGTCAA ATTTGTAAAT CTTATTCTTT. TGTAATATTT 1980 

ATTTATATTT ATTTATGACA GTGAACATTC TGATTTTACA TGTAAAACAA GAAAAGTTGA 2040 

AGAAGATATG TGAAGAAAAA TGTATTTTTC CTAAATAGAA ATAAATGATC CCATTTTTTG 2100 

GT 2102 

Seq ID NO: C167 DNA Sequence 

Nucleic Acid Accession ft: Eos sequence 

Coding sequence : 1 . . 2651 

1 11 21 31 41 51 

I I I I I I 

ATGGACACCT CCCGGCTCGG TGTGCTCCTG TCCTTGCCTG TGCTGCTGCA GCTGGCGACC 60 

GGGGGCAGCT CTCCCAGGTC TGGTGTGTTG CTGAGGGGCT GCCCCACACA CTGTCATTGC 120 

GAGCCOGACG GCAGGATGTT GCTCAGGGTG GACTGCTCOG ACCTGGGGCT CTCGGAGCTG 180 

CCTTCCAACC TCAGCGTCTT CACCTCCTAC CTAGACCTCA GTATGAACAA CATCAGTCAG 240 

CTGCTCCOGA ATCCCCTGCC CAGTCTCCGC TTCCTGGAGG AGTTACGTCT TGCGGGAAAC 300 

GCTCTGACAT ACATTCCCAA GGGAGCATTC ACTGGCCTTT ACAGTCTTAA AGTTCTTATG 360 

CTGCAGAATA ATCAGCTAAG ACACGTACCC ACAGAAGCTC TGCAGAATTT GCGAAGCCTT 420 

CAATCCCTGC GTCTGGATGC TAACCACATC AGCTATGTGC CCCCAAGCTG TTTCAGTGGC 480 

CTGCATTCCC TGAGGCACCT GTGGCTGGAT GACAATGCGT TAACAGAAAT CCCCGTCCAG 540 

GCTTTTAGAA GTTTATCGGC ATTGCAAGCC ATGACCTTGG CCCTGAACAA AATACACCAC 600 

ATACCAGACT ATGCCTTTGG AAACCTCTCC AGCTTGGTAG TTCTACATCT CCATAACAAT 660 

AGAATCCACT CCCTGGGAAA GAAATGCTTT GATGGGCTCC ACAGCCTAGA GACTTTAGAT 720 

TTAAATTACA ATAACCTTGA TGAATTCCCC ACTGCAATTA GGACACTCTC CAACCTTAAA 780 

GAACTACATT TCTATGACAA TCCCATCCAA TTTGTTGGGA GATCTGCTTT TCAACATTTA 840 

CCTGAACTAA GAACACTGAC TCTGAATGGT GCCTCACAAA TAACTGAATT TCCTGATTTA 900 

ACTGGAACTG CAAACCTGGA GAGTCTGACT TTAACTGGAG CACAGATCTC ATCTCTTCCT 960 

CAAACCGTCT GCAATCAGTT ACCTAATCTC CAAGTGCTAG ATCTGTCTTA CAACCTATTA 1020 

GAAGATTTAC CCAGTTTTTC AGTCTGCCAA AAGCTTCAGA AAATTGACCT AAGACATAAT 1080 

GAAATCTACG AAATTAAAGT TGACACTTTC CAGCAGTTGC TTAGCCTCCG ATCGCTGAAT 1140 

TTGGCTTGGA ACAAAATTGC TATTATTCAC CCCAATGCAT TTTCCACTTT GCCATCCCTA 1200 

ATAAAGCTGG ACCTATCGTC CAACCTCCTG TCGTCTTTTC CTATAACTGG GTTACATGGT 1260 

TTAACTCACT TAAAATTAAC AGGAAATCAT GCCTTACAGA GCTTGATATC ATCTGAAAAC 1320 

TTTCCAGAAC TCAAGGTTAT AGAAATGCCT TATGCTTACC AGTGCTGTGC ATTTGGAGTG 1380 

TGTGAGAATG CCTATAAGAT TTCTAATCAA TGGAATAAAG GTGACAACAG CAGTATGGAC 1440 

GACCTTCATA AGAAAGATGC TGGAATGTTT CAGGCTCAAG ATGAACGTGA CCTTGAAGAT 1500 

TTCCTGCTTG ACTTTGAGGA AGACCTGAAA GCCCTTCATT CAGTGCAGTG TTCACCTTCC 1560 

CCAGGCCCCT TCAAACCCTG TGAACACCTG CTTGATGGCT GGCTGATCAG AATTGGAGTG 1620 

TGGACCATAG CAGTTCTGGC ACTTACTTGT AATGCTTTGG TGACTTCAAC AGTTTTCAGA 1680 

TCCCCTCTGT ACATTTCCCC CATTAAACTG TTAATTGGGG TCATCGCAGC AGTGAACATG 1740 

CTCACGGGAG TCTCCAGTGC CGTGCTGGCT GGTGTGGATG 0GTTCACTTT TGGCAGCTTT 1800 

GCACGACATG GTGCCTGGTG GGAGAATGGG GTTGGTTGCC ATGTCATTGG TTTTTTGTCC I860 

ATTTTTGCTT CAGAATCATC TGTTTTCCTG CTTACTCTGG CAGCCCTGGA GCGTGGGTTC 1920 

TCTGTGAAAT ATTCTGCAAA ATTTGAAACG AAAGCTCCAT TTTCTAGCCT GAAAGTAATC 1980 

ATTTTGCTCT GTGCCCTGCT GGCCTTGACC ATGGCOGCAG TTCCCCTGCT GGGTGGCAGC 2040 

AAGTATGGCG CCTCCCCTCT CTGCCTGCCT TTGCCTTTTG GGGAGCCCAG CACCATGGGC 2100 

TACATGGTCG CTCTCATCTT GCTCAATTCC CTTTGCTTCC TCATGATGAC CATTGCCTAC 2160 

ACCAAGCTCT ACTGCAATTT GGACAAGGGA GACCTGGAGA ATATTTGGGA CTGCTCTATG 2220 

GTAAAACACA TTGCCCTGTT GCTCTTCACC AACTGCATCC TAAACTGCCC TGTGGCTTTC 2280 

TTGTCCTTCT CCTCTTTAAT AAACCTTACA TTTATCAGTC CTGAAGTAAT TAAGTTTATC 2340 

CTTCTGGTGG TAGTCOCACT TCCTGCATGT CTCAATCCCC TTCTCTACAT CTTGTTCAAT 2400 

CCTCACTTTA AGGAGGATCT GGTGAGCCTG AGAAAGCAAA CCTACGTCTG GACAAGATCA 2460 

AAACACCCAA GCTTGATGTC AATTAACTCT GATGATGTCG AAAAACAGTC CTGTGACTCA 2520 

ACTCAAGCCT TGGTAACCTT TACCAGCTCC AGCATCACTT ATGACCTGCC TCCCAGTTCC 2580 

GTGCCATCAC CAGCTTATCC AGTGACTGAG AGCTGCCATC TTTCCTCTGT GGCATTTGTC 2640 

CCATGTCTTA A 2651 

Seq ID NO: C168 DNA Sequence 

Nucleic Acid Accession #: NM_003667.2 

Coding sequence: 49.-2772 

1 11 21 31 41 51 

I I I I I I 

TGCTGCTCTC CGCCCGCGTC CGGCTCGTGG CCCCCTACTT CGGGCACCAT GGACACCTCC 60 

CGGCTCGGTG TGCTCCTGTC CTTGCCTGTG CTGCTGCAGC TGGCGACCGG GGGCAGCTCT 120 

• CCCAGGTCTG GTGTGTTGCT GAGGGGCTGC CCCACACACT GTCATTGOGA GCCCGACGGC 180 

AGGATGTTGC TCAGGGTGGA CTGCTCCGAC CTGGGGCTCT CGGAGCTGCC TTCCAACCTC 240 

AGCGTCTTCA CCTCCTACCT AGACCTCAGT ATGAACAACA TCAGTCAGCT GCTCCCGAAT 300 
CCCCTGCCCA GTCTCCGCTT CCTGGAGGAG TTACGTCTTG CGGGAAACGC TCTGACATAC 360 

ATTCCCAAGG GAGCATTCAC TGGCCTTTAC AGTCTTAAAG TTCTTATGCT GCAGAATAAT 420 

CAGCTAAGAC ACGTACCCAC AGAAGCTCTG CAGAATTTGC GAAGCCTTCA ATCCCTGCGT 480 
CTGGATGCTA ACCACATCAG CTATGTGCCC CCAAGCTGTT TCAGTGGCCT GCATTCCCTG 540 
AGGCACCTGT GGCTGGATGA CAATGCGTTA ACAGAAATCC CCGTCCAGGC TTTTAGAAGT 600 
TTATCGGCAT TGCAAGCCAT GACCTTGGCC CTGAACAAAA TACACCACAT ACCAGACTAT 660 
GCCTTTGGAA ACCTCTCCAG CTTGGTAGTT CTACATCTCC ATAACAATAG AATCCACTCC 720 
CTGGGAAAGA AATGCTTTGA TGGGCTCCAC AGCCTAGAGA CTTTAGATTT AAATTACAAT 780 
AACCTTGATG AATTCCCCAC TGCAATTAGG ACACTCTCCA ACCTTAAAGA ACTAGGATTT 840 
CATAGCAACA ATATCAGGTC GATACCTGAG AAAGCATTTG TAGGCAACCC TTCTCTTATT 900 
ACAATACATT TCTATGACAA TCCCATCCAA TTTGTTGGGA GATCTGCTTT TCAACATTTA 960 
CCTGAACTAA GAACACTGAC TCTGAATGGT GCCTCACAAA TAACTGAATT TCCTGATTTA 1020 
ACTGGAACTG CAAACCTGGA GAGTCTGACT TTAACTGGAG CACAGATCTC ATCTCTTCCT 1080 
CAAACCGTCT GCAATCAGTT ACCTAATCTC CAAGTGCTAG ATCTGTCTTA CAACCTATTA 1140 
GAAGATTTAC CCAGTTTTTC AGTCTGCCAA AAGCTTCAGA AAATTGACCT AAGACATAAT 1200 
GAAATCTACG. AAATTAAAGT TGACACTTTC CAGCAGTTGC TTAGCCTCCG ATCGCTGAAT 1260 
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TTGGCTTGGA 
ATAAAGCTGG 
TTAACTCACT 
TTTCCAGAAC 
TGTGAGAATG 
GACCTTCATA 
TTCCTGCTTG 
CCAGGCCCCT 
TGGACCATAG 

CTCACGGGAG 
GCACGACATG 
ATTTTTGCTT 
TCTGTGAAAT 
ATTTTGCTCT 
AAGTATGGCG 
TACATGGTCG 
ACCAAGCTCT 
GTAAAACACA 
TTGTCCTTCT 
CTTCTGGTGG 
CCTCACTTTA 
AAACACCCAA 
ACTCAAGCCT 
GTGCCATCAC 
CCATGTCTCT 
GAGATTGAGT 



ACAAAATTGC 
ACCTATCGTC 
TAAAATTAAC 
TCAAGGTTAT 
CCTATAAGAT 
AGAAAGATGC 
ACTTTGAGGA 
TCAAACCCTG 
CAGTTCTGGC 
ACATTTCCCC 
TCTCCAGTGC 
GTGCCTGGTG 
CAGAATCATC 
ATTCTGCAAA 
GTGCCCTGCT 
CCTCCCCTCT 
CTCTCATCTT 
ACTGCAATTT 
TTGCCCTGTT 
CCTCTTTAAT 
TAGTCCCACT 
AGGAGGATCT 
GCTTGATGTC 
TGGTAACCTT 
CAGCTTATCC 
AATTAATATG 
ATATCAGAGC 



TATTATTCAC 
CAACCTCCTG 
AGGAAATCAT 
AGAAATGCCT 
TTCTAATCAA 
TGGAATGTTT 
AGACCTGAAA 
TGAACACCTG 
ACTTACTTGT 
CATTAAACTG 
CGTGCTGGCT 
GGAGAATGGG 

ATTTGAAACG 
GGCCTTGACC 
CTGCCTGCCT 
GCTCAATTCC 
GGACAAGGGA 
GCTCTTCACC 
AAACCTTACA 
TCCTGCATGT 
GGTGAGCCTG 
AATTAACTCT 
TACCAGCTCC 
AGTGACTGAG 
TGAAGGAAAA 
AGTAATTAAT 



CCCAATGCAT 
TCGTCTTTTC 
GCCTTACAGA 
TATGCTTACC 
TGGAATAAAG 
CAGGCTCAAG 
GCCCTTCATT 
CTTGATGGCT 
AATGCTTTGG 
TTAATTGGGG 
GGTGTGGATG 
GTTGGTTGCC 
CTTACTCTGG 
AAAGCTCCAT 
ATGGCCGCAG 
TTGCCTTTTG 
CTTTGCTTCC 
GACCTGGAGA 
AACTGCATCC 
TTTATCAGTC 
CTCAATCCCC 
AGAAAGCAAA 
GATGATGTCG 
AGCATCACTT 
AGCTGCCATC 
TGTTTTCAAA 
AAGAAGAGCT 



TTTCCACTTT 
CTATAACTGG 
GCTTGATATC 
AGTGCTGTGC 
GTGACAACAG 
ATGAACGTGA 
CAGTGCAGTG 
GGCTGATCAG 
TGACTTCAAC 
TCATCGCAGC 
CGTTCACTTT 
ATGTCATTGG 
CAGCCCTGGA 
TTTCTAGCCT 
TTCCCCTGCT 
GGGAGCCCAG 
TCATGATGAC 
ATATTTGGGA 
TAAACTGCCC 
CTGAAGTAAT 
TTCTCTACAT 
CCTACGTCTG 
AAAAACAGTC 
ATGACCTGCC 
TTTCCTCTGT 
GGTTGAGAAC 
GAGGTGAAAC 



GCCATCCCTA 
GTTACATGGT 
ATCTGAAAAC 
ATTTGGAGTG 
CAGTATGGAC 
CCTTGAAGAT 
TTCACCTTCC 
AATTGGAGTG 
AGTTTTCAGA 
AGTGAACATG 
TGGCAGCTTT 
TTTTTTGTCC 
GCGTGGGTTC 
GAAAGTAATC 
GGGTGGCAGC 
CACCATGGGC 
CATTGCCTAC 
CTGCTCTATG 
TGTGGCTTTC 
TAAGTTTATC 
CTTGTTCAAT 
GACAAGATCA 
CTGTGACTCA 
TCCCAGTTCC 
GGCATTTGTC 
CTGAAAATGT 
TCGGTTTAAA 



1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 



Seq ID NO: Cl 6 9 dna Sequence 

Nucleic Acid Accession #: NM_003506.1 

Coding sequence-. 259.. 2379 



51 



1 11 21 31 41 

I I I I I I 

GCAGCTCCAG TCCCGGACGC AACCCCGGAG CCGTCTCAGG TCCCTGGGGG GAACGGTGGG 60 

TTAGACGGGG ACGGGAAGGG ACAGCGGCCT TCGACCGCCC CCCGAGTAAT TGACCCAGGA 120 

CTCATTTTCA GGAAAGCCTG AAAATGAGTA AAATAGTGAA ATGAGGAATT TGAACATTTT 180 

ATCTTTGGAT GGGGATCTTC TGAGGATGCA AAGAGTGATT CATCCAAGCC ATGTGGTAAA 240 

ATCAGGAATT TGAAGAAAAT GGAGATGTTT ACATTTTTGT TGACGTGTAT TTTTCTACCC 300 

CTCCTAAGAG GGCACAGTCT CTTCACCTGT GAACCAATTA CTGTTCCCAG ATGTATGAAA 360 

ATGGCCTACA ACATGACGTT TTTCCCTAAT CTGATGGGTC ATTATGACCA GAGTATTGCC 420 

GCGGTGGAAA TGGAGCATTT TCTTCCTCTC GCAAATCTGG AATGTTCACC AAACATTGAA 480 

ACTTTCCTCT GCAAAGCATT TGTACCAACC TGCATAGAAC AAATTCATGT GGTTCCACCT 540 

TGTCGTAAAC TTTGTGAGAA AGTATATTCT GATTGCAAAA AATTAATTGA CACTTTTGGG 600 

ATCCGATGGC CTGAGGAGCT TGAATGTGAC AGATTACAAT ACTGTGATGA GACTGTTCCT 660 

GTAACTTTTG ATCCACACAC AGAATTTCTT GGTCCTCAGA AGAAAACAGA ACAAGTCCAA 720 

AGAGACATTG GATTTTGGTG TCCAAGGCAT CTTAAGACTT CTGGGGGACA AGGATATAAG 780 

TTTCTGGGAA TTGACCAGTG TGCGCCTCCA TGCCCCAACA TGTATTTTAA AAGTGATGAG 840 

CTAGAGTTTG CAAAAAGTTT TATTGGAACA GTTTCAATAT TTTGTCTTTG TGCAACTCTG 900 

TTCACATTCC TTACTTTTTT AATTGATGTT AGAAGATTCA GATACCCAGA GAGACCAATT 960 

ATATATTACT CTGTCTGTTA CAGCATTGTA TCTCTTATGT ACTTCATTGG ATTTTTGCTG 1020 

GGCGATAGCA CAGCCTGCAA TAAGGCAGAT GAGAAGCTAG AACTTGGTGA CACTGTTGTC 1080 

CTAGGCTCTC AAAATAAGGC TTGCACCGTT TTGTTCATGC TTTTGTATTT TTTCACAATG 1140 

GCTGGCACTG TGTGGTGGGT GATTCTTACC ATTACTTGGT TCTTAGCTGC AGGAAGAAAA 1200 

TGGAGTTGTG AAGCCATCGA GCAAAAAGCA GTGTGGTTTC ATGCTGTTGC ATGGGGAACA 1260 

CCAGGTTTCC TGACTGTTAT GCTTCTTGCT CTGAACAAAG TTGAAGGAGA CAACATTAGT 1320 

GGAGTTTGCT TTGTTGGCCT TTATGACCTG GATGCTTCTC GCTACTTTGT ACTCTTGCCA 13 BO 

CTGTGCCTTT GTGTGTTTGT TGGGCTCTCT CTTCTTTTAG CTGGCATTAT TTCCTTAAAT 1440 

CATGTTCGAC AAGTCATACA ACATGATGGC CGGAACCAAG AAAAACTAAA GAAATTTATG 1500 

ATTCGAATTG GAGTCTTCAG CGGCTTGTAT CTTGTGCCAT TAGTGACACT TCTCGGATGT 1560 

TACGTCTATG AGCAAGTGAA CAGGATTACC TGGGAGATAA CTTGGGTCTC TGATCATTGT 1620 

CGTCAGTACC ATATCCCATG TCCTTATCAG GCAAAAGCAA AAGCTCGACC AGAATTGGCT 1680 

TTATTTATGA TAAAATACCT GATGACATTA ATTGTTGGCA TCTCTGCTGT CTTCTGGGTT 1740 

GGAAGCAAAA AGACATGCAC AGAATGGGCT GGGTTTTTTA AACGAAATCG CAAGAGAGAT 1800 

CCAATCAGTG AAAGTCGAAG AGTACTACAG GAATCATGTG AGTTTTTCTT AAAGCACAAT 1860 

TCTAAAGTTA AACACAAAAA GAAGCACTAT AAACCAAGTT CACACAAGCT GAAGGTCATT 1920 

TCCAAATCCA TGGGAACCAG CACAGGAGCT ACAGCAAATC ATGGCACTTC TGCAGTAGCA 1980 

ATTACTAGCC ATGATTACCT AGGACAAGAA ACTTTGACAG AAATCCAAAC CTCACCAGAA 2040 

ACATCAATGA GAGAGGTGAA AGCGGACGGA GCTAGCACCC CCAGGTTAAG AGAACAGGAC 2100 

TGTGGTGAAC CTGCCTCGCC AGCAGCATCC ATCTCCAGAC TCTCTGGGGA ACAGGTCGAC 2160 

GGGAAGGGCC AGGCAGGCAG TGTATCTGAA AGTGCGCGGA GTGAAGGAAG GATTAGTCCA 2220 

AAGAGTGATA TTACTGACAC TGGCCTGGCA CAGAGCAACA ATTTGCAGGT CCCCAGTTCT 2280 

TCAGAACCAA GCAGCCTCAA AGGTTCCACA TCTCTGCTTG TTCACCCAGT TTCAGGAGTG 2340 

AGAAAAGAGC AGGGAGGTGG TTGTCATTCA GATACTTGAA GAACATTTTC TCTCGTTACT 2400 

CAGAAGCAAA TTTGTGTTAC ACTGGAAGTG ACCTATGCAC TGTTTTGTAA GAATCACTGT 2460 

TACGTTCTTC TTTTGCACTT AAAGTTGCAT TGCCTACTGT TATACTGGAA AAAATAGAGT 2520 

TCAAGAATAA TATGACTCAT TTCACACAAA GGTTAATGAC AACAATATAC CTGAAAACAG 2580 

AAATGTGCAG GTTAATAATA TTTTTTTAAT AGTGTGGGAG GACAGAGTTA GAGGAATCTT 2640 

CCTTTTCTAT TTATGAAGAT TCTACTCTTG GTAAGAGTAT TTTAAGATGT ACTATGCTAT 2700 

TTTACC T TTT TGATATAAAA TCAAGATATT TCTTTGCTGA AGTATTTAAA TCTTATCCTT 2760 

GTATCTTTTT ATACATATTT GAAAATAAGC TTATATGTAT TTGAACTTTT TTGAAATCCT 2820 

ATTCAAGTAT TTTTATCATG CTATTGTGAT ATTTTAGCAC TTTGGTAGCT TTTACACTGA 2880 

ATTTCTAAGA AAATTGTAAA ATAGTCTTCT TTTATACTGT AAAAAAAGAT ATACCAAAAA 2940 

GTCTTATAAT AGGAATTTAA CTTTAAAAAC CCACTTATTG ATACCTTACC ATCTAAAATG 3000 
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TGTGATTTTT ATAGTCTCGT TTTAGGAATT TCACAGATCT AAATTATGTA ACTGAAATAA 
GGTGCTTACT CAAAGAGTGT CCACTATTGA TTGTATTATG CTGCTCACTG ATCCTTCTGC 
ATATTTAAAA TAAAATGTCC TAAAGGGTTA GTAGACAAAA TGTTAGTCTT TTGTATATTA 
GGCCAAGTGC AATTGACTTC CCTTTTTTAA TGTTTCATGA CCACCCATTG ATTGTATTAT 
AACCACTTAC AGTTGCTTAT ATTTTTTGTT TTAACTTTTG TTTCTTAACA TTTAGAATAT 
TACATTTTGT ATTATACAGT ACCTTTCTCA GACATTTTGT AG 

Seq ID NO: C170 DNA Sequence 
Nucleic Acid Accession #: NM_000582 
Coding sequence: BB...990 



3060 
3120 
3180 
3240 
3300 
3342 



1 11 

I I 
GCAGAGCACA GCATCGTCGG 
AACGCCGACC AAGGAAAACT 
GGCATCACCT GTGCCATACC 
CTTTACAACA AATACCCAGA 
CAGAATCTCC TAGCCCCACA 
GATGATATGG ATGATGAAGA 
AACGACTCTG ATGATGTAGA 
TCTGATGAAT CTGATGAACT 
TTCACTCCAG TTGTCCCCAC 
GGACTGAGGT CAAAATCTAA 
GACGAGGACA TCACCTCACA 
CCCGTTGCCC AGGACCTGAA 
GAAACGAGTC AGCTGGATGA 
TATAAGCGGA AAGCCAATGA 
CTTTCCAAAG TCAGCCGTGA 
GTTGTAGACC CCAAAAGTAA 
TTAGATAGTG CATCTTCTGA 
ATTTAGTCAA AAGAAAAAAT 
CTCAGTTTAT TGGTTGAATG 
ATTAGTTTAG TTTGTGGCTT 
CTATGTTCAT TCTATAGAAG 
TCATGAATAG AAATTTATGT 
ATAACATTTT ATGTCACTAT 
TATCTTTTTG TGGTGTGAAT 
AATTGCTTAT TTGTTTTCCC 
GCCTAAAAAA AAAAAAAAAA 



21 31 

1 I 
GACCAGACTC GTCTCAGGCC 
CACTACCATG AGAATTGCAG 
AGTTAAACAG GCTGATTCTG 
TGCTGTGGCC ACATGGCTAA 
GACCCTTCCA AGTAAGTCCA 
TGATGATGAC CATGTGGACA 
TGACACTGAT GATTCTCACC 
GGTCACTGAT TTTCCCACGG 
AGTAGACACA TATGATGGCC 
GAAGTTTCGC AGACCTGACA 
CATGGAAAGC GAGGAGTTGA 
CGCGCCTTCT GATTGGGACA 
CCAGAGTGCT GAAACCCACA 
TGAGAGCAAT GAGCATTCCG 
ATTCCACAGC CATGAATTTC 
GGAAGAAGAT AAACACCTGA 
GGTCAATTAA AAGGAGAAAA 
GCTTTATAGC AAAATGAAAG 
TGTATCTATT TGAGTCTGGA 
CATGGAAACT CCCTGTAAAC 
AAATGCAAAC TATCACTGTA 
AGAAGCAAAC AAAATACTTT 
AATCTTTTGT TTTTTAAGTT 
AAATCTTTTA TCTTGAATGT 
ACGGTTGTCC AGCAATTAAT 



41 

I 

AGTTGCAGCC 
TGATTTGCTT 
GAAGTTCTGA 
ACCCTGACCC 
ACGAAAGCCA 
GCCAGGACTC 
AGTCTGATGA 
ACCTGCCAGC 
GAGGTGATAG 
TCCAGTACCC 
ATGGTGCATA 
GCCGTGGGAA 
GCCACAAGCA 
ATGTGATTGA 
ACAGCCATGA 
AATTTCGTAT 
AATACAATTT 
AGAACATGAA 
AATAACTAAT 
TAAAAGCTTC 
TTTTAATATT 
TACCCACTTA 
AGTGTATATT 
AATAAGAATT 
AAAACATAAC 



51 
I 

TTCTCAGCCA 
TTGCCTCCTA 
GGAAAAGCAG 
ATCTCAGAAG 
TGACCACATG 
CATTGACTCG 
GTCTCACCAT 
AACCGAAGTT 
TGTGGTTTAT 
TGATGCTACA 
CAAGGCCATC 
GGACAGTTAT 
GTCCAGATTA 
TAGTCAGGAA 
AGATATGCTG 
TTCTCATGAA 
CTCACTTTGC 
ATGCTTCTTT 
GTGTTTGATA 
AGGGTTATGT 
TGTTATTCTC 
AAAAGAGAAT 
TTGTTGTGAT 
TGGTGGTGTC 
CTTTTTTACT 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1524 



Seq ID NO: C171 DNA Sequence 
Nucleic Acid Accession #: NM_002821 
Coding sequence: 150.. 3362 

1 11 21 31 41 51 

111 111 

AACTCCCGCC TCGGGACGCC TCGGGGTCGG GCTCCGGCTG CGGCTGCTGC TGCGGCGCCC 60 

GCGCTCCGGT GCGTCCGCCT CCTGTGCCCG CCGCGGAGCA GTCTGCGGCC CGCCGTGCGC 120 

CCTCAGCTCC TTTTCCTGAG CCCGCCGCGA TGGGAGCTGC GCGGGGATCC CCGGCCAGAC 180 

CCCGCCGGTT GCCTCTGCTC AGCGTCCTGC TGCTGCCGCT GCTGGGCGGT ACCCAGACAG 240 

CCATTGTCTT CATCAAGCAG CCGTCCTCCC AGGATGCACT GCAGGGGCGC CGGGCGCTGC 300 

TTCGCTGTGA GGTTGAGGCT CCGGGCCCGG TACATGTGTA CTGGCTGCTC GATGGGGCCC 360 

CTGTCCAGGA CACGGAGCGG CGTTTCGCCC AGGGCAGCAG CCTGAGCTTT GCAGCTGTGG 420 

ACCGGCTGCA GGACTCTGGC ACCTTCCAGT GTGTGGCTCG GGATGATGTC ACTGGAGAAG 480 

AAGCCCGCAG TGCCAACGCC TCCTTCAACA TCAAATGGAT TGAGGCAGGT CCTGTGGTCC S40 

TGAAGCATCC AGCCTCGGAA GCTGAGATCC AG CCACAGAC CCAGGTCACA CTT OGTTG CC 600 

ACATTGATGG GCACCCTCGG CCCACCTACC AATGGTTCCG AGATGGGACC CCCCTTTCTG 660 

ATGGTCAGAG CAACCACACA GTCAGCAGCA AGGAGCGGAA CCTGACGCTC CGGCCAGCTG 720 

GTCCTGAGCA TAGTGGGCTG TATTCCTGCT GCGCCCACAG TGCTTTTGGC CAGGCTTGCA 780 

GCAGCCAGAA CTTCACCTTG AGCATTGCTG ATGAAAGCTT TGCCAGGGTG GTGCTGGCAC 840 

CCCAGGACGT GGTAGTAGCG AGGTATGAGG AGGCCATGTT CCATTGCCAG TTCTCAGCCC 900 

AGCCACCCCC GAGCCTGCAG TGGCTCTTTG AGGATGAGAC TCCCATCACT AACCGCAGTC 960 

GCCCCCCACA CCTCCGCAGA GCCACAGTGT TTGCCAACGG GTCTCTGCTG CTGACCCAGG 1020 

TCCGGCCACG CAATGCAGGG ATCTACCGCT GCATTGGCCA GGGGCAGAGG GGCCCACCCA 1080 

TCATCCTGGA AGCCACACTT CACCTAGCAG AGATTGAAGA CATGCCGCTA TTTGAGCCAC 1140 

GGGTGTTTAC AGCTGGCAGC GAGGAGCGTG TGACCTGCCT TCCCCCCAAG GGTCTGCCAG 1200 

. AGCCCAGCGT GTGGTGGGAG CACGCGGGAG TCCGGCTGCC CACCCATGGC AGGGTCTACC 1260 

AGAAGGGCCA CGAGCTGGTG TTGGCCAATA TTGCTGAAAG TGATGCTGGT GTCTACACCT 1320 

GCCACGCGGC CAACCTGGCT GGTCAGCGGA GACAGGATGT CAACATCACT GTGGCCACTG 1380 

TGCCCTCCTG GCTGAAGAAG CCCCAAGACA GCCAGCTGGA GGAGGGCAAA CCCGGCTACT 1440 

TGGATTGCCT GACCCAGGCC ACACCAAAAC CTACAGTTGT CTGGTACAGA AACCAGATGC 1500 

TCATCTCAGA GGACTCACGG TTCGAGGTCT TCAAGAATGG GACCTTGCGC ATCAACAGCG 1560 

TGGAGGTGTA TGATGGGACA TGGTACCGTT GTATGAGCAG CACCCCAGCC GGCAGCATCG 1620 

AGGCGCAAGC CCGTGTCCAA GTGCTGGAAA AGCTCAAGTT CACACCACCA CCCCAGCCAC 1680 

AGCAGTGCAT GGAGTTTGAC AAGGAGGCCA CGGTGCCCTG TTCAGCCACA GGCCGAGAGA 1740 

AGCCCACTAT TAAGTGGGAA CGGGCAGATG GGAGCAGCCT CCCAGAGTGG GTGACAGACA 1800 

ACGCTGGGAC CCTGCATTTT GCCCGGGTGA CTCGAGATGA CGCTGGCAAC TACACTTGCA 1860 

TTGCCTCCAA CGGGCCGCAG GGCCAGATTC GTGCCCATGT CCAGCTCACT GTGGCAGTTT 1920 

TTATCACCTT CAAAGTGGAA CCAGAGCGTA CGACTGTGTA CCAGGGCCAC ACAGCCCTAC 1980 

TGCAGTGCGA GGCCCAGGGG GACCCCAAGC CGCTGATTCA GTGGAAAGGC AAGGACCGCA 2040 

TCCTGGACCC CACCAAGCTG GGACCCAGGA TGCACATCTT CCAGAATGGC TCCCTGGTGA 2100 

TCCATGACGT GGCCCCTGAG GACTCAGGCC GCTACACCTG CATTGCAGGC AACAGCTGCA 2160 

ACATCAAGCA CACGGAGGCC CCCCTCTATG TCGTGGACAA GCCTGTGCCG GAGGAGTCGG 2220 

AGGGCCCTGG CAGCCCTCCC CCCTACAAGA TGATCCAGAC CATTGGGTTG TCGGTGGGTG 2280 

CCGCTGTGGC CTACATCATT GCCGTGCTGG GCCTCATGTT CTACTGCAAG AAGCGCTGCA 2340 
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AAGCCAAGCG GCTGCAGAAG CAGCCCGAGG GCGAGGAGCC AGAGATGGAA TGCCTCAACG 2400 

GAGGGCCTTT GCAGAACGGG CAGCCCTCAG CAGAGATCCA AGAAGAAGTG GCCTTGACCA 2460 

GCTTGGGCTC CGGCCCCGCG GCCACCAACA AACGCCACAG CACAAGTGAT AAGATGCACT 2520 

TCCCACGGTC TAGCCTGCAG CCCATCACCA CGCTGGGGAA GAGTGAGTTT GGGGAGGTGT 2580 

TCCTGGCAAA GGCTCAGGGC TTGGAGGAGG GAGTGGCAGA GACCCTGGTA CTTGTGAAGA 2640 

GCCTGCAGAC GAAGGATGAG CAGCAGCAGC TGGACTTCCG GAGGGAGTTG GAGATGTTTG 2700 

GGAAGCTGAA CCACGCCAAC GTGGTGCGGC TCCTGGGGCT GTGCCGGGAG GCTGAGCCCC 2760 

ACTACATGGT GCTGGAATAT GTGGATCTGG GAGACCTCAA GCAGTTCCTG AGGATTTCCA 2820 

AGAGCAAGGA TGAAAAATTG AAGTCACAGC CCCTCAGCAC CAAGCAGAAG GTGGCCCTAT 2880 

GCACCCAGGT AGCCCTGGGC ATGGAGCACC TGTCCAACAA CCGCTTTGTG CATAAGGACT 2940 

TGGCTGCGCG TAACTGCCTG GTCAGTGCCC AGAGACAAGT GAAGGTGTCT GCCCTGGGCC 3000 

TCAGCAAGGA TGTGTACAAC AGTGAGTACT ACCACTTCCG CCAGGCCTGG GTGCCGCTGC 306O 

GCTGGATGTC CCCCGAGGCC ATCCTGGAGG GTGACTTCTC TACCAAGTCT GATGTCTGGG 3120 

CCTTOGGTGT GCTGATGTGG GAAGTGTTTA CACATGGAGA GATGCCCCAT GGTGGGCAGG 3180 

CAGATGATGA AGTACTGGCA GATTTGCAGG CTGGGAAGGC TAGACTTCCT CAGCCCGAGG 3240 

GCTGCCCTTC CAAACTCTAT CGGCTGATGC AGOGCTGCTG GGCCCTCAGC CCCAAGGACC 3300 

GGCCCTCCTT CAGTGAGATT GCCAGCGCCC TGGGAGACAG CACCGTGGAC AGCAAGCCGT 3360 

GAGGAGGGAG CCCGCTCAGG ATGGCCTGGG CAGGGGAGGA CATCTCTAGA GGGAAGCTCA 3420 

CAGCATGATG GGCAAGATCC CTGTCCTCCT GGGCCCTGAG GTGCCCTAGT GCAACAGGCA 3480 

TTGCTGAGGT CTGAGCAGGG CCTGGCCTTT CCTCCTCTTC CTCACCCTCA TCCTTTGGGA 3540 

GGCTGACTTG GACCCAAACT GGGCGACTAG GGCTTTGAGC TGGGCAGTTT CCCCTGCCAC 3600 

CTCTTCCTCT ATCAGGGACA GTGTGGGTGC CACAGGTAAC CCCAATTTCT GGCCTTCAAC 3660 

TTCTCCCCTT GACCGGGTCC AACTCTGCCA CTCATCTGCC AACTTTGCCT GGGGAGGGCT 3720 

AGGCTTGGGA TGAGCTGGGT TTGTGGGGAG TTCCTTAATA TTCTCAAGTT CTGGGCACAC 3780 

AGGGTTAATG AGTCTCTTGC CCACTGGTCC ACTTGGGGGT CTAGACCAGG ATTATAGAGG 3840 

ACACAGCAAG TGAGTCCTCC CCACTCTGGG CTTGTGCACA CTGACCCAGA CCCACGTCTT 3900 

CCCCACCCTT CTCTCCTTTC CTCATCCTAA GTGCCTGGCA GATGAAGGAG TTTTCAGGAG 3960 

CTTTTGACAC TATATAAACC GCCCTTTTTG TATGCACCAC GGGCGGCTTT TATATGTAAT 4020 

TGCAGCGTGG GGTGGGTGGG CATGGGAGGT AGGGGTGGGC CCTGGAGATG AGGAGGGTGG 4080 

GCCATCCTTA CCCCACACTT TTATTGTTGT CGTTTTTTGT TTGTTTTGTT TTTTTGTTTT 4140 

TGTTTTTGTT TTTACACTCG CTGCTCTCAA TAAATAAGCC TTTTTTA 4187 

Seq ID NO: C172 DNA Sequence 

Nucleic Acid Accession #: NM_002309.2 , 

Coding sequence: 65.-673 

1 11 21 31 41 51 

I I I I I I 

ATGAACCTCT GAAAACTGCC GGCATCTGAG GTTTCCTCCA AGGCCCTCTG AAGTGCAGCC 60 

CATAATGAAG GTCTTGGCGG CAGGAGTTGT GCCCCTGCTG TTGGTTCTGC ACTGGAAACA 120 

TGGGGCGGGG AGCCCCCTCC CCATCACCCC TGTCAACGCC ACCTGTGCCA TACGCCACCC 180 

ATGTCACAAC AACCTCATGA ACCAGATCAG GAGCCAACTG GCACAGCTCA ATGGCAGTGC 240 

CAATGCCCTC TTTATTCTCT ATTACACAGC CCAGGGGGAG CCGTTCCCCA ACAACCTGGA 300 

CAAGCTATGT GGCCCCAACG TGACGGACTT CCCGCCCTTC CACGCCAACG GCACGGAGAA 360 

GGCCAAGCTG GTGGAGCTGT ACCGCATAGT CGTGTACCTT GGCACCTCCC TGGGCAACAT 420 

CACCCGGGAC CAGAAGATCC TCAACCCCAG TGCCCTCAGC CTCCACAGCA AGCTCAACGC 480 

CACCGCCGAC ATCCTGCGAG GCCTCCTTAG CAACGTGCTG TGCCGCCTGT GCAGCAAGTA 540 

CCACGTGGGC CATGTGGACG TGACCTACGG CCCTGACACC TCGGGTAAGG ATGTCTTCCA 600 

GAAGAAGAAG CTGGGCTGTC AACTCCTGGG GAAGTATAAG CAGATCATCG COGTGTTGGC 660 

CCAGGCCTTC TAGCAGGAGG TCTTGAAGTG TGCTGTGAAC CGAGGGATCT CAGGAGTTGG 720 

GTCCAGATGT GGGGGCCTGT CCAAGGGTGG CTGGGGCCCA GGGCATCGCT AAACCCAAAT 780 

GGGGGCTGCT GGCAGACCCC GAGGGTGCCT GGCCAGTCCA CTCCACTCTG GGCTGGGCTG 840 

TGATGAAGCT GAGCAGAGTG GAAACTTCCA TAGGGAGGGA GCTAGAAGAA GGTGCCCCTT 900 

CCTCTGGGAG ATTGTGGACT GGGGAGCGTG GGCTGGACTT CTGCCTCTAC TTGTCCCTTT 960 

GGCCCCTTGC TCACTTTGTG CAGTGAACAA ACTACACAAG TCATCTACAA GAGCCCTGAC 1020 

CACAGGGTGA GACAGCAGGG CCCAGGGGAG TGGACCAGCC CCCAGCAAAT TATCACCATC 1080 

TGTGCCTTTG CTGCCCCTTA GGTTGGGACT TAGGTGGGCC AGAGGGGCTA GGATCCCAAA 1140 

GGACTCCTTG TCCCCTAGAA GTTTGATGAG TGGAAGATAG AGAGGGGCCT CTGGGATGGA 1200 

AGGCTGTCTT CTTTTGAGGA TGATCAGAGA ACTTGGGCAT AGGAACAATC TGGCAGAAGT 1260 

TTCCAGAAGG AGGTCACTTG GCATTCAGGC TCTTGGGGAG GCAGAGAAGC CACCTTCAGG 1320 

CCTGGGAAGG AAGACACTGG GAGGAGGAGA GGCCTGGAAA GCTTTGGTAG GTTCTTCGTT 1380 

CTCTTCCCCG TGATCTTCCC TGCAGCCTGG GATGGCCAGG GTCTGATGGC TGGACCTGCA 1440 

GCAGGGGTTT GTGGAGGTGG GTAGGGCAGG GGCAGGTTGC TAAGTCAGGT GCAGAGGTTC 1500 

TGAGGGACCC AGGCTCTTCC TCTGGGTAAA GGTCTGTAAG AAGGGGCTGG GGTAGCTCAG 1560 

AGTAGCAGCT CACATCTGAG GCCCTGGGAG GTCTTGTGAG GTCACACAGA GGTACTTGAG 1620 

. GGGGACTGGA GGCCGTCTCT GGTCCCCAGG GCAAGGGAAC AGCAGAACTT AGGGTCAGGG 1680 

TCTCAGGGAA CCCTGAGCTC CAAGCGTGCT GTGCGTCTGA CCTGGCATGA TTTCTATTTA 1740 

TTATGATATC CTATTTATAT TAACTTATTG GTGCTTTCAG TGGCCAAGTT AATTCCCCTT 1800 

TCCCTGGTCC CTACTCAACA AAATATGATG ATGGCTCCCG ACACAAGCGC CAGGGCCAGG I860 

GCTTAGCAGG GCCTGGTCTG GAAGTCGACA ATGTTACAAG TGGAATAAGC TTACGGGTGA 1920 

AGCTCAGAGA AGGGTCGGAT CTGAGAGAAT GGGGAGGCCT GAGTGGGAGT GGGGGGCCTT 1960 

GCTCCACCCC CATCCCCTAC TGTGACTTGC TTTAGCGTGT CAGGGTCCAG GCTGCAGGGG 2040 

CTGGGCCAAT TTGTGGAGAG GCCGGGTGCC TTTCTGTCTT GCTTCCAGGG GGCTGGTTCA 2100 

CACTGTTCTT GGGCGCCCCA GCATTGTGTT GTGAGGCGCA CTGTTCCTGG CAGATATTGT 2160 

GCCCCCTGGA GCAGTGGGCA AGACAGTCCT TGTGGCCCAC CCTGTCCTTG TTTCTGTGTC 2220 

CCCATGCTGC CTCTGAAATA GCGCCCTGGA ACAACCCTGC CCCTGCACCC AGCATGCTCC 2280 

GACACAGCAG GGAAGCTCCT CCTGTGGCCC GGACACCCAT AGACGGTGCG GGGGGCCTGG 2340 

CTGGGCCAGA CCCCAGGAAG GTGGGGTAGA CTGGGGGGAT CAGCTGCCCA TTGCTCCCAA 2400 

GAGGAGGAGA GGGAGGCTGC AGACGCCTGG GACTCAGACC AGGAAGCTGT GGGCCCTCCT 2460 

GCTCCACCCC CATCCCACTC CCACCCATGT CTGGGCTCCC AGGCAGGGAA CCCGATCTCT 2520 

TCCTTTGTGC TGGGGCCAGG CGAGTGGAGA AACGCCCTCC AGTCTGAGAG CAGGGGAGGG 2580 

AAGGAGGCAG CAGAGTTGGG GCAGCTGCTC AGAGCAGTGT TCTGGCTTCT TCTCAAACCC 2640 

TGAGCGGGCT GCCGGCCTCC AAGTTCCTCC GACAAGATGA TGGTACTAAT TATGGTACTT 2700 

TTCACTCACT TTGCACCTTT CCCTGTCGCT CTCTAAGCAC TTTACCTGGA TGGCGCGTGG 2760 



1307 



WO 03/042661 



PCT/US02/36810 



10 

15 

20 

25 

30 

35 

40 

45 
50 
55 
60 
65 
70 
75 
80 



GCAGTGTGCA 
CCATCTGTCC 
CCCTTGTCCA 
CTGGCTGCAT 
GCTGATTCCA 
GGAGCTCTGT 
GGACTTGGCC 
CGTCCACCCT 
CCGGGAGCTG 
CGGAGGAAGA 
TTTTTTTAAA 
CTTGTCCCTG 
CTCTCCAGAG 
CCCTGCAGAT 
CGGCATCCAG 
GAGGTTTCCC 
TGATGAATGT 
CATATTTAAA 
GAGAAATGAA 



GGCAGGTCCT 
ATCCCAACAG 
GGATGCAGGG 
TCCCCCAGGA 
CCCGGGGGGC 
GCCAGGCCCA 
CCATGGGACC 
GCACAGCATC 
GGTTTCTCTT 
GAGAGTTTAC 
AAGCACTGCT 
ACTTGGGGCA 
TAGACATAGT 
GGTACAGATG 
CCTTCAGTCC 
TCCTTCCTTT 
ACCCTGTGGG 
ATATTTATTT 
TAAAGAATCT 



GAGGCCTGGG 
CAAGACGAGG 
ACTGCCTTCT 
TGGGCTTCGA 
CCGGCTGACT 
CCTTGCGGCC 
CATCCTCAGT 
ACTGAATCAC 
CCCTTTTTAT 
CAAGAGAGAT 
AGTTTACTTG 
CTTCCACCCT 
GTGTGGGGTT 

GGCCCACGTG 
CCACTGAAAA 
GATGTTTCAT 
TTTATACCAA 
ACTCTTCG 



GTTGGGGTGG 
ATGTGGCTGT 
CCTTCCTGCT 
GAAAGACAAA 
CGCCCATCAC 
CTGGCTCTGA 
GCTCCCTCCA 
AGAGCCTTTG 
CTGCTGGTGT 
GTCTCCGGGC 
TCTCTCCTCC 
GACCCAGCCA 
GGAGCTCTGG 
AGAGTCATCT 
CTAGCTCCGT 
GCACATGGCC 
ACTGACAGAT 
ATGAATCACT 



AGGGTGCGGC 
TGAGATGTGG 
TCATCCGGCT 
CTTGTCTGGA 
CTCATCTCCC 
GTCGCTCTCC 
GATCCCGTCC 
CGTGAAACAG 
GGACCACACC 
CCTTATTTAT 
CCATCGTCCC 
GTCCAGCTCT 
CACCCGGGGA 
CTAGTTCCCC 
GGGCCCACCG 
TTGGGTGACA 
TATTTTTATT 
TTTTTTTTTA 



CCGGAGTTGT 
GCCACACTCA 
TAGCTTGGGG 
AACCAGAGTT 
TGTGGACTTG 
CACCCAGCCT 
GGCAGCTTGG 
CTCTGCCAGG 
TGGGCCTGGC 
TATTTAAACA 
CATCGTCCTC 
GCCTTGCCGG 
GGTAGCATTT 
ACCTCAATCC 
TGCGGCCTTA 
AATTCCTCTT 
TATTCAATGT 
AGAAAAAAAA 



2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3868 



Seq ID NO: C173 DNA Sequence 
Nucleic Acid Accession #: XM_097508 
Coding sequence: 44.. 2788 



11 



21 



31 



41 51 

! I 11 I I 

TGAAAGGCGG TTGTGGTGCA AAGGAAAACC CACAGGCCAA GGAATGGGAA GACCAAGGTT 60 

GACACTTGTT TGTCACGTGT CAATAATCAT CTCTGCCCGG GACCTCAGCA TGAACAACCT 120 

CACAGAGCTT CAGCCTGGCC TCTTCCACCA CCTGCGCTTC TTGGAGGAGC TGCGTCTCTC 180 

TGGGAACCAT CTCTCACACA TCCCAGGACA AGCATTCTCT GGTCTCTACA GCCTGAAAAT 240 

CCTGATGCTG CAGAACAATC AGCTGGGAGG AATCCCCGCA GAGGCGCTGT GGGAGCTGCC 300 

GAGCCTGCAG TCGCTGCGCC TAGATGCCAA CCTCATCTCC CTGGTCCCGG AGAGGAGCTT 360 

TGAGGGGCTG TCCTCCCTCC GCCACCTCTG GCTGGACGAC AATGCACTCA CGGAGATCCC 420 

TGTCAGGGCC CTCAACAACC TCCCTGCCCT GCAGGCCATG ACCCTGGCCC TCAACCGCAT 480 

CAGCCACATC CCCGACTACG CGTTCCAGAA TCTCACCAGC CTTGTGGTGC TGCATTTGCA 540 

TAACAACCGC ATCCAGCATC TGGGGACCCA CAGCTTCGAG GGGCTGCACA ATCTGGAGAC 600 

ACTAGACCTG AATTATAACA AGCTGCAGGA GTTCCCTGTG GCCATCCGGA CCCTGGGCAG 660 

ACTGCAGGAA CTGGGGTTCC ATAACAACAA CATCAAGGCC ATCCCAGAAA AGGCCTTCAT 720 

GGGGAACCCT CTGCTACAGA CGATACACTT TTATGATAAC CCAATCCAGT TTGTGGGAAG 780 

ATCGGCATTC CAGTACCTGC CTAAACTCCA CACACTATCT CTGAATGGTG CCATGGACAT 840 

CCAGGAGTTT CCAGATCTCA AAGGCACCAC CAGCCTGGAG ATCCTGACCC TGACCCGCGC 900 

AGGCATCCGG CTGCTCCCAT CGGGGATGTG CCAACAGCTG CCCAGGCTCC GAGTCCTGGA 960 

ACTGTCTCAC AATCAAATTG AGGAGCTGCC CAGCCTGCAC AGGTGTCAGA AATTGGAGGA 1020 

AATCGGCCTC CAACACAACC GCATCTGGGA AATTGGAGCT GACACCTTCA GCCAGCTGAG 1080 

CTCCCTGCAA GCCCTGGATC TTAGCTGGAA CGCCATCCGG TCCATCCACC CCGAGGCCTT 1140 

CTCCACCCTG CACTCCCTGG TCAAGCTGGA CCTGACAGAC AACCAGCTGA CCACACTGCC 1200 

CCTGGCTGGA CTTGGGGGCT TGATGCATCT GAAGCTCAAA GGGAACCTTG CTCTCTCCCA 1260 

GGCCTTCTCC AAGGACAGTT TCCCAAAACT GAGGATCCTG GAGGTGCCTT ATGCCTACCA 1320 

GTGCTGTCCC TATGGGATGT GTGCCAGCTT CTTCAAGGCC TCTGGGCAGT GGGAGGCTGA 1380 

AGACCTTCAC CTTGATGATG AGGAGTCTTC AAAAAGGCCC CTGGGCCTCC TTGCCAGACA 1440 

AGCAGAGAAC CACTATGACC AGGACCTGGA TGAGCTCCAG CTGGAGATGG AGGACTCAAA 1500 

GCCACACCCC AGTGTCCAGT GTAGCCCTAC TCCAGGCCCC TTCAAGCCCT GTGAGTACCT 1560 

CTTTGAAAGC TGGGGCATCC GCCTGGCOGT GTGGGCCATC GTGTTGCTCT CCGTGCTCTG 1620 

CAATGGACTG GTGCTGCTGA CCGTGTTCGC TGGCGGGCCT GTCCCCCTGC CCCCGGTCAA 1680 

GTTTGTGGTA GGTGCGATTG CAGGCGCCAA CACCTTGACT GGCATTTCCT GTGGCCTTCT 1740 

AGCCTCAGTC GATGCCCTGA CCTTTGGTCA GTTCTCTGAG TACGGAGCCC GCTGGGAGAC 1800 

GGGGCTAGGC TGCCGGGCCA CTGGCTTCCT GGCAGTACTT GGGTCGGAGG CATCGGTGCT 1860 

GCTGCTCACT CTGGCCGCAG TGCAGTGCAG CGTCTCCGTC TCCTGTGTCC GGGCCTATGG 1920 

GAAGTCCCCC TCCCTGGGCA GCGTTCGAGC AGGGGTCCTA GGCTGCCTGG CACTGGCAGG 1980 

GCTGGCCGCC GOGCTGCCCC TGGCCTCAGT GGGAGAATAC GGGGCCTCCC CACTCTGCCT 2040 

GCCCTACGCG CCACCTGAGG GTCAGCCAGC AGCCCTGGGC TTCACCGTGG CCCTGGTGAT 2100 

GATGAACTCC TTCTGTTTCC TGGTCGTGGC CGGTGCCTAC ATCAAACTGT ACTGTGACCT 2160 

GCCGCGGGGC GACTTTGAGG CCGTGTGGGA CTGCGCCATG GTGAGGCACG TGGCCTGGCT 2220 

CATCTTCGCA GACGGGCTCC TCTACTGTCC CGTGGCCTTC CTCAGCTTTG CCTCCATGCT 2 2 BO 

GGGCCTCTTC CCTGTCACGC CCGAGGCCGT CAAGTCTGTC CTGCTGGTGG TGCTGCCCCT 2340 

GCCTGCCTGC CTCAACCCAC TGCTGTACCT GCTCTTCAAC CCCCACTTCC GGGATGACCT 2400 

TCGGCGGCTT CGGCCCCGCG CAGGGGACTC AGGGCCCCTA GCCTATGCTG CGGCCGGGGA 2460 

GCTGGAGAAG AGCTCCTGTG ATTCTAOCCA GGCCCTGGTA GCCTTCTCTG ATGTGGATCT 2520 

CATTCTGGAA GCTTCTGAAG CTGGGCGGCC CCCTGGGCTG GAGACCTATG GCTTCCCCTC 2580 

AGTGACCCTC ATCTCCTGTC AGCAGCCAGG GGCCCCCAGG CTGGAGGGCA GCCATTGTGT 2640 

AGAGCCAGAG GGGAACCACT TTGGGAACCC CCAACCCTCC ATGGATGGAG AACTGCTGCT 2700 

GAGGGCAGAG GGATCTACGC CAGCAGGTGG AGGCTTGTCA GGGGGTGGCG GCTTTCAGCC 2760 

CTCTGGCTTG GCCTTTGCTT CACACGTGTA AATATCCCTC CCCATTCTTC TCTTCCCCTC 2820 

TCTTCCCTTT CCTCTCTCCC CCTCGGTGAA TGATGGCTGC TTCTAAAACA AATACAACCA 2880 

AAACTCAGCA GTGTGATCTA TAGCAGGATG GCCCAGTCCC TGGCTCCACT GATCACCTCT 2940 

CTCCTGTGAC CATCACCAAC GGGTGCCTCT TGGCCTGGCT TTCCCTTGGC CTTCCTCAGC 3000 

TTCACCTTGA TACTGGGCCT CTTCCTTGTC ATGTCTGAAG CTGTGGACCA GAGACCTGGA 3060 

CTTTTGTCTG CTTAAGGGAA ATGAGGGAAG TAAAGACAGT GAAGGGGTGG AGGGTTGATC 3120 

AGGGCACAGT GGACAGGGAG ACCTCACAGA GAAAGGCCTG GAAGGTGATT TCCCGTGTGA 3180 

CTCATGGATA GGATACAAAA TGTGTTCCAT GTACCATTAA TCTTGACATA TGCCATGCAT 3240 

AAAGACTTCC TATTAAAATA AGCTTTGGAA GAG 3273 
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AGTCTGGCCC TGGACAACCC CAGCAAAGCC GCCCTCAGCC AGCCCAGAAG CACTGGGCCT 
TGGCCACAGC AACACCCACT GAGCACGCTG GGAGCTGAGT ATGGCGTCCC TGGTCTCGCT 
GGAGCTGGGG CTGCTTCTGG CTGTGCTGGT GGTGACGGCG AOGGCGTCCC CGCCTGCTGG 
TCTGCTGAGC CTGCTCACCT CTGGCCAGGG CGCTCTGGAT CAAGAGGCTC TGGGCGGCCT 
GTTAAATACG CTGGCGGACC GTGTGCACTG CACCAACGGG COGTGTGGAA AGTGCCTGTC 
TGTGGAGGAC GCCCTGGGCC TGGGCGAGCC TGAGGGGTCA GGGCTGCCCC CGGGCCCGGT 
CCTGGAGGCC AGGTACGTCG CCCGCCTCAG TGCCGCCGCC GTCCTGTACC TCAGCAACCC 
CGAGGGCACC TGTGAGGACA CTCGGGCTGG CCTCTGGGCC TCTCATGCAG ACCACCTCCT 
GGCCCTGCTC GAGAGCCCCA AGGCCCTGAC CCCGGGCCTG AGCTGGCTGC TGCAGAGGAT 
GCAGGCCOGG GCTGCCGGCC AGACCCCCAA GACGGCCTGC GTAGATATCC CTCAGCTGCT 
GGAGGAGGCG GTGGGGGCGG GGGCTCCGGG CAGTGCTGGC GGCGTCCTGG CTGCCCTGCT 
GGACCATGTC AGGAGCGGGT CTTGCTTCCA CGCCTTGCOG AGCCCTCAGT ACTTCGTGGA 
CTTTGTGTTC CAGCAGCACA GCAGCGAGGT CCCTATGACG CTGGCCGAGC TGTCAGCCTT 
GATGCAGCGC CTGGGGGTGG GCAGGGAGGC CCACAGTGAC CACAGTCATC GGCACAGGGG 
AGCCAGCAGC CGGGACCCTG TGCCCCTCAT CAGCTCCAGC AACAGCTCCA GTGTGTGGGA 
CACGGTATGC CTGAGTGCCA GGGACGTGAT GGCTGCATAT GGACTGTCGG AACAGGCTGG 
GGTGACCCCG GAGGCCTGGG CCCAACTGAG CCCTGCCCTG CTCCAACAGC AGCTGAGTGG 
AGCCTGCACC TCCCAGTCCA GGCCCCCGGT CCAGGACCAG CTCAGCCAGT CAGAGAGGTA 
TCTGTACGGC TCCCTGGCCA CGCTGCTCAT CTGCCTCTGC GCGGTCTTTG GCCTCCTGCT 
GCTGACCTGC ACTGGCTGCA GGGGGGTCGC CCACTACATC CTGCAGACCT TCCTGAGCCT 
GGCAGTGGGT GCACTCACTG GGGACGCTGT CCTGCATCTG ACGCCCAAGG TGCTGGGGCT 
GCATACACAC AGCGAAGAGG GCCTCAGCCC ACAGCCCACC TGGCGCCTCC TGGCTATGCT 
GGCCGGGCTC TACGCCTTCT TCCTGTTTGA GAACCTCTTC AATCTCCTGC TGCCCAGGGA 
CCCGGAGGAC CTGGAGGACG GGCCCTGCGG CCACAGCAGC CATAGCCACG GGGGCCACAG 
CCACGGTGTG TCCCTGCAGC TGGCACCCAG CGAGCTCCGG CAGCCCAAGC CCCCCCACGA 
GGGCTCCCGC GCAGACCTGG TGGCGGAGGA GAGCCCGGAG CTGCTGAACC CTGAGCCCAG 
GAGACTGAGC CCAGAGTTGA GGCTACTGCC CTATATGATC ACTCTGGGCG ACGCCGTGCA 
CAACTTCGCC GACGGGCTGG CCGTGGGCGC CGCCTTCGCG TCCTCCTGGA AGACCGGGCT 
GGCCACCTCG CTGGCCGTGT TCTGCCACGA GTTGCCACAC GAGCTGGGGG ACTTCGCCGC 
CTTGCTGCAC GCGGGGCTGT CCGTGCGCCA AGCACTGCTG CTGAACCTGG CCTCCGCGCT 
CACGGCCTTC GCTGGTCTCT ACGTGGCACT CGCGGTTGGA GTCAGCGAGG AGAGCGAGGC 
CTGGATCCTG GCAGTGGCCA CCGGCCTGTT CCTCTACGTA GCACTCTGCG ACATGCTCCC 
GGCGATGTTG AAAGTACGGG ACCCGCGGCC CTGGCTCCTC TTCCTGCTGC ACAACGTGGG 
CCTGCTGGGC GGCTGGACCG TCCTGCTGCT GCTGTCCCTG TACGAGGATG ACATCACCTT 
CTGATACCCT GCCCTAGTCC CCCACCTTTG ACTTAAGATC CCACACCTCA CAAACCTACA 
GCCCAGAAAC CAGAAGCCCC TATAGAGGCC CCAGTCCCAA CTCCAGTAAA GACACTCTTG 
TCCTTGGAAA AAAAAAAAAA AAAAAAAAAA AA 

Seq ID NO: CI 7 5 DMA Sequence 
Nucleic Acid Accession #: NM_018971 
Coding sequence: 1..1128 



1 
1 

ATGGCGAACG 
AAGCTGGCCA 
CTGCTGATCG 

CGTGCGGCGG 
CTGGCCGCGC 
TACCTGGCCA 
GCCATGCTGG 
GACGGCGGTG 
CCCGGOGCGC 
TACCTCCGCC 
CCCGCCGTCA 
AACTGGACGG 
GCAGGGCCGG 
AGGCTGTGCA 
GTCGTGGCCA 
ACGGCCTCCG 
TTCAACAGGG 
ACCACCCAGG 



11 
I 

CGAGCGAGCC 
CGCTCAGCCT 
TGCGGGAGCG 
ACGGGCTGCG 



TCGCGCACCA 
TGTGCGCCGC 
GCGAOGACGA 

TGCTCTTCTT 
GCCACGACTG 
CGGGCTTCGG 
GCCGCGGCGC 
AGATGTTCTA 
GCTACCTGCG 
TGTGGCTGAC 
AGCTGAGGGA 
CGACCCATCC 




CCGTCCCCCA 
ACCCCGTCGT 
CCTGCTGCCA 
GTTTATGA 



Seq ID NO j C176 DNA Sequence 
Nucleic Acid Accession #: NM_005631 
Coding sequence: 290.. 2653 



60 

120 

180 

240 

300 

360 

420 

480 

S40 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 
1440 

1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2192 



51 
I 

CCTGGGCCTC 
GCTGTTCGCG 
GCTCGACCTG 
GGCGGCGCGG 
GCTCGCCTTC 
CGTCACCCGC 
GCCGTGCGCC 
GCCAGTGCTG 
CGACGGCGCC 
GCACCTCGTC 

GGCGGCCGCC 
CATCCGGCCC 
GACGGAGAAG 
GGGGCCCTAC 
GGCCTACCTG 
GTGCTTCCTC 
GAGCCCCCGG 



GGCACGAGGG 
CCGTTCCGGG 
GCCGAGGGGC 
ACAGGTCGCC 
AGGAGCAGGC 
CCGCCCAGCG 
GGACCCGGGC 
GGGCGGGAGC 
CGGCCGGGCT 
CTACGGGGCC 



11 

1 

GGCTGAAGAC 
CCTCCGCAGC 
CGGGGCGCGC 
TGAGCCGCCT 
GGGGGCGCCG 
CGGGGGCCGG 
CGGGGGGCGG 
GCGAGGAGGA 
GCCCCCTGCG 
ACCTCCACAC 



21 
I 

AACTTGGATT 
CCAACATGGG 
GGAGCGTCCG 
CCGCGGCCGC 

AGCTCCCGCT 
CCTCGAGCGG 
GCGCGGCGGT 
AGCCGCTGCG 
TGCTGGCCGG 



31 
I 

GCGAGGCTAG 
CCCCGGGTTC 
GGGGGGCCCG 
CGAGGTCGTG 
TGAGTTGGCG 
CCTGGGGCTG 
GAACGCGACC 
GACTGGCCCT 
CTACAACGTG 
AGACTCGGAC 



41 

I 

GGCTTGGGGA 
CAAAGTTTGC 
GGCCCGGATT 
CGTGTGGCCG 
GGGTTGGCCA 
CTGCTGCTGC 
GGGCCTGGGC 
COGCCGCOGC 
TGCCTGGGCT 
TCCCAGGAGG 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 
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900 

960 

1020 

1080 

1128 



51 
I 

GTCGTGCATC 
GAAGTTGGGC 
CTCTGGGCGC 
GGGGGCTCCG 
TGGCCGCTGC 
TGCTGCTGGG 
CTCGGAGCGC 
TGAGCCACTO 



AAGCGCACGG 
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CAAGCTCGTG CTCTGGTCGG GCCTCCGGAA TGCCCCCCGC TGCTGGGCAG TGATCCAGCC 660 

CCTGCTGTGT GCCGTATACA TGCCCAAGTG TGAGAATGAC GGGGTGGAGC TGCCCAGCCG 720 

TACCCTCTGC CAGGCCACCC GAGGCCCCTG TGCCATOGTG GAGAGGGAGC GGGGCTGGCC 780 

TGACTTCCTG CGCTGCACTC CTGACCGCTT CCCTGAAGGC TGCACGAATG AGGTGCAGAA 840 

CATCAAGTTC AACAGTTCAG GCCAGTGCGA AGTGCCCTTG GTTCGGACAG ACAACCCCAA 900 

GAGCTGGTAC GAGGACGTGG AGGGCTGCGG CATCCAGTGC CAGAACCCGC TCTTCACAGA 960 

GGCTGAGCAC CAGGACATGC ACAGCTACAT CGCGGCCTTC GGGGCCGTCA CGGGCCTCTG 1020 

CACGCTCTTC ACCCTGGCCA CATTCGTGGC TGACTGGCGG AACTCGAATC GCTACCCTGC 1080 

TGTTATTCTC TTCTACGTCA ATGCGTGCTT CTTTGTGGGC AGCATTGGCT GGCTGGCCCA 1140 

GTTCATGGAT GGTGCCCGCC GAGAGATOGT CTGCCGTGCA GATGGCACCA TGAGGCTTGG 1200 

GGAGCCCACC TCCAATGAGA CTCTGTCCTG CGTCATCATC TTTGTCATCG TGTACTAOGC 1260 

CCTGATGGCT GGTGTGGTTT GGTTTGTGGT CCTCACCTAT GCCTGGCACA CTTCCTTCAA 1320 

AGCCCTGGGC ACCACCTACC AGCCTCTCTC GGGCAAGACC TCCTACTTCC ACCTGCTCAC 1380 

CTGGTCACTC CCCTTTGTCC TCACTGTGGC AATCCTTGCT GTGGCGCAGG TGGATGGGGA 1440 

CTCTGTGAGT GGCATTTGTT TTGTGGGCTA CAAGAACTAC CGATACCGTG CGGGCTTCGT 1500 

GCTGGCCCCA ATCGGCCTGG TGCTCATCGT GGGAGGCTAC TTCCTCATCC GAGGAGTCAT 1560 

GACTCTGTTC TCCATCAAGA GCAACCACCC GGGGCTGCTG AGTGAGAAGG CTGCCAGCAA 1620 

GATCAAOGAG ACCATGCTGC GCCTGGGCAT TTTTGGCTTC CTGGCCTTTG GCTTTGTGCT 1680 

CATTACCTTC AGCTGCCACT TCTACGACTT CTTCAACCAG GCTGAGTGGG AGCGCAGCTT 1740 

COGGGACTAT GTGCTATGTC AGGCCAATGT GACCATCGGG CTGCCCACCA AGCAGCCCAT 1800 

CCCTGACTGT GAGATCAAGA ATCGCCCGAG CCTTCTGGTG GAGAAGATCA ACCTGTTTGC 1860 

CATGTTTGGA ACTGGCATCG CCATGAGCAC CTGGGTCTGG ACCAAGGCCA CGCTGCTCAT 1920 

CTGGAGGCGT ACCTGGTGCA GGTTGACTGG GCAGAGTGAC GATGAGCCAA AGCGGATCAA 1980 

GAAGAGCAAG ATGATTGCCA AGGCCTTCTC TAAGCGGCAC GAGCTCCTGC AGAACCCAGG 2040 

CCAGGAGCTG TCCTTCAGCA TGCACACTGT GTCCCACGAC GGGCCCGTGG CGGGCTTGGC 2100 

CTTTGACCTC AATGAGCCCT CAGCTGATGT CTCCTCTGCC TGGGCCCAGC ATGTCACCAA 2160 

GATGGTGGCT CGGAGAGGAG CCATACTGCC CCAGGATATT TCTGTCACCC CTGTGGCAAC 2220 

TCCAGTGCCC CCAGAGGAAC AAGCCAACCT GTGGCTGGTT GAGGCAGAGA TCTCCCCAGA 2280 

GCTGCAGAAG CGCCTGGGCC GGAAGAAGAA GAGGAGGAAG AGGAAGAAGG AGGTGTGCCC 2340 

GCTGGCGCCG CCCCCTGAGC TTCACCCCCC TGCCCCTGCC CCCAGTACCA TTCCTCGACT 2400 

GCCTCAGCTG CCCCGGCAGA AATGCCTGGT GGCTGCAGGT GCCTGGGGAG CTGGGGACTC 2460 

TTGCCGACAG GGAGCGTGGA CCCTGGTCTC CAACCCATTC TGCCCAGAGC CCAGTCCCCC 2520 

TCAGGATCCA TTTCTGCCCA GTGCACCGGC CCCCGTGGCA TGGGCTCATG GCCGCCGACA 2580 

GGGCCTGGGG CCTATTCACT CCCGCACCAA CCTGATGGAC ACAGAACTCA TGGATGCAGA 2640 

CTCGGACTTC TGAGCCTGCA GAGCAGGACC TGGGACAGGA AAGAGAGGAA CCAATACCTT 2700 

CAAGGCTCTT CTTCCTCACC GAGCATGCTT CCCTAGGATC CCGTCTTCCA GAGAACCTGT 2760 

GGGCTGACTG CCCTCCGAAG AGAGTTCTGG ATGTCTGGCT CAAAGCAGCA GGACTGTGGG 2820 

AAAGAGCCTA ACATCTCCAT GGGGAGGCCT CACCCCAGGG ACAGGGCCCT GGAGCTCAGG 2880 

GTCCTTGTTT CTGCCCTGCC AGCTGCAGCC TGGTTGGCAG CATCTGCTCC ATCGGGGCAG 2940 

GGGGTATGCA GAGCTTGTGG TGGGGCAGGA ACGGTGGAGG CAGAGGTGAC AGTTCCCAGA 3000 

GTGGGCTTTG GTGGCCAGGG AGGCAGCCTA GCCTATGTCT GGCAGATGAG GGCTGGCTGC 3060 

CGTTTTCTGG GCTGATGGGT GCCCTTTCCT GGCAGTCTCA GTCCAAAAGT GTTGACTGTG 3120 

TCATTAGTCC TTTGTCTAAG TAGGGCCAGG GCACCGTATT CCTCTOCCAG GTGTTTGTGG 3180 

GGCTGGAAGG ACCTGCTCCC ACAGGGGCCA TGTCCTCTCT TAATAGGTGG CACTACCCCA 3240 

AACCCATCTT TTGTTCTCCT ATATCCTCCT TCTCCTGTTC CATTTCAGTT CAGTTTCAGC 3300 

GGTGCCAACC TCTTTGCGTT TCCTTTTTGT TGATGAGGAC CCAGAGCTGC TGCACACACT 3360 

CACCTCTAAC CCCCTCCCCT CGCTGCTGGG CCCCATCTCC ACAGGAGAGA CTGGTTCGGC 3420 

TCTAGGGCCT CAGTCTGGAG TGGGATAGGA GCAGTGAGTG ACAAAGCCTC TGAAAGATGC 3480 

ATCATCTCTT CCTCACACCC ATTTAGTGGG GGATGGGTCC TCTAGACTTG AGGGGCTACC 3540 

CTGGGAAGCT GCCGTAGCTT CAGCCAGGCA AGAAAGCTTC CTTCAACCTG CATAGCCGGT 3600 

GGGTGAGGAG ATTCCCACCT TCCATAGCCT CCAAACATGT TCCCAAGGCC CCACTTTCAA 3660 

GAATCAGACA GCAGGAAGCC ATAGATGCTG GCTGGGTTCC AGGTTATGGG GAGAAGAAAT 3720 

ACAGTCAATA AAAGGTTTTT GTATAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 3780 

A 3781 

Seg ID NO: C177 dna Sequence 
Nucleic Acid Accession #: AK09459S 
Coding sequence: 1..2853 



1 
I 

ATGGGGGCCC 
CCGAGGCTGA 
TCAGCGCCAG 
AAGAACAAGC 
TGCAACGGCG 
ACCCTGGGGG 
CAGGTGGAGG 
GCGGGCACCA 
GATCAGGAGC 
CCGCCGGAGG 
CCCACCCAGG 
CGCCTGTCGG 
AGCACCACTG 
TCACCCTGCT 
CCCGCTCCAC 
ACCACCATCT 
ACTGAGTGTG 
CGTGACTGCA 
AATAAGAAAA 
CTGTATGCGG 
GTGGTGGTGT 
GCCCTGACTG 
CTCCTACACC 



11 

I 

GGAGCGGAGC 
GCCAAGCAGG 
CAGAGCCGCT 
CTGTGGAGCT 
AGTGGGTCAG 
CGCGGGGCGG 
AGCTCTTTGG 
CCAAGAGTCG 
CTCTGGGCAA 
GGGTGOCTGT 
ACACCAACTT 
ACACTGCCAA 
CCACCGTCAT 
CCAACCGCTG 
TCAACGGAGG 
GCCCAGTCGA 
CCCACTGGCG 
GCGGGACGCT 
CTCTAAGCGA 
GGCTCGTGGT 
ACCGCCGCAA 
GTGGTTTCCA 
CCTCTGTGCC 




GCTGGAGGAT 
CCGAGCCTAC 
GGAGGTGCCC 
GGCCGAGGTG 
CCTGCTCACC 
CTATACCTGC 
CGTCTACGTG 
TGGCCGAGGC 
GGCCTTCTGC 
TGGGGCGTGG 
TAGCCGCGAG 
GCTCGACTCT 
CCCCAACAGC 
GGCCATCTTC 
CTGCCGTGAC 
CCCCGTCAAC 
TCCTGACCTG 



31 
I 

CTGCTGCTGG 
TOGGGTGAGG 
CTGCAGGAGC 
GCCTTCCCOG 
CACGTCACAC 
CGCGAGGTGC 
TACTGGTGCC 
GTCCGCATCG 
CTGGACCATG 
GAATGGCTCA 
ATCGACCACA 
GTGGCCAAGA 
AATGGCGGCT 
TGGCAGAAGC 
GAGGGCCAGG 
ACGGAGTGGA 
TGCATGGCGC 
AAGAACTGCA 
CACCTGCTGG 
GTGGTCGTGG 
TTCGACACAG 
TTTAAGACGG 
ACAGCCAGCG 



41 
I 

CACTGCTGCT 
TGCTCCCTGA 
CACAGGACGC 
CCACACAGAT 
AGGAAGGCCT 
AGATCGAGGT 
AGTGCGTGGC 
CCTACCTGCG 
AGGTTCTCCT 
AGAATGAGGA 
ACCTCATCAT 
ACATCGTGGC 
GGTCCAGCTG 
GCACCCGGAC 
CATTCCAGAA 
GCAAGTGGTC 
CCCCACCCCA 
CAGATGGGCT 
AGGCCTCAGG 
CAATCCTCAT 
ACATCACTGA 
CAAGGCCCAG 
CCGGCATCTA 



51 
I 

CTGCTGGGAC 
CTCCTTCCCG 
CTACATTGTG 
CTACTTCAAG 
GGATGAGGCC 
GTCGCGGCAG 
CTGGAGCTCC 
CAAGAACTTC 
GCAGTGCCGC 
TGTCATCGAC 
CCGCCAGGCC 
CAAACGCCGG 
GGCAGAGTGG 
CTGCACCAAC 
GACCGCCTGC 
AGCCTGCAGC 
GAACGGAGGC 
GTGCATGCAA 
GGATGCGGCG 
GGCGGTGGGG 
CTCATCTGCT 
CAACCCGCAG 
COGCGGACCC 
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GTGTATGCCC TGCAGGACTC CACCGACAAA ATCCCCATGA CCAACTCTCC TCTGCTGGAC 1440 

CCCTTACCCA GCCTTAAGGT CAAGGTCTAC AGCTCCAGCA CCACGGGCTC TGGGCCAGGC 1500 

CTGGCAGATG GGGCTGACCT GCTGGGGGTC TTGCCGCCTG GCACATACCC TAGCGATTTC 1S60 

GCCCGGGACA CCCACTTCCT GCACCTGCGC AGCGCCAGCC TCGGTTCCCA GCAGCTCTTG 1620 

5 GGCCTGCCCC GAGACCCAGG GAGCAGCGTC AGCGGCACCT TTGGCTGCCT GGGTGGGAGG 1680 

CTCAGCATCC CCGGCACAGG GGTCAGCTTG CTGGTGCCCA ATGGAGCCAT TCCCCAGGGC 1740 

AAGTTCTACG AGATGTATCT ACTCATCAAC AAGGCAGAAA GTACCCTCCC GCTTTCAGAA 1800 

GGGACCCAGA CAGTATTGAG CCCCTCGGTG ACCTGTGGAC CCACAGGCCT CCTGCTGTGC 1860 

CGCCCCGTCA TCCTCACCAT GCCCCACTGT GCCGAAGTCA GTGCCCGTGA CTGGATCTTT 1920 

10 CAGCTCAAGA CCCAGGCCCA CCAGGGCCAC TGGGAGGAGG TGGTGACCCT GGATGAGGAG 1980 

ACCCTGAACA CACCCTGCTA CTGCCAGCTG GAGCCCAGGG CCTGTCACAT CCTGCTGGAC 2040 

CAGCTGGGCA CCTACGTGTT CACGGGCGAG TCCTATTCCC GCTCAGCAGT CAAGCGGCTC 2100 

CAGCTGGCCG TCTTCGCCCC CGCCCTCTGC ACCTCCCTGG AGTACAGCCT CCGGGTCTAC 2160 

TGCCTGGAGG ACACGCCTGT AGCACTGAAG GAGGTGCTGG AGCTGGAGCG GACTCTGGGC 2220 

15 GGATACTTGG TGGAGGAGCC GAAACCGCTA ATGTTCAAGG ACAGTTACCA CAACCTGCGC 2280 

CTCTCCCTCC ATGACCTCCC CCATGCCCAT TGGAGGAGCA AGCTGCTGGC CAAATACCAG 2340 

GAGATCCCCT TCTATCACAT TTGGAGTGGC AGCCAGAAGG CCCTCCACTG CACTTTCACC 2400 

CTGGAGAGGC ACAGCTTGGC CTCCACAGAG CTCACCTGCA AGATCTGCGT GCGGCAAGTG 2460 

GAAGGGGAGG GCCAGATATT CCAGCTGCAT ACCACTCTGG CAGAGACACC TGCTGGCTCC 2520 

20 CTGGACACTC TCTGCTCTGC CCCTGGCAGC ACTGTCACCA CCCAGCTGGG ACCTTATGCC 2580 

TTCAAGATCC CACTGTCCAT CCGCCAGAAG ATATGCAACA GCCTAGATGC CCCCAACTCA 2640 

CGGGGCAATG ACTGGCGGAT GTTAGCACAG AAGCTCTCTA TGGACCGGTA CCTGAATTAC 2700 

TTTGCCACCA AAGCGAGCCC CACGGGTGTG ATCCTGGACC TCTGGGAAGC TCTGCAGCAG 2760 

GACGATGGGG ACCTCAACAG CCTGGCGAGT GCCTTGGAGG AGATGGGCAA GAGTGAGATG 2820 

25 CTGGTGGCTG TGGCCACCGA CGGGGACTGC TGAGCCTCCT GGGACAGCGG GCTGGCAGGG 2880 

ACTGGCAGGA GGCAGGTGCA GGGAGGCCTG GGGCAGCCTC CTGATGGGGA TGTTTGGCCT 2940 

CTGCTTCCTC CCAGTTCACA GCCAGAGTTG CCTCTCCTCC TCCTCTTCCC CAACCCCCAG 3000 

ACCATGACCA GCCTTAGAAA ATCCATGTAC TCTGTTGTTA GAGGGCCCAG AGTTCCTTCT 3060 

CCACCCCOGC TCTCTCTCTC TTGGCCTGAG ATCTCTGTGC AGGAACCAAG ATGGGGCTGA 3120 

30 AGCCTCTGGA GGCAGTTGGT TGGGGGCGGG CAGGCAGGAG GCCCTCCCTC CACCCCCCCA 3180 

CCCTCAGCCC GGCAACTTCT GGGTTCCATG GGTTTTAGTT CCGTTCTCGT TTTCTTCCTC 3240 

CGTTATTGAT TTCTCCTTTC TCCCTAAGCC CCCTTCTGCT TCCACGCCCT TTTCCTCTTT 3300 

GAAGAGTCAA GTACAATTCA GACAAACTGC TTTCTCCTGT CCAAAAGCAA AAAGGCAAAG 3360 

GAAAGAAAGA AAGCTTCAGA CCGCTAGTAA GGCTCAAAGA AGAAGAAAAA CACCAAAACC 3420 

35 ACAAGGGAAA AGAAAAACCC AGTTTCTTAG GAAACGCAAA CGATTTATTA TCCAGATTAT 3480 

TTGGATAAGT CCTTTTTAAA A 3501 

Seq ID NO: C178 DNA Sequence 
Nucleic Acid Accession #: NM_004625 
40 Coding sequence: 310.. 1359 
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i ii 
I I 

GAGGGAGGGG 
GCACCGGTCG CTCGCGCGCA 
CCTCCCGCTC CGTGGCTCCC 
CGCACCGACA GACGGCCCCG 
TTGCCCCGCC GGGGCCGGCC 
ATCGGGACTA TGAACCGGAA 
ATGGTCTACC TCCGGATCGG 
TGTAACAAGA TCCCAGGCCT 
GCCATCATCG TCATAGGAGA 
CGCAATGGCC GCTGGAACTG 
AAAGTGGGGA GCCGGGAGGC 
GCCATCACAG CTGCCTGTAC 
CAAGGCCAGT ACCACCGGGA 
TACGGCATGG GCTTCGCCAA 
ACTCTCATGA ACTTGCACAA 
CTGGAATGTA AGTGCCACGG 
CTGCCACAGT TTCGGGAGCT 
GTGGAGCCTG TGCGTGCCAG 
CTGTCGTACC GCAAGCCCAT 
TGGGAGGAGG ACCCGGTGAC 
GCTCCCCAGG CCAGCGGCTG 
TACGCCCGCG TGTGGCAGTG 
ACGTGCAGCG AGCGCACGGA 
CCCGCTGCAA GTCAGATTGC 
AGGATGCTGA GCTTGTCTTT 
TCCGTGGGGG AAAAAAAATC 
CTCCACCCTC CCCCAGACAC 
CAGGAACTCT GGACCCCTGG 
ATAATATTAA TTTATTTAAT 



21 31 41 

1 i I 

AGGCAGCAGC GCCCCCGCAC 
GCCCGGCGTC GCCCCACGCC GCGCTCGCTC 
GTGCTCCTGG CGAGGCTCAG GCGCGGAGCG 
GGGACGCCTC GGCTCGCGCC TCCCGGGCGG 
CGCGGGATCA GCACAGCCCG GCCCGCGGCC 
AGCGCGGCGC TGCCTGGGCC ACCTCTTTCT 
TGGCTTCTCC TCAGTGGTAG CTCTGGGCGC 
GGCTCCCAGA CAGCGGGCGA TCTGCCAGAG 
AGGCTCACAA ATGGGCCTGG ACGAGTGTCA 
CTCTGCACTG GGAGAGCGCA CCGTCTTCGG 
TGCGTTCACC TACGCCATCA TTGCCGCCGG 
CCAGGGCAAC CTGAGCGACT GTGGCTGCGA 
CGAGGGCTGG AAGTGGGGTG GCTGCTCTGC 
GGTCTTTGTG GATGCCCGGG AGATCAAGCA 
CAACGAGGCA GGCCGAAAGA TCCTGGAGGA 
CGTGTCAGGC TCGTGCACCA CCAAGACGTG 
GGGCTACGTG CTCAAGGACA AGTACAACGA 
CCGCAACAAG CGGCCCACCT TCCTGAAGAT 
GGACACGGAC CTGGTGTACA TCGAGAAGTC 
CGGCAGTGTG GGCACCCAGG GCOGCGCCTG 
TGACCTCATG TGCTGTGGGC GTGGCTACAA 
CAACTGTAAG TTOCACTGGT GCTGCTATGT 
GATGTACACG TGCAAGTGAG CCCCGTGTGC 
TGGGAGGACT GGACCGTTTC CAAGCTGCGG 
TCTGCTGAGG AGGGTACTTT TCCTGGGTTT 
TCTCAGAGCC CTCAACTATT CTGTTCCACA 
AGCCCAGGTC CCTCCGOGGC TGGAGCGAAG 
GCCTCATCAC AGCAATATTT AACAATTTAT 
TAAAAAGAAT TCTTCCACAA AAAAAAAAAA 




CGCGGACGGG 
GCTATGTTGA 
CCGGOGGCCA 
CAGCCTGGGC 
AAGCATCATC 
OCGGCCCGAC 
GTTTCAGTTC 
GAAGGAGCTC 
CGTGGCCCAC 
CAAAGAGAAG 
CGACATCCGC 
GAATGCCCGG 
GAACATGAAG 
CTGGACCACA 
GGCCGTTCAC 
CAAGAAGCCA 
GCCCAACTAC 
CAACAAGACG 
CACCCACCAG 
CAAGTGCAAC 
ACACCACCCT 
GCTCCCTGGC 
CCTGCAGGCA 
CCCAATGCTG 
CCTTCTGCAG 
TCTGATAAAA 
AAAAAA 



51 
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Seq ID NO: C179 DNA Sequence OBR3 
Nucleic Acid Accession NM_003786 
Coding sequence: 71.. 4654 

1 11 21 31 41 

1 I I I I I 

CTCCGGCGCC CGCTCTGCCC GCCGCTGGGT CCGACCGCGC TCGCCTTCCT TGCAGCCGCG 
CCTCGGCCCC ATGGACGCCC TGTGCGGTTC CGGGGAGCTC GGCTCCAAGT TCTGGGACTC 
CAACCTGTCT GTGCACACAG AAAACCCGGA CCTCACTCCC TGCTTCCAGA ACTCCCTGCT 
GGCCTGGGTG CCCTGCATCT ACCTGTGGGT CGCCCTGCCC TGCTACTTGC TCTACCTGCG 
GCACCATTGT CGTGGCTACA TCATCCTCTC CCACCTGTCC AAGCTCAAGA TGGTCCTGGG 
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TGTCCTGCTG TGGTGCGTCT CCTGGGCGGA CCTTTTTTAC TCCTTCCATG GCCTQGTCCA 3 GO 

TGGCCGGGCC CCTGCCCCTG TTTTCTTTGT CACCCCCTTG GTGGTGGGGG TCACCATGCT 420 

GCTGGCCACC CTGCTGATAC AGTATGAGCG GCTGCAGGGC GTACAGTCTT CGGGGGTCCT 4 BO 

CATTATCTTC TGGTTCCTGT GTGTGGTCTG CGCCATCGTC CCATTCCGCT CCAAGATCCT 540 

TTTAGCCAAG GCAGAGGGTG AGATCTCAGA CCCCTTCCGC TTCACCACCT TCTACATCCA 600 

CTTTGCCCTG GTACTCTCTG CCCTCATCTT GGCCTGCTTC AGGGAGAAAC CTCCATTTTT 660 

CTCCGCAAAG AATGTCGACC CTAACCCCTA CCCTGAGACC AGCGCTGGCT TTCTCTCCCG 720 

CCTGTTTTTC TGGTGGTTCA CAAAGATGGC CATCTATGGC TACCGGCATC CCCTGGAGGA 780 

GAAGGACCTC TGGTCCCTAA AGGAAGAGGA CAGATCCCAG ATGGTGGTGC AGCAGCTGCT 840 

GGAGGCATGG AGGAAGCAGG AAAAGCAGAC GGCACGACAC AAGGCTTCAG CAGCACCTGG 900 

GAAAAATGCC TCCGGCGAGG ACGAGGTGCT GCTGGGTGCC CGGCCCAGGC CCCGGAAGCC 960 

CTCCTTCCTG AAGGCCCTGC TGGCCACCTT CGGCTCCAGC TTCCTCATCA GTGCCTGCTT 1020 

CAAGCTTATC CAGGAOCTGC TCTCCTTCAT CAATCCACAG CTGCTCAGCA TCCTGATCAG 1080 

GTTTATCTCC AACCCCATGG CCCCCTCCTG GTGGGGCTTC CTGGTGGCTG GGCTGATGTT 1140 

CCTGTGCTCC ATGATGCAGT CGCTGATCTT ACAACACTAT TACCACTACA TCTTTGTGAC 1200 . 

TGGGGTGAAG TTTCGTACTG GGATCATGGG TGTCATCTAC AGGAAGGCTC TGGTTATCAC 1260 

CAACTCAGTC AAACGTGCGT CCACTGTGGG GGAAATTGTC AACCTCATGT CAGTGGATGC 1320 

CCAGCGCTTC ATGGACCTTG CCCCCTTCCT CAATCTGCTG TGGTCAGCAC CCCTGCAGAT 1380 

CATCCTGGCG ATCTACTTCC TCTGGCAGAA CCTAGGTCCC TCTGTCCTGG CTGGAGTCGC 1440 

TTTCATGGTC TTGCTGATTC CACTCAACGG AGCTGTGGCC GTGAAGATGC GCGCCTTCCA 1500 

GGTAAAGCAA ATGAAATTGA AGGACTCGCG CATCAAGCTG ATGAGTGAGA TCCTGAAOGG 1560 

CATCAAGGTG CTGAAGCTGT ACGCCTGGGA GCCCAGCTTC CTGAAGCAGG TGGAGGGCAT 1620 

CAGGCAGGGT GAGCTCCAGC TGCTGCGCAC GGCGGCCTAC CTCCACACCA CAACCACCTT 1680 

CACCTGGATG TGCAGCCCCT TCCTGGTGAC CCTGATCACC CTCTGGGTGT ACGTGTAOGT 1740 

GGACCCAAAC AATGTGCTGG ACGCCGAGAA GGCCTTTGTG TCTGTGTCCT TGTTTAATAT 1800 

CTTAAGACTT CCCCTCAACA TGCTGCCCCA GTTAATCAGC AACCTGACTC AGGCCAGTGT 1860 

GTCTCTGAAA CGGATCCAGC AATTCCTGAG CCAAGAGGAA CTTGACCCCC AGAGTGTGGA 1920 

AAGAAAGACC ATCTCCCCAG GCTATGCCAT CACCATACAC AGTGGCACCT TCACCTGGGC 1980 

CCAGGACCTG CCCCCCACTC TGCACAGCCT AGACATCCAG GTCCCGAAAG GGGCACTGGT 2040 

GGCCGTGGTG GGGCCTGTGG GCTGTGGGAA GTCCTCCCTG GTGTCTGCCC TGCTGGGAGA 2100 

GATGGAGAAG CTAGAAGGCA AAGTGCACAT GAAGGGCTCC GTGGCCTATG TGCCCCAGCA 2160 

GGCATGGATC CAGAACTGCA CTCTTCAGGA AAACGTGCTT TTCGGCAAAG CCCTGAACCC 2220 

CAAGCGCTAC CAGCAGACTC TGGAGGCCTG TGCCTTGCTA GCTGACCTGG AGATGCTGCC 2280 

TGGTGGGGAT CAGACAGAGA TTGGAGAGAA GGGCATTAAC CTGTCTGGGG GCCAGCGGCA 2340 

GCGGGTCAGT CTGGCTCGAG CTGTTTACAG TGATGCCGAT ATTTTCTTGC TGGATGACCC 2400 

ACTGTCCGCG GTGGACTCTC ATGTGGCCAA GCACATCTTT GACCACGTCA TCGGGCCAGA 2460 

AGGGGTGCTG GCAGGCAAGA CGCGAGTGCT GGTGACGCAC GGCATTAGCT TCCTGCCCCA 2520 

GACAGACTTC ATCATTGTGC TAGCTGATGG ACAGGTGTCT GAGATGGGCC CGTACCCAGC 2580 

CCTGCTGCAG CGCAACGGCT CCTTTGCCAA CTTTCTCTGC AACTATGCCC CCGATGAGGA 2640 

CCAAGGGCAC CTGGAGGACA GCTGGACCGC GTTGGAAGGT GCAGAGGATA AGGAGGCACT 2700 
GCTGATTGAA GACACACTCA GCAACCACAC GGATCTGACA GACAATGATC CAGTCACCTA 2760 
TGTGGTCCAG AAGCAGTTTA TGAGACAGCT GAGTGCCCTG TCCTCAGATG GGGAGGGACA 2820 
GGGTCGGCCT GTACCCCGGA GGCACCTGGG TCCATCAGAG AAGGTGCAGG TGACAGAGGC 2880 
GAAGGCAGAT GGGGCACTGA CCCAGGAGGA GAAAGCAGCC ATTGGCACTG TGGAGCTCAG 2940 
TGTGTTCTGG GATTATGCCA AGGCCGTGGG GCTCTGTACC ACGCTGGCCA TCTGTCTCCT 3000 
GTATGTGGGT CAAAGTGCGG CTGCCATTGG AGCCAATGTG TGGCTCAGTG CCTGGACAAA 30 60 
TGATGCCATG GCAGACAGTA GACAGAACAA CACTTCCCTG AGGCTGGGCG TCTATGCTGC 3120 
TTTAGGAATT CTGCAAGGGT TCTTGGTGAT GCTGGCAGCC ATGGCCATGG CAGCGGGTGG 3180 
CATCCAGGCT GCCCGTGTGT TGCACCAGGC ACTGCTGCAC AACAAGATAC GCTCGCCACA 3240 
GTCCTTCTTT GACACCACAC CATCAGGCCG CATCCTGAAC TGCTTCTCCA AGGACATCTA 3300 
TGTOGTTGAT GAGGTTCTGG CCCCTGTCAT CCTCATGCTG CTCAATTCCT TCTTCAACGC 3360 
CATCTCCACT CTTGTGGTCA TCATGGCCAG CACGCCGCTC TTCACTGTGG TCATCCTGCC 3420 
CCTGGCTGTG CTCTACACCT TAGTGCAGCG CTTCTATGCA GCCACATCAC GGCAACTGAA 3480 
GCGGCTGGAA TCAGTCAGCC GCTCACCTAT CTACTCCCAC TTTTCGGAGA CAGTGACTGG 3540 
TGCCAGTGTC ATCCGGGCCT ACAACCGCAG CCGGGATTTT GAGATCATCA GTGATACTAA 3600 
GGTGGATGCC AACCAGAGAA GCTGCTACCC CTACATCATC TCCAACCGGT GGCTGAGCAT 3660 
CGGAGTGGAG TTCGTGGGGA ACTGOGTGGT GCTCTTTGCT GCACTATTTG CCGTCATCGG 3720 
GAGGAGCAGC CTGAACCCGG GGCTGGTGGG CCTTTCTGTG TCCTACTCCT TGCAGGTGAC 3780 
ATTTGCTCTG AACTGGATGA TAOGAATGAT GTCAGATTTG GAATCTAACA TCGTGGCTGT 3840 
GGAGAGGGTC AAGGAGTACT CCAAGACAGA GACAGAGGCG CCCTGGGTGG TGGAAGGCAG 3900 
CCGCCCTCCC GAAGGTTGGC CCCCACGTGG GGAGGTGGAG TTCOGGAATT ATTCTGTGCG 3960 
CTACCGGCCG GGCCTAGACC TGGTGCTGAG AGACCTGAGT CTGCATGTGC ACGGTGGCGA 4020 
GAAGGTGGGG ATCGTGGGCC GCACTGGGGC TGGCAAGTCT TCCATGACCC TTTGCCTGTT 4080 
. CCGCATCCTG GAGGCGGCAA AGGGTGAAAT CCGCATTGAT GGCCTCAATG TGGCAGACAT 4140 
CGGCCTCCAT GACCTGCGCT CTCAGCTGAC CATCATCCCG CAGGACCCCA TOCTGTTCTC 4200 
GGGGACCCTG CGCATGAACC TGGACCCCTT OGGCAGCTAC TCAGAGGAGG ACATTTGGTG 4260 
GGCTTTGGAG CTGTCCCACC TGCACACGTT TGTGAGCTCC CAGCCGGCAG GCCTGGACTT 4320 
CCAGTGCTCA GAGGGCGGGG AGAATCTCAG CGTGGGCCAG AGGCAGCTCG TGTGCCTGGC 4380 
CCGAGCCCTG CTCCGCAAGA GCCGCATCCT GGTTTTAGAC GAGGCCACAG CTGCCATCGA 4440 
CCTGGAGACT GACAACCTCA TCCAGGCTAC CATCCGCACC CAGTTTGATA CCTGCACTGT 4500 
CCTGACCATC GCACACCGGC TTAACACTAT CATGGACTAC ACCAGGGTCC TGGTCCTGGA 4560 
CAAAGGAGTA GTAGCTGAAT TTGATTCTCC AGCCAACCTC ATTGCAGCTA GAGGCATCTT 4620 
CTACGGGATG GCCAGAGATG CTGGACTTGC CTAAAATATA TTCCTGAGAT TTCCTCCTGG 4680 
CCTTTCCTGG TTTTCATCAG GAAGGAAATG ACACCAAATA TGTCCGCAGA ATGGACTTGA 4740 
TAGCAAACAC TGGGGGCACC TTAAGATTTT GCACCTGTAA AGTGCCTTAC AGGGTAACTG 4800 
TGCTGAATGC TTTAGATGAG GAAATGATCC CCAAGTGGTG AATGACACGC CTAAGGTCAC 4860 
AGCTAGTTTG AGCCAGTTAG ACTAGTCCCC CGGTCTCCCG ATTCCCAACT GAGTGTTATT 4920 
TGCACACTGC ACTGTTTTCA AATAACGATT TTATGAAATG ACCTCTGTCC TCCCTCTGAT 4980 
TTTTCATATT TTCCTAAAGT TTCGTTTCTG TTTTTTAATA AAAAGCTTTT TCCTCCTGGA 5040 
ACAGAAGACA GCTGCTGGGT CAGGCCACCC CTAGGAACTC AGTCCTGTAC TCTGGGGTGC 5100 
TGCCTGAATC CATTAAAAAT GGGAGTACTG ATGAAATAAA ACTACATGGT CAACAGTAAA 5160 
AAAAAAAAAA AAAAAA 5176 

Seq ID NO: CI 80 DNA Sequence 
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Nucleic Acid Accession NM_004626 
Coding sequence : 124.. lies 
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1 
I 

TAACCCGCCG 
CAGGCGGAGG 
ACGATGAGGG 
GGCGTGTGCT 
AACCAGACGC 
CGCAGCAACC 
TGTCGCCGGG 
TATTTGCTTG 
GCCGCCATCA 
TGCGGCCCCQ 
AACCTCAGCT 
AAAACAGGAT 
GCTCTGCGCG 
ATCCGCACCT 
CGATACCTGT 
CCCAAGGACC 
CCTGACTTCT 
AACAAGACAT 
CCCTACACAG 
ACCTGCCGCA 
CGCCCCACGC 
GGAGACACTC 

CAGGACATCA 
GCCCACAGGC 
TTTCAGGGCC 
TGCGGGACCC 
TAGCTTGGGT 
GCTACAGAAA 
TTATATATTT 
AGCTTTTTAA 
CCCGGCCTOG 
CCAGGCA 



11 
I 

CCTCCGCTCT 
ACTTCGGCGC 
CGCGGCCGCA 
ATGGCATCAA 
AACACTGCAA 
TGGAGCTCAT 
CCTTTGCCGA 
ACCTGGAGAG 
GCCACGCCAT 
TCCCAGGTGA 
ACGGGCTCCT 
CCCAAGCCAA 
CCTCTCTGGA 
GCTGGAAGGG 
CGGCCACCAA 
TGGATATCCG 
GCATGAAGAA 
CCAACGGAAG 
ACCGCGTGGT 
GGTGTGAGCG 
AGGAGCGAGG 
CATGGAGCTC 
TCACTTGGAA 
AAGGAAACCG 
TGGTGTAAGG 
ACAGAGACCA 
CCTGGCCCAC 
TGTAAATGTT 
TATTTATAAA 
TATATATATA 
GCAGCTGTAT 
TCAAGTGAAC 



ACATAGCTTT 
AATATATATG 
GAAATAAATG 
TCGGCAGACC 



31 
I 

AGGCGGCGTG 
GGGTGTGACC 
GCGCTGCTCT 
CTGTCCAAGA 
GGTCTGGTGT 
GTGCACGCCG 
AACTGCTCCT 
GAGTCGGCCT 
TGCACCTCCG 
CCCGGGAACC 
AAGTTTTCCG 
CGTCTACACA 
AAGTGCCATG 
CTGCAGGATG 
CGACCCATGG 
GACTCGGAAC 
GGCTCCCACG 
GACCTTATGT 
CACTGTAAGT 
CGCTATGTCT 
AGGACCCTCA 
TTCCAGATGC 
ACAGAAGGTC 
AAATAACTTG 
GGGATCGGTG 
AGGGGTCTGC 
GAGCCTGCTG 
TTTTGCTTTT 
TTCTTTGGGG 
TATATATATA 
CTGAGTGAGC 
CTGGGGCTGG 



41 
I 

CAGGACCAGC 
CCGGGCGCGC 
TCGCCCTGGC 
CACCATCGGC 
CTGCACAGGT 
CCCGCGAGGT 
CCATTGAGCT 
TCGTGTATGC 
GCGACCTGCC 
GCTGGGGAGG 
ATGCTCCTAT 
ACAGTGAAGT 
GGGTGTCTGG 
TGGCTGCTGA 
GCACCCGCAA 
TCGTCTATCT 
GGACACAAGA 
GCTGCGGGCG 
ACCACTGGTG 
GCAAGTGAGG 
GCAACTGGGG 
CAGGCATGGG 
TGGCCACCCT 
GCAGCCTGAG 
AGACTGATAC 
CCGCCTTCTT 
GGCCCACCAC 



TGGCACTTCT 
ATGATCTCTA 
CCCAGCCCGC 
CAGAGGGAGC 




CCTCAAGACC 
GCACCTGGTG 
GCAGAGCTCA 
CAGGCAGTGC 
TGGCTACAAC 
CTGCTACGTC 
CCCTGCCCTC 
CCAGGGGCCT 
AGGCGGCTTG 
GGAAGGAGGG 
GCTCTGGAGT 
AGACTTGACC 
CAGAATGTTC 
ATGGAATCAC 
GGATGTGGAA 
CAATTCCTCT 
TTTTAAAACT 
• CCCTGCAGTT 
TCTCCAGTTT 



60 

120 

180 

240 

300 

360 

420 

460 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1360 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1927 



Seq ID NO: C181 DNA Sequence 
Nucleic Acid Accession #i NM_031866 
Coding sequence: 6.. 2090 

21 



11 21 31 41 51 

I I I I I I 

ACAGCATGGA GTGGGGTTAC CTGTTGGAAG TGACCTCGCT GCTGGCCGCC TTGGCGCTGC 60 

TGCAGCGCTC TAGCGGCGCT GCGGCCGCCT CGGCCAAGGA GCTGGCATGC CAAGAGATCA 120 

CCGTGCCGCT GTGTAAGGGC ATCGGCTACA ACTACACCTA CATGCCCAAT CAGTTCAACC 180 

ACGACACGCA AGACGAGGCG GGCCTGGAGG TGCACCAGTT CTGGCCGCTG GTGGAGATCC 240 

AGTGCTCGCC CGATCTCAAG TTCTTCCTGT GCAGCATGTA CACGCCCATC TGCCTAGAGG 300 

ACTACAAGAA GCCGCTGCCG CCCTGCCGCT CGGTGTGCGA GCGCGCCAAG GCCGGCTGCG 360 

CGCCGCTCAT GCGCCAGTAC GGCTTCGCCT GGCCCGACCG CATGCGCTGC GACCGGCTGC 420 

CCGAGCAAGG CAACCCTGAC ACGCTGTGCA TGGACTACAA CCGCACCGAC CTAACCACCG 460 

CCGCGCCCAG CCCGCCGCGC CGCCTGCCGC CGCCGCCGCC CGGCGAGCAG CCGCCTTCGG 540 

GCAGCGGOCA CGGCCGCCCG CCGGGGGCCA GGCCCCCGCA CCGCGGAGGC GGCAGGGGCG 600 

GTGGCGGCGG GGACGCGGCG GCGCCCCCAG CTCGCGGCGG CGGCGGTGGC GGGAAGGCGC .660 

GGCCCCCTGG CGGCGGCGCG GCTCCCTGCG AGCCCGGGTG CCAGTGCCGC GCGCCTATGG 720 

TGAGCGTGTC CAGCGAGCGC CACCCGCTCT ACAACCGCGT CAAGACAGGC CAGATCGCTA 780 

ACTGCGCGCT GCCCTGCCAC AACCCCTTTT TCAGCCAGGA CGAGCGCGCC TTCACCGTCT 840 

TCTGGATCGG CCTGTGGTCG GTGCTCTGCX TCGTGTCCAC CTTCGCCACC GTCTCCACCT 900 

TCCTTATCGA CATGGAGCGC TTCAAGTACC CGGAGCGGCC CATTATCTTC CTCTGGGCCT 960 

GCTACCTCTT CGTGTCGGTG GGCTACCTAG TGCGCCTGGT GGCGGGCCAC GAGAAGGTGG 1020 

CGTGCAGCGG TGGCGCGCCG GGOGCGGGGG GGGCTGGGGG CGCGGGCGGC GCGGCGGCGG 1080 

GCGCGGGCGC GGCGGGCGCG GGCGCGGGCG GCCOGGGCGG GCGCGGOGAG TACGAGGAGC 1140 

TGGGCGCGGT GGAGCAGCAC GTGCGCTACG AGACCACCGG CCCCGCGCTG TGCACCGTGG 1200 

TCTTCTTGCT GGTCTACTTC TTCGGCATGG CCAGCTCCAT CTGGTGGGTG ATCTTGTCGC 1260 

TCACATGGTT CCTGGCGGCC GGTATGAAGT GGGGCAACGA AGCCATCGCC GGCTACTCGC 1320 

AGTACTTCCA CCTGGCCGCG TGGCTTGTGC CCAGCGTCAA GTCCATCGCG GTGCTGGCGC 1380 

TCAGCTCGGT GGACGGCGAC CCGGTGGCGG GCATCTGCTA CGTGGGCAAC CAGAGCCTGG 1440 

ACAACCTGCG CGGCTTCGTG CTGGCGCCGC TGGTCATCTA" CCTCTTCATC GGCACCATGT 1500 

TCCTGCTGGC CGGCTTCGTG TCCCTGTTCC GCATCCGCTC GGTCATCAAG CAACAGGACG 1560 

GCCCCACCAA GACGCACAAG CTGGAGAAGC TGATGATCCG CCTGGGCCTG TTCACCGTGC 1620 

TCTACACCGT GCCCGCCGCG GTGGTGGTCG CCTGCCTCTT CTACGAGCAG CACAACCGCC 1680 
CGCGCTGGGA GGCCACGCAC AACTGCCCGT GCCTGCGGGA CCTGCAGCCC GACCAGGCAC 1740 
GCAGGCCCGA CTACGCCGTC TTCATGCTCA AGTACTTCAT GTGCCTAGTG GTGGGCATCA 1800 
CCTCGGGCGT GTGGGTCTGG TCCGGCAAGA CGCTGGAGTC CTGGCGCTCC CTGTGCACCC 1860 
GCTGCTGCTG GGCCAGCAAG GGCGCCGCGG TGGGCGGGGG CGCGGGCGCC ACGGCCGCGG 1920 
GGGGTGGCGG CGGGCCGGGG GGCGGCGGCG GCGGGGGACC CGGCGGCGGC GGGGGGCCGG 1980 
GCGGCGGCGG GGGCTCCCTC TACAGCGACG TCAGCACTGG CCTGACGTGG CGGTCGGGCA 2040 
CGGCGAGCTC CGTGTCTTAT CCAAAGCAGA TGCCATTGTC CCAGGTCTGA GCGGAGGGGA 2100 
GGGGGCGCCC AGGAGGGGTG GGGAGGGGGG CGAGGAGACC CAAGTGCAGC GAAGGGACAC 2160 
TTGATGGGCT GAGGTTCCCA CCCCTTCACA GTGTTGATTG CTATTAGCAT GATAATGAAC 2220 
TCTTAATGGT ATCCATTAGC TGGGACTTAA ATGACTCACT TAGAACAAAG TACCTGGCAT 2280 
TGAAGCCTCC CAGACCCAGC CCCTTTTCCT CCATTGATGT GCGGGGAGCT CCTCCCGCCA 2340 



1313 



WO 03/042661 



PCT/US02/36810 



2400 



OGCGTTAATT TCTGTTGGCT GAGGAGGGTG GACTCTGCGG CGTTTCCAGA ACCCGAGATT ^„ u „ 

TGGAGCCCTC CCTGGCTGCA CTTGGCTGGG TTTGCAGTCA GATACACAGA TTTCACCTGG 2460 

GAGAACCTCT TTTTCTCCCT CGACTCTTCC TACGTAAACT CCCACCCCTG ACTTACCCTG 2520 

GAGGAGGGGT GACOGCCACC TGATGGGATT GCACGGTTTG GGTATTCTTA ATGACCAGGC 2580 

J AAATGCCTTA AGTAAACAAA CAAGAAATGT CTTAATTATA CACCCCACGT AAATACGGGT 2640 

TTCTTACATT AGAGGATGTA TTTATATAAT TATTTGTTAA ATTGTAAAAA AAAAAAGTGT 2700 

AAAATATGTA TATATCCAAA GATATAGTGT GTACATTTTT TTGTAAAAAG TTTAGAGGCT 2760 

TACCCCTGTA AGAACAGATA TAAGTATTCT ATTTTGTCAA TAAAATGACT TTTGATAAAT 2820 

1 n GATTTAACCA TTGCCCTCTC CCCCGCCTCT TCTGAGCTGT CACCTTTAAA GTGCTTGCTA 2880 

1U AGGACGCATG GGGAAAATGG ACATTTTCTG GCTTGTCATT CTGTACACTG ACCTTAGGCA 2940 

TGGAGAAAAT TACTTGTTAA ACTCTAGTTC TTAA G TT GT T AGCCAAGTAA ATATCATTGT 3000 

TGAACTGAAA TCAAAATTGA GTTTTTGCAC CTTCCCCAAA GACGGTGTTT TTCATGGGAG 3060 

CTCTTTTCTG ATCCATGGAT AACAACTCTC ACTTTAGTGG ATGTAAATGG AACTTCTGCA 3120 

1 c AGGCAGTAAT TCCCCTTAGG CCTTGTTATT TATCCTGCAT GGTATCACTA AAGGTTTCAA 3180 

I J AACCCTGAAA AAAAA 3 ioe 
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Seq ID NO: C182 DNA Sequence 
Nucleic Acid Accession #: XM 050625 
Coding sequence: 222.. 1109 ~ 



1 11 21 31 41 51 

I I I I | | 

CCGGGTCGGA GCCCCCCGGA GCTGCGCGCG GGCTTGCAGC GCCTCGCCCG CGCTGTCCTC 60 

CCGGTGTCCC GCTTCTCCGC GCCCCAGCCG CCGGCTGCCA GCT TTT CGGG GCCCCGAGTC 120 

GCACCCAGCG AAGAGAGCGG GCCCGGGACA AGCTCGAACT COGGCCGCCT CGCCCTTCCC 180 

CGGCTCCGCT CCCTCTGCCC CCTCGGGGTC GCGCGCCCAC GATGCTGCAG GGCCCTGGCT 240 

CGCTGCTGCT GCTCTTCCTC GCCTCGCACT GCTGCCTGGG CTCGGCGCGC GGGCTCTTCC 300 

TCTTTGGCCA GCCCGACTTC TCCTACAAGC GCAGCAATTG CAAGCCCATC CCTGCCAACC 360 

„ TGCAGCTGTG CCACGGCATC GAATACCAGA ACATGCGGCT GCCCAACCTG CTGGGCCACG 420 

D\) AGACCATGAA GGAGGTGCTG GAGCAGGCCG GCGCTTGGAT CCCGCTGGTC ATGAAGCAGT 480 

GCCACCCGGA CACCAAGAAG TTCCTGTGCT CGCTCTTCGC CCCCGTCTGC CTCGATGACC 540 

TAGACGAGAC CATCCAGCCA TGCCACTCGC TCTGCGTGCA GGTGAAGGAC CGCTGCGCCC 600 

CGGTCATGTC CGCCTTCGGC TTCCCCTGGC CCGACATGCT TGAGTGCGAC CGTTTCCCCC 660 

AGGACAACGA CCTTTGCATC CCCCTCGCTA GCAGCGACCA CCTCCTGCCA GCCACCGAGG 720 

AAGCTCCAAA GGTATGTGAA GCCTGCAAAA ATAAAAATGA TGATGACAAC GACATAATGG 780 

AAACGCTTTG TAAAAATGAT TTTGCACTGA AAATAAAAGT GAAGGAGATA ACCTACATCA 840 

ACCGAGATAC CAAAATCATC CTGGAGACCA AGAGCAAGAC CATTTACAAG CTGAACGGTG 900 

TGTCCGAAAG GGACCTGAAG AAATCGGTGC TGTGGCTCAA AGACAGCTTG CAGTGCACCT 960 

. - GTGAGGAGAT GAACGACATC AACGCGCCCT ATCTGGTCAT GGGACAGAAA CAGGGTGGGG 1020 

AGCTGGTGAT CACCTCGGTG AAGCGGTGGC AGAAGGGGCA GAGAGAGTTC AAGCGCATCT 10 80 

CCCGCAGCAT CCGCAAGCTG CAGTGCTAGT CCOGGCATCC TGATGGCTCC GACAGGCCTG 1140 

CTCCAGAGCA CGGCTGACCA TTTCTGCTCC GGGATCTCAG CTCCCGTTCC CCAAGCACAC 1200 

TCCTAGCTGC TCCAGTCTCA GCCTGGGCAG CTTCCCCCTG CCTTTTGCAC GTTTGCATCC 1260 

CCAGCATTTC CTGAGTTATA AGGCCACAGG AGTGGATAGC TGTTTTCACC TAAAGGAAAA 1320 

GCCCACCCGA ATCTTGTAGA AATATTCAAA CTAATAAAAT CATGAATATT TTTATGAAGT 1380 
TT 



1382 

Seq ID NO: C183 DNA Sequence 

Nucleic Acid Accession #: NM_001306.1 

Coding sequence : 1 9 9 8 61 

1 11 21 31 41 51 

I I I I I | 

AATTCGGCAC GAGGGCAGGT GCAGGCGCAC GOGGCGAGAG CGTATGGAGC CGAGCCGTTA 60 

GCGCGCGCCG TCGGTGAGTC AGTCCGTCCG TCCGTCCGTC CGTCGGGGCG CCGCAGCTCC 120 

CGCCAGGCCC AGCGGCCCCG GCCCCTCGTC TCCCCGCACC CGGAGCCACC CGGTGGAGCG 180 

GGCCTTGCCG CGGCAGCCAT GTCCATGGGC CTGGAGATCA CGGGCACCGC GCTGGCCGTG 240 

CTGGGCTGGC TGGGCACCAT CGTGTGCTGC GCGTTGCCCA TGTGGCGCGT GTCGGCCTTC 300 

ATCGGCAGCA ACATCATCAC GTCGCAGAAC ATCTGGGAGG GCCTGTGGAT GAACTGCGTG 360 

GTGCAGAGCA CCGGCCAGAT GCAGTGCAAG GTGTACGACT CGCTGCTGGC ACTGCCACAG 420 

GACCTTCAGG CGGCCCGCGC CCTCATCGTG GTGGCCATCC TGCTGGCCGC CTTCGGGCTG 480 

CTAGTGGCGC TGGTGGGCGC CCAGTGCACC AACTGCGTGC AGGACGACAC GGCCAAGGCC 540 

AAGATCACCA TCGTGGCAGG CGTGCTGTTC CTTCTCGCCG CCCTGCTCAC CCTCGTGCCG 600 

GTGTCCTGGT CGGCCAACAC CATTATCCGG GACTTCTACA ACCCCGTGGT GCCOGAGGOG 660 

CAGAAGCGCG AGATGGGCGC GGGCCTGTAC GTGGGCTGGG CGGCCGCGGC GCTGCAGCTG 720 

CTGGGGGGCG CGCTGCTCTG CTGCTCGTGT CCCCCACGCG AGAAGAAGTA CACGGCCACC 780 

AAGGTCGTCT ACTCCGCGCC GCGCTCCACC GGCCCGGGAG CCAGCCTGGG CACAGGCTAC 840 

GACCGCAAGG ACTACGTCTA AGGGACAGAC GCAGGGAGAC CCCACCACCA CCACCACCAC 900 

CAACACCACC ACCACCACCG CGAGCTGGAG CGCGCACCAG GCCATCCAGC GTGCAGCCTT 960 

GCCTCGGAGG CCAGCCCACC CCCAGAAGCC AGGAAGCCCC CGCGCTGGAC TGGGGCAGCT 1020 * 

TCCCCAGCAG CCACGGCTTT GCGGGCCGGG CAGTCGACTT CGGGGCCCAG GGACCAACCT 1080 

GCATGGACTG TGAAACCTCA CCCTTCTGGA GCACGGGGCC TGGGTGACCG CCAATACTTG 1140 

ACCACCCCGT CGAGCCCCAT CGGGCCGCTG CCCCCATGTC GCGCTGGGCA GGGACCGGCA 1200 

GCCCTGGAAG GGGCACTTGA TATTTTTCAA TAAAAGCCTC TCGTTTTAGC 1250 

Seq ID NO: CI 84 DNA Sequence 

Nucleic Acid Accession #: NM_0 1244 9.1 

Coding sequencer 66.. 1085 

* 11 21 31 41 51 

' 1 1 1 1 I 

CCGAGACTCA CGGTCAAGCT AAGGCGAAGA GTGGGTGGCT GAAGCCATAC TATTTTATAG 60 

AATTAATGGA AAGCAGAAAA GACATCACAA ACCAAGAAGA ACTTTGGAAA ATGAAGCCTA 120 

GGAGAAATTT AGAAGAAGAC GATTATTTGC ATAAGGACAC GGGAGAGACC AGCATGCTAA 180 

1314 
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AAAGACCTGT 
CAGAACTTCA 
CTATTATAGC 
CAACTTCCCA 
CAATGGTTTC 
TCCAACTTCA 
TAACAAGAAA 
GTCTGTCTTA 
AGGTCCAACA 
ATGTGTCTCT 
CATCTGTGAG 
TTGTTTCCCT 
ATATAAAACA 
TTGTTGTCCT 
AGATTAGACA 
TGTAGAATTA 
TCAAGTTTGT 



GCTTTTGCAT 
GCACACACAG 
ATCTCTGACT 
TCAACAATAT 
CATCACTCTC 
TAATGGAACC 
GCAGTTTGGG 
CCCAATGAGG 
AAATAAAGAA 
GGGAATTGTG 
TGACTCTTTG 
TCTACTGGGC 
ATTTGTATGG 
GATATTTAAA 
TGGTTGGGAA 
CTGTTTACAC 
ATTTGTTAAT 



TTGCACCAAA 
GAACTCTTTC 
TTTCTTTACA 
TTTTATAAAA 
TTGGCATTGG 
AAGTATAAGA 
CTTCTCAGTT 
CGATCCTACA 
GATGCCTGGA 
GGATTGGCAA 
ACATGGAGAG 
ACAATACACG 
TATACACCTC 
AGCATACTAT 
GACGTCACCA 
ACATTTTTGT 
AAAATGATTA 



CAGCCCATGC 
CACAGTGGCA 
CTCTTCTGAG 
TTCCAATCCT 
TTTACCTGCC 
AGTTTCCACA 
TCTTTTTTGC 
GATACAAGTT 
TTGAGCATGA 
TACTGGCTCT 
AATTTCACTA 
CATTGATTTT 
CAACTTTTAT 
TCCTGCCATG 
AAATTAACAA 
TCAATATTGA 
TTCAAGGAAA 



TGATGAATTT 
CTTGCCAATT 
GGAAGTAATT 
GGTCATCAAC 
AGGTGTGATA 
TTGGTTGGAT 
TGTACTGCAT 
GCTAAACTGG 
TGTTTGGAGA 
GTTGGCTGTG 
TATTCAGAGC 
TGCCTGGAAT 
GATAGCTGTT 
CTTGAGGAAG 
AACTGAGATA 
TATATTTTAT 
AAAAAAAAAA 



GACTGCCCTT 
AAAATAGCTG 
CACCCTTTAG 
AAAGTCTTGC 
GCAGCAATTG 
AAGTGGATGT 
GCAATTTATA 
GCATATCAAC 
ATGGAGATTT 
ACATCTATTC 
AAGCTAGGAA 
AAGTGGATAG 
TTCCTTCCAA 
AAGATACTGA 
TGTTCCCAGT 
CACCAACATT 



Seq ID NO: C185 DNA Sequence 

Nucleic Acid Accession fl: NMJJ01775.1 

Coding sequence: 70.. 972 



1 
I 

CTAAAGCTCT 
TGGAGCCCTA 
CTCTCTAGGA 
GTGCTCGCGG 



AGACATGTAG 
CCTTGCAACA 
CCTTGCAACA 
GTCCAGCGGG 
ACATGGTGTG 
AAGGACTGCA 
GAAGCTGCCT 
AAAAACAGCA 
CTAGAGGCCT 
ACCATAAAAG 
ATCTACAGAC 
TCTGAGATCT 
CATCATACAT 
CAATAAGGTC 
CATCAGCATA 
AATGAAAATT 



11 
I 

CTTGCTGCCT 
TGGCCAACTG 
GAGCCCAACT 
TGGTOGTCCC 
AGACCGTCCT 
ACTGCCAAAG 
TTACTGAAGA 
AGATTCTTCT 
ACATGTTCAC 
GTGAATTCAA 
GCAACAACCC 
GTGATGTGGT 
CTTTTGGGAG 
GGGTGATACA 
AGCTGGAATC 
CTGACAAGTT 
GAGCCAGTCG 
GACTCAGCAT 
AATGCCAGAG 
CCTTTATTGT 
GTATGTTAAG 



21 
I 

AGCCTCCTGC 
CGAGTTCAGC 
CTGTCTTGGC 
GAGGTGGCGC 
GGCGCGATGC 
TGTATGGGAT 
AGACTATCAG 
TTGGAGCAGA 
CCTGGAGGAC 
CACTTCCAAA 
TGTTTCAGTA 
CCATGTGATG 
TGTGGAAGTC 
TGGTGGAAGA 
GATTATAAGC 
TCTTCAGTGT 
CTGTGGTTGT 
ACCTGCTGGT 
ACGGAAGCCT 
GATCTATCAA 
TTACTTCCTT 



31 
I 

CGGCCTCATC 
CCGGTGTCCG 
GTCAGTATCC 
CAGACGTGGA 
GTCAAGTACA 
GCTTTCAAGG 
CCACTAATGA 
ATAAAAGATC 
ACGCTGCTAG 
ATAAACTATC 
TTCTGGAAAA 
CTCAATGGAT 
CATAATTTGC 
GAAGATTCCA 
AAAAGGAATA 
GTGAAAAATC 
TTTAGCTCCT 
GCAGAGCTGA 
TTTTCCCCAA 
TAGTCAAGAA 
TAG 



41 

I 

TTCGCCCAGC 
GGGACAAACC 
TGGTCCTGAT 
GCGGTCCGGG 
CTGAAATTCA 
GTGCATTTAT 
AGTTGGGAAC 
TGGCCCATCA 
GCTACCTTGC 
AATCTTGCCC 
CGGTTTCCCG 
CCCGCAGTAA 
AACCAGAGAA 
GAGACTTATG 
TTCAATTTTC 
CTGAGGATTC 
TGACTCCTTG 
AGATTTTGGA 
AGTCTTAAAA 
AAATTATTGT 




CACCACCAAG 
TCCTGAGATG 
TTCAAAACAT 
TCAGACCGTA 
GTTCACACAG 
TGATGACCTC 
AGACTGGAGA 
CAGGTTTGCA 
AATCTTTGAC 
GGTTCAGACA 
CCAGGATCCC 
CTGCAAGAAT 
ATCTTGCACA 
TGGTTTATGT 
GGGTCCTCCA 
TAACTTATAT 
ATAAGATTAG 



Seq ZD NO: C186 DNA Sequence 
Nucleic Acid Accession #: XM_120513. 
Coding sequence: 1..2208 



1 
I 

ATGGTGTCAT 
TACGCCTTGC 
AGGACCCCAA 
TTTTCAGAAG 
TACTCTGACG 
GTCATGCAGG 
GCCTCGAGTC 
GCCCCCCGCG 
ACGGCACAGA 
CGGAGGCACC 
CCGAGGCGCC 
ATCTACAAGC 
CTGGCCAAGC 
CCGGCGCAGC 
GTCAGACTCG 
CCGCCCCTCC 
TCCCCGCCAT 
CATACACATA 
TTTGGATTGA 
ATT CTTGG AT 
GTGGCGGTCC 
TTGGATGCCT 
CTCCTGGACT 
TACAACGAAA 
AATGTAGACA 
GTTTTCACTG 
TCGTCAGCCA 
CATACATTAG 
TCCAGAGAGC 
GGTGCTTCAA 
CTTGAGGATG 
ACCTCCTTGG 
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60 
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TAGGCATCTA 
CTTGG CTGGG 
GAACTCTGAA 
AATGGATTTG 
GAGCAAACAG 
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TGGGATGAGG CTGTAAATTC TTCAAAGAAA GATGGGAGAC GGCTCCTTCG ATACCTCATC 1980 

AGATTTGTTT TCACAACCGA TGAGCTTAAG TACTCATGCG GCCTTGGGAA AAGGAAAAGG 2040 

TCAGTGCAGT CAGGAGAGAC AGGTCCCGAA AGACGCCCTC TGGATCCAGT TAAAGTAACA 2100 

TGCCTCCGAG GTACTGCATC CTTCCGCTCA GTGTCACCAT CTGTGATCTC ATTTCACCGC 2160 

ATTGGCTGTG GCTCTCCCCG TACAAGTGTT CAGCCTTCTG TATTTTGA 2208 

Seq ID NO: CI 8 7 DNA Sequence 
Nucleic Acid Accession #: AB037745.1 
Coding sequence: 26.. 1744 

1 11 21 31 41 51 

I I I I I I 

ATGGTGGAAC ACGCTGCCCA CAAACATGGA AACGACCGTT CTCAGTGGGA TCAACTTCGA 60 

GTACAAGGGC ATGACAGGCT GGGAGGTGGC TGGTGATCAC ATTTACACAG CTGCTGGAGC 120 

CTCAGACAAT GACTTCATGA TTCTCACTCT GGTTGTGCCA GGATTTAGAC CTCCGCAGTC 180 

GGTGATGGCA GACACAGAGA ATAAAGAGGT GGCCAGAATC ACATTTGTCT TTGAGACCCT 240 

CTGTTCTGTG AACTGTGAGC TCTACTTCAT GGTGGGTGTG AATTCTAGGA CCAACACTCC 300 

TGTGGAGACG TGGAAAGGTT CCAAAGGCAA ACAGTCCTAT ACCTACATCA TTGAGGAGAA 360 

CACTACCACG AGCTTCACCT GGGCCTTCCA GAGGACCACT TTTCATGAGG CAAGCAGGAA 420 

GTACACCAAT GACGTTGCCA AGATCTACTC CATCAATGTC ACCAATGTTA TGAATGGCGT 480 

GGCCTCCTAC TGCCGTCCCT GTGCCCTAGA AGCCTCTGAT GTGGGCTCCT CCTGCACCTC 540 

TTGTCCTGCT GGTTACTATA TTGACCGAGA TTCAGGAACC TGCCACTCCT GCCCCCCTAA 600 

CACAATTCTG AAAGCCCACC AGCCTTATGG TGTCCAGGCC TGTGTGCCCT GTGGTCCAGG 660 

GACCAAGAAC AACAAGATCC ACTCTCTGTG CTACAATGAT TGCACCTTCT CACGCAACAC 720 

TCCAACCAGG ACTTTCAACT ACAACTTCTC CGCTTTGGCA AACACCGTCA CTCTTGCTGG 780 

AGGGCCAAGC TTCACTTCCA AAGGGTTGAA ATACTTCCAT CACTTTACCC TCAGTCTCTG 840 

TGGAAACCAG GGTAGGAAAA TGTCTGTGTG CACCGACAAT GTCACTGACC TCCGGATTCC 900 

TGAGGGTGAG TCAGGGTTCT CCAAATCTAT CACAGCCTAC GTCTGCCAGG CAGTCATCAT 960 

CCCCCCAGAG GTGACAGGCT ACAAGGCCGG GGTTTCCTCA CAGCCTGTCA GCCTTGCTGA 1020 

TCGACTTATT GGGGTGACAA CAGATATGAC TCTGGATGGA ATCACCTCCC CAGCTGAACT 1080 

TTTCCACCTG GAGTCCTTGG GAATACCGGA CGTGATCTTC TTTTATAGGT CCAATGATGT 1140 

GACCCAGTCC TGCAGTTCTG GGAGATCAAC CACCATCCGC GTCAGGTGCA GTCCACAGAA 1200 

AACTGTCCCT GGAAGTTTGC TGCTGCCAGG AACGTGCTCA GATGGGACCT GTGATGGCTG 1260 

CAACTTCCAC TTCCTGTGGG AGAGCGCGGC TGCTTGCCCG CTCTGCTCAG TGGCTGACTA 1320 

CCATGCTATC GTCAGCAGCT GTGTGGCTGG GATCCAGAAG ACTACTTACG TGTGGCGAGA 1380 

ACCCAAGCTA TGCTCTGGTG GCATTTCTCT GCCTGAGCAG AGAGTCACCA TCTGCAAAAC 1440 

CATAGATTTC TGGCTGAAAG TGGGCATCTC TGCAGGCACC TGTACTGCCA TCCTGCTCAC 1500 

CGTCTTGACC TGCTACTTTT GGAAAAAGAA TCAAAAACTA GAGTACAAGT ACTCCAAGCT 1560 

GGTGATGAAT GCTACTCTCA AGGACTGTGA CCTGCCAGCA GCTGACAGCT GCGCCATCAT 1620 

GGAAGGCGAG GATGTAGAGG ACGACCTCAT CTTTACCAGC AAGAAGTCAC TCTTTGGGAA 1680 

GATCAAATCA TTTACCTCCA AGCAGCCAGC TCCTGTCACC ATCTCTCTTT CAGAGGACTC 1740 

CTGATGGATT TGACTCAGTG CCGCTGAAGA CATCCTCAGG AGGCCCAGAC ATGGACCTGT 1800 

GAGAGGCACT GCCTGCCTCA CCTGCCTCCT CACCTTGCAT AGCACCTTTG CAAGCCTGCG 1860 

GCGATTTGGG TGCCAGCATC CTGCAACACC CACTGCTGGA AATCTCTTCA TTGTGGCCTT 1920 

ATCAGATGTT TGAATTTCAG ATCTTTTTTT ATAGAGTACC CAAACCCTCC TTTCTGCTTG 1980 

CCTCAAACCT GCCAAATATA CCCACACTTT GTTTGTAAAT TATGCCCTTG CTTGTATCTT 2040 

GTTTCCCAAA ATGGCCCATC CGCCAGAGCC ATAGCTTCGT CTGCTCATAA TTCTTATAGC 2100 

TTTGGAATGA AAATATTTCT ATCTTCTTAA GTATAGAAAC TATTTCCTCT GTCCTCTAAC 2160 

TTAAGGGCAG AAACAGCTGG GAGTTTTCCT CGCATGCCCT CAGCTCATGA TCTCTTCAGG 2220 

AGAGAGGCTG GGTGAGGAGG GTGTCGGGGT TCCCTGGTGG ATAATCTTCA TAGCAGCCTG 2280 

GATCCATTTC CCCTGGATAA CCAGCTCAAA GGGAGTGAAA ATGGTAGTCT GAGGGCAAGG 2340 

GGAGCAAGGC CTGGGTAAGA AAAGCCTTGA AAAGCATAAA AAGAGGCCGG GCGCGGTGGC 2400 

TCACGCCTGT AATCCCAGCA CTTTGGGAGG CCGAGGCGGG CAGATCATGA GGTCGGGAGA 2460 

TTGAGACCAT CCTGGCTAAC ACGGTGAAGC CCCGTCTCTA CTGGAAATAC AAAAAATTAG 2520 

CCGGGCGTGG TGGCGGGTGC CTGTGGTCCC AGCTACTCGG GAGGCTGAGG CGGGAGAATA 2580 

GCGTGGGCCT GGAAGGCGGA GCTTGCAGTG AGCCGAGATC GCGCCACTGC ACTCCATCCA 2640 

GCCTGGGTGA CAGAGTGAGA CTCTGCCTCA AAAAAAAAAA AAAAAAAGAA AAGCACAAAG 2700 

AGAGGCAACA AGGAATGTTT TTGTTTTTGA GACAGGCTCT CACTCTGTCA CCTAGGCTGG 2760 

AGTGCAGTGG CGTAATCACT GTTCAGTGCA GCCTCAAGCT CTTGGGCTCA GGCTATCCTC 2820 

CCATCTCAGC CTCTCAAGTA GCTGGGACTA CGAGTGTGCA CCACCAGGCT CACTAATTTT 2880 
TGTGTTTTTT GTAGACACGG GGTTTCACCG TGTTGCCCAG GCTGGTCTCC AACTCCTGGG 2940 
CTCAAGTGAT CTGTCCGCCT CGGCCTCCCA AACTGCTGGG ATTACAGGCA TAAGCCACTG 3000 
CACTCAGCCT TTTATTTGTT TTTTAAACCA CGTAGCTCAT TGCCTTCTCT TAAGTAAATG 3060 
ATAGATATTC TCACTGAAGC CAAAGGAATA AGTTCATCAA GAAAATGCCC AAAGCCCTGG 3120 
TGGATACATC CTCCCTATCT TTTTTTTAAA CCTTCCACTA TCACTCTATG ACACTGAAAA 3180 
GAACCAGGTA AGCCCCAAAC CCAGATGTTC CAGCCTTATC CTCTATTGGG TTTACCCACA 3240 
GACATAGCAA ACCCTGTCAG TGAGGAAAAT TCCCCATCCT TGAGTGCCCC CGTCCTAGAA 3300 
GTTTGGGCCA TATTATGGAA CAGGGGTCTC TTATTTGAAA AGAGCACAAG GAGGCCAAGA 3360 
TTTTAATGGG GCACTTTAGG GGATACAGCC CACAATGGCA TGGGCCTGAG GTGGCCGTGA 3420 
TGTCTGCTTC TAAGCTTAAC GCATCTGCTC AGGCACAGAA TAAACGTCTA GGCTGGCCAA 3480 
AAAAGGAACT GAATCCCAGG CCCATACGCC AGCACCAGAA TCAAACCAGT CTTCAAGGAA 3540 
GGAAGGCTAG GAGAGTTTAA CAAGATTTTC ACTGGGCCCA GCATGGTGGC TCACACCTGT 3 GOO 
AATCCCAAGG CAGAATGGTG GCTTGAGCTC AGGAGTTCAA GACCAGCCTG GGCAACACAG 3660 
TGAGACCCTG TCTCTAAAAA ATTTAAAAAT AAACAAGGTG TTCACCAAGC TGGGATACTT 3720 
CTCACTATTA AGCCCCTATC TTTCTCTTTT TTTCATTCTC AATTGCTTTG TGTGATAAAA 3780 
AACTAAAGAG ACTTCTGGTC CAATTTCTGG CAACATCCCT TCTGAAAGGT GAGTAGAGTG 3840 
GGTGTCTTCT ATGCCCATTT TCCCCAATTT TACACAAACT ATTATCAATG AACTTTTAAG 3900 
TACCTAGAAT GGGTAAAACC AGAGCAAGAC TTTAAATTAC CTTCTTCTTT CTTCTACTGG 3960 
CAGTTCTGCC TCCATCACTA TCAGGCTAGG GTGACCTTCC CTTGGTCAAG CCCCAATTGC 4020 
CCATGATTTG TGCCTGTGCC CTTTCTCCAG TGACCATTTG GTGACCAGAT GGTAGATATA 4080 
GAAAGGGGAT GGCATTTGCA AGTGACTAGT CTGCCACAAA ATGCTCATCT GATTAGCCAC 4140 
TGCTGCCCTG GCAATGGCTT TGTAAGAGTC AATGAGAACT AGAGCCAGGC TGTGGTCCCT 4200 
GGCCATCAAC -AGTGTTGGTG ACGGCAGGGA GTCCCTTTGG TTTAATAAAT CCAGTTTTTC 4260 
TTTGGGTATC CAAATTCTCC CCTCCTTTTG TAGGAGTCAG GCTCTCAGAA CCTGTGTCCA 4320 
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5 

10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 



TGTTGGAACT 
TTCTGTAGCC 
CAAAGGCCAC 
TTGACTTCTT 
ACTACCCCTT 
GTTACGCTAC 
ACAGCTTTAT 
ATGGTTTTCA 
TTCTTTTCCT 
CCATGATCTT 
CCTCCCCAGT 
TAGTAGAGAT 
ATCCACCCAC 
CTACGTGTTC 
TGTCCAAGCT 
TTGAGTCCTG 
ATACAAATGA 
GTGCCAACAA 
AACAGAAACT 
CTGGTTGGAT 
TATTGCATGT 



75 



80 



TCCCCCAGTG 
TCAGCAATAC 
AGACAGCCCT 
TATTCTGACT 
ACCGTGCTGA 
TGGGGTCACC 
TGAGATATAA 
GCAAACTCAC 
TTTTTTTTTT 
GGCTCACTGC 
AGCTGGGATT 
GGGGTTTCAC 
CTCGGCCTCC 
AATTTTCTAT 
GTTGTTGGTA 
GAACACGCAG 
AGATTGTGGT 
AACTTTTTTT 
TTGTAATAGA 
TTCCTTTTTA 
CACTTTTTTT 



TGGATGCAGA 
TTGGGCACCT 
TAGACTATTC 
CCACTGATTT 
CTTCTGCAGG 
TGTTGCTCCC 
TTCACATATT 
AGAGTTGTCC 
GAGACAGAGT 
AGCCTCCCCA 
ACAGGCATGC 

CAAAGTGCTG 
GAACAAAGGC 
GAGGGAGTAT 
TAACTGTCAT 
GTATTCAAGC 
TGTCTGACTA 
TTTTTTCAGC 
CCCTGTAATC 
TTTTTGTAAA 



TACGCAGCTC 
GCTGTCTCAC 
CGGAAACAGT 
TAGCCATAAT 
TCTGCCCTGT 
CTAGCAAGTT 
ATACAATTCA 
GCCCACTTGA 
CAGCTTTGTC 
TCCTGGGTTC 
GCCACCACGC 
AGGCTGGTCT 
GGATTGCAGG 
TTTAGTCCTT 
GATAAAATGT 
GCTATAGTCA 
AGTAGGGTTT 
CATTAAAGAT 
TTTGTGAAAT 
CAAGCGTTAA 
ATAAAAACAT 



CTGAGCTCCA 
TGAATAGCTT 
AGGAAAAATT 
ACTTTAAGGA 
GACCTGTCAG 
AGGCATGTCA 
CCTTTAAAAC 
GAGCAAACAC 
GCCCAGGCTG 
AAGTGATCCT 
CTAGCTAATT 
CAAACTCCTG 
TGTGAGCCAC 
GACCCAGGGC 
TTAAATCTCA 
TCATCTGTAT 
TTGCTTTTGT 
AAGACTGACT 
CGAATTTTTT 
TAGTTTGTTA 
ACCTTAC 



GCCTAAAGTC 
TCTTTTGTGA 
ACATATGTCT 
GCTACTTTTT 
GAACTCCTGA 
TATATTTTTA 
ATACGATTCA 
ATGTTCAATT 
GAGTGCAGTG 
TCTGCTTCAG 
TTTGTGTTTT 
GACTCAAGTG 
CGTGCCTGGC 
TAAAGTGGTC 
TTTGGTTACC 
TTGGCTGGGA 
TTTTGTTTTA 
ATATTTATAC 
TTCATCAGGG 
GAAGATGGGT 



4380 
4440 
4500 
4560 
4620 
46B0 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5567 



Seq ID NO: C188 DNA Sequence 

Nucleic Acid Accession NM_014324.1 

Coding sequence: 89.. 1237 



ll 



GGCGCCGGGA 
TTCCTTCAGC 
GTCCGGCCTG 
GGTACGCGTG 
CTCGCTAGTG 
GGT CGGATGT 
CAGAGATTCT 
AGTTCAGGAA 
TGTTCTCTCA 
TGACTTTGCT 
CACACGCACT 
AAGTTCTTTT 
CATGTTGGAT 
GGCTGTTGGA 
GTCTGATGAA 
TGCAGATGTA 
TGCCTGTGTG 
ACGGGGCTCG 
GTTAAACACC 
GGAGATACTT 
AATCATTGAA 
AATTTGAATA 
GAGGAACAGT 
CTACAGTGAT 
TGGGTACTTA 
TGATATTAAG 
TCTTGAAGAC 
AAATGCCACA 
GGCCTTTTGT 
TATCACACTT 
CTGAAAAAAA 
GGGACAGTCA 
CTCTOGGCTG 
TTCTGGATCT 
AAAAAAAAAA 



TTGGGAGGGC 
GGGGCACTGG 
GCCCCGGGCC 
GACCGGCCCG 
CTGGACCTGA 
GCTGCTGGAG 
GCAGCGGGAA 
AGCTTCTGCC 
AAAATTGGCA 
GGTGGTGGCC 
GACAAGGGTC 
CTGTGGAAAA 
GGTGGAGCAC 
GCAATAGAAC 
CTTCCCAATC 
TTTGCAAAGA 
ACTCCGGTTC 
TTTATCACCA 
CCAGCCATCC 
GAAGAATTTG 
AGTAATAAGG 
CTGCATTTAC 
ATTACAGTGT 
GATTGAATTC 
TACTAAATTA 
ATTCTTGACT 
ATCGATATAC 
AATTGTATGG 
CTTGGTGTTC 
TGTAATTTGC 
CATATCCAAA 
GTTTTAGGGT 
TCAGCTTTCC 
TATACCCAAC 
AAAAAAAAAA 



21 
I 

TTCTTGCAGG 
GAAGCGCCAT 
GTNTCTGTGC 
GCTCCCGCTA 
AGCAGCCGCG 
CCCTTCCGCC 
AATCCAAGGC 
GGTTAGCTGG 
GAAGTGGTGA 
TTATGTGTGC 
AGGTCATTGA 
CTCAGAAATC 
CTTTCTATAC 
CCCAGTTCTA 
AGATGAGCAC 
AGACGAAGGC 
TGACTTTTGA 
GTGAGGAGCA 
CTTCTTCCAA 
GATTCAGCCG 
TAAAAGCTAG 
AGTGTAGAGT 
CCTACCACTC 
TAAAAATGGT 
TGGTAGTTAT 
TATATTTTGA 
ATTTATTTAC 
TGATAAAAGT 
ATGATCTCCC 
AAAGAAAAGT 
ATAATGAGGA 
TGOCTGTATC 
TTTCTCCATG 
ACACAGCAAC 
AAAAAAAA 



31 

I 

CTGCTGGGCT 
GGCACTGCAG 
TATGGTCCTG 
CGACGTGAGC 
GGAGCCGCGT 
GCGGTGTCAT 
TTATTTATGC 
CCACGATATC 
GAATCCGTAT 
ACTGGGCATT 
TGCAAATATG 
GAGTCTGTGG 
GACTTACAGG 
CGAGCTGCTG 
GGATGATTGG 
AGAGTGGTGT 
GGAGGTTGTT 
GGACGTGAGC 
AGGGGATCCT 
AGAAGAGATT 
TCTCTAACTT 
AACACATAAC 
TAATCAAGAA 
TATCATTAGG 
TCTGCCTTCC 
ATGGGTTCTA 
ACTCTTGATT 
CACGTGAAAC 
TCTAAGCACA 
TTCACCTGTA 
AATGTGTTGG 
CAGTAACTCG 
TGTTTGATTT 
ATCCAGAAAT 



41 



51 



GGGGCTAAGG 
GGCATCTCGG 
GCTGACTTCG 
CGCTTGGGCC 
GCTGCGGCGT 
GGAGAAACTC 
CAGGCTGAGT 
AACTATTTGG 
GCCCCGCTGA 
ATAATGGCTC 
GTGGAAGGAA 
GAAGCACCTC 
ACAGCAGATG 
ATCAAAGGAC 
CCAGAAATGA 
CAAATCTTTG 
CATCATGATC 
CCCCGCCTTG 
TTCATAGGAG 
TATCAGCTTA 
CCAGGCCCAC 
ATTGTATGCA 
AAGAATTACA 
GCTTTTGATT 
AGTTTGCTTG 
GTGAAAAAGG 
CTACAATGTA 
AGAGTGATTG 
TTCCAAACTT 
TTGAATCAGA 
CTCACTACGT 
GGGCCTGTTT 
CTCCTCAGGC 
AAAGATCTCA 



GCTGCTCAGT 
TCGTGGAGCT 
GGGCGCGTGT 
GGGGCAAGCG 
CTGTGCAAGC 
CAGCTGGGCC 
GGATTTGGCC 
CTTTGTCAGG 
ATCTCGTGGC 
TTTTTGACCG 
CAGCATATTT 
GAGGACAGAA 
GGGAATTCAT 
TTGGACTAAA 
AGAAGAAGTT 
ACGGCACAGA 
ACAACAAGGA 
CACCTCTGCT 
AACACACTGA 
ACTCAGATAA 
GGCTCAAGTG 
TGGAAACATG 
GACTCTGATT 
TATAAAACTT 
ATATATTTGT 
AATGATATAT 
GAAAATGAGG 
GTTGCATCCA 
TAGCAACAGT 
ATGCCTTCAA 
AGAGTCCAGA 
CCCCGTGGGT 
TGGTAGCAAG 
GGACCCCCCA 



Seq ID NO: CI 8 9 DNA Sequence 

Nucleic Acid Accession #: XM_091332.1 

Coding sequence: 1..1401 



70 | 



ATGCAAAGGT 
CCACAAACAG 
CCCCTGTTCT 
TGGGCTGAGG 
GCCTCCATCC 
CCCGGCATCC 
GAATGGACTG 
GGCGTCCACG 
GAGCAGCTAC 
TATCTCCCTG 
AAGAGCCTGA 
CAAAAGGCAA 
GGGTCATGGA 
CAGGAAGGCC 



11 

I 

GGACACTGTG 
ACATCAGTAT 
GGATGGAGTT 
CCGACCTCTA 
ACAGCTGGGA 
CAGCTGACGT 
ATGGCTCATC 
CGGACCCAGA 
AGGCCCCAGA 
AGGATTTCCC 
AACCATCCAA 
AGACCCGACA 
AAGAAAGAAT 
GGCTCCGGTG 



21 
I 

GGCTGCAGCC 
CAGTCCAGCC 
CAAAGGCCAC 
CTGTTCTGAG 
GGAGAATGTC 
CTGGACAGGC 
CTATGACTAC 
AGAAGAGGAC 
GCCCCAGTTA 
AGCTGAGCCC 
GAGTCATCTT 
TAGGAGCCTG 
GAATGCAGAC 
CAGGGAGCGC 



31 
I 

TTCCTGACCC 
CTGCCAGAGC 
TGCTATCGAT 
TTCTCTGTGG 
TTTGTATATG 
CTTCATQATC 
AGCTACTGGG 
TGCGTGCAGA 
CCCTTATCAA 
AAGCTCATGG 
ATGGAGCCAC 
CGOGGCGTCT 
TACGGGCGAA 
CGCCTGCGGG 



41 
I 

TCCACTCTGC 
TGCCCCTGCC 
TCTTCCCTCT 
GCAGGAAGTC 
ACCTCGTGAA 
ACAGACAGGA 
ATGGCAGCCA 
TATGGTACAG 
TCTCAGAGGC 
ACCAGTCCTG 
CCACTCCAGT 
GGTGGCCATC 
GAAAGCGATC 
CTGCTTCGGG 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2068 



51 
I 

ACAGGCCTTT 
TTCCCTGTGC 
CAATAAGACC 
CGCCAAGCTG 
CAGCTGTGTT 
AGGGCAGTTT 
GCCAGATGAT 
GCCTACCAGT 
CACAGATGTC 
GGTGTCCAGG 
GGCCAAGCAC 
AGGTAAGGCT 
GGCCCCGAGG 
CCAGGGTCGA 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 
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CCCGAGGGCC AGCGCAAGCA GCGGCAACAG GAGCGCCAGG AGAGAGGCTG GGAAGAACTG 
GGAGGGGTGT CCCCAATGCG GGGCGCCCAA GCGTGGCAGC ACGGGCTGGG AGCGGGGAGC 
CAGCGGGGTG CGGCGCCGGA GTGCGGGGAG AACCACCAGG CGCCGGAATT GGGGAGCACG 
TGGAGGGGGC AGCGGCTCCA GCCCCAGACC GCCGCGCTCT GTCACTTTGC ATTAAGAAAG 
CTTCCGGGGA ATGCACACGG CCTGGCCGCC GCCTTCGTGC AGCCCGCCCT GCAGGTGCAG 
GAAGAAAAGA ATAATOGCAC CCGTTTCTCA GGTGCTTACT TCACCATGTC CGATCCGACG 
TGTGACCAAG ATAGCAAGGA GCAGTCTTTA AGGCGACAOG GCAGAGAGGC AGAAAAAGAT 
GGGCCTTACC GGTTAGTTAA GAAAAAAAGA GGACCTGTTG CCTGTCCCTC TAGCTTTGAA 
CTACAAAGTG GAGGGGAAGT TTGTCTGGAT TTTCCTGTAG AACTGAGGGC AGGGACCTGG 
ATTGCTCGAG AACCTCCATA A 

Seq ID NO: CI 90 DNA Sequence 
Nucleic Acid Accession #: XM_054869.2 
Coding sequence : 2 6 .. 2 90 2 

1 11 21 31 41 SI 

1 I I I 1 I 

TAGACGCGGA GCCCAAGGAG GTAAAATGCA CACTTGCTGC CCCCCAGTAA CTTTGGAACA 60 

GGACCTTCAC AGAAAAATGC ATAGCTGGAT GCTGCAGACT CTAGCGTTTG CTGTAACATC 120 

TCTCGTCCTT TCGTGTGCAG AAACCATCGA TTATTATGGG GAAATCTGTG ACAATGCATG 180 

TCCTTGTGAG GAAAAGGACG GCATTTTAAC TGTGAGCTGT GAAAACCGGG GGATCATCAG 240 

TCTCTCTGAA ATTAGCCCTC CCCGTTTCCC AATCTACCAC CTCTTGTTGT CCGGAAACCT 300 

TTTGAACCGT CTCTATCCCA ATGAGTTTGT CAATTACACT GGGGCTTCAA TTTTGCATCT 360 

AGGTAGCAAT GTTATCCAGG ACATTGAGAC CGGGGCTTTC CATGGGCTAC GGGGTTTGAG 420 

GAGATTGCAT CTAAACAATA ATAAACTGGA ACTTCTGCGA GATGATACCT TCCTTGGCTT 480 

GGAGAACCTG GAGTACCTAC AGGTCGATTA CAACTACATC AGCGTCATTG AACCCAATGC 540 

TTTTGGGAAA CTGCATTTGT TGCAGGTGCT TATCCTCAAT GACAATCTTT TGTCCAGTTT 600 

ACCCAACAAT CTTTTCCGTT TTGTGCCCTT AACGCACTTG GACCTCCGGG GGAACCGGCT 660 

GAAACTTCTG CCCTACGTGG GGCTCTTGCA GCACATGGAT AAAGTTGTGG AGCTACAGCT 720 

GGAGGAAAAC CCTTGGAATT GTTCTTGTGA GCTGATCTCT CTAAAGGATT GGTTGGACAG 780 

CATCTCCTAT TCAGCCCTGG TGGGGGATGT AGTTTGTGAG ACCCCCTTCC GCTTACACGG 840 

AAGGGACTTG GACGAGGTAT CCAAGCAGGA ACTTTGCCCA AGGAGACTTA TTTCTGACTA 900 

CGAGATGAGG CCGCAGACGC CTTTGAGCAC CACGGGGTAT TTACACACCA CCCCGGOGTC 960 

AGTGAATTCT GTGGCCACTT CTTCCTCTGC TGTTTACAAA CCCCCTTTGA AGCCCCCTAA 1020 

GGGGACTCGC CAACCCAACA AGCCCAGGGT GCGCCCCACC TCTCGGCAGC CCTCTAAGGA 1080 

CTTGGGCTAC AGCAACTATG GCCCCAGCAT CGCCTATCAG ACCAAATCCC CGGTGCCTTT 1140 

GGAGTGTCCC ACCGCGTGCT CTTGCAACCT GCAGATCTCT GATCTGGGCC TCAACGTAAA 1200 

CTGCCAGGAG CGAAAGATCG AGAGCATCGC TGAACTGCAG CCCAAGCCCT ACAATCCCAA 1260 

GAAAATGTAT CTGACAGAGA ACTACATCGC TGTCGTGCGC AGGACAGACT TCCTGGAGGC 1320 

CACGGGGCTG GACCTCCTGC ACCTGGGGAA TAACCGCATC TCGATGATCC AGGACCGCGC 1380 

TTTCGGGGAT CTCACCAACC TGAGGCGCCT CTACCTGAAT GGCAACAGGA TCGAGAGGCT 1440 

GAGCCCGGAG TTATTCTATG GCCTGCAGAG CCTGCAGTAT CTCTTCCTCC AGTACAATCT 1500 

CATCCGCGAG ATTCAGTCTG GAACTTTTGA CCCGGTCCCA . AACCTCCAGC TGCTATTCTT 1560 

GAATAACAAC CTCCTGCAGG CCATGCCCTC AGGCGTCTTC TCTGGCTTGA CCCTCCTCAG 1620 

GCTAAACCTG AGGAGTAACC ACTTCACCTC CTTGCCAGTG AGTGGAGTTT TGGACCAGCT 1680 

GAAGTCACTC ATCCAAATCG ACCTGCATGA CAATCCTTGG GATTGTACCT GTGACATTGT 1740 

GGGCATGAAG CTGTGGGTGG AGCAGCTCAA AGTGGGCGTC CTAGTGGACG AGGTGATCTG 1800 

TAAGGCGCCC AAAAAATTCG CTGAGACCGA CATGCGCTCC ATTAAGTCGG AGCTGCTGTG 1860 

CCCTGACTAT TCAGATGTAG TAGTTTCCAC GCCCACACCC TCCTCTATCC AGGTCCCTGC 1920 

GAGGACCAGC GCCGTGACTC CTGCGGTCCG GTTGAATAGC ACCGGGGCCC COGCGAGCTT 1980 

GGGCGCAGGC GGAGGGGCGT CGTCGGTGCC CTTGTCTGTG TTAATTCTCA GCCTCCTGCT 2040 

GGTTTTCATC ATGTCCGTCT TCGTGGCCGC CGGGCTCTTC GTGCTGGTCA TGAAGCGCAG 2100 

GAAGAAGAAC CAGAGCGACC ACACCAGCAC CAACAACTCC GACGTGAGCT CCTTTAACAT 2160 

GCAGTACAGC GTGTACGGCG GCGGCGGCGG CACGGGCGGC CACCCACACG CGCACGTGCA 2220 

TCACCGCGGG CCCGCGCTGC CCAAGGTGAA GACGCCCGCG GGCCACGTGT ATGAATACAT 2280 

CCCCCACCCA CTGGGCCACA TGTGCAAAAA CCCCATCTAC CGCTCCCGAG AGGGCAACTC 2340 

CGTAGAGGAT TACAAAGACC TGCACGAGCT CAAGGTCACC TACAGCAGCA ACCACCACCT 2400 

GCAGCAGCAG CAGCAGCCGC CGCCGCCACC GCAGCAGCCA CAGCAGCAGC CCCCGCCGCA 2460 

GCTGCAGCTG CAGCCCGGGG AGGAGGAGAG GOGGGAAAGC CACCACTTGC GGAGCCCCGC 2520 

CTACAGCGTC AGCACCATCG AGCCCCGGGA GGACCTGCTG TCGCCGGTGC AGGACGCCGA 2580 

CCGCTTTTAC AGGGGCATTT TAGAACCAGA CAAACACTGC TCCACCACCC CCGCCGGCAA 2640 

TAGCCTCCCG GAATATCCCA AATTCCCGTG CAGCCCCGCT GCTTACACTT TCTCCCCCAA 2700 

CTATGACCTG AGACGCCCCC ATCAGTATTT GCACCCGGGG GCAGGGGACA GCAGGCTACG 2760 

GGAACCGGTG CTCTACAGCC CCCCGAGTGC TGTCTTTGTA GAACCCAACC GGAACGAATA 2820 

TCTGGAGTTA AAAGCAAAAC TAAACGTTGA GCCGGACTAC CTCGAAGTGC TGGAAAAACA 2880 

GACCACGTTT AGCCAGTTCT AAAAGCAAAG AAACTCTCTT GGAGCTTTTG CATTTAAAAC 2940 

AAACAAGCAA GCAGACACAC ACAGTGAACA CATTTGATTA ATTGTGTTGT TTCAACGTTT 3000 

AGGGTGAAGT GCCTTGGCAC GGGATTTCTC AGCTTCGGTG GAAGATACGA AAAGGGTGTG 3060 

CAATTTCCTT TAAAATTTAC ACGTGGGAAA CATTTGTGTA AACTGGGCAC ATCACTTTCT 3120 

CTTCTTGCGT GTGGGGCAGG TGTGGAGAAG GGCTTTAAGG AGGCCAATTT GCTGCGCGGG 3180 

TGACCTGTGA AAGGTCACAG TCATTTTTGT AGTGGTTGGA AGTGCTAAGA ATGGTGGATG 3240 

ATGGCAGAGC ATAGATTCTA CTCTTCCTCT TTTGCTTCCT CCCCCTCCCC CGCCCCTGCC 3300 

CCACCTCTCT TTCTCCCCTT TTAAGCCATG GGTGGGTCTA ACTGGCTTTT GTGGAGAAAT 3360 

TAGCACACCC CAACTTTAAT AGGAAATTTG TTCTCTTTTT CCGCCCCTCT CCTTCTCTCC 3420 

TCCCCTCCCC TCCCTTCTCA TTCCTTTTCT TTGTTTTTAA AGGATGTGTT TGTATGCATT 3480 

CTGGACATTT GAATTAAAAA AAAAGTATTG TGATCCTGTA AAGGATCACC ATAGATGTGG 3540 

ACAAATCATT AAAATTACAG AGCTATATGA TCCATAATTG ATTAGTCAAA ATAACTTATT 3600 

GATGAAATAT ACAAATATTT TATTGTAGCA CCTATTTTTA TATGCACATT TAGCATTCCT 3660 

CTTTCCTTCA CTATTTAGCC TATGATTTTG CAGAGGTGTC ACACTGTATT AGGATCTGCA 3720 

TTTCTAAAAC TGACGTGGTA TCAGGAAGGC ATTTTCAATC ATTCAAAATG TGGAGAATTT 3780 

AATGGCTAAA TCTTTAAAAG CCAATGCAAC CCACCCAATT GAATCTGCAT TTTCTTTTAA 3840 

GAAAACAGAG CTGATTGTAT CCCAATGTAT TTTAAAAAAT AGGGCAATTG ATTGGGCCAT 3900 

TCOGAGAGAA TTGTTTGCAA GTTTTGGGTT TTATTAGAAA ATATTTGAAA GTATTTTTAT 3960 

TAATGAACCA AAATGACATG TTCATTTGAC TACTATTGTA GCCGATTTTC GATTGTTTAA 4020 
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CCAAACCCAG TTGCATTTGT ACAGATCCAC GTGTACTGGC ACCTCAGAAG ACCAAATCAT 4080 

GGACTGTACA AGTCTCTATA CAATGTCTTT ATCCCTGTGG GCAGCAAGCA ATGATGATAA 4140 

TGACAAACAG GATATCTGTA AGATGGGGCT ACTGTTGTTA CAGTCTCATA TGTATCCCAG 4200 

CACATGTAAT TTTTTAAATA GTTTCTGAAT AAACACTTGA TAACTATGTC 42 SO 

Seq ID NO: C191 DNA Sequence 

Nucleic Acid Accession #: NMJJ00793.2 

Coding sequence: 401.. 1222 

1 11 21 31 41 51 

1 1 1 I I I 

GCCTGCAGAG AGAGGCACTT TGCACCACAG ACAGATAGCA AGAAGGGAAA GACAGAGAGT 60 

GAGAAAAAAG AGGAGTCAGT CGCTCCTGGG GAAGGGAGAG AGTGAGACTG GGAGAAAGAG 120 

AAGCACAGAA AGTGTGTGTA AAACGGAGTA AAGAAAGAAA AAAAAAAAAC TACCCTTAAA 180 

GCACATTTAA AAAAAAAAAA CTCTGGCAAT TCAAGAAAGA AACAGGCTAC GTTTAAAGAG 240 

CATAGAGACA ATGAAAGGCT AAAGAAAATT TTAAAATCTC TGCCACAGTC TCATAGGTGC 300 

TTGGAAATGA AAGTAGAACT GCCTGTCTTT AACGGACTCT GACAGAGAGG GTGAAGGGGA 360 

ACCAGAGCGC ACAAGGGAAC TGACTCAGGA GGCAGAGAAG ATGGGCATCC TCAGCGTAGA 420 

CTTGCTGATC ACACTGCAAA TTCTGCCAGT TTTTTTCTCC AACTGCCTCT TCCTGGCTCT 480 

CTATGACTCG GTCATTCTGC TCAAGCACGT GGTGCTGCTG TTGAGCCGCT CCAAGTCCAC 540 

TCGCGGAGAG TGGCGGCGCA TGCTGACCTC AGAGGGACTG CGCTGCGTCT GGAAGAGCTT 600 

CCTCCTCGAT GCCTACAAAC AGGTGAAATT GGGTGAGGAT GCCCCCAATT CCAGTGTGGT 660 

GCATGTCTCC AGTACAGAAG GAGGTGACAA CAGTGGCAAT GGTACCCAGG AGAAGATAGC 720 

TGAGGGAGCC ACATGCCACC TTCTTGACTT TGCCAGCCCT GAGCGCCCAC TAGTGGTCAA 780 

CTTTGGCTCA GCCACTTGAC CTCCTTTCAC GAGCCAGCTG CCAGCCTTCC GCAAACTGGT 840 

GGAAGAGTTC TCCTCAGTGG CTGACTTCCT GCTGGTCTAC ATTGATGAGG CTCATCCATC 900 

AGATGGCTGG GCGATACCGG GGGACTCCTC TTTGTCTTTT GAGGTGAAGA AGCACCAGAA 960 

CCAGGAAGAT CGATGTGCAG CAGCCCAGCA GCTTCTGGAG CGTTTCTCCT TGCCGCCCCA 1020 

GTGCCGAGTT GTGGCTGACC GCATGGACAA TAACGCCAAC ATAGCTTACG GGGTAGCCTT 1080 

TGAACGTGTG TGCATTGTGC AGAGACAGAA AATTGCTTAT CTGGGAGGAA AGGGCCCCTT 1140 

CTCCTACAAC CTTCAAGAAG TCCGGCATTG GCTGGAGAAG AATTTCAGCA AGAGATGAAA 1200 

GAAAACTAGA TTAGCTGGTT AAAGGTATGA TTATAAGAGA GCTTATTGTT TTAAAAAGTT 1260 

ATATAAAGGC AAGGAAATTA AGAACTGAAT CCATATTTCA ACAGAGCCCT ATTGGCTTAC 1320 

TGAAAGACAG GAGTTTATCT ATCGGAAGAA CATGAATCTC TAACAGCTCC ATACTTCTTT 1380 

CACTACTCAA ATGGCATTGG GCTGAGTAAG TAACCATATC ACCTCTCTTC TTAGTAAAAA 1440 

GCCCTATGTG AAAAGATCCC AAGATGGAGA GGAAGAAACG CTAATTCAGC ATGTGTTCAT 1500 

TCTGCATTGA GAAGGAACTG ATACATCTGA TGCATGCTTT GAGACCAGAA GAAAAGACTT 1560 

ACCTGAATAA TTACTACATT AGGGAAGCTA CTGTCTACGT TAAGATAAAG GGTATTGCCT 1620 

TGGCTCTATT TGGCATGGAT GGAGCCCAGT TGGAAAATTC CCAAATATTA CAACAAGTCC 1680 

TTGAACCCAG GCCATGTGGT TAGACGTTGG TGTTAAGGTT AGACCTTATG TTAGAGTCAT 1740 

TTCTGATGTT CCAGCTTCTA GCCATGTAGT GCTCTCAGTC TTCATACCCC AGAAATTATT 1800 

GGTATATTTG TAGATACCGA GAATGATCCC TCAGTCTGAG AGGTTAGAAT GATCATCTGT 1860 

AATCTGAGGG TTAATTTCTA GGCAGGTGGA GAGAGTGGTA AAAAAGAAAT GAAATTGACA 1920 

AGCTAGGAAA GAGGAGGCAG AAAGATTTGG AAAATTCACA GAGTTTCACC CTTAAGCTGT 1980 

AGAGAGTGGG TCACATTTGT TAGCCACGGA AACATAGAAA CATACACAAG GCCAGAAAAA 2040 

GAAGAAGGAG CTCAACTAAA AGTGGCATAG AGAATACACA TATAAAAACA ATATATTTGT 2100 

CATATGCTCC TAGAGAGGAG AAAGGGGTGA TTGAAAGAAA AAAAAATACT TAAATATTTG 2160 

TAATTGTGAG GGGTTTCTTT TGGAAATAAT TACTTTTGAA CCATGTATGT GGTATGTATA 2220 

TTTTCAGTGG GTTAATTATA CCCCATGATA CCTATTAAAG GAAAACCAGT GGGTCTGGTG 2280 

GTGCTGGTCT TTTCCTCCCC ATTCCTACAA TTTCTATGTG GCCCAAGTCA TTCCTAATCT 2340 

TGGTCTCTAT AGCAGTGTTC TCTCTGAATG CTGAGCTGAA GAAATTATAC GTACATACAC 2400 

ACATACATAC ATACATACAA ATATATGTAT ATATATTCTC AGCTGCTGCG GGAGGTAGGT 2460 

ACCATGGCCA TTCAGCACAG CCTTGATTTC CTCCCAAAGT AGGTGAGCTA TAGTGAAGAA 2S20 

TAGGTGCAAA CAAACAAGCT TACTTCCATT GCAAAATAGA AGAAGAGGAA GTTAGAGATA 2580 

ATTCTGATCA ATCATTTTGG AGGCTTTGTT ATAAGGCAAC CCCCGGTATA TCATGGAATT 2640 

TCCATTGACA TTTGAATTTG ■GACTTGGATC TTCCCTTGGT CCCATTAGCT GAGGTTTAGT 2700 

AATCTAAAGT CCCTATAGTA TATGATTATA ATGCTATTTT AAAAAATATA TATATAAAAT 2760 

ATTTTTTTCT TTTTAAAATA GACACTATAG TTTTACCCAT AAGTAATATT TAAAGATTAT 2820 

AGCTCCCAAA AGAATGGACC AACCACTTTC GTATCATAAT TTCTTTTTGG TAAATATGAG 2880 

ACTATTATGA AATCATAGTA TATGATTGTA TTTAAAGGTA CAATCAAAGG ATCTTTTGTC 2940 

CATTCCATTA ATAACTGAAT AAAAAATAAA TAAAATGGAT AGAAAAAAAC TAAAGTTGAA 3000 

AATACATTCT TAAACTAGTT GTCTGAAATG AGAAAAGAGT GAGAACTAGG TGTGCAAGAA 3060 

CCAAACGTAT TTTATTTTAT TTTTTAAATG GGAGCAACAT ATCAGTOGTG TCACCAGCTG 3120 

GTATATTGTG TAAATATTAA AGCTCCATTG GGACTGATTT TTCATGGCAA CATCAGCTTT 3180 

CTAATGTTCT AAATTCTATA AAAACCACCC ACAAAGAAAC AAAGCAAATT TCATTATCTA 3240 

ATGAGTTGCT GGAAAATCAT ATTGAGAATA ATTATTTCAG ATTCCTCAGT TGTTAACTTC 3300 

TACATTCAAG GGCTTATCTC TGCCCCCATT GATTTTTAAC CTCAAAATGG TGTGAGATTT 3360 

ACTGTGGAAC CCTAAAGCAG TAAAATAAAA AACCTGGTTG CAGCACATTC ACACTGTTGT 3420 

CCTTAAAATT CCC CTTTTTT CTCTATGTAC GATAAAGTAA CAGTATGTCA GATAAGCOGG 3480 

TGGGGGGATG AGATTAGGCT GAGGCAGTGC TAGTCAACTG GGGGAAAAGG ATGATGGAAA 3540 
AATCACCCAG TTGTGCTATA TTTTTAAAGA AGGAGGTOGT TTATGTGTGC AGACAATTCT 3600 
CCCTGAGGTT AGCCCAATGG AGAAATGAAG CAGAGGAAGG AAACATAGAA AGACATGGGC 3660 
TATCAGGGAG GAAGATGTTC AATAGAACAT GCAAGAATTT CTGGAAGAAA GGCTGTGGAA 3720 
GGGCCAATGG AGAAAATGAA TGGACAAAGC TCAGGAATCC CTACGCTATG TAGAATGTTC 3780 
TTGGTGTTAT CAGGGTTAAG CCCTGTAATT ATGTAACCTA TTTATCGCAA CATGAATTTT 3840 
TATGATTTCT TGTGATGTAT TCTTTTATGA AATTAACAAG AACTCATTAT TTTGAGGTAG 3900 
AGGAAAATCA ATGCTTTATC TGATATGCTG AGAAATTATT AGATTGCCAA TACTCATGTG 3960 
CGTTTCATGT GTTTTATAAG GTTTGTTCCT TTGAAGAATT GTAGTTCTTA GTCCCACAGG 4020 
GAAATGTGTA TCTATTTATA TATCATAGTA TAAATCTATG ATATATTTAT AT CATA TATA 4080 
AAAGTCTGAG TTCTCTTTCT TAGTCCCTAA TCATGTTTCT CCCATAGGCT GTGTTTACAT 4140 
GGAGCTATCG GTTTAGCCTT TTAAGCTTCA TTAGCTTGTC TATTATTGAA ATAGTTTCCA 4200 
AGAAATTTTA GATATTATCA TAACATCTGG GTCTACTCAA ACACTTATTG TTTGAAAGAC 4260 
TTATGTCTTG GACCTATCAA AAACTGACTT TATTTATTGC TTAGTGAAAA TACTAGTGGG 4320 
ATCAACAATG ATTTTCTTGA ATGGGCATGA ATGGAGATGC CCGCACAGTA ATGTAGAAAT 4380 
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GTTTCATACA 
ACTTTGTATA 
CTCTGGTCCT 
CTTCCCAGTG 
CATCACAGTG 
GGGCTCTGAA 
ATAAAATTCT 
AGGGTGGGAT 
AAGAATTGAC 
GGACATTTGT 
TATGACCATT 
AAAACCTTTA 
TGGAGACTGC 
TAAATGGTAT 
TCACTCTAAA 
TACCAATCTC 
AGAGAGGATC 
GGGTTGGTCT 
GAAGGCTGCT 
ATGCCCTATG 
CAAGGTGAGT 
TGCAAGTCGG 
CAATATTTAC 
AAACCCTCCA 
AACCCACAGA 
CACTGGTCAG 
AGTACTTTAT 
TTAGTTTTGC 
GAAGAGGGAG 
AAGAGTCATC 
TTTGTCTCCC 
GTGAGAATGC 
GCTTTGGGAG 
TTATACAATT 
AAAAAAAAA 



GCTATTAAAA 
GCTAAGTGAC 
GTGTCTTCAC 
ATCTGTTCAG 
GGAAGAATAG 
ATAAAAATTA 
AGGAGGGCAG 
TACAAGGGTG 
TGATTTACAG 
TCCACCCGAC 
GAAAAAGGAA 
CTAGCATTTA 
AAGTAAGGCT 
GGCCAAAAGT 
AATAGAATCC 
TGCTTTGCAA 
TAGGATGGGA 
ACCAATCTGG 
GCAAGTCATT 
TGTATAGTAC 
CTGAACCAAT 
GGCTGGCTAC 
TGAGACTTCG 

AGGGGATGGG 

AACCAAAGCA 
CATCACATTG 
ACAGAGATGT 
GACCTCAGTT 
TGGCAAGGAG 
ACGCGTGCAT 
ATTATCAGTA 
TGTTCTTGTA 



TGTAACTGAC 
AGTCACTTAA 
CTCATTTATA 
TTAAGTTCTT 
CCTATTGTCT 
TGAAATATGG 
GTTAGGAGAC 
TTCCTCAGGC 
GACTTCTCTT 
CTCTGACTGA 
AAATGTAGAC 
GAGCTTTTCA 
TTTAATTTTA 
CAGAGTTAAA 
AAACCCACTC 
TGGGCACAAT 
GAGCTAGAAA 
GAAGATTTGA 
GAGTGACTTT 
CAGAAGCAAG 
AGAAAGCAAA 
CCGTCTTAGG 
AAGACCCAGC 
TCCCCTCAAA 
AAATAAAGAA 
ATGTGTATTA 
ATTAAATGAT 
TCACCCAGAC 
GCCAGAGTTG 
AGTGGTTGGA 
AATATGCGGG 
ATGCTACACA 
GAAAGAGTGT 
TTTTAATAAA 



CTCCTTAGAG 
CTTACATGAC 
GCACGTCTCC 
CTCCCGTTAA 
TTCATTTTGC 
TGAGGTCACA 
AGTTATGTAT 
ATGCCCCTAT 
TATGTCAATC 
TGGTTTGGAA 
TCTGACTTCC 
GAACATCCCC 
GGAGGTTTTT 
ATATATATAG 
TTCATATATG 
CTTGGTCATG 
GTTGCTAACT 
AAACAAACTT 
AGGATGAGCA 
GTCTCAGACT 
CATGTGCAGA 
CAGCAACAGC 
AGATGTTTAA 
AAGCCAACAG 
AATTCTCTCA 
GGATTGGGGG 
ATTGGGGTAG 
CTCACCTAGC 
ACCCAGTGTG 
TGTAGTCACA 
ACATGATGCT 
TATGTGCTTC 
TATCATATTG 
CTTTGAATAA 



GCAGATTAGT 
TTTCTTTTTT 
TTGATTTTTG 
CCAGGAAGTG 
CTGAGTGTAT 
TGTTGGTGCT 
GGCCTTTCGG 
GGGCCCTATG 
TTAAGAGGAT 
AATAACTTTA 
GTCCCACTGA 
ACTGTCATGT 
TTTTTTTTTT 
TTAGATTCCA 
CTTCCAGAAT 
TCCTGAGGCT 
GGGAAGAACA 
CTCGCAACTG 
AAACATTGGG 
TAACAGACCC 
TATCCAAACA 
AGAGCTCCAG 
TGAAGTCACT 
GTAAACACAT 
AGACTTCTCC 
ATGTGAAGAA 
GGAATGTTGG 
CCCAAGTAAT 
CGGATGATAA 
TTAGTTTGCC 
AAGAGCCCTG 
TCAGTTGCAG 
GTGCTGAGTG 
AAGAATAAAA 



AACTGTTCCT 
CACATTGGGT 
GTAGTATCAA 
CTTATTCTCT 
TTTACTATTT 
GCCTTGCTGC 
GAAAATTCAA 
TGGAAGCAAG 
GGATGAATCT 
ATTAGGATCA 
AGGATTAATG 
GTCTCAGCAG 
TTTTTTCCCC 
ACTTCCTCCT 
GGGGCTTAAG 
CTCTAAGAAA 
AGGCCCTGAG 
AAGGAAGGCT 
CCACTTCCTA 
AGCTCTGTTC 
AGACTGCTCA 
GGAGCTTATT 
ATTTTGGCTC 
AAATGAAAGA 
AGGCCCATGT 
ATAAGTATCC 
CCAGTTTTGT 
CGGGCGCCCC 
CTACTGACGA 
TCTCCCCATC 
GGTAAATGTG 
AAAATGAACT 
CTATGTGTGC 
AAAAAAAAAA 



Seq ID NO: CI 92 DNA Sequence 

Nucleic Acid Accession #: NMJ>06549.2 

Coding sequence: 824.. 2590 
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1 11 21 31 41 

I 1 t I I I 

GAGCCTGGGG AGGTCGAGGG TGCAGCGAGC CGTGATCGTG CTACTGCACT CCAGCCTGGG 60 

CAACACAGAG AGACCCTGTC TCAAAACAAA CAAACAAACA AACAAACAAA CAAAACAAAG 120 

AAAAAAAAAT GGGAGTGGGC CGGGCGCGGT GACTCACACC TGTAATCCCA GCACTTTCGG 180 

AGGCCAAGGC GGGTGGATCA CGAGGTCAGG AATTCAAGAT TAGCCTGGAC AACATGGTGA 240 

AACCCCATCT CTACGAAAAA TACAAAAATT AGCCAAGTAT GGTGGCCGGC GCCTGTAATC 300 

CCAGCTACTC GGGAGACTGA GGCAGAGAAC TGCTTGAACC TGGGAGGCAG AGGTTGCAGT 360 

GATCCGAGAT CGCGTCACTG CACTCCAGCG TGGGCGACAG AGCGAGACTC CGTTTCAGAA 420 

AAGAAAAAAA AAAAAAAAAA AAAAAGGGAG TCGGGGTGGA GCTCTCATTG GCTCGTTGCA 480 

TGTGAGTGTC CCTACGGCCT AGAAATACCA GAGAAGCACA TCGGAACGGG CTGGAAATCC 540 

ACCCAGTTAA CTAGAGGGCT TTGAACCTTT TATTAACTTG GAGGTTGACT CTCCTGTCAA 600 

CTCGATTCCC TTTTGGCTGT TTGGCAGGGT CAGTGAGACA TCCCCTGGGT CGCTCGACCC 660 

CGTAGGACGG TTCAGGGAGC CCTCCAGGTC TTCGTTTCTC CTCTTCCCCG CACAGTGCTG '720 

TTATCCAGCT GGGGGATCCA ACGCACACTT AAGGCTCCAG CAAAGTGGCT CCGCTGCCGG 780 

ATGGGAGTGC CCCAGTGTGC TGGATGAAGC TGGCGCATGC ACCATGTCAT CATGTGTCTC 840 

TAGCCAGCCC AGCAGCAACC GGGCCGCCCC CCAGGATGAG CTGGGGGGCA GGGGCAGCAG 900 

CAGCAGCGAA AGCCAGAAGC CCTGTGAGGC CCTGCGGGGC CTCTCATCCT TGAGCATCCA 960 

CCTGGGCATG GAGTCCTTCA TTGTGGTCAC CGAGTGTGAG CCGGGCTGTG CTGTGGACCT 1020 

CGGCTTGGCG CGGGACCGGC CCCTGGAGGC CGATGGCCAA GAGGTCCCCC TTGACTCCTC 1080 

CGGGTCCCAG GCCCGGCCCC ACCTCTCCGG TCGCAAGCTG TCTCTGCAAG AGCGGTCCCA 1140 

GGGTGGGCTG GCAGCCGGTG GCAGCCTGGA CATGAAOGGA CGCTGCATCT GCCCGTCCCT 1200 

GCCCTACTCA CCCGTCAGCT CCCCGCAGTC CTCGCCTCGG CTGCCCCGGC GGCCGACAGT 1260 

GGAGTCTCAC CACGTCTCCA TCACGGGTAT GCAGGACTGT GTGCAGCTGA ATCAGTATAC 1320 

CCTGAAGGAT GAAATTGGAA AGGGCTCCTA TGGTGTCGTC AAGTTGGCCT ACAATGAAAA 1380 

TGACAATACC TACTATGCAA TGAAGGTGCT GTCCAAAAAG AAGCTGATCC GGCAGGCCGG . 1440 

CTTTCCACGT CGCCCTCCAC CCCGAGGCAC CCGGCCAGCT CCTGGAGGCT GCATCCAGCC 1500 

CAGGGGCCCC ATTGAGCAGG TGTACCAGGA AATTGCCATC CTCAAGAAGC TGGACCACCC 1560 

CAATGTGGTG AAGCTGGTGG AGGTCCTGGA TGACCCCAAT GAGGACCATC TGTACATGGT 1620 

GTTCGAACTG GTCAACCAAG GGCCCGTGAT GGAAGTGCCC ACCCTCAAAC CACTCTCTGA 1680 

AGACCAGGCC CGTTTCTACT TCCAGGATCT GATCAAAGGC ATCGAGTACT TACACTACCA 1740 

GAAGATCATC CACCGTGACA TCAAACCTTC CAACCTCCTG GTCGGAGAAG ATGGGCACAT 1800 

CAAGATCGCT GACTTTGGTG TGAGCAATGA ATTCAAGGGC AGTGACGCGC TCCTCTCCAA I860 

CACCGTGGGC ACGCCCGCCT TCATGGCACC CGAGTCGCTC TCTGAGACCC GCAAGATCTT 1920 

CTCTGGGAAG GCCTTGGATG TTTGGGCCAT GGGTGTGACA CTATACTGCT TTGTCTTTGG 1980 

CCAGTGCCCA TTCATGGACG AGCGGATCAT GTGTTTACAC AGTAAGATCA AGAGTCAGGC 2040 

CCTGGAATTT CCAGACCAGC CCGACATAGC TGAGGACTTG AAGGACCTGA TCACCCGTAT 2100 

GCTGGACAAG AACCCCGAGT CGAGGATCGT GGTGCCGGAA ATCAAGCTGC ACCCCTGGGT 2160 
CACGAGGCAT GGGGCGGAGC CGTTGCCGTC GGAGGATGAG AACTGCACGC TGGTCGAAGT 2220 
GACTGAAGAG GAGGTCGAGA ACTCAGTCAA ACACATTCCC AGCTTGGCAA CCGTGATCCT 2280 
GGTGAAGACC ATGATACGTA AACGCTCCTT TGGGAACCCA TTCGAGGGCA GCCGGCGGGA 2340 
GGAACGCTCA CTGTCAGCGC CTGGAAACTT GCTCACCAAA AAACCAACCA GGGAATGTGA 2400 
GTCCCTGTCT GAGCTCAAGG AAGCAAGGCA GCGAAGACAA CCTCCAGGGC ACCGACCCGC 2460 
CCCCCGTGGG GGAGGAGGAA GTGCTCTTGT GAGAGGCAGT CCCTGCGTGG AAAGTTGCTG 2520 
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GGCCCCCGCC CCCGGCTCCC CCGCACGCAT GCATCCACTG CGGCCGGAGG AGGCCATGGA 
GCCCGAGTAG CTGCCTGGAT CGCTCGACCT GGCATGCGCG CCGCGTCGCC TCTGGGGGGC 
TGCTGCACCG CGTTTCCATA GCAGCATGTC CTACGGAAAC CCAGCACGTG TGTAGAGCCT 
CGATCGTCAT CTCTGGTTAT TTGTTTTTTC CTTTGTTGTT TTAAAGGGGA CAAAAAAAAA 
AAAAAAAGGA CTTGACTCCA TGACGTCGAC CGTGGCCGCT GGCTGGCTGG ACAGGCGGGT 
GTGAGGAGTT GCAGACCCAA ACCCACGTGC ATTTTGGGAA ATTGCTTTTT AAAAGATTTT 
TATGCCAAAA ATCCTTCATT GTGATTTTCA GAACCACGTC AGATATACCA AGTGACTGTG 
TGTGGGGTTT GACAACTGTG GAAAGGCGAG CAGAAAACTC CGGGGGTCTG AGGCCATGGA 
GGTGGTTGCT GCATTTGAGA GGGAGTAGGG GGCTAGATGT GGCTCCTAGT GCAAACCGGA 
AACCATGGCA CCTTCCAGAG CCGTGGTCTC AAGGAGTCAG AGCAGGGCTG GCCCTCAGTA 
GCTGCAGGGA GCTTTGATGC AACTTATTTG TAAGAAGGAT TTTTAAATTT TTTATGGGTA 
GAATTGTAGT CAGGAAAACA GAAAGGGCTT GAAATTTAAT AAGTGCTGCT GGAAGGGGAT 
TTTCCAAGCC TGGAAGGGTA TTCAGCAGCT GTGGTGGGGA AACATTTCTC CTGAAAGACT 
GAACGTGTTT CTTCATGACA GCTGCTCAAA GCAGGTTTCT GAGATAGCTG ACCGAGCTCT 
GGTAAATCTC TTTGTCAAAT TACGAAAACT TCAGGGTGAA ATCCTATGCT TTCATGTACA 
TTACATGGCT TAAGATTAAA CAAAAACATT TTTCAAGTCT CTAACTAGAG TGAACTCTAG 
AGCACAGTAG TTCAGAAACT ATTTAGAGCT TCCAGGATAT ATTTCACAGC TTCAGGCATG 
TGATCAGTTA GAGCCGATGA AACCTATGCC CGCCTGTATA TATATTAGCA GCTTAGCTAG 
TTCATAACCT GTATATTCTA AAGACTGCTA AGGTTTTGTT TTCATTTTAA ATCCTAGCTG 
ATTGTTGTGG TCAATGAAAT ACCCAGTTTC TGAAGGGCCA GGTGGGAAAT GCTTTCACTG 
GACCAACACA CAAATGATCA TCCTGAGGAT CTGAGCTTCC CTAGACTCCA CACAATAACC 
TTGGGGCACC CTTTTAGAGA AGACTGTTGA AACCCACAGC ACTOGTTGGG GTATGAGGAA 
ACCAGGGCTT GGCACAGGAA GTTCCCCTTT GTAGCTAAAA GTCCAGAAAG AAAGGGTTCA 
TCTTTTTGAC TTCCAACTGA TATTGGGAAG TTTGGTTGAG GTTCAAGTGT GACTCCTTCC 
AGAGCCACAG GTAGGGGAGT GTGAAGTTGA GGGGGAGGAA AGCTGGAAGG ACTCTGCCTT 
GGGAGATTCC CAGCTCTGCT TTCCAGCGCT TGGTGGAATC TGGGCTGGGG AAAGACGGCA 
CCGGGAAACT CTGCTTCCCC ATTGTTTCCA TCTGATCAGC TGTGGTGTGA GGACTTCTCA 
GACAAAGGCA AGGCCTCGTG CCCCTGCCCA GCCCATTCAT GGAGCCCTGG GCCTTCTTGG 
CTTCCATAGA TCCTAAGCTC TTGACTGTAG TTTAGCCAGA CTTGTTTTGC TATCTTATAA 
GCAGTTCAGA ATTAGGGAAT GCTGGTTTTG AAGAGCAAAG GACAGGTAGT CTAGAGAGGG 
TCGTCTGGCC TGCTTGCTGG GTCTTTGTAA CCCAGCACTT CCTCTTGCCC TCCTGGCTTT 
ATGTTTATGG GAGAGGACTC AATAGCTCCA CCCCTTCTGG CACCAGATGG GGCTTGGTTA 
GTTTGCAATA AGCACCTTGC AGAGGTTAAA GCCAGCGGGT CCCTAGTCTT AGGCCCAGCC 
TGCTTGTGTG GGCTCTGGCC TGGCCTGGTG GCTGGCCCAG GGGGCAGCAG TGCTTAGAGC 
TTCTGCAGGG CTTCTCTTGT TTACACAGCT GCATCAGACA ATGCCATTTC TCCCCACCAC 
GGAACCTTCC ATCTAAGATT TCTTCCAGGG AATGCCAGCA ATCAGGCAGC ACCCAGCTGT 
GGGGGCAGTG GGGTGGGGGA GACCCACATT GATGACTTTT TTTTTTTCTT TTAATGAAGA 
AACACCAAAG AAAGCTGTGG AAAGGACCTG CCCCACATGA AAAGGATAAG CCAAGATGGC 
TGTAAACACA GAGCATTTGA GCTGCCACTC TTGGAGCACA TTGATTTTTC AAAAGCCAGC 
TCTGTCAGGA AAGGAGGTGC TGTTATGACA GCTCTTCCAG TGGGCAAAGA GGACGCCCAT 
AATTTCTTCC ATTGCTAGCT CATCTGTGGG ACCAATTTGG TGTAAGCAAC CTGTGGCCTG 
CACTTGTGGC CTCGAAGGAA GCACAAACCC TCCATCCACT TCCCATTTCC TCTGCCCTTT 
TCCACCTCCC CCTTCCATCC CACCAGCTGC CAGTGGCTCC CAGAAAGCCT TATTGAGCCC 
CTTGTTGACA CTTGGGGCTG CGGAGGCCTT TCCCTACTGG TCTGGCCTTT CCTGAGAGGC 
AGGTCTTCCG TCCTCAGAGC CTTTCTGGAA CAAGGAGAAT GCCTGTGCAG GTGGACACAC 
AGGCCTGGCC TGTCGCTCTC ACTTGTCTTC CAGCGGGGAG CTTCACGTTG CCGAGTGGAA 
GAACCATGAC CTCCACTTGC TTCCAAGGTG CTAGGGAAGT TTCAGGGTAC GCTGGTTCCC 
CTCTCCAGCT GGAGGCCGAG TTTCTGGGGA CTGCAGATTT TTCTACTCTG TGATCGATTC 
AATGCCCGAT GCTTCTGTTT CATTCCCGAC CCTTTCTACT ATGCATTTTC CTTTTATCAG 
GTGTATAAAG TTAAATACTG TGTATTTATC ACTAAAAAGT ACATGAACTT AAGAGACAAC 
TAAGCCTTTC GTGTTTTTCC ACAGGTGTTT AAGCTTCTCT GTACAGTTGA AATAAACAGA 
CAGCAAAATG GTAAAAAAAA AAAAAAAAAA A 

Seq ID NO: C193 DNA Sequence 
Nucleic Acid Accession #: NM_018646 
Coding sequence: 217.. 2394 
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I I I I I I 

GCTCTGCCAA GTGTAACAAA CTCACAGCCC TCTCCAAACT GGCTGGGGCT GCTGGGAGAC 60 

TCCCAAGGAA CTCGTCAGGA AGGCAGGAGA CAGGAGACGG GACCTCTACA GGGAGACGGT 120 

GGGCCGGCCC TTGGGGGGGC TGATGTGGCC CCAAGGCTGA GTCCCGTCAG GGTCTGGCCT 180 

CGGCCTCAGG CCCCCAAGGA GCCGGCCCTA CACCCCATGG GTTTGTCACT GCCCAAGGAG 240 

AAAGGGCTAA TTCTCTGCCT ATGGAGCAAG TTCTGCAGAT GGTTCCAGAG ACGGGAGTCC 300 

TGGGCCCAGA GCCGAGATGA GCAGAACCTG CTGCAGCAGA AGAGGATCTG GGAGTCTCCT 360 

CTCCTTCTAG CTGCCAAAGA TAATGATGTC CAGGCCCTGA ACAAGTTGCT CAAGTATGAG 420 

GATTGCAAGG TGCACCAGAG AGGAGCCATG GGGGAAACAG CGCTACACAT AGCAGCCCTC 480 

TATGACAACC TGGAGGCCGC CATGGTGCTG ATGGAGGCTG CCCCGGAGCT GGTCTTTGAG 540 

CCCATGACAT CTGAGCTCTA TGAGGGTCAG ACTGCACTGC ACATCGCTGT TGTGAACCAG 600 

AACATGAACC TGGTGCGAGC CCTGCTTGCC CGCAGGGCCA GTGTCTCTGC CAGAGCCACA 660 

GGCACTGCCT TCCGCCGTAG TCCCTGCAAC CTCATCTACT TTGGGGAGCA CCCTTTGTCC 720 

TTTGCTGCCT GTGTGAACAG TGAGGAGATC GTGCGGCTGC TCATTGAGCA TGGAGCTGAC 780 

ATCCGGGCCC AGGACTCCCT GGGAAACACA GTGTTACACA TCCTCATCCT CCAGCCCAAC 840 

AAAACCTTTG CCTGCCAGAT GTACAACCTG TTGCTGTCCT ACGACAGACA TGGGGACCAC 900 

CTGCAGCCCC TGGACCTCGT GCCCAATCAC CAGGGTCTCA CCCCTTTCAA GCTGGCTGGA 960 

GTGGAGGGTA ACACTGTGAT GTTTCAGCAC CTGATGCAGA AGCGGAAGCA CACCCAGTGG 1020 

ACGTATGGAC CACTGACCTC GACTCTCTAT GACCTCACAG AGATCGACTC CTCAGGGGAT 1080 

GAGCAGTCCC TGCTGGAACT TATCATCACC ACCAAGAAGC GGGAGGCTCG CCAGATCCTG 1140 

GACCAGACGC CGGTGAAGGA GCTGGTGAGC CTCAAGTGGA AGCGGTACGG GCGGCCGTAC 1200 

TTCTGCATGC TGGGTGCCAT ATATCTGCTG TACATCATCT GCTTCACCAT GTGCTGCATC 1260 

TACCGCCCCC TCAAGCCCAG GACCAATAAC CGCACGAGCC CCCGGGACAA CACCCTCTTA 1320 

CAGCAGAAGC TACTTCAGGA AGCCTACATG ACCCCTAAGG ACGATATCCG GCTGGTCGGG 1380 

GAGCTGGTGA CTGTCATTGG GGCTATCATC ATCCTGCTGG TAGAGGTTCC AGACATCTTC 1440 

AGAATGGGGG TCACTCGCTT CTTTGGACAG ACCATCCTTG GGGGCCCATT CCATGTCCTC IS 00 

1321 
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ATCATCACCT 
GGGGAGGTGG 
GCCCGAGGAT 
GACCTGATGC 
TATATCATCT 
GCCCTGTTCA 
GTGGACCTGC 
CTCATGCTCA 
CGGGATGAGC 
CCTCGCTGCC 
TGGTTCCTGC 
CAGGCCTTCC 
CTGGGCTGTC 
TCCOGCAGCA 
GGGATAATCA 
GTTCTCACTT 
AACCAAACAC 
CGCCAAGGAA 
GGCAGAGGAA 
TGCTCATCAC 
TGCTTGGGCG 
ACCCAGGCAG 
AACGGCTCTG 
TTTTCGGATC 



ATGCCTTCAT 
TACCCATGTC 
TCCAGATGCT 
GATTCTGCTG 
TCCAGACAGA 
GCACCTTCGA 
CCTTCATGTA 
ACCTCCTCAT 
TGTGGAGGGC 
TGTGGCCTCG 
GGGTGGAAGA 
ACACCCGGGG 
CCTTCAGCCC 
GTGCCAATTG 
ACAGGGGTCT 
CGCTTCCTGG 
CGAGAGGTCT 
TGTACGTTGA 
GCCCAGCCCA 
CCTTCCGACA 
CTCGGTCTCC 
GTGCAGGGAA 
GGGGTGGGAA 
CCTGAAAAAA 



GGTGCTGGTG 
CTTTGCACTC 
AGGCCCCTTC 
GCTGATGGCT 
GGACCCCGAG 
GCTGTTCCTT 
CAGCATCACC 
TGCCATGATG 
CCAGATTGTG 
CTCCGGGATC 
CAGGCAAGAT 
CTCTGAGGAT 
CCACCTGTCC 
GGAAAGGCTT 
GGAGGACGGG 
AACTTGCTCT 
CATCTCCCAG 
GAATCACTGC 
AGCACGGGGC 
GGAGCACTGC 
ACCCCAGGGT 
GTGCAGAGCT 
GTGGGGCTAG 
AAAAAAAAAA 



ACCATGGTGA 
GTGCTGGGCT 
ACCATCATGA 
GTGGTCATCC 
GAGCTAGGCC 
ACCATCATCG 
TATGCTGCCT 
GGCGACACTC 
GCCACCACGG 
TGCGGACGGG 
CTCAACCGGC 
TTGGACAAAG 
CTTCCTATGC 
CGGCAAGGGA 
GAGAGCTGGG 
CATTTTCCTG 
GCCCCAGGGG 
TCCAGGCCTG 
TGGCAGGGCG 
ATGTCAGAGC 
CATAAGTGGG 
TGTGGAAAGC 
GTCTTGCCAA 



TGCGGCTCAT 
GGTGCAACGT 
TTCAGAAGAT 
TGGGCTTTGC 
ACTTCTACGA 
ATGGCCCAGC 
TTGCCATCAT 
ACTGGCGAGT 
TGATGCTGGA 
AGTATGGCCT 
AGCGGATCCA 
ACTCAGTGGA 
CCTCAGTGTC 
CCCTGAGGAG 
AATATCAGAT 
GGTGCATCAA 
AGAAAGAGGA 
CATTACTCCT 
TGAGGAACTC 
ACTTTAAAAA 
GAGAGAGCCC 
GTGTGAGTGA 
CTCCATCTTC 



CAGTGCCAGC 
CATGTACTTC 
GATTTTTGGC 
TTCAGCCTTC 
CTACCCCATG 
CAACTACAAC 
CGCCACACTG 
GGCCCATGAG 
GCGGAAGCTG 
GGGAGACCGC 
ACGCTACGCA 
AAAACTAGAG 
TCGAAGTACC 
AGACCTGCGT 
CTGACTGCGT 
ACAAAACAAA 
GTAGCATGAA 
TCAGCTCTGG 
TCCTGTGGCC 
CAGGCCAGCC 
TTCCCAGGGC 
GGGAGACAGG 
AATAAAGTCG 



Seq ID NO: C194 DNA Sequence 

Nucleic Acid Accession ft: NM_021910.1 

Coding sequence: 260.. 601 



1 
I 

GTTCTCCACA 
GTGAGGTTTG 
ATTTCTCCCG 
CCCACTCCAC 
CCCGGGCCAG 
CTTTCCTGTC 
GCACAGCCTC 
CATCGTCATG 
TCCCCTGACC 
GTCCGGTCAC 
ATGAGGACAG 
TGCACAGAAA 
GGCAGGGCCT 
AATGATTGTG 
TTGGGGCACC 
CTGCTTGTTG 
CGCCCTCTGC 
AAGGAATGAA 
CATTCCAGGA 
ATAATAGTAA 
TTTCCAAATA 
AACATACAAT 
CCCAATACCT 
CTCAAAACAC 
ACAATAAACT 



11 
I 

ACTGCCAGCA 
CTTAGGGCGT 
GAACCTCTGC 
GGTGCAGCTG 
CGCTCTGACA 
CTGGACGCCA 
CAGGTTGGCG 
AGTGAGTGGA 
ACTGAGCTCT 
CATCCAGGGG 
ACCAGCTGAA 
CTTGAACTCC 
CATCTCACCT 
CCTCTGCCCA 
ATGAGAAGGT 
GGAAAAGCCC 
CTCCTCAGCC 
AGAGAGCTCT 
AAACTGGGAC 
GCCCCTGTGA 
CCCAAATATC 
GACCACATAG 
AATAAGGCAT 
CCAAAAATAT 
CGTGACTTGT 



21 
I 

ATCCTTCCAC 
GGCAGCTTCG 
TCAGCCTGGT 
CGGCTTATCT 
TGCAGAAGGT 
ATGACCTAGA 
GGCTCATCTG 
GGAGCTCGGG 
CCCCAACAGG 
AGACTCCACC 
ATTGGGTGGA 
AGGATGGAAT 
CTCGCAAGAG 
AGCAGCCTGG 
TGGCGTGCCC 
ACAGGCCTGT 
ATGGGACCTC 
AACCAGATGG 
ATAGGATCGT 
CCCTCTCACT 
CCTGCAAGCC 
TGTGAGAACT 
GCGAAATGTT 
CTCCTCCAAT 
CCCCTC 



31 
I 

CAGGCAAAAC 
GATAAACGCA 
GAACCACACA 
CTCAGCCCAG 
GACCCTGGGC 
AGATAAAAAC 



GGAGGAGGCG 
TGCAAAATGC 
TCTCATCACC 
GGACCGTTCT 
TCTTCCTCCT 
GGTCTCTTTG 
AGACTTCCTA 
TGGAGGCTGA 
TCCCTTGTGG 
ATATGCAATT 
AACACTGGAA 
CCCGCTATGA 
TACCCCGAGA 
CGTTAAATAA 
TCCAACAAGC 
CTCATGAACT 
GTCCTGAGAC 



41 
I 

ACATCATCTA 
GGACTCCGCC 
GGCCCGAGTT 
CGAGATGCCA 
CTGCTTGTGT 
AGTCCTTTCT 
CTGTGOGCCA 
GGCCGGGGCT 
AAATGCAAGT 
CCCGGCTCAG 
CTGTCCCCAG 
CTGCTGGGAC 
TTCAATTTTT 
TGTGTGCATT 
CACAGAGGCT 
CTTGGGACAT 
TGGGATTTAC 
CATTCCAGTG 
TGGAAGTGTT 
CCTCACTTTA 
TTCCCTATGC 
CTCAAAGTCC 
ACCCCACAAC 
ATGAACCCAA 



51 
I 

AGGAAAAGAA 
TGGCAGCCCG 
TCACCCAGTC 
GCCTTCCTGT 
TCCTGGCAGG 
ACTATGACTG 
TGGGCATCAT 
GGGGCTCCCC 
TTGGCCAGAA 
CCCAAAGCTG 
GTCCTGTCTC 
TCCTTTGCAT 
TTTAATCTAA 



GGCACTGAGC 
GGCACAGGCC 
TAGTAGCCAA 
GACCCTGGAC 
CAGACAGTTT 
TTACAAGATC 
TACCCTTAAT 
CTTGAGACTC 
ACGCCTAAAA 
AAAGAGACCC 



Seq ID NO: CI 95 DNA Sequence 

Nucleic Acid Accession #: NM_005971.2 

Coding sequence: 176.. 43 9 



1560 
1620 
16B0 
1740 
1800 
1B60 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2918 



1 
I 

GTTCTCCACA 
GTGAGGTTTG 
ATTTCTCCCG 
GAAGGTGACC 
CCTAGAAGAT 
CATCTGCGCT 
ATGCAAGTTT 
AGGCTCAGCC 
GTCCCCAGGT 
GCTGGGACTC 
CAATTTTTTT 
TGTGCATTGG 
CAGAGGCTGG 
TGGGACATGG 
GGATTTACTA 
TTCCAGTGGA 
GAAGTGTTCA 
TCACTTTATT 
CCCTATGCTA 
CAAAGTCCCT 
CCCACAACAC 



11 
I 

ACTGCCAGCA 
CTTAGGGCGT 
GAACCTCTGC 
CTGGGCCTGC 
AAAAACAGTC 
GGGGTTCTGT 
GGCCAGAAGT 
CAAAGCTGAT 
CCTGTCTCTG 
CTTTGCATGG 
TAATCTAAAA 
GGTGGGGCTT 
CACTGAGCCT 
CACAGGCCCG 
GTAGCCAAAA 
CCCTGGACCA 
GACAGTTTAT 
ACAAGATCTT 
CCCTTAATAA 
TGAGACTCCC 
GCCTAAAACT 



21 
I 

ATCCTTCCAC 
GGCAGCTTCG 
TCAGCCTGGT 
TTGTGTTCCT 
CTTTCTACTA 
GCGCCATGGG 
CCGGTCACCA 
GAGGACAGAC 
CACAGAAACT 
CAGGGCCTCA 
TGATTGTGCC 
GGGGCACCAT 
GCTTGTTGGG 
CCCTCTGCCT 
GGAATGAAAG 
TTCCAGGAAA 
AATAGTAAGC 
TCCAAATACC 
CATACAATGA 
CAATACCTAA 
CAAAACACCC 



31 
I 

CAGGCAAAAC 
GATAAACGCA 
GAACCACACA 
GGCAGGCTTT 
TGACTGGCAC 
CATCATCATC 
TCCAGGGGAG 
CAGCTGAAAT 
TGAACTCCAG 
TCTCACCTCT 
TCTGCCCAAG 
GAGAAGGTTG 
AAAAGCCCAC 
CCTCAGCCAT 
AGAGCTCTAA 
ACTGGGACAT 
CCCTGTGACC 
CAAATATCCC 
CCACATAGTG 
TAAGGCATGC 
AAAAATATCT 



41 

! 

ACATCATCTA 
GGACTCCGCC 
GGCCAGCGCT 
CCTGTCCTGG 
AGCCTCCAGG 
GTCATGAGTG 
ACTCCACCTC 
TGGGTGGAGG 
GATGGAATTC 
CGCAAGAGGG 
CAGCCTGGAG 
GCGTGCCCTG 
AGGCCTGTTC 
GGGACCTCAT 
CCAGATGGAA 
AGGATCGTCC 
CTCTCACTTA 
TGCAAGCCCG 
TGAGAACTTC 
GAAATGTTCT 
CCTCCAATGT 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1466 



51 
I 

AGGAAAAGAA 
TGGCAGCCCG 
CTGACATGCA 
ACGCCAATGA 
TTGGCGGGCT 
CAAAATGCAA 
TCATCACCCC 
ACCGTTCTCT 
TTCCTCCTCT 
TCTCTTTGTT 
ACTTCCTATG 
GAGGCTGACA 
CCTTGTGGCT 
ATGCAATTTG 
CACTGGAACA 
CGCTATGATG 
CCCCGAGACC 
TTAAATAATT 
CAACAAGCCT 
CATGAACTAC 
CCTGAGACAT 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 
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Seq ID NO: CI 96 DNA Sequence 

Nucleic Acid Accession NM__004961.2 

Coding sequence : 55 - . 1575 



1 11 21 31 41 51 

I i I I I I 

GCCAGAGCGT GAGCCGCGAC CTCCGCGCAG GTGGTCGCGC CGGTCTCCGC GGAAATGTTG 60 

TCCAAAGTTC TTCCAGTCCT CCTAGGCATC TTATTGATCC TCCAGTCGAG GGTCGAGGGA 120 

CCTCAGACTG AATCAAAGAA TGAAGCCTCT TCCCGTGATG TTGTCTATGG CCCCCAGCCC 180 

CAGCCTCTGG AAAATCAGCT CCTCTCTGAG GAAACAAAGT CAACTGAGAC TGAGACTGGG 240 

AGCAGAGTTG GCAAACTGCC AGAAGCCTCT CGCATCCTGA ACACTATCCT GAGTAATTAT 300 

GACCACAAAC TGCGCCCTGG CATTGGAGAG AAGCCCACTG TGGTCACTGT TGAGATCGCC 360 

GTCAACAGCC TTGGTCCTCT CTCTATCCTA GACATGGAAT ACACCATTGA CATCATCTTC 420 

TCCCAGACCT GGTACGACGA ACGCCTCTGT TACAACGACA CCTTTGAGTC TCTTGTTCTG 480 

AATGGCAATG TGGTGAGCCA GCTATGGATC CCGGACACCT TTTTTAGGAA TTCTAAGAGG 540 

ACCCACGAGC ATGAGATCAC CATGCCCAAC CAGATGGTCC GCATCTACAA GGATGGCAAG 600 

GTGTTGTACA CAATTAGGAT GAOCATTGAT GCCGGATGCT CACTCCACAT GCTCAGATTT 660 

CCAATGGATT CTCACTCTTG CCCTCTATCT TTCTCTAGCT TTTCCTATCC TGAGAATGAG 720 

ATGATCTACA AGTGGGAAAA TTTCAAGCTT GAAATCAATG AGAAGAACTC CTGGAAGCTC 780 

TTCCAGTTTG ATTTTACAGG AGTGAGCAAC AAAACTGAAA TAATCACAAC CCCAGT TGGT 840 

GACTTCATGG TCATGACGAT TTTCTTCAAT GTGAGCAGGC GGTTTGGCTA TGTTGCCTTT 900 

CAAAACTATG TCCC T TCTTC CGTGACCACG ATGCTCTCCT GGGTTTCCTT TTGGATCAAG 960 

ACAGAGTCTG CTCCAGCCCG GACCTCTCTA GGGATCACCT CTGTTCTGAC CATGACCACG 1020 

TTGGGCACCT TTTCTCGTAA GAATTTCCCG CGTGTCTCCT ATATCACAGC CTTGGATTTC 1080 

TATATCG CCA TCTG C TTC G T CTTCTGCTTC TGCGCTCTGT TGGAGTTTGC TGTGCTCAAC 1140 

TTCCTGATCT ACAACCAGAC AAAAGCCCAT GCTTCTCCTA AACTCCGCCA TCCTCGTATC 1200 

AATAGCCGTG CCCATGCCCG TACCCGTGCA CGTTCCCGAG CCTGTGCCCG CCAACATCAG 1260 

GAAGCTTTTG TGTGCCAGAT TGTCACCACT GAGGGAAGTG ATGGAGAGGA GCGCCCGTCT 1320 

TGCTCAGCCC AGCAGCCCCC TAGCCCAGGT AGCCCTGAGG GTCCCCGCAG CCTCTGCTCC 1380 

AAGCTGGCCT GCTGTGAGTG GTGCAAGCGT TTTAAGAAGT ACTTCTGCAT GGTCCCCGAT 1440 

TGTGAGGGCA GTACCTGGCA GCAGGGCCGC CTCTGCATCC ATGTCTACCG CCTGGATAAC 1500 

TACTCGAGAG TTGTTTTCCC AGTGACTTTC TTCTTCTTCA ATGTGCTCTA CTGGCTTGTT 1560 

TGCCTTAACT TGTAGGTACC AGCTGGTACC CTGTGGGGCA ACCTCTCCAG TTCCCCAGGA 1620 

GGTCCAAGCC CCTTGCCAAG GGAGTTGGGG GAAAGCAGCA GCAGCAGCAG GAGCGACTAG 1680 

AGTTTTTCCT GCCCCATTCC CCAAACAGAA GCTTGCAGAG GGTTTGTCTT TGCTGCCCCT 1740 

CTCCCCTACC TGGCCCATTC ACTGAGTCTT CTCAGCAGAC CATTTCAAAT TATTAATAAA 1800 

TGGGCCACCT CCCTCTTCTT CAAGGAGCAT CCGTGATGCT CAGTGTTCAA AACCACAGCC I860 

ACTTAGTGAT CAGCTCCCTA AAACCATGCC TAAGTACAGG CGGATTAGCT ATCTTCCAAC 1920 

AATGCTGACC ACCAGACAAT TACTGCATTT TTCCAGAAGC CCACTATTGC CTTTGTAGTG 1980 

CTTTCGGCCC AGTTCTGGCC TCAGCCTCAA AGTGCACCGA CTAGTTGCTT GCCTATACCT 2040 

GGCACCTCAT TAAGATGCTG GGCAGCAGTA TAACAGGAGG AAGAGATCCC TCTCCTTTGG 2100 

TCAGATTATT ATGTTCTCAG TTCTCTCTCC CTGCTACCCC TTTCTCTGCA GATAGATAGA 2160 

CACTGGCATT ATCCCTTTAG GAAGAGGGGG GGGCAGCAAG AGAGCCTATT TGGGACAGCA 2220 

TTCCTCTCTC TCTGCTGCTG TGACATCTCC CTCTCCTTGC TGGCTCCATC TTTCGTCTGC 2280 

ACTACCAATT CAATGCCCTT CATCCAATGG GTATCTATTT TTGTGTGTGA TTATAGTAAC 2340 

TACTCCCTGC TTTATATGCC ACCCTCTTCC TTCTCTTTGA CCCCTGTGAC TCTTTCTGTA 2400 

ACTTTCCCAG TGACTTCCCC TAGCCCTGAC CCAGGCACTA GGOCTTGGTG ACTTCCTGGG 2460 
GCCAAGAAAC TAAGGAAACT CGGCTTTGCA ACAGGCATTA CTCGCCATTG ATTGGTGCCC 2520 
ACCCAGGGCA CACTGTCGGA GTTCTATCAC TTGCTTGACC CCTGGACCCA TAAACCAGTC 2580 
CACTGTTATA CCCGGGGCAC TCTAACCATC ACAATCAATC AATCAAATTC CCTTAAATTT 2640 
GTATGGCACT GGAACTTTGG CAAAGCACTT TTGACAAGTT GTGTCTGATT GGAGCTTCAT 2700 
GATAGCCTTG TGACATCTTT AGGGCAGGAT TCTTATCCCC ATTTTGCAGA TGAAAACCCT 2760 
GAGTCACAGA TTTCTGTGGG ACTGTGGATC TCACTGGAAG CTATCCAAGA GCCCACTGTC 2820 
ACCTTCTAGA CCACATGATA GGGCTAGACA GCTCAGTTCA CCATGATTCT CTTCTGTCAC 2880 
CTCTGCTGGC ACACCAGTGG CAAGGCCCAG AATGGCGACC TCTCTTTAGC TCAATTTCTG 2940 
GGCCTGAGGT GCTCAGACTG CCCCCAAGAT CAAATCTCTC CTGGCTGTAG TAACCCAGTG 3000 
GAATGAATTT GGACATGCCC CAATGCTTCT ATATGCTAAG TGAAATCTGT GTCTGTAATT 3060 
TGTTGGGGGG TGGATAGGGT GGGGTCTCCA TCTACTTTTT GTCACCATCA TCTGAAATGG 3120 
GGAAATATGT AAATAAATAT ATCAGCAAAG CAAAAAGAAA AAAAAAAA 3168 

Seq ID NO: C197 DNA Sequence 

Nucleic Acid Accession #: NM_021984.l 

Coding sequence: 572.. 1753 



GCCAGAGCGT 
TCCAAAGTTC 
CAGAGAAGTG 
GTGTAAAGAA 
CACTGCCTCC 
TCAGACTGAA 
GCCTCTGGAA 
CAGAGTTGGC 
CCACAAACTG 
CAACAGCCTT 
CCAGACCTGG 
TGGCAATGTG 
CCACGAGCAT 
GTTGTACACA 
AATGGATTCT 
GATCTACAAG 



11 
I 

GAGCCGCGAC 
TTCCAGTCCT 
CTCAAATCAT 
AGCCAAATCA 
CAGCAAAGGC 
TCAAAGAATG 
AATCAGCTCC 
AAACTGCCAG 
CGCCCTGGCA 
GGTCCTCTCT 
TACGACGAAC 
GTGAGCCAGC 
GAGATCACCA 
ATTAGGATGA 
CACTCTTGCC 
TGGGAAAATT 



21 
1 

CTCCGCGCAG 
CCTAGGCATC 
AAGTGTACAG 
AGGACCCGAA 
AGCACTATCC 
AAGCCTCTTC 
TCTCTGAGGA 
AAGCCTCTCG 
TTGGAGAGAA 
CTATCCTAGA 
GCCTCTGTTA 
TATGGATCCC 
TGCCCAACCA 
CCATTGATGC 
CTCTATCTTT 
TCAAGCTTGA 



31 
I 

GTGGTCGCGC 
TTATTGATCC 
CTGATGAGTT 
TGTGAGCAGG 
GGACTTCTAA 
CCGTGATGTT 
AACAAAGTCA 
CATCCTGAAC 
GCCCACTGTG 
CATGGAATAC 
CAACGACACC 
GGACACCTTT 
GATGGTCCGC 
CGGATGCTCA 
CTCTAGCTTT 
AATCAATGAG 



41 
I 

CGGTCTCCGC 
TCCAGTCGAG 
GTCAAAAAAT 
ACCTCAGAAG 
CACCATCGGG 
GTCTATGGCC 
ACTGAGACTG 
ACTATCCTGA 
GTCACTGTTG 
ACCATTGACA 
TTTGAGTCTC 
TTTAGGAATT 
ATCTACAAGG 
CTCCACATGC 
TCCTATCCTG 
AAGAACTCCT 



51 
I 

GGAAATGTTG 
AACATGTATA 
GACCACAGCG 
CCCCCTTTGT 
TCGAGGGACC 
OCCAGCCCCA 
AGACTGGGAG 
GTAATTATGA 
AGATCTCCGT 
TCATCTTCTC 
TTGTTCTGAA 
CTAAGAGGAC 
ATGGCAAGGT 
TCAGATTTCC 
AGAATGAGAT 
GGAAGCTCTT 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

640 

900 

960 
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CCAGTTGGAT 
CTTCATGGTC 
AAACTATGTC 
AGAGTCTGCT 
GGGCACCTTT 
TATOGCCATC 
CCTGATCTAC 
TAGCCGTGCC 
AGCTTTTGTG 
CTCAGCCCAG 
GCTGGCCTGC 
TGAGGGCAGT 
CTCGAGAGTT 
CCTTAACTTG 
TCCAAGCCCC 
TTTTTCCTGC 
CCCCTACCTG 
GGCCACCTCC 
TTAGTGATCA 
TGCTGACCAC 
TTCGGCCCAG 
CACCTCATTA 
AGATTATTAT 
CTGGCATTAT 
CCTCTCTCTC 
TACCAATTCA 
CTCCCTGCTT 
TTTCCCAGTG 
CAAGAAACTA 
CCAGGGCACA 
CTGTTATACC 
ATGGCACTGG 
TAGCCTTGTG 
GTCACAGATT 
CTTCTAGACC 
CTGCTGGCAC 
CCTGAGGTGC 
ATGAATTTGG 
TTGGGGGGTG 
AAATATGTAA 



TTTACAGGAG 
ATGACGATTT 
CCTTCTTCCG 
CCAGCCCGGA 
TCTCGTAAGA 
TGCTTCGTCT 
AACCAGACAA 
CATGCCCGTA 
TGCCAGATTG 
CAGCCCCCTA 
TGTGAGTGGT 
ACCTGGCAGC 
GTTTTCCCAG 
TAGGTACCAG 
TTGCCAAGGG 
CCCATTCCCC 
GCCCATTCAC 
CTCTTCTTCA 
GCTCCCTAAA 
CAGACAATTA 
TTCTGGCCTC 
AGATGCTGGG 
GTTCTCAGTT 
CCCTTTAGGA 
TGCTGCTGTG 
ATGCCCTTCA 
TATATGCCAC 
ACTTCCCCTA 
AGGAAACTCG 
CTGTCGGAGT 
CGGGGCACTC 
AACTTTGGCA 
ACATCTTTAG 
TCTGTGGGAC 
ACATGATAGG 
ACCAGTGGCA 
TCAGACTGCC 
ACATGCCCCA 
GATAGGGTGG 
ATAAATATAT 



TGAGCAACAA 
TCTTCAATGT 
TGACCACGAT 
CCTCTCTAGG 
ATTTCCCGCG 
TCTGCTTCTG 
AAGCCCATGC 
CCCGTGCACG 
TCACCACTGA 
GCCCAGGTAG 
GCAAGCGTTT 
AGGCCCGCCT 
TGACTTTCTT 
CTGGTACCCT 
AGTTGGGGGA 
AAACAGAAGC 
TGAGTTTTCT 
AGGAGCATCC 
ACCATGCCTA 
CTGCATTTTT 
AGCCTCAAAG 
CAGCAGTATA 
CTCTCTCCCT 
AGAGGGGGGG 
ACATCTCCCT 
TCCAATGGGT 
CCTCTTCCTT 
GCCCTGACCC 
GCTTTGCAAC 
TCTATCACTT 
TAACCATCAC 
AAGCACTTTT 
GGCAGGATTC 
TGTGGATCTC 
GCTAGACAGC 
AGGCCCAGAA 
CCCAAGATCA 
ATGCTTCTAT 
GGTCTCCATC 
CAGCAAAGC 



AACTGAAATA 
GAGCAGGCGG 
GCTCTCCTGG 
GATCACCTCT 
TGTCTCCTAT 
CGCTCTGTTG 
TTCTCCTAAA 
TTCCCGAGCC 
GGGAAGTGAT 
CCCTGAGGGT 
TAAGAAGTAC 
CTGCATCCAT 
CTTCTTCAAT 
GTGGGGCAAC 
AAGCAGCAGC 
TTGCAGAGGG 
CAGCAGACCA 
GTGATGCTCA 
AGTACAGGCG 
CCAGAAGCCC 
TGCACCGACT 
ACAGGAGGAA 
GCTACCCCTT 
GCAGCAAGAG 
CTCCTTGCTG 
ATCTATTTTT 
CTCTTTGACC 
AGGCACTAGG 
AGGCATTACT 
GCTTGACCCC 
AATCAATCAA 
GACAAGTTGT 
TTATCCCCAT 
ACTGGAAGCT 
TCAGTTCACC 
TGGOGACCTC 
AATCTCTCCT 
ATGCTAAGTG 
TACTTTTTGT 



ATCACAACCC 
TTTGGCTATG 
GTTTCCTTTT 
GTTCTGACCA 
ATCACAGCCT 
GAGTTTGCTG 
CTCCGCCATC 
TGTGCCCGCC 
GGAGAGGAGC 
CCCCGCAGCC 
TTCTGCATGG 
GTCTACCGCC 
GTGCTCTACT 
CrCTCCAGTT 
AGCAGCAGGA 
TTTGTCTTTG 
TTTCAAATTA 
GTGTTCAAAA 
GATTAGCTAT 
ACTATTGCCT 
AGTTGCTTGC 
GAGATCCCTC 
TCTCTGCAGA 
AGCCTATTTG 
GCTCCATCTT 
GTGTGTGATT 
CCTGTGACTC 
CCTTGGTGAC 
CGCCATTGAT 
TGGACCCATA 
TCAAATTCCC 
GTCTGATTGG 
TTTGCAGATG 
ATCCAAGAGC 
ATGATTCTCT 
TCTTTAGCTC 
GGCTGTAGTA 
AAATCTGTGT 
CACCATCATC 



Seq ID NO: CI 98 DNA Sequence 

Nucleic Acid Accession #: NM_0219B7.1 

Coding sequence: 572.. 1657 



CAGTTGGTGA 
TTGCCTTTCA 
GGATCAAGAC 
TGACCACGTT 
TGGATTTCTA 
TGCTCAACTT 
CTCGTATCAA 
AACATCAGGA 
GCCCGTCTTG 
TCTGCTCCAA 
TCCCCGATTG 
TGGATAACTA 
GGCTTGTTTG 
CCCCAGGAGG 
GCGACTAGAG 
CTGCCCCTCT 
TTAATAAATG 
CCACAGCCAC 
CTTCCAACAA 
TTGCAGTGCT 
CTATACCTGG 
TCCTTTGGTC 
TAGATAGACA 
GGACAGCATT 
TGGTCTGCAC 
ATAGTAACTA 
TTTCTGTAAC 
TTCCTGGGGC 
TGGTGCCCAC 
AACCAGTCCA 
TTAAATTTGT 
AGCTTCATGA 
AAAACCCTGA 
CCACTGTCAC 
TCTGTCACCT 
AATTTCTGGG 
ACCCAGTGGA 
CTGTAATTTG 
TGAAATGGGG 



51 



1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3329 



I 11 21 31 41 

II ! I i 1 

GCCAGAGCGT GAGCCGCGAC CTCCGCGCAG GTGGTCGCGC CGGTCTCCGC GGAAATGTTG 60 

TCCAAAGTTC TTCCAGTCCT CCTAGGCATC TTATTGATCC TCCAGTCGAG AACATGTATA 120 

CAGAGAAGTG CTCAAATCAT AAGTGTACAG CTGATGAGTT GTCAAAAAAT GACCACAGCG 180 

GTGTAAAGAA AGCCAAATCA AGGACCOGAA TGTGAGCAGG ACCTCAGAAG CCCCCTTTGT 240 

CACTGCCTCC CAGCAAAGGC AGCACTATCC GGACTTCTAA CACCATCGGG TCGAGGGACC 300 

TCAGACTGAA TCAAAGAATG AAGCCTCTTC CCGTGATGTT GTCTATGGCC CCCAGCCCCA 360 

GCCTCTGGAA AATCAGCTCC TCTCTGAGGA AACAAAGTCA ACTGAGACTG AGACTGGGAG 420 

CAGAGTTGGC AAACTGCCAG AAGCCTCTCG CATCCTGAAC ACTATCCTGA GTAATTATGA 480 

CCACAAACTG CGCCCTGGCA TTGGAGAGAA GCCCACTGTG GTCACTGTTG AGATCTCCGT 540 

CAACAGCCTT GGTCCTCTCT CTATCCTAGA CATGGAATAC ACCATTGACA TCATCTTCTC 600 

CCAGACCTGG AATTCTAAGA GGACCCACGA GCATGAGATC ACCATGCCCA ACCAGATGGT 660 

CCGCATCTAC AAGGATGGCA AGGTGTTGTA CACAATTAGG ATGACCATTG ATGCCGGATG 720 

CTCACTCCAC ATGCTCAGAT TTCCAATGGA TTCTCACTCT TGCCCTCTAT CTTTCTCTAG 780 

CTTTTCCTAT CCTGAGAATG AGATGATCTA CAAGTGGGAA AATTTCAAGC TTGAAATCAA 840 

TGAGAAGAAC TCCTGGAAGC TCTTCCAGTT TGATTTTACA GGAGTGAGCA ACAAAACTGA 900 

AATAATCACA ACCCCAGTTG GTGACTTCAT GGTCATGACG ATTTTCTTCA ATGTGAGCAG 960 

GCGGTTTGGC TATGTTGCCT TTCAAAACTA TGTCCCTTCT TCCGTGACCA CGATGCTCTC 1020 

CTGGGTTTCC TTTTGGATCA AGACAGAGTC TGCTCCAGCC CGGACCTCTC TAGGGATCAC 1080 

CTCTGTTCTG ACCATGACCA CGTTGGGCAC CTTTTCTCGT AAGAATTTCC CGCGTGTCTC 1140 

CTATATCACA GCCTTGGATT TCTATATCGC CATCTGCTTC GTCTTCTGCT TCTGCGCTCT 1200 

GTTGGAGTTT GCTGTGCTCA ACTTCCTGAT CTACAACCAG ACAAAAGCCC ATGCTTCTCC 1260 

TAAACTCCGC CATCCTCGTA TCAATAGCCG TGCCCATGCC CGTACCCGTG CACGTTCCCG 1320 

AGCCTGTGCC CGCCAACATC AGGAAGCTTT TGTGTGCCAG ATTGTCACCA CTGAGGGAAG 1380 

TGATGGAGAG GAGCGCCCGT CTTGCTCAGC CCAGCAGCCC CCTAGCCCAG GTAGCCCTGA 1440 

GGGTCCCCGC AGCCTCTGCT CCAAGCTGGC CTGCTGTGAG TGGTGCAAGC GTTTTAAGAA 1500 

GTACTTCTGC ATGGTCCCCG ATTGTGAGGG CAGTACCTGG CAGCAGGGCC GCCTCTGCAT 1560 

CCATGTCTAC CGCCTGGATA ACTACTCGAG AGTTGTTTTC CCAGTGACTT TCTTCTTCTT 1620 
CAATGTGCTC TACTGGCTTG TTTGCCTTAA CTTGTAGGTA CCAGCTGGTA CCCTGTGGGG 1680 
CAACCTCTCC AGTTCCCCAG GAGGTCCAAG CCCCTTGCCA AGGGAGTTGG GGGAAAGCAG J 1740 
CAGCAGCAGC AGGAGCGACT AGAGTTTTTC CTGCCCCATT CCCCAAACAG AAGCTTGCAG 1800 
AGGGTTTGTC TTTGCTGCCC CTCTCCCCTA CCTGGCCCAT TCACTGAGTT TTCTCAGCAG 1860 
ACCATTTCAA ATTATTAATA AATGGGCCAC CTCCCTCTTC TTCAAGGAGC ATCCGTGATG 1920 
CTCAGTGTTC AAAACCACAG CCACTTAGTG ATCAGCTCCC TAAAACCATG CCTAAGTACA 1980 
GGCGGATTAG CTATCTTCCA ACAATGCTGA CCACCAGACA ATTACTGCAT TTTTCCAGAA 2040 
GCCCACTATT GCCTTTGCAG TGCTTTCGGC CCAGTTCTGG CCTCAGCCTC AAAGTGCACC 2100 
GACTAGTTGC TTGCCTATAC CTGGCACCTC ATTAAGATGC TGGGCAGCAG TATAACAGGA 2160 
GGAAGAGATC CCTCTCCTTT GGTCAGATTA TTATGTTCTC AGTTCTCTCT CCCTGCTACC 2220 

1324 
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10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



CCTTTCTCTG 
AGAGAGCCTA 
GCTGGCTCCA 
TTTTGTGTGT 
GACCCCTGTG 
AGGCCTTGGT 
ACTCGCCATT 
CCCTGGACCC 
CAATCAAATT 
TGTGTCTGAT 
CATTTTGCAG 
GCTATCCAAG 
ACCATGATTC 
CTCTCTTTAG 
CCTGGCTGTA 
GTGAAATCTG 
TGTCACCATC 



CAGATAGATA 
TTTGGGACAG 
TCTTTCGTCT 
GATTATAGTA 
ACTCTTTCTG 
GACTTCCTGG 
GATTGGTGCC 
ATAAACCAGT 
CCCTTAAATT 
TGGAGCTTCA 
ATGAAAACCC 
AGCCCACTGT 
TCTTCTGTCA 
CTCAATTTCT 
GTAACCCAGT 
TGTCTGTAAT 
ATCTGAAATG 



GACACTGGCA 
CATTCCTCTC 
GCACTACCAA 
ACTACTCCCT 
TAACTTTCCC 
GGCCAAGAAA 
CACCCAGGGC 
CCACTGTTAT 
TGTATGGCAC 
TGATAGCCTT 
TGAGTCACAG 
CACCTTCTAG 
CCTCTGCTGG 
GGGCCTGAGG 
GGAATGAATT 
TTGTTGGGGG 
GGGAAATATG 



TTATCCCTTT 
TCTCTGCTGC 
TTCAATGCCC 
GCTTTATATG 
AGTGACTTCC 
CTAAGGAAAC 
ACACTGTCGG 
ACCCGGGGCA 
TGGAACTTTG 
GTGACATCTT 
ATTTCTGTGG 
ACCACATGAT 
CACACCAGTG 
TGCTCAGACT 
TGGACATGCC 
GTGGATAGGG 
TAAATAAATA 



AGGAAGAGGG 
TGTGACATCT 
TTCATCCAAT 
CCACCCTCTT 
CCTAGCCCTG 
TCGGCTTTGC 
AGTTCTATCA 
CTCTAACCAT 
GCAAAGCACT 
TAGGGCAGGA 
GACTGTGGAT 
AGGGCTAGAC 
GCAAGGCCCA 
GCCCCCAAGA 
CCAATGCTTC 
TGGGGTCTCC 
TATCAGCAAA 



GGGGGCAGCA 
CCCTCTCCTT 
GGGTATCTAT 
CCTTCTCTTT 
ACCAGGCACT 
AACAGGCATT 
CTTGCTTGAC 
CACAATCAAT 
TTTGACAAGT 
TTCTTATCCC 
CTCACTGGAA 
AGCTCAGTTC 
GAATGGCGAC 
TCAAATCTCT 
TATATGCTAA 
ATCTACTTTT 
GC 



2260 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3232 



Seq ID NO: C199 DNA Sequence 
Nucleic Acid Accession ft: NM_021990. 
Coding sequence: 1309.. 2490 



1 11 21 31 41 51 

I 1 I I I.I 

GCCAGAGCGT GAGCCGCGAC CTCCGCGCAG GTGGTCGCGC CGGTCTCCGC GGAAATGTTG 60 

TCCAAAGTTC TTCCAGTCCT CCTAGGCATC TTATTGATCC TCCAGTCGAG AACATGTATA 120 

CAGAGAAGTG CTCAAATCAT AAGTGTACAG CTGATGAGTT GTCAAAAAAT GACCACAGCG 180 

GTGTAAAGAA AGCCAAATCA AGGACCCGAA TGTGAGCAGG ACCTCAGAAG CCCCCTTTGT 240 

CACTGCCTCC CAGCAAAGGC AGCACTATCC GGACTTCTAA CACCATCGGT GAGTTTCATA 300 

CCTTGGCAGA TGGCCTTTAA CATTTTTGTT TAATTCAATT ATTCTTACTA ATCTTCTTCT 360 

TTTTCTTGGC TGTGGTGCAT GGCTGTGGAG CTCAGGGTGG ACTCCTGTTG GGCAGCCAGT 420 

TCCTGGATGG CTGTCTGTGG GTGGAGGACT CCTGCCTTTC CTGTTTAGAC ACCCACAAAG 480 

GCTGCTCTTT AGCCTCCTTC CCTTCATCCC CTTCCCCTGC CCCCAGTGCA ACGAGTATTA 540 

CACAACCAAC AAAACCGCAA AATATTCCCA CAATTTTCTG GTCCTCTCTG GGAGAGGCCG 600 

CTCTGGCTTT TCCTCTCAGC CCTGGCCCTC TGCCTGCTCC TCACTCCTGG TTGGTGCTGG 660 

TCAGGCTGAC TAGAGGCCAA GGCGACCAAC ACTAGGCAAA CGCGGCCAGC GCTCAGACAT 720 

AAATGCCCTC TTCATTTCAC GTGTAACATT CTTTTAAAAT CTAGGTCTTG GTTTTGTTGA 780 

TTTTTTCTTA AATAAAAGAG TGATCATAAA AGAGGGACAG CATAGAAAGT CCCCAAAGAG 840 

CAGCAAGGTT TTAAAGAAAT TCACAAGCCT AATCTGTCAC TGTCTTATAA TTTGCTATTA 900 

CCAGTCACAA TTTAACTAGG TTTTGTGTTG AAAACTTGTT TTGGTTTGCT TCTGTCCCAA 960 

GAGGCACTAG CTGGGGCCCC TACAGAGTGC AGGGCAGAGC TTCATTTTTC GTTTGAATGT 1020 

TCTAGGGTCG AGGGACCTCA GACTGAATCA AAGAATGAAG CCTCTTCCCG TGATGTTGTC 1080 

TATGGCCCCC AGCCCCAGCC TCTGGAAAAT CAGCTCCTCT CTGAGGAAAC AAAGTCAACT 1140 

GAGACTGAGA CTGGGAGCAG AGTTGGCAAA CTGCCAGAAG CCTCTCGCAT CCTGAACACT 1200 

ATCCTGAGTA ATTATGACCA CAAACTGCGC CCTGGCATTG GAGAGAAGCC CACTGTGGTC 1260 

ACTGTTGAGA TCTCCGTCAA CAGCCTTGGT CCTCTCTCTA TCCTAGACAT GGAATACACC 1320 

ATTGACATCA TCTTCTCCCA GACCTGGTAC GACGAACGCC TCTGTTACAA CGACACCTTT 1380 

GAGTCTCTTG TTCTGAATGG CAATGTGGTG AGCCAGCTAT GGATCCCGGA CACCTTTTTT 1440 

AGGAATTCTA AGAGGACCCA CGAGCATGAG ATCACCATGC CCAACCAGAT GGTCCGCATC 1500 

TACAAGGATG GCAAGGTGTT GTACACAATT AGGATGACCA TTGATGCCGG ATGCTCACTC 1560 

CACATGCTCA GATTTCCAAT GGATTCTCAC TCTTGCCCTC TATCTTTCTC TAGCTTTTCC 1620 

TATCCTGAGA ATGAGATGAT CTACAAGTGG GAAAATTTCA AGCTTGAAAT CAATGAGAAG 1680 

AACTCCTGGA AGCTCTTCCA GTTTGATTTT ACAGGAGTGA GCAACAAAAC TGAAATAATC 1740 

ACAACCCCAG TTGGTGACTT CATGGTCATG ACGATTTTCT TCAATGTGAG CAGGCGGTTT 1800 

GGCTATGTTG CCTTTCAAAA CTATGTCCCT TCTTCCGTGA CCACGATGCT CTCCTGGGTT 1860 

TCCTTTTGGA TCAAGACAGA GTCTGCTCCA GCCCGGACCT CTCTAGGGAT CACCTCTGTT 1920 

CTGACCATGA CCACGTTGGG CACCTTTTCT CGTAAGAATT TCCCGCGTGT CTCCTATATC 1980 

ACAGCCTTGG ATTTCTATAT CGCCATCTGC TTCGTCTTCT GCTTCTGCGC TCTGTTGGAG 2040 

TTTGCTGTGC TCAACTTCCT GATCTACAAC CAGACAAAAG CCCATGCTTC TCCTAAACTC 2100 

CGCCATCCTC GTATCAATAG CCGTGCCCAT GCCOGTACCC GTGCACGTTC CCGAGCCTGT 2160 

GCCCGCCAAC ATCAGGAAGC TTTTGTGTGC CAGATTGTCA CCACTGAGGG AAGTGATGGA 2220 

GAGGAGCGCC CGTCTTGCTC AGCCCAGCAG CCCCCTAGCC CAGGTAGCCC TGAGGGTCCC 2280 

CGCAGCCTCT GCTCCAAGCT GGCCTGCTGT GAGTGGTGCA AGCGTTTTAA GAAGTACTTC 2340 

TGCATGGTCC CCGATTGTGA GGGCAGTACC TGGCAGCAGG GCCGCCTCTG CATCCATGTC 2400 

TACCGCCTGG ATAACTACTC GAGAGTTGTT TTCCCAGTGA CTTTCTTCTT CTTCAATGTG 2460 

CTCTACTGGC TTGTTTGCCT TAACTTGTAG GTACCAGCTG GTACCCTGTG GGGCAACCTC 2520 

TCCAGTTCCC CAGGAGGTCC AAGCCCCTTG CCAAGGGAGT TGGGGGAAAG CAGCAGCAGC 2580 

AGCAGGAGCG ACTAGAGTTT TTCCTGCCCC ATTCCCCAAA CAGAAGCTTG CAGAGGGTTT 2640 

GTCTTTGCTG CCCCTCTCCC CTACCTGGCC CATTCACTGA GTTTTCTCAG CAGACCATTT 2700 

CAAATTATTA ATAAATGGGC CACCTCCCTC TTCTTCAAGG AGCATCCGTG ATGCTCAGTG 2760 

TTCAAAACCA CAGCCACTTA GTGATCAGCT CCCTAAAACC ATGCCTAAGT ACAGGCGGAT 2820 

TAGCTATCTT CCAACAATGC TGACCACCAG ACAATTACTG CATTTTTCCA GAAGCCCACT 2880 
ATTGCCTTTG CAGTGCTTTC GGCCCAGTTC TGGCCTCAGC CTCAAAGTGC ACCGACTAGT 2940 
TGCTTGCCTA TACCTGGCAC CTCATTAAGA TGCTGGGCAG CAGTATAACA GGAGGAAGAG 3000 
ATCCCTCTCC TTTGGTCAGA TTATTATGTT CTCAGTTCTC TCTCCCTGCT ACCCCTTTCT 3060 
CTGCAGATAG ATAGACACTG GCATTATCCC TTTAGGAAGA GGGGGGGGCA GCAAGAGAGC 3120 
CTATTTGGGA CAGCATTCCT CTCTCTCTGC TGCTGTGACA TCTCCCTCTC CTTGCTGGCT 3180 
CCATCTTTCG TCTGCACTAC CAATTCAATG CCCTTCATCC AATGGGTATC TATTTTTGTG 3240 
TGTGATTATA GTAACTACTC CCTGCTTTAT ATGCCACCCT CTTCCTTCTC TTTGACCCCT 3300 
GTGACTCTTT CTGTAACTTT CCCAGTGACT TCCCCTAGCC CTGACCAGGC ACTAGGCCTT 3360 
GGTGACTTCC TGGGGCCAAG AAACTAAGGA AACTCGGCTT TGCAACAGGC ATTACTCGCC 3420 
ATTGATTGGT GCCCACCCAG GGCACACTGT CGGAGTTCTA TCACTTGCTT GACCCCTGGA 3480 
CCCATAAACC AGTCCACTGT TATACCCGGG GCACTCTAAC CATCACAATC AATCAATCAA 3540 
ATTCCCTTAA ATTTGTATGG CACTGGAACT TTGGCAAAGC ACTTTTGACA AGTTGTGTCT 3600 
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GATTGGAGCT 
CAGATGAAAA 
AAGAGCCCAC 

TAGCTCAATT 
GTAGTAACCC 



ATCATCTGAA 
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TCATGATAGC 
CCCTGAGTCA 
TGTCACCTTC 
TCACCTCTGC 
TCTGGGCCTG 
AGTGGAATGA 
AATTTGTTGG 
ATGGGGAAAT 



CTTGTGACAT 
CAGATTTCTG 
TAGACCACAT 
TGGCACACCA 
AGGTGCTCAG 
ATTTGGACAT 
GGGGTGGATA 
ATGTAAATAA 



CTTTAGGGCA 
TGGGACTGTG 
GATAGGGCTA 
GTGGCAAGGC 
ACTGCCCCCA 
GCCCCAATGC 
GGGTGGGGTC 
ATATATCAGC 



GGATTCTTAT 
GATCTCACTG 
GACAGCTCAG 
CCAGAATGGC 
AGATCAAATC 
TTCTATATGC 
TCCATCTACT 
AAAGC 



Seq ID NO: C200 DNA Sequence 

Nucleic Acid Accession #: NM_021B19.1 

Coding sequence: 39.. 1619 



41 



CCCCATTTTG 
GAAGCTATCC 
TTCACCATGA 
GACCTCTCTT 
TCTCCTGGCT 
TAAGTGAAAT 
TTTTGTCACC 



51 



3G60 
3720 
3780 
3840 
3900 
3960 
4020 
4065 



1 11 21 31 

1111 

TAGGGCGGTC GCGTAATTCG GCACGAGGGC GCTCACGTAT GCCGGCGGTC AGTGGTCCAG 
GTCCCTTATT CTGCCTTCTC CTCCTGCTCC TGGACCCCCA CAGCCCTGAG ACGGGGTGTC 
CTCCTCTACG CAGGTTTGAG TACAAGCTCA GCTTCAAAGG CCCAAGGCTG GCATTGCCTG 
GGGCTGGAAT ACCCTTCTGG AGCCATCATG GAGACGCCAT CCTGGGCCTG GAGGAAGTGC 
GGCTGACGCC ATCCATGAGG AACCGGAGTG GCGCCGTGTG GAGCAGGGCC TCTGTCCCCT 
TCTCTGCCTG GGAAGTAGAG GTGCAAATGA GGGTGACGGG ACTGGGGCGC CGGGGAGCCC 
ACGGCATGGC CGTGTGGTAC ACCCGGGGCA GGGGCCATGT AGGCTCTGTC CTTGGGGGGC 
TGGCTTCGTG GGACGGCATC GGGATCTTCT TTGACTCTCC GGCAGAGGAT ACTCAGGACA 
GTCCTGCCAT CCGTGTGCTG GCCAGCGACG GGCACATCCC CTCTGAGCAG CCTGGGGATG 
GAGCTAGCCA AGGGCTGGGC TCCTGTCATT GGGACTTCCG GAACCGGCCA CACCCCTTCA 
GAGCACGGAT CACCTACTGG GGGCAGAGGC TGCGCATGTC CTTGAACAGT GGCCTCACTC 
CCAGTGATCC AGGTGAGTTC TGTGTGGATG TGGGGCCCCT GCTTTTGGTC CCTGGAGGTT 
TCTTTGGGGT CTCAGCAGCC ACCGGCACCC TGGCAGATGA TCATGATGTC CTGTCCTTCC 
TGACCTTCAG CCTGAGTGAG CCCAGCCCAG AGGTTCCCCC TCAGCCCTTC CTGGAGATGC 
AGCAGCTCCG CCTGGCGAGG CAGCTGGAAG GGCTGTGGGC AAGGCTGGGC TTGGGCACCA 
GGGAGGATGT AACTCCAAAA TCAGACTCTG AAGCTCAAGG AGAAGGGGAA AGGCTCTTTG 
ACCTGGAGGA GACGCTGGGC AGACACCGCC GGATCCTGCA GGCTCTGCGG GGTCTCTCCA 
AGCAGCTGGC CCAGGCTGAG AGACAATGGA AGAAGCAGCT GGGGCCCCCA GGCCAAGCCA 
GGCCTGACGG AGGCTGGGCC CTGGATGCTT CCTGCCAGAT TCCATCCACC CCAGGGAGGG 
GTGGCCACCT CTCCATGTCA CTCAATAAGG ACTCTGCCAA GGTCGGTGCC CTGCTCCATG 
GACAGTGGAC TCTGCTCCAG GCCCTGCAAG AGATGAGGGA TGCAGCTGTC CGCATGGCTG 
CAGAAGCCCA GGTCTCCTAC CTGCCTGTGG GCATTGAGCA TCATTTCTTA GAGCTGGACC 
ACATCCTGGG CCTCCTGCAG GAGGAGCTTC GGGGCCCGGC GAAGGCAGCA GCCAAGGCCC 
CCCGCCCACC TGGCCAGCCC CCAAGGGCCT CCTCGTGCCT GCAGCCTGGC ATCTTCCTGT 
TCTACCTCCT CATTCAGACT GTAGGCTTCT TCGGCTACGT GCACTTCAGG CAGGAGCTGA 
ACAAGAGCCT TCAGGAGTGT CTGTCCACAG GCAGCCTTCC TCTGGGTCCT GCACCACACA 
CCCCCAGGGC CCTGGGGATT CTGAGGAGGC AGCCTCTCCC TGCCAGCATG CCTGCCTGAC 
CCACCTCAGA GCCTGCTTTG CATCACTGGG AAGCAGGCAG TGTCTTGGGT GGGGGCTTGG 
TCAGTATCCT CTCCGTCTGG GTGCCCAGCT CCCACGCACA CCTGAGCTTT CGGCATGCTC 
CCACCTCGTT AAAGGTGATT TCCCTCTCAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 
A 

Seq ID NO: C201 DNA Sequence 

Nucleic Acid Accession #: XM_117036.1 

Coding sequence: 25.. 495 



1' 
I 

AGCCAAGAGA 
CGGCCCCCAC 
ATGGCCTGGC 
GGCCCGTGGG 
GGACCCAGCC 
GATGAGCGGG 
TGTCACOGCC 
GCAGCCCCCG 
TGCGGATGGG 
CGGAACTGTG 
TTACACAGTG 
CGGCTGCTCA 



11 
I 

GGGGGGACAG 
CCCCACTTCC 
CGGGCACGCT 
GGCTGTTCCG 
CCGAAACGCA 
AACCGGGGGG 
ACAGCGGCCG 
AGACAGGGAG 
GCTGAGGTTC 
CACGTTCACN 
AACAAGAGCA 
GAGCCGGGAG 



21 
I 

ACAGATGGAA 
AGCTCTGAGA 
CTGGAGACAC 
ACCACACAGG 
GGGTGTCGCC 
CGGCGCCTAC 
CACCGTGCGT 
GGGCCCAGCG 
TTCCTGAAGG 
GGTGAACCGT 
CGANGGGTGG 
ATGGCCACTC 




GTCGCCAGGG 
ATGAATGCTC 
GTGGGGAGTG 
AGGCCNCTTG 
AGCTCCTTTA 



41 
I 

GGGGAGCTCT 
CAGGCCTGCA 
GCAGGGCCNG 
AGGCAGGGCA 
GCCGCGCTCG 
TTACGCCGCG 
CCTGGAAACG 
GAAGTGGGGA 
TGAAACTGTG 
AATCCATCTC 
TCTGCTCCTG 
ACTG 



Seq ID NO: C202 DNA Sequence 

Nucleic 'Acid Accession *h XM_167803.2 

Coding sequence: 1162.. 1488 



1 
I 

AACATCATAC 
AGTCACCCAC 
GTCGGCCCCA 
GAGGGCAGCG 
GGAGGAGGGG 
CCACCACCCG 
CTTCGCCCTG 
TCCCGAGCCC 
GGTCTCTTCC 
CGCCCCACCA 
TGGGCCGAGG 
CGGTCTAGGG 



11 

I 

ATAGTAGGTG 
GTTCCTTGTA 
ACTGCATCCT 
ACCCCGAGGC 
ATCTGCCCTT 
CCTGCCCCGC 
GGGATCCAAC 
CGCCCACAGA 
CAACCAGACC 
CCACCACGTG 
GCCGGGTAGG 
CAATAGGAGA 



21 
I 

AATCGTTTTG 
AATCCAAATG 
CCTCTTTGGC 
CTCGTGGACG 
CTCCACTCCT 
GGAAGACCGC 
CAACTTGTAT 
GCTCTTAGCC 
CCACCCAGGC 
CAGGTGGAGC 
CTAGCACGGG 
GCAGAGAATG 



31 

I 

TAGAGTGAAG 
TTTCTATATT 
GGGCTGGGGA 
TGGGAGAGAG 
CTCTTGGATC 
CCAGTGAGCC 
CGAGTGGGCG 
AATCCTATGC 
ACATTAGCGA 
TCTGGGATGC 
AGGTAAGGGT 
GGGGAACTTG 



41 
I 

AATGCTAATG 
GTAGCTTTGC 
GCGGCCCCCA 
TGTGGTGGGA 
CGCCTCGGTT 
AGCCCCCACC 
GGGCAACGGC 
AGAGAGCATC 
CCAGGCTTGG 
TATGTTGGGG 
GGTATGGGAT 
AGGGTGGGGG 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1801 



51 
I 

GGGGAGCAGG 
GGCGGCTGGG 
GGCAGCCGAG 
GGCCCCGGTG 
CGTGTCGGTG 
GTGGAAGGGG 
TCCAGAACAG 
TGGGAATGAG 
AATAGACACA 
AACAGAGCTG 
CAGATCGAGC 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

704 



51 
I 

TAAAGCAAAT 
TTAAAATGGG 
GCCGGGACGG 
AGTCTTGAGC 
TCCTGTCCCC 
TTCCAGGCGC 
TCCCCATTTT 
TCCTGGCAGG 
GCTTCCCCAG 
CGGCAAGCGG 



GAGGGCACCG 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 
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GAGCCTTGCC 
CCCCATCTGT 
CACCATCACC 
AGTGTAATTA 
AGCTAGCTGA 
GGCTTCGTCT 
TTTCTTGTGA 
GCACCTGCCC 
CCAGGGGGGC 
GAGTGCAAGT 
GCAGCGAAGA 
ACAAAATTTT 
TATTTTTGTG 
CGTGAAAAAG 



ACCATCCCAG 
AAAATGATGG 
TGACCTGGGG 
CCGAATCCTG 
GGCCAACGCA 
GCCATCTTTG 
CCATCTCCCC 
TCTGGCACCA 
TCCCCGGAGC 
GTTTTATAAA 
GAAATGAAGA 
TCCACCACAC 
GGAATGGGCC 
T 



GACTTTGGGC 
TAATAATACT 
GTCAAGGCAG 
ACTGCAAGGC 
GGCCCATCCG 
TGGGTGCCCT 
ATGAAAGTGC 
GATGCCAGGG 
CCCAGGGGTG 
AAATAAAACA 
AAAACTGAAG 
ATACCCTCTA 
CTGCTTTTTG 



AAGTCACCCG 
TCACCTACCT 
GAGGACTCCG 
CCACCTGCCC 
TCTCTTCACT 
AGACTTAGTC 
TGTACAAATT 
AAGGGACAGA 
GGGATTGGTG 
AAAACCCACC 
AAAAAAAAAA 
AGCCAGCAAG 
TGGCAAGGCC 



Seq ID NO: C203 DNA Sequence 

Nucleic Acid Accession ttz NMJ)24780.1 

Coding sequence : 3 1 102 3 



31 



CACTCCCTGG 
CATAGGGGAG 
AAGGTGCTAC 
CTCCCCCACA 
CTGTCGCAGG 
CTTATCTTGT 
CCACCCGCCC 
GGAAAACAGC 
GCCACTGTTT 
ATCACAAAAA 
AACAGGAAAA 
ATTTCCTCTT 
TGTTCTGATT 



41 



GCCTCGGTTT 
GTTGTGAGGC 
CCGTGAGCAA 
GAGCCTCCAG 
CCCTTTCATG 
CCTGGTTTCC 
CAGGACCCCC 
CACAAACAAG 
GTATGTTCTT 
AAAAAATTTT 
AAAGAACCAT 
TGCAAAATCA 
AATAAAGGAT 



51 



780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1511 



1 11 21 

I 1 I I I I 

ATTAATCTGG CCGTGCCATG CATCTACTCC ATGTTCAGGC TTGTGGAGAG GTACGAGATG 60 

CCACGGCACG AAGTCTACGT TCTCCTGATC CGAAACATCT TTTTGAAAAT ATCAATCATT 120 

GGCATTCTTT GTTACTATTG GCTCAACACC GTGGCCCTGT CTGGTGAAGA GTGTTGGGAA 180 

ACCCTCATTG GCCAGGACAT CTACCGGCTC CTTCTGATGG ATTTTGTGTT CTCTTTAGTC 240 

AATTCCTTCC TGGGGGAGTT TCTGAGGAGA ATCATTGGGA TGCAACTGAT CACAAGTCTT 300 

GGCCTTCAGG AGTTTGACAT TGCCAGGAAC GTTCTAGAAC TGATCTATGC ACAAACTCTG 360 

GTGTGGATTG GCATCTTCTT CTGCCCCCTG CTGCCCTTTA TCCAAATGAT TATGCTTTTC 420 

ATCATGTTCT ACTCCAAAAA TATCAGCCTG ATGATGAATT TCCAGCCTCC GAGCAAAGCC 480 

TGGCGGGCCT CACAGATGAT GACTTTCTTC ATCTTCTTGC TCTTTTTCCC ATCCTTCACC 540 

GGGGTCTTGT GCACCCTGGC CATCACCATC TGGAGATTGA AGCCTTCAGC TGACTGTGGC 600 

CCTTTTCGAG GTCTGCCTCT CTTCATTCAC TCCATCTACA GCTGGATCGA CACCCTAAGT 660 

ACACGGCCTG GCTACCTGTG GGTTGTTTGG ATCTATCGGA ACCTCATTGG AAGTGTGCAC 720 

TTCTTTTTCA TCCTCACCCT CATTGTGCTA ATCATCACCT ATCTTTACTG GCAGATCACA 780 

GAGGGAAGGA AGATTATGAT AAGGCTGCTC CATGAGCAGA TCATTAATGA GGGCAAAGAT 840 

AAAATGTTCC TGATAGAAAA ATTGATCAAG CTGCAGGATA TGGAGAAGAA AGCAAACCCC 900 

AGCTCACTTG TTCTGGAAAG GAGAGAGGTG GAGCAACAAG GCTTTTTGCA TTTGGGGGAA 960 

CATGATGGCA GTCTTGACTT GCGATCTAGA AGATCAGTTC AAGAAGGTAA TCCAAGGGCC 1020 

TGATGACTCT TTTGGTAACC AGACACCAAT CAAATAAGGG GAGGAGACGA AAATGGAATG 1080 

ATTTCTTCCA TGCCACCTGT GCCTTTAGGA ACTGCCCAGA AGAAAATCCA AGGCTTTAGC 1140 

CAGGAGCGGA AACTGACTAC CATGTAATTA TCAAAGTAAA ATTGGGCATT CCATGCTATT 1200 

TTTAATACCT GGATTGCTGA TTTTTCAAGA CAAAATACTT GGGGTTTTCC AATAAAGATT 1260 

GTTGTAATAT TGAAATGAGC CTACAAAAAC CTAGGAAGAG ATAACTAGGG AATAATGTAT 1320 

ATTATCTTCA AGAAGTGTGT GCAGGAATGA TTGGTTCTTA GAAATCTCTC CTGCCAGACT 1380 

TCCCAGACCT GGCAAAGGTT TAGAAACTGT TGCTAAGAAA AGTGGTCCAT CCTGAATAAA 1440 

CATGTAATAC TCCAGCAGGG ATATGAAGCC TCTGAATTGT AGAACCTGCA TTTATTTGTG 1500 

ACTTTGAACT AAAGACATCC CCCATGTCCC AAAGGTGGAA TACAACCAGA GGTCTCATCT 1560 

CTGAACTTTC TTGCGTACTG AT TA CAT GAG TCTTTGGAGT CGGGGATGGA GGAGGTTCTG 1620 

CCCCTGTGAG GTGTTATACA TGACCATCAA AGTCCTACGT CAAGCTAGCT TTGCAGTGGC 1680 

AGTACCGTAG CCAATGAGAT TTATCCGAGA CGCGATTATT GCTAATTGGA AATTTTCCCA 1740 

ATACCCCACC GTGATGACTT GAAATATAAT CAGCGCTGGC AATTTTTGAC AGTCTCTACG 1800 

GAGACTGAAT AAG 1813 

Seq ID NO: C204 Protein Sequence 
Protein Accession #: Eos sequence 

1 11 21 31 41 51 

I I I I I I 

AGCGGGGTTA GATGCAGGGA GTTATGTTGT GATTTGTGTA GGGGTGGCTT TGGT GAGCTC 60 
ATCCTGGCAT TTTAAAAATA ATGTGAATGG TGTATCCCTT TGTGATCATC ATCACTTTTT 120 
CCCTGCGAGT CCCAATCAAT GCGGGTGTGA CTGCTGTATG AAGTCTCAGG AAGCCCACCC 180 
CAGTGGAGAA GAAGGCTTGC AGGAGGCAGG AGATGCTGTC CGATGACCAC GTGAATGAAA 240 
TCATCATACA GGTTGAGAAT GTTTCCTCTG GGGTCCAAAG CC ACCC ATCC TCAAATCAGA 300 
TTTTTCAAGA AAAGGTGCTG CTAGACTCAA GCATCAACAT GGTTTTGTCA ATATCTGACA 360 
TTGATGTGAT AGACTCTCAG ACAGTCAGCA AAAGGAATGA CCAAAAGGGT AACCAGGTGC 420 
TGCGGTTTTC AACATCTTTG AATGAGTCGA TGTCTCAGAC CCTTCATAGC CTAGAATGCA 480 
TGGGCATAGA CACTCCTGGT TCTTCACATG AAACTGTTCA AGGACAGAAG TTAATCGCAT 540 
CCCTTATACC CATGACATCC AGAGACAGAA TTAAAGCCAT CAGGAACCAG CCAAGGACCA 600 
TGGAAGAGAA AAGGAACCTT AGGAAAATAG TTGACAAAGA AAAAAGCAAA CAGACCCATC 660 
GTATCCTTCA GCTCAATTGC TGTATTCAGT GTCTGAACTC CATTTCCCGG GCTTATCGGA 720 
GATCCAAGAA CAGCCTGTCG GAAATTCTGA ATTCCATCAG CCTGTGGCAG AAGACGCTGA 780 
AGATCATTGG AGGCAAGTTT GGAACCAGCG TCCTCTCCTA TTTCAACTTT CTGAGATGGC 840 
TTTTGAAGTT CAACATTTTC TCATTCATCC TGAACTTCAG CTTCATCATA ATCCCTCAGT 900 
TTACCGTGGC CAAAAAGAAC ACCCTCCAGT TCACTGGGCT GGAGTTTTTC ACTGGGGTGG 960 
GTTATTTTAG GGACACAGTG ATGTACTATG GCTTTTACAC CAATTCCACC ATCCAGCACG 1020 
GGAACAGCGG GGCATCCTAC AACATGCAGC TGGCCTACAT CTTCACAATC GGAGCATGCT 1080 
TGACCACCTG CTTCTTCAGT TTGCTGTTCA GCATGGCCAA GTATTTCCGG AACAACTTCA 1140 
TTAATCCCCA CATTTACTCC GGAGGGATCA CCAAGCTGAT CTTTTGCTGG GACTTCACTG 1200 
TCACTCATGA AAAAGCTGTG AAGCTAAAAC AGAAGAATCT TAGCACTGAG ATAAGGGAGA 1260 
ACCTGTCAGA GCTCCGTCAG GAGAATTCCA AGTTGACGTT CAATCAGCTG CTGACCCGCT 1320 
TCTCTGCCTA CATGGTAGCC TGGGTTGTCT CTACAGGAGT GGCCATAGCC TGCTGTGCAG 1380 
CCGTTTATTA CCTGGCTGAG TACAACTTAG AGTTCCTGAA GACACACAGT AACCCTGGGG 1440 
CGGTGCTGTT ACTGCCTTTC GTTGTGTCCT GCATTAATCT GGCCGTGCCA TGCATCTACT 1500 
CCATGTTCAG GCTTGTGGAG AGGTACGAGA TGCCACGGCA CGAAGTCTAC GTTCTCCTGA 1560 
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TCCGAAACAT CTTTTTGAAA ATATCAATCA TTGGCATTCT TTGTTACTAT TGGCTCAACA 
COGTGGCCCT GTCTGGTGAA GAGTGTTGGG AAACCCTCAT T6GCCAGGAC ATCTACCGGC 
TCCTTCTGAT GGATTTTGTG TTCTCTTTAG TCAATTCCTT CCTGGGGGAG TTTCTGAGGA 
GAATCATTGG GATGCAACTG ATCACAAGTC TTGGCCTTCA GGAGTTTGAC ATTGCCAGGA 
ACGTTCTAGA ACTGATCTAT GCACAAACTC TGGTGTGGAT TGGCATCTTC TTCTGCCCCC 
TGCTGCCCTT TATCCAAATG ATTATGCTTT TCATCATGTT CTACTCCAAA AATATCAGCC 
TGATGATGAA TTTCCAGCCT COGAGCAAAG CCTGGCGGGC CTCACAGATG ATGACTTTCT 
TCATCTTCTT GCTCTTTTTC CCATCCTTCA CCGGGGTCTT GTGCACCCTG GCCATCACCA 
TCTGGAGATT GAAGCCTTCA GCTGACTGTG GCCCTTTTCG AGGTCTGCCT CTCTTCATTC 
ACTCCATCTA CAGCTGGATC GACACCCTAA GTACACGGCC TGGCTACCTG TGGGTTGTTT 
GGATCTATCG GAACCTCATT GGAAGTGTGC ACTTCTTTTT CATCCTCACC CTCATTGTGC 
TAATCATCAC CTATCTTTAC TGGCAGATCA CAGAGGGAAG GAAGATTATG ATAAGGCTGC 
TCCATGAGCA GATCATTAAT GAGGGCAAAG ATAAAATGTT CCTGATAGAA AAATTGATCA 
AGCTGCAGGA TATGGAGAAG AAAGCAAACC CCAGCTCACT TGTTCTGGAA AGGAGAGAGG 
TGGAGCAACA AGGCTTTTTG CATTTGGGGG AACATGATGG CAGTCTTGAC TTGCGATCTA 
GAAGATCAGT TCAAGAAGGT AATCCAAGGG CCTGATGACT CTTTTGGTAA CCAGACACCA 
ATCAAATAAG GGGAGGAGAY GAAAATGGAA TGATTTCTTC CATGCCACCT GTGCCTTTAG 
GAACTGCCCA GAAGAAAATC CAAGGCTTTA GCCAGGAGCG GAAACTGACT ACCATGTAAT 
TATCAAAGTA AAATTGGGCA TTCCATGCTA TTTTTAATAC CTGGATTGCT GATTTTTCAA 
GACAAAATAC TTGGGGTTTT CCAATAAAGA TTGTTGTAAT ATTGAAAWRA RMMWAMAAAA 
ACCTAGGAAG AGATAACTAG GGAATAATGT ATATTATCTT CAAGAAGTGT GTGCAGGAAT 
GATTGGTTCT TAGAAATCTC TCCTGCCAGA CTTCCCAGAC CTGGCAAAGG TTTAGAAACT 
GTTGCTAAGA AAAGTGGTCC ATCCTGAATA AACATGTAAT ACTCCAGCAG GGATATGAAG 
CCTCTGAATT GTAGAACCTG CATTTATTTG TGACTTTGAA CTAAAGACAT CCCCCATGTC 
CCAAAGGTGG AATACAACCA GAGGTCTCAT CTCTGAACTT TCTTGCGTAC TGATTACATG 
AGTCTTTGGA GTCGGGGATG GAGGAGGTTC TGCCCCTGTG AGGTGTTATA CATGACCATC 
AAAGTCCTAC GTCAAGCTAG CTTTGCAGTG GCAGTACCGT AGCCAATGAG ATTTATCCGA 
GACGCGATTA TTGCTAATTG GAAATTTTCC CAATACCCCA CCGTGATGAC TTGAAATATA 
ATCAGCGCTG GCAATTTTTG ACAGTCTCTA CGGAGACTGA ATAAG 

Seq ID NO; C20S DNA Sequence 

Nucleic Acid Accession Its- NM_0022S0.1 

Coding sequence: 397. .1680 
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1 11 21 31 41 51 

I I I I I I 

GTCCTTCGGT GTCTGGGTGT GGTGAGTAGA GGTGTGTGTC ACAAAGTACA GACCATTGTG 60 

TGTGACAAAG CCCATCGTGT GTCTGTGTGT GTCTTTATCC ACGTGGATGG ACGTCTCTTT 120 

CTTGCTCTGC CCCAAGACAC ACCCTAGCCC CTCCTTATTC TCAAAAGGGG GAGCTGGGGA 180 

GOCTCCCCCT ACCCTGGGGC CTCCCCTGCC CCTCCCCGCC CTGCCTGGCC GTCACCACTC 240 

CCCAGAGGGC ACAGGGCTCT GCTGTGCCTC AGAGCAAAAG TCCCAGAGCC AGCAGAGCAG 300 

GCTGACGACC TGCAAGCCAC AGTGGCTGCC CTGTGCGTGC TGCGAGGTGG GGGACCCTGG 360 

GCAGGAAGCT GGCTGAGCCC CAAGACCCCG GGGGCCATGG GCGGGGATCT GGTGCTTGGC 420 

CTGGGGGCCT TGAGACGCCG AAAGCGCTTG CTGGAGCAGG AGAAGTCTCT GGCCGGCTGG 480 

GCACTGGTGC TGGCAGGAAC TGGCATTGGA CTCATGGTGC TGCATGCAGA GATGCTGTGG 540 

TTCGGGGGGT GCTCGTGGGC GCTCTACCTG TTCCTGGTTA AATGCACGAT CAGCATTTCC 600 

ACCTTCTTAC TCCTCTGCCT CATCGTGGCC TTTCATGCCA AAGAGGTCCA GCTGTTCATG 660 

ACCGACAACG GGCTGCGGGA CTGGCGOGTG GCGCTGACCG GGCGGCAGGC GGCGCAGATC 720 

GTGCTGGAGC TGGTGGTGTG TGGGCTGCAC CCGGCGCCCG TGCGGGGCCC GCCGTGCGTG 780 

CAGGATTTAG GGGCGCCGCT GACCTCCCOG CAGCCCTGGC CGGGATTCCT GGGCCAAGGG 840 

GAAGCGCTGC TGTCCCTGGC CATGCTGCTG CGTCTCTACC TGGTGCCCCG CGCCGTGCTC 900 

CTGCGCAGCG GCGTCCTGCT CAACGCTTCC TACCGCAGCA TCGGCGCTCT CAATCAAGTC 960 

CGCTTCCGCC ACTGGTTCGT GGCCAAGCTT TACATGAACA CGCACCCTGG CCGCCTGCTG 1020 

CTCGGCCTCA CGCTTGGCCT CTGGCTGACC ACCGCCTGGG TGCTGTCCGT GGCCGAGAGG 1080 

CAGGCTGTTA ATGCCACTGG GCACCTTTCA GACACACTTT GGCTGATCCC CATCACATTC 1140 

CTGACCATCG GCTATGGTGA CGTGGTGCCG GGCACCATGT GGGGCAAGAT CGTCTGCCTG 1200 

TGCACTGGAG TCATGGGTGT CTGCTGCACA GCCCTGCTGG TGGCCGTGGT GGCCOGGAAG 1260 

CTGGAGTTTA ACAAGGCAGA GAAGCACGTG CACAACTTCA TGATGGATAT CCAGTATACC 1320 

AAAGAGATGA AGGAGTCCGC TGCCCGAGTG CTACAAGAAG CCTGGATGTT CTACAAACAT 13 BO 

ACTCGCAGGA AGGAGTCTCA TGCTGCCCGC AGGCATCAGC GCAAGCTGCT GGCCGCCATC 1440 

AACGCGTTCC GCCAGGTGCG GCTGAAAGAC CGGAAGCTCC GGGAACAAGT GAACTCCATG 1500 

GTGGACATCT CCAAGATGCA CATGATCCTG TATGACCTGC AGCAGAATCT GAGCAGCTCA 1560 

CACCGGGCCC TGGAGAAACA GATTGACACG CTGGCGGGGA AGCTGGATGC CCTGACTGAG 1620 

CTGCTTAGCA CTGCCCTGGG GCCGAGGCAG CTTCCAGAAC CCAGCCAGCA GTCCAAGTAG 1680 

CTGGACCCAC GAGGAGGAAC CAGGCTACTT TCCCCAGTAC TGAGGTGGTG GACATCGTCT 1740 

CTGCCACTCC TGACCCAGCC CTGAACAAAG CACCTCAAGT GCAAGGACCA AAGGGGGCCC 1800 

TGGCTTGGAG TGGGTTGGCT TGCTGATGGC TGCTGGAGGG GACGCTGGCT AAAGTGGGTA 1860 

GGCCTTGGCC CACCTGAGGC CCCAGGTGGG AACATGGTCA CCCCCACTCT GCATACCCTC 1920 

ATCAAAAACA CTCTCACTAT GCTGCTATGG ACGACCTCCA GCTCTCAGTT ACAAGTGCAG 1980 

GCGACTGGAG GCAGGACTCC TGGGTCCCTG GGAAAGAGGG TACTAGGGGC CCGGATCCAG 2040 

GATTCTGGGA GGCTTCAGTT ACCGCTGGCC GAGCTGAAGA ACTGGGTATG AGGCTGGGGC 2100 

GGGGCTGGAG GTGGCGCCCC CTGGTGGGAC AACAAAGAGG ACACCATTTT TCCAGAGCTG 2160 

CAGAGAGCAC CTGGTGGGGA GGAAGAAGTG TAACTCACCA GCCTCTGCTC TTATCTTTGT 2220 
AATAAATGTT AAAGCCAG 2238 

Seq ID NO: C206 DNA Sequence 

Nucleic Acid Accession ft: NM_025257.1 

Coding sequence: 1..213 9 

1 11 21 31 41 51 

I I I ! I I 

ATGGGGGGAA AGCAGCGGGA CGAGGATGAC GAGGCCTACG GGAAGCCAGT CAAATACGAC 60 
CCCTCCTTTC GAGGCCCCAT CAAGAACAGA AGCTGCACAG ATGTCATCTG CTGCGTCCTC 120 
TTCCTGCTCT TCATTCTAGG TTACATCGTG GTGGGGATTG TGGCCTGGTT GTATGGAGAC 180 
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CCCCGGCAAG 
AAAGATAAGC 
ATCTCAGTTG 
CCGGAGGACC 
TATACAAAAA 
AGCCTGCAAC 
TGCTTTCCAT 
ATACAGCAGG 
ATCTTTGAAG 
GTCTTGAGCC 
CTGATCCTGG 
CGAGTGCTGC 
GCCTACCAGA 
GCCATCCTGC 
CTGAAGGAGG 
ACCTTTGTCC 
CCCACGCAGC 
TGTGAGAAAG 
TGCCCAGGGC 
GTCTTCAATC 
GCCCTGGGCC 
CCCCAGGACA 
ACTGGGTCAT 
TTGGAGTATA 
TGCTGTTTCA 
GCATACATCA 
ATGCTACTCA 
CTGTTCTTTG 
TCCGGTCGCA 
CTGCCCATCA 
TTCGGCATGT 
GGCTCCCTGG 
AACGAGGCGC 
ACTGCACCCC 
TTGTGGTAAA 
GAGAGGCTGA 
GTGAAAC 



TGCTCTACCC 
CGTATCTCCT 
CTGAGAACGG 
CATGGACTGT 
ACAGGAACTT 
AGGAACTCTG 
GGACCAACAT 
GGATCAGCGG 
ATTTTGCCCA 
TACTGTTTAT 
GAGTGCTGGG 
GGGACAAGGG 
GCGTGCAGGA 
TGCTGGTGCT 
CCAGCAAGGC 
TCCTCCTCAT 
CAGCCACTCT 
TGCCAATAAA 
TGATGTGCGT 
TGCAAATCTA 
AATGCGTCCT 
TCCCTACCTT 
TGGCATTTGG 
TTGACCACAA 
AGTGCTGCCT 
TGATCGCCAT 
TGCGAAACAT 
GGAAGCTGCT 



TGACCTCCAT 
GTGTGGACAC 
ACCGGCCCTA 
CCCCGGACAA 
ACCCCCACCG 
AAAAGGTTTT 
GGCGGGCGGA 



CAGGAACTCT 
GTACTTCAAC 
CCTACAGTGC 
GGGAAAAAAC 
TTGTCTGCCA 
CCCCAGTTTC 
TACTCCACCG 
TCTTATTGAC 
GTCCTGGTAT 
CTTGCTTCTG 
CGTGCTGGCA 
CGCCTCCATC 
GACCTGGCTG 
CATCTTCCTG 
TGTGGGACAG 
CTGCATTGCC 
TGGATATGTG 
TACATCATGC 
CTTCCAGGGC 
TGGGGTCCTG 
CGCTGGAGCC 
CCCCTTAATC 
AGCCCTCATC 
GCTCAGAGGA 
CTGGTGTCTG 
CTACGGGAAG 
TGTCAGGGTG 
GGTGGTCGGA 
GGGTAAAGAC 
CCTGGGGGCC 
GCTCTTCCTC 
CTACATGTCC 
CAAGAAGAGG 
TCCAGCCATC 
AGGCCAGGCG 
TCACCTGAGT 



ACTGGGGCCT 
ATCTTCAGCT 
CCCACACCCC 
GAGTTCTCAC 
GGGGTACCCT 
CTCCTCCCCT 
GCGCTCCCAG 
AGCCTCAATG 
TGGATTCTTG 
CGCCTGGTGG 
TATGGCATCT 
TCCCAGCTGG 
GeCGOCCTGA 
CGGCAGCGGA 
ATGATGTCTA 
TACTGGGCCA 
CTCTGGGCAT 
AACCCCACGG 
TACTCATCCA 
GGGCTCTTCT 
TTTGCCTCCT 
TCTGCCTTCA 
CTGACCCTTG 
GTGCAGAACC 
GAAAAATTTA 
AATTTCTGTG 
GTCGTCCTGG 



TTTAAGAGCC 
TATGTCATCG 
TGCTTCCTGG 
AAGAGCCTTC 
AAGAAGTGAC 
CAACCTCACT 
CCGTGGCTCA 
CAGGAGTTCG 



ACTGTGGCAT 
GCATCCTGTC 
AGGTGTGTGT 
AGACTGTTGG 
GGAATATGAC 
CTGCTCCAGC 
GGATCACCAA 
CCCGAGACAT 
TTGCCCTGGG 
CTGGGCCCCT 
ACTACTGCTG 
GTTTCACCAC 
TCGTGTTGGC 
TTCGTATTGC 
CCATGTTCTA 
TGACTGCTCT 
CCAACATCAG 
CCCACCTTGT 
AAGGCCTAAT 
GGACCCTTAA 
TCTACTGGGC 
TCCGCACACT 
TGCAGATAGC 
CTGTAGCCCG 
TCAAGTTCCT 
TCTCAGCCAA 
ACAAAGTCAC 
TCCTGTCCTT 
CCCACCTCAA 
CCAGCGGCTT 
AAGACCTGGA 
TAAAGATTCT 
AGCTCCGGCC 
TCGCCTTACA 
CGCCTGTAAT 
AGACCAGCCT 



GGGGGAGAAC 
CAGCAACATC 
GTCCTCCTGC 
GGAAGTCTTC 
GGTGATCACA 
TCTGGGACGC 
TGACACCACC 
CAGTGTTAAG 
GGTGGCTCTG 
GGTGCTGGTG 
GGAGGAGTAC 
CAACCTCAGT 
GGTGCTTGAA 
CATOGCCCTC 
CCCACTGGTC 
GTATCCTCTG 
CTCCCCCGGC 
GAACTCCTCG 
CCAACGTTCT 
CTGGGTACTG 
CTTCCACAAG 
CCGTTACCAC 
CCGGGTCATC 
CTGCATCATG 
AAACCGCAAT 
AAATGCGTTC 
AGACCTGCTG 
CTTTTTTTTC 
CTATTACTGG 
CTTCAGCGTT 
GCGGAACAAC 
GGG CAAGAAG 
CTGATCCAGG 
GGTCTCCATT 
CCAACACTTT 
GGCCAACATG 
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Seq ID NO: C207 DNA Sequence 
Nucleic Acid Accession #: NM_016180. 
Coding sequence: 26.. 1618 



CAGGAAGGTT 
CTATAAATCC 
CAGCAGACTC 



TGTGTGGTTC 
CGACCACTGC 
GATGCTCGTG 
TAACCCAAGG 
TGATTTTGCT 
CCATCAGGAC 
CCTGGGTTAC 
TACAGAATTC 
TCATCTGTGC 
GCAAACCCCT 
GAAAGTTAAA 
TCATGCTGAA 
CATGCCTCCT 
GTCCAACATG 
TAGTGCACAC 
GGGCTTCTGC 
ATCCTACATT 
GGGATTTATT 
TGTAATGTCC 
GGAAGAAAAG 
GGGCATGGAC 
TGGCCTGGGC 
TGCGGTGGCA 
CAATAAAGAG 



11 
I 

CCTCTCCCAG 
CTAGCTGATG 
ATCATGCACA 
GTGACCCCAG 
CTCAGCCCCA 
CGGTCCAGGT 
GGCATGGCTC 
AGGAAGCTGG 
GCCGACTTCA 
AAGGAGAAGG 
CTTTTGGGTG 
CAGGTCATGT 
AGTATCTCTG 
CAGGACCCTC 
AATGGTTACG 
CAGACTCGCA 
CACTACCGCT 
CTGTTCTTCA 
AACTCCACAG 
ATCAACTCCG 
GGATTAAAGG 
GGGCTCTTCC 
AGCACCCTGT 
GAGAGGCAGC 
TGCGCCACCC 
TTTCTGGTCA 
CTGATAGGCT 
ACAATGACCC 



21 

i 

TGGCCATGGG 
ATGGCCCCTT 
GCATGGCCAT 
TCCTGCTCAG 
TCCTGGGATT 
GGGGCCGCCG 
TGTACCTCAA 
TTTGGGCCAT 
TTGATGGGCC 
GCCTCCACTA 
CTATAGACTG 
TCTTCTTCTC 
AAGCCCCACT 
CATTGTCATC 
TAAATCCAGA 
GGGCAATGAC 
ACCTTTGCAT 
CAGATTTCAT 
AGTTTCTCAT 
TGTTTTCCTC 
GTCTTTACTT 
CGAATGTCTA 
ACACTGTGCC 
AGGCCCCAGG 
TCACATGCAT 
ACACAGCCGG 
GTTGCTTTGT 
TAAAAAAAAA 



31 
I 

TAGCAACAGT 
TGACTCTGTG 
GTTCGGAAGA 
CGTAGGTCTG 
CCTGCTGCAG 
GAGACCCTAC 
TGGGGCTACT 
AAGTGTCACC 
CATCAAAGCC 
CCATGCCCTC 
GGCCCATCTG 
TGCATTGGTG 
TACAGAGGTT 
AGATGGAATG 
GCTGGCAATG 
ATTAAAGTCA 
CAGCCACCTC 
GGGCCAGATT 
CTAOGAAAGA 
ACTTTATTCT 
CACGGGATAT 
CTCCAOCCTG 
CTTTAACCTC 
AGGGGACCCA 
GGTGCAGCTG 
GACCGTTGTC 
CGCTCTCTTT 



41 
I 

GGGCAGGCTG 
GAGCCGCCTA 
GAGTTCTGCT 
CCCAGCAGCC 




Seq ID NO: C208 DNA Sequence 

Nucleic Acid Accession #: NM_003273.1 

Coding sequence: 255.. 2024 

1 11 21 31 

I 1 I I 

CGCCGCGGGG CCGGATCCTC CGCGCGGCCG AGTCCATCTC 
TGTTTCCTTG ACTGACTATT GTGAGCGCCC TCTCTCTCCG 
CCCACTCAGG CCCCCGGGCC COGCTGGAAT TCGGAGGGCC 
GATGGGACCT GGGGCAAAGG CTAAGCGAAG GAGAGCTGGA 
GGCGAGATCT GAGGATGGAA GGCTTTGGGG GTGTCGGAGG 



51 
I 

GCCGCCACAT 

AAAGACCCAC 

AOGCGGTGGA 

TGTACAGCAT 

GATCGGCCAG 

TGGGAGTCAT 

CTTTGATTGC 

TCGTTCTCTT 

ATGTCTGCTC 

TTGGAGGTGC 

GACTGTTGGG 

GTTTTACTGT 

TTCCCCCACA 

GTTCTATCGA 

AAAACAAAAA 

CACTGGTGAA 

CGGCCTTCCT 

GGGATCCCTA 

TTGGATGTTG 

AAGTTTTGGT 

GCCTGGGGAC 

GCCTGTTTGG 

ACCACCGCGA 

TGAGAGGGAA 

TGGTCGGAGG 

TCACAGCGTC 

TGGATTAGGT 



51 
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41 

I ! 

CTGGGAAATG GGGCGGACAG 
GCGGAGGGGA GACCATGGCC 
CCTGGGTAAT GGGGCAGAGA 
GCGGGTGAAC TAAGAGCGGG 
CAGAGGGACC CGGGGGTTTG 



60 
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180 
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CAGCGAAGGG 
GGGGATTTAT 
GACTAAGATG 
CTGAGGACGG 
GGGCGCCGT6 
CTTCCCGCAG 
CTGGCGGCCC 
GAGGTGCTGT 
GCGCTCTACC 
GAAGCGAATG 
CCGCAGTGGC 
GCTTCCAGGC 
CTCTGGGGGC 
TCATCTTCAG 
CTGGGGGGAA 
GTATCTGTTT 
TCCTCATCAA 
CCATGTGGCT 
AGGCCGTCCT 
GGGACATGGC 
CGCAGCCCCT 
ACATCTTCCG 
GAGTGGCTGG 
GGTGGGGTAT 
CCTTGCCCTG 
TGCTGGTGCA 
GCAGGAGTAC 
CACCACCCCA 
CGACACACTT 
AGAAGAGGTG 



TGTCTGGAGA 
GGTGTCGACT 
GACGCCTGGG 
GGTGCGGAGG 
CGGGCCGGCG 
GCGCCGCGGC 
GTTCGGGCCC 
GGAGCCCACG 
TACTGCCGGC 
GGCTCGGCGA 
CGAGGGGCAG 




GGGAGAGCTG 
GGGAGCAGGA 
AAGGGAACTG 
CGCACTCTGG 
GGGAGCTGGG 
TCTGCTACTG 
CGCGOGCCTG 
GGCGCTGCTG 
GCGCAAGGTG 
GGGAAAGGAC 
GAATTGAAGG 
ACAGCCCTGT 
ATGCTCCTGC 
TACATGAAGG 
CCGATTTACG 
AAATATTTCT 
TTGATGAAGG 
TTCCAGTTGC 
GATATCACAC 
TTCACCTACA 
ATGGCCTCTG 
TCCCAGAAAA 
ATCTCTACAG 
CCCAACTATC 
CACCTGCTGC 
GGGGATGAGC 
TGCCTTACCG 
GTGCCCACTC 
GGCTTGCACC 
AGGAAAAAAA 



AGGAGGGGCC 
GGAGGGTCTT 
GGAGGCAGCG 
GAATGCCGAG 
GGGCTAGGGG 
CTGCTGCCCG 
CTGGGTCCAC 
CTGTGGCTCG 
CGGGCCCCGC 
GCCCCGGGCC 
ACAAGAGTCG 
TGGTGGGGCT 
CCTTGGOGTT 
CGCAGGTAGC 
ACTTTTTTCT 
GTGAACTGCG 
AGGCAGAGCT 
TCTACGTGGG 
ATGACGGGTT 
GCCTGCAGGC 
TCATCTGCCT 
ACACTTTCCG 
CCACAGGGCG 
TTGGAGACCT 
CCTACTTCTA 
GGAGTGCCTG 
CATCATGCCC 
. ATCCACCAGC 
CCACCCAGCC 
TGAAACCAGT 



GGTTCTGGGG 
CGAGGGGCCT 
GGGTGCCTGG 
AGGGTCCCGC 
CGGACGCCGA 
CCACCATGTT 
CCGCGTCCCT 
CCTGGCTCGG 
TCGCGGACGC 
TTATCAGAGC 
CCTGCGCTAT 
GGGGATGTCA 
TGTCGCCACC 
CCCAGTTTCG 
GGGACGAGAG 
ACCCGGCCTC 
TCGAGGCAGT 
TGATGCCCTC 
TGGCTTCATG 
CCAGTTCCTG 
CATCAATGCT 
AAAGAATCCT 
GAAACTGCTG 
CATCATGGCT 
CCTCCTCTAC 
CAGAAGTACG 
TACATCTACT 
ACACCCAGGA 
CTGAGGATGA 
GACCAAAAAA 



GCTGCAGAAC 
GGGGGCGGGG 
GGGCCGAGGG 
AGAGACGTCA 
CGTGATGGCC 
CCACCTGCTC 
GCCCGGGCTG 
CCTGCAGGCG 
TCGGGGGAGG 
CCCCTTGGAC 
CCTATTAAOG 
GCGGGGCTGC 
CTCACCGCTT 
GCCCTGGCAC 
CTCAACCCTC 
ATCGGCTGGG 
CCCTCACTGG 
TGGCACGAGG 
CTGGCGTTTG 
CTGCACCACC 
ACTGGTTACT 
TCTGACCCCA 
GTGTCTGGGT 
CTGGCTTGGT 
TTCACCGCGC 
GCCTGGCCTG 
GAAGCGGCTC 
CCAGGAGCCT 
ACAACCTCAG 
AAAAAAAAAA 



360 

420 

480 

540 

600 

660 

720 

780 

840 

900 
960 

1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 



Seq ID NO: C209 DNA Sequence 

Nucleic Acid Accession H : NM_015720.l 

Coding sequence : 21 . . 1838 



1 
I 

CCAGTTCCGG 



CTGATGAGCC 
CTGGCTTGGA 
TGGGAGCCCC 
CACCACAGTA 
CCACTGCAGA 
GCCAGGCTCA 
CTCCCTGGCA 
AGAAGGAAGA 
GATCCCAAGA 
CCCCAGGGGC 
AGGTGGAGAG 
CTCCGGGGGA 
GGCTTGGGGT 
CTGGTTTGTC 
CTCCCAGTGG 
CCCCACTGGC 
ATCTCAACCA 
ATTCTACGCA 
TGAACATGAC 
TGGCCCTGGT 
TCTCTCTGAG 
AGGGGGTGGT 
AGGAGATTGG 
TGCGCAGCGA 
TCATCATTGC 
ACGTGTCGCA 
TGGACGTGGC 
GCGGGGCCCT 
AGGACTCGGA 
TGGGCCGCCA 
CCCGCGCCTA 
CCGACTTCAC 
AAGCCCGCCT 
CTTCCTGCGC 
AAAAAAAAAA 
AAAAAAAAAA 



11 
I 

CACGAGGACC 
GCTGCTTCTG 
TGGCCCAGAG 
GCCACTGGAC 
TGGCTCAGGC 
CTTCTGGGAA 
TTATGTTTTT 
AGAGCTGCCA 
TATGCCTGCC 
GGTAGAGAAA 
AGAAGCCAAG 
CACCAAAAGC 
CAGCATGGGG 
CCAGGACTCC 
AGAGTTCGAG 
TGGCCAGCAC 
GGCCGAGCAC 
CCCTGGAGAC 
GCAGCTCCTA 
GGTGATCTGC 
AGAGAACATA 
GGAAGAGGTG 
CAAGCCCAGC 
GCCCACTCAA 
CATCCAGAAC 
CTACGGCACG 
GCTTGGCCTG 
CGGCGAGGAG 
CAGCGACAGC 
CAACGGCCCG 
CGTGTTCGAG 
GGACCAAGCG 
CCCGGCCGCC 
ACGGCGGCTT 
TGGCCCCGCT 
CCCGGGACTC 
AAAAAAAAAA 
AAAAAAAAAA 



21 
I 

ATGGGCCGGC 
CTGGTTGGGG 
GGCCTCACCT 
TCAGAGGAGC 
TTCCCCAGCG 
GAGGAGGAAG 
CCTGACTTAA 
AACCTCCCCT 
AGAGAGGAGG 
CAAGAGGAGG 
CCTCAGGTCC 
AGGCATGAAG 
CCCAGCTTGC 
ACCAGCCAAG 
GCTCCTCAGG 
GAGGAGGTGC 
CCAGATGAAG 
ATGGAACTGA 
GAAGGGCAGG 
AAGGACTGGA 
GACTGTGAGG 
CTGCCGCGCC 
GAGAAGGAGC 
GATGTCCTTT 
TATTCCACAA 
CTCTTCGTGG 
CTCTACAACT 
CTGCGCTTCG 
CAGTCGGAGA 
GGGAGCTGGG 
GAGGACACGC 
AGGTGGACCC 
CCCGCGCCTG 
CGGACCAACT 
TTCCCGCCCC 
AATTAAACCC 
AAAAAAAAAA 
AAAAAAAAAA 



GAGCGTTCCT 
CCACCTCCCT 
CTAGTGAGAC 
AAGAGAATGA 
AGCTGAATGA 
CTGAGAAGGC 
CTCCCTTGCC 
AAGAAGAGGA 
AGGAAGAGGA 
GTGACTTTTC 
ACTCCGGGGA 
TGCTGCCTTC 
AGGCAGAGGC 
AAGCAAGCGA 
CGGCCTTGCC 
ATCCCCTTGG 
CACCTTCCTC 
CAGCTGAAGC 
GCAATCTGGC 
TGTTCCGGCA 
ATGGCAGTGG 
AGCACCTTCT 
CCATGCTGGG 
CCAGCAGCTG 
TGCTGGTGGT 
GCTGGCAGCG 
TGGAGAACGG 
TGCAGGAGAA 
GGGCGCTCAT 
ACCTGTGAGC 
CGAAACGGAC 
GCCCTCGGCG 
CCCTCACTCC 
TGAACCCCGG 
GCCCGGAGAC 
AAAAAAAAAA 
AAAAAAAAAA 



41 
I 

CGCCCGGCTG 
GGGTGCCTGT 
GCTAGACCTC 
CATGGGCCTG 
AGAGTCTCGG 
CTCAAGTCTG 
AGGTTCCATT 
CAAGATGAAT 
AGAGGAAGAG 
GGAGCTGCTC 
TCTCACCAGC 
CCAGGCCTCA 
AGTCACCCCA 
CACAGTGCTG 
GGAAGCCACT 
TTCATTCCCT 
CTCTAGAACC 
TGCTACCTTG 
TCAATCCAGG 
TGGGAAAAAC 
GCACCGGGGG 
CCACCATGGG 
CATGACACTG 
TGACATCCGC 
CCAGGCGCGG 
CATTGGGGCC 
CCGGCTGCCC 
CTGCCACGAC 
GCACCCCAGC 
GGGGGGCAAG 
GCAGCGAGGC 
GGCCCGGAGC 
CGGGCTCCTT 
CGCCCGAGGG 
CCCCGCGGGC 
CACGCGGGCC 
AAAAAAAAAA 
AAAAAAAAA 



51 
I 

CCGCCGCTGC 
GTGGCTGGGT 
CTGCTGCCCA 
GGAGCTGGGC 
ATTCTGCAGC 
GACCTGGGAC 
GAAGACACTA 
CTGGTTGAGC 
GAGGAGAGGG 
CCTGTGAATG 
AGCAGCCAGA 
TCAGGTGTGG 
ACTACAGTGA 
CCAGCTGCAG 
GCAGGAGCAG 
CAAACCACAG 
TCAGCCTCTT 
GGACAAGAAG 
ATACCCTGGG 
TACATCATTC 
CCACAGCTCC 
GCCTGGCACA 
GTGGGCGAGC 
AGGAGCCTGG 
GCCAGCCAGG 
ATCTGCATCA 
AAGCTCAAGC 
AACCCCACGC 
CTGAACGGCG 
CGGGACCCCG 
GCAGGCCGAG 
CCGCACCAGC 
CCCGCTTCCC 
GCAGGCCTCA 
GGCGGGCGCG 
CAGCGAAAAA 
AAAAAAAAAA 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2269 



Seq ID NO: C210 DNA Sequence 

Nucleic Acid Accession #: NMJ>01197.3 

Coding sequence: 61.. 543 



11 



21 31 41 51 

1 I I I I I 

GACACGAAGC CTCCCGGGTG GCTTACAGAC GCTGCCAGCA TCGCCGCCGC CAGAGGAGAA 60 
ATGTCTGAAG TAAGACCCCT CTCCAGAGAC ATCTTGATGG AGACCCTCCT GTATGAGCAG 120 

1330 



WO 03/042661 



CTCCTGGAAC CCCCGACCAT GGAGGTTCTT GGCATGACTG ACTCTGAAGA GGACCTGGAC 
CCTATGGAGG ACTTCGATTC TTTGGAATGC ATGGAGGGCA GTGAGGCATT GGCCCTGCGG 
CTGGCCTGCA TCGGGGACGA GATGGACGTG AGCCTCAGGG CCCCGCGCCT GGCCCAGCTC 
TCCGAGGTGG CCATGCACAG CCTGGGTCTG GCTTTCATCT ACGACCAGAC TGAGGACATC 
AGGGATGTTC TTAGAAGTTT CATGGACGGT TTCACCACAC TTAAGGAGAA CATAATGAGG 
TTCTGGAGAT CCCCGAACCC CGGGTCCTGG GTGTCCTGCG AACAGGTGCT GCTGGCGCTG 
CTGCTGCTGC TGGCGCTGCT GCTGCCGCTG CTCAGCGGGG GCCTGCACCT GCTGCTCAAG 
TGAGGCCCCG GCGGCTCAGG GCGGGGCTGG CCCCACCCCC ATGACCACTG CCCTGGAGGT 
GGCGGCCTGC TGCTGTTATC TTTTTAACTG TTTTCTCATG ATGCCTTTTT ATATTTAAAC 
CCCGAGATAG TGCTGGAACA CTGCTGAGGT TTTATACTCA GGTTTTTTGT TTTTTTTTTA 
TTCCAGTTTT CGTTTTTTCT AAAAGATGAA TTCCTATGGC TCTGCAATTG TCACCGGTTA 
ACTGTGGCCT GTGCCCAGGA AGAGCCATTC ACTCCTGCCC CTGCCCACAC GGCAGGTAGC 
AGGGGGAGTG CTGGTCACAC CCCTGTGTGA TATGTGATGC CCTCGGCAAA GAATCTACTG 
GAATAGATTC CGAGGAGCAG GAGTGCTCAA TAAAATGTTG GTTTCCAGCA AAAAAAAAAA 
AAA 

Seq ID NO: C211 DNA Sequence 
Nucleic Acid Accession #: AF2723S7 
Coding sequence: 83.. 1060 

1 11 21 31 41 51 

I I I I I I 

GCTGCTCCCG ACGCGGAGCC CGGAGCCCGC GCCGAGCCCC TGGCCTCGCG GTGCCATGCT 60 

GCCCCGGCGG CGGCGCTGAA GGATGGCGAC GCCGCTGCCT CCGCCCTCCC CGCGGCACCT 120 

GCGGCTGCTG CGGCTGCTGC TCTCCGGCCT CGTCCTCGGC GCCGCCCTGC GTGGAGCCGC 180 

CGCCGGCCAC CCGGATGTAG CCGCCTGTCC CGGGAGCCTG GACTGTGCCC TGAAGAGGCG 240 

GGCAAGGTGT CCTCCTGGTG CACATGCCTG TGGGCCCTGC CTTCAGCCCT TCCAGGAGGA 300 

CCAGCAAGGG CTCTGTGTGC CCAGGATGCG CCGGCCTCCA GGCGGGGGCC GGCCCCAGCC 360 

CAGACTGGAA GATGAGATTG ACTTCCTGGC CCAGGAGCTT GCCCGGAAGG AGTCTGGACA 420 

ATCAACTCCG CCCCTACCCA AGGACCGACA GCGGCTCCCG GAGCCTGCCA CCCTGGGCTT 480 

CTCGGCACGG GGG CAGGGGC TGGAGCTGGG CCTCCCCTCC ACTCCAGGAA CCCCCACGCC 540 

CACGCCCCAC ACCTCCCTGG GCTCCCCTGT GTCATCCGAC CCGGTGCACA TGTCGCCCCT 600 

GGAGCCCCGG GGAGGGCAAG GCGACGGCCT CGCCCTTGTG CTGATCCTGG CGTTCTGTGT 660 

GGCCGGTGCA GCCGCCCTCT CCGTAGCCTC CCTCTGCTGG TGCAGGCTGC AGCGTGAGAT 720 

CCGCCTGACT CAGAAGGCCG ACTACGCCAC TGCGAAGGCC CCTGGCTCAC CTGCAGCTCC 780 

CCGGATCTCG CCTGGGGACC AGCGGCTGGC ACAGAGCGCG GAGATGTACC ACTACCAGCA 840 

CCAACGGCAA CAGATGCTGT GCCTGGAGCG GCATAAAGAG CCACCCAAGG AGCTGGACAC 900 

GGCCTCCTCG GATGAGGAGA ATGAGGACGG AGACTTCACG GTGTACGAGT GCCCGGGCCT 960 

GGCCCCGACC GGGGAAATGG AGGTGCGCAA CCCTCTGTTC GACCACGCCG CACTGTCCGC 1020 

GCCCCTGCCG GCCCCCAGCT CACCGCCTGC ACTGCCATGA CCTGGAGGCA GACAGACGCC 1080 

CACCTGCTCC CCGACCTCGA GGCCCCCGGG GAGGGGCAGG GCCTGGAGCT TCCCACTAAA 1140 

AACATGTTTT GATGCTGTGT GCTTTTGGCT GGGCCTCGGG CTCCAGGCCC TGGGACCCCT 1200 

TGCCAGGGAG ACCCCCGAAC CTTTGTGCCA GGACACCTCC TGGTCCCCTG CACCTCTCCT 1260 

GTTCGGTTTA GACCCCCAAA CTGGAGGGGG CATGGAGAAC CGTAGAGCGC AGGAACGGGT 1320 

GGGTAATTCT AGAGACAAAA GCCAATTAAA GTCCATTTCA GAAAAAAAAA A 1371 

Seq ID NO: C212 DNA Sequence 

Nucleic Acid Accession #: NM_004445.1 

Coding sequence: 799-. 3819 

1 11 21 31 41 51 

I I 1 I I I 

CGGAGGGGGC GGGCCGGGCT GCGTTCGCTC CAGCCGCGGC TCTACAGCAG CGGGCGGCGG 60 

GACCCGGGAC CCAGCTTGGC GACGGCGATT CTCGACGCGG GCCCCCAGGA TTCTCCCGGC 120 

GCCCCACCTC TGGAGCAGCC CCTGCCGCCA GCGTCAGGTC CACCCCGGAA TCCCAGGGAC 180 

TCTCGGCGCC GAACGGACCC GGGCCGGTGC AACGGGGTCC CCGGACTGGA GAAGACGCGG 240 

GTGGCACCGT GCGAGCTCCA GGAGCCCCGG GTCCACTGCG AGGCCTCGGG GGGCGCAGAC 300 

CTGCAGAGAC TGCGGCCAAC GGGAAGAAAT AAAGGGATTA TAGTCCACCC AATTCACAGA 360 

CTTCTGAGAC TCAGACACGA GGAGAGATAG AGAACCGCCA ATCTCTAGAT CAACAAGCAA 420 

AGGAGGTGCC AAGCCTGTTT GTCTTCATTG TGACACTGGA GTCTAGATGC TGGGAAGTCC 480 

AAGATCAGGG TGCCGGCATG GTCAGTTCCT GGCGAAGCCT CTCTTCTAGG TTTCAGACTG 540 

CCCTCTTCTT TGTTGTGTCC TCGAATGGCA GAAAAAGGGG TGGCTGTTGG AGGAAGGGAG 600 

GAGAGTAAAT GAAGAGAAAG AACTGGAATA ACCCCTTGCA GAAAAAAAAA AAAAGGGAAG 660 

CTTAGCTGTA CACCCTGAGT CTTGCAAAAG CTGCAGCCCC ACCCAGGAGC AGGGTGGTGG 720 

CTGGGGCGAT GGTGGACGCC CTGAAGATGT CCCATGGCTA CTGAAGGGGC TGCCCAGTTA 780 

GGGAACAGAG TGGCGGGCAT GGTGTGTAGC CTATGGGTGC TGCTCCTGGT GTCTTCAGTT 840 

CTGGCTCTGG AAGAGGTATT GCTGGACACC ACCGGAGAGA CATCTGAGAT TGGCTGGCTC 900 

ACCTACCCAC CAGGGGGGTG GGACGAGGTG AGTGTTCTGG ACGACCAGCG ACGCCTGACT 960 

CGGACCTTTG AGGCATGTCA TGTGGCAGGG GCCCCTCCAG GCACCGGGCA GGACAATTGG 1020 

TTGCAGACAC ACTTTGTGGA GCGGCGCGGG GCCCAGAGGG CGCACATTCG ACTCCACTTC 1080 

TCTGTGCGGG CATGCTCCAG CCTGGGTGTG AGCGGCGGCA CCTGCCGGGA GACCTTCACC 1140 

CTTTACTACC GTCAGGCTGA GGAGCCCGAC AGCCCTGACA GCGTTTCCTC CTGGCACCTC 1200 

AAACGCTGGA CCAAGGTGGA CACAATTGCA GCAGACGAGA GCTTTCCCTC CTCCTCCTCC 1260 

TCCTCCTCCT CTTCTTCCTC TGCAGCGTGG GCTGTGGGAC CCCACGGGGC TGGGCAGCGG 1320 

GCTGGACTGC AACTGAACGT CAAAGAGCGG AGCTTTGGGC CTCTCACCCA ACGCGGCTTC 1380 

TACGTGGCCT TCCAGGACAC GGGGGCCTGC CTGGCCCTGG TCGCTGTCAG GCTCTTCTCC 1440 

TACACCTGCC CTGCCGTGCT CCGATCCTTT GCTTCCTTTC CAGAGACGCA GGCCAGTGGG 1500 

GCTGGGGGGG CCTCCCTGGT GGCAGCTGTG GGCACCTGTG TGGCTCATGC AGAGCCAGAG 1560 

GAGGATGGAG TAGGGGGCCA GGCAGGAGGC AGCCCCCCCA GGCTGCACTG CAACGGGGAG 1620 

GGCAAGTGGA TGGTAGCTGT CGGGGGCTGC CGCTGCCAGC CTGGATACCA ACCAGCACGA 1680 

GGAGACAAGG CCTGCCAAGC CTGCCCACGG GGGCTCTATA AGTCTTCTGC TGGGAATGCT 1740 

CCCTGCTCAC CATGCCCTGC CCGCAGTCAC GCTCCCAACC CAGCAGCCCC CGTTTGCCCC 1800 

TGCCTGGAGG GCTTCTACCG GGCCAGTTCC GACCCACCAG AGGCCCCCTG CACTGGTCCT 1860 

CCATCGGCTC CCCAGGAGCT TTGGTTTGAG GTGCAAGGCT CAGCACTCAT GCTACACTGG 1920 



1331 



WO 03/042661 



CGCCTGCCTC GGGAGCTGGG GGGTCGAGGG GACCTGCTCT TCAATGTCGT GTGCAAGGAG 1980 

TGTGAAGGCC GCCAGGAACC TGCCAGCGGT GGTGGGGGCA CTTGTCACCG CTGCAGGGAT 2040 

GAGGTCCACT TCGACCCTCG CCAGAGAGGC CTGACTGAGA GCCGAGTGTT AGTGGGGGGA 2100 

CTCCGGGCAC ACGTACCCTA CATCTTAGAG GTGCAGGCTG TTAATGGGGT GTCTGAGCTC 2160 

AGCCCTGACC CTCCTCAGGC TGCAGCCATC AATGTCAGCA CCAGCCATGA AGTGCCCTCT 2220 

GCTGTCCCTG TGGTGCACCA GGTGAGCCGG GCATCCAACA GCATCACGGT GTCCTGGCCG 2280 

CAGCCCGACC AGACCAATGG GAACATCCTG GACTATCAGC TCCGCTACTA TGACCAGGCA 2340 

GAAGACGAAT CCCACTCCTT CACCCTGACC AGCGAGACCA ACACTGCCAC CGTGACACAG 2400 

CTGAGCCCTG GCCACATCTA TGGTTTCCAG GTGOGGGCCC GGACTGCTGC CGGCCAOGGC 2460 

CCCTACGGGG GCAAAGTCTA TTTCCAGACA CTTCCTCAAG GGGAGCTGTC TTCCCAGCTT 2520 

CCGGAAAGAC TCTCCTTGGT GATCGGCTCC ATCCTGGGGG CTTTGGCCTT CCTCCTGCTG 2580 

GCAGCCATCA CCGTGCTGGC GGTCGTCTTC CAGCGGAAGC GGCGTGGGAC TGGCTACACG 2640 

GAGCAGCTGC AGCAATACAG CAGCCCAGGA CTCGGGGTGA AGTATTACAT CGACCCCTCC 2700 

ACCTACGAGG ACCCCTGTCA GGCCATCCGA GAACTTGCCC GGGAAGTCGA TCCTGCTTAT 2760 

ATCAAGATTG AGGAGGTCAT TGGGACAGGC TCTTTTGGAG AAGTGCGCCA GGGCCGCCTG 2820 

CAGCCACGGG GACGGAGGGA GCAGACTGTG GCCATCCAGG CCCTGTGGGC CGGGGGCGCC 2880 

GAAAGCCTGC AGATGACCTT CCTGGGCCGG GCCGCAGTGC TGGGTCAGTT CCAGCACCCC 2940 

AACATCCTGC GGCTGGAGGG CGTGGTCACC AAGAGCCGAC CCCTCATGGT GCTGACGGAG 3000 

TTCATGGAGC TTGGCCCCCT GGACAGCTTC CTCAGGCAGC GGGAGGGCCA GTTCAGCAGC 3060 

CTGCAGCTGG TGGCCATGCA GCGGGGAGTG GCTGCTGCCA TGCAGTACCT GTCCAGCTTT 3120 

GCCTTCGTCC ATCGCTCGCT GTCTGCCCAC AGCGTGCTGG TGAATAGCCA CTTGGTGTGC 3180 

AAGGTGGCCC GTCTTGGCCA CAGTCCTCAG GGCCCAAGTT GTTTGCTTCG CTGGGCAGCC 3240 

CCAGAGGTCA TTGCACATGG AAAGCATACA ACATCCAGTG ATGTCTGGAG CTTTGGGATA 3300 

CTCATGTGGG AAGTGATGAG TTATGGAGAA CGGCCTTACT GGGACATGAG TGAGCAGGAG 3360 

GTACTAAATG CAATAGAGCA GGAGTTCCGG CTGCCCCCGC CTCCAGGCTG TCCTCCTGGA 3420 

TTACATCTAC TTATGTTGGA CACTTGGCAG AAGGACCGTG CCCGGCGGCC TCATTTTGAC 3480 

CAGCTGGTGG CTGCATTTGA CAAGATGATC CGCAAGCCAG ATACCCTGCA GGCTGGCGGG 3540 

GACCCAGGGG AAAGGCCTTC CCAGGCCCTT CTGACCCCTG TGGCCCTGGA CTTTCCTTGT 3600 

CTGGACTCAC CCCAGGCCTG GCTTTCAGCC ATTGGACTGG AGTGCTACCA GGACAACTTC 3660 

TCCAAGTTTG GCCTCTGTAC CTTCAGTGAT GTGGCTCAGC TCAGCCTAGA AGACCTGCCT 3720 

GCCCTGGGCA TCACCCTGGC TGGCCACCAG AAGAAGCTGC TGCACCACAT CCAGCTCCTT 3780 

CAGCAACACC TGAGGCAGCA GGGCTCAGTG GAGGTCTGAG AATGACGATA CCCGTGACTC 3840 

AGCCCTGGAC ACTGGTCCGA GAAGGGACAT GTGGGACGTG AGCCGGGCTC CAACAGCCTC 3900 

TGTGAGAGAT GCCCCACACC AAACCCAACC CTCCCGATGG CTGCATTCCC TGGTCCTCCG 3960 

CCTCTCCACC AGCCCCCTCC TCATTAAAGG GAAAGAAGGG AATTTGCAAA 4010 



Seq ID NO: C213 DNA Sequence 

Nucleic Acid Accession #: XM_043340.4 

Coding sequence: 195.. 1067 

1 11 21 31 41 51 

I I 1 I I I 

GGGCGGCGCC CAATGGGCTG CGCGGAGCGT CACTTCCCGG CAGCGGGAGG CGAGTGGCGA 60 

GTGGCGAGTG GCGAGTGTCA GGGGGGCGGC CGGCGGGGGC GGGGCGGCCG GAGGAGGCGT 120 

TGGCAGCGGG CTCGGACCCA CGCGGCGCCG CGGCCCGCCT GGCCTGCAGC GCTCCCACCC 180 

CCGGCGGCGG CACGATGCCC TTTGACTTCA GGAGGTTTGA CATCTACAGG AAGGTGCCCA 240 

AGGACCTTAC GGAGCCAACG TACACCGGGG CCATTATCTC CATCTGCTGC TGCCTCTTCA 300 

TCCTCTTCCT CTTCCTCTCG GAGCTCACCG GATTTATAAC GACAGAAGTT GTGAACGAGC 360 

TCTATGTCGA TGACCCAGAC AAGGACAGCG GTGGCAAGAT CGACGTCAGT CTGAACATCA 420 

GTTTACCCAA TCTGCACTGC GAGTTGGTTG GGCTTGACAT TCAGGATGAG ATGGGCAGGC 480 

ACGAAGTGGG CCACATCGAC AACTCCATGA AGATCCCGCT GAACAATGGG GCAGGCTGCC 540 

GCTTCGAGGG GCAGTTCAGC ATCAACAAGG TCCCCGGCAA CTTCCACGTG TCCACACACA 600 

GTGCCACAGC CCAGCCACAG AACCCAGACA TGACGCATGT CATCCACAAG CTCTCCTTTG 660 

GGGACACGCT ACAGGTCCAG AACATCCACG GAGCTTTCAA TGCTCTCGGG GGAGCAGACA 720 

GACTCACCTC CAACCCCCTG GCCTCCCACG ACTACATCCT GAAGATTGTG CCCACGGTTT 780 

ATGAGGACAA GAGTGGCAAG CAGCGGTACT CCTACCAGTA CACGGTGGCC AACAAGGAAT 840 

ACGTCGCCTA CAGCCACACG GGCCGCATCA TCCCTGCAAT CTGGTTCCGC TACGACCTCA 900 

GCCCCATCAC GGTCAAGTAC ACAGAGAGAC GGCAGCCGCT GTACAGATTC ATCACCACGA 960 

TCTGTGCCAT CATTGGCGGG ACCTTCACCG TCGCCGGCAT CCTGGACTCA TGCATCTTCA 1020 

CAGCCTCTGA GGCCTGGAAG AAGATCCAGC TGGGCAAGAT GCATTGACGC CACACCCAGC 1080 

CTAATGGCCG AGGACCCTGG GCATCGCCAG CCTTGCCTCC AGTGCCCTGT CTCCTTTGGC 1140 

CCTCAATCTG GTCCCAAATC TGGCTGTGTC CCAAAGGGTG TGTGGGAAGT GGGGGGAAAG 1200 

TAGAGGATGG CTCGATGTTT TGCAGCTACC TCTTTTCCCC GTGTTTCTTT TTAGACAAAT 1260 

TACACTGCCT GAAGTTGCAG TTCCCCTTTC CCTGGGGAGC CCCAAGAACA GAGTCAGGCA 1320 

AGGGGTGGGG AGTCCAGGGG AACATCCCAG AATGCATATC GATCAGCTCT CAGCCAGGCT 1380 

TCGACAATCT CGCAGCCCCC ACTAGGTGGA CACATTAATG ATTTGGTTTC TCCCCTGGGC 1440 

AGCCAACCTG CCCCAGAGGC ACCAGACCTG GGCTTTCAGC TTTGGGACCA GGCTGCCCAA 1500 

AGGTACTCCT TTATACACCC GGCACCTTCC ACGAAAGATG GTACTTCCCA AGCAAGCCCC 1560 

TATGATTTGT CACTATAGAT GGAAATGTGT GGCATGCACA TGAGTTGAAA TTCTTTTATG 1620 

CATTTTTTTG AAGAAAAAAA AAAAAACAAC TCTGAGGACA TAGGGGATGT CAGTTTCCTA 1680 

TGGAAGAGAC ACCTCTGACC CGTTATTCTT ATAATCAAAA TCTGAAGGGA AAAAAATGTT 1740 

TTAGTTCTTT CCCCACTCGT TGGGTTCAAC TAGATTAAAA GGCTGATTTT CAG 1793 

Seq ID NO: C214 DNA Sequence 

Nucleic Acid Accession NM_002151.1 

Coding sequence: 246.. 1499 

1 U 21 31 41 51 

I I I I I 1 

TCGAGCCCGC TTTCCAGGGA CCCTACCTGA GGGCCCACAG GTGAGGCAGC CTGGCCTAGC 60 
AGGCCCCACG CCACOGCCTC TGCCTCCAGG COGCCCGCTG CTGCGGGGCC ACCATGCTCC 120 
TGCCCAGGCC TGGAGACTGA CCCGACCCCG GCACTACCTC GAGGCTCCGC CCCCACCTGC 180 
TGGACCCCAG GGTCCCACCC TGGCCCAGGA GGTCAGCCAG GGAATCATTA ACAAGAGGCA 240 
GTGACATGGC GCAGAAGGAG GGTGGCCGGA CTGTGCCATG CTGCTCCAGA CCCAAGGTGG 300 



1332 



WO 03/042661 



PCT/US02/36810 



10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



CAGCTCTCAC 
TTGTGGCTGT 
CGGACGCTCG 
CGCGCTCCAA 
TGACCCACTC 
GTGTGGACGA 
ATTGCCCCAG 
CCGTGGACCG 
GCCTTCGCTA 
TGACAGCCGC 
CCGGTGCCGT 
ACCACGGGGG 
CCCTGGTCCA 
CAGCTGCCGG 
CGCAGTACTA 
ATGATGTCTG 
CTGGCTACCC 
GTGAGGACAG 
CTGGCTGTGC 
GGATCTTCCA 
CGGTGGCTTC 
CTGGGCCGAG 
CCCTCCAGGG 
CACCCTCCTG 
CTGATGATGG 



TGCGGGGACC 
TCTCCTCAGG 
GCTCATGGTC 
CGCCAGGGTA 
CGAGCTGGAC 
GGGGAGGCTG 
AGGCOGTTTC 
CATCGTGGGA 
TGATGGAGCA 
CCACTGCTTC 
GGCCCAGGCC 
CTATCTTCCC 
CCTCTCCAGT 
CCAGGCCCTG 
TGGCCAACAG 
CAATGGCGCT 
CGAGGGTGGC 
CATCTCTOGG 
CCTGGCCCAG 
GGCCATAAAG 
TCGCTGCGCA 
GATGGGACGT 
TCCTCTCTTC 
ACCCCCATGT 
GATGCTCTTT 



CTGCTACTTC 
AGTGACCAGG 
TTTGACAAGA 
GCCGGACTCA 
GTGCGAACGG 
CCCCACACCC 
TTGGCCGCCA 
GGCCGGGACA 
CACCTCTGTG 
CCGGAGCGGA 
TCTCCCCACG 
TTTCGGGACC 
CCCCTGCCCC 
GTGGATGGCA 
GCCGGGGTAC 
GACTTCTATG 
ATTGATGCCT 
ACGCCACGTT 
AAGCCAGGCG 
ACTCACTCCG 
GCCTCCAGGG 
TTTTCTTCTT 
CACAGTGGCG 
AAATATTGTT 
AAATAATAAA 



TGACAGCCAT 
AGCCGCTGTA 
CGGAAGGGAC 
GCTGCGAGGA 
CGGGCGCCAA 
AGAGGCTGCT 
TCTGCCAAGA 
CCAGCTTGGG 
GGGGATCCCT 
ACCGGGTCCT 
GTCTGCAGCT 
CCAACAGCGA 
TCACAGAATA 
AGATCTGTAC 
TCCAGGAGGC 
GAAACCAGAT 
GCCAGGGCGA 
GGCGGCTGTG 
TCTACACCAA 
AAGCCAGCGG 
CCCGAGGTGA 
GGGCCCGGTC 
GGCCCACTCA 
CTGCTGTCTG 
GATGGTTTTG 



CGGGGCGGCA 
CCCAGTGCAG 
GTGGCGGCTG 
GATGGGCTTC 
TGGCACGTCG 
GGAGGTCATC 
CTGTGGCCGC 
CCGGTGGCCG 
GCTCTCCGGG 
GTCCCGATGG 
GGGGGTGCAG 
GGAGAACAGC 
CATCCAGCCT 
CGTGACGGGC 
TCGAGTCCCC 
CAAGCCCAAG 
CAGCGGTGGT 
TGGCATTGTG 
AGTCAGTGAC 
CATGGTGACC 
TCCCGGTGGT 
CACAGGTCCA 
GCCCCGAGAC 
GGACTCCTGT 
ATT 



TCCTGGGCCA 
GTCAGCTCTG 
CTGTGCTCCT 
CTCAGGGCAC 
GGCTTCTTCT 
TCCGTGTGTG 
AGGAAGCTGC 
TGGCAAGTCA 
GACTGGGTGC 
CGAGTGTTTG 

AACGATATTG 
GTGTGCCTCC 
TGGGGCAACA 
ATAATCAGCA 
ATGTTCTGTG 
CCCTTTGTGT 
AGTTGGGGCA 
TTCCGGGAGT 
CAGCTCTGAC 
GGGATCCACG 
AGGACACCCT 
CACCCAACCT 
CTAGGTGCCC 



Seq ID NO; C215 DNA Sequence 
Nucleic Acid Accession #: AB037745.1 
Coding sequence: 26.. 1744 



21 



31 



41 



51 



3S0 

420 

4B0 

540 

600 

660 

720 

780 

040 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1783 



1 11 

I 1 I I 1 I 

ATGGTGGAAC ACGCTGCCCA CAAACATGGA AACGACCGTT CTCAGTGGGA TCAACTTCGA 60 

GTACAAGGGC ATGACAGGCT GGGAGGTGGC TGGTGATCAC ATTTACACAG CTGCTGGAGC 120 

CTCAGACAAT GACTTCATGA TTCTCACTCT GGTTGTGCCA GGATTTAGAC CTCCGCAGTC 180 

GGTGATGGCA GACACAGAGA ATAAAGAGGT GGCCAGAATC ACATTTGTCT TTGAGACCCT 240 

CTGTTCTGTG AACTGTGAGC TCTACTTCAT GGTGGGTGTG AATTCTAGGA CCAACACTCC 300 

TGTGGAGACG TGGAAAGGTT CCAAAGGCAA ACAGTCCTAT ACCTACATCA TTGAGGAGAA 360 

CACTACCACG AGCTTCACCT GGGCCTTCCA GAGGACCACT TTTCATGAGG CAAGCAGGAA 420 

GTACACCAAT GACGTTGCCA AGATCTACTC CATCAATGTC ACCAATGTTA TGAATGGCGT 480 

GGCCTCCTAC TGCOGTCCCT GTGCCCTAGA AGCCTCTGAT GTGGGCTCCT CCTGCACCTC 540 

TTGTCCTGCT GGTTACTATA TTGACCGAGA TTCAGGAACC TGCCACTCCT GCCCCCCTAA 600 

CACAATTCTG AAAGCCCACC AGCCTTATGG TGTCCAGGCC TGTGTGCCCT GTGGTCCAGG 660 

GACCAAGAAC AACAAGATCC ACTCTCTGTG CTACAATGAT TGCACCTTCT CACGCAACAC 720 

TCCAACCAGG ACTTTCAACT ACAACTTCTC CGCTTTGGCA AACACCGTCA CTCTTGCTGG 780 

AGGGCCAAGC TTCACTTCCA AAGGGTTGAA ATACTTCCAT CACTTTACCC TCAGTCTCTG 840 

TGGAAACCAG GGTAGGAAAA TGTCTGTGTG CACCGACAAT GTCACTGACC TCCGGATTCC 900 

TGAGGGTGAG TCAGGGTTCT CCAAATCTAT CACAGCCTAC GTCTGCCAGG CAGTCATCAT 960 

CCCCCCAGAG GTGACAGGCT ACAAGGCCGG GGTTTCCTCA CAGCCTGTCA GCCTTGCTGA 1020 

TCGACTTATT GGGGTGACAA CAGATATGAC TCTGGATGGA ATCACCTCCC CAGCTGAACT 1080 

TTTCCACCTG GAGTCCTTGG GAATACCGGA CGTGATCTTC TTTTATAGGT CCAATGATGT 1140 

GACCCAGTCC TGCAGTTCTG GGAGATCAAC CACCATCCGC GTCAGGTGCA GTCCACAGAA 1200 

AACTGTCCCT GGAAGTTTGC TGCTGCCAGG AACGTGCTCA GATGGGACCT GTGATGGCTG 1260 

CAACTTCCAC TTCCTGTGGG AGAGCGCGGC TGCTTGCCCG CTCTGCTCAG TGGCTGACTA 1320 

CCATGCTATC GTCAGCAGCT GTGTGGCTGG GATCCAGAAG ACTACTTACG TGTGGCGAGA 1380 

ACCCAAGCTA TGCTCTGGTG GCATTTCTCT GCCTGAGCAG AGAGTCACCA TCTGCAAAAC 1440 

CATAGATTTC TGGCTGAAAG TGGGCATCTC TGCAGGCACC TGTACTGCCA TCCTGCTCAC 1500 

CGTCTTGACC TGCTACTTTT GGAAAAAGAA TCAAAAACTA GAGTACAAGT ACTCCAAGCT 1560 

GGTGATGAAT GCTACTCTCA AGGACTGTGA CCTGCCAGCA GCTGACAGCT GCGCCATGAT 1620 

GGAAGGCGAG GATGTAGAGG ACGACCTCAT CTTTACCAGC AAGAAGTCAC TCTTTGGGAA 1680 

GATCAAATCA TTTACCTCCA AGCAGCCAGC TCCTGTCACC ATCTCTCTTT CAGAGGACTC 1740 

CTGATGGATT TGACTCAGTG CCGCTGAAGA CATCCTCAGG AGGCCCAGAC ATGGACCTGT 1800 

GAGAGGCACT GCCTGCCTCA CCTGCCTCCT CACCTTGCAT AGCACCTTTG CAAGCCTGCG I860 

GCGATTTGGG TGCCAGCATC CTGCAACACC CACTGCTGGA AATCTCTTCA TTGTGGCCTT 1920 

ATCAGATGTT TGAATTTCAG ATCTTTTTTT ATAGAGTACC CAAACCCTCC TTTCTGCTTG 1980 

CCTCAAACCT GCCAAATATA CCCACACTTT GTTTGTAAAT TATGCCCTTG CTTGTATCTT 2040 

GTTTCCCAAA ATGGCCCATC CGCCAGAGCC ATAGCTTCGT CTGCTCATAA TTCTTATAGC 2100 

TTTGGAATGA AAATATTTCT ATCTTCTTAA GTATAGAAAC TATTTCCTCT GTCCTCTAAC 2160 

TTAAGGGCAG AAACAGCTGG GAGTTTTCCT CGCATGCCCT CAGCTCATGA TCTCTTCAGG 2220 

AGAGAGGCTG GGTGAGGAGG GTGTCGGGGT TCCCTGGTGG ATAATCTTCA TAGCAGCCTG 2280 

GATCCATTTC CCCTGGATAA CCAGCTCAAA GGGAGTGAAA ATGGTAGTCT GAGGGCAAGG 2340 

GGAGCAAGGC CTGGGTAAGA AAAGCCTTGA AAAGCATAAA AAGAGGCGGG GCGCGGTGGC 2400 

TCACGCCTGT AATCCCAGCA CTTTGGGAGG CCGAGGGGGG CAGATCATGA GGTCGGGAGA 2460 

TTGAGACCAT CCTGGCTAAC ACGGTGAAGC CCCGTCTCTA CTGGAAATAC AAAAAATTAG 2520 

CCGGGCGTGG TGGCGGGTGC CTGTGGTCCC AGCTACTCGG GAGGCTGAGG CGGGAGAATA 2580 

GCGTGGGCCT GGAAGGCGGA GCTTGCAGTG AGCCGAGATC GCGCCACTGC ACTCCATCCA 2640 

GCCTGGGTGA CAGAGTGAGA CTCTGCCTCA AAAAAAAAAA AAAAAAAGAA AAGCACAAAG 2700 

AGAGGCAACA AGGAATGTTT TTGTTTTTGA GACAGGCTCT CACTCTGTCA CCTAGGCTGG 2760 

AGTGCAGTGG CGTAATCACT GTTCAGTGCA GCCTCAAGCT CTTGGGCTCA GGCTATCCTC 2820 

CCATCTCAGC CTCTCAAGTA GCTGGGACTA CGAGTGTGCA CCACCAGGCT CACTAATTTT 2880 

TGTGTTTTTT GTAGACACGG GGTTTCACCG TGTTGCCCAG GCTGGTCTCC AACTCCTGGG 2940 

CTCAAGTGAT CTGTCCGCCT CGGCCTCCCA AACTGCTGGG ATTACAGGCA TAAGCCACTG 3000 

CACTCAGCCT TTTATTTGTT TTTTAAACCA CGTAGCTCAT TGCCTTCTCT TAAGTAAATG 3060 

ATAGATATTC TCACTGAAGC CAAAGGAATA AGTTCATCAA GAAAATGCOC AAAGCCCTGG 3120 
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TGGATACATC CTCCCTATCT TTTTTTTAAA CCTTCCACTA TCACTCTATG ACACTGAAAA 3180 

GAACCAGGTA AGCCCCAAAC CCAGATGTTC CAGCCTTATC CTCTATTGGG TTTACCCACA 3240 

GACATAGCAA ACCCTGTCAG TGAGGAAAAT TCCCCATCCT TGAGTGCCCC CGTCCTAGAA 3300 

GTTTGGGCCA TATTATGGAA CAGGGGTCTC TTATTTGAAA AGAGCACAAG GAGGCCAAGA 3360 

TTTTAATGGG GCACTTTAGG GGATACAGCC CACAATGGCA TGGGCCTGAG GTGGCCGTGA 3420 

TGTCTGCTTC TAAGCTTAAC GCATCTGCTC AGGCACAGAA TAAACGTCTA GGCTGGCCAA 3480 

AAAAGGAACT GAATCCCAGG CCCATACGCC AGCACCAGAA TCAAACCAGT CTTCAAGGAA 3540 

GGAAGGCTAG GAGAGTTTAA CAAGATTTTC ACTGGGCCCA GCATGGTGGC TCACACCTGT 3600 

AATCCCAAGG CAGAATGGTG GCTTGAGCTC AGGAGTTCAA GACCAGCCTG GGCAACACAG 3660 

TGAGACCCTG TCTCTAAAAA ATTTAAAAAT AAACAAGGTG TTCACCAAGC TGGGATACTT 3720 

CTCACTATTA AGCCCCTATC TTTCTCTTTT TTTCATTCTC AATTGCTTTG TGTGATAAAA 3780 

AACTAAAGAG ACTTCTGGTC CAATTTCTGG CAACATCCCT TCTGAAAGGT GAGTAGAGTG 3840 

GGTGTCTTCT ATGCCCATTT TCCCCAATTT TACACAAACT ATTATCAATG AACTTTTAAG 3900 

TACCTAGAAT GGGTAAAACC AGAGCAAGAC TTTAAATTAC CTTCTTCTTT CTTCTACTGG 3960 

CAGTTCTGCC TCCATCACTA TCAGGCTAGG GTGACCTTCC CTTGGTCAAG CCCCAATTGC 4020 

CCATGATTTG TGCCTGTGCC CTTTCTCCAG TGACCATTTG GTGACCAGAT GGTAGATATA 4080 

GAAAGGGGAT GGCATTTGCA AGTGACTAGT CTGCCACAAA ATGCTCATCT GATTAGCCAC 4140 

TGCTGCCCTG GCAATGGCTT TGTAAGAGTC AATGAGAACT AGAGCCAGGC TGTGGTCCCT 4200 

GGCCATCAAC AGTGTTGGTG ACGGCAGGGA GTCCCTTTGG TTTAATAAAT CCAGTTTTTC 4260 

TTTGGGTATC CAAATTCTCC CCTCCTTTTG TAGGAGTCAG GCTCTCAGAA CCTGTGTCCA 4320 

TGTTGGAACT TCCCCCAGTG TGGATGCAGA TACGCAGCTC CTGAGCTCCA GCCTAAAGTC 4380 

TTCTGTAGCC TCAGCAATAC TTGGGCACCT GCTGTCTCAC TGAATAGCTT TCTTTTGTGA 4440 

CAAAGGCCAC AGACAGCCCT TAGACTATTC CGGAAACAGT AGGAAAAATT ACATATGTCT 4500 

TTGACTTCTT TATTCTGACT CCACTGATTT TAGCCATAAT ACTTTAAGGA GCTACTTTTT 4560 

ACTACCCCTT ACCGTGCTGA CTTCTGCAGG TCTGCCCTGT GACCTGTCAG GAACTCCTGA 4620 

GTTACGCTAC TGGGGTCACC TGTTGCTCCC CTAGCAAGTT AGGCATGTCA TATATTTTTA 4680 

ACAGCTTTAT TGAGATATAA TTCACATATT ATACAATTCA CCTTTAAAAC ATACGATTCA 4740 

ATGGTTTTCA GCAAACTCAC AGAGTTGTCC GCCCACTTGA GAGCAAACAC ATGTTCAATT 4800 

TTCTTTTCCT TTTTTTTTTT GAGACAGAGT CAGCTTTGTC GCCCAGGCTG GAGTGCAGTG 4860 

CCATGATCTT GGCTCACTGC AGCCTCCCCA TCCTGGGTTC AAGTGATCCT TCTGCTTCAG 4920 

CCTCCCCAGT AGCTGGGATT ACAGGCATGC GCCACCACGC CTAGCTAATT TTTGTGTTTT 4980 

TAGTAGAGAT GGGGTTTCAC CGTGTTGGCC AGGCTGGTCT CAAACTCCTG GACTCAAGTG 5040 

ATCCACCCAC CTCGGCCTCC CAAAGTGCTG GGATTGCAGG TGTGAGCCAC CGTGCCTGGC 5100 

CTACGTGTTC AATTTTCTAT GAACAAAGGC TTTAGTCCTT GACCCAGGGC TAAAGTGGTC 5160 

TGTCCAAGCT GTTGTTGGTA GAGGGAGTAT GATAAAATGT TTAAATCTCA TTTGGTTACC 5220 

TTGAGTCCTG GAACACGCAG TAACTGTCAT GCTATAGTCA TCATCTGTAT TTGGCTGGGA 5280 

ATACAAATGA AGATTGTGGT GTATTCAAGC AGTAGGGTTT TTGCTTTTGT TTTTGTTTTA 5340 

GTGCCAACAA AACTTTTTTT TGTCTGACTA CATTAAAGAT AAGACTGACT ATATTTATAC 5400 

AACAGAAACT TTGTAATAGA TTTTTTCAGC TTTGTGAAAT CGAATTTTTT TTCATCAGGG 5460 

CTGGTTGGAT TTCCTTTTTA CCCTGTAATC CAAGCGTTAA TAGTTTGTTA GAAGATGGGT 5520 

TATTGCATGT CACTTTTTTT TTTTTGTAAA ATAAAAACAT ACCTTAC 5567 



Seq ZD NO: C216 DNA Sequence 
Nucleic Acid Accession #: NM_004864.1 
4 J Coding sequencer 26-952 



1 11 21 31 41 51 

11)111 

CGGAACGAGG GCAACCTGCA CAGCCATGCC CGGGCAAGAA CTCAGGACGG TGAATGGCTC 60 

TCAGATGCTC CTGGTGTTGC TGGTGCTCTC GTGGCTGCCG CATGGGGGCG CCCTGTCTCT 120 

GGCCGAGGCG AGCOGCGCAA GTTTCCCGGG ACCCTCAGAG TTGCACTCCG AAGACTCCAG 180 

ATTCCGAGAG TTGCGGAAAC GCTACGAGGA CCTGCTAACC AGGCTGCGGG CCAACCAGAG 240 

CTGGGAAGAT TCGAACACCG ACCTCGTCCC GGCCCCTGCA GTCCGGATAC TCACGCCAGA 300 

AGTGCGGCTG GGATCCGGCG GCCACCTGCA CCTGCGTATC TCTCGGGCCG CCCTTCCCGA 360 

GGGGCTCCCC GAGGCCTCCC GCCTTCACCG GGCTCTGTTC CGGCTGTCCC CGACGGCGTC 420 

AAGGTCGTGG GACGTGACAC GACCGCTGCG GCGTCAGCTC AGCCTTGCAA GACCCCAAGC 480 

GCCCGCGCTG CACCTGCGAC TGTOGCCGCC GCCGTGGCAG TCGGACCAAC TGCTGGCAGA 540 

ATCTTCGTCC GCACGGCCCC AGCTGGAGTT GCACTTGCGG CCGCAAGCCG CCAGGGGGCG 600 

CCGCAGAGCG CGTGCGCGCA ACGGGGACGA CTGTCCGCTC GGGCCCGGGC GTTGCTGCCG 660 

TCTGCACACG GTCCGCGOGT CGCTGGAAGA CCTGGGCTGG GCCGATTGGG TGCTGTCGCC 720 

ACGGGAGGTG CAAGTGACCA TGTGCATCGG CGCGTGCCCG AGCCAGTTCC GGGCGGCAAA 780 

CATGCACGCG CAGATCAAGA CGAGCCTGCA CCGCCTGAAG CCCGACACGG AGCCAGCGCC 840 

CTGCTGCGTG CCCGCCAGCT ACAATCCCAT GGTGCTCATT CAAAAGACCG ACACCGGGGT 900 

GTCGCTCCAG ACCTATGATG ACTTGTTAGC CAAAGACTGC CACTGCATAT GAGCAGTCCT 960 

GGTCCTTCCA CTGTGCAOCT GCGCGGGGGA GGCGACCTCA GTTGTCCTGC CCTGTGGAAT 1020 

GGGCTCAAGG TTCCTGAGAC ACCCGATTCC TGCCCAAACA GCTGTATTTA TATAAGTCTG 1080 

TTATTTATTA TTAATTTATT GGGGTGACCT TCTTGGGGAC TCGGGGGCTG GTCTGATGGA 1140 

ACTGTGTATT TATTTAAAAC TCTGGTGATA AAAATAAAGC TGTCTGAACT GTTAAAAAAA 1200 

AAAA 1204 

Seq ID NO: C432 DNA Sequence 

Nucleic Acid Accession NMJJ52858.1 

Coding sequence: 54.. 1259 

1 11 21 31 41 51 

I I I I ! I 

GGCACGAGGT GTTGCCCTCA GGTCGCTCCC GGGCGCGGAC ACGGAACCCG GCCATGGAAG 60 

ATCCGTCGGG GGCTCGCGAG CCCCGGGCCC GGCCGAGAGA GCGGGACCCG GGACGGCGCC 120 

CCCACCCAGA CCAAGGCCGC ACCCACGATC GACCGCGGGA CCGACCCGGG GACCCGCGCA 180 

GGAAGCGAAG CAGCGACGGG AACCGGCGAA GGGACGGGGA CCGGGACCCG AAGAGAGACC 240 

AGGAGAGGGA CGGGAACCGC GACCGGAACC GGGACCGGGA GAGGGAGAGA GAGAGGGAAA 300 

GAGACCCGGA CCGAGGCCCC CGCCGGGACA CACACAGGGA CGCGGGCCCT CGCGCAGGTG 360 

AACACGGAGT TTGGGAAAAA CCGCGCCAAA GCCGGACGCG GGACGGAGCC CGGGGACTGA 420 

CCTGGGACGC AGCCGCGCCT CCTGGGCCCG CGCCCTGGGA AGCCCCGGAG CCGCCGCAGC 480 
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CGCAGAGGAA 
ATCTGCCCTC 
AAGGACTCCT 
TGCTGGAGGT 
CAGGGGGCTA 
CTTACAGTGG 
ACCAGCTAAA 
TCTGCTGCCT 
TGGTGACCGA 
TCTACTTCCA 
TGTACCAAAG 
CTGGAATCTT 
AGGGCTACCG 
GGTTTCTAAA 
TTCCCAAGAA 
GTGTGGTGGG 
CTTCTGGAGT 
TGGCCGGCCT 
ATCACTTGAG 
TAAAATACAA 
AGGCTGAGGC 
CGCCACTGCA 



GGGAGACCCC 
GACCCCCAGG 
GGAATGCCAC 
TCTCCTGAAC 
CACGGGCATC 
CTTTGATGGT 
GCTGCCCATG 
CTTOGTTGCC 
AGGCTTGTTG 
CTACCTCTCT 
CAAAGGCTAC 
TGCTGCCCTG 
AAAAGTTAGG 
ACGCTCTGAC 
TCCCTTGTTG 
CGGAGCTCCC 
CCTCTGTGAG 
CGGCGGCTCA 
GTCAGGAGCT 
AAAAATTAGC 
AGGAGAATCG 
CTCCAGCCCA 



GGGCGCCGCA 
CCTGGACGAG 
AAATGCAAAT 
TTGCTGATCC 
ACCAGCTTGG 
GCTGACGGGG 
GTCACTGTGG 
ATGGGTGTCC 
GACATGCTCA 
GCTGCCTATG 
AGCGGTTTCG 
GGCATTGTGG 
AAGCTAAAAG 
AGATGCAAGT 
TGGAAGTTTC 
AGTCGCATGG 
TGAGGGACCA 
CACCTGTAAC 
CGAGACCAGC 
CAGGCGTGGT 
CTTGAATCTG 
GGTGACAGAG 



GACCCGAAAG 
AGGAGGTGGA 
ACTTGTGCAC 
TGGCCTGCAG 
GGGGCATTTA 
AGAAGGCCCA 
CAATGGCCTG 
TGCGGGTCCC 
TCGCGGGGGG 
GCTCTCCTGT 
GCTGCAGTTT 
TCTTTGCCCT 
AGAAGCCAGC 
GGTGGTGGAA 
CAATGCTGGA 
AGCGGTGTTC 
ATCAAAATTA 
CCCAGCACTT 
TTGGCCAACA 
GGCGGGOGCC 
GGAGGCGGAG 
CGAGACTCCA 



TGAACCCCCT 
ATATTACCAG 
TGGGAGAGCC 
CTCTGTGTCT 
CTACTATCAG 
GCAACTGGAT 
TAGTGGAGCC 
GTGGCATTGT 
GTACATCCCG 
GTGTAAAGAG 
CCACGGAGCA 
GGGGGCGGTC 
AGAAATGTTT 
GGTAGTCTGA 
AAAGCAGCGA 
ATGGATGCAA 
TTTTTCAAAA 
TGGGAGGCTG 
TGGTGAGCCC 
TGTAATCCCA 
ATTGCAGTGA 
TCTCAAAAAA 



TCGGAGAGAT 
TCAGAGGCGG 
TGCTGCCAAA 
TACAGTTCCA 
TTCGGAGGGG 
GTCCAGTTCT 
CTCACAGCCC 
CCACTGTTGC 
GCCTTGTACT 
AGGCAGGCGC 
GATATAGGAG 
CTGGCCATAA 
GAATTTTAAG 
GCCACTGCCT 
GCCAGOGTTG 
CAGACCCTGG 
AGCAAAAAAA 
AGGTGGGTGG 
CCGTCTCTAC 
GCTACTTGGG 
GCCGAGATCC 
AAAAAAAAAA 



540 

600 

660 

720 

780 

640 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 



Seq ID NO: C434 DNA Sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 261.. 2861 



51 



1 11 21 31 41 

I I I I I I 

GAGCTAGCGC TCAAGCAGAG CCCAGCGCGG TGCTATCGGA CAGAGCCTGG CGAGCGCAAG 60 

CGGCGCGGGG AGCCAGCGGG GCTGAGCGCG GCCAGGGTCT GAACCCAGAT TTCCCAGACT 120 

AGCTACCACT CCGCTTGCCC ACGCCCCGGG AGCTCGCGGC GCCTGGCGGT CAGCGACCAG 180 

ACGTCCGGGG CCGCTGCGCT CCTGGCCCGC GAGGCGTGAC ACTGTCTCGG CTACAGACCC 240 

AGAGGGAGCA CACTGCCAGG ATGGGAGCTG CTGGGAGGCA GGACTTCCTC TTCAAGGCCA 300 

TGCTGACCAT CAGCTGGCTC ACTCTGACCT GCTTCCCTGG GGCCACATCC ACAGTGGCTG 360 

CTGGGTGCCC TGACCAGAGC CCTGAGTTGC AACCCTGGAA CCCTGGCCAT GACCAAGACC 420 

ACCATGTGCA TATCGGCCAG GGCAAGACAC TGCTGCTCAC CTCTTCTGCC ACGGTCTATT 480 

CCATCCACAT CTCAGAGGGA GGCAAGCTGG TCATTAAAGA CCACGACGAG CCGATTGTTT 540 

TGCGAACCCG GCACATCCTG ATTGACAACG GAGGAGAGCT GCATGCTGGG AGTGCCCTCT 600 

GCCCTTTCCA GGGCAATTTC ACCATCATTT TGTATGGAAG GGCTGATGAA GGTATTCAGC 660 

CGGATCCTTA CTATGGTCTG AAGTACATTG GGGTTGGTAA AGGAGGCGCT CTTGAGTTGC 720 

ATGGACAGAA AAAGCTCTCC TGGACATTTC TGAACAAGAC CCTTCACCCA GGTGGCATGG 780 

CAGAAGGAGG CTATTTTTTT GAAAGGAGCT GGGGCCACCG TGGAGTTATT GTTCATGTCA 840 

TCGACCCCAA ATCAGGCACA GTCATCCATT CTGACCGGTT TGACACCTAT AGATCCAAGA 900 

AAGAGAGTGA ACGTCTGGTC CAGTATTTGA ACGCGGTGCC GGATGGCAGG ATCCTTTCTG 960 

TTGCAGTGAA TGATGAAGGT TCTCGAAATC TGGATGACAT GGCCAGGAAG GCGATGACCA 1020 

AATTGGGAAG CAAACACTTC CTGCACCTTG GATTTAGACA CCCTTGGAGT TTTCTAACTG 1080 

TGAAAGGAAA TCCATCATCT TCAGTGGAAG ACCATATTGA ATATCATGGA CATCGAGGCT 1140 

CTGCTGCTGC CCGGGTATTC AAATTGTTCC AGACAGAGCA TGGCGAATAT TTCAATGTTT 1200 

CTTTGTCCAG TGAGTGGGTT CAAGACG7GG AGTGGACGGA GTGGTTCGAT CATGATAAAG 1260 

TATCTCAGAC TAAAGGTGGG GAGAAAATTT CAGACCTCTG GAAAGCTCAC CCAGGAAAAA • 1320 

TATGCAATCG TCCCATTGAT ATACAGGCCA CTACAATGGA TGGAGTTAAC CTCAGCACCG 1380 

AGGTTGTCTA CAAAAAAGGC CAGGATTATA GGTTTGCTTG CTACGACCGG GGCAGAGCCT 1440 

GCCGGAGCTA CCGTGTACGG TTCCTCTGTG GGAAGCCTGT GAGGCCCAAA CTCACAGTCA 1500 

CCATTGACAC CAATGTGAAC AGCACCATTC TGAACTTGGA GGATAATGTA CAGTCATGGA 1560 

AACCTGGAGA TACCCTGGTC ATTGCCAGTA CTGATTACTC CATGTACCAG GCAGAAGAGT 1620 

TCCAGGTGCT TCCCTGCAGA TCCTGCGCCC CCAACCAGGT CAAAGTGGCA GGGAAACCAA 1680 

TGTACCTGCA CATCGGGGAG GAGATAGACG GCGTGGACAT GCGGGCGGAG GTTGGGCTTC 1740 

TGAGCCGGAA CATCATAGTG ATGGGGGAGA TGGAGGACAA ATGCTACCCC TACAGAAACC 1800 

ACATCTGCAA TTTCTTTGAC TTCGATACCT TTGGGGGCCA CATCAAGTTT GCTCTGGGAT 1860 

TTAAGGCAGC ACACTTGGAG GGCACGGAGC TGAAGCATAT GGGACAGCAG CTGGTGGGTC 1920 

AGTACCCGAT TCACTTCCAC CTGGCCGGTG ATGTAGACGA AAGGGGAGGT TATGACCCAC 1980 

CCACATACAT CAGGGACCTC TCCATCCATC ATACATTCTC TCGCTGCGTC ACAGTCCATG 2040 

GCTCCAATGG CTTGTTGATC AAGGACGTTG TGGGCTATAA CTCTTTGGGC CACTGCTTCT 2100 

TCACGGAAGA TGGGCCGGAG GAACGCAACA CTTTTGACCA CTGTCTTGGC CTCCTTGTCA 2160 
. AGTCTGGAAC CCTCCTCCCC TCGGACCGTG ACAGCAAGAT GTGCAAGATG ATCACAGAGG 2220 
ACTCCTACCC AGGGTACATC CCCAAGCCCA GGCAAGACTG CAATGCTGTG TCCACCTTCT 2280 
GGATGGCCAA TCCCAACAAC AACCTCATCA ACTGTGCCGC TGCAGGATCT GAGGAAACTG 2340 
GATTTTGGTT TATTTTTCAC CACGTACCAA CGGGCCCCTC CGTGGGAATG TACTCCCCAG 2400 
GTTATTCAGA GCACATTCCA CTGGGAAAAT TCTATAACAA CCGAGCACAT TCCAACTACC 2460 
GGGCTGGCAT GATCATAGAC AACGGAGTCA AAACCACCGA GGCCTCTGCC AAGGACAAGC 2520 
GGCCGTTCCT CTCAATCATC TCTGCCAGAT ACAGCCCTCA CCAGGACGCC GACCCGCTGA 2580 
AGCCCCGGGA GCCGGCCATC ATCAGACACT TCATTGCCTA CAAGAACCAG GACCACGGGG 2640 
CCTGGCTGCG CGGCGGGGAT GTGTGGCTGG ACAGCTGCCA TTTCAGAGGG GAGGCTCAGG 2700 
AAGGCTTCTT GCTTACAGGA ATGAAGGCTG GGGGCATTTT GCTGGGGGGA GATGAGGCAG 2760 
CCTCTGGAAT GGCTCAGGGA TTCAGCCCTC CCTGCCGCTG CCTGCTGAAG CTGGTGACTA 2820 
CGGGGTCGCC CTTTGCTCAC GTCTCTCTGG CCCACTCATG ATGGAGAAGT GTGGTCAGAG 2880 
GGGAGCAATG GGCTTTGCTG CTTATGAGCA CAGAGGAATT CAGTCCCCAG GCAGCCCTGC 2940 
CTCTGACTCC AAGAGGGTGA AGTCCACAGA AGTGAGCTCC TGCCTTAGGG CCTCATTTGC 3000 
TCTTCATCCA GGGAACTGAG CACAGGGGGC CTCCAGGAGA CCCTAGATGT GCTCGTACTC 3060 
CCTCGGCCTG GGATTTCAGA GCTGGAAATA TAGAAAATAT CTAGCCCAAA GCCTTCATTT 3120 
TAACAGATGG GGAAAGTGAG CCCCCAAGAT GGGAAAGAAC CACACAGCTA AGGGAGGGCC 3180 
TGGGGAGCCC CACCCTAGCC CTTGCTGCCA CACCACATTG CCTCAACAAC CGGCCCCAGA 3240 
GTGCCCAGGC ACTCCTGAGG TAGCTTCTGG AAATGGGGAC AAGTCCCCTC GAAGGAAAGG 3300 
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AAATGACTAG 
AACCCGCCCT 
AGCCCAGCCT 
TCTGCAGCTC 
CCAATTTGGC 
TGCTGCTTAA 
TAAGAGGGAG 
GTCTTGTGAT 
AGAGGGCCTG 
TTCCCCAGGT 
GGGAGGTGAC 
GGAAGGACTT 
TCCTTCTTGT 
GAACCACTTA 
TTCATTTAAA 
AGGATCTGTA 
TTTCTTGGTA 
TGTAAGACTT 
TTTTTCTGTT 
TGTAAACATT 
ATGTGCACTT 
GAGATGTCCT 
TTGGAAAAAT 
AAAAAAAAAA 



AGTAGAATGA 
CCCCTTGGTG 
GGGTGCACAG 
TACAGGTGAG 
TGATCTTGGG 
TGCCCTGCTC 
AACTCTATCT 
GAACTACATT 
CCTGGCTCCC 
GGGAGCCAAC 
CATAGGGCTC 
CTTCCAGGGA 
CCACGGTTTT 
GGATGTGATC 
AAAGATATCT 
CATAAAAGTT 
GCACAAATTT 
AAGTGAGTTA 
GCCGAAATAG 
TCTTGTAGGC 
CAAGAAGTCA 
TTGCATTGCT 
TTTGCTGTTA 
AAAAAAAAAA 



CAGCTAGCAG 
TTGGCGGTCC 
TAGCTGCAAC 
GCCCAGCAGA 
TGTCTGAACA 
TCTCCCTGGC 
GTGGTTTATA 
TATCCCCTTT 
TCCACCCAAC 
TGTCAGGGAG 
TGCTTTTAAA 
GATTAGTGGT 
GTTGAGTTTT 
ACTTTCAGGT 
ATTTGAAAGT 
TCTTTCCTAA 
TCTTATTGCT 
GGTCTTTAAG 
CTGGTCCTTT 
ATCACCATGA 
CTGTCAGAGA 
TGGAAGGGGT 
TTATAGTAAA 
AA 



ATCTCTTCCC 
CTGTGGCCTT 
TCCCCATTGG 
GGGAGTAGGG 
GCTATTGGGT 
CCACCTTATA 
ATCTTGCACG 
CCTGCCCCAA 
TGCACCCATG 
GTCTTTCCCA 
GATATGGCTG 
GATGGAGAGG 
CACTCTTCTA 
GGCCAGGAAT 
TCTCAGAGTT 
ACCATTCACC 
TAGAAAATTG 
GAAAGCAACG 
TTCGGGAGTT 
ACAAAGATAT 
AATAAAGAAT 
GTACCTAGAG 
CATACAAAGG 



TCCTGCTCCC 
CACTTTGTTC 
TGCTACCTGG 
CTCGCCATGT 
CCACCCCAGT 
GAGAGCCCAA 
AGGCACCAGA 
CCACAAACTC 
AGACTCGGTC 
CCAAACATCT 
CTTCAAAGGC 
AGAGTTAAAA 
ATGCAAGGGT 
GTTGAATGTC 
GTACATATGT 
AAGAGCCAAT 
TCCTCCTTGT 
CTCCTCTGAA 
AGATGTATAG 
ATTTTCTATT 
TGTCTTAAAT 
CCAAGGAAAT 
ATGTCAAAAA 



AGCGCACACA 
ACTACCTGTC 
CTCTCCTGTC 
TTCTGGTGAG 
CCCTTTCAGC 
AGAGCTCCTG 
GTCTCCCTGG 
TTTCCTTCAA 
CAAGAGTCCA 
TTCAGCTGCT 
CAGAGTCACA 
TGACCTCATG 
CTCACACTGT 
TTTGGCTCAG 
TTCACAGTAC 
ATCTAGGCAT 
TATTTCTGTT 
ATGCTTGTCT 
AGTGTTTGTA 
TATTTATTAT 
GTCATGATTG 
TGGCTCTGGT 
AAAAAAAAAA 



Seq ID NO: C217 Protein Sequence 
Protein Accession #: NP_005805.1 

1 11 21 

I 1 I 

MVGKMWPVLW TLCAVRVTVD AISVETPQDV 
LLLTHTERW IWPFSNKNYI HGELYKNRVS 
LMSDLEGNTK SRVRLLVLVP PSKPECGIEG 
LNQEQPLAQP ASGQPVSLKN ISTDTSGYYI 
IAVGWAALI IIGIIIYCCC CRGKDDNTED 
YRQEEQRSTG RESPDHLDQ 



31 41 51 

I I I 

LRASQGKSVT LPCTYHTSTS SREGLIQWDK 
I SNNAEQSDA SITIDQLTMA DNGTYECSVS 
ETIIGNNIQL TCQSKEGSPT PQYSWKRYNI 
CTSSNEEGTQ FCNITVAVRS PSMNVALYVG 
KEDARPNREA YEEPPEQLRE LSREREEEDD 



Seq ID NO: C218 Protein Sequence 
Protein Accession #•- Eos sequence 



1 
I 

MGSRTPBSPL 
GFSVEPYRPG 
LESSLSSSEG 
DNPTRILEYA 
IAESYYPEYL 
GYVTILNGSD 
EVGRVYVYLQ 
QQGWFVPPG 
VDKAWYRGR 
GFTVELQLDW 
LSPIHIALNP 
GEQNHVYLGD 
FAVNQSRLLV 
SFRLSVEAQA 
SOGVLEIiSCP 
SASSGPQILK 
YKALKMPYRI 
YKLGFFKRSL 



11 

I 

HAVQLRWGPR 
TDGVSVLVGA 
EEPVEYKSLQ 
PCRSDFSWAA 
INLVQGQLQT 
IRSLYNFSGE 
HPAGIEPTPT 
GPGGLGSKPS 
PrVSASASLT 
QKQKGGVRRA 
SLDPQAPVDS 
KNALNLTFHA 
CDLGNPMKAG 
QVTLNGVSKP 
QALEGQQLLY 
CPEAECFRLR 
LPRQLPQKER 
PYGTAMEKAQ 



21 
I 

RRPPLLPLLL 
PKANTSQPGV 
WFGATVRAHG 
GQGYCQGGFS 
RQASSIYDDS 
QMASYFGYAV 
LTLTGHDEFG 
QVLQPLWAAS 
IFPAMFNPEE 
LPLASRQATL 
HGLRPALHYQ 
QNVGEGGAYE 
ASLWGGLRFT 
EAVLFPVSDW 
VTRVTGLNCT 
CELGPLHQQE 
QVATAVQWTK 
LKPPATSDA 



31 
I 

LLLPPPPRVG 
LQGGAVYLCP 
SSILACAPLY 
AEFTKTGRW 
YLGYSVAVGE 
AATDVNGDGL 
RFGSSLTPLG 
HTPDFFGSAL 
RSCSLEGNPV 
TQTLLIQNGA 
SKSRIEDKAQ 
AELRVTAPPE 
VPHLRDTKKT 
HPRDQPQKEE 
TNHPINPKGL 
SQSLQLHFRV 
AEGSYGVPLW 



41 
I 

GFNIiDAEAPA 
WGASPTQCTP 
SWRTEKEPLS 
LGGPGSYFWQ 
FSGDDTEDFV 
DDLLVGAPLL 
DLDQDGYNDV 
RGGRDLDGNG 
ACINLSFCLN 
REDCREMKIY 
ILLDCGEDNI 
AEYSGLVRHP 
IQFDFQILSK 
DLGPAVHHVY 
ELDPEGSLHH 
WAKTFLQREH 
IIIIiAILFGL 



51 
I 

VLSGPPGSFF 



DPVGTCYLST 
GQILSATQEQ 
AGVPKGNLTY 
MDRTPDGRPQ 
AIGAPFGGET 
YPDLIVGSPG 
ASGKHVADSI 
LRNESEFRDK 
CVPDLQLEVF 
GNFSSLSCDY 
NLNNSQSDW 
ELINQGPSSI 
QQKREAPSRS 
QPFSIiQCEAV 
LLLGLLIYIL 



3360 
3420 
34B0 
3540 
3600 
3660 
3720 
37B0 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4702 



60 

120 

180 

240 

300 

319 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1049 



Seq ID NO: C219 Protein Sequence 
Protein Accession ft: NPJJ02412.1 



1 
I 

MHSFPPLLLL 
VEKLKQMQEF 
YTPDLPRADV 
LAHAFQPGPG 
TFSGDVQLAQ 
FYMRTNPFYP 
PKDIYSSFGF 
GIGHKVDAVF 



11 
I 

LFWGWSHSF 
FGLKVTGKPD 
DHAIEKAPQL 
IGGDAHFDED 
DDIDGIQAIY 
EVELNPISVF 
PRTVKHIDAA 
MKDGFFYFFH 



21 
I 

PATLETQEQD 
AETLKVMKQP 
WSNVTPLTFT 
ERWTNNFREY 
GRSQNFVQPI 
WPQLPNGLEA 
LSEENTGKTY 
GTRQYKFDPK 



31 
I 

VDLVQKYIiEK 
RCGVPDVAQF 
KVSEGQADIM 
NLHRVAAHEL 
GPQTPKACDS 
AYEFADRDEV 
FFVANKYWRY 
TKRILTLQKA 



41 
I 

YYNLKNDGRQ 
VLTEGNPRWE 
ISFVRGDHRD 
GHSLGLSHS7 
KLTFDAITTI 
RFFKGNKYWA 
DEYKRSMDPG 
NSWFNCRKN 



51 
I 

VEKRRNSGPV 
QTHLTYRIEN 
NSPFDGPGGN 
DIGALMYPSY 
RGEVMFFKDR 
VQGQNVLiHGY 
YPKMIAHDFP 



Seq ID NO: C220 Protein Sequence 
Protein Accession ft: Eos sequence 

1 11 21 31 41 51 

I I I I I I 

MHSFPPLLLL LFWGWSHSF PATLETQEQD VDLVQKYLEK YYNLKNDGRQ VEKRRNSGPV 
VEKLKQMQEF FGLKVTGKPD AETLKVMKQP RCGVPDVAQF VLTEGNPRWE QTHLTYRIEN 
YTPDLPRADV DHAIEKAFQL WSNVTPLTFT KVSEGQADIM ISFVRGDHRD NSPFDGPGGN 
LAHAFQPGPG IGGDAHFDED ERWTNNFREY NLHRVAAHAL GHSLGLSHST DIGALMYPSY 

1336 



60 

120 

180 

240 

300 

360 

420 

469 



60 
120 
180 
240 



WO 03/042661 



TFSGDVQLAQ DDIDGIQAIY GRSQNPVQPI GPQTPKACDS KLTFDAITTI RGEVMFFKDR 300 

FYMRTNPFYP EVELNFISVF WPQLPNGLEA AYEFADRDEV RFFKGNKYWA VQGQNVLHGY 3 60 

PKDIYSSFGF PRTVKHTDAA LSEENTGKTY FFVAKKYWRY DEYKRSMDPG YPKMIAHDFP 420 

GIGHKVDAVP MKDGFFYFFH GTRQYKFDPK TKRILTLQKA NSWFNCRKN 469 



Seq ID NO: C221 Protein Sequence 
Protein Accession #: NP_055146.1 

1 11 21 31 41 51 

I I I 1 I I 

MVRKPWSTI SKGGYLQGNV NGRLPSLGNK EPPGQEKVQL KRKVTLLRGV SIIIGTIIGA 60 

GIFISPKGVL QNTGSVGMSL TIWTVCGVLS LFGALSYAEL GTTIKKSGGH YTYILEVFGP 120 

LPAFVRVWVE LLIIRPAATA VISLAFGRYI LEPFFIQCEI PELAIKLITA VGITWMVLN 180 

SMSVSWSARI QIFLTFCKLT AILIIIVPGV MQLIKGQTQN FKDAFSGRDS SITRLPLAFY 240 

YGMYAYAGWF YLNFVTEEVE NPEKTIPLAI CTSMAITIGV YVLTNVAYFT TINAEELLLS 300 

NAVAVTFSER LU3NFSLAVP IFVALSCFGS MNGGVFAVSR LFYVASREGH LPEILSMIHV 360 

RKHTPLPAVI VLHPLTMIML FSGDLDSIiLN FLSFARWLFI GLAVAGLIYL RYKCPDMHRP 420 

FKVPLFIPAL FSFTCLFMVA LSLYSDPFST GIGFVITLTG VPAYYLFIIW DKKPRWFRIM 480 

SEKITRTLQI I LEW PEED K L 501 



Seq ID NO: C222 Protein Sequence 
Protein Accession #: NP_003237.1 

1 11 21 31 41 51 

| I I I I I 

MGIAWGLGVL FLMHVCGTNR IPESGGDNSV FDIFELTGAA RKGSGRRLVK GPDPSSPAFR 60 

IEDANLIPPV PDDKFQDLVD AVRAEKGFLL LASLRQMKKT RGTLLALERK DHSGQVFSW 120 

SNGKAGTLDIi SLTVQGKQHV VSVEEALLAT GQWKSITLFV QEDRAQLYID CEKMENAELD 180 

VPIQSVFTRD LASIARLRIA KGGVNDNFQG VLQNVRFVFG TTPEDILRNK GCSSSTSVL.L 240 

TLDNNWNGS SPAIRTNYIG HKTKDLQAIC GISCDELSSM VLELRGLRTI VTTLQDSIRK 300 

VTEENKELAN ELRRPPLCYH NGVQYRNNEE WTVDSCTECH CQNSVTICKK VSCPIMPCSN 360 

ATVPDGECCP RCWPSDSADD GWSPWSEWTS CSTSCGNGIQ QRGRSCDSLN NRCEGSSVQT 420 

RTCHIQECDK RFKQDGGWSH WSPWSSCSVT CGDGVITRIR LCNSPSPQMN GKPCEGEARE 480 

TKACKKDACP INGGWGPWSP WDICSVTCGG GVQKRSRLCN NPAPQFGGKD CVGDVTENQI 540 

CNKQDCPIDG CLSNPCFAGV KCTSYPDGSW KCGACPPGYS GNGIQCTDVD ECKEVPDACF 60 0 

NHNGEHRCEN TDPGYNCLPC PPRFTGSQPF GQGVEHATAN KQVCKPRNPC TDGTHDCNKN 660 

AKCNYLGHYS DPMYRCECKP GYAGNGIICG EDTDLDGWPN ENLVCVANAT YHCKKDNCPN 720 

LPNSGQEDYD KDGIGDACDD DDDNDKIPDD RDNCPFHYNP AQYDYDRDDV GDRCDNCPYN 780 

HNPDQADTDN NGEGDACAAD IDGDGILNER DNCQYVYNVD QRDTDMDGVG DQCDNCPLEH 840 

NPDQLDSDSD RIGDTCDNNQ DIDEDGHQNN LDNCPYVPNA NQADHDKDGK GDACDHDDDN 900 

DGIPDDKDNC RLVPNPDQKD SDGDGRGDAC KDDFDHDSVP DIDDICPENV DISETDFRRF 960 

QMIPLDPKGT SQNDPNWWR HQGKELVQTV NCDPGLAVGY DEFNAVDFSG TFFINTERDD 1020 

DYAGFVFGYQ SSSRFYWMW KQVTQSYWDT NPTRAQGYSG LSVKWNSTT GPGEHLRNAL 1080 

WHTGNTPGQV RTLWHDPRHI GWKDFTAYRW RLSHRPKTGF IRWMYEGKK IMADSGPIYD 1140 

KTYAGGRLGL FVFSQEMVFF SDLKYECRDP 1170 



Seq ID NO: C223 Protein Sequence 
Protein Accession 8: NP_002183.1 

1 11 21 31 41 51 

I I 1 I I I 

MPLLWLRGFL LASCWIIVRS SPTPGSEGHS AAPDCPSCAL AALPKDVPNS QPEMVEAVKK 60 

HILNMLHLKK RPDVTQPVPK AALLNAIRKL HVGKVGENGY VEIEDDIGRR AEMNELMEQT 120 

SEIITFAESG TARKTLHFEI SKEGSDLSW ERAEVWLFLK VPKANRTRTK VTIRLFQQQK 180 

HPQGSLDTGE EAEEVGLKGE RSELLLSEKV VDARKSTWHV FPVSSSIQRI. LDQGKSSLDV 240 

RIACEQCQES GASLVLLiGKK KKKEEEGEGK KKGGGEGGAG ADEEKEQSHR PFLMLQARQS 300 

EDHPHRRRRR GLECDGKVNI CCKKQFFVSF KDIGWNDWII APSGYHANYC EGECPSHIAG 360 

TSGSSLSFHS TVINHYRMRG HSPFANLKSC CVPTKLRPMS MLYYDDGQNI IKKDIQNMIV 420 

EECGCS 426 



Seq ID NO: C224 Protein Sequence 
Protein Accession #: NP__000086.1 

1 11 21 31 41 51 

I I I I 1 1 

MVPDTACVLL LTLAALGASG QGQSPLGSDL GPQMLRELQE TNAALQDVRD WLRQQVREIT 60 

FLKNTVMECD ACGMQQSVRT GLPSVRPLLH CAPGFCFPGV ACIQTESGGR CGPCPAGFTG 120 

NGSHCTDVNE CNAHPCFPRV RCINTSPGFR CEACPPGYSG PTHQGVGIAF AKANKQVCTD 180 

INECETGQHN CVPNSVCINT RGSFQCGPCQ PGFVGDQASG CQRGAQRFCP DGSPSECHEH 240 

ADCVLERDGS RSCVCRVGWA GNGILCGRDT DLDGFPDEKL RCPEPQCRKD NCVTVPNSGQ 300 

EDVDRDGIGD ACDPDADGDG VPNEKDNCPL VRNPDQRNTD EDKWGDACDN CRSQKNDDQK 360 

DTDQDGRGDA CDDDIDGDRI RNQADNCPRV PNSDQKDSDG DGIGDACDNC PQKSNPDQAD 420 

VDHDFVGDAC DSDQDQDGDG HQDSRDNCPT VPNSAQEDSD HDGQGDACDD DDDNDGVPDS 480 

RDNCRLVPNP GQEDADRDGV GDVCQDDFDA DKWDKIDVC PENAEVTLTD FRAFQTWIiD 540 

PEGDAQIDPN WWLNQGREI VQTMNSDPGL AVGYTAFNGV DFEGTFHVNT VTDDDYAGFI 600 

FGYQDSSSFY WMWKQMEQT YWQANPFRAV AEPGIQLKAV KSSTGPGEQX* RNALWHTGDT 660 

ESQVRLLWKD PRNVGWKDKK SYRWFLQHRP QVGYIKVRFY EGPELVADSN WLDTTMRGG 720 

RLGVFCFSQE NIIWANLRYR CNDTIPEDYE THQLRQA 757 

Seq ID NO: C225 Protein Sequence 
Protein Accession NP_ 612464 

1 11 21 31 41 51 

I I 1 I i I 



1337 



WO 03/042661 



PCT/US02/36810 



MRPQGPAASP QRLRGLIjLLIj LLQLPAPSSA SEIPKGKQKA QLRQREWDL YNGMCLQGPA 60 

GVPGRDGSPG ANGIPGTPGI PGRDGFKGEK GECLRESFEE SWTPNYKQCS WSSLNYGIDL 120 

GKIAECTFTK MRSNSALRVL FSGSLRLKCR NACCQRWYFT FNGAECSGPL PIEAIIYLDQ 180 

GSPEMKSTIN IHRTSSVEGL CEGIGAGLVD VAIWVGTCSD YPKGDASTGW NSVSRIIIEE 240 

LPK 243 

Seq ID NO: C226 Protein Sequence 
Protein Accession ft: NP_003216.1 
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1 11 21 31 41 51 

I I I I I I 

MATMENKVIC ALVLVSMLAL GTLAEAQTET CTVAPRERQN CGFPGVTPSQ CANKGCCFDD 
TVRGVFWCFY PNTIDVPPEE ECEF 



Seq ID NO: C227 Protein Sequence 
Protein Accession #: NP 056234.1 



11 



MPKRAHWGAL 
NLGFNSIOAL 
QTLQGLSNLM 
RLSTIRHLYL 
CKKDKAYEGG 
DGGSQLILEK 
VAIiDFECPMT 
VRAQILAEPE 
PSGAVQRDQT 
RIKSMEPSDS 
ALAIPEAHLS 
HFTVGITVTK 
EVFLKTKDDA 
ERWADILAKV 
SSADVPLLGE 
TEGDLKGTAA 
EPPLDAVSLA 
STIGEPGVPG 
RSSESEGQES 
TMSTHPSRRR 
VPTAWVDNTV 
PENKHRNIVT 
SDGKEIKDDV 
NPSRTAQPGR 
TTLSSIKVEV 
MSLGQTTTTK 
PSSDRDAFNL 
LPEMSTQSAS 
PSIPSKFTDR 
PQLGVTRRPQ 
TNLQNIPMVS 
QTVSVTAETD 
QDRGQYMCTA 
QISWIFPDRR 
HVAALPPVIH 
VFPNGTLYIR 
GGTLKLDCSA 
CVARNKVGDD 
DGSLVNSFMQ 
WTAPATIRN 
GTLLIQKAQR 
RKLIDCKAEG 
RNEGGEARLI 
GTDLQSGQQL 
KQYHNLVSII 
ASVFDRGTYV 
PKADITWELP 
KTTYIHVF 



SWLILLWGH 
SETSFAGLTK 
RLHIDHNKIE 
AENMVRTLPA 
QLCAMCFSPK 
FQLPQWSISL 
RENYEKLWKli 
WVMQPSIDIQ 
VLEGGPCQLS 
GLYQCIAQVR 
WILPNRRIIN 
KGSGLPSKRG 
INGDKKAKKG 
RGKNLPKGTE 
EEHVLGTISS 
PTLISEPYEP 
ESEPMQYFDP 
QSBLQGLTDN 
KSITIiPDSTL 
PNGRRRLRPN 
NTPKQLEMEK 
PSSETILLPR 
ATNVDKHKSD 
LQTDIPVTTS 
ASSQAETTTL 
PALPSPRISQ 
STKLELEKQV 
RYFVTSQSPR 
RTDQFNGYSK 
IPTSPAFVMR 
STQSSISFIT 
TVFPCEATGK 
SNLHGLDRMV 
VWQTVSPVES 
QEKLENISLP 
NLAPKDSGRY 
SGDPWPRILH 
YWLKVDWM 
SDDSGGRTKR 
KTYLAVQVPY 
SDSGNYTCLV 
IPTPRVLWAF 
VQLTVLEPME 
QRFYHKADGM 
NGETLKLPCT 
CRMETEYGPS 
DKSHLKAGVQ 



21 31 

1 I 
PRVALACPHP CACYVPSEVH 
LELLMIHGNE IPSIPDGALR 
FIHPQAFNGfc TSLRLLHLEG 
SMLRNMPLLE NLYLQGNPWT 
KLYKHEIHKL KDMTCLKPSI 
NMTDEHGNMV NLVCDIKKPM 
IAYYSEVPVK LHRELMLSKD 
LNRRQSTAKK VLLSYYTQYS 
CNVKASESPS IFWVLPDGSI 
DEMDRMVYRV LVQSPSTQPA 
DLANTSHVYM LPNGTLSIPK 
RRPGAKALSR VREDIVEDEG 
RRKLKLWKHS EKEPETNVAE 
VPPLIKTTSP PSLSLEVTPP 
ASMGLEHNHN GVILVEPEVT 
SPTLHTLDTV YEKPTHEETA 
DLETKSQPDE DKMKEDTFAH 
IHLVKSSLST QDTLLIKKGM 
GIMSSMSPVK KPAETTVGTL 
KFRHRHKQTP PTTFAPSETF 
NAEPTSKGTP RRKHGKRPNK 
TVSLKTEGPY DSLDYMTTTR 
ILVTGESITN AIPTSRSLVS 
GENLTDPPLIi KELEDVDFTS 
DQDHLETTVA ILLSETRPQN 
ASRDSKENVF LNYVGNPETE 
FGSRSLPRGP DSQRQDGRVH 
HWTNKPEITT YPSGALPENK 
VFGNNNIPEA RNPVGKPPSP 
ERKVIPGSYN RIHSHSTFHL 
SSVQSSGSFH QSSSKFFAGG 
PKPFVTWTKV STGALMTPNT 
VLLSVTVQQP QILASHYQDV 
RITLHENRTL SIKEASFSDR 
PGLSIHIHCT AKAAPLPSVR 
ECVAANLVGS ARRTVQLNVQ 
RLPSKRMIDA LFSFDSRIKV 
KPAK1EHKEE NDHKVFYGGD 
YWFNNGTLY FNEVGMREEG 
GDWTVACEA KGEPMPKVTW 
RNSAGEDRKT VWIHVNVQPP 
PEGWLPAPY YGNRITVHGN 
KPIFHDPISE KITAMAGHTI 
LHISGLSSVD AGAYRCVARN 
PPGAGOGRFS WTLPNGMHLE 
VTSIPVIVIA YPPRITSEPT 
ARLYGNRFLH PQGSLTIQHA 



41 51 

I I 
CTFRSLASVP AGIARHVERI 
DLSSLQVFKF SYNKLRVITG 
NLLHQLHPST FSTFTFLDYF 
CDCEMRWFLE WDAKSRGILK 



DVYKIHLNQT 
PRVSYQYRQD 
QTISTKDTRQ 
LKAPMDDPDS 
EKDTVTIGKN 
VQVSDSGYYR 
GSGMGDEENT 
GRRVFESRRR 
FPAVSPPSAS 
STPLEEWDD 
TEGWSAADVG 
LTPTPTIWVN 
KEMSQTLQGG 
LDKDTTTVTT 
STQPTQAPDI 
HRYTPSTVSS 
KIYSSYPKVQ 



EFLSSLTVST 
HTPTAARMKE 
ATFVNNEGTQ 
ASHQLTRVPA 
QFTTPRIiSST 
RIPHYSNGRL 
DFGPPAPPLL 
PPASKFWSI/3 
RIQRFEVLKN 
TVYLGDTIAM 
GVYKCVASNA 
WVLGDGTQIR 
RAAANARITG 
FANGTLWKS 
LKVDCVATGI* 
DYTCFAENQV 
LSPTNKVIPT 
KINGNPNPIT 
GSLDIRSLRK 
SLNCSAAGTP 
AAGETERLVS 
GPQTLGRVSIt 
PVTYTRPGNT 
TQRDAGFYKC 



DPPDIDINAT 
ADEEALYYTG 
ARGRSWVMIE 
KFSILSSGWL 
PGESVTLPCN 
CVAVNQQGAD 
SRRLLHPKDQ 
INMANKQINP 
PVQTVTSAEE 
LSEKTEEITS 
SSPEPTSSEY 
DSSTSQLFED 
NMLEGDPTHS 
TPRQKVAPSS 
KISSQVESSL 
RASGSKPSPS 
ETLPVTYKPT 
PVGFPGTPTW 
PFHQEEAGSS 
PASSSPSTIL 
HMSGPNELST 
KPILPTATVR 
TIPLPLHMSK 
PFFTNKTLSF 
HTPQTTGSPS 
EKPQILTKSP 
GTLVIRKVQV 
ECLAKGTPAP 
AGADSLAIRli 
PSQFLHGNLF 
TSPRRTDVRY 
VTDKDAGDYL 
PNPEISWSLP 
GKDEMRVRVK 
SSEKYQIYQD 
TVREIAAGGS 
SDSVQLVCMA 
TPSLVWVLPN 
LKVGLKPEAN 
LDNGTLTVRE 
VKLNCMAMGI 
KAKNILGSDS 



60 
84 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1660 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2828 



Seq ID NO: C228 Protein Sequence 
Protein Accession ft: Eos sequence 



1 
I 

MPGTKLTRTG 
SRQYTVRYRE 
TPESAPTTAP 
FQNTFFHTPR 
SLCQQECSCT 
MCYEDP7VSS 
PQGRNAKDLL 
SDTQDQKRTL 
SASPABHAST 



SSPLSKGGKD 



11 

I 

APADYRVILK 
KGELARWDYK 
ENLNVWFVNG 
LSNHLEQSPS 
QKDPSCLAYL 
LTGNSLKSVA 
LDLKNKILAN 
RPPSRHGHSV 
QGTSHRPStiP 
VHPGAKPASP 
GEDAPATNSN 



21 

1 

TSQEDELDVP 
QIANRRVLIE 
KPTWAASWD 
PILETLLLPW 
IDIQTKQVNK 
ASKADVQQNT 
GGAPRKPQLR 
VAPGRTAVRA 
ASLNDNDLVD 
ARRTPHSGAA 



31 
I 

DDISVRVMSS 
NLIPDTVYEF 
ALPETEGKVK 
WMVCSLGNAI 
DPQLEGSVFG 



AKKAEELDLQ 
RMPALPRREG 
SDEDERAVGS 
EEDSSASAPP 



41 
I 

QSVLVSWVDP 
AVRXSQGERD 
VCLLDTGLFS 
FSKSGPQTGE 
PCFLFYFLTF 
PSSPSPRAPA 
STEITGEEEL 
VDKPGFSLAT 
LHPKGAFAQP 
SRLSPPHGGS 
VSEGAEASDG 



51 

1 

VLEKQKKWA 
GKWSTSVFQR 
VSSFQPSAKS 
AWDLTPKPSL 
MLDIGGFSFI 
SSQHPSVPAS 
GSREDSPMSP 
QPRPGAPPSA 
RPALSPSRQS 
SRLLPTQPHL 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 
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GGRQAEATAQ TLRARPASGH FHLLRHKPFA ANGRSPSRFS IGRGPRLQPS SSPQSTVPSR 720 

AHPRVPSHSD SHPKLSSGIH GDEEDEKPLP ATWNDHVPS SSRQPISRGW EDLRRSPQRG 780 

ASLHRKEPIP ENPKSTGADT HPOGKYSSLA SKAQDVQQST DADTEGHSPK AQPGSTDRHA 840 

SPARPPAARS QQHPSVPRRM TPGRAPEQQP PPPVATSQHH PGPQSRDAGR SPSQPRLSLT 900 

QAGRPRPTSQ GRSHSSSDPY TASSRGMLPT ALQNQDEDAQ GSYDDDSTEV EAQDVRAPAH 960 

AARAKEAAAS LPKHQQVESP TGAGAGGDHR SQRGHAASPA RPSRPGGPQS RARVPSRAAP 1020 

GRSEPPSKRP LSSKSQQSVS AEDEEEEDAG FFKGGKEDLL SSSVPKWPSS STPRGGKDAD 1080 

GSLAKEEREP AIALAPRGGS LAPVKRPLPP PPGSSPRASH VPSRPPPRSA ATVSPVAGTH 1140 

PWPRYTTRAP PGHFSTTPML SLRQRMMHAR FRNPLSRQPA RPSYRQGYNG RPNVEGKVLP 1200 

GSNGKPNGQR IINGPQGTKW WDLDRGLVL NAEGRYLQDS HGNPLRIKLG GDGRTIVDLE 1260 

GTPWSPDGL PLFGQGRHGT PLANAQDKPI LSLGGKPLVG LEVIKKTTHP PTTTMQPTTT 1320 

TTPLPTTTTP RPTTATTMQP TTTTTPLPTT TPRPTTATTR RTTTRRPTTT VRTTTRTTTT 1380 

TTPKPTTPIP TCPPGTLERH DDDGNLIMSS NGIPECYAEE DEFSGLETDT AVPTEEAYVI 1440 

YDEDYEFETS RPPTTTEPST TATTPRVIPE EGAISSFPEE EFDIAGRKRF VAPYVTYLNK 1500 

DPSAPCSLTD ALDHFQVDSL DEIIPNDLKK SDLPPQHAPR NITWAVEGC HSFVIVDWDK 1560 

ATPGDLVTGY LVYSASYEDF IRNKFSTQAS SVTHLPIENL KPNTRYYFKV QAQNPHGYGP 1620 

ISPSVSFVTE fiDNPLLWRP PGGELSGSHS LSNMIPATRT AMDGNM 1666 



Seq ID NO: C229 Protein Sequence 
Protein Accession ft: NP_003005.1 

1 11 21 31 41 51 

111)11 

MFLSILVALC LWLH1ALGVR GAPCEAVRIP MCRHMPWNIT RMPNHLHHST QENAIIAIEQ 60 
YEELVDVNCS AVLRFFFCAM YAPICTLEFL HDPIKPCKSV CQRARDDCEP LMKMYNHSWP 120 
ESLACDELPV YDRGVCISPE AIVTDLPEDV KWIDITPDMM VQERPLDVDC KRLSPDRCKC 180 
KKVKPTLATY LSKKYSYVIH AKIKAVQRSG CNEVTTWDV KEIFKSSSPI PRTQVPLITN 240 
SSCQCPHILP HQDVL.IMCYE WRSRMMLLEN CLVEKWRDQL SKRSIQWEER LQEQRRTVQD 300 
KKKTAGRTSR SNPPKPKGKP PAPKPASPKK NIKTRSAQKR TNPKRV 346 



Seq ID NO: C230 Protein Sequence 
Protein Accession #: NPJJ05931.1 

1 11 21 31 41 51 

1 I I I I I 

MAPAAWLRSA AARALLPPML LLLLQPPPLL ARALPPDVHH LHAERRGPQP WHAALPSSPA 60 

PAPATQEAPR PASSLRPPRC GVPDPSDGLS ARNRQKRFVL SGGRWEKTDL TYRILRFPWQ 120 

LVQEQVRQTM AEALKVWSDV TPLTFTEVHE GRADIMIDFA RYWHGDDLPF DGPGGILAHA 180 

FFPKTHREGD VHFDYDETWT IGDDQGTDLL QVAAHEFGHV LGLQHTTAAK ALMSAFYTFR 240 

YPLSLSPDDC RGVQHLYGQP WPTVTSRTPA LGPQAGIDTN EIAPIiEPDAP PDACEASFDA 300 

VSTIRGELFF FKAGFVWRLR GGQLQPGYPA IASRHWQGLP SPVDAAFEDA QGHIWFFOGA 360 

QYWVYDGEKP VLGPAPLTEL GLVRFPVHAA LVWGPEKNKI YFFRGRDYWR FHPSTRRVDS 420 

PVPRRATDWR GVPSEIDAAF QDADGYAYFL RGRLYWKFDP VKVKALEGFP RLVGPDFFGC 480 

AEPANTFI* 488 



Seq ID NO: C231 Protein Sequence 
Protein Accession ft: NP_076927 

1 11 21 31 41 51 

I I I I I I 

MGENDPPAVE APFSFRSLFG LDDLKISPVA PDADAVAAQI LSLLPLKFFP IIVIGIIALI 60 

LALAIGLGIH FDCSGKYRCR SSFKCIELIA RCDGVSDCKD GEDEYRCVRV GGQNAVLQVF 120 

TAASWKTMCS DDWKGHYANV ACAQLGFPSY VSSDNLRVSS LEGQFREEFV SIDHLLPDDK 180 

VTALHHSVYV REGCASGHW TLQCTACGHR RGYSSRIVGG NMSLLSQWPW QASLQFQGYH 240 

LCGGSVITPL WIITAAHCVY DLYLPKSWTI QVGLVSLLDN PAPSHLVEKI VYHSKYKPKR 300 

LGNDIALMKL AGPLTFNEMI QPVCLPNSEE NFPDGKVCWT SGWGATEDGG DASPVLNHAA 360 

VPLISNKICN HRDVYGGIIS PSMLCAGYLT GGVDSCQGDS GGPLVCQERR LWKLVGATSF 420 

GIGCAEVNKP GVYTRVTSFL DWIHEQMERD LKT . 453 



Seq ID NO: C232 Protein Sequence 
Protein Accession ft: NP_003211 

1 11 21 31 41 51 

I I 1 1 I I 

MLWKLTDNIK YEDCEDRHDG TSNGTARLPQ LGTVGQSPYT SAPPLSHTPN ADFQPPYFPP 60 

PYQPIYPQSQ DPYSHVNDPY SLNPLHAQPQ PQHPGWPGQR QSQESGLLHT HRGTLPHQLSG 120 

LDPRRDYRRH EDLLHGPHAL SSGLGDLSIH SLPHAIEEVP HVEDPGINIP DQTVIKKGPV 180 

SLSKSNSNAV SAIPINKDNL FGGWNPNEV FCSVPGRLSL LSSTSKYKVT VAEVQRRLSP 240 

PECLNASLLG GVLRRAKSKN GGRSLREKIJ5 KIGLNLPAGR RKAANVTLLT SLVEGEAVHL 300 

ARDFGYVCET EFPAKAVAEF LNRQHSDPNE QVTRKNMLLA TKQICKEFTD LLAQDRSPLG 360 

NSRPNPILEP GIQSCLTHFN LISHGFGSPA VCAAVTALQN YLTEALKAMD KMYLSNNPNS 420 

HTDNNAKSSD KEEKHRK 437 



Seq ID NO: C233 Protein Sequence 
Protein Accession ft: NP_002979-1 

1 11 21 31 41 51 

1 i I I I I 

MKGLAAAIiLV LVCTMALCSC AQVGTNKELC CLVYTSWQIP QKFIVDYSET SPQCPKPGVI 60 
LLTKRGRQIC ADPNKKWVQK YISDUOJJA 89 



Seq ID NO: C234 Protein Sequence 
Protein Accession ft: NP_0040S4.1 
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1 11 21 31 41 51 

I I I I I I 

MILQAHLHSL CLLMLYLATG YGQEGKFSGP LKPMTFSIYE GQEPSQIIFQ FKANPPAVTF 60 

ELTGETDNIF VIEREGLLYY NRALDRETRS THNLQVAALD ANGIIVEGPV PITIEVKDIN 120 

DNRPTFLQSK YEGSVRQNSR PGKPFLYVNA TDLDDPATPN GQLYYQIVIQ LPMINNVMYF 180 

QINNKTGAIS LTREGSQELN PAKNPSYNLV ISVKDMGGQS ENSFSDTTSV DIIVTENIWK 240 

APKPVEMVEN STDPHPIKIT QVRWNDPGAQ YSLVDKEKLP RFPFSIDQEG DIYVTQPLDR 300 

EEKDAYVFYA VAKDEYGKPL SYPLEIHVKV KDINDNPPTC PSPVTVFEVQ ENERLGNSIG 360 

TLTAHDRDEE NTANSFLNYR IVEQTPKLPM DGLFLIQTYA GMLQLAKQSL KXQDTPQYNL 420 

TIEVSDKDFK TLCFVQINVI DINDQIPIFE KSDYGNLTLA EDTNIGSTIL TIQATDADEP 480 

FTGSSKILYH IIKGDSEGRL GVDTDPHTNT GYVIIKKPLD FETAAVSNXV FKAENPEPLV 540 

FGVKYNASSF AKFTLIVTDV NEAPQFSQHV FQAKVSEDVA IGTKVGNVTA KDPEGLDISY 600 

SLRGDTRGWL KIDHVTGEIF SVAPLDREAG SPYRVQWAT EVGGSSI*SSV SEFHLILMDV 660 

NDNPPRLAKD YTGLFFCHPL SAPGSLIFEA TDDDQHLFRG PHFTFSLGSG SLQNDWBVSK 720 

INGTHARLST RHTEFEEREY WLIRINDGG RPPLEGIVSL PVTFCSCVEG SCFRPAGHQT 780 

GIPTVGMAVG ILLTTLLVIG IILAWFIRI KKDKGKDNVE SAQASEVKPL RS 832 



Seq ID NO: C235 Protein Sequence 
Protein Accession #i NP_004434.1 



1 11 21 31 41 51 

I I I I I I 

MARARPPPPP SPPPGLLPLL PPLLLLPLLL LPAGCRALEE TLMDTKWVTS ELAWTSHPES 60 

GWEEVSGYDE AMNPIRTYQV OJVRESSQNN WLRTGFIWRR DVQRVYVELK FTVRDCNSIP 120 

NIPGSCKETF NLFYYEADSD VASASSPFWM ENPYVKVDTI APDESFSRLD AGRVNTKVRS 180 

FGPLSKAGFY LAFQDQGACM SLISVRAFYK KCASTTAGFA LFPETLTGAE PTSLVIAPGT 240 

CIPNAVEVSV PLKLYCNGDG EWMVPVGACT CATGHEPAAK ESQCRPCPPG SYKAKQGEGP 300 

CLPCPPNSRT TSPAASICTC HNNFYRADSD SADSACTTVP SPPRGVISNV NETSLILEWS 360 

EPRDLGGRDD LLYNVICKKC HGAGGASACS RCDDNVEFVP RQLGLTERRV HISHLLAHTR 420 

YTFEVQAVNG VSGKSPLPPR YAAVNITTNQ AAPSEVPTLR LHSSSGSSLT LSWAPPERPN 480 

GVILDYEMKY FEKSEGIAST VTSQMNSVQL DGLRPDARYV VQVRARTVAG YGQYSRPAEF 540 

ETTSERGSGA QQLQEQLPLI VGSATAGLVF WAVWIAIV CLRKQRHGSD SEYTEKLQQY 600 

IAPGMKVYID PFTYEDPNEA VREFAKEIDV SCVKIEEVIG AGEFGEVCRG RLKQPGRREV 660 

FVAIKTLKVG YTERQRRDFL SEASIMGQFD HPNIIRLEGV VTKSRPVMIIi TEFMENCALD 720 

SFLRLNDGQF TVIQLVGMLR GIAAGMKYLS EMNYVHRDLA ARNILVNSNL VCKVSDFGLS 780 

RFLEDDPSDP TYTSSLGGKI PIRWTAPEAI AYRKFTSASD VWSYGIVMWE VMSYGERPYW 840 

DMSNQDVINA VEQDYRliPPP MDCPTALHQL MLDCWVRDRN LRPKFSQIVN TLDKLIRNAA 900 

SLKVIASAQS GMSQPLLDRT VPDYTTFTTV GDWLDAIKMG RYKESFVSAG FASFDLVAQM 960 

TAEDLLRIGV TLAGHQKKIL SSIQDMRLQM NQTLPVQV 998 



Seq ID NO j C236 Protein Sequence 
Protein Accession #: NP_00179S.l 

1 11 21 31 41 51 

I I I I I 1 

MYVGYVLDKD SPVYPGPARP ASLGLGPANY GPPAPPPAPP QYPDFSSYSH VEPAPAPPTA 60 
WGAPFPAPKD DWAAAYGPGP AAPAASPASIi AFGPPPDFSP VPAPPGPGPG LIiAQPLGGPG 120 
TPSSPGAQRP TPYEWMRRSV AAGGGGGSGK TRTKDKYRW YTDHQRLELE KEFHYSRYIT 180 
IRRKSELAAN LGLTERQVKI WFQNRRAKER KVNKKKQQQQ QPPQPPMAHD ITATPAGPSL 240 
GGLCPSNTSL LATSSPMPVK EEFLP 265 



Seq ID NO: C237 Protein Sequence 
Protein Accession #: NP_068813.1 



1 11 21 31 41 51 

I I I I I I 

MGSDRARKGG GGPKDFGAGL KYNSRHEKVN GLEEGVEFLP VNNVKKVEKH GPGRWWLAA 60 

VLIGLLLVLL GIGFLVWHLQ YRDVRVQKVF NGYMRITNEN FVDAYENSNS TEFVSLASKV 120 

KDALKLLYSG VPFLGPYHKE SAVTAFSEGS VIAYYWSEFS IPQHLVEEAE RVMAEERWM 180 

LPPRARSLKS FWTSWAFP TDSKTVQRTQ DNSCSFGLHA RGVELMRFTT PGFPDSPYPA 240 

HARCQWALRG DADSVLSLTF RSFDIASCDE RGSDLVTVYN TLSPMEPHAL VQLOGTYPPS 300 

YNLTPHSSQN VLLITLITNT ERRHPGFEAT FFOLPRMSSC GGRLRKAQGT FNSPYYPGHY 360 

PPNIDCTWNI EVPNNQHVKV RFKFFYLLEP GVPAGTCPKD YVEINGEKYC GERSQFWTS 420 

NSNKITVRFH SDQSYTDTGF LAEYLSYDSS DPCPGQFTCR TGRCIRKELR CDGWADCTDH 480 

SDELNCSCDA GHQFTCKNKF CKPLFWVCDS VNDCGDNSDE QGCSCPAQTF RCSNGKCLSK 540 

SQQCNGKDDC GDGSDEASCP KVNWTCTKH TYRCLNGLCL SKGNPECDGK EDCSDGSDEK 600 

DCDCGLRSFT RQARWGGTD ADEGEWPWQV SLHALGQGHI OSASLISPNW LVSAAHCYID 660 

DRGFRYSDPT QWTAHjGLHD QSQRSAPGVQ ERRLKRIISH PFFNDFTFDY DIALLELEKP 720 

AEYSSMVRPI CLPDASHVFP AGKAIWVTGW GHTQYGGTGA LILQKGEIRV INQTTCENLL 780 

PQQITPRMMC VGFLSGGVDS CQGDSGGPL.S SVEADGRIFQ AGWSWGDGC AQRNKPGVYT 840 

RLPLFRDWIK ENTGV 855 

Seq ID NO: C238 Protein Sequence 
Protein Accession 8: Eos sequence 

1 11 21 31 ' 41 51 

I I i I I I 

MPPFLLLEAV CVFLFSRVPP SLPLQEVHVS KETIGKISAA SKMMWCSAAV DIMFLLDGSN 60 

SVGKGSFERS KHFAITVCDG LDISPERVRV GAFQFSSTPH LEFPLDSFST QQEVKARIKR 120 

MVFKGGRTET ELALKYLLHR GLPGGRNASV PQILIIVTDG KSQGDVALPS KQLKERGVTV 180 

FAVGVRFPRW EELHALASEP RGQHVLLAEQ VEDATNGLFS TLSSSAICSS ATPDCRVEAH 240 

PCEHRTLEMV REFAGNAPCW RGSRRTLAVL AAHCPFYSWK RVFLTHPATC YR TTCP GPCD 300 

SQPCQNGGTC VPEGLDGYQC LCPLAFGGEA NCALKLSLBC RVDLLFLLDS SAGTTLDGFL 360 
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RAKVFVKRFV RAVLSEDSRA RVGVATYSRE LLVAVPVGEY QDVPDLVWSL DGIPFRGGPT 420 

LTGSALRQAA ERGFGSATRT GQDRPRRVW LLTESHSEDE VAGPARHARA RELLLLGVGS 480 

EAVRAELEEI TGSPKHVMVY SDPQDLFNQI PELQGKLCSR QRPGCRTQAL DLVFMLDTSA 540 

SVGPENPAQM QSFVRSCALQ FEVNPDVTQV GLWYGSQVQ TAFGLDTKPT RAAMLRAISQ 600 

APYLGGVGSA GTALliHIYDK VMTVQRGARP GVPKAVWLT GGRGAEDAAV PAQKLRNNGI 660 

SVLWGVGPV LSEGLRRLAG PRDSLIHVAA YADLRYHQDV LIEWLCGEAK RPVNLCKPSP 720 

CMNEGSCVLQ NGSYRCKCRD GWEGPHCENR FLRRP 755 

Seq ID NO: C239 Protein Sequence 
Protein Accession tf: Eos sequence 

1 11 21 31 41 51 

I I I I I I 

MPPFLLLEAV CVFLFSRVPP SLPLQEVHVS ' KETXGKISAA SKMMWCSAAV DIMFLLDGSN 60 

SVGKGSFERS KHFAITVCDG LDISPERVRV GAFQFSSTPH LEFPLDSFST QQEVKARIKR 120 

MVFKGGRTET ELALKYLLHR GLPGGRNASV PQILIIVTDG KSQGDVALPS KQLKERGVTV 180 

PAVGVRFPRW EELHALASEP RGQHVLLAEQ VEDATNGLFS TLSSSAICSS ATPDCRVEAH 240 

PCEHRTLEMV REFAGNAPCW RGSRRTLAVL AAHCPFYSWK RVFLTHPATC YRTTCPGPCD 300 

SQPCQNGGTC VPEGLDGYQC LCPLAFGGEA NCALKLSLEC RVDLLFLLDS SAGTTLDGFL 360 

RAKVFVKRPV RAVLSEDSRA RVGVATYSRE LLVAVPVGEY QDVPDLVWSL DGIPFRGGPT 420 

LTGSALRQAA ERGFGSATRT GQDRPRRVW LLTESHSEDE VAGPARHARA RELLLLGVGS 480 

EAVRAELEEI TGSPKHVMVY SDPQDLFNQI PELQGKLCSR QRPGCRTQAL DLVFMLDTSA 540 

SVGPENPAQM QSFVRSCALQ FEVNPDVTQV GLWYGSQVQ TAFGLDTKPT RAAMLRAISQ 600 

APYLGGVGSA GTALLHIYDK VMTVQRGARP GVPKAVWLT GGRGAEDAAV PAQKLRNNGI 660 

SVLWGVGPV LSEGLRRLAG PRDSLIHVAA YADLRYHQDV LIEWLCGEAK QPVNLCKPSP 720 

CMNEGSCVLQ NGSYRCKCRD GWEGPHCENR EWSSCSVCVS QGWILETPLR HMAPVQEGSS 780 

RTPPSNYREG LGTEMVPTFW NVCAPGP 807 

Seq ID NO: C240 Protein Sequence 
Protein Accession #: XP_097386.1 

1 11 21 31 41 51 

I I I I I I 

MPKSEPLGCL SPASRAPGSA AATGAWLPAA SGGPGPLGPP CTCPPRSLGR GRAGSRAGSS 60 

PSGCVCVSGI LRWSVGDPA SRRWVDLDSN SEDLSLLLTP MIVGTGGVGG GWARGWVPAQ 120 

EKEVAEGSGH AGRGNGRRLQ RVYGARSWIL GRKPCLQRLL PASGGPVQPQ PCPSPATACR 180 

WGFKFGVAFW GAAQHPPLCR LGGGRAVPSA TRTLDGF 217 

Seq ID NO: C241 Protein Sequence 
Protein Accession #: CAC03433 

1 11 21 31 41 51 

I I I I I I 

MLSSTDFTFA SWELWRVDH PNEEQQKDVT LRVSGDLHVG GVMLKLVEQI NISQDWSDFA 60 

LWWEQKHCWL LKTHWTLDKY GVQADAKLLF TPQHKMLRLR LPNLKMVRLR VSFSAWFKA 120 

VSDICKILNI RRSEELSLLK PSGDYFKKKK KKDKNNKEPI IEDILNLESS PTASGSSVSP 180 

GLYSKTMTPI YDPINGTPAS STMTWFSDSP LTEQNCSILA FSQPPQSPEA LADMYQPRSL 240 

VDKAKLNAGW LDSSRSLMEQ GIQEDEQLLL RFKYYSFFDL NPKYDAVRIN QLYEQARWAI 300 

LLEEIDCTEE EMLIFAALQY HISKLSLSAE TQDFAGESEV DEIEAALSNL EVTLEGGKAD 360 

SLLEDITDIP KLADNLKLFR PKKLLPKAFK QYWFIFKDTS IAYFKNKELE QGEPLEKLNL 420 

RGCEWPDVN VAGRKFGIKL LIPVADGMNE MYLRCDHENQ YAQWMAACML ASKGKTMADS 480 
SYQPEVLNIL SFLRMKNRNS ASQVASSLEN MDMNPECFVS PRCAKKHKSK QLAARILEAH 540 
QNVAQMPLVE AKLRFIQAWQ SLPEFGLTYY LVRFKGSKKD DILGVSYNRL IKIDAATGIP 600 
VTTWRFTNIK QWNVNWFJTRQ WIEFDQNVF TAFTCLSADC KIVHEYIGGY IFLSTRSKDQ 660 
NETLDEDLFH KLTGGQD 677 



Seq ID NO: C242 DNA Sequence 
Nucleic Acid Accession #j NM_005170 
Coding sequence : 337 . . 918 

1 11 21 31 41 51 

I 1 I I I I 

GGGCGTGAGA AAGGCOACGG CGGCGGCGCG GAGGAGGGTT ATCTATACAT TTAAAAACCA 60 

GCCGCCTGCG CCGCGCCTGC GGAGACCTGG GAGAGTCCGG CCGCACGCGC GGGACACGAG 120 

CGTCCCACGC TCCCTGGCGC GTACGGCCTG CCACCACTAG GCCTCCTATC CCCGGGCTCC 180 

AGACGACCTA GGACGCGTGC CCTGGGGAGT TGCCTGGCGG CGCCGTGCCA GAAGCCCCCT 240 

TGGGGCGCCA CAGTTTTCCC CGTCGCCTCC GGTTCCTCTG CCTGCACCTT CCTGCGGCGC 300 

GCCGGGACCT GGAGCGGGCG GGTGGATGCA GGCGCGATGG ACGGCGGCAC ACTGCCCAGG 360 

TCCGCGCCCC CTGCGCCCCC CGTCCCTGTC GGCTGCGCTG CCCGGCGGAG ACCCGCGTCC 420 

CCGGAACTGT TGCGCTGCAG CCGGCGGCGG CGACCGGCCA CCGCAGAGAC CGGAGGCGGC 480 

GCAGCGGCCG TAGCGCGGCG CAATGAGOGC GAGCGCAACC GCGTGAAGCT GGTGAACTTG 540 

GGCTTCCAGG CGCTGCGGCA GCACGTGCCG CACGGCGGCG CCAGCAAGAA GCTGAGCAAG 600 

GTGGAGACGC TGCGCTCAGC CGTGGAGTAC ATCOGCGCGC TGCAGCGCCT GCTGGCCGAG 660 

CACGACGCCG TGCGCAACGC GCTGGCGGGA GGGCTGAGGC CGCAGGCCGT GCGGCCGTCT 720 

GCGCCCCGCG GGCCGCCAGG GACCACCCCG GTCGCCGCCT CGCCCTCCCG CGCTTCTTCG 780 

TCCCCGGGCC GCGGGGGCAG CTCGGAGCCC GGCTCCCCGC GTTCCGCCTA CTCGTCGGAC 840 

GACAGCGGCT GCGAAGGCGC GCTGAGTCCT GCGGAGCGCG AGCTACTCGA CTTCTCCAGC 900 

TGGTTAGGGG GCTACTGAGC GCCCTCGACC TA 932 

Seq ID NO: C243 Protein Sequence 
Protein Accession #: NPJJ60233.1 



1 11 21 31 41 51 
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I 

MSGGHQLQLA 
LNLTLEGVFA 
SLASKARMAG 
KAHVRIELKE 
SQLATRRYQA 
PWLHQHRTCP 
PSRSAVARPP 
LQSTSQHPAA 
NCTDISLQGV 
ETQVSSHVHY 
SNSAAPSGRL 
SEPTPGSRPQ 
NSQPVWLCLT 
QAV 



ALWPWLLMAT 
GVAEITPAEG 
ERGASAVLFD 
PPAWPDYDVW 
SCRQARGEWP 
LCVFNITEGD 
RPGPFLPSQE 
CPVPLRRARP 
HGSSSTFCSS 
HRHRHHHYKK 
SNPQCPRALP 
DATVHPACQI 
PRQPLEPHPP 



I 

LQAGFGRTGL 
KLMQSHPLYL 
ITEDRAAAEQ 
ILMTWGTIF 
DSGSSCSSAP 
SFSQSLGPSR 
PGMGPRHHRF 



LSSDFDPLVY 
RFQWHGRKPG 
EPAPGPVDAS 
FPHYTPSVAY 



VLAAAVESER 
CNASDDDNLE 
LQQPLGLTWP 
VIILASVLRI 
VCAICLEEFS 
SYQEPGRRLH 
PRAAHPRAPG 
CTERSGYLAD 
CSPKGDPQRV 
PETGVPQSRP 
SICPSTSSLF 
PHSPEAHPLI 
TAEGRPCPYP 



SAEQKAVIRV 
PGFISIVKLE 
WLIWGNDAE 
RCRPRHSRPD 
EGQELRVISC 
LIRQHPGHAH 
EQQRLAGAQH 
GPASDSSSGP 
DMQPSVTSRP 
PIPRTQPQPE 
NLQKSSLSAR 
CGPPGLDKRL 
HCQVLSAQPG 



IPLKMDPTGK 
SPRRAPRPCL 
KLMEFVYKNQ 
PLQQRTAWAI 
LHEFHRNCVD 
YHLPAAYLLG 
PYAQGWGMSH 



RSLDSWPTG 
PPSPDQQVTG 
HPQRKRRGGP 
LPETPGPCYS 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

783 



Seq ID NO: C244 DNA Sequence 
Nucleic Acid Accession 8: NM_0042B9 
Coding sequence: 493.. 1695 

31 



51 



1 11 21 31 41 

I 1 I I I I 

GCCGCCGCCT CGTCCACCGG AGGAGCCGGC GCCAGCGTGG ACGGCGGCAG CCAGGCTGTG 
CAGGGGGGCG GCGGGGACCC CCGAGCGGCT CGGAGTGGCC CCTTGGACGC CGGGGAAGAG 
GAGAAGGCAC CCGCGGAACC GACGGCTCAG GTGCCGGACG CTGGCGGATG TGCGAGCGAG 
GAGAATGGGG TACTAAGAGA AAAGCACGAA GCTGTGGATC ATAGTTCCCA GCATGAGGAA 
AATGAAGAAA GGGTGTCAGC CCAGAAGGAG AACTCACTTC AGCAGAATGA TGATGATGAA 
AACAAAATAG CAGAGAAACC TGACTGGGAG GCAGAAAAGA CCACTGAATC TAGAAATGAG 
AGACATCTGA ATGGGACAGA TACTTCTTTC TCTCTGGAAG ACTTATTCCA GTTGCTTTCA 
TCACAGCCTG AAAATTCACT GGAGGGCATC TCATTGGGAG ATATTCCTCT TCCAGGCAGT 
ATCAGTGATG GCATGAATTC TTCAGCACAT TATCATGTAA ACTTCAGCCA . GGCTATAAGT 
CAGGATGTGA ATCTTCATGA GGCCATCTTG CTTTGTCCCA ACAATACATT TAGAAGAGAT 
CCAACAGCAA GGACTTCACA GTCACAAGAA CCATTTCTGC AGTTAAATTC TCATACCACC 
AATCCTGAGC AAACCCTTCC TGGAACTAAT TTGACAGGAT TTCTTTCACC GGTTGACAAT 
CATATGAGGA ATCTAACAAG CCAAGACCTA CTGTATGACC TTGACATAAA TATATTTGAT 
GAGATAAACT TAATGTCATT GGCCACAGAA GACAACTTTG ATCCAATCGA TGTTTCTCAG 
CTTTTTGATG AACCAGATTC TGATTCTGGC CTTTCTTTAG ATTCAAGTCA CAATAATACC 
TCTGTCATCA AGTCTAATTC CTCTCACTCT GTGTGTGATG AAGGTGCTAT AGGTTATTGC 
ACTGACCATG AATCTAGTTC CCATCATGAC TTAGAAGGTG CTGTAGGTGG CTACTACCCA 
GAACCGAGTA AGCTTTGTCA CTTGGATCAA AGTGATTCTG ATTTCCATGG AGATCTTACA 
TTTCAACACG TATTTCATAA CCACACTTAC CACTTACAGC CAACTGCACC AGAATCTACT 
TCTGAACCTT TTCCGTGGCC TGGGAAGTCA CAGAAGATAA GGAGTAGATA CCTTGAAGAC 
ACAGATAGAA ACTTGAGCCG TGATGAACAG CGTGCTAAAG CTTTGCATAT CCCTTTTTCT 
GTAGATGAAA TTGTCGGCAT GCCTGTTGAT TCTTTCAATA GCATGTTAAG TAGATATTAT 
CTGACAGACC TACAAGTCTC ACTTATCCGT GACATCAGAC GAAGAGGGAA AAATAAAGTT 
GCTGCGCAGA ACTGTCGTAA ACGCAAATTG GACATAATTT TGAATTTAGA AGATGATGTA 
TGTAACTTGC AAGCAAAGAA GGAAACTCTT AAGAGAGAGC AAGCACAATG TAACAAAGCT 
ATTAACATAA TGAAACAGAA ACTGCATGAC CTTTATCATG ATATTTTTAG TAGATTAAGA 
GATGACCAAG GTAGGCCAGT CAATCCCAAC CACTATGCTC TCCAGTGTAC CCATGATGGA 
AGTATCTTGA TAGTACCCAA AGAACTGGTG GCCTCAGGCC ACAAAAAGGA AACCCAAAAG 
GGAAAGAGAA AGTGAGAAGA AACTGAAGAT GGACTCTATT ATGTGAAGTA GTAATGTTCA 
GAAACTGATT ATTTGGATCA GAAACCATTG AAACTGCTTC AAGAATTGTA TCTTTAAGTA 
CTGCTACTTG AATAACTCAG TTAACGCTGT TTTGAAGCTT ACATGGACAA ATGTTTAGGA 
CTTCAAGATC ACACTTGTGG GCAATCTGGG GGAGCCACAA CTTTTCATGA AGTGCATTGT 
ATACAAAATT CATAGTTATG TCCAAAGAAT AGGTTAACAT GAAAACCCAG TAAGACTTTC 
CATCTTGGCA GCCATCCTTT TTAAGAGTAA GTTGGTTACT TCAAAAAGAG CAAACACTGG 
GGATCAAATT ATTTTAAGAG GTATTTCAGT TTTAAATGCA AAATAGCCTT ATTTTCATTT 
AGTTTGTTAG CACTATAGTG AGCTTTTCAA ACACTATTTT AATCTTTATA TTTAACTTAT 
AAATTTTGCT TTCT 

Seq ID NO: C245 Protein Sequence 
Protein Accession #: NP 004433 



1 
I 

MALRRLGAAL 
VCNVFESSQN 
DSATKTFPNW 
YGGCMSLIAV 
CNGDGEWLVP 
TNCVCRNGYY 
ICKSCGSGRG 
SPQFASVNIT 
YNATAIKSPT 
LPLIIGSSAA 
PTYEDPNEAV 
TEKQRRDFLS 
VIQLVGMLRG 
YTSALGGKIP 
EQDYRLPPPM 
INLPLLDRTI 
LAGHQKKILN 



11 
I 

LLLPLLAAVE 
NWLRTKFIRR 
MENPWVKVDT 
RVFYRKCPRI 
IGRCMCKAGF 
RADLDPLDMP 
ACTRCGDNVQ 
TNQAAPSAVS 
NTVTVQGLKA 
GLVFLIAVW 
REFAKEIDIS 
EASIMGQFDH 
IAAGMKYLAD 
IRWTAPEAIQ 
DCPSALHQLM 
PDYTSFNTVD 
SIQVMRAQMN 



21 
I 

ETLMDSTTAT 
RGAHRIHVEM 
IAADESFSQV 
IQNGAIFQET 
EAVENGTVCR 
CTTIPSAPQA 
YAPRQLGI*TE 
IMHQVSRTVD 
GAIYVFQVRA 
IAIVCNRRRG 
CVKIEQVIGA 
PNVIHliEGW 
MNYVHRDLAA 
YRKFTSASDV 
LDCWQKDRNH 
EWLEAIKMGQ 
QIQSVEV 



31 
I 

AELGWMVHPP 
KFSVRDCSSI 
DLGGRVMKIN 
LSGAESTSLV 
GCPSGTFKAN 
VISSVNETSL 
PRIYISDLLA 
SITLSWSQPD 
RTVAGYGRYS 
FERADSEYTD 
GEFGEVCSGH 
TKSTPVMIIT 
RN1LVNSNLV 
WSYGIVMWEV 
RPKFGQIVNT 
YKESFANAGF 



41 
I 

SGWEEVSGYD 
PSVPGSCKET 
TEVRSFGPVS 
AARGSCIANA 
QGDEACTHCP 
MLEWTPPRDS 
HTQYTFEIQA 
QPNGVILDYE 
GKMYFQTMTE 
KLQHYTSGHM 
LKLPGKREIF 
EFMENGSLDS 
CKVSDFGLSR 
MSYGERPYWD 
LDKMIRNPNS 
TSFDWSQMM 



51 
I 

ENMNTIRTYQ 
FNIiYYYEADF 
RSGFYLAFQD 
EEVDVPIKLY 
INSRTTSEGA 
GGREDLVYNI 
VNGVTDQSPF 
LQYYEKEL.SE 
AEYQTSIQEK 
TPGMKIYIDP 
VAIKTLKSGY 
FLRQNDGQFT 
FLEDDTSDPT 
MTNQDVINAI 
LKAMAPIjSSG 
MEDILRVGLT 
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Seq ID NO: C246 Protein Sequence 
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Protein Accession #: NP_114148.1 

1 11 21 31 41 51 

1 | I I I I 

MDARRVPQKD LRVKKNLKKF RYVKLISMET SSSSDDSCDS FASDNFANTR LQSVREGCRT 60 

RSQCRHSGPL RVAMKFPARS TRGATNKKAE SRQPSENSVT DSNSDSEDES GMNFLEKRAL 120 

NIKQNKAMLA KLMSELESFP GSFRGRHPLP GSDSQSRRPR RRTFPGVASR RNPERRARPL 180 

TRSRSRILGS LDALPMEEEE EEDKYMLVRK RKTVDGYMNE DDLPRSRRSR SSVTLPHIIR 240 

PVEEITEGGV GERLQQFSKR RYITVHWALL VINAVRRLLI PKQTAETQTA GAFEASSVAP 300 

AFETVMVKRS GMLCWIRTGI ARLVEESATA VSAGSEMDGV RLGSLCI 347 

Seq ID NO: C247 Protein Sequence 
Protein Accession ft: NP_036577.1 

1 11 21 31 

I I I.I 



RTRYVRVQPA NNGSPCPELE EEAECVPDNC V 

Seq ID NO: C248 Protein Sequence 
Protein Accession #: NP_063947.1 

1 11 21 31 

1111 



LTAAHCFRKH TDVFNWKVRA GSDKLGSFPS LAVAKIIIIE FNPMYPKDND IALMKLQFPL 300 

TFSGTVRPIC LPFFDEELTP ATPLWIIGWG FTKQNGGKMS DILLQASVQV IDSTRCNADD 3 60 

AYQGEVTEKM MCAGIPEGGV DTCQGDSGGP LMYQSDQWHV VGIVSWGYGC GGPSTPGVYT 420 

KVSAYLNWIY NVWKAEL 437 

Seq ID NO: C249 Protein Sequence 
Protein Accession #: NP_003036.1 

1 11 21 31 41 51 

MGCKVLLNIG QQMLRRKWD CSREETRLSR CLNTFDLVAL GVGSTLGAGV YVLAGAVARE 60 

NAGPAIVISF LIAALASVLA GlrCYGEFGAR VPKTGSAYLY SYVTVGELWA FITGWNLILS 120 

YIIGTSSVAR AWSATFDELI GRPIGEFSRT HMTLNAPGVL AENPDIFAVI I ILI LTGLLT 180 

LGVKESAMVN KIFTCINVLV LGFIMVSGFV KGSVKNWQIiT EEDFGNTSGR LCLNNDTKEG 240 

KPGVGGFMPF GFSGVLSGAA TCFYAFVGFD CIATTGEEVK NPQKAIFVGI VASLLICFIA 300 

YFGVSAALTL MMPYFCLDNN SPLPDAFKHV GWEGAKYAVA VGSLCALSAS LLGSMFPMFR 360 

VIYAMAEDGL LFKFLANVND RTKTPIIATL ASGAVAAVMA FLFDLKDLVD liMSIGTLLAY 420 

SLVAACVLVL RYQPEQPNLV YQMASTSDEL DPADQNELAS TNDSQLGFLP EAEMFSLKTI 480 

LSPKNMEPSK ISGLIVNIST SLIAVLIITF CIVTVLGREA LTKGALWAVF LLAGSALLCA 540 

WTGVIWRQP ESKTKLSFKV PFLPVLPILS IFVNVYLMMQ LDQGTWVRFA VWMLIGFIIY 600 

FGYGLWHSEE ASLDADQART PDGNLDQCK 629 

Seq ID NO: C250 Protein Sequence 
Protein Accession #: NP_002767.1 

1 11 21 31 41 51 

| I I I I I 

MRAPHLHLSA ASGARALAKL LPLLMAQLWA AEAALLPQND TRLDPEAYGA PCARGSQPWQ 60 

VSLFNGLSFH CAGVLVDQSW VLTAAHCGNK PLWARVGDDH LIiLLQGEQLR RTTRSWHPK 120 

YHQGSGPILP RRTDEHDLML LKLARPWPG PRVRAbQLPY RCAQPGDQCQ VAGWGTTAAR 180 

' RVKYNKGLTC SSITILSPKE CEVFYPGWT NNMICAGLDR GQDPCQSDSG GPLVCDETLQ 240 

GILSWGVYPC GSAQHPAVYT QICKYMSWIN KVIRSN 276 



41 


SI 




1 

AKYSITFTGK 


I 

WSQTAFPKQY 


60 


GEAHAIiMKEI 


EAAGEALQSV 


120 


WFVGVDSLDL 


CDGDRHREQA 


180 


PANSFYYPRL 


KALPPIARVT 


240 


SLWSSWGLCG 


GHCGRLGTKS 


300 






331 


41 


51 




1 

LSLASIIIW 


1 

VLIKVILDKY 


60 


AVAVRLSKDR 


STLQVLDSAT 


120 


i DLDWEITEN 


SQELRMRNSS 


180 


SIQYDKQHVC 


GGSILDPHWV 


240 



Seq ID NO: C251 Protein Sequence 
Protein Accession #= XPJJ95088.3 

1 11 21 31 41 51 

I I I I I I 

MTRAATAEPG RVSPASPARS TAGLPRAFLQ SLRTLLDILD DWQRGCVHLR EIQSLWVEAR 60 

ELPSGVLEGL SQRRGPQPGA AVRSRRGGAV PRGARAVPER CAGTETRRGR RCSGLQRLGG 120 

GFRGCPADPC ARGEHRRHTI TSGVDCGLLK QMKELEQEKE VLLQGLEMMA QGRDWYQQQL 180 

QQVQERQCRL GQSRASADFG AVGSPRPLGR LLPKVQEVAR WLGELLAEAC AGRALPTSSS 240 

GPPCSALTST SSPGWQQQII LMLKEQNRLL TQEVTEKSER ITQLEQKSAL IKQLFEARAL 300 

SQQDGGLSPA GPHIEPLTRF RLPVLTWAGA LLSPHSPQLL LPLSADSGGP LHELPDTWFP 360 

AVLLWVPSPG KRTAHARLHF HQRPAEGAWQ LGCGAEAAPE TCGTLPHFES HKTTCEPDSL 420 

GGPCPQEGDR SWSHLGAAFD VAPAVAKVTP NREDAAGSRH GDICPLCPKG LLTFRDIAIE 480 

FSLAEWQCLD HAQQNIiYRDV MLENYRNLFS LGMTVSKPDIi IACLEQNKEP QNIKRNEMAA 540 

KHPVTCSHFN QDLQPEQSIK DSLQKVIPRT YGKCGHENLQ LKKCCKRVDE CEVHKGGYND 600 

LNQCLSNTQN KIFQTHKCVK VFSKFSNSNR HNARYTGKKH LKCKKYGKSF CMFSHLNQHQ 660 

IIHTKEKSYK CEECGKSFNH SSSGTTHKRI LTGEKPYRCE ECGKAFRWPS NLTRHKRIHT 720 

GEKPYACEEC GQAFRRSSTI* TNHKRIHTGE RPYKCEECGK AFSVSSALIY HKRIHTGEKP 780 

YTCEECGKAF NCSSTLKTHK IIHTGEKPYT CEECGRTFNC SSTVKAHKRI HTGEKPYKCE 840 
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ECDKAFKWHS SLAKHKIIHT GEKPYKCSDS KALAKSSEVQ KVYSGDGENG IRVHKKKETQ 900 

GWLVRNKNEN RTGLFQIRAA VRPNRDPSWG QQBGSLTDPI QRKEEPDLQN HYDHONALED 960 

QRNTGVGGLL TFRDWIEFS LEEWQCLDHA QQNLYRDVML ENYRNLVSLG IAVSKPDLIT 1020 

CLEQNKEPWN IKRNEMVTKH PDLPPELGIK DSLQKVIPRR YGKSGHDNLQ VKTCKSMGEC 1080 

EVQKGGCNEV NQCLSTTQNK IFQTHKCVKV FGKFSNSNRH KTRHTGKKHF KCKKYGKSFC 1140 

MVSQLHQHQI IHTRENSYQC EECX3KPFNCS STLSKHKRIH TGEKPYRCEE CGKAFTWSST 1200 

LTKHRRIHTG EKPYTCEECG QAFSRSSTLA NHKRIHTGEK PYTCEECGKA FSLSSSLTYH 1260 

KRIHTGEKPY TCEECGKAFN CSSTLKKHKI IHTGEKPYKC KECGKAFAFS STLNTHKRIH 1320 

TGEEPYKCEE CDKAFKWSSS LANHKSMHTG EKPYKCE 1357 



Seq ID NO: C252 Protein Sequence 
Protein Accession ft: NP_114433.1 

1 11 21 31 41 51 

I I I I I I 

MASRSMRLLL LLSCLAKTGV LGDIIMRPSC APGWFYHKSN CYGYFRKLRN WSDAELECQS 60 
YGNGAHLASI LSLKEASTIA EYISGYQRSQ PIWIGfcHDPQ KRQQWQWIDG AMYLYRSWSG 120 
KSMGGNKHCA EMSSNNNFLT WSSNECNKRQ HFLCKYRP 158 



Seq ID NO: C253 Protein Sequence 
Protein Accession ft: XP_051860.2 

1 11 21 31 41 51 

1 I I I I I 

MDGVNLSTEV VYKKGQDYRF ACYDRGRACR SYRVRFLCGK PVRPKLTVTI DTNVNSTILN 60 

LEDNVQSWKP GDTLVIASTD YSKYQAEEFQ VLPCRSCAPN QVKVAGKPMY LHIGEEIDGV 120 

DMRAEVGLLS RNIIVMGEME DKCYPYRNHI CNFFDFDTFG GHIKFALGFK AAHLEGTELK 180 

HMGQQLVGQY PIHFHLAGDV DERGGYDPPT YIRDLSIHHT FSRCVTVHGS NGLLIKDWG 240 

YNSLGHCFFT EDGPEERNTF DHCLGLLVKS GTLLPSDRDS KMCKMITGDS YPGYIPKPRQ 300 

DCNAVSTFWM ANPNNNLINC AAAGSEETGF WFIFHHVPTG PSVGMYSPGY SEHIPLGKFY 360 

NNRAHSNYRA GMIIDNGVKT TEASAKDKRP FliSIISARYS PHQDADPLKP REPAIIRHF1 420 

AYKNQDHGAW LRGGDWJLDS CRFADNGIGL TLASGGTFPY DDGSKQEIKN SLFVGESGNV 480 

GTEMMDNRIW GPGGLDHSGR TLPIGQNFPI RGIQLYDGPI NIQNCTFRKF VALEGRHTSA 540 

LAFRLNNAWQ SCPHNNVTGI AFEDVPITSR VFFGEPGPWF NQLDMDGDKT SVFHDVDGSV 600 

SEYPGSYLTK NDNWLVRHPD CINVPDWRGA ICSGCYAQMY IQAYKTSNLR MKIIKNDFPS 660 

HPLYLEGALT RSTHYQQYQP WTLQKGYTI HWDQTAPAEL AIWLINFNKG DWIRVGLCYP 720 

RGTTFSILSD VHNRLLKQTS KTGVFVRTLQ MDKVEQSYPG RSHYYWDEDS GLLFLKLKAQ 780 

NEREKFAFCS MKGCERIKIK ALIPKNAGVS DCTATAYPKF TERAWDVPM PKKLFGSQLK 840 

TKDHFLEVKM ESSKQHFFHL WNDFAYIEVD GKKYPSSEDG IQVWIDGNQ GRWSHTSFR 900 

NSILQGIPWQ LFNYVATIPD NSIVLMASKG RYVSRGPWTR VLEKIiGADRG LKLKEQMAFV 960 

GFKGSFRPIW VTLDTEDHKA KIFQWPIPV VKKKKL 996 



Seq ID NO: C254 Protein Sequence 
Protein Accession ft: NP_0551B8.1 

1 11 21 31 41 51 

| ( I I 1 1 

MTALSSENCS FQYQLRQTNQ PLDVNYLLFL. IILGKILLNI LTLGMRRKNT CQNFMEYFCI 60 
SLAFVDLLLL VNISIILYFR DFVLLSIRFT KYHICLFTQI ISFTYGFLHY PVFLTACIDY 120 
CLNFSKTTKL SFKCQKLFYF FTVILIWISV LAYVLGDPAI YQSLKAQNAY SRHCPFYVSI 180 
QSYWLSFFMV MILFVAFITC WEEVTTLVQA IRITSYMNET ILYFPFSSHS SYTVRSKKIF 240 
LSKLIVCFLS TWLPFVLLQV IIVLUCVQIP AYIEMNIPWL YFVNSFLIAT VYWFNCHKLN 300 
LKDIGLPLDP FVNWKCCFIP LTIPNLEQIE KPISIMIC 338 



Seq ID NO: C255 Protein Sequence 
Protein Accession ft: Eos sequence 



1 11 21 31 41 51 

1 I I I I 1 

MALVLGSLLL LGLCGNSFSG GQPSSTDAPK AWNYELPATN YETQDSHKAG PIGILFELVH 60 

IFLYWQPRD FPEDTLRKFI* QKAYESK3DY DKTVYYEAGI ILCCVLGLLF I ILMPLVGYF 120 

FCMCRCCNKC GGEMHQRQKE NGPFLRKCFA ISLLVICIII SIGIFYGFVA NHQVRTRIKR 180 

SRKLADSNFK DLRTLLNETP EQIKYILAQY NTTKDKAFTD LNSINSVLGG GILDRLRPNI 240 

IPVLDEIKSM ATAIKETKEA LENMNSTLKS LHQQSTQLSS SLTSVKTSLR SSLNDPLCLV 300 

HPSSETCNSI RLSLSQLKSN PELRQLPPVD AELDNVNNVIi RTDUX3LVQQ GYQSLNDIPD 360 

RVQRQTTTW AGIKRVLNSI GSDIDNVTQR LPIQDILSAF SVYVNNTESY IHRNLPTLEE 420 

YDSYWWLGGL VICSLLTLIV IFYYLGLLCG VCGYDRHATP TTRGCVSNTG GVFLMVGVGL 480 

SFLFCWILMI IWLTFVFGA NVEKLICEPY TSKELFRVLD TPYLLNEDWE YYLSGKLFNK 540 

SKMKLTFEQV YSDCKKNRGT YGTLHLQNSF NISEHLNINE HTGSISSELE SLKVNLNIFL 600 

LGAAGRKNLQ DFAACGIDRM NYDSYIAQTG KSPAGVNLLS FAYDLEAKAN SLPPGNLRNS 660 

LKRDAQTIKT IHQQRVLPIE QSLSTLYQSV KIDQRTGNGL LERVTRILAS LDFAQNFITN 720 

NTSSVIIBET KKYGRTI IGY FEHYLQWIEF SISEKVASCK PVATALDTAV DVFLCSYIID 780 

PLNLFWFGIG KATVFLLPAL IFAVKLAKYY RRMDSEDVYD DVETIPMKNM ENGNNGYHKD 840 

HVYGIHNPVM TSPSQH 856 



Seq ID NO: C256 Protein Sequence 
Protein Accession ft: NP_149038.1 

1 11 21 31 41 51 

I I I I I I 

MKAIIHLTLL ALLSVNTATN QGNSADAVTT TETATSGPTV AAADTTETNF PETASTTANT 60 

PSFPTATSPA PPIISTHSSS TIPTPAPPII STHSSSTIPI PTAADSESTT NVNSIATSDI 120 
ITASSPNDGL ITMVPSETQS NNEMSPTTED NQSSGPPTGT ALLETSTLNS TGPSNPCQDD 180 

PCADNSLCVK LHNTSFCLCL EGYYYNSSTC KKGKVFPGKI SVTVSETFDP EEKHSMAYQD 240 
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LHSEITSLFK DVFGTSVYGQ TVILTVSTSL SPRSEMRADD KFVNVTIVTI LAETTSDNEK 300 

TVTEKINKAI RSSSSNFLNY DLTLRCDYYG CNQTADDCLN GLACDCKSDL QRPNPQSPFC 360 

VASSLKCPDA CNAQHKQCLI KKSGGAPECA CVPGYQEDAN GNCQKCAFGY SGLDCKDKFQ 420 

LILTIVGTIA GIVILSMIIA LIVTARSNNK TKHIEEENLI DEDFQNLKLR STGFTNLGAE 480 

GSVFPKVRIT ASRDSQMQNP YSRHSSMPRP DY 512 

Seq ID NO: C257 Protein Sequence 
Protein Accession ft: NP_001423.1 

1 11 21 31 41 51 

I I II i t 

MTAGRRMEML CAGRVPALLL CLGFHLLQAV LSTTVIPSCI PGESSDNCTA LVQTEDNPRV 60 

AQVSITKCSS DMNGYCLHGQ CIYLVDMSQN YCRCEVGYTG VRCEHFFLTV HQPLSKEYVA 120 

LTVILIILFL ITWGSTYYF CRHYRNRKSK EPKKEYERVT SGDPELPQV 169 

Seq ID NO: C258 Protein Sequence 
Protein Accession ft: AAC63902.1 



„ 1 11 21 31 41 51 

20 | | | | | | 

MDRSKENCIS GPVKATAPVG GPKRVLVTQQ IPCQNPLPVN SGQAQRVLCP SNSSQRVPLQ 60 

AQKLVSSHKP VQNQKQKQLQ ATSVPHPVSR PLNNTQKSKQ PLPSAPENNP EEELASKQKN 120 

EESKfcRQWAL EDFEIGRPLG KGKFGNVYLA REKQSKFILA LKVLFKAQLE KAGVEHQLRR 180 

EVEIQSHLRH PNILRLYGYF HDATRVYLIL EYAPLGTVYR ELQKLSKFDE QRTATY ITEL 240 

JLj ANALSYCHSK RVIHRDIKPE NLLLGSAGEL KIADFGWSVH APSSRRTTLC GTLDYLPPEM 300 

IEGRMHDEKV DLWSLGVLCY EFLVGKPPFE ANTYQETYKR ISRVEFTFPD FVTEGARDLI 360 

SRLLKHNPSQ RPMLREVLEH PW1TANSSKP SNCQNKESAS KQS 403 



Seq ID NO: C259 Protein Sequence 
Protein Accession ft: NP_037504.1 

1 11 21 31 41 51 

I I I I I I 

MSRTAYTVGA LLLLLGTLLP AAEGKKKGSQ GAIPPPDKAQ HNDSEQTQSP QQPGSRNRGR 60 

GQGRGTAMPG EEVLESSQEA LHVTERKYLK RDWCKTQPLK QTIHEEGCNS RTIINRFCYG 120 

QCNSFYIPRH IRKEEGSFQS CSFCKPKKFT TMMVTLNCPE LQPPTKKKRV TRVKQCRCIS 180 

IDLD 184 

Seq ID NO: C260 Protein Sequence 
Protein Accession ft: Eos sequence 

1 11 21 31 41 51 

I I I I I I 

MKVGVLWLIS FFTFTDGHGG FLGKNDGIKT KKELIVNKKK HLGPVEEYQL LLQVTYRDSK 60 

EKRDLRNFLK LLKPPLLWSH GLIRIIRAKA TTDCNSLNGV LQCTCEDSYT WFPPSCLDPQ 120 

NCYLHTAGAL PSCECHLNNL SQSVNFCERT KIWGTFKINE RFTNDLLNSS SAIYSKYANG 180 

IEIQLKKAYE RIQGFESVQV TQFRNGSIVA GYEWGSSSA SELLS AIEHV AEKAKTALHK 240 

LFPLEDGSFR VFGKAQCNDI VFGFGSKDDE YTLPCSSGYR GNITAKCESS GWQVIRETCV 300 

LSLLBELNKN FSMIVGNATE AAVSSFVQNL SVIIRQNPST TVGNIASWS ILSNISSLSL 360 

ASHFRVSNST MEDVISIADN ILNSASVTNW TVLLREEKYA SSRLLETLEN ISTLVPPTAL 420 

PLNFSRKFID WKGIPVNKSQ LKRGYSYQIK MCPQNTSIPI RGRVLIGSDQ FQRSLPETII 480 

SMASLTLGNI LPVSKNGNAQ VNGPVISTVI QNYSINEVFL FFSKIESNLS QPHCVFWDFS 540 

HLQWNDAGCH LVNETQDIVT CQCTHLTSFS ILMSPFVPST IFPWKHITY VGLGISIGSL 600 

ILCLIIEALF WKQIKKSQTS HTRRICMVNI ALSLLIADVW FIVGATVDTT VNPSGVCTAA 660 

VFFTHFFYLS LFFWMLMLGI LLAYRIILVF HHMAQHLMMA VGFCLGYGCP LIISVITIAV 720 

TQPSNTYKRK DVCWLKWSNG SKPLLAFWP ALAIVAVNFV WLLVLTKLW RPTVGERLSR 780 

DDKATIIRVG KSLLILTPLL GLTWGFGIGT IVDSQNLAWH VIFALLNAFQ GFFILCFGIL 840 

LDSKLRQLLF NKLSALSSWK QTEKQNSSDL SAKPKFSKPF NPLQNKGHYA FSHTGDSSDN 900 

IMLTQFVSNE 910 

Seq ID NO: C261 Protein Sequence 
Protein Accession ft: NP_00057S.l 



^- 1 11 21 31 41 51 

65 | | | | | | 

MTSXLAVALL AAFLTSAALC EGAVLPRSAK ELRCQCIKTY SKPFHPKFIK ELRVIESGPH 60 
CANTEIIVKL SDGRELCLDP KENWVQRWE KFLKRAENS 99 



Seq ID NO: C262 Protein Sequence 
Protein Accession ft: NP_005594.1 

1 11 21 31 41 51 

I ! I I I I 

MSTERDSETT FDEDSQPNDE WPYSDDETE DELDDQGSAV EPEQNRVNRE AEENREPFRK 60 

ECTWQVKAND RKYHEQPHFM NTKFLCIKES KYANNAIKTY KYNAFTFIPM NLFEQFKRAA 120 

NLYFLALLIL QAVPQISTLA WYTTLVPLLV VLGVTAIKDL VDDVARHKMD KEINNRTCEV 180 

IKDGRFKVAK WKEIQVGDVI RLKKNDFVPA DILLLSSSEP NSLCYVETAE LDGETNLKFK 240 

MSLEITDQYL QREDTLATFD GFIECBEPNN RLDKFTGTLF WRNTSFPLDA DKILLRGCVI 300 

RNTDFCHGLV IFAGADTKIM KNSGKTRFKR TKIDYLMNYM VYTIFWLIL LSAGLAIGHA 360 

YWEAQVGNSS WYLYDGEDDT PSYRGFLIFW GYIIVLNTMV PISLYVSVEV IRLGQSHFIN 420 

WDLQMYYAEK DTPAKARTTT LNEQLGQIHY IFSDKTGTLT QNIMTFKKCC INGQIYGDHR 480 

DASQHNHNKI EQVDFSWNTY ADGKLAFYDH YLIEQIQSGK EPEVRQFFFL LAVCHTVMVD 540 

RTDGQLNYQA ASPDEGALVN AARNFGFAFL ARTONTITIS ELGTERTYNV LAILDFNSDR 600 

KRMSIIVRTP EGNIKLYCKG ADTVIYERLH RMNPTKQETQ DALDIFANET LRTLCLCYKE 660 
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IEEKEFTEWN KKFMAASVAS TNRDEALDKV YEEIEKDLIL LGATAIEDKL QDGVPETISK 720 

LAKADIKIWV LTGDKKETAE NIGFACELLT EDTTICYGED INSLLBARME NQRNRGGVYA 780 

KFAPPVQESF FPPGGNRALI ITGSWLNEIL LEKKTKRNKI LKLKFPRTEE ERRMRTQSKR 840 

RLEAKKEQRQ KNFVDLACEC SAVICCRVTP KQKAMWDLV KRYKKAITLA IGDGANDVNM 900 

5 IKTAHIGVGI SGQEGMQAVM SSDYSFAQFR YLQRLLLVHG RWSYIRMCKF LRYFFYKNFA 960 

FTLVHFWYSF FNGYSAQTAY EDWFITLYNV LYTSLPVLLM GLLDQDVSDK LSLRFPGLYI 1020 

VGQRDLLFNY KRFFVSLLHG VLTSMILFFI PLGAYLQTVG QDGEAPSDYQ SFAVTIASAL 10 80 

VITVNFQIGL DTSYWTFVNA FSIFGSIALY FGIMFDFHSA GIHVLFPSAF QFTGTASNAL 1140 

, A RQPYIWLTII LTVAVCLLPV VAIRFXSMTI WPSESDKIQK HRKRLKAEEQ WQRRQQVFRR 1200 

10 GVSTRRSAYA FSHQRGYADL ISSGRSIRKK RSPLDAIVAD GTAEYRRTGD S 12S1 
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Seq ID NO: C263 Protein Sequence 
Protein Accession #: XM_044533 

1 11 21 31 41 51 

I 1 I I I I 

MLRTAMGLRS WLAAPWGALP PRPPLLLLLL LLLLLQPPPP TWALSPRISL PLGSEERPFL 60 

RFEAEHISNY TALLLSRDGR TLYVGAREAL FALSSNLSFL PGGEYQELLW GADAEKKQQC 120 

SFKGKDPQRD CQNYIKILLP LSGSRLFTCG TAAFSPMCTY INMENFTLAR DEKGNVLLED 180 

GKGRCPFDPN FKSTALWDG ELYTGTVSSF QGNDPAI SRS QSLRPTKTES SUJWLQDPAF 240 

VASAYIPESL GSLQGDDDKI YFFFSETGQE FEFFENTIVS RIARICKGDE GGERVLQQRW 300 

TSFLKAQLLC SRPDDGFPFN VLQDVFTLSP SPQDWRDTLF YGVFTSQWHR GTTEGSAVCV 360 

FTMKDVQRVF SGLYKEVNRE TQQWYTVTHP VPTPRPGACI TNSARERKIN SSLQLPDRVL 420 

NFLKDHFLMD GQVRSRMLLL QPQARYQRVA VHRVPGLHHT YDVLFLGTGD GRLHKAVSVG 480 

PRVHI IEELQ IFSSGQPVQN LLLDTHRGLL YAASHSGWQ VPMANCSLYR SGGDCLLARD 540 

PYCAWSGSSC KHVSLYQPQL ATRPWIQDIE GASAKDLCSA SSWSPSFVP TGEKPCEQVQ 600 

FQPNTVNTLA CPLLSNLATR LWLRNGAPVN ASASCHVLPT GDLLLVGTQQ I/3EFQCWSLE 660 

EGFQQLVASY CPEWEDGVA DQTDEGGSVP VIISTSRVSA PAGGKASWGA DRSYWKEFLV 720 

MCTLFVLAVI* LPVLFLLYRH RNSKKVFLKQ GECASVHPKT CPWLPPETR PLNGLGPPST 780 

PLDHRGYQSL SDSPPGSRVF TESEKRPLSI QDSFVEVSPV CPRPRVRLGS EIRDSW 837 

Seq ID NO: C264 Protein Sequence 
Protein Accession ft: NP_0 08 950.1 

1 11 21 31 41 51 

I II I i I 

MASQNRDPAA TSVAAARKGA EPSGGAARGP VGKRLQQELM TLMMSGDKGI SAFPESDNLF 60 
KWVGTIHGAA GTVYEDLRYK LSLEFPSGYP YNAPTVKPX.T PCYHPNVDTQ GNICLDILKE 120 
KWSALYDVRT ILLSIQSLLG EPNIDSPLNT HAAELHKNPT AFKKYLQETY SKQVTSQEP 179 

Seq ID NO: C265 Protein Sequence 
•Protein Accession #: NP_055399.1 



1 11 21 31 41 51 

45 | | | | i | 

MGRGWGFLFG LLGAVWLLSS GHGEEQPPET AAQRCFCQVS GYLDDCTCDV ETIDRFNNYR 60 

LFPRLQKLLE SDYFRYYKVN LKRPCPFWND ISQCGRRDCA VKPCQSDEVP DGIKSASYKY 120 

SEEANNLIEE CEQAERLGAV DESLSEETQK AVLQWTKHDD SSDNFCEADD IQSPEAEYVD 180 

LLLNPERYTG YKGPDAWKIW NVIYEENCFK PQTIKRPLNP LASGQGTSEE NTFYSWLEGL 240 

50 CVEKRAFYRL ISGLHASINV HLSARYLLQE TWLEKKWGHN ITEFQQRFDG ILTEGEGPRR 300 

LKNLYFLYLI ELRALSKVLP FFERPDFQLF TGNKIQDEEN KMLLLEILHE IKSFPLHFDE 360 

NSFFAGDKKE AHKLKEDFRL HFRNISRIMD CVGCFXCRLW GKLQTQGLGT ALKILFSEKL 420 

IANMPESGPS YEFHLTRQEI VSLFNAFGRI STSVKELENF RNLLQNIH 468 



Seq ID NO: C266 Protein Sequence 
Protein Accession NP_002879.1 

1 11 21 31 41 51 

I I I 1 I 1 

MQPRRQRLPA PWSGPRGPRP TAPLLALLLL LAPVAAPAGS GGPDDPGQPQ DAGVPRRLLQ 60 
QKARAALHFF NFRSGSPSAL RVLAEVQEGR AWINPKEGCK VHWFSTERY NPESLLQEGE 120 
GRLGKCSARV FFKNQKPRPT INVTCTRLIE KKKRQQEDYL LYRQMKQLKN PLEIVSIPDN 180 
HGHIDPSLRL IWDLAFLGSS YVMWEMTTQV SHYYUVQLTS VRQWRKT 228 

Seq ID NO: C267 Protein Sequence 
Protein Accession #: NP_00S400.1 

1 11 21 31 41 51 

I I I I I I 

MSVKGMAIAL AVILCATWQ GPPMFKRGRC LCIGPGVKAV KVADIEKASI MYPSNNCDKI 60 

EVIITLKENK GQRCLNPKSK QARLIIKKVE RKNF 94 

Seq ID NO: C268 Protein Sequence 
Protein Accession #: FGENESH predicted 

1 11 21 31 41 51 

i i I I I I 

MLRQVLRRGL QSFCHRLGLC VSRHPVFFLT VPAVLTITFG LSALNRFQPE GDLERLVAPS 60 

HSLAKIERSL ASSLFPLDQS KSQLYSDLHT PGRYGRVILL SPTGDNILLQ AEGILQTHRA 120 

VLEMKVNHKG YNYTFSHLCV LRNQDKKCVL DDIISVLEDL RQAAVSNKTT ARVQVRYPNT 180 

KLKVCSFCML LPIKEAAIiHF LP 202 

Seq ID NO: C269 Protein Sequence 
Protein Accession #: NP_002429.1 
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1 11 21 31 41 51 

I I I I 1 I 

MRLPLLLVFA SVIPGAVLLL DTRQFLIYNE DHKRCVDAVS PSAVQTAACN QDAESQKFRW 60 

VSESQIMSVA FKLCLGVPSK TDWVAITLYA CDSKSEFQKW ECKNDTLLGI KGEDLFFNYG 120 

NRQEKNIMLY KGSGLWSRWK IYGTTDNLCS RGYEAMYTLL GNANGATCAF PFKFENKWYA 180 

DCTSAGRSDG WLWCGTTTDY DTDKLFGYCP LKFEGSBSLW NKDPLTSVSY QINSKSALTW 240 

HQARKSCQQQ NAELLSITEI HEQTYLTGLT SSLTSGLWIG LNSLSFNSGW QWSDRSPFRY 300 

LNWLPGSPSA EPGKSCVSLN PGKNAKWENL ECVQKLGYIC KKGNTTLNSF VIPSESDVPT 360 

HCPSQWWPYA GHCYKIHRDE KKIQRDAL.TT CRKEGGDLTS IHTIEELDFI ISQLGYEPND 420 

ELWIGLNDIK IQMYFEWSDG TPVTFTKWLR GEPSHENNRQ EDCWMKGKD GYWADRGCEW 480 

PLGYICKMKS RSQGPEIVEV EKGCRKGWKK HHFYCYMIGH TLSTFAEANQ TCNNENAYLT 540 

TIEDRYEQAF LTSFVGLRPE KYFWTGLSDI QTKGTFQWTI EEEVRFTHWN SDMPGRKPGC 600 

VAMRTG X AGG LWDVLKCDEK AKFVCKHWAE GVTHPPKPTT TPEPKCPEDW GASSRTSLCF 660 

KLYAKGKHEK KTWFESRDFC RALGGDLASI KNKEEQQTIW RLITASGSYH KLFWLGLTYG 720 

SPSEGFTWSD GSPVSYENWA YGEPNNYQNV EYCGELKGDP TMSWNDINCE HLNNWICQIQ 780 

KGQTPKPEPT PAPQDNPPVT EDGWVIYKDY QYYFSKEKET MDNARAFCKR NFGDLVSIQS 840 

ESEKKFLWKY VNRNDAQSAY FIGLLISLDK KFAWMDGSKV DYVSWATGEP NFANEDENCV 900 

TMYSNSGFWN DINCGYPNAF ICQRHNSSIN ATTVMPTMPS VPSGCKEGWN FYSNKCFKIF 960 

GFMEEERKNW QEARKACIGF GGNLVSIQNE KEQAFLTYHM KDSTFSAWTG LNDVNSEHTF 1020 

LWTDGRGVHY TNWGKGYPGG RRSSLSYEDA DCWIIGGAS NEAGKWMDDT CDSKRGYICQ 1080 

TRSDPSLTNP PATIQTDGFV KYGKSSYSLM RQKFQWHEAE TYCKLHNSLI AS I LDP Y SNA 1140 

FAWLQMETSN ERVWIALNSN LTDNQYTWTD KWRVRYTNWA ADEPKLKSAC VYLDLDGYWK 1200 

TAHCNESFYF I*CKRSDEIPA TEPPQLPGRC PESDHTAWIP FHGHCYYIES SYTRNWGQAS 1260 

LECLRMGSSL VSIESAAESS FLSYRVEPLK SKTNFWIGLF RNVEGTWLWI NNSPVSFVNW 1320 

NTGDPSGERN DCVALHASSG FWSNIHCSSY KGYICKRPKI IDAKPTHELL TTKADTRKMD 1380 

PSKPSSNVAG WIIVILLIL TGAGLAAYFF YKKRRVHLPQ EGAFENTLYF NSQSSPGTSD 1440 

MKDLVGNIEQ NEHSVI 1456 

Seq ID NO: C270 Protein Sequence 
Protein Accession #: Eos sequence 

1 11 21 31 41 SI 

I I I I I I 

MVLLHWCLLW LLFPLSSRTQ KLPTRDEEIiF QMQIRDKAFF HDSSVIPDGA EISSYLFRDT 60 

PKRYFFWEE DNTPLSVTVT PCDAPLEWKL SLQELPEDRS GEGSGDLEPL EQQKQQIINE 120 

EGTELFSYKG NDVEYFISSS SPSGLYQLDIi LSTEKDTHFK VYATTTPESD QPYPELPYDP 180 

RVDVTSLGRT TVTLAWKPSP TASLLKQPIQ YCWINKEHN FKSLCAVEAK LSADDAFMMA 240 

PKPGLDFSPF DFAHFGFPSD NSGKERSFQA KPSPKLGRHV YSRPKVDIQK ICIGNKNIFT 300 

VSDLKPDTQY YFDVFWNIN SNMSTAYVGT FARTKEEAKQ KTVELKDGKI TDVFVKRKGA 360 

KFLRFAPVSS HQKVTFFIHS CLDAVQIQVR RDGKLLLSQN VEGIQQFQLR GKPKAKYLVR 420 

LKGNKKGASM LKILATTRPT KQSFPSLPED TRIKAFDKLR TCSSATVAWL GTQERNKFCI 480 

YKKEVDDNYN EDQKKREQNQ CLGPDIRKKS EKVLCKYFHS QNLQKAVTTE TIKGLQPGKS 540 

YLLDVYVIGH GGHSVKYQSK WKTRKFC 568 

Seq ID NO: C271 Protein Sequence 
Protein Accession #: AAH34229.1 

1 11 21 31 41 51 

I I I I I 1 

MEKVQLEFEN QEMEKKLQEF RSTRNKEKED RESSEYYWKS GKVGKLVNQS YMMSQNKGNV 60 

VKFSAGKVKL KLLKEQIQEP VKPTVNYKMA NSSECEKPKI NGKVCGQCEN KAALLVCLEC 120 

GEDYCSGCFA NVHQKGALKL HRTTLLQARS QILFNVLDVA HQFIKDVNPD EPKEENNSTK 180 

ETSKIQHKPK SVLLQRSSSE VEITTMKRAQ RTKPRKSLLC EGSFDEEASA QSFQEVLSQW 240 

RTGNHDDNKK QNLHAAVKDS LEECEVQTNL KIWREPLNIE LKEDILSYME KLWLKKHRRT 300 

PQEQLFKCYQ IRSHIHMKPL VMHSVLKMKT MKIVMVRRPK YNTQLFYCQ 349 

Seq ID NO: C272 Protein Sequence 
Protein Accession #: NP_078963-1 

1 11 21 31 41 51 

I I I I I I 

MEKLWLKKHR RTPQEQLFKM LSDTFPHPHE TTGDAQCSQN ENDEDSDGEE TKVQHTALLL 60 

PVETLNIERP EPSLKTVELD DTYEEEFEEA ENIVPYKVKIi ADADSQRSCA FHDOQKNSFP 120 

YENGIHQHHV FDKGKRDFLN LCLRNSSTYY KDNSKGETSN TDFDNIVDPD VYSSDIEKIE 180 

ESTSFERNLK EKNIGLESKQ KSDDSCVSLE SKDTLLGRDI* EKAPIEEKLS QDIKESLELS 240 

NLYKRPSFEE SKTTKSSLLL QEIACRSKPI TKQYQGLERF FIFDTNERLN LLPSHRLECN 300 

NSSTRITLAE DREWIPDHSL SEYADNAIVL GVLQGAQSPS SSRKQQKMGQ KSQRPSTANF 360 

PLSNSVKESS SCLSSSHPRS RSAAAQSSSR AASEISEIEY IDITDQNELS LDDTTDQHTL 420 

DNIiEKELQVL RSIADTSEKL YSLTSEEFPD FSSQSLNISQ ISTDFLKTSH VRGPCGVEEL 480 

SCSGRDTKIQ SLLSLSESST DEEEEDFLNK QHVITLPWSK ST 522 

Seq ID NO: C273 Protein Sequence 
Protein Accession #: NP_00S399.1 

1 11 21 31 41 51 

I I I I I I 

MKVSAVLLCL LLMTAAFNPQ GLAQPDALNV PSTCCFTFSS KKISLQRLKS YVITTSRCPQ 60 

KAVI FRTKLG KEICADPKEK WVQNYMKHLG RKAHTLKT 98 

Seq ID NO: C274 Protein Sequence 
Protein Accession fc: BAC0S158.1 

1 11 21 31 41 51 
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I I I I I I 

MFLLTGGVSL KSAEKNPDPT WLQDKSWEEI CRASEFPAFR GLRQHFCEHI YEWREIYDSK 60 

EPHNAKFPAP MDKNLNELQK IIILRCLRPD KITPAITNYV TDKLGKKFVE PPPFDLTKSY 120 

LDSNCTIPLI FVLSPGADPM ASLLKFANDK SMSGNKFQAI SLGQGQGPIA AKMIKAAIEE 180 

GTWVCLQNCH LAVSWMPMLE KICEDPTSET CNSSFRLWLT SYPSSKFPVT ILQNGVKMTN 240 

EPPTGLRLNL LQSYLTDPVS DPEFPKGCRG KELLFINEYD TIPFEAISYL TGECNYGGRV 300 

TDDWDRRLLb TMLADFYNLY IVENPHYKFS PSGNYFAPPK GTYEDYIEFI KKLPPTQHPE 360 

IFGLHENVDI SKDLQQTKTL FESLLLTQGG SKQTGASGST DQILLEITKD ILNKLPSDFD 420 

IEMALRKYPV RYEESMNTVL VQEMERFNNL IITIRNTLRD LEKAIKGWV MDSALEALSS 480 

SLLVGKVPEI WAKRSYPSLK PLGSYITDFL ARLNFLQDWY NSGKPCVFWL SGFFFTQAFL 540 

TGAMQNYARK YTTPIDLLGY EFEVIPSDTS DTSPEDGVYI HGLYLDGARW DRESGLLAEQ 600 

YPKLLFDLMP IIWIKPTQKS RIIKSDAYVC PLYKTSERKG TLSTTGHSTN FVIAMLLKTD 660 

QPTRHWIKRG VALLCQLDD 679 

Seq ID NO: C275 Protein Sequence 
Protein Accession #i AAA60212.1 

1 11 21 31 41 51 

I I I I 1 I 

MAESHLLQWL LLLLPTLCGP GTAAWTTSSL ACAQGPEFHC QSLEQALQCR ALGHCLQEVW 60 
GHVGADDLCQ ECEDIVHILN KMAKEAIFQD TMRKFLEQEC NVLPLKLLMP QCNQVLDDYF 120 
PLVIDYFQNQ TDSNGICMHL GLCKSRQPEP EQEPGMSDPL PKPLRDPLPD PLLDKLVLPV 180 
LPGALQARPG PHTQDLSEQQ FPIPLPYCWL CRALIKRIQA MIPKGALAVA VAQVCRWPL 240 
VAGGICQCLA ERYSVILLDT LLGRMLPQLV CRLVLRCSMD DSAGPRSPTG EWLPRDSECH 300 
LCMSVTTQAG NSSEQAIPQA MLQACVGSWL DREKCKQFVE QHTPQLLTLV PRGWDAHTTC 360 
QALGVCGTMS SPLQCIHSPD L 381 

Seq ID NO: C276 Protein Sequence 
Protein Accession ft: NP_631911.1 

1 11 21 31 41 51 

I 1 I I I I 

MLGCGIPALG LLLLLQGSAD GNGIQGFFYP WSCEGDIWDR ESCGGQAAID SPNLCLRLRC 60 
CYRNGVCYHQ RPDENVRRKH MWALVWTCSG LLLLSCSICL FWWAKRRDVL HMPGFLAGPC 120 
DMSKSVSIiLS KHRGTKKTPS TGSVPVALSK ESRDVEGGTE GEGTEEGEET EGEEEED 177 

Seq ID NO: C277 Protein Sequence 
Protein Accession #: NP_473364.1 

1 11 21 31 41 51 

I I I I I I 

MKLVTIFLLV TISLCSYSAT AFLINKVPLP VDKLAPLPLD NILPFMDPLK LLLKTLGISV 60 
EHLVEGLRKC VNELGPEASE AVKKLLEALS HLV 93 



45 Seq ID NO: C278 Protein Sequence 

Protein Accession ft : FGENESH predicted 



1 11 21 31 41 51 

I I I I I I 

MPLSYAYKNA ETLAGRHTSS WMSRGAYQRR NTRAAGRPEE CTDRNWHAGR TRGIWLGQLE 60 

ERCSDVFGVS FFWWVRGLAG SGAKLQTFTP AQEGAPTVQR QAEALLKCRQ SGRPGRGGAE 120 

SERARDASML SPLSAAMRNY PTSSTIPPRR" SYSPTEIAHK SYSCSLPDMK ISMAESGPSL 180 

DSLDILEDGE SGSPFLVTHL YFLGWTTGM EQLDFETGPN IFDLQIYVKD EVGVTDLQVL 240 

TVQVTDVNEP PQFQGNLAED HLRADQPHFN . AHSHTYVRW ATALARHRLR SSIGSPFLGT 300 

FCWVGMQYF LISPPKSFRM SANGTLFSTT ELDFEAGHRS FHLIVEVRDS GGLKASTELQ 360 

VNIVNLNDEV PRFTSPTRVY TVLEELSPGT IVANITAEDP DDEGFPSHLL YSITTVSKYF 420 

MINQLTGTIQ VAQRIDRDAG ELRQNPTISL EVLVKDRPYG GQENRIQITF TVEDVNDNPA 480 

TCQKFTFRSS LHPALCSKTL TWMDTVLDCF HAADKDIPVT GRFTKERGLI GLTVPHGWGS 540 

LTIMAEGKEE QVTSYMDGSR QRDRACVGKL LLIKPSDLMR LSHYHENNSG KTCPHDSISS 600 

YQVPPTTCRN SRIQATNNED TSSVTVTVNI LEENDEKPIC TPNSYFLALP VDLKVGTNIQ 660 

NFKLTCTDtiD SSPRSFRYSI GPGNVNNHFT FSPNAGSNVT RLLLTSRFDY AGGFDKIWDY 720 

KLLVYVTDDN LMSDRKKAEA LVETGTVTLS IKVIPHPTTI ITTTPRPRVT YQVLRKNVYS 780 

PSAWYVPFVI TLGSILLLGL LVYLWLLAK AIHRHCPCKT GKNKEPLTKK GETKTAERDV 840 

WETIQMNTI FDGEAIDPEP EQASLELYAL LPSCCDPSPV TLRKVQVCGE SEETGQCSGH 900 

ITLPGKIPVD DPRKQETGLQ GDFEVWTLCP AVKVWGSPQ AERCIRLALS LKKYSSD 957 

Seq ID NO: C279 Protein Sequence 
Protein Accession #: XP_168571.1 

1 11 21 31 41 51 

I I I I I I 

MINQLTGTIQ VAQRIDRDAG ELRQNPTISL EVLVKDRPYG GQENRIQITF IVEDVNDNPA 60 

TCQKFTFSIM VPERTAKGTL LLDLNKFCFD DDSEAPNNRF NFTMPSGVGS GSRFLQDPAG 120 

SGKIVLIGDL DYENPSNLAA GNKYTVIIQV QDVAPPYYKN NVYVYILTSP ENEFPLIFDR 180 

PSYVFDVSER RPAQGHLSGP EEKRLLSICH VRAVCHHFGL HIASGSPRVP GRPIGQSHPQ 240 

TLPLQDWEBQ GTSDKERRNE DCRERRRGGN YPDEHYL 277 

Seq ID NO: C280 Protein Sequence 
Protein Accession #; NP__005257.2 

1 11 21 31 41 51 

I I I I 1 I 

MGDWSFLGNF LEEVHKHSTV VGKVWLTVLF IFRMLVLGTA AESSWGDEQA DFRCDTIQPG 60 

CQNVCYDQAF PISHIRYWVL QIIFVSTPSL VYMGHAMHTV RMQEKRKLRE AERAKEVRGS 120 



1348 



WO 03/042661 



GSYEYPVAEK AELSCWEEGN GRIALQGTLL NTYVCSILIR TTMEVGFIVG QYPIYGIFLT 180 

TLHVCRRSPC PHPVNCYVSR PTEKNVFIVF MLAVAALSLL LSLAELYHLG WKKIRQRFVK 240 

PRQHMAKCQL SGPSVGIVQS CTPPPDFNQC LENGPGGKFF NPFSNNMASQ QNTDNLVTEQ 300 

VRGQEQTPGE GFIQVRYGQK PEVPNGVSPG HRLPHGYHSD KRRLSKASSK ARSDDLSV 358 

Seq ID NO: C281 Protein Sequence 
Protein Accession #: NPJJ55274.2 

1 11 21 31 41 51 

I I I I I ! 

MYLSICCCFL LWAPALTLKN LNYSVPEEQG AGTVIGNIGR DARLQPGLPP AERGGGGRSK 60 

SGSYRVLENS APHLLDVDAD SGLLYTKQRI DRESLCRHKA KCQLSLEVFA NDKEICMIKV 120 

EIQDINDNAP SFSSDQIEMD ISENAAPGTR FPLTSAHDPD AGENGLRTYL liTRDDHGLFG 180 

LDVKSRGDGT KFPELVIQKA *LDREQQNHHT LVLTALDGGE PPRSATVQIN VKVIDSNDNS 240 

PVFEAPSYLV ELPENAPLGT WIDLNATDA DEGPNGEVLY SFSSYVPDRV RELFSIDPKT 300 

GLIRVKGNLD YEENGMLEID VQARDLGPNP IPAHCKVTVK LIDRNDNAPS IGFVSVRQGA 360 

LSEAAPPGTV IALVRVTDRD SGKNGQLQCR VLGGGGTGGG GGLGGPGGSV PFKLEENYDN 420 

FYTWTDRPL DRETQDEYNV TIVARDGGSP PLNSTKSFAI KILDENDNPP RFTKGLYVLQ 480 

VHENNIPGEY LGSVLAQDPD LGQNGTVSYS ILPSHIGDVS IYTYVSVNPT NGAIYALRSF 540 

NFEQTKAFEF KVLAKDSGAP AHLESNATVR VTVLDVNDNA PVIVLPTLQN DTAELQVPRN 600 

AGLGYLVSTV RALDSDFGES GRLTYEIVDG NDDHLFEIDP SSGEIRTLHP FWEDVTPWE 660 

LWKVTDHGK PTLSAVAKLI IRSVSGSLPE GVPRVNGEQH HWDMSLPLIV TLSTISIILL 720 

AAMITIAVKC KRENKEIRTY NCRIAEYSHP QLGGGKGKKK KINKNDIMLV QSEVEERNAM 780 

NVMNWSSPS LATSPMYFDY QTRLPLSSPR SEVMYLKPAS NNLTVPQGHA GCHTSFTGQG 840 

TNASETPATR MSIIQTDNFP AEPNYMGSRQ QFVQSISVAP RLRTQKEPA 889 

Seq ID NO: C282 Protein Sequence 
Protein Accession #: NP_005592.1 

1 11 21 31 41 51 

I I I I I I 

MELCRSLALL GGSLGLMFCL IALSTDFWFE AVGPTHSAHS GLWPTGHGDI ISGYIHVTQT 60 
FSIMAVLWAL VSVSFLVLSC FPSLFPPGHG PLVSTTAAFA AAISMWAMA VYTSERWDQP 120 
PHPQIQTFFS WSFYLGWVSA ILLLCTGALS LGAHCGGPRP GYETL 165 

Seq ID NO: C283 Protein Sequence 
Protein Accession #: NP_Q06424.2 

1 11 21 31 41 51 

I I I i I I 

MATWALLLLA AMLLGNPGLV FSRLSPEYYD LARAHLRDEE KSCPCLAQEG PQGDLLTKTQ 60 
ELGRDYRTCL TIVQKLKKMV DKPTQRSVSN AATRVCRTGR SRWRDVCRNF MRRYQSRVTQ 120 
GLVAGETAQQ ICEDLRLCIP STGPL 145 

Seq ID NO j C284 Protein Sequence 
Protein Accession #: NP_005594.1 

1 11 21 .31 41 51 

I I I I I I 

MKVSAAALAV ILIATALCAP ASASPYSSDT TPCCFAYIAR PLPRAHIKEY FYTSGKCSNP 60 

AWFVTRKNR QVCANPEKKW VREYINSLEM S 91 



Seq ID NO: C285 Protein Sequence 
Protein Accession ft: NPJ>71437.1 

1 11 21 31 41 51 

I I I I I I 

MAPGRAVAGL LLLAAAGLGG VAEGPGLAFS EDVLSVFGAN LSLSAAQLQH LLEQMGAASR 60 

VGVPEPGQLH FNQCLTAEEI FSLHGFSNAT QITSSKFSVI CPAVLQQLNF HPCEDRPKHK 120 

TRPSHSEVWG YGFLSVTIIN LASLLGLILT PLIKKSYFPK ILTFFVGLAI GTIiFSNAIFQ 180 

LIPEAFGFDP KVDSYVEKAV AVFGGFYLLF FFERMLKML.L KTYGQNGHTH FGNDNFGPQE 240 

KTHQPKALPA INGVTCYANP AVTEANGHIH FDNVSWSLQ DGKKEPSSCT CLKGPKLSEI 300 

GTIAWMITLC DALHNFIDGI* AIGASCTLSL LQGLSTSIAI LCEEFPHELG DFVILLNAGM 360 

STRQALLFNF LSACSCYVGL AFGILVGNNF APNIIFALAG GMFLYISLAD MFPEMNDMLR 420 

EKVTGRKTDF TFFMIQNAGM LTGFTAILL.I TLYAGEIELE 460 



Seq ID NO: C286 Protein Sequence 
Protein Accession #: NP_004175.1 

1 11 21 31 41 51 

1 I 1 i I I 

MPNSEPASLL ELFNSIATQG ELVRSLKAGN ASKDEIDSAV KKLVSLKMSY KAAAGEDYKA 60 

DCPPGNPAPT SNHGPDATEA EEDFVDPWTV QTSSAKGIDY DKLIVRFGSS KIDKELINRI 120 

ERATGQRPHH FLRRGIFFSH RDMNOVLDAY ENKKPFYLYT GRGPSSEAMH VGHLIPFIPT 180 

KWLQDVFNVP LVIQMTDDEK YLWKDLTLDQ AYGDAVENAK DIIACGFDIN KTFIFSDLDY 240 

MGMSSGFYKN WKIQKHVTF NQVKGIFGFT DSDCIGKISF PAIQAAPSFS NSPPQIFRDR 300 

TDIQCLIPCA IDQDPYFRMT RDVAPRIGYP KPALLHSTFF PALQGAQTKM SASDPNSSIF 360 

LTDTAKQIKT KVNKHAFSGG RDTIEEHRQF GGNCDVDVSF MYLTFFLEDD DKLEQIRKDY 420 

TSGAMLTGEL KKALIEVLQP LIAEHQARRJC EVTDEIVKEF MTPRKLSFDF Q 471 



Seq ID NO: C2B7 Protein Sequence 



1349 



WO 03/042661 PCT/US02/36810 



Protein Accession »: NP_004929.1 



10 

15 

20 

25 

30 
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1 
I 

MTVFRQENVD 
IEREVSILKE 
KQILNGVYYL 
PEPVAPEIVN 
FSNTSALAKD 
ARKKWKQSVR 
QHLLGSLSNY 
RHGHVDTLKF 
TPLHCAAWHG 
CDKDGHIALH 
NCNLDISNKY 
GLLARLRKDT 
RPRLSSTNSS 
. PTHHPHCSAD 
FSLEEPYEIQ 
GEFGYDKDTS 
PPMTHLCEKI 
EINIMQSETV 
ELLQILDAMD 
FPCGIFHKVQ 
GLETEKIKCC 
AQTLKETSLT 
PDPLGKDWCL 
LRELGRRDAA 



11 

I 

DYYDTGEELG 
IQHPNVITLH 
HSLQIAHFDL 
YEPLiGLEADM 
FIRRLLVKDP 
LISLCQRLSR 
DVNQPNKHGT 
LSENKCPLDV 
YYSVAKALCE 
LAVRRCQMEV 
GRTPLHLAAN 
HRGLFIQQLR 
RFPPSPLASK 
DQSTKAIDIQ 
LNPVIFWLSF 
LLKEIRNRFG 
ISTLPSWRKL 
QDVLLLDPRW 
ICARDLSSGT 
VNLCRWIHQQ 
LLLDSVCSTI 
NTMGGYKESF 
LAMNLGLPDL 
DLLLKASSVF 



21 
I 

SGQFAWKKC 

EVYENKTDVI 

KPENIMLLDR 

WSIGVITYIL 

KKRMTIQDSL 

SFLSRSNMSV 

PPLLIAAGCG 

KDKSGEMALH 

AGCNVNIKNR 

IKTLLSQGCF 

NGILDWRYL 

PTQNLQPRIK 

PTVSVSINNL 

NAYLNGVGDF 

LKSLVPVEEP 

NDLHISNKLF 

NGPNQLMSLQ 

LCTNVLGKLL 

MVDVPALIKT 

STEGDADIRL 

ENVMATTLPG 

SSIMCFGCHD 

VAKYNTNNGA 

KINLDGNGQE 



31 
I 

REKSTGLQYA 
LILELVAGGE 
NVPKPRIKII 
LSGASPFLGD 
QHPWIKPKDT 
ARSDDTLDEE 
NIQILQLLIK 
VAARYGHADV 
EGETPLLTAS 
VDYQDRHGNT 
CLMGASVEAL 
LKLFGHSGSG 
YPGCENVSVR 
SVWEFSGNPV 
IAFGGKLKNP 
VLDAGASGSK 
QFVYDVQDQL 
SVETPRALHH 
DNLHRSWADE 
WVNGCKLANR 
LLTVKHYLSP 
VYSQASLGMD 
PKDFLPSPLH 
AYASSCNSGT 



41 
1 

AKFIKKRRTK 
LFDFLAEKES 
DFGLAHKIDF 
TKQETLANVS 
QQALSRKASA 
DSFVMKAIIH 
RGSRIDVQDK 
AQVTCAASAQ 
ARGYHDIVEC 
PLHVACKDGN 
TTDGKTAEDL 
KTTLVESLKC 



YFCCYDYFAA 
LQWLVATHA 
DMKVLRNKLQ 
NPLASEEDLR 
YRGRYTVEDI 
EDEVMVYGGV 
GAELLVLLVN 
QQLREHHEPV 
IHASDLNLLT 
ALLREWTTYP 
SYNSISSWS 



51 
I 

SSRRGVSRED 

LTEEEATEFL 

GNEFKNIFGT 

AVNYEFEDEY 

VNMEKFKKFA 

AINDDNVPGL 

GGSNAVYWAA 

IPISRTKEEE 

LAEHGADLNA 

MPIWALCEA 

ARSEQHEHVA 

GLLRSFFRRR 

TKGMLEVFVA 

NDPTSIHWV 

DIMNVPRPAG 

EIRSQIVSVC 

RIAQQLHSTG 

QRLVPDSDVE 

RIVPVEHLTP 

HGQGIEVQVR 

MIYQPRDFFR 

RRKLSRLLDP 

ESTVGTLMSK 

R 



Seq ID NO: C288 Protein Sequence 
Protein Accession #: NP 002072.1 



1 
I 

MELRARGWWL 
ICPQGYTCCT 
TLQATFPGAF 
QLLLPDDYLD 
PLGPECSRAV 
TDKFWGTSGV 
RGKLAPRERP 
RGRYLPEVMG 
ASDDGSGSGS 



11 
I 

LCAAAALVAC 
SEMEENIANR 
GELYTQNARA 
CLGKQAEALR 
MKLVYCAHCL 
ESVIGSVHTW 
PSGTLEKLVS 
DGLANQINNP 



21 
1 

ARGDPASKSR 
SHAELETALR 
FRDLYSELRL 
PFGEAPRELR 
GVPGARPCPD 
LAEAIKALQD 
EAKAQLRDVQ 
EVEVDITKPD 



31 
i 

SCGEVRQIYG 
DSSRVLQAMI* 
YYRGANLHLE 
LRATRAFVAA 
YCRNVLKGCL 
NRDTLTAKVI 
DFWISLPGTL 
MTIRQQIMQL 



41 
I 

AKGFSLSDVP 
ATQLRSFDDH 
ETLAEFWARL 
RSFVQGLGVA 
ANQADLDAEW 
QGCGNPKVNP 
CSEKMALSTA 
KIMTNRLRSA 



51 . 
I 

QAEISGEHLR 
FQHLLNDSER 
LERLFKQLHP 
SDWRKVAQV 
RNLLDSMVLI 
QGPGPEEKRR 
SDDRCWNGMA 
YNGNDVDFQD 



Seq ID NO: C289 Protein Sequence 
Protein Accession #: AAH30205.1 

1 11 21 31 41 51 

I I I 1 I I 

MIILIYLFLL LWEDTQGWGF KDGIFHNSIW LERAAGVYHR EARSGKYKLT YAEAKAVCEF 
EGGHLATYKQ LEAARKIGFH VCAAGWMAKG RVGYPIVKPG PNCGFGKTGI IDYGIRLNRS 
ERWDAYCYNP HAKECGGVFT DPKQIFKSPG FPNEYEDNQI CYWHIRLKYG QRIHLSFLDF 
DLEDDPGCLA DYVEIYDSYD DVHGFVGRYC GDELPDDIIS TGNVMTLKFlt SDASVTAGGF 
QIKYVAMDPV SKSSQGKNTS TTSTGNKNFL AGRFSHL 

Seq ID NO: C290 Protein Sequence 
Protein Accession #: NPJ)01973.1 
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60 

120 

180 

240 

300 

360 
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480 
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60 

120 

180 

240 

277 



MRANDALQVXi 
GNLEIVLTGH 
LNYNTNSSHA 
RSCPPCHEVC 
TDCFACRHFN 
CVRACPPDKM 
LDFTilTGLNG 
RSLYKRGFSL 
RLDIKHNRPR 
REFAHEAECF 
YKYPDVQNEC 
GTFLYWRGRR 
GTVHKGVWIP 
SSLQLVTQYI* 
LLKSPSQVQV 
VWELMTFGAE 
LANEFTRMAR 
NLATTTLGSA 
PVSLHPMPRG 
TPVTPLSPPG 
NRRRRHSPPH 
EYMNRQRDGG 
SAFDNPDYWH 



11 
I 



GLLFSLARGS 
NADLSFLQWI 
LRQLRLTQLT 



DSGACVPRCP 
EVDKNGLKMC 
DPWHKIPALD 
LIMKNLNVTS 
RDCVAEGKVC 
SCHPECQPMG 
RPCHENCTQG 
IQNKRAMRRY 
EGESIKIPVC 
PLGSLLDHVR 
ADFGVADLLP 
PYAGLRIiAEV 
DPPRYLVIKR 
LSLPVGTLNR 



21 
I 

EVGNSQAVCP 
REVTGYVLVA 
EILSGGVYIE 
DCQTLTKTIC 
QPLVYNKLTF 
EPCGGLCPKA 
PEKLNVFRTV 
LGFRSLKEIS 
DPLCSSGGCW 
GTATCNGSGS 
CKGPELQDCL 
LERGESIEPL 
IKVIEDKSGR 
QHRGALGPQL 
PDDKQLLYSE 
PDLLEXGERL 
ESGPGIAPGP 



GTLNGLSVTG 
MNEFSTLPLP 



LEEEDVNGYV 
PPRPSSLEEL 
GPGGDYAAMG 
SRLFPKANAQ 



VTGSEAELQE 
MPDTHLKGTP 
GYEYMDVGSD 
ACPASEQGYE 
RT 



APQCNGHCFG 
QLEPNPHTKY 
CEGTGSGSRF 
REITGYLNIQ 
AGRIYISANR 
GPGPGQCLSC 
DTCAQCAHFR 
GQTLVLIGKT 
DPSEKANKVL 
QSFQAVTDHM 
LLNWGVQIAK 
AKTPIKWMAL 
AQPQICTIDV 
EPHGLTNKKL 
SSGYMPMNQG 



41 

I 

DAENQYQTLY 
NLRWRGTQV 
IDWRDIVRDR 
PNPNQCCHDE 
QYGGVCVASC 
QTVDSSNIDG 
SWPPHMHNFS 
QLCYHHSLNW 



DGPHCVSSCP 
HLTMALTVIA 
ARIFKETELR 
LAIGSLDHAH 
GMYYLEEHGM 
ESIHFGKYTH 
YMVMVKCWMI 
EEVELEPELD 



51 
I 

KLYERCEWM 
YDGKFAIFVM 
DAEIWKDNG 
CAGGCSGPQD 
PHNFWDQTS 
FVNCTKILGN 
VFSNLTTIGG 
TKVLRGPTEE 
THCNFLNGEP 
HGVLGAKGPI 
GLWIFMMLG 
KLKVLGSGVF 
IVRLLGLCPG 
VHRNLAARNV 
QSDVWSYGVT 
DENIRPTFKE 
t .r>T .nr p Kfl l 



SSREGTLSSV 
LSASLGSTQS 
EMRAFQGPGH 



SRSPRPRGDS AYHSQRHSLL 
GLSSVLGTEE EDEDEEYEYN 
CP LHP VP IMP TAGTTPDEDY 
QAPHVHYARL KTLRSLEATD 
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Seq ID NO: C291 Protein Sequence 
Protein Accession ft: NP_001207.1 



1 11 21 31 41 51 

I I i I 1 I 

MAPLCPSPWL PLLIPAPAPG LTVQLLLSLL LLMPVHPQRL PRMQEDSPLG GGSSGEDDPL 60 

GEEDLPSBED SPREEDPPGE EDLPGEEDLP GEEDLPEVKP KSEEEGSIaKL EDLPTVEAPG 120 

DPQEPQNNAH RDKEGDDQSH WRYGGDPPWP RVSPACAGRF QSPVDIRPQL AAFCPALRPL 180 

ELLGFQLPPL PELRIiRNNGH SVQLTLPPGL EMALGPGREY RALQLHLHWG AAGRPGSEHT 240 

VEGHRFPAEI HWHLSTAPA RVDEALGRPG GLAVLAAFLE EGPEENSAYE QLLSRLEEIA 300 

EEGSETQVPG LDISALLPSD FSRYFQYEGS LTTPPCAQGV IWTVFNQTVM LSAKQLHTLS 360 

DTLWGPGDSR LQLNFRATQP LNGRVIEASP PAGVDSSPRA AEFVQLNSCL AAGDILALVF 420 

GLLFAVTSVA FLVQMRRQHR RGTKGGVSYR PAEVAETGA 459 

Seq ID NO: C292 Protein Sequence 
Protein Accession #: NP_004198.1 

1 11 21 31 41 51 

I ! ! I I I 

MGGAWDEGP TGVKAPDGGW GWAVLFGCFV ITGFSYAFPK AVSVFFKELI QEFGIGYSDT 60 

AWISSILLAM LYGTGPLCSV CVNRFGCRPV MLVGGLPASIi GMVAASFCRS IIQVYLTTGV 120 

ITGLGLALNF QPSLIMLNRY FSKRRPMANG LAAAGSPVFL CALSPLGQLL QDRYGWRGGF 180 

LILGGLLLNC CVCAALMRPL WTAQPGSGP PRPSRRLLDL SVFRDRGFVL YAVAASVMVL 240 

GLFVPPVFW SYAKDLGVPD TKAAFLLTIL GFIDIFARPA AGFVAGLGKV RPYSVYLFSF 300 

SMFFNGLADL AGSTAGDYGG LWFCIFFGI SYGMVGALQF EVLMAIVGTH KFSSAIGLVL 360 

LMEAVAVLVG PPSGGKLLDA THVYMYVFIL AGAEVLTSSL ILLLGNFFCI RKKPKEPQPB 420 

VAAAEEEKLH KPPADSGVDL REVEHFLKAE PEKNGEWHT PETSV 465 



Seq ID NO: C293 Protein Sequence 
Protein Accession #: NP_000349.l 

1 11 21 31 41 51 

I I 1 I 1 I 

MALFVRLLAL ALALALGPAA TLAGPAKSPY QLVLQHSRLR GRQHGPNVCA VQKVIGTNRK 60 

YFTNCKQWYQ RKICGKSTVI SYECCPGYEK VPGEKGCPAA LPLSNLYETL GWGSTTTQL 120 

YTDRTEKLRP EMEGPGSFTI FAPSNEAWAS LPAEVLDSLV SNVNIELLNA LRYHMVGRRV 180 

LTDELKHGMT LTSMYQNSNI QIHHYPNGIV TVNCARLI*KA DHHATNGWH LIDKVISTIT 240 

NNIQQIIEIE DTFETLRAAV AASGLNTMLE GNGQYTL1AP TNEAFEKIPS ETLNRILGDP 300 

EALRDLLNNH ILKSAMCAEA IVAGLSVETL EGTTLEVGCS GDMLTINGKA IISNKDILAT 360 

NGVIHYIDEL LIPDSAKTLF ELAAESDVST AIDLFRQAGL GNHLSGSERL TLLAPLNSVF 420 

KDGTPPIDAH TRNLLRNHII KDQLASKYLY HGQTLETLGG KKLRVFVYRN SLCIENSCIA 480 

AHDKRGRYGT LFTMDRVLTP PMGTVMDVLK GDNRFSMLVA AIQSAGLTET LNREGVYTVF 540 

AFTNEAFRAL PPRERSRLLG DAKELANILK YHIGDEILVS GGIGALVRLK SLQGDKLEVS 600 

LKNNWSVNK EPVAEPDIMA TNGWHVITN VLQPPANRPQ ERGDELADSA LEIFKQASAF 660 

SRASQRSVRL APVYQKLLER MKH 683 



Seq ID NO: C294 Protein Sequence 
Protein Accession #: NP_006527.1 

1 11 21 31 41 51 

I I I / I I I 

MTQRSIAGPI CNLKFVTLLV ALSSELPFLG AGVQLQDNGY NGLLIAINPQ VPENQNLISN 60 

IKEMITEASF YLFNATKRRV FFRNIKILIP ATWKANNNSK IKQESYEKAN VIVTDWYGAH 120 

GDDPYTLQYR GCGKEGKYIH FTPNFLLNDN LTAGYGSRGR VFVHEWAHLR WGVFDEYNND 180 

KPFYINGQNQ IKVTRCSSDI TGIFVCEKGP CPQENCIISK LFKEGCTFIY NSTQNATASI 240 

MFMQSLSSW EFCNASTHNQ EAPNLQNQMC SLRSAWDVIT DSADFHHSFP MNGTELPPPP 300 

TFSLVQAGDK WCLVLDVSS KMAEADRLLQ LQQAAEFYLM QIVEIHTFVG IASFDSKGEI 360 

RAQLHQINSN DDRKLLVSYI* PTTVSAKTDI SICSGLKKGF EWEKLNGKA YGSVMILVTS 420 

GDDKLLGNCL PTVLSSGSTI HSIALGSSAA PNIiEELSRLT GGLKFFVPDI SNSNSMIDAF 480 

SRISSGTGDI FQQHIQIiEST GENVKPHHQL KNTVTVDNTV GNDTMFLVTW QASGPPEIIL 540 

FDPDGRKYYT NNFITNLTFR TASLWIPGTA KPGHWTYTLN NTHHSLQALK VTVTSRASNS 600 

AVPPATVEAF VERDSLHFPH PVMIYANVKQ GFYPILNATV TATVEPETGD PVTLRLLDDG 660 

AGADVIKNDG IYSRYFFSFA ANGRYSLKVH VNHSPSISTP AHS IPG SHAM YVPGYTANGN 720 

ICMNAPRKSV GRNEEERKWG FSRVSSGGSF SVLGVPAGPH PDVFPPCKII DLEAVKVEEE 780 

LTLSWTAPGE DFDOGQATSY EIRMSKSLQN IQDDFNNAIL VNTSKRNPQQ AGIREIFTFS 840 

PQISTNGPEH QPNGETHESH RIYVAIRAMD RNSLQSAVSN IAQAPLFIPP NSDPVPARDY 900 

LILKGVLTAM GLIGIICLII WTHHTLSRK KRADKKENGT KLL 943 

Seq ID NO: C29S Protein Sequence 
Protein Accession #: Eos sequence 



1 11 21 31 41 51 

I I I I I I 

MKFLLILLLQ ATASGALPLN SSTSLEKNNV LFGERYLEKF YGLEINKLPV TKMKYSGNLM 60 

KEKIQEMQHF LGLKVTGQLD TSTLEMMHAP RCGVPDVHHF REMPGGPVWR KHYITYRINN 120 

YTPDMNREDV DYAIRKAFQV WSNVTPLKFS KINTGMADIL WFARGAHGD FHAFDGKGGI 180 

LAHAFGPGSG IGGDAHFDED EFWTTHSGGT NLFLTAVHEI GHSLGLGHSS DPKAVMFPTY 240 

KYVDINTFRL SADDIRGIQS LYGDPKENQR LPNPDNSEPA LCDPNLSFDA VTTVGNKIFF 300 

FKDRFFWLKV SERPKTSVNL ISSLWPTLPS GIEAAYEIEA RNQVFLFKDD KYVTLISNLRP 360 

EPNYPKSIHS FGFPNFVKKI DAAVFNPRFY RTYFFVDNQY WRYDERRQMM DPGYPKLITK 420 

NFQGIGPKID AVFYSKNKYY YFFQGSNQFE YDFLLQRITK TLKSNSWFGC , 470 

Seq ID NO: C296 Protein Sequence 
Protein Accession #: Eos sequence 



1351 



WO 03/042661 



1 11 21 31 41 51 

I I I I I I 

MKFLLILLLQ ATASGALPLN SSTSLEKNNV LFGERYLEKF YGLEINKLPV TKMKYSGNLM 60 

KEKIQEMQHF LGLKVTGQLD TSTLEMMHAP RCGVPDVHHF REMPGGPVWR KHYITYRINN 120 

YTPDMNREDV DYAIRKAFQV WSNVTPLKFS KINTGMADIL WFARGAHGD FHAFDGKGGI 180 

LAHAFGPGSG IGGDAHFDED EFWTTHSGGT NLFLTAVHAI GHSLGLGHSS DPKAVMFPTY 240 

KYVDINTFRL SADDIRGIQS LYGDPKENQR LPNPDNSEPA LCDPNLSFDA VTTVGNKIFF 300 

FKDRFFWLKV SERPKTSVNL ISSLWPTLPS GIEAAYEIEA RNQVFLFKDD KYWLISNLRP 360 

EPNYPKSIHS FGFPNFVKKI DAAVFNPRFY RTYFFVDNQY WRYDERRQMM DPGYPKLITK 420 

NFQGIGPKID AVFYSKNKYY YFFQGSNQFE YDFLLQRITK TLKSNSWFGC 470 

Seq ID NO: C2 97 Protein Sequence 
Protein Accession tii NP_008883.l 

1 11 21 31 41 51 

I I I I I I 

MAKDNSTVRC FQGLLIFGNV IIGCCGIALT AECIFFVSDQ HSLYPLLEAT DNDDIYGAAW 60 

IGIFVGICIiF CLSVLGIVGI MKSSRKILLA YFILMFIVYA FEVASCITAA TQRDFFTPNL 120 

FLKQMLERYQ NNSPPNNDDQ WKNNGVTKTW DRLMLQDNCC GVNGPSDWQK YTSAFRTENN 180 

DADYPWPRQC CVMNNLKEPL NLEACKLGVP GFYHNQGCYE LISGPMNRHA WGVAWFGFAI 240 

LCWTFWVLLG TMFYWSRIEY 260 

- Seq ID NO: C298 Protein Sequence 
Protein Accession #i NP_001784.2 

1 11 21 31 41 51 

I I I I I.I 

MGLPRGPLAS LLLLQVCWLQ CAASEPCRAV FREAEVTLEA GGAEQEPGQA LGKVFMGCPG 60 

QEPALFSTDN DDFTVRNGET VQERRSLKER NPLKIFPSKR ILRRHKRDWV VAPISVPENG 120 

KGPFPQRLNQ LKSNKDRDTK I FYS I TG PGA DSPPEGVFAV EKETGWLLLN KPLDREEIAK 180 

YELFGHAVSE NGASVEDPMN ISIIVTDQND HKPKFTQDTF RGSVLEGVLP GTSVMQVTAT 240 

DEDDAIYTYN GWAYSIHSQ EPKDPHDLMF TIHRSTGTIS VISSGLDREK VPEYTLTIQA 300 

TDMDGDGSTT TAVAWEILD ANDNAPMFDP QKYEAHVPEN AVGHEVQRLT VTDLDAPNSP 360 

AWRATYLIMG GDDGDHFTIT THPESNQGIL TTRKGLDFEA KNQHTLYVEV TNEAPFVLKL 420 

PTSTATIWH VEDVNEAPVF VPPSKWEVQ EGIPTGEPVC VYTAEDPDKE NQKISYRILR 480 

DPAGWLAMDP DSGQVTAVGT LDREDEQFVR NNIYEVMVLA MDNGSPPTTG TGTLLLTLID 540 

VNDHGPVPEP RQITICNQSP VRQVLNITDK DLSPHTSPFQ AQLTDDSDIY WTAEVNEEGD 600 

TWLSLKKFL KQDTYDVHLS LSDHGNKEQL TVIRATVCDC HGHVETCPGP WKCGFILPVL 660 

GAVLALLFLL LVLLLLVRKK RKIKEPLLLP EDDTRDNVFY YGEEGGGEED QDYDITOLHR 720 

GLEARPEWL RNDVAPTIIP TPMYRPRPAN PDEIGNFI IE NIJCAANTDPT APPYDTLLVF 780 

DYEGSGSDAA SLSSLTSSAS DQDQDYDYLN EWGSRFKKLA DMYGGGEDD 829 

Seq ID NO: C299 Protein Sequence 
Protein Accession #: NPJ>05620.1 

1 11 21 31 41 51 

I I I I I I 

MAKKSAENGI YSVSGDEKKG PLIAPGPDGA PAKGDGPVGL GTPGGRLAVP PRETWTRQMD 60 

FIMSCVGFAV GLGNVWRFPY LCYKNGGGVF LIPYVLIALV GGIPIFFLEI SLGQFMKAGS 120 

INWINICPLF KGLGYASMVI VFYCNTYYIM VLAWGFYYLV KSFTTTLPWA TCGHTWNTPD 1B0 

CVEIFRHEDC ANASLANLTC DQLADRRSPV IEFWENKVLR liSGGLEVPGA LNWEVTLCLL 240 

ACWVLVYFCV WKGVKSTGKI VYFTATFPYV VLWLLVRGV LLPGALDGII YYLKPDWSKL 300 

GSPQVWIDAG TQIFFSYAIG LGALTALGSY NRFNNNCYKD AIIIALINSG TSFFAGFWF 360 

SILGFMAAEQ GVHISKVAES GPGLAFIAYP RAVTLMPVAP LHAAIiFFFML LLLGLDSQFV 420 

GVEGFITGLIi DLLPASYYFR FQREISVALC CALCFVIDLS MVTDGGMYVF QLFDYYSASG 480 

TTLLWQAFWE CVWAWVYGA DRFMDDIACM IGYRPCPWMK WCWSFFTPLV CMGIFIFNW 540 

YYEPLVYNNT YVYPWWGEAM GWAFALSSML CVPLHLLGCL LRAKGTMAER WQHLTQPIWG 600 

LHHLEYRAQD ADVRGLTTLT PVSESSKWV VESVM 635 

Seq ID NO: C300 Protein Sequence 
Protein Accession #: NP_006507.1 

1 11 21 31 41 51 

I I I I I 1 

MEPSSKKLTG RLMLAVGGAV LGSLQFGYNT GVINAPQKVI EEFYNQTWVH RYGESILPTT 60 

LTTLWSLSVA IFSVGGMIGS FSVGLFVNRF GRRNSMLMMN LLAFVSAVLM GFSKLGKSFE 120 

MLILGRFIIG VYCGLTTGFV PMYVGEVSPT AFRGALGTLH QLGIWGILI AQVFGLDSIM 180 

GNKDLWPLLL SIIFIPALLQ CIVLPFCPES PRFLLINRNE ENRAKSVLKK LRGTADVTHD 240 

LQEMKEESRQ MMREKKVTIL ELFRSPAYRQ PILIAWLQIj SQQLSGINAV FYYSTSIFEK 300 

AGVQQPVYAT IGSGIVNTAF TWSLFWER AGRRTLHLIG LAGMAGCAIL MTIALALLEQ 360 

LPWMSYLSIV AIFGFVAFFE VGPGPIPWFI VAELFSQGPR PAAIAVAGFS NWTSNPIVGM 420 

CFQYVEQLCG PYVFIIFTVL LVLFFIFTYF KVPETKGRTF DEIASGFRQG GASQSDKTPE 480 

BLFHPLGADS QV 492 

Seq ID NO: C301 Protein Sequence 
Protein Accession #: XP_035292.2 

1 11 21 31 41 51 

I I I I I I 

MAGAGPKRRA LAAPAAEEKE EAREKMLAAK SADGSAPAGS GEGVTLQRNI TLLNGVAIIV 60 

GTIIGSGIFV TPTGVLKEAG SPGLALWWA ACGVFSIVGA tiCYAELGTTI SKSGGDYAYM 120 

LEVYGSLPAF LKLHIELLII RPSSQYIVAL VFATYLLKPL FPTCPVPEEA AKLVACLCVL 180 

LLTAVNCYSV KAATRVQDAF AAAKLIiALAL I ILLGFVQIG KGDVSNLDPN FSFEGTKLDV 240 

1352 
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GNIVLALYSG LFAYGGWNYL NFVTEEKINP YRNLPLAIII SLPIVTLVYV LTNLAYFTTL 300 

STEQMLSSEA VAVDFGNYHL GVMSWIIPVP VGLSCFGSVN GSLFTSSRLF FVGSREGHLP 360 

SILSMIHPQL LTPVPSLVFT CVMTLLYAFS KDIFSVINFF SFFNWl*CVAL AIIGMIWLRH 420 

RKPELERPIK VNLALPVFFI LACLFLIAVS FWKTPVECGI GFTIILSGLP VYFFGVWWKN 480 

KPKWLLQGIF STTVLCQKLM QWPQET S07 

Seq ID NO: C302 Protein Sequence 
Protein Accession #: NP_005259.1 

1 11 21 31 41 51 

I I 1 1 I I 

MNWSIFEGLL SGVNKYSTAF GRIWLSLVFI FRVLVYLVTA ERVWSDDHKD FDCNTRQPGC 60 

SNVCFDEFFP VSHVRLWALQ LILVTCPSLL WMHVAYREV QEKRHREAHG ENSGRLYLNP 120 

GKKRGGLWWT YVCSLVFKAS VDIAFLYVFH SFYPKYILPP WKCHADPCP NIVDCFISKP 180 

SEKNIFTLFM VATAAICILL NLVELI Y1»VS KRCHECLAAR KAQAMCTGHH PHGTTSSCKQ 240 

DDLLSGDLIF LGSDSHPPLL PDRPRDHVKK TIL 273 

Seq ID NO: C303 Protein Sequence 
Protein Accession #: NP_005121.1 

1 11 21 31 41 51 

I I I I I I 

MKICSLTLLS FLLLAAQVLL VEGKKKVKNG LHSKWSEQK DTLGNTQIKQ KSRPGNKGKF 60 
VTKDQANCRW AATEQEEGIS LKVECTQLDH EFSCVFAGNP TSCLKLKDER VYWKQVARNL 120 
RSQKDICRYS KTAVKTRVCR KDFPESSLKL VSSTLFGNTK PRKEKTEMSP REHIKGKETT 180 
PSSLAVTQTM ATKAPECVED PDMANQRKTA LEFCGETWSS LCTFFLSIVQ DTSC 234 

Seq ID NO: C304 Protein Sequence 
Protein Accession # : AAH22542 

1 11 21 31 41 51 

I 1 I I I I 

MCSEIILRQE VLKDGFHRDL LIKVKFGESI EDLHTCRLLI KQDIPAGLYV DPYELASLRE 60 
RNITEAVMVS ENFDIEAPNY LSKESEVLIY ARRDSQCIDC FQAFLPVHCR YHRPH SEDGE 120 
ASIWNNPDL LMFCDQAGSR RMIRFRFDSF DKTIEFPILK CWAHSEVAAP CALENEDICQ 180 
WNKMKYKSVY KNVILQVPVG I/TVHTSLVCS VTLLITILCS KKKKK 225 

Seq ID NO: C305 Protein Sequence 
Protein Accession #: NP 004985.1 



1 
I 

MSLWQPLVLV 



ITYWIQNYSE 
FDGKDGLLAH 
YSACTTDGRS 
ACTTDGRSDG 
CTSEGRGDGR 
PMYRFTEGPP 
PTAGPTGPPS 



LDKLGLGADV 
THDVFQYREK 



11 
I 

LLVLGCCFAA 
LLLLQKQLSL 
DLPRAVIDDA 
AFPPGPGIQG 
DGLPWCSTTA 
YRHCATTANY 
LWCATTSNFD 
LHKDDVNGIR 
AGPTGPPTAG 
QGPFIilADKW 
AQVTGALRSG 
AYFCQDRFYW 



21 
I 

PRQRQSTLVL 
PETGELDSAT 
FARAFALWSA 
DAHFDDDELW 
NYDTDDRFGF 
DRDKLFGFCP 



HLYGPRPEPE 
PSTATTVPLS 
PALPRKLDSV 
RGKMLLFSGR 



31 
1 

FPGDLRTNLT 
LKAMRTPRCG 
VTPLTFTRVY 
SLGKGWVPT 
CPSERLYTRD 
TRADSTVMGG 
QGYSLFLVAA 
PRPPTTTTPQ 
PVDDACNVNI 
FEEPLSKKLF 
RLWRFDVKAQ 
VDQVGYVTYD 



41 
I 

DRQLAEEYLY 
VPDLGRFQTF 
SKDADIVIQF 
RFGNADGAAC 
GNADGKPCQF 
NSAGELCVFP 
HEFGHALGLD 
PTAPPTVCPT 
FDAIAEIGNQ 
FFSGRQVWVY 
MVDPRSASEV 
ILQCPED 



51 
I 

RYGYTRVAEM 



GVAEHGDGYP 
HFPFIFEGRS 
PFIPQGQSYS 
FTFLGKEYST 
HSSVPEALMY 
GPPTVHPSER 
LYLFKDGKYW 
TGASVLGPRR 
DRMFPGVPLD 



Seq ID NO: C306 Protein Sequence 
Protein Accession ft: NP 000204 



1 
I 

MAGPRPSPWA 
CNTQAELLAA 
EVFEPLESPV 
PQTDMRPEKL 
QTAVCTRDIG 
TQDYPSVPTL 
EAFNRIRSNL 
THVCQLPEDQ 
CSEGWSGQTC 
FQCPRTSGFL 
CHCHQQSLYT 
LKRAEEVWR 
LLPLLALLLL 
SGNLKGRDW 
AQLRQEVEEN 
LTEKQVEQRA 
LVEAIDVPAG 
KSQVSYRTQD 
VQLSNPKFGA 
RKIHFNWIiPP 
AQGEGPYSSL 
NDDNRPIGPM 
KRPMSIPIIP 
EELDLRRVTW 



11 
I 

RLLLAALISV 
GOQRESIWM 
DLYILMDFSN 
KEPWPNSDPP 
NRPDSTHLLV 
VRLLAKHNII 
DIRALDSPRG 
KGNIKLKPSF 
NCSTGSLSDI 
CNDRGRCSMG 
DTICEINYSA 
CSFRDEDDDC 
LCWKYCACCK 
RWKVTNNMQR 
LNEVYRQISG 
FHDLKVAPGY 
TATLGRRLVN 
GTAQGNRDYI 
HLGQPHSTTI 
SGKPKGYRVK 
VSCRTHQEVP 
KKVLVDNPKN 
DIPIVDAQSG 
RLPPEIilPRL 



21 
1 

SLSGTLANRC 
ESSFQITEET 
SMSDDDDNLK 
FSFKNVISLT 
FSTESAFHYE 
PIFAVTNYSY 
LRTEVTSKMF 
SDGLKMDAGI 
QPCLREGEDK 
QCVCEPGWTG 
IHPGLCEDLR 
TYSYTMEGDG 
ACLALLPCCN 
PGFATHAASI 
VHKIiQQTKFR 
YTLTADQDAR 
ITIIKEQARD 
PVEGELLFQP 
IIRDPDELDR 
YWIQGDSESE 
SEPGRIAFNV 
RMLLIENLRE 
EDYDSFLMYS 



31 
I 

KKAPVKSCTE 
QIDTTLRRSQ 
KMGQNLARVL 
BDVDEFRNKL 
ADGANVLAGI 
SYYEKLHTYF 
QKTRTGSFHI 
ICDVCTCELQ 
PCSGRGECQC 
PSCDCPLSNA 
SCVQCQAWGT 
APGPNSTVLV 
RGHKVGFKED 
NPTELVPYGL 
QQPNAGKKQD 
GMVEFQEGVE 
WSFEQPEFS 
GEAWKELQVK 
SFTSQMLSSQ 
AHLLDSKVPS 
VSSTVTQLSW 
SQPYRYTVKA 
DDVLRSPSGS 
EAPTAPRTTA 



41 
I 

CVRVDKDCAY 
MSPOGLRVRL 
SQLTSDYTIG 
QGERISGNLD 
MSRNDERCHL 
PVSSLGVLQE 
RRGEVGIYQV 
KEVRSARCSF 
GHCVCYGEGR 
TCIDSNGGIC 
GEKKGRTCEE 
HKKKDCPPGS 
HYMLRENLMA 
SLRLARLCTE 
HTIVDTVLMA 
LVDVRVPLFI 
VSRGDQVARI 
LLELQEVDSL 
PPPHGDLGAP 
VELTNLYPYC 
AEPAETNGEI 
RNGAGWGPER 
QRPSVSDDTG 
ARAGRAAAVP 



51 
I 

CTDEMFRDRR 
RPGEERHFEL 
FGKFVDKVSV 
APEGGFDAIL 
DTTGTYTQYR 
DSSNIVELLE 
QLRALEHVDG 
NGDFVGGQCV 
YBGQFCEYDN 



CNFKVKMVDE 
FWWLIPLLLIi 
SDHLDTPMLR 
NLLKPDTREC 
PRSAKPALLK 
RPEDDDEKQL 
PVIRRVLDGG 
LRGRQVRRFH 
QNFNAKAAGS 
DYEMKVCAYG 
TAYEVCYGLV 
EAIINLATQP 
CGWKFEPIiLG 
RSATPGPPGE 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

707 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 



1353 



WO 03/042661 



PCT/US02/36810 



10 



20 



60 
65 
70 
75 
80 



HLVNGRMDFA FPGSTNSLHR MTTTSAAAYG THLSPHVPHR VLSTSSTLTR DYNSLTRSEH 1500 

SHSTTLPRDY STLTSVSSHD SRLTAGVPDT PTRLVFSALG PTSLRVSWQE PRCERPLQGY 1560 

SVEYQLLNGG ELHRLNIPNP AQTSVWEDL LPNHSYVFRV RAQSQEGWGR EREGVITIES 1620 

QVHPQSPLCP LPGSAPTLST PSAPGPLVFT ALSPDSLQLS WERPRRPNGD IVGYLVTCEM 16B0 

AQGGGPATAF RVDGDSPESR LTVPGLSENV PYKFKVQART TEGFGPEREG IITIESQDGG 174 0 

PFPQLGSRAG LFQHPLQSEY SSITTTHTSA TEPFLVDGLT LGAQHLEAGG SLTRHVTQEF 1800 

VSRTLTTSGT LSTHMDQQFF QT 1822 

Seq ID NO: C307 Protein Sequence 
Protein Accession #: NP 076404.1 



1 11 21 31 41 51 

I 1 I I I I 

MGFNLTIiAKL PNNELHGQES HNSGNRSDGP GKNTTLHNEF DTIVLPVLYL IIPVASILLN 60 

ID GIAVWIFFHI RNKTSFIFYL KNIWADLIM TLTPPFRIVH DAGFGPWYFK FILCRYTSVL 120 

PYANMYTSIV FLGLISIDRY LKWKPFGDS RMYSITFTKV LSVCVWVIMA VLSLPNIILT 1B0 

NGQPTEDNIH DCSKLKSPLG VKHHTAVTYV NSCLFVAVLV ILIGCYIAIS RYIHKSSRQF 240 

ISQSSRKRKH NQSIRVWAV FFTCFLPYHL CRIPFTFSHL DRLLDESAQK ILYYCKEITL 300 

FLSACNVCLD PIIYFFMCRS FSRRLFKKSN IRTRSESIRS LQSVRRSEVR IYYDYTDV 358 



Seq ID NO: C308 Protein Sequence 
Protein Accession #: NP_065840.1 



~ c 1 11 21 31 41 51 

25 I I I I 1 I 

MVWCLGLAVL SLVISQGADG RGKPEWSW GRAEESWLG CDLLPPAGRP PLHVIEWLRF 60 

GPLLPIFIOF GLYSPRIDPD YVGRVRLQKG ASLQIEGLRV EDQGWYECRV FFLDQHIPED 120 

DFANGSWVHL TVNSPPQFQE TPPAVLEVQE LEPVTLRCVA RGSPLPHVTW KLRGKDLGQG 160 

~ A QGQVQVQNGT LRIRRVERGS SGVYTCQASS TEGSATHATQ LLVLGPPVIV VPPKNSTVNA 240 

SQDVSLACHA EAYPANLTYS WFQDNINVFH ISRLQPRVQI LVDGSLRLLA TQPDDAGCYT 300 

CVPSNGLLHP PSASAYLTVL CMPGVIRCPV RANPPLLFVS WTKDGKALQL DKFPGWSQGT 360 

EGSLI IALGN EDALGEYSCT PYNSLGTAGP SPVTRVLLKA PPAFIERPKE EYFQEVGREL 420 

• MPCSAQGDP PPWSWTKVG RGLQGQAQVD SNSSLILRPL TKEAHGHWEC SASNAVARVA 480 

« - TSTNVYVLGT SPHWTNVSV VALPKGANVS WEPGFDGGYL QRFSVWYTPL AKRPDRMHHD 540 

J-> WVSLAVPVGA AHLLVPGLQP HTQYQFSVLA QNKLGSGPFS EIVLSAPEGL PTTPAAPGLP 600 

PTEIPPPLSP PRGLVAVRTP RGVLLHWDPP ELVPKRLDGY VLEGRQGSQG WEVLDPAVAG 660 

TETELLVPGL IKDVLYEFRL VAFAGSFVSD PSNTANVSTS GLEVYPSRTQ LPGLLPQPVL 720 

AGWGGVCFL GVAVLVSIIA GCLLNRRRAA RRRRKRLRQD PPLIFSPTGK SAAPSALGSG 780 

, - SPDSVAKLKL QGSPVPSLRQ SLLWGDPAGT PSPHPDPPSS RGPLPLEPIC RGPDGRFVMG 840 

4U PTVAAPQERS GREQAEPRTP AQRLARSFDC SSSSPSGAPQ PLCIEDISPV APPPAAPPSP 900 

LPGPGPLI^Y LSLPPFREMN VDGDWPPLEE PSPAAPPDYM DTRRCPTSSF LRSPETPPVS 960 

PRESLPGAW GAGATAEPPY TALADWTLRE RLLPGLLPAA PRGSLTSQSS GRGSASFLRP 1020 

PSTAPSAGGS YLSPAPGDTS SWASGPERWP RREHWTVSK RRNTSVDENY EWDSEFPGDM 1080 

AC ELLETLHLGL ASSRLRPEAE TELGVKTPEE GCLLNTAHVT GPEARCAALR EEFIAFRRRR 1140 

43 DATRARLPAY RQPVPHPEQA TLL 1163 

Seq ID NOt C309 Protein Sequence 
Protein Accession #: 

50 x 11 21 31 41 51 

1111)1 

MLTKPLQGPP APPGTPTPPP GGKDREAFEA EYRLGPLLGK GGFGTVFAGH RLTORLQVAI 60 

KVIPRNRVLG WSPLSDSVTC PLEVALLWKV GAGGGHPGVI RLLDWFETQE GFMLVLERPL 120 

PAQDLFDYIT EKGPLGEGPS RCFFGQWAA IQHCHSRGW HRDIKDENIL IDLRRGCAKL 180 

IDFGSGALLH DEPYTDFDGT RVYSPPEWIS RHQYHALPAT VWSLGILIiYD MVCGDIPFER 240 

DQEILEAELH FPAHVSPDCC ALIRRCLAPK PSSRPSLEEI LLDPWMQTPA EDVTPQPLQR 300 

RPCPFGLVLA TLSLAHPGLA PNGQKSHPMA MSQG 334 



Seq ID NO: C310 Protein Sequence 
Protein Accession #: NP_00250l.l 

1 11 21 31 41 51 

1 I 1 I I I 

MECLYYFLGF LLLAARLPLD AAKRFHDVLG NERPSAYMRE HNQLNGHSSD ENDWNEKLYP 60 

VWKRGDMRWK NSWKGGRVQA VLTSDSPALV GSNITFAVNL IFPRCQKEDA NGNIVYEKNC 120 

RNEAGLSADP YVYNWTAWSE DSDGENGTGQ SHHNVFPDGK PFPHHPGHRR WNFIYVFHTL 180 

GQYFQKLGRC SVRVSVNTAN VTLGPQLMEV TVYRRHGRAY VPIAQVKDVY WTDQIPVFV 240 

TMFQKNDRNS SDETFLKDLP IMFDVLIHDP SHFLNYSTIN YKWSFGDNTG LFVSTNHTVN 300 

HTYVLNGTPS LNLTVKAAAP GPCPPPPPPP RPSKPTPSLG PAGDNPLELS RIPDENCQIN 360 

RYGHFQATIT IVEGILEVNI IQMTDVLMPV PWPESSLIDF WTCQGSIPT EVCTIISDPT 420 

CEITQNTVCS PVDVDEHCLL TVRRTFNGSG TYCVNLTLGD DTSLALTSTL ISVPDRDPAS 480 

PLRMANSALI SVGCLAIFVT VISLLVYKKH KEYNPIENSP GNWRSKGLS VFLNRAKAVF 540 

FPGNQEKDPL LKNQEFKGVS 560 

Seq ID NO: C311 Protein Sequence 
Protein Accession #: Eos seq 

1 11 21 31 41 51 

I II I I I 

MRI LKRFLAC IQLLCVCRIiD WANGYYRQQR KLVEEIGWSY TGALNQKNWG KKYPTCNSPK 60 
QSPINIDEDL TQVNVNLKKL KFQGWDKTSL ENTFIHNTGK TVEINLTNDY RVSGGVSEMV 120 
FKASKITFHW GKCNMSSDGS EHSLEGQKFP LEMQIYCPDA DRFSSFEEAV KGKGKLRALS 180 
ILFEVGTEEN LDFKAIIDGV ESVSRFGRQA ALDPFILLNL LPNSTDKYYI YNGSLTSPPC 240 
TDTVDWIVFK DTVSISESQL AVFCEVLTMQ QSGYVMLMDY LQNNFREQQY KFSRQVFSSY 300 
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TGKEEIHEAV CSSEPENVQA DPENYTSLLV TWERPRWYD TMIEKFAVLY QQLDGEDQTK 360 

KEFLTDGYQD LGAILNNLLP NMSYVLQIVA ICTKGLYGKY SDQLIVDMPT DNPELDLFPE 420 

LIGTEEIIKE EEEGKDIEEG AIVNPGRDSA TNQIRKKEPQ ISTTTHYNRI GTKYNEAKTN 480 

RSPTRGSEFS GKGDVPNTSL NSTSQPVTKL ATEKDISLTS QTVTELPPHT VEGTSASLND 540 

GSKTVLRSPH MNLSGTAESL NTVSITEYEE ESLLTSFKLD TGAEDSSGSS PATSAIPFIS 600 

ENISQGYIFS SENPETITYD VLIPESARNA SEDSTSSGSE ESLKDPSMEG NVWFPSSTDI 660 

TAQPDVGSGR ESFLQTNYTE IRVDESEKTT KSFSAGPVMS QGPSVTDLEM PHYSTFAYFP 720 

TEVTPHAFTP SSRQQDLVST VNWYSQTTQ PVYNEASNSS HESRIGLAEG LESEKKAVIP 780 

LVIVSALTFI CLWLVGILI YWRKCFQTAH FYLEDSTSPR VISTPPTPIF PISDDVGAIP 840 

IKHFPKHVAD LHASSGFTEE FEEVQSCTVD LGITADSSNH PDNKHKNRYI NIVAYDHSRV 900 

KLAQLAEKDG KLTDYINANY VDGYNRPKAY IAAQGPLKST AEDFWRMIWE HNVEVIVMIT 960 

NLVEKGRRKC DQYWPADGSE EYGNFLVTQK SVQVLAYYTV RNFTLRNTKI KKGSQKGRPS 1020 

GRWTQYHYT QWPDMGVPEY SLPVLTFVRK AAYAKRHAVG PVWHCSAGV GRTGTYIVLD 1080 

SMLQQIQHEG TVNIFGFLKH IRSQRNYLVQ TEEQYVFIHD TLVEAILSKE TEVLDSHIHA 1140 

YVNALLIPGP AGKTKLEKQF QLLSQSNIQQ SDYSAALKQC NREKNRTSSI IPVERSRVGI 1200 

SSLSGEGTDY INASYIMGYY QSNEFIITQH PLLHTIKDFW RMIWDHNAQL WMIPDGQNM 1260 

AEDEFVYWPN KDEPINCESF KVTLMAEEHK CLSNEEKLII QDFILEATQD DYVLEVRHFQ 1320 

CPKWPNPDSP ISKTFELISV IKEEAANRDG PMIVHDEHGG VTAGTFCALT TLMHQLEKEN 1380 

SVDVYQVAKM INLMRPGVFA DIEQYQFLYK VILSLVSTRQ EENPSTSLDS NGAALPDGNI 1440 

AESIiESLV 1448 



Seq ID NO: C312 Protein Sequence 
Protein Accession #: XP_031379 



MRILKRFLAC 
QSPINIDEDL 
FKASKITFHW 
ILFEVGTEEN 
TDTVDWIVFK 
TGKEEIHEAV 
HEFLTDGYQD 
LIGTEEIIKE 



11 
I 



21 



31 



41 



51 



GSKTVLRSPH 
ENISQGYIFS 
TAQPDVGSGR 
TEVTPHAFTP 
LNTTPAASSS 
ILPQVTSATE 
KTLMFSQVEP 
SLFSGPSHIP 
PVSVAEFTYT 
LNASLQETSV 
KPVLSANSEP 
AVPSDPILVE 
TISYASEKYE 
EPLNTLINKL 
PHRDGSVTST 



SPGKSPSANG 
EHETSTDFSF 
HESRIGLAEG 
VISTPPTPIF 
TADSSNHPDN 
QGPLKSTAED 
VLAYYTVRNF 
AKRHAVGPW 
QYVFIHDTLV 
SAALKQCNRE 
HTIKDFWRMI 
NEEKLIIQDF 
VHDEHGGVTA 
SLVSTRQEEN 



IQLLCVCRLD WANGYYRQQR 
TQVNVNLKKL KFQGWDKTSL 
GKCNMSSDGS EHSLEGQKFP 
LDFKAIIDGV ESVSRFGKQA 
DTVSISESQL AVFCEVLTMQ 
CSSEPENVQA DPENYTSLLV 
LGAILNNLLP NMSYVLQIVA 
EEEGKDIEEG AIVNPGRDSA 
GKGDVPNTSL NSTSQPVTKL 
MNLSGTAESL NTVSITEYEE 
SENPETITYD VLIPESARNA 
ESFLQTNYTE IRVDESEKTT 
SSRQQDLVST VNWYSQTTQ 
DSALHATPVF PSVDVSFESI 
SDKVPLHASL PVAGGDLLLE 
PSSDAMMHAR SSGPEPSYAL 
IPKSSLITPT ASLLQPTHAL 
TSVFGDDNKA LSKSEIIYGN 
SISSTKGMFP GSLAHTTTKV 
ASSDPASSEM LSPSTQLLFY 
TPKVDKISST MLHLIVSNSA 
PVLLKSESSH QWPSLYSND 
IHSDEILTST KSSVTGKVFA 
KLLFPSKATS ELSHSAKSDA 
SQEKVKNDSD THENSLMDQN 
LSQKHNDGKE ENDIQTGSAL 
ADTNEKDADG ILAAGDSEIT 
LESEKKAVIP LVIVSALTFI 
PISDDVGAIP IKHFPKHVAD 
KHKNRYINIV AYDHSRVKLA 
FWRMIWEHNV EVIVMITNLV 
TLRNTKIKKG SQKGRPSGRV 
VHCSAGVGRT GTYIVLDSML 
EAILSKETEV LDSHIHAYVN 
KNRTSSIIPV ERSRVGISSL 
WDHNAQLWM IPDGQNMAED 
ILEATQDDYV LEVRHFQCPK 
GTFCALTTLM HQLEKENSVD 
PSTSLDSNGA ALPDGNIAES 



KLVEEIGWSY TGALNQKNWG KKYPTCNSPK 60 

ENTFIHNTGK TVEINLTNDY RVSGGVSEMV 120 

LEMQIYCFDA DRFSSFEEAV KGKGKLRALS 180 

ALDPFILLNL LPNSTDKYYI YNGSLTSPPC 240 

QSGYVMLMDY LQNNFREQQY KFSRQVFSSY 300 

TWERPRWYD TMIEKFAVLY QQLDGEDQTK 360 

ICTNGLYGKY SDQLIVDMPT DNPELDLFPE 420 

TNQIRKKEPQ ISTTTHYNRI GTKYNEAKTN 480 

ATEKDISLTS QTVTELPPHT VEGTSASLND 540 

ESLLTSFKLD TGAEDSSGSS PATSAIPFIS 600 

SEDSTSSGSE ESLKDPSMEG NVWFPSSTDI 660 

KSFSAGPVMS QGPSVTDLEM PHYSTFAYFP 720 

PVYNGETPLQ PSYSSEVFPL VTPLLLDNQI 780 

LSSYDGAPLL PFSSASFSSE LFRHLHTVSQ 840 

PSLAQYSDVL STTHAASETL EFGSESGVLY 900 

SDNEGSQHIF TVSYSSAIPV HDSVGVTYQG 960 

SGDGEWSGAS SDSEFLLPDT DGLTALNISS 1020 

ETELQIPSFN EMVYPSESTV MPNMYDNVNK 1080 

FDHEISQVPE NNFSVQPTHT VSQASGDTSL 1140 

ETSASFSTEV LLQPSFQASD VDTLLKTVLP 1200 

SSENMLHSTS VPVFDVSPTS HMHSASLQGL 1260 

ELFQTANLEI NQAHPPKGRH VFATPVLSID 1320 

GIPTVASDTF VSTDHSVPIG NGHVAITAVS 1380 

GLVGGGEDGD TDDDGDDDDD DRGSDGLSIH 1440 

NPISYSLSEN SEEDNRVTSV SSDSQTGMDR 1500 

LPLSPESKAW 'AVLTSDEBSG SGQGTSDSLN 1560 

PGFPQSPTSS VTSENSEVFH VSEAEASNSS 1620 

CLWLVGILI YWRKCFQTAH FYLEDSTSPR 1680 

LHASSGFTEE FETLKEFYQE VQSCTVDLGI 1740 

QLAEKDGKLT DYINANYVDG YNRPKAYIAA 1800 

EKGRRKCDQY WPADGSEEYG NFLVTQKSVQ 1860 

VTQYHYTQWP DMGVPEYSLP VLTFVRKAAY 1920 

QQIQHEGTVN IFGFLKHIRS QRNYLVQTEE 1980 

ALLIPGPAGK TKLEKQFQLL SQSNIQQSDY 2040 

SGEGTDYINA SYIMGYYQSN EFIITQHPLL 2100 

EFVYWPNKDE PINCESFKVT LMAEEHKCLS 2160 

WPNPDSPISK TFELISVIKE EAANRDGPMI 2220 

VYQVAKMINL MRPGVFADIE QYQFLYKVIL 2280 

LESLV 2315 



Seq ID NO: C313 Protein Sequence 
Protein Accession ft : NP_002842 



MRILKRFLAC 
QSPINIDEDL 
FKASKITFHW 
ILFEVGTEEN 
TDTVDWIVFK 
TGKEEIHEAV 
HEFLTDGYQD 
LIGTEEIIKE 
RSPTRGSEFS 
GSKTVLRSPH 
ENISQGYIFS 
TAQPDVGSGR 
TEVTPHAFTP 



11 
I 

IQLLCVCRLD 
TQVNVNLKKL 
GKCNMSSDGS 
LDFKAIIDGV 
DTVSISESQL 
CSSEPENVQA 
LGAILNNLLP 
EEEGKDIEEG 
GKGDVPNTSL 
MNLSGTAESL 
SENPETITYD 
ESFLQTNYTE 
SSRQQDLVST 



21 
I 

WANGYYRQQR 
KFQGWDKTSL 
EHSLEGQKFP 
ESVSRFGKQA 
AVFCEVLTMQ 
DPENYTSLLV 
NMSYVLQIVA 
AIVNPGRDSA 
NSTSQPVTKL 
NTVSITEYEE 
VLIPESARNA 
IRVDESEKTT 
VNWYSQTTQ 



31 
I 

KLVEEIGWSY 
ENTFIHNTGK 
LEMQIYCFDA 
ALDPFILLNL 
QSGYVMLMDY 
TWERPRWYD 
ICTNGLYGKY 
TNQIRKKEPQ 
ATEKDISLTS 
ESLLTSFKLD 
SEDSTSSGSE 
KSFSAGPVMS 
PVYNAEASNS 



41 
I 

TGALNQKNWG 
TVEINLTNDY 



LPNSTDKYYI 
LQNNFREQQY 
TMIEKFAVLY 
SDQLIVDMPT 
ISTTTHYNRI 
QTVTELPPHT 
TGAEDSSGSS 
ESLKDPSMEG 
QGPSVTDLEM 
SHESRIGLAE 



51 
I 

KKYPTCNSPK 
RVSGGVSEMV 
KGKGKLRALS 
YNGSLTSPPC 
KFSRQVFSSY 
QQLDGEDQTK 
DNPELDLFPE 
GTKYNEAKTN 
VEGTSASLND 
PATSAIPFIS 
NVWFPSSTDI 
PHYSTFAYFP 
GLESEKKAVI 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 
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PLVIVSALTF 
PIKHFPKHVA 
VAYDHSRVKL 
VEVIVMITNL 
GSQKGRPSGR 
TGTYIVLDSM 
VLDSHIHAYV 
VERSRVGISS 
MIPDGQNMAE 
VLEVRHFQCP 
MHQLEKENSV 
AALPDGNIAE 



ICLWLVGIL 
DLHASSGFTE 
AQLAEKDGKL 
VEKGRRKCDQ 
WTQYHYTQW 
LQQIQHEGTV 
NALLIPGPAG 
LSGEGTDYIN 
DEFVYWPNKD 
KWPNPDSPIS 
DVYQVAKMIN 
SLESLV 



IYWRKCPQTA 
EFETLKEFYQ 
TDYINANYVD 
YWPADGSEEY 
PDMGVPEYSL 
NIFGFLKHIR 
KTKLEKQFQL 
ASYIMGYYQS 
EPINCESFKV 
KTFELISVIK 
LMRPGVFADI 



HFYLEDSTSP 
EVQSCTVDLG 
GYNRPKAYIA 
GNFLVTQKSV 
PVLTFVRKAA 
SQRNYLVQTE 
LSQSNIQQSD 
NEFIITQHPL 
TLMAEEHKCL 
EEAANRDGPM 
EQYQFLYKVI 



RVISTPPTPI 
ITADSSNHPD 
AQGPLK5TAE 
QVLAYYTVRN 
YAKRHAVGPV 
EQYVFIHDTL 
YSAALKQCNR 
LHTIKDFWRM 
SNEEKLI IQD 
IVHDEHGGVT 
LSLVSTRQEE 



FPISDDVGAI 
NKHKNRYINI 
DFWRMIWEHN 
FTLRNTKIKK 
WHCSAGVGR 
VEAILSKETE 
EKNRTSSIIP 
IWDHNAQLW 
FILEATQDDY 
AGTFCALTTL 
NPSTSLDSNG 



Seq ID NO: C314 Protein Sequence 
Protein Accession #t Eos sequence 



1 
I 

MRILKRFLAC 
QSPINIDEDL 
MVFKASKITF 
LSILFEVGTB 
PCTDTDWIVF 
YTGKEEIHEA 
KHEFIiTDGYQ 
ELIGTEEIIK 
NRSPTRGSEF 
DGSKTVLRSP 
SENISQGYIF 
ITAQPDVGSG 
PTEVTPHAFT 
PLVIVSALTF 
PIKHFPKHVA 
VAYDHSRVKL 
VEVIVMITNL 
GSQKGRPSGR 
TGTYIVLDSM 
VLDSHIHAYV 
VERSRVGISS 
MIPDGQNMAE 
VLEVRHFQCP 
MHQLEKENSV 
AALPDGNIAE 



11 
I 

IQLLCVCRLD 
TQVNVNLKKL 
HWGKCNMSSD 
ENLDFKAIID 
KDTVSISESQ 
VCSSEPENVQ 
DLGAILNNLL 
EEEEGKDIEE 
SGKGDVPNTS 
HMNLSGTAES 
SSENPETITY 
RESFLQTNYT 
PSSRQQDLVS 
ICLWLVGIL 
DLHASSGFTE 
AQLAEKDGKL 
VEKGRRKCDQ 
WTQYHYTQW 
LQQIQHEGTV 
NALLIPGPAG 
LSGEGTDYIN 
DEFVYWPNKD 
KWPNPDSPIS 
DVYQVAKMIN 
SLESLV 



21 
I 

WANGYYRQQR 
KFQGWDKTSL 
GSEHSLEGQK 
GVESVSRFGK 
LAVFCEVLTM 
ADPENYTSLL 
PNMSYVLQIV 
GAIVNPGRDS 
LNSTSQPVTK 
LNTVSITEYE 
DVLIPESARN 
EIRVDESEKT 
TVNWYSQTT 
IYWRKCFQTA 
EFETLKEFYQ 
TDYINANYVD 
YWPADGSEEY 
PDMGVPEYSL 
NIFGFLKHIR 
KTKLEKQFQL 
ASYIMGYYQS 
EPINCESFKV 
KTFELISVIK 
LMRPGVFADI 



31 
I 

KLVEEIGWSY 
ENTFIHNTGK 
FPLEMQIYCF 
QAALDPFILL 
QQSGYVMLMD 
VTWERPRWY 
AICTNGLYGK 
ATNQIRKKEP 
LATEKDISLT 
EESLLTSFKL 
ASEDSTSSGS 
TKSPSAGPVM 
QPVYNEASNS 
HFYLEDSTSP 
EVQSCTVDLG 
GYNRPKAYIA 
GNFLVTQKSV 
PVLTFVRKAA 
SQRNYLVQTE 
LSQSNIQQSD 
NEFIITQHPL 
TLMAEEHKCL 
EEAANRDGPM 
EQYQFLYKVI 



41 
I 

TGALNQKNWG 
TVEINLTNDY 
DADRFSSFEE 
NLLPNSTDKY 
YLQNNFREQQ 
DTMIEKFAVL 
YSDQLIVDMP 
QISTTTHYNR 
SQTVTELPPH 
DTGAEDSSGS 
EESLKDPSME 
SQGPSVTDLE 
SHESRIGLAE 
RVISTPPTPI 
ITADSSNHPD 
AQGPLKSTAE 
QVLAYYTVRN 
YAKRHAVGPV 
EQYVFIHDTL 
YSAALKQCNR 
LHTIKDFWRM 
SNEEKLI IQD 
IVHDEHGGVT 
LSLVSTRQEE 



51 
I 

KKYPTCNSPK 
RVSGGVSEMV 
AVKGKGKLRA 
YIYNGSLTSP 
YKFSRQVFSS 
YQQLDGEDQT 
TDNPELDLFP 
IGTKYNEAKT 
TVEGTSASLN 
SPATSAIPFI 
GNVWFPSSTD 
MPHYSTFAYF 
GLESEKKAVI 
FPISDDVGAI 
NKHKNRYINI 
DFWRMIWEHN 
FTLRNTKIKK 
WHCSAGVGR 
VEAILSKETE 
EKNRTSSIIP 
IWDHNAQLW 
FILEATQDDY 
AGTFCALTTL 
NPSTSLDSNG 
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1080 

1140 

1200 

1260 

1320 

1380 

1440 

1456 
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Seq ID NO: C31S Protein Sequence 
Protein Accession #: Eos sequence 



MRILKRFLAC 
QSPINIDEDL 
FKASKITFHW 
ILFEVGTEEN 
TDTVDWIVFK 
TGKEEIHEAV 
HEFLTDGYQD 
LIGTEEIIKE 
RSPTRGSEFS 
GSKTVLRSPH 
ENISQGYIFS 
TAQPDVGSGR 
TEVTPHAFTP 
LVIVSALTFI 
IKHFPKHVAD 
AYDHSRVKLA 
EVIVMITNLV 
SQKGRPSGRV 
GTYIVLDSML 
LDSHIHAYVN 
SRVAGTILLS 
YIMGYYQSNE 
INCESFKVTL 
FELISVIKEE 
RPGVFADIEQ 



11 
I 

IQLLCVCRLD 
TQVNVNLKKL 
GKCNMSSDGS 
LDFKAIIDGV 
DTVSISESQL 
CSSEPENVQA 
LGAILNNLLP 
EEEGKDIEEG 
GKGDVPNTSL 
MNLSGTAESL 
SENPETITYD 
ESFLQTNYTE 
SSRQQDLVST 
CLWLVGILI 



21 
I 

WANGYYRQQR 
KFQGWDKTSL 



QLAEKDGKLT 
EKGRRKCDQY 
VTQYHYTQWP 
QQIQHEGTVN 
ALLIPGPAGK 
QSNIQQSDYS 
FIITQHPLLH 



AANRDGPMIV 
YQPLYKVILS 



ESVSRFGKQA 
AVFCEVLTMQ 
DPENYTSLLV 
NMSYVLQIVA 
AIVNPGRDSA 
NSTSQPVTKL 
NTVSITEYEE 
VLIPESARNA 
IRVDESEKTT 
VNWYSQTTQ 
YWRKCFQTAH 
FETLKEFYQE 
DYINANYVDG 
WPADGSEEYG 
DMGVPEYSLP 
IFGFLKHIRS 
TKLEKQFQGL 
AALKQCNREK 
TIKDFWRMIW 
EEKLIIQDFI 
HDEHGGVTAG 
LVGTRQEENP 



31 
! 

KLVEEIGWSY 
ENTFIHNTGK 
LEMQIYCFDA 
ALDPFILLNL 
QSGYVMLMDY 
TWERPRWYD 
ICTNGLYGKY 
TNQIRKKEPQ 
ATEKDISLTS 
ESLLTSFKLD 
SEDSTSSGSE 
KSFSAGPVMS 
PVYNEASNSS 
FYLEDSTSPR 
VQSCTVDLGI 
YNRPKAYIAA 
NFLVTQKSVQ 
VLTFVRKAAY 
QRNYLVQTEE 
TLSPRLECRG 
NRTSSIIPVE 
DHNAQLWMI 
LEATQDDYVL 
TFCALTTLMH 



41 
I 

TGALNQKNWG 
TVEINLTNDY 
DRFSSFEEAV 
LPNSTDKYYI 
LQNNFREQQY 
TMIEKFAVLY 
SDQLIVDMPT 
ISTTTHYNRI 
QTVTELPPHT 



BSLKDPSMEG 
QGPSVTDLEM 
HESRIGLAEG 
VISTPPTPIF 
TADSSNHPDN 
QGPLKSTAED 
VLAYYTVRNF 
AKRHAVGPW 
QYVFIHDTLV 
TISAHCNLPL 
RSRVGISSLS 
PDGQNMAEDE 
EVRHFQCPKW 
QLEKENSVDV 
LPDGNIAESL 



51 
I 

KKYPTCNSPK 
RVSGGVSEMV 
KGKGKLRALS 
YNGSLTSPPC 
KFSRQVFSSY 
QQLDGEDQTK 
DNPELDLFPE 
GTKYNEAKTN 
VEGTSASLND 
PATSAIPFIS 
NVWFPSSTDI 
PHYSTFAYFP 
LESEKKAVIP 
PISDDVGAIP 
KHKNRYINIV 
FWRMIWEHNV 
TLRNTKIKKG 
VHCSAGVGRT 
EAILSKETEV 
PGLTDPPTSA 
GEGTDYINAS 
FVYWPNKDEP 
PNPDSPISKT 
YQVAKMINLM 
ESLV 



Seq ID NO: C316 Protein Sequence 
Protein Accession #: Eos sequence 

1 11 21 31 41 51 

I 1 I I I I 

MRILKRFLAC IQLLCVCRLD WANGYYRQQR KLVEEIGWSY TGALNQKNWG KKYPTCNSPK 
QSPINIDEDL TQVNVNLKKL KFQGWDKTSL ENTFIHNTGK TVEINLTNDY RVSGGVSEMV 
FKASKITFHW GKCNMSSDGS EHSLEGQKFP LEMQIYCFDA DRFSSFEEAV KGKGKLRALS 
ILFEVGTEEN LDFKAIIDGV ESVSRFGKQA ALDPFILLNL LPNSTDKYYI YNGSLTSPPC 
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TDTVDWIVFK 
TGKEEIHEAV 
HEFLTDGYQD 
ESRIGLAEGL 
ISTPPTPIFP 
ADSSNHPDNK 
GPLKSTAEDF 
LAYYTVRNFT 
KRHAVGPWV 
YVFIHDTLVE 
AALKQCNREK 
TIKDFWRMIW 
EEKLIIQDFI 
HDEHGGVTAG 
LVSTRQEENP 



DTVSISESQL 
CSSEPENVQA 
LGAILNNLLP 
ESEKKAVIPL 
ISDDVGAIPI 
HKNRYINIVA 
WRMIWEHNVE 
IiRNTKIKKGS 
HCSAGVGRTG 
AILSKETEVL 
NRTSSIIPVE 
DHNAQLWMI 
LEATQDDYVL 
TFCALTTLMH 
STSLDSNGAA 



AVFCEVLTMQ 
DPENYTSLLV 
NMSYVLQIVA 
VIVSALTFIC 
KHFPKHVADL 
YDHSRVKXAQ 
VIVMITNLVE 
QKGRPSGRW 
TYIVLDSMLQ 
DSHIHAYVNA 
RSRVGISSLS 
PDGQNMAEDE 
EVRHFQCPKW 
QLEKENSVDV 
LPDGNIAESL 



QSGYVMLMDY 
TWERPRWYD 
ICTNGLYGKY 
LWLVGILIY 
HASSGFTEEF 
LAEKDGKLTD 
KGRRKCDQYW 
TQYHYTQWPD 
QIQHEGTVNI 
LLIPGPAGKT 
GEGTDY1KAS 
FVYWPNKDEP 
PNPDSPISKT 
YQVAKMINLM 



LQNNFREQQY 
TMIEKFAVLY 
SDQLIVDMPT 
WRKCFQTAHP 
ETLKEFYQEV 
YINANYVDGY 
PADGSEEYGN 
MGVPEYSLPV 
FGFLKHIRSQ 
KLEKQFQLLS 
YIMGYYQSNE 
INCESFKVTL 
FELISVIKEE 
RPGVFADIEQ 



KFSRQVFSSY 
QQLDGEDQTK 
DNPEASNSSH 
YLEDSTSPRV 
QSCTVDLGIT 
NRPKAYIAAQ 
FLVTQKSVQV 
LTFVRKAAYA 
RNYLVQTEEQ 
QSNIQQSDYS 
FIITQHPLLH 
MAEEHKCLSN 
AANRDGPMIV 
YQFLYKVILS 
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Seq ID NO: C317 Protein Sequence 
Protein Accession # : Eos sequence 



MRILKRFLAC 
QSPINIDEDL 
FKASKITFHW 
ILFEVGTEEN 
TDTVDWIVFK 
TGKEEIHEAV 
HEFLTDGYQD 
LIGTEEIIKE 
RSPTRGSEFS 
GSKTVLRSPH 
ENISQGYIFS 
TAQPDVGSGR 
TEVTPHAFIP 
LVIVSALTFI 
IKHFPKHVAD 
AHDHSRVKLA 
EVIVMITNLV 
SQKGRPSGRV 
GTYIVLDSMIi 
LDSHIHAYVN 
SRV ART ILLS 
YIMGYYQSNE 
INCESFKVTL 
FELISVIKEE 
RPGVFADIEQ 



11 

I 

IQLLCVCRLD 
TQVNVNLKKL 
GKCNMSSDGS 
LDFKAIIDGV 
DTVSISESQL 
CSSEPENVQA 
LGAILNNLLP 



GKGDVPNTSL 
MNLSGTAESL 
SEN PET IT YD 
ESFLQTNYTE 
SSRQQDLVST 
CLWLVGILI 
LHASSGFTEE 
QLAEKDGKLT 
EKGRRKCDQY 
VTQYHYTQWP 
QQIQHEGTVN 
ALLIPGPAGK 
QSNIQQSDYS 
FIITQHPLLH 



AANRDGPMIV 
YQFLYKVILS 



21 

I 

WANGYYRQQR 
KFQGWDKTSL 
EHSLEGQKFP 
ESVSRFGKQA 
AVFCEVLTMQ 
DPENYTSLLV 
NMSYVLQIVA 
AIVNPGRDSA 
NSTSQPVTKL 
NTVSITEYEE 
VLIPESARNA 
IRVDESEKTT 
VNWYSQTTQ 
YWRKCFQTAH 
FETLKEFYQE 
DYINANYVDG 
WPADGSEEYG 
DMGVPEYSLP 
IFGFLKHIRS 
TKLEKQFQGL 
AALKQCNREK 
TIKDFWRMIW 
EEKLIIQDFI 
HDEHGGVTAG 
LVSTRQEENP 



31 
I 

KLVEEIGWSY 
ENTFIHNTGK 
LEMQIYCFDA 
ALDPFILLNL 
QSGYVMLMDY 
TWERPRWYD 
ICTNGLYGKY 
TNQIRKKEPQ 
ATEKDISLTS 
ESLLTSFKLD 
SEDSTSSGSE 
KSFSAGPVMS 
PVYNEASNSS 
FYLEDSTSPR 
VQSCTVDLGI 
YNRPKAYIAA 
NFLVTQKSVQ 
VLTFVRKAAY 
QRNYLVQTEE 
TLSPRLECRG 
NRTSSIIPVE 
DHNAQLWMI 
LEATQDDYVL 
TFCALTTLMH 
STSLDSNGAA 



41 
I 

TGALNQKNWG 
TVEINLTNDY 
DRFSSFEEAV 
LPNSTDKYYI 
LQNNFREQQY 
TMIEKFAVLY 
SDQLIVDMPT 
ISTTTHYNRI 
QTVTELPPHT 
TGAEDSSGSS 
ESLKDPSMEG 
QGPSVTDLEM 
HESRIGLAEG 
VISTPPTPIF 
TADSSNHPDN 
QGPLKSTAED 
VLAYYTVRNF 
AKRHAVGPW 
QYVFIHDTLV 
TISAHCNLPL 
RSRVGISSLS 
PDGQNMAEDE 
EVRHFQCPKW 
QLEKENSVDV 
LPDGNIAESL 



51 
I 

KKYPTCNSPK 
RVSGGVSEMV 
KGKGKLRALS 
YNGSLTSPPC 
KFSRQVFSSY 
QQLDGEDQTK 
DNPELDLFPE 
GTKYNEAKTN 
VEGTSASLND 
PATSAIPFIS 
NVWFPSSTDI 
PHYSTFAYFP 
LESEKKAVIP 
PISDDVGAIP 
KHKNRYINIV 
FWRMIWEHNV 
TLRNTKIKKG 
VHCSAGVGRT 
EAILSKETEV 
PGLTDPPTSA 
GEGTDYINAS 
FVYWPNKDEP 
PNPDSPISKT 
YQVAKMINLM 



Seq ID NO: C318 Protein Sequence 
Protein Accession #: Eos sequence 



MRILKRFLAC 
QSPINIDEDL 
FKASKITFHW 
ILFEVGTEEN 
TDTVDWIVFK 
TGKEEIHEAV 
HEFLTDGYQD 
LIGTEEIIKE 
RSPTRGSEFS 
GSKTVLRSPH 
ENISQGYIFS 
TAQPDVGSGR 
TEVTPHAFTP 
LVIVSALTFI 
IKHFPKHVAD 
AYDHSRVKLA 
EVIVMITNLV 
SQKGRPSGRV 
GTYIVLDSML 
LDSHIHAYVN 
ERSRVGISSL 
IPDGQNMAED 
DGRNFLCSDN 



11 

I 

IQLLCVCRLD 
TQVNVNLKKL 
GKCNMSSDGS 
LDFKAIIDGV 
DTVSISESQL 



LGAILNNLLP 
EEEGKDIEEG 
GKGDVPNTSL 
MNLSGTAESL 
SENPETITYD 
ESFLQTNYTE 
SSRQQDLVST 
CLWLVGILI 
LHASSGFTEE 
QLAEKDGKLT 



VTQYHYTQWP 
QQIQHEGTVN 
ALLIPGPAGK 
SGEGTDYINA 
EFVYWPNKDE 
PYAPTRKRKF 



21 
I 

WANGYYRQQR 
KFQGWDKTSL 
EHSLEGQKFP 
ESVSRFGKQA 
AVFCEVLTMQ 
DPENYTSLLV 
NMSYVLQIVA 
AIVNPGRDSA 
NSTSQPVTKL 
NTVSITEYEE 
VLIPESARNA 
IRVDESEKTT 
VNWYSQTTQ 
YWRKCFQTAH 
FETLKEFYQE 
DYINANYVDG 
WPADGSEEYG 
DMGVPEYSLP 
IFGFLKHIRS 
TKLEKQFQLL 
SYIMGYYQSN 
PINCESFKVT 
RGCLPGSQDD 



31 
I 

KLVEEIGWSY 
ENTFIHNTGK 
LEMQIYCFDA 
ALDPFILLNL 
QSGYVMLMDY 
TWERPRWYD 
ICTNGLYGKY 
TNQIRKKEPQ 
ATEKDISLTS 
ESLLTSFKLD 



41 
I 

TGALNQKNWG 
TVEINLTNDY 
DRFSSFEEAV 
LPNSTDKYYI 
LQNNFREQQY 
TMIEKFAVLY 
SDQLIVDMPT 
ISTTTHYNRI 
QTVTELPPHT 



KSFSAGPVMS 
PVYNEASNSS 
FYLEDSTSPR 
VQSCTVDLGI 
YNRPKAYIAA 
NFLVTQKSVQ 
VLTFVRKAAY 
QRNYLVQTEE 
SQSNIQQSDY 
EFIITQHPLL 
LMAEEHKCLS 
QSDEARSLC 



ESLKDPSMEG 
QGPSVTDLEM 
HESRIGLAEG 
VISTPPTPIF 
TADSSNHPDN 
QGPLKSTAED 
VLAYYTVRNF 
AKRHAVGPW 
QYVFIHDTLV 
SAALKQCNRE 
HTIKDFWRMI 
NEEKLIIQDF 



Seq ID NO: C319 Protein Sequence 
Protein Accession #: XP_002914.4 

11 
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51 
I 

KKYPTCNSPK 
RVSGGVSEMV 
KGKGKLRALS 
YNGSLTSPPC 
KFSRQVFSSY 
QQLDGEDQTK 
DNPELDLFPE 
GTKYNEAKTN 
VEGTSASLND 
PATSAIPFIS 
NVWFPSSTDI 
PHYSTFAYFP 
LESEKKAVIP 
PISDDVGAIP 
KHKNRYINIV 
FWRMIWEHNV 
TLRNTKIKKG 
VHCSAGVGRT 
EAILSKETEV 
KNRTSSIIPV 
WDHNAQLWM 
ILEATQAWRS 



1 11 21 31 41 51 

I i II I I 

MKDIDIGKEY IIPSPGYRSV RERTSTSGTH RDREDSKFRR TRPLECQDAL ETAARAEGLS 
LDASMHSQLR ILDEEHPKGK YHHGLSALKP IRTTSKHQHP VDNAGLFSCM TFSWLSSLAR 
VAHKKGELSM EDVWSLSKHE SSDVNCRRLE RLWQEELNEV GPDAASLRRV VWIFCRTRLI 

1357 
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60 

120 

180 



WO 03/042661 



LSIVCLMITQ LAGFSGPAFM VKHLLEYTQA TESNLQYSLL LVLGLLLTEI VRSWSLALTW 240 

ALNYRTGVRL RGAILTMAFK KILKLKNIKE KSLGELINIC SNDGQRMFEA AAVGSLLAGG 300 

PWAILGMIY NVIILGPTGF LGSAVFILFY PAMMFASRLT AYFRRKCVAA TDERVQKMNE 360 

VLTYIKFIKM YAWVKAFSQS VQKIREEERR ILEKAGYFQS ITVGVAPIW VIASWTFSV 420 

HMTLGFDLTA AQAFTWTVF NSMTFALKVT PFSVKSLSEA SVAVDRFKSL FLMEEVHMIK 4B0 

NKPASPHIKI EMKNATLAWD SSHSSIQNSP KLTPKMKKDK RASRGKKEKV RQLQRTEHQA 540 

VLAEQKGHLL LDSDERPSPE EEEGKHIHLG HLRLQRTLHS IDLEIQEGKL VGICGSVGSG 600 

KTSLISAILG QMTLLEGSIA ISGTFAYVAQ QAWILNATLR DNILFGKEYD EERYNSVLNS 660 

CCLRPDLAIL PSSDLTEIGE RGANLSGGQR QRISLARALY SDRSIYILDD PLSALDAHVG 720 

NHIFNSAIRK HLKSKTVLFV THQLQYLVDC DEVIFMKEGC ITERGTHEEL MNLNGDYATI 780 

FNNLLLGETP PVEINSKKET SGSQKKSQDK GPKTGSVKKE KAVKPEEGQL VQLEEKGQGS 840 

VPWSVYGVYI QAAGGPLAFL VIMALFMLNV GSTAFSTWWL SYWIKQGSGN TTVTRGNETS 900 

VSDSMKDNPH MQYYASIYAL SMAVMLILKA IRGWFVKGT LRASSRLHDE LFRRILRSPM 960 

KFFDTTPTGR ILNRFSKDMD EVDVRLPFQA EMFIQNVILV FFCVGMIAGV FPWFLVAVGP 1020 

LVILFSVLHI VSRVLIRELK RLDNITQSPF LSHITSSIQG LATIHAYNKG QEFLHRYQEL 1080 

LDDNQAPFFL FTCAMRWLAV RLDLISIALI TTTGLMIVLM HGQIPPAYAG LAISYAVQLT 1140 

GLFQFTVRLA SET EAR FT SV ERINHYIKTL SLEAPARIKN KAPSPDWPQE GEVTFENAEM 1200 

RYRENLPLVL KKVSFTIKPK EKIGIVGRTG SGKSSLGMAL FRLVELSGGC IKIDGVRISD 1260 

IGLADLRSKL SIIPQEPVLF SGTVRSNLDP FNQYTEDQIW DALERTHMKE CIAQLPLKLE 1320 

SEVMENGDNF SVGERQLLCI ARALLRHCKI LILDEATAAM DTETDLLIQE TIREAFADCT 1380 

MLTIAHRLiHT VLGSDRIMVL AQGQWEFDT PSVLLSNDSS RFYAMFAAAE NKVAVKG 1437 

Seq ID NO: C320 Protein Sequence 
Protein Accession NP_00S679.1 

1 11 21 31 41 51 

1 ] 1 I 1 I 

MKDIDIGKEY IIPSPGYRSV RERTSTSGTH RDREDSKFRR TRPLECQDAL ETAARAEGLS 60 

LDASMHSQLR ILDEEHPKGK YHHGLSALKP IRTTSKHQHP VDNAGLFSCM TFSWLSSIAR 120 

VAHKKGELSM EDVWSLSKHE SSDVNCRRLE RLWQEELNEV GPDAASLRRV VWIFCRTRLI 180 

LSIVCLMITQ LAGFSGPAFM VKHLLEYTQA TESNLQYSLL LVLGLLLTEI VRSWSLALTW 24 0 

ALNYRTGVRL RGAILTMAFK KILKLKNIKE KSLGELINIC SNDGQRMFEA AAVGS LLAGG 300 

PWAILGMIY NVIILGPTGF LGSAVFILFY PAMMFASRLT AYFRRKCVAA TDERVQKMNE 3 60 

VLTYIKFIKM YAWVKAFSQS VQKIREEERR ILEKAGYFQG ITVGVAPIW VIASWTFSV 420 ' 

HMTLGFDLTA AQAFTWTVF NSMTFALKVT PFSVKSLSEA SVAVDRFKSL FLMEEVHMIK 480 

NKPASPHIKI EMKNATLAWD SSHSSIQNSP KLTPKMKKDK RASRGKKEKV RQLQRTEHQA 540 

VLAEQKGHLL LDSDERPSPE EEEGKHIHLG HLRLQRTLHS IDLEIQEGKL VGICGSVGSG 600 

KTSLISAILG QMTLLEGSIA ISGTFAYVAQ QAWILNATLR DNILFGKEYD EERYNSVLNS 660 

CCLRPDLAIL PSSDLTEIGE RGANLSGGQR QRISLARALY SDRSIYILDD PLSALDAHVG 720 

NHIFNSAIRK HLKSKTVLFV THQLQYLVDC DEVIFMKEGC ITERGTHEEL MNLNGDYATI 780 

FNNLLLGETP PVEINSKKET SGSQKKSQDK GPKTGSVKKE KAVKPEEGQL VQLEEKGQGS 840 

VPWSVYGVYI QAAGGPLAFL VIMALFMLNV GSTAFSTWWL SYWIKQGSGN TTVTRGNETS 900 

VSDSMKDNPH MQYYASIYAL SMAVMLILKA IRGWFVKGT LRASSRLHDE LFRRILRSPM 960 

KFFDTTPTGR ILNRFSKDMD EVDVRLPFQA EMFIQNVILV FFCVGMIAGV FPWFLVAVGP 1020 

LVILFSVLHI VSRVLIRELK RLDNITQSPF LSHITSSIQG LATIHAYNKG QEFLHRYQEL 1080 

LDDNQAPFFL FTCAMRWLAV RLDLISIALI TTTGLMIVLM HGQIPPAYAG LAISYAVQLT 1140 

GLFQFTVRLA SETEARFTSV ERINHYIKTL SLEAPARIKN KAPSPDWPQE GEVTFENAEM 1200 

RYRENLPLVL KKVSFTIKPK EKIGIVGRTG SGKSSLGMAL FRLVELSGGC IKIDGVRISD 1260 

IGLADLRSKL SIIPQEPVLF SGTVRSNLDP FNQYTEDQIW DALERTHMKE CIAQLPLKLE 1320 

SEVMENGDNF SVGERQLLCI ARALLRHCKI LILDEATAAM DTETDLLIQE TIREAFADCT 1380 

MLTIAHRLiHT VLGSDRIMVL AQGQWEFDT PSVLLSNDSS RFYAMFAAAE NKVAVKG 1437 

Seq ID NO: C321 Protein Sequence 
Protein Accession #: NP_005S53.1 

1 11 21 31 41 51 

I I I I I I 

MPALWLGCCL CFSLLLPAAR ATSRREVCDC NGKSRQCIFD RELHRQTGNG FRCLNCNDNT 60 

DGIHCEKCKN GFYRHRERDR CLPCNCNSKG SLSARCDNSG RCSCKPGVTG ARCDRCLPGF 120 

HMLTDAGCTQ DQRLLDSKCD CDPAGIAGPC DAGRCVCKPA VTGERCDRCR SGYYNLDGGN 180 

PEGCTQCFCY GHSASCRSSA EYSVHKITST FHQDVDGWKA VQRNGSPAKL QWSQRHQDVF 240 

SSAQRLDPVY FVAPAKFLGN QQVSYGQSLS FDYRVDRGGR HPSAHDVILE GAGLRITAPL 300 

MPLGKTLPCG LTKTYTFRLN EHPSNNWSPQ LSYFEYRRLL RNLTALRIRA TYGEYSTGYI 360 

DNVTLISARP VSGAPAPWVE QCICPVGYKG QFCQDCASGY KRDSARLGPF GTCIPCNCQG 420 

GGACDPDTGD CYSGDENPDI ECADCPIGFY NDPHDPRSCK PCPCHNGFSC SVMPETEEW 480 

CNNCPPGVTG ARCELCADGY FGDPFGEHGP VRPCQPCQCN NNVDPSASGN CDRLTGRCLK 540 

CIHNTAGIYC DQCKAGYFGD PLAPNPADKC RACNCNPMGS EPVGCRSDGT CVCKPGFGGP 600 

NCEHGAFSCP ACYNQVKIQM DQFMQQLQRM EALISKAQGG DGWPDTELE GRMQQAEQAL 660 

QDILRDAQIS EGASRSLGLQ LAKVRSQENS YQSRLDDLKM TVERVRALGS QYQNRVRDTH 720 

RLITQMQLSL AESEASLGNT NIPASDHYVG PNGFKSLAQE ATRLAESHVE SASNMEQLTR 780 

ETEDYSKQAL SLVRKALHEG VGSGSGSPDG AWQGLVEKL EKTKSLAQQL TREATQAEIE 840 

ADRSYQHSLR LLDSVSRLQG VSDQSFQVEE AKRIKQKADS LSTLVTRHMD EFKRTQKNLG 900 

NWKEEAQQLL QNGKSGREKS DQLLSRANLA KSRAQEALSM GNAT FY EVES ILKNLREFDL 960 

QVDNRKAEAE EAMKRLSYIS QKVSDASDKT QQAERALGSA AADAQRAKNG AGEALEISSE 1020 

IEQEIGSLNL EANVTADGAL AMEKGLASLK SEMREVEGEL BRKELEFDTN MDAVQMVITE 1080 

AQKVDTRAKN AGVTIQDTLN TLDGLLHLMD QPLSVDEEGL VLLEQKLSRA KTQINSQLRP 1140 

MMSELEERAR QQRGHLHLLE TSIDGILADV KNLENIRDNL PPGCYNTQAL EQQ 1193 

Seq ID NO: C322 Protein Sequence 
Protein Accession #: NP_066924.1 

1 11 21 31 41 51 

1 | I I I I 

MANAGLQLLG FILAFLGWIG AIVSTALPQW RIYSYAGDNI VTAQAMYEGL WMSCVSQSTG 60 
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QIQCKVFDSL LNLSSTLQAT RALMWGILL GVIAIFVATV GMKCMKCLED DEVQKMRMAV 120 
IGGAIFLLAG LAILVATAWY GNRIVQEFYD PMTPVNARYE FGQALFTGWA AASLCLLGGA 180 
LLCCSCPRKT TSYPTPRPYP KPAPSSGKDY V 211 



Seq ID NO: C323 Protein Sequence 
Protein Accession #: AAM77876 

1 ii 21 31 41 51 

1 I 1 I 1 I 

MSSWIRWHGP AMARLWGFCW LWGFWRAAP ACPTSCKCSA SRIWCSDPSP GIVAFPRLEP 60 

NSVDPENITE IFIANQKRLE IINEDDVEAY VGLRNLTIVD SGLKFVAHKA FLKNSNLQHI 120 

NFTRNKLTSL SRKHFRHLDL SELILVGNPP TCSCDIMWIK TLQEAKSSPD TQDLYCLNES 180 

SKNIPLANLQ IPNCGLPSAN LAAPNLTVEE GKSITLSCSV AGDPVPNMYW DVGNLVSKHM 240 

NETSHTQGSL RITNISSDDS GKQISCVAEN LVGEDQDSVN LTVHFAPTIT FLESPTSDHH 300 

WCIPFTVKGN PKPALQWFYN GAILNESKYI CTKIHVTNHT EYHGCLQLDN PTHMNNGDYT 360 

LIAKNEYGKD EKQISAHFMG WPGIDDGANP NYPDVIYEDY GTAANDIGDT TNRSNEIPST 420 

DVTDKTGREH LSVYAVWIA SWGFCLLVM LFLLKLARHS KFGMKGFVLF HKIPLDG 477 



Seq ID NO: C324 Protein Sequence 
Protein Accession fti NP_006171.1 

1 11 21 31 41 51 

I i 1 I . I I 

MSSWIRWHGP AMARLWGFCW LWGFWRAAF ACPTSCKCSA SRIWCSDPSP GIVAFPRLEP 60 

NSVDPENITE IFIANQKRLE IINEDDVEAY VGLRNLTIVD SGLKFVAHKA FLKNSNLQHI 120 

NFTRNKLTSL SRKHFRHLDL SELILVGNPF TCSCDIMWIK TLQEAKSSPD TQDLYCLNES 180 

SKNIPLANLQ IPNCGLPSAN LAAPNLTVEE GKSITLSCSV AGDPVPNMYW DVGNLVSKHM 240 

NETSHTQGSL RITNISSDDS GKQISCVAEN LVGEDQDSVN LTVHFAPTIT FLESPTSDHH 300 

WCIPFTVKGN PKPALQWFYN GAILNESKYI CTKIHVTNHT EYHGCLQLDN PTHMNNGDYT 360 

LIAKNEYGKD EKQISAHFMG WPGIDDGANP NYPDVIYEDY GTAANDIGDT TNRSNEIPST 420 

DVTDKTGREH LSVYAVWIA SWGFCLLVM LFLLKLARHS KFGMKGPASV ISNDDDSASP 480 

LHHISNGSNT PSSSEGGPDA VIIGMTKIPV IENPQYFGIT NSQUCPDTFV QHIKRHNIVL 540 

KRELGEGAFG KVFLAECYNL CPEQDKILVA VKTLKDASDN ARKDFHREAE LLTNLQHEHI 600 

VKFYGVCVEG DPLIMVFEYM KHGDLNKFLR AHGPDAVLMA EGNPPTELTQ SQMLHIAQQI 660 

AAGMVYLASQ HFVHRDLATR NCLVGENLLV KIGDFGMSRD VYSTDYYRVG GHTMLPIRWM 720 

PPESIMYRKF TTESDVWSLG WLWEIFTYG KQPWYQLSNN EVIECITQGR VLQRPRTCPQ 780 

EVYELMLGCW QREPHMRKNI KGIHTLLQNL AKASPVYLDI LG 822 



Seq ID NO: C325 Protein Sequence 
Protein Accession #: Eos sequence 



1 11 21 31 41 51 

| | I I I I 

MSSWIRWHGP AMARLWGFCW LWGFWRAAF ACPTSCKCSA SRIWCSDPSP GIVAFPRLEP 60 

NSVDPENITE IFIANQKRLE IINEDDVEAY VGLRNLTIVD SGLKFVAHKA FLKNSNLQHI 120 

NFTRNKLTSL SRKHFRHLDL SELILVGNPF TCSCDIMWIK TLQEAKSSPD TQDLYCLNES 180 

SKNIPLANLQ IPNCGLPSAN LAAPNLTVEE GKSITLSCSV AGDPVPNMYW DVGNLVSKHM 240 

NETSHTQGSL RITNISSDDS GKQISCVAEN LVGEDQDSVN LTVHFAPTIT FLESPTSDHH 300 

WCIPFTVKGN PKPALQWFYN GAILNESKYI CTKIHVTNHT EYHGCLQLDN PTHMNNGDYT 360 

LIAKNEYGKD EKQISAHFMG WPGIDDGANP NYPDVIYEDY GTAANDIGDT TNRSNEIPST 420 

DVTDKTGREH LSVYAVWIA SWGFCLLVM LFLLKLARHS KFGMKGFVLF HKIPLDG 477 

Seq ID NO: C326 Protein sequence 
Protein Accession #: NP_570843.1 

1 11 21 31 41 51 

MPLKHYLLLL VGCQAWGAGL AYHGCPSECT CSRASQVECT GARIVAVPTP LPWNAMSLQI 60 

LNTHITELNE SPFLNISALI ALRIEKNELS RITPGAFRNL GSLRYLSLAN NKLQVLPIGL 120 

FQGLDSLESL LLSSNQLLQI QPAHFSQCSN LKELQLHGNH LEYIPDGAFD HLVGLTKLNL 180 

GKNSLTHISP RVFQHLGNLQ VLRLYENRLT DIPMGTFDGL VNLQELALQQ NQIGLLSPGL 240 

FHNNHNLQRL YLSNNHISQL PPSIFMQLPQ LNRLTLFGNS LKELSLGIFG PMPNLRELWL 300 

YDNHISSLPD NVFSNLRQLQ VLILSRNQIS FISPGAFNGL TELRELSLHT NALQDLDGNV 360 

FRMLANLQNI SLQNNRLRQL PGNIFANVNG LMAIQLQNNQ LENLPLGIFD HLGKLCELRL 420 

YDNPWRCDSD ILPLRNWLLL NQPRLGTDTV PVCFSPANVR GQSLIIINVN VAVPSVHVPE 480 

VPSYPETPWY PDTPSYPDTT SVSSTTELTS PVEDYTDLTT IQVTDDRSVW GMTQAQSGLA 540 

IAAIVIGIVA LACSLAACVG CCCCKKRSQA VLMQMKAPNE C SB1 



Seq ID NO: C327 Protein Sequence 
Protein Accession #: NP_002649.1 

1 11 21 31 41 51 

i I I I 1 > 

MRALLARLLL CVLWSDSKG SNELHQVPSN CDCLNGGTCV SNKYFSNIHW CNCPKKFGGQ 60 

HCEIDKSKTC YEGNGHFYRG KASTDTMGRP CLPWNSATVL QQTYHAHRSD ALQLGLGKHN 120 

YCRNPDNRRR PWCYVQVGLK PLVQECMVHD CADGKKPSSP PEELKFQCGQ KTLRPRFKII 180 

GGEFTTIENQ PWFAAIYRRH RGGSVTYVCG GSLISPCWVI SATHCFIDYP KKEDYIVYLG 240 

RSRLNSNTQG EMKFEVENLI LHKDYSADTL AHHNDIALLK IRSKEGRCAQ PSRTIQTICL 300 

PSMYNDPQFG TSCEITGFGK ENSTDYLYPE QLKMTWKLI SHRECQQPHY YGSEVTTKML 360 

CAADPQWKTD SCQGDSGGPL VCSLQGRMTL TGIVSWGRGC ALKDKPGVYT RVSHFLPWIR 420 

SHTKEENGLA L 431 

Seq ID NO: C328 Protein Sequence 
Protein Accession #: XP_087254.l 
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10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 



I 

MQFRECSING 
NETELIKEHD 
AAARIGIVFI 
AESSILPKCI 
REEKLAAVFQ 
SLSCGHFHRT 
EKLFMEVCRN 
GIMGKEGRQA 
QFYCLFSQQT 
SIKTFLYWTI 
MALETHFWTW 
ILMWTCLFIi 
LERVIGRCSP 



11 
I 

MKYQEINGRL 
LFFKAVSLCH 
GNSEETMEVK 
GGEIEKTRIH 
FIEKDLILLG 
MNILELINQK 
CSAVLCCRMA 
ARNSDYAIAR 
LYDSVYLTLY 
LGFSHAFIFF 
INHLVTWGSI 
DIIKKVPDRH 
THISRSWSAS 



21 
I 

VPEGPTPDSS 
TVQISNVQTD 
TLGKLERYKL 
VDEPALKGLR 
ATAVEDRLQD 
SDSECAEQLR 
PLQKAKVIRL 
FKFLSKLLFV 
NICFTSLPIL 
FGSYLLIGKD 
IFYFVFSLFY 
LHPTSTEKAQ 
DPFYTNDRSI 



31 
I 

EGNLSYLSSL 
CTGDGPWQSN 
LHILEFDSDR 
TLCIAYRKFT 
KVRETIEALR 
QLARRITEDH 
IKISPEKPIT 
HGHFYYIRIA 
IYSLLEQHVD 
TSLLGNGQMF 
GGILWPFLGS 
LTETNAGIKC 
LTLSTMDSST 



41 
I 

SHLNNLSHLT 
LAPSQLEYYA 
RRMSVIVQAP 
SKEYEBIDKR 
MAGIKVWVLT 
VIQHGLWDG 
LAVGDGANDV 
TLVQYFFYXN 
PHVLQNKPTL 
GNWTFGTLVF 
QNMYFVFIQL 
LDSMCCFPEG 
C 



51 
I 

TSSSFRTSPE 
SSPDEKALVE 
SGEKLLFAKG 
IFEARTALQQ 
GDKKETAVSV 
TSLSIiALREH 
SMIQEAHVGI 
VCFITPQFLY 
YRDISKNRLL 
TVMVITVTVK 
LSSGSAWFAI 
EAACASVGRM 



Seq ID NO: C329 Protein Sequence 
Protein Accession #: XP_087461.1 

1 11 21 31 41 51 

1)1111 
MLPLIAALIA AACPLPPVRG GAADAPGLLG VPSNASVNAS SAASPSPRGC WPRRPPGPPS 
ARARRRRRRR RRLCNISVQR QMLSSLLVRW GRPRGFQCDL LLFSTNAHGR AFFAAAFHRV 
GPPLLIEHLG LAAGGAQQDL RLCVGCGWVR GRRTGRLRPA AAPSAAAATA GAPTALPAYP 
AAEPPGPLWL QGEPLHFCCL DFSLEELQGE PGWRLNRKPI ESTLVACFMT LVIWWSVAA 
LIWPVPIIAG FLPNGMEQRR TTASTTAATP AAVPAGTTAA AAAAAAAAAA AVTSGVATK 

Seq ID NO: C330 Protein Sequence 
Protein Accession #: XP_051522.2 



MDLHLFDYSE 
ANSVWWVNI 
HLIFSINLFG 
YYLKTVTSAS 
ASSDQEKHSS 
QCLSLVHCCV 
TK 



11 
I 

PGNFSDISWP 
QAKTTGYDTH 
SIFFLTCMSV 
NNETYCRSFY 
RKIIFSYVW 
NPVLYSFINR 



21 
1 

CNSSDCIWD 
CYILNIiAIAD 
DRYLSITYFT 
PEHSIKEWLI 
FLVCWLPYHV 
NYRYELMKAF 



31 
I 

TVMCPNMPNK 
LWWLTIPVW 
NTPSSRKKMV 
GMELVSWLG 
AVLIiDIFSIL 
IFKYSAKTGIi 



41 
I 

SVLLYTLSFI 
WSIiVQHNQW 
RRWCILVWL 
FAVPFSIIAV 
KYIPFTCRLE 
TKLIDASRVS 



51 
I 

YIFIFVIGMI 
PMGELTCKVT 
LAFCVSLPDT 
FYFLLARAIS 
HALFTALHVT 
ETEYSALEQS 



60 

120 

180 

240 

300 

360 

420 

4B0 

540 

600 

660 

720 

761 



60 
120 
180 
240 

299 



60 

120 

180 

240 

300 

360 

362 



Seq ID NO: C331 Protein Sequence 
Protein Accession #: NP_000341.1 



1 
I 

MGFVRQIQLL 
MPSAGMLPWL 
HLGRIWTELH 
XjINSQVRPEQ 
EDTLYANVDF 
RPLMQNGGPE 
SYDRRTTSFC 
IiEHVRKLVKA 
FLYKGPRESQ 
LLEENMFWAG 
FRYIWGGFAY 
IAWIYSVSWT 
HGRILHYSDP 
AWQDRMTAEL 
QMMLLDAACY 



FLGHNGAGKT 
AEHMLFYAQL 
KWILDEPTS 
SGTPLFLKNC 
LDGDVNELMD 
SFGISDTPLE 
SPGAPAAHPE 
ATFVFLALML 
GNRCLKEGWL 
ECPEGAGGLP 
SIGGKLPWP 
NKGWHALVSF 
WAICVIFSM 
LWGIFIGFQ 
PIGINSSAIT 
FGEEHSANPP 
VAEERQRIIT 
TFKMLTGDTT 
GVPAEEIEKV 
MDPQARRMLW 
FGDGYIVTMK 
UjLSHKDSLL 



11 
I 

LWKNWTLRKR 
QGIFCNVNNP 
ILSQFMDTLR 
FAHGVPDIiAL 
FKLFRVLPTL 
TFTKLMGILS 
NALIQSLESN 
WEEVGPQIWY 
ADDMANFDWR 
WFPDMYPWT 
UJDMVEQGIT 
VKSIVLEKEL 
FILFLFLLAF 
KKAVSLLSPV 
GLLAWYLDQV 
EGIHDSFFER 
TTLSILTGLL 
KGKSQEEAQL 
GVDPYSRRSI 
FGTGLYLTLV 
WLHHVPEAK 
EIFUCVTEDS 
GQPPPEPECP 
SIVILPFGEY 
PEYPCGNSTP 
PPQRTQRSTE 
ITGEALVGFL 
LNVAHNAILR 
SFVPASFVLY 
KKAYTSPENL 
FILELFDNNR 
HWDLIGKNLF 
GGNKTDILRIi 
VTSGDATVAG 
ANKSIKSLGL 
NVIVSIIRKG 
IKSPKDDLLP 
IEEYSVTQTT 



21 

1 

QKIRFWELV 
CFQSPTPGES 
THPERIAGRG 
KDIACSEALL 
LDSRSQGINL 
DLLCGYPEGG 
PLTKIAWRAA 
FFDNSTQMNM 
DIFNITDRTL 
SSLPPHVKYK 
RSQVQAEAPV 
RLKETLKNQG 
STATIMLCFL 
AFGFGTEYLV 
FPGDYGTPLP 
EHPGWVPGVC 
PPTSGTVLVG 
EMEAMLEDTG 
WDLLLKYRSG 
RKMKNIQSQR 
LVECIGQELI 
DSGPLFAGGA 
GPQLNTGTQL 
PALTLHPWIY 
WKTPSVSPNI 
ILQDLTDRNI 
SDLGRIMNVS 
ASLPKDR8PE 
LIQERVNKSK 
PALVALLLLY 
TLLRFNAVLR 
AMWEGWYF 
HELTKIYLGT 
KSILTNISEV 
TVYADCLAGT 
RAWLTSHSM 
DLNPVEQFFQ 
LDQVFVNFAK 



31 
I 

WPLSLFLVLI 
PGIVSNYNNS 
IRIRDILKDE 
ERFIIFSQRR 
RSWGGILSDM 
GSRVLSFNWY 
KPLLMGKILY 
IRDTLGNPTV 
RLVNQYLECL 
IRMDIDWEK 
GIYLQQMPYP 
VSNAVIWCTW 
LSTFFSKASL 
RFEEQGLGLQ 
WYFIiLQESYW 
VKNLVKIFEP 
GRDIETSLDA 



41 



RTIIMPTHHM 
KGSEGTCSCS 
FLLPNKNFKH 
QQKRENVNPR 
VLQHVQALLV 
GQQYTFFSMD 
TQLFQKQKWT 
SDFIiVKTYPA 
GGPITREASK 
EYGITVISQP 
HLQFISGVSP 
GWAVIPMMYP 
KLLIVFPHFC 
LLTLLVQRHF 
SSPAVDRLCV 
HQNMGYCPQF 
YSGGNKRKLS 
EECEALCTRL 
GNFPGSVQRE 
QQTESHDLPL 



WLRNANPLYS 
ILARVYRDFQ 
ETLTLFLIKN 
GAKTVRYALC 
SPRIQEFIHR 
EDNNYKAFLG 
TPDSPAARRI 
KDFLNRQIiGE 
VLDKFESYND 
TNKIKDRYWD 
CFVDDSFMII 
FLDSFSIMSM 
AAACSGVIYF 
WSNIGNSPTE 
LSGEGCSTRE 
CGRPAVDRLN 
VRQSLGMCPQ 
DLSGGMQRKL 
DEADHQGDRI 
SKGPSTTCPA 
RAYASLFREL 
HPCLGPREKA 
KRFQHTIRSH 
EPGSEQFTVL 
QVNPSPSCRC 
LIRSSLKSKF 
EIPDFLKHLE 
LNLTKEQLSE 
TTYWVTNFLW 
ASFLFDVPST 
LGRGLIDLAL 
FLSQWIAEPT 
GVRPGECFGIi 
DAIDELLTGR 
TAIALIGCPP 
AIMVKGAFRC 
RHYNMLQFQV 
HPRAAGASRQ 



51 
1 

HHECHFPNKA 
ELLMNAPESQ 
IGLSDSWYL 
SLSQGTIiQWI 
PSMQDLLWVT 
IDSTRKDPIY 



EGITAEAILN 
ETQLTQRALS 
SGPRADPVED 
LNRCFPIFMV 
SIFLLTIFIM 
TLYLPHILCF 
GDEFSFLLSM 
ERALEKTEPL 
ITFYENQITA 
HNILFHHLTV 
SVAIAFVGDA 
AIIAQGRLYC 
HVDDLTPEQV 
EETLADLGLS 
GQTPQDSNVC 
KDFLAQIVLP 
ADVLLNKPGF 
STREKLTMLP 
WVNEQRYGGI 
TEDNIKVWFN 
ITVLTTSVDA 
DIMNYSVSAG 
AYVALSCANIj 
SQAVTDVYAR 
KEPIVDEDDD 
LGVNGAGKTT 
EHLYIiYARLR 
LVLLDEPTTG 
MGTIQHLKSK 
SSSSLARIFQ 
AQD 



60 

120 

180 

240 

300 

360 

420 

480 

S40 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2273 
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Seq ID NO: C332 Protein Sequence 
Protein Accession ft: NP_006662.2 

1 11 21 31 41 51 

I | I I I 1 

MVPHAILARG RDVCRRNGLL ILSVLSVIVG CLLGFFLRTR RLSPQEISYF QFPGELLMRM 60 

LKMMILPLW SSLMSGLASL DAKTSSRLGV LTVAYYLWTT FMAVIVGIFM VSIIHPGSAA 120 

QKETTEQSGK PIMSSADALL DLIRNMFPAN LVEATFKQYR TKTTPWKSP KVAPEEAPPR 180 

RILIYGVQEE NGSHVQNFAL DLTPPPEWY KSEPGTSDGM NVLGIVFFSA TKGIMLGRMG 240 

DSGAPLVSFC QCLNESVMKI VAVAVWYFPF GIVFLIAGKI LEMDDPRAVG KKLGFYSVTV 300 

VCGLVLHGLF ILPLLYFFIT KKNPIVFIRG I LQ ALL I ALA TSSSSATLPI TFKCLLENNH 360 

IDRRIARFVL PVGATINMDG TALYEAVAAI FIAQVNNYEL DFGQIITISI TATAASIGAA 420 

GIPQAGLVTM VIVLTSVGLP TDDITLIIAV DWALDRFRTM INVLGDALAA GIMAHICRKD 480 

FARDTGTEKL LPCETKPVSL QEIVAAQQNG CVKSVAEASE LTLGPTCPHH VPVQVERDEE 540 

LPAASLNHCT IQISELETNV 560 



Seq ID NO: C333 Protein Sequence 
Protein Accession #: NP_005680.l 

1 11 21 31 41 51 

| I I I I I 

MVTVGNYCEA EGPVGPAWMQ DGLSPCFFFT LVPSTRMALG TLALVLALPC RRRERPAGAD 60 

SLSWGAGPRI SPYVLQLLLA TLQAALPLAG LAGRVGTARG APLPSYLLLA SVLESLAGAC 120 

GLWLLWERS QARQRLAMGI HIKFRHSPGL LLLWTVAFAA ENLALVSWNS PQWWWARADL 180 

GQQVQFSLWV LRYWSGGLF VLGLWAPGLR PQSYTLQVHE EDQDVERSQV RSAAQQSTWR 240 

DFGRKLRLLS GYLWPRGSPA LQLWLICLG LMGLERALNV LVPIFYRNIV NLLTEKAPWN 300 

SLAWTVTSYV FLKFLQGGGT GSTGFVSNLR TFLWIRVQQF TSRRVELLIF SHLHELSLRW 360 

HLGRRTGEVL RIADRGTSSV TGLLSYLVFN VIPTLADIII GIIYFSMFFN AWFGLIVFLC 420 

MSLYLTLTIV VTEWRTKFRR AMNTQENATR ARAVDSLLNF ETVKYYNAES YEVERYREAI 480 

IKYQGLEWKS SASLVLLNQT QNLVIGLGLL AGSLLCAYFV TEQKLQVGDY VLFGTYIIQL 540 

YMPLNWFGTY YRMIQTNFID MENMFDLLKE ETEVKDLPGA GPLRFQKGRI EFENVHFSYA 600 

DGRETLQDVS FTVMPGQTLA LVGPSGAGKS TILRLLFRFY DISSGCIRID GQDISQVTQA 660 

SLRSHIGWP QDTVLFNDTI ADNIRYGRVT AGNDEVEAAA QAAGIHDAIM AFPEGYRTQV 720 

GERGLKLSGG EKQRVAIART ILKAPGIILL DEATSALDTS NERAIQASLA KVCANRTTIV 780 

VAHRLSTWN ADQILVIKDG CIVERGRHEA LLSRGGVYAD MWQLQQGQEE TSEDTKPQTM 840 

ER 842 

Seq ID NO: C334 Protein Sequence 
Protein Accession #: NP_000667.1 

1 11 21 31 41 51 

I I I I I I 

MLLETQDALY VALELVIAAL SVAGNVLVCA AVGTANTLQT PTNYFLVSLA AADVAVGLFA 
IPFAITISLG FCTDFYGCLF LACFVLVLTQ SSIFSLLAVA VDRYLAICVP LRYKSLVTGT 
RARGVIAVLW VLAFGIGLTP PLGWNSKDSA TNNCTEPWDG TTNESCCLVK CLFENWPMS 
YMVYFNFFGC VLPPLLIMLV IYIKIFLVAC RQLQRTELMD HSRTTLQREI HAAKSLAMIV 
GIFALCWLPV HAVNCVTLFQ PAQGKNKPKW AMNMAILLSH ANSWNPIVY AYRNRDFRYT 
FHKI I SRYLL CQADVKSGNG QAGVQPALGV GL 



Seq ID NO: C335 Protein Sequence 
Protein Accession #: NP_443164 

1 11 21 31 41 51 

I I 1 I i I 

MGLGARGAWA ALLLGTLQVL ALLGAAHESA AMAETLQHVP SDHTNETSNS TVKPPTSVAS 60 

DSSNTTVTTM KPTAASNTTT PGMVSTNMTS TTLKSTPKTT SVSQNTSQIS TSTMTVTHNS 120 

SVTSAASSVT ITTTMHSEAK KGSKFDTGSF VGGIVLTLGV LSILYIGCKM YYSRRGIRYR 180 

TIDEHDAII 189 



Seq ID NO: C336 Protein Sequence 
Protein Accession #: NP_004186.1 

1 11 21 31 41 51 

I I I I 1 I 

MAQHGAMGAF RALCGLALLC ALSLGQRPTG GPGCGPGRLL LGTGTDARCC RVHTTRCCRD 60 

YPGEECCSEW DCMCVQPEFH CGDPCCTTCR HHPCPPGQGV QSQGKFSFGF QCIDCASGTF 120 

SGGHEGHCKP WTDCTQFGFL TVFPGNKTHN AVCVPGSPPA EPLGWLTWL LAVAACVLLL 180 

TSAQLGLHIW QLRSQCMWPR ETQLLLEVPP STEDARSCQF PEEERGERSA EEKGRLGDLW 240 

V 241 



Seq ID NO: C337 Protein Sequence 
Protein Accession fc: BAC03767.1 



1 11 21 31 41 51 

I 111 I I 

MGCDGRVSGL LRRNLQPTLT YVJSVFFSFGL CIAFLGPTLL DLRCQTHSSL PQISWVFFSQ 60 

QLCLLLGSAL GGVFKRTLAQ SLWALFTSSL AISLVFAVIP FCRDVKVLAS VMALAGLAMG 120 

CIDTVANMQL VRMYQKDSAV FLQVLHFFVG FGALLSPLIA DPFLSEANCL PANSTANTTS 180 

RGHLFHVSRV LGQHHVDAKP WSNQTFPGLT PKDGAGTRVS YAFWIMALID LPVPMAVLML 240 

LSKERLLTCC PQRRPLLLSA DELALETQPP EKEDASSLPP KFQSHLGHED LFSCCQRKNL 300 

RGAPYSFFAI HITGALVLPM TDGLTGAYSA FVYSYAVEKP LSVGHKVAGY LPSLFWGPIT 360 

LGRLLSIPIS SRMKPATMVF INWGWVTF LVLLIFSYNV VFLFVGTASL GLFLSSTFPS 420 

MLAYTEDSLQ YKGCATTVLV TGAGVGEMVL QMLVGSIFQA QGSYSFLVCG VTFGCIiAFTF 480 
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YILLLFFHRM HPGLPSVPTQ DRSIGMENSE CYQR 

Seq ID NO: C33B Protein Sequence 
Protein Accession fl: NP_002 194-1 



514 



1 11 21 31 41 51 

I I I I I I 

MGPERTGAAP LPLLLVIiALS QGILNCCLAY NVGLPEAKIF SGPSSEQFGY AVQQFINPKG 60 

NWLLVGSPWS GFPENRMGDV YKCPVDLSTA TCEKLNLQTS TSIPNVTEMK TNMSLGLILT 120 

RNMGTGGFLT CGPLWAQQCG NQYYTTGVCS DISPDFQLSA SFSPATQPCP SLIDWWCD 1B0 

ESNSIYPWDA VKNFLEKFVQ GLDIGPTKTQ VGLIQYANNP RWFNLNTYK TKEEMIVATS 240 

QTSQYGGDLT NTFGAIQYAR KYAYSAASGG RRSATKVMW VTDGESHDGS MLKAVIDQCN 300 

HDNILRFGIA VLGYLNRNAL DTKNLIKEIK AIASIPTERY FFNVSDEAAL LEKAGTLGEQ 360 

IFSIEGTVQG GDNFQMEMSQ VGFSADYSSQ NDILMLGAVG AFGWSGTIVQ KTSHGHLIFP 420 

KQAFDQILQD RNHSSYLGYS VAAISTGEST HFVAGAPRAN YTGQIVLYSV NENGNITVIQ 480 

AHRGDQIGSY FGSVLCSVDV DKDTITDVLL VGAPMYMSDL KKEEGRVYLF TIKKGILGQH 540 

QFLEGPEGIE NTRFGSAIAA LSDINMDGFN DVIVGSPLEN QNSGAVYIYN GHQGTIRTKY 600 

SQKILGSDGA FRSHLQYFGR SLDGYGDLNG DSITDVSIGA FGOWQLWSQ SIADVAIEAS 660 

FTPEKITLVN KNAQIILKLC FSAKFRPTKQ NNQVAIVYNI TLDADGFSSR VTSRGLFKEN 720 

NERCLQKNMV VNQAQSCPEH IIYIQEPSDV VNSLDLRVDI SLENPGTSPA LEAYSETAKV 780 

FSIPFHKDCG EDGLCISDLV LDVRQIPAAQ EQPFIVSNQN KRLTFSVTLK NKRESAYNTG 840 

IWDFSENLF FASFSLPVDG TEVTCQVAAS QKSVACDVGY PALKREQQVT FTINFDFNLQ 900 

NLQNQASLSF QALSESQEEN KADNLVNLKI PLLYDAEIKL TRSTNXNFYE ISSOGNVPSI 960 

VHSFEDVGPK FIFSLKVTTG SVPVSMATVI IHIPQYTKEK NPLMYLTGVQ TDKAGDISCN 1020 

ADINPLKIGQ TSSSVSFKSE NFRHTKELNC RTASCSNVTC WLKDVHMKGE YFVNVTTRIW 1080 

NGTFASSTFQ TVQLTAAAEI NTYNPEIYVI EDNTVTIPLM IMKPDEKAEV PTGVIIGSII 1140 

AGILLLLAItV AILWKLGFFK RKYEKMTKNP DEIDETTELS S 1181 



Seq ID NO: C339 Protein Sequence 
Protein Accession fl: NP_113648.1 

1 11 21 31 41 51 

1 | I I I I 

MYRPRARAAP EGRVRGCAVP STVLLLLAYL AYLALGTGVF WTLEGRAAQD SSRSFQRDKW 60 
EIiLQNFTCLD RPALDSLIRD WQAYKNGAS LLSNTTSMGR WELVGSFFFS VSTITTIGYG 120 
NLSPNTMAAR LFCIFFALVG IPLNLWLNR LGHLMQQGVN HWASRLGGTW QDPDKARWLA 180 
GSGALLSGLI* LFLLLPPLLF SHMEGWSYTE GFYFAFITLS TVGFGDYVIG MNPSQRYPLW 240 
YKNMVSLWIL FGMAWLALII KLILSQLETP GRVCSCCHHS SKEDFKSQSW RQGPDREPES 300 
HSPQQGCYPE GPMGIIQHIjE PSAHAAGCGK DS 332 

Seq ID NO: C340 Protein Sequence 
Protein Accession #: NP_004145.1 

1 11 21 31 41 SI 

1 I I I I t 

MEWDNGTGQA LGLPPTTCVY RENFKQLLLP PVYSAVLAAG LPLNICVITQ ICTSRRALTR 60 

TAVYTLNIiAIi ADLLYACSLP LLIYNYAQGD HWPFGDFACR LVRFLFYANL HSSXXiFLTCI 120 

SFQRYLGICH PLAPWHKRGG RRAAWLVCVA VWLAVTTQCL PTAIFAATGI QRNRTVCYDL 180 

SPPALiATHYM PYGMALTVIG FLLPFAALLA CYCLLACRLC RQDGPAEPVA QERRGKAARM 240 

AVWAAAFAI SFLPFHITKT AYLAVRSTPG VPCTVLEAFA AAYKGTRPFA SANSVLDPIL 300 

FYFTQKKFRR RPHELLQKLT AKWQRQGR 328 

Seq ID NO; C341 Protein Sequence 
Protein Accession #: NP_009128.1 

1 ll 21 31 41 51 

I I I I I I 

MQRPGPRLWL VLQVMGSCAA ISSMDMERPG DGKCQPIEIP MCKDIGYNMT RMPNLMGHEN 60 

QREAAIQLHE FAPLVEYGCH GHLRFFLCSL YAPMCTEQVS TPIPACRVMC EQARLKCSPI 120 

MEQFNFKWPD SLDCRKLPNK NDPNYLCMEA PNNGSDEPTR GSGLFPPLFR PQRPHSAQEH 180 

PliKDGGPGRG GCDNPGKFHH VEKSASCAPL CTPGVDVYWS REDKRFAWW LAIWAVLCFF 240 

SSAFTVLTFL IDPARFRYPE RPIIFLSMCY CVYSVGYIilR LFAGAESIAC DRDSGQLYVI 300 

QEGLESTGCT LVFLVLYYFG MASSLWWWL TLTWFLAAGX KKGHEAIEAN SSYFHLAAWA 360 

IPAVKTILIL VMRRVAGDEL TGVCYVGSMD VNALTGFVLI PLACYLVIGT SFILSGFVAL 420 

FHIRRVMKTG GENTDKLEKL MVRIGLFSVL YTVPATCVIA CYFYERLNMD YMKILAAQHK 480 

CKMNNQTKTL DCLMAASIPA VEIFMVKIFM LLWGITSGM WIWTSKTLQS WQQVCSRRLK 540 

KKSRRKPASV ITSGGIYKKA OHPQKTHHGK YEIPAQSPTC V 581 



Seq ID NO: C342 Protein Sequence 
Protein Accession #: NP_005752.1 

1 11 21 31 41 51 

| | I 1 I I 

MEVSRRKAPP RPPRPAAPLP LLAYLLALAA PGRGADEPVW RSEQAIGAIA ASQEDGVFVA 60 

SGSCLDQLDY SLEHSLSRLY RDQAGNCTEP VSLAPPARPR PGSSFSKLLL PYREGAAGLG 120 

GLLLTGWTFD RGACEVRPLG NLSRNSLRNG TEWSCHPQG STAGWYRAG RNNRWYIAVA 180 

ATYVLPEPET ASRCNPAASD HDTAIALKDT EGRSLATQEL GRLKLCEGAG SLHFVDAFLW 240 

NGSIYFPYYP YNYTSGAATG WPSMARIAQS TEVfcFQGQAS LDCGHGHPDG RRLLLSSSLV 300 

EALDVWAGVF SAAAGEGQER RSPTTTALCL FRMSEIQARA KRVSWDFKTA ESHCKEGDQP 360 

ERVQPIASST LIHSDLTSVY GTWMNRTVL FLGTGDGQLL KVILGENLTS NCPEVIYEIK 420 

EETPVFYKLV PDPVKNIYIY LTAGKEVRRI RVANCNKHXS CSECLTATDP HCGWCHSLQR 480 

CTFOGDCVHS ENLENWLDIS SGAKKCPKIQ IIRSSKEKTT VTMVGSFSPR HSKCMVKNVD 540 

SSRELCQNKS QPNRTCTCSI PTRATYKDVS WNVMFSFGS WNLSDRFNFT NCSSLKECPA 600 

CVETGCAWCK SARRCIHPFT ACDPSDYERN QEQCPVAVEK TSGGGRPKEN KGNKTNQALQ 660 

VFYIKSIEPQ KVSTLGKSNV IVTGANFTRA SNITMILKGT STCDKDVIQV SHVLNDTHMK 720 
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FSLPSSRKEM KDVCIQFDGG NCSSVGSLSY IALPHCSLIF PATTWISGGQ NITMMGRNFD 780 

VIDNLI ISHE LKGNINVSEY CVATYCGFLA PSLKSSKVRT NVTVKLRVQD TYLDCGTLQY 840 

REDPRFTGYR VESEVDTELE VKIQKENDNF NISKKDIEIT LFHGENGQLN CSFENITRNQ 900 

DLTTILCKIK GIKTASTIAN SSKKVRVKLG NLELYVEQES VPSTWYFLIV LPVLLVIVIF 960 

AAVGVTRHKS KELSRKQSQQ LELLESELRK EIEDGFAELQ MDKLDWDSF GTVPFLDYKH 1020 

FALRTFFPES GGFTHIFTED MHNRDANDKN ESLTALDALI CNKSFLVTVI HTLEKQKNFS 1080 

VKDRCLFASF LTIALQTKLV YLTSILEVLT RDLMEQCSNM QPKLMLRRTE SWEKLLTKW 1140 

MSVCLSGPLR ETVGEPFYLL VTTLNQKINK GPVDVITCKA LYTLNEDWLL WQVPEFSTVA 1200 

LNWFEKIPE NESADVCRNI SVNVLDCDTI GQAKEKIFQA FLSKNGSPYG LQLNEIGLEL 1260 

QMGTRQKELL DIDSSSVILE DGITKLNTIG HYEISNGSTI KVFKKIANFT SDVEYSDDHC 1320 

HLILPDSEAF QDVQGKRHRG KHKFKVKEMY LTKLLSTKVA IHSVLBKLFR SIWSLPNSRA 1380 

PFAIKYFFDF LDAQAENKKI TDPDWHIWK TNSLPLRFWV NILKNPQFVF DIKKTPHIDG 1440 

CLSVIAQAFM DAFSLTEQQL GKEAPTNKLL YAKDIPTYKE EVKSYYKAIR DLPPLSSSEM 1500 

EEFLTQESKK HENEFNEEVA LTEIYKYIVK YFDEILNKLE RERGLEEAQK QLLHVKVLFD 1560 

EKKKCKHM 1568 

Seq ID NO: C343 Protein Sequence 
Protein Accession #: NP_002176.1 

1 11 21 31 41 51 

I i I I I I 

MTILGTTFGM VFSLLQWSG ESGYAQNGDL EDAELDDYSF SCYSQLEVNG SQHSLTCAEJE 60 
DPDVNTTNLE FEIOGALVEV KCLNFRKLQE IYFIETKKFL LIGKSNICVK VGEKSLTCKK 120 
IDLTTIVKPE APFDLSVIYR EGANDFWTF NTSHLQKKYV KVLMHDVAYR QEKDENKWTH 180 
VNLSSTKLTL LQRKLQPAAM YEIKVRSIPD HYFKGFWSEW SPSYYFRTPE INNSSGEMDP 240 
ILLTISILSF FSVALLVI LA CVLWKKRIKP IVWPSLPDHK KTLEHLCKKP RKNLNVSFNP 300 
ESFLDCQIHR VDDIQARDEV EGFLQDTFPQ QLEESEKQRL GGDVQSPNCP SEDVWTPES 360 
FGRDSSIjTCL AGNVSACDAP ILSSSRSLDC RESGKNGPHV YQDLLLSLGT TNSTLPPPFS 420 
LQSGILTLNP VAQGQPILTS LGSNQEEAYV TMSSFYQNQ 459 



Seq ID NO: C344 Protein Sequence 
Protein Accession ft: NP_002713.1 



1 11 21 31 41 51 

I 1 1 I i I 

MAAARLCLSIi LLLSTCVALL LQPLLGAQGA PLEPVYPGDN ATPEQMAQYA ADLRRYINML 60 
TRPRYGKRHK EDTLAFSEWG SPHAAVPREXi SPLDL. 95 



Seq ID NO: C34S Protein Sequence 
Protein Accession #: NP_115934.1 

1 11 21 31 41 51 

MTWRHHVRLL FTVSLALQII NLGNSYQREK HNGGREEVTK VATQKHRQSP LNWTSSHFGE 60 

VTGSAEGWGP EEPLPYSRAF GEGASARPRC CRNGGTCVLG SFCVCPAHFT GRYCEHDQRR 120 

SECGALEHGA WTLRACHLCR CIFGALHCLP LQTPDRCDPK DFLASHAHGP SAGGAPSLLL 180 

LLPCALLHRL LRPDAPAHPR SLVPSVLQRE RRPCGRPGLG HRL 223 



Seq ID NO: C346 Protein Sequence 
Protein Accession #: NP_006524.1 

1 11 21 31 41 51 

MARSLVCLGV IILLSAFSGP GVRGGPMPKL ADRKLCADQE CSHPISMAVA LQDYMAPDCR 60 
FLTIHRGQW YVFSKLKGRG RLFWGGSVQG DYYGDLAARIi GYFPSSIVRE DQTLKPGKVD 120 
VKTDKWDFYC Q 131 



Seq ID NO: C347 Protein Sequence 
Protein Accession #: Eos sequence 

1 11 21 31 41 51 

I I ! 1 I I 

KTQVTEKSTE HPEKTTSTTE KTTRTPEKPT LYSEKTICTK GKNTPVPEKP TENLGNTTLT 60 

TETIKAPVKS TENPEKTAAV TKTIKPSVKV TGDKSLTTTS SHLNKTEVTH QVPTGSFTLI 120 

TSRTKLSSIT SEATGNESHP YLNKDGSQKG IHAGQMGEND SFPAWAIVIV VLVAVILLLV 180 

FLGLIFLVSY MMRTRRTLTQ NTQYNDAEDE GGPNSYPVYL MEQQNLGMGQ IPSPR 235 



Seq ID NO: C348 Protein Sequence 
Protein Accession #: NP_543146.1 

1 11 21 31 41 51. 

| I I I I 

MTQVTEKSTE HPEKTTSTTE KTTRTPEKPT LYSEKTICTK GKNTPVPEKP TENLGNTTLT 60 
TETIKAPVKS TENPEKTAAV TKTIKPSVKV TGDKSLTTTS SHLNKTEVTH QVPTGSFTLI 120 
TSRTKLSSIT SEATGNESHP YLNKDGSQKG IHAGQMGEND SFPAWAIVIV VLVAVILLLV 180 
FLGLIFLVSY MMRTRRTLTQ NTQYNDAEDE GGPNSYPVYL MEQQNLGMGQ IPSPR 235 

Seq ID NO: C349 Protein Sequence 
Protein Accession #: FGENESH predicted 
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MWPRLAFCCW 
VERLGQAFRR 
RVAIVTFSSK 
LHARENSTKV 
EHCYLLHSFB 
GHFECICEKG 
VCRBGYRASG 
LPNGLWSGSE 
QGNSQWDGPE 
MLRCTTSGKW 
VSVHVHPAFT 
VSEKVHAASW 
VIKGSPCEIP 
TTEWPDCAKK 
DCRLEENLTK 
SRIKRSAPLS 
PMYSFQLASE 
GSYQDEEGQL 



LACPFYGTTP 
QQLGRGYVCL 
NINECSSSPC 
FLCKCPPGFL 
SNPCRNQATC 
HALTCTFWMK 
RWHHIAITWT 
SFVGSISQLN 
SDCPRLGGSV 
SCGVPPPLEN 
CAVGSDCSEH 
AGVTFSCQEG 
VTYRCNKGYT 
YSCDTGYSLQ 
CKEGYTLAGL 
DGYSLADNSQ 
SFKCMEGFVL 
AYSCNKGFYI 
YQCNPGYKSV 
FCNEGYELVG 
CKEGHVLQGP 
VGGFFLRGNS 
GFELVGNTTT 
RLEGPSALTC 
AGHAMQTCEE 
HPPYHLGAVA 
APSCISIECD 
RCEAISCKKP 
IPVDCSSPPV 
VRCATPPQLA 
CGPPEDLAHG 
TPVIEYGTVN 
MIPNAFISET 
ANAVATGEAH 
HILVHGDDFS 
HGFWGSKYT 
KADIENRTTG 
LSEKEFYVDQ 
GVHYQYGDMI 
SCPEGWMGRL 
PNRCHCIjSSW 



GLALVSGWAT 
RVRLLRELSE 
NYWPRVDYI 
VFLITDGYSN 
EFEALARRAL 
YYGKGLQYEC 
QTCELVHCPA 
SYCRVRTCPH 
PRCVERHCST 
NVGVQAAVCK 
PPYLFPIGDV 
DEPQFSDNSG 
FTPVNGDFIC 
RFANHGFKSF 
KYCLEYNYDY 
DYKIKLIFNI 
ILIAD SNSLE 
ECKLCPSGMY 
ENTSTVKRGA 
FAGSRSITEC 
CPLGYTGLKC 
LNKGICVDGV 
GTRCGKNVDE 
VDELNSYSCK 
SSDDMNYGTP 
SANGIWKVYI 
LWDYVLSPQQ 
PHLRTASEDL 
GFHSADDFYA 
ASCLNVDGSY 
YQLMGVTKIT 
LAGDKESSCL 
GPSIIECTAS 
DTIECXiADGK 
LLCNAQGKWV 
NTSAKIECMR 
KGEKKSTCEA 
GSPVFVCQAN 
DSSWTCQKSG 
SVLKCLPSQQ 
TTLCQPDGTW 
LCGENGHWLG 
LETGDHDVDA 
SGWSSSIPTC 
KTWENTKESP 
LPTAPENGPL 
NPVKNGSIKG 
SANGQVRGDE 
NGVTEGLDYG 
FPNGFSFIHG 
GTDFDCGKAA 
SSWKENVITY 
TYESEVKLRC 
VNRQVSVSCA 
FESTIIYQCE 
PNWYSCNRG 
NVSIKCREGF 
TYSCYSGYML 
CEEPICILPC 
TGHNCSR 



FQQMSPSRNF 
RLELVFLVDD 
STRRARQHKC 
GGDPRPIAAS 
HEDLPSGSFI 
TACPSGTYKP 
LKPPENGYFI 
LRQPKHGHIS 
FQMPKDVIIS 
DVEAPQINCP 
AIVYTATDLS 
AELVITRSHT 
TPDNTGVNCT 
EMFYKAARCD 
ENGFAIGPGG 
TASVPLPDER 
TKKASPFCRP 
TEYIHSRNIS 
VNISACGVPC 
STSVLNITIF 
ETDIDECSPL 
AGYRCTCVKG 
CLSQPCKNGA 
CQPGFSGKRC 
ISYAVDNGSD 
DGKLSDGGAG 
VKSLATSCPE 
KPGSKVNLFC 
GSTVTYQCNN 
ICSCVPPYTG 
CLESGEWNHL 
ANSSWSHSPP 
GIWDRAPPAC 
WSRSDQQCLA 
PPEGQDMPRC 
GGQWNPSPMS 
TGQWSSPIPT 
RHWHSESPLM 
KWNKKSNPKC 
WNDSFPVCKI 
SSPLPECVPV 
GKPTCKAIEC 
PSCNAIHCDS 
MPIDCGLPPH 
ATHSSNFLYG 
RFTETSMGSA 
SNYTYLSTLY 
YTFQKEIEYT 
FMKEVTFHCH 
GHIQYQCFPG 
RIQCFKGFKL 
SCRSGYVIQG 
LEGYTMDTDT 
EGYTFEGVNI 
PGYELEGNRE 
YSLEGPSEAH 
LLQGHGIITC 
EGFLRSVCLE 
LNGGRCVAPY 



SFRLFPETAP 
SSSVGEWFR 
ALLLQEIPAI 
LRDSGVEIFT 
QDDMVHCSYL 
EGSPGGISSC 
QNTCNNHFNA 
CSTREMLYKT 
PHNCGKQPAK 
KDIEAKTLEQ 
GNQASCIFHI 
QGDIjFPQGET 
LTCLEGYDFT 
DTDLMKKFSE 
WGAANRLDYS 
NDTLEWENQQ 
GSVLRGRMCV 
DCKAQCKOGT 
PEGKFSRSGL 
GGFGHLELLN 
PCLNNGVCKD 
FVGLHCETEV 
TCKDGANSFR 
ETEQSTGFNIj 
NTLLLTDYNG 
LSVGLPIPGG 
ELSKGNVLAW 
DPGFQLVGNP 
GYYLLGDSRM 
DGKNCAEPIK 
IPYCKAVSCG 
VCEPVKCSSP 
HLVFCGEPPA 
VSCDEPPIVD 
IAHFCEKPPS 
IQCIPVRCGE 
CHPVSCGEPP 
CVPLDCGKPP 
MPAKCPEPPL 
VLCTPPPLIS 
ECPQPEEIPN 
LKPKEILNGK 
PQPIENGFVE 
IDFGDCTKLK 
TMVSYTCNPG 
VQYSCKPGHI 
YECDPGYVLN 
CNEGFLLEGA 
EGYILHGAPK 
YKLHGNSSRR 
LGbSEITCEA 
SSDLICTEKG 
DTFTCQKDGR 
SVCQLDGTWE 
RVCQENRQWS 
CTENGTWSHP 
NPDETWTOTS 
NGTWTSPPIC 
QCDCPPGWTG 



GAPGSIPAPP 
SELMFVRKU. 
SYRGGGTYTK 
FGIWQGNIRE 
CDEGKDCCDR 
IPCPDENHTS 
ACGVRCHPGF 
TCLVACDEGY 
FGTICYVSCR 
QDSANVTWQI 
KVIDAEPPVI 
IVQYTATDPS 
EGSTDKYYCA 
AFETTLGKMV 
YDDFLDTVQE 
RLLQTLETIT 
NCPLGTYYNL 
YSYSGLETCE 
MPCHPCPRDY 
CPSEVFHECF 
LVGEFICECP 
NECQSNPCLN 
CLCAAGFTGS 
DFEVSGIYGY 
WVLYVNGREK 
GALVLGQEQD 
PDFLSGIVGK 
VQYCLNQGQW 
FCTDNGSWNG 
CKAPGNPENG 
KPAIPENGCI 
ENINNGKYIL 
IKDAVITGNN 
HASPETAHRL 
VSYSILESVS 
PPSIMNGYAS 
KVENGFLEHT 
PIQNGFMKGE 
LENQLVLKEL 
FGVPIPSSAL 
GIIDVQGLAY 
FSYTDLHYGQ 
GADYSYGAII 
DDQGYFEQED 
YELLGNPVLI 
LAGSDLRLCL 
GTERRTCQDD 
RSRVCLANGS 
LTCQSDGNWD 
CLSNGSWSGS 
DGQWSSGFPH 
VWSQPYPVCE 
WFPERISCSP 



GGVAICKETR 
VPLCKPNPCP 
AKCEKISCGP 
RAVCRFPCQN 
SRCHTAVCQS 



APGDEAAGSR 
SDFPWPTAT 
GAFQQAAQIL 
LNDMASTPKE 
MGSCKCGTHT 
PPGSTSPEDC 
DLVGSSIILC 
RLEGSDKLTC 
QGFILSGVKE 
PTAKDNSGEK 
DWCRSPPPVQ 
GNNRTCDIHI 
YEDGVWKPTY 
PSFCSDAEDI 
TATSIGNAKS 
NKLKRTLNKD 
EHFTCESCRI 
SCPLGTYQPK 
YQPNAGKAFC 
FNPCHNSGTC 
SGYTGQRCEE 
NAVCEDQVGG 
HCELNINECQ 
VMLDGMLPSL 
ITNCPSVNDG 
KKGEGFSPAE 
VKIDSKSIFC 
TQPLPHCERI 
VSPSCLDVDE 
HSSGEIYTVG 
EELAFTFGSK 
SGLTYLSTAS 
FTFRNTVTYT 
FGDIAFYYCS 
KAKFAAGSW 
GSNYSFGAMV 
TGRIFESEVR 
NFEVGSKVQF 
TTEVGWTFS 
HFGSTVKYSC 
LSTALYTCKP 
TVTYSCNRGF 
IYSCFPGFQV 
DMMEVPYVTP 
CQEDGTWNGS 
ENRKWSGASP 
KtJWDEDEPIC 
WSGATPDCVP 
AEIPLCKPVN 
SPSCLPCRCS 
CEHTSCGSLP 
PLSCGSPPSV 
KKCPLPENIT 
VSCGKPESPE 
CETPLlEFLNG 
VPFVI PENAL 
PAHVENAIAR 
GGICQRPNAC 
PCLNGGKCVR 



Seq ID NO: C350 Protein Sequence 
Protein Accession #: FGENESH predicted 
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360 

420 
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600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3557 



1 11 21 31 41 

I i I I I I 

MRFSVSGMRT DYPRSVLAPA YVSVCLLLLC PREVIAPAGS EPWLCQPAPR CGDKIYNPLE 
QCCYNDAIVS LSETRQCGPP CTFWPCFELC CLDSFGLTND FWKLKVQGV NSQCHSSPIS 
SKCERGRIC 

Seq ID NO: C351 Protein Sequence 
Protein Accession ft: AAH3S671.1 



MVPGARGGGA 
TYPNHTVCEK 
LLLNTSEVTV 
VAGDISGNMV 
GSLSDKRFLF 
SGDSSNKHKP 
IVNNEEKVFQ 
LVWRKTSQST 
RKKKKKGSPY 



LARAAGRGLL 
TITVPKGKRL 



DGYRDTSLLC 
TSNGCSRSLS 
REWLEIDLGE 
GNSNFRDPVQ 
SVSTKKEDET 
GSAEAQKTDC 



21 
I 

ALLLAVSAPL 
ILRLGDLDIE 
RGFLLTYASS 
KAAIHAGIIA 
FEPDGQIRAS 
KKKITGIRTT 
NNFIPPIVAR 
ITRPIPSEET 
WKQIKYPFAR 



31 
I 

RLQAEELGDG 
SQTCASDYLL 
DHPDLITCLE 
DELGGQISVL 
SSWQSVNESG 
GSTQSNFNFY 
YVRWPQTWH 
STGINITTVA 
HQSAEFTISY 



41 
I 

CGHLVTYQDS 
FTSSSDQYGP 
RASHYLKTEY 
QRKGISRYEG 
DQVHWSPGQA 
VKSFVMNFKN 
QRIALKVELI 
IPLVLLWLV 



51 
I 

GTMTSKNYPG 
YCGSMTVPKE 
SKFCPAGCRD 
ILANGVLSRD 
RLQDQGPSWA 
NNSKWKTYKG 
GCQITQGNDS 
FAGMGIFAAF 
DLITSDMAG 



60 

120 

129 



60 

120 

180 

240 

300 

360 

420 

480 

539 
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Seq ID NO: C3S2 Protein Sequence 
Protein Accession ft: Eos sequence 

1 11 21 31 41 51 

I I I I I 1 

MGFGAGQRLR PVPAPRSSAE EAARPGQLRL GIRRGEAELA KLAPSGVMVP GARGGGALAR 60 

AAGRGLLALL LAVSAPLRLQ AEELGDGCGH LVTYQDSGTM TSKNYPGTYP NHTVCEKTIT 120 

VPKGKRLILR LGDLDIESQT CASDYLLFTS SSDQYGPYCG SMTVPKELLL NTSEVTVRFE 180 

SGSHISGRGF LLTYASSDHP DLITCLERAS HYLKTEYSKP CPAGCRDVAG DISGNMVDGY 240 

RDTSLLCKAA IHAGIIADEL GGQISVLQRK GISRYEGILA NGVLSRDGSL SDKRFLFTSN 300 

GCSRSLSFEP DGQIRASSSW QSVNESGDQV HWSPGQARLQ DQGPSWASGD SSNNHKPREW 360 

LEIDLGEKKK ITGIRTTGST QSNFNFYVKS FVMNFKNNNS KWKTYKGIVN NEEKVFQGNS 420 

NFRDPVQNNP IPPIVARYVR WPQTWHQRI ALKVELIGCQ ITQGNDSLVW RKTSQSTSVS 480 

TKKEDETITR PIPSEETSTG INITTVAIPL VLLWLVFAG MGIFAAFRKX KKKGSPYGSA S40 

EAQKTDCWKQ IKYPFARHQS AEFTISYDNE KEMTQKLDLI TSDMAG 586 

Seq ID NO: C353 Protein Sequence 
Protein Accession ft: FGENESH predicted 

1 11 21 31 41 51 

I I I 1 I I 

MFQRQERFLD LSSAEAVAAW ILHQHPDIIN KGDGCGHLVT YQDSGTMTSK NYPGTYPNHT 60 

VCEKTITVPK GKRL1LRLGD LDIESQTCAS DYLLFTSSSD QYGMQKEEET EVLCLSVAGA 120 

QRVDIPVQLL PSFLEGWKGH ADARGPYCGS MTVPKELLLN TSEVTVRFES GSHISGRGFL 180 

LTYASSDHPD LITCLERASH YLKTEYSKFC PAGCRDVAGD ISGNMVDGYR DTSLLCKAAI 240 

HAGIIADELG GQISVLQRKG ISRYEGILAN GVLSRDGSLS DKRFLFTSNG CSRSIiSFEPD 300 

GQIRASSSWQ SVNESGDQVH WSPGQARLQD QGPSWASGDS SNNHKPREWL EIDLGEKKKI 360 

TGIRTTGSTQ SNFNFYVKSF VMNFKNNNSK WKTYKGIVNN EEKVFQGNSN FRDPVQNNFI 420 

PPIVARYVRV VPQTWHQRIA LKVELIGCQI TQGNDSLVWR KTSQSTSVST KKEDETITRP 480 

IPSEETSTDA MPVQIVGDHT QMISQRENLG PDEGKIPFKG TAESMVRWF AWVNDLGML S40 

FLAHTPEEDI DHYCWKQIKY PFARHQSAEF TISYDNEKEM TQKLDLITSD MADYQQPLMI 600 

GTGTVTRKGS TFRPMDTDAE EAGVSTDAGG HYDCPQRAGR HEYALPLAPP EPEYATPIVE 660 

RHVLRAHTFS AQSGYRVPGP QPGHKHSLSS GGFSPVAGVG AQDGDYQRPH SAQPADRGYD 720 

RPKAVSALAT ESGHPDSQKP PTHPGTSDSY SAPRDCLTPL NQTAMTALL 769 

Seq ID NO: C354 Protein Sequence 
Protein Accession ft: NP_004607.1 

1 11 21 31 41 51 

I 1 I 1 I I 

MAG V SAC IKY SMFTFNFLFW LCGILILALA IWVRVSNDSQ AIFGSEDVGS SSYVAVDILI 60 

AVGAIIMILG FLGCCGAIKE SRCMTiT.TtFFI GLLLILLLQV ATGILGAVFK SKSDRIVNET 120 

LYENTKLLSA TGESEKQFQE AIIVFQEEFK CCGLVNGAAD WGNNFQHYPE LCACLDKQRP 180 

CQSYNGKQVY KETCISFIKD FLAKNLIIVI GISFGLAVIE ILGLVFSMVL YCQIGNK 237 

Seq ID NO: C355 Protein Sequence 
Protein Accession ft: NPJJ04606.1 

1 11 21 31 41 51 

I I I 1 I I 

MCTGGCARCL GGTLIPIAFP GFLANILLFF PGGKVIDDND HLSQEIWFFG GILGSGVLMI 60 
FPAIiVFLGLK NNDCCGCCGN EGCGKRFAMF TSTIFAWGF LGAGYSFIIS AISINKGPKC 120 
LMANSTWGYP FHDGDYLNDE ALWNKCREPL NWPWNLTLF SILLWGGIQ MVLCAIQWN 180 
GLIX3TLCGDC QCCGCCGGDG PV 202 

Seq ID NO: C3S6 Protein Sequence 
Protein Accession ft: NP_002372.1 

1 11 21 31 41 51 

I I I I I I 

MPRPAPARRL PGLLLLLWPL LLLPSAAPDP VARPGFRRLE TRGPGGSPGR RPSPAAPDGA 60 

PASGTSEPGR ARGAGVCKSR PLDLVFIIDS SRSVRPLEFT KVKTFVSRII DTLDIGPADT 120 

RVAWNYAST VKIEFQLQAY TDKQSLKQAV GRITPLSTGT MSGIAIQTAM DEAFTVEAGA 180 

. REPSSNIPKV AIIVTDGRPQ DQVNEVAARA QASGIELYAV GVDRADMASL KMMASEPLEE 240 

HVFYVETYGV IEKLSSRFQE TFCALDPCVL GTHQCQHVCI * SDGEGKHHCE CSQGYTLNAD 300 

KKTCSALDRC ALNTHGCEHI CVNDRSGSYH CECYEGYTLN EDRKTCSAQD KCALGTHGCQ 360 

HICVNDRTGS HHCECYEGYT LNADKKTCSV RDKCALGSHG OQHICVSDGA ASYHCDCYPG 420 

YTLNEDKKTC SATEEARRLV STEDACGCEA TLAFQDKVSS YI^RLNTKLD DILEKLKINE 480 

YGQIHR 486 

Seq ID NO: C3S7 Protein Sequence 
Protein Accession ft: NP_057723.1 

1 11 21 31 41 51 

1 I 1 I I I 

MARGSLRRLL RLLVLGLVILA LLRSVAGEQA PGTAPCSRGS SWSADLDKCM DCASCRARPH 60 
SDFCLGCAAA PPAPFRLLWP ILGGALSLTF VLGLLSGFLV WRRCRRREKF TTPIEETGGE 120 
GCPAVALIQ 129 

Seq ID NO: C358 Protein Sequence 
Protein Accession ft: NP_001810.1 
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1 11 21 31 41 51 

I I I I I I 

KQPTLLLSLL GAVGLAAVNS MPVDNRNHNE GMVTRCIIEV LSNALSKSSA PPITPECRQV 60 

LKTSRKDVKD KETTENENTK FEVRLLRDPA DASEAHESSS RGEAGAPGEE DIQGPTKADT 120 

EKWAEGGGHS RERADEPQWS LYPSDSQVSE EVKTRHSEKS QREDEEEEEG ENYQKGERGB 180 

DSSEEKHLEE PGETQNAFLN ERKQASAIKK EELVARSETH AAGHSQEKTH SREKSSQESG 240 

EEAGSQENHP QESKGQPRSQ EESEEGEEDA TSEVDKRRTR PRHHHGRSRP DRSSQGGSLP 300 

SEEKGHPQEE SEESNVSMAS LGEKRDHHST HYRASEEEPE YGEEIKGYPG VQAPEDLEWE 360 

RYRGRGSEEY RAPRPQSEES WDEEDKRNYP SLELDKMAHG YGEESEEERG LEPGKGRHHR 420 

GRGGEPRAYF MSDTREEKRF LGEGHHRVQE NQMDKARRHP QGAWKELDRN YLNYGEEGAP 480 

GKWQQQGDLQ DTKENREEAR FQDKOYSSHH TAEKRKRLGE LFNPYYDPLQ WKSSHFERRD 540 

NMNDNPLEGE EENELTLNEK NFFPEYNYDW WEKKPFSEDV NWGYEKRNIA RVPKLDLKRQ 600 

YDRVAQLDQL LHYRKKSAEF PDFYDSEEPV STHQEAENEK DRADQTVLTE DEKKELEMLA 660 

AMDLELQKIA EKPSQRG 677 

Seg ID NO: C359 Protein Sequence 
Protein Accession Jh XP_093082.l 

1 11 21 31 41 51 

I I I 1 I I 

MKLLCEGLKQ PNCVLQTIiRW YRCLISSASC GALAAVLSTS QWLTELEFSE TKIiEASALKL 60 

LYGGLKDPNC KLQKLNLQFS LSVTAAKLPV GMVGNCSGFS GSLVQSHFGY CQDSSFKCDL 120 

CKLLWPSTRV AAAKDCGSPK SFLSEGLNWA GRLEAVEEVL GIjGVIiVQPGD PASQGGGHCE 180 

NYGSFRDLVD LEVKAEPSLR KGGMDLQRPT LQWLLCKIF SLKLFLFIAL PNSPGQVSW 240 

QVTIPDGFVN VTVGSNVTLI CIYTTTVASR EQLSIQWSFF HKKEMEPISS PWEEGKWPDV 300 

EAVKGTLDGQ QAELQIYFSQ GGQAVAIGQF KDRITGSNDP GNASITISHM QPADSGIYIC 360 

DVNNPPDFLG QNQGILNVSV LVKPSKPLCS VQGRPETGHT ISLSCLSALG TPSPVYYWHK 420 

LEGRDIVPVK ENFNPTTGIL VIGNI/TNFEQ GYYQCTAINR LGNSSCEIDL TSSHPEVGII 480 

VGALIGSLVG AAIIISWCF ARNKAKAKAK ERNSKTIAEL EPMTKINPRG ESEAMPREDA 540 
TQLEVTLPSS IHETGPDTIQ EPDYEPKPTQ EPAPEPAPGS EPMAVPDLDI ELELEPETQS 600 
ELEPEPEPEP ESEPGVWEP LSEDEKGWK A 631 

Seq ID NO: C360 Protein Sequence 
Protein Accession #: FGENESH predicted 

1 11 21 31 41 51 

1 I 1 I I 1 

MVFAFWKVFL ILSCLAGQVS WQVTIPDGF VNVTVGSNVT LICIYTTTVA SREQLSIQWS 60 
FFHKKEMEPI SSPWEEGKWP DVEAVKGTLD GQQAELQIYF SQGGQAVAIG QFKDRITGSN 120 
DPGNASITIS HMQPADSGIY ICDVNNPPDF LGQNQGILNV SVLVKPSKPL CSVQGRPETG 180 
HTISLSCLSA I/GTPSPVYYW HKLEGRDIVP VKENFNPTTG ILVIGNLTNF EQGYYQCTAI 240 
NRLGNSSCEI DLTSSHPEVG IIVGALIGSL VGAAIIISW CFARNKAKAK AKERNSKTIA 300 
ELEPMTKINP RGESEAMPRE DATQLEVTLP SSIHETGPDT IQEPDYEPKP TQEPAPEPAP 360 
GSEPMAVPDL DIELELEPET QSELEPEPEP EPESEPGWV EPLSEDEKGV VKA 413 



Seq ID NO: C361 Protein Sequence 
Protein Accession #: NP_003011.1 



1 11 21 31 41 51 

I I I I I 1 

MVSRMVSTML SGLLFWLASG WTPAFAYSPR TPDRVSEADI QRLLHGVMEQ LGIARPRVEY 60 

PAHQAMNLVG PQSIEGGAHE GLQHLGPFGN IPNIVAELTG DNIPKDFSED QGYPDPPNPC 120 

PVGKTDDGCL ENTPDTAEFS REFQLHQHLF DPEHDYPGLG KWNKKLLYEK MKGGERRKRR 180 

SVNPYLQGQR LDNWAKKSV PHFSDEDKDP E 211 



Seq ID NO: C362 Protein Sequence 
Protein Accession #: NP_076926.2 



1 11 21 31 41 51 

I I I I I I 

MTTMQGMEQA MPGAGPGVPQ LGNMAVIHSH LWKGLQEKFI. KGEPKVLGW QILTALMSLS 60 

MGITMMCMAS NTYGSNPISV YIGYTIWGSV MFIISGSLSI AAGIRTTKGL VRGSLGMNIT 120 

SSVLAASGIL INTFSLAFYS FHHPYCNYYG NSNNCHGTMS ILMGLDGMVL LLSVLEFCIA 180 

VSLSAFGCKV LCCTPGGWL ILPSHSHMAE TASPTPUJEV 220 



Seq ID NO: C363 Protein Sequence 
Protein Accession #: NP_002082.l 



1 11 21 31 41 51 

111)11 

MRGSELPLVL LALVLCLAPR GRAVPLPAGG GTVLTKMYPR GNHVJAVGHLM GKKSTGESSS 60 

VSERGSLKQQ LREYIRWEEA ARNLLGLIEA KENRNHQPPQ PKALGNQQPS WDSEDSSNFK 120 

DVGSKGKVGR LSAPGSQREG RNPQLNQQ 148 



Seq ID NO: C364 Protein Sequence 
Protein Accession #: NP_036393.1 

1 11 21 31 41 51 

I I I I I I 

MDLQGRGVPS IDRLRVLLML FHTMAQIMAE QEVENLSGLS TNPEKDIFW RENGTTCWIA 60 

EFAAKFIVPY DVWASNYVDL ITEQADIALT RGAEVKGRCG HSQSELQVFW VDRAYALKML 120 

FVKESHNMSK GPEATWRLSK VQFVYDSSEK THFKDAVSAG KHTANSHHLS ALVTPAGKSY 180 

ECQAQQTISL ASSDPQKTVT MILSAVHIQP FDIISDFVFS EEHKCPVDER EQLEBTLPLI 240 

LGLILGLVIM VTLAIYHVHH KMTANQVQIP RDRSQYKHMG 280 
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Seq ID NO: C365 Protein Sequence 
Protein Accession 8: NP_003217.1 

1 ll 21 31 41 51 

I I I I I I 

MLGLVLALLS SSSAEEYVGL SANQCAVPAK DRVDCGYPHV TPKECNNRGC CFDSRIPGVP 

WCFKPLTRKT ECTF 

Seq ID NO: C366 Protein Sequence 
Protein Accession «: NP_002984.1 

1 11 21 31 41 51 

| I I I 1 I 

MSLPSSRAAR VPGPSGSLCA LLALLLLLTP PGPLASAGPV SAVLTELRCT CLRVTLRVNP 
KTIGKLQVFP AGPQCSKVEV VASLKNGKQV CLDPEAPFLK KVIQKILDSG NKKN 

Seq ID NO: C367 Protein Sequence 
Protein Accession #: NP_00S233.2 



1 
I 

MRSPSAAWLL 
VDEFSASVI/T 
DLLSVIWPPL 
MGHSRKKANI 
NYPLSLAIGV 
TPSNLLLWH 
CRSVRTVKQM 



11 
I 

GAAILLAASL 
GKLTTVFLPI 
KIAYHIHANN 
AIGISLAIWL 
FLFPAFLTAS 
YFLIKSQGQS 
QVSLTSKKHS 



21 
I 

SCSGTIQGTN 
VYTIVFWGL 
WIYGEALCNV 
LILLVTIPLY 
AYVLMIRMLR 
HVYALYIVAL 
RKSSSYSSSS 



31 
I 

RSSKGRSLIG 
PSNGMALWVF 
LIGFFYGNMY 
WKQTIFIPA 
SSAMDENSEK 
CLSTLNSCID 
TTVKTSY 



41 
1 

KVDGTSHVTG 
LFRTKKKHPA 
CSILFMTCLS 
LNITTCHDVL 
KRKRAIKLIV 
PFVYYFVSHD 



51 
I 

KGVTVETVFS 
VIYMANIiALA 
VQRYWVIVNP 
PEQLLVGDMF 
TVLAMYLICF 
FRDHAKNALL 



60 
74 



60 
114 



60 

120 

180 

240 

300 

360 

397 



Seq ID NO: C368 Protein Sequence 
Protein Accession #: NP 003460. 1 



1 
I 

MAEAKTHWLG 
YIENLRQQAH 
AENEPQSAPK 
RTNEIVEEQY 
YEDWGGEDW 
DQLSDDVSKV 
PPEDLIEMLK 
RAGTEALPDG 
AAWIPHVENR 



11 
I 

AALSLIPLIF 
KEESSPDYNP 
ENKPYALNSE 
TPQSIiATLES 
NPVEEKIESQ 
IAYLKRLVNA 
TGEKPNGSVE 
LSVEDILNLL 
QMAYENLNDK 
LNQGSSQETD 
KRAMENM 



21 
I 

LISGAEAASF 
YQGVSVPLQQ 
KNFPMDMSDD 
VFQELGKLTG 
TQEEVRDSKE 
AGSGRLQNGQ 
PERELDIiPVD 
GMESAANQKT 
DQELGEYLAR 
KIAPVSKRFP 



31 
I 

QRNQLLQKEP 
KENGDESHLP 
YETQQWPERK 
PNNQKRERMD 
NIGKNEQIND 
NGERATRLFE 
LDDISEADLD 
SYFPNPYNQE 
MLVKYPEIIN 
VGPPKNDDTP 



41 
I 

DLRLENVQKF 
ERDSLSEEDW 
DKHMQFPPMY 
EEQKLYTDDE 
EMKRSGQLGI 
KPLDSQSIYQ 
HPDLFQNRML 
KVLPRLPYGA 
SNQVKRVPGQ 
NRQYWDEDLL 



51 
I 

PSPEMIRALE 
MRIILEALRQ 
EENSRDNPFK 
DDIYKANNIA 



LIEISRNXiQI 
SKSGYPKTPG 
GRSRSNQLPK 
GSSEDDLQEE 
MKVLEYLNQE 



Seq ID NO: C369 Protein Sequence 
Protein Accession fl: NP_112217.1 



1 

I 

MPCAQRSWLA 
DASGHFLSYG 
KRYGNLSHVK 
KHPLVEGGYH 
SISKERWVET 
LEEEEQGLKI 
LGLSHLSGMC 
RQLQYDPTPIi 
QYGPNATFCQ 
IPGGWGRWSP 
PTFRQMQCSE 
GGNSRNVCIN 
LIPKGARDIR 
. EKLMATGPTN 
TGIRRQTAHC 
GEKKRTVLCI 
GGVRIRSVTC 
VPPPTSRPRM 
ISTGSTSQPI 
PFYNTLTKGP 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

617 



KTANRNHLKL 
WSECSTTCGL 
GGFKIREIQC 
GVQERGVFCP 
EGNKTEDQDQ 
TVRAECCFSC 



11 21 31 41 51 

) 1 I I I 

NLSWAQLLN FGALCYGRQP QPGPVRFPDR RQEHFIKGLP EYHWGPVRV 
LHYPITSSRR KRDLDGSEDW VYYRI SHEER DLFFNLTVNQ GFLSNSYIME 
MMASSAPLCH LSGTVLQQGT RVGTAALSAC HGLTGPFQLP HGDFFIEPVK 
PHIVYRRQKV PETKEPTOGL KDSVNISQKQ ELWREKWERH NLPSRSLSRR 
LWADTKMIE YHGSENVESY ILTIMNMVTG LFHNPSIGNA IHIVWRLIL 
VHHAEKTLSS FCKWQKSINP KSDLNPVHHD VAVLLTRKDI CAGFNRPCET 
QPHRSCNINE DSGLPLAFTI AHELGHSFGI QHDGKENDCE PVGRHPYIMS 
TWSKCSEEYI TRFLDRGWGF CLDDIPKKKG LKSKVIAPGV IYDVHHQCQL 
EVENVCQTLW CSVKGFCRSK LDAAADGTQC GEKKWCMAGK CITVGKKPES 
WSHCSRTCGA GVQSAERLCN NPEPKFGGKY CTGERKRYRL CNVHPCRSEA 
FDTVPYKNEL YHWFPIFNPA HPCELYCRPI DGQFSEKMLD AVIDGTPCPE 
GICKMVGCDY EIDSNATEDR CGVCLGDGSS CQTVRKMFKQ KEGSGYVDIG 
VMEIEGAGNF LAIRSEDPEK YYLKGGFIIQ WNGNYKLAGT VFQYDRKGDL 
ESVWIQLLFQ VTNPGIKYEY TIQKDGLDND VEQMYFWQYG HWTECSVTCG 
IKKGRGMVKA TFCDPETQPN GRQKKCHEKA CPPRWWAGEW EACSATCGPH 
QTMVSDEQAL PPTDOQHLLK PKTLLSCKRD ILCPSDWTVG NWSECSVSCG 
AKNKDEFCDV TRKPNSRALC GLQQCPSSRR VLKPNKGTIS NGKNPPTLKP 
LTTPTGPESM STSTPAISSP SPTTASKEGD LGGKQWQDSS TQPELSSRYD 
LTSQSLSIQP SEENVSSSDT GPTSEGGLVA TTTSGSGDSS SRNPITWPVT 
EMEIHSGSGE EREQPEDKDE SNPVIWTKIR VPGNDAPVES TEMPLAPPI»T 
PFSTVMEGLLi PSQRPTTSET GTPRVEGMVT EKPANTLLPL GGDHQPEPSG 
PNNMNQTKSS EPVLTEEDAT SLITEGFLLN ASNYKQLTNG HGSAHWIVGN 
GAYWKRVECT TQMDSDCAAI QRPDPAKRCH LRPCAGWKVG NWSKCSRNCS 
VDSRDHRNLR PFHCQFLAGI PPPLSMSCNP EPCEAWQVEP WSQCSRSCGG 
GGLCDWTKRP TSTMSCNEHI> CCHWATGNWD LCSTSCGGGF QKRIVQCVPS 
CLCDHKPRPP EFKKCNQQAC KKSADLLCTK DKLSASFCQT LKAMKKCSVP 
PQTHITHTQR QRRQRLLQKS KEU 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1593 



Seq ID NO: C370 Protein Sequence 
Protein Accession #: NP_001053.l 
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MRQSHQLPLV GLLLFSFIPS QLCEICEVSE ENYIRLKPLL NTMIQSNYNR GTSAVNWLS 
LKLVGIQIQT LMQKMIQQIK YNVKSRLSDV SSGELALIIL ALGVCRNAEE NLIYDYHLTD 
KLENKFQAEI ENMEAHNGTP LTNYYQLSLD VLALCLFNGM YSTAEWNHF TPENKNYYFG 
SQFSVDTGAM AVLALTCVKK SLINGQIKAD EGSLKNISIY TKSLVEKILS EKKENGLIGN 
TFSTGEAMQA LFVSSDYYKE NDWNCQQTLN TVLTEISQGA FSNPNAAAQV LPALMGKTFL 
DINKDSSCVS ASGNFNISAD EPITVTPPDS QSYISVNYSV RINETYFTNV TVLNGSVFLS 
VMEKAQKMND TIFGFTMEER SWGPYITCIQ GLCANNNDRT YWELLSGGEP LSQGAGSYW 
RNGENLEVRW SKY 

Seq ID NO: C371 Protein Sequence 
Protein Accession #: NP_004582.l 

1 11 21 31 41 51 

| I I i 1 i 

MCCTKSLLLA ALMSVLLLHL CGESEAASNF DCCLGYTDRI LHPKFIVGFT RQLANEGCDI 
NAIIFHTKKK LSVCANPKQT WVKYIVRLLS KKVKNM 

Seq ID NO: C372 Protein Sequence 
Protein Accession 8: NP_037403.1 

1 11 21 31 41 51 

I I I I I ! 

MAGSPLLWGP RAGGVGLLVL LLLGLFRPPP ALCARPVKEP RGLSAASPPL AETGAPRRFR 
RSVPRGEAAG AVQELARALA HLLEAERQER ARAEAQEAED QQARVLAQLL RVWGAPRNSD 
PAliGLDDDPD APAAQLARAL LRARLDPAAIi AAQLVPAPVP AAALRPRPPV YDDGPAGPDA 
EEAGDETPDV DPELLRYLLG RILAGSADSE GVAAPRRLRR AADHDVGSEL PPEGVLGALL 
RVKRLETPAP QVPARRLLPP 

Seq ID NO: C373 Protein Sequence 
Protein Accession #: NP_002236.1 

1 11 21 31 41 51 

11)111 

MLQSLAGSSC VRLVERHRSA WCFGFLVLGY LLYLVFGAW FSSVELPYED LLRQELRKLK €0 

RRFLEEHECL SEQQLEQFLG RVLEASNYGV SVLSNASGNW NWDFTSALFF ASTVLSTTGY 120 

GHTVPLSDGG KAFCIIYSVI GIPFTLLFLT AWQRITVHV TRRPVLYFHI RWGFSKQWA 180 

IVHAVLLGFV TVSCFFFIPA AVFSVLEDDW NFLESFYFCF ISLSTIGLGD YVPGEGYNQK 240 

FRELYKIGIT CYLLLGLIAM LWLETFCEL HELKKFRKMF YVKKDKDEDQ VHIIEHDQLS 300 

FSSITDQAAG MKEDQKQNEP FVATQSSACV DGPANH 336 

Seq ID NO: C374 Protein Sequence 
Protein Accession «: NP__005463.1 

1 11 21 31 41 51 

| I I i I I 

METTNGTETW YESLHAVLKA LNATLHSNLL CRPGPGLGPD NQTEERRASL PGRDDNSYMY 60 
ILFVMFUFAV TVGSLILGYT RSRKVDKRSD PYHVYIKNRV SMI 103 

Seq ID NO: C375 Protein Sequence 
Protein Accession #: NP_005236.1 

1 11 21 31 41 51 

I I I I I I 

MGRHLALLLL LLLLFQHFGD SDGSQRLEQT PLQFTHLEYN VTVQENSAAK TYVGHPVKMG 60 

VYITHPAWEV RYKIVSGDSE NLFKAEEYIL GDFCFLRIRT KGGNTAILNR EVKDHYTLIV 120 

KALEKNTNVE ARTKVRVQVL DTNDLRPLFS PTSYSVSIjPE NTAIRTSIAR VSATDADIGT 180 

NGEFYYSFKD RTDMFAIHPT SGVIVLTGRL DYLETKLYEM EILAADRGMK LYGSSGISSM 240 

AKLTVHIEQA NECAPVITAV TLSPSELDRD PAYAIVTVDD CDQGANGDIA SLSIVAGDLL 300 

QQFRTVRSFP GSKEYKVKAI GDIDWDSHPF GYNLTLQAKD KGTPPQFSSV KVIHVTSPQF 360 

KAGPVKFEKD VYRAEISEFA PPNTPWMVK AIPAYSHLRY VFKRTPGKAK FSLNYNTGLI 420 

SILEPVKRQQ AAHFELEVTT SDRKASTKVL VKVLGANSNP PEFTQTAYKA AFDENVPIGT 480 

TIMSLSAVDP DEGENGYVTY SIANLUHVPF AIDHFTGAVS TSENLDYELM PRVYTLRIRA 540 

SDWGLPYRRE VEVLATITLN NLNDNTPLFE KINCEGTIPR DLGVGEQITT VSAIDADELQ 600 

LVQYQIEAGN ELDLFSLNPN SGVLSLKRSI* MDGLGAKVSF HSLRITATDG ENFATPLYIN 660 

I TVAASHKLV NLQCEETGVA KMLAEKLLQA NKLHNOGEVE DIFFDSHSVN AHIPQFRSTL 720 

PTGIQVKENQ PVGSSVIFMN STDLDTGFNG KLVYAVSGGN EDSCFMIDME TGMLKILSPL 780 

DRETTDKYTL NITVYDLGIP QKAAWRLLHV VWDANDNPP EFLQESYFVE VSEDKEVHSE 840 

IIOVEATDKD LGPNGHVTYS ILTDTDTFSI DSVTGWNIA RPLDRELQHE HSLKIEARDQ 900 

AREEPQLFST VWKVSLEDV NDNPPTFIPP NYRVKVREDI* PEGTVIMWLE AHDPDLGQSG 960 

QVRYSLLDHG EGNFDVDKLS GAVRIVQQLD FEKKQVYNLT VRAKDKGKPV SLSSTCYVEV 1020 

EWDVNENLH PPVFSSFVEK GTVKEDAPVG SLVMTVSAHD EDAGRDGEIR YSIRDGSGVG 1080 

VFKIGEETGV IETSDRLDRE STSHYWLTVF ATDQGWPLS SFIEIYIEVE DVNDNAPQTS 1140 

EPVYYPEIME NSPKDVSWQ IEAFDPDSSS NDKLMYKITS GNPQGFFSIH PKTGLITTTS 1200 

RKLDREQQDE HILEVTVTDN GSPPKSTIAR VIVKILDEND NKPQFLQKFY KIRLPEREKP 1260 

DRERNARREP LYRVIATDKD EGPNAEISYS IEDGNEHGKF FIEPKTGWS SKRFSAAGEY 1320 

DILSIKAVDN GRPQKSSTTR LSIEWISKPK QSLEPISFEE SFFTFTVMBS DPV AHMI GVI 1380 

SVEPPGIPLW FDITGGNYDS HFDYDKGTGT I IVAKPLDAE QKSNYNLTVE ATDGTTTILT 1440 

QVFIKVIDTN DHRPQFSTSK YEWIPEDTA PETEILQISA VDQDEKNKLI YTLQSSRDPL 1500 

SLKKFRLDPA TGSIiYTSEKL DHEAVSPAHL TVMVRDQDVP VKRNFARIW NVSDTNDHAP 1560 

WFTASSYKGR VYESAAVGSV VUQVTALDKD KGKNAEVLYS IESGNIGNIG NSFMIDPVLG 1620 

SIKTAKELDR SNQAEYDLMV KATDKGSPPM SEITSVRIFV TIADNASPKF TSKEYSVELS 1680 

ETVSIGSFVG MVTAHSQSSV VYEIKDGNTG DAFDINPHSG TIITQKALDF ETLPIYTLII 1740 

QGTNMAGLST NTTVLVHLQD ENDNAPVFMQ AEYTGLISES ASINSWLTD RNVPLVIRAA 1800 
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DADKDSNALL VYHIVEPSVH TYFAIDSSTG AIHTVLSLDY 
FAEYAANVTV HVIDINDCPP VFAKPLYEAS LLLPTYKGVK 
ITEGNIGEKF SMDYKTGALT VQNTTQLRSR YELTVRASDG 
KFTQDVYSAV VKENSTEAET LAVITAIGSP INEPLFYHIL 
PFDREQQEAF DVWEVIEEH KPSAVAHWV KVIVEDQNDN 
HVIRYVTAVD RDSGRNGEVH YYLKEHHEHF QIGPLGEISI* 
DGGNPAFSAE VIVPITVMNK AMPVFEXPFY SAEIAESIQV 
ITDGDPFSQF TINFNTGVIN VIAPLDFEAH PAYKLSIRAT 
DNPPVFAQQS YAVTLSEASV IGTSWQVRA TDSDSEPNRG 
STGLISLLRT LDYEQSRQHT IFVRAVDGGM PTLSSDVIVT 
RISEHAPHGH FVTCVKAYDA DSSDIDKLQY SILSGNDHKH 
KPFYSLNLSV SDGVFRSSTQ VHVTVIGGNL HSPAFLQMEY 
DGDSGIYGHV TYHIVNDFAK DRFYINERGQ IFTLEKLDRE 
AFCTVNVILT DDNDNAPQFR ATKYEVNIGS SAAKGTSWK 
SESVKENLEI NKLSGVITTK ESLIGLENEF FTFFVRAVDN 
QLPKFSEPFY TFTVSEDVPV GTEIDLIRAE HSGTVLYSLV 
RLKLEKSLDH ETTKWYQFSI LARCTQDDHE MVASVDVSIQ 
VENLPGGSRV IQIRASDADS GTNGQVMYSL DQSQSVEVIE 
KRDNYQIKW ASDHGEKIQL SSTAIVDVTV TDVNDSPPRF 
ILSTTDADSE EINRQVTYFI TGGDPLGQFA VETIQNEWKV 
TDGTFSSKAI VEVKVLDAND NSPVCEKTLY SDTIPEDVLP 
TYTLLGSGAE KFKLNPDTGE LKTSTPLDRE EQAVYHLLVR 
NDNAPEFSAD PYAITVFENT EPGTLLTRVQ ATDADAGLNR 
GIIQLEKPLD RELQAVYTLS LKAVDQGLPR RLTATGTVIV 
VSEDILVGTE VLQVYAASRD IEANAEITYS IISGNEHGKF 
HEYYLTVEAT DGGTPSLSDV ATVNYNVTDI NDNTPVFSQD 
ADDADGPSNS HIHYSIIDGN QGSSFTIDPV RGEVKVTKLL 
PRVNTTTVNI DVSDVNDNAP VFSRGNYSVI IQENKPVGFS 
TIVTGNDEKA FEVNPQGVLL TSSAIKRKEK DHYLLQVKVA 
SIYPPAILPL EIFITSSGEE YSGGVIGKIH ATDQDVYDTL 
LIAHKKLDIG QYLLNVSVTD GKFTTVADIT VHIRQVTQEM 
YWRNFQRALR NILGVRRNDI QIVSLQSSEP HPHLDVIiLFV 
VTDIEEIIGV RILNVFQKLC AGLDCPWKFC DEKVSVDESV 
VCLCKEGRCP PVHHGCEDDP CPEGSECVSD PWEEKHTCVC 
YVKYRLTENE NKLEMKLTMR LRTYSTHAW MYARGTDYSI 
VSVQSIQVND GQWHAVALEV NGNYARLVLD QVHTASGTAP 
QGTRHGRSPQ VGNGFRGCMD SIYLNGQELP LNSKPRSYAH 
SNPCQNGGVC NPSPAGGYYC KCSALYIGTH CEISVNPCSS 
RGLYTGQRCQ LSPYCKDEPC KNGGTCFDSL DGAVCQCDSG 
GALCENTHGS YHCNCSHEYR GRHCEDAAPN QYVSTPWNIG 
FVLCRKMISR KKKHQAEPKD KHLGPATAFT. QRPYFDSKLN 
SIPSDSRNNL DRNSFEGSAI PEHPEFSTFN PESVHGHRKA 
DSDSIQKPSW DFDYDTKWD LDPCLSKKPL EEKPSQPYSA 
NGYHWDTSDW MPSVPLPDIQ EFPNYEVIDE QTPLYSADPN 
PEDFPAADEL PPLPPEFSNQ FESIHPPRDM PAAGSLGSSS 
SEPQTKGTGE NSTCREPHAP YPPGYQRHFE APAVESMPMS 
VMMSDYESGD DGHFEEVTIP PLDSQQHTEV 



EETSIFHFTV QVHDMGTPRL 1860 

VITVNATDAD SSAFSQLIYS 1920 

RFAGLTSVKI NVKESKESHL 1980 

NPDRRFKISR TSGVLSTTGT 2040 

APVFVNLPYY AWKVDTEVG 2100 

KKQFELDTLN KEYLVTWAK 2160 

HSPWHVQAN SPEGLKVFYS 2220 

DSLTGAHAEV FVDIIVDDIN 2280 

ISYQMFGNHS KSHDHFHVDS 2340 

VDVTDLNGNP PLFEQQIYEA 2400 

FVIDSATGII TLSNLHRHAL 2460 

EVELAENAPL HTLVMEVKTT 2520 

TPAEKVISVR LMAKDAGGKV 2580 

SASDADEGSN ADITYAIEAD 2640 

GSPSKESWL VYVKILPPEM 2700 

KGNTPESNRD ESFVIDRQSG 2760 

VKDANDNSPV FESSPYEAFI 2820 

SFAINMETGW ITTLKELDHE 2880 

TAEIYKGTVS EDDPQGGVIA 2940 

YVKKPLDREK KDNYLLTITA 3000 

GKLIMQISAT DADIRSNAEI 3060 

ATDGGGRFCQ ASIWTLEDV 3120 

KILYSLIDSA DGQFSINELS 3180 

SVLDINDNPP VFEYREYGAT 3240 

SIDSKTGAVF IIENLDYESS 3300 

TYTTVISEDA VLEQSVITVM 3360 

DRETISGYTL TVQASDNGSP 3420 

VLQLWTDED SSHNGPPFFF 3480 

DNGKPQLSSL TYIDIRVIEE 3540 

TYSLDPQMDN LFSVSSTGGK 3600 

LNHTIAIRFA NLTPEEFVGD 3660 

EKPGSAQIST KQLLHKINSS 3720 

MSTHSTARLS FVTPRHHRAA 3780 

PSGRFGQCPG SSSMTLTGNS 3840 

LEIHHGRLQY KFDCGSGPGI 3900 

GTLKTLNIiDN YVFFGGHIRQ 3960 

IEESVDVSPG CFLTATEDCA 4020 

NPCLYGGTCV VDNGGFVCQC 4080 

FRGERCQSDI DECSGNPCLH 4140 

LAEGIGIWF VAGIFLLVW 4200 

KNIYSDIPPQ VPVRPISYTP 4260 

VAVCSVAPNL PPPPPSNSPS 4320 

RESLSEVQSL SSFQSESCDD 4380 

AIDTDYYPGG YDIESDFPPP 4440 

RNRQRFNLNQ YLPNFYPLDM 4500 

VYASTASCSD VSACCEVESE 4560 
4590 



Seq ID NO: C376 Protein Sequence 
Protein Accession #i NP_055035.1 

1 11 21 31 41 51 

MCYGKCARCI GHSLVGIiALL CIAANILLYF PNGETKYASE NHLSRFVWFF SGIVGGGLLM 
LLPAFVFIGL EQDDCCGCCG HENCGKRCAM LSSVLAALIG IAGSGYCVIV AALGIABGPL 
CLDSLGQWNY TFASTEGQYL LDTSTWSECT EPKHTVEWNV SLFSILLALG GIEFILCLIQ 
VINGVIiGGIC GFCCSHQQQY DC 

Seq ID NO: C377 Protein Sequence 
Protein Accession #: NP_003750.1 

1 11 21 31 41 51 

I 1 1 1 I i 

KSTENVEGKP SNLGERGRAR SSTFLRWQP MFNHSIFTSA VSPAAERIRF ILGEEDDSPA 
PPQLFTELDE IiLAVDGQEME WKETARWIKF EEKVEQGGER WSKPHVATLS LHSLFELRTC 
. MEKGSIMLDR EASSLPQLVE- MIVDHQIETG LLKPELKDKV TYTLLRKHRH QTKKSNLRSL 
ADIGKTVSSA SRMFTNPDNG SPAMTHRNLT SSSLNDISDK PEKDQLKNKF MKKLPRDAEA 
SNVLVGEVDF LDTPFIAFVR LQQAVMLGAL TBVPVPTRFL FILLGPKGKA KSYHEIGRAI 
ATLMSDEVFH DIAYKAKDRH DLIAGIDEFL DEVIVLPPGE WDPAIRIEPP KSLPSSDKRX 
NMYSGGENVQ MNGDTPHDGG HGGGGHGDCE ELQRTGRFCG GLIKDIKRKA PFFASDFYDA 
LNIQALSAIL FIYLATVTNA ITFGGLLGDA TDNMQGVLES FLGTAVSGAI FCLFAGQPLT 
ILSSTGPVLV FERLLFNFSK DNNFDYLEFR LWIGLWSAFL CLILVATDAS FLVQYFTRFT 
EEGFSSLISF IFIYDAFKKM IKLADYYPIN SNFKVGYNTL FSCTCVPPDP ANISISNDTT 
LAPEYLPTMS STDMYHNTTF DWAFLSKKEC SKYGGNLVGN NCNFVPDITL MSFILFLGTY 
TSSMALKKFK TSPYFPTTAR KLISDFAIIL SILIFCVIDA LVGVDTPKLI VPSEFKPTSP 
NRGWFVPPFG ENPWWVCLAA AIPALLVTIL IFMDQQITAV IVNRKEHKLK KGAGYHLDLF 
WVAILMVICS LMALPWYVAA TVISIAHIDS LKMETETSAP GEQPKFLGVR EQRVTGTLVF 
ILTGLSVFMA PILKFIPMPV LYGVFLYMGV ASLNGVQFMD RLKLLLMPLK HQPDFIYLRH 
VPLRRVHLFT FLQVLCLALI> WILKSTVAAI IFPVMIIAIiV AVRKGMDYLF SQHDLSFLDD 
VIPEKDKKKK EDEKKKKKKK GSLDSDNDDS DCPYSEKVPS IKIPMDIMEQ QPFLSDSKPS 
DRERSPTFLE RHTSC 
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Seq ID NO: C37B Protein Sequence 
Protein Accession Jh NP_000949.1 
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1 11 21 31 41 51 

i 1 | I I I 

MSTPGVNSSA SLSPDRLNSP VTIPAVMFIF GWGNLVAIV VLCKSRKEQK ETTFYTLVCG 60 

LAVTDLLGTL LVSPVTIATY MKGQWPGGQP LCEYSTFILL FFSLSGLSII CAMSVERYLA 120 

INHAYFYSHY VDKRLAGLTL FAVYASNVLF CALPNMGLGS SRLQYPDTWC FIDWTTNVTA 180 

HAAYSYMYAG FSSFLILATV LCNVLVCGAL LRMHRQFMRR TSLGTEQHHA AAAASVASRG 240 

HPAASPALPR LSDFRRRRSF RRIAGAEIQM VILLI ATSLV VLICSIPLW RVFVNQLYQP 300 

SLEREVSKNP DLQAIRIASV NPILDPWIYI LLRKTVLSKA IEKIKCLFCR IGGSRRERSG 360 

QHCSDSQRTS SAMSGHSRSF ISRELKEISS TSQTLLPDLS LPDLSENGLG GRNLLPGVPG 420 

MGLAQ EDITS LRTLRISETS DSSQGQDSES VLLVDEAGGS GRAGPAPKGS SLQVTFPSET 480 

LNLSEKCI 488 

Seq ID NO: C379 Protein Sequence 
Protein Accession 8: NP_002650.1 

1 11 21 31 41 51 

I I I I I I 

MGHPPLLPLL LLLHTCVPAS WGLRCMQCKT NGDCRVEECA LGQDLCRTTI VRLWEEGEEL 60 
ELVEKSCTHS EKTNRTLSYR TGLKITSLTE WCGLDLCNQ GNSGRAVTYS RSRYLECISC 120 
GSSDMSCERG RHQSLQCRSP EEQCLDWTH WIQEGEEGRP KDDRHLRGCG YLPGCPGSNG 180 
FHNNDTFHFL KCCNTTKCNE GPILELENLP QNGRQCYSCK GNSTHGCSSE ETFLIDCRGP 240 
MNQCLVATGT HEPKNQSYMV RGCATASMCQ HAHLGDAFSM NHIDVSCCTK SGCNHPDLDV 300 
QYRSGAAPQP GPAHLSLTIT LLMTARLWGG TLLWT 33 S 

Seq ID NO: C380 Protein Sequence 
Protein Accession #* BAB5S406.1 

1 11 21 31 41 51 

] I I I I I 

MDFESGQVDP LASVILPPNL LENLSPEDSV LVRRAQFTFF NKTGLFQDVG PQRKTLVSYV 60 

MACSIGNITI QNLKDPVQIK IKHTRTQEVH HPICAFWDLN KNKSFGGWNT SGCVAHRDSD 120 

ASETVCLCNH FTHFGVLMDL PRSASQLDAR NTKVLTFISY IGCGISAIFS AATLLTYVAF 180 

EKLRRDYPSK ILMNLSTALL FLNLLFLLDG WITSFNVDGL CIAVAVLLHF FLLATFTWMG 240 

LEAIHMYIAL VKVFNTYIRR YILKFCIIGW GLPALWSW LASRNNNEVY GKESYGKEKG 300 

DEFCWIQDPV IFYVTCAGYF GVMFFLN I AM FIWMVQICG RNGKRSNRTL REEVLRNLRS 360 

WSLTFLLGM TWGFAFPAWG PLNIPFMYLF SIFNSLQGLF IFIFHCAMKE NVQKQWRRHL 420 

CCGRFRLADN SDWSKTATNI IKKSSDNLGK SLSSSSIGSN STYLTSKSKS SSTTYFKRNS 480 

HTDNVSYEHS FNKSGSLRQC FHGQVLVKTG PC 512 

Seq ID NO: C381 Protein Sequence 
Protein Accession #; NP_000565.1 

1 11 21 31 41 51 

i 1 t I I I 

MTVARPSVPA ALPLLGELPR LLLLVLLCLP AVWGDCGLPP DVPNAQPALE GRTSFPEDTV 60 
ITYKCEESFV KIPGEKDSVI CLKGSQWSDI EEFCNRSCEV PTRLNSASLK QPYITQNYFP 120 
VGTWEYECR PGYRREPSLS PKLTCLQNLK WSTAVEFCKK KSCPNPGEIR NGQIDVPGGI 180 
LFGATI SFSC NTGYKLFGST SSFCLISGSS VQWSDPLPEC REIYCPAPPQ IDNGIIQGER 240 
DHYGYRQSVT YACNKGFTMI GEHSIYCTVN NDEGEWSGPP PECRGKSLTS KVPPTVQKPT 300 
TVNVPTTEVS PTSQKTTTKT TTPNAQATRS TPVSRTTKHF HETTPNKGSG TTSGTTRLLS 360 
GHTCFTLTGL LGTLVTMGLL T 381 

seq ID NO: C382 Protein Sequence 
Protein Accession #: Eos sequence 

1 11 21 31 41 51 

MDTSRLGVLL SLPVLLQLAT GGSSPRSGVL LRGCPTHCHC EPDGRMLLRV DCSDLGLSEL 60 

PSNLSVFTSY LDLSMNNISQ LLPNPLPSLR FLEELRLAGN ALTYIPKGAF TGLYSLKVLM 120 

LQNNQLRHVP TEALQNLRSL QSLRLDANHI SYVPPSCFSG LKSLRHLWLD DNALTEIPVQ 180 

AFRSLSALQA MTLALNKIHH IPDYAFGNLS SLWLHLHNN RIHSLGKKCF DGLHSLETLD 240 

LNYNNLDEFP TAIRTLSNLK ELHFYDNPIQ FVGRSAFQHL PELRTLTLNG ASQITEFPDL 300 

TGTANLESLT LTGAQISSLP QTVCNQLPNL QVLDLSYNLL EDLPSFSVCQ KLQKIDLRHN 360 

EIYEIKVDTF QQLLSLRSLN LAWNKIAIIH PNAFSTLPSL IKLDLSSNLL SSFPITGLHG 420 

. LTHLKLTGNH ALQSLI9SEN FPELKVIEMP YAYQCCAFGV CENAYKISNQ WNKGDNSSMD 480 

DLHKKDAGMF QAQDERDLED FLLDFEEDLK ALHSVQCSPS PGPFKPCEHL LDGWLIRIGV 540 

WTIAVLALTC NALVTSTVFR SPLYISPIKL LIGVIAAVNM LTGVSSAVLA GVDAFTFGSF 600 

ARHGAWWENG VGCHVIGFLS IFASESSVFL LTLAALERGP SVKYSAKFET KAPFSSLKVI 660 

ILLCALLALT MAAVPLLGGS KYGASPLCLP LPFGEPSTMG YMVALILLNS LCFLMMTIAY 720 

TKLYCNLDKG DLENIWDCSM VKHIALLLFT NCILNCPVAF LSFSSLINLT FISPEVIKFI 780 

LLWVPLPAC LNPLLYILFN PHFKEDLVSL RKQTYVWTRS KHPSLMSINS DDVEKQSCDS 840 

TQALVTFTSS SITYDLPPSS VPSPAYPVTE SCHLSSVAFV PCL 883 

Seq ID NO: C383 Protein Sequence 
Protein Accession #: NP_003658.i 

1 11 21 31 41 51 

I I I I I I 

MDTSRLGVLL SLPVLLQLAT GGSSPRSGVL LRGCPTHCHC EPDGRMLLRV DCSDLGLSEL 60 

PSNLSVFTSY LDLSMNNISQ LLPNPLPSLR FLEELRLAGN ALTYIPKGAF TGLYSLKVLM 120 

LQNNQLRHVP TEALQNLRSL QSLRLDANHI SYVPPSCFSG LHSLRHLHLD DNALTEIPVQ 180 

AFRSLSALQA MTLALNKIHH IPDYAFGNLS SLWLHLHNN RIHSLGKKCF DGLHSLETLD 240 

LNYNNLDEFP TAIRTLSNLK ELGFHSNNIR SIPEKAFVGN PSLITIHFYD NPIQFVGRSA 300 
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FQHLPELRTL TLNGASQITE FPDLTGTANL ESLTLTGAQI SSLPQTVCNQ LPNLQVLDLS 
YNL.LEDLPSF SVCQKLQKID LRHNEIYEIK VDTFQQLLSL RSLNLAWNKI AI IHPNAFST 
LPSLIKXiDLS SNLLSSFPIT GLHGLTHLKL TGNHALQSLI SSENPPELKV IEMPYAYQCC 
AFGVCENAYK ISNQWNKGDN SSMDDLHKKD AGMFQAQDER DLEDFLLDFE EDLKALHSVQ 
CSPSPGPFKP CEHLLDGWLI RIGVWTIAVL ALTCNALVTS TVFRSPLYIS PIKLLIGVIA 
AVNMLTGVSS AVIAGVDAFT FGSFARHGAW HENGVGCHVI GFLSIFASES SVFLLTLAAli 
ERGFSVKYSA KFETKAPFSS LKVIILLCAL LALTMAAVPL LGGSKYGASP LCLPLPFGEP 
STMGYMVALI LLNSLCFLMM TIAYTKLYCN LDKGDLENIW DCSMVKHIAL LLFTNCILNC 
PVAFLSFSSL INLTFISPEV IKFILLWVP LPACLNPLLY ILFNPHFKED LVSLRKQTYV 
WTRSKHPSLM SINSDDVEKQ SCDSTQALVT FTSSSITYDL PPSSVPSPAY PVTESCHLSS 
VAFVPCL 

Seq ID NO i C384 Protein Sequence 
Protein Accession #: NPJ>03497.1 

1 11 21 31 41 51 

1 I I 1 t I 

MEMFTFLLTC I FLPLLRGHS LFTCEPITVP RCMKMAYNMT FFPNLMGHYD QSIAAVEMEH 60 
FLPLANLECS PNIETFLCKA FVPTCIEQIH WPPCRKLCE KVYSDCKKLI DTFGIRWPEE 120 
LECDRLQYCD ETVPVTFDPH TEFLGPQKKT EQVQRDIGFW CPRHLKTSGG QGYKFLGIDQ 180 
CAPPCPNMYP KSDELEFAKS FIGTVSIFCL CATLFTFLTF LIDVRRFRYP ERPIIYYSVC 240 
YSIVSLMYFI GFLLGDSTAC NKADEKLELG DTWLGSQNK ACTVLFMLLY FFTMAGTVWW 300 
VILTITWFLA AGRKWSCEAI EQKAVWFHAV AWGTPGFLTV MLLALNKVEG DNISGVCFVG 360 
LYDLDASRYF VLLPLCLCVF VGLSLLLAGI ISLNHVRQVI QHDGRNQEKL KKFMIRIGVF 420 
SGLYLVPLVT LLGCYVYEQV NRITWEITWV SDHCRQYHIP CPYQAKAKAR PELALFMIKY 480 
LMTLIVGISA VFWVGSKKTC TEWAGFFKRN RKRDPISESR RVLQESCEFF LKHNSKVKHK 540 
KKHYKPSSHK LKVISKSMGT STGATANHGT SAVAITSHDY LGQETLTEIQ TSPETSMREV 600 
KADGASTPRL REQDCGEPAS PAASISRLSG EQVDGKGQAG SVSESARSEG RISPKSDITD 660 
TGLAQSNNLQ VPSSSEPSSL KGSTSLLVHP VSGVRKEQGG GCHSDT 706 

Seq ID NO: C385 Protein Sequence 
Protein Accession #: NP_000S73 

1 11 21 31 41 51 

I I I I I 1 

MRIAVICFCL LGITCAIPVK QADSGSSEEK QLYNKYPDAV ATWLNPDPSQ KQNLLAPQTL 60 
PSKSNESHDH MDDMDDEDDD DHVDSQDSID SNDSDDVDDT DDSHQSDESH HSDESDELVT 120 
DFPTDLPATE VFTPWPTVD TYDGRGDSW YGLRSKSKKF RRPDIQYPDA TDEDITSHME 180 
SEELNGAYKA I PVAQDLNAP SDWDSRGKDS YETSQLDDQS AETHSHKQSR LYKRKANDES 240 
NEHSDVIDSQ ELSKVSREFH SHEFHSHEDM LWDPKSKEE DKHLKFRISH ELDSASSEVN 300 

Seq ID NO: C386 Protein Sequence 
Protein Accession #: NP_002812 

1 11 21 31 41 51 

1 I I I I I 

MGAARGSPAR PRRLPLLSVL LLPLLGGTQT AIVFIKQPSS QDALQGRRAL LRCEVEAPGP 60 

VHVYWLLDGA PVQDTERRFA QGSSLSFAAV DRLQDSGTFQ CVARDDVTGE EARS ANAS FN 120 

IKWIEAGPW LKHPASEAEI QPQTQVTLRC HIDGHPRPTY QWFRDGTPLS DGQSNHTVSS 180 

KERNLTLRPA GPEHSGLYSC CAHSAFGQAC SSQNFTLSIA DESFARWLA PQDVWARYE 240 

EAMFHCQFSA QPPPSLQWLF EDETPITNRS RPPHLRRATV FANGSLLLTQ VRPRNAGIYR 300 

CIGQGQRGPP IILEATLHLA EIEDMPLFEP RVFTAGSEER VTCLPPKGLP EPSVWWEHAG 360 

VRLPTHGRVY QKGHELVLAN IAESDAGVYT CHAANLAGQR RQDVNITVAT VPSWLKKPQD 420 

SQLEEGKPGY LDCLTQATPK PTWWYRNQM LISEDSRFEV FKNGTLRINS VEVYDGTWYR 480 

CMSSTPAGSI EAQARVQVLE KLKFTPPPQP QQCMEFDKEA TVPCSATGRE KPTIKWERAD 540 

GSSLPEWVTD NAGTLHFARV TRDDAGNYTC IASNGPQGQI RAHVQLTVAV FITFKVEPER 600 

TTVYQGHTAL LQCEAQGDPK PLIQWKGKDR ILDPTKLGPR MHIFQNGSLV IHDVAPEDSG 660 

RYTCIAGNSC NIKHTEAPLY WDKPVPEES EGPGSPPPYK MIQTIGLSVG AAVAYIIAVL 720 

GLMFYCKKRC KAKRLQKQPE GEEPEMECLN GGPLQNGQPS AEIQEEVALT SI/3SGPAATN 780 

KRHSTSDKMH FPRSSLQPIT TLGKSEFGEV FLAKAQGLEE GVAETLVLVK SLQTKDEQQQ 840 

LDFRRELEMF GKmHANWR LLGLCREAEP HYMVLEYVDL GDliKQFLRIS KSKDEKLKSO. 900 

PLSTKQKVAL CTQVALGMEH LSNNRFVHKD LAARNCLVSA QRQVKVSALG LSKDVYNSEY 960 

YHFRQAWVPL RWMSPEAILE GDFSTKSDVW AFGVLMWEVF THGEMPHGGQ ADDEVLADLQ 1020 

AGKARLPQPE GCPSKLYRLM QRCWALSPKD RPSFSEIASA IX5DSTVDSKP 1070 

. seq ID NO: C387 Protein Sequence 
Protein Accession ftt NP_002300.1 

1 11 21 31 41 51 

| I I I I 

MKVLAAGWP LLLVLHWKHG AGSPLPITPV NATCAIRHPC HNNLMNQIRS QLAQLNGSAN 60 

ALFILYYTAQ GEPFPNNLDK LCGPNVTDFP PFHANGTEKA KLVELYRIW YLGTSI/GNIT 120 

RDQKILNPSA LSLHSKLNAT ADILRGLLSN VLCRLCSKYH VGHVDVTYGP DTSGKDVFQK 180 

KKLGCQLLGK YKQIIAVLAQ AF 202 

Seq ID NO: C388 Protein Sequence 
Protein Accession #: XP_097508 

1 11 21 31 41 51 

I 1 I I I I 

MGRPRLTLVC HVSHISARD LSKNNLTELQ PGLFHHLRFL EELRLSGNHL SHIPGQAFSG 60 

LYSLKILMLQ NNQLGGIPAE ALWELPSLQS LRLDANLISL VPERSFEGLS SLRHLWLDDN 120 

ALTEIPVRAL NNIjPALQAMT LALNRISHIP DYAFQNLTSL WLHLHNNRI QHLGTHSFEG 180 

LHNLETLDLN YNKUQEFPVA IRTLGRLQEL GFHNNNIKAI PEKAFMGNPL LQTIHFYDNP 240 
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IQFVGRSAFQ YLPKLHTLSL NGAMDIQEPP DLKGTTSLEI LTLTRAGIRL LPSGMCQQLP 300 

RLRVLELSHN QIEELPSLHR CQKLEEIGLQ HNRIWEIGAD TFSQLSSLQA LDLSWNAIRS 360 

IHPEAFSTLH SLVKLDLTDN QLTTLPLAGL GGLMHXKLKG NLALSQAFSK DSFPKLRILE 420 

VPYAYQCCPY GMCASFFKAS GQWEAEDLHL DDEESSKRPL GLIARQAENH YDQDLDELQL 480 

5 EMEDSKPHPS VQCSPTPGPF KPCEYLFESW GIRLAVWAIV LLSVLCNGLV LLTVFAGGPV 540 

PLPPVKFWG AIAGANTLTG ISCGMASVD ALTFGQFSEY GARWETGLGC RATGFLAVLG 600 

SEASVLLLTL AAVQCSVSVS CVRAYGKSPS LGSVRAGVLG CLALAGLAAA LPLASVGEYG 660 

ASPLCLPYAP PEGQPAALGF TVALVMMNSF CFLWAGAYI KXiYCDLPRGD FEAVWDCAMV 720 

RHVAWLIFAD GLLYCPVAFL SFASMLGLFP VTPEAVKSVL LWIiPLPACL NPLLYLLFHP 780 

10 HFRDDLRRLR PRAGDSGPLA YAAAGELEKS SCDSTQALVA FSDVDLILEA SEAGRPPGLE 840 

TYGFPSVTLI SCQQPGAPRL EGSHCVEPEG NHFGNPQPSM DGELLLRAEG STPAGGGLSG 900 

GGGFQPSGLA FASHV 9*5 

Seq ID NO: C389 Protein Sequence 
15 Protein Accession ft: NP_570901 

1 ll 21 31 41 SI 

I I I I I I 

MASLVSLELG LLIAVLWTA TASPPAGLLS LLTSGQGALD QEALGGLLNT LADRVHCTNG 60 

20 PCGKCLSVED ALGLGEPEGS GLPPGPVLEA RYVARLSAAA VLYLSNPEGT CEDTRAGLWA 120 

SHADHLLALL. ESPKALTPGL SWLLQRMQAR AAGQT PKTAC VDIPQLLEEA VGAGAPGSAG 180 

GVLAALLDHV RSGSCFHALP SPQYFVDFVF QQHSSEVPMT JUAELSALKQR LGVGREAHSD 240 

HSHRHRGASS RDPVPLISSS NSSSVWDTVC LSARDVMAAY GLSEQAGVTP EAWAQLSPAL 300 

LQQQLSGACT SQSRPPVQDQ LSQSERYLYG SLATLLICLC AVFGLLLLTC TGCRGVAHYI 360 

25 LQTFLSLAVG ALTGDAVLHlt TPKVLGLHTH SEEGLSPQPT WRLLAMIAGL YAFFLFENLF 420 

NLLLPRDPED LEDGPCGHSS HSHGGHSHGV SLQLAPSELR QPKPPHEGSR ADLVAEESPE 480 

LLNPEPRRLS PELRLLPYMI TLGDAVHNFA DGLAVGAAFA SSWKTGLATS LAVFCHELPH 540 

ELGDFAALLH AGLSVRQALL LNLASALTAF AGLYVALAVG VSEESEAWIL AVATGLFLYV 600 

ALCDMLPAML KVRDPRPWLL FLLHNVGLU3 GWTVLLLLSL YEDDITF 648 



30 



Seq ID NO: C390 Protein Sequence 
Protein Accession ft: NP_061844 



1 11 21 31 41 51 

35 | | | | | I 

MANASEPGGS GGGEAAALGL KLATLSLLDC VSLAGNVLFA LLIVRERSLH RAPYYLLLDL 60 

CLADGLRALA CLPAVMLAAR RAAAAAGAPP GALGCKLLAF LAALFCFHAA FLLLGVGVTR 120 

YLAIAHHRFY AERLAGWPCA AMLVCAAWAL ALAAAFPPVL DGGGDDEDAP CALEQRPDGA 180 

PGALGFLLLL AVWGATHLV YLRLLFFIHD RRKMRPARLV PAVSHDWTFH GPGATGQAAA 240 

40 NWTAGFGRGP TPPALVGIRP AGPGRGARRL LVLEEFKTEK RLCKMFYAVT LLFLLLWGPY 300 

WASYLRVLV RPGAVPQAYL TASVWLTFAQ AGINPWCFL FNRELRDCFR AQFPCCQSPR 360 

TTQATHPCDL KGIGL 3 7S 

Seq ID NO: C391 Protein Sequence 
45 Protein Accession ft: NP_005622 

1 11 21 31 41 51 

I I I I I I 

MAAARPARGP ELPLLGLLLL LLLGDPGRGA ASSGNATGPG PRSAGGSARR SAAVTGPPPP 60 

50 LSHCGRAAPC EPLRYNVCLG SVLPYGATST LLAGDSDSQE EAHGKLVLWS GLRNAPRCWA 120 

VIQPLLCAVY MPKCENDRVE LPSRTLCQAT RGPCAIVERE RGWPDFLRCT PDRFPEGCTN 180 

EVQNIKFNSS GQCEVPLVRT DNPKSWYEDV EGCGIQCQNP LFTEAEHQDM HSYIAAFGAV 240 

TGLCTLFTLA TFVADWRNSN RYPAVILFYV NACFFVGSIG WLAQFMDGAR REIVCRADGT 300 

MRLGEPTSNE TLSCVIIFVI VYYALMAGW WFWLTYAWH TSFKALGTTY QPLSGKTSYF 360 

55 HLLTWSLPFV LTVAI LAVAQ VDGDSVSGIC FVGYKNYRYR AGFVLAPIGL VLIVGGYFLI 420 

RGVMTLFSIK SNHPGLLSEK AASKINETML RLGIFGFLAF GFVLITFSCH FYDFFNQAEW 480 

ERSFRDYVLC QANVTIGLPT KQPIPDCEIK NRPSLLVEKI NLFAMFGTGI AMSTWVWTKA 540 

TLLIWRRTWC RLTGQSDDEP KRIKKSKMIA KAFSKRHELL QNPGQELSFS MHTVSHDGPV 600 

AGLAFDLNEP SADVSSAWAQ HVTKMVARRG AILPQDISVT PVATPVPPEE QANLWLVEAE 660 

60 ISPELQKRLG RKKKRRKRKK EVCPIAPPPE LHPPAPAPST IPRLPQLPRQ KCLVAAGAWG 720 

AGDSCRQGAW TLVSNPFCPE PSPPQDPFLP SAPAPVAWAH GRRQGLGPIH SRTNLMDTEL 780 

MDADSDF 788 

Seq ID NO: C392 Protein Sequence 
65 Protein Accession ft: BAC04382 

1 11 21 31 41 51 

I I I I I I 

MGARSGARGA LLLALLLCWD PRLSQAGRKR SGEVLPDSFP SAPAEPLPYF LQEPQDAYIV 60 

70 KNKPVEIiRCR AFPATQIYFK CNGEWVSQND BVTQEGLDEA TLGARGGLRV REVQIEVSRQ 120 

QVEELFGLED YWCQCVAWSS AGTTKSRRAY VRIAYLRKNF DQEPLGKEVP LDHEVLLQCR 180 

PPEGVPVAEV EWLKNEDVID PTQDTNFLLT IDHNLIIRQA RLSDTANYTC VAKNIVAKRR 240 
STTATVIVYV NGGWSSWAEW SPCSNRCGRG WQKRTRTCTN PAPLNGGAFC EGQAFQKTAC 300 
_ TTICPVDGAW TEWSKWSACS TECAHWRSRE CMAPPPQNGG RDCSGTLLDS KNCTDGLCMQ 360 
75 NKKTLSDPNS HUJEASGDAA LYAGLWAIF VWAILMAVG VWYRRNCRD FDTDITDSSA 420 
ALTGGFHPVN FKTARPSNPQ LLHPSVPPDL TASAGIYRGP VYALQDSTDK IPMTNSPLLD 480 
PLPSLKVEVY SSSTTGSGPG LADGADLLGV LPPGTYPSDF ARDTHFIjHLR SASLGSQQLL 540 
GLPRDPGSSV SGTFGCLGGR LSIPGTGVSL LVPNGAIPQG KFYEMYI*LIN KAESTLPLSE 600 
orv GTQTVLSPSV TCGPTGLLLC RPVILTMPHC AEVSARDWIF QLKTQAHQGH WEEWTLDEE 660 
80 TLNTPCYCQL EPRACHILLD QLGTYVFTGE SYSRSAVKRL QIAVFAPALC TSLEYSLRVY 720 
CLEDTPVALK EVLELERTLG GYLVEEPKPL MFKDSYHNLR LSLHDLPHAH WRSKLLAKYQ 780 
EIPFYHIWSG SQKALHCTFT LERHSLASTE LTCKICVRQV EGEGQIFQLH TTLABTPAGS 840 
LDTLCSAPGS TVTTQLGPYA FKIPLSIRQK ICNSLDAPNS RGNDWRMLAQ KLSMDRYLNY 900 
FATKASPTGV ILDLWEALQQ DDGDLNSLAS ALEEMGKSEM LVAVATDGDC 950 
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Seq ID NO: C393 Protein Sequence 
Protein Accession NP_004616 

5 1 11 21 31 41 51 

| | I I 1 I 

MNRKARRCLG HLFLSLGMVY LRIGGFSSW ALGASIICNK IPGLAPRQRA ICQSRPDAII 60 

VIGEGSQMGL DECQFQFRNG RWNCS ALGER TVFGKELKVG SREAAFTYAI IAAGVAHAIT 120 

AACTQGNLSD CGCDKEKQGO YHRDEGWKWG GCSADIRYGI GFAKVFVDAR EIKQNARTLM 180 

10 NLHNNEAGRK ILEENMKLEC KCHGVSGSCT TKTCWTTLPQ FRELGYVLKD KYNEAVHVEP 240 

VRASRNKRPT FLKIKKPLSY RKPMDTDLVY IEKSPNYCEB DPVTGSVGTQ GRACNKTAPQ 300 

ASGCDLMCCG RGYNTHQYAR VWQCNCKFHW CCYVKCNTCS ERTEMYTCK 349 
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Seq ID NO: C394 Protein Sequence 
Protein Accession #: NP_003777 



MDALCGSGEL 
RGYIILSHLS 
LLIQYERLQG 
VLSALILACF 
WSLKEEDRSQ 
KALLATFGSS 
MMQSLILQHY 
MDLAPFLNLL 
MKLKDSRIKL 
CSPFLVTLIT 
RIQQFLSQEE 
GPVGCGKSSL 
QQTLEACALL 
VDSHVAKHIF 
RNGSFANFLC 
KQFMRQLSAL 
DYAKAVGLCT 
LQGFLVMLAA 
EVLAPVILML 
SVSRSPIYSH 
FVGNCWLFA 
KEYSKTETEA 
IVGRTGAGKS 
RMNLDPFGSY 
LRKSRILVLD 
VAEFDSPANL 



11 
I 

GSKFWDSNLS 
KLKMVLGVLL 
VQSSGVLIIF 
REKPPFFSAK 
MWQQLLEAW 
FLISACFKLI 
YHYIFVTGVK 
WSAPLQIILA 
MSEILNGIKV 
LWVYVYVDPN 
LDPQSVERKT 
VSALLGEMEK 
ADLEMLPGGD 
DHVIGPEGVL 
NYAPDEDQGH 
SSDGEGQGRP 
TLAICLLYVG 
MAMAAGGIQA 
LNSFFNAIST 
FSETVTGASV 
ALFAVIGRSS 
PWWEGSRPP 
SMTLCLFRIL 
SEED I WW ALE 
EATAAIDLET 
IAARGIFYGM 



21 
I 

VHTENPDLTP 
WCVSWADLFY 
WFLCWCAIV 
NVDPNPYPET 
RKQEKQTARH 
QDLLSFINPQ 
FRTGIMGVIY 
IYFLWQNLGP 
LKLYAWEPSF 
NVLDAEKAFV 
ISPGYAITIH 
LEGKVHMKGS 
QTEIGEKGIN 
AGKTRVLVTH 
LEDSWTALEG 
VPRRHLGPSE 
QSAAAIGANV 
ARVLHQALLH 
LWIMASTPL 
IRAYNRSRDF 
LNPGLVGLSV 
EGWPPRGEVE 
EAAKGEIRID 
LSHLHTFVSS 
DNLIQATIRT 
ARDAGLA 



31 
I 

CFQNSLLAWV 
SFHGLVHGRA 
PFRSKILLAK 
SAGFLSRLFF 
KASAAPGKNA 
LLSILIRFIS 
RKALVITNSV 
SVLAGVAFMV 
LKQVEGIRQG 
SVSLFNILRL 
SGTFTWAQDL 
VAYVPQQAWI 
LSGGQRQRVS 
GISFOPQTDF 
AEDKEALLIE 
KVOVTEAKAD 
WLSAWTNDAM 
NKIRSPQSFF 
FTWILPLAV 
EIISDTKVDA 
SYSLQVTFAL 
FRNYSVRYRP 
GLNVADIGLH 
QPAGLDFQCS 
QFDTCTVLTI 



41 



PCIYLWVALP 
PAPVFFVTPL 
AEGEISDPFR 
WWFTKMAIYG 
SGEDEVLLGA 
NPMAPSWWGF 
KRASTVGEIV 
IiLIPLNGAVA 
ELQLLRTAAY 
PLNMLPQLIS 
PPTLHSLDIQ 
QNCTLQENVL 
LARAVYSDAD 
IIVLADGQVS 
DTLSNHTDLT 
GALTQEEKAA 
ADSRQNNTSL 
DTTPSGRILN 
LYTLVQRFYA 
NQRSCYPYII 
NWMIRMMSDL 
GLDLVLRDLS 
DLRSQLTIIP 
EGGENLSVGQ 
AHRLNTIMDY 



51 
I 

CYLLYLRHHC 
WGVTMLLAT 
FTTFYIHFAL 
YRHPLEEKDL 
RPRPRKPSFL 
LVAGLMFLCS 
NLMSVDAQRF 
VKMRAFQVKQ 
LHTTTTFTWM 
NLTQASVSLK 
VPKGALVAW 
FGKALNPKRY 
IFLLDDPLSA 
EMGPYPALLQ 
DNDPVTYWQ 
IGTVELSVFW 
RLGVYAALGI 
CFSKDIYWD 
ATSRQLKRLE 
ENRWLSIGVE 
ESNIVAVERV 
LHVHGGEKVG 
QDPILFSGTL 
RQLVCLARAL 
TRVLVLDKGV 



Seq ID NO: C395 Protein Sequence 
Protein Accession #: NP_004617 

1 11 21 31 41 51 

I I I I I I 

MRARPQVCEA LLFALALQTG VCYGIKWLAL SKTPSALALN QTQHCKQLEG LVSAQVQLCR 
SNLELMHTW HAAREVMKAC RRAFADMRWN CSSIELAPNY LLDLERGTRE SAFVYALSAA 
AISHAIARAC TSGDLPGCSC GPVPGEPPGP GNRWGGCADN LSYGLLMGAK FSDAPMKVKK 
TGSQANKLMR LHNSEVGRQA LRASLEMKCK CHGVSGSCSI RTCWKGLQEL QDVAADLKTR 
YLSATKWHR PMGTRKHLVP KDLDIRPVKD SELVYLQSSP DFCMKNEKVG SHGTQDRQCN 
KTSNGSDSCD LMCCGRGYNP YTDRWERCH CKYHWCCYVT CRRCERTVER YVCK 



Seq ID NO: C396 Protein Sequence 
Protein Accession #: NP_114072 
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60 

120 

180 

240 
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354 



1 
I 

MEWGYLLEVT 
TQDEAGIiEVH 
LMRQYGFAWP 
GHGRPPGARP 
VSSERHPLYN 
IDMERFKYPE 
GAAGAGAGGP 
WFLAAGMKWG 
LRGFVLAPLV 
TVPAAWVAC 
GVWVWSGKTL 
GGGSLYSDVS 



11 
I 

SLLAALALLQ 
QFWPLVEIQC 
DRMRCDRLPE 
PHRGGGRGGG 
RVKTGQIANC 
RPIIFLSACY 



NEAIAGYSQY 
IYLFIGTMFL 
LFYEQHNRPR 
ESWRSLCTRC 
TGLTWRSGTA 



21 
I 

RSSGAAAASA 
SPDLKFFLCS 
QGNPDTLCMD 
GGDAAAPPAR 
ALPCHNPFFS 
LFVSVGYLVR 
AVEQHVRYET 
FHLAAWLVPS 
LAGFVSLFRI 
WEATHNCPCL 
CWASKGAAVG 
SSVSYPKQMP 



31 
I 

KELACQEITV 
MYTPICLEDY 
YNRTDLTTAA 
GGGGGGKARP 
QDERAFTVFW 
LVAGHEKVAC 
TGPALCTWF 
VKSIAVLALS 
RSVIKQQDGP 
RDLQPBQARR 
GGAGATAAGG 
LSQV 



41 
I 

PLCKGIGYNY 
KKPLPPCRSV 
PSPPRRLPPF 
PGGGAAPCEP 
IGLWSVLCFV 
SGGAPGAGGA 
LLVYFFGMAS 
SVDGDPVAGI 
TKTHKLEKLM 
PDYAVFMLKY 
GGGPGGGGGG 



51 
I 

TYMPNQFNHD 
CERAKAGCAP 
PPGEQPPSGS 
GCQCRAPMVS 
STFATVSTFL 
GGAGGAAAGA 
SIWWVILSLT 
CYVGNQSIiDN 
IRLGLFTVLY 
FMCLWGITS 
GPGGGGGPGG 



Seq ID NO: C397 Protein Sequence 
Protein Accession #: XP_050625 



1 11 21 31 41 51 

I I I I I I 

MLQGPGSLLL LFLASHCCLG SARGLFLFGQ PDFSYKRSNC KPIPANLQLC HGIEYQNMRL 
PNLLGHETMK EVLEQAGAWI PLVMKQCHPD TKKFLCSLFA PVCLDDLDET IQPCHSLCVQ 
VKDRCAPVMS AFGFFWPDML ECDRFPQDND LCIPLASSDH LLPATEEAPK VCEACKNKND 
DDNDIMETLC KNDFALKIKV KEITYINRDT KIILETKSKT IYKLNGVSER DLKKSVLWLK 

1373 
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DSLQCTCEEM NDINAPYLVM GQKQGGELVI TSVKRWQKGQ REFKRISRSI RKLQC 295 

Seq ID NO: C398 Protein Sequence 
Protein Accession ft: NP_00X297.1 

1 11 21 31 41 51 

I I I I I I 

MSMGLEITGT ALAVLGWLGT IVCCALPMWR VSAFIGSNII TSQNIWEGLW MNCWQSTGQ 60 

MQCKVYDSLL ALPQDLQAAR ALIWAILIA AFGLLVALVG AQCTNCVQDD TAKAKITIVA 120 

GVLFLLAALL TLVPVSWSAN TIIRDFYNPV VPEAQKREMG AGLYVGWAAA ALQLLGGALL 180 

CCSCPPREKK YTATKWYSA PRSTGPGASL GTGYDRKDYV 220 

Seq ID NO: C399 Protein Sequence 
Protein Accession ft: NP_036581.1 

1 11 21 31 41 51 

I I I I I I 

MESRKDITNQ EELWKMKPRR NLEEDDYLHK DTGETSMLKR PVLLHLHQTA HADEFDCPSE 60 
LQHTQELFPQ WHLPIKIAAI IASLTFLYTL LREVIHPLAT SHQQYFYKIP ILVINKVLPM 120 
VSITLIALVY LPGVIAAIVQ LHNGTKYKKF PHWLDKHMLT RKQFGLLSFF FAVLHAIYSL 180 
SYPMRRSYRY KLLNWAYQQV QQNKEDAWIE HDVWRMEIYV SLGIVGLAIL ALIAVTSIPS 240 
VSDSLTWREF HYIQSKLGIV SLLLGTIHAL IFAWNKWIDI KQFVWYTPPT FMIAVFLPIV 300 
VLIFKSILPL PCLRKKILKI RHGWEDVTKI NKTEICSQL 339 



25 Seq ID NO: C400 Protein Sequence 
Protein Accession ft: NP_001766.1 



1 11 21 31 41 51 

I I I I ! I 

MANCEFSPVS GDKPCCRLSR RAQLCLGVSI LVLILWVLA VWPRWRQTW SGPGTTKRFP 60 

ETVLARCVKY TEIHPEMRHV DCQSVWDAFK GAFISKHPCN ITEEDYQPLM KLGTQTVPCN 120 

KILLWSRIKD IiAHQFTQVQR DMFTLEDTLL GYLADDLTWC GEFNTSKINY QSCPDWRKDC 180 

SNNPVSVFWK TVSRRFAEAA CDWHVMIiNG SRSKIFDKNS TFGSVEVHNL QPEKVQTLEA 240 

WVIHGGREDS RDLCQDPTIK ELESIISKRN IQFSCKNIYR PDKFLQCVKN PEDSSCTSEI 300 

Seq ID NO: C401 Protein Sequence 
Protein Accession ft: XP 120513.2 



1 11 21 31 41 51 

40 I J J 1 J 1 

MVSCTPSGPL RETNENVKKF YALRAFMFRM SSEAAMLGES RTPKPRKHRA TTRAKIFKRF 60 

FSEGSESNSR LVEELAVIHT YSDDPAPTTS PSSVQPREFG VMQGAPRARF GSRTPPAAAE 120 

ASSPHLGIGE AACQSGARAA APRAGARROQ PQRQAAAAAA TAQTHTLPHA RTRADPAGRR 180 

RRHPRSPAPG GEGTCSEGPA PRRRMEEEMQ PAEEGPSVPK IYKQRSPYSV LKTFPSKRPA 240 

45 LAKRYERPTL VELPHGHLRT PAQPPPASPA ASSSSSFAAV VRLGAPPRPP RRGFRARGTI 300 

PPLLPAPGVA GTLLPPPTSS SPPSPRPRPW HAAAPRGGTS HTHMWRSQST LPGSDTMVSV 360 

FGLMAQRRWQ HRSLKQFEWG ILGSWGTWPC GQDWLEKEGQ VAVLLPRSEG NTAPKKSRMI 420 

LDAFAQQCSR VLSLLNCGGK LLDSNHSQSM ISCVKQEGSS YNERQEHCHI GKGVHSQTSD 480 

NVDIEMQYMQ RKQQTSAFLR VFTDSLQNYL LSGSFPTPNP SSASEYGHLA DVDPLSTSPV 540 

50 HTLENISLDS TASLCKSRHL SREPPVKSDF PNPLQQALAG GASRPFSGAQ QSIAYRVNSE 600 

LEDGIRSPVP LSCEALEMDL TSLGSKQLLN NYPVYITSKQ WEEAVNSSKK DGRRIiLRYLI 660 

RFVFTTDELK YSCGLGKRKR SVQSGETGPE RRPLDPVKVT CLRGTASFRS VSPSVISFHR 720 

IGCGSPRTSV QPSVF 735 

55 Seq ID NO: C402 Protein Sequence 
Protein Accession ft: BAA92562.1 

1 11 21 31 41 51 

I I I I I I 

METTVLSGIN FEYKGMTGWE VAGDHIYTAA GASDNDFMIL TLWPGFRPP QSVMADTENK 60 

EVARITFVFE TLCSVNCELY FMVGVNSRTN TPVETWKGSK GKQSYTYI IE ENTTTSFTWA 120 

FQRTTFHEAS RKYTNDVAKI YSINVTNVMN GVASYCRPCA LEASDVGSSC TSCPAGYYID 180 

RDSGTCHSCP PNTILKAHQP YGVQACVPCG PGTKNNKIHS LCYNDCTFSR NTPTRTFNYN 240 

FSALANTVTL AGGPSFTSKG LKYFHHFTLS LCGNQGRKMS VCTDNVTDLR IPEGESGFSK 300 

SITAYVCQAV IIPPEVTGYK AGVSSQPVSL ADRLIGVTTD MTLDGITSPA ELFHLESLGI 360 

PDVIFFYRSN DVTQSCSSGR STTIRVRCSP QKTVPGSLLL PGTCSDGTCD GCNFHFLWES 420 

AAACPLCSVA DYHAIVSSCV AGIQKTTYVW REPKLCSGGI SLPEQRVTIC KTIDFWLKVG 480 

ISAGTCTAIL LTVLTCYFWK KNQKLEYKYS KLVMNATLKD CDLPAADSCA IMEGEDVEDD 540 

LIFTSKKSIiF GKIKSFTSKQ PAPVTISLSE DS 572 



Seq ID NO: C403 Protein Sequence 
Protein Accession ft: NP 055139.1 



„ 1 11 21 31 41 , SI 

75 1 1 1 1 1 1 

MALQGISWE LSGLAPGRXC AMVLADFGAR WRVDRPGSR YDVSRLGRGK RSLVU3LKQP 60 

REPRAAASVQ AVGCAAGALP PRCHGETPAG PRDSAAGKSK AYLCQAEWIW PVQESFCRLA 120 

GHDINYLALS GVLSKIGRSG ENPYAPLNLV ADFAGGGLMC ALGIIMAItFD RTRTDKGQVI 180 

DANMVEGTAY LSSFLWKTQK SSLWEAPRGQ NMLDGGAPFY TTYRTADGEF MAVGAIEPQF 240 

YELLIKGLGL KSDELPNQMS TDDWPEMKKK FADVFAKKTK AEWCQIFDGT DACVTPVLTF 300 

EEWHHDHNK ERGSFITSEE QDVSPRLAPL LUTTPAIPSS KGDPFIGEHT EEILEEFGFS 360 

REEIYQLNSD KIIESNKVKA SL 382 

Seq ID NO: C404 Protein Sequence 

1374 



WO 03/042661 



Protein Accession #: XP_091332.1 

1 11 21 31 41 51 

1 ! I I I I 

MQRWTLWAAA FLTLHSAQAP PQTDISISPA LPELPLPSLC PLFWMEFKGH CYRFFPLNKT 60 

WAEADLYCSE FSVGRKSAKL ASIHSWEENV FVYDLVNSCV PGIPADVWTG LHDHRQEGQP 120 

EWTDGSSYDY SYWDGSQPDD GVHADPBEED CVQIWYRPTS EQLQAPEPQL PLSISEATDV 160 

YLPEDFPAEP KLMDQSWVSR KSLKPSKSHL MEPPTPVAKH QKAKTRHRSL RGVWWPSGKA 24 0 

GSWKERMNAD YGRRKRSAPR QEGRLRCRER RLRAASGQGR PEGQRKQRQQ ERQERGWEEL 300 

GGVSPMRGAQ AWQHGLGAGS QRGAAPECGE NHQAPELGST WRGQRLQPQT AALCHFALRK 360 

LPGNAHGLAA AFVQPALQVQ EEKNNRTRFS GAYFTMSDPT CDQDSKEQSL RRHGREAEKD 420 

GPYRLVKKKR GPVACPSSFE LQSGGEVCLD FPVELRAGTW IAREPP 466 

Seq ID NO: C40S Protein Sequence 
Protein Accession #: XP_054869.2 

1 11 21 31 41 51 

I I I I I I 

MHTCCPPVTL EQDLHRKMHS WMLQTLAFAV TSLVLSCAET IDYYGEICDN ACPCEEKDGI 60 

LTVSCENRGI ISLSEISPPR FPIYHLLLSG NLLNRLYPNE FVNYTGAS1L HLGSNVIQDI 120 

ETGAFHGLRG LRRLHLNNNK LELLRDDTFL GLENLEYLQV DYNYISVIEP NAFGKIjHLLQ 180 

VLILNDNLLS SLPNNLFRFV PLTHLDLRGN RLKLLPYVGL LQHMDKWEI* QLEENPWNCS 240 

CELISLiKDWL DSISYSALVG DWCETPFRL HGRDLDEVSK QELCPRRLIS DYEMRPQTPL 300 

STTGYLHTTP ASVNSVATSS SAVYKPPLKP PKGTRQPNKP RVRPTSRQPS KDLGYSNYGP 360 

SIAYQTKSPV PLECPTACSC NLQISDLGLN VNCQERKIES IAELQPKPYN PKKMYI/TENY 420 

IAWRRTDFL EATGLDLLHL GNNRISMIQD RAFGDLTNLR RLYLNGNRIE RLSPELFYGL 480 

QSLQYLFLQY NLIREIQSGT FDPVPNLQLI* FLNNNLLQAM PSGVFSGLTL LRLNLRSNKF 540 

TSLPVSGVLD QLKSLIQIDL HDNPWDCTCD IVGMKLWVEQ LKVGVLVDEV ICKAPKKFAE 600 

TDMRSIKSEL LCPDYSDVW STPTPSSIQV PARTSAVTPA VRLNSTGAPA SLGAGGGASS 660 

VPLSVLILSL LLVFIMSVFV AAGLFVLVMK RRKKNQSDHT STNNSDVSSF NMQYSVYGGG 720 

GGTGGHPHAH VHHRGPALPK VKTPAGHVYE YIPHPLGHMC KNPIYRSREG NSVEDYKDLH 780 

ELKVTYSSNH HLQQQQQPPP PPQQPQQQPP PQLQLQPGEE ERRESHHLRS PAYSVSTIEP 840 

REDLLSPVQD ADRFYRGILE PDKHCSTTPA GNSLPEYPKF PCSPAAYTFS PNYDLRRPHQ 900 

YLHPGAGDSR LREPVLYSPP SAVFVEPNRN EYLELKAKLN VEPDYLEVLE KQTTFSQF 958 

Seq ID NO: C406 Protein Sequence 
Protein Accession #: NP_000784.2 

1 11 21 31 41 51 

1 I I I I I 

MGILSVDLLI TLQILPVFFS NCLFLALYDS VILLKHWLL LSRSKSTRGE WRRMLTSEGL 60 

RCVWKSFLLD AYKQVKLGED APNSSWHVS STEGGDNSGN GTQEKIAEGA TCHLLDFASP 120 

ERPLWNFGS ATUPPFTSQI. PAFRKLVEEF SSVADFIiVY IDEAHPSDGW AIPGDSSLSP 180 

EVKKHQNQED RCAAAQQLLE RFSLPPQCRV VADRMDNNAN IAYGVAFERV CIVQRQKIAY 240 

LGGKGPFSYN LQEVRHWLEK NFSKRUKKTR IiAG 273 

Seq ID NO: C407 Protein Sequence 
Protein Accession #: NP_006540.2 

1 11 21 31 41 51 

I I I 1 I 1 

MSSCVSSQPS SNRAAPQDEIi GGRGSSSSES QKPCEALRGL SSLSIHLGME SFIWTECEP 60 

GCAVDLGLAR DRPLEADGQE VPLDSSGSQA RPHLSGRKLS LQERSQGGLA AGGSLDMNGR 120 

CICPSLPYSP VSSPQSSPRIi PRRPTVESHH VSITGMQDCV QLNQYTLKDE IGKGSYGWK 180 

LAYNENDNTY YAMKVLSKKK LIRQAGFPRR PPPRGTRPAP GGCIQPRGPI EQVYQE1AIL '240 

KKLDHPNWK LVEVLDDPNE DHLYMVFELV NQGPVMEVPT LKPLSEDQAR FYFQDLIKGI 300 

EYLEYQKIIH RDIKPSNLLV GEDGHIKIAD FGVSNEFKGS DALliSNTVGT PAFMAPESLS 360 

ETRKIFSGKA LDVWAMGVTL YCFVFGQCPP MDERIMCLHS KIKSQALEFP DQPDIAEDLK 420 

DLITRMLDKN PESRIWPEI KLHPWVTRHG AEPLPSEDEN CTLVEVTEEE VENSVKHIPS 480 

LATVILVKTM IRKRSFGNPF EGSRREERSL SAPGNLLTKK PTRECESLSE LKEARQRRQP S40 

PGHRPAPRGG GGSALVRGSP CVESCWAPAP GSPARMHPLR PEEAMEPE 588 

Seq ID NO: C408 Protein Sequence 
Protein Accession #: NP_061116.2 

..1 11 21 31 41 51 

I I I I I 1 

MGLSLPKEKG LILCLWSKFC RWFQRRESWA QSRDEQNLLQ QKRIWESPLL LAAKDNDVQA 60 

LNKLL.KYEDC KVHQRGAMGE TALHIAALYD NLEAAMVLME AAPELVFEPM TSELYEGQTA 120 

LHIAWNQNM NLVRALLARR ASVSARATGT AFRRSPCNLI YFGEHPLSFA ACVNSEEIVR 180 

LLIEHGADIR AQDSLGNTVL HILILQPNKT FAOQMYNLLL SYDRHGDHLQ PLDLVPNHQG 240 

LTPFKLAGVE GNTVMFQHLM QKRKHTQWTY GPLTSTLYDL TEIDSSGDEQ SLLELIITTK 300 

KREARQILDQ TPVKELVSLK WKRYGRPYFC MLGAIYLLYI ICFTMCCIYR PLKPRTNNRT 360 

SPRDNTLLQQ KLLQEAYMTP KDDIRLVGEL VTVIGAIIIL LVEVPDIFRM GVTRFFGQTI 420 

LGGPFHVLI I TYAFMVLVTM VMRLISASGE WPMSFALVL GWQJVMYFAR GFQMLGPFTI 480 

MIQKMIFGDL MRFCWLMAW ILGFASAFYI IFQTEDPEEL GHFYDYPMAL FSTFELFLTI 540 

IDGPANYNVD LPFMYSITYA AFAIIATLLM LNLLIAMMGD THWRVAHERD ELWRAQIVAT 600 

TVMLERKLPR CLWPRSGICG REYGLGDRWF LRVEDRQDLN RQRIQRYAQA FHTRGSEDLD 660 

KDSVEKLELG CPFSPHLSLP MPSVSRSTSR SSANWERLRQ GTLRRDLRGI INRGLEDGES 720 

WEYQI 725 

Seq ID NO: C409 Protein Sequence 
Protein Accession 8: NP_068710.1 
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I 11 21 31 41 51 

I I I I I I 

MQKVTLGLLV F1AGFPVLDA NDLEDKNSPF YYDWHSLQVG GLICAGVLCA MGIIIVMSEW 60 
RSSGEQAGRG WGSPPLTTQL SPTGAKCKCK FGQKSGHHPG ETPPLITPGS AQS 113 

Seq ID NO: C410 Protein Sequence 
Protein Accession NP_005962.1 

1 11 21 31 41 51 

I I 1 I I I 

MQKVTLGLLV FLAGFPVLDA NDLEDKNSPF YYDWHSLQVG GLICAGVLCA MGIIIVMSAK 60 

CKCKFGQKSG HHPGETPPLI TPGSAQS 87 

Seq ID NO: C411 Protein Sequence 
Protein Accession #: NP_004952.1 

1 11 21 31 41 51 

I I I I I I 

MLSKVLPVLL GILLILQSRV EGPQTESKNE ASSRDWYGP QPQPLENQLL SEETKSTETE 60 

TGSRVGKLPE ASRILNTILS NYDHKLRPGI GEKPTWTVE IAVNSLGPLS ILDMEYTIDI 120 

IFSQTWYDER LCYNDTFESL VLNGNWSQL WIPDTFFRNS KRTHEHEITM PNQMVRIYKD 180 

GKVLYTIRMT IDAGCSLHML RFPMDSHSCP LSFSSFSYPE NEMIYKWENF KLEINEKNSW 240 

KLFQFDFTGV SNKTEIITTP VGDFMVMT I F FNVSRRFGYV AFONYVPSSV TTMLSWVSFW 300 

IKTESAPART SLGITSVLTM TTLGTFSRKN FPRVSYITAL DFYIAICFVF CFCALLEFAV 360 

LNFLIYNQTK AHASPKLRHP RINSRAHART RARSRACARQ HQEAFVCQIV TTEGSDGEER 420 

PSCSAQQPPS PGSPEGPRSL CSKLACCEWC KRFKKYFCMV PDCEGSTWQQ GRLCIHVYRL 480 

DNYSRWFPV TFFFFNVLYW LVCLNL 506 

Seq ID NO: C412 Protein Sequence 
Protein Accession #: NP_0 68819.1 

1 11 21 31 41 51 

I I I I I I 

MEYTIDIIFS QTWYDERLCY NDTFESLVLN GNWSQLWIP DTFFRNSKRT HEHEITMPNQ 60 

MVRIYKDGKV LYTIRMTIDA GCSLHMLRFP MDSHSCPLSF SSFSYPENEM IYKWENFKLE 120 

INEKNSWKLF QLDFTGVSNK TEIITTPVGD FMVMTIFFNV SRRFGYVAFQ NYVPSSVTTM 180 

LSWVSFWIKT ESAPARTSLG ITSVLTMTTL GTFSRKNFPR VSYITALDFY IAICFVFCFC 240 

ALLEFAVLNF LIYNQTKAHA SPKLRHPRIN SRAHARTRAR SRACARQHQE AFVCQIVTTE 300 

GSDGEERPSC SAQQPPSPGS PEGPRSLCSK LACCEWCKRF KKYFCMVPDC EGSTWQQARL 360 

CIHVYRLDNY SRWFPVTFF FFNVLYWLVC LNL 393 

Seq ID NO: C413 Protein Sequence 
Protein Accession #: NPJ>68822.1 

1 11 21 31 41 51 

1 I 1 I I I 

MEYTIDIIFS QTWNSKRTHE HEITMPNQMV RIYKDGKVLY TIRMTIDAGC SLHMLRFPMD 60 
SHSCPLSFSS FSYPENEMIY KWENFKLEIN EKNSWKLFQF DFTGVSNKTE I ITTPVGDFM 120 
VMTIFFNVSR RFGYVAFQNY VPSSVTTMLS WVSFWIKTES APARTSLGIT SVLTMTTLGT 180 
FSRKNFPRVS YITALDFYIA ICFVFCFCAL LEFAVLNFLI YNQTKAHASP KLRHPRINSR 240 
AHARTRARSR ACARQHQBAF VCQIVTTEGS DGEERPSCSA QQFPSPGSPE GPRSLCSKLA 300 
CCEWCKRFKK YFCMVPDCBG STWQQGRLCI HVYRLDNYSR WFPVTFFFF NVLYWLVCLN 360 
L 361 

Seq ID NO: C414 Protein Sequence 
Protein Accession #: NP_068830.1 

1 11 21 31 41 51 

I I I I I I 

MEYTIDIIFS QTWYDERLCY NDTFESLVLN GNWSQLWIP DTFFRNSKRT HEHEITMPNQ 60 
MVRIYKDGKV LYTIRMTIDA GCSLHMLRFP MDSHSCPLSF SSFSYPENEM IYKWENFKLE 120 
INEKNSWKLF QFDFTGVSNK TEIITTPVGD FMVMTIFFNV SRRFGYVAFQ NYVPSSVTTM 180 
LSWVSFWIKT ESAPARTSLG ITSVLTMTTL GTFSRKNFPR VSYITALDFY IAICFVFCFC 240 
ALLEFAVLNF LIYNQTKAHA SPKLRHPRIN SRAHARTRAR SRACARQHQE AFVCQIVTTE 300 
GSDGEERPSC SAQQPPSPGS PEGPRSLCSK LACCEWCKRF KKYFCMVPDC EGSTWQQGRL 360 
CIHVYRLDNY SRWFPVTFF FFNVLYWLVC LNL 393 

Seq ID NO: C415 Protein Sequence 
Protein Accession #: NPJ>68S91.1 

1 11 21 31 41 51 

I I I I I I 

MPAVSGPGPL FCLLLLLLDP HSPETGCPPL RRFEYKLSFK GPRLALPGAG IPFWSHHGDA 60 

ILGLEEVRLT PSMRNRSGAV WSRASVPFSA WEVEVQMRVT GLGRRGAHGM AVWYTRGRGH 120 

VGSVLGGLAS WDGIGIFFDS PAEDTQDSPA IRVLASDGHI PSEQPGDGAS QGLGSCHWDF 180 

RNRPHPFRAR ITYWGQRLRM SLNSGLTPSD PGEFCVDVGP LLLVPGGFFG VSAATGTLAD 240 

DHDVLSFLTF SLSEPSPEVP PQPFLEMQQL RLARQLEGLW ARLGLGTRED VTPKSDSEAQ 300 

GEGERLFDLE ETLGRHRRIL QALRGLSKQL AQAERQWKKQ LGPPGQARPD GGWALDASCQ 360 

IPSTPGRGGH LSMSLNKDSA KVGALLHGQW TLLQALQEMR DAAVRMAAEA QVSYLPVGIE 420 

HHFLELDHIL GLLQEELRGP AKAAAKAPRP PGQPPRASSC LQPGIFLFYL LIQTVGFFGY 480 

VHFRQELNKS LQECLSTGSL PLGPAPHTPR ALGILRRQPL PASMPA S26 



Seq ID NO: C416 Protein Sequence 
Protein Accession #: XP 117036.1 
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1 11 21 31 41 51 

1 i I 1 I I 

MERRTRGALG SRRPPPPLPA LRHLCTGLQA AGMAWPGTLW RHTCQGRAXA AEGPWGLFRP 60 

HRCPREAGQA PVGPSPETQG VAHVCSRARV SVDEREPGGG AYAMHVTPRW KGCHRHSGRT 120 

VRGSVSWKRP EQAAPETGRG PAVARGSGDG NECGWG 156 

Seq ID NO: C417 Protein Sequence 
Protein Accession #: XP_167803.2 

1 11 21 31 41 51 

1 I I I I I 

MPGKGQRKTA TNKPGGLPGA PGVGIGGHCL YVLECKCPIK NKTKTHHHKK KNFAAKRNEE 60 
KLKKKKKQEK KNHTKFFHHT YPLSQQDFLF AKSYFCGNGP CFLWQGLF 108 

Seq ID NO: C418 Protein Sequence 
Protein Accession NP_079056.1 

1 11 21 31 41 51 

I I I 1 I I 

MFRLVERYEM PRHEVYVLLI RNIFLKISII GILCVYWLNT VALSGEECWE TLIGQDIYRL 60 

LLMDFVFSLV NSFLGEFLRR IIGMQLITSL GLQEFDIARN VLELIYAQTL VWIGIFFCPL 120 

LPFIQMIMLF IMFYSKNISL MMNFQPPSKA WRASQMMTFF IFLLFFPSFT GVLCTLAITI 180 

WRLKPSADCG PFRGLPLFIH SIYSWIDTLS TRPGYLWWW IYRNLIGSVH FFFILTLIVL 240 

IITYLYWQIT EGRKIMIRLL HEQI INEGKD KMFLIEKLIK LQDMEKKANP SSLVLERREV 300 

EQQGFLHLGE EDGSLDLRSR RSVQEGNPRA 330 

Seq ID NO: C419 Protein Sequence 
Protein Accession #: Eos sequence 

1 11 21 31 41 51 

I 1 i I I ! 

MLSDDHVNEI IIQVENVSSG VQSHPSSNQI FQEKVLLDSS INMVLSISDI DVIDSQTVSK 60 

RNDQKGNQVL RFSTSLNESM SQTLHSLECM GIDTPGSSHE TVQGQKLIAS LIPMTSRDRI 120 

KAIRNQPRTM EEKRNLRKIV DKEKSKQTHR ILQLNCCIQC LNSISRAYRR SKNSLSEILN 180 

SISLWQKTLK I IGGKFGTSV LSYFNFLRWL LKFNIFSFIL NFSFIIIPQF TVAKKNTLQF 240 

TGLEFFTGVG YFRDTVMYYG FYTNSTIQHG NSGASYNMQL AYIFTIGACL TTCFFSLLFS 300 

MAKYFRNNFI NPHIYSGGIT KLIFCWDFTV THEKAVKLKQ KNLSTEIREN LSELRQENSK 360 

LTFNQLLTRF SAYMVAWWS TGVAIACCAA VYYLAEYNLE FLKTHSNPGA VLLLPFWSC 420 

INLAVPCIYS MFRLVERYEM PRHEVYVLLI RNIFLKISII GILCYYWLNT VALSGEECWE 480 

TLIGQDIYRL LLMDFVFSLV NSFLGEFLRR IIGMQLITSL GLQEFDIARN VLELIYAQTL 540 

VWIGIFFCPL LPFIQMIMLF IMFYSKNISL MMNFQPPSKA WRASQMMTFF IFLLFFPSFT 600 

GVLCTLAITI WRLKPSADCG PFRGLPLFIH SIYSWIDTLS TRPGYLWWW IYRNLIGSVH 660 

FFFILTLIVL IITYLYWQIT EGRKIMIRLL HEQI INEGKD KMFLIEKLIK LQDMEKKANP 720 

SSLVLERREV EQQGFLHLGE HDGSLDLRSR RSVQEGNPRA 760 

Seq ID NO: C420 Protein Sequence 
Protein Accession #: NP_002241.1 

1 11 21 31 41 51 

I I I I I I 

MGGDLVLGLG ALRRRKRLLE QEKSLAGWAL VLAGTGIGLM VLHAEMLWFG GCSWALYLFL 60 

VKCTISISTF LLLCLIVAFH AKEVQLFMTD NGLRDWRVAL TGRQAAQIVL ELWCGLHPA 120 

PVRGPPCVQD LGAPLTSPQP WPGFLGQGEA LLSLAMLLRL YLVPRAVLLR SGVLLNASYR 180 

SIGALNQVRF RHWFVAKLYM NTHPGRLLLG LTLGLWLTTA WVLSVAERQA VNATGHLSDT 240 

LWLIPITFLT IGYGDWPGT MWGKIVCLCT GVMGVCCTAL LVAWARKLE FNKAEKHVHN 300 

FMMDIQYTKE MKESAARVLQ EAWMFYKHTR RKESHAARRH QRKLLAAINA FRQVRLKHRK 360 

LREQVNSMVD ISKMHMILYD LQQNLSSSHR ALEKQIDTLA GKLDALTELL STALGPRQLP 420 

427 



Seq ID NO: C421 Protein Sequence 
Protein Accession #: NP_079533.1 

1 11 21 31 41 51 

I I I I I I 

MGGKQRDEDD EAYGKPVKYD PSFRGPIKNR SCTDVICCVL FLLFILGYIV VGIVAWLYGD 60 

PRQVLYPRNS TGAYCGMGEN KDKPYLLYFN IFSCILSSNI ISVAENGLQC PTPQVCVSSC 120 

PEDPWTVGKN EFSQTVGEVF YTKNRNFCLP GVPWNMTVIT SLQQELCPSF LLPSAPALGR 180 

CFPWTNITPP ALPGITNDTT IQQGISGLID SLNARDISVK IFEDFAQSWY WILVALGVAL 240 

VLSLLFILLL RLVAGPLVLV LILGVLGVLA YGIYYCWEEY RVLRDKGASI SQLGFTTNLS 300 

AYQSVQETWL AALIVLAVLE AILLLVLIFL RQRIRIAIAL LKEASKAVGQ MMSTMFYPLV 360 

TFVLLLICIA YWAMTALYPL PTQPATLGYV LWASNISSPG CEKVPINTSC NPTAHLVNSS 420 

CPGLMCVFQG YSSKGLIQRS VFNLQIYGVL GLFWTLNWVL ALGQCVLAGA FASFYWAFHK 480 

PQDIPTFPLI SAFIRTLRYH TGSLAFGALI LTLVQIARVT LEYIDHKLRG VQNPVARCIM 540 

CCFKCCLWCL EKFIKFLNRN AYIMIAIYGK NFCVSAKNAF MLLMRNIVRV WLDKVTDLL 600 

LFFGKLLWG GVGVLSFFFP SGRIPGLGKD FKSPHLNYYW LPIMTSILGA YVIASGFFSV 660 

FGMCVDTLFL CFLEDLERNN GSLDRPYYMS KSLLKILGKK NEAPPDNKKR KK 712 

Seq ID NO: C422 Protein Sequence 
Protein Accession #: NP__057264.1 

1 11 21 31 41 51 

I i I I I I 

MGSNSGQAGR HIYKSLADDG PFDSVEPPKR PTSRLIMHSM AMFGREFCYA VEAAYVTPVL 60 
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LSVGLPSSLY SIVWFLSPIL GFLLQPWGS ASDHCRSRWG RRRPYILTLG VMMLVGMALY 
LNGATWAAL IANPRRKLVW AISVTMIGW LFDFAADFID GPIKAYLFDV CSHQDKEKGL 
HYHALFTGFG GALGYLLGAI DWAHLELGRL LGTEFQVMFF FSALVLTLCF TVHLCSISEA 
PLTEVAKGIP PQQTPQDPPL SSDGMYEYGS IEKVKNGYVN PEIAMQGAKN KNHAEQTRRA 
MTLKSLLRAL VNMPPHYRYL CISHLIGVITA FLSNMLFFTD FMGQIVYRGD PYSAHNSTEF 
LIYERGVEVG CWGFCINSVF SSLYSYFQKV LVSYIGLKGL YFTGYLLFGL GTGFIGLFPN 
VYSTLVLCSL FGVMSSTLYT VPFNLITEYH REEEKERQQA PGGDPDNSVR GKGMDCATLT 
CMVQLAQILV GGGLGFLVNT AGTVWWIT ASAVALIGCC FVALFVRYVD 

Seq ID NO: C423 Protein Sequence 
Protein Accession »: NP_003264.1 

1 11 2.1 31 41 51 

I I ! I i I 

MEGFGGVGGR GTRGFAAKGV WRGRAEEGPV LGAAERGFMV STGSRRRVFE GPGGGGLRWT 
PGKGTGRQRG AWGPRAEDGV RRRTLGMPRG SRRDVRAPCG PAGSKGARGG RRRDGPSRRR 
RGSATAAARH HVPPAPGGPF GPRAPAGSTR VPARAGGAVE PTGAAAVARL ARPAGGALPT 
AGAQGAGPAR GRSGEGSEWA RRGKGRPGPY QSPLGPAVAE GQELKDKSRL RYPINGFQAL 
VLTALLVGLG MSAGLPLGAL PEMLLPLAFV ATLTAFIFSL FLYMKAQVAP VSALAPGGNS 
GNPIYDFFLG RELNPRICFF DFKYFCELRP GLIGWVLINL ALLMKEAELR GSPSLAMWLV 
NGFQLLYVGD ALWHEEAVLT TMDITHDGFG FMLAFGDMAW VPFTYSLQAQ FLLHHPQPLG 
LPMASVICLI NATGYYIFRG ANSQKNTFRK NPSDPRVAGL ETISTATGRK LLVSGWWGMV 
RHPNYLGDLI MAIAWSLPCG VSHLLPYFYL LYFTALLVHR EARDERSACR STAWPGRSTA 
GVCLTASCPT STEAAPPPQV GHVPTHPPAH PGPGASTHLG LKGLHPTQP 

Seq ID NO: C424 Protein Sequence 
Protein Accession #: NP_056535.l 

1 11 21 31 41 51 

I I I I I.I 

MGRLLRAARL PPLLSPLLLL LVGGAFLGAC VAGSDEPGPE GLTSTSLLDIi LLPTGLEPLD 
SEEPSETMGL GAGLGAPGSG FPSEENEESR ILQPPQYFWE EEEELNDSSL DLGPTADYVF 
PDLTEKAGSI EDTSQAQELP NLPSPLPKKN LVEPPWHMPP REEEEEEEEE EEREKEEVEK 
QEEEEEEELL PVNGSQEEAK PQVRDFSLTS SSQTPGATKS RHEDSGDQAS SGVEVESSMG 
PSLLLPSVTP TTVTPGDQDS TSQEAEATVL PAAGLGVEFE APQEASEEAT AGAAGLSGQH 
EEVPALPSFP QTTAPSGAEH PDEDPLGSRT SASSPLAPGD MELTPSSATL GQEDLNQQLL 
EGQAAEAQSR IPWDSTQVIC KDWSNLAGKN YIILNMTENI DCEVFRQHRG PQIiLALVEEV 
LPRHGSGHHG AWHISLSKPS EKEQHLLMTL VGEQGWPTQ DVLSMLGDIR RSLEEIGIQN 
YSTTSSCQAR ASQVRSDYGT LFWLWIGA ICIIIIALGL LYNCWQRRLP KLKHVSHGEE 
LRFVENGCHD NPTLDVASDS QSEMQEKHPS LNGGGALNGP GSWGALMGGK RDPEDSDVFE 
EDTHL 

Seq ID NO: C425 Protein Sequence 
Protein Accession #: NP_0011B8.1 

1 11 21 31 41 51 

I I I 1 I 1 

MSEVRPLSRD ILMETLLYEQ LLEPPTMEVL GMTDSEEDLD PMEDFDSLEC MEGSDALALR 60 

LACIGDEMDV SLRAPRLAQL SEVAMHSLGI* AFIYDQTEDI RDVLRSFMDG FTTLKENIMR 120 

FWRSPNPGSVf VSCEQVLLAI* LLLLALLLPL LSGGLHLLLK 160 

Seq ID NO: C426 Protein Sequence 
Protein Accession #: AAF76225.1 

I 11 21 31 41 51 

II [III 

MATPLPPPSP RELRLLRLLL SGLVLGAALR GAAAGHPDVA ACPGSLDCAL KRRARCPPGA 60 
HACGPCLQPF QEDQQGLCVP RMRRPPGGGR PQPRLEDEID FLAQELARKE SGQSTPPLPK 120 
DRQRLPEPAT LGFSARGQGL ELGLPSTPGT PTPTPHTSLG SPVSSDPVHM SPIiEPRGGQG 180 
DGIiALVLILA FCVAGAAALS VASLCWCRLQ REIRLTQKAD YATAKAPGSP AAPRISPGDQ 240 
RLAQSAEMYH YQHQRQQMLC LERHKEPPKE LDTASSDEEN EDGDFTVYEC PGIAPTGEME 300 
VRNPLFDHAA LSAPLPAPSS PPALP 325 

Seq ID NO: C427 protein sequence 
Protein Accession #: NP_004436.1 

1 11 21 31 41 51 

I I I 1 I I 

MVCSLWVLLL VSSVLALEEV LLDTTGETSE IGWLTYPPGG WDEVSVLDDQ RRLTRTFEAC 60 

HVAGAPPGTG QDNWLQTHFV ERRGAQRAHI RLHFSVRACS SLGVSGGTCR ETFTLYYRQA 120 

EEPDSPDSVS SWHLKRWTKV DTIAADESFP SSSSSSSSSS SAAWAVGPHG AGQRAGLQLN 180 

VKERSFGPLT QRGFYVAFQD TGACLALVAV RLFSYTCPAV LRSFASFPET QASGAGGASL 240 

VAAVGTCVAH AEPEEDGVGG QAGGSPPRLH CNGEGKWMVA VGGCRCQPGY QPARGDKACQ 300 

ACPRGLYKSS AGNAPCSPCP ARSHAPNPAA PVCPCXjEGFY RASSDPPEAP CTGPPSAPQE 360 

LWFEVQGSAL MLHWRLPREL GGRGDLLFNV VCKECEGRQE PASGGGGTCH RCRDEVHFDP 420 

RQRGLTESRV LVGGLRAHVP YILEVQAVNG VSELSPDPPQ AAAINVSTSH EVPSAVPWH 480 

QVSRASNSIT VSWPQPDQTN GNILDYQLRY YDQAEDESHS FTLTSETNTA TVTQLSPGHI 540 

YGFQVRARTA AGHGPYGGKV YFQTLPQGEIi SSQLPERLSL VIGSILGALA FLLLAAITVL 600 

AWFQRKRRG TGYTEQLQQY SSPGLGVKYY IDPSTYEDPC QAIRELAREV DPAYIKIEEV 660 

IGTGSFGEVR QGRLQPRGRR EQTVAIQALW AGGAESLQMT FI/3RAAVLGQ FQHPNILRLE 720 

GWTKSRPLM VLTEFMEDGP LDSFLRQREG QFSSLQLVAM QRGVAAAMQY LSSFAFVHRS 780 

LSAHSVLVNS HLVCKVARLG HSPQGPSCLI* RWAAPEVIAH GKHTTSSDVW SFGILMWEVM 840 

SYGERPYWDM SEQEVLNAIE QEFRLPPPPG CPPGLHItLML DTWQKDRARR PHFDQLVAAF 900 

DKMIRKPDTCi QAGGDPGERP SQALLTPVAL DPPCLDSPQA WLSAIGLECY QDNFSKFGLC 960 
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TFSDVAQLSL EDLPALGITL AGHQKKLLHH IQLLQQHLRQ QGSVEV 1006 

Seq ID NO: C428 Protein Sequence 
Protein Accession ft: XP_043340.2 

1 11 21 31 41 51 

1 I I I I I 

MPFDFRRFDI YRKVPKDLTQ PTYTGAIISI CCCLFILFLF LSELTGFITT EWNELYVDD 
PDKDSGGKID VSLNISLPNL HCELVGLDIQ DEMGRHEVGH IDNSMKIPLN NGAGCRFEGQ 
FSINKVPGNF HVSTHSATAQ PQNPDMTHVI HKLSFGDTLQ VQNIHGAFNA LGGADRLTSN 
PLASHDYILK IVPTVYEDKS GKQRYSYQYT VANKEYVAYS HTGRIIPAIW FRYDLSPITV 
KYTERRQPLY RFITTICAII GGTFTVAGIL DSC I FT AS EA WKKIQLGKMH 

Seq ID NO: C429 Protein Sequence 
Protein Accession #: NP_002142.1 

1 11 21 31 41 51 

I I 1 i I I 

MAQKEGGRTV PCCSRPKVAA LTAGTLLLLT AIGAASWAIV AVLLRSDQEP LYPVQVSSAD 
ARLMVFDKTE GTWRIiLCSSR SNARVAGLSC EEMGFLRALT HSELDVRTAG ANGTSGFFCV 
DEGRLPHTQR LLEVISVCDC PRGRFLAAIC QDCGRRKLPV DRIVGGRDTS LGRWPWQVSL 
RYDGAHLCGG SLLSGDWVLT AAHCFPERNR VLSRWRVFAG AVAQASPHGL QLGVQAWYH 
GGYLPFRDPN SEENSNDIAL VHLSSPLPLT EYIQPVCLPA AGQALVDGKI CTVTGWGNTQ 
YYGQQAGVLQ EARVPIISND VQJGADFYGN QIKPKMFCAG YPEGGIDACQ GDSGGPFVCE 
DSISRTPRWR LCGIVSWGTG CAIiAQKPGVY TKVSDFREWI FQAIKTHSEA SGMVTQL 

Seq ID NO: C430 Protein Sequence 
Protein Accession ft: BAA92S62.1 

1 11 21- .31 41 51 

I I I 1 I 1 

METTVLSGIN FEYKGMTGWE VAGDHIYTAA GASDNDFMIL TLWPGFRPP QSVMADTENK 
EVARITFVFE TLCSVNCELY FMVGVNSRTN TPVETWKGSK GKQSYTYIIE ENTTTSFTWA 
FQRTTFHEAS RKYTNDVAKI YSINVTNVMN GVASYCRPCA LEASDVGSSC TSCPAGYYID 
RDSGTCHSCP PNTILKAHQP YGVQACVPCG PGTKNNKIHS LCYNDCTFSR NTPTRTFNYN 
FSALANTVTIi AGGPSFTSKG LKYFHHFTLS LCGNQGRKMS VCTDNVTDLR IPEGESGFSK 
SITAYVCQAV IIPPEVTGYK AGVSSQPVSI* ADRLIGVTTD MTLDGITSPA ELFHLESLGI 
PDVIFFYRSN DVTQSCSSGR STTIRVRCSP QKTVPGSLLL PGTCSDGTCD GCNFHFLWES 
AAACPLCSVA DYHAIVSSCV AGIQKTTYVW REPKLCSGGI SLPEQRVTIC KTIDFWLKVG 
I SAGTCTAIL LTVLTCYFWK KNQKLEYKYS KLVMNATLKD CDLPAADSCA IMEGEDVEDD 
LIFTSKKSLF GKIKSFTSKQ PAPVTISLSE DS 

Seq ID NO: C431 Protein Sequence 
Protein Accession ft: NP_004855.1 

1 11 21 31 41 51 

11)111 
MPGQELRTVN GSQMLIiVIiLV LSWLPHGGAL SLAEASRASF PGPSELHSED SRFRELRKRY 
EDLLTRIiRAN QSWEDSNTDL VPAPAVRILT PEVRLGSGGH LHLRISRAAL PEGLPEASRL 
HRALFRLSPT ASRSWDVTRP LRRQLSLARP QAPALHLRLS PPPSQSDQLL AESSSARPQL 
ELHLRPQAAR GRRRARARNG DDCPLGPGRC CRLHTVRASL EDLGWADWVL SPREVQVTMC 
IGACPSQFRA ANMHAQIKTS LHRLKPDTEP APCCVPASYN PMVLIQKTDT GVSLQTYDDL 
LAKDCHCI 

Seq ID NO: C433 Protein Sequence 
Protein Accession ft: NP_443 090.1 

1 11 21 31 41 51 

I I "I I I I 

MEDPSGAREP RARPRERDPG RRPHPDQGRT HDRPRDRPGD PRRKRSSDGN RRRDGDRDPK 60 

RDQERDGNRD RNRDRERERE RERDPDRGPR RDTHRDAGPR AGEHGVWEKP RQSRTRDGAR 120 

GLTWDAAAPP GPAPWEAPEP PQPQRKGDPG RRRPESEPPS ERYLPSTPRP GREEVEYYQS 180 

EAEGLLECHK CKYLCTGRAC CQMLEVLLNL LILACSSVSY SSTGGYTGIT SLGGIYYYQF 240 

GGAYSGFDGA DGEXAQQLDV QFYQIiKLPMV TVAMACSGAL TALCCLFVAM GVLRVPWBCP 300 

LLLVTEGLLD MLIAGGYIPA LYFYFHYLSA AYGSPVCKER QALYQSKGYS GFGCSFHGAD 360 

IGAGIFAALG IWFALGAVL AIKGYRKVRK LKEKPAEMFE F 401 

Seq ID NO: C435 Protein Sequence 
Protein Accession ft: Eos sequence 

1 11 21 31 41 51 

I I I I I 1 

MGAAGRQDFL FKAMLT1SWL TLTCFPGATS TVAAGCPDQS PELQPWNPGH DQDHHVHIGQ 60 

GKTLLLTSSA TVYSIHISEG GKLVIKDHDE PIVLRTRHIL IDNGGELHAG SALCPFQGNF 120 

TIILYGRADE GIQPDPYYGL KYIGVGKGGA LELHGQKKLS WTFLNKTLHP GGMAEGGYFF 180 

ERSWGHRGVI VHVIDPKSGT VIHSDRFDTY RSKKESERLV QYLNAVPDGR ILSVAVNDEG 240 

SRNLDDMARK AMTKLGSKHF LHLGFRHPWS FLTVKGNPSS SVEDHIEYHG HRGSAAARVF 300 

KLFQTEHGEY FNVSLSSEWV QDVEWTEWFD HDKVSQTKGG EKISDLWKAH PGKICNRPID 360 

IQATTMDGVN LSTEWYKKG QDYRPACYDR GRACRSYRVR FLCGKPVRPK LTVTIDTNVN 420 

STILNLEDNV QSWKPGDTLV IASTDYSMYQ AEEFQVLPCR SCAPNQVKVA GKPMYLHIGE 480 

EIDGVDMRAE VGLLSRNIIV KGEMEDKCYP YRNHICNFFD FDTFGGHIKF ALGFKAAHLE 540 

GTELKHMGQQ LVGQYPIHFH LAGDVDERGG YDPPTYIRDL SIHHTFSRCV TVHGSNGLLI 600 

KDWGYNSLG HCFFTEDGPE ERNTFDHCLG LLVKSGTLLP SDRDSKMCKM ITEDSYPGYI 660 

PKPRQDCNAV STFWMANPNN NLINCAAAGS EETGFWFIFH HVPTGPSVGH YSPGYSEHIP 720 



60 

120 

180 

240 

290 



60 

120 

180 

240 

300 

360 

417 



60 

120 

180 

240 

300 

360 

420 

480 

540 

572 



1379 



WO 03/042661 PCT/US02/36810 



LGKFYNNRAH SNYRAGMIID NGVKTTEASA KDKRPFLSII SARYSPHQDA DPLKPREPAI 780 
IRHFIAYKNQ DHGAWLRGGD VWLDSCHFRG EAQEGFLLTG MKAGGILLGG DEAASGMAQG 840 
FSPPCRCLLK LVTTGSPFAH VSLAHS 866 
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It is understood that the examples described above in no way serve to limit the true 
scope of this invention, but rather are presented for illustrative purposes. All publications, 
5 sequences of accession numbers, and patent applications cited in this specification are herein 
incorporated by reference as if each individual publication, accession number, or patent 
application were specifically and individually indicated to be incorporated by reference. 
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WHAT IS CLAIMED IS: 



1 LA method for determining the presence or absence of a pathological cell in a 

2 patient, said method comprising detecting a nucleic acid comprising a sequence at least 80% 

3 identical to a sequence as described in Tables 2A-80 in a biological sample from said patient, 

4 thereby determining the presence or absence of said pathological cell. 

1 2. The method of Claim 1 , wherein: 

2 a) said pathology is described in Table 1, including a cancer; and/or 

3 b) said biological sample comprises isolated nucleic acids. 

1 3. The method of Claim 1, wherein said biological sample is tissue from an organ 

2 which is affected by said pathology of Table 1 , including a cancer. 

1 4. The method of Claim 2, wherein said nucleic acids are mRNA 

1 5. The method of Claim 2: 

2 a) further comprising a step of amplifying nucleic acids before said step of detecting 

3 said nucleic acid; or 

4 b) where said detecting is of a protein encoded by said nucleic acid. 

1 6. The method of Claim 1 , wherein said nucleic acid comprises a sequence as 

2 described in Tables 2A-80. 

1 7. The method of Claim 2, wherein: 

2 a) said detecting step is carried out by: 

3 i) using a labeled nucleic acid probe; 

4 ii) utilizing a biochip comprising a sequence at least 80% identical to a sequence 

5 as described in Tables 2A-80; or 

6 iii) detecting a polypeptide encoded by said nucleic acid; or 

7 b) said patient is: 

8 i) undergoing a therapeutic regimen to treat said pathology of Table 1 ; or 

9 ii) is suspected of having said pathology or cancer. 

1 8. An isolated nucleic acid molecule comprising a sequence as described in 

2 Tables 2A-80. 
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1 9. The nucleic acid molecule of Claim 8, which is labeled. 

1 1 0. An expression vector comprising the nucleic acid of Claim 8. 

1 1 1 . A host cell comprising the expression vector of Claim 10. 

1 12. An isolated polypeptide which is encoded by a nucleic acid molecule 

2 comprising a sequence as described in Tables 2A-80. 

1 1 3 . An antibody that speci fi cally binds a polypeptide of Claim 1 2. 

1 14. The antibody of Claim 13: 

2 a) conjugated to an effector component; 

3 b) conjugated to a detectable label, including a fluorescent label, a radioisotope, or a 

4 cytotoxic chemical; 

5 c) which is an antibody fragment; or 

6 d) which is a humanized antibody. 

1 1 5 . A method for specifically targeting a compound to a pathological cell in a 

2 patient, said method comprising administering to said patient an antibody of Claim 13, 

3 thereby providing said targetting. 

1 16. A method for determining the presence or absence of a pathological cell in a 

2 patient, said method comprising contacting a biological sample with an antibody of Claim 13. 



1 17. The method of Claim 1 6, wherein: 

2 a) said antibody is conjugated to: 

3 i) an effector component; or 

4 ii) a fluorescent label; or 



5 b) said biological sample is a blood, serum, urine, or stool sample. 

* 

1 1 8. A method for identifying a compound that modulates a pathology-associated 

2 polypeptide, said method comprising the steps of: 
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3 a) contacting said compound with a pathology-associated polypeptide, said 

4 polypeptide encoded by a polynucleotide that selectively hybridizes to a sequence 

5 at least 80% identical to a sequence as described in Tables 2A-80; and 

6 b) determining the functional effect of said compound upon said polypeptide. 

1 19. A drug screening assay comprising the steps of: 

2 a) administering a test compound to a mammal having a pathology of Table 1 or a 

3 cell isolated therefrom; and 

4 b) comparing the level of gene expression of a polynucleotide that selectively 

5 hybridizes to a sequence at least 80% identical to a sequence as described in 

6 Tables 2A-80 in a treated cell or mammal with the level of gene expression of said 

7 polynucleotide in a control cell or mammal, wherein a test compound that 

8 modulates said level of expression of the polynucleotide is a candidate for the 

9 treatment of said pathology. 



10 
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